Original article excerpt
Server-side extracted preview paragraphs from the original source.
Developers can now fine-tune GPT-4o with images and text to improve vision capabilities
Developers can now fine-tune GPT‑4o with images and text to improve vision capabilities
Today, we’re introducing vision fine-tuning(opens in a new window) on GPT‑4o1, making it possible to fine-tune with images, in addition to text. Developers can customize the model to have stronger image understanding capabilities which enables applications like enhanced visual search functionality, improved object detection for autonomous vehicles or smart cities, and more accurate medical image analysis.