Introducing vision to the fine-tuning API

OpenAI has added vision capabilities to its fine-tuning API, allowing developers to customize models with image understanding. This update enables more tailored AI applications that combine text and visual inputs. It matters because it expands the versatility and practical use cases of AI models in various industries.

ArchiveLaunch

Signal trust

Single sourceEarly signal

PublishedTuesday, October 1, 2024 at 12:04 PMOct 1, 12:04 PM

FreshnessArchive

Story ID#514

Back to feed Original report

Original article excerpt

Server-side extracted preview paragraphs from the original source.

Developers can now fine-tune GPT-4o with images and text to improve vision capabilities

Developers can now fine-tune GPT‑4o with images and text to improve vision capabilities

Today, we’re introducing vision fine-tuning⁠(opens in a new window) on GPT‑4o1, making it possible to fine-tune with images, in addition to text. Developers can customize the model to have stronger image understanding capabilities which enables applications like enhanced visual search functionality, improved object detection for autonomous vehicles or smart cities, and more accurate medical image analysis.

Opening the briefing

Introducing vision to the fine-tuning API

Original article excerpt