Original article excerpt
Server-side extracted preview paragraphs from the original source.
A universal interface for AI to interact with the digital world.
Powering Operator with Computer-Using Agent, a universal interface for AI to interact with the digital world.
Today we introduced a research preview of Operator(opens in a new window), an agent that can go to the web to perform tasks for you. Powering Operator is Computer-Using Agent (CUA), a model that combines GPT‑4o's vision capabilities with advanced reasoning through reinforcement learning. CUA is trained to interact with graphical user interfaces (GUIs)—the buttons, menus, and text fields people see on a screen—just as humans do. This gives it the flexibility to perform digital tasks without using OS-or web-specific APIs.