OpenAI CEO Sam Altman kicked off this year by saying in a blog post that 2025 would be big for AI agents, tools that can automate tasks and take actions on your behalf.
Now, we’re seeing OpenAI’s first real attempt.
OpenAI announced on Thursday that it is launching a research preview of Operator, a general-purpose AI agent that can take control of a web browser and independently perform certain actions.
Operator is coming to U.S. users on ChatGPT’s $200 Pro subscription plan first. OpenAI says it plans to roll this feature out to more users in its Plus, Team, and Enterprise tiers eventually.
“[Operator] will be [in] other countries soon,” OpenAI CEO Sam Altman said during a livestream Thursday. “Europe will, unfortunately, take a while.”
This initial research preview is available through operator.chatgpt.com, but soon, OpenAI says it wants to integrate Operator into all of its ChatGPT clients.
Operator promises to automate tasks such as booking travel accommodations, making restaurant reservations, and shopping online, according to OpenAI. There are several task categories users can choose from within the Operator interface, including shopping, delivery, dining, and travel — all of which enable different kinds of automation.
When ChatGPT users activate Operator, a small window will pop up showing a dedicated web browser that the agent uses to complete tasks, along with explanations of specific actions the agent is performing. Users can still take control of their screen while Operator is working, as Operator uses its own dedicated browser.
OpenAI says that Operator is powered by a computer-using agent, or CUA, that combines the vision capabilities of the company’s GPT-4o model with the reasoning abilities from OpenAI’s more advanced models. The CUA is trained to interact with the front-end of websites, meaning it doesn’t need to use developer-facing APIs to tap into different services.
In other words, the CUA can use buttons, navigate menus, and fill out forms on a webpage — much like a human would.
OpenAI says it’s collaborating with companies like DoorDash, Instacart, Priceline, StubHub, and Uber to ensure that Operator respects these businesses’ terms of service agreements.
“The CUA model is trained to ask for user confirmation before finalizing tasks with external side effects, for example before submitting an order, sending an email, etc., so that the user can double-check the model’s work before it becomes permanent,” OpenAI writes in materials provided to TechCrunch. “[It] has already proven useful in a variety of cases, and we aim to extend that reliability across a wider range of tasks.”
But OpenAI warns the CUA isn’t perfect. The company says it “[doesn’t] expect [the] CUA to perform reliably in all scenarios just yet.”
Out of an abundance of caution, OpenAI is also requiring supervision for some tasks, like banking transactions, the CUA and Operator could perform entirely on their own. Users will need to take over to put in credit card information, for example. OpenAI says that Operator doesn’t collect or screenshot any data.
“On particularly sensitive websites, such as email, Operator requires active user supervision, ensuring users can directly catch and address any potential mistakes the model might make,” OpenAI says in its support materials.
Operator is OpenAI’s boldest attempt yet at creating an AI agent. Last week, OpenAI released Tasks, giving ChatGPT simple automation features such as the ability to set reminders and schedule prompts to run at a set time every day.
Tasks gave ChatGPT users some familiar, but necessary, features to make ChatGPT as practical to use as Siri or Alexa. However, Operator shows off capabilities that the previous generation of virtual assistants could never do.
AI agents have been pitched as the next big thing in AI after ChatGPT: a new technology that will change how people use the internet and their PCs. Instead of simply delivering and processing information, agents can — in theory — take actions and do things.
As OpenAI releases its first concrete take on agents, it’ll soon become clear just how realistic that vision is.