Listen Live
In this photo illustration, the OpenAI logo seen displayed...
Source: SOPA Images / Getty

ChatGPT’s First AI Agent, Operator, Redefines What AI Can Do

OpenAI has officially launched its first AI agent, called Operator, marking a major leap in artificial intelligence capabilities.

Unlike traditional chatbots that respond to queries, Operator is designed to autonomously complete tasks online.

It can make dinner reservations, book travel, buy groceries, and even fill out forms on websites.

This innovation is available as a limited research preview to ChatGPT Pro subscribers in the U.S., with plans to expand availability in the future.

https://platform.twitter.com/widgets.js

The technology behind Operator is groundbreaking.

It is powered by the Computer Using Agent (CUA) model, an advanced iteration of GPT-4o.

This model gives Operator the ability to “see” web content using screenshots and “interact” with it just like a human would—with mouse and keyboard actions.

It can understand buttons, menus, and search boxes within a browser, allowing it to execute tasks step-by-step.

For instance, you can ask Operator to book a reservation for dinner, specify a time, and even choose a platform like OpenTable.

If you’re shopping, you can upload a handwritten grocery list, and Operator will process it, search for items, and make purchases through a designated website like Instacart.

Operator also provides a reasoning summary in a sidebar, allowing users to track its steps and identify possible areas of error.

RELATED | U.S. Department of Labor Announces New Guidelines For AI in the Workplace

While Operator demonstrates impressive functionality, it is still an early-stage “research preview.”

OpenAI acknowledges its limitations, noting it occasionally struggles with complex web interactions.

Currently, Operator navigates websites and operating systems with somewhat lower efficiency than humans.

However, the technology is evolving quickly, and each refinement brings it closer to the goal of making artificial general intelligence (AGI) a reality.

window.addEventListener(‘interaction’, function () {

setTimeout(function () {

var s = document.createElement(‘script’), el = document.getElementsByTagName(‘script’)[ 0 ];

s.async = true;

s.src = ‘https://platform.twitter.com/widgets.js’;

el.parentNode.insertBefore(s, el);

}, 1000)

});

ChatGPT’s First AI Agent, Operator, Redefines What AI Can Do  was originally published on wibc.com