OpenAI on Thursday introduced Operator, its first artificial intelligence (AI) agent, which can “go to the web to perform tasks for you”. It marks the latest entry into the agents segment by a major player, following the likes of Google and Salesforce. ET explains what Operator can do, how it works and who can access it.
Budget with ET Budget 2025: A CFO’s playbook for operational excellence and long-term growth Rising Bharat may need to take center stage for India’s game-changing plans Will Indian Railways accelerate to global standards with govt’s budgetary allocation? What can Operator do? Users can ask Operator to carry out a range of repetitive browser tasks such as filling out forms, ordering groceries and even creating memes, OpenAI said in a blog post. Some who have access shared on social media that they tried using the agent to order dinner ingredients based on pictures and recipes, schedule a barber appointment by checking Google calendar availability, plan a trip by parsing recommendations on Reddit that would be within budget, among other tasks. OpenAI is collaborating with firms including food delivery app DoorDash, ecommerce site eBay, grocery delivery platform Instacart, taxi aggregator Uber, sports and entertainment ticket booking app StubHub to ensure conformity with their terms of service agreements.
Artificial Intelligence(AI) Java Programming with ChatGPT: Learn using Generative AI By - Metla Sudha Sekhar, IT Specialist and Developer View Program Artificial Intelligence(AI) Basics of Generative AI: Unveiling Tomorrows Innovations By - Metla Sudha Sekhar, IT Specialist and Developer View Program Artificial Intelligence(AI) Generative AI for Dynamic Java Web Applications with ChatGPT By - Metla Sudha Sekhar, IT Specialist and Developer View Program Artificial Intelligence(AI) Mastering C++ Fundamentals with Generative AI: A Hands-On By - Metla Sudha Sekhar, IT Specialist and Developer View Program Artificial Intelligence(AI) Master in Python Language Quickly Using the ChatGPT Open AI By - Metla Sudha Sekhar, IT Specialist and Developer View Program Marketing Performance Marketing for eCommerce Brands By - Zafer Mukeri, Founder- Inara Marketers View Program Office Productivity Zero to Hero in Microsoft Excel: Complete Excel guide 2024 By - Metla Sudha Sekhar, IT Specialist and Developer View Program Finance A2Z Of Money By - elearnmarkets, Financial Education by StockEdge View Program Marketing Modern Marketing Masterclass by Seth Godin By - Seth Godin, Former dot com Business Executive and Best Selling Author View Program Astrology Vastu Shastra Course By - Sachenkumar Rai, Vastu Shashtri View Program Strategy Succession Planning Masterclass By - Nigel Penny, Global Strategy Advisor: NSP Strategy Facilitation Ltd. View Program Data Science SQL for Data Science along with Data Analytics and Data Visualization By - Metla Sudha Sekhar, IT Specialist and Developer View Program Artificial Intelligence(AI) AI and Analytics based Business Strategy By - Tanusree De, Managing Director- Accenture Technology Lead, Trustworthy AI Center of Excellence: ATCI View Program Web Development A Comprehensive ASP.NET Core MVC 6 Project Guide for 2024 By - Metla Sudha Sekhar, IT Specialist and Developer View Program Marketing Digital Marketing Masterclass by Pam Moore By - Pam Moore, Digital Transformation and Social Media Expert View Program Artificial Intelligence(AI) AI-Powered Python Mastery with Tabnine: Boost Your Coding Skills By - Metla Sudha Sekhar, IT Specialist and Developer View Program Office Productivity Mastering Microsoft Office: Word, Excel, PowerPoint, and 365 By - Metla Sudha Sekhar, IT Specialist and Developer View Program Marketing Digital marketing - Wordpress Website Development By - Shraddha Somani, Digital Marketing Trainer, Consultant, Strategiest and Subject Matter expert View Program Office Productivity Mastering Google Sheets: Unleash the Power of Excel and Advance Analysis By - Metla Sudha Sekhar, IT Specialist and Developer View Program Web Development Mastering Full Stack Development: From Frontend to Backend Excellence By - Metla Sudha Sekhar, IT Specialist and Developer View Program Finance Financial Literacy i.
e Lets Crack the Billionaire Code By - CA Rahul Gupta, CA with 10+ years of experience and Accounting Educator View Program Data Science SQL Server Bootcamp 2024: Transform from Beginner to Pro By - Metla Sudha Sekhar, IT Specialist and Developer View Program “It (Operator) has limitations and will evolve based on user feedback,” OpenAI said. Discover the stories of your interest Blockchain 5 Stories Cyber-safety 7 Stories Fintech 9 Stories E-comm 9 Stories ML 8 Stories Edtech 6 Stories It added, however, that the agent has produced state-of-the-art results, setting new benchmarks when evaluated for full computer use tasks (38% success rate on the OSWorld benchmark) and web-based tasks (58% and 87% success rates on WebArena and WebVoyager benchmarks, respectively). How does it work? Operator processes raw pixel data to understand what’s happening on the screen and uses a virtual mouse and keyboard to complete actions.
It can recognise buttons, menus and text fields people see on a screen. It does not need to use back-end application programming interfaces (APIs) to interact with platforms. The agent is powered by a new model called Computer-Using Agent.
This combines the vision capabilities of its most advanced generative AI model GPT-4o with advanced reasoning through reinforcement learning. The ability to use the same interfaces and tools that humans interact with on a daily basis broadens the utility of AI, helping people save time on everyday tasks while opening up new engagement opportunities for businesses, the company said. OpenAI CEO Sam Altman said during the launch livestream that AI agents are “going to be a big trend in AI and really impact the work people can do, how productive they can be, how creative they can be, what they can accomplish”.
Who is able to access it? Operator is currently a research preview, available to Pro users in the United States. The company plans to expand access to Plus, Team and Enterprise users and integrate Operator’s capabilities into ChatGPT in the future. It will also be available in other countries “soon”, Altman said during the livestream.
“Europe will, unfortunately, take a while,” he added..
Technology
Meet ‘Operator’, a web-enabled AI agent that performs tasks for You
OpenAI's Operator is an AI agent that automates web tasks like ordering food, scheduling appointments, and creating memes. It uses GPT-4o and reinforcement learning to understand and interact with websites, improving user productivity. Currently in research preview for US Pro users, Operator will expand to other user tiers and regions soon.