ChatGPT just announced Operator - an agent that use a remote web browser (that you can watch in real time and take control of) to perform tasks on your behalf. It’s a research preview so it’s a bit rough around the edges, and is limited in terms of the sites it can visit. Operator is based on a model called Computer-using Agent (CUA) - which combines GPT-4o's vision capabilities with advanced reasoning. It’s trained to interact with buttons, menus and text-fields on UIs to accomplish the required task effectively. You’ll remember that ChatGPT started as text only, then it became multi-modal, allowing it to “see, hear and talk.” This is the next iteration of chatbots - which will now be able to perform tasks for you. Another AI building block. Slowly but surely - Tony Stark’s Jarvis will come to life and we’ll all eventually have ultra-capable super-assistants who know us extremely well. #openai #chatgpt #operator #ai #tech
Tags, Events, and Projects