ChatGPT Intros Agent Mode with Web Navigation and Slide Creation Features – The Fast Mode


Technologies
Services
Business
Devices

AI for Sustainability
OpenAI announced that ChatGPT can now perform complex tasks from start to finish using its own virtual computer, marking a significant advancement in AI-powered task automation.
With this new capability, users can ask ChatGPT to handle requests such as “look at my calendar and brief me on upcoming client meetings based on recent news,” “plan and buy ingredients to make Japanese breakfast for four,” or “analyze three competitors and create a slide deck.” 

Comarch Comarch
ChatGPT can now intelligently navigate websites, filter results, prompt secure login when needed, run code, conduct analysis, and deliver editable slideshows and spreadsheets that summarize its findings.
At the core of this new capability is a unified agentic system. It brings together three key strengths from earlier innovations: Operator’s ability to interact with websites, deep research’s capacity to synthesize information, and ChatGPT’s intelligence and conversational fluency.
ChatGPT performs these tasks using its own virtual computer, seamlessly shifting between reasoning and action to handle complex workflows from start to finish, based on user instructions.
Importantly, users remain in control. ChatGPT requests permission before taking consequential actions, and users can interrupt, take over the browser, or stop tasks at any time.
Beginning today, Pro, Plus, and Team users can activate ChatGPT’s new agentic capabilities directly through the tools dropdown from the composer by selecting ‘agent mode’ in any conversation.
While the ChatGPT agent already offers powerful functionality for managing complex tasks, this launch marks only the beginning. OpenAI plans to deliver significant improvements on a regular basis, enhancing the tool’s capabilities and usefulness over time.
A Natural Evolution of Operator and Deep Research
Previously, Operator and deep research each offered distinct capabilities: Operator could scroll, click, and type on the web, while deep research excelled at analyzing and summarizing information. However, they worked best in different scenarios. Operator lacked the ability to perform deep analysis or generate detailed reports, while deep research could not interact with websites or access content requiring user authentication. In many cases, user queries attempted with Operator were better suited for deep research.
To address this, OpenAI integrated the complementary strengths of both systems into ChatGPT and introduced additional tools—unlocking new capabilities within a single model. ChatGPT can now actively engage websites, clicking, filtering, and gathering more precise, efficient results. Users can transition naturally from a conversation to direct action within the same chat session.

Ray Sharma is an Industry Analyst and Editor at The Fast Mode. He has over 15 years of experience in mobile broadband technologies and solutions, conducting research and analysis on various technology segments and producing articles and write-ups on the latest developments within the sector. He is also in charge of social media engagement and industry liaisons.
Follow him on LinkedIn or Facebook. He can be reached at ray.sharma@thefastmode.com
PREVIOUS POST
NEXT POST
NEWSLETTER
Get updates and alerts
delivered to your inbox
Comarch Comarch ANDREW
NEWSLETTER
Get updates and alerts
delivered to your inbox
Rohde ACC 2025 ACC 2025

COPYRIGHT © 2013 – 2025 THE FAST MODE

source

Jesse
https://playwithchatgtp.com