Categories: Safe Search

OpenAI Unveils GPT-4o: Faster, Multimodal, and Game-Changing for Developers

OpenAI has unveiled GPT-4o, an enhanced version of its GPT-4 language model that powers the popular ChatGPT assistant. According to Mira Murati, OpenAI's Chief Technology Officer, the updated model offers significant performance improvements, boasting faster speeds and enhanced capabilities across text, vision, and audio modalities. While GPT-4o will be available free of charge to all users, paid subscribers will continue to enjoy up to five times higher capacity limits compared to free users. In a blog post, OpenAI revealed that GPT-4o's capabilities will be rolled out incrementally, with text and image functionalities becoming available in ChatGPT starting today. Sam Altman, OpenAI's CEO, highlighted that the model is "natively multimodal," enabling it to generate content and comprehend commands across various formats, including voice, text, and images. Developers interested in exploring GPT-4o will gain access to an API that promises to be twice as fast as GPT-4 Turbo while costing only half the price. One of the notable new features introduced with GPT-4o is an enhanced voice mode for ChatGPT. This mode will allow the app to function as a real-time voice assistant, akin to the fictional AI system depicted in the movie "Her," enabling it to observe its surroundings and respond accordingly. The current voice mode, in contrast, is more limited, responding to prompts one at a time and relying solely on audio input. In a reflective blog post following the livestream event, Altman acknowledged a shift in OpenAI's vision. While the company's original goal was to "create all sorts of…

OpenAI has unveiled GPT-4o, an enhanced version of its GPT-4 language model that powers the popular ChatGPT assistant. According to Mira Murati, OpenAI’s Chief Technology Officer, the updated model offers significant performance improvements, boasting faster speeds and enhanced capabilities across text, vision, and audio modalities. While GPT-4o will be available free of charge to all users, paid subscribers will continue to enjoy up to five times higher capacity limits compared to free users.

In a blog post, OpenAI revealed that GPT-4o’s capabilities will be rolled out incrementally, with text and image functionalities becoming available in ChatGPT starting today. Sam Altman, OpenAI’s CEO, highlighted that the model is “natively multimodal,” enabling it to generate content and comprehend commands across various formats, including voice, text, and images. Developers interested in exploring GPT-4o will gain access to an API that promises to be twice as fast as GPT-4 Turbo while costing only half the price.

One of the notable new features introduced with GPT-4o is an enhanced voice mode for ChatGPT. This mode will allow the app to function as a real-time voice assistant, akin to the fictional AI system depicted in the movie “Her,” enabling it to observe its surroundings and respond accordingly. The current voice mode, in contrast, is more limited, responding to prompts one at a time and relying solely on audio input.

In a reflective blog post following the livestream event, Altman acknowledged a shift in OpenAI’s vision. While the company’s original goal was to “create all sorts of benefits for the world,” Altman recognized that the focus has evolved. Amid criticism for not open-sourcing its advanced AI models, Altman suggested that OpenAI’s role has shifted towards creating AI models and making them available to developers through paid APIs, allowing third parties to leverage these models to create innovative solutions that benefit society.

Prior to the GPT-4o launch, conflicting reports speculated that OpenAI might announce an AI search engine to rival Google, a voice assistant integrated into GPT-4 named Perplexity, or a entirely new and improved model called GPT-5. Notably, OpenAI timed this launch strategically ahead of Google I/O, the tech giant’s flagship conference, where various AI products from the Gemini team are expected to be unveiled.

Was this Answer helpful?

YesNo

Sabyasachi Roy

NextNvidia Unveils Rubin: Next-Gen AI Chip Platform Coming in 2026 »

Previous « Google Acquires Swiss AI Startup Vizerai to Bolster Computer Vision and Model Compression Capabilities

Published by

Sabyasachi Roy

2 years ago

Google DeepMind Launches AlphaEarth Foundations: A “Virtual Satellite” for Environmental Insight

When satellites fall short, AI is now stepping in. In a powerful blend of climate…

8 months ago

Safe Search

OpenAI Builds Europe’s First AI “Gigafactory” in Norway: A Bold Leap Toward Sustainable AI

A new chapter in AI is unfolding—not in Silicon Valley, but along the fjords of…

8 months ago

Safe Search

Indian Railways Enters a New Digital Era with AI, Cybersecurity, and Analytics Overhaul

The Ministry of Railways, through its technology arm — the Centre for Railway Information Systems…

9 months ago

Safe Search

Apple Doubles Down on Child Safety and Parental Controls in iOS 26 and Beyond

As digital devices continue to play an ever-larger role in children’s lives, Apple is stepping…

9 months ago

Safe Search

Inside Google’s Flow: The Future of Filmmaking or the End of Human Creativity?

On May 22, 2025, Google quietly unveiled what may be its most ambitious AI-driven creative…

10 months ago

Safe Search

ChatGPT Could Soon Become Your Personal Storefront Thanks to Shopify Integration

In a bold move that could redefine how we shop online, OpenAI’s ChatGPT might soon…

11 months ago