Categories: Safe Search

OpenAI Unveils GPT-4o: Faster, Multimodal, and Game-Changing for Developers

OpenAI has unveiled GPT-4o, an enhanced version of its GPT-4 language model that powers the popular ChatGPT assistant. According to Mira Murati, OpenAI's Chief Technology Officer, the updated model offers significant performance improvements, boasting faster speeds and enhanced capabilities across text, vision, and audio modalities. While GPT-4o will be available free of charge to all users, paid subscribers will continue to enjoy up to five times higher capacity limits compared to free users. In a blog post, OpenAI revealed that GPT-4o's capabilities will be rolled out incrementally, with text and image functionalities becoming available in ChatGPT starting today. Sam Altman, OpenAI's CEO, highlighted that the model is "natively multimodal," enabling it to generate content and comprehend commands across various formats, including voice, text, and images. Developers interested in exploring GPT-4o will gain access to an API that promises to be twice as fast as GPT-4 Turbo while costing only half the price. One of the notable new features introduced with GPT-4o is an enhanced voice mode for ChatGPT. This mode will allow the app to function as a real-time voice assistant, akin to the fictional AI system depicted in the movie "Her," enabling it to observe its surroundings and respond accordingly. The current voice mode, in contrast, is more limited, responding to prompts one at a time and relying solely on audio input. In a reflective blog post following the livestream event, Altman acknowledged a shift in OpenAI's vision. While the company's original goal was to "create all sorts of…

OpenAI has unveiled GPT-4o, an enhanced version of its GPT-4 language model that powers the popular ChatGPT assistant. According to Mira Murati, OpenAI’s Chief Technology Officer, the updated model offers significant performance improvements, boasting faster speeds and enhanced capabilities across text, vision, and audio modalities. While GPT-4o will be available free of charge to all users, paid subscribers will continue to enjoy up to five times higher capacity limits compared to free users.

In a blog post, OpenAI revealed that GPT-4o’s capabilities will be rolled out incrementally, with text and image functionalities becoming available in ChatGPT starting today. Sam Altman, OpenAI’s CEO, highlighted that the model is “natively multimodal,” enabling it to generate content and comprehend commands across various formats, including voice, text, and images. Developers interested in exploring GPT-4o will gain access to an API that promises to be twice as fast as GPT-4 Turbo while costing only half the price.

One of the notable new features introduced with GPT-4o is an enhanced voice mode for ChatGPT. This mode will allow the app to function as a real-time voice assistant, akin to the fictional AI system depicted in the movie “Her,” enabling it to observe its surroundings and respond accordingly. The current voice mode, in contrast, is more limited, responding to prompts one at a time and relying solely on audio input.

In a reflective blog post following the livestream event, Altman acknowledged a shift in OpenAI’s vision. While the company’s original goal was to “create all sorts of benefits for the world,” Altman recognized that the focus has evolved. Amid criticism for not open-sourcing its advanced AI models, Altman suggested that OpenAI’s role has shifted towards creating AI models and making them available to developers through paid APIs, allowing third parties to leverage these models to create innovative solutions that benefit society.

Prior to the GPT-4o launch, conflicting reports speculated that OpenAI might announce an AI search engine to rival Google, a voice assistant integrated into GPT-4 named Perplexity, or a entirely new and improved model called GPT-5. Notably, OpenAI timed this launch strategically ahead of Google I/O, the tech giant’s flagship conference, where various AI products from the Gemini team are expected to be unveiled.

Was this Answer helpful?
YesNo
Sabyasachi Roy

Recent Posts

Google Enhances Pixel Security with Live Threat Detection and Scam Call Alerts

Google is introducing powerful new security features to its Pixel smartphones, aiming to strengthen defenses…

1 week ago

OpenAI and Perplexity AI Challenge Google in the Search Domain

Summary OpenAI and Perplexity AI are emerging as formidable competitors to Google in the search…

3 weeks ago

The Future of AI Search: Promises and Pitfalls

As the internet's knowledge graph continues to expand at an unprecedented rate, traditional search methods…

1 month ago

Microsoft’s New AI Hype: What You Actually Need to Know

Summary Microsoft just dropped a bunch of updates to their AI assistant Copilot, and honestly,…

2 months ago

New Malware Uses Deceptive Tactics to Steal Google Credentials

Cybercriminals are getting creative, and a recently uncovered malware is no exception. Instead of using…

2 months ago

Google Abandons Third-Party Cookie Plan, Introduces User-Centric Tracking Control

Google's decision to prioritize user privacy over cookie-based tracking marks a significant shift in the…

3 months ago