On May 13th, OpenAI announced the launch of its cutting-edge AI model, GPT-4o, which offers advanced capabilities in text, image, and voice interactions. This innovative model features real-time, lifelike voice conversation capabilities, marking a significant advancement in the AI industry.
GPT-4o, where the 'o' stands for 'omni', represents a major step toward more natural human-computer interactions. The model can seamlessly accept and generate any combination of text, audio, and image inputs and outputs. It boasts an impressive response time to audio inputs, as quick as 232 milliseconds, with an average of 320 milliseconds, mirroring human conversational response times. Additionally, GPT-4o matches GPT-4 Turbo in English and coding performance, shows enhanced capabilities in non-English text processing, offers faster API speeds, and reduces costs by 50%. It excels in visual and audio comprehension, setting a new standard for AI models.
OpenAI demonstrated GPT-4o in a video on its website, highlighting its ability to engage in uninterrupted real-time conversations. Users can interact with ChatGPT without delay and even interrupt it while it is speaking, a feature indicative of advanced AI voice assistants.
OpenAI CEO Sam Altman stated in a blog post, "This feels like the AI from the movies... Talking to a computer has never felt so natural, and it is now a reality."
The demonstration video showcased GPT-4o's new voice assistant feature, where ChatGPT used both visual and audio capabilities to assist a researcher in solving a math equation on paper. Another demonstration revealed GPT-4o's ability to engage in playful conversation, humorously responding to a researcher’s compliments.
OpenAI's CTO, Mira Murati, announced that GPT-4o would be available for free, as it is more cost-effective than previous models. "GPT-4o's paid users will benefit from higher capacity limits compared to free users," she stated. The company confirmed that GPT-4o will be integrated into ChatGPT in the upcoming weeks.
This launch positions GPT-4o as a groundbreaking development in the realm of AI, promising more natural and efficient interactions across various modalities.