OpenAI releases new flagship generative AI model GPT-4o: smoother voice conversations, free of charge

OpenAI announced the launch of its latest flagship generative AI model, GPT-4o, which will be integrated into OpenAI's products in stages over the next few weeks. The most surprising thing is that GPT-4o will be free for all users.

OpenAI releases new flagship generative AI model GPT-4o: smoother voice conversations, free of charge

According to TechCrunch and other foreign media reports, OpenAI Chief Technology Officer Muri Murati said that GPT-4o will provide the same level of intelligence as GPT-4, but will have further improvements in text, image, and speech processing.

“GPT-4o can reason with a combination of speech, text, and visual information,” Murati said during a keynote at OpenAI’s headquarters. GPT-4o adds speech processing capabilities to OpenAI’s previous flagship model, GPT-4, which can process information mixed with images and text and complete tasks such as extracting text from images or describing the content of images.

GPT-4o will run much faster, and the biggest highlight is that its voice interaction mode uses new technology. OpenAI has been committed to allowing users to communicate with ChatGPT through voice, just like talking to a real person. However, the previous version seriously affected the immersiveness of the conversation due to latency issues. GPT-4o uses a brand-new technology to greatly improve the response speed of chatbot conversations.

At the conference, OpenAI showed a demonstration of using GPT-4o for voice conversation. After the presenter finished asking questions, GPT-4o responded almost instantly and read aloud through the text-to-speech function, making the conversation feel more natural and realistic.

Another demonstration showed GPT-4o adjusting its tone of voice when speaking upon request, and GPT-4o could change its voice according to instructions, from exaggerated and dramatic to cold and mechanical, showing excellent plasticity. Finally, the demonstration also showed GPT-4o's singing function.

In the past, when OpenAI released a new version of the ChatGPT model, it usually placed it behind a paywall. However, this time GPT-4o will be available to all users for free, and paying users can enjoy five times the call quota.

In addition, OpenAI also released a desktop version of ChatGPT and a new user interface. "We recognize that these models are becoming more and more complex," Murati said, "but we hope that the user's interaction experience with the artificial intelligence model can be more natural and easy, so that users can focus entirely on collaborating with the model without having to care about the interface itself."

This article comes from online submissions and does not represent the analysis of kookeey. If you have any questions, please contact us

Like (0)
kookeeykookeey
Previous May 13, 2024 4:12 pm
Next May 14, 2024 5:26 pm

Related recommendations