13/05/2024
OpenAI Updates 👇
⠀
OpenAI announced a new GPT-4o model, which performs better than GPT-4, but most importantly, it’s going to be free for all users! 🚀
⠀
Updated interface and PC application with voice control and screen-sharing capabilities have been introduced 🎤
⠀
Natively multimodal: for now, text, images, and voice generation are done by ONE model 🗣️
⠀
Developers are not forgotten either, because there will be API support: 2x faster, 50% cheaper, and 5x higher rate limits API vs. GPT4 💰
⠀
Voice mode has been greatly improved: now you can interrupt the model generation at any time, instead of waiting until the end. OpenAI also managed to bring speech generation to the real-time level, and most importantly, to make ChatGPT voice really alive! 🎶
⠀
There is also an interactive mode, where you can simultaneously share the image from your camera and communicate with ChatGPT about it! 💬
⠀
Our take: It’s not a groundbreaking change but rather a step forward we anticipated. The audio integration is impressive, especially how they paired it with visuals intuitively. It seems they’re likely not streaming live video to the model, but rather specific images, which makes sense.
⠀
Overall, OpenAI continues to excel in user experience. First with user-friendly language models in ChatGPT, and now with a package combining audio, image, and possibly video capabilities. This marks OpenAI’s transition from research releases to product development.
⠀
Hopefully, in a year or two, we’ll have similar tools running locally, enhancing data security for sensitive work.