Abhi I.’s Post

2mo Edited

OpenAI Unveils GPT-4o: The new battle for the gateway UI ! OpenAI demoed the capabilities of their newest flagship model GPT-4o. They are making it available through a new updated UI, and also through the API. GPT-4o ("Omni") - is a conversational model that improves on its capabilities over text, audio and vision. This new model reasons across voice, text and vision instead of just of just combining them. GPT-4o can be combined with: real-time vision (both still images and videos), memory, GPTs, browse, and advanced data analysis. This makes GPT-4o way faster (2x) and crucially cheaper per token (50%) with a 5x faster rate limit compared to GPT4. But the most powerful impact is how conversational and natural it sounds with the ability to use different tones and even continue post an interruption as a result of understanding the context and the tone of the conversation. The demo was eye-opening with no lag. Demos ranged from math tutoring to telling bed-time stories and real time translation from Italian to English. The model was clearly picking up on emotions and able to change tone and voice to demonstrate emotions. What was magical, and just slightly creepy, was it gave off the appearance of enjoying the interaction! Those of you who were fascinated by the movie "Her" - know where this could go! With this demo, OpenAI has put all other providers of natural language assistants on notice - don't be surprised to see a fire back from everyone else in this space - Apple, Microsoft, Google among others. In a bid to make ChatGPT even more ubiquitous, they rolled out a desktop version of ChatGPT, and an improved Web UI. This feels like shades of the last browser battle playing out all over again. If you recall that bygone era - Netscape pioneered that paradigm, only to be caught and passed by Microsoft, Google & Apple all with their own entrants who bundled this with each of their devices and Operating systems. Mira Murati who led the introduction also emphasized their focus on safety, not a surprise given the drama over the last year. In terms of implications for others, what comes to my mind is the interview that Sam Altman and Brad Lightcap gave recently. (I am paraphrasing) "If your company builds on a foundational model and you cheer a 10x improvement, then you likely building the right way and have a sustainable business model, if it sends a chill down your spine - you may want to reformulate your product/company thesis". Would love to hear your thoughts on this announcement - do you agree that this represents a browser like moment to capture the new UI gateway for the masses? Who is most at risk? Who benefits the most? What would you do differently as a result of what you heard on the demo? #genai #openai

Introducing GPT-4o

https://www.youtube.com/

3 Comments

Ridwan Kabir

2mo

Thanks for sharing, Abhi. This is great for an improved experience. As a general user, the value-add will come primarily in two forms: faster responses during peak hours and more effective engagements. Often, it is necessary to refine queries to get the optimal experience with GPT-4. Hopefully, the experience with the turbo version will be more exciting and engaging, with enhanced empathy and emotional intelligence to better interpret the tone during human interactions. Exciting indeed...

1 Reaction

Nasir Mahmood

Generative AI Consultant | Ex-PwC, Deloitte, AWS, Accenture Executive | Trusted Advisor to Fortune 100 Companies.

2mo

Thanks for sharing, Abhi I.. I've been using 4o since its recent release, and the results have been truly astonishing in comparison.

1 Reaction

Suman Chepuri

2mo

Multimodality is a huge game changer imo, exciting (and scary) times ahead! And hope they bring Scarlet Johansen’s voice back 😁

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Eman Zerafa

CTO at Cleverbit Software
2mo
Report this post
🚀 OpenAI released GPT-4o yesterday! The "o" stands for Omni, indicating its ability to process a combination of audio, text, and images, and produce outputs in the same formats. While the concept isn't entirely new, the implementation is truly impressive. It's a groundbreaking way to interact with AI. Google demoed something similar back in December. The difference? OpenAI did this live! In contrast, Google's demo was a polished, sped-up video stitched together from various experiments. Many people are saying it reminds them of the movie "Her." Here are a few key takeaways from OpenAI's update: 1. Availability: This model will be available to free users. This is fantastic news! As I've mentioned before, people often use GPT-3.5 and then give up on AI, so this is a game changer. 2. Advanced Capabilities: It's not just text-to-speech and speech-to-text. It understands emotion in voice and context, and can speak very naturally. 3. Speed: It's much faster, making interactions feel like natural conversations. 4. Cost: Using it in the API is cheaper. 5. User Experience: OpenAI is launching a desktop app to improve user experience and provide better context for the model. This unlocks many potential new use cases, like real-time translation, which was also demoed. And finally, it's flirty! OpenAI has already posted a lot of demos on their website, giving us a glimpse of the impact of such a tool. Here are my thoughts on its potential impact: 1. Accessibility for Visual Impairment: This is a game changer for people with visual impairments. GPT-4o can help them understand and relay their surroundings, enhancing their ability to navigate the world. 2. Making Reservations and Calls: GPT-4o can easily make phone calls on behalf of users to make reservations or contact customer support. Ironically, the customer support you're calling will probably also be a bot. 3. Job Displacement: The potential for job displacement is worrying. Translation and customer support are the obvious ones. However, personal tutoring has never been as affordable and effective. You will now have a tutor that is infinitely patient, understands all subjects, and can explain anything with infinite analogies, metaphors, and simplicity. 4. Enhanced Meetings: In all meetings, you will have a bot present, understanding the conversation, fact-checking ideas, and pitching in its own contributions. It will pull research to contribute to any conversation and make meetings much more effective. 5. Next-Gen Digital Assistants: We will definitely see such assistants replace the old clunky digital assistants like Siri and Alexa. Apple is already in talks with OpenAI to include ChatGPT in its next phone. GPT-4o is not a big step in terms of AI model capabilities, but it is nonetheless a significant leap forward. Let's see what Google release next. Here's a link to the GPT-4o demo: https://lnkd.in/dnx65TgM

Introducing GPT-4o

https://www.youtube.com/

1 Comment
Like Comment
To view or add a comment, sign in
Mitesh Mistry

Senior Director, Salesforce CTA, CTA Master Coach and BBC and Netflix Family Cooking Showdown Winner.
2mo Edited
Report this post
5 key takeaways from OpenAI GPT-4o announcement. OpenAI announced GPT-4o and the capabilities it is bringing to the table is simple awesome and so transformatory. There are future industry use cases as things evolve which will transform how automations can work across day to day life. Here my key takeaways from the Live Session today from OpenAI 1) 🎙 VOICE MODE: Audio Anaylsis and Real-time transation: The ability for you to SPEAK to ChatGPT with your voice and have a conversational interaction in real-time. Some fantastic potential here in the accessibility arena. Secondly TRANSLATION. This is a game-changer. Speak to ChatGPT in one langauge and have it translate it into another language in real-time. Do we need translation apps anymore? Looks like ChatGPT just launched a real game-changer. 2) 🎥 Chat GPT VISION MODE: The ability to parse and understand video content in real-time and facilitate a conversation with meaning and reasoning over this. I think this is soo cool, that the platform can interpret and derive information, meaning and context from real-time video content and allow you to have a chat conversation over this. I can see this having huge potential across a range of industries as this capability is developed and enhanced over time. 3) 🖥 ChatGPT Desktop App: Build ChatGPT into your organizations workflows and processes with the GPT desktop app. Easily copy-paste data/information to ChatGPT and also allow it to view and interpret parts of your screen and facilitate a conversation / intelligent interaction over this. Just think about the productivity gains here of having this super smart AI assistant help you do your work better, faster, smarter. 4) 🆓 FREE Access: This is the top one. Democratising AI access for all. OpenAI are bringing these capabiltiies FREE for all users with GPT-4o. This is awesome to get everyone to start using AI capabilities 5) 🔌 API Enhancements: New model is 2x faster, 50% cheaper and has 5x higher rate limits These are massive announcements from OpenAI, and has huge potential for industries and people in the future as we look to the safe, practical and scalable adoption of AI. Lets ensure as technologists we are getting onto this wave and looking at what we can do to better the lives for the customers that we serve. Excited to see what Salesforce look to do next in this domain. #ai #openai #chatgpt #gpt #alwaysbelearning #thefutureisnow

Introducing GPT-4o

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Korhan Özkilinc

Consultant bei ÖZKILINC Consulting
2mo
Report this post
GPT-5 or a supercharged GPT-2

The mysterious 'gpt2-chatbot': a surprise breakthrough in AI? | DailyAI

https://dailyai.com
Like Comment
To view or add a comment, sign in
Wei Zhang

Director, Center for Cancer Genomics and Precision Oncology, Wake Forest Baptist Comprehensive Cancer Center
2mo Edited
Report this post
New version of GPT! 4o

OpenAI announces new version of AI language model that fuels ChatGPT

abcnews.go.com
Like Comment
To view or add a comment, sign in
Naomi Kaduwela

⚡️🤖 Your Enterprise Data Analytics Go-To Partner for MS, AWS, GCP, Databricks, Snowflake, SAS, SAP | Turning Data into Business Value | Best Data Analytics & AI Firm in USA | Multi-Patented Innovator | AI Ethics Author
2mo
Report this post
🎉 **Exciting News from OpenAI!** 🎉 🚀 Thrilled to announce **GPT-4o**, new flagship model that can reason across **audio, vision, and text in real time**. As the Head of Kavi Labs at Kavi Global, I'm excited to share how this breakthrough will impact enterprise #DataAnalytics and #AI leaders. 🔍 **What is GPT-4o?** GPT-4o ("o" for "#omni") is a significant step toward more **natural human-computer interaction**. It accepts **any combination of #text, #audio, and #image** as input and generates corresponding outputs. Imagine seamless conversations where you can #speak, #type, or show #images, and GPT-4o responds instantly! 🎙️ **#RealTime #Audio Interaction** GPT-4o can respond to audio inputs in as little as **232 milliseconds**, with an average of **320 milliseconds**—similar to human response time in a conversation. Say goodbye to long delays and hello to real-time communication! 🌐 **#Multilingual and #Multimodal** GPT-4o matches **GPT-4 Turbo** performance on English text and code, with significant improvements in non-English languages. Plus, it's **50% #cheaper** in the #API! Whether you're analyzing data in English, Spanish, or any other language, GPT-4o has you covered. 👁️🗨️ **#Vision and #Audio Understanding** GPT-4o excels in vision and audio understanding. It's like having an AI that can see and hear! Imagine using it for real-time translation, meeting AI, or even customer service proof of concept. 🌟 **How to Experience GPT-4o** 1. **Watch the Livestream**: Catch the **GPT-4o reveal and demo** on OpenAI's YouTube channel: https://zurl.co/GWro 2. **Try It Out**: Explore GPT-4o in the ChatGPT Playground or API. 3. **Stay Tuned**: We'll roll out a new version of **Voice Mode** with GPT-4o in alpha within ChatGPT Plus soon! 🤖 Let's unlock new possibilities together! Share your thoughts in the comments below. 🚀 #GPT4o #AI #NaturalLanguageProcessing #EnterpriseAI #DataAnalytics #OpenAI *Source: OpenAI's GPT-4o Announcement - https://lnkd.in/gSDCg37f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ *Follow me for the latest on Data Analytics & AI as I Kavi Labs @ Kavi Global

Introducing GPT-4o

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Scott Forsyth
2mo Edited
Report this post
OpenAI just completed their Spring update announcement. Here are the highlights. The 30-minute live video was engaging and fun to watch. You can watch the announcement here: https://lnkd.in/edrJgK68 It was a live format with the unexpected issues you can expect with generative AI in audio and video, but they pulled it off well, with only some slight hiccups, which made it even more endearing. The big news is the release of GPT-4o. It's much faster and better quality, and it will be free for everyone (with capacity limits). It has strong reasoning across text, vision, and audio. They also announced a desktop app that interacts more with your screen, audio, and video. They gave some demos: 𝐂𝐡𝐚𝐭 𝐚𝐩𝐩 - Detects emotion! - You can interrupt it. - Much more real-time feed than anything before, with almost no delay. Very natural sounding. - While telling a bedtime story, they asked for maximum drama, and ChatGPT went all out with crazy, fun dramatization. They asked for a robot voice and later to sing, and ChatGPT did that very well, too. 𝐕𝐢𝐝𝐞𝐨 𝐝𝐞𝐦𝐨 - Able to see a handwritten math question in real-time. - Fully interactive tutor-based solving of a math question together. - Then showed a handwritten "I {heart} ChatGPT," to which ChatGPT replied with an endearing thank you. - The voice quality is next level! It sounds much more human and has a much wider inflection range and emotion than anything out there. 𝐂𝐨𝐝𝐢𝐧𝐠 - They demoed understanding a coding program from a screenshot. - Then, they turned on the screen sharing and had ChatGPT summarize the program's chart output. - My take: there's nothing revolutionary about this except to show the progress towards even more interactive engagement with the desktop app, less copy and pasting, and further evolution in quality. Watch it here: https://lnkd.in/egSe9eXF 𝐑𝐞𝐚𝐥𝐭𝐢𝐦𝐞 𝐭𝐫𝐚𝐧𝐬𝐥𝐚𝐭𝐢𝐨𝐧 They took an audience request to demo translation (I don't know if it was a staged question) and performed a real-time translation between English and Italian which went very well. While we've had tools like this for years, it was much smoother and quicker than other products out there, and it even passed through the emotion and laugh through the translation. Watch this segment here: https://lnkd.in/e8QyhCEw 𝐌𝐲 𝐭𝐚𝐤𝐞 This is more progress on things that we've already seen before. It's not new. However, it's much more polished, natural, faster, and with higher quality. It's a further step in the right direction. I'm impressed. Is it out today? Not yet. Sam Altman tweeted that we'll see the new voice mode in the coming weeks, and GPT-4o text will 𝘴𝘵𝘢𝘳𝘵 rolling out to users today. Additionally, Sam Altman just Tweeted a blog post with his take: https://lnkd.in/eB3tPnJa This will be available to all users for free, with paid memberships getting 5x the capacity as free accounts.

Introducing GPT-4o

https://www.youtube.com/

2 Comments
Like Comment
To view or add a comment, sign in
Jimmy Orucevic

Privacy Professional | Data Protection | Cybersecurity | Technology | CIPP/E | LLM
2mo
Report this post
#AI Revolution Unleashed? OpenAI unveils GPT-4o --> Your Multimodal Partner in Innovation #OpenAI has just unleashed the next leap in AI evolution with GPT-4o, a multimodal marvel that reshapes the landscape of text, vision, and audio processing. The Power-packed Upgrade: GPT-4o doesn't just raises the bar: outperforming its predecessor, GPT-4T, across text, vision, #audio, coding, and multilingual applications. But that's not all – it's not only smarter but also budget-friendly, with a 50% reduction in costs, 5 times higher rate limits, and double the generation speed. Oh, and remember that mysterious 'im-also-a-good-gpt2-chatbot' from Lmsys Arena? Surprise! It was the covert genius of GPT-4o all along. Voice and Beyond: Prepare to be amazed as GPT-4o introduces a symphony of #new features. Real-time responses, emotion detection, and seamless integration of voice, text, and vision are just the tip of the iceberg. In a demo, witness GPT-4o perform real-time translations, collaborate with AI on live #video analysis, and even provide tutoring and coding aid using voice and vision. Beyond Imagination: OpenAI's latest blog reads like a sci-fi dream come true. From 3D generation to font creation, from enhancing text within images to crafting bespoke #sound effects – GPT-4o seems to blur the lines between imagination and reality. And hold onto your hats, macOS users! OpenAI is rolling out a sleek new ChatGPT desktop app with a user-friendly interface, seamlessly integrating AI into your daily workflows. Accessible Innovation: In a move that democratizes AI like never before, GPT-4o, along with its bells and whistles, is now freely available to all users. Say goodbye to limitations; say hello to boundless creativity. Get ready, world! GPT-4o is making its grand entrance into ChatGPT and API, with the promise of voice capabilities lighting up your screens in the weeks to come. Why does it matter? With real-time voice and multimodal prowess, AI isn't just a tool anymore – it's a dynamic partner in our quest for #innovation and growth. And for the legions of free users about to experience the magic of GPT-4o, it's not just an upgrade; it's a transformational journey into the future of AI. OpenAI Demo: https://lnkd.in/eARW_HZa

Introducing GPT-4o

https://www.youtube.com/
Like Comment
To view or add a comment, sign in
Patrick C Miller
8mo
Report this post
OpenAI introduces GPT-4 Turbo: Larger memory, lower cost, new knowledge https://lnkd.in/dunE-Xw4

OpenAI introduces GPT-4 Turbo: Larger memory, lower cost, new knowledge

arstechnica.com
Like Comment
To view or add a comment, sign in
Yogesh Jadhav

Data Enthusiast | Data Analyst | Data Science | ML/DL/AI | Analytics | Visualization | ETL | UI/UX | NFT | Power Apps | IT | Content Writer | Jobs/Recruitment | Quoran | Follow for more
7mo
Report this post
🚀 Unlock the power of Large Language Models with Kel, a free and open-source utility that connects you with LLMs, streamlining your developer and testing tasks. With support for various LLM models, customization options, and a command-line interface, Kel is a valuable tool for improving workflow and productivity. #AI #ML #DataScience

🚀 Unlock the power of Large Language Models with Kel, a free and open-source utility that connects you with LLMs, streamlining your developer and testing tasks. With support for various LLM models, customization options, and a command-line interface, Kel is a valuable tool for improving workflow and productivity. #AI #ML #DataScience

dev.to
Like Comment
To view or add a comment, sign in
Rob Falconer

Chief AI Officer
2mo
Report this post
Breaking! GPT 4o (four Oh) to be released, with GPT4 capabilities but faster and easier to use. Plus they just hinted that there are more big announcements coming in the next few weeks about 'the future' - GPT5 https://lnkd.in/eisJrz44

OpenAI's GPT-4o: Everything We Know So Far

https://www.techopedia.com
Like Comment
To view or add a comment, sign in

9,000 followers

View Profile Follow

Abhi I.’s Post

Introducing GPT-4o

https://www.youtube.com/

More from this author

XM: The Way to Differentiate in a Converging Market

Leaving AT&T

Explore topics