- AI Valley
- Posts
- 🐶 Google revealed 2 BIG AI models
🐶 Google revealed 2 BIG AI models
PLUS: Google launched Project Astra...
Together with
Howdy, happy Wednesday, AI family! 🐶
🧵Here are some useful AI updates and tools I gathered today:
🍋 OpenAI announced GPT-4o. Here are the key takeaways
🦾 Google revealed 2 BIG AI models
🔍️ + new AI Tools, Resources, and other news
Reading time - 4 minutes only
OPENAI
OpenAI has unveiled a new AI model called GPT-4o, a multimodal system capable of handling text, images, and audio. The "o" in GPT-4o stands for "omni," reflecting the model's ability to manage 50 different languages with enhanced speed and quality.
Although the announcement livestream is great, the real gold nuggets are in the 22 demo videos they posted on their channel.
We watched all of them, and here are the key takeaways and use cases we all should know:
1. Accessibility for the Blind
GPT-4o can now look at your surroundings and describe them for you.
Why should you care? Imagine sending it the visual feed from something like the Meta Rayban glasses, and your AI assistant can describe what you’re seeing, and help you navigate your surroundings like never before (e.g., “Is what I’m holding a jar of peanut butter or a jar of vegemite?”).
This will be a game-changer for how the visually impaired lives their daily lives.
2. The Ultimate Learning Partner
Give GPT-4o a view of the math problem you’re working on, or the objects you want to learn the language translation of, and it can teach you like no other tool can.
3. Prepare for Interviews like Never Before
Have GPT-4o act like the company you’re interviewing for.
Why should you care? What’s changed is that the AI can now “see” you. So instead of just giving feedback on what you say, it can also give feedback on how you say it. Layer this on top of an AI avatar, and maybe you can simulate the interview itself in the future.
4. Your Personal Language Translator, wherever you go
Ask ChatGPT to translate between languages, and then speak normally.
5. Share Screen with your AI Assistant
Share the screen with your AI partner, and have them guide you through your work.
Why should you care? Now, this is something that will happen pretty soon. Being able to “share screen” with your AI assistant can help not just with coding, but even with other non-programmer tasks such as work in Excel, PowerPoint, etc.
6. A future where AIs interact with each other
Two GPT-4os are interacting with each other, which sounds indistinguishable from two people talking. (They even sang a song together!)
7. Brainstorm with two GPTs
The demo shows how you can talk to two GPT-4os at once.
Why should you care? The demo video is centered around harmonizing singing for some reason, but I think the real use case is being able to brainstorm with two specific AI personalities at once:
One’s a Devil’s Advocate, the other’s an Angel’s Advocate?
One provides the pros (the optimist), and the other gives the cons (the pessimist).
PS: In response to these, Google has launched some AI models, which I will share with you in a bit 👀
TOGETHER WITH MIXO
Traditional website builders often promise simplicity, but making a sleek, professional website can be far trickier than expected. Navigating through design choices and dealing with complex code can be daunting.
But with AI platforms like Mixo, with just a simple prompt you can create an impressively professional multi-page website in seconds.
What's more, editing the design and content is straightforward with the help of the integrated AI editor.
This smart tech manages not just the content and structure of your website but also oversees customer subscriptions, SEO, and more, saving you time and leaving you to focus on launching your business idea.
Credit: Google
The AI race is hot right now, folks! It’s been two days since OpenAI demoed its captivating AI, GPT-4o, and now Google is trying to steal some of that spotlight with their I/O 2024 event.
Google has introduced two major AI projects:
1. Project Astra
A multimodal AI assistant that can interpret visual and audio inputs in real-time, identify objects, locate misplaced items, and explain code.
This is Google's response to OpenAI’s GOT-4o.
2. Veo
Credit: Google
A text-to-video generator that allows users to create AI-generated videos from text prompts.
Veo has “an advanced understanding of natural language,” enabling the model to understand cinematic terms like “timelapse” or “aerial shots of a landscape.”
Here are some of the visuals generated by Veo.
Other good releases from the events are:
🌟 Gemini 1.5 Pro and Gemini Flash: Gemini 1.5 Pro and Flash, improved for tasks like translation, coding, and reasoning, are now available in preview globally and will launch in June.
🗣️ Gemini Live: A feature that enables voice-based AI interactions with Google's AI assistant.
🌐 Gemini Nano in Chrome: Google's lightweight LLM, Nano, will be integrated into the Chrome browser, enabling on-device AI features like text generation.
Useful AI Links
🔍️ Trending Tools
Brilliant - Its ever-expanding library of content will help you master the basics of AI with bite-sized lessons you can do in minutes a day, whenever, wherever (sponsored)
Mindsera - an AI-powered journal that offers personalized mentorship and feedback to enhance your mindset, cognitive skills, mental health, and fitness.
Chatmind - AI that quickly generates a complete mind map.
Clay - AI-powered tools for cultivating amazing personal and professional relationships.
Find AI - Research engine for companies and people.
Wegic - The first AI web designer & developer by your side.
Callfluent - A tool to create voice agents for automating business calls.
🤗 Resources
How to make your content go viral with ChatGPT (prompts you can copy and paste)
On-boarding your AI Intern.
The Next Big Programming Language Is English
3 prompts to generate 100% human-like content in ChatGPT
🤗 Good Vids
Google I/O Keynote in 17 minutes.
😮 Things you might like
1/4 Ilya and OpenAI are going to part ways.
Ilya and OpenAI are going to part ways. This is very sad to me; Ilya is easily one of the greatest minds of our generation, a guiding light of our field, and a dear friend. His brilliance and vision are well known; his warmth and compassion are less well known but no less… x.com/i/web/status/1…
— Sam Altman (@sama)
11:02 PM • May 14, 2024
2/4 Recap from GoogleIO.
Google vs OpenAI is the beef we’re all here for
Dumping updates from I/O in this 🧵
— Jerry Liu (@jerryjliu0)
5:30 PM • May 14, 2024
3/4 Audience: That OpenAI voice sounds a little too sexy
Unitree: hold my beer
Good lord
We’re cooked— Nick St. Pierre (@nickfloats)
2:52 PM • May 14, 2024
4/4 Seven features announced by OpenAI on Monday
OpenAI wins the internet with another big breakthrough in AI.
It takes their ChatGPT capabilities to a whole new level.
Here are 7 revolutionary innovations they unveiled today:
— Barsee 🐶 (@heyBarsee)
6:03 PM • May 13, 2024
100% completed
That’s all for today
Thanks for reading. See you next time! Uff uff 🐶
💡 Help me get better and suggest new ideas at [email protected] or @heyBarsee
👍️ Like what you see? Subscribe Now
Reach 75K+ Founders, Software Engineers, & Operators
If you’re interested in advertising with us, send an email over to [email protected] with the subject “AI Valley Ads”.
Written with 💚 by Barsee (me) and Jet (my 🐶 )
What did you think of today's newsletter: Your feedback helps to create better content for you. |