• AI Valley
  • Posts
  • 🐶 Google revealed 2 BIG AI models

🐶 Google revealed 2 BIG AI models

PLUS: Google launched Project Astra...

Together with

Howdy, happy Wednesday, AI family! 🐶

🧵Here are some useful AI updates and tools I gathered today:

  • 🍋 OpenAI announced GPT-4o. Here are the key takeaways

  • 🦾 Google revealed 2 BIG AI models

  • 🔍️ + new AI Tools, Resources, and other news

Reading time - 4 minutes only

OPENAI

OpenAI has unveiled a new AI model called GPT-4o, a multimodal system capable of handling text, images, and audio. The "o" in GPT-4o stands for "omni," reflecting the model's ability to manage 50 different languages with enhanced speed and quality.

Although the announcement livestream is great, the real gold nuggets are in the 22 demo videos they posted on their channel.

We watched all of them, and here are the key takeaways and use cases we all should know:

1. Accessibility for the Blind

GPT-4o can now look at your surroundings and describe them for you.

Why should you care? Imagine sending it the visual feed from something like the Meta Rayban glasses, and your AI assistant can describe what you’re seeing, and help you navigate your surroundings like never before (e.g., “Is what I’m holding a jar of peanut butter or a jar of vegemite?”).

This will be a game-changer for how the visually impaired lives their daily lives.

2. The Ultimate Learning Partner

Give GPT-4o a view of the math problem you’re working on, or the objects you want to learn the language translation of, and it can teach you like no other tool can.

3. Prepare for Interviews like Never Before

Have GPT-4o act like the company you’re interviewing for.

Why should you care? What’s changed is that the AI can now “see” you. So instead of just giving feedback on what you say, it can also give feedback on how you say it. Layer this on top of an AI avatar, and maybe you can simulate the interview itself in the future.

4. Your Personal Language Translator, wherever you go

Ask ChatGPT to translate between languages, and then speak normally.

5. Share Screen with your AI Assistant

Share the screen with your AI partner, and have them guide you through your work.

Why should you care? Now, this is something that will happen pretty soon. Being able to “share screen” with your AI assistant can help not just with coding, but even with other non-programmer tasks such as work in Excel, PowerPoint, etc.

6. A future where AIs interact with each other

Two GPT-4os are interacting with each other, which sounds indistinguishable from two people talking. (They even sang a song together!)

7. Brainstorm with two GPTs

The demo shows how you can talk to two GPT-4os at once.

Why should you care? The demo video is centered around harmonizing singing for some reason, but I think the real use case is being able to brainstorm with two specific AI personalities at once:

  • One’s a Devil’s Advocate, the other’s an Angel’s Advocate?

  • One provides the pros (the optimist), and the other gives the cons (the pessimist).

PS: In response to these, Google has launched some AI models, which I will share with you in a bit 👀 

TOGETHER WITH MIXO

Traditional website builders often promise simplicity, but making a sleek, professional website can be far trickier than expected. Navigating through design choices and dealing with complex code can be daunting.

But with AI platforms like Mixo, with just a simple prompt you can create an impressively professional multi-page website in seconds.

What's more, editing the design and content is straightforward with the help of the integrated AI editor.

This smart tech manages not just the content and structure of your website but also oversees customer subscriptions, SEO, and more, saving you time and leaving you to focus on launching your business idea.

GOOGLE

Credit: Google

The AI race is hot right now, folks! It’s been two days since OpenAI demoed its captivating AI, GPT-4o, and now Google is trying to steal some of that spotlight with their I/O 2024 event.

Google has introduced two major AI projects:

1. Project Astra

A multimodal AI assistant that can interpret visual and audio inputs in real-time, identify objects, locate misplaced items, and explain code.

This is Google's response to OpenAI’s GOT-4o.

2. Veo

Credit: Google

A text-to-video generator that allows users to create AI-generated videos from text prompts.

Veo has “an advanced understanding of natural language,” enabling the model to understand cinematic terms like “timelapse” or “aerial shots of a landscape.”

Here are some of the visuals generated by Veo.

Other good releases from the events are: 

  • 🌟 Gemini 1.5 Pro and Gemini Flash: Gemini 1.5 Pro and Flash, improved for tasks like translation, coding, and reasoning, are now available in preview globally and will launch in June.

  • 🗣️ Gemini Live: A feature that enables voice-based AI interactions with Google's AI assistant.

  • 🌐 Gemini Nano in Chrome: Google's lightweight LLM, Nano, will be integrated into the Chrome browser, enabling on-device AI features like text generation.

Useful AI Links

🔍️ Trending Tools

  • Brilliant - Its ever-expanding library of content will help you master the basics of AI with bite-sized lessons you can do in minutes a day, whenever, wherever (sponsored)

  • Mindsera - an AI-powered journal that offers personalized mentorship and feedback to enhance your mindset, cognitive skills, mental health, and fitness.

  • Chatmind - AI that quickly generates a complete mind map.

  • Clay - AI-powered tools for cultivating amazing personal and professional relationships.

  • Find AI - Research engine for companies and people.

  • Wegic - The first AI web designer & developer by your side.

  • Callfluent - A tool to create voice agents for automating business calls.

🤗 Resources

🤗 Good Vids

Google I/O Keynote in 17 minutes.

😮 Things you might like

1/4 Ilya and OpenAI are going to part ways.

2/4 Recap from GoogleIO.

3/4 Audience: That OpenAI voice sounds a little too sexy

Unitree: hold my beer

4/4 Seven features announced by OpenAI on Monday

100% completed

That’s all for today

Thanks for reading. See you next time! Uff uff 🐶

💡 Help me get better and suggest new ideas at [email protected] or @heyBarsee

👍️ Like what you see? Subscribe Now

Reach 75K+ Founders, Software Engineers, & Operators

If you’re interested in advertising with us, send an email over to [email protected] with the subject “AI Valley Ads”.

Written with 💚 by Barsee (me) and Jet (my 🐶 )

What did you think of today's newsletter:

Your feedback helps to create better content for you.

Login or Subscribe to participate in polls.