- AI Valley
- Posts
- 🐶 Meta's just dropped SAM 2
🐶 Meta's just dropped SAM 2
PLUS: OpenAI rolls out “Her” like Voice Mode
Together with
Howdy,
It’s Barsee, again. Happy Wednesday, AI family, and welcome back to AI Valley 🐶
In today’s edition:
🎥 Meta’s new AI tracks every object in Videos
🗣️ OpenAI starts rolling out “Her” like Voice Mode for ChatGPT
🔍️ Plus trending AI tools and resources
Ready, set, go…
🤖🍨 MAIN SCOOP
Meta's just dropped SAM 2, a game-changer for real-time image and video segmentation. This new AI model can recognize and track the movement of objects in your videos, with just a few clicks.
What's new?
SAM 2 now works with video, unlike its predecessor, which only handled still images.
It's 6x faster and more accurate than the original SAM.
Analyzes video at about 44 frames per second, making it almost real-time.
Needs three times fewer human interactions as compared to older models.
It comes with the new SA-V dataset, which is 4.5 times larger than previous datasets and features 53 times more annotations.
And the best part? It’s under the Apache 2.0 license, so anyone can use it for free.
Why it matters?
SAM 2 will take video editing and AI-based video generation to the next level. Meta says it can also help in creating new experiences for their mixed-reality projects and could improve computer vision for autonomous vehicles by providing precise object tracking.
I found Merlin, one AI chat with all AI models, including GPT-4o, Claude, Gemini, and more.
Here is what I found relevant:
Chat with PDF
Chat with Websites
Web-scraping
Content repurposing
Summarization/transcription
All of it in 1-click feels like pure magic. It's as if Merlin can read my mind and deliver exactly what I need, every time.
They have 1 million+ users on the Chrome store, check here. But that's not it, Merlin integrates with my LinkedIn & Twitter so that I can create engaging content faster and more effortlessly.
*Indicates our partner’s link
🤝 THE LATEST IN
TECH
Logitech wants to sell a subscription-based ‘Forever Mouse’ (3 min read)
Why The Tech Industry Is Moving From The Metaverse To Spatial Computing (2 min read)
AI
GPT-5: Everything you need to know (10 min read)
AI boyfriends are on the rise (3 min read)
Apple releases the first preview of its long-awaited iPhone AI (2 min read)
You can now turn still images into AI videos with Runway Gen-3 Alpha (3 min read)
Google's AI Olympics commercial is backfiring in a big way (2 min read)
Perplexity AI will share revenue with publishers after plagiarism accusations (2 min read)
Meta is rolling out its AI Studio in the US for creators to build AI chatbots (3 min read)
BUSINESS
The top business use cases for generative AI (5 min read)
IBM's generative AI business is small but booming (3 min read)
🔗 USEFUL AI LINKS
TRENDING TOOLS
Chatling > build and deploy advanced AI chatbots on any website in less than 15 minutes. *
Jamie > get human-quality meeting summaries after each meeting.
Beloga > a read-it-later app with advanced insights.
Topview AI > an online AI video editor that turns your links or media assets into viral videos with one click.
Simply Draw > it helps you learn to draw by providing a customized learning journey.
Jovu > a tool that transforms ideas into production-ready code in an instant.
Greptile > an AI code review bot that has full context of your codebase. *
HOT FINDINGS / RESOURCES
I asked 10 businesses how they ac use AI.
Learn about Autonomous AI Agents: From concept to real-world application.
Tools vs. Chatbots։ How to write a business plan with AI.
Video: SearchGPT vs PerplexityAI.
13 hidden open-source libraries will help you become an AI wizard.
7 best generative AI use cases for business.

📰🍨 SIDE SCOOP
OpenAI is now rolling out its Advanced Voice Mode to a select group of paid ChatGPT users.
So, what’s the deal?
This feature was first revealed at the GPT-4o launch event in May, where the AI voice, Sky, sounded strikingly like Scarlett Johansson.
The resemblance caused quite a stir, leading OpenAI to pause and run more safety tests.
They’ve since tested GPT-4o's voice capabilities with over 100 external experts in 45 languages. The model now uses only four preset voices, which means no more mimicking celebrities.
What’s new with this update?
The new feature lets ChatGPT respond in real time and handle interruptions smoothly. It can also pick up on humor, sarcasm, and more. Plus, the new setup skips the speech-to-text conversion, which makes interactions faster and more natural.
🔥 DAILY DOSE OF CONTENTS
Friend, an AI necklace (with no voice output) promises to never let you feel lonely anymore.
AI and The Next Computing Platforms With Jensen Huang and Mark Zuckerberg (59-minute video)
What It's Like Using a Brain Implant With ChatGPT. (5-minute video)
😃 WEDNESDAY MOTIVATION
“The person who is willing to suffer the longest wins.”
👋 THAT’S ALL FOR TODAY
Thanks for reading.


💡 Help me get better and suggest new ideas at [email protected] or @heyBarsee
👍️ Like what you see? Subscribe here
REACH 90K+ READERS
Acquire new customers and drive revenue by partnering with us
Sponsor AI Valley and reach over 90,000+ entrepreneurs, founders, software engineers, investors, etc.
If you’re interested in sponsoring us, email [email protected] with the subject “AI Valley Ads”.
Help me improve AI Valley |
We appreciate your continued support! We'll catch you in the next edition 👋
💚 Written and edited by Barsee, Jet, and Akash Thapa.