- AI Valley
- Posts
- Google unveils 'Project Jarvis'
Google unveils 'Project Jarvis'
PLUS: Meta releases NotebookLlama
Together with
Howdy! It’s Barsee again.
Happy Monday, AI family, and welcome back to AI Valley.
In today’s edition:
🤖💡 Google working on new ‘Project Jarvis’
📓 Meta releases NotebookLlama
🤖 Plus trending AI tools, posts, and resources.
Ready, set, go…
TOGETHER WITH STORYTELL
Struggling to make sense of endless data? Stop drowning and start making faster, smarter, and data-driven decisions with Storytell.ai
This AI tool automates complex data analysis, turning unstructured information within organizations into clear, actionable insights. Whether you're pulling data from various sources or preparing for critical decisions, it ensures you're not just ahead of the curve—you're leading the charge.
No more wasting hours sifting through data manually. With Storytell.ai, your team can focus on what truly matters—driving impactful business outcomes.
Ready to unlock the full potential of your data? Let AI power your next big business move.
Try it now or request a demo at [email protected]
🤖💡 Google working on new ‘Project Jarvis’
Google is reportedly developing "Jarvis," an AI agent designed to carry out tasks on behalf of users, such as research, shopping, and booking flights directly within their web browser.
Here's what you need to know:
Project Jarvis is expected to integrate with Google Chrome, automating tasks by analyzing screenshots of web content and performing actions like clicking buttons or typing.
It will run on the upcoming Gemini 2.0 model, set to debut in December.
Google is expected to unveil Project Jarvis around that time alongside the new Gemini model, with initial access likely limited to a small group for testing.
The system currently has speed limitations, requiring a few seconds to process each action.
Why it matters:
The biggest AI companies are all working on models that perform similar tasks.
Microsoft’s Copilot Vision will let you talk with it about webpages you’re viewing.
Apple Intelligence is expected to be aware of what’s on your screen.
Anthropic debuted Claude beta update that can use a computer for you, and OpenAI is reportedly working on a version of that, too.
With Project Jarvis, Google is joining the race to develop "computer-using agents," aiming to make AI-driven task management a part of daily life.
META
📓 Meta releases NotebookLlama
Meta has rolled out NotebookLlama, an open-source version of Google’s popular podcast creation tool, NotebookLM.
NotebookLlama leverages Meta's own Llama models for processing tasks, and similar to its Google counterpart, it can create conversational-style summaries from text files that users upload.
Here's what you need to know:
NotebookLlama works by converting a file, like a PDF, into a transcript and then adds dramatization and interruptions before using text-to-speech models.
While the voices in NotebookLlama samples can sound overly robotic and occasionally overlap unexpectedly, Meta's team believes they can fix these issues with more advanced models.
They also proposed an alternative podcast creation method where two agents debate the topic and draft the podcast outline, rather than relying on a single model.
Hallucination problem in AI generated podcasts:
AI-generated podcasts still face the 'hallucination problem,' with attempts to replicate NotebookLM’s podcast feature falling short. Despite some progress, NotebookLlama also struggles with this, signaling room for improvement in future versions.
PRESENTED BY STORYTELL
🚀 Level up as a product manager with AI
Discover how AI can help you extract user insights, automate tedious PM tasks, and drive smarter decisions across cross-functional teams.
BREAKING BYTES
Meta has signed a multi-year deal with Reuters to incorporate its news content into the Meta AI chatbot, allowing it to provide real-time answers to user queries about news and current events (4 minutes)
Waymo has secured $5.6 billion in new funding to expand its robotaxi service to new cities in 2025 (2 minutes)
IC-Light v2 was just released, and it now runs on FLUX, and it looks like the best relighting tool in the world (demo)
MuVi can generate music that matches the visuals of videos by analyzing important features. It uses rhythmic synchronization and can control the style and genre of the music (4 minutes)
xAI adds image understanding capabilities to its Grok AI chatbot (2 minutes)
People are cloning The Rock and Lex Fridman using 100% local and open source AI tools like F5-TTS and Facefusion (2 minutes)
TRENDING TOOLS
ElevenLabs Voice Design > Generate a custom voice based on a text prompt. (link)
Loomos > Transform raw screen recordings into studio-quality videos in single click. (link)
VisualSiteMap > Automatically generate beautiful visual sitemaps + high-resolution screenshots of any public or private website. (link)
Sequel > Obtain instant insights and visualizations by simply asking questions about your data. (link)
COOL FINDINGS / RESOURCES
Shockingly good super-intelligent summarization prompt. (link)
How to set Perplexity as your default search. You won't regret. (link)
Robotics will soon have their ChatGPT moment. Here’s where to start with robotics. (link)
ElevenLabs recently dropped a Conversational AI SDK, making it super easy to build Voice Assistants. Here are the codes. (link)
Notes on Anthropic’s Computer use ability. (link)
Notes on the new Claude analysis JavaScript code execution tool. (link)
NotebookLM crash course. (link)
CONTENT CORNER
1/ This is a game-changer announcement by Apple around cryptography. It is the “HTTPS moment for AI” in some ways.
This is a game-changer announcement by Apple around cryptography. It is the “HTTPS moment for AI” in some ways..
Here is what this means: your private confidential data can be pooled with other data sources and used to securely improve your UX and that of the wider community… x.com/i/web/status/1…
— Varun (@varun_mathur)
11:37 AM • Oct 27, 2024
2/ How AI could help us talk to animals.
3/ How Ai is about to transform the world’s economy.
THAT’S ALL FOR TODAY
That’s all for today’s issue, folks.
💡 Help me get better and suggest new ideas at [email protected] or @heyBarsee
👍️ Like what you see? Subscribe here
Thanks for being here.
HOW WAS TODAY'S NEWSLETTER |
REACH 100K+ READERS
Acquire new customers and drive revenue by partnering with us
Sponsor AI Valley and reach over 100,000+ entrepreneurs, founders, software engineers, investors, etc.
If you’re interested in sponsoring us, email [email protected] with the subject “AI Valley Ads”.