- AI Valley
- Posts
- Microsoft Unveils Autonomous AI Agents
Microsoft Unveils Autonomous AI Agents
PLUS: Figure 02 Humanoids are now 400% Faster
Together with
Howdy! It’s Barsee again.
Happy Wednesday, AI family, and welcome back to AI Valley.
In today’s edition:
Figure 02 Humanoids are now 400% Faster
Microsoft Unveils Autonomous AI Agents
Plus trending AI tools, posts, and resources
Ready, set, go…
FIGURE
Figure 02 Humanoids are now 400% Faster
Figure founder Brett Adcock recently shared a glimpse of Figure 02 humanoid robots performing industrial tasks autonomously for BMW.
Here's what you need to know:
Figure 02 is now 4x faster, 7x more accurate, and significantly more reliable than its initial trial three months ago.
The robots operate fully autonomously, completing around 1,000 placements daily.
Robots are trained daily using a replica of BMW’s South Carolina plant and NVIDIA’s digital twin technology, ahead of their full-time deployment in January 2025.
Figure is committed to scaling production, with plans to deploy millions of humanoid robots for commercial and domestic use.
Engineers are already working on the next-gen Figure 03 robot, with the company hiring 100+ engineers across various roles to support development.
Why it matters:
Founded just two years ago, Figure is rapidly advancing humanoid robotics, aiming to lead a market with transformative potential for industries and everyday life.
TOGETHER WITH FLOW AI
Use your voice to type 3x faster than your keyboard - anytime, anywhere
Tired of slow typing and endless edits? Wispr Flow lets you speak naturally and converts your thoughts into perfectly formatted text, saving you hours.
Whether you're crafting AI prompts in ChatGPT, Cursor, or v0, or simply writing emails and messages, Flow adapts to your style and context, ensuring every word is seamless.
For professionals, students, and tech enthusiasts, Flow is a game-changer. Developers love using Flow to interact with AI assistants faster than typing. Product managers appreciate how it turns rambles into clear ideas.
And for anyone juggling busy schedules, Flow's accuracy and speed give you more time for what matters.
Flow's advanced voice recognition captures your tone, eliminates mistakes, and even offers features like auto-edits and command mode for enhanced productivity.
Ready to boost your workflow? Try Wispr Flow today and experience smarter, faster communication.
SIDE UPDATES
OpenAI is rolling out its Advanced Voice Mode for ChatGPT on the web, enabling users to have real-time, natural conversations directly from their browsers. Using GPT-4's audio capabilities, Advanced Voice Mode allows ChatGPT to understand non-verbal cues like speaking speed and respond with emotion, making chats more dynamic.
Google’s Gemini AI now has a memory feature that remembers users' interests and preferences for more relevant responses. Users can easily view, edit, or delete shared information and see when it’s used. The feature is available to Gemini Advanced subscribers in English.
Niantic is building a “Large Geospatial Model” (LGM) using data from millions of players to help AI navigate physical spaces. This model, similar to Large Language Models, will enable machines to understand and interact with environments. By training AI on geolocated images from Niantic’s games like Pokémon Go, the LGM aims to advance augmented reality, robotics, and autonomous systems.
Allen AI's OpenScholar is a new model that answers research questions by searching for relevant papers and generating responses based on them. Using a 45M paper database and an 8B parameter LLM, it helps researchers explore scientific literature. On ScholarBench, OpenScholar-8B beat GPT-4o and other models in accuracy and citations, while being much more cost-effective.
New experimental models have appeared on lmarena, including the "Anonymous Chatbot," which may be the 4.0 update from November 11, or something more advanced. Google has released two models: "Secret Chatbot" (Gemini), which performs well, and "Mystery Gemini 3," which underperforms. With many models now available, it's unclear which are breakthroughs. lmarena remains a hub for anonymous AI experiments.
O2 has launched Daisy, an AI-powered "granny" designed to waste scammers’ time with lifelike, rambling conversations. Trained in partnership with YouTuber Jim Browning, Daisy is added to scam call lists to keep fraudsters on the line and prevent them from targeting real victims.
Kim Kardashian showcased Tesla's Gen 2 humanoid robot, Optimus, on social media. The robot is expected to enter full production by 2026, with an estimated cost of $20,000 to $30,000.
MICROSOFT
Microsoft Unveils Autonomous AI Agents For Copilot
At Ignite 2024, Microsoft released several ready-made AI agents in Copilot that can handle everything from simple tasks to complex multi-step processes.
Here's what you need to know:
The SharePoint agent lets users create personalized agents for real-time answers, which can be shared across chats, meetings, and emails while ensuring data privacy.
The Interpreter agent can clone participants' voices and offer real-time translation in up to 9 languages during Teams meetings.
The Employee Self-Service agent handles workplace policy questions and automates HR and IT tasks, like retrieving payroll info or processing leave requests.
The Facilitator agent takes real-time notes in Teams, summarizing key information as conversations unfold.
The Project Manager agent automates tasks in Planner, creating projects, assigning tasks, tracking progress, and generating status reports.
Why it matters:
These AI agents help businesses cut costs, boost creativity, and speed up innovation by automating tasks and allowing employees to focus on more important work.
TRENDING TOOLS
Thoughtly > AI-powered human like phone agents. Reduce costs, increase efficiency, and seamlessly integrate into your systems [Link] *
Mistral > Free ChatGPT Plus Alternative with web search, image generation, agents (like GPTs) and even a Canvas feature [Link]
Documind > Open-source platform for extracting structured data from documents [Link]
Layer > Build visual tree structures of your projects and goals in just a few clicks [Link]
COOL FINDINGS / RESOURCES
Build your own screen recorder app with Replit AI Agent - by Paul [Link]
How to connect AI automations to Bolt.new and build your first SaaS - by Kevin [Link]
Our brains are vector databases, here’s why that’s helpful when using AI - by VentureBeat [Link]
Discover how to spot an underserved market niche and create a prototype with Bolt AI - by Nodus Labs [Link]
CONTENT CORNER
Future of programming with Al.
AI predictions for the next five years.
predictions for the next five years.
1. multimodal mastery:
- models will seamlessly handle text, images, video, audio
- we'll probably see llms that can understand and manipulate 3d spaces
- real-time processing of multiple inputs simultaneously
- ability to generate and edit… x.com/i/web/status/1…— 🍓🍓🍓 (@iruletheworldmo)
7:13 PM • Nov 18, 2024
Unclassified ad:
The fastest way to build AI apps
Writer is the full-stack generative AI platform for enterprises. Quickly and easily build and deploy AI apps with Writer AI Studio, a suite of developer tools fully integrated with our LLMs, graph-based RAG, AI guardrails, and more.
Use Writer Framework to build Python AI apps with drag-and-drop UI creation, our API and SDKs to integrate AI into your existing codebase, or intuitive no-code tools for business users.
THAT’S ALL FOR TODAY
That’s all for today’s issue, folks.
💡 Help me get better and suggest new ideas at [email protected] or @heyBarsee
👍️ Like what you see? Subscribe here
Thanks for being here.
HOW WAS TODAY'S NEWSLETTER |
REACH 100K+ READERS
Acquire new customers and drive revenue by partnering with us
Sponsor AI Valley and reach over 100,000+ entrepreneurs, founders, software engineers, investors, etc.
If you’re interested in sponsoring us, email [email protected] with the subject “AI Valley Ads”.