• AI Valley
  • Posts
  • Gemini can now watch your screen in real-time

Gemini can now watch your screen in real-time

PLUS: 1 Worker + AI = 2 Workers

Together with

Howdy again. It’s Barsee, and welcome back to AI Valley.

Another day, another AI adventure.

Today’s climb through the Valley reveals:

  • DeepSeek has quietly released DeepSeek-V3-0324

  • 1 worker + AI = 2 workers

  • Gemini can now watch your screen in real-time

  • Plus trending AI tools, posts, and resources

Let’s dive into the Valley of AI…

PEAK OF THE DAY

DeepSeek unveils updated V3 model and takes the lead

Source: Artificial Analysis

DeepSeek has quietly released DeepSeek-V3-0324, an upgraded language model with major improvements in reasoning and real-world programming tasks, outperforming its predecessor and even top competitors.

Here’s what you need to know:

  • It’s way better at problem-solving, coding, and math than its previous version and even beats top competitors like Claude 3.7 Sonnet.

  • Uses a special "Mixture-of-Experts" system with 685B parameters but only activates 37B at a time, making it faster and easier to run.

  • The model is about 641 GB, but with compression, it shrinks to 352 GB and can generate over 20 tokens per second on devices like the Mac Studio with an M3 Ultra chip.

  • The model is open-source on Hugging Face under an MIT license, allowing developers to freely modify and deploy it for commercial use.

  • It is also available for demo access via OpenRouter, where users can test its capabilities directly through a chat interface or API.

  • Costs way less than competitors — $0.14 per million input tokens vs. Claude’s $3.

Why it matters:

Every week, a new model drops with higher benchmark scores, raising the bar for future model releases. Recent notable releases include:

  • Gemma 3  (2 weeks ago) 

  • Mistral Small 3.1 (a week ago) 

  • Deepseek-v3-0324  (now)

DeepSeek V3-0324 represents a significant leap in AI reasoning capabilities, enabling more accurate problem-solving, decision-making, and logical analysis. This update is expected to lay the groundwork for DeepSeek R2, a more advanced reasoning model anticipated in early May 2025, potentially setting new standards in AI performance.

NVIDIA Blackwell GPU Clusters Now Live on Lambda

Image Source: Lambda

Multi-node NVIDIA HGX B200-accelerated clusters are now available on demand through Lambda 1-Click Clusters. Time for AI teams to start innovating faster with the latest and greatest NVIDIA GPUs, without the overhead of long-term contracts or complex infrastructure management.

  • 3x faster training

  • 15x faster Inference

  • Zero lock-in

*This is sponsored

VALLEY VIEW

Image Source: SSRN

A Procter & Gamble study with 776 professionals revealed that AI significantly boosts product development performance. Teams with access to GPT-4 or GPT-4o outperformed individuals without AI by 40%, while individuals using AI improved their results by 37%, matching the performance of non-AI teams. AI-supported teams delivered the highest quality overall and were three times more likely to produce top-tier solutions.

Google's search engine remains hugely profitable, but the company is urgently focusing on generative AI. Independent web publishers report declining traffic as Google's AI overviews display information directly in search results, sparking tensions between Google and content creators. Google frames this as part of search's evolving nature, but as major changes loom, the future of Search and the web could shift dramatically.

Google is rolling out screen sharing and real-time video interaction for Gemini Live as part of its Project Astra initiative. These features let the assistant "see" your screen and camera feed, providing contextual responses to user queries. Currently, they’re exclusive to Gemini Advanced subscribers on Android via the Google One AI Premium plan, with broader availability expected soon.

Cloudflare's new "AI Labyrinth" combats unauthorized AI data scraping by luring bots into fake AI-generated content, protecting websites while wasting crawler resources. Invisible to regular users, this digital maze prevents search engine indexing and marks a new front in the battle over unapproved data collection, as AI crawler requests surge to 50 billion daily.

Microsoft has introduced 11 new AI agents for its Security Copilot platform to automate repetitive cybersecurity tasks and enhance efficiency. These agents handle jobs like triaging phishing emails and generating regulatory notifications after data breaches, easing the burden on security teams managing thousands of alerts daily. Starting next month, six Microsoft-developed agents and five from partners will be available for preview, integrated across Microsoft’s security tools.

Alibaba has unveiled the Large Animatable Human Reconstruction Model (LHM), an AI tool that transforms a single photo into a lifelike, movable 3D human avatar. This technology enables the creation of realistic digital humans with diverse poses and clothing styles, opening new possibilities in movies, video games, and online shopping. The model is available as open-source code on GitHub and HuggingFace.

The first AI agent that delivers human-quality service

Intercom has rolled out some exciting updates to its AI agent, Fin, making customer support faster and more intuitive:

  • Understands Images: Fin can now process photos and screenshots, so customers can show issues instead of just describing them.

  • Custom Guidance: Fine-tune Fin’s behavior to align with your support needs.

  • Task Completion: Beyond answering questions, Fin can now complete tasks on behalf of your customers.

  • Voice Support: Fin works over voice and phone, bringing AI-powered calls into the mix.

Additionally, Intercom is introducing real-time AI translation in the human support inbox, enabling agents and customers to communicate seamlessly in any language.

*This is sponsored

TRENDING TOOLS

  • Base44 > Create fully functional apps or products without coding or manual integrations.

  • Sider 5.0 > A sidebar extension that conducts deep research and create expert-level reports effortlessly.

  • Alta > Provides virtual agents to automate sales tasks and boost revenue growth.

  • GPT Rules > Supercharge your AI chats with cursor style rules.

THINK PIECES / BRAIN BOOST

VALLEY GEMS

1/ Google is set to unveil a new model for Gemini this week.

2/ An exploratory study was performed on the feasibility of humanoid robots performing direct clinical tasks through teleoperation.

The system is evaluated across seven diverse medical procedures, including physical examinations, emergency interventions, and precision needle tasks.

3/ Autonomous drone delivery is going to change the entire commerce industry. Coming to a doorstep near you.

SUNSET IN THE VALLEY

Thank you for reading today’s edition. That’s all for today’s issue.

💡 Help me get better and suggest new ideas at [email protected] or @heyBarsee

👍️ New reader? Subscribe here

Thanks for being here.

REACH 100K+ READERS

Acquire new customers and drive revenue by partnering with us

Sponsor AI Valley and reach over 100,000+ entrepreneurs, founders, software engineers, investors, etc.

If you’re interested in sponsoring us, email [email protected] with the subject “AI Valley Ads”.