- AI Valley
- Posts
- An open-source model outperforms GPT-5
An open-source model outperforms GPT-5
PLUS: Perplexity to pay Snapchat $400M to power search
Together with
Howdy, it’s Barsee.
Happy Friday, AI family, and welcome to another AI Valley edition. This issue takes 4 minutes to read.
Today’s climb through the Valley reveals:
Kimi-K2 Thinking outperforms GPT-5
Perplexity to pay Snapchat $400M to power search
Plus trending AI tools, posts, and resources
Let’s dive into the Valley of AI…
VOICES.COM
The best AI voices don’t feel robotic, because they’re not. They’re powered by real, talented humans. Customers want a voice AI experience that feels real and authentic, and for voice AI to truly represent your company or your product, it needs to sound natural, distinct, and unmistakably on-brand.
Join Voices on November 13th at 1pm ET to hear how Voices is helping brands discover and source professional voices to bring their AI experiences to life.
You’ll also hear from BMW on how they create authentic in-car AI experiences with voice and how Voices helped them do it.
*This is sponsored
THROUGH THE VALLEY
Moonshot AI has released Kimi K2 Thinking, a trillion-parameter open-source model that just beat OpenAI’s GPT-5 and Anthropic’s Claude Sonnet 4.5 in key reasoning and coding benchmarks.
The model’s architecture activates 32 billion parameters per inference, supports 256k-token context windows, and can autonomously perform 200–300 sequential tool calls without human input.
Despite its size, it’s fast and cheap, costing just $0.15 per 1M tokens and $2.50 per 1M output tokens, nearly 10× cheaper than GPT-5 and 20x cheaper than Sonnet 4.5.
Moonshot built K2 to think like a human. Each answer shows every step of logic before reaching a conclusion, making its reasoning transparent and easy to audit.
Here are some of its benchmark results:
Humanity’s Last Exam: 44.9 % (with tools enabled)
BrowseComp: 60.2 % (web reasoning + search)
SWE-Bench Verified: 71.3 % (coding + tool use)
Across these tests, K2 Thinking consistently outperforms GPT-5, Claude Sonnet 4.5, and xAI’s Grok-4, setting a new bar for open-source reasoning models.
The release includes APIs for chat, reasoning, and automation. It’s fully open-source under a Modified MIT License. The only rule: if your app serves more than 100 million users or earns over 20 million dollars a month, you must credit “Kimi K2” in the interface.
Why does it matter?
For the first time, an open-weight AI model has not only caught up with but surpassed the world’s leading proprietary systems, and it’s free. While OpenAI pours trillions into chips and datacenters, Moonshot is proving that clever architecture > infinite compute. If enterprises can now get GPT-5-level reasoning at 1/10 the cost, the case for paying premium prices for closed systems starts to crumble.

AI search startup Perplexity will integrate its “answer engine” directly into Snapchat over the next year in a paid deal. Under the agreement, Perplexity will pay Snap $400 million over one year, through a combination of cash and equity
For Snap, this could be more than an AI experiment; it’s a strategic pivot. The company’s built-in chatbot, My AI, has gained users but struggled to stand out against Meta’s expanding AI features. Partnering with Perplexity could give Snap a fresh tech edge, helping reposition it from a social app into an AI-powered discovery hub.
For Perplexity, the partnership is massive: Snap’s 943 million-strong user base offers a direct line to a notoriously hard audience to capture (young users who increasingly get information from messaging apps instead of search engines).
Why does it matter?
The deal signals a generational shift in how people search the web. Instead of typing queries into Google, millions of younger users could soon be asking questions inside Snapchat, powered by Perplexity’s AI. It’s a bold, costly bet (nearly a third of Perplexity’s $1.5 billion funding), but if it works, it could cement Perplexity as the default search engine for many Snapchat users.
TRENDING TOOLS
Kimi K2 Thinking > Moonshot AI’s new open-source agent that pushes state-of-the-art reasoning to new heights
Firecrawl > Instantly extract a brand’s entire DNA from any website (colors, logos, frameworks, and more)
Cotera > Build AI agents directly in chat, connect them to hundreds of apps, and automate repetitive work
Stream Ring > A sleek AI smart ring that lets you record voice notes with just a whisper
TryCaddy > Control your computer and every app using only your voice
Manus 1.5 > A general-purpose AI agent that builds full-stack web apps through natural conversation
Orgo > Virtual computers for AI agents, they can now create files, browse the web, and run desktop apps autonomously
Gemini Deep Research > Adds new connectors for Gmail, Drive, Docs, and Chat to supercharge research workflows
MatterAl > A self-improving code intelligence layer that reviews, optimizes, and verifies code against your company’s architecture, security, and performance standards.
THINK PIECES / BRAIN BOOST
Tesla shareholders approve Elon Musk's $1 trillion pay package
If AI keeps doubling its task horizon every six months, the economy could hit breakneck growth
Sam Altman on trust, persuasion, and the future of intelligence
Ten robot dogs leading the charge in military, research, and industrial applications
Nvidia's Jensen Huang says, 'China is going to win the AI race,' FT reports
Quantinuum launches Helios, claiming “most accurate quantum computer in the world”
THE VALLEY GEMS
THAT’S ALL FOR TODAY
Thank you for reading today’s edition. That’s all for today’s issue.

💡 Help me get better and suggest new ideas at [email protected] or @heyBarsee
👍️ New reader? Subscribe here
Thanks for being here.
HOW WAS TODAY'S NEWSLETTER |
REACH 100K+ READERS
Acquire new customers and drive revenue by partnering with us
Sponsor AI Valley and reach over 100,000+ entrepreneurs, founders, software engineers, investors, etc.
If you’re interested in sponsoring us, email [email protected] with the subject “AI Valley Ads”.



