Friday, April 3, 2026

Today's big story is OpenAI acquiring TBPN, the popular tech talk show that interviews founders and AI execs live every weekday. Google's countering with Gemma 4, a new family of open models that punch way above their weight — and they've switched to Apache 2.0 licensing. Meanwhile, AMD just dropped Lemonade, a fast local LLM server that uses both GPU and NPU, and Microsoft released three new foundational models for transcription, audio, and images.

Big Moves

OpenAI Acquires TBPN

OpenAI just bought TBPN, the cult-favorite tech podcast that goes live every weekday for up to three hours. The show has featured Sam Altman, executives from Meta, Microsoft, Palantir, and Andreessen Horowitz — basically everyone in AI. It'll operate independently under Chris Lehane's oversight. Discussion on HN.
OpenAI

Mistral Secures $830M in Debt Financing

Mistral raised $830 million in debt to fund a new AI data center. Not equity — debt financing for infrastructure, which signals they're betting big on having their own compute. It's a different play than most AI companies are making right now.
CNBC

New Models & Tools

Google Releases Gemma 4 with Apache 2.0 License

Four new vision-capable models from Google DeepMind: 2B, 4B, 31B, plus a 26B Mixture-of-Experts. Google emphasizes "unprecedented intelligence-per-parameter" — these are small models that punch way above their weight. And they switched to Apache 2.0 licensing, which is a big deal for commercial use. Simon Willison's breakdown is worth reading. HN discussion.
Google DeepMind

AMD's Lemonade: Fast Local LLM Server Using GPU and NPU

AMD released Lemonade, an open-source local LLM server that uses both GPU and NPU (neural processing unit) to run models faster on consumer hardware. If you've been wanting to run models locally without melting your laptop, this is worth checking out. Popular on Hacker News.
Hacker News

Microsoft Drops Three New Foundational Models

Microsoft's new MAI group (formed six months ago) just released three models: one for voice transcription, one for audio generation, and one for images. These are Microsoft's first serious in-house models outside of Copilot. Worth watching to see if they gain traction.
TechCrunch

Google Vids Gets AI Upgrade with Veo and Lyria

Google's video creation tool Vids now uses Veo 3.1 for video generation and Lyria 3 for music, both at no extra cost. You can also direct AI avatars through prompts now. If you've been waiting for an accessible way to prototype with generated video and audio, the barrier just dropped. More details.
Google AI Blog

Updates & Improvements

Google Introduces Flex and Priority Inference Tiers

Google added two new inference tiers to the Gemini API: Flex (cheaper, slower) and Priority (faster, costs more). This gives developers more control over the cost-latency tradeoff instead of one-size-fits-all pricing.
Google AI Blog

Google Home's Gemini Gets Better at Understanding Commands

Google updated the Home app so Gemini can better understand natural language commands for smart home control. You can now describe lighting like "the color of the ocean" instead of fighting with presets. Small quality-of-life improvement if you use Google Home.
The Verge

In Case You Missed It

The Claude Code Leak Breakdown

Following up on yesterday's leak story: this is a solid breakdown of what was in the Claude Code source that leaked via NPM. Ben's Bites also has a good analysis of the files. If you're curious what Anthropic's been building, these writeups are worth your time. HN discussion.
Build.ms

Anthropic's DMCA Takedown Hit 8,100 Legit GitHub Repos

Anthropic's effort to stop the Claude Code leak accidentally DMCA'd over 8,000 legitimate GitHub repos. They're working to reverse it, but it's a reminder of how blunt DMCA takedowns are. The leaked code is already out there — this is an uphill battle.
Ars Technica

Worth a Listen

Agentic Engineering on Lenny's Podcast

Simon Willison was on Lenny Rachitsky's podcast talking about agentic engineering, automation timelines, and the current state of AI. Simon's writeup includes highlights and relevant links from the conversation. Available on YouTube, Spotify, and Apple Podcasts.
Simon Willison

Moonlake on World Models

Latent Space interviewed Chris Manning and Fan-yun Sun from Moonlake AI about building long-running, multiplayer, interactive world models using agents bootstrapped from game engines. If you've been following the world models conversation, this caps it off nicely.
Latent Space

Today’s Sources