AI Friday — Thursday, June 25, 2026

Hardware & Infrastructure

OpenAI unveils its first custom chip, built by Broadcom

OpenAI is building its own AI inference chips in partnership with Broadcom—a major signal that relying on Nvidia isn't enough anymore. This isn't about training; it's about running models faster and cheaper at scale. Discussion on HN.

TechCrunch

IBM claims world's first sub-1 nanometer chip technology

IBM announced a breakthrough in transistor density that could push performance and efficiency forward. This matters for anyone running large models locally or in the cloud—better chips mean faster inference and lower power costs.

Ars Technica

New Models & Capabilities

Computer use in Gemini 3.5 Flash

Google just gave Gemini the ability to see and interact with your screen—similar to Claude's computer use feature. This opens up automation possibilities: take screenshots, click buttons, fill forms. It's in Gemini 3.5 Flash, so it's fast enough for real workflows. HN discussion.

Google DeepMind

How agents are transforming work

OpenAI published new research on agentic AI—systems that can plan, execute, and iterate without constant human input. The paper shows real productivity gains in customer support, coding, and research workflows. If you're building with agents, this is worth reading for framing what's actually possible.

OpenAI Blog

Geopolitics & Access Control

Anthropic says Alibaba illicitly extracted Claude AI model capabilities

Anthropic accused Alibaba of extracting Claude's capabilities without authorization—adding fuel to ongoing tensions between U.S. AI labs and foreign competitors. HN thread has more context on the broader export control landscape.

Reuters

NSA lost access to Mythos amid Anthropic dispute

The NSA apparently had access to Anthropic's security research tools, but that relationship soured. This signals how fragile government-industry AI partnerships can be. HN discussion.

The New York Times

Tools & Frameworks

RubyLLM: A Ruby framework for all major AI providers

Tired of writing custom code for every API? RubyLLM gives you a unified interface to OpenAI, Anthropic, Google, Mistral, and others. Write once, swap providers with a config change. If you're building Rails apps with AI, this could save you hours. HN thread.

Hacker News

Haystack: Open-Source AI Framework for Production Ready Agents and RAG

Haystack is a solid open-source framework for building retrieval-augmented generation (RAG) and agentic systems. If you're past the prototype stage and need something production-ready, it's worth evaluating alongside LangChain and LlamaIndex.

Hacker News

Interesting Reads & Experiments

For most of the world, open-source AI is the only way forward

This piece argues that proprietary AI from OpenAI, Anthropic, and Google creates a two-tier world—access for the wealthy, exclusion for everyone else. Open models like Llama and Mistral are the only path to democratization. Worth reading if you care about AI equity. HN discussion.

TechStrong

Big AI labs are hiring philosophers

OpenAI, DeepMind, and Anthropic are all building philosophy teams. Not because they've gone soft—they're grappling with real questions about agency, alignment, and what it means when AI systems can make decisions. It's a signal that the industry knows it's past the point of pure engineering.

The Economist

What I'm Finding About LLM Code Style and Token Costs

A practical breakdown of how code formatting affects token usage—and thus cost—when feeding code to LLMs. Simple stuff like variable naming and spacing can change your bill. Useful if you're optimizing prompts for production.

Hacker News

Also

Qwen-AgentWorld: Language World Models for General Agents — New research on world models for agentic systems—understanding environment dynamics without explicit training.
Mistral OCR 4: Document Reading at Scale — We covered this yesterday, but if you missed it: fast, accurate text extraction from images and PDFs. Now with better multilingual support.
Recall: Local Project Memory for Claude Code — Community project that adds persistent memory to Claude in VS Code. Solid if you're using Claude for extended coding sessions.
Claude Tag: Agents That Work Together in Slack — Yesterday's big launch is still generating momentum. If you use Slack, this is worth setting up today.
Meta's Creator Studio has been revived as an AI companion app — Meta is bringing back Creator Studio as a standalone AI assistant for creators. Part of their broader push into agentic tools.
Building Reliable Agentic AI Systems: A Real-World Framework — Martin Fowler's thoughtful piece on making agents reliable in production—error handling, retries, human-in-the-loop patterns.
LLMs Are Complicated Now — A grounded take on why the simple 'prompt + LLM' era is over. Systems thinking is back in style.
Bible as RAG Database — A clever demo showing RAG applied to biblical text—search, retrieve, and reason over scripture. Fun proof of concept.
DiffusionBench: Evaluating Generative Diffusion Transformers — New benchmark for diffusion-based models. Useful if you're evaluating image or video generation tools.
Europe is pushing back on Washington's chip war — As the U.S. restricts chip exports to China, Europe is pushing back on being dragged into trade wars. Geopolitics colliding with AI infrastructure.
Team Topologies for Agentic AI: Rethinking How Teams Structure Around Agents — How should teams organize when half the work is done by AI? A thoughtful reframing of org structure for the agent era.

Hardware & Infrastructure

New Models & Capabilities

Geopolitics & Access Control

Tools & Frameworks

Interesting Reads & Experiments

Also

Today’s Sources