AI Friday — Saturday, May 9, 2026

AI Safety & Security: The Hard Problems

AI is breaking two vulnerability cultures

Security researchers rely on coordinated disclosure—quietly reporting bugs before going public. AI makes this harder: you can't keep an exploit secret when the vulnerability exists in a model weights file that's already been shared globally. This piece cuts through the noise to explain why traditional vulnerability disclosure is collapsing and what we do about it. Discussion on HN.

Hacker News

Claude Code CVE-2026-39861: Sandbox escape via symlink

A real vulnerability found in Claude Code's sandbox—agents could escape using symlinks. It's now patched, but this is the kind of thing you want to see caught and fixed early. The fact that this exists and is being openly tracked says something good about how the space is maturing.

GitHub Security Advisories

Can LLMs model real-world systems in TLA+?

Formal verification—proving your system is correct before it breaks—is hard. Can AI help? This deep-dive explores whether LLMs can write TLA+ specifications (a formal logic language) for real systems. Spoiler: it's complicated, but there's promise. HN discussion.

Hacker News

Understanding Claude's Thinking

Teaching Claude Why: Explaining reasoning in language

Anthropic released a new research piece on how Claude learns to explain its own reasoning—why it chose a particular answer, not just what the answer is. This matters for trust and debugging. When Claude can tell you *why* it did something, you can actually use it for high-stakes work. HN discussion.

Anthropic Research

A recent experience with ChatGPT 5.5 Pro

Real-world test drive: a mathematician uses GPT-5.5 Pro on actual work problems. It's honest about what the model can and can't do—useful for getting a sense of where the ceiling is right now. Not hype, just evidence. HN thread.

Hacker News

Tools & Things People Built

Git for AI Agents (re_gent)

Someone built version control for AI agent workflows—tracking agent state, branching, and rollback like you would for code commits. This is exactly the kind of boring infrastructure that makes agents actually usable in production. Show HN discussion.

Hacker News

Using Claude Code: The Unreasonable Effectiveness of HTML

Thariq Shihipar from the Claude Code team argues that HTML—not Markdown—is the right format for agents to work with. It's semantic, it's flexible, and it lets agents actually see structure. A smart piece on what makes agentic coding actually work in practice.

Simon Willison

Ben's Builds #3: An Email App

Ben Tossell is shipping an AI-native email app. This is the kind of "I'm actually building something" energy the industry needs—not a white paper, not a panel discussion, just shipping. Watch what works and what breaks.

Ben's Bites

CyberSecQwen-4B: Small, specialized security models

A 4B parameter open-source model trained for cybersecurity work—fast enough to run locally, specialized enough to actually be useful. Part of a broader shift: instead of one huge model for everything, people are training lean specialists.

Hugging Face Blog

The Quiet Structural Story

Anthropic growing 10x/year while the industry lays off >10%

While OpenAI and other labs are cutting costs, Anthropic is hiring aggressively—10x headcount growth. This is a structural signal about who investors think will win. It's quiet but important context for understanding where the industry is actually moving.

Latent Space

Cloudflare says AI made 1,100 jobs obsolete—while revenue hits record high

CEO Matthew Prince: because of AI, we don't need as many support roles. The company is cutting headcount while growing revenue. This is the actual economic story—not "AI will kill jobs someday," but "it's happening now, and businesses are optimizing."

TechCrunch

Worth a Listen

The Week the AI Story Shifted

A week-in-review podcast that steps back from the noise. The narrative around AI is forking: away from job-apocalypse panic toward a more realistic picture of how AI actually diffuses through the economy. Worth an hour of your time if you're thinking strategically.

AI Daily Brief Podcast

Also

Google is linking more sources in AI Overviews — Google's AI search is adding better attribution and source links. Small move, big signal about how AI search will compete with traditional search.
Running Codex safely at OpenAI — OpenAI's playbook for running Codex securely: sandboxing, approvals, network policies, telemetry. Practical security architecture for coding agents.
Intel's comeback story is wilder than it seems — Intel stock up 490% in a year. Wall Street is betting on a chip architecture comeback. Context for why compute competition matters.
Sony says AI tools will flood the game market — PlayStation maker predicts AI will lower barriers to game creation. Human artists "must remain at the center." The supply question everyone's asking.
All my clients wanted a carousel, now it's an AI chatbot — A freelancer's honest take: client expectations have shifted. Everyone wants "AI" now. Funny and real.
GPT-5.5 Price Increase: What It Costs — We covered this yesterday; still useful as reference for cost estimates.
EMO: Mixture of experts for emergent modularity — Allen AI research on training models with selective specialization. Technical but elegant—shows how next-gen models might be structured.
Nanoleaf bets its future on robots, red light therapy, and AI — Smart lighting company pivoting to robotics and wellness. Interesting bet on where the consumer AI market goes.
Data centers are the physical foundation for AI — The infrastructure story: massive power requirements, grid strain, environmental questions. You can't talk about AI adoption without talking about this.
Microsoft worried OpenAI would 'shit-talk' Azure to Amazon — Court documents reveal internal anxiety during early Microsoft-OpenAI partnership talks. Behind-the-scenes politics around the AI compute deals.

AI Safety & Security: The Hard Problems

Understanding Claude's Thinking

Tools & Things People Built

The Quiet Structural Story

Worth a Listen

Also

Today’s Sources