OpenAI is building its own AI inference chips in partnership with Broadcom—a major signal that relying on Nvidia isn't enough anymore. This isn't about training; it's about running models faster and cheaper at scale. Discussion on HN.
IBM announced a breakthrough in transistor density that could push performance and efficiency forward. This matters for anyone running large models locally or in the cloud—better chips mean faster inference and lower power costs.
Google just gave Gemini the ability to see and interact with your screen—similar to Claude's computer use feature. This opens up automation possibilities: take screenshots, click buttons, fill forms. It's in Gemini 3.5 Flash, so it's fast enough for real workflows. HN discussion.
OpenAI published new research on agentic AI—systems that can plan, execute, and iterate without constant human input. The paper shows real productivity gains in customer support, coding, and research workflows. If you're building with agents, this is worth reading for framing what's actually possible.
Anthropic accused Alibaba of extracting Claude's capabilities without authorization—adding fuel to ongoing tensions between U.S. AI labs and foreign competitors. HN thread has more context on the broader export control landscape.
The NSA apparently had access to Anthropic's security research tools, but that relationship soured. This signals how fragile government-industry AI partnerships can be. HN discussion.
Tired of writing custom code for every API? RubyLLM gives you a unified interface to OpenAI, Anthropic, Google, Mistral, and others. Write once, swap providers with a config change. If you're building Rails apps with AI, this could save you hours. HN thread.
Haystack is a solid open-source framework for building retrieval-augmented generation (RAG) and agentic systems. If you're past the prototype stage and need something production-ready, it's worth evaluating alongside LangChain and LlamaIndex.
This piece argues that proprietary AI from OpenAI, Anthropic, and Google creates a two-tier world—access for the wealthy, exclusion for everyone else. Open models like Llama and Mistral are the only path to democratization. Worth reading if you care about AI equity. HN discussion.
OpenAI, DeepMind, and Anthropic are all building philosophy teams. Not because they've gone soft—they're grappling with real questions about agency, alignment, and what it means when AI systems can make decisions. It's a signal that the industry knows it's past the point of pure engineering.
A practical breakdown of how code formatting affects token usage—and thus cost—when feeding code to LLMs. Simple stuff like variable naming and spacing can change your bill. Useful if you're optimizing prompts for production.