Posts

How Google DeepMind Runs Agents at Scale — KP Sawhney & Ian Ballantyne, Google DeepMind

How Google DeepMind Runs Agents at Scale — KP Sawhney & Ian Ballantyne, Google DeepMind

KP Sawhney from Google DeepMind discusses the internal strategies for scaling agentic AI, including managing token-hungry workflows, curating a 'Darwinian' skills library, and evolving the Deep Research pipeline from large context blobs to a collaborative file system.

⚡️ Google's Open AI Strategy — Omar Sanseviero, Google DeepMind

⚡️ Google's Open AI Strategy — Omar Sanseviero, Google DeepMind

An in-depth look at Gemma 4's novel transformer architecture with per-layer embeddings, enabling efficient parameter offloading for on-device inference. The discussion also covers its native multimodality, the state of fine-tuning, text-based diffusion models, and the growing intersection of research and engineering.

⚡️ Why you should build Science Fiction — Sunil Pai, Cloudflare

⚡️ Why you should build Science Fiction — Sunil Pai, Cloudflare

Sunil Pai from Cloudflare discusses building efficient AI agent architectures using Durable Objects and Dynamic Workers as an alternative to platforms like Anthropic's. He explores the search for a standardized 'React-like' framework for agents, the culture of forking in open source, and encourages developers to pursue original, 'sci-fi' style projects.

Scaling the Next Paradigm of Heterogeneous Intelligence — Adrian Bertagnoli, Callosum

Scaling the Next Paradigm of Heterogeneous Intelligence — Adrian Bertagnoli, Callosum

Adrian Bertagnoli from Callosum argues that the era of scaling monolithic models on homogeneous GPU clusters is ending. He introduces "heterogeneous intelligence," a new paradigm where model architectures, chip types, and workflows are optimized together. By routing subtasks to the most efficient model and hardware, this approach achieves significant performance gains, as demonstrated by two key results: a 7x cost reduction in recursive reasoning tasks using Cerebras, and state-of-the-art performance on the Video Web Arena benchmark, outperforming leading GPT and Gemini models at a fraction of the cost and time.

Five AI Risks That Can Get You Fired—And How to Avoid Them

Five AI Risks That Can Get You Fired—And How to Avoid Them

Martin Keen explains five real-world AI risks that can lead to job loss: shadow AI, data leakage, hallucinations, prompt injection, and unauthorized AI agents. He emphasizes the critical need for strong AI governance to ensure safe and productive AI adoption in the workplace.

Your Agent Is an Infinite Canvas — RL Nabors, Dressed for Space

Your Agent Is an Infinite Canvas — RL Nabors, Dressed for Space

RL Nabors makes a case against chat as the permanent UI for agents, arguing it's the "terminal" to the future "iPhone" of rich, interactive experiences. She demonstrates how to build these experiences using web technologies, introducing MCP Apps for in-agent UIs and WebMCP for making existing websites agent-callable, positioning the web platform as the ultimate "infinite canvas".