Feature

Evals Are Not Unit Tests — Ido Pesok, Vercel v0

Evals Are Not Unit Tests — Ido Pesok, Vercel v0

Ido Pesok from Vercel explains why LLM-based applications often fail in production despite successful demos, and presents a systematic framework for building reliable AI systems using application-layer evaluations ("evals").

AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL

AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL

Tanmai Gopal, CEO of Hasura, discusses a Gen AI-driven automation strategy that addresses the "Automation Paradox" by empowering non-technical users. This approach uses a domain-specific language (DSL) to translate natural language into deterministic, executable plans, aiming to drive over $100M in annual impact for a healthcare partner.

From the Dot-Com Crash to the AI Era: How Builders Survive Waves of Disruption

From the Dot-Com Crash to the AI Era: How Builders Survive Waves of Disruption

Leaders from VMware and Cisco share hard-won lessons on navigating disruption, fostering a founder's mindset at scale, and rebuilding for the AI era. They cover GTM strategies for new products, the importance of storytelling as strategy, and why AI represents a 100x shift for infrastructure.

The Hidden Bottlenecks Slowing Down AI Agents

The Hidden Bottlenecks Slowing Down AI Agents

Paul van der Boor and Bruce Martens from Prosus discuss the real bottlenecks in AI agent development, arguing that the primary challenges are not tools, but rather evaluation, data quality, and feedback loops. They detail their 'buy-first' philosophy, the practical reasons they often build in-house, and how new coding agents like Devon and Cursor are changing their development workflows.

AI Coding Agents Change Software Development Forever

AI Coding Agents Change Software Development Forever

A discussion on the promise and limitations of coding agents, covering key challenges like verification and debugging, and exploring how they can support developers through improved abstraction, collaboration, and handling long-term tasks.

DeepMind's Secret AI Project That Will Change Everything [EXCLUSIVE]

DeepMind's Secret AI Project That Will Change Everything [EXCLUSIVE]

Google DeepMind's Genie 3 is a new generative interactive environment that creates photorealistic, controllable 3D worlds from text prompts in real-time. This summary explores its architecture, the concept of emergent consistency, and its primary application as a powerful simulator for training embodied AI agents.