Feature

This Week in AI: GPT-5 Ships, 4o Pulled Back, Grok Imagine Goes Social

This Week in AI: GPT-5 Ships, 4o Pulled Back, Grok Imagine Goes Social

Partners Olivia and Justine Moore discuss the latest in consumer AI, including Grok's uniquely social and fast image generation, Google's interactive world model Genie 3, the user backlash to GPT-5's personality changes, ElevenLabs' licensed AI music model, and the emerging fragmentation of "vibecoding" platforms for technical and non-technical users.

12-factor Agents - Patterns of reliable LLM applications // Dexter Horthy

12-factor Agents - Patterns of reliable LLM applications // Dexter Horthy

Drawing from conversations with top AI builders, Dex argues that production-grade AI agents are not magical loops but well-architected software. This talk introduces "12-Factor Agents," a methodology centered on "Context Engineering" to build reliable, high-performance LLM-powered applications by applying rigorous software engineering principles.

EDD: The Science of Improving AI Agents // Shahul Elavakkattil Shereef // Agents in Production 2025

EDD: The Science of Improving AI Agents // Shahul Elavakkattil Shereef // Agents in Production 2025

This talk introduces Eval-Driven Development (EDD) as a scientific alternative to 'vibe-based' iteration for improving AI agents. It covers quantitative evaluation (choosing strong end-to-end metrics, aligning LLM judges) and qualitative evaluation (using error and attribution analysis to debug failures), providing a structured framework for consistent agent improvement.

How Grounded Synthetic Data is Saving the Publishing Industry // Robert Caulk

How Grounded Synthetic Data is Saving the Publishing Industry // Robert Caulk

Robert from Emergent Methods discusses how grounded synthetic news data can solve the publisher revenue crisis in the AI era. He details the process of 'Context Engineering' news into token-optimized, objective data for high-stakes AI agent tasks, covering their open-source models for entity extraction and bias mitigation, and the on-premise infrastructure that protects publisher content.

When Agents Hire Their Own Team: Inside Hypermode’s Concierge // Ryan Fox-Tyler

When Agents Hire Their Own Team: Inside Hypermode’s Concierge // Ryan Fox-Tyler

Ryan Fox-Tyler from Hypermode explains their philosophy of empowering AI agents to design and deploy other agents. He introduces Concierge, an agent that builds other agents, and details the underlying actor-based runtime built for scalability, fault tolerance, and efficient, event-driven execution of thousands of parallel agent instances.

Making Your Data Agent-Ready with EnrichMCP // Simba Khadder // Agents in Production 2025

Making Your Data Agent-Ready with EnrichMCP // Simba Khadder // Agents in Production 2025

Simba Khadder explains that the primary bottleneck for LLM agents is not intelligence, but access to structured data. He introduces EnrichMCP, an open-source framework that creates a semantic layer over data models, enabling agents to discover, reason about, and query data sources like SQL databases effectively, moving beyond the limitations of RAG and direct API conversions.