AI Agents

Aug 21, 2025

Genie 3: An infinite world model with Shlomi Fruchter and Jack Parker-Holder

Professor Hannah Fry speaks with Jack Parker-Holder and Shlomi Fruchter about Genie 3, a general-purpose world model that generates diverse, interactive environments from prompts. The discussion covers its auto-regressive nature, which enables the creation of consistent, explorable worlds, its key differences from video models like Veo, and its foundational role in training AI agents and advancing toward AGI.

Aug 21, 2025

How Intercom rose from the ashes by betting everything on AI | Eoghan McCabe (founder and CEO)

Eoghan McCabe, founder and CEO of Intercom, shares the unfiltered story of transforming a multi-billion dollar, stagnating SaaS business into a rapidly growing, AI-first company. He details the necessity of 'founder mode,' a radical cultural overhaul, and why having nothing to lose is the ultimate advantage in the AI era.

Aug 19, 2025

How Reinforcement Learning can Improve your Agent

This talk addresses the unreliability of current AI agents, arguing that prompting is insufficient. It posits that Reinforcement Learning (RL) is the most promising solution, delving into the mechanisms of RLHF and RLVR. The core challenge identified is 'reward hacking', and the discussion explores future directions to overcome it, such as RLAIF, data augmentation, and the development of interactive, online models that can learn in real-time.

Aug 13, 2025

12-factor Agents - Patterns of reliable LLM applications // Dexter Horthy

Drawing from conversations with top AI builders, Dex argues that production-grade AI agents are not magical loops but well-architected software. This talk introduces "12-Factor Agents," a methodology centered on "Context Engineering" to build reliable, high-performance LLM-powered applications by applying rigorous software engineering principles.

Aug 13, 2025

EDD: The Science of Improving AI Agents // Shahul Elavakkattil Shereef // Agents in Production 2025

This talk introduces Eval-Driven Development (EDD) as a scientific alternative to 'vibe-based' iteration for improving AI agents. It covers quantitative evaluation (choosing strong end-to-end metrics, aligning LLM judges) and qualitative evaluation (using error and attribution analysis to debug failures), providing a structured framework for consistent agent improvement.

Aug 13, 2025

How Grounded Synthetic Data is Saving the Publishing Industry // Robert Caulk

Robert from Emergent Methods discusses how grounded synthetic news data can solve the publisher revenue crisis in the AI era. He details the process of 'Context Engineering' news into token-optimized, objective data for high-stakes AI agent tasks, covering their open-source models for entity extraction and bias mitigation, and the on-premise infrastructure that protects publisher content.

← Previous Next →