Llm

The Latency Goldilocks Zone Explained

The Latency Goldilocks Zone Explained

Rafael Borger and Daniel Wolbert from iFood discuss the engineering and product strategy behind ILO-Agent, their conversational AI for 200 million users. They cover hyper-personalized recommendation systems, the "Latency Goldilocks Zone" where AI responses can be too fast for users to trust, and the architectural challenges of building multi-channel agents for text and voice.

Building MCP Before MCP Existed: Inside Despegar's Sofia Agent

Building MCP Before MCP Existed: Inside Despegar's Sofia Agent

A deep dive into Despegar's GenAI travel agent, Sofia. Explore its multi-agent architecture, the custom orchestration layer 'Chappi' built before MCP was a standard, and the strategy of decentralizing agent development across company squads to cover the entire five-phase travel arc.

Personalization in the Era of LLMs - Shivam Verma, Spotify

Personalization in the Era of LLMs - Shivam Verma, Spotify

Spotify is personalizing open-weight LLMs without full fine-tuning by combining three key components: foundational user embeddings from streaming history, 'Semantic IDs' that tokenize its 100M+ item catalog, and a 'soft tokenization' layer that projects a user's embedding directly into the LLM's context. This allows the model to autoregressively generate the next song or podcast as the next token in a sequence.

How to Build AI-First Organizations — with Jacob Miller and Jeremy Mumford

How to Build AI-First Organizations — with Jacob Miller and Jeremy Mumford

Jacob Miller and Jeremy Mumford, authors of 'Architected Intelligence', discuss the enduring principles for building successful AI products and organizations. They cover why velocity is the only durable moat, why hallucinations are a data curation issue, and the proper progression from skills to workflows to agents, emphasizing a shift from focusing on models to focusing on process and speed.

Tech Truth: Agile Evolution & the Future of SW Engineering • Martin Fowler & Kent Beck • GOTO 2025

Tech Truth: Agile Evolution & the Future of SW Engineering • Martin Fowler & Kent Beck • GOTO 2025

Martin Fowler and Kent Beck, two authors of the Agile Manifesto, reunite for an unscripted conversation about their 30-year friendship and the evolution of software engineering. They delve into their practical, daily use of AI (the "Genie"), reflect on the legacy of Extreme Programming and the Agile Manifesto, and offer timeless advice to the next generation of developers.

Fighting AI with AI — Lawrence Jones, Incident

Fighting AI with AI — Lawrence Jones, Incident

Lawrence Jones from Incident.io explains why their team needed AI to debug their complex AI SRE product. He details three powerful patterns: a CLI for agents to manage large evaluation files, serializing debug UIs into downloadable file systems for agent-based tracing, and multi-agent pipelines for fleet-scale failure analysis.