Conversational ai

The Latency Goldilocks Zone Explained

The Latency Goldilocks Zone Explained

Rafael Borger and Daniel Wolbert from iFood discuss the engineering and product strategy behind ILO-Agent, their conversational AI for 200 million users. They cover hyper-personalized recommendation systems, the "Latency Goldilocks Zone" where AI responses can be too fast for users to trust, and the architectural challenges of building multi-channel agents for text and voice.

Building MCP Before MCP Existed: Inside Despegar's Sofia Agent

Building MCP Before MCP Existed: Inside Despegar's Sofia Agent

A deep dive into Despegar's GenAI travel agent, Sofia. Explore its multi-agent architecture, the custom orchestration layer 'Chappi' built before MCP was a standard, and the strategy of decentralizing agent development across company squads to cover the entire five-phase travel arc.

Physical AI Forum | Builders Reveal the New Moat & Playbook | Creator & Founder's Cut | Mar 2026 |4K

Physical AI Forum | Builders Reveal the New Moat & Playbook | Creator & Founder's Cut | Mar 2026 |4K

In a live panel at the Physical AI Builders Forum, founders and operators in computer vision, robotics, and multimodal AI share their 2026 playbooks. The discussion covers the architectural differences between physical and generative AI, the strategic shift from frame AI to scene AI for enterprise value, and the critical skills needed to build and scale a modern AI business.

Distant conversational speech recognition: Challenges and Opportunities

Distant conversational speech recognition: Challenges and Opportunities

Dr. Samuele Cornell from Carnegie Mellon University discusses the persistent challenges in distant automatic speech recognition (DASR) for spontaneous, multi-party conversations. He explains why state-of-the-art systems falter in real-world scenarios and presents recent advancements through three key efforts: (1) insights from the CHiME-7/8 DASR challenges, which benchmark robust meeting transcription; (2) progress towards unified end-to-end models that jointly handle diarization and recognition; and (3) novel techniques for generating realistic, large-scale training data using a combination of large language models and multi-speaker text-to-speech systems.

How LiveKit Became An AI Company By Accident

How LiveKit Became An AI Company By Accident

Russ d'Sa, CEO of LiveKit, recounts the company's unexpected journey from a pandemic-era open-source WebRTC project to becoming a crucial infrastructure provider for AI voice interfaces, most notably for OpenAI's ChatGPT. He details the serendipitous moments that led to this pivot and shares his vision for LiveKit as the nervous system for a multimodal AI future.

Delphi’s Dara Ladjevardian: How AI Digital Minds Can Scale Human Connection

Delphi’s Dara Ladjevardian: How AI Digital Minds Can Scale Human Connection

Dara Ladjevardian, founder of Delphi, discusses creating "digital minds" using an adaptive temporal knowledge graph. Inspired by Ray Kurzweil's theory of the mind, this technology aims to scale human thought and expertise, transforming content consumption from static feeds into interactive, conversational media while emphasizing the premium value of authentic human connection.