Speech recognition

Building Voice Agents Just Got Easier

Building Voice Agents Just Got Easier

Anoop Dawar from Deepgram discusses the evolution of voice AI, from basic transcription to sophisticated, real-time voice agents. He covers the key technical challenges in production, such as latency and interruption handling, and introduces Deepgram's Flux system. The talk concludes with a look at the future of speech-to-speech models that can understand emotional nuance, moving closer to passing the audio Turing Test.

How DeepL Built a Translation Powerhouse with AI with CEO Jarek Kutylowski

How DeepL Built a Translation Powerhouse with AI with CEO Jarek Kutylowski

Jarek Kutylowski, CEO of DeepL, discusses the company's technical strategy for competing with large language models in the translation space. He covers their focus on specialized model architectures, the critical role of curated data, the engineering challenges of building custom GPU data centers and large-scale inference systems, and the future of AI-driven translation in enterprise workflows.