Distant asr

Distant conversational speech recognition: Challenges and Opportunities

Distant conversational speech recognition: Challenges and Opportunities

Dr. Samuele Cornell from Carnegie Mellon University discusses the persistent challenges in distant automatic speech recognition (DASR) for spontaneous, multi-party conversations. He explains why state-of-the-art systems falter in real-world scenarios and presents recent advancements through three key efforts: (1) insights from the CHiME-7/8 DASR challenges, which benchmark robust meeting transcription; (2) progress towards unified end-to-end models that jointly handle diarization and recognition; and (3) novel techniques for generating realistic, large-scale training data using a combination of large language models and multi-speaker text-to-speech systems.