CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Shinji Watanabe; Michael Mandel; Jon Barker; Emmanuel Vincent; Ashish Arora; Xuankai Chang; Sanjeev Khudanpur; Vimal Manohar; Daniel Povey; Desh Raj; David Snyder; Aswin Shanmugam Subramanian; Jan Trmal; Bar Ben Yair; Christoph Boeddeker; Zhaoheng Ni; Yusuke Fujita; Shota Horiguchi; Naoyuki Kanda; Takuya Yoshioka; Neville Ryant

CHiME-6 Challenge: Tackling Multispeaker Speech Recognition for Unsegmented Recordings

Shinji Watanabe ,
Michael Mandel ,
Jon Barker ,
Emmanuel Vincent ,
Ashish Arora ,
Xuankai Chang ,
Sanjeev Khudanpur ,
Vimal Manohar ,
Daniel Povey ,
Desh Raj ,
David Snyder ,
Aswin Shanmugam Subramanian ,
Jan Trmal ,
Bar Ben Yair ,
Christoph Boeddeker ,
Zhaoheng Ni ,
Yusuke Fujita ,
Shota Horiguchi ,
Naoyuki Kanda ,
Takuya Yoshioka ,
Neville Ryant

arXiv:2004.09249 | April 2020

Download BibTex

Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the 6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge revisits the previous CHiME-5 challenge and further considers the problem of distant multi-microphone conversational speech diarization and recognition in everyday home environments. Speech material is the same as the previous CHiME-5 recordings except for accurate array synchronization. The material was elicited using a dinner party scenario with efforts taken to capture data that is representative of natural conversational speech. This paper provides a baseline description of the CHiME-6 challenge for both segmented multispeaker speech recognition (Track 1) and unsegmented multispeaker speech recognition (Track 2). Of note, Track 2 is the first challenge activity in the community to tackle an unsegmented multispeaker speech recognition scenario with a complete set of reproducible open source baselines providing speech enhancement, speaker diarization, and speech recognition modules.