Towards all-purpose full-sphere binaural localization
- Shoken Kaneko ,
- Hannes Gamper
IEEE Eusipco |
Sound source localization from binaural signals has important applications ranging from machine listening to psychoacoustics, yet challenges including generalization and robustness to noise and reverberation remain. Here we propose a binaural localizer (BL) framework that produces a full-sphere spatial activity map for every audio input frame. The framework enables individual-agnostic training of a convolutional neural network using head-related impulse response (HRIR) sets with arbitrary measurement grids and is shown to perform well on unseen HRIRs and binaural recordings. Unlike BLs trained with the HRIRs of a specific known subject or dummy head, the proposed individual-agnostic BL is intended to perform robustly without any a priori knowledge about the process creating the binaural signals. Localization tests with binaural speech renderings and recordings show that the proposed BL performs well in the presence of noise and reverberation and compares favorably to individual-specific BLs. Furthermore, preliminary results indicate that the proposed BL is applicable to the localization of multiple simultaneous and moving sources.