Blind reverberation time estimation using a convolutional neural network
Proc. International Workshop on Acoustic Signal Enhancement (IWAENC) |
Published by IEEE
Nominated for best paper award
The reverberation time of an acoustic environment is a useful parameter for applications including source localisation, speech recognition and mixed reality. However, estimating the reverberation time blindly and on the fly remains a challenge. Here we propose formulating the estimation as a regression problem and using a convolutional neural network (CNN) to estimate the reverberation time directly from a four second long single-channel recording of reverberant speech in noise. Evaluation on the ACE Challenge data corpus suggests that the proposed method is computationally efficient and outperforms state-of-the-art methods.
![CNN block diagram](https://www.microsoft.com/en-us/research/uploads/prod/2018/09/CNN_diagram-1024x187.png)
Block diagram of convolutional neural network architecture
![T60 confusion matrix](https://www.microsoft.com/en-us/research/uploads/prod/2018/09/confusion_matrix-1024x495.png)
Confusion matrices of ground truth and estimated T60 for training set (left) and evaluation set (right). Results are binned by T60 with a resolution of 0.1 s.