SilentVoice: Unnoticeable Voice Input by Ingressive Speech

Masaaki FUKUMOTO

SilentVoice: Unnoticeable Voice Input by Ingressive Speech

Masaaki FUKUMOTO

UIST 2018 Conference Proceedings, pp.237-246 | October 2018

Published by ACM – Association for Computing Machinery

Download BibTex

SilentVoice is a new voice input interface device that penetrates the speech-based natural user interface (NUI) in daily life. The proposed “ingressive speech” method enables placement of a microphone very close to the front of the mouth without suffering from pop-noise, capturing very soft speech sounds with a good S/N ratio. It realizes ultra-small (less than 39dB(A)) voice leakage, allowing us to use voice input without annoying surrounding people in public and mobile situations as well as offices and homes. By measuring airflow direction, SilentVoice can easily be separated from normal utterances with 98.8% accuracy; no activation words are needed. It can be used for voice-activated systems with a specially trained voice recognizer; evaluation results yield word error rates (WERs) of 1.8% (speaker-dependent condition), and 7.0% (speaker-independent condition) with a limited dictionary of 85 command sentences. A whisper-like natural voice can also be used for real-time voice communication.

[Awards]
– Best Talk
– Honorable Mention

[Demo Video]
– full version (opens in new tab)
– short version (opens in new tab)