Reinforcement Learning for Turn-Taking Management in Incremental Spoken Dialogue Systems

Hatim Khouzaimi; Romain Laroche; Fabrice Lefevre

Reinforcement Learning for Turn-Taking Management in Incremental Spoken Dialogue Systems

Hatim Khouzaimi ,
Romain Laroche ,
Fabrice Lefevre

Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI) | July 2016

Download BibTex

In this article, reinforcement learning is used to learn an optimal turn-taking strategy for vocal human-machine dialogue. The Orange Labs’ Ma-jordomo dialogue system, which allows the users to have conversations within a smart home, has been upgraded to an incremental version. First, a user simulator is built in order to generate a dialogue corpus which thereafter is used to optimise the turn-taking strategy from delayed rewards with the Fitted-Q reinforcement learning algorithm. Real users test and evaluate the new learnt strategy, versus a non-incremental and a handcrafted incremen-tal strategies. The data-driven strategy is shown to significantly improve the task completion ratio and to be preferred by the users according to subjective metrics.