MISC: A data set of information-seeking conversations

Proceedings of the 1st International Workshop on Conversational Approaches to Information Retrieval |

Conversational interfaces to information retrieval systems, via software agents such as Siri or Cortana, are of commercial and research interest. To build or evaluate these software interfaces it is natural to consider how people act in the same role, but there is little public, fine-grained, data on interactions with intermediaries for web tasks.

We introduce the Microsoft Information-Seeking Conversation data (MISC), a set of recordings of information-seeking conversations between human “seekers” and “intermediaries”. MISC includes audio and video signals; transcripts of conversation; affectual and physiological signals; recordings of search and other computer use; and post-task surveys on emotion, success, and effort. We hope that these recordings will support conversational retrieval interfaces both in engineering (how can we make “natural” systems?) and evaluation (what does a “good” conversation look like?).