Optimizing Query Evaluations Using Reinforcement Learning for Web Search

Corby Rosset; Damien Jose; Gargi Ghosh; Bhaskar Mitra; Saurabh Tiwary

Optimizing Query Evaluations Using Reinforcement Learning for Web Search

Corby Rosset ,
Damien Jose ,
Gargi Ghosh ,
Bhaskar Mitra ,
Saurabh Tiwary

Proceedings of the 41st international ACM SIGIR conference on Research & development in information retrieval | May 2018

Published by ACM

Download BibTex

In web search, typically a candidate generation step selects a small set of documents—from collections containing as many as billions of web pages—that are subsequently ranked and pruned before being presented to the user. In Bing, the candidate generation involves scanning the index using statically designed match plans that prescribe sequences of different match criteria and stopping conditions. In this work, we pose match planning as a reinforcement learning task and observe up to 20% reduction in index blocks accessed, with small or no degradation in the quality of the candidate sets.