Optimizing Query Evaluations Using Reinforcement Learning for Web Search

  • Corby Rosset ,
  • Damien Jose ,
  • Gargi Ghosh ,
  • ,
  • Saurabh Tiwary

Proceedings of the 41st international ACM SIGIR conference on Research & development in information retrieval |

Published by ACM

In web search, typically a candidate generation step selects a small set of documents—from collections containing as many as billions of web pages—that are subsequently ranked and pruned before being presented to the user. In Bing, the candidate generation involves scanning the index using statically designed match plans that prescribe sequences of different match criteria and stopping conditions. In this work, we pose match planning as a reinforcement learning task and observe up to 20% reduction in index blocks accessed, with small or no degradation in the quality of the candidate sets.