Offline Reinforcement Learning

Microsoft Research blog