NeurIPS 2020

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

Meta Review

This paper proposes an interesting way to use hypothesis testing as a solution to use expert knowledge for offline RL. The proposed approach is exciting and good enough to be published at NeurIPS. The experimental results are interesting, as well. However, the authors should address the concerns on the presentation and theoretical results raised by Reviewer 1 in the camera-ready version of the paper. At the very least, discussing it is the limitation of the approach in the paper's conclusion.