Talk
in
Workshop: Offline Reinforcement Learning
Advances in (High-Confidence) Off-Policy Evaluation
Philip Thomas
Abstract: