Showing results 1 to 1 of 1
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation Lee, Jongmin; Jeon, Wonseok; Lee, Byung-Jun; Pineau, Joelle; Kim, Kee-Eung, International Conference on Machine Learning (ICML), JMLR-JOURNAL MACHINE LEARNING RESEARCH, 2021-07 |
Discover