Results 1-3 of 3 (Search time: 0.002 seconds).
NO | Title, Author(s) (Publication Title, Volume Issue, Page, Issue Date) |
---|---|
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation Lee, Jongmin; Jeon, Wonseok; Lee, Byung-Jun; Pineau, Joelle; Kim, Kee-Eung, International Conference on Machine Learning (ICML), JMLR-JOURNAL MACHINE LEARNING RESEARCH, 2021-07 | |
Monte-Carlo Planning and Learning with Language Action Value Estimates Jang, Youngsoo; Kim, Kee-Eung; Seo, Seokin; Lee, Jongmin, The Ninth International Conference on Learning Representations (ICLR), International Conference on Learning Representations, 2021-05 | |
Representation Balancing Offline Model-based Reinforcement Learning Lee, Byung-Jun; Kim, Kee-Eung; Lee, Jongmin, The Ninth International Conference on Learning Representations (ICLR), International Conference on Learning Representations, 2021-05 |
Discover