Showing results 1 to 15 of 15
AlberDICE: Addressing Out-Of-Distribution Joint Actions in Offline Multi-Agent RL via Alternating Stationary Distribution Correction Estimation MATSUNAGA, DAIKI EDDY; Lee, Jongmin; Yoon, Jaeseok; Leonardos, Stefanos; Abbeel, Pieter; Kim, Kee-Eung, The 37th Conference on Neural Information Processing Systems (NeurIPS 2023), Neural information processing systems foundation, 2023-12-13 |
Controllability-Aware Unsupervised Skill Discovery Park, Seohong; Lee, Kimin; Lee, Youngwoon; Abbeel, Pieter, 40th International Conference on Machine Learning, ICML 2023, International Machine Learning Society (IMLS), 2023-07-26 |
HARP: Autoregressive Latent Video Prediction with High-Fidelity Image Generator Seo, Younggyo; Lee, Kimin; Liu, Fangchen; James, Stephen; Abbeel, Pieter, 29th IEEE International Conference on Image Processing, ICIP 2022, pp.3943 - 3947, IEEE International Conference on Image Processing, 2022-10 |
Masked World Models for Visual Control Seo, Younggyo; Hafner, Danijar; Liu, Hao; Liu, Fangchen; James, Stephen; Lee, Kimin; Abbeel, Pieter, 6th Conference on Robot Learning, CoRL 2022, pp.1332 - 1344, ML Research Press, 2022-12-17 |
Multi-View Masked World Models for Visual Robotic Manipulation Seo, Younggyo; Kim, Junsu; James, Stephen; Lee, Kimin; Shin, Jinwoo; Abbeel, Pieter, 40th International Conference on Machine Learning, ICML 2023, International Machine Learning Society (IMLS), 2023-07-25 |
Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble Lee, Seunghyun; Seo, Younggyo; Lee, Kimin; Abbeel, Pieter; Shin, Jinwoo, 5th Annual Conference on Robot Learning(coRL 2021), CoRL Conference Chair, 2021-11 |
Preference Transformer: Modeling Human Preferences using Transformers for RL Kim, Changyeon; Park, Jongjin; Shin, Jinwoo; Lee, Honglak; Abbeel, Pieter; Lee, Kimin, Eleventh International Conference on Learning Representations, ICLR 2023, International Conference on Learning Representations, 2023-05-01 |
Programmatic Modeling and Generation of Real-time Strategic Soccer Environments for Reinforcement Learning Azad, Abdus Salam; Kim, Edward; Wu, Qiancheng; Lee, Kimin; Stoica, Ion; Abbeel, Pieter; Seshia, Sanjit A, 36th AAAI Conference on Artificial Intelligence, AAAI 2022, Association for the Advancement of Artificial Intelligence, 2022-02-26 |
Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models Fan, Ying; Watkins, Olivia; Du, Yuqing; Liu, Hao; Ryu, Moonkyung; Boutilier, Craig; Abbeel, Pieter; et al, Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS) 2023, Neural Information Processing Systems Foundation, 2023-12-12 |
Reinforcement Learning with Action-Free Pre-Training from Videos Seo, Younggyo; Lee, Kimin; James, Stephen; Abbeel, Pieter, 38th International Conference on Machine Learning (ICML), pp.19561 - 19579, JMLR-JOURNAL MACHINE LEARNING RESEARCH, 2022-07-20 |
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning Liang, Xinran; Shu, Katherine; Lee, Kimin; Abbeel, Pieter, 10th International Conference on Learning Representations, ICLR 2022, International Conference on Learning Representations, 2022-04-27 |
State Entropy Maximization with Random Encoders for Efficient Exploration Seo, Younggyo; Chen, Lili; Shin, Jinwoo; Lee, Honglak; Abbeel, Pieter; Lee, Kimin, 38th International Conference on Machine Learning, ICML 2021, JMLR-JOURNAL MACHINE LEARNING RESEARCH, 2021-07 |
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning Park, Jongjin; Seo, Younggyo; Shin, Jinwoo; Lee, Honglak; Abbeel, Pieter; Lee, Kimin, 10th International Conference on Learning Representations, ICLR 2022, International Conference on Learning Representations, 2022-04-26 |
Towards More Generalizable One-shot Visual Imitation Learning Mandi, Zhao; Liu, Fangchen; Lee, Kimin; Abbeel, Pieter, 39th IEEE International Conference on Robotics and Automation, ICRA 2022, pp.2434 - 2444, Institute of Electrical and Electronics Engineers Inc., 2022-05 |
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning Seo, Younggyo; Lee, Kimin; Clavera, Ignasi; Kurutach, Thanard; Shin, Jinwoo; Abbeel, Pieter, 34th Conference on Neural Information Processing Systems (NeurIPS) 2020, Neural Information Processing Systems, 2020-12-07 |
Discover