Ahn, Kwangjun; Cheng, Xiang; Song, Minhak; Yun, Chulhee; Jadbabaie, Ali; Sra, Suvrit, 12th International Conference on Learning Representations, ICLR 2024, 2024-05-07
최윤선; 김기응; 반성현, 2023 한국소프트웨어종합학술대회 (KSC2023), 2023-12-20
Kim, Sung-Yub; Yang, Eunho, 37th Conference on Neural Information Processing Systems, NeurIPS 2023, 2023-12-15
Kang, Minki; Lee, Seanie; Baek, Jinheon; Kenji, Kawaguchi; Hwang, Sung Ju, 37th Conference on Neural Information Processing Systems, NeurIPS 2023, 2023-12-14
Seo, Seokin; HWANG, HYEONGJOO; Yang, Hongseok; Kim, Kee-Eung, The 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 2023-12-13
MATSUNAGA, DAIKI EDDY; Lee, Jongmin; Yoon, Jaeseok; Leonardos, Stefanos; Abbeel, Pieter; Kim, Kee-Eung, The 37th Conference on Neural Information Processing Systems (NeurIPS 2023), 2023-12-13
Sohn, Kihyuk; Ruiz, Nataniel; Lee, Kimin; Chin, Daniel Castro; Blok, Irina; Chang, Huiwen; Barber, Jarred; Jiang, Lu; Entis, Glenn; Li, Yuanzhen; Hao, Yuan; Essa, Irfan; Rubinstein, Michael; Krishnan, Dilip, 37th Conference on Neural Information Processing Systems (NeurIPS), 2023-12-13
Yun, Jihun; Yang, Eunho, 37th Conference on Neural Information Processing Systems, NeurIPS 2023, 2023-12-13
Lee, Hojoon; Cho, Hanseul; Kim, Hyunseung; Gwak, Daehoon; Kim, Joonkee; Choo, Jaegul; Yun, Se-Young; Yun, Chulhee, 37th Annual Conference on Neural Information Processing Systems, 2023-12-13
Song, Minhak; Yun, Chulhee, 37th Annual Conference on Neural Information Processing Systems, 2023-12-13
Linear attention is (maybe) all you need (to understand Transformer optimization) Ahn, Kwangjun; Cheng, Xiang; Song, Minhak; Yun, Chulhee; Jadbabaie, Ali; Sra, Suvrit, 12th International Conference on Learning Representations, ICLR 2024, International Conference on Learning Representations (ICLR), 2024-05-07 |
Trajectory Alignment: Understanding the Edge of Stability Phenomenon via Bifurcation Theory Song, Minhak; Yun, Chulhee, 37th Annual Conference on Neural Information Processing Systems, Neural Information Processing Systems, 2023-12-13 |
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning Lee, Hojoon; Cho, Hanseul; Kim, Hyunseung; Gwak, Daehoon; Kim, Joonkee; Choo, Jaegul; Yun, Se-Young; et al, 37th Annual Conference on Neural Information Processing Systems, Neural Information Processing Systems, 2023-12-13 |
Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima Si, Dongkuk; Yun, Chulhee, 37th Annual Conference on Neural Information Processing Systems, Neural Information Processing Systems, 2023-12-12 |
Fair Streaming Principal Component Analysis: Statistical and Algorithmic Viewpoint Lee, Junghyun; Cho, Hanseul; Yun, Se-Young; Yun, Chulhee, 37th Annual Conference on Neural Information Processing Systems, Neural Information Processing Systems, 2023-12-12 |
Dynamic Control for On-Demand Interference-Managed WLAN Infrastructures Kim, Seokhyun; Lee, Kimin; Yeonkeun Kim; Jinwoo Shin; Seungwon Shin; Song Chong, IEEE/ACM Transactions on Networking, IEEE Communications Society, 2020-02 |
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback Wang, Xiaofei; Lee, Kimin; Kourosh Hakhamaneshi; Pieter Abbeel; Michael Laskin, 5th Conference on Robot Learning, CoRL 2021, ML Research Press, 2021-11 |
Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings Chen, Lili; Lee, Kimin; Aravind Srinivas; Pieter Abbeel, 35th Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, 2021-12 |
URLB: Unsupervised Reinforcement Learning Benchmark Laskin, Michael; Denis Yarats; Hao Liu; Lee, Kimin; Albert Zhan; Kevin Lu; Catherine Cang; et al, 35th Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, 2021-12 |
B-Pref: Benchmarking Preference-Based Reinforcement Learning Lee, Kimin; Laura Smith; Anca Dragan; Pieter Abbeel, 35th Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, 2021-12 |
Decoupling Representation Learning from Reinforcement Learning Stooke, Adam; Lee, Kimin; Pieter Abbeel; Michael Laskin, 38th International Conference on Machine Learning, ICML 2021, International Machine Learning Society (IMLS), 2021-07 |
SUNRISE: A Simple Unified Frameworkfor Ensemble Learning in Deep Reinforcement Learning Lee, Kimin; Michael Laskin; Aravind Srinivas; Pieter Abbeel, 38th International Conference on Machine Learning, ICML 2021, International Machine Learning Society (IMLS), 2021-07 |
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training Lee, Kimin; Laura Smith; Pieter Abbeel, 38th International Conference on Machine Learning, ICML 2021, International Machine Learning Society (IMLS), 2021-07 |
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment Carvalho, Wilka; Anthony Liang; Lee, Kimin; Sungryull Sohn; Honglak Lee; Richard L. Lewis; Satinder Singh, 30th International Joint Conference on Artificial Intelligence (IJCAI-21), International Joint Conference on Artificial Intelligence Organization, 2021-08 |
Decision Transformer: Reinforcement Learning via Sequence Modeling Chen, Lili; Kevin Lu; Aravind Rajeswaran; Lee, Kimin; Aditya Grover; Misha Laskin; Pieter Abbeel; et al, 35th Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, 2021-12 |
Reinforcement Learning with Augmented Data Laskin, Misha; Lee, Kimin; Adam Stooke; Lerrel Pinto; Pieter Abbeel; Aravind Srinivas, 34th Conference on Neural Information Processing Systems (NeurIPS), Neural Information Processing Systems Foundation, 2020-12-09 |
MAP inference for Bayesian inverse reinforcement learning Choi, Jae-Deug; Kim, Kee-Eung, 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011, Neural Information Processing Systems Foundation, 2011-12 |
A POMDP-Based Optimal Control of P300-Based Brain-Computer Interfaces Park, Jaeyoung; Kim, Kee-Eung; Song, Yoon-Kyu, 25th AAAI Conference on Artificial Intelligence, AAAI 2011, pp.1559 - 1562, Association for the Advancement of Artificial Intelligence (AAAI), 2011-08 |
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning Park, Jongjin; Seo, Younggyo; Shin, Jinwoo; Lee, Honglak; Abbeel, Pieter; Lee, Kimin, 10th International Conference on Learning Representations, ICLR 2022, International Conference on Learning Representations, 2022-04-26 |
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning Liang, Xinran; Shu, Katherine; Lee, Kimin; Abbeel, Pieter, 10th International Conference on Learning Representations, ICLR 2022, International Conference on Learning Representations, 2022-04-27 |
Discover