Showing results 1 to 22 of 22
A Bayesian Approach to Generative Adversarial Imitation Learning Jeon, Wonseok; Seo, Seokin; Kim, Kee-Eung, 32nd Conference on Neural Information Processing Systems (NIPS 2018), Neural Information Processing Systems, 2018-12-06 |
Batch Reinforcement Learning with Hyperparameter Gradients Lee, Jongmin; Lee, Byung-Jun; Vrancx, Peter; Kim, Dongho; Kim, Kee-Eung, The 37th International Conference on Machine Learning (ICML 2020), pp.5681 - 5691, International Conference on Machine Learning, 2020-07-16 |
Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues Jang, Youngsoo; Lee, Jongmin; Kim, Kee-Eung, NeurIPS Workshop on Conversational AI (ConvAI), NeurIPS Workshop on Conversational AI (ConvAI), 2019-12-14 |
Bayes-Adaptive Monte-Carlo Planning and Learning for Goal-Oriented Dialogues Jang, Youngsoo; Lee, Jongmin; Kim, Kee-Eung, 34th AAAI Conference on Artificial Intelligence (AAAI 2020), pp.7994 - 8001, Association for the Advancement of Artificial Intelligence, 2020-02-11 |
Bayesian Reinforcement Learning with Behavioral Feedback Hong, Teakgyu; Lee, Jongmin; Kim, Kee-Eung; Ortega, Pedro A.; Lee, Daniel, 25th International Joint Conference on Artificial Intelligence, pp.1571 - 1577, International Joint Conferences on Artificial Intelligence Organization (IJCAI), 2016-07-14 |
Dual Correction Strategy for Ranking Distillation in Top-N Recommender System Lee, Youngjune; Kim, Kee-Eung, 30th ACM International Conference on Information and Knowledge Management, CIKM 2021, pp.3186 - 3190, Association for Computing Machinery, 2021-11-02 |
End-to-End Document-Grounded Conversation with Encoder-Decoder Pre-Trained Language Model Kim, Jinhyeon; Kim, Kee-Eung; Ham, Donghoon; Lee, Jeong-Gwan, AAAI Conference on Artificial Intelligence (AAAI) DSTC9 Workshop, Association for the Advancement of Artificial Intelligence, 2021-02-08 |
End-to-End Neural Pipeline for Goal-Oriented Dialogue System using GPT-2 Ham, Donghoon; Lee, Jeong-Gwan; Jang, Youngsoo; Kim, Kee-Eung, The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020), pp.583 - 592, Association for Computational Linguistics, 2020-07-05 |
Imitation Learning via Kernel Mean Embedding Kim, Kee-Eung; Park, Hyun Soo, 32nd AAAI Conference on Artificial Intelligence and the 20th Innovative Applications of Artificial Intelligence Conference, AAAI-18/IAAI-18, pp.3415 - 3422, Association for the Advancement of Artificial Intelligence, 2018-02-06 |
Monte-Carlo Planning and Learning with Language Action Value Estimates Jang, Youngsoo; Kim, Kee-Eung; Seo, Seokin; Lee, Jongmin, The Ninth International Conference on Learning Representations (ICLR), International Conference on Learning Representations, 2021-05 |
Monte-Carlo Tree Search for Constrained POMDPs Lee, Jongmin; Kim, Geon-Hyeong; Poupart, Pascal; Kim, Kee-Eung, 32nd Conference on Neural Information Processing Systems (NIPS 2018), Neural Information Processing Systems, 2018-12-06 |
Monte-Carlo Tree Search in Continuous Action Spaces with Value Gradients Lee, Jongmin; Jeon, Wonseok; Kim, Geon-Hyeong; Kim, Kee-Eung, 34th AAAI Conference on Artificial Intelligence (AAAI 2020), pp.4561 - 4568, Association for the Advancement of Artificial Intelligence, 2020-02-10 |
Multi-View Representation Learning via Total Correlation Objective Hwang, Hyeongjoo; Kim, Geon-hyeong; Hong, Seunghoon; Kim, Kee-Eung, Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021), Neural information processing systems foundation, 2021-12-06 |
OP-CAS: Collision Avoidance with Overtaking Maneuvers Cha, Eun Sang; Kim, Kee-Eung; Longo, Stefano; Mehta, Ankur, 21st IEEE International Conference on Intelligent Transportation Systems (ITSC), pp.1715 - 1720, IEEE, 2018-11-06 |
OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation Lee, Jongmin; Jeon, Wonseok; Lee, Byung-Jun; Pineau, Joelle; Kim, Kee-Eung, International Conference on Machine Learning (ICML), JMLR-JOURNAL MACHINE LEARNING RESEARCH, 2021-07 |
Reinforcement Learning for Control with Multiple Frequencies Lee, Jongmin; Lee, Byung-Jun; Kim, Kee-Eung, Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020), Neural information processing systems foundation, 2020-12-10 |
Representation Balancing Offline Model-based Reinforcement Learning Lee, Byung-Jun; Kim, Kee-Eung; Lee, Jongmin, The Ninth International Conference on Learning Representations (ICLR), International Conference on Learning Representations, 2021-05 |
Residual Neural Processes Lee, Byung-Jun; Hong, Seunghoon; Kim, Kee-Eung, 34th AAAI Conference on Artificial Intelligence (AAAI 2020), pp.4545 - 4552, Association for the Advancement of Artificial Intelligence, 2020-02-11 |
Simulated Physics for High Speed Aerial Systems Kang, MinKu; Kim, Kee-Eung, 18th International Conference on Control, Automation and Systems (ICCAS), pp.800 - 803, Institute of Control, Robotics, and Systems (ICROS), 2018-10-19 |
Trust Region Sequential Variational Inference Kim, Geon-Hyeong; Jang, Youngsoo; Lee, Jongmin; Jeon, Wonseok; Yang, Hongseok; Kim, Kee-Eung, Conference on Asian Conference on Machine Learning (ACML 2019), Asian Conference on Machine Learning, 2019-11-19 |
Variational Inference for Sequential Data with Future Likelihood Estimates Kim, Geon-Hyeong; Jang, Youngsoo; Yang, Hongseok; Kim, Kee-Eung, The 37th International Conference on Machine Learning (ICML 2020), pp.5252 - 5261, ICML Organisation, 2020-07-16 |
Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic Yoon, Deunsol; Hong, Sunghoon; Kim, Kee-Eung; Lee, Byung-Jun, The Ninth International Conference on Learning Representations, International Conference on Learning Representations, 2021-05 |
Discover