DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kim, Kee-Eung | - |
dc.contributor.advisor | 김기응 | - |
dc.contributor.author | Choi, Jae-Deug | - |
dc.contributor.author | 최재득 | - |
dc.date.accessioned | 2011-12-13T06:08:24Z | - |
dc.date.available | 2011-12-13T06:08:24Z | - |
dc.date.issued | 2009 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=327349&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/34884 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전산학전공, 2009. 8., [ v, 36 p. ] | - |
dc.description.abstract | Inverse reinforcement learning (IRL) is the problem of recovering the underlying reward function from the behavior of an expert. Most of the existing algorithms for IRL assume that the expert`s environment is modeled as a Markov decision process (MDP), although they should be able to handle partially observable settings in order to widen the applicability to more realistic scenarios. In this paper, we present an extension of the classical IRL algorithm by Ng and Russell to partially observable environments. We discuss technical issues and challenges, and present the experimental results on some of the benchmark partially observable domains. | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Machine Learning. | - |
dc.subject | Reinforcement Learning. | - |
dc.subject | Partially Observable Markov Decision Processes(POMDPs). | - |
dc.subject | Inverse Reinforcement Learning. | - |
dc.subject | 기계학습. | - |
dc.subject | 강화학습. | - |
dc.subject | 부분관찰마르코프의사결정과정. | - |
dc.subject | 역강화학습. | - |
dc.subject | Machine Learning. | - |
dc.subject | Reinforcement Learning. | - |
dc.subject | Partially Observable Markov Decision Processes(POMDPs). | - |
dc.subject | Inverse Reinforcement Learning. | - |
dc.subject | 기계학습. | - |
dc.subject | 강화학습. | - |
dc.subject | 부분관찰마르코프의사결정과정. | - |
dc.subject | 역강화학습. | - |
dc.title | Inverse reinforcement learning in partially observable environments | - |
dc.title.alternative | 부분관찰환경에서의 역강화학습 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 327349/325007 | - |
dc.description.department | 한국과학기술원 : 전산학전공, | - |
dc.identifier.uid | 020083539 | - |
dc.contributor.localauthor | Kim, Kee-Eung | - |
dc.contributor.localauthor | 김기응 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.