Inverse reinforcement learning in partially observable environments부분관찰환경에서의 역강화학습

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 478
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorKim, Kee-Eung-
dc.contributor.advisor김기응-
dc.contributor.authorChoi, Jae-Deug-
dc.contributor.author최재득-
dc.date.accessioned2011-12-13T06:08:24Z-
dc.date.available2011-12-13T06:08:24Z-
dc.date.issued2009-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=327349&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/34884-
dc.description학위논문(석사) - 한국과학기술원 : 전산학전공, 2009. 8., [ v, 36 p. ]-
dc.description.abstractInverse reinforcement learning (IRL) is the problem of recovering the underlying reward function from the behavior of an expert. Most of the existing algorithms for IRL assume that the expert`s environment is modeled as a Markov decision process (MDP), although they should be able to handle partially observable settings in order to widen the applicability to more realistic scenarios. In this paper, we present an extension of the classical IRL algorithm by Ng and Russell to partially observable environments. We discuss technical issues and challenges, and present the experimental results on some of the benchmark partially observable domains.eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectMachine Learning.-
dc.subjectReinforcement Learning.-
dc.subjectPartially Observable Markov Decision Processes(POMDPs).-
dc.subjectInverse Reinforcement Learning.-
dc.subject기계학습.-
dc.subject강화학습.-
dc.subject부분관찰마르코프의사결정과정.-
dc.subject역강화학습.-
dc.subjectMachine Learning.-
dc.subjectReinforcement Learning.-
dc.subjectPartially Observable Markov Decision Processes(POMDPs).-
dc.subjectInverse Reinforcement Learning.-
dc.subject기계학습.-
dc.subject강화학습.-
dc.subject부분관찰마르코프의사결정과정.-
dc.subject역강화학습.-
dc.titleInverse reinforcement learning in partially observable environments-
dc.title.alternative부분관찰환경에서의 역강화학습-
dc.typeThesis(Master)-
dc.identifier.CNRN327349/325007 -
dc.description.department한국과학기술원 : 전산학전공, -
dc.identifier.uid020083539-
dc.contributor.localauthorKim, Kee-Eung-
dc.contributor.localauthor김기응-
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0