DSpace at KOASAS: Inverse reinforcement learning in partially observable environments

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Master(석사논문)

Inverse reinforcement learning in partially observable environments부분관찰환경에서의 역강화학습

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 500
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Kim, Kee-Eung	-
dc.contributor.advisor	김기응	-
dc.contributor.author	Choi, Jae-Deug	-
dc.contributor.author	최재득	-
dc.date.accessioned	2011-12-13T06:08:24Z	-
dc.date.available	2011-12-13T06:08:24Z	-
dc.date.issued	2009	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=327349&flag=dissertation	-
dc.identifier.uri	http://hdl.handle.net/10203/34884	-
dc.description	학위논문(석사) - 한국과학기술원 : 전산학전공, 2009. 8., [ v, 36 p. ]	-
dc.description.abstract	Inverse reinforcement learning (IRL) is the problem of recovering the underlying reward function from the behavior of an expert. Most of the existing algorithms for IRL assume that the expert`s environment is modeled as a Markov decision process (MDP), although they should be able to handle partially observable settings in order to widen the applicability to more realistic scenarios. In this paper, we present an extension of the classical IRL algorithm by Ng and Russell to partially observable environments. We discuss technical issues and challenges, and present the experimental results on some of the benchmark partially observable domains.	eng
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	Machine Learning.	-
dc.subject	Reinforcement Learning.	-
dc.subject	Partially Observable Markov Decision Processes(POMDPs).	-
dc.subject	Inverse Reinforcement Learning.	-
dc.subject	기계학습.	-
dc.subject	강화학습.	-
dc.subject	부분관찰마르코프의사결정과정.	-
dc.subject	역강화학습.	-
dc.subject	Machine Learning.	-
dc.subject	Reinforcement Learning.	-
dc.subject	Partially Observable Markov Decision Processes(POMDPs).	-
dc.subject	Inverse Reinforcement Learning.	-
dc.subject	기계학습.	-
dc.subject	강화학습.	-
dc.subject	부분관찰마르코프의사결정과정.	-
dc.subject	역강화학습.	-
dc.title	Inverse reinforcement learning in partially observable environments	-
dc.title.alternative	부분관찰환경에서의 역강화학습	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	327349/325007	-
dc.description.department	한국과학기술원 : 전산학전공,	-
dc.identifier.uid	020083539	-
dc.contributor.localauthor	Kim, Kee-Eung	-
dc.contributor.localauthor	김기응	-

Appears in Collection: CS-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Inverse reinforcement learning in partially observable environments부분관찰환경에서의 역강화학습

KOASAS

Communities & Collections