DSpace at KOASAS: Bayesian Reinforcement Learning with Behavioral Feedback

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Conference Papers(학술회의논문)

Bayesian Reinforcement Learning with Behavioral Feedback

Cited 0 time in webofscience

Cited 0 time in

Hit : 398
Download : 30

Export

DC Field	Value	Language
dc.contributor.author	Hong, Teakgyu	ko
dc.contributor.author	Lee, Jongmin	ko
dc.contributor.author	Kim, Kee-Eung	ko
dc.contributor.author	Ortega, Pedro A.	ko
dc.contributor.author	Lee, Daniel	ko
dc.date.accessioned	2016-12-01T01:34:22Z	-
dc.date.available	2016-12-01T01:34:22Z	-
dc.date.created	2016-11-18	-
dc.date.created	2016-11-18	-
dc.date.issued	2016-07-14	-
dc.identifier.citation	25th International Joint Conference on Artificial Intelligence, pp.1571 - 1577	-
dc.identifier.uri	http://hdl.handle.net/10203/214342	-
dc.description.abstract	In the standard reinforcement learning setting, the agent learns optimal policy solely from state transitions and rewards from the environment. We consider an extended setting where a trainer additionally provides feedback on the actions executed by the agent. This requires appropriately incorporating the feedback, even when the feedback is not necessarily accurate. In this paper, we present a Bayesian approach to this extended reinforcement learning setting. Specifically, we extend Kalman Temporal Difference learning to compute the posterior distribution over Q-values given the state transitions and rewards from the environment as well as the feedback from the trainer. Through experiments on standard reinforcement learning tasks, we show that learning performance can be significantly improved even with inaccurate feedback.	-
dc.language	English	-
dc.publisher	International Joint Conferences on Artificial Intelligence Organization (IJCAI)	-
dc.title	Bayesian Reinforcement Learning with Behavioral Feedback	-
dc.type	Conference	-
dc.identifier.scopusid	2-s2.0-85006154182	-
dc.type.rims	CONF	-
dc.citation.beginningpage	1571	-
dc.citation.endingpage	1577	-
dc.citation.publicationname	25th International Joint Conference on Artificial Intelligence	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	New York City, NY	-
dc.embargo.liftdate	9999-12-31	-
dc.embargo.terms	9999-12-31	-
dc.contributor.localauthor	Kim, Kee-Eung	-
dc.contributor.nonIdAuthor	Hong, Teakgyu	-
dc.contributor.nonIdAuthor	Lee, Jongmin	-
dc.contributor.nonIdAuthor	Ortega, Pedro A.	-
dc.contributor.nonIdAuthor	Lee, Daniel	-

Appears in Collection: RIMS Conference Papers

Files in This Item

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Bayesian Reinforcement Learning with Behavioral Feedback

KOASAS

Communities & Collections