DC Field | Value | Language |
---|---|---|
dc.contributor.author | Hong, Teakgyu | ko |
dc.contributor.author | Lee, Jongmin | ko |
dc.contributor.author | Kim, Kee-Eung | ko |
dc.contributor.author | Ortega, Pedro A. | ko |
dc.contributor.author | Lee, Daniel | ko |
dc.date.accessioned | 2016-12-01T01:34:22Z | - |
dc.date.available | 2016-12-01T01:34:22Z | - |
dc.date.created | 2016-11-18 | - |
dc.date.created | 2016-11-18 | - |
dc.date.issued | 2016-07-14 | - |
dc.identifier.citation | 25th International Joint Conference on Artificial Intelligence, pp.1571 - 1577 | - |
dc.identifier.uri | http://hdl.handle.net/10203/214342 | - |
dc.description.abstract | In the standard reinforcement learning setting, the agent learns optimal policy solely from state transitions and rewards from the environment. We consider an extended setting where a trainer additionally provides feedback on the actions executed by the agent. This requires appropriately incorporating the feedback, even when the feedback is not necessarily accurate. In this paper, we present a Bayesian approach to this extended reinforcement learning setting. Specifically, we extend Kalman Temporal Difference learning to compute the posterior distribution over Q-values given the state transitions and rewards from the environment as well as the feedback from the trainer. Through experiments on standard reinforcement learning tasks, we show that learning performance can be significantly improved even with inaccurate feedback. | - |
dc.language | English | - |
dc.publisher | International Joint Conferences on Artificial Intelligence Organization (IJCAI) | - |
dc.title | Bayesian Reinforcement Learning with Behavioral Feedback | - |
dc.type | Conference | - |
dc.identifier.scopusid | 2-s2.0-85006154182 | - |
dc.type.rims | CONF | - |
dc.citation.beginningpage | 1571 | - |
dc.citation.endingpage | 1577 | - |
dc.citation.publicationname | 25th International Joint Conference on Artificial Intelligence | - |
dc.identifier.conferencecountry | US | - |
dc.identifier.conferencelocation | New York City, NY | - |
dc.embargo.liftdate | 9999-12-31 | - |
dc.embargo.terms | 9999-12-31 | - |
dc.contributor.localauthor | Kim, Kee-Eung | - |
dc.contributor.nonIdAuthor | Hong, Teakgyu | - |
dc.contributor.nonIdAuthor | Lee, Jongmin | - |
dc.contributor.nonIdAuthor | Ortega, Pedro A. | - |
dc.contributor.nonIdAuthor | Lee, Daniel | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.