A Bayesian Approach to Generative Adversarial Imitation Learning

Cited 2 time in webofscience Cited 0 time in scopus
  • Hit : 268
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorJeon, Wonseokko
dc.contributor.authorSeo, Seokinko
dc.contributor.authorKim, Kee-Eungko
dc.date.accessioned2019-03-19T01:38:51Z-
dc.date.available2019-03-19T01:38:51Z-
dc.date.created2019-03-09-
dc.date.created2019-03-09-
dc.date.created2019-03-09-
dc.date.issued2018-12-06-
dc.identifier.citation32nd Conference on Neural Information Processing Systems (NIPS 2018)-
dc.identifier.urihttp://hdl.handle.net/10203/251739-
dc.description.abstractGenerative adversarial training for imitation learning has shown promising results on high-dimensional and continuous control tasks. This paradigm is based on reducing the imitation learning problem to the density matching problem, where the agent iteratively refines the policy to match the empirical state-action visitation frequency of the expert demonstration. Although this approach can robustly learn to imitate even with scarce demonstration, one must still address the inherent challenge that collecting trajectory samples in each iteration is a costly operation. To address this issue, we first propose a Bayesian formulation of generative adversarialimitation learning (GAIL), where the imitation policy and the cost function are represented as stochastic neural networks. Then, we show that we can significantly enhance the sample efficiency of GAIL leveraging the predictive density of the cost, on an extensive set of imitation learning tasks with high-dimensional states and actions.-
dc.languageEnglish-
dc.publisherNeural Information Processing Systems-
dc.titleA Bayesian Approach to Generative Adversarial Imitation Learning-
dc.typeConference-
dc.identifier.wosid000461852002002-
dc.identifier.scopusid2-s2.0-85064832431-
dc.type.rimsCONF-
dc.citation.publicationname32nd Conference on Neural Information Processing Systems (NIPS 2018)-
dc.identifier.conferencecountryCN-
dc.identifier.conferencelocationMontreal Convention Centre-
dc.contributor.localauthorKim, Kee-Eung-
dc.contributor.nonIdAuthorJeon, Wonseok-
dc.contributor.nonIdAuthorSeo, Seokin-
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 2 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0