Bayesian multi-task reinforcement learning베이지안 다중환경 강화학습

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 378
  • Download : 0
In this paper, we will discuss a Bayesian learning technique in multitask reinforcement learning situations. Multitask reinforcement learning aims to make learning more efficient and faster by sharing the information that is learned in various tasks, that is, information that can be commonly applied in multiple tasks. In recent research, it has been possible to share initial policy information by restricting the Kullback-Leibler divergence between the learning policies of each task and the initial policy shared by all tasks. However, if there are environments with conflicting goals in a given set of environments, this initial policy sharing scheme may prevent agents from learning the conflicting environments because it transfers unprofitable knowledge to each task. To solve this problem, this paper suggests the Bayesian methodology in learning an initial policy and a method to learn Bayesian deep artificial neural network policies. By using the Bayesian methodology the distribution of initial policy parameter is learned. So each task-specific policy can deal with the uncertainty of the knowledge transferred from the initial policy. The proposed method is evaluated in the grid world environments and shows more stable and higher performance than the prior work.
Advisors
Kim, Kee-Eungresearcher김기응researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2019
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2019.2,[iii, 23 p. :]

Keywords

multitask reinforcement learning▼abayesian reinforcement learning▼adeep neural networks; 다중환경 강화학습▼a베이지안 강화학습▼a심층 인공 신경망

URI
http://hdl.handle.net/10203/267008
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=843535&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0