Imitation learning from compressed observation domain압축된 관찰 영역으로부터의 모방 학습

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 374
  • Download : 0
In general, when a novice attempts to imitate an expert’s behavior, the novice can not fully grasp the expert’s internal state. For example, when we learn the movement of a professional soccer player, we performe imitation learning while watching the video of the player, and we do not see the internal state such as the player’s joint or psychological state. In this case, the state information of the expert is compressed and transmitted to the novice, and the novice observes the compressed state information and performs imitation learning from this information. Therefore, the novice needs the ability of imitation learning from the expert state information projected onto the compressed observation domain. In the case of a person, this work can be done naturally through empathy with others, but it is a very challenging task for machines because machines require a lot of cross-domain pairs between the original state domain and the compressed observation domain. In this thesis, we propose an algorithm that can perform imitation learning from the compressed observation domain with only a very small amount of expert state sample, without the need of the cross-domain pairs between the state domain and the compressed observation domain.
Advisors
Sung, Youngchulresearcher성영철researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2018
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2018.2,[iii, 27 p. :]

Keywords

Reinforcement learning▼aGenerative adversarial network▼aImitation learning; 강화학습▼a생성 적대 네트워크▼a모방 학습

URI
http://hdl.handle.net/10203/266815
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=734010&flag=dissertation
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0