DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Sung, Youngchul | - |
dc.contributor.advisor | 성영철 | - |
dc.contributor.author | Noh, Hyungcheol | - |
dc.date.accessioned | 2019-09-04T02:42:18Z | - |
dc.date.available | 2019-09-04T02:42:18Z | - |
dc.date.issued | 2018 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=734010&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/266815 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2018.2,[iii, 27 p. :] | - |
dc.description.abstract | In general, when a novice attempts to imitate an expert’s behavior, the novice can not fully grasp the expert’s internal state. For example, when we learn the movement of a professional soccer player, we performe imitation learning while watching the video of the player, and we do not see the internal state such as the player’s joint or psychological state. In this case, the state information of the expert is compressed and transmitted to the novice, and the novice observes the compressed state information and performs imitation learning from this information. Therefore, the novice needs the ability of imitation learning from the expert state information projected onto the compressed observation domain. In the case of a person, this work can be done naturally through empathy with others, but it is a very challenging task for machines because machines require a lot of cross-domain pairs between the original state domain and the compressed observation domain. In this thesis, we propose an algorithm that can perform imitation learning from the compressed observation domain with only a very small amount of expert state sample, without the need of the cross-domain pairs between the state domain and the compressed observation domain. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Reinforcement learning▼aGenerative adversarial network▼aImitation learning | - |
dc.subject | 강화학습▼a생성 적대 네트워크▼a모방 학습 | - |
dc.title | Imitation learning from compressed observation domain | - |
dc.title.alternative | 압축된 관찰 영역으로부터의 모방 학습 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
dc.contributor.alternativeauthor | 노형철 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.