DSpace at KOASAS: Imitation learning from compressed observation domain

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

Imitation learning from compressed observation domain압축된 관찰 영역으로부터의 모방 학습

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 382
Download : 0

Export

Noh, Hyungcheol

In general, when a novice attempts to imitate an expert’s behavior, the novice can not fully grasp the expert’s internal state. For example, when we learn the movement of a professional soccer player, we performe imitation learning while watching the video of the player, and we do not see the internal state such as the player’s joint or psychological state. In this case, the state information of the expert is compressed and transmitted to the novice, and the novice observes the compressed state information and performs imitation learning from this information. Therefore, the novice needs the ability of imitation learning from the expert state information projected onto the compressed observation domain. In the case of a person, this work can be done naturally through empathy with others, but it is a very challenging task for machines because machines require a lot of cross-domain pairs between the original state domain and the compressed observation domain. In this thesis, we propose an algorithm that can perform imitation learning from the compressed observation domain with only a very small amount of expert state sample, without the need of the cross-domain pairs between the state domain and the compressed observation domain.

Advisors: Sung, Youngchul researcher; 성영철 researcher

Description: 한국과학기술원 :전기및전자공학부,

Publisher: 한국과학기술원

Issue Date: 2018

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2018.2,[iii, 27 p. :]

Keywords: Reinforcement learning▼aGenerative adversarial network▼aImitation learning; 강화학습▼a생성 적대 네트워크▼a모방 학습

URI: http://hdl.handle.net/10203/266815

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=734010&flag=dissertation

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Imitation learning from compressed observation domain압축된 관찰 영역으로부터의 모방 학습

KOASAS

Communities & Collections