Two mode Q-learning using failure experience of the agent and its application to biped robot개체의 실패 경험을 활용한 Two mode Q-학습과 이족보행 로봇에의 적용

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 673
  • Download : 0
In this thesis, a two mode Q-learning method is proposed for fast convergence, extending Q-learning, a well-known scheme in reinforcement learning. It employs a separate failure Q value that keeps track of failure experiences and uses this to modify the exploratory behavior of a learning agent. The effectiveness of the pro-posed two mode Q-learning method is verified in a grid world environment. Its performance is also evaluated against conventional Q-learning in training a soccer agent to perform goalkeeping. The acquired knowledge of two mode Q-learning is implemented on the goalie robot of the NaroSot soccer system. The effects of varying parameters in two mode Q-learning is investigated. Also, a biped robot, called HSR-IV, is used as a test bed to compare the performance of both algorithms. An external force that is generated in the sagittal plane, is applied to the HSR-IV and its standing posture is investigated. Q and two mode Q-learning are employed to select an action for resisting the external force. In the frontal plane, an external force is generated and impacts the HSR-IV. In this situation, more than two actuators that were considered in this thesis, exist for resisting the external force. For implementing Q and two mode Q-learning in this situation, a curse of dimensionality must be considered. To solve this problem, a module-based scheme is adopted. The effectiveness of module-based two mode Q-learning is verified by real experiments using HSR-IV.
Advisors
Kim, Jong-Hwanresearcher김종환researcher
Description
한국과학기술원 : 전기및전자공학전공,
Publisher
한국과학기술원
Issue Date
2004
Identifier
237638/325007  / 000995132
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기및전자공학전공, 2004.2, [ ix, 154 p. ]

Keywords

Two mode Q-learning; Failure experience; Q-learning; Biped robot; 이족보행 로봇; Two mode Q-학습; 실패 경험; Q-학습

URI
http://hdl.handle.net/10203/35208
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=237638&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0