Quadruped locomotion on non-rigid terrain using reinforcement learning강화학습을 사용한 비경직 지형에서의 사족보행 설계

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 170
  • Download : 0
Generating quadruped locomotion using reinforcement learning has shown promising results so far. Imitation learning was used to create diverse locomotion styles. Methods based on learning without data showed improved results in speed and accuracy on following commands than methods using a manually designed controller. Indeed, quadruped locomotion both in computer graphics and robotics are becoming more robust. Despite these progresses, most current works solved locomotion on rigid terrain. Since we can easily imagine non-rigid terrain in a real environment such as sand or mud, locomotion on non-rigid terrain should be solved. Our work aims to tackle locomotion on non-rigid terrain. We first showed the effectiveness of our method on the flat rigid terrain. Then we extend our method to work in non-rigid terrain. Our method is based on creating three Bezier curves for the current moving end-effector, the base position, and the base orientation respectively. Once the Bezier curve is created through action space, inverse kinematics is used to calculate the joint angles needed to reach that point. We chose control points of the Bezier curve as an action space. Curriculum learning that enlarged the freedom of control point position enhanced the performance by producing longer stable locomotion. In observation space, we included the configuration of the robot and the minimum amount of environment which was induced by the sparse ray projected from the robot. The necessity of each reward is proven by showing the result motion without each of the rewards subtracted.
Advisors
Lee, Sung Heeresearcher이성희researcher
Description
한국과학기술원 :문화기술대학원,
Publisher
한국과학기술원
Issue Date
2021
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2021.2,[iv, 31 p. :]

Keywords

Motion Control▼aPhysics-based Locomotion▼aReinforcement Learning▼aQuadruped Locomotion▼aNon-rigid Terrain; 움직임 제어▼a물리기반 보행▼a강화학습▼a사족보행▼a비경직 지형

URI
http://hdl.handle.net/10203/295119
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=948612&flag=dissertation
Appears in Collection
GCT-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0