DSpace at KOASAS: Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning

Cited 0 time in webofscience

Cited 0 time in

Hit : 144
Download : 0

Export

Kim, Junsu / Seo, Younggyo / Shin, Jinwoo researcher

Goal-conditioned hierarchical reinforcement learning (HRL) has shown promising results for solving complex and long-horizon RL tasks. However, the action space of high-level policy in the goal-conditioned HRL is often large, so it results in poor exploration, leading to inefficiency in training. In this paper, we present HIerarchical reinforcement learning Guided by Landmarks (HIGL), a novel framework for training a high-level policy with a reduced action space guided by landmarks, i.e., promising states to explore. The key component of HIGL is twofold: (a) sampling landmarks that are informative for exploration and (b) encouraging the high level policy to generate a subgoal towards a selected landmark. For (a), we consider two criteria: coverage of the entire visited state space (i.e., dispersion of states) and novelty of states (i.e., prediction error of a state). For (b), we select a landmark as the very first landmark in the shortest path in a graph whose nodes are landmarks. Our experiments demonstrate that our framework outperforms prior-arts across a variety of control tasks, thanks to efficient exploration guided by landmarks.

Publisher: Neural Information Processing Systems

Issue Date: 2021-12-07

Language: English

Citation: 35th Conference on Neural Information Processing Systems, NeurIPS 2021

URI: http://hdl.handle.net/10203/290297

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning

KOASAS

Communities & Collections