DSpace at KOASAS: Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 206
Download : 0

Export

Han, Seungyul / Sung, Youngchul researcher

In this paper, sample-aware policy entropy regularization is proposed to enhance the conventional policy entropy regularization for better exploration. Exploiting the sample distribution obtainable from the replay buffer, the proposed sample-aware entropy regularization maximizes the entropy of the weighted sum of the policy action distribution and the sample action distribution from the replay buffer for sample-efficient exploration. A practical algorithm named diversity actor-critic (DAC) is developed by applying policy iteration to the objective function with the proposed sample-aware entropy regularization. Numerical results show that DAC significantly outperforms existing recent algorithms for reinforcement learning.

Publisher: International Conference on Machine Learning (ICML)

Issue Date: 2021-07

Language: English

Citation: International Conference on Machine Learning (ICML)

ISSN: 2640-3498

URI: http://hdl.handle.net/10203/286838

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration

KOASAS

Communities & Collections