DSpace at KOASAS: Sampling Rate Decay in Hindsight Experience Replay for Robot Control

DSpace at KOASAS

College of Engineering(공과대학)Cho Chun Shik Graduate School for Mobility(조천식모빌리티대학원)GT-Journal Papers(저널논문)

Sampling Rate Decay in Hindsight Experience Replay for Robot Control

Cited 18 time in

Cited 0 time in

Hit : 225
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Vecchietti, Luiz Felipe	ko
dc.contributor.author	Seo, Minah	ko
dc.contributor.author	Har, Dongsoo	ko
dc.date.accessioned	2022-04-14T06:52:32Z	-
dc.date.available	2022-04-14T06:52:32Z	-
dc.date.created	2020-04-17	-
dc.date.created	2020-04-17	-
dc.date.created	2020-04-17	-
dc.date.issued	2022-03	-
dc.identifier.citation	IEEE TRANSACTIONS ON CYBERNETICS, v.52, no.3, pp.1515 - 1526	-
dc.identifier.issn	2168-2267	-
dc.identifier.uri	http://hdl.handle.net/10203/292822	-
dc.description.abstract	Training agents via deep reinforcement learning with sparse rewards for robotic control tasks in vast state space are a big challenge, due to the rareness of successful experience. To solve this problem, recent breakthrough methods, the hindsight experience replay (HER) and aggressive rewards to counter bias in HER (ARCHER), use unsuccessful experiences and consider them as successful experiences achieving different goals, for example, hindsight experiences. According to these methods, hindsight experience is used at a fixed sampling rate during training. However, this usage of hindsight experience introduces bias, due to a distinct optimal policy, and does not allow the hindsight experience to take variable importance at different stages of training. In this article, we investigate the impact of a variable sampling rate, representing the variable rate of hindsight experience, on training performance and propose a sampling rate decay strategy that decreases the number of hindsight experiences as training proceeds. The proposed method is validated with three robotic control tasks included in the OpenAI Gym suite. The experimental results demonstrate that the proposed method achieves improved training performance and increased convergence speed over the HER and ARCHER with two of the three tasks and comparable training performance and convergence speed with the other one.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	Sampling Rate Decay in Hindsight Experience Replay for Robot Control	-
dc.type	Article	-
dc.identifier.wosid	000795863600013	-
dc.identifier.scopusid	2-s2.0-85126389132	-
dc.type.rims	ART	-
dc.citation.volume	52	-
dc.citation.issue	3	-
dc.citation.beginningpage	1515	-
dc.citation.endingpage	1526	-
dc.citation.publicationname	IEEE TRANSACTIONS ON CYBERNETICS	-
dc.identifier.doi	10.1109/TCYB.2020.2990722	-
dc.contributor.localauthor	Har, Dongsoo	-
dc.contributor.nonIdAuthor	Vecchietti, Luiz Felipe	-
dc.contributor.nonIdAuthor	Seo, Minah	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Task analysis	-
dc.subject.keywordAuthor	Erbium	-
dc.subject.keywordAuthor	Robot control	-
dc.subject.keywordAuthor	Neural networks	-
dc.subject.keywordAuthor	Aerospace electronics	-
dc.subject.keywordAuthor	Hindsight experience replay (HER)	-
dc.subject.keywordAuthor	machine learning	-
dc.subject.keywordAuthor	reinforcement learning (RL)	-
dc.subject.keywordAuthor	robot control	-
dc.subject.keywordAuthor	sampling rate decay	-
dc.subject.keywordPlus	GAME	-
dc.subject.keywordPlus	GO	-

Appears in Collection: GT-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 18 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Sampling Rate Decay in Hindsight Experience Replay for Robot Control

This item is cited by other documents in WoS

KOASAS

Communities & Collections