Sampling Rate Decay in Hindsight Experience Replay for Robot Control

Cited 18 time in webofscience Cited 0 time in scopus
  • Hit : 225
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorVecchietti, Luiz Felipeko
dc.contributor.authorSeo, Minahko
dc.contributor.authorHar, Dongsooko
dc.date.accessioned2022-04-14T06:52:32Z-
dc.date.available2022-04-14T06:52:32Z-
dc.date.created2020-04-17-
dc.date.created2020-04-17-
dc.date.created2020-04-17-
dc.date.issued2022-03-
dc.identifier.citationIEEE TRANSACTIONS ON CYBERNETICS, v.52, no.3, pp.1515 - 1526-
dc.identifier.issn2168-2267-
dc.identifier.urihttp://hdl.handle.net/10203/292822-
dc.description.abstractTraining agents via deep reinforcement learning with sparse rewards for robotic control tasks in vast state space are a big challenge, due to the rareness of successful experience. To solve this problem, recent breakthrough methods, the hindsight experience replay (HER) and aggressive rewards to counter bias in HER (ARCHER), use unsuccessful experiences and consider them as successful experiences achieving different goals, for example, hindsight experiences. According to these methods, hindsight experience is used at a fixed sampling rate during training. However, this usage of hindsight experience introduces bias, due to a distinct optimal policy, and does not allow the hindsight experience to take variable importance at different stages of training. In this article, we investigate the impact of a variable sampling rate, representing the variable rate of hindsight experience, on training performance and propose a sampling rate decay strategy that decreases the number of hindsight experiences as training proceeds. The proposed method is validated with three robotic control tasks included in the OpenAI Gym suite. The experimental results demonstrate that the proposed method achieves improved training performance and increased convergence speed over the HER and ARCHER with two of the three tasks and comparable training performance and convergence speed with the other one.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleSampling Rate Decay in Hindsight Experience Replay for Robot Control-
dc.typeArticle-
dc.identifier.wosid000795863600013-
dc.identifier.scopusid2-s2.0-85126389132-
dc.type.rimsART-
dc.citation.volume52-
dc.citation.issue3-
dc.citation.beginningpage1515-
dc.citation.endingpage1526-
dc.citation.publicationnameIEEE TRANSACTIONS ON CYBERNETICS-
dc.identifier.doi10.1109/TCYB.2020.2990722-
dc.contributor.localauthorHar, Dongsoo-
dc.contributor.nonIdAuthorVecchietti, Luiz Felipe-
dc.contributor.nonIdAuthorSeo, Minah-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorErbium-
dc.subject.keywordAuthorRobot control-
dc.subject.keywordAuthorNeural networks-
dc.subject.keywordAuthorAerospace electronics-
dc.subject.keywordAuthorHindsight experience replay (HER)-
dc.subject.keywordAuthormachine learning-
dc.subject.keywordAuthorreinforcement learning (RL)-
dc.subject.keywordAuthorrobot control-
dc.subject.keywordAuthorsampling rate decay-
dc.subject.keywordPlusGAME-
dc.subject.keywordPlusGO-
Appears in Collection
GT-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 18 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0