Deterministic Policy Gradient-based Reinforcement Learning for DDR5 Memory Signaling Architecture Optimization considering Signal Integrity

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 51
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorLho, Daehwanko
dc.contributor.authorPark, Hyunwookko
dc.contributor.authorKim, Keunwooko
dc.contributor.authorKim, SeongGukko
dc.contributor.authorSim, Boogyoko
dc.contributor.authorSon, Kyungjuneko
dc.contributor.authorSon, Keeyoungko
dc.contributor.authorKim, Jihunko
dc.contributor.authorChoi, Seongukko
dc.contributor.authorPark, Joonsangko
dc.contributor.authorKim, Haeyeonko
dc.contributor.authorKong, Kyubongko
dc.contributor.authorKim, Jounghoko
dc.date.accessioned2023-09-15T06:00:50Z-
dc.date.available2023-09-15T06:00:50Z-
dc.date.created2023-09-15-
dc.date.issued2022-10-
dc.identifier.citation31st IEEE Conference on Electrical Performance of Electronic Packaging and Systems, EPEPS 2022-
dc.identifier.issn2165-410-
dc.identifier.urihttp://hdl.handle.net/10203/312667-
dc.description.abstractIn this paper, we propose the deterministic policy gradient-based reinforcement learning for DDR5 memory signaling architecture optimization considering signal integrity. We convert the complex DDR5 memory signaling architecture optimization to the Markov decision process (MDP). The key limitation factor was found through the analysis of the hierarchical channel, and MDP was configured to solve it. The deterministic policy is essential for optimizing high-dimensional problems that have many continuous design parameters. For verification, we compare the proposed method with conventional methods such as random search (RS) and Bayesian optimization (BO) and other reinforcement learning algorithms such as the advantage actor-critic (A2C) and proximal policy optimization (PPO). RS and BO could not be properly optimized even after 10000 iterations of 1000 times, respectively, and A2C and PPO failed to optimize. As a result of comparison, the proposed method has the highest optimality, low computing time, and reusability.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleDeterministic Policy Gradient-based Reinforcement Learning for DDR5 Memory Signaling Architecture Optimization considering Signal Integrity-
dc.typeConference-
dc.identifier.wosid000919898800010-
dc.identifier.scopusid2-s2.0-85143439883-
dc.type.rimsCONF-
dc.citation.publicationname31st IEEE Conference on Electrical Performance of Electronic Packaging and Systems, EPEPS 2022-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationSan Jose-
dc.identifier.doi10.1109/EPEPS53828.2022.9947119-
dc.contributor.localauthorKim, Joungho-
dc.contributor.nonIdAuthorPark, Hyunwook-
dc.contributor.nonIdAuthorKim, Jihun-
dc.contributor.nonIdAuthorKong, Kyubong-
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0