DSpace at KOASAS: Deterministic Policy Gradient-based Reinforcement Learning for DDR5 Memory Signaling Architecture Optimization considering Signal Integrity

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Conference Papers(학술회의논문)

Deterministic Policy Gradient-based Reinforcement Learning for DDR5 Memory Signaling Architecture Optimization considering Signal Integrity

Cited 1 time in

Cited 0 time in

Hit : 51
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Lho, Daehwan	ko
dc.contributor.author	Park, Hyunwook	ko
dc.contributor.author	Kim, Keunwoo	ko
dc.contributor.author	Kim, SeongGuk	ko
dc.contributor.author	Sim, Boogyo	ko
dc.contributor.author	Son, Kyungjune	ko
dc.contributor.author	Son, Keeyoung	ko
dc.contributor.author	Kim, Jihun	ko
dc.contributor.author	Choi, Seonguk	ko
dc.contributor.author	Park, Joonsang	ko
dc.contributor.author	Kim, Haeyeon	ko
dc.contributor.author	Kong, Kyubong	ko
dc.contributor.author	Kim, Joungho	ko
dc.date.accessioned	2023-09-15T06:00:50Z	-
dc.date.available	2023-09-15T06:00:50Z	-
dc.date.created	2023-09-15	-
dc.date.issued	2022-10	-
dc.identifier.citation	31st IEEE Conference on Electrical Performance of Electronic Packaging and Systems, EPEPS 2022	-
dc.identifier.issn	2165-410	-
dc.identifier.uri	http://hdl.handle.net/10203/312667	-
dc.description.abstract	In this paper, we propose the deterministic policy gradient-based reinforcement learning for DDR5 memory signaling architecture optimization considering signal integrity. We convert the complex DDR5 memory signaling architecture optimization to the Markov decision process (MDP). The key limitation factor was found through the analysis of the hierarchical channel, and MDP was configured to solve it. The deterministic policy is essential for optimizing high-dimensional problems that have many continuous design parameters. For verification, we compare the proposed method with conventional methods such as random search (RS) and Bayesian optimization (BO) and other reinforcement learning algorithms such as the advantage actor-critic (A2C) and proximal policy optimization (PPO). RS and BO could not be properly optimized even after 10000 iterations of 1000 times, respectively, and A2C and PPO failed to optimize. As a result of comparison, the proposed method has the highest optimality, low computing time, and reusability.	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Deterministic Policy Gradient-based Reinforcement Learning for DDR5 Memory Signaling Architecture Optimization considering Signal Integrity	-
dc.type	Conference	-
dc.identifier.wosid	000919898800010	-
dc.identifier.scopusid	2-s2.0-85143439883	-
dc.type.rims	CONF	-
dc.citation.publicationname	31st IEEE Conference on Electrical Performance of Electronic Packaging and Systems, EPEPS 2022	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	San Jose	-
dc.identifier.doi	10.1109/EPEPS53828.2022.9947119	-
dc.contributor.localauthor	Kim, Joungho	-
dc.contributor.nonIdAuthor	Park, Hyunwook	-
dc.contributor.nonIdAuthor	Kim, Jihun	-
dc.contributor.nonIdAuthor	Kong, Kyubong	-

Appears in Collection: EE-Conference Papers(학술회의논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 1 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Deterministic Policy Gradient-based Reinforcement Learning for DDR5 Memory Signaling Architecture Optimization considering Signal Integrity

This item is cited by other documents in WoS

KOASAS

Communities & Collections