DSpace at KOASAS: A 8.81 TFLOPS/W Deep-Reinforcement-Learning Accelerator With Delta-Based Weight Sharing and Block-Mantissa Reconfigurable PE Array

DSpace at KOASAS

RIMS Collection RIMS Journal Papers

A 8.81 TFLOPS/W Deep-Reinforcement-Learning Accelerator With Delta-Based Weight Sharing and Block-Mantissa Reconfigurable PE Array

Cited 0 time in webofscience

Cited 0 time in

Hit : 5
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	An, Sanghyuk	ko
dc.contributor.author	Ryu, Junha	ko
dc.contributor.author	Park, Gwangtae	ko
dc.contributor.author	Yoo, Hoi-Jun	ko
dc.date.accessioned	2024-08-30T03:00:14Z	-
dc.date.available	2024-08-30T03:00:14Z	-
dc.date.created	2024-08-29	-
dc.date.issued	2024-05	-
dc.identifier.citation	IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, v.71, no.5, pp.2529 - 2533	-
dc.identifier.issn	1549-7747	-
dc.identifier.uri	http://hdl.handle.net/10203/322487	-
dc.description.abstract	TD3 is one of the most high-performing Deep Reinforcement Learning (DRL) algorithms, providing high training stability and rewards. However, it suffers from low energy efficiency due to high External Memory Access (EMA) and floating point operations. To mitigate this issue and achieve higher throughput and energy efficiency, we propose the DRL accelerator with 3 features: 1) Delta-based Weight Sharing (DWS) represents weights by referencing corresponding network and exploits data locality, leading to EMA reduction of up to 64.3% in feed-forward stage and 39.7% in gradient generation and weight update stage. 2) Block-Mantissa Reconfigurable PE Array (BMRPA) supports variable operations in blocks and mantissa to provide optimal precision for each layer, resulting in up to a 4x increase in throughput. 3) Multi-mode Data Fetcher (MDF) supports bit width adaptive data fetching, achieving twice the bandwidth with an average read overhead of 5.3%. When combined with BMRPA, it attains an energy efficiency of 8.81 TFLOPS/W.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	A 8.81 TFLOPS/W Deep-Reinforcement-Learning Accelerator With Delta-Based Weight Sharing and Block-Mantissa Reconfigurable PE Array	-
dc.type	Article	-
dc.identifier.wosid	001230987700069	-
dc.identifier.scopusid	2-s2.0-85188005568	-
dc.type.rims	ART	-
dc.citation.volume	71	-
dc.citation.issue	5	-
dc.citation.beginningpage	2529	-
dc.citation.endingpage	2533	-
dc.citation.publicationname	IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS	-
dc.identifier.doi	10.1109/TCSII.2024.3374725	-
dc.contributor.localauthor	An, Sanghyuk	-
dc.contributor.localauthor	Yoo, Hoi-Jun	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Throughput	-
dc.subject.keywordAuthor	Energy efficiency	-
dc.subject.keywordAuthor	Decoding	-
dc.subject.keywordAuthor	Artificial neural networks	-
dc.subject.keywordAuthor	Vectors	-
dc.subject.keywordAuthor	Task analysis	-
dc.subject.keywordAuthor	Deep reinforcement learning	-
dc.subject.keywordAuthor	TD3	-
dc.subject.keywordAuthor	external memory access	-
dc.subject.keywordAuthor	block floating point	-
dc.subject.keywordAuthor	reconfigurable	-

Appears in Collection: RIMS Journal Papers EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

A 8.81 TFLOPS/W Deep-Reinforcement-Learning Accelerator With Delta-Based Weight Sharing and Block-Mantissa Reconfigurable PE Array

KOASAS

Communities & Collections