DSpace at KOASAS: (A) fixed-point deep reinforcement learning platform with quantization-aware training and adaptive parallelism

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Master(석사논문)

(A) fixed-point deep reinforcement learning platform with quantization-aware training and adaptive parallelism동적 비트 양자화와 적응형 병렬화를 이용한 고정 소수점 기반 심층강화학습 가속 플랫폼

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 82
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	Kim, Joo-Young	-
dc.contributor.advisor	김주영	-
dc.contributor.author	Yang, Je	-
dc.date.accessioned	2023-06-26T19:33:47Z	-
dc.date.available	2023-06-26T19:33:47Z	-
dc.date.issued	2022	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=997256&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/309859	-
dc.description	학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2022.2,[iv, 25 p. :]	-
dc.description.abstract	We present a deep reinforcement learning acceleration platform named FIXAR, which employs fixed-point data types and arithmetic units for the first time using a SW/HW co-design approach. We propose a quantization-aware training algorithm in fixed-point, which enables to reduce the data precision by half after a certain amount of training time without losing accuracy. We also design a FPGA accelerator that employs adaptive dataflow and parallelism to handle both inference and training operations. Its processing element has configurable datapath to efficiently support the proposed quantized-aware training. We validate our FIXAR platform, where the host CPU emulates the DRL environment and the FPGA accelerates the agent’s DNN operations, by running multiple benchmarks in continuous action spaces based on a latest DRL algorithm called DDPG. Finally, the FIXAR platform achieves 25293.3 inferences per second (IPS) training throughput, which is 2.7 times higher than the CPU-GPU platform. In addition, its FPGA accelerator shows 53826.8 IPS and 2638.0 IPS/W energy efficiency, which are 5.5 times higher and 15.4 times more energy efficient than those of GPU, respectively. FIXAR also shows the best IPS throughput and energy efficiency among other state-of-the-art acceleration platforms using FPGA, even it targets one of the most complex DNN models.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.title	(A) fixed-point deep reinforcement learning platform with quantization-aware training and adaptive parallelism	-
dc.title.alternative	동적 비트 양자화와 적응형 병렬화를 이용한 고정 소수점 기반 심층강화학습 가속 플랫폼	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :전기및전자공학부,	-
dc.contributor.alternativeauthor	양제	-

Appears in Collection: EE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

(A) fixed-point deep reinforcement learning platform with quantization-aware training and adaptive parallelism동적 비트 양자화와 적응형 병렬화를 이용한 고정 소수점 기반 심층강화학습 가속 플랫폼

KOASAS

Communities & Collections