DSpace at KOASAS: An Energy-Efficient Deep Convolutional Neural Network Training Accelerator for In-Situ Personalization on Smart Devices

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

An Energy-Efficient Deep Convolutional Neural Network Training Accelerator for In-Situ Personalization on Smart Devices

Cited 25 time in

Cited 16 time in

Hit : 318
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Choi, Seungkyu	ko
dc.contributor.author	Sim, Jaehyeong	ko
dc.contributor.author	Kang, Myeonggu	ko
dc.contributor.author	Choi, Yeongjae	ko
dc.contributor.author	Kim, Hyeonuk	ko
dc.contributor.author	Kim, Lee-Sup	ko
dc.date.accessioned	2020-10-14T02:55:10Z	-
dc.date.available	2020-10-14T02:55:10Z	-
dc.date.created	2020-08-12	-
dc.date.created	2020-08-12	-
dc.date.created	2020-08-12	-
dc.date.created	2020-08-12	-
dc.date.issued	2020-10	-
dc.identifier.citation	IEEE JOURNAL OF SOLID-STATE CIRCUITS, v.55, no.10, pp.2691 - 2702	-
dc.identifier.issn	0018-9200	-
dc.identifier.uri	http://hdl.handle.net/10203/276545	-
dc.description.abstract	A scalable deep-learning accelerator supporting the training process is implemented for device personalization of deep convolutional neural networks (CNNs). It consists of three processor cores operating with distinct energy-efficient dataflow for different types of computation in CNN training. Unlike the previous works where they implement design techniques to exploit the same characteristics from the inference, we analyze major issues that occurred from training in a resource-constrained system to resolve the bottlenecks. A masking scheme in the propagation core reduces a massive amount of intermediate activation data storage. It eliminates frequent off-chip memory accesses for holding the generated activation data until the backward path. A disparate dataflow architecture is implemented for the weight gradient computation to enhance PE utilization while maximally reuse the input data. Furthermore, the modified weight update system enables an 8-bit fixed-point computing datapath. The processor is implemented in 65-nm CMOS technology and occupies 10.24 mm(2) of the core area. It operates with the supply voltage from 0.63 to 1.0 V, and the computing engine runs in near-threshold voltage of 0.5 V. The chip consumes 40.7 mW at 50 MHz with the highest efficiency and achieves 47.4 mu J/epoch of training efficiency for the customized CNN model.	-
dc.language	English	-
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC	-
dc.title	An Energy-Efficient Deep Convolutional Neural Network Training Accelerator for In-Situ Personalization on Smart Devices	-
dc.type	Article	-
dc.identifier.wosid	000572629500007	-
dc.identifier.scopusid	2-s2.0-85089364270	-
dc.type.rims	ART	-
dc.citation.volume	55	-
dc.citation.issue	10	-
dc.citation.beginningpage	2691	-
dc.citation.endingpage	2702	-
dc.citation.publicationname	IEEE JOURNAL OF SOLID-STATE CIRCUITS	-
dc.identifier.doi	10.1109/JSSC.2020.3005786	-
dc.contributor.localauthor	Kim, Lee-Sup	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordPlus	Convolutional neural network (CNN)	-
dc.subject.keywordPlus	dataflow	-
dc.subject.keywordPlus	deep-learning application-specific integrated circuit (ASIC)	-
dc.subject.keywordPlus	deep learning	-
dc.subject.keywordPlus	neural network training	-
dc.subject.keywordPlus	training processor	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 25 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

An Energy-Efficient Deep Convolutional Neural Network Training Accelerator for In-Situ Personalization on Smart Devices

This item is cited by other documents in WoS

KOASAS

Communities & Collections