DSpace at KOASAS: A Deep Neural Network Training Architecture with Inference-aware Heterogeneous Data-type

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

A Deep Neural Network Training Architecture with Inference-aware Heterogeneous Data-type

Cited 4 time in

Cited 0 time in

Hit : 329
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Choi, Seungkyu	ko
dc.contributor.author	Shin, Jaekang	ko
dc.contributor.author	Kim, Lee-Sup	ko
dc.date.accessioned	2022-04-25T10:00:12Z	-
dc.date.available	2022-04-25T10:00:12Z	-
dc.date.created	2021-06-15	-
dc.date.created	2021-06-15	-
dc.date.issued	2022-05	-
dc.identifier.citation	IEEE TRANSACTIONS ON COMPUTERS, v.71, no.5, pp.1216 - 1229	-
dc.identifier.issn	0018-9340	-
dc.identifier.uri	http://hdl.handle.net/10203/295891	-
dc.description.abstract	As deep learning applications often encounter accuracy degradation due to the distorted inputs from a variety of environmental conditions, training with personal data has become essential for the edge devices. Hence, training on edge by supporting a trainable deep learning accelerator has been actively studied. Nevertheless, previous research does not consider the fundamental datapath for training and the importance of retaining the high performance for inference tasks. In this work, we propose NeuroFlix, a deep neural network training accelerator supporting heterogeneous data-type of floating- and fixed-point for input operands. From two perspectives: 1)separate precision decision for each input data, 2)maintenance of high performance on inference, we configure the data with low-bit fixed-point of activation/weight and floating-point based error gradient securing up to half-precision. A novel MAC architecture is designed to compute low/high-precision modes for the different input combinations. By substituting a high-cost floating-point based addition to brick-level separate accumulations, we realize both area-efficient architecture and high throughput for low-precision computation. Consequently, NeuroFlix outperforms the previous architectures of state-of-the-art configurations proving its high efficiency in both training and inference. By also comparing with the off-the-shelf bfloat16-based accelerator, it achieves 1.2/2.0 of speedup/energy-efficiency at training and further enhancement of 3.6/4.5 at inference.	-
dc.language	English	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	A Deep Neural Network Training Architecture with Inference-aware Heterogeneous Data-type	-
dc.type	Article	-
dc.identifier.wosid	000778905700018	-
dc.identifier.scopusid	2-s2.0-85105844527	-
dc.type.rims	ART	-
dc.citation.volume	71	-
dc.citation.issue	5	-
dc.citation.beginningpage	1216	-
dc.citation.endingpage	1229	-
dc.citation.publicationname	IEEE TRANSACTIONS ON COMPUTERS	-
dc.identifier.doi	10.1109/TC.2021.3078316	-
dc.contributor.localauthor	Kim, Lee-Sup	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Computer architecture	-
dc.subject.keywordAuthor	Throughput	-
dc.subject.keywordAuthor	Quantization (signal)	-
dc.subject.keywordAuthor	Neural networks	-
dc.subject.keywordAuthor	Computational modeling	-
dc.subject.keywordAuthor	Performance evaluation	-
dc.subject.keywordAuthor	Deep neural network	-
dc.subject.keywordAuthor	on-device training	-
dc.subject.keywordAuthor	multiply-and-accumulate unit	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 4 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

A Deep Neural Network Training Architecture with Inference-aware Heterogeneous Data-type

This item is cited by other documents in WoS

KOASAS

Communities & Collections