DSpace at KOASAS: An Energy-Efficient Deep Convolutional Neural Network Inference Processor With Enhanced Output Stationary Dataflow in 65-nm CMOS

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Journal Papers(저널논문)

An Energy-Efficient Deep Convolutional Neural Network Inference Processor With Enhanced Output Stationary Dataflow in 65-nm CMOS

Cited 37 time in

Cited 26 time in

Hit : 474
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Sim, Jaehyeong	ko
dc.contributor.author	Lee, Somin	ko
dc.contributor.author	Kim, Lee-Sup	ko
dc.date.accessioned	2020-01-29T03:20:07Z	-
dc.date.available	2020-01-29T03:20:07Z	-
dc.date.created	2020-01-29	-
dc.date.created	2020-01-29	-
dc.date.created	2020-01-29	-
dc.date.issued	2020-01	-
dc.identifier.citation	IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, v.28, no.1, pp.87 - 100	-
dc.identifier.issn	1063-8210	-
dc.identifier.uri	http://hdl.handle.net/10203/271828	-
dc.description.abstract	We propose a deep convolutional neural network (CNN) inference processor based on a novel enhanced output stationary (EOS) dataflow. Based on the observation that some activations are commonly used in two successive convolutions, the EOS dataflow employs dedicated register files (RFs) for storing such reused activation data to eliminate redundant memory accesses for highly energy-consuming SRAM banks. In addition, processing elements (PEs) are split into multiple small groups such that each group covers a tile of input activation map to increase the usability of activation RFs (ARFs). The processor has two different voltage/frequency domains. The computation domain with 512 PEs operates at near-threshold voltage (NTV) (0.4 V) and 60-MHz frequency to increase energy efficiency, while the rest of the processors including 848-KB SRAMs run at 0.7 V and 120-MHz frequency to increase both on-chip and off-chip memory bandwidths. The measurement results show that our processor is capable of running AlexNet at 831 GOPS/W, VGG-16 at 1151 GOPS/W, ResNet-18 at 1004 GOPS/W, and MobileNet at 948 GOPS/W energy efficiency.	-
dc.language	English	-
dc.publisher	IEEE, Institute of Electrical and Electronics Engineers	-
dc.title	An Energy-Efficient Deep Convolutional Neural Network Inference Processor With Enhanced Output Stationary Dataflow in 65-nm CMOS	-
dc.type	Article	-
dc.identifier.wosid	000506608100009	-
dc.identifier.scopusid	2-s2.0-85077823130	-
dc.type.rims	ART	-
dc.citation.volume	28	-
dc.citation.issue	1	-
dc.citation.beginningpage	87	-
dc.citation.endingpage	100	-
dc.citation.publicationname	IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS	-
dc.identifier.doi	10.1109/TVLSI.2019.2935251	-
dc.contributor.localauthor	Kim, Lee-Sup	-
dc.contributor.nonIdAuthor	Lee, Somin	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Earth Observing System	-
dc.subject.keywordAuthor	Radio frequency	-
dc.subject.keywordAuthor	Energy consumption	-
dc.subject.keywordAuthor	System-on-chip	-
dc.subject.keywordAuthor	Memory management	-
dc.subject.keywordAuthor	Registers	-
dc.subject.keywordAuthor	Random access memory	-
dc.subject.keywordAuthor	Convolutional neural network (CNN)	-
dc.subject.keywordAuthor	dataflow	-
dc.subject.keywordAuthor	deep learning	-
dc.subject.keywordAuthor	energy-efficient processor	-
dc.subject.keywordAuthor	near-threshold voltage (NTV)	-

Appears in Collection: EE-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 37 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

An Energy-Efficient Deep Convolutional Neural Network Inference Processor With Enhanced Output Stationary Dataflow in 65-nm CMOS

This item is cited by other documents in WoS

KOASAS

Communities & Collections