DSpace at KOASAS: FlexBlock: A Flexible DNN Training Accelerator With Multi-Mode Block Floating Point Support

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Journal Papers(저널논문)

FlexBlock: A Flexible DNN Training Accelerator With Multi-Mode Block Floating Point Support

Cited 2 time in

Cited 0 time in

Hit : 239
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Noh, Seock-Hwan	ko
dc.contributor.author	Koo, Jahyun	ko
dc.contributor.author	Lee, Seunghyun	ko
dc.contributor.author	Park, Jongse	ko
dc.contributor.author	Kung, Jaeha	ko
dc.date.accessioned	2023-08-28T08:00:22Z	-
dc.date.available	2023-08-28T08:00:22Z	-
dc.date.created	2023-08-28	-
dc.date.issued	2023-09	-
dc.identifier.citation	IEEE TRANSACTIONS ON COMPUTERS, v.72, no.9, pp.2522 - 2535	-
dc.identifier.issn	0018-9340	-
dc.identifier.uri	http://hdl.handle.net/10203/311897	-
dc.description.abstract	When training deep neural networks (DNNs), expensive floating point arithmetic units are used in GPUs or custom neural processing units (NPUs). To reduce the burden of floating point arithmetic, community has started exploring the use of more efficient data representations, e.g., block floating point (BFP). The BFP format allows a group of values to share an exponent, which effectively reduces the memory footprint and enables cheaper fixed point arithmetic for multiply-accumulate (MAC) operations. However, existing BFP-based DNN accelerators are targeted for a specific precision, making them less versatile. In this paper, we present FlexBlock, a DNN training accelerator with three BFP modes, possibly different among activation, weight, and gradient tensors. By configuring FlexBlock to a lower BFP precision, the number of MACs handled by the core increases by up to 4x in 8-bit mode or 16x in 4-bit mode compared to 16-bit mode. To reach this theoretical upper bound, FlexBlock maximizes the core utilization at various precision levels or layer types, and allows dynamic precision control to keep throughput at its peak without sacrificing training accuracy. We evaluate the effectiveness of FlexBlock using representative DNNs on CIFAR, ImageNet and WMT14 datasets. As a result, training in FlexBlock significantly improves training speed by 1.5 ,, 5.3x and energy efficiency by 2.4 ,, 7.0x compared to other training accelerators.	-
dc.language	English	-
dc.publisher	IEEE COMPUTER SOC	-
dc.title	FlexBlock: A Flexible DNN Training Accelerator With Multi-Mode Block Floating Point Support	-
dc.type	Article	-
dc.identifier.wosid	001047175700008	-
dc.identifier.scopusid	2-s2.0-85149901909	-
dc.type.rims	ART	-
dc.citation.volume	72	-
dc.citation.issue	9	-
dc.citation.beginningpage	2522	-
dc.citation.endingpage	2535	-
dc.citation.publicationname	IEEE TRANSACTIONS ON COMPUTERS	-
dc.identifier.doi	10.1109/TC.2023.3253050	-
dc.contributor.localauthor	Park, Jongse	-
dc.contributor.nonIdAuthor	Noh, Seock-Hwan	-
dc.contributor.nonIdAuthor	Koo, Jahyun	-
dc.contributor.nonIdAuthor	Lee, Seunghyun	-
dc.contributor.nonIdAuthor	Kung, Jaeha	-
dc.description.isOpenAccess	N	-
dc.type.journalArticle	Article	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Tensors	-
dc.subject.keywordAuthor	Hardware	-
dc.subject.keywordAuthor	Arithmetic	-
dc.subject.keywordAuthor	Parallel processing	-
dc.subject.keywordAuthor	Deep learning	-
dc.subject.keywordAuthor	Scalability	-
dc.subject.keywordAuthor	Block floating point	-
dc.subject.keywordAuthor	DNN training accelerator	-
dc.subject.keywordAuthor	low precision training	-
dc.subject.keywordAuthor	precision scalability	-

Appears in Collection: CS-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 2 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

FlexBlock: A Flexible DNN Training Accelerator With Multi-Mode Block Floating Point Support

This item is cited by other documents in WoS

KOASAS

Communities & Collections