DSpace at KOASAS: Segmenting 2K-Videos at 36.5 FPS with 24.3 GFLOPs: Accurate and Lightweight Realtime Semantic Segmentation Network

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

Segmenting 2K-Videos at 36.5 FPS with 24.3 GFLOPs: Accurate and Lightweight Realtime Semantic Segmentation Network

Cited 5 time in

Cited 5 time in

Hit : 632
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Oh, Dokwan	ko
dc.contributor.author	Ji, Daehyun	ko
dc.contributor.author	Jang, Cheolhun	ko
dc.contributor.author	Hyunv, Yoonsuk	ko
dc.contributor.author	Bae, Hong S.	ko
dc.contributor.author	Hwang, Sungju	ko
dc.date.accessioned	2020-12-21T09:10:23Z	-
dc.date.available	2020-12-21T09:10:23Z	-
dc.date.created	2020-12-03	-
dc.date.created	2020-12-03	-
dc.date.created	2020-12-03	-
dc.date.issued	2020-05-31	-
dc.identifier.citation	IEEE International Conference on Robotics and Automation, ICRA 2020, pp.3153 - 3160	-
dc.identifier.issn	1050-4729	-
dc.identifier.uri	http://hdl.handle.net/10203/278851	-
dc.description.abstract	We propose a fast and lightweight end-to-end convolutional network architecture for real-time segmentation of high resolution videos, NfS-SegNet, that can segement 2K-videos at 36.5 FPS with 24.3 GFLOPS. This speed and computation-efficiency is due to following reasons: 1) The encoder network, NfS-Net, is optimized for speed with simple building blocks without memory-heavy operations such as depthwise convolutions, and outperforms state-of-the-art lightweight CNN architectures such as SqueezeNet [2], Mo- bileNet v1 [3] v2 [4] and ShuffleNet v1 [5] v2 [6] on image classification with significantly higher speed. 2) The NfS- SegNet has an asymmetric architecture with deeper encoder and shallow decoder, whose design is based on our empirical finding that the decoder is the main bottleneck in computation with relatively small contribution to the final performance. 3) Our novel uncertainty-aware knowledge distillation method guides the teacher model to focus its knowledge transfer on the most difficult image regions. We validate the performance of NfS-SegNet with the CITYSCAPE [1] benchmark, on which it achieves state-of-the-art performance among lightweight segementation models in terms of both accuracy and speed.	-
dc.language	English	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Segmenting 2K-Videos at 36.5 FPS with 24.3 GFLOPs: Accurate and Lightweight Realtime Semantic Segmentation Network	-
dc.type	Conference	-
dc.identifier.wosid	000712319502039	-
dc.identifier.scopusid	2-s2.0-85092716742	-
dc.type.rims	CONF	-
dc.citation.beginningpage	3153	-
dc.citation.endingpage	3160	-
dc.citation.publicationname	IEEE International Conference on Robotics and Automation, ICRA 2020	-
dc.identifier.conferencecountry	FR	-
dc.identifier.conferencelocation	Virtual	-
dc.identifier.doi	10.1109/ICRA40945.2020.9196510	-
dc.contributor.localauthor	Hwang, Sungju	-
dc.contributor.nonIdAuthor	Oh, Dokwan	-
dc.contributor.nonIdAuthor	Ji, Daehyun	-
dc.contributor.nonIdAuthor	Jang, Cheolhun	-
dc.contributor.nonIdAuthor	Hyunv, Yoonsuk	-
dc.contributor.nonIdAuthor	Bae, Hong S.	-

Appears in Collection: AI-Conference Papers(학술대회논문)

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 5 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Segmenting 2K-Videos at 36.5 FPS with 24.3 GFLOPs: Accurate and Lightweight Realtime Semantic Segmentation Network

This item is cited by other documents in WoS

KOASAS

Communities & Collections