DC Field | Value | Language |
---|---|---|
dc.contributor.author | Oh, Dokwan | ko |
dc.contributor.author | Ji, Daehyun | ko |
dc.contributor.author | Jang, Cheolhun | ko |
dc.contributor.author | Hyunv, Yoonsuk | ko |
dc.contributor.author | Bae, Hong S. | ko |
dc.contributor.author | Hwang, Sungju | ko |
dc.date.accessioned | 2020-12-21T09:10:23Z | - |
dc.date.available | 2020-12-21T09:10:23Z | - |
dc.date.created | 2020-12-03 | - |
dc.date.created | 2020-12-03 | - |
dc.date.created | 2020-12-03 | - |
dc.date.issued | 2020-05-31 | - |
dc.identifier.citation | IEEE International Conference on Robotics and Automation, ICRA 2020, pp.3153 - 3160 | - |
dc.identifier.issn | 1050-4729 | - |
dc.identifier.uri | http://hdl.handle.net/10203/278851 | - |
dc.description.abstract | We propose a fast and lightweight end-to-end convolutional network architecture for real-time segmentation of high resolution videos, NfS-SegNet, that can segement 2K-videos at 36.5 FPS with 24.3 GFLOPS. This speed and computation-efficiency is due to following reasons: 1) The encoder network, NfS-Net, is optimized for speed with simple building blocks without memory-heavy operations such as depthwise convolutions, and outperforms state-of-the-art lightweight CNN architectures such as SqueezeNet [2], Mo- bileNet v1 [3] v2 [4] and ShuffleNet v1 [5] v2 [6] on image classification with significantly higher speed. 2) The NfS- SegNet has an asymmetric architecture with deeper encoder and shallow decoder, whose design is based on our empirical finding that the decoder is the main bottleneck in computation with relatively small contribution to the final performance. 3) Our novel uncertainty-aware knowledge distillation method guides the teacher model to focus its knowledge transfer on the most difficult image regions. We validate the performance of NfS-SegNet with the CITYSCAPE [1] benchmark, on which it achieves state-of-the-art performance among lightweight segementation models in terms of both accuracy and speed. | - |
dc.language | English | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | Segmenting 2K-Videos at 36.5 FPS with 24.3 GFLOPs: Accurate and Lightweight Realtime Semantic Segmentation Network | - |
dc.type | Conference | - |
dc.identifier.wosid | 000712319502039 | - |
dc.identifier.scopusid | 2-s2.0-85092716742 | - |
dc.type.rims | CONF | - |
dc.citation.beginningpage | 3153 | - |
dc.citation.endingpage | 3160 | - |
dc.citation.publicationname | IEEE International Conference on Robotics and Automation, ICRA 2020 | - |
dc.identifier.conferencecountry | FR | - |
dc.identifier.conferencelocation | Virtual | - |
dc.identifier.doi | 10.1109/ICRA40945.2020.9196510 | - |
dc.contributor.localauthor | Hwang, Sungju | - |
dc.contributor.nonIdAuthor | Oh, Dokwan | - |
dc.contributor.nonIdAuthor | Ji, Daehyun | - |
dc.contributor.nonIdAuthor | Jang, Cheolhun | - |
dc.contributor.nonIdAuthor | Hyunv, Yoonsuk | - |
dc.contributor.nonIdAuthor | Bae, Hong S. | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.