Transformer based lightweight monocular depth estimation단안 깊이 추정을 위한 트랜스포머 기반 경량 인공 신경망 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 28
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor김준모-
dc.contributor.authorLee, Dong-Jae-
dc.contributor.author이동재-
dc.date.accessioned2024-07-30T19:31:38Z-
dc.date.available2024-07-30T19:31:38Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097218&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/321646-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[iv, 23 p. :]-
dc.description.abstractDepth estimation is an important task in various robotics systems and applications. In mobile robotics systems, monocular depth estimation is desirable since a single RGB camera can be deployable at a low cost and compact size. Due to its significant and growing needs, many lightweight monocular depth estimation networks have been proposed for mobile robotics systems. While most lightweight monocular depth estimation methods have been developed using convolution neural networks, the Transformer has been gradually utilized in monocular depth estimation recently. However, massive parameters and large computational costs in the Transformer disturb the deployment to embedded devices. In this paper, we present a Token-Sharing Transformer (TST), an architecture using the Transformer for monocular depth estimation, optimized especially in embedded devices. The proposed TST utilizes global token sharing, which enables the model to obtain an accurate depth prediction with high throughput in embedded devices. Experimental results show that TST outperforms the existing lightweight monocular depth estimation methods. On the NYU Depth $v2$ dataset, TST can deliver depth maps up to $63.4$ FPS in NVIDIA Jetson nano and $142.6$ FPS in NVIDIA Jetson $TX2$, with lower errors than the existing methods. Furthermore, TST achieves real-time depth estimation of high-resolution images on Jetson $TX2$ with competitive results.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject단안 깊이 추정▼a경량 인공 신경망▼a트랜스포머-
dc.subjectMonocular Depth estimation▼aLightweight Neural Network▼aTransformer-
dc.titleTransformer based lightweight monocular depth estimation-
dc.title.alternative단안 깊이 추정을 위한 트랜스포머 기반 경량 인공 신경망 연구-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthorKim, Junmo-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0