DSpace at KOASAS: CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

Cited 22 time in

Cited 0 time in

Hit : 56
Download : 0

Export

Yu, Qihang / Wang, Huiyu / Kim, Dahun researcher / Qiao, Siyuan / Collins, Maxwell / Zhu, Yukun / Adam, Hartwig / Yuille, Alan / Chen, Liang-Chieh

We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based framework for panoptic segmentation designed around clustering. It rethinks the existing transformer architectures used in segmentation and detection; CMT-DeepLab considers the object queries as cluster centers, which fill the role of grouping the pixels when applied to segmentation. The clustering is computed with an alternating procedure, by first assigning pixels to the clusters by their feature affinity, and then updating the cluster centers and pixel features. Together, these operations comprise the Clustering Mask Transformer (CMT) layer, which produces cross-attention that is denser and more consistent with the final segmentation task. CMT-DeepLab improves the performance over prior art significantly by 4.4% PQ, achieving a new state-of-the-art of 55.7% PQ on the COCO test-dev set.

Publisher: IEEE Computer Society

Issue Date: 2022-06

Language: English

Citation: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, pp.2550 - 2560

ISSN: 1063-6919

DOI: 10.1109/CVPR52688.2022.00259

URI: http://hdl.handle.net/10203/312790

Appears in Collection: RIMS Conference Papers

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 22 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

This item is cited by other documents in WoS

KOASAS

Communities & Collections