DSpace at KOASAS: CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

Cited 25 time in

Cited 0 time in

Hit : 72
Download : 0

Export

DC Field	Value	Language
dc.contributor.author	Yu, Qihang	ko
dc.contributor.author	Wang, Huiyu	ko
dc.contributor.author	Kim, Dahun	ko
dc.contributor.author	Qiao, Siyuan	ko
dc.contributor.author	Collins, Maxwell	ko
dc.contributor.author	Zhu, Yukun	ko
dc.contributor.author	Adam, Hartwig	ko
dc.contributor.author	Yuille, Alan	ko
dc.contributor.author	Chen, Liang-Chieh	ko
dc.date.accessioned	2023-09-21T01:00:31Z	-
dc.date.available	2023-09-21T01:00:31Z	-
dc.date.created	2023-09-21	-
dc.date.issued	2022-06	-
dc.identifier.citation	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022, pp.2550 - 2560	-
dc.identifier.issn	1063-6919	-
dc.identifier.uri	http://hdl.handle.net/10203/312790	-
dc.description.abstract	We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based framework for panoptic segmentation designed around clustering. It rethinks the existing transformer architectures used in segmentation and detection; CMT-DeepLab considers the object queries as cluster centers, which fill the role of grouping the pixels when applied to segmentation. The clustering is computed with an alternating procedure, by first assigning pixels to the clusters by their feature affinity, and then updating the cluster centers and pixel features. Together, these operations comprise the Clustering Mask Transformer (CMT) layer, which produces cross-attention that is denser and more consistent with the final segmentation task. CMT-DeepLab improves the performance over prior art significantly by 4.4% PQ, achieving a new state-of-the-art of 55.7% PQ on the COCO test-dev set.	-
dc.language	English	-
dc.publisher	IEEE Computer Society	-
dc.title	CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation	-
dc.type	Conference	-
dc.identifier.wosid	000867754202080	-
dc.identifier.scopusid	2-s2.0-85141776953	-
dc.type.rims	CONF	-
dc.citation.beginningpage	2550	-
dc.citation.endingpage	2560	-
dc.citation.publicationname	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022	-
dc.identifier.conferencecountry	US	-
dc.identifier.conferencelocation	New Orleans, LA	-
dc.identifier.doi	10.1109/CVPR52688.2022.00259	-
dc.contributor.localauthor	Kim, Dahun	-
dc.contributor.nonIdAuthor	Yu, Qihang	-
dc.contributor.nonIdAuthor	Wang, Huiyu	-
dc.contributor.nonIdAuthor	Qiao, Siyuan	-
dc.contributor.nonIdAuthor	Collins, Maxwell	-
dc.contributor.nonIdAuthor	Zhu, Yukun	-
dc.contributor.nonIdAuthor	Adam, Hartwig	-
dc.contributor.nonIdAuthor	Yuille, Alan	-
dc.contributor.nonIdAuthor	Chen, Liang-Chieh	-

Appears in Collection: RIMS Conference Papers

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 25 items in WoS	Click to see citing articles in

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

This item is cited by other documents in WoS

KOASAS

Communities & Collections