DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Sumin | ko |
dc.contributor.author | Woo, Sangmin | ko |
dc.contributor.author | Park, Yeonju | ko |
dc.contributor.author | Adi Nugroho, Muhammad | ko |
dc.contributor.author | Kim, Changick | ko |
dc.date.accessioned | 2023-04-04T05:00:17Z | - |
dc.date.available | 2023-04-04T05:00:17Z | - |
dc.date.created | 2023-03-31 | - |
dc.date.issued | 2023-01 | - |
dc.identifier.citation | 23rd IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023, pp.3297 - 3306 | - |
dc.identifier.uri | http://hdl.handle.net/10203/305982 | - |
dc.description.abstract | In multi-modal action recognition, it is important to consider not only the complementary nature of different modalities but also global action content. In this paper, we propose a novel network, named Modality Mixer (M-Mixer) network, to leverage complementary information across modalities and temporal context of an action for multi-modal action recognition. We also introduce a simple yet effective recurrent unit, called Multi-modal Contextualization Unit (MCU), which is a core component of M-Mixer. Our MCU temporally encodes a sequence of one modality (e.g., RGB) with action content features of other modalities (e.g., depth, IR). This process encourages M-Mixer to exploit global action content and also to supplement complementary information of other modalities. As a result, our proposed method outperforms state-of-the-art methods on NTU RGB+D 60, NTU RGB+D 120, and NW-UCLA datasets. Moreover, we demonstrate the effectiveness of M-Mixer by conducting comprehensive ablation studies. | - |
dc.language | English | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | Modality Mixer for Multi-modal Action Recognition | - |
dc.type | Conference | - |
dc.identifier.scopusid | 2-s2.0-85148995785 | - |
dc.type.rims | CONF | - |
dc.citation.beginningpage | 3297 | - |
dc.citation.endingpage | 3306 | - |
dc.citation.publicationname | 23rd IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023 | - |
dc.identifier.conferencecountry | US | - |
dc.identifier.conferencelocation | Waikoloa, HI | - |
dc.identifier.doi | 10.1109/WACV56688.2023.00331 | - |
dc.contributor.localauthor | Kim, Changick | - |
dc.contributor.nonIdAuthor | Lee, Sumin | - |
dc.contributor.nonIdAuthor | Woo, Sangmin | - |
dc.contributor.nonIdAuthor | Park, Yeonju | - |
dc.contributor.nonIdAuthor | Adi Nugroho, Muhammad | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.