Mitigating the modality bias in multispectral pedestrian detection using counterfactual strategies반사실적 기법들을 이용한 다중 스펙트럼 보행자 검출의 모달리티 편향 완화

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 3
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor노용만-
dc.contributor.authorShin, Sebin-
dc.contributor.author신세빈-
dc.date.accessioned2024-07-30T19:31:23Z-
dc.date.available2024-07-30T19:31:23Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1096790&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/321572-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[viii, 82 p. :]-
dc.description.abstractMultispectral pedestrian detection using RGB and thermal sensors (RGBT) has emerged as a promising solution for safety-critical vision applications that require non-stop operations all day/night. However, there are unsolved issues in multispectral pedestrian detection, including the modality bias problem. The imbalanced modality distribution in RGBT datasets provoke modality bias, where models tend to rely on one modality (thermal) over the other (RGB). Therefore, it is necessary to address the modality bias problem in order to learn multimodal relationships robustly in real-world environments. We deal with modality bias problems for multimodal representation through counterfactual approaches that can compensate for modality imbalance in datasets. First, we propose a novel model framework: Causal Mode Multiplexer (CMM) based on counterfactual intervention and guide the model to learn the causality between multimodal inputs and outputs. Different from the symmetrical fusion topology of existing methods, the proposed approach leverages two distinct causal graphs that are tailored to the multimodal data type. Second, we introduce a novel data augmentation framework: Prototypical Cross-modal Balancing (PCB) based on counterfactual image generation. Unlike existing augmentation methods, PCB generates multimodal data considering the modality balance of multimodal data. Each of the proposed methods from the model and data perspective are validated under extensive experiments including comparisons to the state-of-the-art methods, ablation studies, and further qualitative/quantitative results.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject멀티모달▼a모달리티 편향 문제▼a반사실적 개입▼a프로토타입 균형▼a다중 스펙트럼 보행자 검출▼a인과관계-
dc.subjectMultimodal▼aModality bias problems▼aCounterfactual intervention▼aPrototypical balancing▼aMultispectral pedestrian detection▼aCausality-
dc.titleMitigating the modality bias in multispectral pedestrian detection using counterfactual strategies-
dc.title.alternative반사실적 기법들을 이용한 다중 스펙트럼 보행자 검출의 모달리티 편향 완화-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthorRo, Yong Man-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0