Selective Query-Guided Debiasing for Video Corpus Moment Retrieval

Cited 4 time in webofscience Cited 0 time in scopus
  • Hit : 80
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorYoon, Sunjaeko
dc.contributor.authorHong, Ji Wooko
dc.contributor.authorYoon, Eunseopko
dc.contributor.authorKim, Dahyunko
dc.contributor.authorKim, Junyeongko
dc.contributor.authorYoon, Hee Sukko
dc.contributor.authorYoo, Chang-Dongko
dc.date.accessioned2022-11-15T09:00:26Z-
dc.date.available2022-11-15T09:00:26Z-
dc.date.created2022-11-15-
dc.date.created2022-11-15-
dc.date.issued2022-10-
dc.identifier.citationEuropean Conference on Computer Vision, ECCV 2022, pp.185 - 200-
dc.identifier.issn0302-9743-
dc.identifier.urihttp://hdl.handle.net/10203/299665-
dc.description.abstractVideo moment retrieval (VMR) aims to localize target moments in untrimmed videos pertinent to a given textual query. Existing retrieval systems tend to rely on retrieval bias as a shortcut and thus, fail to sufficiently learn multi-modal interactions between query and video. This retrieval bias stems from learning frequent co-occurrence patterns between query and moments, which spuriously correlate objects (e.g., a pencil) referred in the query with moments (e.g., scene of writing with a pencil) where the objects frequently appear in the video, such that they converge into biased moment predictions. Although recent debiasing methods have focused on removing this retrieval bias, we argue that these biased predictions sometimes should be preserved because there are many queries where biased predictions are rather helpful. To conjugate this retrieval bias, we propose a Selective Query-guided Debiasing network (SQuiDNet), which incorporates the following two main properties: (1) Biased Moment Retrieval that intentionally uncovers the biased moments inherent in objects of the query and (2) Selective Query-guided Debiasing that performs selective debiasing guided by the meaning of the query. Our experimental results on three moment retrieval benchmarks (i.e., TVR, ActivityNet, DiDeMo) show the effectiveness of SQuiDNet and qualitative analysis shows improved interpretability.-
dc.languageEnglish-
dc.publisherSpringer Nature Switzerland-
dc.titleSelective Query-Guided Debiasing for Video Corpus Moment Retrieval-
dc.typeConference-
dc.identifier.wosid000903751800011-
dc.identifier.scopusid2-s2.0-85142672536-
dc.type.rimsCONF-
dc.citation.beginningpage185-
dc.citation.endingpage200-
dc.citation.publicationnameEuropean Conference on Computer Vision, ECCV 2022-
dc.identifier.conferencecountryIS-
dc.identifier.conferencelocationTel Aviv-
dc.identifier.doi10.1007/978-3-031-20059-5_11-
dc.contributor.localauthorYoo, Chang-Dong-
dc.contributor.nonIdAuthorYoon, Sunjae-
dc.contributor.nonIdAuthorHong, Ji Woo-
dc.contributor.nonIdAuthorYoon, Eunseop-
dc.contributor.nonIdAuthorKim, Dahyun-
dc.contributor.nonIdAuthorKim, Junyeong-
dc.contributor.nonIdAuthorYoon, Hee Suk-
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 4 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0