Cascaded MPN: Cascaded Moment Proposal Network for Video Corpus Moment Retrieval

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 150
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorYoon, Sunjaeko
dc.contributor.authorKim, Dahyunko
dc.contributor.authorKim, Junyeongko
dc.contributor.authorYoo, Chang-Dongko
dc.date.accessioned2022-06-26T01:02:07Z-
dc.date.available2022-06-26T01:02:07Z-
dc.date.created2022-06-25-
dc.date.created2022-06-25-
dc.date.created2022-06-25-
dc.date.created2022-06-25-
dc.date.issued2022-
dc.identifier.citationIEEE ACCESS, v.10, pp.64560 - 64568-
dc.identifier.issn2169-3536-
dc.identifier.urihttp://hdl.handle.net/10203/297081-
dc.description.abstractVideo corpus moment retrieval aims to localize temporal moments corresponding to textual query in a large video corpus. Previous moment retrieval systems are largely grouped into two categories: (1) anchor-based method which presets a set of video segment proposals (via sliding window) and predicts proposal that best matches with the query, and (2) anchor-free method which directly predicts frame-level start-end time of the moment related to the query (via regression). Both methods have their own inherent weaknesses: (1) anchor-based method is vulnerable to heuristic rules of generating video proposals, which causes restrictive moment prediction in variant length; and (2) anchor-free method, as is based on frame-level interplay, suffers from insufficient understanding of contextual semantics from long and sequential video. To overcome the aforementioned challenges, our proposed Cascaded Moment Proposal Network incorporates the following two main properties: (1) Hierarchical Semantic Reasoning which provides video understanding from anchor-free level to anchor-based level via building hierarchical video graph, and (2) Cascaded Moment Proposal Generation which precisely performs moment retrieval via devising cascaded multi-modal feature interaction among anchor-free and anchor-based video semantics. Extensive experiments show state-of-the-art performance on three moment retrieval benchmarks (TVR, ActivityNet, DiDeMo), while qualitative analysis shows improved interpretability. The code will be made publicly available.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleCascaded MPN: Cascaded Moment Proposal Network for Video Corpus Moment Retrieval-
dc.typeArticle-
dc.identifier.wosid000814549500001-
dc.identifier.scopusid2-s2.0-85132713223-
dc.type.rimsART-
dc.citation.volume10-
dc.citation.beginningpage64560-
dc.citation.endingpage64568-
dc.citation.publicationnameIEEE ACCESS-
dc.identifier.doi10.1109/access.2022.3183106-
dc.contributor.localauthorYoo, Chang-Dong-
dc.contributor.nonIdAuthorKim, Dahyun-
dc.contributor.nonIdAuthorKim, Junyeong-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorProposals-
dc.subject.keywordAuthorSemantics-
dc.subject.keywordAuthorStreaming media-
dc.subject.keywordAuthorCognition-
dc.subject.keywordAuthorBipartite graph-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorVideo corpus moment retrieval-
dc.subject.keywordAuthorcascaded moment proposal-
dc.subject.keywordAuthormulti-modal interaction-
dc.subject.keywordAuthorvision-language system-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0