Pixel-Level Matching for Video Object Segmentation using Convolutional Neural Networks

Cited 131 time in webofscience Cited 0 time in scopus
  • Hit : 268
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorYoon, Jae Shinko
dc.contributor.authorRameau, Francoisko
dc.contributor.authorKim, Junsikko
dc.contributor.authorLee, Seokjuko
dc.contributor.authorShin, Seunghakko
dc.contributor.authorKweon, In-Soko
dc.date.accessioned2017-12-05T02:20:31Z-
dc.date.available2017-12-05T02:20:31Z-
dc.date.created2017-11-29-
dc.date.created2017-11-29-
dc.date.created2017-11-29-
dc.date.issued2017-10-
dc.identifier.citation16th IEEE International Conference on Computer Vision (ICCV), pp.2186 - 2195-
dc.identifier.issn1550-5499-
dc.identifier.urihttp://hdl.handle.net/10203/227591-
dc.description.abstractWe propose a novel video object segmentation algorithm based on pixel-level matching using Convolutional Neural Networks (CNN). Our network aims to distinguish the target area from the background on the basis of the pixel-level similarity between two object units. The proposed network represents a target object using features from different depth layers in order to take advantage of both the spatial details and the category-level semantic information. Furthermore, we propose a feature compression technique that drastically reduces the memory requirements while maintaining the capability of feature representation. Two-stage training (pretraining and fine-tuning) allows our network to handle any target object regardless of its category (even if the object's type does not belong to the pre-training data) or of variations in its appearance through a video sequence. Experiments on large datasets demonstrate the effectiveness of our model -against related methods - in terms of accuracy, speed, and stability. Finally, we introduce the transferability of our network to different domains, such as the infrared data domain.-
dc.languageEnglish-
dc.publisherIEEE Computer Society and the Computer Vision Foundation (CVF)-
dc.titlePixel-Level Matching for Video Object Segmentation using Convolutional Neural Networks-
dc.typeConference-
dc.identifier.wosid000425498402026-
dc.identifier.scopusid2-s2.0-85041906830-
dc.type.rimsCONF-
dc.citation.beginningpage2186-
dc.citation.endingpage2195-
dc.citation.publicationname16th IEEE International Conference on Computer Vision (ICCV)-
dc.identifier.conferencecountryIT-
dc.identifier.conferencelocationVenice Convention Center, Venice-
dc.identifier.doi10.1109/ICCV.2017.238-
dc.contributor.localauthorKweon, In-So-
dc.contributor.nonIdAuthorYoon, Jae Shin-
dc.contributor.nonIdAuthorRameau, Francois-
dc.contributor.nonIdAuthorKim, Junsik-
dc.contributor.nonIdAuthorLee, Seokju-
dc.contributor.nonIdAuthorShin, Seunghak-
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 131 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0