Pixel-Level Matching for Video Object Segmentation using Convolutional Neural Networks

Cited 131 time in webofscience Cited 0 time in scopus
  • Hit : 267
  • Download : 0
We propose a novel video object segmentation algorithm based on pixel-level matching using Convolutional Neural Networks (CNN). Our network aims to distinguish the target area from the background on the basis of the pixel-level similarity between two object units. The proposed network represents a target object using features from different depth layers in order to take advantage of both the spatial details and the category-level semantic information. Furthermore, we propose a feature compression technique that drastically reduces the memory requirements while maintaining the capability of feature representation. Two-stage training (pretraining and fine-tuning) allows our network to handle any target object regardless of its category (even if the object's type does not belong to the pre-training data) or of variations in its appearance through a video sequence. Experiments on large datasets demonstrate the effectiveness of our model -against related methods - in terms of accuracy, speed, and stability. Finally, we introduce the transferability of our network to different domains, such as the infrared data domain.
Publisher
IEEE Computer Society and the Computer Vision Foundation (CVF)
Issue Date
2017-10
Language
English
Citation

16th IEEE International Conference on Computer Vision (ICCV), pp.2186 - 2195

ISSN
1550-5499
DOI
10.1109/ICCV.2017.238
URI
http://hdl.handle.net/10203/227591
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 131 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0