Revisiting the Receptive Field of Conv-GRU in DROID-SLAM

Cited 1 time in webofscience Cited 0 time in scopus
  • Hit : 89
  • Download : 0
This work focuses on improving the Conv-GRU-based optical flow update within a DROID-SLAM framework. Prior optical flow models typically follow a UNet or coarse-to-fine architecture in order to extract long-range cross-correlation and context cues. This helps flow estimation in the presence of large motion and challenging image regions, e.g., textureless regions. We propose modifications to the Conv-GRU module which follows the rationale of these prior models by integrating (Atrous) Spatial Pyramid Pooling and global self-attention into the Conv-GRU block. By enlarging the receptive field through the aforementioned modifications, the model is able to integrate information from a larger context window, thus improving the robustness even when given inputs that comprise challenging image regions. We show empirically through extensive experiments the gain in accuracy through these modifications.
Publisher
IEEE Computer Society
Issue Date
2022-06
Language
English
Citation

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, CVPRW 2022, pp.1905 - 1915

ISSN
2160-7508
DOI
10.1109/CVPRW56347.2022.00207
URI
http://hdl.handle.net/10203/312765
Appears in Collection
ME-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 1 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0