Skip to main navigation Skip to search Skip to main content

MRDFlow: Unsupervised Optical Flow Estimation Network with Multi-Scale Recurrent Decoder

  • Rui Zhao
  • , Ruiqin Xiong*
  • , Ziluo Ding
  • , Xiaopeng Fan
  • , Jian Zhang
  • , Tiejun Huang
  • *Corresponding author for this work
  • Peking University

Research output: Contribution to journalArticlepeer-review

Abstract

Optical flow estimation is a fundamental task in computer vision and image processing. Due to the difficulty in obtaining the ground truth of flow field, unsupervised learning approaches attract more and more research interests in recent years. However, despite of their good generalization capability, unsupervised optical flow methods suffer in the scenarios with large displacement, small objects, and occlusions. In this work, we propose a novel optical flow network based on decoder with multi-scale kernels. Different from previous U-Net like or pyramidal methods, we design our network based on RAFT architecture that with a 4D correlation layer and recurrent decoder. More importantly, we incorporate three novel ideas with regard to the input, information processing and output of the update units improve the performance. Firstly, we utilize various motion-related information as input to the update units. Secondly, we propose a module of multi-scale update unit. Thirdly, for the final flow up-sampling procedure, we propose an image-guided up-sampling loss to guide the learning of up-sampling masks. Our model is trained by the occlusion-aware photometric loss, edge-aware smoothness loss, self-supervised loss, and image-guided up-sampling loss. Experimental results demonstrate that our model achieves the state-of-the-art performance on both Sintel and KITTI and outperforms other unsupervised optical flow methods remarkably.

Original languageEnglish
Pages (from-to)4639-4652
Number of pages14
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume32
Issue number7
DOIs
StatePublished - 1 Jul 2022

Keywords

  • Optical flow
  • multi-scale decoder
  • recurrent decoder
  • unsupervised learning

Fingerprint

Dive into the research topics of 'MRDFlow: Unsupervised Optical Flow Estimation Network with Multi-Scale Recurrent Decoder'. Together they form a unique fingerprint.

Cite this