DifFlow3D: Toward Robust Uncertainty-Aware Scene Flow Estimation with Iterative Diffusion-Based Refinement
Jiuming Liu,Guangming Wang,Weicai Ye,Chaokang Jiang,Jinru Han,Zhe Liu,Guofeng Zhang,Dalong Du,Hesheng Wang
DOI: https://doi.org/10.1109/cvpr52733.2024.01431
2024-01-01
Computer Vision and Pattern Recognition
Abstract:Scene flow estimation, which aims to predict per-point 3D displacements ofdynamic scenes, is a fundamental task in the computer vision field. However,previous works commonly suffer from unreliable correlation caused by locallyconstrained searching ranges, and struggle with accumulated inaccuracy arisingfrom the coarse-to-fine structure. To alleviate these problems, we propose anovel uncertainty-aware scene flow estimation network (DifFlow3D) with thediffusion probabilistic model. Iterative diffusion-based refinement is designedto enhance the correlation robustness and resilience to challenging cases, e.g.dynamics, noisy inputs, repetitive patterns, etc. To restrain the generationdiversity, three key flow-related features are leveraged as conditions in ourdiffusion model. Furthermore, we also develop an uncertainty estimation modulewithin diffusion to evaluate the reliability of estimated scene flow. OurDifFlow3D achieves state-of-the-art performance, with 24.0reduction respectively on FlyingThings3D and KITTI 2015 datasets. Notably, ourmethod achieves an unprecedented millimeter-level accuracy (0.0078m in EPE3D)on the KITTI dataset. Additionally, our diffusion-based refinement paradigm canbe readily integrated as a plug-and-play module into existing scene flownetworks, significantly increasing their estimation accuracy. Codes arereleased at https://github.com/IRMVLab/DifFlow3D.