Abstract:Optical flow estimation is extensively used in autonomous driving and video editing. While existing models demonstrate state-of-the-art performance across various benchmarks, the robustness of these methods has been infrequently investigated. Despite some research focusing on the robustness of optical flow models against adversarial attacks, there has been a lack of studies investigating their robustness to common corruptions. Taking into account the unique temporal characteristics of optical flow, we introduce 7 temporal corruptions specifically designed for benchmarking the robustness of optical flow models, in addition to 17 classical single-image corruptions, in which advanced PSF Blur simulation method is performed. Two robustness benchmarks, KITTI-FC and GoPro-FC, are subsequently established as the first corruption robustness benchmark for optical flow estimation, with Out-Of-Domain (OOD) and In-Domain (ID) settings to facilitate comprehensive studies. Robustness metrics, Corruption Robustness Error (CRE), Corruption Robustness Error ratio (CREr), and Relative Corruption Robustness Error (RCRE) are further introduced to quantify the optical flow estimation robustness. 29 model variants from 15 optical flow methods are evaluated, yielding 10 intriguing observations, such as 1) the absolute robustness of the model is heavily dependent on the estimation performance; 2) the corruptions that diminish local information are more serious than that reduce visual effects. We also give suggestions for the design and application of optical flow models. We anticipate that our benchmark will serve as a foundational resource for advancing research in robust optical flow estimation. The benchmarks and source code will be released at <a class="link-external link-https" href="https://github.com/ZhonghuaYi/optical_flow_robustness_benchmark" rel="external noopener nofollow">this https URL</a>.

A Survey on the Robustness of Computer Vision Models against Common Corruptions

Benchmarking the Robustness of Spatial-Temporal Models Against Corruptions

Enhanced Model Robustness to Input Corruptions by Per-corruption Adaptation of Normalization Statistics

Assessing Visually-Continuous Corruption Robustness of Neural Networks Relative to Human Performance

Benchmarking Robustness of 3D Point Cloud Recognition Against Common Corruptions

Benchmarking the Robustness of Semantic Segmentation Models with Respect to Common Corruptions

Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital Pathology

Investigating the Corruption Robustness of Image Classifiers with Random Lp-norm Corruptions

Deeper Insights into the Robustness of ViTs towards Common Corruptions

Common Corruptions for Enhancing and Evaluating Robustness in Air-to-Air Visual Object Detection

Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving

Benchmarking Neural Network Robustness to Common Corruptions and Perturbations

PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions

R-Bench: Are your Large Multimodal Model Robust to Real-world Corruptions?

Benchmarking Object Detection Robustness against Real-World Corruptions

Benchmarking the Robustness of Optical Flow Estimation to Corruptions

Benchmarking and Analyzing Point Cloud Classification under Corruptions

Exploring the Robustness of Human Parsers Toward Common Corruptions

RoboBEV: Towards Robust Bird's Eye View Perception under Corruptions

MedMNIST-C: Comprehensive benchmark and improved classifier robustness by simulating realistic image corruptions

Assessing and Enhancing Robustness of Deep Learning Models with Corruption Emulation in Digital Pathology