Abstract:Self-supervised stereo matching holds great promise for application and research due to its independence from expensive labeled data. However, direct self-supervised stereo matching paradigms based on photometric loss functions have consistently struggled with performance issues due to the occlusion challenge. The crux of the occlusion challenge lies in the fact that the positions of occluded pixels consistently align with the epipolar search direction defined by the input stereo images, leading to persistent information loss and erroneous feedback at fixed locations during self-supervised training. In this work, we propose a simple yet highly effective pseudo-stereo inputs strategy to address the core occlusion challenge. This strategy decouples the input and feedback images, compelling the network to probabilistically sample information from both sides of the occluding objects. As a result, the persistent lack of information in the aforementioned fixed occlusion areas is mitigated. Building upon this, we further address feedback conflicts and overfitting issues arising from the strategy. By integrating these components, our method achieves stable and significant performance improvements compared to existing methods. Quantitative experiments are conducted to evaluate the performance. Qualitative experiments further demonstrate accurate disparity inference even at occluded regions. These results demonstrate a significant advancement over previous methods in the field of direct self-supervised stereo matching based on photometric loss. The proposed pseudo-stereo inputs strategy, due to its simplicity and effectiveness, has the potential to serve as a new paradigm for direct self-supervised stereo matching. Code is available at <a class="link-external link-https" href="https://github.com/qrzyang/Pseudo-Stereo" rel="external noopener nofollow">this https URL</a>.

PVStereo: Pyramid Voting Module for End-to-End Self-Supervised Stereo Matching

Learning Local Event-based Descriptor for Patch-based Stereo Matching

PQMPV: Parallel and Quantified Dense Stereo Disparity Estimation Based on Multi-Path Viterbi

MC-Stereo: Multi-peak Lookup and Cascade Search Range for Stereo Matching

Playing to Vision Foundation Model's Strengths in Stereo Matching

Stereo Matching by Self-supervision of Multiscopic Vision.

Faster Self-adaptive Deep Stereo.

Co-Teaching: An Ark to Unsupervised Stereo Matching

Multi-scale Cross-form Pyramid Network for Stereo Matching

Self-Supervised Learning for Stereo Matching with Self-Improving Ability

SCV-Stereo: Learning Stereo Matching from a Sparse Cost Volume

AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach

Stereo Anything: Unifying Stereo Matching with Large-Scale Mixed Data

Multi-level Pyramid Fusion for Efficient Stereo Matching

Pseudo-Stereo Inputs: A Solution to the Occlusion Challenge in Self-Supervised Stereo Matching

EAI-Stereo: Error Aware Iterative Network for Stereo Matching

Deep Stereo Matching With Hysteresis Attention and Supervised Cost Volume Construction

EdgeStereo: an Effective Multi-Task Learning Network for Stereo Matching and Edge Detection.

OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline

EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching

PCW-Net: Pyramid Combination and Warping Cost Volume for Stereo Matching