Abstract:Stereo matching is a core component in many computer vision and robotics systems. Despite significant advances over the last decade, handling matching ambiguities in ill-posed regions and large disparities remains an open challenge. In this paper, we propose a new deep network architecture, called IGEV++, for stereo matching. The proposed IGEV++ builds Multi-range Geometry Encoding Volumes (MGEV) that encode coarse-grained geometry information for ill-posed regions and large disparities and fine-grained geometry information for details and small disparities. To construct MGEV, we introduce an adaptive patch matching module that efficiently and effectively computes matching costs for large disparity ranges and/or ill-posed regions. We further propose a selective geometry feature fusion module to adaptively fuse multi-range and multi-granularity geometry features in MGEV. We then index the fused geometry features and input them to ConvGRUs to iteratively update the disparity map. MGEV allows to efficiently handle large disparities and ill-posed regions, such as occlusions and textureless regions, and enjoys rapid convergence during iterations. Our IGEV++ achieves the best performance on the Scene Flow test set across all disparity ranges, up to 768px. Our IGEV++ also achieves state-of-the-art accuracy on the Middlebury, ETH3D, KITTI 2012, and 2015 benchmarks. Specifically, IGEV++ achieves a 3.23% 2-pixel outlier rate (Bad 2.0) on the large disparity benchmark, Middlebury, representing error reductions of 31.9% and 54.8% compared to RAFT-Stereo and GMStereo, respectively. We also present a real-time version of IGEV++ that achieves the best performance among all published real-time methods on the KITTI benchmarks. The code is publicly available at <a class="link-external link-https" href="https://github.com/gangweiX/IGEV-plusplus" rel="external noopener nofollow">this https URL</a>

Eglcr: Edge Structure Guidance and Scale Adaptive Attention for Iterative Stereo Matching

Efficient Large Scale Stereo Matching Based on Cross-Scale.

Edge supervision and multi-scale cost volume for stereo matching

EAI-Stereo: Error Aware Iterative Network for Stereo Matching

Edge-preserving Guided Filtering Based Cost Aggregation for Stereo Matching.

A Novel Cell Structure‐based Disparity Estimation for Unsupervised Stereo Matching

Left ventricular mass and diastolic function in normotensive young adults with autosomal dominant polycystic kidney disease.

Parallax attention stereo matching network based on the improved group-wise correlation stereo network

Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation

Superpixel Guided Network for Three-Dimensional Stereo Matching

IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo Matching

Multi-Scale Context Attention Network for Stereo Matching

End-to-End Edge-Guided Multi-Scale Matching Network for Optical Satellite Stereo Image Pairs

Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching

Accurate Real-Time Stereo Correspondence Using Intra- and Inter-Scanline Optimization

MC-Stereo: Multi-peak Lookup and Cascade Search Range for Stereo Matching

Improved real-time three-dimensional stereo matching with local consistency

Stereo Matching Method with Integrated Geometric Encoding for Disparity Refinement

Deep Stereo Matching With Hysteresis Attention and Supervised Cost Volume Construction

Stereo Matching Method for Remote Sensing Images Based on Attention and Scale Fusion

Guided aggregation and disparity refinement for real-time stereo matching