A Multimodal-Based Feature Generalization Model for Binocular Stereo Matching

Wenfeng Qiu,Chihao Ma,Jianhua Li,Ren Qian,Yun Zhao,Yujun Fan,Yong Zhao
DOI: https://doi.org/10.1145/3695080.3695182
2024-01-01
Abstract:Addressing the generalization issue in stereo matching, we propose the CSD4Stereo network, featuring style-content information decoupling. Through the design of a style normalization module, instance-level style information is stripped away, while a content restoration module adaptively preserves and recovers content details. The bidirectional compensation loss significantly enhances the decoupling effect. Utilizing generalized cosine similarity in a decoupled space to construct the cost volume further boosts the generalization and discriminative power of stereo matching. Cross-domain evaluations on the Middlebury and KITTI datasets demonstrate respective performance improvements of 23% and 17.3%. Similarly, notable generalization capabilities are achieved on the complex scene autonomous driving dataset DrivingStereo.
What problem does this paper attempt to address?