Domain Adaptative Video Semantic Segmentation Via Motion-Guided Domain Bridge
Wenchao Guan,Ben Wang,Xiaoxiao Ma,Junkang Dai,Chengfeng Kong,Lin Chen,Zhixiang Wei,Yi Jin,Huaian Chen
DOI: https://doi.org/10.1109/tim.2024.3480199
IF: 5.6
2024-01-01
IEEE Transactions on Instrumentation and Measurement
Abstract:Domain bridging aims to eliminate discrepancies across domains by establishing an intermediary domain, enabling gradual domain adaptation. However, in the realm of video segmentation, existing methods often yield inaccurate motion within this intermediary domain, especially when tackling complex temporal sequences involving both object and camera movements. To overcome this challenge, we introduce a Motion-guided Domain Bridge (MDBridge), which constructs an aligned video intermediary domain and strengthens motion awareness. Central to MDBridge is the Motion-Aligned Domain Bridge , which unifies the relative camera movements into a consistent ego-motion by leveraging depth and motion priors. Complementing this, the Motion Perception Module , which utilizes monocular and binocular vision to establish a spatial perception structure, is developed to capture and utilize aligned motion information. Notably, both motion and depth information are derived using a self-supervised depth estimation strategy, without the need for external depth ground truth, allowing us to directly integrate into existing tasks without extra annotation costs. These progressive steps collectively extract and leverage temporal information from complex temporal scenarios, ensuring smooth domain transition during video segmentation training. Extensive experiments validate the superiority of our approach over previous state-of-the-art methods.