Lightweight cross-guided contextual perceptive network for visible–infrared urban road scene parsing

Jinfu Liu,Wujie Zhou,Meixin Fang,Shanshan Mao,Rongwang Yang
DOI: https://doi.org/10.1016/j.infrared.2024.105167
IF: 2.997
2024-01-21
Infrared Physics & Technology
Abstract:Visible–infrared urban road scene parsing is attracting increasing attention because it can extract complementary cues from the visible and infrared imaging modalities. However, most existing parsing methods adopt complicated models, which incur large computational costs and limit real-time performance. Moreover, parsing methods may inadequately explore and apply high-level semantic information, considerably undermining the parsing accuracy. To solve these problems, we introduce a lightweight high-performance network called cross-guided contextual perceptive network (CCPNet). A lightweight backbone equipped with adaptive refined fusion modules reduces the size of CCPNet. Additionally, a cross-guided contextual perceptive module extracts and enhances semantic cues from high-level features. Experimental results indicate that CCPNet achieves state-of-the-art performance for visible–infrared scene parsing with few parameters (7.34 million), a small model (29.9 MB), and real-time inference (50.03 fps). The CCPNet code and results are available at: https://github.com/Jinfu0913/CCPNet .
optics,physics, applied,instruments & instrumentation
What problem does this paper attempt to address?