SSTNet: A Network Based on Modeling of Semantic-spatio-temporal Information for Semantic Change Detection of Remote Sensing Images

Qingyu Xia,Yiming Yan,Wei Hou,Baoyu Ge,Nan Su,Shou Feng,Chunhui Zhao
DOI: https://doi.org/10.1109/jstars.2024.3464117
2024-01-01
Abstract:Semantic change detection (SCD) aiming to localize changes and identify their types is a complex pixel-level classification task. In SCD, detecting changes of narrow and long objects (NLOs) common in various scenarios is more challenging than detecting these of typical small objects, primarily because NLOs' narrowness makes them carry local details, while their length gives them a global view. The fusion of semantic with spatial information and the interaction of semantic with spatio-temporal information are crucial to accurately detecting changed NLOs. Insufficient utilization of these pieces of information carried by multi-layer semantic feature and change feature would limit the representation of the characteristics of changed NLOs, thereby leading to missed detection and false detection. We propose a new model that integrates semantic, spatial, and temporal information, named SSTNet, for semantic change detection. This model utilizes a designed multi-layer feature fusion (MFF) module to fuse features from different layers in the encoder, leveraging global semantic information and local spatial information. Additionally, the semantic change feature interaction (SCFI) module facilitates the interaction between semantic feature and change feature, enhancing the correlation between semantic information and spatial information. A series of experiments conducted on two public datasets demonstrate that SSTNet improves the detection accuracy of changed NLOs and outperforms state-of-the-art methods.
What problem does this paper attempt to address?