Continuous Urban Change Detection from Satellite Image Time Series with Temporal Feature Refinement and Multi-Task Integration

Sebastian Hafner,Heng Fang,Hossein Azizpour,Yifang Ban
2024-06-25
Abstract:Urbanization advances at unprecedented rates, resulting in negative effects on the environment and human well-being. Remote sensing has the potential to mitigate these effects by supporting sustainable development strategies with accurate information on urban growth. Deep learning-based methods have achieved promising urban change detection results from optical satellite image pairs using convolutional neural networks (ConvNets), transformers, and a multi-task learning setup. However, transformers have not been leveraged for urban change detection with multi-temporal data, i.e., >2 images, and multi-task learning methods lack integration approaches that combine change and segmentation outputs. To fill this research gap, we propose a continuous urban change detection method that identifies changes in each consecutive image pair of a satellite image time series. Specifically, we propose a temporal feature refinement (TFR) module that utilizes self-attention to improve ConvNet-based multi-temporal building representations. Furthermore, we propose a multi-task integration (MTI) module that utilizes Markov networks to find an optimal building map time series based on segmentation and dense change outputs. The proposed method effectively identifies urban changes based on high-resolution satellite image time series acquired by the PlanetScope constellation (F1 score 0.551) and Gaofen-2 (F1 score 0.440). Moreover, our experiments on two challenging datasets demonstrate the effectiveness of the proposed method compared to bi-temporal and multi-temporal urban change detection and segmentation methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper primarily aims to address the following two key issues: 1. **Utilizing self-attention mechanisms to handle temporal features in multi-temporal satellite image sequences**: Most current urban change detection methods are designed based on bi-temporal image pairs, which have limitations when dealing with rapid urbanization processes. The paper proposes a continuous urban change detection method that captures long-term dependencies across all timestamps by applying self-attention mechanisms in the time series of satellite images, thereby improving the multi-temporal building representations extracted by Convolutional Neural Networks (ConvNets). 2. **Integration of multi-task prediction results**: Multi-task learning frameworks typically combine building segmentation and change detection tasks to enhance detection performance. However, existing methods rarely focus on how to effectively integrate segmentation and change prediction results. This paper proposes a novel Multi-Task Integration (MTI) module that uses a Markov network to combine segmentation and change predictions, thereby finding the most probable series of building maps at each time point. The main contributions of the paper include: - Proposing a Transformer-based Temporal Feature Refinement (TFR) module that can effectively model temporal information in satellite image sequences, thereby improving building representations. - Introducing a new multi-task integration method that represents segmentation and change outputs in a Markov network to find the most probable building maps in each image time series. - Experimental results on the SpaceNet 7 and Wuhan Urban Semantic Understanding (WUSU) datasets demonstrate that the proposed continuous urban change detection method is more effective compared to existing methods. In summary, this research aims to fill two gaps in the current urban change detection field: first, utilizing self-attention mechanisms to handle temporal features in multi-temporal satellite image sequences; second, integrating segmentation and change prediction results within a multi-task learning framework.