Combined multi-level context aggregation and attention mechanism method for photovoltaic panel extraction from high resolution remote sensing images

Qingqing Qi,Jinghao Zhao,Lu Lin,Xiaoqing Zhang,Yajun Tian
DOI: https://doi.org/10.1080/01431161.2024.2347527
IF: 3.531
2024-05-17
International Journal of Remote Sensing
Abstract:In the context of global carbon emission reduction, solar photovoltaic (PV) technology is experiencing rapid development. Using high-resolution remote sensing images to accurately obtain PV information over a large region, including location and size, has the advantages of high statistical efficiency and timely data update for the PV energy management. Due to the intra-class diversity of PV panels and the intricate variability in their deployment environments, existing semantic segmentation methods often have problems such as under-segmentation and mis-segmentation. To alleviate these problems, this paper proposes an improved DeepLabv3+ semantic segmentation network to more accurately extract PV panels from high-resolution remote sensing images. With the aim of alleviating under-segmentation, a multi-level context aggregation module is developed. This module can enhance the model's ability to learn the characteristics of PV panels and their surrounding environment by aggregating rich contextual information from multi-scale and semantic levels. To alleviate the problem of mis-segmentation, a hybrid attention module is introduced. This module sequentially and adaptively adjusts the weight distribution in both the channel and spatial dimensions, thus enabling the model to focus more on the feature information and spatial positions of PV objects. Experiments conducted on a self-constructed Beijing PV segmentation dataset show that the method in this paper has advantages of completeness and accuracy in extracting PV panels compared to the baseline model and current mainstream semantic segmentation network. In addition, the results of experiments on extracting PV panels in real region show that our model also has good stability and generalization capability.
imaging science & photographic technology,remote sensing
What problem does this paper attempt to address?