Dual Backbone Interaction Network for Burned Area Segmentation in Optical Remote Sensing Images

Wei Fang,Yuxiang Fu,Victor S. Sheng
DOI: https://doi.org/10.1109/lgrs.2024.3369619
IF: 5.343
2024-01-01
IEEE Geoscience and Remote Sensing Letters
Abstract:The existing methods for burned area segmentation (BAS) in optical remote sensing images (ORSIs) mainly adopt convolution neural network (CNN) as the backbone, which has limited receptive filed and suffers from long-term dependencies problem. To address this issue, we propose a novel salient object detection (SOD) network (DBINet) based on dual interactive convolution-transformer backbone for BAS in ORSIs. DBINet combines the benefits of CNN and transformer: CNN is good at extracting local spatial information, while transformer does well in modeling long-term dependencies. The core component of DBINet is three newly designed modules: dual-feature fusion module (DFM), convertor, and a novel decoder. Specifically, DFM is proposed to bridge two different backbones. Convertor is designed to fuse the multiscale coarse features from the main encoder and produce the fined feature for the decoder. The decoder has a multilevel feature aggregating process and a self-refining process, which restores the resolution and generates the prediction results. Experiments on three datasets demonstrate that our DBINet outperforms the state-of-the-art methods and achieves the best S-measure on three datasets: 0.847, 0.888, and 0.883. Code is available at: https://github.com/Voruarn/DBINet.
What problem does this paper attempt to address?