ResCount: A Residual Feature Fusion Network for Ship Counting in Remote Sensing Images

Kai Yan,Kai Yang,Jinghao Huang,Yaxiong Chen,Shengwu Xiong
DOI: https://doi.org/10.1109/lgrs.2024.3462444
IF: 5.343
2024-10-04
IEEE Geoscience and Remote Sensing Letters
Abstract:Ship counting is used to count the number of ships in an image. It has a wide range of research backgrounds in areas such as port management and maritime security. In specific areas such as ports, due to their large number of ships, the ships captured by remote sensing images often have problems of uneven distribution and large differences in ship sizes, which will affect the performance of ship counting. To address the above problems, this letter proposes a residual feature fusion network for ship counting (ResCount). The model first uses a feature extraction network to extract the feature map of the image, and then uses a dual-branch structure to further enhance the feature map. One branch uses a visual encoder module to learn the connection between different regions in the image to improve the problem of decreased counting accuracy in scenes with uneven distribution of ships. However, the visual encoder will lose information such as the outline and texture of the ship. Therefore, the other branch uses a regional context feature fusion module (CAF) proposed in this letter to extract local features of different scales and context features of ships to improve the counting accuracy in scenes with large differences in ship size. In addition, this letter proposes a residual feature fusion (RFF) module to enhance the model's attention to sparse areas and finally regress to obtain a density map. In addition, we conducted a large number of experiments to verify the method. Finally, on the remote sensing object counting dataset (RSOC), the mean absolute error (MAE) index reached 60.08 and the root mean squared error (RMSE) index reached 79.62.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?