Scene Recognition Based on an Improved DenseNet with Attention Mechanism

Angran Li,Mingzhu Sun
DOI: https://doi.org/10.1109/mita60795.2024.10751728
2024-01-01
Abstract:High-resolution remote sensing image classification is of great significance to scene recognition and plays a pivotal role in surface cover classification, environmental monitoring, geological exploration and resource management. Convolutional neural network (CNN) has a powerful feature learning capability of automatic extraction of features and can categorize different scenes with high accuracy in scene recognition. In this study, we train on the Aerial Image Dataset (AID) and reprocess it with data augmentation. We enhance feature focus within densely connected blocks by incorporating Efficient Channel Attention (ECA), thereby proposing the ECA-DenseNet architecture. Moreover, we employ transfer learning and pre-train the network to accelerate the training process. The accuracy, precision, recall and f1 score of the final model reached ${9 7 . 1 \%, 9 7 . 0 \%, 9 6 . 8 \%}$ and ${9 6 . 9 \%}$, respectively. The experimental results show that ECA-DenseNet fully utilizes the different features extracted from each convolutional kernel and captures the key information in the scene images more effectively, which effectively improves the classification accuracy of scene recognition.
What problem does this paper attempt to address?