Large-scale Remote Sensing Image Target Recognition and Automatic Annotation

Wuzheng Dong
2024-11-12
Abstract:This paper presents a method for object recognition and automatic labeling in large-area remote sensing images called LRSAA. The method integrates YOLOv11 and MobileNetV3-SSD object detection algorithms through ensemble learning to enhance model performance. Furthermore, it employs Poisson disk sampling segmentation techniques and the EIOU metric to optimize the training and inference processes of segmented images, followed by the integration of results. This approach not only reduces the demand for computational resources but also achieves a good balance between accuracy and speed. The source code for this project has been made publicly available on <a class="link-external link-https" href="https://github.com/anaerovane/LRSAA" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problems that this paper attempts to solve mainly focus on the challenges faced by large - scale remote sensing image target recognition and automatic annotation technologies. Specifically, the paper focuses on the following aspects: 1. **Limitations of single - model training and prediction**: Most of the current remote sensing target recognition technologies rely on a single model for training and prediction. Although this strategy simplifies the prediction architecture and reduces the computational cost, it may sacrifice the generalization ability and robustness of the model. 2. **Lack of multi - model integration framework**: Currently, there is a lack of effective frameworks to integrate the outputs of different remote sensing algorithms, which limits the improvement of recognition performance. 3. **Efficiency and accuracy issues in large - scale image processing**: The existing remote sensing target recognition schemes have problems of low recognition accuracy and excessive computational resource requirements when processing large - scale images, which hinders their application in large - scale remote sensing image automatic annotation systems. To address these challenges, the paper proposes an innovative framework - LRSAA (Large - Scale Remote Sensing Automatic Annotation), aiming to improve the performance of large - scale remote sensing image target recognition and automatic annotation. This framework is implemented through the following key technologies: - **Ensemble learning**: Combine two advanced target detection models, MobileNetV3 - SSD and YOLOv11, and use the ensemble learning method to combine the advantages of both to achieve better performance. - **Enhanced non - maximum suppression (NMS)**: Use the enhanced intersection - over - union (EIOU) metric instead of the traditional intersection - over - union (IoU) metric to improve the NMS process and consider more geometric information. - **Poisson - disk sampling**: Use Poisson - disk sampling technology to segment and recognize the data set, ensure the balance and representativeness of the sample distribution, and thus improve the quality of the data. Through these technological improvements, the paper not only improves the applicability and efficiency of the model on large - scale data sets, but also reduces the dependence on hardware resources through optimized algorithm design, thereby promoting more extensive practical applications.