Multidimensional Evaluation Methods for Deep Learning Models in Target Detection for SAR Images

Pengcheng Wang,Huanyu Liu,Xinrui Zhou,Zhijun Xue,Liang Ni,Qi Han,Junbao Li
DOI: https://doi.org/10.3390/rs16061097
IF: 5
2024-03-21
Remote Sensing
Abstract:As artificial intelligence technology advances, the application of object detection technology in the field of SAR (synthetic aperture radar) imagery is becoming increasingly widespread. However, it also faces challenges such as resource limitations in spaceborne environments and significant uncertainty in the intensity of interference in application scenarios. These factors make the performance evaluation of object detection key to ensuring the smooth execution of tasks. In the face of such complex and harsh application scenarios, methods that rely on single-dimensional evaluation to assess models have had their limitations highlighted. Therefore, this paper proposes a multi-dimensional evaluation method for deep learning models used in SAR image object detection. This method evaluates models in a multi-dimensional manner, covering the training, testing, and application stages of the model, and constructs a multi-dimensional evaluation index system. The training stage includes assessing training efficiency and the impact of training samples; the testing stage includes model performance evaluation, application-based evaluation, and task-based evaluation; and the application stage includes model operation evaluation and model deployment evaluation. The evaluations of these three stages constitute the key links in the performance evaluation of deep learning models. Furthermore, this paper proposes a multi-indicator comprehensive evaluation method based on entropy weight correlation scaling, which calculates the weights of each evaluation indicator through test data, thereby providing a balanced and comprehensive evaluation mechanism for model performance. In the experiments, we designed specific interferences for SAR images in the testing stage and tested three models from the YOLO series. Finally, we constructed a multi-dimensional performance profile diagram for deep learning object detection models, providing a new visualization method to comprehensively characterize model performance in complex application scenarios. This can provide more accurate and comprehensive model performance evaluation for remote sensing data processing, thereby guiding model selection and optimization. The evaluation method proposed in this study adopts a multi-dimensional perspective, comprehensively assessing the three core stages of a model's lifecycle: training, testing, and application. This framework demonstrates significant versatility and adaptability, enabling it to transcend the boundaries of remote sensing technology and provide support for a wide range of model evaluation and optimization tasks.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the limitations of the existing performance evaluation methods for SAR (Synthetic Aperture Radar) image target detection models. Specifically, the current evaluation methods have two main problems: 1. **Limitations of single - dimension evaluation**: The existing evaluation methods are often limited to the testing phase, ignoring the performance of the model during training and in practical applications. This leads to an inability to comprehensively capture the performance of the model in actual deployment. Especially when dealing with SAR images, the model faces different challenges and performance requirements during the training, testing, and application stages. For example, a model may perform well in the testing phase, but it requires a large amount of computing resources during training, or it cannot effectively adapt to new data types and complex scenarios when processing actual SAR images. Therefore, single - stage evaluation cannot provide in - depth understanding of the comprehensive performance and applicability of the model when processing SAR image data. 2. **Lack of a comprehensive evaluation mechanism**: The current evaluation methods usually simply list multiple detection results without providing a comprehensive evaluation of the overall performance of the model or in - depth analysis of its applicability. Although these methods can observe the performance of the model from multiple dimensions, they lack a comprehensive performance evaluation system, making it difficult to guide model optimization and practical application decisions. In the field of SAR image applications, whether the model can adapt to different environmental conditions, its robustness in handling high data diversity, and its performance in terms of resource consumption and execution efficiency are key indicators for evaluating whether it meets the actual application requirements. Without a comprehensive evaluation mechanism, it is difficult to comprehensively evaluate the true value and potential application range of the model when processing SAR images. To solve these problems, the paper proposes a multi - dimensional evaluation method for evaluating the performance of deep - learning models in SAR image target detection. This method covers the three core stages in the model's life cycle: **training stage**, **testing stage**, and **application stage**, and constructs a multi - dimensional evaluation index system. In addition, the paper also proposes a multi - index comprehensive evaluation method based on entropy - weight - related scales, aiming to provide a balanced and comprehensive model performance evaluation mechanism. Finally, the paper constructs a multi - dimensional deep - learning target - detection performance profile to visually display the multi - dimensional performance of the target - detection model. ### Specific contributions: - **Proposing a multi - dimensional evaluation index system**: Covering the training stage, testing stage, and application stage. The training stage includes training efficiency and the influence of training samples; the testing stage includes model performance evaluation, evaluation based on model applications, and evaluation based on model tasks; the application stage includes model running evaluation and model deployment evaluation. - **Proposing a multi - index comprehensive evaluation method based on entropy - weight - related scales**: Aiming to comprehensively evaluate multi - dimensional indicators and provide a balanced and comprehensive model performance evaluation mechanism. - **Constructing a multi - dimensional deep - learning target - detection performance profile**: Displaying the performance of the target - detection model in a multi - dimensional visual way. Through these methods, the paper aims to provide more accurate and comprehensive model performance evaluation, thereby guiding model selection and optimization, especially in complex SAR image processing application scenarios.