Improvement of ship target detection algorithm for YOLOv7‐tiny
Huixia Zhang,Haishen Yu,Yadong Tao,Wenliang Zhu,Kaige Zhang
DOI: https://doi.org/10.1049/ipr2.13054
IF: 2.3
2024-03-19
IET Image Processing
Abstract:This paper presents an enhanced ship detection algorithm for YOLOv7‐tiny, addressing occlusion challenges in multi‐target ship detection. The modifications include integrating Convolutional Block Attention Module for position emphasis, GSConv in the Neck section, Content‐Aware ReAssembly of Features for up‐sampling, and SIOU Loss for better accuracy. The improved model achieves a precision of 91.2%, mAP@0.5 of 94.5%, and an F1‐score of 90.7%, surpassing the original model by 3.7%, 5.5%, and 4.2%, respectively, significantly boosting detection accuracy. In addressing the challenge of ships being prone to occlusion in multi‐target situations during ship target detection, leading to missed and false detections, this paper proposes an enhanced ship detection algorithm for YOLOv7‐tiny. The proposed method incorporates several key modifications. Firstly, it introduces the Convolutional Block Attention Module in the Backbone section of the original model, emphasizing position information while attending to channel features to enhance the network's ability to extract crucial information. Secondly, it replaces standard convolution with GSConv convolution in the Neck section, preserving detailed information and reducing computational load. Subsequently, the lightweight operator Content‐Aware ReAssembly of Features is employed to replace the original nearest‐neighbour interpolation, mitigating the loss of feature information during the up‐sampling process. Finally, the localization loss function, SIOU Loss, is utilized to calculate loss, expedite training convergence, and enhance detection accuracy. The research results indicate that the precision of the improved model is 91.2%, mAP@0.5 is 94.5%, and the F1‐score is 90.7%. These values are 3.7%, 5.5%, and 4.2% higher than those of the original YOLOv7‐tiny model, respectively. The improved model effectively enhances detection accuracy. Additionally, the improved model achieves an FPS of 145.4, meeting real‐time requirements.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology