YOLOv9-YX: lightweight algorithm for underwater target detection

Qiang Cen,QiGuang Zhu,YuXin Wang,WeiDong Chen,Shuo Liu
DOI: https://doi.org/10.1007/s00371-024-03644-4
IF: 2.835
2024-10-11
The Visual Computer
Abstract:The issue of target misdetection and omission resulting from the limited memory and computational capacity of underwater devices and the inferior quality of underwater images necessitates the maintenance of detection accuracy while minimizing computational overhead. This paper proposes an enhanced network, YOLOv9-YX, based on YOLOv9, as a solution to this problem. Firstly, to address the issue of poor underwater image quality, an adaptive histogram equalization algorithm based on gamma-adjusted contrast constrained is proposed to enhance the image quality. Secondly, the C3_ECA module is employed to replace the RepNCSPELAN4 as the backbone feature extraction network, which significantly reduces the computational cost of the model. Subsequently, the CDown module is employed to replace the ADown and Conv modules as the downsampling module, thereby further reducing the computational cost. Finally, the FSSPF module is utilized to replace the SPPELAN as the multi-scale fusion module, which is capable of more fully fusing the features of different layers and improving the feature expression of the model on underwater images. The experimental results demonstrate that on the DUO underwater target detection dataset, YOLOv9-YX FPS achieves 293.1 frames per second, representing a 153.8 frames per second improvement over the original model. Additionally, the number of model parameters is reduced by 65.6%, the computational cost is decreased by 65.3%, and the detection accuracy of YOLOv9-YX is only 1.4% lower than the original network. It can be demonstrated that the proposed algorithm achieves an optimal balance between accuracy and speed.
computer science, software engineering
What problem does this paper attempt to address?