YOLO‐UOD: An underwater small object detector via improved efficient layer aggregation network

Weiwen Chen,Tingting Zhuang,Yuanfang Zhang,Teng Mei,Xiaoyu Tang
DOI: https://doi.org/10.1049/ipr2.13112
IF: 2.3
2024-04-26
IET Image Processing
Abstract:Here, we propose a novel convolutional omni‐efficient layer aggregation network structure and an optimized loss calculation function based on Gaussian modelling, and apply effective underwater image enhancement techniques to our detector, achieving better performance than existing detectors. Accurate detection of underwater objects is a key indicator technology to effectively enhance the field of marine development and application, and is of great importance to various fields including marine military defense and seafood aquaculture. Efficient and rapid detection of underwater targets is a crucial technological challenge in this field. To meet the challenges posed by these issues, this study applies the convolutional omni‐efficient layer aggregation network (CO‐ELAN) module to the detector backbone to improve the ability of the network structure to acquire underwater objects from image information. The module improves the feature representation of gradient branching through a multi‐dimensional dynamic convolution and attention mechanism. In terms of loss calculation, the optimized normalized Wasserstein distance approach is used to predict the box distribution probabilistic modelling method to determine comparable distances to the ground box and obtain better samples of small target labels. Here, an underwater image enhancement algorithm based on white balance and underwater blur fusion is used to obtain clear images that enable improved detector performance. After the verification experiment on the URPC2018 dataset, it is found that the detector has better underwater detection ability compared with other detectors in the complex underwater environment. The proposed method achieves a 2.4% improvement over the YOLOv7 baseline model, while reducing computation costs by 5%.
computer science, artificial intelligence,engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?