Abstract:The underwater imaging environment is complex, and the application of conventional target detection algorithms to the underwater environment has yet to provide satisfactory results. Therefore, underwater optical image target detection remains one of the most challenging tasks involved with neighborhood-based techniques in the field of computer vision. Small underwater targets, dispersion, and sources of distortion (such as sediment and particles) often render neighborhood-based techniques insufficient, as existing target detection algorithms primarily focus on improving detection accuracy and enhancing algorithm complexity and computing power. However, excessive extraction of deep-level features leads to the loss of small targets and decrease in detection accuracy. Moreover, most underwater optical image target detection is performed by underwater unmanned platforms, which have a high demand of algorithm lightweight requirements due to the limited computing power of the underwater unmanned platform with the mobile vision processing platform. In order to meet the lightweight requirements of the underwater unmanned platform without affecting the detection accuracy of the target, we propose an underwater target detection model based on mobile vision transformer (MobileViT) and YOLOX, and we design a new coordinate attention (CA) mechanism named a double CA (DCA) mechanism. This model utilizes MobileViT as the algorithm backbone network, improving the global feature extraction ability of the algorithm and reducing the amount of algorithm parameters. The double CA (DCA) mechanism can improve the extraction of shallow features as well as the detection accuracy, even for difficult targets, using a minimum of parameters. Research validated in the Underwater Robot Professional Contest 2020 (URPC2020) dataset revealed that this method has an average accuracy rate of 72.00%. In addition, YOLOX's ability to compress the model parameters by 49.6% efficiently achieves a balance between underwater optical image detection accuracy and parameter quantity. Compared with the existing algorithm, the proposed algorithm can carry on the underwater unmanned platform better.

Underwater Small Target Detection Based on YOLOX Combined with MobileViT and Double Coordinate Attention

YoloXT: A Object Detection Algorithm for Marine Benthos

Research on Underwater Small Target Detection Algorithm Based on Improved YOLOv3

Underwater Object Detection Based on Enhanced YOLO

MDM-YOLO: Research on Object Detection Algorithm Based on Improved YOLOv4 for Marine Organisms.

Underwater target detection algorithm based on improved YOLOv4 with SemiDSConv and FIoU loss function

Underwater small and occlusion object detection with feature fusion and global context decoupling head-based YOLO

Underwater Object Detection Using TC-YOLO with Attention Mechanisms

YOLOv5s-CA: A Modified YOLOv5s Network with Coordinate Attention for Underwater Target Detection

UTD-Yolov5: A Real-time Underwater Targets Detection Method based on Attention Improved YOLOv5

Underwater Target Detection Algorithm Based on Improved YOLOv5

Underwater Robot Target Detection Algorithm Based on YOLOv8

Underwater small target detection based on dynamic convolution and attention mechanism

Research on Underwater Small Target Detection Technology Based on Single-Stage USSTD-YOLOv8n

Attention-Based Lightweight YOLOv8 Underwater Target Recognition Algorithm

Underwater object detection algorithm based on attention mechanism and cross-stage partial fast spatial pyramidal pooling

Underwater Target Detection Algorithm Based on Feature Fusion Enhancement

Research on multi-scale fusion image enhancement and improved YOLOv5s lightweight ROV underwater target detection method

Underwater Object Detection Algorithm Based on an Improved YOLOv8

Underwater Target Detection Based On Modified YOLOv5

Novel Dynamic Feature Fusion Stragegy for Detection of Small Underwater Marine Object