Abstract:Underwater object detection has been shown to exhibit significant potential for exploring underwater environments. However, underwater datasets often suffer from degeneration due to uneven underwater light distribution, complex underwater environment, and crowded underwater dynamic background. Thus, object detection performance would be degraded accordingly. In this paper, a large kernel convolutional object detection network based on self-attention and long-range relationship capture is proposed. Firstly, a hybrid dilated large kernel attention mechanism is proposed, which adopts the idea of hybrid dilated convolution and combines the advantages of large kernel attention mechanism and self-attention. This attention mechanism can avoid self-attention defects while achieving self-attention adaptiveness and long-range relevance. Secondly, a feature enhancement block called residual reconstructed module is proposed, which captures long-range dependencies in the network and extracts more critical contextual information, thus solving the problem of network degradation and accuracy degradation. Thirdly, an adaptive spatial feature fusion object detection head is constructed, which can directly learn how to filter different features at different feature layers spatially; useless information is filtered out, and only useful information is kept for combination to enhance the detection capability of the network further. Finally, network for underwater object detection is proposed based on the above three techniques. Extensive experiments were conducted on the well-known datasets of RUOD, Aquarium, URPC, and MS COCO. Compared to the prior state-of-the-art methods, the experimental findings demonstrate that the proposed approach obtains the highest mAP of 88.7%, 86.5%, 98.9%, and 71.4%, respectively. This represents an improvement of 1.2, 1.5, 8.5, and 0.2 percentage, in that order. The proposed model shows the capacity to function by applying self-attention to local details, as well as the capacity to grasp global long-range relationships, prioritize essential data, and spatially filter irrelevant information.

HAR-Net: Joint Learning of Hybrid Attention for Single-Stage Object Detection

Object detection based on an adaptive attention mechanism

Single-Shot Object Detector Based on Attention Mechanism.

HAM: Hybrid Attention Module in Deep Convolutional Neural Networks for Image Classification

HCNET: A Point Cloud Object Detection Network Based on Height and Channel Attention

Two Cases of Sinusitis Induced by Immune Checkpoint Inhibition.

Attention CoupleNet: Fully Convolutional Attention Coupling Network for Object Detection

Learning Synergistic Attention for Light Field Salient Object Detection

Single-Stage Detector With Dual Feature Alignment for Remote Sensing Object Detection

Mutual-Assistance Learning for Object Detection.

An Adaptive Attention Fusion Mechanism Convolutional Network for Object Detection in Remote Sensing Images

Exploring Reciprocal Attention for Salient Object Detection by Cooperative Learning

MAFNet: Multi-style attention fusion network for salient object detection

Cross-Scale Hybrid Gaussian Attention Network for Object Detection in Remote Sensing Images

YOLO V4 with hybrid dilated convolution attention module for object detection in the aerial dataset

DanHAR: Dual Attention Network for multimodal human activity recognition using wearable sensors

Aggregating Attentional Dilated Features for Salient Object Detection

Self-attention and long-range relationship capture network for underwater object detection

Dual Attention Based Image Pyramid Network for Object Detection.

Joint-attention feature fusion network and dual-adaptive NMS for object detection

AMA-Det: Enhancing Shared Head of One-Stage Object Detection with Adaptation, Merging, and Alignment