Abstract:In recent years, the marine economy has developed rapidly, and human demand for marine resources has increased greatly. At present, target detection technology has a wide range of applications and prospects in seabed observation and ocean engineering. However, the accuracy and robustness of existing target detection methods are low due to the complex underwater environment, poor lighting, and poor quality of undersea images and videos. To solve these problems, this paper proposes YoloXT, a new quantitative identification method for marine benthos. YoloXT introduces the DECA (Deformable Coordinate Attention) module, which expands the spatial awareness in feature extraction and can learn image features more effectively. Meanwhile FPST-PAN (Feature Pyramid S2win Transformer, Improved Path Aggregation Network) was proposed to deal with the problem of marine benthic target diversity. It further integrates deep and shallow features through multi-scale skip-connection and Transformer and improves the model's ability to deal with complex and changeable marine environments. Finally, the positive and negative sample assignment strategy OAA (Optimal Anchor Assignment) applied to the detection head is proposed. It effectively avoids the problem of unbalanced distribution of positive and negative samples caused by traditional sample assignment methods and marine benthos image noise. Experiments on the IOC-URPC dataset show that the mAP of YoloXT is 3.9% higher than that of YoloX, reaching 70.9%. YoloXT has demonstrated excellent performance in quantitative identification task of marine organisms, which can effectively contribute to the exploitation and conservation of marine re-sources. The source code is publicly available at https://github.com/F1veZhang/YOLOXT.

Object Pose Estimation Based on Improved YOLOX Algorithm

YoloXT: A Object Detection Algorithm for Marine Benthos

An Object Detection Method Based on Improved YOLOX

YOLO-Rlepose: Improved YOLO Based on Swin Transformer and Rle-Oks Loss for Multi-Person Pose Estimation

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation.

An enhanced real-time human pose estimation method based on modified YOLOv8 framework

RNNPose: 6-DoF Object Pose Estimation Via Recurrent Correspondence Field Estimation and Pose Optimization

YOLOPose V2: Understanding and Improving Transformer-based 6D Pose Estimation

KSL-POSE: A Real-Time 2D Human Pose Estimation Method Based on Modified YOLOv8-Pose Framework

Research on Human Posture Estimation Algorithm Based on YOLO-Pose

Robust RGB-based 6-DoF Pose Estimation without Real Pose Annotations

RA-YOLOX: Re-parameterization align decoupled head and novel label assignment scheme based on YOLOX

SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

MDA-YOLO Person: a 2D human pose estimation model based on YOLO detection framework

RFA-YOLO-POSE: A Fusion Algorithm for Pose Detection and Object Identification Amidst Complex Crowds

RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

Object Detection Algorithm Based on Improved YOLOv3

Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset

RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images

6DoF Pose Estimation of Transparent Object from a Single RGB-D Image