Abstract:Transactions of the Institute of Measurement and Control, Ahead of Print. In the domain of autonomous driving, object detection presents several complex challenges, particularly concerning the accurate identification of small and salient objects. This paper introduces DL-YOLOX (Dilated Enhancement YOLOX), which flexibly uses dilated convolution to enhance features to achieve the purpose of improving small objects and silent objects. As we all know, a large receptive field covers a larger area and has greater contextual information, which is more advantageous for detecting large targets. A small receptive field helps capture local details and has better detection capabilities for detecting small targets. To bolster the representation of objects across various scales, we propose the integration of Dilated Adaptive Feature Fusion (DAFF) which has the ability to adaptively fuse features with different receptive fields. This innovative fusion mechanism allows for a more comprehensive understanding of objects, enabling improved detection accuracy even for objects of varying sizes. In addition, we tackle the issue of small object loss during feature propagation by introducing Stack Dilated Module (SDM), a powerful module that mitigates this phenomenon and contributes to better detection performance. Moreover, we endeavor to enhance small object detection further by replacing the conventional Intersection over Union (IoU) metric with Normalized Gaussian Wasserstein Distance (NWD), a novel distance metric that proves to be more effective in accurately gauging small object detection, thus elevating the precision of our algorithm. To thoroughly evaluate the robustness and generalization capabilities of our proposed method, we conduct extensive experiments on two benchmark datasets, namely MS COCO 2017 and BDD100K. The results from our evaluation not only affirm the significant improvements achieved in multi-scale object detection but also highlight the real-time capability of our approach. The impressive performance across these datasets demonstrates the promising potential of DL-YOLOX in revolutionizing object detection techniques in the context of autonomous driving.

A study on a target detection model for autonomous driving tasks

Research on Autonomous Driving Image Recognition Based on a New Real-Time Object Detection Model YOLOv5st

High-precision real-time autonomous driving target detection based on YOLOv8

A Study on the Performance Improvement of a Conical Bucket Detection Algorithm Based on YOLOv8s

An improved YOLOv8 algorithm for small object detection in autonomous driving

Optimization of Autonomous Driving Image Detection Based on RFAConv and Triplet Attention

YOLOv8-QSD: An Improved Small Object Detection Algorithm for Autonomous Vehicles Based on YOLOv8

DAN-YOLO: A Lightweight and Accurate Object Detector Using Dilated Aggregation Network for Autonomous Driving

CF-YOLOX: An Autonomous Driving Detection Model for Multi-Scale Object Detection

YOLOv4-5D: An Effective and Efficient Object Detector for Autonomous Driving

SA-YOLOv3: An Efficient and Accurate Object Detector Using Self-Attention Mechanism for Autonomous Driving

Research on Improved Automatic Driving Target Detection Algorithm for Yolo v5

YOLOv3-MT: A YOLOv3 using multi-target tracking for vehicle visual detection

Small Target-YOLOv5: Enhancing the Algorithm for Small Object Detection in Drone Aerial Imagery Based on YOLOv5

SNCE-YOLO: An Improved Target Detection Algorithm in Complex Road Scenes

DL-YOLOX: Real-time object detection via adjustable dilated enhancement for autonomous driving scene

Research on target detection method of distracted driving behavior based on improved YOLOv8

Research on Small Target Detection in Driving Scenarios Based on Improved Yolo Network

YOLO-MPAM: Efficient real-time neural networks based on multi-channel feature fusion

Improving real-time object detection in Internet-of-Things smart city traffic with YOLOv8-DSAF method

Enhancing YOLOv8's Performance in Complex Traffic Scenarios: Optimization Design for Handling Long-Distance Dependencies and Complex Feature Relationships