Abstract:Transactions of the Institute of Measurement and Control, Ahead of Print. In the domain of autonomous driving, object detection presents several complex challenges, particularly concerning the accurate identification of small and salient objects. This paper introduces DL-YOLOX (Dilated Enhancement YOLOX), which flexibly uses dilated convolution to enhance features to achieve the purpose of improving small objects and silent objects. As we all know, a large receptive field covers a larger area and has greater contextual information, which is more advantageous for detecting large targets. A small receptive field helps capture local details and has better detection capabilities for detecting small targets. To bolster the representation of objects across various scales, we propose the integration of Dilated Adaptive Feature Fusion (DAFF) which has the ability to adaptively fuse features with different receptive fields. This innovative fusion mechanism allows for a more comprehensive understanding of objects, enabling improved detection accuracy even for objects of varying sizes. In addition, we tackle the issue of small object loss during feature propagation by introducing Stack Dilated Module (SDM), a powerful module that mitigates this phenomenon and contributes to better detection performance. Moreover, we endeavor to enhance small object detection further by replacing the conventional Intersection over Union (IoU) metric with Normalized Gaussian Wasserstein Distance (NWD), a novel distance metric that proves to be more effective in accurately gauging small object detection, thus elevating the precision of our algorithm. To thoroughly evaluate the robustness and generalization capabilities of our proposed method, we conduct extensive experiments on two benchmark datasets, namely MS COCO 2017 and BDD100K. The results from our evaluation not only affirm the significant improvements achieved in multi-scale object detection but also highlight the real-time capability of our approach. The impressive performance across these datasets demonstrates the promising potential of DL-YOLOX in revolutionizing object detection techniques in the context of autonomous driving.

PAHD-YOLOv5:Parallel Attention and Hybrid Dilated Convolution for Autonomous Driving Object Detection

YOLOv4-5D: An Effective and Efficient Object Detector for Autonomous Driving

DAN-YOLO: A Lightweight and Accurate Object Detector Using Dilated Aggregation Network for Autonomous Driving

YOLO V4 with hybrid dilated convolution attention module for object detection in the aerial dataset

SA-YOLOv3: An Efficient and Accurate Object Detector Using Self-Attention Mechanism for Autonomous Driving

DCW-YOLO: Road Object Detection Algorithms for Autonomous Driving

An improved YOLOv8 algorithm for small object detection in autonomous driving

Improved YOLOv4 Based on Dilated Coordinate Attention for Object Detection

Enhanced YOLOX with United Attention Head for Road Detetion When Driving

YED-YOLO: an Object Detection Algorithm for Automatic Driving

High-precision real-time autonomous driving target detection based on YOLOv8

BiGA-YOLO: A Lightweight Object Detection Network Based on YOLOv5 for Autonomous Driving

DL-YOLOX: Real-time object detection via adjustable dilated enhancement for autonomous driving scene

Enhanced YOLOv5: An Efficient Road Object Detection Method

YOLO-SDH: improved YOLOv5 using scaled decoupled head for object detection

A Real-Time Object Detector for Autonomous Vehicles Based on YOLOv4

MobileYOLO: Real-Time Object Detection Algorithm in Autonomous Driving Scenarios

YOLOMH: you only look once for multi-task driving perception with high efficiency

Object Detection for Intelligent Driving Based on Improved YOLOv5

Real-Time Object Detection Algorithm of Autonomous Vehicles Based on Improved YOLOv5s

PDT-YOLO: A Roadside Object-Detection Algorithm for Multiscale and Occluded Targets