Abstract:The rapid development of Autonomous Vehicles (AVs) increases the requirement for the accurate prediction of objects in the vicinity to guarantee safer journeys. For effectively predicting objects, sensors such as Three-Dimensional Light Detection and Ranging (3D LiDAR) and cameras can be used. The 3D LiDAR sensor captures the 3D shape of the object and produces point cloud data that describes the geometrical structure of the object. The LiDAR-only detectors may be subject to false detection or even non-detection over objects located at high distances. The camera sensor captures RGB images with sufficient attributes that describe the distinct identification of the object. The high-resolution images produced by the camera sensor benefit the precise classification of the objects. However, hindrances such as the absence of depth information from the images, unstructured point clouds, and cross modalities affect assertion and boil down the environmental perception. To this end, this paper proposes an object detection mechanism that fuses the data received from the camera sensor and the 3D LiDAR sensor (OD-C3DL). The 3D LiDAR sensor obtains point clouds of the object such as distance, position, and geometric shape. The OD-C3DL employs Convolutional Neural Networks (CNN) for further processing point clouds obtained from the 3D LiDAR sensor and the camera sensor to recognize the objects effectively. The point cloud of the LiDAR is enhanced and fused with the image space on the Regions of Interest (ROI) for easy recognition of the objects. The evaluation results show that the OD-C3DL can provide an average of 89 real-time objects for a frame and reduces the extraction time by a recall rate of 94%. The average processing time is 65ms, which makes the OD-C3DL model incredibly suitable for the AVs perception. Furthermore, OD-C3DL provides mean accuracy for identifying automobiles and pedestrians at a moderate degree of difficulty is higher than that of the previous models at 79.13% and 88.76%.

Realtime Single-Shot Refinement Neural Network With Adaptive Receptive Field for 3D Object Detection From LiDAR Point Cloud

Real-Time And Robust 3D Object Detection with Roadside LiDARs

Real-Time 3D Object Detection From Point Cloud Through Foreground Segmentation

6DoF-3D: Efficient and accurate 3D object detection using six degrees-of-freedom for autonomous driving

Real-Time 3D Object Detection on Crowded Pedestrians

RT3D: Real-Time 3-D Vehicle Detection in LiDAR Point Cloud for Autonomous Driving

Up-Sampling Method for Low-Resolution LiDAR Point Cloud to Enhance 3D Object Detection in an Autonomous Driving Environment

Real-Time 3D Object Detection and Classification in Autonomous Driving Environment Using 3D LiDAR and Camera Sensors

Research on 3D Point Cloud Object Detection Algorithm for Autonomous Driving

Stereo RGB and Deeper LIDAR Based Network for 3D Object Detection

Pre-Segmented Down-Sampling Accelerates Graph Neural Network-Based 3D Object Detection in Autonomous Driving

KDA3D: Key-Point Densification and Multi-Attention Guidance for 3D Object Detection

Three-Attention Mechanisms for One-Stage 3-D Object Detection Based on LiDAR and Camera

RoarNet: A Robust 3D Object Detection based on RegiOn Approximation Refinement

Generalized Few-Shot 3D Object Detection of LiDAR Point Cloud for Autonomous Driving

Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data

A LiDAR Multi-Object Detection Algorithm for Autonomous Driving

Lidar Point Cloud Guided Monocular 3D Object Detection

Multi-Sensor 3D Object Box Refinement for Autonomous Driving

HRNet: 3D object detection network for point cloud with hierarchical refinement

SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection