Abstract:In the field of autonomous driving, object detection under point clouds is indispensable for environmental perception. In order to achieve the goal of reducing blind spots in perception, many autonomous driving schemes have added low-cost blind-filling LiDAR on the side of the vehicle. Unlike point cloud target detection based on high-performance LiDAR, the blind-filling LiDARs have low vertical angular resolution and are mounted on the side of the vehicle, resulting in easily mixed point clouds of pedestrian targets in close proximity to each other. These characteristics are harmful for target detection. Currently, many research works focus on target detection under high-density LiDAR. These methods cannot effectively deal with the high sparsity of the point clouds, and the recall and detection accuracy of crowded pedestrian targets tend to be low. To overcome these problems, we propose a real-time detection model for crowded pedestrian targets, namely RTCP. To improve computational efficiency, we utilize an attention-based point sampling method to reduce the redundancy of the point clouds, then we obtain new feature tensors by the quantization of the point cloud space and neighborhood fusion in polar coordinates. In order to make it easier for the model to focus on the center position of the target, we propose an object alignment attention module (OAA) for position alignment, and we utilize an additional branch of the targets' location occupied heatmap to guide the training of the OAA module. These methods improve the model's robustness against the occlusion of crowded pedestrian targets. Finally, we evaluate the detector on KITTI, JRDB, and our own blind-filling LiDAR dataset, and our algorithm achieved the best trade-off of detection accuracy against runtime efficiency.

Accurate and Real-Time 3D Pedestrian Detection Using an Efficient Attentive Pillar Network

See Extensively While Focusing on the Core Area for Pedestrian Detection.

A Novel Approach to Design the Fast Pedestrian Detection for Video Surveillance System

Towards Accurate Dense Pedestrian Detection Via Occlusion-Prediction Aware Label Assignment and Hierarchical-Nms.

RPEA: A Residual Path Network with Efficient Attention for 3D pedestrian detection from LiDAR point clouds

Real-Time 3D Object Detection on Crowded Pedestrians

PillarBAPI: Enhancing Pillar-Based 3D Object Detection Through Attentive Pseudo-Image Feature Extraction

PIDNet: an Efficient Network for Dynamic Pedestrian Intrusion Detection

PEPillar: a point-enhanced pillar network for efficient 3D object detection in autonomous driving

Flexible Neural Network for Fast and Accurate Road Scene Perception.

Pillar-Based 3D Object Detection from Point Cloud with Multiattention Mechanism

An Efficient Multi-Task Network for Pedestrian Intrusion Detection

EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion

Pillar-based multilayer pseudo-image 3D object detection

Fusion-attention network using dense scale-invariant feature transform flow image and point cloud for 3D pedestrian detection

Pedestrian As Points: an Improved Anchor-Free Method for Center-Based Pedestrian Detection.

Real‐time 3D multi‐pedestrian detection and tracking using 3D LiDAR point cloud for mobile robot

Fast Pedestrian Detection with Attention-Enhanced Multi-Scale RPN and Soft-Cascaded Decision Trees

A Real-Time Predictive Pedestrian Collision Warning Service for Cooperative Intelligent Transportation Systems Using 3D Pose Estimation

S-AT GCN: Spatial-Attention Graph Convolution Network based Feature Enhancement for 3D Object Detection

FastPillars: A Deployment-friendly Pillar-based 3D Detector