Abstract:LiDAR (Light Detection And Ranging) is an essential and widely adopted sensor for autonomous vehicles, particularly for those vehicles operating at higher levels (L4-L5) of autonomy. Recent work has demonstrated the promise of deep-learning approaches for LiDAR-based detection. However, deep-learning algorithms are extremely data hungry, requiring large amounts of labeled point-cloud data for training and evaluation. Annotating LiDAR point cloud data is challenging due to the following issues: 1) A LiDAR point cloud is usually sparse and has low resolution, making it difficult for human annotators to recognize objects. 2) Compared to annotation on 2D images, the operation of drawing 3D bounding boxes or even point-wise labels on LiDAR point clouds is more complex and time-consuming. 3) LiDAR data are usually collected in sequences, so consecutive frames are highly correlated, leading to repeated annotations. To tackle these challenges, we propose LATTE, an open-sourced annotation tool for LiDAR point clouds. LATTE features the following innovations: 1) Sensor fusion: We utilize image-based detection algorithms to automatically pre-label a calibrated image, and transfer the labels to the point cloud. 2) One-click annotation: Instead of drawing 3D bounding boxes or point-wise labels, we simplify the annotation to just one click on the target object, and automatically generate the bounding box for the target. 3) Tracking: we integrate tracking into sequence annotation such that we can transfer labels from one frame to subsequent ones and therefore significantly reduce repeated labeling. Experiments show the proposed features accelerate the annotation speed by 6.2x and significantly improve label quality with 23.6% and 2.2% higher instance-level precision and recall, and 2.0% higher bounding box IoU. LATTE is open-sourced at <a class="link-external link-https" href="https://github.com/bernwang/latte" rel="external noopener nofollow">this https URL</a>.

OpenAnnotate2: Multi-Modal Auto-Annotating for Autonomous Driving

OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal 3D Data

From Time to Space: Automatic Annotation of Unmarked Traffic Scene Based on Trajectory Data.

ALGPT: Multi-Agent Cooperative Framework for Open-Vocabulary Multi-Modal Auto-Annotating in Autonomous Driving

Proximity based automatic data annotation for autonomous driving

OpenAD: Open-World Autonomous Driving Benchmark for 3D Object Detection

OpenMPD: An Open Multimodal Perception Dataset for Autonomous Driving

A Semi-Automatic Video Labeling Tool for Autonomous Driving Based on Multi-Object Detector and Tracker

An Efficient Semi-Automated Scheme for Infrastructure LiDAR Annotation

Annotator: A Generic Active Learning Baseline for LiDAR Semantic Segmentation

LATTE: Accelerating LiDAR Point Cloud Annotation via Sensor Fusion, One-Click Annotation, and Tracking

Open 3D World in Autonomous Driving

MindReaD: Enhancing Pedestrian-Vehicle Interaction with Micro-Level Reasoning Data Annotation

PanopticNeRF-360: Panoramic 3D-to-2D Label Transfer in Urban Scenes

Smartannotator an Interactive Tool for Annotating Indoor Rgbd Images

Automated Data Annotation for 6-DoF AI-Based Navigation Algorithm Development

No Need to Sacrifice Data Quality for Quantity: Crowd-Informed Machine Annotation for Cost-Effective Understanding of Visual Data

Open-sourced Data Ecosystem in Autonomous Driving: the Present and Future

Open-Vocabulary High-Resolution 3D (OVHR3D) Data Segmentation and Annotation Framework

Fine-Grained Vehicle Perception via 3D Part-Guided Visual Data Augmentation

PALF: Pre-Annotation and Camera-LiDAR Late Fusion for the Easy Annotation of Point Clouds