Moving Object Segmentation in 3D LiDAR Data: A Learning-based Approach Exploiting Sequential Data

Xieyuanli Chen,Shijie Li,Benedikt Mersch,Louis Wiesmann,Jürgen Gall,Jens Behley,Cyrill Stachniss
DOI: https://doi.org/10.1109/LRA.2021.3093567
2021-07-14
Abstract:The ability to detect and segment moving objects in a scene is essential for building consistent maps, making future state predictions, avoiding collisions, and planning. In this paper, we address the problem of moving object segmentation from 3D LiDAR scans. We propose a novel approach that pushes the current state of the art in LiDAR-only moving object segmentation forward to provide relevant information for autonomous robots and other vehicles. Instead of segmenting the point cloud semantically, i.e., predicting the semantic classes such as vehicles, pedestrians, roads, etc., our approach accurately segments the scene into moving and static objects, i.e., also distinguishing between moving cars vs. parked cars. Our proposed approach exploits sequential range images from a rotating 3D LiDAR sensor as an intermediate representation combined with a convolutional neural network and runs faster than the frame rate of the sensor. We compare our approach to several other state-of-the-art methods showing superior segmentation quality in urban environments. Additionally, we created a new benchmark for LiDAR-based moving object segmentation based on SemanticKITTI. We published it to allow other researchers to compare their approaches transparently and we furthermore published our code.
Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve real - time Moving Object Segmentation (MOS) in 3D LiDAR data. Specifically, the authors propose a method based on Convolutional Neural Network (CNN), using range images generated from 3D LiDAR scans and their residual images as input to distinguish between moving and static objects in the scene. This method aims to provide relevant information useful for self - driving cars and other vehicles, can operate at a speed exceeding the sensor frame rate, and shows better segmentation quality than other existing methods in urban environments. The main contributions of the paper are: 1. Propose a novel method to achieve moving object segmentation through CNN using range images and residual images generated from 3D LiDAR scans as input. 2. By combining range images and residual images, the network can utilize temporal information to effectively distinguish between moving and static objects. 3. Create a new LiDAR moving object segmentation benchmark based on SemanticKITTI, allowing other researchers to transparently compare their methods and publish the code. The goal of the paper is to improve the accuracy of moving object segmentation, especially in the case of online operation without pre - constructed maps, using only past LiDAR scan data. This helps to enhance the safety and reliability of autonomous navigation and support tasks such as future state prediction, collision avoidance and planning.