Moving Object Segmentation for LiDAR-based Localization and Mapping

Xieyuanli Chen
2021-01-01
Abstract:The ability to identify which parts of the environment are static and which ones are moving is key to safe and reliable autonomous navigation. It supports the task of predicting the future state of the surroundings, collision avoidance, and planning. This knowledge can also improve and robustify pose estimation, sensor data registration, and simultaneous localization and mapping (SLAM). Thus, accurate and reliable moving object segmentation (MOS) in sensor data at frame rate is a crucial capability supporting most autonomous mobile systems. Depending on the application domain and chosen sensor setup, moving object segmentation can be a challenging task. While there has been a large interest in vision-based moving object segmentation [27, 23, 20] and approaches combining vision and LiDAR sensors [33, 22], we concentrate here on approaches using only LiDAR sensors. Below, we distinguish between map-based and map-free approaches. Map-based approaches. Most of the existing LiDARbased approaches target the cleaning of a point cloud map. These methods mostly run offline and rely on a prebuilt map. Some methods use time-consuming voxel ray casting and require accurately aligned poses to clean the dense terrestrial laser scans [12, 25]. To alleviate the computational burden, visibility-based methods have been proposed [21, 32]. These types of methods associate a query point cloud to a map point within a narrow field of view, e.g. cone-shaped used by Pomerleau et al. [21]. Recently, Pagad et al. [18] propose an occupancy map-based method to remove dynamic points in LiDAR scans. Kim et al. [14] propose a range-image-based method, which exploits the consistency check between the query scan and the pre-built map to remove dynamic points. Even though such map-based methods can separate moving objects from the background, they need a pre-built and cleaned map and therefore usually can not achieve online operation. Map-free approaches. Recently, LiDAR-based semantic segmentation methods operating only on the sensor data have achieved great success [17, 8, 15]. Wang et al. [31] tackle the problem of segmenting things that could move from 3D laser scans of urban scenes, e.g. cars, pedestrians, and bicyclists. Ruchti and Burgard et al. [24] also propose a learning-based method to predict the probabilities of potentially movable objects. Dewan et al. [10] propose a LiDAR-based scene flow method that estimates motion vectors for rigid bodies. Based on that, they recently developed a semantic segmentation Raw Point Cloud Segmented Point Cloud
What problem does this paper attempt to address?