Moving Object Segmentation in Point Cloud Data using Hidden Markov Models

Vedant Bhandari,Jasmin James,Tyson Phillips,P. Ross McAree
2024-10-24
Abstract:Autonomous agents require the capability to identify dynamic objects in their environment for safe planning and navigation. Incomplete and erroneous dynamic detections jeopardize the agent's ability to accomplish its task. Dynamic detection is a challenging problem due to the numerous sources of uncertainty inherent in the problem's inputs and the wide variety of applications, which often lead to use-case-tailored solutions. We propose a robust learning-free approach to segment moving objects in point cloud data. The foundation of the approach lies in modelling each voxel using a hidden Markov model (HMM), and probabilistically integrating beliefs into a map using an HMM filter. The proposed approach is tested on benchmark datasets and consistently performs better than or as well as state-of-the-art methods with strong generalized performance across sensor characteristics and environments. The approach is open-sourced at <a class="link-external link-https" href="https://github.com/vb44/HMM-MOS" rel="external noopener nofollow">this https URL</a>.
Robotics,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **Accurately and robustly segment moving objects (Moving Object Segmentation, MOS) in point cloud data**. Specifically, autonomous agents (such as robots or self - driving vehicles) need to have the ability to identify dynamic objects in the environment to ensure safe planning and navigation. However, due to the uncertainties in the input data (such as sensor noise, environmental changes, etc.) and the diversity of application scenarios, dynamic object detection becomes a challenging problem. ### Problem Background: 1. **Importance of Dynamic Object Detection**: For autonomous agents, detecting motion in the workspace is a key ability. These agents use sensors such as cameras and LiDAR (Light Detection and Ranging) to perceive the environment. 2. **Limitations of Existing Methods**: Existing MOS methods usually rely on specific application scenarios and it is difficult to maintain consistent performance under different environments, platform dynamics and sensor characteristics. In addition, learning - based methods may require a large amount of labeled data and perform poorly in new environments. ### Method Proposed in the Paper: To solve the above problems, the author proposes a **learning - free MOS method**, which is achieved in the following ways: - **Voxel Representation Based on Hidden Markov Model (HMM)**: The state of each voxel is modeled by a three - state HMM, which are unobserved, occupied and free respectively. By probabilistically updating the voxel states, it can be inferred which voxels may be dynamic objects. - **Map Update Mechanism**: Convert the point cloud data into the map coordinate system and find all the observed voxels through ray casting. Update the occupancy probability of voxels according to the sensor observation results. - **Dynamic Point Identification**: Identify potential dynamic objects by detecting changes in voxel states and use 4D convolution to suppress noise detection and identify missed dynamic objects. ### Main Contributions: 1. **Robustness and Generalization Performance**: This method can perform well on multiple datasets without the need to re - configure parameters. 2. **Real - Time Performance**: Although the computational cost is high, real - time processing can be achieved within a certain range (20 - 50 meters). 3. **Open - Source Implementation**: The code has been open - sourced, which is convenient for other researchers to reproduce and improve. ### Experimental Verification: The paper was tested on multiple benchmark datasets (such as DOALS, Sipailou Campus and HeLiMOS), and the results show that the performance of this method is better than or comparable to the existing state - of - the - art methods. In conclusion, this paper aims to provide a robust and general - purpose MOS solution applicable to applications with different environments and sensor characteristics.