Enhancing Moving Object Segmentation with Spatio-Temporal Information Fusion

Siyu Chen,Yilei Huang,Qilin Li,Ruosong Wang,Zhenhai Zhang
DOI: https://doi.org/10.1109/icma61710.2024.10633122
2024-01-01
Abstract:Sensing moving objects accurately can provide information about dynamic changes in the environment, while further segmentation can help autonomous systems make smarter decisions and better SLAM. Effective utilization of spatio-temporal information is paramount for LiDAR Moving Object Segmentation (LiDAR-MOS). We propose an efficient approach for attaining more accurate point cloud segmentation results by leveraging spatio-temporal information from multiple LiDAR scans and their corresponding poses. To be specific, using acquired pose information, we initially transform the point cloud data of the sequence into the coordinate system of the current frame. The aligned point clouds are then discretized to generate a special BEV-occupied representation. Subsequently, we employ a Spatio-Temporal Excitation (STE) module excite the spatio-temporal features of the superimposed representations and put into the spatio-temporal pyramid network (STPN) for dual-head decoding and result fusion. We trained and evaluated our network on the nuScenes dataset. The results of comparative and ablation studies demonstrate the advantage of our designed method.
What problem does this paper attempt to address?