RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds

Remco Royen,Adrian Munteanu
DOI: https://doi.org/10.1109/ICIP49359.2023.10222338
2024-04-10
Abstract:While deep learning-based methods have demonstrated outstanding results in numerous domains, some important functionalities are missing. Resolution scalability is one of them. In this work, we introduce a novel architecture, dubbed RESSCAL3D, providing resolution-scalable 3D semantic segmentation of point clouds. In contrast to existing works, the proposed method does not require the whole point cloud to be available to start inference. Once a low-resolution version of the input point cloud is available, first semantic predictions can be generated in an extremely fast manner. This enables early decision-making in subsequent processing steps. As additional points become available, these are processed in parallel. To improve performance, features from previously computed scales are employed as prior knowledge at the current scale. Our experiments show that RESSCAL3D is 31-62% faster than the non-scalable baseline while keeping a limited impact on performance. To the best of our knowledge, the proposed method is the first to propose a resolution-scalable approach for 3D semantic segmentation of point clouds based on deep learning.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve resolution scalability in 3D semantic segmentation of point clouds. Although existing deep - learning - based methods have shown excellent results in many fields, when dealing with point cloud data, they usually need the entire point cloud dataset to start the inference process, which limits their flexibility and efficiency in practical applications. Specifically, these methods cannot handle the gradually increasing point cloud density over time, nor can they process low - resolution data while obtaining higher resolution. To overcome these problems, the paper proposes the RESSCAL3D architecture, a new deep - learning method that can perform 3D semantic segmentation on point clouds at different resolutions and can dynamically update prediction results when new points are acquired without re - processing all existing points. This method not only improves the processing speed but also enables early decision - making because it can generate preliminary semantic predictions even when only low - resolution data is available. The main contributions of the paper include: - Proposing the first deep - learning - based resolution - scalable 3D semantic segmentation method. - Designing a fusion module that can fuse features at different resolution levels to improve performance. - Experimental results show that compared with non - scalable baseline methods, RESSCAL3D improves the processing speed by 31 - 62% at the highest spatial resolution with limited impact on performance. In this way, RESSCAL3D not only improves the processing efficiency but also enhances the practicality and flexibility of the system, especially in application scenarios that require real - time processing and gradually increasing precision.