Abstract:While deep learning-based methods have demonstrated outstanding results in numerous domains, some important functionalities are missing. Resolution scalability is one of them. In this work, we introduce a novel architecture, dubbed RESSCAL3D, providing resolution-scalable 3D semantic segmentation of point clouds. In contrast to existing works, the proposed method does not require the whole point cloud to be available to start inference. Once a low-resolution version of the input point cloud is available, first semantic predictions can be generated in an extremely fast manner. This enables early decision-making in subsequent processing steps. As additional points become available, these are processed in parallel. To improve performance, features from previously computed scales are employed as prior knowledge at the current scale. Our experiments show that RESSCAL3D is 31-62% faster than the non-scalable baseline while keeping a limited impact on performance. To the best of our knowledge, the proposed method is the first to propose a resolution-scalable approach for 3D semantic segmentation of point clouds based on deep learning.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to achieve resolution scalability in 3D semantic segmentation of point clouds. Although existing deep - learning - based methods have shown excellent results in many fields, when dealing with point cloud data, they usually need the entire point cloud dataset to start the inference process, which limits their flexibility and efficiency in practical applications. Specifically, these methods cannot handle the gradually increasing point cloud density over time, nor can they process low - resolution data while obtaining higher resolution. To overcome these problems, the paper proposes the RESSCAL3D architecture, a new deep - learning method that can perform 3D semantic segmentation on point clouds at different resolutions and can dynamically update prediction results when new points are acquired without re - processing all existing points. This method not only improves the processing speed but also enables early decision - making because it can generate preliminary semantic predictions even when only low - resolution data is available. The main contributions of the paper include: - Proposing the first deep - learning - based resolution - scalable 3D semantic segmentation method. - Designing a fusion module that can fuse features at different resolution levels to improve performance. - Experimental results show that compared with non - scalable baseline methods, RESSCAL3D improves the processing speed by 31 - 62% at the highest spatial resolution with limited impact on performance. In this way, RESSCAL3D not only improves the processing efficiency but also enhances the practicality and flexibility of the system, especially in application scenarios that require real - time processing and gradually increasing precision.

RESSCAL3D: Resolution Scalable 3D Semantic Segmentation of Point Clouds

RESSCAL3D++: Joint Acquisition and Semantic Segmentation of 3D Point Clouds

Pass3d: Precise And Accelerated Semantic Segmentation For 3d Point Cloud

Deep Projective 3D Semantic Segmentation

ProtoSeg: A Prototype-Based Point Cloud Instance Segmentation Method

SEGCloud: Semantic Segmentation of 3D Point Clouds

FA-ResNet: Feature affine residual network for large-scale point cloud segmentation

Voxel-based 3D Point Cloud Semantic Segmentation: Unsupervised Geometric and Relationship Featuring vs Deep Learning Methods

Multi-Scale Point-Wise Convolutional Neural Networks for 3D Object Segmentation From LiDAR Point Clouds in Large-Scale Environments

RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds

Semantic Segmentation of Point Cloud Scene via Multi-Scale Feature Aggregation and Adaptive Fusion

Dilated Nearest-Neighbor Encoding for 3D Semantic Segmentation of Point Clouds

Rethinking 3D LiDAR Point Cloud Segmentation

Rethinking Design and Evaluation of 3D Point Cloud Segmentation Models

Semantic segmentation of large-scale point clouds based on dilated nearest neighbors graph

Exploring Spatial Context for 3D Semantic Segmentation of Point Clouds

3D Semantic Segmentation of Large-Scale Point-Clouds in Urban Areas Using Deep Learning

PointResNet: Residual Network for 3D Point Cloud Segmentation and Classification

3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds

Fusion of images and point clouds for the semantic segmentation of large-scale 3D scenes based on deep learning

Scalable 3D Panoptic Segmentation As Superpoint Graph Clustering