Abstract:Point cloud completion aims to reconstruct the complete 3D shape from incomplete point clouds, and it is crucial for tasks such as 3D object detection and segmentation. Despite the continuous advances in point cloud analysis techniques, feature extraction methods are still confronted with apparent limitations. The sparse sampling of point clouds, used as inputs in most methods, often results in a certain loss of global structure information. Meanwhile, traditional local feature extraction methods usually struggle to capture the intricate geometric details. To overcome these drawbacks, we introduce PointCFormer, a transformer framework optimized for robust global retention and precise local detail capture in point cloud completion. This framework embraces several key advantages. First, we propose a relation-based local feature extraction method to perceive local delicate geometry characteristics. This approach establishes a fine-grained relationship metric between the target point and its k-nearest neighbors, quantifying each neighboring point's contribution to the target point's local features. Secondly, we introduce a progressive feature extractor that integrates our local feature perception method with self-attention. Starting with a denser sampling of points as input, it iteratively queries long-distance global dependencies and local neighborhood relationships. This extractor maintains enhanced global structure and refined local details, without generating substantial computational overhead. Additionally, we develop a correction module after generating point proxies in the latent space to reintroduce denser information from the input points, enhancing the representation capability of the point proxies. PointCFormer demonstrates state-of-the-art performance on several widely used benchmarks.

SparseFormer: Sparse Transformer Network for Point Cloud Classification

SEFormer: Structure Embedding Transformer for 3D Object Detection

SSF: Sparse Point Cloud Object Detection Based on Self-Adaptive Voxel Encoding and Focal-Sparse Convolution

3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification

PVT: Point-Voxel Transformer for Point Cloud Learning

OctFormer: Octree-based Transformers for 3D Point Clouds

Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space

Point Cloud Recognition with Position-to-Structure Attention Transformers

Collect-and-Distribute Transformer for 3D Point Cloud Analysis

G-Former: A Grouping Transformer for Weakly Supervised Point Cloud Segmentation

Learning point cloud context information based on 3D transformer for more accurate and efficient classification

PointMT: Efficient Point Cloud Analysis with Hybrid MLP-Transformer Architecture

PatchFormer: an Efficient Point Transformer with Patch Attention

PointCFormer: a Relation-based Progressive Feature Extraction Network for Point Cloud Completion

PU-Transformer: Point Cloud Upsampling Transformer

Stratified Transformer for 3D Point Cloud Segmentation

Multi-Head Self-Attention for 3D Point Cloud Classification

MPCT: Multiscale Point Cloud Transformer with a Residual Network

SparseFormer: Detecting Objects in HRW Shots Via Sparse Vision Transformer

CloudAttention: Efficient Multi-Scale Attention Scheme For 3D Point Cloud Learning