Abstract:The evolution of 3D visualization techniques has fundamentally transformed how we interact with digital content. At the forefront of this change is point cloud technology, offering an immersive experience that surpasses traditional 2D representations. However, the massive data size of point clouds presents significant challenges in data compression. Current methods for lossy point cloud attribute compression (PCAC) generally focus on reconstructing the original point clouds with minimal error. However, for point cloud visualization scenarios, the reconstructed point clouds with distortion still need to undergo a complex rendering process, which affects the final user-perceived quality. In this paper, we propose an end-to-end deep learning framework that seamlessly integrates PCAC with differentiable rendering, denoted as rendering-oriented PCAC (RO-PCAC), directly targeting the quality of rendered multiview images for viewing. In a differentiable manner, the impact of the rendering process on the reconstructed point clouds is taken into account. Moreover, we characterize point clouds as sparse tensors and propose a sparse tensor-based transformer, called SP-Trans. By aligning with the local density of the point cloud and utilizing an enhanced local attention mechanism, SP-Trans captures the intricate relationships within the point cloud, further improving feature analysis and synthesis within the framework. Extensive experiments demonstrate that the proposed RO-PCAC achieves state-of-the-art compression performance, compared to existing reconstruction-oriented methods, including traditional, learning-based, and hybrid methods.

Transformer and Upsampling-Based Point Cloud Compression

3QNet: 3D Point Cloud Geometry Quantization Compression Network

3QNet

Point Cloud Compression with Implicit Neural Representations: A Unified Framework

DeepCompress: Efficient Point Cloud Geometry Compression

Multiscale Point Cloud Geometry Compression

Multi-Scale end-to-End Learning for Point Cloud Geometry Compression

Point cloud upsampling via a coarse-to-fine network with transformer-encoder

Learning Convolutional Transforms for Lossy Point Cloud Geometry Compression

Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor

PIVOT-Net: Heterogeneous Point-Voxel-Tree-based Framework for Point Cloud Compression

OctFormer: Efficient Octree-Based Transformer for Point Cloud Compression with Local Enhancement

Point Cloud Geometry Compression Based on Multi-Layer Residual Structure

Learned Point Cloud Geometry Compression

Deep Geometry Post-Processing for Decompressed Point Clouds.

Embedded Coding of Point Cloud Attributes

Rendering-Oriented 3D Point Cloud Attribute Compression using Sparse Tensor-based Transformer

Fast Point Cloud Geometry Compression with Context-based Residual Coding and INR-based Refinement

3D Point Cloud Geometry Compression on Deep Learning

Scalable Point Cloud Attribute Compression

Density-preserving Deep Point Cloud Compression