Abstract:We propose an end-to-end attribute compression method for dense point clouds. The proposed method combines a frequency sampling module, an adaptive scale feature extraction module with geometry assistance, and a global hyperprior entropy model. The frequency sampling module uses a Hamming window and the Fast Fourier Transform to extract high-frequency components of the point cloud. The difference between the original point cloud and the sampled point cloud is divided into multiple sub-point clouds. These sub-point clouds are then partitioned using an octree, providing a structured input for feature extraction. The feature extraction module integrates adaptive convolutional layers and uses offset-attention to capture both local and global features. Then, a geometry-assisted attribute feature refinement module is used to refine the extracted attribute features. Finally, a global hyperprior model is introduced for entropy encoding. This model propagates hyperprior parameters from the deepest (base) layer to the other layers, further enhancing the encoding efficiency. At the decoder, a mirrored network is used to progressively restore features and reconstruct the color attribute through transposed convolutional layers. The proposed method encodes base layer information at a low bitrate and progressively adds enhancement layer information to improve reconstruction accuracy. Compared to the latest G-PCC test model (TMC13v23) under the MPEG common test conditions (CTCs), the proposed method achieved an average Bjontegaard delta bitrate reduction of 24.58% for the Y component (21.23% for YUV combined) on the MPEG Category Solid dataset and 22.48% for the Y component (17.19% for YUV combined) on the MPEG Category Dense dataset. This is the first instance of a learning-based codec outperforming the G-PCC standard on these datasets under the MPEG CTCs.

Scale-Adaptive Asymmetric Sparse Variational AutoEncoder for Point Cloud Compression

Point AE-DCGAN: A Deep Learning Model for 3D Point Cloud Lossy Geometry Compression.

Sparse Representation based Deep Residual Geometry Compression Network for Large-scale Point Clouds

Sparse Tensor-Based Multiscale Representation for Point Cloud Geometry Compression.

Multiscale Point Cloud Geometry Compression

Decoupled Sparse Priors Guided Diffusion Compression Model for Point Clouds

TSC-PCAC: Voxel Transformer and Sparse Convolution Based Point Cloud Attribute Compression for 3D Broadcasting

Learned Point Cloud Geometry Compression

LVAC: Learned Volumetric Attribute Compression for Point Clouds using Coordinate Based Networks

Deep-PCAC: an End-to-End Deep Lossy Compression Framework for Point Cloud Attributes

Deep Compression for Dense Point Cloud Maps

DeepCompress: Efficient Point Cloud Geometry Compression

Spatially Scalable Video-Based Point Cloud Compression.

Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds

Lossy Point Cloud Geometry Compression Via End-to-End Learning

SPCGC: Scalable Point Cloud Geometry Compression for Machine Vision

Color Enhancement for V-PCC Compressed Point Cloud via 2D Attribute Map Optimization

SPAC: Sampling-based Progressive Attribute Compression for Dense Point Clouds

PCAC-GAN: A Sparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression

Scalable Point Cloud Attribute Compression

Preserving High Quality in A Learning-based Compression Model for Point Cloud Videos.