Abstract:The recent adoption of deep learning-based models for the processing and coding of multimedia signals has brought noticeable gains in performance, which have established deep learning-based solutions as the uncontested state-of-the-art both for computer vision tasks, targeting machine consumption, as well as, more recently, coding applications, targeting human visualization. Traditionally, applications requiring both coding and computer vision processing require first decoding the bitstream and then applying the computer vision methods to the decompressed multimedia signals. However, the adoption of deep learning-based solutions enables the use of compressed domain computer vision processing, with gains in performance and computational complexity over the decompressed domain approach. For point clouds (PCs), these gains have been demonstrated in the single available compressed domain computer vision processing solution, named Compressed Domain PC Classifier, which processes JPEG Pleno PC coding (PCC) compressed streams using a PC classifier largely compatible with the state-of-the-art spatial domain PointGrid classifier. However, the available Compressed Domain PC Classifier presents strong limitations by imposing a single, specific input size which is associated to specific JPEG Pleno PCC configurations; this limits the compression performance as these configurations are not ideal for all PCs due to their different characteristics, notably density. To overcome these limitations, this paper proposes the first Adaptive Compressed Domain PC Classifier solution which includes a novel adaptive bridge model that allows to process the JPEG Pleno PCC encoded bit streams using different coding configurations, now maximizing the compression efficiency. Experimental results show that the novel Adaptive Compressed Domain PC Classifier allows JPEG PCC to achieve better compression performance by not imposing a single, specific coding configuration for all PCs, regardless of its different characteristics. Moreover, the added adaptability power can achieve slightly better PC classification performance than the previous Compressed Domain PC Classifier and largely better PC classification performance (and lower number of weights) than the PointGrid PC classifier working in the decompressed domain.

Learned Point Cloud Compression for Classification

Scalable Human-Machine Point Cloud Compression

Learned Compression for Images and Point Clouds

3QNet: 3D Point Cloud Geometry Quantization Compression Network

3QNet

Deep Learning-based Compressed Domain Multimedia for Man and Machine: A Taxonomy and Application to Point Cloud Classification

DeepCompress: Efficient Point Cloud Geometry Compression

Adaptive bridge model for compressed domain point cloud classification

Point Cloud Compression with Bits-back Coding

Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor

Learned Point Cloud Geometry Compression

BiPointNet: Binary Neural Network for Point Clouds

Double Deep Learning-based Event Data Coding and Classification

Transformer and Upsampling-Based Point Cloud Compression

Scalable Point Cloud Attribute Compression

End-to-end learned Lossy Dynamic Point Cloud Attribute Compression

Lossy Point Cloud Geometry Compression Via End-to-End Learning

SPCGC: Scalable Point Cloud Geometry Compression for Machine Vision

Deep-PCAC: an End-to-End Deep Lossy Compression Framework for Point Cloud Attributes

DeepPCC: Learned Lossy Point Cloud Compression

Deep AutoEncoder-based Lossy Geometry Compression for Point Clouds