Learned Point Cloud Compression for Classification

Mateen Ulhaq,Ivan V. Bajić
2023-08-11
Abstract:Deep learning is increasingly being used to perform machine vision tasks such as classification, object detection, and segmentation on 3D point cloud data. However, deep learning inference is computationally expensive. The limited computational capabilities of end devices thus necessitate a codec for transmitting point cloud data over the network for server-side processing. Such a codec must be lightweight and capable of achieving high compression ratios without sacrificing accuracy. Motivated by this, we present a novel point cloud codec that is highly specialized for the machine task of classification. Our codec, based on PointNet, achieves a significantly better rate-accuracy trade-off in comparison to alternative methods. In particular, it achieves a 94% reduction in BD-bitrate over non-specialized codecs on the ModelNet40 dataset. For low-resource end devices, we also propose two lightweight configurations of our encoder that achieve similar BD-bitrate reductions of 93% and 92% with 3% and 5% drops in top-1 accuracy, while consuming only 0.470 and 0.048 encoder-side kMACs/point, respectively. Our codec demonstrates the potential of specialized codecs for machine analysis of point clouds, and provides a basis for extension to more complex tasks and datasets in the future.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve efficient data compression while maintaining high accuracy in the classification task of point cloud data. Specifically, the paper focuses on how to design a point - cloud codec specifically for machine analysis to overcome the problems existing in existing codecs when compressing point - cloud data, such as insufficient compression ratio or sacrificing the accuracy of classification tasks. The paper proposes a new point - cloud codec based on PointNet, which can significantly improve the compression ratio without compromising the performance of classification tasks. In addition, the paper also explores the possibility of running a lightweight version of the codec on resource - constrained devices, aiming to reduce network transmission requirements and improve the computational efficiency and adaptability of end - devices.