Improving Deep Learning on Point Cloud by Maximizing Mutual Information Across Layers

Di Wang,Lulu Tang,Xu Wang,Luqing Luo,Zhi-Xin Yang
DOI: https://doi.org/10.1016/j.patcog.2022.108892
IF: 8
2022-07-10
Pattern Recognition
Abstract:It is a fundamental and vital task to enhance the perception capability of the point cloud learning network in 3D machine vision applications. Most existing methods utilize feature fusion and geometric transformation to improve point cloud learning without paying enough attention to mining further intrinsic information across multiple network layers. Motivated to improve consistency between hierarchical features and strengthen the perception capability of the point cloud network, we propose exploring whether maximizing the mutual information (MI) across shallow and deep layers is beneficial to improve representation learning on point clouds. A novel design of Maximizing Mutual Information (MMI) Module is proposed, which assists the training process of the main network to capture discriminative features of the input point clouds. Specifically, the MMI-based loss function is employed to constrain the differences of semantic information in two hierarchical features extracted from the shallow and deep layers of the network. Extensive experiments show that our method is generally applicable to point cloud tasks, including classification, shape retrieval, indoor scene segmentation, 3D object detection, and completion, and illustrate the efficacy of our proposed method and its advantages over existing ones. Our source code will be available at https://github.com/wendydidi/MMI.git.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?