Abstract: Geometric deep learning, i.e., designing neural networks to handle the ubiquitous geometric data such as point clouds and graphs, have achieved great successes in the last decade. One critical inductive bias is that the model can maintain invariance towards various transformations such as translation, rotation, and scaling. The existing graph neural network (GNN) approaches can only maintain permutation-invariance, failing to guarantee invariance with respect to other transformations. Besides GNNs, other works design sophisticated transformation-invariant layers, which are computationally expensive and difficult to be extended. To solve this problem, we revisit why the existing neural networks cannot maintain transformation invariance when handling geometric data. Our findings show that transformation-invariant and distance-preserving initial representations are sufficient to achieve transformation invariance rather than needing sophisticated neural layer designs. Motivated by these findings, we propose Transformation Invariant Neural Networks (TinvNN), a straightforward and general framework for geometric data. Specifically, we realize transformation-invariant and distance-preserving initial point representations by modifying multi-dimensional scaling before feeding the representations into neural networks. We prove that TinvNN can strictly guarantee transformation invariance, being general and flexible enough to be combined with the existing neural networks. Extensive experimental results on point cloud analysis and combinatorial optimization demonstrate the effectiveness and general applicability of our proposed method. Based on the experimental results, we advocate that TinvNN should be considered a new starting point and an essential baseline for further studies of transformation-invariant geometric deep learning.

Transformation-invariant Gabor Convolutional Networks

Adaptive Gabor convolutional networks

Transform-Invariant Convolutional Neural Networks for Image Classification and Search

Orientation Convolutional Networks for Image Recognition

Group Equivariant Convolutional Networks

Revisiting Transformation Invariant Geometric Deep Learning: Are Initial Representations All You Need?

Gradient-Aligned convolution neural network

Gaussian Context Transformer.

Convolutional Neural Networks with Gated Recurrent Connections

Learning Transformation-Invariant Representations for Image Recognition with Drop Transformation Networks.

GIFT: Learning Transformation-Invariant Dense Visual Descriptors Via Group CNNs

HCFNN: High-order Coverage Function Neural Network for Image Classification

[Tuberculosis in miprant workers in the Netherlands].

Twisted Convolutional Networks (TCNs): Enhancing Feature Interactions for Non-Spatial Data Classification

ScaleGCN: Efficient and Effective Graph Convolution Via Channel-Wise Scale Transformation

OACNNs: Orientation adaptive convolutional neural networks

Enhanced Convolutional Neural Tangent Kernels

Convolutional Networks with Cross-Layer Neurons for Image Recognition

Isometric Transformation Invariant Graph-based Deep Neural Network

G$^2$CN: Graph Gaussian Convolution Networks with Concentrated Graph Filters

Dynamic Group Convolution for Accelerating Convolutional Neural Networks