Abstract: Geometric deep learning, i.e., designing neural networks to handle the ubiquitous geometric data such as point clouds and graphs, have achieved great successes in the last decade. One critical inductive bias is that the model can maintain invariance towards various transformations such as translation, rotation, and scaling. The existing graph neural network (GNN) approaches can only maintain permutation-invariance, failing to guarantee invariance with respect to other transformations. Besides GNNs, other works design sophisticated transformation-invariant layers, which are computationally expensive and difficult to be extended. To solve this problem, we revisit why the existing neural networks cannot maintain transformation invariance when handling geometric data. Our findings show that transformation-invariant and distance-preserving initial representations are sufficient to achieve transformation invariance rather than needing sophisticated neural layer designs. Motivated by these findings, we propose Transformation Invariant Neural Networks (TinvNN), a straightforward and general framework for geometric data. Specifically, we realize transformation-invariant and distance-preserving initial point representations by modifying multi-dimensional scaling before feeding the representations into neural networks. We prove that TinvNN can strictly guarantee transformation invariance, being general and flexible enough to be combined with the existing neural networks. Extensive experimental results on point cloud analysis and combinatorial optimization demonstrate the effectiveness and general applicability of our proposed method. Based on the experimental results, we advocate that TinvNN should be considered a new starting point and an essential baseline for further studies of transformation-invariant geometric deep learning.

RC-CNN: Representation-Consistent Convolutional Neural Networks for Achieving Transformation Invariance

Transform-Invariant Convolutional Neural Networks for Image Classification and Search

Inability of spatial transformations of CNN feature maps to support invariant recognition

RIC-CNN: Rotation-Invariant Coordinate Convolutional Neural Network

What Does CNN Shift Invariance Look Like? A Visualization Study

Achieving Rotation Invariance in Convolution Operations: Shifting from Data-Driven to Mechanism-Assured

Learning Transformation-Invariant Representations for Image Recognition with Drop Transformation Networks.

Convolutional Kernel Networks

CNN Architectures for Geometric Transformation-Invariant Feature Representation in Computer Vision: A Review

Learning Geometric Invariance Features and Discrimination Representation for Image Classification via Spatial Transform Network and XGBoost Modeling

Rotation Invariance Neural Network

Patch Reordering: a Novel Way to Achieve Rotation and Translation Invariance in Convolutional Neural Networks

RRL:Regional Rotation Layer in Convolutional Neural Networks

Equivariance-bridged SO(2)-Invariant Representation Learning using Graph Convolutional Network

Understanding when spatial transformer networks do not support invariance, and what to do about it

Conformer: Local Features Coupling Global Representations for Visual Recognition

Rotation Invariant Local Binary Convolution Neural Networks.

Gradient-Aligned convolution neural network

Sorting Convolution Operation for Achieving Rotational Invariance

Revisiting Transformation Invariant Geometric Deep Learning: Are Initial Representations All You Need?