Abstract:Artificial and biological agents cannon learn given completely random and unstructured data. The structure of data is encoded in the metric relationships between data points. In the context of neural networks, neuronal activity within a layer forms a representation reflecting the transformation that the layer implements on its inputs. In order to utilize the structure in the data in a truthful manner, such representations should reflect the input distances and thus be continuous and isometric. Supporting this statement, recent findings in neuroscience propose that generalization and robustness are tied to neural representations being continuously differentiable. In machine learning, most algorithms lack robustness and are generally thought to rely on aspects of the data that differ from those that humans use, as is commonly seen in adversarial attacks. During cross-entropy classification, the metric and structural properties of network representations are usually broken both between and within classes. This side effect from training can lead to instabilities under perturbations near locations where such structure is not preserved. One of the standard solutions to obtain robustness is to add ad hoc regularization terms, but to our knowledge, forcing representations to preserve the metric structure of the input data as a stabilising mechanism has not yet been studied. In this work, we train neural networks to perform classification while simultaneously maintaining within-class metric structure, leading to isometric within-class representations. Such network representations turn out to be beneficial for accurate and robust inference. By stacking layers with this property we create a network architecture that facilitates hierarchical manipulation of internal neural representations. Finally, we verify that isometric regularization improves the robustness to adversarial attacks on MNIST.

Isometric Representations in Neural Networks Improve Robustness

On 1/n neural representation and robustness

Exploring mechanisms of Neural Robustness: probing the bridge between geometry and spectrum

An Empirical Study on the Relation between Network Interpretability and Adversarial Robustness

Adversarial Robustness with Partial Isometry

Fixed Inter-Neuron Covariability Induces Adversarial Robustness

Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception

Hierarchical binding in convolutional neural networks: Making adversarial attacks geometrically challenging

Dense Associative Memory is Robust to Adversarial Inputs

Understanding Robust Learning through the Lens of Representation Similarities

Layerwise Hebbian/anti-Hebbian (HaH) Learning In Deep Networks: A Neuro-inspired Approach To Robustness

On the Robustness of Neural Collapse and the Neural Collapse of Robustness

Learning From Brains How to Regularize Machines

Globally-Robust Neural Networks

Not So Robust After All: Evaluating the Robustness of Deep Neural Networks to Unseen Adversarial Attacks

On the relationship between class selectivity, dimensionality, and robustness

An Extended Study of Human-like Behavior under Adversarial Training

Leveraging the Human Ventral Visual Stream to Improve Neural Network Robustness

Relational Constraints On Neural Networks Reproduce Human Biases towards Abstract Geometric Regularity

Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing Their Input Gradients

Measuring Neural Net Robustness with Constraints