Abstract:We present a complete mechanistic description of the algorithm learned by a minimal non-linear sparse data autoencoder in the limit of large input dimension. The model, originally presented in <a class="link-https" data-arxiv-id="2209.10652" href="https://arxiv.org/abs/2209.10652">arXiv:2209.10652</a>, compresses sparse data vectors through a linear layer and decompresses using another linear layer followed by a ReLU activation. We notice that when the data is permutation symmetric (no input feature is privileged) large models reliably learn an algorithm that is sensitive to individual weights only through their large-scale statistics. For these models, the loss function becomes analytically tractable. Using this understanding, we give the explicit scalings of the loss at high sparsity, and show that the model is near-optimal among recently proposed architectures. In particular, changing or adding to the activation function any elementwise or filtering operation can at best improve the model's performance by a constant factor. Finally, we forward-engineer a model with the requisite symmetries and show that its loss precisely matches that of the trained models. Unlike the trained model weights, the low randomness in the artificial weights results in miraculous fractal structures resembling a Persian rug, to which the algorithm is oblivious. Our work contributes to neural network interpretability by introducing techniques for understanding the structure of autoencoders. Code to reproduce our results can be found at <a class="link-external link-https" href="https://github.com/KfirD/PersianRug" rel="external noopener nofollow">this https URL</a> .

Compression of Structured Data with Autoencoders: Provable Benefit of Nonlinearities and Depth

On the convergence of group-sparse autoencoders

The dynamics of representation learning in shallow, non-linear autoencoders

Why should autoencoders work?

Sparse $L^1$-Autoencoders for Scientific Data Compression

Autoencoders Learn Generative Linear Models

Compute Optimal Inference and Provable Amortisation Gap in Sparse Autoencoders

An Introduction to Neural Data Compression

Image Compression: Sparse Coding vs. Bottleneck Autoencoders

Video Compression With Rate-Distortion Autoencoders

Analyzing noise in autoencoders and deep networks

Thinner Latent Spaces: Detecting dimension and imposing invariance through autoencoder gradient constraints

A simple connection from loss flatness to compressed representations in neural networks

Learning a Compressed Sensing Measurement Matrix via Gradient Unrolling

HSAE: A Hessian Regularized Sparse Auto-Encoders.

High-Ratio Lossy Compression: Exploring the Autoencoder to Compress Scientific Data

Deep Learning of Nonnegativity-Constrained Autoencoders for Enhanced Understanding of Data

Deep Nonparametric Estimation of Intrinsic Data Structures by Chart Autoencoders: Generalization Error and Robustness

Feedback Recurrent Autoencoder for Video Compression

The Persian Rug: solving toy models of superposition using large-scale symmetries

Contractive Auto-Encoders: Explicit Invariance During Feature Extraction