Abstract:Recent advances in deep neural networks (DNNs) owe their success to training algorithms that use backpropagation and gradient-descent. Backpropagation, while highly effective on von Neumann architectures, becomes inefficient when scaling to large networks. Commonly referred to as the weight transport problem, each neuron's dependence on the weights and errors located deeper in the network require exhaustive data movement which presents a key problem in enhancing the performance and energy-efficiency of machine-learning hardware. In this work, we propose a bio-plausible alternative to backpropagation drawing from advances in feedback alignment algorithms in which the error computation at a single synapse reduces to the product of three scalar values. Using a sparse feedback matrix, we show that a neuron needs only a fraction of the information previously used by the feedback alignment algorithms. Consequently, memory and compute can be partitioned and distributed whichever way produces the most efficient forward pass so long as a single error can be delivered to each neuron. We evaluate our algorithm using standard datasets, including ImageNet, to address the concern of scaling to challenging problems. Our results show orders of magnitude improvement in data movement and 2× improvement in multiply-and-accumulate operations over backpropagation. Like previous work, we observe that any variant of feedback alignment suffers significant losses in classification accuracy on deep convolutional neural networks. By transferring trained convolutional layers and training the fully connected layers using direct feedback alignment, we demonstrate that direct feedback alignment can obtain results competitive with backpropagation. Furthermore, we observe that using an extremely sparse feedback matrix, rather than a dense one, results in a small accuracy drop while yielding hardware advantages. All the code and results are available under https://github.com/bcrafton/ssdfa.

Equivariant Deep Weight Space Alignment

Improving Federated Relational Data Modeling via Basis Alignment and Weight Penalty

Weight Scope Alignment: A Frustratingly Easy Method for Model Merging

Scalable unsupervised alignment of general metric and non-metric structures

ALIGNet: Partial-Shape Agnostic Alignment via Unsupervised Learning

DNA: Dynamic Social Network Alignment

Deep Learning without Weight Symmetry

Deep Weighted Consensus: Dense correspondence confidence maps for 3D shape registration

Deep graph alignment network

Deep Cross-Network Alignment with Anchor Node Pair Diverse Local Structure.

Training-time Neuron Alignment through Permutation Subspace for Improving Linear Mode Connectivity and Model Fusion

Attent: Active Attributed Network Alignment

Learning Symmetries via Weight-Sharing with Doubly Stochastic Tensors

Direct Feedback Alignment With Sparse Connections for Local Learning

Adaptive Network Alignment with Unsupervised and Multi-order Convolutional Networks

SAlign: A Graph Neural Attention Framework for Aligning Structurally Heterogeneous Networks

Shared-latent Variable Network Alignment

DeepMatching: A Structural Seed Identification Framework for Social Network Alignment

Manifold Alignment Based on Sparse Local Structures of More Corresponding Pairs.

Git Re-Basin: Merging Models modulo Permutation Symmetries