An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification

Ben Crulis,Barthelemy Serres,Cyril de Runz,Gilles Venturini

2024-08-08

Abstract:Current artificial neural networks are trained with parameters encoded as floating point numbers that occupy lots of memory space at inference time. Due to the increase in the size of deep learning models, it is becoming very difficult to consider training and using artificial neural networks on edge devices. Binary neural networks promise to reduce the size of deep neural network models, as well as to increase inference speed while decreasing energy consumption. Thus, they may allow the deployment of more powerful models on edge devices. However, binary neural networks are still proven to be difficult to train using the backpropagation-based gradient descent scheme. This paper extends the work of \cite{crulis2023alternatives}, which proposed adapting to binary neural networks two promising alternatives to backpropagation originally designed for continuous neural networks, and experimented with them on simple image classification datasets. This paper proposes new experiments on the ImageNette dataset, compares three different model architectures for image classification, and adds two additional alternatives to backpropagation.

Machine Learning

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper aims to study the training methods of Binary Neural Networks (BNNs) in image classification tasks and compare the impact of different training algorithms on BNN performance. Specifically: 1. **Challenges of BNNs**: - Current Artificial Neural Networks (ANNs) require a large amount of memory to store floating-point parameters, occupying significant space during inference. - The increasing scale of deep learning models makes it very difficult to train and use ANNs on edge devices (such as smartphones). - Binary Neural Networks can significantly reduce model size, improve inference speed, and lower energy consumption by using binary operations, making them more suitable for deployment on resource-constrained edge devices. - However, BNNs still face difficulties when trained using gradient descent schemes based on backpropagation. 2. **Research on Alternatives**: - This paper extends previous work [1] by proposing two alternative methods to backpropagation and conducting experiments on simple image classification datasets. - The paper further conducts new experiments on the ImageNette dataset, comparing three different model architectures and adding two new backpropagation alternative algorithms. 3. **Main Contributions**: - Extended the experiments from previous work by adding new alternative algorithms and model architectures. - Provided extensive test results on the ImageNette dataset. - Released an open-source Pytorch framework that can train both binary and non-binary neural networks under different algorithms. Through these studies, the paper seeks to find more suitable methods for training BNNs and explore their potential in practical applications.

An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification

Shallow Binary Features Enhance the Robustness of Deep Convolutional Neural Networks

BinaryConnect: Training Deep Neural Networks with binary weights during propagations

Enabling Binary Neural Network Training on the Edge

Transfer Learning with Binary Neural Networks

On Study of the Binarized Deep Neural Network for Image Classification.

Learning Adaptive Gradients for Binary Neural Networks

Improving energy efficiency and classification accuracy of neuromorphic chips by learning binary synaptic crossbars

Deep Spiking Neural Networks with Binary Weights for Object Recognition

An adiabatic method to train binarized artificial neural networks

Exploring the Connection Between Binary and Spiking Neural Networks

Training Multi-Layer Binary Neural Networks With Local Binary Error Signals

Algorithm and hardware design of discrete-time spiking neural networks based on back propagation with binary activations

Models of binary classification of the semantic colouring of texts

Enabling On-device Continual Learning with Binary Neural Networks

Exploring Optimized Spiking Neural Network Architectures for Classification Tasks on Embedded Platforms

Binarizing by Classification: is Soft Function Really Necessary?

A More Biologically Plausible Local Learning Rule for ANNs

DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

A Robust Backpropagation-Free Framework for Images

Hyperparameter Optimization in Binary Communication Networks for Neuromorphic Deployment