An experimental comparative study of backpropagation and alternatives for training binary neural networks for image classification

Ben Crulis,Barthelemy Serres,Cyril de Runz,Gilles Venturini
2024-08-08
Abstract:Current artificial neural networks are trained with parameters encoded as floating point numbers that occupy lots of memory space at inference time. Due to the increase in the size of deep learning models, it is becoming very difficult to consider training and using artificial neural networks on edge devices. Binary neural networks promise to reduce the size of deep neural network models, as well as to increase inference speed while decreasing energy consumption. Thus, they may allow the deployment of more powerful models on edge devices. However, binary neural networks are still proven to be difficult to train using the backpropagation-based gradient descent scheme. This paper extends the work of \cite{crulis2023alternatives}, which proposed adapting to binary neural networks two promising alternatives to backpropagation originally designed for continuous neural networks, and experimented with them on simple image classification datasets. This paper proposes new experiments on the ImageNette dataset, compares three different model architectures for image classification, and adds two additional alternatives to backpropagation.
Machine Learning
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to study the training methods of Binary Neural Networks (BNNs) in image classification tasks and compare the impact of different training algorithms on BNN performance. Specifically: 1. **Challenges of BNNs**: - Current Artificial Neural Networks (ANNs) require a large amount of memory to store floating-point parameters, occupying significant space during inference. - The increasing scale of deep learning models makes it very difficult to train and use ANNs on edge devices (such as smartphones). - Binary Neural Networks can significantly reduce model size, improve inference speed, and lower energy consumption by using binary operations, making them more suitable for deployment on resource-constrained edge devices. - However, BNNs still face difficulties when trained using gradient descent schemes based on backpropagation. 2. **Research on Alternatives**: - This paper extends previous work [1] by proposing two alternative methods to backpropagation and conducting experiments on simple image classification datasets. - The paper further conducts new experiments on the ImageNette dataset, comparing three different model architectures and adding two new backpropagation alternative algorithms. 3. **Main Contributions**: - Extended the experiments from previous work by adding new alternative algorithms and model architectures. - Provided extensive test results on the ImageNette dataset. - Released an open-source Pytorch framework that can train both binary and non-binary neural networks under different algorithms. Through these studies, the paper seeks to find more suitable methods for training BNNs and explore their potential in practical applications.