Examining and Mitigating the Impact of Crossbar Non-idealities for Accurate Implementation of Sparse Deep Neural Networks

Abhiroop Bhattacharjee,Lakshya Bhatnagar,Priyadarshini Panda
DOI: https://doi.org/10.48550/arXiv.2201.05229
2022-01-14
Abstract:Recently several structured pruning techniques have been introduced for energy-efficient implementation of Deep Neural Networks (DNNs) with lesser number of crossbars. Although, these techniques have claimed to preserve the accuracy of the sparse DNNs on crossbars, none have studied the impact of the inexorable crossbar non-idealities on the actual performance of the pruned networks. To this end, we perform a comprehensive study to show how highly sparse DNNs, that result in significant crossbar-compression-rate, can lead to severe accuracy losses compared to unpruned DNNs mapped onto non-ideal crossbars. We perform experiments with multiple structured-pruning approaches (such as, C/F pruning, XCS and XRS) on VGG11 and VGG16 DNNs with benchmark datasets (CIFAR10 and CIFAR100). We propose two mitigation approaches - Crossbar column rearrangement and Weight-Constrained-Training (WCT) - that can be integrated with the crossbar-mapping of the sparse DNNs to minimize accuracy losses incurred by the pruned models. These help in mitigating non-idealities by increasing the proportion of low conductance synapses on crossbars, thereby improving their computational accuracies.
Machine Learning,Emerging Technologies
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **Research and mitigate the impact of crossbar non - idealities on the hardware implementation of sparse deep neural networks (DNNs)**. Specifically, the paper focuses on the following issues: 1. **Performance degradation of highly sparse DNNs on non - ideal crossbars**: Although structured pruning techniques can achieve more efficient DNNs on hardware, these techniques do not fully consider the non - idealities of crossbars (such as interconnect parasitic effects, synaptic nonlinearity and variation, etc.) in practical applications, resulting in a significant decrease in the inference accuracy of sparse DNNs when implemented on non - ideal crossbars. 2. **Trade - off between resource efficiency and performance**: As the structured sparsity of DNNs increases, although the hardware resource efficiency (area and energy) improves, the performance (inference accuracy) is sacrificed. To address these problems, the paper has carried out the following work: - **Experimental verification**: Experiments were carried out on VGG11 and VGG16 DNNs through multiple structured pruning methods (such as C/F pruning, XCS and XRS), using the CIFAR10 and CIFAR100 datasets, demonstrating the performance degradation of highly sparse DNNs on non - ideal crossbars. - **Proposing mitigation strategies**: Two hardware - friendly non - ideality mitigation strategies were proposed: - **Crossbar - column rearrangement**: By rearranging the columns of the weight matrix, the proportion of low - conductance synapses is increased, thereby reducing the impact of non - idealities. - **Weight - Constrained - Training (WCT)**: Training the structured - pruned DNN on software, restricting the weight range so that more weights are in a low - conductance state, thereby reducing the impact of non - idealities. Through these methods, the paper aims to improve the performance of sparse DNNs on non - ideal crossbars and provide references for future research.