ShiftAddAug: Augment Multiplication-Free Tiny Neural Network with Hybrid Computation

Yipin Guo,Zihao Li,Yilin Lang,Qinyuan Ren
2024-07-03
Abstract:Operators devoid of multiplication, such as Shift and Add, have gained prominence for their compatibility with hardware. However, neural networks (NNs) employing these operators typically exhibit lower accuracy compared to conventional NNs with identical structures. ShiftAddAug uses costly multiplication to augment efficient but less powerful multiplication-free operators, improving performance without any inference overhead. It puts a ShiftAdd tiny NN into a large multiplicative model and encourages it to be trained as a sub-model to obtain additional supervision. In order to solve the weight discrepancy problem between hybrid operators, a new weight sharing method is proposed. Additionally, a novel two stage neural architecture search is used to obtain better augmentation effects for smaller but stronger multiplication-free tiny neural networks. The superiority of ShiftAddAug is validated through experiments in image classification and semantic segmentation, consistently delivering noteworthy enhancements. Remarkably, it secures up to a 4.95% increase in accuracy on the CIFAR100 compared to its directly trained counterparts, even surpassing the performance of multiplicative NNs.
Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the significant energy consumption and computational cost issues encountered when deploying deep neural networks (DNNs) on resource-constrained platforms. Specifically, the paper proposes a method called ShiftAddAug, which enhances small multiplication-free neural networks through hybrid computation, thereby significantly improving their performance without increasing inference overhead. The main contributions of the paper include: 1. **Hybrid Computation Enhancement**: Enhancing small multiplication-free neural networks with multiplication operations, making the target network more efficient and expressive while maintaining the same structure. 2. **New Weight Sharing Strategy**: Proposing a new weight sharing strategy for hybrid computation enhancement, addressing the weight disparity between different operators. 3. **Two-Stage Neural Architecture Search Method**: Based on the enhancement concept, adopting a two-stage neural architecture search strategy to find efficient and multiplication-free small neural networks. Through experimental validation, ShiftAddAug demonstrates significant performance improvements in image classification and semantic segmentation tasks, particularly on the CIFAR100 dataset, where it improves accuracy by up to 4.95% compared to directly trained multiplication-free networks. Additionally, the method shows significant improvements in hardware efficiency, such as reducing energy consumption on CIFAR100 by 67.75% to 69.09%.