Model encoding of binary neural networks

Xichuan Zhou,Haijun Liu,Cong Shi,Ji Liu
DOI: https://doi.org/10.1016/b978-0-32-385783-3.00013-2
2022-01-01
Abstract:Though the low-bit quantization method is more efficient than the full-precision models, there is still a space to improve the efficiency of deep neural networks in both computation and memory requirements. As the last chapter introduced, the low-precision quantization networks enjoy the memory saving and efficient fixed-point computation. However, in some embedded devices, the energy consumption of quantization networks is still preventing the model deployment for real applications. To address this issue, in this chapter, we focus on a extreme quantization network, which constrains both the weight parameters and activations into discrete binary values {−1,+1}. It presents a hardware-friendly approach to improve the classification accuracy of efficient deep neural networks with binary weights and activations. Different from the standard feed-forward neural networks, the Cellular Binary Neural Network consists of multiple subnets of binary neurons, which are connected via global lateral connections. The brain-inspired lateral connections between different subnets are assumed to be sparse and grouped with respect to different source layers. Technically, the lateral connections are optimized by group sparse regularization and knowledge distillation. Experiments on the CIFAR-10 and ImageNet datasets showed that by incorporating optimized group-sparse lateral paths the Cellular Binary Neural Network can achieve higher classification accuracy than other binary deep neural networks.
What problem does this paper attempt to address?