Expansion-Squeeze- Block: Linear Over-parameterization with Shortcut Connections to Train Compact Convolutional Networks

Linzhuo Yang,Lan Zhang
DOI: https://doi.org/10.1109/bigcom57025.2022.00011
2022-01-01
Abstract:We propose a new structure called Expansion-Squeeze-Block by leveraging over-parameterization to train given compact neural networks. The structure expands the width of convolutional layers and adds shortcut connections for better performance without adding any nonlinearity. The expanded networks can be contracted back to the original format algebraically at inference time without loss of information. In addition, we introduce a new initialization method utilizing the weights of the pretrained original networks, which can further improve the accuracy. We evaluate our methods on CIFAR-100 and Tiny-ImageNet datasets to show their effectiveness. As evidenced by our experiments, Expansion-Squeeze-Block outperforms the baselines in most cases with average 0.81% improvements and 1.15% combined with our initialization method on CIFAR-100 and Tiny-ImageNet datasets. We also demonstrate the effectiveness of the partial expansion strategy by choosing the layers of the highest importance in VGG11-BN0.5x and ResNet20. We can save on average 58.82% #MACs and 38.82% #Params than expanding all layers.
What problem does this paper attempt to address?