Abstract:Deep convolutional neural networks (DCCNs) have shown pleasing performance in single image super-resolution (SISR). To deploy them onto real devices with limited storage and computational resources, a promising solution is to binarize the network, i.e., quantize each float-point weight and activation into 1 bit. However, existing works on binarizing DCNNs still suffer from severe performance degradation in SISR. To mitigate this problem, we argue that the performance degradation mainly comes from no appropriate constraint on the network weights, which causes it difficult to sensitively reverse the binarization results of these weights using the backpropagated gradient during training and thus limits the flexibility of network in respect of fitting extensive training samples. Inspired by this, we present an embarrassingly simple but effective binarization scheme for SISR, which can obviously relieve the performance degeneration resulted from network binarization and is applicable to different DCNN architectures. Specifically, we force each weight to follow a compact uniform prior, with which the weight will be given a very small absolute value close to zero and its binarization result can be straightforwardly reversed even by a small backpropagated gradient. By doing this, the flexibility and the generalization performance of the binarized network can be improved. Moreover, such a prior performs much better when introducing real identity shortcuts into the network. In addition, to avoid falling into bad local minima during training, we employ a pixel-wise curriculum learning strategy to learn the constrained weights in an easy-to-hard manner. Experiments on four SISR benchmark datasets demonstrate the effectiveness of the proposed binarization method in terms of binarizing different SISR network architectures, e.g., it even achieves performance comparable to the baseline with 5 quantization bits.

SBNN: Slimming binarized neural network

Learning to Slim Deep Networks with Bandit Channel Pruning

Loss Constrains Added Squeeze and Excitation Blocks for Pruning Deep Neural Networks

Batch-Normalization-based Soft Filter Pruning for Deep Convolutional Neural Networks

Efficient Structure Slimming for Spiking Neural Networks

Learning Slimming SSD Through Pruning and Knowledge Distillation

Self-distribution binary neural networks

Embarrassingly Simple Binarization for Deep Single Imagery Super-Resolution Networks

SiMaN: Sign-to-Magnitude Network Binarization

Network Binarization via Contrastive Learning

Distribution-sensitive Information Retention for Accurate Binary Neural Network

Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks

Loss-aware Binarization of Deep Networks.

Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis

Improving the Accuracy of Binarized Neural Networks and Application on Remote Sensing Data

Neural Network Compression using Binarization and Few Full-Precision Weights

Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers

CBin-NN: An Inference Engine for Binarized Neural Networks

BiPointNet: Binary Neural Network for Point Clouds

AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets

Ultra-low Latency Adaptive Local Binary Spiking Neural Network with Accuracy Loss Estimator