Abstract:Recent advances of deep neural networks (DNNs) promote low-level vision applications in real-world scenarios, e.g. , image enhancement, dehazing. Nevertheless, DNN-based methods encounter challenges in terms of high computational and memory requirements, especially when deployed on real-world devices with limited resources. Quantization is one of effective compression techniques that significantly reduces computational and memory requirements by employing low-bit parameters and bit-wise operations. However, low-bit quantization for computational imaging ( Q-Imaging ) remains largely unexplored and usually suffer from a significant performance drop compared with the real-valued counterparts. In this work, through empirical analysis, we identify the main factor responsible for such significant performance drop underlies in the large gradient estimation error from non-differentiable weight quantization methods, and the activation information degeneration along with the activation quantization. To address these issues, we introduce a differentiable quantization search (DQS) method to learn the quantized weights and an information boosting module (IBM) for network activation quantization. Our DQS method allows us to treat the discrete weights in a quantized neural network as variables that can be searched. We achieve this end by using a differential approach to accurately search for these weights. In specific, each weight is represented as a probability distribution across a set of discrete values. During training, these probabilities are optimized, and the values with the highest probabilities are chosen to construct the desired quantized network. Moreover, our IBM module can rectify the activation distribution before quantization to maximize the self-information entropy, which retains the maximum information during the quantization process. Extensive experiments across a range of image processing tasks, including enhancement, super-resolution, denoising and dehazing, validate the effectiveness of our Q-Imaging along with superior performances compared to a variety of state-of-the-art quantization methods. In particular, the method in Q-Imaging also achieves a strong generalization performance when composing a detection network for the dark object detection task.

Zero-Centered Fixed-Point Quantization With Iterative Retraining for Deep Convolutional Neural Network-Based Object Detectors

Hessian-based Mixed-Precision Quantization with Transition Aware Training for Neural Networks

AQD: Towards Accurate Quantized Object Detection

GradQuant: Low-Loss Quantization for Remote-Sensing Object Detection

Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms

Q-YOLO: Efficient Inference for Real-time Object Detection

Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing

Channel Pruning and Quantization-Based Learning for Object Detection with Computing Source Limited Application

IFQ-Net: Integrated Fixed-point Quantization Networks for Embedded Vision

Improving Post-Training Quantization on Object Detection with Task Loss-Guided Lp Metric

SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks

Bag of Tricks with Quantized Convolutional Neural Networks for image classification

Post-Training Non-Uniform Quantization for Convolutional Neural Networks

Towards Super Compressed Neural Networks for Object Identification: Quantized Low-Rank Tensor Decomposition with Self-Attention

Unsupervised Network Quantization via Fixed-Point Factorization

Compression for Text Detection and Recognition Based on Low Bit-Width Quantization

Fixed Point Quantization of Deep Convolutional Networks

Fixed-Point Back-Propagation Training

Learning Accurate Low-bit Quantization towards Efficient Computational Imaging

Adaptive Global Power-of-Two Ternary Quantization Algorithm Based on Unfixed Boundary Thresholds

Direct Quantization for Training Highly Accurate Low Bit-width Deep Neural Networks