Abstract:Recent advances of deep neural networks (DNNs) promote low-level vision applications in real-world scenarios, e.g. , image enhancement, dehazing. Nevertheless, DNN-based methods encounter challenges in terms of high computational and memory requirements, especially when deployed on real-world devices with limited resources. Quantization is one of effective compression techniques that significantly reduces computational and memory requirements by employing low-bit parameters and bit-wise operations. However, low-bit quantization for computational imaging ( Q-Imaging ) remains largely unexplored and usually suffer from a significant performance drop compared with the real-valued counterparts. In this work, through empirical analysis, we identify the main factor responsible for such significant performance drop underlies in the large gradient estimation error from non-differentiable weight quantization methods, and the activation information degeneration along with the activation quantization. To address these issues, we introduce a differentiable quantization search (DQS) method to learn the quantized weights and an information boosting module (IBM) for network activation quantization. Our DQS method allows us to treat the discrete weights in a quantized neural network as variables that can be searched. We achieve this end by using a differential approach to accurately search for these weights. In specific, each weight is represented as a probability distribution across a set of discrete values. During training, these probabilities are optimized, and the values with the highest probabilities are chosen to construct the desired quantized network. Moreover, our IBM module can rectify the activation distribution before quantization to maximize the self-information entropy, which retains the maximum information during the quantization process. Extensive experiments across a range of image processing tasks, including enhancement, super-resolution, denoising and dehazing, validate the effectiveness of our Q-Imaging along with superior performances compared to a variety of state-of-the-art quantization methods. In particular, the method in Q-Imaging also achieves a strong generalization performance when composing a detection network for the dark object detection task.

Efficient Neural Image Decoding Via Fixed-Point Inference

Computationally-Efficient Neural Image Compression with Shallow Decoders

Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations

Flexible Neural Image Compression via Code Editing

Accelerating Neural Network Inference by Overflow Aware Quantization

Unsupervised Network Quantization via Fixed-Point Factorization

Bandwidth-efficient Inference for Neural Image Compression

Asymmetric Learned Image Compression with Multi-Scale Residual Block, Importance Scaling, and Post-Quantization Filtering

Learning Accurate Low-bit Quantization towards Efficient Computational Imaging

Efficient Neural Compression with Inference-time Decoding

Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation

IFQ-Net: Integrated Fixed-point Quantization Networks for Embedded Vision

Fixed Point Quantization of Deep Convolutional Networks

Towards On-demand Transmission: Joint Feature and Image Coding with Reversible Neural Networks

Standard compliant video coding using low complexity, switchable neural wrappers

Learned Image Compression with Generalized Octave Convolution and Cross-Resolution Parameter Estimation

Efficient Integer-Arithmetic-Only Convolutional Neural Networks

Fully Connected Network-Based Intra Prediction for Image Coding.

A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding

Advancing The Rate-Distortion-Computation Frontier For Neural Image Compression

FxpNet: Training a deep convolutional neural network in fixed-point representation