Abstract:Extensive convolutional neural network (CNN)-based methods have been widely used in remote sensing scene classification. However, the dense operation and huge memory storage of the state-of-the-art models hinder their deployment on low-power embedded devices. In this letter, we propose a mixed-precision quantization method to compress the model size without accuracy degradation. In this method, we propose a symmetric nonlinear quantization scheme to reduce the quantization error. A corresponding three-step training strategy is proposed to improve the performance of the quantized network. Finally, based on the proposed scheme and training strategy, we propose a neural architecture search (NAS)-based quantization bit-width search (NQBS) method. This method can automatically select a bit width for each quantized layer to obtain a mixed-precision network with an optimal model size. We apply the proposed method to the ResNet-34 and SqueezeNet networks and evaluate the quantized networks on the NWPU-RESISC45 data set. The experimental results show that the mixed-precision quantized networks under the proposed method strike a satisfying tradeoff between classification accuracy and model size.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to compress the size of convolutional neural network (CNN) models through the mixed - precision quantization method in remote sensing scene classification without reducing the classification accuracy, so that these models can be deployed on low - power embedded devices. Specifically, the paper proposes solutions to the following challenges: 1. **High computational and memory requirements**: The current state - of - the - art CNN models contain billions of arithmetic operations and millions of parameters, which makes them difficult to implement in real - time processing applications, especially on low - power embedded devices. 2. **Errors in the quantization process**: Traditional quantization methods are vulnerable to outliers when dealing with weights and activation values, resulting in significant quantization errors. 3. **Vanishing gradients during the training process**: Directly training on quantized weights and activation values may lead to vanishing gradients, especially in low - bit quantization, which will further increase the difficulty of training. 4. **Manually setting the quantization bit width**: Previous works usually manually set the quantization bit width of the entire network, which makes it difficult to minimize the cost of the model size while maintaining the network accuracy. To solve these problems, the paper proposes the following methods: - **Symmetric Non - Linear Quantization (SNLQ) scheme**: By introducing non - linear transformation to reduce the influence of outliers on quantization errors, thereby improving the performance of the quantized model. - **Three - Step Training (TST) strategy**: Train the network step by step. First, quantize the weights, then the activation values, and finally quantize the weights and activation values simultaneously to reduce the possible accuracy loss during the quantization process. - **Neural Architecture Search (NAS) - based Quantization Bit - Width Search (NQBS) method**: Automatically select the optimal quantization bit width for each quantization layer to obtain a mixed - precision network that achieves a satisfactory balance between the model size and classification accuracy. Through these methods, the paper aims to realize a remote sensing scene classification system that can effectively compress the model size and maintain high classification accuracy.

Mixed-Precision Quantization for CNN-Based Remote Sensing Scene Classification

Hessian-based Mixed-Precision Quantization with Transition Aware Training for Neural Networks

Pse: Mixed Quantization Framework of Neural Networks for Efficient Deployment

Single-shot Pruning and Quantization for Hardware-Friendly Neural Network Acceleration

Low-precision CNN Model Quantization based on Optimal Scaling Factor Estimation

GradQuant: Low-Loss Quantization for Remote-Sensing Object Detection

Focused Quantization for Sparse CNNs

Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights

Bag of Tricks with Quantized Convolutional Neural Networks for image classification

Mixed-Precision Neural Network Quantization Via Learned Layer-Wise Importance

Quantizing separable convolution of MobileNets with mixed precision

Joint Optimization of Dimension Reduction and Mixed-Precision Quantization for Activation Compression of Neural Networks

CSMPQ:Class Separability Based Mixed-Precision Quantization

Remote Sensing Image Scene Classification in Hybrid Classical–Quantum Transferring CNN with Small Samples

LSMQ: A Layer-Wise Sensitivity-Based MixedPrecision Quantization Method for Bit-Flexible CNN Accelerator

Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization

Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms

Deep Hybrid Compression Network for Lidar Point Cloud Classification and Segmentation

SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks

GroupQ: Group-Wise Quantization With Multi-Objective Optimization for CNN Accelerators

Learning Low Resource Consumption CNN through Pruning and Quantization