Mixed-Precision Quantization for CNN-Based Remote Sensing Scene Classification

Xin Wei,He Chen,Wenchao Liu,Yizhuang Xie
DOI: https://doi.org/10.1109/lgrs.2020.3007575
IF: 5.343
2021-10-01
IEEE Geoscience and Remote Sensing Letters
Abstract:Extensive convolutional neural network (CNN)-based methods have been widely used in remote sensing scene classification. However, the dense operation and huge memory storage of the state-of-the-art models hinder their deployment on low-power embedded devices. In this letter, we propose a mixed-precision quantization method to compress the model size without accuracy degradation. In this method, we propose a symmetric nonlinear quantization scheme to reduce the quantization error. A corresponding three-step training strategy is proposed to improve the performance of the quantized network. Finally, based on the proposed scheme and training strategy, we propose a neural architecture search (NAS)-based quantization bit-width search (NQBS) method. This method can automatically select a bit width for each quantized layer to obtain a mixed-precision network with an optimal model size. We apply the proposed method to the ResNet-34 and SqueezeNet networks and evaluate the quantized networks on the NWPU-RESISC45 data set. The experimental results show that the mixed-precision quantized networks under the proposed method strike a satisfying tradeoff between classification accuracy and model size.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to compress the size of convolutional neural network (CNN) models through the mixed - precision quantization method in remote sensing scene classification without reducing the classification accuracy, so that these models can be deployed on low - power embedded devices. Specifically, the paper proposes solutions to the following challenges: 1. **High computational and memory requirements**: The current state - of - the - art CNN models contain billions of arithmetic operations and millions of parameters, which makes them difficult to implement in real - time processing applications, especially on low - power embedded devices. 2. **Errors in the quantization process**: Traditional quantization methods are vulnerable to outliers when dealing with weights and activation values, resulting in significant quantization errors. 3. **Vanishing gradients during the training process**: Directly training on quantized weights and activation values may lead to vanishing gradients, especially in low - bit quantization, which will further increase the difficulty of training. 4. **Manually setting the quantization bit width**: Previous works usually manually set the quantization bit width of the entire network, which makes it difficult to minimize the cost of the model size while maintaining the network accuracy. To solve these problems, the paper proposes the following methods: - **Symmetric Non - Linear Quantization (SNLQ) scheme**: By introducing non - linear transformation to reduce the influence of outliers on quantization errors, thereby improving the performance of the quantized model. - **Three - Step Training (TST) strategy**: Train the network step by step. First, quantize the weights, then the activation values, and finally quantize the weights and activation values simultaneously to reduce the possible accuracy loss during the quantization process. - **Neural Architecture Search (NAS) - based Quantization Bit - Width Search (NQBS) method**: Automatically select the optimal quantization bit width for each quantization layer to obtain a mixed - precision network that achieves a satisfactory balance between the model size and classification accuracy. Through these methods, the paper aims to realize a remote sensing scene classification system that can effectively compress the model size and maintain high classification accuracy.