Application of Convolutional Neural Network-Based Feature Extraction and Data Fusion for Geographical Origin Identification of Radix Astragali by Visible/Short-Wave Near-Infrared and Near Infrared Hyperspectral Imaging

Qinlin Xiao,Xiulin Bai,Pan Gao,Yong He
DOI: https://doi.org/10.3390/s20174940
IF: 3.9
2020-09-01
Sensors
Abstract:Radix Astragali is a prized traditional Chinese functional food that is used for both medicine and food purposes, with various benefits such as immunomodulation, anti-tumor, and anti-oxidation. The geographical origin of Radix Astragali has a significant impact on its quality attributes. Determining the geographical origins of Radix Astragali is essential for quality evaluation. Hyperspectral imaging covering the visible/short-wave near-infrared range (Vis-NIR, 380–1030 nm) and near-infrared range (NIR, 874–1734 nm) were applied to identify Radix Astragali from five different geographical origins. Principal component analysis (PCA) was utilized to form score images to achieve preliminary qualitative identification. PCA and convolutional neural network (CNN) were used for feature extraction. Measurement-level fusion and feature-level fusion were performed on the original spectra at different spectral ranges and the corresponding features. Support vector machine (SVM), logistic regression (LR), and CNN models based on full wavelengths, extracted features, and fusion datasets were established with excellent results; all the models obtained an accuracy of over 98% for different datasets. The results illustrate that hyperspectral imaging combined with CNN and fusion strategy could be an effective method for origin identification of Radix Astragali.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: how to effectively identify the geographical origin of Radix Astragali. Specifically, the research aims to achieve accurate classification of Radix Astragali from different geographical origins by combining visible light/short - wave near - infrared (Vis - NIR, 380–1030 nm) and near - infrared (NIR, 874–1734 nm) hyperspectral imaging techniques with convolutional neural networks (CNN) and data fusion strategies. ### Background and the Importance of the Problem Radix Astragali is an important traditional Chinese medicinal material, widely used for both medicine and food, and has multiple health functions, such as immunomodulation, anti - tumor and antioxidant, etc. The quality attributes of Radix Astragali are significantly affected by its geographical origin. Therefore, determining the geographical origin of Radix Astragali is crucial for quality assessment. Although traditional chemical analysis methods are effective, they are time - consuming and destructive and difficult to be applied on a large scale. In addition, these methods are only applicable to laboratory conditions and are less efficient. ### Research Objectives To solve the above problems, the research has set the following specific objectives: 1. **Explore Feasibility**: Verify whether visible light/short - wave near - infrared and near - infrared hyperspectral imaging techniques can effectively distinguish Radix Astragali from five different geographical origins. 2. **Feature Extraction**: Use principal component analysis (PCA) and convolutional neural networks (CNN) to extract features from the original spectra respectively. 3. **Build Classification Models**: Based on full - wavelength, PCA score features and deep - spectral features, construct support vector machines (SVM), logistic regression (LR) and convolutional neural networks (CNN) classification models to quantitatively identify Radix Astragali from different geographical origins. 4. **Generate Prediction Maps**: Generate prediction maps of Radix Astragali from different geographical origins in the visible light/short - wave near - infrared and near - infrared spectral ranges. 5. **Fusion Strategies**: Based on measurement - level and feature - level fusion data sets, construct SVM, LR and CNN classification models to further improve the classification effect. ### Method Overview - **Sample Preparation**: Collected Radix Astragali samples from Gansu province, Heilongjiang province, Inner Mongolia autonomous region, Shanxi province and Xinjiang Uygur autonomous region in China. - **Hyperspectral Image Acquisition**: Use two hyperspectral imaging systems to cover the visible light/short - wave near - infrared and near - infrared spectral ranges respectively. - **Image Pre - processing and Spectral Extraction**: Extract the spectral information of each sample by segmenting the background and ROI, and perform pre - processing. - **Data Analysis Methods**: Include principal component analysis (PCA), convolutional neural networks (CNN) and two fusion strategies (measurement - level fusion and feature - level fusion). - **Traditional Discriminant Models**: Use support vector machines (SVM) and logistic regression (LR) for comparative analysis. - **Model Evaluation**: Evaluate the performance of models through methods such as classification accuracy and cross - validation. Through the above methods, the research has proved that hyperspectral imaging combined with CNN and fusion strategies can effectively identify the geographical origin of Radix Astragali, and the accuracy of all models exceeds 98%.