Abstract:The feasibility was explored for identifying health, nutrient deficiency and citrus greening leaves based on near infrared (NIR) spectroscopy combined with machine learning methods. 232 samples were divided into the calibration and prediction sets for calibrating the models and accessing their performance according to the proportion of 3:1. The calibration set included citrus greening samples of 54, nutrient deficiency samples of 64 and healthy samples of 54. The prediction set included citrus greening samples of 21, nutrient deficiency samples of 17 and healthy samples of 22. The spectra of health, nutrient deficiency and citrus greening leaves were recorded in the wavelength range of 4 000-9 000 cm-1. After compared the representative spectra of health, nutrient deficiency and citrus greening, it was found that two significant differences appeared in the wavenumber bands of 5 100 and 6 880 cm-1. The peak around 6 880 cm-1 was caused by the stretching vibration of O-H first overtone of water and sugar. The difference between the spectra of health and citrus greening leaves was significant around 6 880 cm-1. The spectral intensity of citrus greening leaf was larger than health leaf. The ability of water absorption for citrus greening leaf was interfered with citrus greening. The peak around 5100 cm-1 was associated with the asymmetric vibration of N-H bond. Therefore, the spectral intensity of citrus greening leaf was lower than health leaf in the wavenumber of 5 100 cm-1. This may be related to the loss of nutrient elements in leaves of citrus greening. The study used different preprocessing methods as first derivative, smoothing and multiple scattered correction for spectral calibration. The preprocssing method of first derivative had removed baseline drift and enlarged the role of feature information. And the amplification characteristics of information can also lead to high frequency noise. Therefore, the further pretreatment was conducted by the method of smoothing. Then the scattering effect caused by the uneven thickness of the leaves was eliminated used the multiple scattering correction. Compared with other methods, it was found that the combination of first derivative, smoothing and multiple scatter correction can effectively eliminated the baseline drift and scattering phenomena. The machine learning methods of partial least square discriminate analysis (PLS-DA) and least square support vector machine (LS-SVM) were used to develop the classification models for identifying health, nutrient deficiency and citrus greening leaves. The principal component analysis (PCA) method was applied to optimize the input vectors of PLS-DA and LS-SVM models compared with full spectra. The first 14 and 11 principal components (PCs) were used to the input vectors for PLS-DA and LS-SVM models, respectively. And the regularization factor and the type of kernel function were optimized by the two-step grid search method. Compared to PLS-DA model, LS-SVM model yielded the best results with accuracy rate of 100% for identifying the health, nutrient deficiency and citrus greening. The kernel function type and regularization factor (γ) of the best LS-SVM model were linear kernel function and 2.25. The experimental results showed that it was feasible to identify health, nutrient deficiency and citrus greening leaves by NIR spectroscopy coupled with machine learning method of LS-SVM.

Classification of Orange Growing Locations Based on the Near-infrared Spectroscopy Using Data Mining.

Near-Infrared Spectroscopy For Classification Of Oranges And Prediction Of The Sugar Content

Combination and comparison of multivariate analysis for the identification of orange varieties using visible and near infrared reflectance spectroscopy

[Research on Discrimination Method of Orange Juice Variety Based on Spectroscopy Technology].

Determination of geographical origin of navel orange by near infrared spectroscopy

Nondestructive Detection of Citrus Greening by Near Infrared Spectroscopy

Cluster Analysis Of Citrus Genotypes Using Near-Infrared Spectroscopy

Preliminary Study on the Application of Near Infrared Spectroscopy and Pattern Recognition Methods to Classify Different Types of Apple Samples.

Soluble Solids Content Binary Classification of Miyagawa Satsuma in Chongming Island Based on Near Infrared Spectroscopy

On-line detection of orange soluble solid content using visible and near infrared transmission measurements

Navel Orange Maturity Classification by Multispectral Indexes Based on Hyperspectral Diffuse Transmittance Imaging

Application of near infrared spectroscopy and clustering analysis to classify wines from different origins

A Novel Strategy of Near-Infrared Spectroscopy Dimensionality Reduction for Discrimination of Grades, Varieties and Origins of Green Tea

Discrimination of varieties of apple using near infrared spectroscopy

Automatic and Rapid Discrimination of Cotton Genotypes by Near Infrared Spectroscopy and Chemometrics

Rapid On-site Identification of Geographical Origin and Storage Age of Tangerine Peel by Near-infrared Spectroscopy.

Rapid Classification of Corn Varieties by Using Near Infrared Spectroscopy

A Near-Infrared Reflectance Spectroscopy Method For Direct Analysis Of Several Chemical Components And Properties Of Fruit, For Example, Chinese Hawthorn

On-site Variety Discrimination of Tomato Plant Using Visible-Near Infrared Reflectance Spectroscopy.

Discrimination of varieties of tea using the near infrared spectroscopy

Multi-platform integration based on NIR and UV-Vis spectroscopies for the geographical traceability of the fruits of Amomum tsao-ko