Quantification of Lodging Scores of Soybean Breeding Lines Using UAV-based Imagery and Machine Learning
Shagor Sarkar,Jing Zhou,Andrew Scaboo,Jianfeng Zhou
DOI: https://doi.org/10.13031/aim.202300520
2023-01-01
Abstract:Soybean lodging identification on breeding purpose relies on manual measurements and visual inspection by the breeders, which is inefficient, time-consuming, and very subjective to human errors. From the past few years, remote sensing image has become one of the most popular tools for crop lodging identification due to the tools provides features for building machine learning models for the lodging prediction of the cultivars. The goal of this study was to investigate the potential use of UAV-based imagery in identifying lodging of soybean breeding lines that can be used to select lodging tolerance soybean genotypes using UAV-based imagery and machine learning methods. An UAV platform equipped with an RGB (red-green-blue) camera was used to collect the imagery data at flight height of 30m from 1266 four-row plots of soybean breeding fields at the reproductive stage. Soybean lodging scores were visually assessed and measured by the breeders and the scores were grouped into four classes namely, non-lodged, moderate lodged, high lodged and severe lodged. Fourteen textural features such as angular second moment, contrast, correlation, variance, inverse difference moment, sum average, sum variance, sum entropy, entropy, difference variance, difference entropy, information measure of correlation I, information measure of correlation II and maximal correlation coefficient were extracted to classify the soybean lodging scores that are associated with the image features. Using Random Forest (RF) Recursive Feature Elimination (RFE) method was used to select important features. In the preprocessing step, Synthetic Minority Oversampling Technique (SMOTE) and Edited Nearest Neighbors (ENN) method were employed to treat the imbalanced lodging dataset. Later, random forest (RF), K-nearest neighbor (KNN), artificial neural network (ANN) models were developed to classify the four classes of soybean lodging using the image derived texture features. Among all the classification models developed, ANN classification using normalized image feature showed higher precision, recall and overall accuracy (95%) to classify the soybean classes. Whereas RF also showed promising overall accuracy of 94% with minimum misclassification rate of 3%, 3%, 3% and 1%, for non-lodging, medium lodging, high lodging, and severe lodging classes, respectively. The higher classification accuracy of soybean lodging classes reflected and demonstrate that the use of UAV based imagery techniques has the high potentiality to identify and select the non-lodging soybean genotypes.