Investigation of genetic diversity of different spring rapeseed (Brassica napus L.) genotypes and yield prediction using machine learning models

Mohamad Amin Norouzi,Leila Ahangar,Kamal Payghamzadeh,Hossein Sabouri,Sayed Javad Sajadi
DOI: https://doi.org/10.1007/s10722-024-01915-6
2024-03-08
Genetic Resources and Crop Evolution
Abstract:Seed yield is influenced by the combined effects of genes, including additive and non-additive interactions. Therefore, accurately predicting seed yield holds significant importance in rapeseed breeding. Nonetheless, limited information exists regarding yield estimation for canola using neural networks. This study employs multi-layer perceptron (MLP) neural network, radial basis function neural network and support vector machine, to forecast rapeseed yield. The models are trained using phenological, morphological, yield and yield-related data, as well as molecular marker information from 8 genotypes and 56 hybrids. Comparative analysis of the models reveals that the MLP model effectively forecasts hybrid yield with root mean square error (RMSE), mean absolute error (MAE) and coefficient of determination (R 2 ) values of 226, 183, and 92%, respectively. Among the 40 primers examined, the ISJ10 primer demonstrates superior discriminatory power compared to others. The use of molecular and phenotypic data as inputs in the model highlights the MLP model's superiority, presenting lower RMSE and MAE values, along with a higher R 2 , compared to direct crosses in predicting the performance of reciprocal crosses. The proposed neural network model enables performance estimation of hybrids prior to crossing parent studied, thereby enabling spring rapeseed breeders to focus on the most promising hybrids.
plant sciences,agronomy
What problem does this paper attempt to address?