Abstract:Genomic Selection (GS) has been proved to be a powerful tool for estimating genetic values in plant and livestock breeding. Newly developed sequencing technologies have dramatically reduced the cost of genotyping and significantly increased the scale of genotype data that used for GS. Meanwhile, state-of-the-art statistical methods were developed to make the best use of high marker density genotype data. In this study, 14 traits from four data sets of three species (maize, cattle, and pig) and five influential factors that affect the prediction accuracy were evaluated, including marker density (from 1 to ~600 k), statistical method (GBLUP-A, GBLUP-AD, and BayesR), minor allele frequency (MAF), heritability, and genetic architecture. Results indicate that in the GBLUP method, higher marker density leads to a higher prediction accuracy. In contrast, BayesR method needs more Monte Carlo Markov Chain (MCMC) iterations to reach the convergence and get reliable prediction values. BayesR outperforms GBLUP in predicting high or medium heritability trait that affected by one or several genes with large effects, while GBLUP performs similarly or slightly better than BayesR in predicting low heritability trait that controlled by a large amount of genes with minor effects. Prediction accuracy of trait with complex genetic architecture can be improved by increasing the marker density. Interestingly, for simple traits that controlled by one or several genes with large effects, higher marker density can cause a lower prediction accuracy if the QTN is included, but leads to a higher prediction accuracy if the QTN is excluded. The quantity of genetic markers with low MAF would not significantly affect the prediction accuracy of GBLUP, but results in a bad prediction accuracy performance of BayesR method. Compared with GBLUP-A, GBLUP-AD didn't show any advantages in capturing the non-additive variance for the traits with high heritability. The factors that affected prediction accuracy are discussed in this study and indicate that a combination of either GBLUP or BayesR method with moderate marker density and favorable polymorphism single nucleotide polymorphisms (SNPs) (~25 k SNPs) would always produce a good and stable prediction accuracy with acceptable breeding and computational costs.

Training population selection for (breeding value) prediction

Optimization of genomic selection training populations with a genetic algorithm

The effects of training population design on genomic prediction accuracy in wheat

The Value of Expanding the Training Population to Improve Genomic Selection Models in Tetraploid Potato

Enhancing Across-Population Genomic Prediction for Maize Hybrids

Impact of selective genotyping in the training population on accuracy and bias of genomic selection

Factors Affecting the Accuracy of Genomic Selection for Agricultural Economic Traits in Maize, Cattle, and Pig Populations

Ability of Genomic Prediction to Bi-Parent-Derived Breeding Population Using Public Data for Soybean Oil and Protein Content

Harnessing Genetic Diversity in the USDA Pea Germplasm Collection Through Genomic Prediction

Simulations of multiple breeding strategy scenarios in common bean for assessing genomic selection accuracy and model updating

A Modified Bayesian Optimization Approach for Determining a Training Set to Identify the Best Genotypes from a Candidate Population in Genomic Selection

Effect of genomic prediction on response to selection in forest tree breeding

Optimizing Training Population Data and Validation of Genomic Selection for Economic Traits in Soft Winter Wheat

Maximizing efficiency in sunflower breeding through historical data optimization

Selective Genotyping and Phenotyping for Optimization of Genomic Prediction Models for Populations with Different Diversity

Training Set Optimization for Sparse Phenotyping in Genomic Selection: A Conceptual Overview

Genomic-inferred cross-selection methods for multi-trait improvement in a recurrent selection breeding program

Genomic Selection on Ear Height, Plant Height and Grain Yield in the Primary Testing Stage of Maize Hybrids

Efficient Breeding by Genomic Mating

Improving the Efficiency of Genomic Selection

Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines