Selective Genotyping and Phenotyping for Optimization of Genomic Prediction Models for Populations with Different Diversity

Marina Ćeran,Vuk Đorđević,Jegor Miladinović,Marjana Vasiljević,Vojin Đukić,Predrag Ranđelović,Simona Jaćimović
DOI: https://doi.org/10.3390/plants13070975
2024-03-29
Plants
Abstract:To overcome the different challenges to food security caused by a growing population and climate change, soybean (Glycine max (L.) Merr.) breeders are creating novel cultivars that have the potential to improve productivity while maintaining environmental sustainability. Genomic selection (GS) is an advanced approach that may accelerate the rate of genetic gain in breeding using genome-wide molecular markers. The accuracy of genomic selection can be affected by trait architecture and heritability, marker density, linkage disequilibrium, statistical models, and training set. The selection of a minimal and optimal marker set with high prediction accuracy can lower genotyping costs, computational time, and multicollinearity. Selective phenotyping could reduce the number of genotypes tested in the field while preserving the genetic diversity of the initial population. This study aimed to evaluate different methods of selective genotyping and phenotyping on the accuracy of genomic prediction for soybean yield. The evaluation was performed on three populations: recombinant inbred lines, multifamily diverse lines, and germplasm collection. Strategies adopted for marker selection were as follows: SNP (single nucleotide polymorphism) pruning, estimation of marker effects, randomly selected markers, and genome-wide association study. Reduction of the number of genotypes was performed by selecting a core set from the initial population based on marker data, yet maintaining the original population's genetic diversity. Prediction ability using all markers and genotypes was different among examined populations. The subsets obtained by the model-based strategy can be considered the most suitable for marker selection for all populations. The selective phenotyping based on makers in all cases had higher values of prediction ability compared to minimal values of prediction ability of multiple cycles of random selection, with the highest values of prediction obtained using AN approach and 75% population size. The obtained results indicate that selective genotyping and phenotyping hold great potential and can be integrated as tools for improving or retaining selection accuracy by reducing genotyping or phenotyping costs for genomic selection.
plant sciences
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the accuracy of genomic prediction models for soybean yield while reducing the costs of genotyping and phenotyping. Specifically, researchers optimize genomic prediction models in populations with different genetic diversities through the methods of selective genotyping and selective phenotyping. These methods aim to determine the smallest and optimal marker sets to maintain high prediction accuracy, while reducing genotyping costs and computing time and reducing multicollinearity. In addition, the selective phenotyping method aims to reduce the number of genotypes in field tests while retaining the genetic diversity of the initial population. ### Research Background With the growth of the global population and the impact of climate change, soybean (*Glycine max (L.) Merr.*) breeders are faced with the challenges of increasing yield and maintaining environmental sustainability. Genomic Selection (GS) is an advanced breeding technique that can accelerate genetic gain through genome - wide molecular markers. However, the accuracy of genomic selection is affected by multiple factors, including trait architecture and heritability, marker density, linkage disequilibrium, statistical models, and the selection of training sets. Therefore, determining the smallest and optimal marker sets and designing efficient training sets are crucial for improving the efficiency of genomic selection. ### Research Objectives 1. **Evaluate different selective genotyping and phenotyping methods**: Study the impact of different selective genotyping and phenotyping methods on the accuracy of genomic prediction in soybean populations with different genetic diversities. 2. **Optimize genomic prediction models**: Reduce the number of markers and genotypes, reduce the costs of genotyping and phenotyping, while maintaining or improving the accuracy of genomic prediction. 3. **Improve the efficiency of genomic selection**: Explore the feasibility and economy of applying genomic selection in actual breeding projects. ### Research Methods - **Population selection**: The study used three different soybean populations, including Recombinant Inbred Lines (RIL), Multi - family Diverse Lines (MDL), and Germplasm Pool (GPL). - **Marker selection strategies**: Including SNP pruning, marker effect estimation, random marker selection, and Genome - Wide Association Studies (GWAS). - **Phenotype selection strategies**: Select a core set based on marker data to reduce the number of genotypes in field tests while maintaining the genetic diversity of the initial population. ### Main Findings - **Selective genotyping**: In all populations, model - based strategies (such as RE - MoB) are considered the best methods for marker selection. In particular, using a subset of 48 SNPs can maintain prediction ability comparable to that of using all markers in some cases. - **Selective phenotyping**: The Average Nearest - neighbor Distance of Accessions based on Markers (AN) method generally performs better than the random sample selection method, especially in smaller training set sizes. - **Comprehensive impact**: The impact of simultaneously reducing the number of markers and genotypes on prediction ability varies among populations. In the MDL population, when the population size is greater than 50%, the population size has a greater impact on prediction ability; while in smaller population sizes, the number of markers has a more significant impact on prediction ability. ### Conclusions Selective genotyping and phenotyping methods can effectively improve the accuracy of genomic prediction while reducing the costs of genotyping and phenotyping. These methods show good applicability in soybean populations with different genetic diversities and provide strong support for genomic selection in actual breeding projects.