Abstract:The size of the reference group is among the most critical determinants of genomic estimated breeding values (GEBVs) accuracy. However, small- and medium-sized pig farms often need help accumulating adequate reference data, posing significant challenges to breeding programs. To solve this problem, exploring the potential benefits of combining reference groups of different sizes is necessary to improve GEBV accuracy. The primary objective of this investigation was to assess a more effective statistical model for combined multi-populations and its potential to enhance the accuracy of GEBVs for small and medium populations. Three populations were simulated using the QMSim software, each consisting of different sizes (300, 600, and 1 500, respectively). To assess the impact of heritability on the accuracy of GEBVs, four different levels of heritability (0.05, 0.15, 0.35, and 0.5) were simulated. Simultaneously, to investigate the impact of kinship on multi-populations, the study created four distinct scenarios for the three sizes of populations. These scenarios included: (1) the three groups are all independent, (2) the large group and the small group with a familial connection (n = 1 800), a middle group (n = 600) acting independently with no kinship, (3) the large group with a familial connection to the middle group (n = 2 100) but no connection to the small group (n = 300), and (4) the small group with a familial connection to the middle group (n = 900), while the large group (n = 1 500) acted independently with no kinship. This study evaluates and compares the accuracy of predicting breeding values using four different methods, including genomic best linear unbiased prediction (GBLUP), single-stepGBLUP (ssGBLUP), and two Bayesian models (Bayes A and Bayes B), with varying sizes of reference groups. In each scenario, three different prediction strategies were compared: (1) Merging all three different sizes of populations for predicting, (2) predicting each independent population separately, and (3) the other two populations predict the population. Our findings reveal that combining populations enhances the Bayesian models, with Bayes B yielding the highest accuracy. In independent populations, the best linear unbiased prediction (BLUP) models demonstrated the highest accuracy. However, in cases where populations were related and the heritability was high, the Bayes B model exhibited the highest overall accuracy (slightly higher than BLUP models) in the independent population. Our results underscore the importance of considering population combinations when using genetic models to predict breeding values, particularly for pig farmers with limited resources.

Genomic Prediction Using LD-Based Haplotypes in Combined Pig Populations

Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs

Using Genomic Selection to Improve the Accuracy of Genomic Prediction for Multi-Populations in Pigs

Integrating large-scale meta-analysis of genome-wide association studies improve the genomic prediction accuracy for combined pig populations

The Construction of a Haplotype Reference Panel Using Extremely Low Coverage Whole Genome Sequences and Its Application in Genome-Wide Association Studies and Genomic Prediction in Duroc Pigs.

The effect of high-density genotypic data and different methods on joint genomic prediction: A case study in large white pigs

Genomic Prediction for Growth and Reproduction Traits in Pig Using an Admixed Reference Population

Factors Affecting the Accuracy of Genomic Prediction in Joint Pig Populations.

Genomic Prediction of Growth Traits in Yorkshire Pigs of Different Reference Group Sizes Using Different Estimated Breeding Value Models

Improving Genomic Prediction Accuracy of Pig Reproductive Traits Based on Genotype Imputation Using Pre-Selected Markers with Different Imputation Platforms

The Superiority of Multi-Trait Models with Genotype-by-environment Interactions in a Limited Number of Environments for Genomic Prediction in Pigs

Improving Genomic Prediction for Two Yorkshire Populations with a Limited Size Using the Single-Step Method

Integration of Ssgwas and ROH Analyses for Uncovering Genetic Variants Associated with Reproduction Traits in Large White Pigs.

Strategies for Obtaining and Pruning Imputed Whole-Genome Sequence Data for Genomic Prediction

Evaluating the Performance of Genomic Selection on Purebred Population by Incorporating Crossbred Data in Pigs

The Genetic Connectedness Calculated from Genomic Information and Its Effect on the Accuracy of Genomic Prediction

Genomic prediction based on preselected single‐nucleotide polymorphisms from genome‐wide association study and imputed whole‐genome sequence data annotation for growth traits in Duroc pigs

Incorporating genomic annotation into single-step genomic prediction with imputed whole-genome sequence data

Haplotype genomic prediction of phenotypic values based on chromosome distance and gene boundaries using low-coverage sequencing in Duroc pigs

Using Different Single-Step Strategies to Improve the Efficiency of Genomic Prediction on Body Measurement Traits in Pig

A Comprehensive Evaluation of Factors Affecting the Accuracy of Pig Genotype Imputation Using a Single or Multi-Breed Reference Population