Abstract:The strategy of combining reference populations has been widely recognized as an effective way to enhance the accuracy of genomic prediction (GP). This study investigated the efficiency of genomic prediction using prior information and combined reference population. In total, prior information considering trait-associated single nucleotide polymorphisms (SNPs) obtained from meta-analysis of genome-wide association studies (GWAS meta-analysis) was incorporated into three models to assess the performance of GP using combined reference populations. Two different Yorkshire populations with imputed whole genome sequence (WGS) data (9,741,620 SNPs), named as P1 (1259 individuals) and P2 (1018 individuals), were used to predict genomic estimated breeding values for three live carcass traits, including backfat thickness, loin muscle area, and loin muscle depth. A 10 × 5 fold cross-validation was used to evaluate the prediction accuracy of 203 randomly selected candidate pigs from the P2 population and the reference population consisted of the remaining pigs from P2 and the stepwise added pigs from P1. By integrating SNPs with different p-value thresholds from GWAS meta-analysis downloaded from PigGTEx Project, the prediction accuracy of GBLUP, genomic feature BLUP (GFBLUP) and GBLUP given genetic architecture (BLUP|GA) were compared. Moreover, we explored effects of reference population size and heritability enrichment of genomic features on the prediction accuracy improvement of GFBLUP and BLUP|GA relative to GBLUP. The prediction accuracy of GBLUP using all WGS markers showed average improvement of 4.380% using the P1 + P2 reference population compared with the P2 reference population. Using the combined reference population, GFBLUP and BLUP|GA yielded 6.179% and 5.525% higher accuracies than GBLUP using all SNPs based on the single reference population, respectively. Positive regression coefficients were estimated in relation to the improvement in prediction accuracy (between GFBLUP/BLUP|GA and GBLUP) and the size of the reference as well as the heritability enrichment of genomic features. Compared to the classic GBLUP model, GFBLUP and BLUP|GA models integrating GWAS meta-analysis information increase the prediction accuracy and using combined populations with enlarged reference population size further enhances prediction accuracy of the two approaches. The heritability enrichment of genomic features can be used as an indicator to reflect weather prior information is accurately presented.

Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations

High density marker panels, SNPs prioritizing and accuracy of genomic selection

Integration of Ssgwas and ROH Analyses for Uncovering Genetic Variants Associated with Reproduction Traits in Large White Pigs.

The effect of high-density genotypic data and different methods on joint genomic prediction: A case study in large white pigs

The Construction of a Haplotype Reference Panel Using Extremely Low Coverage Whole Genome Sequences and Its Application in Genome-Wide Association Studies and Genomic Prediction in Duroc Pigs.

Improving GWAS discovery and genomic prediction accuracy in biobank data

Improving lodgepole pine genomic evaluation using spatial correlation structure and SNP selection with single-step GBLUP

GWABLUP: genome-wide association assisted best linear unbiased prediction of genetic values

Using imputation-based whole-genome sequencing data to improve the accuracy of genomic prediction for combined populations in pigs

Utility of whole-genome sequence data for across-breed genomic prediction

Large-scale Genotyping of Complex DNA

Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets

A method to obtain exact single-step GBLUP for non-genotyped descendants when the genomic relationship matrix of ancestors is not available

Integrating large-scale meta-analysis of genome-wide association studies improve the genomic prediction accuracy for combined pig populations

Prediction of Complex Human Traits Using the Genomic Best Linear Unbiased Predictor

3D-GBS: a universal genotyping-by-sequencing approach for genomic selection and other high-throughput low-cost applications in species with small to medium-sized genomes

Meta‐analysis of genome‐wide association from genomic prediction models

Comparing algorithms to approximate accuracies for single-step genomic best linear unbiased predictor

Factors Affecting the Accuracy of Genomic Selection for Agricultural Economic Traits in Maize, Cattle, and Pig Populations

Efficient weighting methods for genomic best linear-unbiased prediction (BLUP) adapted to the genetic architectures of quantitative traits

The Usage of an SNP-SNP Relationship Matrix for Best Linear Unbiased Prediction (BLUP) Analysis Using a Community-Based Cohort Study