Abstract:Heritability enrichment analysis is an important means of exploring the genetic architecture of complex traits in human genetics. Heritability enrichment is typically defined as the proportion of an SNP subset explained heritability, divided by the proportion of SNPs. Heritability enrichment enables better study of underlying complex traits, such as functional variant/gene subsets, biological networks and metabolic pathways detected through integrating explosively increased omics data. This would be beneficial for genomic prediction of disease risk in humans and genetic values estimation of important economical traits in livestock and plant species. However, in livestock, factors affecting the heritability enrichment estimation of complex traits have not been examined. Previous studies on humans reported that the frequencies, effect sizes, and levels of linkage disequilibrium (LD) of underlying causal variants (CVs) would affect the heritability enrichment estimation. Therefore, the distribution of heritability across the genome should be fully considered to obtain the unbiased estimation of heritability enrichment. To explore the performance of different heritability enrichment models in livestock populations, we used the VanRaden, GCTA and α models, assuming different α values, and the LDAK model, considering LD weight. We simulated three types of phenotypes, with CVs from various minor allele frequency (MAF) ranges: genome-wide (0.005 ≤ MAF ≤ 0.5), common (0.05 ≤ MAF ≤ 0.5), and uncommon (0.01 ≤ MAF < 0.05). The performances of the models with two different subsets (one of which contained known CVs and the other consisting of randomly selected markers) were compared to verify the accuracy of heritability enrichment estimation of functional variant sets. Our results showed that models with known CV subsets provided more robust enrichment estimation. Models with different α values tended to provide stable and accurate estimates for common and genome-wide CVs (relative deviation 0.5–2.2%), while tending to underestimate the enrichment of uncommon CVs. As the α value increased, enrichments from 15.73% higher than true value (i.e., 3.00) to 48.93% lower than true value for uncommon CVs were observed. In addition, the long-range LD windows (e.g., 5000 kb) led to large bias of the enrichment estimations for both common and uncommon CVs. Overall, heritability enrichment estimations were sensitive for the α value assumption and LD weight consideration of different models. Accuracy would be greatly improved by using a suitable model. This study would be helpful in understanding the genetic architecture of complex traits and provides a reference for genetic analysis in the livestock population.

Impact of Linkage Disequilibrium Heterogeneity along the Genome on Genomic Prediction and Heritability Estimation

Accuracy of Genomic Prediction Using Low-Density Marker Panels.

The Impact of Genetic Relationship and Linkage Disequilibrium on Genomic Selection

The Genetic Connectedness Calculated from Genomic Information and Its Effect on the Accuracy of Genomic Prediction

Genomic Prediction Based on Selective Linkage Disequilibrium Pruning of Low-Coverage Whole-Genome Sequence Variants in a Pure Duroc Population

An Efficient Unified Model for Genome-Wide Association Studies and Genomic Selection

The Construction of a Haplotype Reference Panel Using Extremely Low Coverage Whole Genome Sequences and Its Application in Genome-Wide Association Studies and Genomic Prediction in Duroc Pigs.

Leveraging LD eigenvalue regression to improve the estimation of SNP heritability and confounding inflation

The effect of single nucleotide polymorphism identification strategies on estimates of linkage disequilibrium.

Association Test Between Haplotypes and Longitudinal Traits in Complex Pedigrees.

Accurate and Efficient Estimation of Local Heritability using Summary Statistics and LD Matrix

Model Comparison of Heritability Enrichment Analysis in Livestock Population

Genomic Prediction Using LD-Based Haplotypes in Combined Pig Populations

An Atlas of Linkage Disequilibrium Across Species

Weighted Single-Step Genomic Best Linear Unbiased Prediction Integrating Variants Selected from Sequencing Data by Association and Bioinformatics Analyses.

Improvement of Genomic Prediction by Integrating Additional Single Nucleotide Polymorphisms Selected from Imputed Whole Genome Sequencing Data.

High density marker panels, SNPs prioritizing and accuracy of genomic selection

Robust Genomic Prediction and Heritability Estimation using Density Power Divergence

The effect of high-density genotypic data and different methods on joint genomic prediction: A case study in large white pigs

A Method to Estimate the Contribution of Regional Genetic Associations to Complex Traits from Summary Association Statistics

Modeling Linkage Disequilibrium Between a Polymorphic Marker Locus and a Locus Affecting Complex Dichotomous Traits in Natural Populations.