Imputation of single nucleotide polymorphism genotypes in ungenotyped sport horses from the genotypes of their progeny

J L Doyle,S Egan,D P Berry
DOI: https://doi.org/10.1016/j.animal.2024.101278
IF: 3.73
animal
Abstract:Many sport horse studbooks worldwide use microsatellite markers for parentage verification. However, many have expressed a desire to introduce genomic selection using genome-wide dense single nucleotide polymorphism (SNP) genotypes to complement their current breeding programmes. Hence, it does not make sense to genotype the same animal for both microsatellite markers and SNP markers. Transitioning to SNP-based parentage verification is an obvious solution but one barrier to this transition is the lack of SNP data on parents from which to verify parentage against. Therefore, the objective of this study was to assess the ability to impute the SNP genotype of a stallion from the genotypes of its progeny, with or without the consideration of the genotype of the progeny's dam. Genotype information from 55 935 SNPs was available on 13 327 horses. A total of 98 stallions had genotype data on 10 progeny and the genotypes of these stallions were used as a test population. Genome-wide genotype imputation was undertaken by combining a family- and population-based imputation approach. Several different scenarios were assessed to quantify the ability to accurately impute the genotype of a stallion based on genotype data of incrementally more half-sibling progeny. Using genomic information from four progeny the average genotype concordance of the imputed sire genotype compared to the actual sire's genotype was 0.932. The average genotype concordance rate increased to 0.960 when the genotypes of 10 progeny were included in the imputation process. The inclusion of the genotypes of the dams of the progeny improved the concordance rate from 0.932 to 0.977 when based on the genotype of just four progeny and their dams and from 0.960 to 0.996 when based on the genotype of 10 progeny and their dams. These results suggest it is possible to impute the genotype of a non-genotyped horse from the genotypes of its progeny and that the inclusion of the genotypes of the dams of the progeny improves this imputation accuracy further.
What problem does this paper attempt to address?