A Novel Single Nucleotide Polymorphisms Quality Control Method in Genome-Wide Association Studies

Yuliang Sun,Renfa Li,Bo Liao,Xiong Li,Zhi Cao
DOI: https://doi.org/10.1166/jctn.2014.3545
2014-01-01
Journal of Computational and Theoretical Nanoscience
Abstract:The quality of single nucleotide polymorphisms (SNPs) is of paramount importance for genome-wide association studies (GWAS) to reduce potential false findings. The SNP genotyping data are not always accurate because of various reasons such as experimental systematic errors. SNP quality control methods commonly use filter-by-extreme filters based on quality control variables of Hardy-Weinberg equilibrium (HWE), missing frequency (MiF) and minor allele frequency (MAF), to remove outliers. These filters neglect the fact that variables may contribute differently for different SNP clusters, and their implementation requires arbitrary thresholds. For this problem, a novel filtering method based on weighted fuzzy kernel clustering algorithm (WFKCA) is described to identify outlier SNPs.
What problem does this paper attempt to address?