MAGPIE: accurate pathogenic prediction for multiple variant types using machine learning approach

Yicheng Liu,Tianyun Zhang,Ningyuan You,Sai Wu,Ning Shen
DOI: https://doi.org/10.1186/s13073-023-01274-4
IF: 15.266
2024-01-11
Genome Medicine
Abstract:Identifying pathogenic variants from the vast majority of nucleotide variation remains a challenge. We present a method named Multimodal Annotation Generated Pathogenic Impact Evaluator (MAGPIE) that predicts the pathogenicity of multi-type variants. MAGPIE uses the ClinVar dataset for training and demonstrates superior performance in both the independent test set and multiple orthogonal validation datasets, accurately predicting variant pathogenicity. Notably, MAGPIE performs best in predicting the pathogenicity of rare variants and highly imbalanced datasets. Overall, results underline the robustness of MAGPIE as a valuable tool for predicting pathogenicity in various types of human genome variations. MAGPIE is available at https://github.com/shenlab-genomics/magpie.
genetics & heredity
What problem does this paper attempt to address?