Abstract:The clinical interpretation of missense variants is critically important in diagnostics due to their potential to cause mild-to-severe effects on phenotype by altering protein structure. Evaluating these variants is essential because they can significantly impact disease outcomes and patient management. Many computational predictors, known as in silico pathogenicity predictors (ISPPs), have been developed to support the assessment of variant pathogenicity. Despite the abundance of these ISPPs, their predictions often lack accuracy and consistency, primarily due to limited data availability and the presence of erroneous data. This inconsistency can lead to false positive or negative results in pathogenicity evaluation, highlighting the need for standardization. The necessity for reliable evaluation methods has driven the development of numerous ISPPs, each attempting to address different aspects of variant interpretation. However, the sheer number of ISPPs and their varied performances make it challenging to achieve consensus in predictions. Therefore, a comprehensive statistical approach to evaluate and integrate these predictors is essential to improve accuracy. Here, we present a comprehensive statistical analysis comparing 52 available ISPPs, which aims to enhance the precision of variant classification. Our work introduces the Variant Analysis with Multiple Pathogenicity Predictors-score (VAMPP-score), a novel statistical framework designed for the assessment of missense variants. The VAMPP-score leverages the best gene-ISPP matches based on ISPP accuracies, providing a combinatorial weighted score that improves missense variant interpretation. We chose to develop a statistical framework rather than creating a new ISPP to capitalize on the strengths of existing predictors and to address their limitations through an integrative approach. This approach not only improves the evaluation of missense variants but also offers a flexible statistical framework designed to identify and utilize the best-performing ISPPs. By enhancing the accuracy of genetic diagnostics, particularly in the reanalysis of rare and undiagnosed cases, our framework aims to improve patient outcomes and advance the field of genetic research.

Assessing predictions on fitness effects of missense variants in HMBS in CAGI6

Critical assessment of missense variant effect predictors on disease-relevant variant data

Evaluation of in silico predictors on short nucleotide variants in HBA1, HBA2, and HBB associated with haemoglobinopathies

Accurate proteome-wide missense variant effect prediction with AlphaMissense

MMPatho: Leveraging Multilevel Consensus and Evolutionary Information for Enhanced Missense Mutation Pathogenic Prediction

FiTMuSiC: leveraging structural and (co)evolutionary data for protein fitness prediction

Evaluation of enzyme activity predictions for variants of unknown significance in Arylsulfatase A

Exploring the effects of missense mutations on protein thermodynamics through structure-based approaches: findings from the CAGI6 challenges

Rapid discrimination between deleterious and benign missense mutations in the CAGI 6 experiment

Amendment history : Corrigendum ( April 2017 ) A modifier screen identifies DNAJB 6 as a cardiomyopathy susceptibility gene

Using computational approaches to enhance the interpretation of missense variants in the PAX6 gene

The landscape of regional missense mutational intolerance quantified from 125,748 exomes

Assessment of Computational Methods for Predicting the Effects of Missense Mutations in Human Cancers.

Assessment of Predicted Enzymatic Activity of Α‐ N ‐acetylglucosaminidase Variants of Unknown Significance for CAGI 2016

Evaluating predictors of kinase activity of STK11 variants identified in primary human non-small cell lung cancers

ProteoMutaMetrics: machine learning approaches for solute carrier family 6 mutation pathogenicity prediction

Predicting Pathology of Missense Mutations through Protein-Specific Evolutionary Pattern

A New Era in Missense Variant Analysis: Statistical Insights and the Introduction of VAMPP-Score for Pathogenicity Assessment

Novel gene-specific Bayesian Gaussian mixture model to predict the missense variants pathogenicity of Sanfilippo syndrome

Assessing the Utility of ColabFold and AlphaMissense in Determining Missense Variant Pathogenicity for Congenital Myasthenic Syndromes

Predicting non-neutral missense mutations and their biochemical consequences using genome-scale homology modeling of human protein complexes