Discordance between a deep learning model and clinical-grade variant pathogenicity classification in a rare disease cohort

Sek Won Kong,In-Hee Lee,Lauren V Collen,Arjun K Manrai,Scott B Snapper,Kenneth D Mandl
DOI: https://doi.org/10.1101/2024.05.22.24307756
2024-05-23
Abstract:Genetic testing has become an essential component in the diagnosis and management of a wide range of clinical conditions, from cancer to developmental disorders, especially in rare Mendelian diseases. Efforts to identify rare phenotype-associated variants have predominantly focused on protein-truncating variants, while the interpretation of missense variants presents a considerable challenge. Deep learning algorithms excel in various applications across biomedical tasks, yet accurately distinguishing between pathogenic and benign genetic variants remains an elusive goal. Specifically, even the most sophisticated models encounter difficulties in accurately assessing the pathogenicity of missense variants of uncertain significance (VUS). Our investigation of AlphaMissense (AM), the latest iteration of deep learning methods for predicting the potential functional impact of missense variants and assessing gene essentiality, reveals important limitations in its ability to identify pathogenic missense variants within a rare disease cohort. Indeed, AM struggles to accurately assess the pathogenicity of variants in intrinsically disordered regions (IDRs), leading to unreliable gene-level essentiality scores for certain genes containing IDRs. This limitation highlights the challenges in applying AM faces in the context of clinical genetics.
What problem does this paper attempt to address?