Genome Sequencing reveals the impact of non-canonical exon inclusions in rare genetic disease

Georgia Pitsava,Megan Hawley,Light Auriga,Ivan de Dios,Arthur Ko,Sofia Marmolejos,Miguel Almalvez,Ingrid Chen,Kaylee Scozzaro,Jianhua Zhao,Rebekah Barrick,Nicholas Ah Mew,Vincent A. Fusaro,Jonathan LoTempio,Matthew Taylor,Luisa Mestroni,Sharon Graw,Dianna Milewicz,Dongchuan Guo,David R. Murdock,Kinga M. Bujakowska,UCI-GREGoR Consortium,Changrui Xiao Xiao,Emmanuele C. Delot,Seth I. Berger,Eric Vilain
DOI: https://doi.org/10.1101/2024.12.21.24318325
2024-12-26
Abstract:Advancements in sequencing technologies have significantly improved clinical genetic testing, yet the diagnostic yield remains around 30-40%. Emerging sequencing technologies are now being deployed in the clinical setting to address the remaining diagnostic gap. We tested whether short-read genome sequencing could increase diagnostic yield in individuals enrolled into the UCI-GREGoR research study, who had suspected Mendelian conditions and prior inconclusive clinical genetic testing. Two other collaborative research cohorts, focused on aortopathy and dilated cardiomyopathy, consisted of individuals who were undiagnosed but had not undergone harmonized prior testing. We sequenced 353 families (754 participants) and found a molecular diagnosis in 54 (15.3%) of them. Of these diagnoses, 55.5% were previously missed because the causative variants were in regions not interrogated by the original testing. In 9 cases, they were deep intronic variants, 5 of which led to abnormal splicing and cryptic exon inclusion, as directly shown by RNA sequencing. All 5 of these variants had inconclusive spliceAI scores. In 26% of newly diagnosed cases, the causal variant could have been detected by exome sequencing reanalysis. Genome sequencing overcomes multiple limitations of clinical genetic testing, such as inability to call intronic variants and technical limitations. Our findings highlight cryptic exon inclusion as a common mechanism via which deep intronic variants cause Mendelian disease. However, they also reinforce that reanalysis of exome datasets can be a fruitful approach.
What problem does this paper attempt to address?