Machine learning-optimized targeted detection of alternative splicing

Kevin Yang,Nathaniel Islas,San Jewell,Anupama Jha,Caleb M. Radens,Jeffrey A. Pleiss,Kristen W. Lynch,Yoseph Barash,Peter S. Choi
DOI: https://doi.org/10.1101/2024.09.20.614162
2024-09-24
Abstract:RNA-sequencing (RNA-seq) is widely adopted for transcriptome analysis but has inherent biases which hinder the comprehensive detection and quantification of alternative splicing. To address this, we present an efficient targeted RNA-seq method that greatly enriches for splicing-informative junction-spanning reads. Local Splicing Variation sequencing (LSV-seq) utilizes multiplexed reverse transcription from highly scalable pools of primers anchored near splicing events of interest. Primers are designed using Optimal Prime, a novel machine learning algorithm trained on the performance of thousands of primer sequences. In experimental benchmarks, LSV-seq achieves high on-target capture rates and concordance with RNA-seq, while requiring significantly lower sequencing depth. Leveraging deep learning splicing code predictions, we used LSV-seq to target events with low coverage in GTEx RNA-seq data and newly discover hundreds of tissue-specific splicing events. Our results demonstrate the ability of LSV-seq to quantify splicing of events of interest at high-throughput and with exceptional sensitivity.
Genomics
What problem does this paper attempt to address?