Long-read sequencing transcriptome quantification with lr-kallisto

Rebekah K Loving,Delaney K Sullivan,Fairlie Reese,Elisabeth Rebboah,Jasmine Sakr,Narges Rezaie,Heidi Y Liang,Ghassan Filimban,Shimako Kawauchi,Conrad Oakes,Diane Trout,Brian A Williams,Grant MacGregor,Barbara Wold,Ali Mortazavi,Lior Pachter
DOI: https://doi.org/10.1101/2024.07.19.604364
2024-07-19
Abstract:RNA abundance quantification has become routine and affordable thanks to high-throughput short-read technologies that provide accurate molecule counts at the gene level. Similarly accurate and affordable quantification of definitive full-length, transcript isoforms has remained a stubborn challenge, despite its obvious biological significance across a wide range of problems. Long-read sequencing platforms now produce data-types that can, in principle, drive routine definitive isoform quantification. However some particulars of contemporary long-read datatypes, together with isoform complexity and genetic variation, present bioinformatic challenges. We show here, using ONT data, that fast and accurate quantification of long-read data is possible and that it is improved by exome capture. To perform quantifications we developed lr-kallisto, which adapts the kallisto bulk and single-cell RNA-seq quantification methods for long-read technologies.
Biology
What problem does this paper attempt to address?