SQANTI-reads: a tool for the quality assessment of long read data in multi-sample lrRNA-seq experiments.

Netanya Keil,Carolina Monzó,Lauren McIntyre,Ana Conesa
DOI: https://doi.org/10.1101/2024.08.23.609463
2024-09-17
Abstract:SQANTI-reads leverages SQANTI3, a tool for the analysis of the quality of transcript models, to develop a quality control protocol for replicated long-read RNA-seq experiments. The number/distribution of reads, as well as the number/distribution of unique junction chains (transcript splicing patterns), in SQANTI3 structural categories are compiled. Multi-sample visualizations of QC metrics can also be separated by experimental design factors. We introduce new metrics for 1) the identification of potentially under-annotated genes and putative novel transcripts and 2) variation in junction donors and acceptors. All scripts are open source and customizable. Using two different datasets, one from Drosophila and one benchmark dataset from the LRGASP project, we demonstrate how low coverage does not automatically indicate low quality and how strong/weak splicing sites can be readily identified genome wide. SQANTI-reads is open source and available for download at GitHub.
Bioinformatics
What problem does this paper attempt to address?