Abstract:Even though alternative RNA splicing was discovered nearly 50 years ago (1977), we still understand very little about most isoforms arising from a single gene, including in which tissues they are expressed and if their functions differ. Human gene annotations suggest remarkable transcriptional complexity, with approximately 252,798 distinct RNA isoform annotations from 62,710 gene bodies (Ensembl v109; 2023), emphasizing the need to understand their biological effects. For example, 256 gene bodies have ≥50 annotated isoforms and 30 have ≥100, where one protein-coding gene ( ) even has 192 distinct RNA isoform annotations. Whether such isoform diversity results from biological redundancy or spurious alternative splicing (i.e., noise), or whether individual isoforms have specialized functions (even if subtle) remains a mystery for most genes. Recent studies by Aguzzoli-Heberle et al., Leung et al., and Glinos et al. demonstrated long-read RNAseq enables improved RNA isoform quantification for essentially any tissue, cell type, or biological condition (e.g., disease, development, aging, etc.), making it possible to better assess individual isoform expression and function. While each study provided important discoveries related to RNA isoform diversity, deeper exploration is needed. We sought to quantify and characterize real isoform usage across tissues (compared to annotations). We used long-read RNAseq data from 58 GTEx samples across nine tissues (three brain, two heart, muscle, lung, liver, and cultured fibroblasts) generated by Glinos et al. and found considerable isoform diversity within and across tissues. Cerebellar hemisphere was the most transcriptionally complex tissue (22,522 distinct isoforms; 3,726 unique); liver was least diverse (12,435 distinct isoforms; 1,039 unique). We highlight gene clusters exhibiting high tissue-specific isoform diversity per tissue (e.g., expresses 19 in heart's atrial appendage). We also validated 447 of the 700 new isoforms discovered by Aguzzoli-Heberle et al. and found that 88 were expressed in all nine tissues, while 58 were specific to a single tissue. This study represents a broad survey of the RNA isoform landscape, demonstrating isoform diversity across nine tissues and emphasizes the need to better understand how individual isoforms from a single gene body contribute to human health and disease.

FLIBase: a Comprehensive Repository of Full-Length Isoforms Across Human Cancers and Tissues.

Improving the Diversity of Captured Full-Length Isoforms Using a Normalized Single-Molecule RNA-sequencing Method

LAFITE Reveals the Complexity of Transcript Isoforms in Subcellular Fractions.

Targeted transcriptome analysis using synthetic long read sequencing uncovers isoform reprograming in the progression of colon cancer

Comprehensive characterization of single-cell full-length isoforms in human and mouse with long-read sequencing

Abstract 4356: An isoform-resolution transcriptomic atlas of colorectal cancer from long-read single-cell sequencing

An isoform-resolution transcriptomic atlas of colorectal cancer from long-read single-cell sequencing

Comprehensive analysis of full-length transcripts reveal aberrations of splicing variants in liver cancer

RJunBase: a database of RNA splice junctions in human normal and cancerous tissues

An optimized workflow of full-length transcriptome sequencing for accurate fusion transcript identification

IFDlong: an isoform and fusion detector for accurate annotation and quantification of long-read RNA-seq data

Long-read transcriptome landscapes of primary and metastatic liver cancers at transcript resolution

Full-Length Immune Repertoire Reconstruction and Profiling at the Transcriptome Level Using Long-Read Sequencing

CTAT-LR-fusion: accurate fusion transcript identification from long and short read isoform sequencing at bulk or single cell resolution

An Expanded Landscape of Human Long Noncoding RNA

Surveying the landscape of RNA isoform diversity and expression across 9 GTEx tissues using long-read sequencing data

Discovery of Novel Genes and Gene Isoforms by Integrating Transcriptomic and Proteomic Profiling from Mouse Liver.

Systematic Characterization of Cancer Transcriptome at Transcript Resolution

Full-length transcript characterization of SF3B1 mutation in chronic lymphocytic leukemia reveals downregulation of retained introns

Single-cell long-read targeted sequencing reveals transcriptional variation in ovarian cancer

Fine mapping of RNA isoform diversity using an innovative targeted long-read RNA sequencing protocol with novel dedicated bioinformatics pipeline