Improving the Diversity of Captured Full-Length Isoforms Using a Normalized Single-Molecule RNA-sequencing Method

Yueming Hu,Xing-Sheng Shu,Jiaxian Yu,Ming-an Sun,Zewei Chen,Xianming Liu,Qiongfang Fang,Wei Zhang,Xinjie Hui,Ying,Li Fu,Desheng Lu,Rakesh Kumar,Yejun Wang
DOI: https://doi.org/10.1038/s42003-020-01125-7
IF: 6.548
2020-01-01
Communications Biology
Abstract:Human genes form a large variety of isoforms after transcription, encoding distinct transcripts to exert different functions. Single-molecule RNA sequencing facilitates accurate identification of the isoforms by extending nucleotide read length significantly. However, the gene or isoform diversity is lowly represented by the mRNA molecules captured by single-molecule RNA sequencing. Here, we show that a cDNA normalization procedure before the library preparation for PacBio RS II sequencing captures 3.2–6.0 fold more full-length high-quality isoform species for different human samples, as compared to the non-normalized capture procedure. Many lowly expressed, functionally important isoforms can be detected. In addition, normalized PacBio RNA sequencing also resolves more allele-specific haplotype transcripts. Finally, we apply the cDNA normalization based long-read RNA sequencing method to profile the transcriptome of human gastric signet-ring cell carcinomas, identify new cancer-specific transcriptome signatures, and thus, bring out the utility of the improved protocols in gene expression studies.
What problem does this paper attempt to address?