An isoform-resolution transcriptomic atlas of colorectal cancer from long-read single-cell sequencing

Zhongxiao Li,Bin Zhang,Jia Jia Chan,Hossein Tabatabaeian,Qing Yun Tong,Xiao Hong Chew,Xiaonan Fan,Patrick Driguez,Charlene Chan,Faith Cheong,Shi Wang,Bei En Siew,Ian Jse-Wei Tan,Kai-Yin Lee,Bettina Lieske,Wai-Kit Cheong,Dennis Kappei,Ker-Kan Tan,Xin Gao,Yvonne Tay
DOI: https://doi.org/10.1016/j.xgen.2024.100641
2024-08-23
Abstract:Colorectal cancer (CRC) ranks as the second leading cause of cancer deaths globally. In recent years, short-read single-cell RNA sequencing (scRNA-seq) has been instrumental in deciphering tumor heterogeneities. However, these studies only enable gene-level quantification but neglect alterations in transcript structures arising from alternative end processing or splicing. In this study, we integrated short- and long-read scRNA-seq of CRC samples to build an isoform-resolution CRC transcriptomic atlas. We identified 394 dysregulated transcript structures in tumor epithelial cells, including 299 resulting from various combinations of splicing events. Second, we characterized genes and isoforms associated with epithelial lineages and subpopulations exhibiting distinct prognoses. Among 31,935 isoforms with novel junctions, 330 were supported by The Cancer Genome Atlas RNA-seq and mass spectrometry data. Finally, we built an algorithm that integrated novel peptides derived from open reading frames of recurrent tumor-specific transcripts with mass spectrometry data and identified recurring neoepitopes that may aid the development of cancer vaccines.
What problem does this paper attempt to address?