Efficient Detection of the Alternative Spliced Human Proteome Using Translatome Sequencing
Chun Wu,Xiaolong Lu,Shaohua Lu,Hongwei Wang,Dehua Li,Jing Zhao,Jingjie Jin,Zhenghua Sun,Qing-Yu He,Yang Chen,Gong Zhang
DOI: https://doi.org/10.3389/fmolb.2022.895746
IF: 6.113
2022-01-01
Frontiers in Molecular Biosciences
Abstract:Alternative splicing (AS) isoforms create numerous proteoforms, expanding the complexity of the genome. Highly similar sequences, incomplete reference databases and the insufficient sequence coverage of mass spectrometry limit the identification of AS proteoforms. Here, we demonstrated full-length translating mRNAs (ribosome nascent-chain complex-bound mRNAs, RNC-mRNAs) sequencing (RNC-seq) strategy to sequence the entire translating mRNA using next-generation sequencing, including short-read and long-read technologies, to construct a protein database containing all translating AS isoforms. Taking the advantage of read length, short-read RNC-seq identified up to 15,289 genes and 15,906 AS isoforms in a single human cell line, much more than the Ribo-seq. The single-molecule long-read RNC-seq supplemented 4,429 annotated AS isoforms that were not identified by short-read datasets, and 4,525 novel AS isoforms that were not included in the public databases. Using such RNC-seq-guided database, we identified 6,766 annotated protein isoforms and 50 novel protein isoforms in mass spectrometry datasets. These results demonstrated the potential of full-length RNC-seq in investigating the proteome of AS isoforms.