A Hidden Human Proteome Encoded by 'Non-Coding' Genes

Shaohua Lu,Jing Zhang,Xinlei Lian,Li Sun,Kun Meng,Yang Chen,Zhenghua Sun,Xingfeng Yin,Yaxing Li,Jing Zhao,Tong Wang,Gong Zhang,Qing-Yu He
DOI: https://doi.org/10.1093/nar/gkz646
IF: 14.9
2019-01-01
Nucleic Acids Research
Abstract:It has been a long debate whether the 98% 'noncoding' fraction of human genome can encode functional proteins besides short peptides. With full-length translating mRNA sequencing and ribosome profiling, we found that up to 3330 long non-coding RNAs (lncRNAs) were bound to ribosomes with active translation elongation. With shotgun proteomics, 308 lncRNA-encoded new proteins were detected. A total of 207 unique peptides of these new proteins were verified by multiple reaction monitoring (MRM) and/or parallel reactionmonitoring (PRM); and 10 newproteins were verified by immunoblotting. We found that these new proteins deviated from the canonical proteins with various physical and chemical properties, and emerged mostly in primates during evolution. We further deduced the protein functions by the assays of translation efficiency, RNA folding and intracellular localizations. As the new protein UBAP1-AST6 is localized in the nucleoli and is preferentially expressed by lung cancer cell lines, we biologically verified that it has a function associated with cell proliferation. In sum, we experimentally evidenced a hidden human functional proteome encoded by purported lncRNAs, suggesting a resource for annotating new human proteins.
What problem does this paper attempt to address?