Discovering putative peptides encoded from non-coding RNAs in ribosome profiling data of Arabidopsis thaliana.
Qilin Li,Md Asif Ahsan,Hongjun Chen,Jitong Xue,Ming Chen
DOI: https://doi.org/10.1021/acssynbio.7b00386
IF: 5.249
2018-01-01
ACS Synthetic Biology
Abstract:Most of non-coding RNAs are considered to express at low levels and have a limited phylogenetic distribution in the cytoplasm, meaning that they may be only involved in specific biological processes. However, recent studies showed the protein-coding potential of ncRNAs, indicating that they might be source of some special proteins. Although there are increasing non-coding RNAs identified to be able to code proteins, it is challenging to distinguish coding RNAs from previously annotated ncRNAs, and to detect the proteins from their translation. In this article, we designed a pipeline consisted of CIPHER, TransDecoder and other perl scripts to identify these non-coding RNAs in Arabidopsis thaliana from three NCBI GEO datasets with coding potential and predict their translation products. 31,311 non-coding RNAs were predicted to be translated into peptides, and they showed lower conservation rate than common proteins. In addition, we built an interaction network between these peptides and annotated Arabidopsis proteins using BIPS, which included 69 peptides from non-coding RNAs. Peptides in the interaction network showed different characteristics from other non-coding RNA-derived peptides, and they participated in several crucial biological processes, such as photorespiration and stress-responses. All the Information of putative ncPEPs and their interaction with proteins predicted above are finally integrated in a PHP-MySQL database, PncPEPDB (http://bis.zju.edu.cn/PncPEPDB). These results showed that peptides derived from non-coding RNAs may be important roles in non-coding RNA regulation, which provided another hypothesis that non-coding RNA may regulate the metabolism via their translation products.