Quantitative model suggests both intrinsic and contextual features contribute to the transcript coding ability determination in cells

Yu-Jian Kang,Jing-Yi Li,Lan Ke,Shuai Jiang,De-Chang Yang,Mei Hou,Ge Gao
DOI: https://doi.org/10.1093/bib/bbab483
IF: 9.5
2021-11-28
Briefings in Bioinformatics
Abstract:Abstract Gene transcription and protein translation are two key steps of the ‘central dogma.’ It is still a major challenge to quantitatively deconvolute factors contributing to the coding ability of transcripts in mammals. Here, we propose ribosome calculator (RiboCalc) for quantitatively modeling the coding ability of RNAs in human genome. In addition to effectively predicting the experimentally confirmed coding abundance via sequence and transcription features with high accuracy, RiboCalc provides interpretable parameters with biological information. Large-scale analysis further revealed a number of transcripts with a variety of coding ability for distinct types of cells (i.e. context-dependent coding transcripts), suggesting that, contrary to conventional wisdom, a transcript’s coding ability should be modeled as a continuous spectrum with a context-dependent nature.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?