An exploration of 3'-end processing signals and their tissue distribution in Oryza sativa.

Haitao Dong,Ye Deng,Jack Chen,Sheng Wang,Sihua Peng,Cheng'en Dai,Yongqi Fang,Jing Shao,Yichun Lou,Debao Li
DOI: https://doi.org/10.1016/j.gene.2006.10.015
IF: 3.913
2007-01-01
Gene
Abstract:The 3′ untranslated regions deeply affect many properties of eukaryotic mRNA. In plants, the polyadenine control signals contained in these regions seem to be more variable than of mammals. Three cDNA libraries derived from the leaf, endosperm and stem tissues of rice were sequenced from the 3′-end. Of the 9911 transcripts analyzed, 5723 unique transcripts were identified from the leaf sequences, 2934 from the endosperm and 1254 from the stem. The information entropy and two statistical methods were used to compile a list of rice poly(A) control signals. Based on their distribution, these signals can be roughly grouped into far-upstream element (FUE), near-upstream element (NUE), T-rich region (TRE) and downstream element (DE). The distribution of rice conserved regions is similar to the previous model from Arabidopsis and yeast, with a few differences in word constructions. Interestingly, we also found the word distributions were diverse in the cleavage site of downstream sequences of different rice tissues. The signal bias in downstream sequences may lead mRNA to be differently cleaved in different rice tissues.
What problem does this paper attempt to address?