Identification Of Rna Editing Sites In Chimpanzee By Transcriptome-Wide Sequencing Data

王端青,何涛,汪莉,王玉民,邵卫东
DOI: https://doi.org/10.3724/SP.J.1206.2011.00328
2012-01-01
PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS
Abstract:RNA editing is a widespread post-transcriptional modification mechanism that alters genetic information at the RNA level by nucleotide insertions, deletions or substitutions, which can contribute to the diversification of the transcriptome and proteome. Although tens of thousands of A-to-I RNA editing events have been found in humans, there is limited knowledge of RNA editing in other nonhuman primates. For exploring the mechanism as well as potential functions of the RNA editing events in chimpanzee, we identified RNA editing sites based on chimpanzee RNA-Seq data here. By aligning between RNA-Seq data and chimpanzee genome sequences with TopHat software, all RNA-DNA mismatch sites were regarded as a candidate set. Low quality sites were filtered out by using both genome and transcriptome sequencing quality scores. The other filters containing uncertainty of sequencing at 3'-terminial positions, read coverage, SNP sites and estimated editing level were also applied on the candidate set. Statistical tests based on the Binomial distribution and Bonferroni multiple testing correction were performed on each candidate site to remove random errors between genome and transcriptome. Then, we detected tissue- and sex-specific RNA editing sites using bioinformatics approaches based on the Fisher's exact test and the Bonferroni multiple testing correction. The Two Sample Logo software was used to analyze the feature of the sequences surrounding the RNA editing site. A total of 8 334 RNA editing sites were identified in chimpanzee transcriptome and all. 12 possible categories of discordances were observed. The top four distributions were A-to-G, U-to-C, G-to-A and C-to-U editing sites, which contained 1 995, 1 452, 1 293 and 1 101 sites, respectively. Forty-one editing sites alter amino acid residues, one of them creates a new stop codon which may shorten the KRT31 protein and affect its activity. Three editing sites damage the binding of microRNA potentially. Six hundred and forty and eight hundred and seventy-two RNA editing sites were identified to be tissue-specific and sex-specific respectively. The analysis of base frequencies indicated that all substitution editings have preferences for certain neighbouring nucleotides. RNA editing is widespread in chimpanzee and has important biology function. Our findings paved the way for further exploration of the mechanism of RNA editing in primates.
What problem does this paper attempt to address?