Prediction and Feature Analysis of Intron Retention Events in Plant Genome

Ying Cui,Chao Zhang,Meng Cai
DOI: https://doi.org/10.1016/j.compbiolchem.2017.04.004
IF: 3.737
2017-01-01
Computational Biology and Chemistry
Abstract:Alternative splicing (AS) is a major contributor to increase the potential informational content of eukaryotic genomes by creating multiple mRNA species and proteins from a single gene. In plants, up to 60% genes are alternatively spliced and the most common type of AS is intron retention (IR). Genomic analyses of IR have illuminated its crucial role in shaping the evolution of genomes, in the control of developmental processes, and in the dynamic regulation of the transcriptome to influence phenotype. To explore the relationship between the sequence feature and the formation mechanism of IR, we statistically analyzed the retained introns and proposed an improved random forest-based hybrid method to predict intron retention events in plant genome. The results indicate that IR has significant relationship with individual introns which have weaker 5’ splice sites, lower GC content and less termination codon occurrence. By the method we proposed, 93.48% retained introns can be correctly distinguished from constitutive introns. Strikingly, our study will facilitate a better understanding of underlying mechanisms involved in intron retention.
What problem does this paper attempt to address?