A Proteogenomic Approach to Understand Splice Isoform Functions Through Sequence and Expression-Based Computational Modeling.

Hong-Dong Li,Gilbert S. Omenn,Yuanfang Guan
DOI: https://doi.org/10.1093/bib/bbv109
IF: 9.5
2016-01-01
Briefings in Bioinformatics
Abstract:The products of multi-exon genes are a mixture of alternatively spliced isoforms, from which the translated proteins can have similar, different or even opposing functions. It is therefore essential to differentiate and annotate functions for individual isoforms. Computational approaches provide an efficient complement to expensive and time-consuming experimental studies. The input data of these methods range from DNA sequence, to RNA selection pressure, to expressed sequence tags, to full-length complementary DNA, to exon array, to RNA-seq expression, to proteomic data. Notably, RNA-seq technology generates quantitative profiling of transcript expression at the genome scale, with an unprecedented amount of expression data available for developing isoform function prediction methods. Integrative analysis of these data at different molecular levels enables a proteogenomic approach to systematically interrogate isoform functions. Here, we briefly review the state-of-the-art methods according to their input data sources, discuss their advantages and limitations and point out potential ways to improve prediction accuracies.
What problem does this paper attempt to address?