Protein Function Prediction with High-Throughput Data

Xing-Ming Zhao,Luonan Chen,Kazuyuki Aihara
DOI: https://doi.org/10.1007/s00726-008-0077-y
IF: 3.7891
2008-01-01
Amino Acids
Abstract:Protein function prediction is one of the main challenges in post-genomic era. The availability of large amounts of high-throughput data provides an alternative approach to handling this problem from the computational viewpoint. In this review, we provide a comprehensive description of the computational methods that are currently applicable to protein function prediction, especially from the perspective of machine learning. Machine learning techniques can generally be classified as supervised learning, semi-supervised learning and unsupervised learning. By classifying the existing computational methods for protein annotation into these three groups, we are able to present a comprehensive framework on protein annotation based on machine learning techniques. In addition to describing recently developed theoretical methodologies, we also cover representative databases and software tools that are widely utilized in the prediction of protein function.
What problem does this paper attempt to address?