Learning the Drug Target-Likeness of A Protein

Huan Xu,HangYang Xu,MingZhi Lin,Wei Wang,Zimu Li,Jiaju Huang,YuZong Chen,Xin Chen
DOI: https://doi.org/10.1002/pmic.200700062
2007-01-01
PROTEOMICS
Abstract:Current drug discovery and development approaches rely extensively on the identification and validation of appropriate targets; for example, those with marketable and robust therapeutics. Wide-ranging efforts have been directed at this problem and various approaches have been developed to identify disease-associated genes as candidates. In this work, we show with statistical significance that successful drug targets, in addition to their linkage to disease, share common characteristics that are disease-independent. For example, marked differences in functional category, tissue specificity, and sequence variability are observed between known targets and average proteins. These results lead to an interesting hypothesis: potentially good drug targets shall have some desired properties, which we refer to as "drug target-likeness" that are beyond their disease-associations. Because of the limited availability of comprehensive protein characteristics data, we tried to learn the drug target-likeness property at the sequence level. Results show that a support vector machine model is able to accurately distinguish targets from non-targets entirely with sequence features. It is our hope that these encouraging results will invite future systematic proteomic scale experiments to gather necessary protein characteristics data for the accurate and predictive definition of "drug target-likeness", providing a new perspective toward understanding and pursuing effective therapeutics.
What problem does this paper attempt to address?