Identification of Human Protein Drug Targets Homologues with Data Mining

Feng Yanghe,Wang Tengjiao
2013-01-01
Research Journal of Biotechnology
Abstract:Identification and validation of potential target proteins is the first step for drug discovery and design. An accurate drug target classifier is helpful to test a new drug target more efficiently and economically. In this paper, we analyzed 522 drug targets and 5371 non-drug targets with 38 chemical and physical properties to identify differences on their chemical and physical properties. It shows the significant differences can be summarized into 9 properties. Based on these sequence features we used four data mining techniques to train drug target classifiers and gained lists of the potential target proteins. The results of 10 fold cross validation show that the accuracy of support vector machine (SVM) is 81.4% which is the highest in these classifiers. By integration of the lists predicted from our classifiers, a drug targets homologues set is identified to help drug discovery.
What problem does this paper attempt to address?