Qsar Models for the Dermal Penetration of Polycyclic Aromatic Hydrocarbons Based on Gene Expression Programming

Tao Wang,Hongzong Si,Pingping Chen,Kejun Zhang,Xiaojun Yao
DOI: https://doi.org/10.1002/qsar.200710153
2008-01-01
QSAR & Combinatorial Science
Abstract:Gene Expression Programming (GEP) is a novel machine learning technique. The GEP is used to build nonlinear quantitative structure activity relationship model for the prediction of the Percent of Applied Dose Dermally Absorbed (PADA) over 24 h for polycyclic aromatic hydrocarbons. This model is based on descriptors which are calculated from the molecular structure. Three descriptors are selected from the descriptors pool by Heuristic Method (HM) to build a multivariable linear model. The GEP method produced a nonlinear quantitative model with a correlation coefficient and a mean error of 0.92 and 4.70 for the training set, 0.91 and 7.65 for the test set, respectively. It is shown that the GEP predicted results are in good agreement with experimental ones.
What problem does this paper attempt to address?