Prediction of Retention Times for a Large Set of Pesticides Based on Improved Gene Expression Programming

Kejun Zhang,Shouqian Sun,Hongzong Si
DOI: https://doi.org/10.1145/1389095.1389429
2008-01-01
Abstract:The purpose of the paper is to present a novel way to building Quantitative structure-retention relationship (QSRR) models. Studies was reported for predicting the retention times (RTs) of 110 pesticides which were detected by gas chromatography (GC) with mass selective detector (MSD). Chemical descriptors were calculated from the molecular structure of pesticides and the QSRR models of RTs with descriptors was built using the heuristic method (HM) and Improved Gene Expression Programming (IGEP), respectively. The obtained linear model of HM had a correlation coefficient R 2 = 0.913, with a root mean square error (RMS) S 2 of 0.0387 for the training set, while R2 =0.907, and RMS =0.0408 for the test set. The nonlinear model by IGEP gave better results: for the training set R 2 = 0.971, S 2 = 0.0176 and for the test set R 2 =0.951, S 2 =0.0267. The prediction results from nonlinear model are in agreement with the experimental values The QSRR model also reveals that the gas chromatographic RTs are associated with physicochemical property of pesticides.
What problem does this paper attempt to address?