Genetic Algorithm Applied To The Selection Of Factors In Principal Component: Asqr Study Of Aromatic Hydrocarbons Toxicity To Chlorella Vulgaris

Yang Sheng-Long,Wu Yang,Wang Cui-Hua,Yu Hong-Xia,Wang Lian-Shen
DOI: https://doi.org/10.4028/www.scientific.net/AMM.321-324.2065
2013-01-01
Applied Mechanics and Materials
Abstract:Marine ecosystems are affected by aromatic hydrocarbons. The predicting ability based on the quantitative structure activity relationships (QSAR) model of unknown aromatic hydrocarbons toxicity is one of the tasks of security precaution. To establish the QSAR model between the physical and chemical properties of aromatic hydrocarbons and the inhibited activity of Chlorella vulgaris(C. Vulgaris), the optimized geometries, based on the 96 hr-EC50 of 25 aromatic hydrocarbons with C. Vulgaris were carried out at the B3LYP/6-311G** level by density functional theory (DFT) calculation. With matlab2 010(a) software, genetic algorithm principal components regression (GAPCR) methods was used to develop the QSAR model and compared to traditional PCR model. PC1+PC3+PC5+PC6+PC8 were finally selected by GAPCR method. The R-2 of training, prediction data set and LOO cross validation are 0.918, 0.956 and 0.933, respectively. Meanwhile, the results of PCR were 0.949, 0.755 and 0.825, respectively. The results of this work showed that the GAPCR method has great results and good generalization capability. Comparing two motheds results indicting that GAPCR gives superior results to traditional PCR procedure.
What problem does this paper attempt to address?