QSAR Studies on the Acute Toxicity of Aliphatic Compounds Based on the Supporting Vector Machines

崔毅,蒋军成,潘勇,曹洪印,王睿
DOI: https://doi.org/10.3969/j.issn.1009-6094.2009.05.005
2009-01-01
Abstract:This paper is concerned about its study on the quantitative relationship between the acute toxicity (LC_(50)) and the molecular structure of 106 aliphatic compounds based on the quantitative structure -activity relationship (QSAR) model. The so-called QSAR model is by nature a newly developed method for predicting the properties of chemo informatics based on the basic theory of chemistry that molecular properties are determined by the molecular structures and the intrinsic quantitative relation between molecular structures and the properties of the organic compounds. Aliphatic compounds, as is known, are various with a great deal of uses in our daily life. However, a considerable part of the aliphatic compounds hasn' t yet been tested for their toxicity. For this purpose, we began to relate the properties under question.to the structural parameters in hope to develop a corresponding quantitative model, believing that QSAR can be used to predict such properties of organic compounds from their molecular structures alone. In this paper, we have chosen 4 descriptors which may contribute greatly to the LC_(50) with a variable selection method of genetic algorithm (GA). At the same time, we have also used both the multi-linear regression (MLR) and the new chemo-informatic method in supporting the vector machine (SVM) to simulate the likely quanti- tative relation lying between the above said selected descriptors and LC_(50). Then, we began to test the proposed models with their internal and external validations thoroughly checked. The results of our study prove the robustness and highly predictive ability as well as the deductive power of our generalization. The mean absolute error for the training set and prediction set of SVM model turn out to be 0. 336 and 0. 364, the results of the MLR model have thus been proved credible. Therefore, it can be concluded that our model for testing the quantitative relationship between the acute toxicity and molecular structures of aliphatic compounds is true to the testing results, which can not only be used to predict the acute toxicity of aliphatic compounds for engineering, but also to reveal the major structural factors and their affecting regularities on the acute toxicity of the aliphatic compounds from the point of view of the molecular structure.
What problem does this paper attempt to address?