Rapid Prediction of Chemical Ecotoxicity Through Genetic Algorithm Optimized Neural Network Models

Ping Hou,Bu Zhao,Olivier Jolliet,Ji Zhu,Peng Wang,Ming Xu
DOI: https://doi.org/10.1021/acssuschemeng.0c03660
2020-07-20
Abstract:Evaluating potentially hazardous effects of chemicals on ecosystems has always been an important research topic traditionally studied using laboratory or field experiments. Experiment-based ecotoxicity test results are only available for a limited number of chemicals due to the extensive experimental effort and cost. Given the ever-increasing number of chemicals involved in the modern production process and products, rapidly characterizing chemical ecotoxicity at lower costs has become critical for guiding technology and policy development for chemical risk management. In this study, artificial neural network models are developed to predict chemical ecotoxicity (HC<sub>50</sub>) based on experimental data to fill data gaps in a widely used database (USEtox). To reduce the manual tuning effort on optimal network architecture, a genetic algorithm is investigated to automatically search and configure the network architecture. The resulting neural network model reached an average test <i>R</i><sup>2</sup> of 0.632 and had a trivial difference with the global optimal regarding validation MSE. The findings of this study can rapidly predict the ecotoxicity of chemicals and further help to understand the potential risk of chemicals and develop strategies for risk management.The Supporting Information is available free of charge at <a class="ext-link" href="/doi/10.1021/acssuschemeng.0c03660?goto=supporting-info">https://pubs.acs.org/doi/10.1021/acssuschemeng.0c03660</a>.Commonly used activation functions; data filtering result; fitness (i.e., validation MSE) of the best model along eight generations (<a class="ext-link" href="/doi/suppl/10.1021/acssuschemeng.0c03660/suppl_file/sc0c03660_si_001.pdf">PDF</a>)Predicted HC<sub>50</sub> values in USEtox; application domain of the predicted HC<sub>50</sub> values; input variables for prediction; training data (<a class="ext-link" href="/doi/suppl/10.1021/acssuschemeng.0c03660/suppl_file/sc0c03660_si_002.xlsx">XLSX</a>)This article has not yet been cited by other publications.
chemistry, multidisciplinary,engineering, chemical,green & sustainable science & technology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to predict the potential harmful effects of chemicals on the ecosystem quickly and at low cost, especially to predict the ecotoxicity (HC50) of chemicals through a neural network model optimized by a genetic algorithm. Since traditional laboratory or field experimental methods are costly, time - consuming and have low throughput, and cannot meet the needs of the increasing number of chemicals in modern production processes, it is necessary to develop a method that can quickly fill data gaps to guide the technological and policy development of chemical risk management. The model proposed in the paper aims to use the existing experimental data to improve the prediction efficiency and accuracy of chemical ecotoxicity through machine - learning techniques.