QSRR Model for Identification and Screening of Emerging Pollutants Based on Artificial Intelligence Algorithms

Qi He,Hua Li,Binyan Jin,Wei Li,Bing Shao,Li Zhang
DOI: https://doi.org/10.1080/26395940.2022.2106311
2022-01-01
Environmental Pollutants and Bioavailability
Abstract:It is urgent to identify and screen emerging pollutants (EPs), which have caused great harm to human health and the environment. In their detection of liquid chromatography-mass spectrometry (LC-MS), the quantitative structure-retention relationship (QSRR) model is simple and efficient to predict the retention behavior of compounds. In the present work, we collected more data with the relative retention time (RRT) of 490 compounds, and filtered the molecular descriptors with lasso regression and multiple linear regression analysis. Then ten important molecular descriptors were screened and applied the QSRR models with deep neural network (DNN), multiple linear regression (MLR), and support vector machine. The DNN model had the best accuracy which the correlation coefficient R2 reached 0.913. Finally, we determined the applicability of the DNN model through a descriptor value range to assist in the identification and screening of EPs.
What problem does this paper attempt to address?