Computational Models to Predict Endocrine-Disrupting Chemical Binding with Androgen or Oestrogen Receptors.

Yingjie Chen,Feixiong Cheng,Lu Sun,Weihua Li,Guixia Liu,Yun Tang
DOI: https://doi.org/10.1016/j.ecoenv.2014.08.026
IF: 7.129
2014-01-01
Ecotoxicology and Environmental Safety
Abstract:Rapidly and correctly identifying endocrine-disrupting chemicals (EDCs) is an important issue in environmental risk assessment. Major EDCs are associated with the androgen receptor (AR) and oestrogen receptors (ERs). Because of the high cost and time-consuming nature of experimental tests, in silico methods are valuable alternative tools for the identification of EDCs. In this study, a large dataset related to EDCs was constructed. Each molecule was represented with seven fingerprints, and computational models were subsequently developed to predict AR and ER binders via machine learning methods including k-nearest neighbour (kNN), C4.5 decision tree (C4.5 DT), naïve Bayes (NB), and support vector machine (SVM) algorithms. The best model for predicting AR binders was PubChem Fingerprint-SVM, which exhibited an accuracy of 0.84. For ER binders, the best method was Extended Fingerprint-SVM with an accuracy of 0.79. Moreover, several representative substructure alerts for characterizing EDCs, such as phenol, trifluoromethyl, and annelated rings, were identified using the combination of information gain and substructure frequency analysis. Our study involved a systematic computational assessment of EDCs related to AR and ERs, and provides significant information on the structural characteristics of these chemicals, which are a great help in identifying EDCs.
What problem does this paper attempt to address?