Combining Molecular Dynamics and Machine Learning to Predict Self-Solvation Free Energies and Limiting Activity Coefficients
Julia Gebhardt,Matthias Kiesel,Sereina Riniker,Niels Hansen
DOI: https://doi.org/10.1021/acs.jcim.0c00479
IF: 6.162
2020-08-10
Journal of Chemical Information and Modeling
Abstract:Computational prediction of limiting activity coefficients is of great relevance for process design. For highly nonideal mixtures including molecules with directed interactions, methods that maintain the molecular character of the solvent are most promising. Computational expense and force-field deficiencies are the main limiting factors that prevent the use of high-throughput molecular dynamics (MD) simulations in a predictive setup. The combination of MD simulations and machine learning used in this work accounts for both issues. Comparison to published data including free-energy simulations, COSMO-RS and UNIFAC models, reveals competitive prediction accuracy.The Supporting Information is available free of charge at <a class="ext-link" href="/doi/10.1021/acs.jcim.0c00479?goto=supporting-info">https://pubs.acs.org/doi/10.1021/acs.jcim.0c00479</a>.Table S1 with molecule names and identifiers of 300 molecules representing the set union of the seven data sets employed. Tables S2–S9 reporting additional analysis and method comparison. Additional Figures S1 and S2 presenting learning curves for the SVR model, Figure S3 comparing MDFP to SM12 and MOSCED, and Figures S4–S7 related to the feature importance analysis (<a class="ext-link" href="/doi/suppl/10.1021/acs.jcim.0c00479/suppl_file/ci0c00479_si_001.pdf">PDF</a>)Lists of molecule names, identifiers, and experimental values for the seven different data sets (<a class="ext-link" href="/doi/suppl/10.1021/acs.jcim.0c00479/suppl_file/ci0c00479_si_002.zip">ZIP</a>)This article has not yet been cited by other publications.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems