Permutationally Invariant Deep Learning Approach to Molecular Fingerprinting with Application to Compound Mixtures

Andrei Buin,Hung Yi Chiang,S. Andrew Gadsden,Faraz A. Alderson
DOI: https://doi.org/10.1021/acs.jcim.0c01097
IF: 6.162
2021-02-04
Journal of Chemical Information and Modeling
Abstract:Recent advancements in deep learning have led to widespread applications of its algorithms to synthetic planning and reaction predictions in the field of chemistry. One major area, known as supervised learning, is being explored for predicting certain properties such as reaction yields and types. Many chemical descriptors known as fingerprints are being explored as potential candidates for reaction properties prediction. However, there are few studies that describe the permutational invariance of chemical fingerprints, which are concatenated at some stage before being fed to deep learning architecture. In this work, we show that by utilizing permutational invariance, we consistently see improved results in terms of accuracy relative to previously published studies. Furthermore, we are able to accurately predict hydrogen peroxide loss with our own dataset, which consists of more than 20 ingredients in each chemical formulation.The Supporting Information is available free of charge at <a class="ext-link" href="/doi/10.1021/acs.jcim.0c01097?goto=supporting-info">https://pubs.acs.org/doi/10.1021/acs.jcim.0c01097</a>.Wei's Original model with neural fingerprints; fingerprints with permutational layers used in modified classification task; fingerprints with permutational layers used in modified classification task; Baseline1 model; and confusion matrices using neural fingerprints on test data Code and data are available at: <a class="extLink" href="https://github.com/phquanta/DeepPermInvFp.git">https://github.com/phquanta/DeepPermInvFp.git</a> (<a class="ext-link" href="/doi/suppl/10.1021/acs.jcim.0c01097/suppl_file/ci0c01097_si_001.pdf">PDF</a>)This article has not yet been cited by other publications.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems
What problem does this paper attempt to address?