Machine Learning Prediction of Structure‐Performance Relationship in Organic Synthesis

Li‐Cheng Yang,Lu‐Jing Zhu,Shuo‐Qing Zhang,Xin Hong
DOI: https://doi.org/10.1002/cjoc.202200039
IF: 5.56
2022-06-06
Chinese Journal of Chemistry
Abstract:Comprehensive Summary Data‐driven approach has emerged as a powerful strategy in the construction of structure‐performance relationships in organic synthesis. To close the gap between mechanistic understanding and synthetic prediction, we have made efforts to implement mechanistic knowledge in machine learning modelling of organic transformation, as a way to achieve accurate predictions of reactivity, regio‐ and stereoselectivity. We have constructed a comprehensive and balanced computational database for target radical transformations (arene C–H functionalization and HAT reaction), which laid the foundation for the reactivity and selectivity prediction. Furthermore, we found that the combination of computational statistics and physical organic descriptors offers a practical solution to build machine learning structure‐performance models for reactivity and regioselectivity. To allow machine learning modelling of stereoselectivity, a structured database of asymmetric hydrogenation of olefins was built, and we designed a chemical heuristics‐based hierarchical learning approach to effectively use the big data in the early stage of catalysis screening. Our studies reflect a tiny portion of the exciting developments of machine learning in organic chemistry. The synergy between mechanistic knowledge and machine learning will continue to generate a strong momentum to push the limit of reaction performance prediction in organic chemistry. This article is protected by copyright. All rights reserved.
chemistry, multidisciplinary
What problem does this paper attempt to address?