Hard-threshold-Neural-Network based Prediction of Organic Synthetic Outcomes

Haoyang Hu,Zhihong Yuan
DOI: https://doi.org/10.21203/rs.2.16734/v1
2019-01-01
Abstract:Abstract Retrosynthetic analysis is the canonical technique to plan the synthesis route of organic molecules in medicine development. In this technique, the screening of synthetic tree branches requires accurate forward reaction prediction, but existing software is still far from completing this step independently. Previous studies have attempted to apply neural network in the forward reaction prediction, but the accuracy is not satisfying. Through using the Edit-based Description and Extended-Connectivity Fingerprints to transform reaction into vector, the presented work focuses on the update of neural network to improve the template-based forward reaction prediction. Hard-threshold activation and target propagation algorithm are implemented by introducing the mixed-convex combinatorial optimization. Comparative tests are conducted to explore the optimal hyperparameter set. Using 15 000 experimental reaction records from granted United States patents, the proposed hard-threshold neural network is systematically trained and tested. The results demonstrate that a higher prediction accuracy is obtained when compared to the traditional neural network with backpropagation algorithm. Indeed, the prediction accuracy of the proposed hard-threshold neural network can reach 73.9% which is higher than Coley’s result with 71.8% ( Coley et al. ACS Cent. Sci, 2017 ). Some successfully predicted reaction examples are also briefly discussed.
What problem does this paper attempt to address?