RASA: A Rapid Retrosynthesis-Based Scoring Method for the Assessment of Synthetic Accessibility of Drug-like Molecules

Qi Huang,Lin-Li Li,Sheng-Yong Yang
DOI: https://doi.org/10.1021/ci100216g
IF: 6.162
2011-09-21
Journal of Chemical Information and Modeling
Abstract:In this account, a rapid retrosynthesis-based scoring method for the assessment of synthetic accessibility of drug-like molecules, called RASA (Retrosynthesis-based Assessment of Synthetic Accessibility) is devised. RASA first constructs a synthesis tree for the target molecule based on retrosynthetic analysis; in this process a series of strategies are suggested for limiting combinatorial explosion of the synthesis tree. A scoring function (RASA-score) for the assessment of synthetic accessibility is then proposed based on the optional effective synthetic routes, the complexity of reaction, and the difficulty of separation/purification associated with the most favorable synthetic route. The contributions of individual components are calibrated by linear regression analysis based on the synthetic accessibility estimates of a training set (100 compounds) given by a group of medicinal chemists (G1). Two external test sets (TS1 and TS2), whose synthetic accessibility estimates were given by the group G1 medicinal chemists and another group (G2) of medicinal chemists (from literature), respectively, were adopted for the evaluation of RASA. The correlation coefficient between the calculated RASA-score values and the estimated scores by medicinal chemists for TS1 is 0.807 and that for TS2 is 0.792, which demonstrate the validity and reliability of RASA. The validity and reliability as well as the high speed of RASA and its capability of suggesting synthetic routes enable it a useful tool in drug discovery.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems
What problem does this paper attempt to address?