“Quantum-Chemoinformatics” for Design and Discovery of New Molecules and Reactions

Hiroko Satoh,Vincenz-Maria Steiner,Jürg Hutter
DOI: https://doi.org/10.26434/chemrxiv-2024-808lg
2024-03-08
Abstract:We give an overview of the role of “quantum-chemoinformatics” in drug development. Quantum- chemoinformatics is a data-driven chemistry using descriptors on the basis of theoretical chemistry, especially quantum chemistry (QC) and ab initio molecular dynamics (MD) simulations. We focus especially on quantum-chemoinformatics for chemical reaction design and prediction, which is one of the important processes in basic research of drug development. We start with a brief historical overview and then introduces two projects of quantum-cheminformatics. The RMap project uses QC-based chemical reaction route networks for discovery and design of new molecules and reactions. The other project is related to environmental pollution by drug molecules, a property which should be taken into account in drug design and evaluation. The last section describes our recent attempt to accelerate QC-data acquisition by utilizing a limited amount of experimental data and machine learning (ML) technology.
Chemistry
What problem does this paper attempt to address?
The paper focuses on the design and prediction of chemical reactions in drug development, particularly through "quantum-chemoinformatics," a data-driven approach to chemistry. Quantum-chemoinformatics utilizes descriptors based on theoretical chemistry, especially quantum chemistry and ab-initio molecular dynamics simulations, to study chemical reactions. The paper introduces the application of this method in the design and prediction of chemical reaction pathways, which are critical steps in the fundamental research phase of drug development, aiming to efficiently synthesize target molecules, improve yields, reduce byproducts, and decrease costs and environmental impacts. The paper provides an overview of the historical development, with a particular mention of the RMap project, which uses quantum chemistry-based chemical reaction route networks to discover and design new molecules and reactions. Another project focuses on the impact of drug molecules on environmental pollution, which needs to be considered in drug design and evaluation. Additionally, the authors discuss how to utilize limited experimental data and machine learning techniques to accelerate the acquisition of quantum chemistry data and improve the accuracy and efficiency of predictions. The paper mentions the development of various systems for chemical reaction design and prediction, from early rule-based or logic-driven methods to later data-driven methods, and the increasing application of these systems in recent years with the development of artificial intelligence and machine learning technologies. The importance of selecting appropriate machine learning methods, data quality and quantity, and descriptor design for the performance of data-driven systems is emphasized in the research. Finally, the paper proposes a strategy to accelerate the acquisition of quantum chemistry data based on limited experimental data by constructing large datasets and using machine learning models to predict parameters at a lower level of quantum chemistry calculation, in order to achieve more efficient prediction of drug molecule degradation.