An Empirical Evaluation of Zeroth-Order Optimization Methods on AI-driven Molecule Optimization

Elvin Lo,Pin-Yu Chen
DOI: https://doi.org/10.48550/arXiv.2210.16099
IF: 6.064
2022-10-27
Biomolecules
Abstract:Molecule optimization is an important problem in chemical discovery and has been approached using many techniques, including generative modeling, reinforcement learning, genetic algorithms, and much more. Recent work has also applied zeroth-order (ZO) optimization, a subset of gradient-free optimization that solves problems similarly to gradient-based methods, for optimizing latent vector representations from an autoencoder. In this paper, we study the effectiveness of various ZO optimization methods for optimizing molecular objectives, which are characterized by variable smoothness, infrequent optima, and other challenges. We provide insights on the robustness of various ZO optimizers in this setting, show the advantages of ZO sign-based gradient descent (ZO-signGD), discuss how ZO optimization can be used practically in realistic discovery tasks, and demonstrate the potential effectiveness of ZO optimization methods on widely used benchmark tasks from the Guacamol suite. Code is available at: https://github.com/IBM/QMO-bench.
What problem does this paper attempt to address?
This paper attempts to solve several key problems in molecular optimization, which are crucial in chemical discovery. Specifically, the paper focuses on how to use Zeroth - Order (ZO) optimization methods to optimize molecular targets, which usually have varying smoothness, sparse optimal values and other challenges. The main contribution of the paper lies in evaluating the effectiveness of different ZO optimization methods on molecular optimization problems, especially in terms of convergence speed, convergence accuracy and robustness to unfriendly landscapes of molecular objective functions. The following are the main research points of the paper: 1. **Evaluating the effectiveness of ZO optimization methods**: The paper experimentally compares several different ZO optimization methods, such as ZO - Gradient Descent (ZO - GD), Sign - based ZO - Gradient Descent (ZO - signGD) and ZO - Adaptive Momentum method (ZO - Adam), to evaluate their performance in molecular optimization tasks. 2. **Characteristics of molecular objective functions**: The paper explores the special properties of objective functions in molecular optimization problems, such as activity cliffs, sparse optimal regions and discrete jumps, which make molecular optimization very difficult. 3. **Advantages of Zeroth - Order optimization methods**: The research shows that ZO - signGD is not only superior to other methods in convergence speed, but also shows higher robustness when dealing with unfriendly landscapes of molecular objective functions. Although ZO - signGD may have lower convergence accuracy in other problems (such as adversarial attacks), it shows higher effectiveness and accuracy in molecular optimization. 4. **Practical applications and hybrid methods**: The paper discusses how to apply ZO optimization methods in actual drug discovery scenarios, including the identification of new lead molecules and the optimization of known lead molecules. In addition, a hybrid method that combines ZO optimization methods with other generative models is proposed to improve search efficiency and the ability to meet multiple design constraints. 5. **Experimental verification**: In order to verify the effectiveness of the above methods, the paper conducts experiments on three tasks in the Guacamol benchmark test suite, including perindopril_mpo, zaleplon_mpo and deco_hop. The experimental results show that ZO - signGD performs best in these tasks, especially when dealing with difficult objective function landscapes. In conclusion, through systematic experiments and analysis, this paper shows the potential of ZO optimization methods in molecular optimization, especially the superior performance of ZO - signGD when dealing with complex molecular objective functions. This provides new tools and methods for future drug discovery.