Decomposed Direct Preference Optimization for Structure-Based Drug Design

Xiwei Cheng,Xiangxin Zhou,Yuwei Yang,Yu Bao,Quanquan Gu
2024-10-28
Abstract:Diffusion models have achieved promising results for Structure-Based Drug Design (SBDD). Nevertheless, high-quality protein subpocket and ligand data are relatively scarce, which hinders the models' generation capabilities. Recently, Direct Preference Optimization (DPO) has emerged as a pivotal tool for aligning generative models with human preferences. In this paper, we propose DecompDPO, a structure-based optimization method aligns diffusion models with pharmaceutical needs using multi-granularity preference pairs. DecompDPO introduces decomposition into the optimization objectives and obtains preference pairs at the molecule or decomposed substructure level based on each objective's decomposability. Additionally, DecompDPO introduces a physics-informed energy term to ensure reasonable molecular conformations in the optimization results. Notably, DecompDPO can be effectively used for two main purposes: (1) fine-tuning pretrained diffusion models for molecule generation across various protein families, and (2) molecular optimization given a specific protein subpocket after generation. Extensive experiments on the CrossDocked2020 benchmark show that DecompDPO significantly improves model performance, achieving up to 95.2% Med. High Affinity and a 36.2% success rate for molecule generation, and 100% Med. High Affinity and a 52.1% success rate for molecular optimization.
Biomolecules,Machine Learning
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to address several key challenges in structure - based drug design (SBDD): 1. **Scarcity of high - quality data**: High - resolution protein sub - pocket and ligand data are relatively scarce, which limits the generation ability of the generative model. High - quality data is crucial for training effective generative models, but in the field of drug design, collecting such data is very difficult due to the complex and resource - intensive experimental process. 2. **Alignment of generative models with actual requirements**: Although existing generative models have made certain progress in generating molecules, the molecules they generate often cannot fully meet the actual requirements of drug design. For example, the generated molecules may have low binding affinity or unreasonable molecular conformations. 3. **Flexibility in multi - objective optimization**: Drug design usually needs to consider multiple target properties, such as binding affinity, drug - likeness (QED), synthetic accessibility (SA), etc. Existing optimization methods have certain limitations in handling multi - objective optimization and it is difficult to optimize multiple properties simultaneously. ### Solutions To solve the above problems, the authors propose **DECOMP DPO** (Decomposed Direct Preference Optimization), a structure - based optimization method that aligns the generative model through multi - granularity preferences to meet the actual requirements of drug design. Specifically: - **Decomposing the optimization objective**: DECOMP DPO introduces the concept of decomposition, decomposing the optimization objective into molecular - level or sub - structure - level preference pairs. This method improves the flexibility of optimization, enabling the model to more effectively align preferences at different granularities. - **Energy term with physical constraints**: To ensure that the generated molecules have reasonable conformations, DECOMP DPO introduces an energy term with physical constraints, penalizing those molecular conformations that do not conform to physical rules. - **Linear beta scheduling**: To improve the optimization efficiency, DECOMP DPO proposes a linear beta scheduling method, gradually reducing the influence of regularization, thereby better aligning the required properties. ### Application scenarios DECOMP DPO can be applied to two main scenarios: 1. **Molecular generation across protein families**: By fine - tuning the pre - trained diffusion model, generate high - quality molecules for different protein families. 2. **Molecular optimization of specific protein sub - pockets**: After generating molecules, perform molecular optimization for specific protein sub - pockets to further improve the binding affinity and other properties of the molecules. ### Experimental results The experimental results show that DECOMP DPO significantly improves the model performance on the CrossDocked2020 benchmark dataset, specifically: - **Molecular generation**: It reaches a median high - affinity of 95.2% with a success rate of 36.2%. - **Molecular optimization**: It reaches a median high - affinity of 100% with a success rate of 52.1%. These results indicate that DECOMP DPO has significant advantages in generating and optimizing high - quality molecules.