Abstract:Diffusion models have achieved promising results for Structure-Based Drug Design (SBDD). Nevertheless, high-quality protein subpocket and ligand data are relatively scarce, which hinders the models' generation capabilities. Recently, Direct Preference Optimization (DPO) has emerged as a pivotal tool for aligning generative models with human preferences. In this paper, we propose DecompDPO, a structure-based optimization method aligns diffusion models with pharmaceutical needs using multi-granularity preference pairs. DecompDPO introduces decomposition into the optimization objectives and obtains preference pairs at the molecule or decomposed substructure level based on each objective's decomposability. Additionally, DecompDPO introduces a physics-informed energy term to ensure reasonable molecular conformations in the optimization results. Notably, DecompDPO can be effectively used for two main purposes: (1) fine-tuning pretrained diffusion models for molecule generation across various protein families, and (2) molecular optimization given a specific protein subpocket after generation. Extensive experiments on the CrossDocked2020 benchmark show that DecompDPO significantly improves model performance, achieving up to 95.2% Med. High Affinity and a 36.2% success rate for molecule generation, and 100% Med. High Affinity and a 52.1% success rate for molecular optimization.

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper aims to address several key challenges in structure - based drug design (SBDD): 1. **Scarcity of high - quality data**: High - resolution protein sub - pocket and ligand data are relatively scarce, which limits the generation ability of the generative model. High - quality data is crucial for training effective generative models, but in the field of drug design, collecting such data is very difficult due to the complex and resource - intensive experimental process. 2. **Alignment of generative models with actual requirements**: Although existing generative models have made certain progress in generating molecules, the molecules they generate often cannot fully meet the actual requirements of drug design. For example, the generated molecules may have low binding affinity or unreasonable molecular conformations. 3. **Flexibility in multi - objective optimization**: Drug design usually needs to consider multiple target properties, such as binding affinity, drug - likeness (QED), synthetic accessibility (SA), etc. Existing optimization methods have certain limitations in handling multi - objective optimization and it is difficult to optimize multiple properties simultaneously. ### Solutions To solve the above problems, the authors propose **DECOMP DPO** (Decomposed Direct Preference Optimization), a structure - based optimization method that aligns the generative model through multi - granularity preferences to meet the actual requirements of drug design. Specifically: - **Decomposing the optimization objective**: DECOMP DPO introduces the concept of decomposition, decomposing the optimization objective into molecular - level or sub - structure - level preference pairs. This method improves the flexibility of optimization, enabling the model to more effectively align preferences at different granularities. - **Energy term with physical constraints**: To ensure that the generated molecules have reasonable conformations, DECOMP DPO introduces an energy term with physical constraints, penalizing those molecular conformations that do not conform to physical rules. - **Linear beta scheduling**: To improve the optimization efficiency, DECOMP DPO proposes a linear beta scheduling method, gradually reducing the influence of regularization, thereby better aligning the required properties. ### Application scenarios DECOMP DPO can be applied to two main scenarios: 1. **Molecular generation across protein families**: By fine - tuning the pre - trained diffusion model, generate high - quality molecules for different protein families. 2. **Molecular optimization of specific protein sub - pockets**: After generating molecules, perform molecular optimization for specific protein sub - pockets to further improve the binding affinity and other properties of the molecules. ### Experimental results The experimental results show that DECOMP DPO significantly improves the model performance on the CrossDocked2020 benchmark dataset, specifically: - **Molecular generation**: It reaches a median high - affinity of 95.2% with a success rate of 36.2%. - **Molecular optimization**: It reaches a median high - affinity of 100% with a success rate of 52.1%. These results indicate that DECOMP DPO has significant advantages in generating and optimizing high - quality molecules.

Decomposed Direct Preference Optimization for Structure-Based Drug Design

Decomposed Direct Preference Optimization for Structure-Based Drug Design

DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization

DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design

Streamlining Computational Fragment-Based Drug Discovery Through Evolutionary Optimization Informed by Ligand-Based Virtual Prescreening

Aligning protein generative models with experimental fitness via Direct Preference Optimization

BInD: Bond and Interaction-generating Diffusion Model for Multi-objective Structure-based Drug Design

Structure-Based Molecule Optimization via Gradient-Guided Bayesian Update

AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design

DiffDec: Structure-Aware Scaffold Decoration with an End-to-End Diffusion Model

Dynamic Multi-Objective Optimization Algorithm Based Decomposition and Preference.

Molecular generation strategy and optimization based on A2C reinforcement learning in de novo drug design

A Deep Generative Model for Molecule Optimization via One Fragment Modification

Improving Inverse Folding for Peptide Design with Diversity-regularized Direct Preference Optimization

Structure-based Drug Design Benchmark: Do 3D Methods Really Dominate?

Binding-Adaptive Diffusion Models for Structure-Based Drug Design

Dual-Space Optimization: Improved Molecule Sequence Design by Latent Prompt Transformer

Direct Preference Optimization with an Offset

De novo drug design by iterative multiobjective deep reinforcement learning with graph-based molecular quality assessment

New Desiderata for Direct Preference Optimization

Rectified Flow For Structure Based Drug Design