Abstract:The integration of deep learning, particularly AI-Generated Content, with high-quality data derived from ab initio calculations has emerged as a promising avenue for transforming the landscape of scientific research. However, the challenge of designing molecular drugs or materials that incorporate multi-modality prior knowledge remains a critical and complex undertaking. Specifically, achieving a practical molecular design necessitates not only meeting the diversity requirements but also addressing structural and textural constraints with various symmetries outlined by domain experts. In this article, we present an innovative approach to tackle this inverse design problem by formulating it as a multi-modality guidance optimization task. Our proposed solution involves a textural-structure alignment symmetric diffusion framework for the implementation of molecular optimization tasks, namely 3DToMolo. 3DToMolo aims to harmonize diverse modalities including textual description features and graph structural features, aligning them seamlessly to produce molecular structures adhere to specified symmetric structural and textural constraints by experts in the field. Experimental trials across three guidance optimization settings have shown a superior hit optimization performance compared to state-of-the-art methodologies. Moreover, 3DToMolo demonstrates the capability to discover potential novel molecules, incorporating specified target substructures, without the need for prior knowledge. This work not only holds general significance for the advancement of deep learning methodologies but also paves the way for a transformative shift in molecular design strategies. 3DToMolo creates opportunities for a more nuanced and effective exploration of the vast chemical space, opening new frontiers in the development of molecular entities with tailored properties and functionalities.

Preference Optimization for Molecular Language Models

PrefixMol: Target- and Chemistry-aware Molecule Design Via Prefix Embedding

Small Molecule Optimization with Large Language Models

Leveraging language model for advanced multiproperty molecular optimization via prompt engineering

Adaptive language model training for molecular design

Bayesian molecular design with a chemical language model

Optimizing molecules using efficient queries from property evaluations

Preference Optimization for Molecule Synthesis with Conditional Residual Energy-based Models

Decomposed Direct Preference Optimization for Structure-Based Drug Design

Conditional Latent Space Molecular Scaffold Optimization for Accelerated Molecular Design

Domain-Agnostic Molecular Generation with Chemical Feedback

Graph Polish: A Novel Graph Generation Paradigm for Molecular Optimization

Preference optimization of protein language models as a multi-objective binder design paradigm

Computer-aided multi-objective optimization in small molecule discovery

MOLER: Incorporate Molecule-Level Reward to Enhance Deep Generative Model for Molecule Optimization

Directly Optimizing for Synthesizability in Generative Molecular Design using Retrosynthesis Models

Chemical Language Model Linker: blending text and molecules with modular adapters

DrugAssist: A Large Language Model for Molecule Optimization

Sculpting Molecules in Text-3D Space: A Flexible Substructure Aware Framework for Text-Oriented Molecular Optimization

Versatile Molecular Editing via Multimodal and Group-optimized Generative Learning