Abstract:Molecular recognition is fundamental in biology, underpinning intricate processes through specific protein–ligand interactions. This understanding is pivotal in drug discovery, yet traditional experimental methods face limitations in exploring the vast chemical space. Computational approaches, notably quantitative structure–activity/property relationship analysis, have gained prominence. Molecular fingerprints encode molecular structures and serve as property profiles, which are essential in drug discovery. While two-dimensional (2D) fingerprints are commonly used, three-dimensional (3D) structural interaction fingerprints offer enhanced structural features specific to target proteins. Machine learning models trained on interaction fingerprints enable precise binding prediction. Recent focus has shifted to structure-based predictive modeling, with machine-learning scoring functions excelling due to feature engineering guided by key interactions. Notably, 3D interaction fingerprints are gaining ground due to their robustness. Various structural interaction fingerprints have been developed and used in drug discovery, each with unique capabilities. This review recapitulates the developed structural interaction fingerprints and provides two case studies to illustrate the power of interaction fingerprint-driven machine learning. The first elucidates structure–activity relationships in β2 adrenoceptor ligands, demonstrating the ability to differentiate agonists and antagonists. The second employs a retrosynthesis-based pre-trained molecular representation to predict protein–ligand dissociation rates, offering insights into binding kinetics. Despite remarkable progress, challenges persist in interpreting complex machine learning models built on 3D fingerprints, emphasizing the need for strategies to make predictions interpretable. Binding site plasticity and induced fit effects pose additional complexities. Interaction fingerprints are promising but require continued research to harness their full potential.

Leveraging non-structural data to predict structures of protein–ligand complexes

Leveraging nonstructural data to predict structures and affinities of protein-ligand complexes

State-specific protein-ligand complex structure prediction with a multi-scale deep generative model

Estimating protein–ligand interactions with geometric deep learning and mixture density models

New trends in computational structure prediction of ligand-protein complexes for receptor-based drug design

On Machine Learning Approaches for Protein-Ligand Binding Affinity Prediction

Synergistic Application of Molecular Docking and Machine Learning for Improved Binding Pose

State-specific protein–ligand complex structure prediction with a multiscale deep generative model

Fingerprinting Interactions between Proteins and Ligands for Facilitating Machine Learning in Drug Discovery

Machine Learning for Sequence and Structure-Based Protein–Ligand Interaction Prediction

Leveraging binding-site structure for drug discovery with point-cloud methods

Accurate prediction of protein–ligand interactions by combining physical energy functions and graph-neural networks

Structure prediction of protein-ligand complexes from sequence information with Umol

Protein-ligand binding affinity prediction: Is 3D binding pose needed?

Structural properties and interaction energies affecting drug design. An approach combining molecular simulations, statistics, interaction energies and neural networks

Binding Affinity Prediction with 3D Machine Learning: Training Data and Challenging External Testing

Modern machine‐learning for binding affinity estimation of protein–ligand complexes: Progress, opportunities, and challenges

Accurate Protein-Ligand Complex Structure Prediction using Geometric Deep Learning

Physics-Guided Deep Generative Model for New Ligand Discovery

Improved prediction of ligand-protein binding affinities by meta-modeling

Structure-based, deep-learning models for protein-ligand binding affinity prediction