Abstract:The goal of molecular optimization (MO) is to discover molecules that acquire improved pharmaceutical properties over a known starting molecule. Despite many recent successes of new approaches for MO, these methods were typically developed for particular properties with rich annotated training examples. Thus, these approaches are difficult to implement in real scenes where only a small amount of pharmaceutical data is usually available due to the expense and significant effort required for the data collection. Here, we propose a new approach, Meta-MO, for molecular optimization with a handful of training samples based on the well-recognized first-order meta-learning algorithms. By using a set of meta tasks with rich training samples, Meta-MO trains a meta model through the meta-learning optimization and adapts the learned model to new low-resource MO tasks. Meta-MO was shown to consistently outperform several pretraining and multitask training procedures, providing an average improvement in the success rate of 4.3% on a large-scale bioactivity data set with diverse target variations. We also observed that Meta-MO resulted in the best performing models across fine-tuning sets with only dozens of samples. To the best of our knowledge, this is the first study to apply meta learning to MO tasks. More importantly, such a strategy could be further extended to many low-resource scenarios in real-world drug design.The Supporting Information is available free of charge at <a class="ext-link" href="/doi/10.1021/acs.jcim.0c01416?goto=supporting-info">https://pubs.acs.org/doi/10.1021/acs.jcim.0c01416</a>.Detailed descriptions of graph encoder and Transformer architecture; Table S1, model input representations for atoms; Table S2, <i>R</i><sup>2</sup>, RMSE, and MAE metrics for query task scoring models; Table S3, standard deviations of data in <a class="ref showTableEvent internalNav" href="#tbl4">Table </a><a class="ref showTableEvent internalNav" href="#tbl4">4</a>; and Figures S1–S4, distributions of source molecule weight, synthetic accessibility score, logP score, and bioactivity (<a class="ext-link" href="/doi/suppl/10.1021/acs.jcim.0c01416/suppl_file/ci0c01416_si_001.pdf">PDF</a>)Data set splits of tasks (<a class="ext-link" href="/doi/suppl/10.1021/acs.jcim.0c01416/suppl_file/ci0c01416_si_002.xlsx">XLSX</a>)This article has not yet been cited by other publications.

Meta-Learning Initializations for Low-Resource Drug Discovery

Meta-Learning GNN Initializations for Low-Resource Molecular Property Prediction

Meta Learning for Low-Resource Molecular Optimization

A Case-Based Meta-Learning Algorithm Boosts the Performance of Structure-Based Virtual Screening.

Meta Learning With Graph Attention Networks for Low-Data Drug Discovery

Meta-MolNet: A Cross-Domain Benchmark for Few Examples Drug Discovery

Meta-QSAR: a large-scale application of meta-learning to drug design and discovery

Task‐similarity is a crucial factor for few‐shot meta‐learning of structure‐activity relationships

Learning Together: Towards foundational models for machine learning interatomic potentials with meta-learning

Learning together: Towards foundation models for machine learning interatomic potentials with meta-learning

Model Agnostic Semi-Supervised Meta-Learning Elucidates Understudied Out-of-distribution Molecular Interactions

Low Data Drug Discovery with One-Shot Learning

MAC: a meta-learning approach for feature learning and recombination

Advancing Drug Discovery with Deep Learning: Harnessing Reinforcement Learning and One-Shot Learning for Molecular Design in Low-Data Situations

A bioactivity foundation model using pairwise meta-learning

MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning

Less is more: sampling chemical space with active learning

Improved prediction of ligand-protein binding affinities by meta-modeling

Semi-supervised meta-learning elucidates understudied molecular interactions

Inductive transfer learning for molecular activity prediction: Next-Gen QSAR Models with MolPMoFiT

When MAML Can Adapt Fast and How to Assist When It Cannot