Abstract:Recent advances in computer hardware and software have led to a revolution in deep neural networks that has impacted fields ranging from language translation to computer vision. Deep learning has also impacted a number of areas in drug discovery, including the analysis of cellular images and the design of novel routes for the synthesis of organic molecules. While work in these areas has been impactful, a complete review of the applications of deep learning in drug discovery would be beyond the scope of a single Account. In this Account, we will focus on two key areas where deep learning has impacted molecular design: the prediction of molecular properties and the de novo generation of suggestions for new molecules.One of the most significant advances in the development of quantitative structure–activity relationships (QSARs) has come from the application of deep learning methods to the prediction of the biological activity and physical properties of molecules in drug discovery programs. Rather than employing the expert-derived chemical features typically used to build predictive models, researchers are now using deep learning to develop novel molecular representations. These representations, coupled with the ability of deep neural networks to uncover complex, nonlinear relationships, have led to state-of-the-art performance. While deep learning has changed the way that many researchers approach QSARs, it is not a panacea. As with any other machine learning task, the design of predictive models is dependent on the quality, quantity, and relevance of available data. Seemingly fundamental issues, such as optimal methods for creating a training set, are still open questions for the field. Another critical area that is still the subject of multiple research efforts is the development of methods for assessing the confidence in a model.Deep learning has also contributed to a renaissance in the application of de novo molecule generation. Rather than relying on manually defined heuristics, deep learning methods learn to generate new molecules based on sets of existing molecules. Techniques that were originally developed for areas such as image generation and language translation have been adapted to the generation of molecules. These deep learning methods have been coupled with the predictive models described above and are being used to generate new molecules with specific predicted biological activity profiles. While these generative algorithms appear promising, there have been only a few reports on the synthesis and testing of molecules based on designs proposed by generative models. The evaluation of the diversity, quality, and ultimate value of molecules produced by generative models is still an open question. While the field has produced a number of benchmarks, it has yet to agree on how one should ultimately assess molecules "invented" by an algorithm.This article has not yet been cited by other publications.

Machine learning-assisted search for novel coagulants: when machine learning can be efficient even if data availability is low

Machine learning‐assisted search for novel coagulants: When machine learning can be efficient even if data availability is low

An Adaptive Graph Learning Method for Automated Molecular Interactions and Properties Predictions

Advancing Drug Discovery with Deep Learning: Harnessing Reinforcement Learning and One-Shot Learning for Molecular Design in Low-Data Situations

Novel Big Data-Driven Machine Learning Models for Drug Discovery Application

A Hybrid Structure-Based Machine Learning Approach for Predicting Kinase Inhibition by Small Molecules

On Machine Learning Approaches for Protein-Ligand Binding Affinity Prediction

Applications of Deep Learning in Molecule Generation and Molecular Property Prediction

Evaluating Scalable Supervised Learning for Synthesize-on-Demand Chemical Libraries

Ensembling machine learning models to boost molecular affinity prediction

Low Data Drug Discovery with One-Shot Learning

Prediction of chemical compounds properties using a deep learning model

Approach for the Design of Covalent Protein Kinase Inhibitors via Focused Deep Generative Modeling

Accelerating Molecular Docking using Machine Learning Methods

Leveraging multiple data types for improved compound-kinase bioactivity prediction

A Network-Based Multi-Target Computational Estimation Scheme for Anticoagulant Activities of Compounds.

Complex machine learning model needs complex testing: Examining predictability of molecular binding affinity by a graph neural network

Efficient Exploration of Chemical Space with Docking and Deep Learning

Modern machine‐learning for binding affinity estimation of protein–ligand complexes: Progress, opportunities, and challenges

Evaluation of network architecture and data augmentation methods for deep learning in chemogenomics

Machine learning-aided generative molecular design