Abstract:A few decades ago, drug discovery and development were limited to a bunch of medicinal chemists working in a lab with enormous amount of testing, validations, and synthetic procedures, all contributing to considerable investments in time and wealth to get one drug out into the clinics. The advancements in computational techniques combined with a boom in multi-omics data led to the development of various bioinformatics/pharmacoinformatics/cheminformatics tools that have helped speed up the drug development process. But with the advent of artificial intelligence (AI), machine learning (ML) and deep learning (DL), the conventional drug discovery process has been further rationalized. Extensive biological data in the form of big data present in various databases across the globe acts as the raw materials for the ML/DL-based approaches and helps in accurate identifications of patterns and models which can be used to identify therapeutically active molecules with much fewer investments on time, workforce and wealth. In this review, we have begun by introducing the general concepts in the drug discovery pipeline, followed by an outline of the fields in the drug discovery process where ML/DL can be utilized. We have also introduced ML and DL along with their applications, various learning methods, and training models used to develop the ML/DL-based algorithms. Furthermore, we have summarized various DL-based tools existing in the public domain with their application in the drug discovery paradigm which includes DL tools for identification of drug targets and drug–target interaction such as DeepCPI, DeepDTA, WideDTA, PADME DeepAffinity, and DeepPocket. Additionally, we have discussed various DL-based models used in protein structure prediction, de novo design of new chemical scaffolds, virtual screening of chemical libraries for hit identification, absorption, distribution, metabolism, excretion, and toxicity (ADMET) prediction, metabolite prediction, clinical trial design, and oral bioavailability prediction. In the end, we have tried to shed light on some of the successful ML/DL-based models used in the drug discovery and development pipeline while also discussing the current challenges and prospects of the application of DL tools in drug discovery and development. We believe that this review will be useful for medicinal and computational chemists searching for DL tools for use in their drug discovery projects.

Deep learning for low-data drug discovery: hurdles and opportunities

A compact review of progress and prospects of deep learning in drug discovery

Deep Learning in Drug Discovery: Current Landscape and Future Prospects

Traversing Chemical Space with Active Deep Learning: A Computational Framework for Low-data Drug Discovery

Deep learning tools for advancing drug discovery and development

Low Data Drug Discovery with One-Shot Learning

The rise of deep learning in drug discovery

Structure-based drug discovery with deep learning

Advancing Drug Discovery with Deep Learning: Harnessing Reinforcement Learning and One-Shot Learning for Molecular Design in Low-Data Situations

A Comprehensive Review on Machine Learning and Deep Learning Methods in Drug Discovery

Deep Learning Methods for Small Molecule Drug Discovery: A Survey

Current strategies to address data scarcity in artificial intelligence-based drug discovery: A comprehensive review

Recent Progress of Deep Learning in Drug Discovery

Artificial intelligence in drug discovery: recent advances and future perspectives

Spectrum of deep learning algorithms in drug discovery

Advances in Deep Learning Assisted Drug Discovery Methods: A Self-review

A Review Of Deep Learning In Computer-Aided Drug Design

Transfer Learning for Drug Discovery

Status and Prospects of Research on Deep Learning-based De Novo Generation of Drug Molecules

Deep Learning in Drug Discovery and Medicine; Scratching the Surface

Data Integration Using Advances in Machine Learning in Drug Discovery and Molecular Biology