Abstract:Abstract We have developed an algorithm and implemented it in a software platform for the purpose of developing new anti-tumor drugs in the form of small molecules. In this study, we focused on generating molecules specifically for the treatment of lung cancer patients. To begin with, we employed deep learning (DL) techniques to evaluate the genes associated with poor clinical outcomes in lung cancer patients. By utilizing generative adversarial neural networks (GAN), we acquired additional patient data. The results of each experiment were presented as a list of genes ordered by their impact on the desired effect. We then intersected the lists of genes obtained from experiments with overall survival (OS) and progression-free interval (PFI) data. This allowed us to identify a set of genes whose expression was correlated with poor prognosis. In order to enhance the precision, we trained another DL model to distinguish between normal and tumor tissue based on gene expression. By doing so, we were able to identify the smaller set of genes that could be targeted. Subsequently, we developed a module that predicts the interactions between inhibitors and proteins. This involved representing protein amino acid sequences and chemical compound formulas in vector form, and a virtual screening of the Pubchem database. The implementation of the Drug-protein interactions module resulted in a dataset of 118,379 pairs, including 19,250 pairs describing compounds bound to proteins, and 99,129 precedents describing non-bound ones. DLwas applied, yielding a ROC-AUC of 0.86. Following the search for candidate molecules, we obtained 160,000 pairs with a predicted interaction probability above 0.99, as well as 2,921 pairs with probability of 1.0. Additionally, we created a DL-based module to predict the IC50 values in cell line experiments. Virtual pre-clinical trials were conducted using the selected inhibitors to identify relevant cell lines for subsequent laboratory experiments. Through this process, we obtained formulas for several molecules that demonstrated predicted binding to specific proteins. During the cell experiment emulation, our feature importance algorithm selected 129 genes. For the cell experiment emulation stage, we specifically chose interactions with a probability of at least 0.9. We prioritized molecules that acted on the minimum number of cell lines with a higher probability, thus ensuring higher specificity. Ultimately, we selected 5 small molecules as potential candidates, as well as certain cell lines for their validation. The NLP technologies utilized in this study demonstrated their effectiveness in processing tens of thousands of articles. The pipeline of methods presented in this paper lays the groundwork for automated AI-driven drug discovery. We have showcased the application of modern machine learning methods, particularly DL, as well as the methods used to prepare the initial data for the learning algorithms. The performance of these methods has been validated through cross-validation using data from publicly available sources. Citation Format: Dmitrii K Chebanov, Vsevolod A Misyurin, Nadezhda S Tatevosova. Deep learning-driven drug discovery: A breakthrough algorithm and its implication in lung cancer therapy development [abstract]. In: Proceedings of the AACR-NCI-EORTC Virtual International Conference on Molecular Targets and Cancer Therapeutics; 2023 Oct 11-15; Boston, MA. Philadelphia (PA): AACR; Mol Cancer Ther 2023;22(12 Suppl):Abstract nr A014.

Abstract 7393: Tumor model to tumor treatment: Applying deep learning approaches to map multimodal data from cancer model systems to patients

Abstract 4951: Integrative multi-omic machine learning model predicts neoadjuvant immunotherapy response using molecular data and deep learning-derived features from digital pathology

Abstract 2723: Model-based cancer therapy selection by linking tumor vulnerabilities to drug mechanism

Abstract 5381: A broad-use deep learning model based on multi-dimensional morphology to identify and characterize tumor cell heterogeneity

Abstract 3537: Assessing machine learning models of drug response predictions using orthotopic patient-derived xenograft

Abstract 4970: Multi-modal machine learning approaches for predicting cancer type and Gleason grade leveraging public TCGA data

Abstract 3522: Application of large language models to nucleotide sequences for profiling signaling pathway disruptions in ovarian cancer patients

Abstract A014: Deep learning-driven drug discovery: A breakthrough algorithm and its implication in lung cancer therapy development

Abstract 6191: Simultaneous prediction of tumor microenvironment biomarkers from pathology slides using multi-task deep regression

Abstract 1174: Enhancing the utilization of deep learning to predict patient response in small immunotherapy cohorts using real-world data

Abstract 469: A multi-scale analysis and visualization platform for cancer data - deriving tumor microenvironment behavior from pathology and transcriptomics

Abstract 1789: Machine learning-based method to analyze metabolic fluxes of patient tumors

Abstract B023: Modeling of new drugs clinical trials outcome with patients’ digital twins cohorts

Abstract B082: Early risk stratification of ER+/HER2– breast cancer patients using digital pathology and multi-task, weakly-supervised deep learning

Abstract 896: Predicting metastatic transcriptomes of patient tumors with deep learning

Abstract 6189: Deep-learning model trained on multiplex immunofluorescence-stained tissue samples predicts the survival of patients with non-small cell lung cancer better than PD-L1 TPS alone

Abstract 5721: Automated annotation for large-scale clinicogenomic models of lung cancer treatment response and overall survival

Abstract 2059: Machine learning integration of transcriptome-wide spatial sequencing data and ultra-high plex spatial proteomic data enables the prioritization of cancer drug targets

Abstract 910: Clinical inference and biological dissection of tumor ploidy and heterogeneity in cutaneous melanoma for immunotherapy response using deep learning

Abstract 188: Deep learning enables label-free profiling of the tumor microenvironment and enrichment of rare cancer cells

Abstract IA019: Multiscale systems approach to target tumor ecosystem responses for therapeutic benefit