Abstract:Patient-derived xenografts (PDXs) are an appealing platform for preclinical drug studies because the in vivo environment of PDXs helps preserve tumor heterogeneity and usually better mimics drug response of patients with cancer compared to CCLs. We investigate multimodal neural network (MM-Net) and data augmentation for drug response prediction in PDXs. The MM-Net learns to predict response using drug descriptors, gene expressions (GE), and histology whole-slide images (WSIs) where the multi-modality refers to the tumor features. We explore whether the integration of WSIs with GE improves predictions as compared with models that use GE alone. We use two methods to address the limited number of response values: 1) homogenize drug representations which allows to combine single-drug and drug-pairs treatments into a single dataset, 2) augment drug-pair samples by switching the order of drug features which doubles the sample size of all drug-pair samples. These methods enable us to combine single-drug and drug-pair treatments, allowing us to train multimodal and unimodal neural networks (NNs) without changing architectures or the dataset. Prediction performance of three unimodal NNs which use GE are compared to assess the contribution of data augmentation methods. NN that uses the full dataset which includes the original and the augmented drug-pair treatments as well as single-drug treatments significantly outperforms NNs that ignore either the augmented drug-pairs or the single-drug treatments. In assessing the contribution of multimodal learning based on the MCC metric, MM-Net statistically significantly outperforms all the baselines. Our results show that data augmentation and integration of histology images with GE can improve prediction performance of drug response in PDXs.

Enhancing Gene Expression Representation and Drug Response Prediction with Data Augmentation and Gene Emphasis

Integrative Pharmacogenomics Analysis of Patient-Derived Xenografts

Data augmentation and multimodal learning for predicting drug response in patient-derived xenografts from gene expressions and histology images

Gex2SGen: Designing Drug-like Molecules from Desired Gene Expression Signatures

GSDRP: Fusing Drug Sequence Features with Graph Features to Predict Drug Response

Optimal fusion of genotype and drug embeddings in predicting cancer drug response

Deep generative neural network for accurate drug response imputation

GexMolGen: Cross-modal Generation of Hit-like Molecules via Large Language Model Encoding of Gene Expression Signatures

ProtoMix: Augmenting Health Status Representation Learning Via Prototype-based Mixup

A Genetic Algorithm-Based Ensemble Learning Framework for Drug Combination Prediction

GenMix: Combining Generative and Mixture Data Augmentation for Medical Image Classification

Prediction of Cancer Drug Combinations Based on Multidrug Learning and Cancer Expression Information Injection

A robust drug representation learning model for eliminating cell specificity in gene expression profile and its application.

Augmented drug combination dataset to improve the performance of machine learning models predicting synergistic anticancer effects

A Deep Neural Network for Predicting Synergistic Drug Combinations on Cancer

A Multi-Modal Genomic Knowledge Distillation Framework for Drug Response Prediction

Synergistic Drug Combination Prediction by Integrating Multi-omics Data in Deep Learning Models

SWnet: a deep learning model for drug response prediction from cancer genomic signatures and compound chemical structures

From Gene Expression to Drug Response: A Collaborative Filtering Approach

drGAT: Attention-Guided Gene Assessment of Drug Response Utilizing a Drug-Cell-Gene Heterogeneous Network