DrugCLIP: Contrastive Drug-Disease Interaction For Drug Repurposing

Yingzhou Lu,Yaojun Hu,Chenhao Li
2024-07-02
Abstract:Bringing a novel drug from the original idea to market typically requires more than ten years and billions of dollars. To alleviate the heavy burden, a natural idea is to reuse the approved drug to treat new diseases. The process is also known as drug repurposing or drug repositioning. Machine learning methods exhibited huge potential in automating drug repurposing. However, it still encounter some challenges, such as lack of labels and multimodal feature representation. To address these issues, we design DrugCLIP, a cutting-edge contrastive learning method, to learn drug and disease's interaction without negative labels. Additionally, we have curated a drug repurposing dataset based on real-world clinical trial records. Thorough empirical studies are conducted to validate the effectiveness of the proposed DrugCLIP method.
Machine Learning,Biomolecules
What problem does this paper attempt to address?
The paper proposes a method to address some key challenges in Drug Repurposing. Drug repurposing refers to the use of approved drugs for treating new diseases to reduce the time and cost of new drug development. Currently, the main issues faced by this approach include the lack of high-quality data, the absence of labels (especially negative samples), and difficulties in representing multimodal features. To tackle these challenges, the paper introduces a contrastive learning framework called DrugCLIP, which can learn drug-disease interactions without the need for negative samples. Additionally, the researchers establish a drug repurposing dataset based on real-world clinical trial records. Through empirical studies, they validate the effectiveness of the DrugCLIP method, demonstrating its superior performance in predicting drug-disease interactions compared to existing baseline methods, especially with a 16.5% improvement in hit rate. The main contributions of the paper include: 1. Creation of a high-quality dataset for drug repurposing derived from clinical trial records. 2. Development of DrugCLIP, a deep contrastive learning method tailored for drug repurposing, capable of modeling the interactions between drug molecules and disease codes. 3. Proposal of representation learning methods for drug molecules and disease codes. The paper also reviews the applications of artificial intelligence in drug discovery and clinical trials, and introduces the proposed solutions, including data preprocessing, contrastive learning strategies, and multimodal data representation. Experimental results demonstrate the superiority of DrugCLIP in the task of drug repurposing.