Multiscale Fusion Network Drives the Repurposing of Anticancer Drugs.
Zhaoman Wan,Nan Jiang,Mingming Su,Xinlei Zhang,Yang Cao,Aiping Wu,Peng Zhang,Taijiao Jiang
DOI: https://doi.org/10.1002/ctm2.1745
IF: 8.554
2024-01-01
Clinical and Translational Medicine
Abstract:Dear Editor, Drug repurposing is at the forefront of a transformative shift in computational methods driving new applications of approved or investigational drugs.1, 2 With the development of network pharmacology, repositioning algorithms for drug effects or drug targets are constantly expanding, but integrating multidimensional data to achieve precise repurposing is still a challenge.3-9 We focus on drug attribute characteristics and propose a scalable systematic paradigm. Using the Genomics of Drug Sensitivity in Cancer (GDSC) database for anti-tumor drugs, a integrated drug similarity network (iDSN) derived from different drug similarity networks (DSNs) based on chemical structure and drug target sequence data is constructed to infer potential drug pathways from drug properties and realise drug repurposing. Initially, we processed drug profile data by vectorizing it (Figure 1A). Based on chemical and pharmacological properties, we constructed two separate DSNs: chem-DSN and pharm-DSN. These were then merged into an iDSN using a nonlinear fusion algorithm called Similarity Network Fusion (SNF) (Figure 1B). To validate the iDSN's potential in therapeutic similarity, we utilized a spectral clustering model with seven gold-standard annotations from PubChem (Figure 1C). Downstream analysis was delineated across three dimensions for drug repurposing (Figure 1D): (1) identifying similar components within classes, amalgamating pharmacological mechanisms with pathway annotation; (2) establishing associations between drug network clusters and distinct biological pathways; (3) prioritizing higher-ranked drug pairs for drug repositioning. Employing spectral clustering, iDSN exhibited a more distinct clustering structure compared to the chem-DSN and a more evenly distributed structure than the pharm-DSN (Figures 2A and S1). With the advantage of framework transparency, pharmacological properties contribute more than chemical properties through the quantitative assessment in clustering (Figure 2B). In 11 clusters, pharmacological features accounted for over 70% of edge similarity, and four clusters were entirely determined by pharm-DSN. In comparison to single-property DSNs, iDSN demonstrated superior performance across all three metrics (Figure 2C). Evaluation using six diverse benchmark datasets of drug categories confirmed iDSN's stronger correlations with all benchmark annotations compared to single-property DSNs (Table S1). Comparing clustering performance among single-property DSNs, pharm-DSN displayed better interaction with cell line (ARI = .470) and pathway (ARI = .585) annotations (Figure 2D). Importantly, iDSN based on the cross-fusion network algorithm achieves higher performance on IC50 (ARI = .502, NMI = .53), indicating improved generalisation and accuracy through multi-feature fusion (Figures 2E and S2). Data contribution analysis within the IC50-based cluster of the three DSNs highlighted iDSN's predominant contribution (77.23%), while chem-DSN and pharm-DSN contributed less (9.83% and 12.94%, respectively), underscoring iDSN's dominance in the IC50-based network (Figure 2F). To validate the superior performance, our method was compared with state-of-the-art approaches, encompassing traditional machine learning, network propagation and matrix factorisation. The framework demonstrated a significantly higher value (SC = .58) compared to other methods. Similar results were observed with the NMI index using the IC50 dataset, where our framework outperformed in interactivity score (ARI = .512) (Table S2). To facilitate drug precision repositioning, we uncovered the drug preferences within each cluster for various molecular functions using the KEGG pathway and Gene Ontology (GO) annotations10 (Figure 3). Notably, certain highly similar drug pairs within clusters exhibited consistent downstream pathway annotations, indicating the potential for drug repositioning by leveraging common targets or similar cellular signalling pathways to achieve therapeutic effects. The results revealed that several drug clusters exhibited significant enrichment annotations on GO analysis, such as Cluster 4, Cluster 2, and Cluster 5 included in the histone deacetylation, ADP ribosylation and phosphorylation respectively based on biological process. Furthermore, we observed that some individual drug clusters met different KEGG enrichment pathways under secondary classification, but specific on GO enrichment analysis in biological processes, cell components or molecular functions. For instance, in Cluster 9, we observed enrichment in KEGG pathways related to cancer and cell growth and death, with emphasis on apoptosis and molecular functions associated with dimerisation in biological processes, which suggests that Cluster 9 may exhibit a pharmacodynamic pattern, potentially influencing protein dimerisation and participating in pathways related to cancer or cell growth and death through apoptosis regulation. Exploring drug pairs with high similarity in the iDSN reveals potential drug repositioning opportunities. For the top 100 similar drug pairs, 86% had consistent pathway annotations, confirming the reliability of our drug similarity calculations. However, some pairs with high similarity scores had different annotations, mainly linked to six pathways in four classification clusters (Figure 4A). For instance, a closely related set of drug pairs, including BMS-536924, BMS-754807, GSK1904529A, Linsitinib and NVP-ADW742, exhibited connections to annotations in the IGF1R signalling and RTK signalling pathways, suggesting potential shared targets or interactions with similar cellular signalling pathways. In a major cluster, drugs associated with the kinases pathway (KIN001-244) showed high similarity to drugs linked to the Metabolism pathway (BX-912 and OSU-03012) and the Mitosis pathway (MPS-1-IN-1), unveiling potential crosstalk for therapeutic strategies (Table S3). Among them, the drug pair with the highest similarity is CMK-LJI308 (ranked sixth), annotated in the kinase pathway and the PI3K/MTOR signalling pathway, respectively. Drug pairs with different annotation pathways in specific spectral clustering clusters indicate distinct subgroups with unique downstream pathway preferences. Some clusters may exhibit pathway-specific therapeutic effects, while others show divergent pathway orientations (Figure 4B). To explore the global repositioning associations, we aligned pathway annotations with clustering results and observed a well-balanced distribution of downstream pathways across drug clusters (Figure 4C). Within one cluster containing nine drugs, four were associated with the IGF1R signalling pathway, and the remaining drugs were linked to the RTK signalling pathway. In another cluster with 37 drugs, 14 were mapped to the RTK pathway, while the remaining drugs were connected to the kinase pathway. Furthermore, we independently analysed highly similar drug pairs within these clusters (Figure 4D). In conclusion, this scalable structure-derived framework offers fresh insights into deducing characteristic downstream pathways and repurposing drugs via common drug structural properties. With the accumulation of drug informatics data and the development of future drugs, we will continue to expand our data, to deepen our understanding of feature integration, and to further improve the algorithm's performance for new drug development. Zhaoman Wan performed the analysis and prepared the manuscript with the help of Yang Cao, Mingming Su, Xinlei Zhang, L.Y. and H.X. Aiping Wu, Peng Zhang and Taijiao Jiang supervised the studies, designed the analysis and revised the manuscript. All authors reviewed and approved the manuscript. M.S. and X.Z. are the co-founders of Beijing Cloudna Technology Co., Ltd., and the other authors declare no competing interests. Please note: The publisher is not responsible for the content or functionality of any supporting information supplied by the authors. Any queries (other than missing content) should be directed to the corresponding author for the article.