Transfer Learning with Molecular Graph Convolutional Networks for Accurate Modeling and Representation of Bioactivities of Ligands Targeting GPCRs Without Sufficient Data
Jiansheng Wu,Chuangchuang Lan,Zheming Mei,Xiaohuyan Chen,Yanxiang Zhu,Haifeng Hu,Yemin Diao
DOI: https://doi.org/10.1016/j.compbiolchem.2022.107664
IF: 3.737
2022-01-01
Computational Biology and Chemistry
Abstract:There are many new or potential drug targets in G protein-coupled receptors (GPCRs) without sufficient ligand associations, and it is essential and urgent to implement drug discovery targeting these GPCRs. Precise modeling and representing ligand bioactivities are essential for screening and optimizing these GPCR targeted drugs, yet insufficient samples made it difficult to achieve. Transfer learning intends to solve this by introducing rich information from related source domains with sufficient ligand training samples. In addition, ligand molecules naturally constitute a graph structure, which can be utilized by molecular graph convolutional networks to form an end-to-end multiple-level representation learning. This study proposed a novel method, TL-MGCN, using transfer learning with molecular graph convolutional networks to precisely model and represent bioactivities of ligands targeting GPCRs without sufficient data. The study tested TL-MGCN on a series of 54 representative target domain datasets which cover most human subfamilies, and 44 out of them have less than 600 ligand associations. TL-MGCN obtained an average improvement of 28.74%, 17.28%, 10.05%, 77.83%, 43.65% and 14.65% on correlation coefficient (r2) and 11.90%, 7.43%, 14.86%, 41.46%, 31.02% and 22.94% on root-mean-square error (RMSE) compared with the WDL-RF, transfer learning version of WDL-RF (TR-WDL-RF), attentive FP, GIN, Weave and MPNN predictors, respectively. Users have free access to the web server of TL-MGCN, along with the source codes and datasets, at http://www.noveldelta.com/TL_MGCN for academic purposes.