A Hybrid-scales Graph Contrastive Learning Framework for Discovering Regularities in Traditional Chinese Medicine Formula

Yingpei Wu,Zecheng Yin,Kaiyuan Zhou,Ruofei Wang,Yun Yang,Zepeng Yin,Chunyang Ruan,Yanchun Zhang
DOI: https://doi.org/10.1109/bibm52615.2021.9669658
2021-01-01
Abstract:Discovering regularities in Traditional Chinese Medicine (TCM) formula has been a hot topic in assisting TCM clinical treatment and poly-pharmacology research. Several machine learning methods, like topic model, auto-encoder, and GNNs, have been proposed for discovering regularities in TCM. However, they are often limited by specific data challenges (e.g., complex relations with rich TCM knowledge, sparsity and ambiguity, expensive data labeling, etc.) in TCM formulae. Addressing these challenges, we first establish a TCM Attributed Heterogeneous Information Network (TAHIN) for modeling massive formulae, which can assemble various types of additional information and capture their relations. Based on the TAHIN, we further propose a novel hybrid-scales graph contrastive learning framework to learn high-quality node representations in a whole unsupervised manner which can be helpful for various tasks of discovering regularities such as herb classification and herb similarity search, etc. Extensive experiments demonstrate the effectiveness and interpretability of our method. Our source code and datasets are available at https://github.com/Yonggie/ HsCTRD.
What problem does this paper attempt to address?