mosGraphGPT: a foundation model for multi-omic signaling graphs using generative AI

Heming Zhang,Di Huang,Emily Chen,Dekang Cao,Tim Xu,Ben Dizdar,Guangfu Li,Yixin Chen,Philip Payne,Michael Province,Fuhai Li
DOI: https://doi.org/10.1101/2024.08.01.606222
2024-08-06
bioRxiv
Abstract:Generative pretrained models represent a significant advancement in natural language processing and computer vision, which can generate coherent and contextually relevant content based on the pre-training on large general datasets and fine-tune for specific tasks. Building foundation models using large scale omic data is promising to decode and understand the complex signaling language patterns within cells. Different from existing foundation models of omic data, we build a foundation model, mosGraphGPT, for multi-omic signaling (mos) graphs, in which the multi-omic data was integrated and interpreted using a multi-level signaling graph. The model was pretrained using multi-omic data of cancers in The Cancer Genome Atlas (TCGA), and fine-turned for multi-omic data of Alzheimer's Disease (AD). The experimental evaluation results showed that the model can not only improve the disease classification accuracy, but also is interpretable by uncovering disease targets and signaling interactions. And the model code are uploaded via GitHub with link: https://github.com/mosGraph/mosGraphGPT.
What problem does this paper attempt to address?