WGTDA: A Topological Perspective to Biomarker Discovery in Gene Expression Data

Ndivhuwo Nyase,Lebohang Mashatola,Aviwe Kohlakala,Kahn Rhrissorrakrai,Stephanie Muller
2024-02-14
Abstract:Advancing the discovery of prognostic cancer biomarkers is crucial for comprehending disease mechanisms, refining treatment plans, and improving patient outcomes. This study introduces Weighted Gene Topological Data Analysis (WGTDA), an innovative framework utilizing topological principles to identify gene interactions and distinctive biomarker features. WGTDA undergoes evaluation against Weighted Gene Co-expression Network Analysis (WGCNA), underscoring that topology-based biomarkers offer more reliable predictors of survival probability than WGCNA's hub genes. Furthermore, WGTDA identifies gene signatures that are significant to survival probability, irrespective of whether the expression is above or below the median. WGTDA provides a new perspective on biomarker discovery, uncovering intricate gene-to-gene relationships often overlooked by conventional correlation-based analyses, emphasizing the potential advantage of leveraging topological concepts to extract crucial information about gene-gene interactions.
Quantitative Methods
What problem does this paper attempt to address?
This paper mainly discusses a new method for discovering biomarkers in gene expression data, called Weighted Gene Topological Data Analysis (WGTDA). Existing methods, such as Weighted Gene Co-expression Network Analysis (WGCNA), rely on correlation and hierarchical clustering, while WGTDA utilizes topological principles to identify gene interactions and unique biomarker features. Compared to WGCNA, the study shows that WGTDA can provide more reliable survival probability prediction factors and identify gene features related to survival probability, regardless of whether their expression levels are higher or lower than the median. This indicates that WGTDA is superior to traditional methods in revealing complex gene relationships, highlighting the potential of using topological concepts to extract key information about gene interactions. Furthermore, through survival analysis of breast cancer, lung cancer, and colorectal cancer data in the TCGA dataset, WGTDA demonstrates its advantages in identifying potential gene biomarkers for targeted cancer research.