Patient Health Representation Learning via Correlational Sparse Prior of Medical Features

Xinyu Ma,Yasha Wang,Xu Chu,Liantao Ma,Wen Tang,Junfeng Zhao,Ye Yuan,Guoren Wang
DOI: https://doi.org/10.1109/tkde.2022.3230454
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Exploiting the correlations between medical features is essential to the success of healthcare data analysis. However, most existing methods are either suffering large estimation variance for data insufficiency or inflexible in terms of demanding task-specific medical knowledge. In this paper, we propose a novel patient health representation learning framework dubbed SAFARI. SAFARI learns a compact representation by imposing a clinical-fact-inspired task-agnostic correlational sparsity prior to the correlations of medical feature pairs. Specifically, we learn the compact representation by solving the bi-level optimization problem, which involves solving the high-level inter-group correlations and the nested lower-level intra-group correlations. We leverage the Laplacian kernel as a robust metric for feature grouping and graph neural networks for solving the bi-level optimization problem following the optimal value reformulation paradigm. Experiments on five datasets of various inputs and tasks demonstrate the efficacy of SAFARI. The discovered findings are also consistent with our insights and medical literature, which can provide valuable clinical explanations.
computer science, information systems, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?