Grouped Correlational Generative Adversarial Networks for Discrete Electronic Health Records.
Fan Yang,Zhongping Yu,Yunfan Liang,Xiaolu Gan,Kaibiao Lin,Quan Zou,Yifeng Zeng
DOI: https://doi.org/10.1109/bibm47256.2019.8983215
2019-01-01
Abstract:Using Generative Adversarial Networks (GANs) to generate synthetic Electronic Health Records (EHR) has attracted increasing attention. However, in existing approaches, the events in EHRs are treated as separate variables which are indiscriminately entered into the model, without taking into account the meaning and grouping of them. Besides, the efficacy of treatment is often neglected. In this paper, we first embed the efficacy information into the disease diagnosis, and then propose Grouped Correlational GAN (GcGAN) to explicitly learn inherent correlations between different groups of variables. We also introduce a dense connection to strengthen the generator capacity in GcGAN. Experimental results on real-world data demonstrate that the generated data from GcGAN are able to simulate real-world data in terms of distribution statistics. The results on multi-label treatment recommendation tasks show that GcGAN can boost the performances by augmenting the training dataset with the generated data and outperforms state-of-the-art approaches. It can also automatically distinguish between disease-specific drugs and adjuvant drugs, which enhances the model interpretability.
What problem does this paper attempt to address?