Comparative assessment of synthetic time series generation approaches in healthcare: leveraging patient metadata for accurate data synthesis

Imanol Isasa,Mikel Hernandez,Gorka Epelde,Francisco Londoño,Andoni Beristain,Xabat Larrea,Ane Alberdi,Panagiotis Bamidis,Evdokimos Konstantinidis
DOI: https://doi.org/10.1186/s12911-024-02427-0
IF: 3.298
2024-02-01
BMC Medical Informatics and Decision Making
Abstract:Synthetic data is an emerging approach for addressing legal and regulatory concerns in biomedical research that deals with personal and clinical data, whether as a single tool or through its combination with other privacy enhancing technologies. Generating uncompromised synthetic data could significantly benefit external researchers performing secondary analyses by providing unlimited access to information while fulfilling pertinent regulations. However, the original data to be synthesized (e.g., data acquired in Living Labs) may consist of subjects' metadata (static) and a longitudinal component (set of time-dependent measurements), making it challenging to produce coherent synthetic counterparts.
medical informatics
What problem does this paper attempt to address?