Spatiotemporal self-supervised representation learning from multi-lead ECG signals

Rui Hu,Jie Chen,Li Zhou
DOI: https://doi.org/10.1016/j.bspc.2023.104772
IF: 5.1
2023-07-01
Biomedical Signal Processing and Control
Abstract:Automatic analysis of electrocardiogram(ECG) signals is one of the applications in the medical domain where deep learning methods demonstrate impressive performance. However, label scarcity remains a critical obstacle limiting the development and clinical application of deep learning methods. Self-supervised pretraining exhibits its potential to alleviate this issue. Motivated by the idea that multi-lead ECG signals gathered simultaneously represent similar underlying cardiac status, we propose ECG-MAE, a novel generative self-supervised pretraining approach. Specifically, an efficient encoder is designed to learn spatiotemporal representations by reconstructing 12-lead ECG signals that have been randomly masked in both time and lead dimensions. Experiments on a multi-label classification problem reveal that our pretrained model obtains a macro-averaged AUC of 0.9474, exceeding prior studies. We observe enhanced downstream performance and label efficiency compared with purely supervised models, allowing us to train more powerful models with the same amount of data and diagnose rare diseases beyond the reach of conventional methods.
engineering, biomedical
What problem does this paper attempt to address?