Multi-source Unsupervised Domain Adaptation for ECG Classification

Fucheng Deng,Shikui Tu,Lei Xu
DOI: https://doi.org/10.1109/bibm52615.2021.9669755
2021-01-01
Abstract:It is challenging to build a machine learning model for automatic arrhythmia diagnosis from Electrocardiograph (ECG) signals, because the variation in ECG signals is big between different patients or over time, and the available training datasets usually contain limited, unbalanced number of data for multiple disease types. Most existing methods relied on labeled data from a single dataset, and the performance is poor when generalizing to unseen heart disease types, limited labels, or distribution shifts. In this paper, we propose a multi-source unsupervised domain adaption (MUDA) neural network for ECG classification, to make effective use of data of multiple sources and improve the model’s generalization ability. Our model is featured by a two-branch domain adaption and a sample-imbalance aware mixing strategy to fuse the information across domains. Specifically, one branch is devised to learn domain-invariant representation, while the other is to extract domain-specific features. The two branches align the ECG in the target domain to individual source domain in an exclusive and complementary manner, leading to enhanced discriminative features for domain invariant/specific classifiers. The final prediction, which is a linear combination of the domain classification decisions, is very robust and accurate, by making use of the prior distribution of sample size across domains to place confidence scores over each classifier. Experiments on five ECG datasets indicate superior performance of our method over the existing ones.
What problem does this paper attempt to address?