Improving generalization performance of electrocardiogram classification models
Hyeongrok Han,Seongjae Park,Seonwoo Min,Eunji Kim,HyunGi Kim,Sangha Park,Jinkook Kim,Junsang Park,Junho An,Kwanglo Lee,Wonsun Jeong,Sangil Chon,Kwonwoo Ha,Myungkyu Han,Hyun-Soo Choi,Sungroh Yoon
DOI: https://doi.org/10.1088/1361-6579/acb30f
2023-01-15
Physiological Measurement
Abstract:Objective. Recently, many electrocardiogram (ECG) classification algorithms using deep learning have been proposed. Because the ECG characteristics vary across datasets due to variations in factors such as recorded hospitals and the race of participants, the model needs to have a consistently high generalization performance across datasets. In this study, as part of the PhysioNet/Computing in Cardiology Challenge 2021, we present a model to classify cardiac abnormalities from the 12- and the reduced-lead ECGs. Approach. To improve the generalization performance of our earlier proposed model, we adopted a practical suite of techniques, i.e., constant-weighted cross-entropy loss, additional features, mixup augmentation, squeeze/excitation block, and OneCycle learning rate scheduler. We evaluated its generalization performance using the leave-one-dataset-out cross-validation setting. Furthermore, we showed that the knowledge distillation from the 12-lead model and the large teacher model improved the performance of the reduced-lead model and the small student model. Main results. With the proposed model, our DSAIL SNU team has received challenge scores of 0.55, 0.58, 0.58, 0.57, and 0.57 (ranked 2nd, 1st, 1st, 2nd, and 2nd of 39 teams) for the 12-, 6-, 4-, 3-, and 2-lead versions of the hidden test set, respectively. Significance. The proposed model achieved a higher generalization performance over six different hidden test datasets than the one we submitted to the PhysioNet Challenge 2020.
engineering, biomedical,biophysics,physiology