Detection of multiple cardiac abnormalities using Convolution, Positional Encoder and Transformer on 12-lead ECG recordings

Adam Ford,Jessica Lan,Kyle Ng
DOI: https://doi.org/10.1101/2024.06.17.24309016
2024-06-17
Abstract:Objectives: Firstly, we aimed to develop a system capable of detecting multiple cardiac abnormalities simultaneously from 12-lead ECG recordings. Secondly, we tried to improve the detection by analyzing the relationship between imbalanced datasets and optimal classification thresholds. Methods: A novel fusion of Convolution Positional Encoder and Transformer Encoder was used to solve the multi-label classification problem. We used a proper evaluation metric called area under the precision-recall curve (AUPRC) that enabled us to analyze the precision-recall trade-off and find the optimal thresholds. Results: Having outperformed other popular deep networks, the model achieved the highest AUPRC of 0.96 and -score of 0.90 on 42511-sample datasets. We also found the negative correlation coefficient of -0.68 between optimal thresholds and the proportion of positive samples. Significance & Conclusion: This study compared the performances of different deep learning architectures on a medical problem and showed the potential of advanced techniques in capturing spatial, temporal features alongside attention mechanisms. It also introduced how to reduce the impact of imbalanced datasets and find optimal classification thresholds.
Health Informatics
What problem does this paper attempt to address?