Abstract:Background: Although deep learning methods have shown great promise for identification of structural and functional cardiac abnormalities using electrocardiographic data, these methods are data hungry, posing a challenge for critically important tasks where ground truth labels are relatively scarce. Impaired coronary microvascular and vasomotor function is difficult to identify with standard clinical methods of cardiovascular testing such as coronary angiography and noninvasive single photon emission tomography (SPECT) myocardial perfusion imaging (MPI). Gold standard data from positron emission tomography (PET) are gaining emphasis in clinical guidelines but are expensive and only available in relatively limited centers. We hypothesized that signals embedded within resting and stress electrocardiograms (ECGs) identify individuals with microvascular and vasomotor dysfunction. Methods: We developed and pretrained a self-supervised foundation vision transformer model using a large database of unlabeled ECG waveforms (N=800,035). We then fine-tuned the foundation model for two clinical tasks: the difficult problem of identifying patients with impaired myocardial flow reserve (AI-MFR), and the relatively easier problem of detecting impaired LVEF (AI-LVEF). A second ECG database was labeled with task-specific annotations derived from quantitative PET MPI (N=4167). Diagnostic accuracy of AI predictions was tested in a holdout set of patients undergoing PET MPI (N=1031). Prognostic evaluation was performed in the PET holdout cohort, as well as independent cohorts of patients undergoing pharmacologic or exercise stress SPECT MPI (N=6635). Results: The diagnostic accuracy of AI-MFR with SSL pretraining increased significantly compared to de novo supervised training (AUROC, sensitivity, specificity: 0.758, 70.1%, 69.4% vs. 0.632, 66.1%, 57.3%, p<0.0001). SSL pretraining also produced a smaller increase in AI-LVEF accuracy (AUROC, sensitivity, specificity: 0.946, 89.4%, 85.9% vs. 0.918, 87.6%, 82.5%, p<0.02). Abnormal AI-MFR was found to be significantly associated with mortality risk in all three test cohorts (Hazard Ratio (HR) 2.61 [95% CI 1.83, 3.71], p<0.0001, PET cohort; HR 2.30 [2.03, 2.61], p<0.0001, pharmacologic stress SPECT cohort; HR 3.76 [2.36, 5.99], p<0.0001, exercise stress SPECT cohort). Conclusion: SSL pretraining of a vision transformer foundation model enabled identification of signals predictive of impaired MFR, a hallmark of microvascular and vasomotor dysfunction, and impaired LV function in resting and stress ECG waveforms. These signals are powerful predictors of prognosis in patients undergoing routine noninvasive stress testing and could enable more efficient diagnosis and management of these common conditions.

COMFORT: A Continual Fine-Tuning Framework for Foundation Models Targeted at Consumer Healthcare

Consformer: Consciousness Detection Using Transformer Networks With Correntropy-Based Measures

DOCTOR: A Multi-Disease Detection Continual Learning Framework Based on Wearable Medical Sensors

Assessing Foundation Models' Transferability to Physiological Signals in Precision Medicine

Safe physical interaction with cobots: a multi-modal fusion approach for health monitoring

Self-supervised deep representation learning of a foundation transformer model enabling efficient ECG-based assessment of cardiac and coronary function with limited labels

Revolutionizing health monitoring: Integrating transformer models with multi-head attention for precise human activity recognition using wearable devices

SiamQuality: A ConvNet-Based Foundation Model for Imperfect Physiological Signals

Demo Abstract: CaringFM: an Interactive In-home Healthcare System Empowered by Large Foundation Models

Toward Foundation Model for Multivariate Wearable Sensing of Physiological Signals

Large-scale Training of Foundation Models for Wearable Biosignals

IoT-enabled healthcare transformation leveraging deep learning for advanced patient monitoring and diagnosis

BEHRT: Transformer for Electronic Health Records

Medformer: A Multi-Granularity Patching Transformer for Medical Time-Series Classification

A Multi-Center Study on the Adaptability of a Shared Foundation Model for Electronic Health Records

An Improved ConvNeXt with Multimodal Transformer for Physiological Signal Classification

MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report

Self-supervised Pretraining and Transfer Learning Enable Flu and COVID-19 Predictions in Small Mobile Sensing Datasets

Probing the Efficacy of Federated Parameter-Efficient Fine-Tuning of Vision Transformers for Medical Image Classification