Abstract:Numerous supervised learning models aimed at classifying 12-lead electrocardiograms into different groups have shown impressive performance by utilizing deep learning algorithms. However, few studies are dedicated to applying the Generative Pre-trained Transformer (GPT) model in interpreting electrocardiogram (ECG) using natural language. Thus, we are pioneering the exploration of this uncharted territory by employing the CardioGPT model to tackle this challenge. We used a dataset of ECGs (standard 10s, 12-channel format) from adult patients, with 60 distinct rhythms or conduction abnormalities annotated by board-certified, actively practicing cardiologists. The ECGs were collected from The First Affiliated Hospital of Ningbo University and Shanghai East Hospital. The dataset is partitioned into training (80%), validation (10%), and test (10%) cohorts for comprehensive evaluation. Each cohort contains ECGs from distinct patients, considering some patients took repeated ECG measurements. The proposed algorithm is evaluated in two levels, self-performance measurement and comparison with the residual neural network classification model. Two scores are used for self-performance measurement, including Bilingual Evaluation Understudy (BLEU) and Recall-Oriented Understudy for Gisting Evaluation (ROUGE). To compare the performance of the proposed model with the residual neural network model, we assessed the F1 score and area under the receiver operating characteristic curve (AUC). We have observed promising performance metrics across multiple evaluation criteria through an extensive evaluation of a large 12-lead ECG database comprising 1,128,553 ECG readings from 754,920 patients. The CardioGPT model exhibited high BLEU and ROUGE scores with 0.68 (95% CI: 0.66, 0.71) and 0.81 (95% CI: 0.79, 0.84). Furthermore, in the classification performance measurement setting, the CardioGPT achieved an average F1-score of 0.91(95% CI: 0.89, 0.93) and AUC of 0.82(95% CI: 0.79, 0.84) and has higher scores than that of the convolutional neural network model, indicating its proficiency in accurately classifying ECG recordings. By leveraging the power of transformer structure model and natural language processing, the GPT model addresses the challenge of imbalanced learning commonly encountered in ECG classification tasks. The results indicate that the GPT model can accurately interpret ECG using natural language, providing valuable insights into the underlying patterns and abnormalities present in the data. Significance: The pioneering application of the GPT model for interpreting ECGs with natural language demonstrates its potential to address ECG classification challenges and offer valuable insights into cardiac health.

ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling

Critical Care Studies Using Large Language Models Based on Electronic Healthcare Records: A Technical Note

Electrocardiogram-Language Model for Few-Shot Question Answering with Meta Learning

ECG Semantic Integrator (ESI): A Foundation ECG Model Pretrained with LLM-Enhanced Cardiological Text

ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis

ECGBERT: Understanding Hidden Language of ECGs with Self-Supervised Representation Learning

Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

De-biased Multimodal Electrocardiogram Analysis

Transfer Knowledge from Natural Language to Electrocardiography: Can We Detect Cardiovascular Disease Through Language Models?

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

BELT-2: Bootstrapping EEG-to-Language representation alignment for multi-task brain decoding

Automated Cardiovascular Record Retrieval by Multimodal Learning between Electrocardiogram and Clinical Report

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Towards Linguistic Neural Representation Learning and Sentence Retrieval from Electroencephalogram Recordings

Deep Representation Learning for Open Vocabulary Electroencephalography-to-Text Decoding

CardioGPT: An ECG Interpretation Generation Model

Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling

Enhancing Electrocardiogram Signal Analysis Using NLP-Inspired Techniques: A Novel Approach with Embedding and Self-Attention

Hidden States in LLMs Improve EEG Representation Learning and Visual Decoding

From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities