0662 Accurate Automated Sleep Staging of Narcoleptic Patients Using a Machine Learning Model

Ahmet Cakir,David Josephs,Dave Kleinschmidt,Jay Pathmanathan,Jacob Donoghue,Alexander Chan
DOI: https://doi.org/10.1093/sleep/zsae067.0662
IF: 6.313
2024-04-20
SLEEP
Abstract:Abstract Introduction Accurate sleep staging of EEG data from polysomnography (PSG) is important in the diagnosis of narcolepsy. Human sleep staging is costly and labor intensive, but automated sleep staging algorithms must be rigorously tested in narcoleptic patients to ensure valid performance. PSGs of narcoleptic patients often tend to be more fragmented and variable than in non-narcoleptic populations, making it challenging for both humans and automated algorithms to accurately stage sleep. Here, we evaluate the performance of a deep learning model validated in a general sleep clinic population for staging nocturnal PSGs in patients with narcolepsy. Methods SleepStageMLTM, a deep-learning model for performing sleep staging on EEG signals, was trained on a large database of polysomnography recordings from a heterogenous population within the Beacon Clinico-PSG Database. The algorithm was evaluated on a held-out set of 28 overnight PSGs from patients with narcolepsy or hypersomnolence and 57 overnight PSGs from individuals without narcolepsy or hypersomnolence. Each PSG was manually scored by a human expert, and the performance of the automated algorithm was compared across the two cohorts. Results Automated sleep staging performance was high across both cohorts. The average F1-score for the control cohort and the narcolepsy cohort was 0.758 and 0.744 respectively. The positive percent agreements (PPAs) for the control cohort were 87%, 38%, 84%, 91%, and 93% for stages W, N1, N2, N3, and R respectively. For the narcolepsy cohort, the PPAs across the same stages were 91%, 33%, 81%, 86%, and 88% respectively. The algorithm’s median absolute error in estimating REM latency, REM duration, and REM percentage in the control cohort was 1.25 minutes, 8.5 minutes, and 2% points, respectively. The same metrics for the narcolepsy cohort were 2.75 minutes, 11.75 minutes, and 3% points respectively. Conclusion A deep-learning model trained on diverse data automatically and accurately staged PSGs from narcoleptic patients and was comparable to performance of a human expert. The algorithm estimated REM parameters accurately in both cohorts. Automated staging algorithms like the one described here have the potential to accelerate diagnosis and monitor therapeutic efficacy for narcolepsy treatments by more efficiently and consistently staging sleep. Support (if any)
neurosciences,clinical neurology
What problem does this paper attempt to address?
The paper primarily discusses two independent research topics: ### 1. Study One: Investigating Changes in Slow Wave Activity in Hypersomnolence Disorder (HD) - **Research Objective**: This study aims to assess the Slow Wave Activity (SWA) and Slow Wave Characteristics (SWC) of patients with Hypersomnolence Disorder during Non-Rapid Eye Movement (NREM) sleep and compare them with a healthy control group. - **Background**: Hypersomnolence Disorder is an illness of unknown etiology, with excessive daytime sleepiness as its core symptom. Currently, there is a lack of understanding of biomarkers for this disease. Given the association between slow waves and the restorative properties of sleep, researchers hypothesize that there may be alterations in the slow wave activity of patients with Hypersomnolence Disorder. - **Main Findings**: The results indicate that patients with Hypersomnolence Disorder have significantly reduced slow wave activity across all brain regions, particularly in the frontal and central areas, with more pronounced reductions in the left hemisphere than the right. Additionally, all slow wave characteristics, such as occurrence frequency, peak amplitude, and slope, showed a consistent pattern of reduction. ### 2. Study Two: Utilizing Machine Learning Models for Accurate Automatic Staging of Polysomnography in Patients with Narcolepsy - **Research Objective**: This study aims to evaluate the performance of a deep learning model (SleepStageML™) in the automatic staging of Polysomnography (PSG) for patients with narcolepsy. The model has been trained and validated on PSG data from the general population. - **Background**: Accurate staging of Polysomnography is crucial for the diagnosis of narcolepsy, but manual staging is time-consuming and costly. PSGs of patients with narcolepsy are often more fragmented and variable, making accurate staging challenging for both humans and automated algorithms. - **Main Findings**: The study shows that the deep learning model performs well in the patient group with narcolepsy, with an average F1 score of 0.744, comparable to that of the healthy control group (average F1 score of 0.758). Notably, the estimation of Rapid Eye Movement (REM) stage parameters showed high accuracy in both groups. ### 3. Study Three: Predictive Accuracy of Quantitative Electroencephalography in the Diagnosis of Narcolepsy - **Research Objective**: The goal of this study is to evaluate the predictive accuracy of Quantitative Electroencephalography (qEEG) in diagnosing narcolepsy and to compare it with the standard diagnostic tool—the Multiple Sleep Latency Test (MSLT). - **Background**: Although the Multiple Sleep Latency Test is considered the gold standard for diagnosing narcolepsy, it has limitations, such as scheduling difficulties and potential influences from other factors. Previous research has indicated that patients with narcolepsy exhibit imbalances in EEG signals at specific frequencies and brain regions. All three studies are dedicated to improving or deepening our understanding and diagnostic methods for the two sleep disorders, Hypersomnolence Disorder and narcolepsy, through modern technological means.