Predicting post-stroke cognitive impairment using electronic health record data

Jeffrey M Ashburner,Yuchiao Chang,Bianca Porneala,Sanjula D Singh,Nirupama Yechoor,Jonathan M Rosand,Daniel E Singer,Christopher D Anderson,Steven J Atlas
DOI: https://doi.org/10.1177/17474930241246156
2024-03-28
International Journal of Stroke
Abstract:Background: Secondary prevention interventions to reduce post-stroke cognitive impairment (PSCI) can be aided by the early identification of high-risk individuals who would benefit from risk factor modification. Aims: To develop and evaluate a predictive model to identify patients at increased risk of PSCI over 5 years using data easily accessible from electronic health records. Methods: Cohort study that included primary care patients from two academic medical centers. Patients were 45 years or older, without prior stroke or prevalent cognitive impairment, with primary care visits and an incident ischemic stroke between 2003-2016 (development/internal validation cohort) or 2010-2022 (external validation cohort). Predictors of PSCI were ascertained from the electronic health record. The outcome was incident dementia/cognitive impairment within 5 years and beginning 3 months following stroke, ascertained using ICD-9/10 codes. For model variable selection, we considered potential predictors of PSCI and constructed 400 bootstrap samples with two-thirds of the model derivation sample. We ran 10-fold cross-validated Cox proportional hazards models using a least absolute shrinkage and selection operator (LASSO) penalty. Variables selected in >25% of samples were included. Results: The analysis included 332 incident diagnoses of PSCI in the development cohort (n=3,741), and 161 and 128 incident diagnoses in the internal (n=1,925) and external (n=2,237) validation cohorts. The c-statistic for predicting PSCI was 0.731 (95% CI: 0.694-0.768) in the internal validation cohort, and 0.724 (95% CI: 0.681-0.766) in the external validation cohort. A risk score based on the beta coefficients of predictors from the development cohort stratified patients into low (0-7 points), intermediate (8-11 points), and high (12-35 points) risk groups. The hazard ratios for incident PSCI were significantly different by risk categories in internal (High, HR: 6.2, 95% CI 4.1-9.3; Intermediate, HR 2.7, 95% CI: 1.8-4.1) and external (High, HR: 6.1, 95% CI: 3.9-9.6; Intermediate, HR 2.8, 95% CI: 1.9-4.3) validation cohorts. Conclusions: Five-year risk of PSCI can be accurately predicted using routinely collected data. Model output can be used to risk stratify and identify individuals at increased risk for PSCI for preventive efforts. Data access statement: Mass General Brigham data contains protected health information and cannot be shared publicly. The data processing scripts used to perform analyses will be made available to interested researchers upon reasonable request to the corresponding author.
peripheral vascular disease,clinical neurology
What problem does this paper attempt to address?