Logo

American Heart Association

  1
  0


Final ID: WMP49

A machine learning approach to classifying ischemic stroke etiology using variables available in the Get-with-the-Guidelines Stroke Registry

Abstract Body: Background: There is a scarcity of US data about the epidemiology of ischemic stroke etiologies. The stroke etiology data field in the Get-with-the-Guidelines (GWTG) Stroke Registry has a 43% missing rate. We sought to develop a scalable machine-learning model to predict ischemic stroke etiology using variables already collected in the GWTG Stroke registry.

Methods: We linked data from two GWTG Stroke registries from YNHH and MGH from 2015-2020 with non-missing, non-cryptogenic stroke etiology inputs adjudicated by the agreement of at least 2 board-certified vascular neurologists’ review of the EHR data from the acute ischemic stroke hospitalization. We trained machine-learning classifiers using XGBoost to predict non-cryptogenic stroke etiology per 4-level TOAST classification using sequentially increasing number of features along with imputation of missing values: (1) Model-1 with age and sex, (2) Model-2 with all GWTG variables excluding ICD codes, and (3) Model-3 with all GWTG variables including ICD codes. Each model underwent Bayesian hyperparameter optimization with 1,000 trials and 5-fold cross-validation. Model performance was evaluated by the following metrics: AUCROC, accuracy, and Cohen’s kappa.

Results: The cohort included 2,724 patients (median age 70; males = 53%). The number of features was 80 in Model 2 and 2,733 in Model 3. In Model-1, the cross-validation mean AUCROC was 0.64, accuracy 0.45, F1 0.30, and kappa 0.063. Model-2's mean AUCROC was 0.78, accuracy 0.57, F1 0.56, and kappa 0.36. Model-3's mean AUCROC was 0.83, accuracy 0.62, F1 0.62, and kappa 0.44. The performance of the three models with respect to diagnosing each etiology is presented in Figure 1.

Conclusion: We developed and validated a proof-of-concept machine-learning model built using variables available in the GWTG Stroke Registry including embedded ICD codes that yield a predicted ischemic stroke etiology in moderate agreement with board-certified vascular neurologists. Upon further improvement, its application to the National GWTG Stroke Registry could allow automatic population of the stroke etiology variable and facilitate etiologic-specific quality evaluation and epidemiologic studies.
  • Lee, Ho-joon  ( Yale School of Medicine , New Haven , Connecticut , United States )
  • Schwamm, Lee  ( Yale School of Medicine , New Haven , Connecticut , United States )
  • Turner, Ashby  ( Massachusetts General Hospital , Chestnut Hill , Massachusetts , United States )
  • De Havenon, Adam  ( Yale University , New Haven , Connecticut , United States )
  • Kamel, Hooman  ( Weill Cornell Medicine , New York , New York , United States )
  • Brandt, Cynthia  ( Yale University , New Haven , Connecticut , United States )
  • Zhao, Hongyu  ( Yale School of Medicine , New Haven , Connecticut , United States )
  • Krumholz, Harlan  ( Yale University , New Haven , Connecticut , United States )
  • Sharma, Richa  ( Yale School of Medicine , Hamden , Connecticut , United States )
  • Author Disclosures:
    Ho-Joon Lee: DO have relevant financial relationships ; Consultant:Guidepoint:Active (exists now) | Lee Schwamm: DO have relevant financial relationships ; Consultant:genentech:Active (exists now) ; Advisor:Penumbra:Past (completed) ; Consultant:medtronic:Active (exists now) | Ashby Turner: DO NOT have relevant financial relationships | Adam de Havenon: DO have relevant financial relationships ; Research Funding (PI or named investigator):NIH/NINDS:Active (exists now) ; Researcher:UptoDate:Active (exists now) ; Individual Stocks/Stock Options:Certus:Active (exists now) ; Individual Stocks/Stock Options:TitinKM:Active (exists now) ; Consultant:Novo Nordisk:Active (exists now) ; Research Funding (PI or named investigator):AAN:Active (exists now) | Hooman Kamel: DO have relevant financial relationships ; Other (please indicate in the box next to the company name):Financial disclosures for Hooman Kamel: a PI role in the ARCADIA trial, which received in-kind study drug from the BMS-Pfizer Alliance for Eliquis and ancillary study support from Roche Diagnostics; a Deputy Editor role for JAMA Neurology; clinical trial steering/executive committee roles for the STROKE-AF (Medtronic), LIBREXIA-AF (Janssen), and LAAOS-4 (Boston Scientific) trials; consulting or endpoint adjudication committee roles for AbbVie, AstraZeneca, Boehringer Ingelheim, and Novo Nordisk; and household ownership interests in TETMedical, Spectrum Plastics Group, and Ascential Technologies.:Active (exists now) | cynthia brandt: DO NOT have relevant financial relationships | Hongyu Zhao: DO NOT have relevant financial relationships | Harlan Krumholz: DO have relevant financial relationships ; Individual Stocks/Stock Options:Element Science:Active (exists now) ; Research Funding (PI or named investigator):Pfizer:Active (exists now) ; Research Funding (PI or named investigator):Novartis:Active (exists now) ; Research Funding (PI or named investigator):Kenvue:Active (exists now) ; Research Funding (PI or named investigator):Janssen:Active (exists now) ; Ownership Interest:ENSIGHT-AI:Active (exists now) ; Ownership Interest:Refactor Health:Active (exists now) ; Ownership Interest:Hugo Health:Active (exists now) ; Advisor:F-Prime:Active (exists now) ; Individual Stocks/Stock Options:Identifeye:Active (exists now) | Richa Sharma: DO NOT have relevant financial relationships
Meeting Info:
Session Info:

Health Services, Quality Improvement, and Patient-Centered Outcomes Moderated Poster Tour I

Wednesday, 02/05/2025 , 06:00PM - 07:00PM

Moderated Poster Abstract Session

More abstracts on this topic:
A Machine Learning Approach to Simplify Risk Stratification of Patients with Atherosclerotic Cardiovascular Disease

Li Hsin Fang, Gluckman Ty, Nute Andrew, Weerasinghe Roshanthi, Wendt Staci, Wilson Eleni, Sidelnikov Eduard, Kathe Niranjan, Swihart Charissa, Jones Laney

Advanced Cardiac MRI Parameters Phenomapping to Predict Outcomes in Patients with Nonischemic Cardiomyopathy

Ammoury Carl, Tang Wai Hong, Wang Xiaofeng, Kwon Deborah, Wu Yanjun, Korkerdsup Theerawat, Wang Tom Kai Ming, Rizkallah Diane, Bodi Kashyap, Calcagno Tess, Chen David, Nguyen Christopher

More abstracts from these authors:
Health and Coverage: How Co-Morbidities and Insurance Status Affect Post-Stroke Blood Pressure Control

Williams Jasmin, Forman Rachel, Wu Qifan, Sharma Richa, Sheth Kevin, De Havenon Adam, Lu Yuan

Evolution of Ischemic Stroke Etiology Diagnosis During Stroke Hospitalization

Ellsworth Patrick, Lee Ho-joon, Sansing Lauren, Schwamm Lee, Thottempudi Neeharika, Sharma Richa

You have to be authorized to contact abstract author. Please, Login
Not Available

Readers' Comments

We encourage you to enter the discussion by posting your comments and questions below.

Presenters will be notified of your post so that they can respond as appropriate.

This discussion platform is provided to foster engagement, and simulate conversation and knowledge sharing.

 

You have to be authorized to post a comment. Please, Login or Signup.


   Rate this abstract  (Maximum characters: 500)