Logo

American Heart Association

  1
  0


Final ID: WP148

Assessment of the modified Rankin Scale in Electronic Health Records with Deep Learning

Abstract Body: The modified Rankin Scale (mRS) is a commonly used functional outcome assessment in stroke clinical trials and has become an essential metric for stroke research. However, the mRS is not routinely obtained in clinical practice. When necessary, the mRS can be derived retrospectively from electronic health record (EHR) documentation review, but this process is labor-intensive and subjective. Large language models, such as GPT-4, have shown promise in automating chart review for various health outcomes. We developed and tested the ability of a large language model to extract an mRS score from EHR text.

Acute ischemic stroke patients hospitalized within the M Health Fairview system between August 2020 and June 2023 had their charts reviewed at discharge and 90 days post-hospitalization. The mRS score at each clinical encounter was assessed by two independent researchers with any disagreements resolved through case discussion; a third researcher’s assessment was utilized when consensus could not be met. Additionally, each reviewer collected critical EHR text for mRS determination. Extracted EHR text and the corresponding mRS score for each encounter were used for model training and evaluation.
Two separate models were trained on the data: the first received all seven classes of the mRS, and the second received binomial mRS scores reflecting functional independence versus non-independence (mRS 0-2 vs. 3-6). Four-fold cross-validation was conducted, using accuracy and Cohen's kappa as model performance metrics. The base language model employed was Gatortron.

A total of 1616 EHR texts and mRS scores were included in the analysis. The first model—considering all seven classes of the mRS—presented with an accuracy of 74% and a Cohen's Kappa of 0.69. The highest class specific accuracy was achieved on mRS 4 determination (90%). The lowest class specific accuracy was achieved on mRS 2 determination (28%). The second model—considering only 2 classes—achieved an accuracy of 93% and Cohen's Kappa of 0.84.

Our findings demonstrate that large language models can be successfully trained to determine mRS scores through EHR text analysis. The multi-class model had the lowest class specific precision when determining an mRS score of 2, but the outstanding accuracy of the two-class model suggests that this could be improved with more training examples. Validation of our findings in different institutions is warranted to ensure model generalizability.
  • Milani, Marcus  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Tessmer, Megan  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Cooper, Dawson  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Guo, Yugene  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Stangebye, Alex  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Valcinord, Carl-lewis  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Yeung, Jeremy  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Lakshminarayan, Kamakshi  ( UNIVERSITY OF MINNESOTA , Minneapolis , Minnesota , United States )
  • Streib, Christopher  ( UNIVERSITY OF MINNESOTA , Minneapolis , Minnesota , United States )
  • Silva, Luis  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Bindra, Sohum  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Kumar, Kompal  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Jagtap, Pramit  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Chakravarthula, Nitin Ramanujam  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Ergen, Halil  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Frederickson, Kaylee  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Datta, Abhigyan  ( University of Minnesota , Minneapolis , Minnesota , United States )
  • Author Disclosures:
    Marcus Milani: DO NOT have relevant financial relationships | Megan Tessmer: DO NOT have relevant financial relationships | Dawson Cooper: DO NOT have relevant financial relationships | Yugene Guo: DO NOT have relevant financial relationships | Alex Stangebye: DO NOT have relevant financial relationships | Carl-Lewis Valcinord: DO NOT have relevant financial relationships | Jeremy Yeung: No Answer | Kamakshi Lakshminarayan: DO NOT have relevant financial relationships | Christopher Streib: DO NOT have relevant financial relationships | Luis Silva: DO NOT have relevant financial relationships | Sohum Bindra: No Answer | Kompal Kumar: DO NOT have relevant financial relationships | Pramit Jagtap: DO NOT have relevant financial relationships | Nitin Ramanujam Chakravarthula: DO NOT have relevant financial relationships | Halil Ergen: DO NOT have relevant financial relationships | Kaylee Frederickson: DO NOT have relevant financial relationships | Abhigyan Datta: DO NOT have relevant financial relationships
Meeting Info:
Session Info:

Health Services, Quality Improvement, and Patient-Centered Outcomes Posters I

Wednesday, 02/05/2025 , 07:00PM - 07:30PM

Poster Abstract Session

More abstracts on this topic:
10-Year Trends in Last Known Well to Arrival Time in Acute Ischemic Stroke Patients: 2014-2023

Ferrone Nicholas, Sanmartin Maria, O'hara Joseph, Jimenez Jean, Ferrone Sophia, Wang Jason, Katz Jeffrey, Sanelli Pina

Accuracy, Hallucinations, and Misgeneralizations of Large Language Models in Reviewing Cardiology Literature

Fang Spencer, Pillai Joshua, Mahin Baharullah, Kim Sarah

More abstracts from these authors:
A Real-World Pilot for Diagnostic Yield of Cardiac CTA vs Echocardiography in Acute Ischemic Stroke

Chakravarthula Nitin Ramanujam, Milani Marcus, Tessmer Megan, Staugaitis Abbey, Akimoto Kai, Markowitz Jeremy, Kalra Rajat, Nijjar Prabhjot, Streib Christopher

Targeted versus High-Intensity Monitoring Following Intravenous Thrombolysis in Acute Ischemic Stroke

Valcinord Carl-lewis, Roberts Rebecca, Bindra Sohum, Milani Marcus, Staugaitis Abbey, Tessmer Megan, Streib Christopher

You have to be authorized to contact abstract author. Please, Login
Not Available

Readers' Comments

We encourage you to enter the discussion by posting your comments and questions below.

Presenters will be notified of your post so that they can respond as appropriate.

This discussion platform is provided to foster engagement, and simulate conversation and knowledge sharing.

 

You have to be authorized to post a comment. Please, Login or Signup.


   Rate this abstract  (Maximum characters: 500)