Logo

American Heart Association

  1
  0


Final ID: 66

Diagnostic Accuracy of ChatGPT4.o for TIA or Stroke Using Patient Symptoms and Demographics

Abstract Body: Introduction: Many patients may not recognize the initial symptoms of TIA or stroke and delay seeking urgent medical care, leading to missed treatments and worse outcomes. Diagnostic decision support (DDSS) systems may help these patients recognize and act on early symptoms of stroke or TIA. Large language models (LLMs) are available and utilized by the public. We evaluated the efficacy of GPT 4.o, a recent LLM, for the prediction of stroke or TIA from data collected at presentation among patients admitted to an emergency department TIA/Stroke observational unit (ED-OU).

Methods: 1466 patients admitted to the ED-OU for suspected TIA in a large, urban, academic ED, from 3/2013 - 2/2020 were included. A thorough history and physical were obtained, including presenting symptoms, symptom time course, vital signs, and past medical history. The outcome was a discharge diagnosis of stroke, TIA, or an alternate diagnosis, from a consulting neurologist and confirmed with neuroimaging. For a random sample of 500 records demographics and symptom data were entered into GPT 4.o via OpenAI’s Application Programming Interface (API). We used a validated prompt that requests the 5 most likely diagnoses. The sensitivity of the GPT 4.o diagnoses for the neurologist’s diagnosis was calculated as a match in the top 1 (M1), top 3 (M3) or top 5 (M5). Sensitivity to a combined diagnosis of TIA or stroke was also calculated. Results were compared to (1) a random sample of 100 patients from the same dataset manually entered into the ChatGPT4.o interface by a research assistant, and (2) results from evaluation of ChatGPT 4.0 with data from a study of patients requesting urgent primary care, who entered their clinical data into a DDSS app.

Results: The 500 cases included 257 with TIA, 73 with stroke, and 170 with other diagnoses. Table 1 shows results of diagnostic matches. Diagnostic lists of 18.4% of cases had no match between GPT4.o and the neurologist’s diagnosis. 5.4% of cases had no clear neurologist diagnosis. GPT4 sensitivity just for combined diagnosis of TIA or Stroke was 98.8%.

Conclusions: DDSS like ChatGPT/GPT4.o have potential to aid patients’ prompt recognition of TIA or stroke symptoms which could shorten time to care. To better define usability, accuracy and safety of DDSS, we are studying direct data collection from patients in the ED or urgent primary care, including stroke patients or their companions, and evaluating other DDSS tools including symptom checkers.
  • Khatri, Ishaani  ( Alpert Medical School of BrownU , Providence , Rhode Island , United States )
  • Zahiri, Anita  ( BROWN UNIVERSITY , Providence , Rhode Island , United States )
  • Abdullahi, Tassallah  ( BROWN UNIVERSITY , Providence , Rhode Island , United States )
  • Bacher, Ian  ( BROWN UNIVERSITY , Providence , Rhode Island , United States )
  • Raman, Sasha  ( Alpert Medical School of BrownU , Providence , Rhode Island , United States )
  • Fraser, Hamish  ( BROWN UNIVERSITY , Providence , Rhode Island , United States )
  • Madsen, Tracy  ( Alpert Medical School of BrownU , Providence , Rhode Island , United States )
  • Author Disclosures:
    Ishaani Khatri: DO NOT have relevant financial relationships | Anita Zahiri: DO NOT have relevant financial relationships | Tassallah Abdullahi: DO NOT have relevant financial relationships | Ian Bacher: DO NOT have relevant financial relationships | Sasha Raman: DO NOT have relevant financial relationships | Hamish Fraser: DO NOT have relevant financial relationships | Tracy Madsen: DO have relevant financial relationships ; Research Funding (PI or named investigator):American Heart Association:Active (exists now) ; Research Funding (PI or named investigator):NIH:Active (exists now)
Meeting Info:
Session Info:

Health Services, Quality Improvement, and Patient-Centered Outcomes Oral Abstracts II

Wednesday, 02/05/2025 , 04:45PM - 05:45PM

Oral Abstract Session

More abstracts on this topic:
A Deep Learning Digital Biomarker for Mitral Valve Prolapse using Echocardiogram Videos

Al-alusi Mostafa, Khurshid Shaan, Sanborn Danita, Picard Michael, Ho Jennifer, Maddah Mahnaz, Ellinor Patrick, Lau Emily, Small Aeron, Reeder Christopher, Shnitzer Dery Tal, Andrews Carl, Kany Shinwan, Ramo Joel, Haimovich Julian

10-Year Trend Analysis of Medicare Payment in Stroke Inpatient Hospital Admission

Wong Ka-ho, Krothapalli Neeharika, Littig Lauren, Champagne Alison, Majersik Jennifer, Reddy Vivek, De Havenon Adam

More abstracts from these authors:
RESCUE: Proof of Concept Trial With RNS60 Shows Safety, Reduced Infarct Growth, and Numerical Improvement in all Prespecified Efficacy Endpoints in Subjects With Ischemic Stroke Receiving Mechanical Thrombectomy

Ghosh Supurna, Jayaraman Mahesh, Moldovan Krisztina, Torabi Radmehr, Ansari Sameer, Favilla Christopher, Chiu David, Clark Wayne, Liebeskind David, Mctaggart Ryan, Dubow Jordan, Sutherland Jocelyn, Kalmes Andreas, Mock Jarrad, Baird Grayson, Smith Wendy, Cook Douglas, Madsen Tracy

Sex-specific Trends in the Prevalence of Stroke Risk Factors and Medication Use in the GCNKSS

Madsen Tracy, Kleindorfer Dawn, Ding Lili, Robinson David, Stanton Robert, Khoury Jane, Rademacher Eric, Anderson Aaron, Broderick Joseph, Kissela Brett

You have to be authorized to contact abstract author. Please, Login
Not Available

Readers' Comments

We encourage you to enter the discussion by posting your comments and questions below.

Presenters will be notified of your post so that they can respond as appropriate.

This discussion platform is provided to foster engagement, and simulate conversation and knowledge sharing.

 

You have to be authorized to post a comment. Please, Login or Signup.


   Rate this abstract  (Maximum characters: 500)