Logo

American Heart Association

  19
  0


Final ID: Su2001

The Latest Large Language Model, Grok: Can It Provide Education about Atrial Fibrillation for Diverse Populations?

Abstract Body (Do not enter title and authors here): Background: Large language models (LLMs) are used by atrial fibrillation patients. ChatGPT (OpenAI, San Francisco) and Grok (X.ai, San Francisco) have 450 M, 35 M monthly users, respectively. Grok is the newest LLM, open-sourced, uses Mixture of Experts algorithms, has 314 billion parameters, known for STEM answers and the use of X (formerly Twitter) as a data source. Grok was meant to be conversational in tone. LLMS are trained by data sets initially trained by software engineers and later by AI in part or exclusively. It is not known whether Grok responses about atrial fibrillation queries differ by patient gender and race/ethnicity.

Methods: We used the query: “I am a 68-year-old [ethnic/racial group] [male/female] with atrial fibrillation. I had a heart attack 2 years ago with stents. What can I expect from my cardiologist?” Three ethnic groups (White, African American, and Latinx) and male/female gender. Response analysis: Word Count (WC) and Flesch-Kincaid Grade Level (FK). ChatGPT4.5 reviewed the LLM responses for cultural sensitivity.

Results: Average WC: ChatGPT= 312.5±110.5, Grok= 830.7±104.7. Average FK: ChatGPT=10.7±0.9, Grok=10.3±1.0. Grok showed high cultural sensitivity, for African American female and Latinx users, e.g. diet, cardiovascular risk factors. Both male and female prompts were treated equitably in tone, depth, and scope. However, Grok did not incorporate culturally relevant content for White male or female users. For the Hispanic prompt, Grok mentioned the existence of “language services” but no website links or related organizations for further help. CHA2DS2-VASc is mentioned by both ChatGPT and Grok. Grok has a lower reading grade level for White males, Black females, Hispanic males than that of ChatGPT which may reflect their use of X (formerly Twitter) data. Grok had the longest response for Black females versus all other ethnic groups in this small study.

Conclusion: Grok, the latest LLM, competes well with ChatGPT with its thoroughness and factual medical education answers. Reading level however varies by racial/ethnic group and gender.
  • Khan, Obaid  ( California Health Sciences University , Clovis , California , United States )
  • Toram, Riki  ( UCSF School of Medicine , San Jose , California , United States )
  • Wang, Margaret  ( Santa Clara University , Santa Clara , California , United States )
  • Wu, Gloria  ( UCSF School of Medicine , San Jose , California , United States )
  • Paliath-pathiyal, Hrishi  ( Nova Southeastern University , Fort Lauderdale , Florida , United States )
  • Wang, Paul  ( Stanford University School of Medicine , Palo Alto , California , United States )
  • Chim, Ivan  ( University of California, San Diego , San Diego , California , United States )
  • Hoang, Brian  ( Unviersity of California, Davis , Davis , California , United States )
  • Chung, Emily  ( Boston University , Boston , Massachusetts , United States )
  • Mendoza, Noemi  ( San Francisco State University , San Francisco , California , United States )
  • Toram, Viki  ( UCSF School of Medicine , San Jose , California , United States )
  • Author Disclosures:
    Obaid Khan: DO NOT have relevant financial relationships | Riki Toram: No Answer | Margaret Wang: No Answer | Gloria Wu: DO NOT have relevant financial relationships | Hrishi Paliath-Pathiyal: No Answer | Paul Wang: DO have relevant financial relationships ; Individual Stocks/Stock Options:Soneira:Active (exists now) ; Ownership Interest:EndoEpiAF:Active (exists now) ; Ownership Interest:HrtEx:Active (exists now) | Ivan Chim: No Answer | Brian Hoang: No Answer | Emily Chung: No Answer | Noemi Mendoza: No Answer | Viki Toram: No Answer
Meeting Info:

Scientific Sessions 2025

2025

New Orleans, Louisiana

Session Info:
More abstracts on this topic:
Barriers and Facilitators to Outpatient Cardiac Rehabilitation Attendance Among Patients with Low Socioeconomic Status: A Qualitative Study

Mathews Lena, Okonkwo Miriam Chiamaka, Tolefree Tionna, Stewart Kerry, Benz Scott Lisa, Cooper Lisa, Ndumele Chiadi, Matsushita Kunihiro, Riekert Kristin

A Fat Chance: Paradoxical Embolic Stroke from Lipomatous Hypertrophy of the Interatrial Septum

Kalathoor Abraham

More abstracts from these authors:

Large Language Models for Atrial Fibrillation Health Education for Asian Subgroups

Khan Obaid, Toram Riki, Wang Margaret, Wu Gloria, Paliath-pathiyal Hrishi, Wang Paul, Hoang Brian, Chim Ivan, Chung Emily, Toram Viki, Mendoza Noemi

Large Language Models for Patient Education for Atrial Fibrillation

Paliath-pathiyal Hrishi, Toram Riki, Wang Margaret, Wu Gloria, Khan Obaid, Wang Paul, Hoang Brian, Chim Ivan, Chung Emily, Mendoza Noemi, Toram Viki

You have to be authorized to contact abstract author. Please, Login
Not Available