Boosting Prediction: Machine Learning Models Outperform Traditional Risk Scores for Primary Cardiovascular Prevention AHA Conference Repository

American Heart Association

Final ID: MP1923

Boosting Prediction: Machine Learning Models Outperform Traditional Risk Scores for Primary Cardiovascular Prevention

Abstract Body (Do not enter title and authors here): Background:
Standard CVD risk calculators assume linear relationships among risk factors. ML methods (gradient boosting, random forests, neural networks, support vector machines) capture nonlinear interactions. We compared ML models with established scores in adults without prior CVD.
Research Question:
We hypothesized that ML algorithms would show superior discrimination and calibration compared with conventional scores in primary-prevention cohorts.
Methods:
We searched MEDLINE, EMBASE, CENTRAL, Web of Science, and IEEE Xplore (January 2000–December 2024) for studies comparing ML algorithms to traditional risk calculators in adults (≥18 years) without CVD. Forty-two studies (total n ≈ 3.3 million) met inclusion criteria. Extracted metrics included AUC, Brier score or calibration slope, sensitivity, specificity, and net reclassification improvement (NRI). Random-effects meta-analysis pooled differences across these outcomes. Heterogeneity was assessed via I^2; publication bias was evaluated using funnel plots and Egger’s test. PROBAST assessed risk of bias; GRADE evaluated evidence certainty.
Results:
ML models outperformed traditional scores in discrimination (AUC_ML 0.83 vs AUC_Trad 0.76; ΔAUC +0.07; 95% CI 0.06–0.08) and calibration (median Brier 0.08 vs 0.12; Δ –0.04; 95% CI –0.05 to –0.03). Sensitivity increased from 0.69 to 0.78 (Δ +0.09; 95% CI 0.07–0.11) and specificity from 0.82 to 0.85 (Δ +0.03; 95% CI 0.02–0.04). Pooled NRI was +0.38 (95% CI 0.32–0.44). Gradient boosting showed the largest ΔAUC (+0.09) and highest NRI (+0.42). Asian cohorts had the greatest discrimination gain (ΔAUC +0.12) and a 31% higher detection of high-risk patients. Sensitivity analyses restricted to low-bias studies, large cohorts (> 50 000), and real-world electronic health record data confirmed ML’s advantage (ΔAUC +0.04–0.06). Overall heterogeneity was moderate to high.
Conclusions:
In adults without prior CVD, ML-based models offer modest but statistically and clinically significant improvements in discrimination, calibration, sensitivity, specificity, and reclassification compared with traditional scores. Gradient boosting and neural networks provided the largest benefits, especially in Asian and high-risk groups. High heterogeneity and study limitations underscore the need for externally validated ML models with robust calibration and evaluation in real-world clinical settings.

Prajapati, Hem ( medical college baroda , Vadodara , India )
Patel, Rahul ( UNC health , Moraganton , North Carolina , United States )
Rathwa, Jigar ( medical college baroda , Vadodara , India )

Author Disclosures:

Hem prajapati:

DO NOT have relevant financial relationships

Rahul Patel:

No Answer

Jigar Rathwa:

No Answer

Meeting Info:

Scientific Sessions 2025

2025

New Orleans, Louisiana

Session Info:

Deep Learning and Enhancing Risk Evaluation

Monday, 11/10/2025 , 01:45PM - 02:35PM

Moderated Digital Poster Session

More abstracts from these authors:

Comprehensive Geriatric Assessment Reduces Cardiovascular Mortality and Heart Failure Hospitalizations in Older Adults: A Systematic Review and Meta-analysis

Prajapati Hem, Chauhan Yesha, Patel Rahul, Wasir Amanpreet, Kakadiya Jay

Diagnostic Accuracy of the Systemic Immune-Inflammation Index for Predicting Contrast-Induced Nephropathy in Patients With Myocardial Infarction Undergoing Percutaneous Coronary Intervention: A Meta-Analysis

Odat Ramez, Aldamen Ali, Al Zoubi Bashar M., Khalefa Basma, Patel Rahul, Altarawneh Tala, Soni Kriti, Agrawal Siddharth, Jain Hritvik