Machine Learning-Based Prediction of Sudden Cardiac Death in the General Population Using EHR Data: An Innovative Approach to Prevention
Abstract Body: Background: The vast majority of sudden cardiac death (SCD) cases occur in the general population with few known risk factors instead of just patients already identified to be at high risk, making the prediction of SCD very difficult. Therefore, a screening tool should be developed to facilitate early identification. Methods: To estimate the risk of SCD, we trained and validated a machine learning model on electronic health records (EHR) data covering 17,172,359 drug prescriptions and 1,639,057 hospital diagnoses up to five years prior to SCD. The data were obtained from a cohort of 12,338 SCD cases in Greater Paris and 12,338 controls from 2011 to 2015. We then validated the results on two external cohorts: a temporal cohort in the same area from 2016 and 2020 with 11,620 SCD cases and 11,620 controls and a geographical cohort from the University of Washington (Seattle, USA) with 892 SCD cases and 892 controls from 2013 to 2021. Results: In the five years preceding the SCD, cardiovascular diagnoses were prevalent in only a few patients, scarce in many patients, and totally nonexistent for 25.7% of subjects. Our model achieved an AUC of 0.81 (95% CI: 0.80–0.82) in the validation cohort. The prediction model discriminated SCD from the general population, especially in the highest decile, where the model detected 26% and 33% of all SCD in the Paris and Seattle datasets, respectively. Our prediction model was specific to SCD and was not predictive of myocardial infarction. In addition to classical cardiovascular risk factors, various non-cardiovascular drugs and diagnoses contributed to the prediction model. Conclusion: We propose a prediction model designed to identify individuals at high risk of SCD among the general population. When combined with EHR, this artificial intelligence model could be utilized as an efficient screening tool.
Youssfi, Younes
(
Inserm u970
, Paris , France )
Chocron, Richard
(
Inserm u970
, Paris , France )
Laurenceau, Thomas
(
Inserm u970
, Paris , France )
Beganton, Frankie
(
Inserm u970
, Paris , France )
Empana, Jean-philippe
(
Inserm u970
, Paris , France )
Rea, Tom
(
University of Washington
, Seattle , Washington , United States )
Chopin, Nicolas
(
National School for Statistics and Economic Administration
, Paris , France )
Bougouin, Wulfran
(
Inserm u970
, Paris , France )