ABSTRACT
Importance National guidelines for primary prevention suggest consideration of lifetime risk for cardiovascular (CV) disease in addition to 10-year risk, however, it is unclear if the predictors of 10-year vs lifetime (10-26 years) CV events are similar.
Objective To use a combination of machine learning methods with deep phenotyping to differentiate 10-year versus lifetime predictors of CV outcomes.
Design, Setting, and Participants This retrospective analysis used the prospectively collected data from the CARDIA (Coronary Artery Risk Development in Young Adults) study, a cohort of White and Black participants recruited from four clinical centers in the US. The analysis included 4314 participants, aged 23-35 years who were then followed up over 25 years through August 2018.
Main Outcomes and Measures 449 variables collected in 1990-91 from imaging and noninvasive tests, questionnaires, and biomarker panels were included. We used machine learning techniques to identify the top-20 predictors of both 10-year and lifetime (10-26 years) CV events (coronary heart disease, myocardial infarction, acute coronary syndrome, stroke, transient ischemic attack, heart failure, peripheral arterial disease, and CV death).
Results Kidney disease, family history of CV disease, and echocardiographic parameters of left ventricular systolic and diastolic dysfunction, and hypertrophy were important markers of 10-year CV events. Traditional risk factors and indices of body size featured heavily as top predictors of lifetime CV risk. Among the different machine learning techniques, Random Survival Forest and Nnet-survival performed the best (C-index of 0.80 for 10-year and 0.72 for lifetime). These models outperformed Cox models including traditional CV risk factors.
Conclusions and Relevance Family history of CVD, kidney disease, and subclinical phenotyping of CVD using echocardiography are important for 10-year risk estimation. However, traditional CV risk factors alone may be adequate in estimating lifetime CV risk.
Question Do machine learning (ML) and deep learning (DL)-based survival analysis models help differentiate 10-year versus lifetime predictors of cardiovascular (CV) outcomes in young adults?
Findings In this retrospective analysis of 4314 participants in the CARDIA study, ML and DL survival analysis improved CVD risk prediction over traditional Cox models and revealed the top 20 predictors among 449 variables. Top 10-year risk predictors include kidney disease, family history of CV disease, and echocardiographic parameters, where as traditional risk factors and indices of body size featured heavily as top predictors of lifetime CV risk.
Meaning Family history, kidney disease, and subclinical phenotyping of CVD using echocardiography play a prominent role for 10-year risk estimation, while traditional CV risk factors alone may be adequate in estimating lifetime CV risk in young adults.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Protocols
https://www.cardia.dopm.uab.edu
Funding Statement
The Coronary Artery Risk Development in Young Adults Study (CARDIA) is conducted and supported by the National Heart, Lung, and Blood Institute (NHLBI) in collaboration with the University of Alabama at Birmingham (HHSN268201800005I & HHSN268201800007I), Northwestern University (HHSN268201800003I), University of Minnesota (HHSN268201800006I), and Kaiser Foundation Research Institute (HHSN268201800004I). This manuscript has been reviewed by CARDIA for scientific content.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
Institutional Review Board from each CARDIA center and the coordinating center (University of Minnesota, University of Alabama at Birmingham, Northwestern University, and Kaiser-Permanente Foundation Research Institute) gave ethical approval for this work. All participants provided written informed consent.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
bambale1{at}jhmi.edu, hnguye78{at}jhmi.edu, reisip{at}mail.nih.gov, wuc{at}nhlbi.nih.gov, j.jeffrey.carr{at}vanderbilt.edu, chikenwabuo{at}gmail.com, samuel.gidding{at}gmail.com, eguallar{at}jhu.edu, jlima1{at}jhmi.edu
Data Availability
The CARDIA data can be requested via the study website https://www.cardia.dopm.uab.edu/. Data access is available through the CARDIA Coordinating Center following approval by the CARDIA Publications and Presentations Committee. Please see the study website for further details: http://www.cardia.dopm.uab.edu/invitation-to-new-investigators.