Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Deep Learning of Electrocardiograms Enables Scalable Human Disease Profiling

Rachael A. Venn, View ORCID ProfileXin Wang, Sam Freesun Friedman, Nate Diamant, Shaan Khurshid, Paolo Di Achille, Lu-Chen Weng, Seung Hoan Choi, Christopher Reeder, James P. Pirruccello, Pulkit Singh, Emily S. Lau, Anthony Philippakis, Christopher D. Anderson, View ORCID ProfilePatrick T. Ellinor, Jennifer E. Ho, Puneet Batra, View ORCID ProfileSteven A. Lubitz
doi: https://doi.org/10.1101/2022.12.21.22283757
Rachael A. Venn
1Cardiovascular Research Center, Massachusetts General Hospital, Boston, Massachusetts, USA
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
3Demoulas Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, Massachusetts, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xin Wang
1Cardiovascular Research Center, Massachusetts General Hospital, Boston, Massachusetts, USA
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
MPH
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Xin Wang
Sam Freesun Friedman
4Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nate Diamant
4Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shaan Khurshid
1Cardiovascular Research Center, Massachusetts General Hospital, Boston, Massachusetts, USA
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
3Demoulas Center for Cardiac Arrhythmias, Massachusetts General Hospital, Boston, Massachusetts, USA
MD, MPH
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paolo Di Achille
4Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lu-Chen Weng
1Cardiovascular Research Center, Massachusetts General Hospital, Boston, Massachusetts, USA
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Seung Hoan Choi
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christopher Reeder
4Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
James P. Pirruccello
1Cardiovascular Research Center, Massachusetts General Hospital, Boston, Massachusetts, USA
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
5Division of Cardiology, Massachusetts General Hospital, Boston, Massachusetts, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pulkit Singh
4Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
BS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Emily S. Lau
1Cardiovascular Research Center, Massachusetts General Hospital, Boston, Massachusetts, USA
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
5Division of Cardiology, Massachusetts General Hospital, Boston, Massachusetts, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anthony Philippakis
4Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
6Eric and Wendy Schmidt Center, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christopher D. Anderson
7Department of Neurology, Brigham and Women’s Hospital, Boston, Massachusetts, USA
8Center for Genomic Medicine, Massachusetts General Hospital, Boston, Massachusetts, USA
9Henry and Allison McCance Center for Brain Health, Massachusetts General Hospital, Boston, Massachusetts, USA
MD, MMSc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Patrick T. Ellinor
1Cardiovascular Research Center, Massachusetts General Hospital, Boston, Massachusetts, USA
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
5Division of Cardiology, Massachusetts General Hospital, Boston, Massachusetts, USA
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Patrick T. Ellinor
Jennifer E. Ho
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
10CardioVascular Institute and Division of Cardiology, Department of Medicine, Beth Israel Deaconess Medical Center, Boston, Massachusetts, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Puneet Batra
4Data Sciences Platform, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Steven A. Lubitz
1Cardiovascular Research Center, Massachusetts General Hospital, Boston, Massachusetts, USA
2Cardiovascular Disease Initiative, The Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
5Division of Cardiology, Massachusetts General Hospital, Boston, Massachusetts, USA
MD, MPH
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Steven A. Lubitz
  • For correspondence: slubitz{at}mgh.harvard.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

The electrocardiogram (ECG) is an inexpensive and widely available diagnostic tool, and therefore has great potential to facilitate disease detection in large-scale populations. Both cardiac and noncardiac diseases may alter the appearance of the ECG, though the extent to which diseases across the human phenotypic landscape can be detected on the ECG remains unclear. We developed a deep learning variational autoencoder model that encodes and reconstructs ECG waveform data within a multidimensional latent space. We then systematically evaluated whether associations between ECG encodings and a broad range of disease phenotypes could be detected using the latent space model by deriving disease vectors and projecting individual ECG encodings onto the vectors. We developed models for both 12- and single-lead ECGs, akin to those used in wearable ECG technology. We leveraged phecodes to generate disease labels using International Classification of Disease (ICD) codes for about 1,600 phenotypes in three different datasets linked to electronic health record data. We tested associations between ECG encodings and disease phenotypes using a phenome-wide association study approach in each dataset, and meta-analyzed the results. We observed that the latent space ECG model identified associations for 645 (40%) diseases tested in the 12-lead model. Associations were enriched for diseases of the circulatory (n=140, 82% of category-specific diseases), respiratory (n=53, 62%), and endocrine/metabolic (n=73, 45%) systems, with additional associations evident across the human phenome; results were similar for the single-lead models. The top ECG latent space association was with hypertension in the 12-lead ECG model, and cardiomyopathy in the single-lead ECG model (p<2.2×10-308 for each). The ECG latent space model demonstrated a greater number of associations than ECG models using standard ECG intervals alone, and generally resulted in improvements in discrimination of diseases compared to models comprising only age, sex, and race. We further demonstrate how a latent space model can be used to generate disease-specific ECG waveforms and facilitate disease profiling for individual patients.

Competing Interest Statement

Dr. Lubitz has received sponsored research support from Bristol Myers Squibb, Pfizer, Boehringer Ingelheim, Fitbit, Medtronic, Premier, and IBM, and has consulted for Bristol Myers Squibb, Pfizer, Blackstone Life Sciences, and Invitae. Dr. Anderson receives sponsored research support from Bayer AG and Massachusetts General Hospital and has consulted for ApoPharma. Dr. Weng receives sponsored research support from IBM to the Broad Institute. Dr. Ellinor has received sponsored research support from Bayer AG and IBM Health, and he has consulted for Bayer AG, Novartis and MyoKardia. Dr. Batra, Dr. Reeder and Dr. Friedman have received sponsored research support from Bayer AG and IBM Health.

Funding Statement

Dr. Lubitz is a full-time employee of Novartis Institutes for Biomedical Research as of July 18, 2022. Dr. Lubitz previously received support from NIH grants R01HL139731 and R01HL157635, and American Heart Association 18SFRN34250007. Dr. Anderson is supported by NIH grants R01NS103924 and U01NS069763 and American Heart Association grants 18SFRN34250007 and 21SFRN812095. Dr. Weng is supported by National Institutes of Health (NIH) grant 1R01HL139731. Dr. Choi is supported by the NHLBI BioData Catalyst Fellows program. Dr. Ellinor is supported by the NIH (1R01HL092577, K24HL105780), AHA (18SFRN34110082) and by MAESTRIA (965286). Dr. Lau is supported by the American Heart Association (853922).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Use of Mass General Brigham (MGB) and UK Biobank (application 7089) data were approved by the MGB Institutional Review Board.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The Mass General Brigham source data are not publicly available because they are electronic health records. Making the data publicly available without additional consent or ethical approval could compromise privacy. Source data from the UK Biobank are available to qualified investigators via application at https://www.ukbiobank.ac.uk.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted December 22, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Deep Learning of Electrocardiograms Enables Scalable Human Disease Profiling
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Deep Learning of Electrocardiograms Enables Scalable Human Disease Profiling
Rachael A. Venn, Xin Wang, Sam Freesun Friedman, Nate Diamant, Shaan Khurshid, Paolo Di Achille, Lu-Chen Weng, Seung Hoan Choi, Christopher Reeder, James P. Pirruccello, Pulkit Singh, Emily S. Lau, Anthony Philippakis, Christopher D. Anderson, Patrick T. Ellinor, Jennifer E. Ho, Puneet Batra, Steven A. Lubitz
medRxiv 2022.12.21.22283757; doi: https://doi.org/10.1101/2022.12.21.22283757
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Deep Learning of Electrocardiograms Enables Scalable Human Disease Profiling
Rachael A. Venn, Xin Wang, Sam Freesun Friedman, Nate Diamant, Shaan Khurshid, Paolo Di Achille, Lu-Chen Weng, Seung Hoan Choi, Christopher Reeder, James P. Pirruccello, Pulkit Singh, Emily S. Lau, Anthony Philippakis, Christopher D. Anderson, Patrick T. Ellinor, Jennifer E. Ho, Puneet Batra, Steven A. Lubitz
medRxiv 2022.12.21.22283757; doi: https://doi.org/10.1101/2022.12.21.22283757

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)