Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores

View ORCID ProfileYana Hrytsenko, Benjamin Shea, Michael Elgart, Nuzulul Kurniansyah, Genevieve Lyons, Alanna C. Morrison, April P. Carson, Bernhard Haring, View ORCID ProfileBraxton D. Mitchel, Bruce M. Psaty, Byron C. Jaeger, C Charles Gu, Charles Kooperberg, Daniel Levy, Donald Lloyd-Jones, Eunhee Choi, Jennifer A Brody, View ORCID ProfileJennifer A Smith, Jerome I. Rotter, Matthew Moll, View ORCID ProfileMyriam Fornage, Noah Simon, View ORCID ProfilePeter Castaldi, Ramon Casanova, Ren-Hua Chung, Robert Kaplan, Ruth J.F. Loos, Sharon L. R. Kardia, Stephen S. Rich, Susan Redline, Tanika Kelly, Timothy O’Connor, Wei Zhao, Wonji Kim, Xiuqing Guo, Yii Der Ida Chen, the Trans-Omics in Precision Medicine Consortium, View ORCID ProfileTamar Sofer
doi: https://doi.org/10.1101/2023.12.13.23299909
Yana Hrytsenko
1Department of Medicine, Brigham and Women’s Hospital, Boston, MA
2Department of Medicine, Harvard Medical School, Boston, MA
3CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yana Hrytsenko
Benjamin Shea
3CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Elgart
1Department of Medicine, Brigham and Women’s Hospital, Boston, MA
2Department of Medicine, Harvard Medical School, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nuzulul Kurniansyah
1Department of Medicine, Brigham and Women’s Hospital, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Genevieve Lyons
4Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alanna C. Morrison
5Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
April P. Carson
6Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bernhard Haring
7Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
8Department of Medicine III, Saarland University, Homburg, Saarland, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Braxton D. Mitchel
9Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Braxton D. Mitchel
Bruce M. Psaty
10Department of Medicine, University of Washington, Seattle, WA, USA
11Department of Epidemiology, University of Washington, Seattle, WA, USA
12Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
13Health Systems and Population Health, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Byron C. Jaeger
14Department of Biostatistics and Data Science, Wake Forest University School of Medicine, Winston-Salem, NC, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
C Charles Gu
15The Center for Biostatistics and Data Science, Washington University, St. Louis, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Charles Kooperberg
16Division of Public Health Sciences, Fred Hutchinson Cancer Center, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daniel Levy
17The Population Sciences Branch of the National Heart, Lung and Blood Institute, Bethesda, MD, USA
18The Framingham Heart Study, Framingham, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Donald Lloyd-Jones
19Department of Preventive Medicine, Northwestern University, Chicago, IL, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eunhee Choi
20Columbia Hypertension Laboratory, Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jennifer A Brody
10Department of Medicine, University of Washington, Seattle, WA, USA
12Cardiovascular Health Research Unit, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jennifer A Smith
21Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
22Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jennifer A Smith
Jerome I. Rotter
23The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew Moll
1Department of Medicine, Brigham and Women’s Hospital, Boston, MA
2Department of Medicine, Harvard Medical School, Boston, MA
24VA Boston Healthcare System, West Roxbury, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Myriam Fornage
5Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
25Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Myriam Fornage
Noah Simon
26Department of Biostatistics, School of Public Health, University of Washington, Seattle, WA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peter Castaldi
1Department of Medicine, Brigham and Women’s Hospital, Boston, MA
2Department of Medicine, Harvard Medical School, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Peter Castaldi
Ramon Casanova
13Health Systems and Population Health, University of Washington, Seattle, WA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ren-Hua Chung
27Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Taipei City, Taiwan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert Kaplan
28Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
29Department of Epidemiology & Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruth J.F. Loos
30The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
31Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty for Health and Medical Sciences, University of Copenhagen, Denmark, DK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sharon L. R. Kardia
21Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stephen S. Rich
32Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Susan Redline
2Department of Medicine, Harvard Medical School, Boston, MA
33Division of Sleep and Circadian Disorders, Brigham and Women’s Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tanika Kelly
34Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Timothy O’Connor
8Department of Medicine III, Saarland University, Homburg, Saarland, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wei Zhao
21Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
22Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Wonji Kim
35Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiuqing Guo
23The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yii Der Ida Chen
23The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tamar Sofer
1Department of Medicine, Brigham and Women’s Hospital, Boston, MA
2Department of Medicine, Harvard Medical School, Boston, MA
3CardioVascular Institute (CVI), Beth Israel Deaconess Medical Center, Boston, MA
4Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tamar Sofer
  • For correspondence: tsofer{at}bidmc.harvard.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

We construct non-linear machine learning (ML) prediction models for systolic and diastolic blood pressure (SBP, DBP) using demographic and clinical variables and polygenic risk scores (PRSs). We developed a two-model ensemble, consisting of a baseline model, where prediction is based on demographic and clinical variables only, and a genetic model, where we also include PRSs. We evaluate the use of a linear versus a non-linear model at both the baseline and the genetic model levels and assess the improvement in performance when incorporating multiple PRSs. We report the ensemble model’s performance as percentage variance explained (PVE) on a held-out test dataset. A non-linear baseline model improved the PVEs from 28.1% to 30.1% (SBP) and 14.3% to 17.4% (DBP) compared with a linear baseline model. Including seven PRSs in the genetic model computed based on the largest available GWAS of SBP/DBP improved the genetic model PVE from 4.8% to 5.1% (SBP) and 4.7% to 5% (DBP) compared to using a single PRS. Adding additional 14 PRSs computed based on two independent GWASs further increased the genetic model PVE to 6.3% (SBP) and 5.7% (DBP). PVE differed across self-reported race/ethnicity groups, with primarily all non-White groups benefitting from the inclusion of additional PRSs.

Competing Interest Statement

B Psaty serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. G Lyons is currently an employee of Alexion Pharmaceuticals, however, her contributions to the present manuscript were performed as part of her previous affiliation at the Harvard T.H. Chan School of Public Health and this work is not related to her current occupation and affiliation. M Moll has received grant funding from Bayer and consulting fees from TriNetX, 2ndMD, TheaHealth, Sitka, Verona Pharma, and Axon Advisors.

Funding Statement

This study was supported by National Heart Lung and Blood Institute (NHLBI) grant R01HL161012 to TS. MM was supported NHLBI grant K08HL159318.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This work was approved by the Mass General Brigham Institutional Review Board and by the Beth Israel Deaconess Medical Center Committee on Clinical Investigations. All study protocols were approved by the institutional review board at the University of Maryland Baltimore. Informed consent was obtained from each study participant. The ARIC study has been approved by a single Institutional Review Board (sIRB) at Johns Hopkins School of Medicine and Institutional Review Boards (IRB) at all participating institutions: University of North Carolina at Chapel Hill IRB, Johns Hopkins University School of Public Health IRB, University of Minnesota IRB, Wake Forest University Health Sciences IRB, and University of Mississippi Medical Center IRB. Study participants provided written informed consent at all study visits. The BioMe cohort was approved by the Institutional Review Board at the Icahn School of Medicine at Mount Sinai. All BioMe participants provided written, informed consent for genomic data sharing. All CARDIA participants provided informed consent, and the study was approved by the Institutional Review Boards of the University of Alabama at Birmingham and the University of Texas Health Science Center at Houston. Cleveland Family Study was approved by the Institutional Review Board (IRB) of Case Western Reserve University and Mass General Brigham (formerly Partners HealthCare). Written informed consent was obtained from all participants. All CHS participants provided informed consent, and the study was approved by the Institutional Review Board [or ethics review committee] of University Washington. All COPDGene participants provided written informed consent, and the study was approved by the Institutional Review Boards of the participating clinical centers. The Framingham Heart Study was approved by the Institutional Review Board of the Boston University Medical Center. All study participants provided written informed consent. Written informed consent was obtained from all subjects and approval was granted by participating institutional review boards (University of Michigan, University of Mississippi Medical Center, and Mayo Clinic). All subjects provided informed consent and the GenSalt study was approved by the Institutional Review Board (IRB) of all participating institutes in the US and China. This study was approved by the institutional review boards (IRBs) at each field center, where all participants gave written informed consent, and by the Non-Biomedical IRB at the University of North Carolina at Chapel Hill, to the HCHS/SOL Data Coordinating Center. All IRBs approving the study are: Non-Biomedical IRB at the University of North Carolina at Chapel Hill. Chapel Hill, NC; Einstein IRB at the Albert Einstein College of Medicine of Yeshiva University. Bronx, NY; IRB at Office for the Protection of Research Subjects (OPRS), University of Illinois at Chicago. Chicago, IL; Human Subject Research Office, University of Miami. Miami, FL; Institutional Review Board of San Diego State University. San Diego, CA. The Institutional Review Boards at Jackson State University, Tougaloo College, and the University of Mississippi Medical Center approved the study, and all participants provided written informed consent. All MESA participants provided written informed consent, and the study was approved by the Institutional Review Boards at The Lundquist Institute (formerly Los Angeles BioMedical Research Institute) at Harbor-UCLA Medical Center, University of Washington, Wake Forest School of Medicine, Northwestern University, University of Minnesota, Columbia University, and Johns Hopkins University. All THRV participants provided informed consent, and the study was approved by the Institutional Review Board at The Lundquist Institute (formerly Los Angeles BioMedical Research Institute, or LA BioMed) at Harbor-UCLA Medical Center, and at Washington University in St. Louis. All WHI participants provided informed consent and the study was approved by the Institutional Review Board (IRB) of the Fred Hutchinson Cancer Research Center.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data availability

TOPMed freeze 8 WGS data and harmonized BP phenotypes are available by application to dbGaP according to the study specific accessions: Amish: “phs000956”, ARIC: “phs001211“, BioMe: “phs001644”, CARDIA: “phs001612”, CFS: “phs000954”, CHS: “phs001368”, COPDGene: “phs000951”, FHS: “phs000974”, GENOA: “phs001345”, GenSalt: “phs001217”, HCHS/SOL: “phs001395”, JHS: “phs000964”, MESA: “phs001211”, THRV: “phs001387”, WHI: “phs001237”. Summary statistics from MVP BP GWAS are available from dbGaP by application to study accession “phs001672”. The summary statistics from the UKBB + ICBP BP GWAS are available at https://grasp.nhlbi.nih.gov/FullResults.aspx. MGB Biobank genotyping and phenotypic data are available to Mass General Brigham investigators with required approval from the Mass General Brigham Institutional Review board (IRB). Data needed to construct the selected BP PRSs generated in this study will become publicly available on a Zenodo repository upon paper acceptance and will include variants, alleles, and weights for each of the PRS based on GWAS of SBP and DBP. The BED files that define LD-regions used for construction of local PRSs are available under the Bitbucket repository in https://bitbucket.org/nygcresearch/ldetect-xsdata/src/master/.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-ND 4.0 International license.
Back to top
PreviousNext
Posted December 14, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores
Yana Hrytsenko, Benjamin Shea, Michael Elgart, Nuzulul Kurniansyah, Genevieve Lyons, Alanna C. Morrison, April P. Carson, Bernhard Haring, Braxton D. Mitchel, Bruce M. Psaty, Byron C. Jaeger, C Charles Gu, Charles Kooperberg, Daniel Levy, Donald Lloyd-Jones, Eunhee Choi, Jennifer A Brody, Jennifer A Smith, Jerome I. Rotter, Matthew Moll, Myriam Fornage, Noah Simon, Peter Castaldi, Ramon Casanova, Ren-Hua Chung, Robert Kaplan, Ruth J.F. Loos, Sharon L. R. Kardia, Stephen S. Rich, Susan Redline, Tanika Kelly, Timothy O’Connor, Wei Zhao, Wonji Kim, Xiuqing Guo, Yii Der Ida Chen, the Trans-Omics in Precision Medicine Consortium, Tamar Sofer
medRxiv 2023.12.13.23299909; doi: https://doi.org/10.1101/2023.12.13.23299909
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Machine learning models for blood pressure phenotypes combining multiple polygenic risk scores
Yana Hrytsenko, Benjamin Shea, Michael Elgart, Nuzulul Kurniansyah, Genevieve Lyons, Alanna C. Morrison, April P. Carson, Bernhard Haring, Braxton D. Mitchel, Bruce M. Psaty, Byron C. Jaeger, C Charles Gu, Charles Kooperberg, Daniel Levy, Donald Lloyd-Jones, Eunhee Choi, Jennifer A Brody, Jennifer A Smith, Jerome I. Rotter, Matthew Moll, Myriam Fornage, Noah Simon, Peter Castaldi, Ramon Casanova, Ren-Hua Chung, Robert Kaplan, Ruth J.F. Loos, Sharon L. R. Kardia, Stephen S. Rich, Susan Redline, Tanika Kelly, Timothy O’Connor, Wei Zhao, Wonji Kim, Xiuqing Guo, Yii Der Ida Chen, the Trans-Omics in Precision Medicine Consortium, Tamar Sofer
medRxiv 2023.12.13.23299909; doi: https://doi.org/10.1101/2023.12.13.23299909

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)