Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Real-time dynamic polygenic prediction for streaming data

View ORCID ProfileJustin D. Tubbs, Yu Chen, Rui Duan, Hailiang Huang, Tian Ge
doi: https://doi.org/10.1101/2024.07.12.24310357
Justin D. Tubbs
1Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
2Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Justin D. Tubbs
Yu Chen
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA
4Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA
5Department of Medicine, Massachusetts General Hospital, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rui Duan
6Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hailiang Huang
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA
4Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA
5Department of Medicine, Massachusetts General Hospital, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tian Ge
1Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA
2Center for Precision Psychiatry, Department of Psychiatry, Massachusetts General Hospital, Boston, MA
3Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: tge1{at}mgh.harvard.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Polygenic risk scores (PRSs) are promising tools for advancing precision medicine. However, existing PRS construction methods rely on static summary statistics derived from genome-wide association studies (GWASs), which are often updated at lengthy intervals. As genetic data and health outcomes are continuously being generated at an ever-increasing pace, the current PRS training and deployment paradigm is suboptimal in maximizing the prediction accuracy of PRSs for incoming patients in healthcare settings. Here, we introduce real-time PRS-CS (rtPRS-CS), which enables online, dynamic refinement and calibration of PRS as each new sample is collected, without the need to perform intermediate GWASs. Through extensive simulation studies, we evaluate the performance of rtPRS-CS across various genetic architectures and training sample sizes. Leveraging quantitative traits from the Mass General Brigham Biobank and UK Biobank, we show that rtPRS-CS can integrate massive streaming data to enhance PRS prediction over time. We further apply rtPRS-CS to 22 schizophrenia cohorts in 7 Asian regions, demonstrating the clinical utility of rtPRS-CS in dynamically predicting and stratifying disease risk across diverse genetic ancestries.

Competing Interest Statement

H.H. received consultancy fees from Ono Pharmaceutical and honorarium from Xian Janssen Pharmaceutical. The other authors declare no competing interests.

Funding Statement

J.D.T is supported by the Mass General Brigham Training Program in Precision and Genomic Medicine (T32HG010464). R.D. is supported by National Institute of General Medical Sciences (NIGMS) R01GM148494. H.H. acknowledges supports from National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) K01DK114379 and R01DK129364, National Institute of Mental Health (NIMH) U01MH109539 and R01MH130675, Brain and Behavior Research Foundation Young Investigator Grant (28450), the Zhengxu and Ying He Foundation, and the Stanley Center for Psychiatric Research. T.G. is supported by National Human Genome Research Institute (NHGRI) R01HG012354, U01HG011723, and NIMH R01MH130899.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The use of Mass General Brigham Biobank (MGBB) data was approved by the Mass General Brigham Institutional Review Board. Collection of the UK Biobank (UKBB) data was approved by the Research Ethics Committee of the UKBB. UKBB data used in the present work were obtained under application 32568. The use of schizophrenia cohorts of East Asian ancestry in the present work was approved by the Stanley Global Asia Initiatives. The following institutions provided ethics oversight for the collection of schizophrenia samples: Samsung Medical Center; Bio-X Institutes of Shanghai Jiao Tong University; Xi'an Jiaotong University; The Second Xiangya Hospital of Central South University; Peking University Sixth Hospital; Fujita Health University; Tokyo Metropolitan Institute of Medical Science; University Medical Center Utrecht; The University of Western Australia; The University of Indonesia; RIKEN Center for Integrative Medical Sciences; Nagoya University; Osaka University; Niigata University; Chonnam National University Hospital; and Mass General Brigham (Protocols 2014P001342 and 2011P002207). Informed consent and permission to share the data were obtained from all subjects, in compliance with the guidelines specified by the recruiting center's institutional review board.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data availability

Mass General Brigham Biobank (MGBB) data are not publicly available due to privacy and ethical restrictions. De-identified data may be shared under an approved Data Use Agreement. UK Biobank (UKBB) data can be accessed under an approved application. The UKBB data used in the present study were obtained under application 32568. Data from schizophrenia cohorts are available through application to the Stanley Global Asia Initiatives: SGAI{at}broadinstitute.org. These data are subject to controlled access due to compliance requirements, participant consent and national laws. Application to access these data requires a brief research proposal that will be reviewed by the principal investigator of each cohort and, if necessary, by the respective ethics committee.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted July 14, 2024.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Real-time dynamic polygenic prediction for streaming data
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Real-time dynamic polygenic prediction for streaming data
Justin D. Tubbs, Yu Chen, Rui Duan, Hailiang Huang, Tian Ge
medRxiv 2024.07.12.24310357; doi: https://doi.org/10.1101/2024.07.12.24310357
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Real-time dynamic polygenic prediction for streaming data
Justin D. Tubbs, Yu Chen, Rui Duan, Hailiang Huang, Tian Ge
medRxiv 2024.07.12.24310357; doi: https://doi.org/10.1101/2024.07.12.24310357

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)