Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

On Cross-ancestry Cancer Polygenic Risk Scores

View ORCID ProfileLars G. Fritsche, Ying Ma, Daiwei Zhang, Maxwell Salvatore, Seunggeun Lee, Xiang Zhou, Bhramar Mukherjee
doi: https://doi.org/10.1101/2021.02.24.21252351
Lars G. Fritsche
1Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
2Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
3Center for Precision Health Data Science, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
4University of Michigan Rogel Cancer Center, University of Michigan, Ann Arbor, Michigan 48109, United States of America
8Department of Biostatistics, School of Public Health University of Michigan, 1415 Washington Heights, SPH Tower Room 4636, Ann Arbor, MI 48109
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lars G. Fritsche
  • For correspondence: larsf{at}umich.edu
Ying Ma
1Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
2Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daiwei Zhang
1Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
2Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Maxwell Salvatore
1Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
3Center for Precision Health Data Science, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
5Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Seunggeun Lee
1Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
2Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
7Graduate School of Data Science, Seoul National University, Seoul, South Korea
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiang Zhou
1Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
2Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
3Center for Precision Health Data Science, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Bhramar Mukherjee
1Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
2Center for Statistical Genetics, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
3Center for Precision Health Data Science, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
4University of Michigan Rogel Cancer Center, University of Michigan, Ann Arbor, Michigan 48109, United States of America
5Department of Epidemiology, University of Michigan School of Public Health, Ann Arbor, Michigan 48109, United States of America
6Michigan Institute for Data Science, University of Michigan, Ann Arbor, Michigan 48109, United States of America
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Polygenic risk scores (PRS) can provide useful information for personalized risk stratification and disease risk assessment, especially when combined with non-genetic risk factors. However, their construction depends on the availability of summary statistics from genome-wide association studies (GWAS) independent from the target sample. For best compatibility, it was reported that GWAS and the target sample should match in terms of ancestries. Yet, GWAS, especially in the field of cancer, often lack diversity and are predominated by European ancestry. This bias is a limiting factor in PRS research. By using electronic health records and genetic data from the UK Biobank, we contrast the utility of breast and prostate cancer PRS derived from external European-ancestry-based GWAS across African, East Asian, European, and South Asian ancestry groups. We highlight differences in the PRS distributions of these groups that are amplified when PRS methods condense hundreds of thousands of variants into a single score. While European-GWAS-derived PRS were not directly transferrable across ancestries on an absolute scale, we establish their predictive potential when considering them separately within each group. For example, the top 10% of the breast cancer PRS distributions within each ancestry group each revealed significant enrichments of breast cancer cases compared to the bottom 90% (odds ratio of 2.81 [95%CI: 2.69,2.93] in European, 2.88 [1.85, 4.48] in African, 2.60 [1.25, 5.40] in East Asian, and 2.33 [1.55, 3.51] in South Asian individuals). Our findings highlight a compromise solution for PRS research to compensate for the lack of diversity in well-powered European GWAS efforts while recruitment of diverse participants in the field catches up.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This material is based in part upon work supported by the National Institutes of Health/NIH (NCI P30CA046592 [LGF, MS, BM]), by the University of Michigan (UM-Precision Health Investigators Award U063790 [LGF, SP, YM, BM]), by the National Research Foundation of Korea (BP+ Program [SL]) and by the National Science Foundation under grant number DMS-1712933. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The open-access UK Biobank data used in this study included questionnaire data, electronic health record data, and genotype and genotyped derived data. UK Biobank received ethical approval from the NHS National Research Ethics Service North West (11/NW/0382).

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

Data cannot be shared publicly due to patient confidentiality. The data underlying the results presented in the study are available from the UK Biobank for researchers who meet the criteria for access to confidential data.

http://www.ukbiobank.ac.uk/register-apply/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC 4.0 International license.
Back to top
PreviousNext
Posted March 02, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
On Cross-ancestry Cancer Polygenic Risk Scores
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
On Cross-ancestry Cancer Polygenic Risk Scores
Lars G. Fritsche, Ying Ma, Daiwei Zhang, Maxwell Salvatore, Seunggeun Lee, Xiang Zhou, Bhramar Mukherjee
medRxiv 2021.02.24.21252351; doi: https://doi.org/10.1101/2021.02.24.21252351
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
On Cross-ancestry Cancer Polygenic Risk Scores
Lars G. Fritsche, Ying Ma, Daiwei Zhang, Maxwell Salvatore, Seunggeun Lee, Xiang Zhou, Bhramar Mukherjee
medRxiv 2021.02.24.21252351; doi: https://doi.org/10.1101/2021.02.24.21252351

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)