Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Exome-wide association studies discover germline mutation patterns and identify high-risk populations in human cancers

View ORCID ProfileSipeng Shen, Yunke Jiang, Guanrong Wang, Hongru Li, Dongfang You, Weiwei Duan, Ruyang Zhang, Yongyue Wei, Hongbing Shen, Zhibin Hu, David C. Christiani, Yang Zhao, Feng Chen
doi: https://doi.org/10.1101/2022.06.11.22275897
Sipeng Shen
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
2Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Jiangsu Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, 211166, Nanjing, China
3China International Cooperation Center of Environment and Human Health, Nanjing Medical University
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sipeng Shen
  • For correspondence: fengchen{at}njmu.edu.cn zhaoyang{at}njmu.edu.cn sshen{at}njmu.edu.cn
Yunke Jiang
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Guanrong Wang
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
4Department of Epidemiology, Jiangsu Health Development Research Center, Jiangsu Province, Nanjing 210036, China
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hongru Li
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
MS
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Dongfang You
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
3China International Cooperation Center of Environment and Human Health, Nanjing Medical University
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Weiwei Duan
6Department of Bioinformatics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, Jiangsu 211166, China
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruyang Zhang
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
5Key Laboratory of Biomedical Big Data of Nanjing Medical University, Nanjing 211166, China
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yongyue Wei
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
3China International Cooperation Center of Environment and Human Health, Nanjing Medical University
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hongbing Shen
2Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Jiangsu Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, 211166, Nanjing, China
7Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China, Nanjing, Jiangsu 211166, China
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Zhibin Hu
2Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Jiangsu Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, 211166, Nanjing, China
7Department of Epidemiology, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China, Nanjing, Jiangsu 211166, China
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
David C. Christiani
8Department of Environmental Health, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA 02115, USA
9Pulmonary and Critical Care Division, Massachusetts General Hospital, Department of Medicine, Harvard Medical School, Boston, MA 02114, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yang Zhao
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
5Key Laboratory of Biomedical Big Data of Nanjing Medical University, Nanjing 211166, China
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: fengchen{at}njmu.edu.cn zhaoyang{at}njmu.edu.cn sshen{at}njmu.edu.cn
Feng Chen
1Department of Biostatistics, Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing 211166, China
2Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Jiangsu Collaborative Innovation Center for Cancer Personalized Medicine, Nanjing Medical University, 211166, Nanjing, China
3China International Cooperation Center of Environment and Human Health, Nanjing Medical University
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: fengchen{at}njmu.edu.cn zhaoyang{at}njmu.edu.cn sshen{at}njmu.edu.cn
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Genome-wide association studies have discovered numerous common variants associated with human cancers. However, the contribution of exome-wide rare variants to cancers remains largely unexplored, especially for the protein-coding variants. The UK Biobank provides detailed cancer follow-up information linked to whole-exome sequencing (WES) for approximately 450,000 participants, offering an unprecedented opportunity to evaluate the effect of exome variation on pan-cancer. Here, we performed exome-wide association studies (ExWAS) based on single variant levels and gene levels to detect their associations across 20 primary cancer types in the discovery set (WES-300k, N = 284,456) and replication set (WES-150k, N = 143,478), separately. The ExWAS detected 143 independent variants at variant-level and 49 genes at gene-level, while nine variants and eight genes were shared across cancers. In the cross-trait meta-analysis, we identified 239 additional independent pleiotropic variants, mapping to the genes which were functional through trans-omics analyses in transcriptomics and proteomics. Further, we developed exome-wide risk scores (ERS) to identify high-risk populations based on rare variants with minor allele frequency (MAF) < 0.05. The ERS had satisfactory performance in cancer risk stratification, especially for the extremely high-risk persons (top 5% ERS) that were frequently risk allele carriers. The ERS (median C-index (IQR): 0.655 (0.636-0.667)) outperforms the traditional polygenic risk score (PRS) (median C-index (IQR): 0.585 (0.572-0.614)) for discrimination in the replication set. Our findings offer further insight into the genetic architecture of human exomes for cancer susceptibility.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This study was supported by the National Natural Science Foundation of China (82103946 to S.S., 82173620 to Y.Z., 81530088 to F.C.), National Key Research and Development Program of China (2016YFE0204900 to F.C.), Natural Science Foundation of the Jiangsu Higher Education Institutions of China (21KJB330004 to S.S.).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • Consent for publication All authors have reviewed and approved this manuscript.

  • Conflicts of interest statements/Financial Disclosure statement: The authors report no conflicts of interest.

  • Funding This study was supported by the National Natural Science Foundation of China (82103946 to S.S., 82173620 to Y.Z., 81530088 to F.C.), National Key Research and Development Program of China (2016YFE0204900 to F.C.), Natural Science Foundation of the Jiangsu Higher Education Institutions of China (21KJB330004 to S.S.).

Data Availability

UK Biobank data is available from https://www.ukbiobank.ac.uk/. TCGA data is available from https://portal.gdc.cancer.gov/. GTEx data is available from https://www.gtexportal.org/home/. CPTAC data is available from https://pdc.esacinc.com/pdc/pdc.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted June 16, 2022.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Exome-wide association studies discover germline mutation patterns and identify high-risk populations in human cancers
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Exome-wide association studies discover germline mutation patterns and identify high-risk populations in human cancers
Sipeng Shen, Yunke Jiang, Guanrong Wang, Hongru Li, Dongfang You, Weiwei Duan, Ruyang Zhang, Yongyue Wei, Hongbing Shen, Zhibin Hu, David C. Christiani, Yang Zhao, Feng Chen
medRxiv 2022.06.11.22275897; doi: https://doi.org/10.1101/2022.06.11.22275897
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Exome-wide association studies discover germline mutation patterns and identify high-risk populations in human cancers
Sipeng Shen, Yunke Jiang, Guanrong Wang, Hongru Li, Dongfang You, Weiwei Duan, Ruyang Zhang, Yongyue Wei, Hongbing Shen, Zhibin Hu, David C. Christiani, Yang Zhao, Feng Chen
medRxiv 2022.06.11.22275897; doi: https://doi.org/10.1101/2022.06.11.22275897

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Oncology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)