Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Navigating sample overlap, winner’s curse and weak instrument bias in Mendelian randomization studies using the UK Biobank

Ildar I Sadreev, Benjamin L Elsworth, Ruth E Mitchell, Lavinia Paternoster, Eleanor Sanderson, Neil M Davies, Louise AC Millard, George Davey Smith, Philip C Haycock, Jack Bowden, Tom R Gaunt, View ORCID ProfileGibran Hemani
doi: https://doi.org/10.1101/2021.06.28.21259622
Ildar I Sadreev
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Benjamin L Elsworth
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ruth E Mitchell
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lavinia Paternoster
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Eleanor Sanderson
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Neil M Davies
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
3K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Norway
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Louise AC Millard
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
George Davey Smith
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Philip C Haycock
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jack Bowden
4Exeter Diabetes Group (ExCEED), College of Medicine and Health, University of Exeter, Exeter, U.K.
1MRC Integrative Epidemiology Unit, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tom R Gaunt
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Gibran Hemani
1MRC Integrative Epidemiology Unit, University of Bristol
2Population Health Sciences, Bristol Medical School, University of Bristol
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Gibran Hemani
  • For correspondence: g.hemani{at}bristol.ac.uk
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

We performed GWAS on 2514 complex traits from the UK Biobank using a linear mixed model, identifying 40,620 independent significant associations (p<5×10−8). We estimate that winner’s curse incurs substantial overestimation of effect sizes in a mean of 35% of discovered associations per trait. We use these results to estimate that the polygenicity of most complex traits is below 10000 common causal variants. We evaluated the impact of winner’s curse on causal effect estimation and hypothesis testing in Mendelian randomization analyses. We show that winner’s curse substantially amplifies the magnitude of weak instrument bias, though any inflation of false discovery rates tends to be low or modest. We designed a process of pseudo-replication within the UK Biobank data to generate GWAS estimates that minimise bias in MR studies using these data. Our resource is integrated into the OpenGWAS platform and enables a convenient framework for researchers to minimise bias or maximise precision of causal effect estimates.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This research was funded in part by the Wellcome Trust and the Royal Society [208806/Z/17/Z] and the UK Medical Research Council (MRC Integrative Epidemiology Unit MC_UU_00011/1 and MC_UU_00011/4). NMD is supported by a Norwegian Research Council Grant number 295989. JB is supported by an Expanding Excellence in England (E3) research grant awarded to the University of Exeter. For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

NA

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The GWAS summary data generated as part of this study are available in the OpenGWAS database under batch IDs ukb-b, ukb-ba, ukb-bb at https://gwas.mrcieu.ac.uk/. Code used to generate the GWAS summary data is available here: https://github.com/MRCIEU/BiobankGWAS/ Code used for the other statistical analysis in the study is available here: https://github.com/isadreev/UKBB_replication

https://gwas.mrcieu.ac.uk/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted July 01, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Navigating sample overlap, winner’s curse and weak instrument bias in Mendelian randomization studies using the UK Biobank
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Navigating sample overlap, winner’s curse and weak instrument bias in Mendelian randomization studies using the UK Biobank
Ildar I Sadreev, Benjamin L Elsworth, Ruth E Mitchell, Lavinia Paternoster, Eleanor Sanderson, Neil M Davies, Louise AC Millard, George Davey Smith, Philip C Haycock, Jack Bowden, Tom R Gaunt, Gibran Hemani
medRxiv 2021.06.28.21259622; doi: https://doi.org/10.1101/2021.06.28.21259622
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Navigating sample overlap, winner’s curse and weak instrument bias in Mendelian randomization studies using the UK Biobank
Ildar I Sadreev, Benjamin L Elsworth, Ruth E Mitchell, Lavinia Paternoster, Eleanor Sanderson, Neil M Davies, Louise AC Millard, George Davey Smith, Philip C Haycock, Jack Bowden, Tom R Gaunt, Gibran Hemani
medRxiv 2021.06.28.21259622; doi: https://doi.org/10.1101/2021.06.28.21259622

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)