Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Large-scale trans-ethnic replication and discovery of genetic associations for rare diseases with self-reported medical data

Suyash S. Shringarpure, Wei Wang, Yunxuan Jiang, Alison Acevedo, Devika Dhamija, Briana Cameron, Adrian Jubb, Peng Yue, The 23andMe Research Team, Lea Sarov-Blat, Robert Gentleman, Adam Auton
doi: https://doi.org/10.1101/2021.06.09.21258643
Suyash S. Shringarpure
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: sshringarpure{at}23andme.com
Wei Wang
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yunxuan Jiang
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alison Acevedo
2GlaxoSmithKline, 1250 S Collegeville Rd, Collegeville PA 19426
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Devika Dhamija
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Briana Cameron
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Adrian Jubb
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Peng Yue
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
Lea Sarov-Blat
2GlaxoSmithKline, 1250 S Collegeville Rd, Collegeville PA 19426
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert Gentleman
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Adam Auton
123andMe Inc., 223 N Mathilda Ave, Sunnyvale, CA 94086
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

A key challenge in the study of rare disease genetics is assembling large case cohorts for well-powered studies. We demonstrate the use of self-reported diagnosis data to study rare diseases at scale. We performed genome-wide association studies (GWAS) for 33 rare diseases using self-reported diagnosis phenotypes and re-discovered 29 known associations to validate our approach. In addition, we performed the first GWAS for Duane retraction syndrome, vestibular schwannoma and spontaneous pneumothorax, and report novel genome-wide significant associations for these diseases. We replicated these novel associations in non-European populations within the 23andMe, Inc. cohort as well as in the UK Biobank cohort. We also show that mixed model analyses including all ethnicities and related samples increase the power for finding associations in rare diseases. Our results, based on analysis of 19,084 rare disease cases for 33 diseases from 7 populations, show that large-scale online collection of self-reported data is a viable method for discovery and replication of genetic associations for rare diseases. This approach, which is complementary to sequencing-based approaches, will enable the discovery of more novel genetic associations for increasingly rare diseases across multiple ancestries and shed more light on the genetic architecture of rare diseases.

Competing Interest Statement

Adam Auton, Briana Cameron, Devika Dhamija, Robert Gentleman, Yunxuan Jiang, Adrian Jubb, Suyash Shringarpure, Wei Wang, and Peng Yue are current or former employees of 23andMe and hold stock or stock options in 23andMe, Inc. Alison Acevedo and Lea Sarov-Blat are employees of GlaxoSmithKline and own company stock.

Funding Statement

No external funding

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Participants provided informed consent and participated in the research online, under a protocol approved by the external AAHRPP-accredited IRB, Ethical & Independent Review Services (E&I Review). Participants were included in the analysis on the basis of consent status as checked at the time data analyses were initiated.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The full GWAS summary statistics for the 23andMe discovery data set will be made available through 23andMe to qualified researchers under an agreement with 23andMe that protects the privacy of the 23andMe participants. Please visit https://research.23andme.com/collaborate/#dataset-access/ for more information and to apply to access the data.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted June 16, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Large-scale trans-ethnic replication and discovery of genetic associations for rare diseases with self-reported medical data
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Large-scale trans-ethnic replication and discovery of genetic associations for rare diseases with self-reported medical data
Suyash S. Shringarpure, Wei Wang, Yunxuan Jiang, Alison Acevedo, Devika Dhamija, Briana Cameron, Adrian Jubb, Peng Yue, The 23andMe Research Team, Lea Sarov-Blat, Robert Gentleman, Adam Auton
medRxiv 2021.06.09.21258643; doi: https://doi.org/10.1101/2021.06.09.21258643
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Large-scale trans-ethnic replication and discovery of genetic associations for rare diseases with self-reported medical data
Suyash S. Shringarpure, Wei Wang, Yunxuan Jiang, Alison Acevedo, Devika Dhamija, Briana Cameron, Adrian Jubb, Peng Yue, The 23andMe Research Team, Lea Sarov-Blat, Robert Gentleman, Adam Auton
medRxiv 2021.06.09.21258643; doi: https://doi.org/10.1101/2021.06.09.21258643

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)