Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Advancing Human Genetics Research and Drug Discovery through Exome Sequencing of the UK Biobank

Joseph D. Szustakowski, Suganthi Balasubramanian, Ariella Sasson, Shareef Khalid, Paola G. Bronson, Erika Kvikstad, Emily Wong, Daren Liu, J. Wade Davis, Carolina Haefliger, A. Katrina Loomis, Rajesh Mikkilineni, Hyun Ji Noh, Samir Wadhawan, Xiaodong Bai, Alicia Hawes, Olga Krasheninina, Ricardo Ulloa, Alex Lopez, Erin N. Smith, Jeff Waring, Christopher D. Whelan, Ellen A. Tsai, John Overton, William Salerno, Howard Jacob, Sandor Szalma, Heiko Runz, Greg Hinkle, Paul Nioi, Slavé Petrovski, Melissa R. Miller, Aris Baras, Lyndon Mitnaul, View ORCID ProfileJeffrey G. Reid on behalf of the UKB-ESC Research Team
doi: https://doi.org/10.1101/2020.11.02.20222232
Joseph D. Szustakowski
1Bristol Myers Squibb, Route 206 and Province Line Road, Princeton, NJ 08543
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Suganthi Balasubramanian
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ariella Sasson
1Bristol Myers Squibb, Route 206 and Province Line Road, Princeton, NJ 08543
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shareef Khalid
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paola G. Bronson
3Biogen Inc., 225 Binney Street, Cambridge, MA 02139
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Erika Kvikstad
1Bristol Myers Squibb, Route 206 and Province Line Road, Princeton, NJ 08543
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Emily Wong
4Takeda Pharmaceutical Company Ltd, 1-1, Nihonbashi-Honcho 2-chome, Chuo-ku, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Daren Liu
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
J. Wade Davis
5Abbvie Inc., 1 N. Waukegan Rd, North Chicago, IL 60064
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carolina Haefliger
7AstraZeneca Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
A. Katrina Loomis
8Pfizer, Inc., 1 Portland St., Cambridge, MA 02139
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Rajesh Mikkilineni
4Takeda Pharmaceutical Company Ltd, 1-1, Nihonbashi-Honcho 2-chome, Chuo-ku, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hyun Ji Noh
5Abbvie Inc., 1 N. Waukegan Rd, North Chicago, IL 60064
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Samir Wadhawan
1Bristol Myers Squibb, Route 206 and Province Line Road, Princeton, NJ 08543
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiaodong Bai
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alicia Hawes
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Olga Krasheninina
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ricardo Ulloa
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alex Lopez
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Erin N. Smith
4Takeda Pharmaceutical Company Ltd, 1-1, Nihonbashi-Honcho 2-chome, Chuo-ku, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeff Waring
5Abbvie Inc., 1 N. Waukegan Rd, North Chicago, IL 60064
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Christopher D. Whelan
3Biogen Inc., 225 Binney Street, Cambridge, MA 02139
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Ellen A. Tsai
3Biogen Inc., 225 Binney Street, Cambridge, MA 02139
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
John Overton
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
William Salerno
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Howard Jacob
5Abbvie Inc., 1 N. Waukegan Rd, North Chicago, IL 60064
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sandor Szalma
4Takeda Pharmaceutical Company Ltd, 1-1, Nihonbashi-Honcho 2-chome, Chuo-ku, Tokyo, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Heiko Runz
3Biogen Inc., 225 Binney Street, Cambridge, MA 02139
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Greg Hinkle
6Alnylam Pharmaceuticals, 675 West Kendall St, Cambridge, MA 02142
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Paul Nioi
6Alnylam Pharmaceuticals, 675 West Kendall St, Cambridge, MA 02142
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Slavé Petrovski
7AstraZeneca Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Melissa R. Miller
8Pfizer, Inc., 1 Portland St., Cambridge, MA 02139
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Aris Baras
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lyndon Mitnaul
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jeffrey G. Reid
2Regeneron Pharmaceuticals Inc., 777 Old Saw Mill River Road, Tarrytown, New York 10591
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Jeffrey G. Reid
  • For correspondence: jeffrey.reid{at}regeneron.com
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The UK Biobank Exome Sequencing Consortium (UKB-ESC) is a unique private/public partnership between the UK Biobank and eight biopharma companies that will sequence the exomes of all ∼500,000 UK Biobank participants. Here we describe early results from the exome sequence data generated by this consortium for the first ∼200,000 UKB subjects and the key features of this project that enabled the UKB-ESC to come together and generate this data.

Exome sequencing data from the first 200,643 UKB enrollees are now accessible to the research community. Approximately 10M variants were observed within the targeted regions, including: 8,086,176 SNPs, 370,958 indels and 1,596,984 multi-allelic variants. Of the ∼8M variants observed, 84.5% are coding variants and include 2,139,318 (25.3%) synonymous, 4,549,694 (53.8%) missense, 453,733 (5.4%) predicted loss-of-function (LOF) variants (initiation codon loss, premature stop codons, stop codon loss, splicing and frameshift variants) affecting at least one coding transcript. This open access data provides a rich resource of coding variants for rare variant genetic studies, and is particularly valuable for drug discovery efforts that utilize rare, functionally consequential variants.

Over the past decade, the biopharma industry has increasingly leveraged human genetics as part of their drug discovery and development strategies. This shift was motivated by technical advances that enabled cost-effective human genetics research at scale, the emergence of electronic health records and biobanks, and a maturing understanding of how human genetics can increase the probability of successful drug development. Recognizing the need for large-scale human genetics data to drive drug discovery, and the unique value of the open data access policies and contribution terms of the UK Biobank, the UKB-ESC was formed. This precompetitive collaboration has further strengthened the ties between academia and industry and provided teams an unprecedented opportunity to interact with and learn from the wider research community.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

Funding for this work was provided by the authors' institutions, and no external funding was recieved for the generation and analysis of the 200k exome data results presented here. UK Biobank generally recieves external funding from a variety of sources.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This work was reviewed and approved by UK Biobank, for details see the UKB ethics and governance framework: https://www.ukbiobank.ac.uk/wp-content/uploads/2011/05/EGF20082.pdf

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All data is avaiable through the UK Biobank (https://www.ukbiobank.ac.uk/) to bona fide researchers who have submitted compliant research applications.

https://www.ukbiobank.ac.uk/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted November 04, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Advancing Human Genetics Research and Drug Discovery through Exome Sequencing of the UK Biobank
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Advancing Human Genetics Research and Drug Discovery through Exome Sequencing of the UK Biobank
Joseph D. Szustakowski, Suganthi Balasubramanian, Ariella Sasson, Shareef Khalid, Paola G. Bronson, Erika Kvikstad, Emily Wong, Daren Liu, J. Wade Davis, Carolina Haefliger, A. Katrina Loomis, Rajesh Mikkilineni, Hyun Ji Noh, Samir Wadhawan, Xiaodong Bai, Alicia Hawes, Olga Krasheninina, Ricardo Ulloa, Alex Lopez, Erin N. Smith, Jeff Waring, Christopher D. Whelan, Ellen A. Tsai, John Overton, William Salerno, Howard Jacob, Sandor Szalma, Heiko Runz, Greg Hinkle, Paul Nioi, Slavé Petrovski, Melissa R. Miller, Aris Baras, Lyndon Mitnaul, Jeffrey G. Reid
medRxiv 2020.11.02.20222232; doi: https://doi.org/10.1101/2020.11.02.20222232
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Advancing Human Genetics Research and Drug Discovery through Exome Sequencing of the UK Biobank
Joseph D. Szustakowski, Suganthi Balasubramanian, Ariella Sasson, Shareef Khalid, Paola G. Bronson, Erika Kvikstad, Emily Wong, Daren Liu, J. Wade Davis, Carolina Haefliger, A. Katrina Loomis, Rajesh Mikkilineni, Hyun Ji Noh, Samir Wadhawan, Xiaodong Bai, Alicia Hawes, Olga Krasheninina, Ricardo Ulloa, Alex Lopez, Erin N. Smith, Jeff Waring, Christopher D. Whelan, Ellen A. Tsai, John Overton, William Salerno, Howard Jacob, Sandor Szalma, Heiko Runz, Greg Hinkle, Paul Nioi, Slavé Petrovski, Melissa R. Miller, Aris Baras, Lyndon Mitnaul, Jeffrey G. Reid
medRxiv 2020.11.02.20222232; doi: https://doi.org/10.1101/2020.11.02.20222232

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)