Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

A Framework for Automated Gene Selection in Genomic Screening

View ORCID ProfileL Lazo de la Vega, W Yu, K Machini, View ORCID ProfileCA Austin-Tse, L Hao, CL Blout Zawatsky, View ORCID ProfileH Mason-Suares, RC Green, View ORCID ProfileHL Rehm, View ORCID ProfileMS Lebo
doi: https://doi.org/10.1101/2020.12.11.20231449
L Lazo de la Vega
1Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA
2Department of Pathology, Brigham & Women’s Hospital, Boston, MA
3Harvard Medical School, Boston, MA
4Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for L Lazo de la Vega
W Yu
1Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
K Machini
1Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA
2Department of Pathology, Brigham & Women’s Hospital, Boston, MA
3Harvard Medical School, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
CA Austin-Tse
1Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA
3Harvard Medical School, Boston, MA
5Center for Genomic Medicine and Departments of Pathology and Medicine, Massachusetts General Hospital, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for CA Austin-Tse
L Hao
1Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA
3Harvard Medical School, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
CL Blout Zawatsky
6Division of Genetics, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
H Mason-Suares
1Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA
2Department of Pathology, Brigham & Women’s Hospital, Boston, MA
3Harvard Medical School, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for H Mason-Suares
RC Green
3Harvard Medical School, Boston, MA
4Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA
6Division of Genetics, Department of Medicine, Brigham and Women’s Hospital, Boston, MA, USA
7Ariadne Labs, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
HL Rehm
2Department of Pathology, Brigham & Women’s Hospital, Boston, MA
3Harvard Medical School, Boston, MA
4Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA
5Center for Genomic Medicine and Departments of Pathology and Medicine, Massachusetts General Hospital, Boston, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for HL Rehm
MS Lebo
1Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA
2Department of Pathology, Brigham & Women’s Hospital, Boston, MA
3Harvard Medical School, Boston, MA
4Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for MS Lebo
  • For correspondence: mlebo{at}bwh.harvard.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

An efficient framework to identify disease-causing genes is needed to evaluate genomic data for both individuals with an unknown disease etiology and those undergoing genomic screening. Here, we propose a framework for gene selection used in genomic analyses, including screening applications limited to genes with strong or established evidence levels and diagnostic applications that includes genes with less or emerging evidence of disease association. We extracted genes with evidence for gene-disease association from the Human Gene Mutation Database, Online Mendelian Inheritance in Man, and ClinVar to build a diagnostic gene list of 5,973 genes. Next, we applied stringent filters in conjunction with computationally curated evidence (DisGeNET) to create a list limited to 3,600 genes with stronger levels of evidence for disease association. When compared to manual gene curation efforts, including the Clinical Genome Resource, genes with strong or definitive disease associations are included in both gene lists at high percentages, while genes with limited evidence are largely removed. We further confirmed the utility of this approach in the screening of 45 ostensibly healthy genomes. Our approach efficiently creates highly sensitive gene lists for genomic applications, while remaining dynamic and updatable, enabling time savings in gene curation and review.

Competing Interest Statement

Ms. Blout Zawatsky, Dr. Lazo de la Vega, Dr. Lebo, report grants from National Heart Blood and Lung Institute, during the conduct of the study. Dr. Austin-Tse, Dr. Mason-Suares, Dr. Machini, Dr. Hao, and Dr. Yu have nothing to disclose. Dr. Rehm reports grants from NIH, grants from National Heart Blood and Lung Institute, during the conduct of the study; personal fees from Genome Medical, outside the submitted work. Dr. Green reports grants from National Heart Blood and Lung Institute, during the conduct of the study; personal fees from AIA, personal fees from SavvySherpa, personal fees from Verily, personal fees from Wamberg, and is co-founder of Genome Medical, outside the submitted work.

Funding Statement

Funding support was partly provided by grant 5R01HL143295 from the National Heart, Lung, and Blood Institute (LLV, CLZ, RCG, HLR, MSL).

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

This project has been reviewed and approved by the Mass General Brigham IRB.

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

The resulting genes lists and information used to create them can be found in the Supplemental Materials. This gene list will be provided on-line for easy access and on-going updates.

https://docs.google.com/spreadsheets/d/1VabFne_4TqEHxczwurwC8gokE0Q9fXCfbLlS5-9SqYs/edit#gid=1463456679

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted December 15, 2020.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
A Framework for Automated Gene Selection in Genomic Screening
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
A Framework for Automated Gene Selection in Genomic Screening
L Lazo de la Vega, W Yu, K Machini, CA Austin-Tse, L Hao, CL Blout Zawatsky, H Mason-Suares, RC Green, HL Rehm, MS Lebo
medRxiv 2020.12.11.20231449; doi: https://doi.org/10.1101/2020.12.11.20231449
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
A Framework for Automated Gene Selection in Genomic Screening
L Lazo de la Vega, W Yu, K Machini, CA Austin-Tse, L Hao, CL Blout Zawatsky, H Mason-Suares, RC Green, HL Rehm, MS Lebo
medRxiv 2020.12.11.20231449; doi: https://doi.org/10.1101/2020.12.11.20231449

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)