Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

The performance of AlphaMissense to identify genes causing disease

Yiheng Chen, View ORCID ProfileGuillaume Butler-Laporte, Kevin Y. H. Liang, View ORCID ProfileYann Ilboudo, Summaira Yasmeen, Takayoshi Sasako, Claudia Langenberg, View ORCID ProfileCelia M.T. Greenwood, J Brent Richards
doi: https://doi.org/10.1101/2024.03.05.24303647
Yiheng Chen
1Department of Human Genetics, McGill University, Montréal, QC, Canada
2Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Guillaume Butler-Laporte
2Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, QC, Canada
3Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Guillaume Butler-Laporte
Kevin Y. H. Liang
2Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, QC, Canada
4Quantitative Life Sciences Program, McGill University, Montreal, Quebec, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yann Ilboudo
2Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Yann Ilboudo
Summaira Yasmeen
5Computational Medicine, Berlin Institute of Health at Charité—Universitätsmedizin Berlin, Berlin, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Takayoshi Sasako
2Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, QC, Canada
6Tanaka Diabetes Clinic Omiya, Saitama, Japan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Claudia Langenberg
7Precision Healthcare University Research Institute, Queen Mary University of London, London, UK
5Computational Medicine, Berlin Institute of Health at Charité—Universitätsmedizin Berlin, Berlin, Germany
8MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Celia M.T. Greenwood
2Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, QC, Canada
3Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, QC, Canada
4Quantitative Life Sciences Program, McGill University, Montreal, Quebec, Canada
9Gerald Bronfman Department of Oncology, McGill University, Montreal, Quebec, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Celia M.T. Greenwood
J Brent Richards
1Department of Human Genetics, McGill University, Montréal, QC, Canada
2Lady Davis Institute, Jewish General Hospital, McGill University, Montréal, QC, Canada
3Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, QC, Canada
105 Prime Sciences Inc, Montréal, Quebec, Canada
11Department of Medicine, McGill University, Montréal, Quebec, Canada
12Department of Twin Research, King’s College London, London, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: brent.richards{at}mcgill.ca
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

A novel algorithm, AlphaMissense, has been shown to have an improved ability to predict the pathogenicity of rare missense genetic variants. However, it is not known whether AlphaMissense improves the ability of gene-based testing to identify disease-causing genes. Using whole-exome sequencing data from the UK Biobank, we compared gene-based association analysis strategies including sets of deleterious variants: predicted loss-of-function (pLoF) variants only, pLoF plus AlphaMissense pathogenic variants, pLoF with missense variants predicted to be deleterious by any of five commonly utilized annotation methods (Missense (1/5)) or only variants predicted to be deleterious by all five methods (Missense (5/5)). We measured performance to identify 519 previously identified positive control genes, which can cause Mendelian diseases, or are the targets of successfully developed medicines. These strategies identified 850k pLoF variants and 5 million deleterious missense variants, including 22k likely pathogenic missense variants identified exclusively by AlphaMissense. The gene-based association tests found 608 significant gene associations (at P<1.25×10−7) across 24 common traits and diseases. Compared to pLOFs plus Missense (5/5), tests using pLoFs and AlphaMissense variants found slightly more significant gene-disease and gene-trait associations, albeit with a marginally lower proportion of positive control genes. Nevertheless, their overall performance was similar. Merging AlphaMissense with Missense (5/5), whether through their intersection or union, did not yield any further enhancement in performance. In summary, employing AlphaMissense to select deleterious variants for gene-based testing did not improve the ability to identify genes that are known to cause disease.

Competing Interest Statement

J.B.R is the CEO of 5 Prime Sciences (www.5primesciences.com), which provides research services for biotech, pharma, and venture capital companies for projects unrelated to this research. He has served as an advisor to GlaxoSmithKline and Deerfield Capital. The institution of J.B.R. has received investigator-initiated grant funding from Eli Lilly, GlaxoSmithKline, and Biogen for projects unrelated to this research. YC is an employee of 5 Prime Sciences.

Funding Statement

The Richards research group is supported by the Canadian Institutes of Health Research (CIHR: 365825, 409511, 100558, 169303), the McGill Interdisciplinary Initiative in Infection and Immunity (MI4), the Lady Davis Institute of the Jewish General Hospital, the Jewish General Hospital Foundation, the Canadian Foundation for Innovation, the NIH Foundation, Cancer Research UK, Genome Quebec, the Public Health Agency of Canada, McGill University, Cancer Research UK, and the Fonds de Recherche Quebec Sante (FRQS). J.B.R. is supported by an FRQS Merite Clinical Research Scholarship. Support from Calcul Quebec and Compute Canada is acknowledged. TwinsUK is funded by the Welcome Trust, Medical Research Council, European Union, the National Institute for Health Research (NIHR)-funded BioResource, Clinical Research Facility and Biomedical Research Centre based at Guys and St Thomas NHS Foundation Trust in partnership with Kings College London. Y.C. is supported by an FRQS doctoral training fellowship and the Lady Davis Institute/TD Bank Studentship Award. G.B.L. is supported by scholarships from the FRQS, the CIHR, and Quebec ministry of health and social services.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The UK Biobank was approved by the North West Multi-centre Research Ethics Committee and informed consent was obtained from all participants prior to participation.This research has been conducted using UK Biobank data under application ID 27449.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data availability

Individual-level genotype, exome sequencing, and phenotype data is available to approved researchers via UK Biobank at: https://www.ukbiobank.ac.uk. ExWAS summary statistics will be made available at GWAS Catalog (https://www.ebi.ac.uk/gwas/).

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted March 07, 2024.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
The performance of AlphaMissense to identify genes causing disease
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
The performance of AlphaMissense to identify genes causing disease
Yiheng Chen, Guillaume Butler-Laporte, Kevin Y. H. Liang, Yann Ilboudo, Summaira Yasmeen, Takayoshi Sasako, Claudia Langenberg, Celia M.T. Greenwood, J Brent Richards
medRxiv 2024.03.05.24303647; doi: https://doi.org/10.1101/2024.03.05.24303647
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
The performance of AlphaMissense to identify genes causing disease
Yiheng Chen, Guillaume Butler-Laporte, Kevin Y. H. Liang, Yann Ilboudo, Summaira Yasmeen, Takayoshi Sasako, Claudia Langenberg, Celia M.T. Greenwood, J Brent Richards
medRxiv 2024.03.05.24303647; doi: https://doi.org/10.1101/2024.03.05.24303647

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Genetic and Genomic Medicine
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)