Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Exploring the extent of uncatalogued genetic variation in antimicrobial resistance gene families in Escherichia coli

View ORCID ProfileSamuel Lipworth, Derrick Crook, A. Sarah Walker, Tim Peto, View ORCID ProfileNicole Stoesser
doi: https://doi.org/10.1101/2023.03.14.23287259
Samuel Lipworth
1Nuffield Department of Medicine, University of Oxford
2Oxford University Hospitals NHS Foundation Trust
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Samuel Lipworth
  • For correspondence: samuel.lipworth{at}ndm.ox.ac.uk
Derrick Crook
1Nuffield Department of Medicine, University of Oxford
2Oxford University Hospitals NHS Foundation Trust
3NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU
4NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with UKHSA, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
A. Sarah Walker
1Nuffield Department of Medicine, University of Oxford
3NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU
4NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with UKHSA, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tim Peto
1Nuffield Department of Medicine, University of Oxford
2Oxford University Hospitals NHS Foundation Trust
3NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicole Stoesser
1Nuffield Department of Medicine, University of Oxford
2Oxford University Hospitals NHS Foundation Trust
3NIHR Oxford Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford OX3 9DU
4NIHR Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance at University of Oxford in partnership with UKHSA, Oxford, UK
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nicole Stoesser
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

ABSTRACT

Background Antimicrobial resistance (AMR) in E. coli is a global problem associated with substantial morbidity and mortality. AMR-associated genes are typically annotated based on similarity to a variants in a curated reference database with an implicit assumption that uncatalogued genetic variation within these is phenotypically unimportant. In this study we evaluated the potential for discovering new AMR-associated gene families and characterising variation within existing ones to improve genotype-to-susceptibility-phenotype prediction in E. coli.

Methods We assembled a global dataset of 9001 E. coli sequences of which 8586 had linked antibiotic susceptibility data. Raw reads were assembled using Shovill and AMR genes extracted using the NCBI AMRFinder tool. Mash was used to calculate the similarity between extracted genes using Jaccard distances. We empirically reclustered extracted gene sequences into AMR-associated gene families (70% match) and alleles (ARGs, 100% match).

Results The performance of the AMRFinder database for genotype-to-phenotype predictions using strict 100% identity and coverage thresholds did not meet FDA thresholds for any of the eight antibiotics evaluated. Relaxing filters to default settings improved sensitivity with a specificity cost. For all antibiotics, a small number of genes explained most resistance although a proportion could not be explained by known ARGs; this ranged from 75.1% for co-amoxiclav to 3.4% for ciprofloxacin. Only 17,177/36,637 (47%) of ARGs detected had a 100% identity and coverage match in the AMRFinder database. After empirically reclassifying genes at 100% nucleotide sequence identity, we identified 1292 unique ARGs of which 158 (12%) were present ≥10 times, 374 (29%) were present 2-9 times and 760 (59%) only once. Simulated accumulation curves revealed that discovery of new (100%-match) ARGs present more than once in the dataset plateaued relatively quickly whereas new singleton ARGs were discovered even after many thousands of isolates had been included. We identified a strong correlation (Spearman coefficient 0.76 (95% CI 0.72-0.79, p<0.001)) between the number of times an ARG was observed in Oxfordshire and the number of times it was seen internationally, with ARGs that were observed 7 times in Oxfordshire always being found elsewhere. Finally, using the example of blaTEM-1, we demonstrated that uncatalogued variation, including synonymous variation, is associated with potentially important phenotypic differences (e.g. two common, uncatalogued blaTEM-1 alleles with only synonymous mutations compared to the known reference were associated with reduced resistance to co-amoxiclav [aOR 0.57, 95%CI 0.34-0.93, p=0.03] and piperacillin-tazobactam [aOR 0.54, 95%CI 0.32-0.87, p=0.01]).

Conclusions Overall we highlight substantial uncatalogued genetic variation with respect to known ARGs, although a relatively small proportion of these alleles are repeatedly observed in a large international dataset suggesting strong selection pressures. The current approach of using fuzzy matching for ARG detection, ignoring the unknown effects of uncatalogued variation, is unlikely to be acceptable for future clinical deployment. The association of synonymous mutations with potentially important phenotypic differences suggests that relying solely on amino acid-based gene detection to predict resistance is unlikely to be sufficient. Finally, the inability to explain all resistance using existing knowledge highlights the importance of new target gene discovery.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

We would like to thank the authors of the datasets used in this study for making their data freely available for public use. The computational aspects of this research were funded from the NIHR Oxford BRC with additional support from the Wellcome Trust Core Award Grant Number 203141/Z/16/Z. SL was funded by an MRC Clinical Research Training Fellowship MR/T001151/1. ASW and TEAP are also supported by the NIHR Oxford Biomedical Research Centre. ASW is an NIHR Senior Investigator. NS is an NIHR Oxford BRC Senior Fellow. This research is supported by the National Institute for Health Research (NIHR) Health Protection Research Unit in Healthcare Associated Infections and Antimicrobial Resistance (NIHR200915), a partnership between the UK Health Security Agency (UKHSA) and the University of Oxford. The views expressed are those of the author(s) and not necessarily those of the NIHR, UKHSA or the Department of Health and Social Care. This research was supported by the National Institute for Health Research (NIHR) Oxford Biomedical Research Centre (BRC). The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Footnotes

  • ↵* Joint senior authors

Data Availability

All assemblies are available at 10.6084/m9.figshare.22220212 and associated metadata can be found in supplementary dataset 1. All code used for the analysis can be found at https://github.com/samlipworth/resistome_variation where there is also a binder environment in which the key aspects of the analysis can be replicated.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted March 15, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Exploring the extent of uncatalogued genetic variation in antimicrobial resistance gene families in Escherichia coli
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Exploring the extent of uncatalogued genetic variation in antimicrobial resistance gene families in Escherichia coli
Samuel Lipworth, Derrick Crook, A. Sarah Walker, Tim Peto, Nicole Stoesser
medRxiv 2023.03.14.23287259; doi: https://doi.org/10.1101/2023.03.14.23287259
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Exploring the extent of uncatalogued genetic variation in antimicrobial resistance gene families in Escherichia coli
Samuel Lipworth, Derrick Crook, A. Sarah Walker, Tim Peto, Nicole Stoesser
medRxiv 2023.03.14.23287259; doi: https://doi.org/10.1101/2023.03.14.23287259

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Infectious Diseases (except HIV/AIDS)
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)