Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Single nucleotide variants in Pseudomonas aeruginosa populations from sputum correlate with baseline lung function and predict disease progression in individuals with cystic fibrosis

Morteza M. Saber, Jannik Donner, Inès Levade, Nicole Acosta, Michael D. Parkins, Brian Boyle, View ORCID ProfileRoger Levesque, View ORCID ProfileDao Nguyen, View ORCID ProfileB. Jesse Shapiro
doi: https://doi.org/10.1101/2021.10.04.21264421
Morteza M. Saber
1Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jannik Donner
2Department of Medicine, Research Institute of the McGill University Health Centre, Montreal, Quebec H4A 3J1, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Inès Levade
2Department of Medicine, Research Institute of the McGill University Health Centre, Montreal, Quebec H4A 3J1, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nicole Acosta
3Department of Microbiology, Immunology and Infectious Disease, University of Calgary, Calgary, AB, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael D. Parkins
3Department of Microbiology, Immunology and Infectious Disease, University of Calgary, Calgary, AB, Canada
4Department of Medicine, University of Calgary, Calgary, AB, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Brian Boyle
5Integrative Systems Biology Institute, Université Laval, Québec, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Roger Levesque
5Integrative Systems Biology Institute, Université Laval, Québec, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Roger Levesque
Dao Nguyen
1Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
2Department of Medicine, Research Institute of the McGill University Health Centre, Montreal, Quebec H4A 3J1, Canada
6Meakins Christie Laboratories, Research Institute of the McGill University Health Centre, Montreal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Dao Nguyen
  • For correspondence: dao.nguyen{at}mcgill.ca jesse.shapiro{at}mcgill.ca
B. Jesse Shapiro
1Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada
7McGill Genome Centre, Montreal, QC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for B. Jesse Shapiro
  • For correspondence: dao.nguyen{at}mcgill.ca jesse.shapiro{at}mcgill.ca
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Complex polymicrobial communities inhabit the lungs of individuals with cystic fibrosis (CF) and contribute to the decline in lung function. However, the severity of lung disease and its progression in CF patients are highly variable and imperfectly predicted by host clinical factors at baseline, CFTR mutations in the host genome, or sputum polymicrobial community variation. The opportunistic pathogen Pseudomonas aeruginosa (Pa) dominates airway infections in the majority of CF adults. Here we hypothesized that genetic variation within Pa populations would be predictive of lung disease severity. To quantify Pa genetic variation within whole CF sputum samples, we used deep amplicon sequencing on a newly developed custom Ion AmpliSeq panel of 209 Pa genes previously associated with the host pathoadaptation and pathogenesis of CF infection. We trained machine learning models using Pa single nucleotide variants (SNVs), clinical and microbiome diversity data to classify lung disease severity at the time of sputum sampling, and to predict future lung function decline over five years in a cohort of 54 adult CF patients with chronic Pa infection. The models using Pa SNVs alone classified baseline lung disease with good sensitivity and specificity, with an area under the receiver operating characteristic curve (AUROC) of 0.87. While the models were less predictive of future lung function decline, they still achieved an AUROC of 0.74. The addition of clinical data to the models, but not microbiome community data, yielded modest improvements (baseline lung function: AUROC=0.92; lung function decline: AUROC=0.79), highlighting the predictive value of the AmpliSeq data. Together, our work provides a proof-of-principle that Pa genetic variation in sputum is strongly associated with baseline lung disease, moderately predicts future lung function decline, and provides insight into the pathobiology of Pa’s effect on CF.

Importance Cystic fibrosis (CF) is among the most common, life-limiting inherited disorder, caused by mutations in the CF transmembrane conductance regulator (CFTR) gene. CF causes progressive damage to the lungs, the major cause of morbidity and mortality in CF patients. However, the rate of lung function decline is highly variable across CF patients, and cannot be fully explained using existing biomarkers in the human genome or patient co-morbidities. Pseudomonas aeruginosa (Pa) is known to evolve and adapt within chronic CF infections. We hypothesized that within-patient Pa diversity could affect lung disease severity. In a CF cohort study, we demonstrate the utility of machine learning tools for predictive modeling of baseline lung function and subsequent decline in CF patients using deep within-patient Pa amplicon sequencing. Our findings show the potential of these models to identify high-risk CF patients based on Pa diversity within the lung.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

The project was supported by funding from CIHR (PJT-148827 to DN) and a Vertex Research Innovation Award (DN), and salary support from the Cystic Fibrosis Canada Research Fellowship (Award ID 558850 to JD), the Leopoldina Foundation (German National Academy of Sciences Leopoldina, Award ID LPDS 2017-17), the Reseau en Sante respiratoire (IL), and the Fonds de Recherche en Sante Quebec (IL, DN). MMS and BJS were supported by a Genome Canada and Genome Quebec Bioinformatics and Computational Biology grant.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The study was carried out with the approval from the Research Ethics Boards from the University of Calgary (15-0854) and McGill University Health Centre (15-623).

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All amplicon sequencing data generating in this project are deposited in NCBI GenBank under BioProject PRJNA763719.

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
Back to top
PreviousNext
Posted October 05, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Single nucleotide variants in Pseudomonas aeruginosa populations from sputum correlate with baseline lung function and predict disease progression in individuals with cystic fibrosis
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Single nucleotide variants in Pseudomonas aeruginosa populations from sputum correlate with baseline lung function and predict disease progression in individuals with cystic fibrosis
Morteza M. Saber, Jannik Donner, Inès Levade, Nicole Acosta, Michael D. Parkins, Brian Boyle, Roger Levesque, Dao Nguyen, B. Jesse Shapiro
medRxiv 2021.10.04.21264421; doi: https://doi.org/10.1101/2021.10.04.21264421
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Single nucleotide variants in Pseudomonas aeruginosa populations from sputum correlate with baseline lung function and predict disease progression in individuals with cystic fibrosis
Morteza M. Saber, Jannik Donner, Inès Levade, Nicole Acosta, Michael D. Parkins, Brian Boyle, Roger Levesque, Dao Nguyen, B. Jesse Shapiro
medRxiv 2021.10.04.21264421; doi: https://doi.org/10.1101/2021.10.04.21264421

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Infectious Diseases (except HIV/AIDS)
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)