ABSTRACT
Studies of the health effects of the microbiome often measure overall associations by using diversity metrics, and individual taxa associations in separate analyses, but do not consider the correlated relationships between taxa in the microbiome. In this study, we applied random subset weighted quantile sum regression with repeated holdouts (WQSRSRH), a mixture method successfully applied to ‘omic data to account for relationships between many predictors, to processed amplicon sequencing data from the Human Microbiome Project. We simulated a binary variable associated with 20 operational taxonomic units (OTUs). WQSRSRH was used to test for the association between the microbiome and the simulated variable, adjusted for sex, and sensitivity and specificity were calculated. The WQSRSRH method was also compared to other standard methods for microbiome analysis. The method was further illustrated using real data from the Growth and Obesity Cohort in Chile to assess the association between the gut microbiome and body mass index. In the analysis with simulated data, WQSRSRH predicted the correct directionality of association between the microbiome and the simulated variable, with an average sensitivity and specificity of 75% and 70%, respectively, in identifying the 20 associated OTUs. WQSRSRH performed better than all other comparison methods. In the illustration analysis of the gut microbiome and obesity, the WQSRSRH analysis identified an inverse association between body mass index and the gut microbe mixture, identifying Bacteroides, Clostridium, and Ruminococcus, among others, as important genera in the negative association. The application of WQSRSRH to the microbiome allows for analysis of the mixture effect of all the taxa in the microbiome, while simultaneously identifying the most important to the mixture, and allowing for covariate adjustment. It outperformed other methods when using simulated data, and in analysis with real data found results consistent with other study findings.
Competing Interest Statement
The authors have declared no competing interest.
Clinical Protocols
https://github.com/ShoshannahE/WQS-Microbiome/
Funding Statement
S.E. was supported by the Eunice Kennedy Shriver National Institute of Child Health and Human Development (T32 HD049311), and the National Institue of Environmental Health Sciences (K99 ES032884). M.B. and C.G. were supported by the National Institute of Environmental Health Sciences through the Mount Sinai Transdisciplinary Center on Early Environmental Exposures Biostatistics and Bioinformatics Facility Core (P30ES023515), and the HHEAR Statistical Services and Analysis Resource Core (U2CES026555).
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
This study is exempt from review by the Mount Sinai Institutional Review Board as the data are de-identified and publicly available.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.
Yes
Footnotes
Additional comparison methods have been added to the analysis of BMI and the gut microbiome in the GOCS cohort. Other minor revisions have also been made for clarity.
Data Availability
Data come from The Human Microbiome Project I, the Growth and Obesity Cohort Study, and the Human Health Exposure Analysis Resource, and are publicly available at https://www.hmpdacc.org/hmp/, and DOIs doi.org\\ 10.36043/1977_480 and doi.org\\ 10.36043/1977_490. Code used in this analysis is available at github.com/ShoshannahE/WQS-Microbiome (DOI: 10.5281/zenodo.7017101).