Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis

View ORCID ProfileLluís Revilla, View ORCID ProfileAida Mayorgas, View ORCID ProfileAna Maria Corraliza, Maria C. Masamunt, View ORCID ProfileAmira Metwaly, View ORCID ProfileDirk Haller, Eva Tristán, View ORCID ProfileAnna Carrasco, Maria Esteve, View ORCID ProfileJulian Panés, View ORCID ProfileElena Ricart, View ORCID ProfileJuan J. Lozano, View ORCID ProfileAzucena Salas
doi: https://doi.org/10.1101/2020.04.16.20031492
Lluís Revilla
1Centro de Investigación Biomédica en Red de Enfermedades Hepática y Digestivas (CIBERehd), Barcelona, Spain
2Department of Gastroenterology, IDIBAPS, Hospital Clínic, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Lluís Revilla
Aida Mayorgas
2Department of Gastroenterology, IDIBAPS, Hospital Clínic, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Aida Mayorgas
Ana Maria Corraliza
2Department of Gastroenterology, IDIBAPS, Hospital Clínic, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Ana Maria Corraliza
Maria C. Masamunt
2Department of Gastroenterology, IDIBAPS, Hospital Clínic, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Amira Metwaly
3Chair of Nutrition and Immunology, Technical University of Munich, Freising-Weihenstephan, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Amira Metwaly
Dirk Haller
3Chair of Nutrition and Immunology, Technical University of Munich, Freising-Weihenstephan, Germany
4ZIEL Institute for Food and Health, Technical University of Munich, Freising-Weihenstephan, Germany
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Dirk Haller
Eva Tristán
1Centro de Investigación Biomédica en Red de Enfermedades Hepática y Digestivas (CIBERehd), Barcelona, Spain
5Department of Gastroenterology, Hospital Universitari Mútua Terrassa, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Anna Carrasco
1Centro de Investigación Biomédica en Red de Enfermedades Hepática y Digestivas (CIBERehd), Barcelona, Spain
5Department of Gastroenterology, Hospital Universitari Mútua Terrassa, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Anna Carrasco
Maria Esteve
1Centro de Investigación Biomédica en Red de Enfermedades Hepática y Digestivas (CIBERehd), Barcelona, Spain
5Department of Gastroenterology, Hospital Universitari Mútua Terrassa, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Julian Panés
1Centro de Investigación Biomédica en Red de Enfermedades Hepática y Digestivas (CIBERehd), Barcelona, Spain
2Department of Gastroenterology, IDIBAPS, Hospital Clínic, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Julian Panés
Elena Ricart
1Centro de Investigación Biomédica en Red de Enfermedades Hepática y Digestivas (CIBERehd), Barcelona, Spain
2Department of Gastroenterology, IDIBAPS, Hospital Clínic, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Elena Ricart
Juan J. Lozano
1Centro de Investigación Biomédica en Red de Enfermedades Hepática y Digestivas (CIBERehd), Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Juan J. Lozano
Azucena Salas
2Department of Gastroenterology, IDIBAPS, Hospital Clínic, Barcelona, Spain
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Azucena Salas
  • For correspondence: asalas1{at}clinic.cat
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Personalized medicine requires finding relationships between variables that influence a patient’s phenotype and predicting an outcome. Sparse generalized canonical correlation analysis identifies relationships between different groups of variables. This method requires establishing a model of the expected interaction between those variables. Describing these interactions is challenging when the relationship is unknown or when there is no pre-established hypothesis.

Aim To develop a method to find the relationships between microbiome and transcriptome data and the relevant clinical variables in a complex disease, such as Crohn’s disease.

Results We present here a method to identify interactions based on canonical correlation analysis. Our main contribution is to show that the model is the most important factor to identify relationships between blocks. Analysis were conducted on three independent datasets: a glioma, Crohn’s disease and a pouchitis data set. We describe how to select the optimum hyperparameters on the glioma dataset. Using such hyperparameters on the Crohn’s disease data set, our analysis revealed the best model for identifying relationships between transcriptome, gut microbiome and clinically relevant variables. With the pouchitis data set our analysis revealed that adding the clinically relevant variables improves the average variance explained by the model.

Conclusions The methodology described herein provides a framework for identifying interactions between sets of (omic) data and clinically relevant variables. Following this method, we found genes and microorganisms that were related to each other independently of the model, while others were specific to the model used. Thus, model selection proved crucial to finding the existing relationships in multi-omics datasets.

  • Integration
  • canonical correlation analysis
  • inflammatory bowel disease
  • interaction model
  • machine learning
  • multi-omics

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by the Leona and Harry Helmsley Charitable Trust grant 2015PG-IBD005, including the work of AMC, AmM, DH, ET, AC, ME. LLRS, AC, ET, ME and JJL are supported by the Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), AiM by the grant SAF2015-66379-R to AS of the Ministerio de Ciencia, Innovación y Universidades.

Author Declarations

All relevant ethical guidelines have been followed; any necessary IRB and/or ethics committee approvals have been obtained and details of the IRB/oversight body are included in the manuscript.

Yes

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • Authors’ information LR: lrevilla{at}clinic.cat; AiM: mayorgas{at}clinic.cat; AMC: corraliza{at}clinic.cat; MCM: mmasamun{at}clinic.cat; AmM: amira.metwaly{at}tum.de; DH: dirk.haller{at}tum.de; ET: etristan{at}mutuaterrasa.cat; AC: anna.carrasco.garcia{at}gmail.com; ME: mariaesteve{at}mutuaterrassa.cat; JP: jpanes{at}clinic.cat; ER: ericart{at}clinic.cat; JJL: juanjo.lozano{at}ciberehd.org; AS: asalas1{at}clinic.cat

Data Availability

Data available: glioma data at https://biodev.cea.fr/sgcca/.The datasets supporting the conclusions of this article are available in the Gene Expression Omnibus repository, https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE139179 (RNA-seq) and https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE139680 (microbiome) for the CD dataset and https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE65270 for the pouchitis dataset and its additional file(s).

https://biodev.cea.fr/sgcca/

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE139179

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE139680

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE65270

  • Abbreviations

    IBD
    inflammatory bowel disease
    CD
    Crohn’s disease
    RGCCA
    regularized generalized canonical correlation analysis
    SRGCCA
    sparse regularized generalized canonical correlation analysis
    AVE
    Average variance explained
    CGH
    Comparative genomic hybridization
    HSCT
    hematopoietic stem cell transplantation
    SESCD
    simple endoscopic score for Crohn’s disease
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
    Back to top
    PreviousNext
    Posted April 22, 2020.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
    Lluís Revilla, Aida Mayorgas, Ana Maria Corraliza, Maria C. Masamunt, Amira Metwaly, Dirk Haller, Eva Tristán, Anna Carrasco, Maria Esteve, Julian Panés, Elena Ricart, Juan J. Lozano, Azucena Salas
    medRxiv 2020.04.16.20031492; doi: https://doi.org/10.1101/2020.04.16.20031492
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Multi-omic modelling of inflammatory bowel disease with regularized canonical correlation analysis
    Lluís Revilla, Aida Mayorgas, Ana Maria Corraliza, Maria C. Masamunt, Amira Metwaly, Dirk Haller, Eva Tristán, Anna Carrasco, Maria Esteve, Julian Panés, Elena Ricart, Juan J. Lozano, Azucena Salas
    medRxiv 2020.04.16.20031492; doi: https://doi.org/10.1101/2020.04.16.20031492

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Gastroenterology
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)