Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Identifying COVID-19 phenotypes using cluster analysis and assessing their clinical outcomes

View ORCID ProfileEric Yamga, Louis Mullie, Madeleine Durand, Alexandre Cadrin-Chenevert, View ORCID ProfileAn Tang, Emmanuel Montagnon, Carl Chartrand-Lefebvre, View ORCID ProfileMichaël Chassé
doi: https://doi.org/10.1101/2022.05.27.22275708
Eric Yamga
1Department of Medicine, Centre Hospitalier de l’Université de Montréal, Montreal, Quebec, Canada
MD, MBI
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Eric Yamga
Louis Mullie
1Department of Medicine, Centre Hospitalier de l’Université de Montréal, Montreal, Quebec, Canada
MD, MSC
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Madeleine Durand
1Department of Medicine, Centre Hospitalier de l’Université de Montréal, Montreal, Quebec, Canada
2Centre de recherche du Centre Hospitalier de l’Université de Montréal (CRCHUM), Montréal, Québec, Canada
MD, MSC
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandre Cadrin-Chenevert
3Department of Medical Imaging, CISSS Lanaudière, Université Laval, Joliette, Quebec, Canada
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
An Tang
2Centre de recherche du Centre Hospitalier de l’Université de Montréal (CRCHUM), Montréal, Québec, Canada
4Department of Radiology, Radiation Oncology and Nuclear Medicine, Centre Hospitalier de l’Université de Montréal, Montreal, Quebec, Canada
MD, MSC
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for An Tang
Emmanuel Montagnon
2Centre de recherche du Centre Hospitalier de l’Université de Montréal (CRCHUM), Montréal, Québec, Canada
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Carl Chartrand-Lefebvre
2Centre de recherche du Centre Hospitalier de l’Université de Montréal (CRCHUM), Montréal, Québec, Canada
4Department of Radiology, Radiation Oncology and Nuclear Medicine, Centre Hospitalier de l’Université de Montréal, Montreal, Quebec, Canada
MD, MSc
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michaël Chassé
1Department of Medicine, Centre Hospitalier de l’Université de Montréal, Montreal, Quebec, Canada
2Centre de recherche du Centre Hospitalier de l’Université de Montréal (CRCHUM), Montréal, Québec, Canada
MD, PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Michaël Chassé
  • For correspondence: michael.chasse{at}umontreal.ca
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Multiple clinical phenotypes have been proposed for COVID-19, but few have stemmed from data-driven methods. We aimed to identify distinct phenotypes in patients admitted with COVID-19 using cluster analysis, and compare their respective characteristics and clinical outcomes.

We analyzed the data from 547 patients hospitalized with COVID-19 in a Canadian academic hospital from January 1, 2020, to January 30, 2021. We compared four clustering algorithms: K-means, PAM (partition around medoids), divisive and agglomerative hierarchical clustering. We used imaging data and 34 clinical variables collected within the first 24 hours of admission to train our algorithm. We then conducted survival analysis to compare clinical outcomes across phenotypes and trained a classification and regression tree (CART) to facilitate phenotype interpretation and phenotype assignment.

We identified three clinical phenotypes, with 61 patients (17%) in Cluster 1, 221 patients (40%) in Cluster 2 and 235 (43%) in Cluster 3. Cluster 2 and Cluster 3 were both characterized by a low-risk respiratory and inflammatory profile, but differed in terms of demographics. Compared with Cluster 3, Cluster 2 comprised older patients with more comorbidities. Cluster 1 represented the group with the most severe clinical presentation, as inferred by the highest rate of hypoxemia and the highest radiological burden. Mortality, mechanical ventilation and ICU admission risk were all significantly different across phenotypes.

We conducted a phenotypic analysis of adult inpatients with COVID-19 and identified three distinct phenotypes associated with different clinical outcomes. Further research is needed to determine how to properly incorporate those phenotypes in the management of patients with COVID-19.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

M.C. is supported by a Fonds de Recherche Québec Santé (FRQS) Clinical Research scholarship. M.D. is supported by a FRQS Clinical Research scholarship. A.T. is supported by a FRQS Clinical Research Scholarship and a Fondation de l'Association des Radiologistes du Québec (FARQ) Clinical Research Scholarship. This work was partly funded by a Quebec Bio-Imaging Network research grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Not Applicable

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

The Institutional Review Board of the CHUM (Centre Hospitalier de l’Université de Montréal) approved the study and informed consent was waived because of its low risk and retrospective nature.

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Not Applicable

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Not Applicable

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Not Applicable

Data Availability

The entire code excluding the dataset is publicly available on GitHub (https://github.com/CODA-19/models/tree/master/phenotyper) The data that support the findings of this study are available on request from the corresponding author, MC.

  • Abbreviations

    APN
    average proportion of non-overlap
    AD
    average distance
    ADM
    average distance between means
    CART
    classification and regression tree
    CCI
    Charlson Comorbidity Index
    CXR
    chest radiographs
    FAMD
    factor analysis of mixed data
    FOM
    figure of merit
    ICU
    intensive care unit
    MCI
    Medicines Comorbidity Index
    MV
    mechanical ventilation
    NLR
    neutrophil-to-lymphocyte ratio
    PAM
    partition around medoids
    PCR
    polymerase chain reaction
    POLST
    Physician Orders for Life-Sustaining Treatment
    VIA
    variable importance analysis
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY 4.0 International license.
    Back to top
    PreviousNext
    Posted May 29, 2022.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Identifying COVID-19 phenotypes using cluster analysis and assessing their clinical outcomes
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Identifying COVID-19 phenotypes using cluster analysis and assessing their clinical outcomes
    Eric Yamga, Louis Mullie, Madeleine Durand, Alexandre Cadrin-Chenevert, An Tang, Emmanuel Montagnon, Carl Chartrand-Lefebvre, Michaël Chassé
    medRxiv 2022.05.27.22275708; doi: https://doi.org/10.1101/2022.05.27.22275708
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Identifying COVID-19 phenotypes using cluster analysis and assessing their clinical outcomes
    Eric Yamga, Louis Mullie, Madeleine Durand, Alexandre Cadrin-Chenevert, An Tang, Emmanuel Montagnon, Carl Chartrand-Lefebvre, Michaël Chassé
    medRxiv 2022.05.27.22275708; doi: https://doi.org/10.1101/2022.05.27.22275708

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Health Informatics
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)