Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Unsupervised Machine Learning Unveil Easily Identifiable Subphenotypes of COVID-19 With Differing Disease Trajectories

Jacky Chen, Jocelyn Hsu, Alexandra Szewc, Clotilde Balucini, Tej D. Azad, Kirby Gong, Han Kim, Robert D Stevens
doi: https://doi.org/10.1101/2023.04.07.23288152
Jacky Chen
1Department of Computer Science, Johns Hopkins University
2Department of Biophysics, Johns Hopkins University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jocelyn Hsu
1Department of Computer Science, Johns Hopkins University
3Department of Biomedical Engineering, Johns Hopkins University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Alexandra Szewc
1Department of Computer Science, Johns Hopkins University
3Department of Biomedical Engineering, Johns Hopkins University
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Clotilde Balucini
4Department of Neurology, Neurocritical Care, NYU Langone
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Tej D. Azad
5Department of Neurosurgery, Johns Hopkins
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Kirby Gong
6Department of Anesthesiology & Critical Care Medicine, Johns Hopkins School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Han Kim
6Department of Anesthesiology & Critical Care Medicine, Johns Hopkins School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Robert D Stevens
6Department of Anesthesiology & Critical Care Medicine, Johns Hopkins School of Medicine
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: rstevens{at}jhmi.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background Given the clinical heterogeneity of COVID-19 infection, we hypothesize the existence of subphenotypes based on early inflammatory responses that are associated with mortality and additional complications.

Methods For this cross-sectional study, we extracted electronic health data from adults hospitalized patients between March 1, 2020 and May 5, 2021, with confirmed primary diagnosis of COVID-19 across five Johns Hopkins Hospitals. We obtained all electronic health records from the first 24h of the patient’s hospitalization. Mortality was the primary endpoint explored while myocardial infarction (MI), pulmonary embolism (PE), deep vein thrombosis (DVT), stroke, delirium, length of stay (LOS), ICU admission and intubation status were secondary outcomes of interest. First, we employed clustering analysis to identify COVID-19 subphenotypes on admission with only biomarker data and assigned each patient to a subphenotype. We then performed Chi-Squared and Mann-Whitney-U tests to examine associations between COVID-19 subphenotype assignment and outcomes. In addition, correlations between subphenotype and pre-existing comorbidities were measured using Chi-Squared analysis.

Results A total of 7076 patients were included. Analysis revealed three distinct subgroups by level of inflammation: hypoinflammatory, intermediate, and hyperinflammatory subphenotypes. More than 25% of patients in the hyperinflammatory subphenotype died compared to less than 3% hypoinflammatory subphenotype (p<0.05). Additional analysis found statistically significant increases in the rate of MI, DVT, PE, stroke, delirium and ICU admission as well as LOS in the hyperinflammatory subphenotype.

Conclusion We identify three distinct inflammatory subphenotypes that predict a range of outcomes, including mortality, MI, DVT, PE, stroke, delirium, ICU admission and LOS. The three subphenotypes are easily identifiable and may aid in clinical decision making.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

There was no funding for this research.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

IRB of Johns Hopkins Medicine gave ethical approval for this work. This work was approved under IRB00281125

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.

Yes

Data Availability

The data used were part of JH-CROWN: The COVID Precision Medicine Analytics Platform Registry.Data and analysis code will be made available upon researcher request as allowable by the terms of the registry and the Johns Hopkins Medicine Institutional Review Board (IRB)

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
Back to top
PreviousNext
Posted April 15, 2023.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Unsupervised Machine Learning Unveil Easily Identifiable Subphenotypes of COVID-19 With Differing Disease Trajectories
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Unsupervised Machine Learning Unveil Easily Identifiable Subphenotypes of COVID-19 With Differing Disease Trajectories
Jacky Chen, Jocelyn Hsu, Alexandra Szewc, Clotilde Balucini, Tej D. Azad, Kirby Gong, Han Kim, Robert D Stevens
medRxiv 2023.04.07.23288152; doi: https://doi.org/10.1101/2023.04.07.23288152
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Unsupervised Machine Learning Unveil Easily Identifiable Subphenotypes of COVID-19 With Differing Disease Trajectories
Jacky Chen, Jocelyn Hsu, Alexandra Szewc, Clotilde Balucini, Tej D. Azad, Kirby Gong, Han Kim, Robert D Stevens
medRxiv 2023.04.07.23288152; doi: https://doi.org/10.1101/2023.04.07.23288152

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)