Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Data driven phenotyping and COVID-19 case definitions: a pattern recognition approach

View ORCID ProfileGeorge D. Vavougios, Christoforos Konstantatos, Pavlos-Christoforos Sinigalias, View ORCID ProfileSotirios G. Zarogiannis, Konstantinos Kolomvatsos, George Stamoulis, Konstantinos I. Gourgoulianis
doi: https://doi.org/10.1101/2021.04.30.21256219
George D. Vavougios
1Department of Computer Science and Telecommunications, University of Thessaly, Papasiopoulou 2–4, Galaneika, Lamia 35131, Greece
2Department of Respiratory Medicine, Faculty of Medicine, School of Health Sciences, University of Thessaly, Biopolis, Larissa 41500, Greece
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for George D. Vavougios
  • For correspondence: dantevavougios{at}hotmail.com gvavougyios{at}uth.gr
Christoforos Konstantatos
3Department of Business Administration, University of Patras, University Campus – Rio, Patras 26504, Greece
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Pavlos-Christoforos Sinigalias
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Sotirios G. Zarogiannis
2Department of Respiratory Medicine, Faculty of Medicine, School of Health Sciences, University of Thessaly, Biopolis, Larissa 41500, Greece
5Department of Physiology, Faculty of Medicine, School of Health Sciences, University of Thessaly, BIOPOLIS, Larissa 41500, Greece
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Sotirios G. Zarogiannis
Konstantinos Kolomvatsos
1Department of Computer Science and Telecommunications, University of Thessaly, Papasiopoulou 2–4, Galaneika, Lamia 35131, Greece
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
George Stamoulis
1Department of Computer Science and Telecommunications, University of Thessaly, Papasiopoulou 2–4, Galaneika, Lamia 35131, Greece
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Konstantinos I. Gourgoulianis
2Department of Respiratory Medicine, Faculty of Medicine, School of Health Sciences, University of Thessaly, Biopolis, Larissa 41500, Greece
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Introduction COVID-19 has pathological pulmonary as well as several extrapulmonary manifestations and thus many different symptoms may arise in patients. The aim of our study was to determine COVID-19 syndromic phenotypes in a data driven manner using survey results extracted from Carnegie Mellon University’s Delphi Group.

Methods Monthly survey results (>1 million responders per month; 320.326 responders with positive COVID-19 test and disease duration <30 days were included in this study) were used sequentially in identifying and validating COVID-19 syndromic phenotypes. Logistic Regression Weighted Multiple Correspondence Analysis (LRW-MCA) was used as a preprocessing procedure, in order to weight and transform symptoms recorded by the survey to eigenspace coordinates (i.e. object scores per case / dimension), with a goal of capturing a total variance of > 75%. These scores along with symptom duration were subsequently used by the Two Step Clustering algorithm to produce symptom clusters. Post-hoc logistic regression models adjusting for age, gender and comorbidities and confirmatory linear principal components analyses were used to further explore the data. The model created from 66.165 included responders in August, was subsequently validated in data from March – December 2020.

Results Five validated COVID-19 syndromes were identified in August: 1. Afebrile (0%), Non-Coughing (0%), Oligosymptomatic (ANCOS) 2. Febrile (100%) Multisymptomatic (FMS) 3. Afebrile (0%) Coughing (100%) Oligosymptomatic (ACOS), 4. Oligosymptomatic with additional self-described symptoms (100%; OSDS) and 5. Olfaction / Gustatory Impairment Predominant (100%; OGIP).

Discussion We present 5 distinct symptom phenotypes within the COVID-19 spectrum that remain stable within 9 – 12 days of first symptom onset. The typical febrile respiratory phenotype is presented as a minority among identified syndromes, a finding that may impact both epidemiological surveillance norms and transmission dynamics.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

No funding to be reported.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Deidentified data were provided by Carnegie Mellon University to the University of Thessaly via a project-based collaboration between the two institutions, consolidated and outlined by a Research Data Use Agreement. The Research Data Use Agreement was made and entered into as of August 21, 2020 by and between Carnegie Mellon University, a Pennsylvania non-profit corporationthe Department of Respiratory Medicine, Faculty of Medicine, University of Thessaly, a Non-profit organization / University having its principal place of business at Biopolis, P.C. 41500 Larissa, Greece. The Carnegie Mellon University (CMU) CMU Institutional Review Board approved the original survey protocol and instrument (IRB Approval Registration Number: STUDY2020_00000162)

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • Conflict of Interest Statement: None declared.

Data Availability

The analyses and all related files are available upon request.

https://cmu-delphi.github.io/delphi-epidata/symptom-survey/

Copyright 
The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
Back to top
PreviousNext
Posted May 03, 2021.
Download PDF

Supplementary Material

Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Data driven phenotyping and COVID-19 case definitions: a pattern recognition approach
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Data driven phenotyping and COVID-19 case definitions: a pattern recognition approach
George D. Vavougios, Christoforos Konstantatos, Pavlos-Christoforos Sinigalias, Sotirios G. Zarogiannis, Konstantinos Kolomvatsos, George Stamoulis, Konstantinos I. Gourgoulianis
medRxiv 2021.04.30.21256219; doi: https://doi.org/10.1101/2021.04.30.21256219
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Data driven phenotyping and COVID-19 case definitions: a pattern recognition approach
George D. Vavougios, Christoforos Konstantatos, Pavlos-Christoforos Sinigalias, Sotirios G. Zarogiannis, Konstantinos Kolomvatsos, George Stamoulis, Konstantinos I. Gourgoulianis
medRxiv 2021.04.30.21256219; doi: https://doi.org/10.1101/2021.04.30.21256219

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Epidemiology
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)