Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

High-Dimensional Multinomial Multiclass Severity Scoring of COVID-19 Pneumonia Using CT Radiomics Features and Machine Learning Algorithms

View ORCID ProfileIsaac Shiri, Shayan Mostafaei, Atlas Haddadi Avval, Yazdan Salimi, Amirhossein Sanaat, Azadeh Akhavanallaf, Hossein Arabi, View ORCID ProfileArman Rahmim, View ORCID ProfileHabib Zaidi
doi: https://doi.org/10.1101/2022.04.27.22274369
Isaac Shiri
1Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Isaac Shiri
Shayan Mostafaei
2Division of Clinical Geriatrics, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Stockholm, Sweden
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Atlas Haddadi Avval
3School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Yazdan Salimi
1Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Amirhossein Sanaat
1Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Azadeh Akhavanallaf
1Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hossein Arabi
1Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva, Switzerland
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Arman Rahmim
4Departments of Radiology and Physics, University of British Columbia, Vancouver BC, Canada
5Department of Integrative Oncology, BC Cancer Research Institute, Vancouver BC, Canada
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Arman Rahmim
Habib Zaidi
1Division of Nuclear Medicine and Molecular Imaging, Geneva University Hospital, CH-1211 Geneva, Switzerland
6Geneva University Neurocenter, Geneva University, Geneva, Switzerland
7Department of Nuclear Medicine and Molecular Imaging, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
8Department of Nuclear Medicine, University of Southern Denmark, Odense, Denmark
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Habib Zaidi
  • For correspondence: habib.zaidi{at}hcuge.ch
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

We aimed to construct a prediction model based on computed tomography (CT) radiomics features to classify COVID-19 patients into severe-, moderate-, mild-, and non-pneumonic. A total of 1110 patients were studied from a publicly available dataset with 4-class severity scoring performed by a radiologist (based on CT images and clinical features). CT scans were preprocessed with bin discretization and resized, followed by segmentation of the entire lung and extraction of radiomics features. We utilized two feature selection algorithms, namely Bagging Random Forest (BRF) and Multivariate Adaptive Regression Splines (MARS), each coupled to a classifier, namely multinomial logistic regression (MLR), to construct multiclass classification models. Subsequently, 10-fold cross-validation with bootstrapping (n=1000) was performed to validate the classification results. The performance of multi-class models was assessed using precision, recall, F1-score, and accuracy based on the 4×4 confusion matrices. In addition, the areas under the receiver operating characteristic (ROC) curve (AUCs) for multi-class classifications were calculated and compared for both models using “multiROC” and “pROC” R packages. Using BRF, 19 radiomics features were selected, 9 from first-order, 6 from GLCM, 1 from GLDM, 1 from shape, 1 from NGTDM, and 1 from GLSZM radiomics features. Ten features were selected using the MARS algorithm, namely 2 from first-order, 1 from GLDM, 2 from GLRLM, 2 from GLSZM, and 3 from GLCM features. The Mean Absolute Deviation and Median from first-order, Small Area Emphasis from GLSZM, and Correlation from GLCM features were selected by both BRF and MARS algorithms. Except for the Inverse Variance feature from GLCM, all selected features by BRF or MARS were significantly associated with four-class outcomes as assessed within MLR (All p-values<0.05). BRF+MLR and MARS+MLR resulted in pseudo-R2 prediction performances of 0.295 and 0.256, respectively. Meanwhile, there were no significant differences between the feature selection models when using a likelihood ratio test (p-value =0.319). Based on confusion matrices for BRF+MLR and MARS+MLR algorithms, the precision was 0.861 and 0.825, the recall was 0.844 and 0.793, whereas the accuracy was 0.933 and 0.922, respectively. AUCs (95% CI)) for multi-class classification were 0.823 (0.795-0.852) and 0.816 (0.788-0.844) for BRF+MLR and MARS+MLR algorithms, respectively. Our models based on the utilization of radiomics features, coupled with machine learning, were able to accurately classify patients according to the severity of pneumonia, thus highlighting the potential of this emerging paradigm in the prognostication and management of COVID-19 patients.

Competing Interest Statement

The authors have declared no competing interest.

Funding Statement

This work was supported by the Swiss National Science Foundation under grant SNRF 320030_176052.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Data Availability

All data produced are available online at

https://mosmed.ai/datasets/covid19_1110

  • Abbreviations

    CT
    Computed Tomography
    COVID-19
    Coronavirus disease 2019
    AUC
    Area under the receiver operating characteristic curve
    BRF
    Bagging Random Forest
    FS
    Feature Selection
    GGO
    Ground Glass Opacity
    IBSI
    The Image Biomarker Standardization Initiative
    MARS
    Multivariate Adaptive Regression Splines
    MLR
    Multinomial Logistic Regression
    RT-PCR
    Reverse transcription polymerase chain reaction
    GLCM
    Gray-Level Co-Occurrence Matrix
    GLSZM
    Gray-Level Size-Zone Matrix
    NGTDM
    Neighbouring Gray Tone Difference Matrix
    GLRLM
    Gray-Level Run-Length Matrix
    GLDM
    Gray-Level Dependence Matrix
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.
    Back to top
    PreviousNext
    Posted April 28, 2022.
    Download PDF
    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    High-Dimensional Multinomial Multiclass Severity Scoring of COVID-19 Pneumonia Using CT Radiomics Features and Machine Learning Algorithms
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    High-Dimensional Multinomial Multiclass Severity Scoring of COVID-19 Pneumonia Using CT Radiomics Features and Machine Learning Algorithms
    Isaac Shiri, Shayan Mostafaei, Atlas Haddadi Avval, Yazdan Salimi, Amirhossein Sanaat, Azadeh Akhavanallaf, Hossein Arabi, Arman Rahmim, Habib Zaidi
    medRxiv 2022.04.27.22274369; doi: https://doi.org/10.1101/2022.04.27.22274369
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    High-Dimensional Multinomial Multiclass Severity Scoring of COVID-19 Pneumonia Using CT Radiomics Features and Machine Learning Algorithms
    Isaac Shiri, Shayan Mostafaei, Atlas Haddadi Avval, Yazdan Salimi, Amirhossein Sanaat, Azadeh Akhavanallaf, Hossein Arabi, Arman Rahmim, Habib Zaidi
    medRxiv 2022.04.27.22274369; doi: https://doi.org/10.1101/2022.04.27.22274369

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Radiology and Imaging
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)