Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Imputation of PaO2 from SpO2 values from the MIMIC-III Critical Care Database Using Machine-Learning Based Algorithms

Shuangxia Ren, Jill Zupetic, Mehdi Nouraie, Xinghua Lu, Richard D. Boyce, Janet S. Lee
doi: https://doi.org/10.1101/2021.04.21.21255877
Shuangxia Ren
1Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jill Zupetic
2Division of Pulmonary, Allergy, and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA
3Acute Lung Injury Center of Excellence, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Mehdi Nouraie
2Division of Pulmonary, Allergy, and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA
3Acute Lung Injury Center of Excellence, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
MD PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xinghua Lu
1Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA
4Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Richard D. Boyce
1Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, USA
4Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA, USA
PhD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Janet S. Lee
2Division of Pulmonary, Allergy, and Critical Care Medicine, University of Pittsburgh, Pittsburgh, PA, USA
3Acute Lung Injury Center of Excellence, Department of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
MD
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: leejs3{at}upmc.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Supplementary material
  • Data/Code
  • Preview PDF
Loading

Abstract

Background The partial pressure of oxygen (PaO2)/fraction of oxygen delivered (FIO2) ratio is the reference standard for assessment of hypoxemia in mechanically ventilated patients. Non-invasive monitoring with the peripheral saturation of oxygen (SpO2) is increasingly utilized to estimate PaO2 because it does not require invasive sampling. Several equations have been reported to impute PaO2/FIO2 from SpO2 /FIO2. However, machine-learning algorithms to impute the PaO2 from the SpO2 has not been compared to published equations.

Research Question How do machine learning algorithms perform at predicting the PaO2 from SpO2 compared to previously published equations?

Methods Three machine learning algorithms (neural network, regression, and kernel-based methods) were developed using 7 clinical variable features (n=9,900 ICU events) and subsequently 3 features (n=20,198 ICU events) as input into the models from data available in mechanically ventilated patients from the Medical Information Mart for Intensive Care (MIMIC) III database. As a regression task, the machine learning models were used to impute PaO2 values. As a classification task, the models were used to predict patients with moderate-to-severe hypoxemic respiratory failure based on a clinically relevant cut-off of PaO2/FIO2 ≤ 150. The accuracy of the machine learning models was compared to published log-linear and non-linear equations. An online imputation calculator was created.

Results Compared to seven features, three features (SpO2, FiO2 and PEEP) were sufficient to impute PaO2/FIO2 ratio using a large dataset. Any of the tested machine learning models enabled imputation of PaO2/FIO2 from the SpO2/FIO2 with lower error and had greater accuracy in predicting PaO2/FIO2 ≤ 150 compared to published equations. Using three features, the machine learning models showed superior performance in imputing PaO2 across the entire span of SpO2 values, including those ≥ 97%.

Interpretation The improved performance shown for the machine learning algorithms suggests a promising framework for future use in large datasets.

Competing Interest Statement

J.S. Lee discloses a paid consultantship with Janssen Pharmaceuticals, Inc. unrelated to this study. The authors have no other relevant conflicts of interest to disclose.

Funding Statement

This work was supported by the National Heart, Lung, And Blood Institute of the National Institutes of Health under Award Numbers F32 HL152504 (J.Z.); P01 HL114453, R01 HL136143, R01 HL142084, K24 HL143285 (J.S.L.), and R01 LM012011 (X.L. and S.R.). The University of Pittsburgh holds a Physician-Scientist Institutional Award from the Burroughs Wellcome Fund (J.Z.); content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or any other sponsoring agency.

Author Declarations

I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.

Yes

The details of the IRB/oversight body that provided approval or exemption for the research described are given below:

Our study was determined by the University of Pittsburgh Institutional Review Board to be exempt (STUDY19100068).

All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.

Yes

I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).

Yes

I have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.

Yes

Footnotes

  • COI: J.S. Lee discloses a paid consultantship with Janssen Pharmaceuticals, Inc. unrelated to this study. The authors have no other relevant conflicts of interest to disclose.

Data Availability

The data utilized in this manuscript was obtained from the MIMIC-III database v1.4. The MIMIC-III database is an openly available dataset developed by the Massachusetts Institute of Technology Lab for Computational Physiology that contains de-identified health data associated with approximately 60,000 intensive care unit admissions.

https://mimic.physionet.org

  • Abbreviations List

    PaO2
    partial pressure of oxygen
    FIO2
    fraction of oxygen
    PaO2/FIO2
    PF ratio
    SpO2
    peripheral saturation of oxygen
    PEEP
    positive end expiratory pressure
    TV
    tidal volume
    MAP
    mean arterial pressure
  • Copyright 
    The copyright holder for this preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. All rights reserved. No reuse allowed without permission.
    Back to top
    PreviousNext
    Posted April 25, 2021.
    Download PDF

    Supplementary Material

    Data/Code
    Email

    Thank you for your interest in spreading the word about medRxiv.

    NOTE: Your email address is requested solely to identify you as the sender of this article.

    Enter multiple addresses on separate lines or separate them with commas.
    Imputation of PaO2 from SpO2 values from the MIMIC-III Critical Care Database Using Machine-Learning Based Algorithms
    (Your Name) has forwarded a page to you from medRxiv
    (Your Name) thought you would like to see this page from the medRxiv website.
    CAPTCHA
    This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
    Share
    Imputation of PaO2 from SpO2 values from the MIMIC-III Critical Care Database Using Machine-Learning Based Algorithms
    Shuangxia Ren, Jill Zupetic, Mehdi Nouraie, Xinghua Lu, Richard D. Boyce, Janet S. Lee
    medRxiv 2021.04.21.21255877; doi: https://doi.org/10.1101/2021.04.21.21255877
    Twitter logo Facebook logo LinkedIn logo Mendeley logo
    Citation Tools
    Imputation of PaO2 from SpO2 values from the MIMIC-III Critical Care Database Using Machine-Learning Based Algorithms
    Shuangxia Ren, Jill Zupetic, Mehdi Nouraie, Xinghua Lu, Richard D. Boyce, Janet S. Lee
    medRxiv 2021.04.21.21255877; doi: https://doi.org/10.1101/2021.04.21.21255877

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
    • Tweet Widget
    • Facebook Like
    • Google Plus One

    Subject Area

    • Respiratory Medicine
    Subject Areas
    All Articles
    • Addiction Medicine (349)
    • Allergy and Immunology (668)
    • Allergy and Immunology (668)
    • Anesthesia (181)
    • Cardiovascular Medicine (2648)
    • Dentistry and Oral Medicine (316)
    • Dermatology (223)
    • Emergency Medicine (399)
    • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
    • Epidemiology (12228)
    • Forensic Medicine (10)
    • Gastroenterology (759)
    • Genetic and Genomic Medicine (4103)
    • Geriatric Medicine (387)
    • Health Economics (680)
    • Health Informatics (2657)
    • Health Policy (1005)
    • Health Systems and Quality Improvement (985)
    • Hematology (363)
    • HIV/AIDS (851)
    • Infectious Diseases (except HIV/AIDS) (13695)
    • Intensive Care and Critical Care Medicine (797)
    • Medical Education (399)
    • Medical Ethics (109)
    • Nephrology (436)
    • Neurology (3882)
    • Nursing (209)
    • Nutrition (577)
    • Obstetrics and Gynecology (739)
    • Occupational and Environmental Health (695)
    • Oncology (2030)
    • Ophthalmology (585)
    • Orthopedics (240)
    • Otolaryngology (306)
    • Pain Medicine (250)
    • Palliative Medicine (75)
    • Pathology (473)
    • Pediatrics (1115)
    • Pharmacology and Therapeutics (466)
    • Primary Care Research (452)
    • Psychiatry and Clinical Psychology (3432)
    • Public and Global Health (6527)
    • Radiology and Imaging (1403)
    • Rehabilitation Medicine and Physical Therapy (814)
    • Respiratory Medicine (871)
    • Rheumatology (409)
    • Sexual and Reproductive Health (410)
    • Sports Medicine (342)
    • Surgery (448)
    • Toxicology (53)
    • Transplantation (185)
    • Urology (165)