Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Development & Deployment of a Real-time Healthcare Predictive Analytics Platform

Aaron Boussina, Supreeth Shashikumar, Fatemeh Amrollahi, Hayden Pour, Michael Hogarth, Shamim Nemati
doi: https://doi.org/10.1101/2023.04.10.23288373
Aaron Boussina
1Division of Biomedical Informatics, University of California San Diego, San Diego, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: aboussina{at}health.ucsd.edu
Supreeth Shashikumar
1Division of Biomedical Informatics, University of California San Diego, San Diego, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Fatemeh Amrollahi
1Division of Biomedical Informatics, University of California San Diego, San Diego, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Hayden Pour
1Division of Biomedical Informatics, University of California San Diego, San Diego, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael Hogarth
1Division of Biomedical Informatics, University of California San Diego, San Diego, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Shamim Nemati
1Division of Biomedical Informatics, University of California San Diego, San Diego, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

The deployment of predictive analytic algorithms that can safely and seamlessly integrate into existing healthcare workflows remains a significant challenge. Here, we present a scalable, cloud-based, fault-tolerant platform that is capable of extracting and processing electronic health record (EHR) data for any patient at any time following admission and transferring results back into the EHR. This platform has been successfully deployed within the UC San Diego Health system and utilizes interoperable data standards to enable portability.

Clinical relevance This platform is currently hosting a deep learning model for the early prediction of sepsis that is operational in two emergency departments.

I. INTRODUCTION

Despite the rapid growth in the number of predictive models developed for healthcare applications, there has been a relative dearth of successful implementations into clinical practice [1, 2]. One major reason for this is the large technical barrier to accessing EHR data in real-time and providing timely results back to clinicians [3, 4]. The challenges are multifactorial and include security, interoperability, availability, and scalability. Existing clinical decision support (CDS) solutions tend to be EHR-vendor or hospital specific and have not been generalized to new institutions [5]. Further, customized on-premise solutions to EHR integration are susceptible to system interruptions and are difficult to scale. In this work, we present a secure, high-availability, cloud-based platform that can process EHR data on any patient within a hospital at any point during their admission. We demonstrate how this platform can close the CDS loop and provide realtime recommendations to clinicians natively within the EHR. We further describe how process control tooling is leveraged to ensure system availability and model fidelity. Finally, we showcase the deployment of a deep learning model for the early prediction of sepsis onto this platform and into clinical practice [6].

The deployment of this sepsis model is clinically significant since sepsis (a life-threatening condition arising from the body’s overwhelming response to infection) is a major cause of mortality and morbidity globally [7-10]. The early recognition and treatment of sepsis has been shown to significantly improve outcomes [11-13]. The use of deep learning at the patient bedside can assist with risk stratification for sepsis management and has the potential to improve clinical outcomes.

This work is distinct from prior publications [14-15] in real-time healthcare analytics in the following ways: (1) It describes a platform for real-time predictions across all inpatient settings including Intensive Care Units (ICUs), Emergency Departments (EDs), and wards. (2) It utilizes collection of data elements beyond laboratory results, vital-signs, and demographics information, including problem-list items, procedures, and clinical notes. (3) It closes the CDS loop and enables outputs from predictive models to be written directly to the EHR. (4) It demonstrates the deployment of the platform into clinical practice.

II. METHODS

A. Platform Architecture

Figure 1 shows the high-level architecture for the predictive analytics platform. Data are extracted from the EHR at routine intervals and subsequently preprocessed for model consumption. An Indications-for-Use module assesses the clinical context of the patient including where the patient is in their treatment timeline to determine whether the patient is included/excluded for CDS. The predictive model is then run and outputs are sent back to the EHR to provide clinicians with relevant recommendations. The system is built in a modular plug-and-play manner such that any number of predictive modules can be added.

Fig. 1.
  • Download figure
  • Open in new tab
Fig. 1.

Architecture of the predictive analytics platform.

B. Cloud Implementation

Our predictive analytics platform is hosted within a HIPAA-compliant Amazon Web Services (AWS) environment (Figure 2). The environment is an isolated enclave with communications only permitted through whitelisted ports. The application layer is hosted on a single EC2 instance with data stored in a MySQL Relational Database Service (RDS). The EC2 instance is part of an Auto Scaling Group (ASG) that is connected to a Network Load Balancer (NLB). This configuration enables a new copy of the EC2 instance to be brought online immediately in the event of primary instance failure. Similarly, the RDS is configured for regular backup which enables switchover to the secondary database in the event of failure.

Fig. 2.
  • Download figure
  • Open in new tab
Fig. 2.

AWS implementation of the analytics platform.

For security, all system credentials such as private keys are stored within the AWS Secrets Manager and automatically updated on a routine schedule. For traceability, all system logs are preserved within S3 storage buckets. For portability, the architecture is captured within terraform scripts that automate the AWS build.

C. Data Pipeline

All active admitted patients within the healthcare system are identified from HL7v2 ADT messages [16]. All ADT messages are forwarded from the hospital’s integration engine to MirthConnect software running within the application EC2 instance. The patient contact serial numbers (CSNs) from the PID segment are converted to patient FHIR IDs by calling the “Patient.Search” API from the hospital’s Epic FHIR server. The platform application authenticates its requests to the FHIR server with OAuth 2.0 using a backend private key.

With the patient FHIR IDs retrieved for every admitted patient, the application makes regular calls to the FHIR server to retrieve updates to the Patient, ServiceRequest, Observation, MedicationRequest, Condition, and Procedure resources. The resources are returned as JSON bundles which are then parsed and preserved within the RDS as a condensed JSONB column containing all updated data for a patient within an elapsed timeframe.

The semi-structured data are then passed to the Data Preprocessing Module where they are converted into a structured format with columns for every feature and missing values imputed by a predefined sample-and-hold. The Data Preprocessing Module also enforces that the values are physiologically possible and not the result of inadvertent entry by applying upper and lower limits on the features.

Models are then directly deployed using these database tables as inputs. Model outputs such as the risk score and top contributing features are placed into OBX segments and an outbound HL7v2 message is constructed and sent back to the integration engine. The observation identifier (field OBX.3) is registered within the EHR allowing all model outputs to be filed to the flowsheet. EHR-native decision support, such as Best Practice Advisories (BPAs), then utilize these flowsheet items to generate clinician-facing alerts.

D. Model Deployment

Using this platform, we deployed the COMPOSER deep learning model for the early prediction of sepsis described in [6]. The model was run in silent-mode evaluation over the course of 6 months in which sepsis risk scores were filed to patient flowsheets, but alerts were not displayed to clinicians. During this period, the performance of the model was evaluated and routine chart reviews with a panel of experts were conducted to assess the clinical utility of the silent alerts. These reviews informed the development of the indications for use of this algorithm listed in Table 1.

View this table:
  • View inline
  • View popup
  • Download powerpoint
TABLE I

Indications for Use of the COMPOSER Algorithm

Design sessions with nursing teams were conducted over the span of three months to build the display of the final EHR-native BPA (Figure 3). Following prospective validation of the model performance, training was conducted across two emergency departments within the UC San Diego Health system prior to deployment of the BPA into clinical workflow.

Fig. 3.
  • Download figure
  • Open in new tab
Fig. 3.

Sample display of the COMPOSER BPA.

E. Process Control

To ensure high-availability of the platform, AWS Cloud-Watch alerts were created to notify the development team of any event that brought the system out of a state of control (e.g. if a service was unreachable). These CloudWatch alerts were integrated with PagerDuty to ensure 24/7 front-line support. Further, the platform was registered with healthcare IT within ServiceNow to enable end-users to escalate issues directly to the development team.

In addition to system interruptions, deployed models are at risk of model drift due to changes in the data distribution over time [17-19]. To ensure detection of possible model drift we implemented a quality dashboard that automatically tracks a model’s inputs, outputs, and performance. Specifically, the median values of measurement inputs and risk score outputs are monitored to ensure they don’t pass the upper or lower quartiles from the training cohort. If any value falls outside of those limits, it is flagged for review by the development team. Similarly, model performance metrics such as the positive predictive value (PPV) and sensitivity are tracked on a weekly basis. Finally, the quality dashboard tracks the rate of rejection from conformal prediction [6]. Conformal prediction is a method for detecting out-of-distribution data within the lower dimensional representations of a neural network. Changes in rejection rates, therefore, are expected to correspond to changes in data distribution that affect the model’s predictions.

III. RESULTS

A. Clinical Population

Table 2 shows the patient population processed by the platform from June 1st, 2022 to January 1st, 2023. During this 7-month period, 63,133 patients and 1,368,763 patient-hours were processed across 63 care units.

View this table:
  • View inline
  • View popup
  • Download powerpoint
TABLE II

Demographics and clinical characteristics of patients processed by the real-time analytics platform.

B. Sepsis Model Performance

Figure 4 shows outputs exported from the quality dashboard for the median values of a sample input feature and COMPOSER’s sepsis risk score relative to the training set following clinical deployment on 2022-12-07. Also shown are the model’s PPV and rate of conformal rejection. The dashboard demonstrates that the input feature distributions and model outputs had not drifted substantially post-deployment. Further, the model’s performance in real-time using the analytics platform did not differ significantly from retrospective performance.

Fig. 4.
  • Download figure
  • Open in new tab
Fig. 4.

Median values and 95% confidence intervals of input heart rate feature and output COMPOSER risk score relative to the training dataset. Trends of conformal rejection rate and PPV over time.

C. System Availability

From June 1st, 2022 to January 1st, 2023 the platform experienced a total of 0 hours of system downtime and 28 hours of inter-connectivity interruptions. This corresponds to an overall uptime of 99.44%. The single largest instance of downtime (10 hours) was related to a significant update to the FHIR API protocol which resulted in a difference in version parity with the platform.

IV. CONCLUSIONS

Using cloud architecture and interoperable data standards we have built a production-grade system to enable safe, rapid deployment of predictive analytics into the clinic. We have leveraged best-practices in software engineering and process control to ensure that the platform is sustainable and robust. We have designed the platform together with our clinical collaborators to ensure that model predictions are relevant and clinically actionable. This work aims to address the growing divide between the abundance of new deep learning models and the relative paucity of predictive models in clinical practice. While in this work we have only showcased a single model at a single institution, we have developed the system with portability and scalability in mind. We are currently developing prediction models for other clinical use cases on this platform and targeting additional institutions for deployment.

Data Availability

All data produced in the present work are contained in the manuscript.

ACKNOWLEDGMENT

S.N. is funded by the National Institutes of Health (#R01LM013998, #R01HL157985, #R35GM143121). He is co-founder of a UCSD start-up, Healcisio Inc., which is focused on commercialization of advanced analytical decision support tools. Mr. Boussina is funded by the National Library of Medicine (#2T15LM011271-11). Dr. Shashikumar has no sources of funding to declare. The opinions or assertions contained herein are the private ones of the author and are not to be construed as official or reflecting the views of the NIH or any other agency of the US Government.

References

  1. [1].↵
    Wilkinson J, Arnold KF, Murray EJ, et al. Time to reality check the promises of machine learning-powered precision medicine. Lancet Digit Health 2020;2:e677.–80. doi:10.1016/S2589-7500(20)30200-4
    OpenUrlCrossRef
  2. [2].↵
    van de Sande D, van Genderen ME, Huiskens J, et al. Moving from bytes to bedside: a systematic review on the use of artificial intelligence in the intensive care unit. Intensive Care Med 2021;47:750-760. doi:10.1007/s00134-021-06446-7
    OpenUrlCrossRefPubMed
  3. [3].↵
    Sudat, S. E., Robinson, S. C., Mudiganti, S., Mani, A., & Pressman, A. R. (2021). Mind the clinical-analytic gap: Electronic health records and COVID-19 pandemic response. Journal of Biomedical Informatics, 116, 103715.
    OpenUrlCrossRefPubMed
  4. [4].↵
    J. Norrie, “The challenge of implementing AI models in the ICU,” The Lancet Respiratory Medicine, Vol. 6, no. 12, pp. 886–888, 2018.
  5. [5].↵
    Kanbar LJ, Wissel B, Ni Y, Pajor N, Glauser T, Pestian J, Dexheimer JW Implementation of Machine Learning Pipelines for Clinical Practice: Development and Validation Study JMIR Med Inform 2022;10(12):e37833
    OpenUrl
  6. [6].↵
    Shashikumar, S.P., Wardi, G., Malhotra, A. et al. Artificial intelligence sepsis prediction algorithm learns to say “I don’t know”. npj Digit. Med. 4, 134 (2021). https://doi.org/10.1038/s41746-021-00504-6
    OpenUrl
  7. [7].↵
    Seymour CW, Liu VX, Iwashyna TJ, et al. Assessment of clinical criteria for sepsis: for the third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA. 2016;315(8):762–774. doi:10.1001/jama.2016.0288
    OpenUrlCrossRefPubMed
  8. [8].
    Liu, V et al. Hospital deaths in patients with sepsis from 2 independent cohorts. JAMA 312, 90–92 (2014).
  9. [9].
    Rhee, C et al. Incidence and Trends of Sepsis in US Hospitals Using Clinical vs Claims Data, 2009-2014. JAMA 318, 1241–1249 (2017).
  10. [10].↵
    Singer, M., Deutschman, C. S., Seymour, C. W., Shankar-Hari, M., Annane, D., Bauer, M., … & Angus, D. C. (2016). The third international consensus definitions for sepsis and septic shock (Sepsis-3). Jama, 315(8), 801–810.
    OpenUrlCrossRefPubMed
  11. [11].↵
    Ferrer, R., Martin-Loeches, I., Phillips, G., Osborn, T. M., Townsend, S., Dellinger, R. P., … & Levy, M. M. (2014). Empiric antibiotic treatment reduces mortality in severe sepsis and septic shock from the first hour: results from a guideline-based performance improvement program. Critical care medicine, 42(8), 1749–1755.
    OpenUrlCrossRefPubMedWeb of Science
  12. [12].
    Rhodes, A., Phillips, G., Beale, R., Cecconi, M., Chiche, J. D., De Backer, D., … & Levy, M. (2015). The surviving sepsis campaign bundles and outcome: results from the international multicentre prevalence study on sepsis (the IMPreSS study). Intensive care medicine, 41(9), 1620–1628.
    OpenUrlCrossRefPubMed
  13. [13].↵
    Sterling, S. A., Miller, W. R., Pryor, J., Puskarich, M. A., & Jones, A. E. (2015). The impact of timing of antibiotics on outcomes in severe sepsis and septic shock: a systematic review and meta-analysis. Critical care medicine, 43(9), 1907.
    OpenUrlCrossRefPubMed
  14. [14].↵
    Henry, J. R., Lynch, D., Mals, J., Shashikumar, S. P., Holder, A., Sharma, A., & Nemati, S. (2018, July). A FHIR-enabled streaming sepsis prediction system for ICUs. In 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (pp. 4093-4096). IEEE.
  15. [15].↵
    Amrollahi, F., Shashikumar, S. P., Kathiravelu, P., Sharma, A., & Nemati, S. (2020, July). AIDEx-an open-source platform for realtime forecasting sepsis and a case study on taking ML algorithms to production. In 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) (pp. 5610-5614). IEEE.
  16. [16].↵
    Benson, T., & Grieve, G. (2016). Hl7 version 2. In Principles of health interoperability (pp. 223–242). Springer, Cham.
  17. [17].↵
    Davis, S. E., Greevy Jr, R. A., Lasko, T. A., Walsh, C. G., & Matheny, M. E. (2020). Detection of calibration drift in clinical prediction models to inform model updating. Journal of biomedical informatics, 112, 103611.
    OpenUrlCrossRef
  18. [18].
    Davis, S. E., Greevy Jr, R. A., Fonnesbeck, C., Lasko, T. A., Walsh, C. G., & Matheny, M. E. (2019). A nonparametric updating method to correct clinical prediction model drift. Journal of the American Medical Informatics Association, 26(12), 1448–1457.
    OpenUrlPubMed
  19. [19].↵
    Liu, A., Lu, J., & Zhang, G. (2020). Concept drift detection via equal intensity k-means space partitioning. IEEE transactions on cybernetics, 51(6), 3198–3211.
    OpenUrl
Back to top
PreviousNext
Posted April 11, 2023.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Development & Deployment of a Real-time Healthcare Predictive Analytics Platform
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Development & Deployment of a Real-time Healthcare Predictive Analytics Platform
Aaron Boussina, Supreeth Shashikumar, Fatemeh Amrollahi, Hayden Pour, Michael Hogarth, Shamim Nemati
medRxiv 2023.04.10.23288373; doi: https://doi.org/10.1101/2023.04.10.23288373
Twitter logo Facebook logo LinkedIn logo Mendeley logo
Citation Tools
Development & Deployment of a Real-time Healthcare Predictive Analytics Platform
Aaron Boussina, Supreeth Shashikumar, Fatemeh Amrollahi, Hayden Pour, Michael Hogarth, Shamim Nemati
medRxiv 2023.04.10.23288373; doi: https://doi.org/10.1101/2023.04.10.23288373

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Health Informatics
Subject Areas
All Articles
  • Addiction Medicine (349)
  • Allergy and Immunology (668)
  • Allergy and Immunology (668)
  • Anesthesia (181)
  • Cardiovascular Medicine (2648)
  • Dentistry and Oral Medicine (316)
  • Dermatology (223)
  • Emergency Medicine (399)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
  • Epidemiology (12228)
  • Forensic Medicine (10)
  • Gastroenterology (759)
  • Genetic and Genomic Medicine (4103)
  • Geriatric Medicine (387)
  • Health Economics (680)
  • Health Informatics (2657)
  • Health Policy (1005)
  • Health Systems and Quality Improvement (985)
  • Hematology (363)
  • HIV/AIDS (851)
  • Infectious Diseases (except HIV/AIDS) (13695)
  • Intensive Care and Critical Care Medicine (797)
  • Medical Education (399)
  • Medical Ethics (109)
  • Nephrology (436)
  • Neurology (3882)
  • Nursing (209)
  • Nutrition (577)
  • Obstetrics and Gynecology (739)
  • Occupational and Environmental Health (695)
  • Oncology (2030)
  • Ophthalmology (585)
  • Orthopedics (240)
  • Otolaryngology (306)
  • Pain Medicine (250)
  • Palliative Medicine (75)
  • Pathology (473)
  • Pediatrics (1115)
  • Pharmacology and Therapeutics (466)
  • Primary Care Research (452)
  • Psychiatry and Clinical Psychology (3432)
  • Public and Global Health (6527)
  • Radiology and Imaging (1403)
  • Rehabilitation Medicine and Physical Therapy (814)
  • Respiratory Medicine (871)
  • Rheumatology (409)
  • Sexual and Reproductive Health (410)
  • Sports Medicine (342)
  • Surgery (448)
  • Toxicology (53)
  • Transplantation (185)
  • Urology (165)