Forecasting Acute Kidney Injury and Resource Utilization in ICU patients using longitudinal, multimodal models

Yukun Tan; Merve Dede; Vakul Mohanty; Jinzhuang Dou; Holly Hill; Elmer Bernstam; Ken Chen

doi:10.1101/2024.03.14.24304230

Abstract

Background Advances in artificial intelligence (AI) have realized the potential of revolutionizing healthcare, such as predicting disease progression via longitudinal inspection of Electronic Health Records (EHRs) and lab tests from patients admitted to Intensive Care Units (ICU). Although substantial literature exists addressing broad subjects, including the prediction of mortality, length-of-stay, and readmission, studies focusing on forecasting Acute Kidney Injury (AKI), specifically dialysis anticipation like Continuous Renal Replacement Therapy (CRRT) are scarce. The technicality of how to implement AI remains elusive.

Objective This study aims to elucidate the important factors and methods that are required to develop effective predictive models of AKI and CRRT for patients admitted to ICU, using EHRs in the Medical Information Mart for Intensive Care (MIMIC) database.

Methods We conducted a comprehensive comparative analysis of established predictive models, considering both time-series measurements and clinical notes from MIMIC-IV databases. Subsequently, we proposed a novel multi-modal model which integrates embeddings of top-performing unimodal models, including Long Short-Term Memory (LSTM) and BioMedBERT, and leverages both unstructured clinical notes and structured time series measurements derived from EHRs to enable the early prediction of AKI and CRRT.

Results Our multimodal model achieved a lead time of at least 12 hours ahead of clinical manifestation, with an Area Under the Receiver Operating Characteristic Curve (AUROC) of 0.888 for AKI and 0.997 for CRRT, as well as an Area Under the Precision Recall Curve (AUPRC) of 0.727 for AKI and 0.840 for CRRT, respectively, which significantly outperformed the baseline models. Additionally, we performed a SHapley Additive exPlanation (SHAP) analysis using the expected gradients algorithm, which highlighted important, previously underappreciated predictive features for AKI and CRRT.

Conclusion Our study revealed the importance and the technicality of applying longitudinal, multimodal modeling to improve early prediction of AKI and CRRT, offering insights for timely interventions. The performance and interpretability of our model indicate its potential for further assessment towards clinical applications, to ultimately optimize AKI management and enhance patient outcomes.

1. Introduction

AKI, characterized by a substantial elevation in serum creatinine (SCr) or a pronounced decline in urine output¹, represents a severe clinical complication marked by a sudden decline in kidney function. AKI is a profoundly serious condition and needs to be intercepted immediately to prevent lasting kidney damage. This condition is linked to elevated hospital costs and extended length of stay^2–4, with a prevalence exceeding 50% among intensive care units (ICU) patients and accompanied by an alarming mortality rate, up to 50% within the ICU settings^5,6. The International Society of Nephrology⁷ emphasizes the significance of early identification of individuals at increased risk of developing AKI for potentially better outcomes. Early detection allows for therapeutic intervention before the onset of complications, such as anuria, and associated issues like acidosis, hyperkalemia, volume overload, as well as long-term complications such as lung injury, sepsis, and chronic kidney disease.^8–11. Beyond the immediate health implications, AKI significantly impacts healthcare resource allocation, particularly in the context of dialysis. Among available dialysis techniques, Continuous Renal Replacement Therapy (CRRT) is the most commonly used method for hemodynamically unstable patients, accounting for up to 75% of instances in ICUs¹². However, the decision to initiate CRRT is influenced by multiple factors. While there are guidelines and recommendations, there is not standardized criteria. Instead, the decision often depends on specific clinical context and individual patient conditions, highlighting the complexity of managing AKI-induced resource utilization and underscoring the need for a comprehensive approach to enhance clinical decision-making and resource allocation strategies.

The widespread adoption of EHRs and the rapid advancements in computing offer new avenues for disease prediction and prevention. These advancements have been particularly transformative in developing integrated clinical decision support systems^13,14. The volume of routine data collection during hospital stays, considering its high temporal resolution, often exceeds human cognitive processing capabilities¹⁵. In response to this data complexity, Artificial Intelligence (AI) emerges as a promising solution to effectively process such large-scale, high-dimensional datasets. This is particularly relevant in the context of AKI and CRRT prediction. AKI, characterized by its well-defined temporal progression and CRRT, are especially suited for EHR-based predictive modeling¹⁶, utilizing techniques such as logistic regression (LR), Extreme Gradient Boosting (XGBoost), Transformer and Long Short-Term Memory (LSTM). Meanwhile, in the realm of clinical records, researchers have actively sought to harness the capabilities of natural language processing (NLP) for medical prediction tasks. This pursuit has led to the exploration of various methodologies, including a convolutional document embedding method grounded in the unstructured textual content of clinical records¹⁷, an open set of biomedical word vectors/embeddings that integrates domain-specific biomedical knowledge^18,19, and the latest Bidirectional Encoder Representations from Transformers (BERT) model²⁰ which addresses the limitation of representing each word with only one vector²¹. Although several AKI or CRRT prediction models have been reported recently, notable limitations exist. First, these models have often been derived and validated within specific patient cohorts, such as those undergoing cardiac surgery or catheterization^22–25, children²⁶, elderly adults²⁷, or the critically ill²⁸. This specificity may limit their generalizability to broader patient populations. Second, a common short fall in these models is their reliance on static features or single-point measurements^29–32. This approach does not fully capture the dynamic and rapidly evolving nature of patients’ health status, especially in the context of AKI development as well as in postoperative settings where physiological parameters can change quickly and unpredictably. The need for models that adapt to these rapid changes in a patient’s condition is crucial for accurate AKI and CRRT prediction. Third, many studies do not account for non-zero gap times in their predictive frameworks, leading to the risk of temporal data leakage of label information during model training^24,33,34. For instance, utilizing fixed 6-hour input windows may inadvertently include data points just before the AKI diagnosis, potentially skewing the prediction by using information already known to the care team. Additionally, most existing models are unimodal, focusing either solely on laboratory measurements or clinical notes, thus lacking the comprehensive view provided by a multimodal approach. This limitation is particularly critical in scenarios, where data availability is constrained or highly imbalanced. Finally, it is noteworthy that a considerable portion of existing literature tends to limit its predictive analyses by relying on basic measurements, thus failing to optimize the full potential of the available datasets.

To address these gaps, we have developed a multi-modal model that integrates the embeddings of LSTM and BioMedBERT. This advanced model leverages a blend of both unstructured clinical notes and structured time series measurements, incorporating over 50 features enhanced by medical insights derived from EHRs. It enables the early prediction of both AKI and the anticipation of the need for CRRT by providing a minimum of 12-hour lead time before clinical manifestation. Recognizing the clinical need for actionable insights, our model is designed not just to predict AKI earlier and with greater accuracy but also to influence ICU interventions. By identifying patients at high risk for AKI/CRRT, our model can potentially inform clinical decisions to minimize renal risk, such as the judicious use and dosage of nephrotoxic drugs, careful consideration of imaging procedures involving ionizing contrast dye, avoiding hypotension, and meticulous management of hydration levels. Understanding a patient’s heightened risk of renal failure recalibrates the risk/benefit ratio of common ICU interventions, allowing clinicians to consider treatment strategies that reduce the likelihood of AKI and the need for CRRT, thereby potentially improving patient outcomes and resource utilization.

View this table:

2. Materials and methods

2.1 Dataset description

The dataset utilized in this study was sourced from the MIMIC-IV database, spanning a period from 2008 to 2009 and including intensive care unit (ICU) patient stays at Beth Israel Deaconess Medical Center. The database comprises a comprehensive array of patient-related information, including demographics, laboratory test results, procedures, medications, clinical notes, imaging reports and mortality data, encompassing post-hospital discharge outcomes (Figure 1(a)). Within the MIMIC-IV database, patients are denoted as Subject_IDs, with each patient having one or more hospital admissions named Hadm_IDs. Within a single admission, a patient may undergo one or more ICU stays, referred to as Icustay_IDs. A clinical event is defined as an individual measurement, observation, or treatment.

Figure 1.

(a) MIMIC-IV built by merging the various data sources (the BIDMC data warehouse, the ICU information system, and external sources) into a single schema with five modules. (b) Comparative analysis of three model types (XGBoost, Logistic Regression, LSTM) for processing 6-hour time series lab measurement data.

2.2 Methodological framework

In this study, we adopt the classification system proposed by the Acute Kidney Injury Working Group of the Kidney Disease: Improving Global Outcomes (KDIGO) Foundation to standardize the definition of AKI. As per the KDIGO criteria, AKI is categorized into three stages^1,35,36, with Stage 1 indicating the initial phase of AKI, often asymptomatic. Stages 2 and 3 signify increasing severity of renal dysfunction. Our research focuses on patients diagnosed with Stage 2 and Stage 3 AKI, given their clinical significance and the progressive nature of the condition at these stages. Furthermore, we extend our analysis to predict the need for CRRT.

The predictive model we propose is specifically designed to analyze time series laboratory measurements collected in the first 6 hours post-admission. Our goal is to forecast the probability of a patient being diagnosed with AKI or requiring CRRT within a 12-hour period after this initial 6-hour data collection window. This design ensures that our predictions apply to patients diagnosed at least 18 hours after admission, thereby reducing the risk of predicting AKI in individuals who might have been diagnosed prior to hospitalization.

In terms of clinical notes, our model selectively incorporates notes recorded at the time of patient admission. This includes information of the chief complaint, history of present illness, past medical history, initial physical examination findings, and initial clinical impressions. Importantly, we exclude discharge notes to focus on data relevant to the initial phase of hospitalization, therefore enhancing the model’s precision and clinical utility.

2.3 Time series embedding model

We designed the Time Series Embedding Model to acquire the temporal representation of patients utilizing 50 distinct features selected based on medical knowledge. Our investigation delved into three prominent classification methodologies: logistic regression (LR), Extreme Gradient Boosting (XGBoost), and Long Short-Term Memory (LSTM). These models are widely acknowledged and extensively employed in the biomedical domain, as depicted visually in Figure 1(b). While Transformer-based architectures^37–39, have gained significant traction in recent years, their substantial data requirements, e.g., included several million samples in³⁸, make them less suitable for our investigation.

2.3.1 Logistic regression (LR)

LR is characterized as a linear classification model with limited complexity and moderate interpretability. Due to its inherent limitation in handling temporal data, we concatenated 6 one-hour measurements from patients into a single vector. In this model, we used a more elaborate version of the hand-engineered features^40,41, incorporating six distinct sample statistic features for each variable within a given time series. These features encompass the minimum, maximum, mean, standard deviation, skew, and the number of measurements within each subsequence.

2.3.2 Extreme Gradient Boosting (XGBoost)

XGBoost is a state-of-the-art machine learning algorithm utilizing gradient-boosting trees for predictive modeling⁴². Its notable advantages include high predictive accuracy, automatic modeling of non-linearities and high-order interactions, as well as robustness to multicollinearity. Similar to LR, we utilized XGBoost to process temporal data by concatenating 6-hour measurements into a single vector.

2.3.3 Long Short-Term Memory (LSTM)

LSTM models⁴³, a popular variant of recurrent neural networks (RNNs), exhibit the capacity to process sequences of arbitrary lengths in a non-linear and high-capacity manner. This enables the learning of sequential relations and long-term temporal dependencies within time series data. The crucial role in achieving this function lies in the gate and state of the network, with the forget gate determines whether to delete the previous state. The final hidden state of the LSTM serves as the time series embedding, utilized for subsequent predictions.

It is noteworthy that both LR and XGBoost generate predictions based on a flattened vector of the 6-hour time series data, while LSTM can directly process the time series input to make binary predictions. Our results have shown that the LSTM model outperforms the other two models — LR and XGBoost (Table 1). Consequently, we have made the informed decision to incorporate the LSTM model as a crucial component of our multimodal model.

View this table:

Table 1. LSTM, LR, and XGBoost performance comparison for (a) AKI (b) prediction.

2.4 Clinical notes embedding model

Our study delves into the exploration of two established BERT-based models, namely BioMedBERT and ClinicalBERT, specifically tailored to enhance natural language processing within the biomedical and clinical literature domains. We enhance this capability through the fine-tuning process on specific tasks, thereby constructing a specialized BERT-based AKI/CRRT prediction model.

2.4.1 BioMedBERT⁴⁴

BioMedBERT, previously named PubMedBERT, specially pretrained from scratch using abstracts from PubMed and full-text articles from PubMed Central, excels in capturing the nuances of medical terminology, scientific jargon, and the vast array of topics within the biomedical field. It has demonstrated remarkable performance in tasks, such as document classification and information retrieval within the biomedical domain.

2.4.2 ClinicalBERT⁴⁵

ClinicalBERT focuses on clinical notes/EHRs, and other healthcare-related documents. By training a BERT model on clinical text, ClinicalBERT has been optimized to understand the specific language nuances present in patient records, medical reports, and uncovers relationships between medical concepts that match physician judgment.

2.4.3 Probability calculation of solely BERT-based model

The computation of predictions for patients with an extensive volume of clinical notes entails a binning process, wherein predictions are aggregated for each subsequence. In the context of a patient whose notes are segmented into n subsequences, the BERT-based model generates a probability estimate for each respective subsequence. As per the methodology established by Huang⁴⁵, the probability of AKI and CRRT for a given patient is expressed as: where c is the scaling factor to control over the influence of the number of subsequences n, and h_patient denotes the implicit representation encompassing all notes associated with a specific patient. The probabilities and represent the maximum and mean probabilities of AKI/CRRT over n subsequences.

The underlying concept posits that certain subsequences may lack informative content regarding AKI/CRRT, while others are relevant. Therefore, computation of AKI/CRRT risk is selectively performed using subsequences that exhibit correlation with the target outcome, thereby minimizing the impact of less informative subsequences. However, it is worth noting that maximum probability may be caused by noise. In this context, we include the mean probability to have a trade-off between the mean and maximum probabilities. Finally, to adjust for patients with varying quantities of clinical notes, we introduce a scaling factor n/c, providing a larger weight for the mean prediction in cases where patients possess a higher volume of notes. Notably, empirical findings by Huang⁴⁵ indicate that c=2 yields optimal performance based on validation data.

The comparison results from the experiment section reveal that BioMedBERT has a superior performance compared to ClinicalBERT. Notably, one of the key differentiators lies in the foundational pretraining approach employed for each model. BioMedBERT is pretrained from scratch, allowing it to capture intricate nuances and domain-specific intricacies. On the other hand, ClinicalBERT is built upon the BERT model, with its training grounded in general-domain language models. As a result, we decided to integrate the BioMedBERT model as an essential element within our multimodal framework.

2.5 Multimodal Learning

All those models above use only one data source to make predictions. Previous studies show that the combination of time series and clinical record is useful for the prediction^18,46,47. Therefore, we used a representation of the clinical record in conjunction with the time series portion of the patient data and incorporated the fusion module to improve the performance of the prediction.

As depicted in Figure 2, our proposed model seamlessly integrates two distinct modalities. Specifically, we concatenate the embeddings of two top-performing unimodal models—LSTM for time-series measurements and BioMedBERT for clinical notes. The multimodal encoder harmonizes these embeddings, fusing the information from both modalities and projecting them into a shared space.

Figure 2.

Schematic Representation of the Proposed Multimodal Model: The diagram illustrates our proposed framework, wherein we concatenate embeddings derived from 6 hours’ time-series measurements using LSTM and clinical notes processed by BioMedBERT. This combined representation is utilized for the prediction of AKI/CRRT, aiming to forecast clinical manifestation at least 12 hours in advance.

In this study, we utilize time series measurements denoted as X_ts ∈ R^2×4, where X_ts represents the time series measurements with L length of measurements counted by hours, and D variables. In our case, L is set to 6, and D is 100, as following Harutyunyan’s setup⁴¹ that X_ts is composed of original time series variables and binary time series variables which indicates whether the corresponding variables are observed or imputed. The details regarding data curation are described in the experiment section.

The final hidden state of LSTM model, denoted as E_ts, serves as the embedding of the time-series data with default 64 dimensions. Concurrently, X_notes represents clinical notes with a dimension of 512, complying with the maximum input size requirement of BioMedBERT. As illustrated in Figure 2, a necessity arises to partition the clinical notes into several subsequences to meet the specified requirement. The embedding matrix E_notes ∈ R^516×789 is elucidated, where 512 corresponds to the dimensionality of the embedding for each token, encapsulating contextual relationships of the word within a 512-dimensional space. The value 768 designates the hidden size in the transformer model, representing the broader contextual understanding of the input sequence which determined by the transformer’s self-attention mechanism.

The calculation of Ē_notes involves two steps: initially, the average of the embeddings of all the subsequences from the same patient is computed, followed by taking the mean in the 512 dimensions, representing the average contribution of each dimension across all embeddings, and providing a condensed representation of the overall information captured by the embedding. Consequently, E_notes is a vector with 768 dimensions. The symbol ⊕ denotes the concatenate operation, wherein the embeddings from LSTM and BioMedBERT are combined to form the final embedding E₁₁ for predictive tasks. As depicted in Figure 2, the ultimate embedding undergoes processing through a fully connected layer and a Softmax layer to derive predictions for the designated task.

3 Results

3.1 Data processing

In this study, we utilized the complete MIMIC-IV database⁴⁸, which includes approximately 73,181 ICU stays across 50,920 critical care patients. We curated time series data from this resource inspired by prior work^41,49,50. Figure 3 summarizes the data extraction and processing steps.

Figure 3:

MIMIC-IV Data Processing Diagram: First, a cohort is established by applying predefined selection criteria. Next, the data undergoes manipulation involving the aggregation of measurements, unit conversion, outlier filtration, discretization, imputation, and normalization. Finally, data extraction is conducted in alignment with our task definition.

To select our cohort as illustrated in Figure 3, we extracted pertinent data from the raw MIMIC-IV tables and organizing them by patient as the initial step. Exclusion criteria are applied to admissions and ICU stays, first excluding hospital admissions with multiple ICU stays or transfers between different ICU units to mitigate outcome ambiguity associated with hospital admissions. Additionally, we excluded ICU stays involving patients under 18 due to significant physiological differences between adults and pediatric patients. The resulting root cohort comprises 45,127 unique patients, with a total of 56,908 ICU stays and over 300 million clinical events.

We next moved into the data processing phase, where clinical aggregation is initially performed. Each measurement in the MIMIC-IV database is associated with a unique ItemID, but multiple ItemIDs may match to semantically equivalent clinical features because of the different version of EHR or the same measurement at various places, e.g., admission and ICU. For example, “Weight” may be recorded under 224639, 226512, or 226531. Then a clinical taxonomy⁴⁹ is employed to group semantically equivalent ItemIDs into more robust “clinical aggregate” features, thereby reducing data missingness and duplicate measures. Subsequently, unit conversion and outlier detection are executed to address data unit inconsistencies and handle outliers. Measurements are standardized into consistent units, with clinical experts’ input used to define valid clinical measure ranges and thresholds for detecting unusable outliers⁴¹. This comprehensive data curation ensures the integrity and quality of the dataset for subsequent analyses.

After the data cleaning phase, we generated time-series data and imputed missing values. In the MIMIC-IV raw data, fine-grained timestamps are provided for each laboratory measurement and recorded vital sign. However, the frequency of most measurements is notably sparse, with certain variables having infrequent occurrences. To create a more interpretable and machine-learning-friendly representation, we employed an aggregation strategy for the observations within each ICU stay’s time-series.

Specifically, we discretized the data into hourly buckets, facilitating a denser representation conducive to modern machine learning methods designed for time-series analyses that expect discretized time representations. In cases where multiple measurements of the same variable are within a single hour, we selected the measurement that is nearest to the discrete integer time point. Conversely, if no measurement occurred within an hour, we resorted to imputation by selecting the value from the previous hour. In the last step, we performed normalization, ensuring that the time-series data is scaled for seamless integration into the subsequent modeling stages.

For the clinical notes, while the patients typically possess a myriad of diverse notes, the inherent constraints of BERT-based models necessitate adherence to a fixed maximum input sequence length. To address this, we systematically divided notes into subsequences, each containing up to 300 words. It is pertinent to note that BERT processes input using sub-word units (WordPieces) rather than entire words. Given BERT’s restriction to a maximum sequence of 512 sub-word unit tokens, equivalent to approximately 300 words, we aligned our subsequence length accordingly. In addition, we kept the relevant information of the clinical notes (described in 2.2 Methodological framework section) and converted words to lowercase, removing line breaks and carriage returns, de-identifying brackets, and eliminating special characters such as “==“, “--” ⁴⁵.

3.2 Generation of training and testing data

Guided by the task definition in the Figure 3 and description in 2.2 Methodological framework section, we generated training and testing datasets tailored for AKI and CRRT prediction. Given that patients with time-series data and with clinical notes do not exhibit a one-to-one correspondence, we selected patients who possess both types of information. Subsequently, we randomly allocated approximately 80% of the events as training data and the remaining 20% as testing data for both AKI and CRRT prediction. The resultant dataset comprised 38,400 records for AKI training data, of which 14,168 are positive instances. For AKI testing data, we had 10,586 records, with 2,103 of them being positive instances. Similarly, the CRRT training data encompassed 18,697 records, with 764 positive instances. The CRRT testing data comprised 4,473 records, with 52 positive instances. It is imperative to highlight the inherent challenge of class imbalance in CRRT prediction. The marked disproportion in the number of positive instances poses an additional layer of complexity to the predictive modeling task.

3.3 Performance comparison

Next, we evaluated and compared the performance of corresponding unimodal models on the same tasks across the two distinct data types with the goal of identifying the most effective models within their respective domains to serve as baselines for comparison with the performance of the multimodal model we have proposed.

3.3.1 Time-series unimodal models’ comparison

The results presented in Table 1 provide a comparative analysis of unimodal models utilizing time-series data. Notably, LSTM consistently outperforms the other two models across both tasks, particularly when evaluating metrics such as AUPRC, precision, and recall.

3.3.2 Clinical notes unimodal models’ comparison

Table 2 illustrate the performance comparison of unimodal models utilizing clinical notes data. Although the ClinicalBERT model exhibits slightly superior recall, considering numerous factors, especially the AUPRC, which is indicative of performance in imbalanced data scenarios, we conclude that BioMedBERT is the more suitable model for this comparison.

View this table:

Table 2. BioMedBERT and ClinicalBERT performance comparison for (a) AKI (b) CRRT prediction.

3.3.3 Performance of multimodal model

Next, we compared the performance of our multimodal model with that of the top-performing unimodal model. The results as detailed in Table 3 and Figure 4 consistently highlights the superior efficacy of the multimodal model with p-value 3.7x10⁻³ (AKI) and 2.68x10⁻⁵ (CRRT) of McNemar’s test comparing with LSTM. Analysis of precision and recall in CRRT predictions (Table 3(b)) indicates that our multimodal model identifies over 25% more patients who require dialysis than the unimodal model (LSTM). This translates to an increase in prediction accuracy for CRRT of approximately 25%, a significant improvement with the potential to reduce hospital complications and improve patient outcomes.

View this table:

Table 3. Multimodal and unimodal model performance comparison for (a) AKI (b) CRRT prediction.

Figure 4.

Multimodal and unimodal model performance comparison for (a) AKI (b) CRRT prediction.

Moreover, the AUPRC (Figure 4(b)) for our multimodal model shows an improvement of more than 20% over the unimodal model (LSTM), suggesting that the multimodal model can effectively minimize unnecessary treatments or interventions while accurately identifying critical cases. This finding is critical in the context of CRRT data because it is highly imbalanced and provides limited input for unimodal models. The strength of our multimodal approach is its ability to leverage complementary insights from different data sources, leading to a more comprehensive understanding of underlying patient data patterns.

For clinical tasks, where false positives are of paramount concern, a precision threshold of 80% is set to minimize the risk of alarm fatigue. Our results, specifically AUPRC plots in Figure 4(b), show that at the level of precision, our multimodal model achieves almost 50% higher recall than the unimodal model (yellow dotted line). This strategic evaluation approach ensures the minimization of false positives while optimizing recall, contributing to the clinical utility and reliability of the unimodal model in real-world applications.

3.4 Elucidating Feature Importance in AKI and CRRT Prediction Using SHAP Values

Building upon the enhanced performance of our multimodal model, we prioritized interpretability, which is essential for clinical adoption. Leveraging concepts from Integrated Gradients and SmoothGrad^51,52, the integration of SHapley Additive exPlanation (SHAP) values with the expected gradients algorithm allowed us to dissect the model’s predictive mechanics at various time points, providing actionable insights into the features most contributory to the risk of AKI and CRRT requirement.

Analysis of the top SHAP values across six time points reveals that the model assigns varying degrees of importance to different clinical features in the prediction of AKI (Figure 5(a)). These features include physiological parameters, laboratory results, and patient vitals, which are known to be associated with kidney function and AKI risk. At the first time-point; we identified urine output, oxygen saturation, and diastolic blood pressure as the most influential features. Urine output is a direct measure of kidney function, with deviations from normal ranges indicating potential renal compromise. Oxygen saturation reflects systemic oxygen delivery, where hypoxia may contribute to renal ischemia⁵³. Diastolic blood pressure is indicative of renal perfusion, with both hypotension and hypertension potentially leading to AKI. As the patients’ clinical course progresses at the second timepoint, urine output remains a predominant feature, emphasizing its critical role in ongoing AKI risk assessment. Oxygen saturation and diastolic blood pressure continue to be significant, alongside red blood cell count, which may indicate changes in blood volume or hematologic conditions affecting kidney function. By the third timepoint, additionally, red blood cell count, and weight emerge as key indicators, with weight potentially reflecting fluid balance, a crucial factor in AKI development⁵⁴. Respiratory rate and neutrophils also rank highly, the former potentially signifying respiratory compensation for metabolic acidosis, and the latter suggesting an inflammatory or infectious process that could impact kidney health. At the fourth timepoint, urine output, red blood cell count, and oxygen saturation continue to dominate the model’s attention, sustaining their roles as essential markers for AKI prediction. Notably, albumin and lactate levels come into focus, where hypoalbuminemia can be associated with chronic illness impacting the kidneys⁵⁵, and elevated lactate may indicate tissue hypoxia and metabolic distress, often preceding renal injury^56,57. By the 5^th and 6^th time-points, the significance of urine output in AKI prediction is highlighted as it remains as the feature with the highest SHAP value.

Figure 5:

(a) AKI (b) CRRT prediction SHAP values across timepoints

The consistent prominence of urine output across all timepoints stresses its essential role in assessing renal function and in AKI staging criteria. The presence of vital signs such as blood pressure and oxygen saturation across multiple timepoints highlights the systemic nature of AKI, where renal health is closely tied to overall hemodynamic stability of the patient. Laboratory parameters such as red blood cell count and albumin levels provide insight into the patient’s volume status and nutritional state, which are integral to kidney function. These trends align with the known pathophysiology of AKI, where both acute changes and underlying chronic conditions converge to impact renal health.

Similarly for our predictive model for CRRT requirement, we observed the urine output as the most important and consistent predictor across all time points (Figure 5(b)). The anion gap is a marker for metabolic acidosis, a common occurrence in renal failure and an indicator for CRRT⁵⁸. Beginning from the second time point, we begin observing anion gap as an important feature and its feature importance rank increases with time. Elevated phosphate levels can indicate renal dysfunction, as kidneys play a central role in phosphate regulation. Our results show phosphate levels are among the most important and persistent features that predict CRRT need in our patient cohort. By the fourth timepoint, the model highlights the anion gap and respiratory rate, the latter of which may signify respiratory compensation for metabolic acidosis—a condition often necessitating CRRT. In the final timepoint analysis, urine output, alongside anion gap and phosphate, remains a principal feature. Our analyses reveal a clear trend: certain features, particularly urine output, the anion gap, and phosphate, are consistently prominent across all timepoints in predicting the need for CRRT. This indicates a strong association of these features with the underlying pathophysiological processes leading to requirement for CRRT, such as fluid overload, electrolyte imbalances, and acid-base disturbances.

Collectively, the interpretative power of SHAP values in our models not only validates the clinical relevance of the identified features but also enhances the model’s utility by offering clinicians a detailed profile of AKI features and CRRT risk factors as they evolve over time. Recognizing these features’ significance can assist clinicians in early identification of patients who may benefit from CRRT, thereby potentially improving outcomes in critically ill patients, through timely and targeted interventions.

3.5 Clinical notes analysis using BioMedBERT embeddings

The incorporation of BioMedBERT allowed our multimodal model to analyze unstructured clinical notes from the MIMIC-IV database for the prediction of AKI and CRRT. Through this analysis, we identified key terms (tokens) that BioMedBERT attributed with high importance scores, potentially revealing clinical narratives associated with these conditions. Here it is important to remember that we only included notes recorded at the time of patient admission and excluded discharge notes to focus on data relevant to the initial phase of hospitalization.

Our results showed that for AKI prediction, the term ‘hemodialysis’ appeared with the highest importance score and similarly, ‘dialysis’ was captured as significant, which may initially seem inconsistent with our focus on the initial phase of hospitalization. However, these terms likely represent significant risk factors in the patient’s past medical history, which are crucial for predicting the development of AKI. The presence of these tokens shows our model’s ability to identify patients with pre-existing conditions that elevate the risk of AKI, even if the actual treatment or condition is not an immediate concern at the point of hospital admission. We also observed ‘hd’ (hemodialysis) and ‘laparotomy’ (which is a surgical procedure involving a large incision through the abdominal wall) in the token list, suggesting possible postoperative kidney stress, which is a recognized risk factor for AKI development^59,60. Moreover, the presence of terms such as ‘ckd’ (chronic kidney disease), ‘cirrhosis,’ and ‘ascites’ indicates underlying chronic conditions that can predispose patients to AKI. Additionally, tokens like ‘jaundice,’ and ‘ed’ (emergency department) highlight symptoms and care settings that may be relevant to the patient’s acute or chronic health challenges.

For the predictions related to CRRT, the token ‘endotracheal’ is associated with the highest importance score. It is relevant as it indicates airway management typically required for critically ill patients, who may also be at an elevated risk for CRRT. The presence of ‘obese’ and ‘intubation’ as important tokens correlates with known risk factors for AKI, which can lead to CRRT. Obesity is associated with several comorbidities that predispose individuals to kidney stress⁶¹, and intubation is often part of the management of severely ill patients who are at an increased risk for renal complications.

Overall, our multimodal model’s ability to identify significant tokens from clinical admission notes offers a promising avenue for enhancing patient risk assessment during the critical initial phase of hospitalization. The detection of terms related to prior renal health problems provides a deeper insight into the patient’s condition, facilitating a proactive approach to AKI and CRRT management. This can be particularly beneficial for tailoring both monitoring and treatment strategies, potentially reducing the progression to severe renal events.

4. Conclusions

In this study, we developed a longitudinal, multimodal model that capitalizes on both unstructured clinical notes and structured time-series measurements derived from EHRs for the early AKI prediction and the prospective identification of the necessity for CRRT. Our model demonstrates exceptional predictive accuracy, offering valuable insights into AKI and CRRT needs at least 12 hours ahead of clinical manifestation. The achieved AUROC and AUPRC metrics further underscore the robustness of our system. The incorporation of interpretability measures, including SHAP values and the expected gradients algorithm, ensures transparency and positions our model as a reliable candidate for clinical deployment. This research holds promise in optimizing AKI management, contributing to improved patient outcomes, and marks a significant stride toward enhancing healthcare practices.

Data Availability

All data produced are available online at MIMIC-IV

Funding

This project has been made possible in part by grant U01CA247760 and U01CA281902 to KC from National Cancer Institute.

CRediT authorship contribution statement

Yukun Tan: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Validation, Writing - original draft, Writing - review & editing. Merve Dede: Conceptualization, Investigation, Formal analysis, Validation, Writing - original draft, Writing - review & editing. Vakul Mohanty: Conceptualization, Visualization. Jinzhuang Dou: Conceptualization, Software. Holly Hill: Conceptualization, Writing - review & editing. Elmer Bernstam: Supervision, Validation, Writing - review & editing. Ken Chen: Conceptualization, Supervision, Project administration, Funding acquisition, Writing – review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

1.↵
Khwaja A. KDIGO clinical practice guidelines for acute kidney injury. Nephron Clin Pract. 2012;120(4):c179–184. doi:10.1159/000339789
OpenUrl CrossRef PubMed
2.↵
Hobson C, Ozrazgat-Baslanti T, Kuxhausen A, et al. Cost and Mortality Associated With Postoperative Acute Kidney Injury. Ann Surg. 2015;261(6):1207–1214. doi:10.1097/SLA.0000000000000732
OpenUrl CrossRef PubMed
3.
Silver SA, Long J, Zheng Y, Chertow GM. Cost of Acute Kidney Injury in Hospitalized Patients. J Hosp Med. 2017;12(2):70–76. doi:10.12788/jhm.2683
OpenUrl CrossRef PubMed
4.↵
Silver SA, Chertow GM. The Economic Consequences of Acute Kidney Injury. Nephron. 2017;137(4):297–301. doi:10.1159/000475607
OpenUrl CrossRef
5.↵
Cheng P, Waitman LR, Hu Y, Liu M. Predicting Inpatient Acute Kidney Injury over Different Time Horizons: How Early and Accurate? AMIA Annu Symp Proc. 2018;2017:565–574.
OpenUrl
6.↵
Brown JR, Rezaee ME, Marshall EJ, Matheny ME. Hospital Mortality in the United States following Acute Kidney Injury. Biomed Res Int. 2016;2016:4278579. doi:10.1155/2016/4278579
OpenUrl CrossRef
7.↵
Gage BF, Waterman AD, Shannon W, Boechler M, Rich MW, Radford MJ. Validation of Clinical Classification Schemes for Predicting StrokeResults From the National Registry of Atrial Fibrillation. JAMA. 2001;285(22):2864–2870. doi:10.1001/jama.285.22.2864
OpenUrl CrossRef PubMed Web of Science
8.↵
Faubel S, Shah PB. Immediate Consequences of Acute Kidney Injury: The Impact of Traditional and Nontraditional Complications on Mortality in Acute Kidney Injury. Adv Chronic Kidney Dis. 2016;23(3):179–185. doi:10.1053/j.ackd.2016.02.007
OpenUrl CrossRef
9.
Hsia CCW, Ravikumar P, Ye J. Acute lung injury complicating acute kidney injury: A model of endogenous αKlotho deficiency and distant organ dysfunction. Bone. 2017;100:100–109. doi:10.1016/j.bone.2017.03.047
OpenUrl CrossRef
10.
Mehta RL, Bouchard J, Soroko SB, et al. Sepsis as a cause and consequence of acute kidney injury: Program to Improve Care in Acute Renal Disease. Intensive Care Med. 2011;37(2):241–248. doi:10.1007/s00134-010-2089-9
OpenUrl CrossRef PubMed Web of Science
11.↵
Coca SG, Singanamala S, Parikh CR. Chronic kidney disease after acute kidney injury: a systematic review and meta-analysis. Kidney Int. 2012;81(5):442–448. doi:10.1038/ki.2011.379
OpenUrl CrossRef PubMed Web of Science
12.↵
Hoste EAJ, Bagshaw SM, Bellomo R, et al. Epidemiology of acute kidney injury in critically ill patients: the multinational AKI-EPI study. Intensive Care Med. 2015;41(8):1411–1423. doi:10.1007/s00134-015-3934-7
OpenUrl CrossRef PubMed
13.↵
Janke AT, Overbeek DL, Kocher KE, Levy PD. Exploring the Potential of Predictive Analytics and Big Data in Emergency Care. Annals of Emergency Medicine. 2016;67(2):227–236. doi:10.1016/j.annemergmed.2015.06.024
OpenUrl CrossRef PubMed
14.↵
Jensen PB, Jensen LJ, Brunak S. Mining electronic health records: towards better research applications and clinical care. Nat Rev Genet. 2012;13(6):395–405. doi:10.1038/nrg3208
OpenUrl CrossRef PubMed
15.↵
Halford GS, Baker R, McCredden JE, Bain JD. How many variables can humans process? Psychol Sci. 2005;16(1):70–76. doi:10.1111/j.0956-7976.2005.00782.x
OpenUrl CrossRef PubMed Web of Science
16.↵
Sutherland SM, Chawla LS, Kane-Gill SL, et al. Utilizing electronic health records to predict acute kidney injury risk and outcomes: Workgroup statements from the 15th ADQI Consensus Conference. Canadian Journal of Kidney Health and Disease. 2016;3(1). doi:10.1186/s40697-016-0099-4
OpenUrl CrossRef
17.↵
Grnarova P, Schmidt F, Hyland SL, Eickhoff C. Neural Document Embeddings for Intensive Care Patient Mortality Prediction. Published online December 1, 2016. doi:10.48550/arXiv.1612.00467
OpenUrl CrossRef
18.↵
Zhang Y, Chen Q, Yang Z, Lin H, Lu Z. BioWordVec, improving biomedical word embeddings with subword information and MeSH. Sci Data. 2019;6(1):52. doi:10.1038/s41597-019-0055-0
OpenUrl CrossRef PubMed
19.↵
Zhang X, Dou D, Wu J. Learning Conceptual-Contextual Embeddings for Medical Text. Published online March 11, 2020. doi:10.48550/arXiv.1908.06203
OpenUrl CrossRef
20.↵
Devlin J, Chang MW, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Published online May 24, 2019. doi:10.48550/arXiv.1810.04805
OpenUrl CrossRef
21.↵
Niu K, Zhang K, Peng X, Pan Y, Xiao N. Deep multi-modal intermediate fusion of clinical record and time series data in mortality prediction. Front Mol Biosci. 2023;10:1136071. doi:10.3389/fmolb.2023.1136071
OpenUrl CrossRef
22.↵
Wijeysundera DN, Karkouti K, Dupuis JY, et al. Derivation and Validation of a Simplified Predictive Index for Renal Replacement Therapy After Cardiac Surgery. JAMA. 2007;297(16):1801–1809. doi:10.1001/jama.297.16.1801
OpenUrl CrossRef PubMed Web of Science
23.
Gurm HS, Seth M, Kooiman J, Share D. A Novel Tool for Reliable and Accurate Prediction of Renal Complications in Patients Undergoing Percutaneous Coronary Intervention. Journal of the American College of Cardiology. 2013;61(22):2242–2248. doi:10.1016/j.jacc.2013.03.026
OpenUrl FREE Full Text
24.↵
Rank N, Pfahringer B, Kempfert J, et al. Deep-learning-based real-time prediction of acute kidney injury outperforms human predictive performance. npj Digit Med. 2020;3(1):1–12. doi:10.1038/s41746-020-00346-8
OpenUrl CrossRef
25.↵
Sun C, Chen D, Jin X, et al. Association between acute kidney injury and prognoses of cardiac surgery patients: Analysis of the MIMIC-III database. Frontiers in Surgery. 2023;9. Accessed December 6, 2023. https://www.frontiersin.org/articles/10.3389/fsurg.2022.1044937
26.↵
Sanchez-Pinto LN, Khemani RG. Development of a Prediction Model of Early Acute Kidney Injury in Critically Ill Children Using Electronic Health Record Data. Pediatric Critical Care Medicine. 2016;17(6):508–515. doi:10.1097/PCC.0000000000000750
OpenUrl CrossRef PubMed
27.↵
Kate RJ, Perez RM, Mazumdar D, Pasupathy KS, Nilakantan V. Prediction and detection models for acute kidney injury in hospitalized older adults. BMC Medical Informatics and Decision Making. 2016;16(1). doi:10.1186/s12911-016-0277-4
OpenUrl CrossRef
28.↵
Chiofolo C, Chbat N, Ghosh E, Eshelman L, Kashani K. Automated Continuous Acute Kidney Injury Prediction and Surveillance: A Random Forest Model. Mayo Clinic Proceedings. 2019;94(5):783–792. doi:10.1016/j.mayocp.2019.02.009
OpenUrl CrossRef PubMed
29.↵
Xia Z, Xu P, Xiong Y, Lai Y, Huang Z. Survival Prediction in Patients with Hypertensive Chronic Kidney Disease in Intensive Care Unit: A Retrospective Analysis Based on the MIMIC-III Database. Journal of Immunology Research. 2022;2022:e3377030. doi:10.1155/2022/3377030
OpenUrl CrossRef
30.
Yue S, Li S, Huang X, et al. Machine learning for the prediction of acute kidney injury in patients with sepsis. Journal of Translational Medicine. 2022;20(1):215. doi:10.1186/s12967-022-03364-0
OpenUrl CrossRef
31.
Deng YH, Luo XQ, Yan P, Zhang NY, Liu Y, Duan SB. Outcome prediction for acute kidney injury among hospitalized children via eXtreme Gradient Boosting algorithm. Sci Rep. 2022;12:8956. doi:10.1038/s41598-022-13152-x
OpenUrl CrossRef
32.↵
Liu J, Wu J, Liu S, Li M, Hu K, Li K. Predicting mortality of patients with acute kidney injury in the ICU using XGBoost model. PLOS ONE. 2021;16(2):e0246306. doi:10.1371/journal.pone.0246306
OpenUrl CrossRef
33.↵
Xu Z, Feng Y, Li Y, et al. Predictive Modeling of the Risk of Acute Kidney Injury in Critical Care: A Systematic Investigation of The Class Imbalance Problem. AMIA Jt Summits Transl Sci Proc. 2019;2019:809–818.
OpenUrl
34.↵
Martinez DA, Levin SR, Klein EY, et al. Early Prediction of Acute Kidney Injury in the Emergency Department With Machine-Learning Methods Applied to Electronic Health Record Data. Annals of Emergency Medicine. 2020;76(4):501–514. doi:10.1016/j.annemergmed.2020.05.026
OpenUrl CrossRef
35.↵
Machado MN, Nakazone MA, Maia LN. Acute kidney injury based on KDIGO (Kidney Disease Improving Global Outcomes) criteria in patients with elevated baseline serum creatinine undergoing cardiac surgery. Rev Bras Cir Cardiovasc. 2014;29(3):299–307. doi:10.5935/1678-9741.20140049
OpenUrl CrossRef PubMed
36.↵
Makris K, Spanou L. Acute Kidney Injury: Definition, Pathophysiology and Clinical Phenotypes. Clin Biochem Rev. 2016;37(2):85–98.
OpenUrl PubMed
37.↵
Chowdhury RR, Zhang X, Shang J, Gupta RK, Hong D. TARNet: Task-Aware Reconstruction for Time-Series Transformer. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. KDD ‘22. Association for Computing Machinery; 2022:212–220. doi:10.1145/3534678.3539329
OpenUrl CrossRef
38.↵
Placido D, Yuan B, Hjaltelin JX, et al. A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories. Nat Med. 2023;29(5):1113–1122. doi:10.1038/s41591-023-02332-5
OpenUrl CrossRef PubMed
39.↵
Tipirneni S, Reddy CK. Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series. Published online February 16, 2022. Accessed March 13, 2024. http://arxiv.org/abs/2107.14293
40.↵
Lipton ZC, Kale DC, Elkan C, Wetzel R. Learning to Diagnose with LSTM Recurrent Neural Networks. Published online March 21, 2017. doi:10.48550/arXiv.1511.03677
OpenUrl CrossRef
41.↵
Multitask learning and benchmarking with clinical time series data | Scientific Data. Accessed August 17, 2023. https://www.nature.com/articles/s41597-019-0103-9
42.↵
Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ‘16. Association for Computing Machinery; 2016:785–794. doi:10.1145/2939672.2939785
OpenUrl CrossRef
43.↵
Long Short-Term Memory | Neural Computation | MIT Press. Accessed December 7, 2023. https://direct.mit.edu/neco/article-abstract/9/8/1735/6109/Long-Short-Term-Memory?redirectedFrom=fulltext
44.↵
Gu Y, Tinn R, Cheng H, et al. Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing. ACM Trans Comput Healthcare. 2021;3(1):2:1-2:23. doi:10.1145/3458754
OpenUrl CrossRef
45.↵
Huang K, Altosaar J, Ranganath R. ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission. Published online November 28, 2020. doi:10.48550/arXiv.1904.05342
OpenUrl CrossRef
46.↵
Khadanga S, Aggarwal K, Joty S, Srivastava J. Using Clinical Notes with Time Series Data for ICU Management. Published online January 2, 2020. doi:10.48550/arXiv.1909.09702
OpenUrl CrossRef
47.↵
Deznabi I, Iyyer M, Fiterau M. Predicting in-hospital mortality by combining clinical notes with time-series data. In: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. Association for Computational Linguistics; 2021:4026–4031. doi:10.18653/v1/2021.findings-acl.352
OpenUrl CrossRef
48.↵
Johnson AEW, Bulgarelli L, Shen L, et al. MIMIC-IV, a freely accessible electronic health record dataset. Sci Data. 2023;10(1):1. doi:10.1038/s41597-022-01899-x
OpenUrl CrossRef PubMed
49.↵
Wang S, McDermott MBA, Chauhan G, Ghassemi M, Hughes MC, Naumann T. MIMIC-Extract: a data extraction, preprocessing, and representation pipeline for MIMIC-III. In: Proceedings of the ACM Conference on Health, Inference, and Learning. ACM; 2020:222–235. doi:10.1145/3368555.3384469
OpenUrl CrossRef
50.↵
Gupta M, Gallamoza B, Cutrona N, Dhakal P, Poulain R, Beheshti R. An Extensive Data Processing Pipeline for MIMIC-IV. Published online December 20, 2022. doi:10.48550/arXiv.2204.13841
OpenUrl CrossRef
51.↵
Kokhlikyan N, Miglani V, Martin M, et al. Captum: A unified and generic model interpretability library for PyTorch. Published online September 16, 2020. doi:10.48550/arXiv.2009.07896
OpenUrl CrossRef
52.↵
Lundberg S, Lee SI. A Unified Approach to Interpreting Model Predictions. Published online November 24, 2017. doi:10.48550/arXiv.1705.07874
OpenUrl CrossRef
53.↵
Eckardt KU, Bernhardt WW, Weidemann A, et al. Role of hypoxia in the pathogenesis of renal disease. Kidney International. 2005;68:S46–S51. doi:10.1111/j.1523-1755.2005.09909.x
OpenUrl CrossRef PubMed
54.↵
Godin M, Bouchard J, Mehta RL. Fluid Balance in Patients with Acute Kidney Injury: Emerging Concepts. Nephron Clinical Practice. 2013;123(3-4):238–245. doi:10.1159/000354713
OpenUrl CrossRef PubMed
55.↵
Haller C. Hypoalbuminemia in Renal Failure: Pathogenesis and Therapeutic Considerations. Kidney and Blood Pressure Research. 2006;28(5-6):307–310. doi:10.1159/000090185
OpenUrl CrossRef
56.↵
Andersen LW, Mackenhauer J, Roberts JC, Berg KM, Cocchi MN, Donnino MW. Etiology and Therapeutic Approach to Elevated Lactate Levels. Mayo Clinic Proceedings. 2013;88(10):1127–1140. doi:10.1016/j.mayocp.2013.06.012
OpenUrl CrossRef PubMed Web of Science
57.↵
Weyker PD, Pérez XL, Liu KD. Management of Acute Kidney Injury and Acid-Base Balance in the Septic Patient. Clin Chest Med. 2016;37(2):277–288. doi:10.1016/j.ccm.2016.01.012
OpenUrl CrossRef
58.↵
Jentzer JC, Bihorac A, Brusca SB, et al. Contemporary Management of Severe Acute Kidney Injury and Refractory Cardiorenal Syndrome: JACC Council Perspectives. Journal of the American College of Cardiology. 2020;76(9):1084–1101. doi:10.1016/j.jacc.2020.06.070
OpenUrl FREE Full Text
59.↵
Grams ME, Sang Y, Coresh J, et al. Acute Kidney Injury After Major Surgery: A Retrospective Analysis of Veterans Health Administration Data. American Journal of Kidney Diseases. 2016;67(6):872–880. doi:10.1053/j.ajkd.2015.07.022
OpenUrl CrossRef PubMed
60.↵
Saran R, Robinson B, Abbott KC, et al. US Renal Data System 2017 Annual Data Report: Epidemiology of Kidney Disease in the United States. American Journal of Kidney Diseases. 2018;71(3):A7. doi:10.1053/j.ajkd.2018.01.002
OpenUrl CrossRef
61.↵
Danziger J, Chen KP, Lee J, et al. Obesity, Acute Kidney Injury, and Mortality in Critical Illness. Critical Care Medicine. 2016;44(2):328. doi:10.1097/CCM.0000000000001398
OpenUrl CrossRef PubMed

View the discussion thread.

Posted March 15, 2024.

Download PDF

Data/Code

Citation Tools

Subject Area

Intensive Care and Critical Care Medicine

Subject Areas

All Articles

Addiction Medicine (349)
Allergy and Immunology (668)
Allergy and Immunology (668)
Anesthesia (181)
Cardiovascular Medicine (2648)
Dentistry and Oral Medicine (316)
Dermatology (223)
Emergency Medicine (399)
Endocrinology (including Diabetes Mellitus and Metabolic Disease) (942)
Epidemiology (12228)
Forensic Medicine (10)
Gastroenterology (759)
Genetic and Genomic Medicine (4103)
Geriatric Medicine (387)
Health Economics (680)
Health Informatics (2657)
Health Policy (1005)
Health Systems and Quality Improvement (985)
Hematology (363)
HIV/AIDS (851)
Infectious Diseases (except HIV/AIDS) (13695)
Intensive Care and Critical Care Medicine (797)
Medical Education (399)
Medical Ethics (109)
Nephrology (436)
Neurology (3882)
Nursing (209)
Nutrition (577)
Obstetrics and Gynecology (739)
Occupational and Environmental Health (695)
Oncology (2030)
Ophthalmology (585)
Orthopedics (240)
Otolaryngology (306)
Pain Medicine (250)
Palliative Medicine (75)
Pathology (473)
Pediatrics (1115)
Pharmacology and Therapeutics (466)
Primary Care Research (452)
Psychiatry and Clinical Psychology (3432)
Public and Global Health (6527)
Radiology and Imaging (1403)
Rehabilitation Medicine and Physical Therapy (814)
Respiratory Medicine (871)
Rheumatology (409)
Sexual and Reproductive Health (410)
Sports Medicine (342)
Surgery (448)
Toxicology (53)
Transplantation (185)
Urology (165)

[1] 1.↵
Khwaja A. KDIGO clinical practice guidelines for acute kidney injury. Nephron Clin Pract. 2012;120(4):c179–184. doi:10.1159/000339789
OpenUrl CrossRef PubMed

[2] 2.↵
Hobson C, Ozrazgat-Baslanti T, Kuxhausen A, et al. Cost and Mortality Associated With Postoperative Acute Kidney Injury. Ann Surg. 2015;261(6):1207–1214. doi:10.1097/SLA.0000000000000732
OpenUrl CrossRef PubMed

[3] 3.
Silver SA, Long J, Zheng Y, Chertow GM. Cost of Acute Kidney Injury in Hospitalized Patients. J Hosp Med. 2017;12(2):70–76. doi:10.12788/jhm.2683
OpenUrl CrossRef PubMed

[4] 4.↵
Silver SA, Chertow GM. The Economic Consequences of Acute Kidney Injury. Nephron. 2017;137(4):297–301. doi:10.1159/000475607
OpenUrl CrossRef

[5] 5.↵
Cheng P, Waitman LR, Hu Y, Liu M. Predicting Inpatient Acute Kidney Injury over Different Time Horizons: How Early and Accurate? AMIA Annu Symp Proc. 2018;2017:565–574.
OpenUrl

[6] 6.↵
Brown JR, Rezaee ME, Marshall EJ, Matheny ME. Hospital Mortality in the United States following Acute Kidney Injury. Biomed Res Int. 2016;2016:4278579. doi:10.1155/2016/4278579
OpenUrl CrossRef

[7] 7.↵
Gage BF, Waterman AD, Shannon W, Boechler M, Rich MW, Radford MJ. Validation of Clinical Classification Schemes for Predicting StrokeResults From the National Registry of Atrial Fibrillation. JAMA. 2001;285(22):2864–2870. doi:10.1001/jama.285.22.2864
OpenUrl CrossRef PubMed Web of Science

[8] 8.↵
Faubel S, Shah PB. Immediate Consequences of Acute Kidney Injury: The Impact of Traditional and Nontraditional Complications on Mortality in Acute Kidney Injury. Adv Chronic Kidney Dis. 2016;23(3):179–185. doi:10.1053/j.ackd.2016.02.007
OpenUrl CrossRef

[9] 9.
Hsia CCW, Ravikumar P, Ye J. Acute lung injury complicating acute kidney injury: A model of endogenous αKlotho deficiency and distant organ dysfunction. Bone. 2017;100:100–109. doi:10.1016/j.bone.2017.03.047
OpenUrl CrossRef

[10] 10.
Mehta RL, Bouchard J, Soroko SB, et al. Sepsis as a cause and consequence of acute kidney injury: Program to Improve Care in Acute Renal Disease. Intensive Care Med. 2011;37(2):241–248. doi:10.1007/s00134-010-2089-9
OpenUrl CrossRef PubMed Web of Science

[11] 11.↵
Coca SG, Singanamala S, Parikh CR. Chronic kidney disease after acute kidney injury: a systematic review and meta-analysis. Kidney Int. 2012;81(5):442–448. doi:10.1038/ki.2011.379
OpenUrl CrossRef PubMed Web of Science

[12] 12.↵
Hoste EAJ, Bagshaw SM, Bellomo R, et al. Epidemiology of acute kidney injury in critically ill patients: the multinational AKI-EPI study. Intensive Care Med. 2015;41(8):1411–1423. doi:10.1007/s00134-015-3934-7
OpenUrl CrossRef PubMed

[13] 13.↵
Janke AT, Overbeek DL, Kocher KE, Levy PD. Exploring the Potential of Predictive Analytics and Big Data in Emergency Care. Annals of Emergency Medicine. 2016;67(2):227–236. doi:10.1016/j.annemergmed.2015.06.024
OpenUrl CrossRef PubMed

[14] 14.↵
Jensen PB, Jensen LJ, Brunak S. Mining electronic health records: towards better research applications and clinical care. Nat Rev Genet. 2012;13(6):395–405. doi:10.1038/nrg3208
OpenUrl CrossRef PubMed

[15] 15.↵
Halford GS, Baker R, McCredden JE, Bain JD. How many variables can humans process? Psychol Sci. 2005;16(1):70–76. doi:10.1111/j.0956-7976.2005.00782.x
OpenUrl CrossRef PubMed Web of Science

[16] 16.↵
Sutherland SM, Chawla LS, Kane-Gill SL, et al. Utilizing electronic health records to predict acute kidney injury risk and outcomes: Workgroup statements from the 15th ADQI Consensus Conference. Canadian Journal of Kidney Health and Disease. 2016;3(1). doi:10.1186/s40697-016-0099-4
OpenUrl CrossRef

[17] 17.↵
Grnarova P, Schmidt F, Hyland SL, Eickhoff C. Neural Document Embeddings for Intensive Care Patient Mortality Prediction. Published online December 1, 2016. doi:10.48550/arXiv.1612.00467
OpenUrl CrossRef

[18] 18.↵
Zhang Y, Chen Q, Yang Z, Lin H, Lu Z. BioWordVec, improving biomedical word embeddings with subword information and MeSH. Sci Data. 2019;6(1):52. doi:10.1038/s41597-019-0055-0
OpenUrl CrossRef PubMed

[19] 19.↵
Zhang X, Dou D, Wu J. Learning Conceptual-Contextual Embeddings for Medical Text. Published online March 11, 2020. doi:10.48550/arXiv.1908.06203
OpenUrl CrossRef

[20] 20.↵
Devlin J, Chang MW, Lee K, Toutanova K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. Published online May 24, 2019. doi:10.48550/arXiv.1810.04805
OpenUrl CrossRef

[21] 21.↵
Niu K, Zhang K, Peng X, Pan Y, Xiao N. Deep multi-modal intermediate fusion of clinical record and time series data in mortality prediction. Front Mol Biosci. 2023;10:1136071. doi:10.3389/fmolb.2023.1136071
OpenUrl CrossRef

[22] 22.↵
Wijeysundera DN, Karkouti K, Dupuis JY, et al. Derivation and Validation of a Simplified Predictive Index for Renal Replacement Therapy After Cardiac Surgery. JAMA. 2007;297(16):1801–1809. doi:10.1001/jama.297.16.1801
OpenUrl CrossRef PubMed Web of Science

[23] 23.
Gurm HS, Seth M, Kooiman J, Share D. A Novel Tool for Reliable and Accurate Prediction of Renal Complications in Patients Undergoing Percutaneous Coronary Intervention. Journal of the American College of Cardiology. 2013;61(22):2242–2248. doi:10.1016/j.jacc.2013.03.026
OpenUrl FREE Full Text

[24] 24.↵
Rank N, Pfahringer B, Kempfert J, et al. Deep-learning-based real-time prediction of acute kidney injury outperforms human predictive performance. npj Digit Med. 2020;3(1):1–12. doi:10.1038/s41746-020-00346-8
OpenUrl CrossRef

[25] 25.↵
Sun C, Chen D, Jin X, et al. Association between acute kidney injury and prognoses of cardiac surgery patients: Analysis of the MIMIC-III database. Frontiers in Surgery. 2023;9. Accessed December 6, 2023. https://www.frontiersin.org/articles/10.3389/fsurg.2022.1044937

[26] 26.↵
Sanchez-Pinto LN, Khemani RG. Development of a Prediction Model of Early Acute Kidney Injury in Critically Ill Children Using Electronic Health Record Data. Pediatric Critical Care Medicine. 2016;17(6):508–515. doi:10.1097/PCC.0000000000000750
OpenUrl CrossRef PubMed

[27] 27.↵
Kate RJ, Perez RM, Mazumdar D, Pasupathy KS, Nilakantan V. Prediction and detection models for acute kidney injury in hospitalized older adults. BMC Medical Informatics and Decision Making. 2016;16(1). doi:10.1186/s12911-016-0277-4
OpenUrl CrossRef

[28] 28.↵
Chiofolo C, Chbat N, Ghosh E, Eshelman L, Kashani K. Automated Continuous Acute Kidney Injury Prediction and Surveillance: A Random Forest Model. Mayo Clinic Proceedings. 2019;94(5):783–792. doi:10.1016/j.mayocp.2019.02.009
OpenUrl CrossRef PubMed

[29] 29.↵
Xia Z, Xu P, Xiong Y, Lai Y, Huang Z. Survival Prediction in Patients with Hypertensive Chronic Kidney Disease in Intensive Care Unit: A Retrospective Analysis Based on the MIMIC-III Database. Journal of Immunology Research. 2022;2022:e3377030. doi:10.1155/2022/3377030
OpenUrl CrossRef

[30] 30.
Yue S, Li S, Huang X, et al. Machine learning for the prediction of acute kidney injury in patients with sepsis. Journal of Translational Medicine. 2022;20(1):215. doi:10.1186/s12967-022-03364-0
OpenUrl CrossRef

[31] 31.
Deng YH, Luo XQ, Yan P, Zhang NY, Liu Y, Duan SB. Outcome prediction for acute kidney injury among hospitalized children via eXtreme Gradient Boosting algorithm. Sci Rep. 2022;12:8956. doi:10.1038/s41598-022-13152-x
OpenUrl CrossRef

[32] 32.↵
Liu J, Wu J, Liu S, Li M, Hu K, Li K. Predicting mortality of patients with acute kidney injury in the ICU using XGBoost model. PLOS ONE. 2021;16(2):e0246306. doi:10.1371/journal.pone.0246306
OpenUrl CrossRef

[33] 33.↵
Xu Z, Feng Y, Li Y, et al. Predictive Modeling of the Risk of Acute Kidney Injury in Critical Care: A Systematic Investigation of The Class Imbalance Problem. AMIA Jt Summits Transl Sci Proc. 2019;2019:809–818.
OpenUrl

[34] 34.↵
Martinez DA, Levin SR, Klein EY, et al. Early Prediction of Acute Kidney Injury in the Emergency Department With Machine-Learning Methods Applied to Electronic Health Record Data. Annals of Emergency Medicine. 2020;76(4):501–514. doi:10.1016/j.annemergmed.2020.05.026
OpenUrl CrossRef

[35] 35.↵
Machado MN, Nakazone MA, Maia LN. Acute kidney injury based on KDIGO (Kidney Disease Improving Global Outcomes) criteria in patients with elevated baseline serum creatinine undergoing cardiac surgery. Rev Bras Cir Cardiovasc. 2014;29(3):299–307. doi:10.5935/1678-9741.20140049
OpenUrl CrossRef PubMed

[36] 36.↵
Makris K, Spanou L. Acute Kidney Injury: Definition, Pathophysiology and Clinical Phenotypes. Clin Biochem Rev. 2016;37(2):85–98.
OpenUrl PubMed

[37] 37.↵
Chowdhury RR, Zhang X, Shang J, Gupta RK, Hong D. TARNet: Task-Aware Reconstruction for Time-Series Transformer. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. KDD ‘22. Association for Computing Machinery; 2022:212–220. doi:10.1145/3534678.3539329
OpenUrl CrossRef

[38] 38.↵
Placido D, Yuan B, Hjaltelin JX, et al. A deep learning algorithm to predict risk of pancreatic cancer from disease trajectories. Nat Med. 2023;29(5):1113–1122. doi:10.1038/s41591-023-02332-5
OpenUrl CrossRef PubMed

[39] 39.↵
Tipirneni S, Reddy CK. Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series. Published online February 16, 2022. Accessed March 13, 2024. http://arxiv.org/abs/2107.14293

[40] 40.↵
Lipton ZC, Kale DC, Elkan C, Wetzel R. Learning to Diagnose with LSTM Recurrent Neural Networks. Published online March 21, 2017. doi:10.48550/arXiv.1511.03677
OpenUrl CrossRef

[41] 41.↵
Multitask learning and benchmarking with clinical time series data | Scientific Data. Accessed August 17, 2023. https://www.nature.com/articles/s41597-019-0103-9

[42] 42.↵
Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. KDD ‘16. Association for Computing Machinery; 2016:785–794. doi:10.1145/2939672.2939785
OpenUrl CrossRef

[43] 43.↵
Long Short-Term Memory | Neural Computation | MIT Press. Accessed December 7, 2023. https://direct.mit.edu/neco/article-abstract/9/8/1735/6109/Long-Short-Term-Memory?redirectedFrom=fulltext

[44] 44.↵
Gu Y, Tinn R, Cheng H, et al. Domain-Specific Language Model Pretraining for Biomedical Natural Language Processing. ACM Trans Comput Healthcare. 2021;3(1):2:1-2:23. doi:10.1145/3458754
OpenUrl CrossRef

[45] 45.↵
Huang K, Altosaar J, Ranganath R. ClinicalBERT: Modeling Clinical Notes and Predicting Hospital Readmission. Published online November 28, 2020. doi:10.48550/arXiv.1904.05342
OpenUrl CrossRef

[46] 46.↵
Khadanga S, Aggarwal K, Joty S, Srivastava J. Using Clinical Notes with Time Series Data for ICU Management. Published online January 2, 2020. doi:10.48550/arXiv.1909.09702
OpenUrl CrossRef

[47] 47.↵
Deznabi I, Iyyer M, Fiterau M. Predicting in-hospital mortality by combining clinical notes with time-series data. In: Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. Association for Computational Linguistics; 2021:4026–4031. doi:10.18653/v1/2021.findings-acl.352
OpenUrl CrossRef

[48] 48.↵
Johnson AEW, Bulgarelli L, Shen L, et al. MIMIC-IV, a freely accessible electronic health record dataset. Sci Data. 2023;10(1):1. doi:10.1038/s41597-022-01899-x
OpenUrl CrossRef PubMed

[49] 49.↵
Wang S, McDermott MBA, Chauhan G, Ghassemi M, Hughes MC, Naumann T. MIMIC-Extract: a data extraction, preprocessing, and representation pipeline for MIMIC-III. In: Proceedings of the ACM Conference on Health, Inference, and Learning. ACM; 2020:222–235. doi:10.1145/3368555.3384469
OpenUrl CrossRef

[50] 50.↵
Gupta M, Gallamoza B, Cutrona N, Dhakal P, Poulain R, Beheshti R. An Extensive Data Processing Pipeline for MIMIC-IV. Published online December 20, 2022. doi:10.48550/arXiv.2204.13841
OpenUrl CrossRef

[51] 51.↵
Kokhlikyan N, Miglani V, Martin M, et al. Captum: A unified and generic model interpretability library for PyTorch. Published online September 16, 2020. doi:10.48550/arXiv.2009.07896
OpenUrl CrossRef

[52] 52.↵
Lundberg S, Lee SI. A Unified Approach to Interpreting Model Predictions. Published online November 24, 2017. doi:10.48550/arXiv.1705.07874
OpenUrl CrossRef

[53] 53.↵
Eckardt KU, Bernhardt WW, Weidemann A, et al. Role of hypoxia in the pathogenesis of renal disease. Kidney International. 2005;68:S46–S51. doi:10.1111/j.1523-1755.2005.09909.x
OpenUrl CrossRef PubMed

[54] 54.↵
Godin M, Bouchard J, Mehta RL. Fluid Balance in Patients with Acute Kidney Injury: Emerging Concepts. Nephron Clinical Practice. 2013;123(3-4):238–245. doi:10.1159/000354713
OpenUrl CrossRef PubMed

[55] 55.↵
Haller C. Hypoalbuminemia in Renal Failure: Pathogenesis and Therapeutic Considerations. Kidney and Blood Pressure Research. 2006;28(5-6):307–310. doi:10.1159/000090185
OpenUrl CrossRef

[56] 56.↵
Andersen LW, Mackenhauer J, Roberts JC, Berg KM, Cocchi MN, Donnino MW. Etiology and Therapeutic Approach to Elevated Lactate Levels. Mayo Clinic Proceedings. 2013;88(10):1127–1140. doi:10.1016/j.mayocp.2013.06.012
OpenUrl CrossRef PubMed Web of Science

[57] 57.↵
Weyker PD, Pérez XL, Liu KD. Management of Acute Kidney Injury and Acid-Base Balance in the Septic Patient. Clin Chest Med. 2016;37(2):277–288. doi:10.1016/j.ccm.2016.01.012
OpenUrl CrossRef

[58] 58.↵
Jentzer JC, Bihorac A, Brusca SB, et al. Contemporary Management of Severe Acute Kidney Injury and Refractory Cardiorenal Syndrome: JACC Council Perspectives. Journal of the American College of Cardiology. 2020;76(9):1084–1101. doi:10.1016/j.jacc.2020.06.070
OpenUrl FREE Full Text

[59] 59.↵
Grams ME, Sang Y, Coresh J, et al. Acute Kidney Injury After Major Surgery: A Retrospective Analysis of Veterans Health Administration Data. American Journal of Kidney Diseases. 2016;67(6):872–880. doi:10.1053/j.ajkd.2015.07.022
OpenUrl CrossRef PubMed

[60] 60.↵
Saran R, Robinson B, Abbott KC, et al. US Renal Data System 2017 Annual Data Report: Epidemiology of Kidney Disease in the United States. American Journal of Kidney Diseases. 2018;71(3):A7. doi:10.1053/j.ajkd.2018.01.002
OpenUrl CrossRef

[61] 61.↵
Danziger J, Chen KP, Lee J, et al. Obesity, Acute Kidney Injury, and Mortality in Critical Illness. Critical Care Medicine. 2016;44(2):328. doi:10.1097/CCM.0000000000001398
OpenUrl CrossRef PubMed

Forecasting Acute Kidney Injury and Resource Utilization in ICU patients using longitudinal, multimodal models

Abstract

1. Introduction

2. Materials and methods

2.1 Dataset description

2.2 Methodological framework

2.3 Time series embedding model

2.3.1 Logistic regression (LR)

2.3.2 Extreme Gradient Boosting (XGBoost)

2.3.3 Long Short-Term Memory (LSTM)

2.4 Clinical notes embedding model

2.4.1 BioMedBERT44

2.4.2 ClinicalBERT45

2.4.3 Probability calculation of solely BERT-based model

2.5 Multimodal Learning

3 Results

3.1 Data processing

3.2 Generation of training and testing data

3.3 Performance comparison

3.3.1 Time-series unimodal models’ comparison

3.3.2 Clinical notes unimodal models’ comparison

3.3.3 Performance of multimodal model

3.4 Elucidating Feature Importance in AKI and CRRT Prediction Using SHAP Values

3.5 Clinical notes analysis using BioMedBERT embeddings

4. Conclusions

Data Availability

Funding

CRediT authorship contribution statement

Declaration of competing interest

References

Citation Manager Formats

Subject Area

2.4.1 BioMedBERT⁴⁴

2.4.2 ClinicalBERT⁴⁵