Classifying the risk for myasthenic crisis using data-driven explainable machine learning with informative feature design and variance control – a pilot study
================================================================================================================================================================

* Sivan Bershan
* Andreas Meisel
* Philipp Mergenthaler

## Abstract

**Importance** Myasthenic crisis (MC) is a critical progression of Myasthenia gravis (MG), requiring intensive care treatment and invasive therapies. Classifying patients at high-risk for MC facilitates treatment decisions and helps prevent disease progression.

**Objective** To test whether machine learning models trained with real-world routine clinical data can aid precisely identifying MG patients at risk for MC.

**Design** This is a pseudo-prospective cohort study of MG patients presenting since January 2010.

**Setting** Single center.

**Participants** A cohort of 51 MG patients was used for model training based on a defined set of real-world clinical data. The cohort was created from a convenience sample of 13 MC patients matched based on sex, five-year age band, antibody status, thymus pathology with MG patients who had not suffered an MC. Data analyses and model refinements were performed from June 2022 to May 2023.

**Exposure** Classification of MG patients to high or low risk for MC using Lasso regression or random forest machine learning models.

**Main Outcomes and Measures** The accuracy of the risk classification was assessed by patient.

**Results** This study included 51 MG patients (13 MC, 38 non-MC; median age MC group 70.5, non-MC group 65.5). The mean cross-validated AUC classifying MG patients as high or low risk for MC based on simple or compound features derived from real-world routine clinical data showed a predictive accuracy of 68.8% for the regularized Lasso regression and of 76.5% for the random forest model. Feature importance scores suggest that multimorbidity may play a role in risk classification. Different thresholds were applied to tune model performance to optimal parameters. Studying result stability across 100 runs further indicated that the random forest model was better suited to cope with feature variance. Studying feature importance across 5100 model runs identified explainable features to distinguish MG patients at high or low risk for MC.

**Conclusions and Relevance** In this study, feasibility of classifying risk for MC based on real-world routine clinical data using machine learning was shown. The models showed accurate and consistent performance indicating the utility of personalized risk assessment in MG patients using machine learning models.

**Question** Can machine learning models be used to classify Myasthenia gravis patients into groups at high or low risk for myasthenic crisis with high precision based on explainable data-driven features derived from real-world clinical data?

**Findings** In this pseudo-prospective study of 51 Myasthenia gravis patients, the risk of myasthenic crisis using real-world clinical data was accurately classified employing two machine learning models with explainable features.

**Meaning** These findings suggest that it is possible to classify the risk for myasthenic crisis in patients based on real-world clinical data with high precision.

Keywords
*   Myasthenic crisis
*   myasthenia gravis
*   risk classification
*   explainable machine learning
*   precision medicine
*   rare disease

## Introduction

Myasthenia gravis (MG) is a rare chronic autoimmune disease causing fatigable muscle weakness due to auto-antibody-mediated decrease in neuromuscular transmission with a prevalence of 40-180 per 1 million people1,2. Myasthenic crisis (MC) defines critical exacerbation of MG which can be life threatening due to respiratory insufficiency and requires intensive care treatment, mechanical ventilation as well as invasive therapeutic procedures such as plasmapheresis. Up to 15-20% of MG patients develop MC over the course of their lifetime3,4. Although there has been a significant decrease in mortality of MC over the last decades, current figures for mortality are variable, but still reported as up to 5-12%4–6. Even though the presence of thymoma, MuSK autoantibodies7, stress, infections or inappropriate treatment are known risk factors for MC, among others, it is still impossible to anticipate MC or predict which patients develop MC in a cohort of patients at risk. The ability to predict which patients have a high risk of MC will help make confident and personalized treatment decisions and thereby help utilize resources more effectively.

In predictive modeling the objective is to accurately project the chances that a specific event will or will not happen, thereby optimizing for prediction accuracy and not for the understanding of root causes. While byproducts such as the feature importance can give insight into why an event occurs, the primary interest lies in predicting if it will occur8. In principle, two classes of models are suited for prediction: regression, generally used for predicting a continuous numeric outcome, and classification for categorical outcomes.

Here, we investigated whether it is possible to reliably classify Myasthenia gravis patients into groups at low- or high risk of MC based entirely on routine medical data in a proof-of-concept pseudo-prospective pilot study. Ultimately, our goal is to support making treatment decisions in a clinical context. Thus, we used real-world routine medical data such as common laboratory values and other case-associated data to classify patients from a pilot cohort of MG patients into MC risk groups. In an explainable data-driven approach, we investigated how to best classify patients into risk groups using either a regularized linear machine learning model to account for highly correlated features or a random forest classifier to minimize noise.

## Methods

### Protocol approval and patient consent

This study was approved by the local ethics committee (no. EA4/068/22). Informed consent was not required for this retrospective analysis.

### Study design and participants

This study is a pilot study to demonstrate feasibility of MC prediction based on real-world routine clinical data. In this study our dependent variable allowed for the two categories: “myasthenic crisis” or “no myasthenic crisis”. Thus, we treated this as a classification model.

We chose a 2-step approach in a pseudo-prospective manner (i.e., occurrence of MC was unknown to the machine learning models). First, we used univariate logistic regression to assess feature importance, and then we compared regularized regression with random forest classification to classify risk into low or high risk for MC. Details of model generation, performance testing and validation are given in the statistical analysis section below. In order to perform pseudo-prospective predictive analysis, we designed a cohort of MG patients from retrospective medical data of patients all treated since January 2010 until recent at the Integrated Myasthenia gravis Center of the Dept. of Neurology at Charité – Universitätsmedizin Berlin, a large academic tertiary care center, certified for applying standardized clinical pathways and patient management by the German Myasthenia Gravis Society. 53 patients with MC admitted to the neurological intensive care unit were screened, of which 13 were included in this pilot study (Table 1). To establish the final cohort for analysis, we initially matched each MC patient with up to four MG patients without MC based on (in order of priority) sex, five-year age band, antibody status, thymus pathology. After data cleanup (below), we were able to match 38 control patients.

View this table:
[Table 1:](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/T1)

Table 1: Clinical and demographic characteristics of patients

MC was defined as exacerbation of myasthenic symptoms with bulbar or general weakness requiring mechanical ventilation. Diagnosis of MG was established based on antibody findings, repetitive nerve stimulation or clinical assessment. Current data analyses and model refinements were performed from June 2022 to May 2023.

### Data curation and preprocessing

All data were obtained from the patients’ electronic health records or from Charité’s Health Data Platform (HDP) which hosts up-to-date retrospective snapshots of the entire hospital management system, including laboratory values. To eliminate hindsight bias, we removed data that were generated within 6 months after the MC.

Input data were routine clinical data such as bloodwork, data relating to current hospital admission (e.g., length of stay, path through hospital, etc.), as well as treatment details (e.g., medication, procedures, etc.). A full list of considered features is shown in Supplementary Table 1. The data were cleaned from impossible values (e.g., negative laboratory values), normalized and standardized. The matching criteria were excluded as classifiers, since by design they were similar in test and control groups. Two patients with MC had to be excluded from the analysis because they had no remaining data after eliminating the data generated in the half year period after the crisis.

### Statistical and Machine Learning Analysis

#### Modeling Approach

Analyses were performed in R version 3.6.1 (R Project for Statistical Computing). For a full list of libraries used see Supplementary Table 2.

We used several feature categories for analyses: bloodwork, hospitalization, and treatment details (see Data Curation above for details). In regression models it is typical to use one row by patient and to depict trend in the feature design. An example is the number of encounter days by patient in general and the corresponding trend feature would be the average encounter days by patient per six months. If applicable, we added minimum, maximum, median, and standard deviation for each value as simple features. For features that did not change over time (e.g., age of onset), this was omitted. For the regularized regression, we also allowed pairwise interactions. In the random forest model, we allowed only simple features and no interactions, because the tree structure itself allows for nonlinear relationships. For the full list of features used see the labels in Fig. 3.

We set a minimum completeness level of 80% per feature, meaning that at least 41 patients had to have a value for a particular feature. Out of originally more than 2000 possible features, 696 feature candidates reached the completeness threshold to be considered in the models. The missing values for the features used were computed with the mice package9 using predictive mean matching for numeric features. This method predicts the value to be imputed based on all other values except the dependent variable. Then it draws a small set of candidate donors closest to the predicted value and draws one of these randomly10. Factor data follows the same process with the exception that the prediction is performed with a polyregression.

In a first step, we determined feature importance in a logistic regression model. Features with a p-value of ≤ 0.05 were then used in a regularized Least Absolute Shrinkage and Selection Operator (Lasso) regression11 to account for many features being highly correlated among the top 50 from the first step. The Lasso regression algorithm identified 8 - 11 features per run that were most predictive. The parameter λ controls the strength of the shrinkage, where an increase in λ results in an increase in shrinkage and an increase in variance. Due to the significant reduction in features, variance is introduced through the model. We thus also calculated a random forest model to gauge if variance was controlled well.

#### Performance Metrics and Validation

Our primary model performance metric in both second phase models was the mean of the cross-validated area under the receiver-operator curve (AUC) over 100 runs of training. AUC is a classification threshold independent metric, contrary to comparison metrics such as sensitivity and specificity which are highly dependent on what threshold is chosen to distinguish the groups.

Accounting for the small data set, we performed leave-one-out cross-validation12. Standard metrics such as sensitivity, specificity, and precision were used to evaluate model performance. We also ran 100 cycles of training to account for two sources of randomness – imputation and the L1-regularization. L1-regularization penalizes the sum of absolute values and is sparse, meaning it sets all variables but the top ones to zero and doesn’t use them. Finally, we scrambled the target variable and verified that the results had a significantly lower AUC.

### Data and code availability

Feature categories and lists are published as supplement to this manuscript (Supplementary Table 1). Ethical approval currently does not permit sharing of raw data. The analysis code will be made available upon reasonable request.

## Results

### Demographics and clinical characteristics

The cohort consisted of 51 Myasthenia gravis patients with 13 patients who suffered from at least one MC (9 patients had one MC, 4 had two or more MC) and 38 controls (Table 1). The median age in the MC group was 70.5, whereas the non-MC group showed a median age of 65.5. Overall, 38 patients were AChR antibody positive and the remaining 13 were antibody negative. One patient tested positive for both AChR as well as MuSK, who we matched against AChR single-positive patients due to a lack of other controls.

### Model Performance

The Lasso regression model allowed the distinction between the MC and non-MC groups with a mean [standard deviation, sd] AUC of 68.8% [8.1%] (Figure 1A, Table 2), and random forest with a mean AUC of 76.7% [4%] (Figure 1B, Table 2).

![Fig. 1:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/08/21/2023.08.19.23294175/F1.medium.gif)

[Fig. 1:](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/F1)

Fig. 1: 
Prediction results independent of threshold (**A)** for the Lasso regression, and (**B**) for the random forest prediction. The random forest prediction performs better in terms of AUC. The black line is the area under the curve for the prediction, the shaded dark blue area represents one standard deviation confidence intervals (CI) and the light blue 2 standard deviations CI. The labels show the thresholds and the respective CI. The purple line represents the prediction with the randomized target variable.

View this table:
[Table 2:](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/T2)

Table 2: Result summary at different thresholds (TH)

Three configuration examples of the Lasso regression model (Table 2) show that the best accuracy (i.e., classifying most patients correctly) is not what the model should be optimized for. It is our major aim to correctly identify patients at risk for MC. It is therefore critical to reduce false negatives, because false negatives mean that patients with high risk of suffering an MC would be classified as “low risk” and may be overlooked. We thus shifted the threshold splitting the two categories to reduce false negatives as much as possible. For this, we looked at the average predicted score by patient.

The confusion matrices for the Lasso regression prediction (Fig. 2A) show that at a threshold of 0.1, 35 out of 51 patients would be considered high risk for MC. Ten of these were correctly classified. The high number of 25 false positives was accompanied by a low number of false negatives (3 patients). Accordingly, at a threshold of 0.3, 13 patients were categorized as high risk, 7 of these true positives. 38 patients were considered low risk, whereas 6 patients were false negatives. In this setting, the number of false negatives doubled compared to the lowest threshold. To gauge the result range, at a threshold of 0.8, 50 patients were considered low risk, 1 patient was considered high risk, and 12 MC patients were false negatives. In the random forest model (Fig. 2B) and at a threshold of 0.1, 48 patients (94%) were considered high risk. 13 of these were correctly classified, whereas 0 patients were false negatives. At a threshold of 0.3, 29 patients (57%) were considered high risk. Of these, 9 were correctly classified and 4 patients were false negatives. At a threshold of 0.8, 0 patients (0%) were considered high risk. The goal is to maintain reasonable group sizes allowing the allocation of significantly more resources to those in high risk. False negatives should be avoided, even if this means that the precision of the model is lowered.

![Fig. 2:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/08/21/2023.08.19.23294175/F2.medium.gif)

[Fig. 2:](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/F2)

Fig. 2: 
Confusion matrices of the classification results for the **(A)** Lasso regression model, and the **(B)** random forest model. *Prediction on the x-axis and ground truth on the y-axis*.

![Fig. 3:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/08/21/2023.08.19.23294175/F3.medium.gif)

[Fig. 3:](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/F3)

Fig. 3: 
Feature importance according to mean coefficient across 100 predictions. **(A)** Feature importance for the Lasso regression. The occurrence by feature varies, because of the L1 regularization. The shading of the bars indicates the number of runs in which the feature was considered important enough (light grey = few runs, dark grey = many runs). In this context, feature importance is measured with negative or positive correlation with the dependent variable. The values around 0 are the least important ones. **(B)** Feature importance for the random forest prediction scaled to −1 to 3. In this case, the feature importance is calculated by comparing the prediction results after leaving out a feature. In this case, the negative values indicate that a feature is harmful to the prediction.

### Predictor Importance and result stability

Result stability varied by patient (Supplementary Fig. 1). However, while there were five particularly volatile prediction sets (#9, #10, #13, #20, #28) in the Lasso regression, there was only one volatile prediction set (#20) in the random forest model (Supplementary Fig. 1). This indicates that particularly the accuracy of the Lasso regression would benefit from more feature data per patient. These five patients all had an MC but were predicted as low-risk several times. We then compared the Lasso regression model with a random forest model to evaluate how much control can be asserted to the variance in the data, especially since the Lasso regression introduced variance by the chosen lambda (Supplementary Fig. 2). The high variance across runs in AUC as well as the high standard deviation of predictive score by patient across 100 predictions (Supplementary Fig. 3) suggest that using a higher number of patients as well as more data points (i.e., features) per patient would increase the prediction accuracy and reduce the number of cases when patients switch groups across runs. The chosen level of 20% imputation maximum was adequate in terms of not superseding the variance introduced by the model itself. (Supplementary Fig. 2).

Feature importance for the Lasso regression is shown with coefficients from −20 to 50 (Fig. 3A). Negative coefficients indicate anti-correlation of a feature with MC. In the Lasso regression, the features used most were the mean trend of lymphocytes (mean [sd] coefficient 191.9 [64.9]) and the path of the patient through hospital units throughout the hospital stay (mean [sd] coefficient 40.8 [14.0]) across a total of 5100 runs of the model (51 patients * 100 runs). Both were used in 99% of the runs. The most important feature in the random forest was the relative minimum trend of creatinine measurements. As the use of each feature across runs is similar in the random forest model, feature importance is measured by its accuracy coefficient after leave-one-out validation (Fig. 3B). For the random forest, the feature importance ranges from −1 to 3. Contrary to the Lasso regression, negative feature importance indicates that these features are harmful to the accuracy of the model. A full list of considered features is shown in Supplementary Table 1. Both machine learning models identified creatinine in various forms as a highly relevant feature for classification. This may suggest that multimorbidity may play a role in MC risk classification.

## Discussion

MC is the most critical presentation of MG and poses a significant burden to MG patients. It is still associated with a high morbidity, mortality, negative impact on quality of life, and requires intensive care medical treatment4,6,7,13,14. Thus, accurately predicting patients at risk for MC could aid treating patients preventively to avoid critical MG progression to MC as well as properly directing scarce clinical resources.

There is a growing number of studies using machine learning and artificial intelligence for research on autoimmune diseases (e.g., on type 1 diabetes15, multiple sclerosis16, rheumatoid arthritis17, and Crohn’s disease18,19), yet, in many cases focusing on genetic risk assessment. Furthermore, some applications of predictive modeling in medicine have focused on predicting Parkinson’s disease in patients before the actual clinical diagnosis20 or predicting the risk for exacerbation in autoimmune diseases21.

Although epidemiological assessment22, diagnosis23, classification of disease subtypes24, and therapeutic discovery25 for rare diseases are thought to be aided by various machine learning approaches, studies employing machine learning to monitor disease progression in MG are scarce. Further, even though disease progression models to describe disease course over time are now frequently used in drug development26, clinical use of disease progression modeling, e.g., to aid clinical decision making, is uncommon, particularly in rare diseases including MG. Recently, basic clinical data have been used to train a random forest classifier to predict short term clinical outcome in MG27. However, classifying the risk of MC based on real-world clinical data of MG patients, particularly using routine laboratory values and further case-associated data readily available at the point of care, has not been the subject of studies using machine learning for predictive modeling or risk classification.

Challenges in predictive studies in medicine are general availability of relevant clinical data, high variance in treatment procedures as well as in treatment quality across institutions and health care systems. Furthermore, in most published cases machine learning models and data have been generated specifically for the purpose of a particular prediction task in the context of a controlled trial21,28,29. Thus, risk prediction of new patients outside of clinical trials using real-world medical data which is generated as a part of routine treatment will be difficult.

Thus, as proof of concept, in this study we focused on using only real-world clinical data to infer patterns from patient data. We ultimately aim to make the results applicable within existing treatment procedures towards personalized disease management such as by aiding to define individual monitoring intervals or to quantify the risk of disease progression posed by a treatment change. Real-world clinical data most faithfully represents the acute disease phenotype of patients, particularly in the case of rare diseases30. It could also simply enhance a patient’s quality of life by easing the mental burden and adjusting the monitoring intervals31. Furthermore, accurate prediction of clinical exacerbation of disease using real-world clinical data aids establishing individual therapeutic concepts and tailored treatment decisions. To this end, our goal was to predict the risk for MC using two different machine learning models trained on real-world routine clinical data. Indeed, our data suggest that it is possible to discern MG patients at risk for MC from patients not at risk for MC with comparable performance as in other predictive studies on auto-immune diseases18. Intriguingly, the identification of creatinine, a marker of kidney injury, in various feature forms as a highly scoring features might suggest that multimorbidity, which commonly involves kidney injury, might place MG patients at high risk for MC. Indeed, multimorbidity is a known factor contributing to poor outcome in MC4.

Studies on risk prediction in MG usually use classical statistical models27 to address clinical subtypes such as thymoma patients32–34, specific clinical situations such as initial steroid treatment35 or MG subtypes classified by specific autoantibodies and prediction of factors for clinical remission. Our study used multidimensional data from a heterogenous MG cohort to learn distinctive features with the aim of predicting MC in general, and thus this work contributes a generalizable model. Furthermore, all previously available studies have a prognostic focus31–34, i.e., the objective is to understand which features are predictive. On the contrary, our approach is feature agnostic. We here primarily aimed at high fidelity, i.e., robust model performance maintaining reasonable group sizes as well as avoiding false negatives, in predicting whether a patient is at high risk of MC.

Classically, precision medicine has considered large scale sequencing data to tailor individual treatment decisions. Technological advances have made the use and integration of genomic, transcriptomic, and proteomic data possible36. While these additional data, without a doubt, retain analytic value, restricted availability of these data in a day-to-day treatment context creates a barrier between such diagnostic instruments and their utility in guiding treatment decisions. Indeed, the utility of large-scale genomic data in predicting the risk in complex sporadic conditions has been questioned37. Thus, advanced precision medicine should consider multimodal clinical data beyond the classical OMICS approaches. Real-world clinical data more closely reflect the medical phenotype of the patient30 and thereby aid understanding individual disease patterns and using individual risk factors for managing disease beyond a patient’s genotype38. To address this, we here investigated whether it is possible to support clinical decision making by unbiased analysis of readily available clinical data in a routine treatment setting.

Both machine learning models used herein (Lasso regression and random forest) are well understood and widely used machine learning algorithms to predict disease progression in a large variety of clinical scenarios20,39–43. The difference in AUC between the two models was expected, since Lasso regression was used due to the high correlation of features. The Lasso regression algorithm is known to introduce randomness (i.e., noise) because of the chosen lambda. Random forest classification was chosen specifically to control for noise in the data. In our dataset, trend features are highly predictive for the risk of MC (Fig. 3). Afterall, it is plausible to see a worsening of features in case of an imminent but not yet apparent MC. Vice versa, a patient who may be at high risk for a MC during a given visit may be in much better health (i.e., lower risk for MC) a few months later. Thus, risk for MC is not only dependent on the patient, but it can also be quite different for the same patient at two different points in time. Thus, model choice and feature design not only enabling, but focusing on a non-linear view of disease progression likely produce a more predictive result of the disease trajectory. The reduction of variance across and by patient can be addressed by including more patients as well as more data points per patient to the training set. Finally, the use of data contained in electronic health records, including laboratory parameters as features allows analyzing the reason for the good performance of the prediction algorithms. From an ethical point of view, this is a critical consideration as these models will have the capacity to alter medical decisions44.

### Limitations

Our findings are limited by the small sample size of 51 patients and uneven matching in some subgroups. Furthermore, for proof of concept only a selection of the available data points from the health care records were used. To prevent selection bias and better and more stable score distinction by patient, future studies will benefit from larger sample sizes as well as larger data sets including more – if not all – direct and indirect clinical parameters per patient. Furthermore, the context in which the patients live (e.g., whether patients were received nursing care or lived alone/independent or their geographic location of country vs. city) seemed to impact the outcome when reviewing the narrative of individual patient charts. Making this information machine-interpretable, would likely contribute a novel set of predictive features.

## Conclusions

This study shows that it is possible to classify the risk for MC in MG patients using longitudinal real-world clinical data. Both models showed accurate and consistent performance indicating the utility of personalized risk assessment in MG patients using machine learning models. As our models are improved, we anticipate that they will become valuable clinical tools for clinical decision support and allow unraveling the heterogeneity of MG disease phenotypes.

## Author contributions

S.B. devised and performed analytical strategies, S.B. and P.M. conceived and P.M. supervised the project, P.M. and A.M. provided clinical data, S.B. and P.M. wrote and edited the paper, A.M. edited the paper for intellectual content.

## Data Availability

Feature categories and lists are published as supplement to this manuscript (Supplementary Table 1). Ethical approval currently does not permit sharing of raw data. The analysis code will be made available upon reasonable request.

## Funding

This study did not receive dedicated funding. PM is Einstein Junior Fellow funded by the Einstein Foundation Berlin and acknowledges funding support by the Einstein Foundation Berlin (EJF- 2020–602; EVF-2021–619) and the Leducq Foundation for Cardiovascular and Neurovascular Research (Consortium International pour la Recherche Circadienne sur l’AVC). Besides funding, the sponsoring organizations did not play any role in the design and conduct of the consensus meetings; preparation, review, or approval of the manuscript; or decision to submit the manuscript for publication.

## Disclosures / Conflicts of interest

S.B. is co-owner of exago.ml, a geoanalytics-focused machine learning company. A.M. has received speaker honoraria, consulting fees or (institutional) financial research support from Alexion Pharmaceuticals Inc., Argenx, Grifols SA, Hormosan Pharma GmbH, Janssen, Octapharma, and UCB. He is chairman of the medical advisory board of the German Myasthenia Gravis Society. P.M. has been on the board of HealthNextGen.

## Supplementary Data

![Supplementary Figure 1:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/08/21/2023.08.19.23294175/F4.medium.gif)

[Supplementary Figure 1:](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/F4)

Supplementary Figure 1: 
The two graphs show the stability of each patient’s predictions in the two sets of runs with different models (left: lasso regression (glm), right: random forest (rF). The black number in the middle is the mean predicted risk score for MC by patient. Each bar is a representation of all runs of the model (100 in each case). Bars that are entirely light yellow, are patients who had a crisis and were consistently predicted high risk (above the threshold (here 0.3). The bars that are light blue represent patients that were false positives, meaning they were considered high risk, but had no crisis. The high number of false positives is intended to reduce the numbers of false negatives. The dark gray bars mean that these are below the threshold and thus considered low risk. Dark gray in combination with light yellow means that these were test group members but alternated in their prediction between the groups. In both models, there are patients that were below the threshold, but had a crisis. Examples are #9, #10, #12, #13, #20, and #28 in the Lasso regression and #8, #9 and #20 in the random forest. Dark gray in combination with light blue means that these were control group members but alternated in their prediction between the groups. One interesting example for this is patient #51 in both predictions. While the patient is classified as low risk in 80% of cases in the Lasso regression model and predicted as high risk in 80% of the cases of the random forest model, this patient had no myasthenic crisis.

![Supplementary Figure 2:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/08/21/2023.08.19.23294175/F5.medium.gif)

[Supplementary Figure 2:](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/F5)

Supplementary Figure 2: 
Imputed values above threshold: This graph compares the runs of the Lasso regression above the threshold with the number of imputed values by patient. One dot represents one patient. Across the number of runs we find patients with imputed values above the threshold. On the lower right of the plot there are 3 patients that were predicted above the threshold in 80-100% of the cases. At the same time, they had up to 8 imputed values. There is no obvious relationship between “predicted above the threshold” and the number of imputed values. This suggests that most of the variance by patient stems from the Lasso regression’s feature selection process. Thus, selecting a model less impacted by randomness such as the random forest is likely to show better results.

![Supplementary Figure 3:](http://medrxiv.org/http://medrxiv.stage.highwire.org/content/medrxiv/early/2023/08/21/2023.08.19.23294175/F6.medium.gif)

[Supplementary Figure 3:](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/F6)

Supplementary Figure 3: 
This plot shows the mean predicted value by patient in 100 runs. A) Results by patient for the Lasso regression, B) results by patient for the random forest model. The orange bar represents the standard deviation by patient. High variation suggests that both models would benefit from more in-depth data by patient. For instance, one difference that seems relevant is whether patients receive professional nursing care or live in care facilities. However, in our data set, such data are currently only available from written physicians’ notes which at present cannot quantitatively be analyzed.

View this table:
[Supplementary Table 1](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/T3)

Supplementary Table 1 List of all considered features.

View this table:
[Supplementary Table 2](http://medrxiv.org/content/early/2023/08/21/2023.08.19.23294175/T4)

Supplementary Table 2 R-packages used and their version.

## Acknowledgments

We are grateful for support by the team of the Health Data Platform at the Berlin Institute of Health at Charité and for administrative support by S. Märschenz and S. Lischewski.

*   Received August 19, 2023.
*   Revision received August 19, 2023.
*   Accepted August 21, 2023.


*   © 2023, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## References

1.  1.Huijbers MG, Marx A, Plomp JJ, Le Panse R, Phillips WD. Advances in the understanding of disease mechanisms of autoimmune neuromuscular junction disorders. Lancet Neurol. Feb 2022;21(2):163–175. doi:10.1016/S1474-4422(21)00357-4
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1474-4422(21)00357-4&link_type=DOI) 

2.  2.Punga AR, Maddison P, Heckmann JM, Guptill JT, Evoli A. Epidemiology, diagnostics, and biomarkers of autoimmune neuromuscular junction disorders. Lancet Neurol. Feb 2022;21(2):176–188. doi:10.1016/S1474-4422(21)00297-0
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S1474-4422(21)00297-0&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom) 

3.  3.Alshekhlee A, Miles JD, Katirji B, Preston DC, Kaminski HJ. Incidence and mortality rates of myasthenia gravis and myasthenic crisis in US hospitals. Neurology. May 5 2009;72(18):1548–54. doi:10.1212/WNL.0b013e3181a41211
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1212/WNL.0b013e3181a41211&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19414721&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom) 

4.  4.Neumann B, Angstwurm K, Mergenthaler P, et al. Myasthenic crisis demanding mechanical ventilation: A multicenter analysis of 250 cases. Neurology. Jan 21 2020;94(3):e299–e313. doi:10.1212/WNL.0000000000008688
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToibmV1cm9sb2d5IjtzOjU6InJlc2lkIjtzOjk6Ijk0LzMvZTI5OSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIzLzA4LzIxLzIwMjMuMDguMTkuMjMyOTQxNzUuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

5.  5.Jani-Acsadi A, Lisak RP. Myasthenic crisis: guidelines for prevention and treatment. J Neurol Sci. Oct 15 2007;261(1-2):127–33. doi:10.1016/j.jns.2007.04.045
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jns.2007.04.045&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17544450&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000249874500016&link_type=ISI) 

6.  6.Mergenthaler P, Stetefeld HR, Dohmen C, et al. Seronegative myasthenic crisis: a multicenter analysis. J Neurol. Jul 2022;269(7):3904–3911. doi:10.1007/s00415-022-11023-z
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00415-022-11023-z&link_type=DOI) 

7.  7.Konig N, Stetefeld HR, Dohmen C, et al. MuSK-antibodies are associated with worse outcome in myasthenic crisis requiring mechanical ventilation. J Neurol. Dec 2021;268(12):4824–4833. doi:10.1007/s00415-021-10603-9
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s00415-021-10603-9&link_type=DOI) 

8.  8.Kuhn M, Johnson K. Applied Predictive Modeling. Springer New York; 2013.
    
    
9.  9.van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate Imputation by Chained Equations in R. J Stat Soft. 2011 2011;45(3)doi:10.18637/jss.v045.i03
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.18637/jss.v045.i03&link_type=DOI) 

10. 10.van Buuren S. Flexible Imputation of Missing Data, Second Edition. 2 ed. Chapman and Hall/CRC; 2018.
    
    
11. 11.Kassambara A. Machine learning essentials. Edition 1 ed. STHDA; 2017:197.
    
    
12. 12.Evgeniou T, Pontil M, Elisseeff A. Leave One Out Error, Stability, and Generalization of Voting Combinations of Classifiers. Machine Learning. 2004/04/01 2004;55(1):71–97. doi:10.1023/B:MACH.0000019805.88351.60
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1023/B:MACH.0000019805.88351.60&link_type=DOI) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000220203500004&link_type=ISI) 

13. 13.Angstwurm K, Vidal A, Stetefeld H, et al. Early Tracheostomy Is Associated With Shorter Ventilation Time and Duration of ICU Stay in Patients With Myasthenic Crisis-A Multicenter Analysis. J Intensive Care Med. Jan 2022;37(1):32–40. doi:10.1177/0885066620967646
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/0885066620967646&link_type=DOI) 

14. 14.Boscoe AN, Xin H, L’Italien GJ, Harris LA, Cutter GR. Impact of Refractory Myasthenia Gravis on Health-Related Quality of Life. J Clin Neuromuscul Dis. Jun 2019;20(4):173–181. doi:10.1097/CND.0000000000000257
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1097/CND.0000000000000257&link_type=DOI) 

15. 15.Wei Z, Wang K, Qu HQ, et al. From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes. PLoS Genet. Oct 2009;5(10):e1000678. doi:10.1371/journal.pgen.1000678
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pgen.1000678&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=19816555&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom) 

16. 16.Corvol JC, Pelletier D, Henry RG, et al. Abrogation of T cell quiescence characterizes patients at high risk for multiple sclerosis after the initial neurological event. Proc Natl Acad Sci U S A. Aug 19 2008;105(33):11839–44. doi:10.1073/pnas.0805065105
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoicG5hcyI7czo1OiJyZXNpZCI7czoxMjoiMTA1LzMzLzExODM5IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDgvMjEvMjAyMy4wOC4xOS4yMzI5NDE3NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

17. 17.Chin CY, Hsieh SY, Tseng VS. eDRAM: Effective early disease risk assessment with matrix factorization on a large-scale medical database: A case study on rheumatoid arthritis. PLoS ONE. 2018/11/26/ 2018;13(11):e0207579. doi:10.1371/journal.pone.0207579
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0207579&link_type=DOI) 

18. 18.Giollo M, Jones DT, Carraro M, Leonardi E, Ferrari C, Tosatto SCE. Crohn disease risk prediction-Best practices and pitfalls with exome data. Hum Mutat. Sep 2017;38(9):1193–1200. doi:10.1002/humu.23177
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/humu.23177&link_type=DOI) 

19. 19.Wei Z, Wang W, Bradfield J, et al. Large sample size, wide variant spectrum, and advanced machine-learning technique boost risk prediction for inflammatory bowel disease. Am J Hum Genet. Jun 6 2013;92(6):1008–12. doi:10.1016/j.ajhg.2013.05.002
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ajhg.2013.05.002&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23731541&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom) 

20. 20.Searles Nielsen S, Warden MN, Camacho-Soto A, Willis AW, Wright BA, Racette BA. A predictive model to identify Parkinson disease from administrative claims data. Neurology. Oct 3 2017;89(14):1448–1456. doi:10.1212/WNL.0000000000004536
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6OToibmV1cm9sb2d5IjtzOjU6InJlc2lkIjtzOjEwOiI4OS8xNC8xNDQ4IjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDgvMjEvMjAyMy4wOC4xOS4yMzI5NDE3NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

21. 21.Stafford IS, Kellermann M, Mossotto E, Beattie RM, MacArthur BD, Ennis S. A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases. NPJ Digit Med. 2020/12// 2020;3(1):30. doi:10.1038/s41746-020-0229-3
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41746-020-0229-3&link_type=DOI) 

22. 22.Kariampuzha WZ, Alyea G, Qu S, et al. Precision information extraction for rare disease epidemiology at scale. J Transl Med. Feb 28 2023;21(1):157. doi:10.1186/s12967-023-04011-y
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12967-023-04011-y&link_type=DOI) 

23. 23.Miyachi Y, Ishii O, Torigoe K. Design, implementation, and evaluation of the computer-aided clinical decision support system based on learning-to-rank: collaboration between physicians and machine learning in the differential diagnosis process. BMC Med Inform Decis Mak. Feb 2 2023;23(1):26. doi:10.1186/s12911-023-02123-5
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1186/s12911-023-02123-5&link_type=DOI) 

24. 24.Faghri F, Brunn F, Dadu A, et al. Identifying and predicting amyotrophic lateral sclerosis clinical subgroups: a population-based machine-learning study. Lancet Digit Health. May 2022;4(5):e359–e369. doi:10.1016/S2589-7500(21)00274-0
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S2589-7500(21)00274-0&link_type=DOI) 

25. 25.Alves VM, Korn D, Pervitsky V, et al. Knowledge-based approaches to drug discovery for rare diseases. Drug Discov Today. Feb 2022;27(2):490–502. doi:10.1016/j.drudis.2021.10.014
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.drudis.2021.10.014&link_type=DOI) 

26. 26.Barrett JS, Nicholas T, Azer K, Corrigan BW. Role of Disease Progression Models in Drug Development. Pharm Res. Aug 2022;39(8):1803–1815. doi:10.1007/s11095-022-03257-3
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11095-022-03257-3&link_type=DOI) 

27. 27.Zhong H, Ruan Z, Yan C, et al. Short-term outcome prediction for myasthenia gravis: an explainable machine learning model. Ther Adv Neurol Disord. 2023/01// 2023;16:17562864231154976. doi:10.1177/17562864231154976
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1177/17562864231154976&link_type=DOI) 

28. 28.1.  Hameurlain A, 
    2.  Liddle SW, 
    3.  Schewe K-D, 
    4.  Zhou X
    
    Bisbal J, Engelbrecht G, Villa-Uriol M-C, Frangi AF. Prediction of Cerebral Aneurysm Rupture Using Hemodynamic, Morphologic and Clinical Features: A Data Mining Approach. In: Hameurlain A, Liddle SW, Schewe K-D, Zhou X, eds. Database and Expert Systems Applications. Springer Berlin Heidelberg; 2011:59–73.
    
    
29. 29.Takahashi N, Lee Y, Tsai DY, Matsuyama E, Kinoshita T, Ishii K. An automated detection method for the MCA dot sign of acute stroke in unenhanced CT. Radiol Phys Technol. Jan 2014;7(1):79–88. doi:10.1007/s12194-013-0234-1
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s12194-013-0234-1&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23996520&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom) 

30. 30.Liu J, Barrett JS, Leonardi ET, Lee L, Roychoudhury S, Chen Y, Trifillis P. Natural History and Real-World Data in Rare Diseases: Applications, Limitations, and Future Perspectives. J Clin Pharmacol. Dec 2022;62 Suppl 2(Suppl 2):S38–S55. doi:10.1002/jcph.2134
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/jcph.2134&link_type=DOI) 

31. 31.Keles H, Ekici A, Ekici M, Bulcun E, Altinkaya V. Effect of chronic diseases and associated psychological distress on health-related quality of life. Intern Med J. Jan 2007;37(1):6–11. doi:10.1111/j.1445-5994.2006.01215.x
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1111/j.1445-5994.2006.01215.x&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=17199838&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom) 

32. 32.Kim H, Lim YM, Lee EJ, Oh YJ, Kim KK. Factors predicting remission in thymectomized patients with acetylcholine receptor antibody-positive myasthenia gravis. Muscle Nerve. Dec 2018;58(6):796–800. doi:10.1002/mus.26300
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1002/mus.26300&link_type=DOI) 

33. 33.Takahagi A, Omasa M, Chen-Yoshikawa TF, et al. Anterior mediastinal tissue volume is correlated with antiacetylcholine receptor antibody level in myasthenia gravis. J Thorac Cardiovasc Surg. Jun 2018;155(6):2738–2744. doi:10.1016/j.jtcvs.2017.10.082
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jtcvs.2017.10.082&link_type=DOI) 

34. 34.Xue L, Wang L, Dong J, et al. Risk factors of myasthenic crisis after thymectomy for thymoma patients with myasthenia gravis. Eur J Cardiothorac Surg. Oct 1 2017;52(4):692–697. doi:10.1093/ejcts/ezx163
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/ejcts/ezx163&link_type=DOI) 

35. 35.Kanai T, Uzawa A, Kawaguchi N, Oda F, Ozawa Y, Himuro K, Kuwabara S. Predictive score for oral corticosteroid-induced initial worsening of seropositive generalized myasthenia gravis. J Neurol Sci. Jan 15 2019;396:8–11. doi:10.1016/j.jns.2018.10.018
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jns.2018.10.018&link_type=DOI) 

36. 36.Rajewsky N, Almouzni G, Gorski SA, et al. LifeTime and improving European healthcare through cell-based interceptive medicine. Nature. Nov 2020;587(7834):377–386. doi:10.1038/s41586-020-2715-9
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41586-020-2715-9&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom) 

37. 37.Dekkers OM, Mulder JM. When will individuals meet their personalized probabilities? A philosophical note on risk prediction. Eur J Epidemiol. Dec 2020;35(12):1115–1121. doi:10.1007/s10654-020-00700-w
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s10654-020-00700-w&link_type=DOI) 

38. 38.Uddin M, Wang Y, Woodbury-Smith M. Artificial intelligence for precision medicine in neurodevelopmental disorders. NPJ Digit Med. 2019/12// 2019;2(1):112. doi:10.1038/s41746-019-0191-0
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41746-019-0191-0&link_type=DOI) 

39. 39.Aheto JMK, Duah HO, Agbadi P, Nakua EK. A predictive model, and predictors of under-five child malaria prevalence in Ghana: How do LASSO, Ridge and Elastic net regression approaches compare? Prev Med Rep. Sep 2021;23:101475. doi:10.1016/j.pmedr.2021.101475
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.pmedr.2021.101475&link_type=DOI) 

40. 40.Fujino Y, Murata H, Mayama C, Asaoka R. Applying “Lasso” Regression to Predict Future Visual Field Progression in Glaucoma Patients. Invest Ophthalmol Vis Sci. Apr 2015;56(4):2334–9. doi:10.1167/iovs.15-16445
    
    [Abstract](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NDoiaW92cyI7czo1OiJyZXNpZCI7czoxNToiaW92cy4xNS0xNjQ0NXYxIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjMvMDgvMjEvMjAyMy4wOC4xOS4yMzI5NDE3NS5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

41. 41.Gokten ES, Uyulan C. Prediction of the development of depression and post-traumatic stress disorder in sexually abused children using a random forest classifier. J Affect Disord. Jan 15 2021;279:256–265. doi:10.1016/j.jad.2020.10.006
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.jad.2020.10.006&link_type=DOI) 

42. 42.Macaulay BO, Aribisala BS, Akande SA, Akinnuwesi BA, Olabanjo OA. Breast cancer risk prediction in African women using Random Forest Classifier. Cancer Treat Res Commun. 2021 2021;28:100396. doi:10.1016/j.ctarc.2021.100396
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.ctarc.2021.100396&link_type=DOI) 

43. 43.Manibardo E, Irusta U, Ser JD, et al. ECG-based Random Forest Classifier for Cardiac Arrest Rhythms. In: 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE; 2019:1504–1508.
    
    
44. 44.Kundu S. AI in medicine must be explainable. Nat Med. Aug 2021;27(8):1328. doi:10.1038/s41591-021-01461-z
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1038/s41591-021-01461-z&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=34326551&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2023%2F08%2F21%2F2023.08.19.23294175.atom)