Risk stratification of elderly patients with ischemic stroke (IS) who are admitted to the intensive care unit (ICU) remains a challenging task. This study aims to establish and validate predictive models that are based on novel machine learning (ML) algorithms for 28-day in-hospital mortality in elderly patients with IS who were admitted to the ICU.
Data of elderly patients with IS were extracted from the electronic intensive care unit (eICU) Collaborative Research Database (eICU-CRD) records of those elderly patients admitted between 2014 and 2015. All selected participants were randomly divided into two sets: a training set and a validation set in the ratio of 8:2. ML algorithms, such as Naïve Bayes (NB), eXtreme Gradient Boosting (xgboost), and logistic regression (LR), were applied for model construction utilizing 10-fold cross-validation. The performance of models was measured by the area under the receiver operating characteristic curve (AUC) analysis and accuracy. The present study uses interpretable ML methods to provide insight into the model's prediction and outcome using the SHapley Additive exPlanations (SHAP) method.
As regards the population demographics and clinical characteristics, the analysis in the present study included 1,236 elderly patients with IS in the ICU, of whom 164 (13.3%) died during hospitalization. As regards feature selection, a total of eight features were selected for model construction. In the training set, both the xgboost and NB models showed specificity values of 0.989 and 0.767, respectively. In the internal validation set, the xgboost model identified patients who died with an AUC value of 0.733 better than the LR model which identified patients who died with an AUC value of 0.627 or the NB model 0.672.
The xgboost model shows the best predictive performance that predicts mortality in elderly patients with IS in the ICU. By making the ML model explainable, physicians would be able to understand better the reasoning behind the outcome.