Machine learning for the prediction of all-cause mortality in patients with sepsis-associated acute kidney injury during hospitalization

Zhou, Hongshan; Liu, Leping; Zhao, Qinyu; Jin, Xin; Peng, Zhangzhe; Wang, Wei; Huang, Ling; Xie, Yanyun; Xu, Hui; Tao, Lijian; Xiao, Xiangcheng; Nie, Wannian; Liu, Fang; Li, Li; Yuan, Qiongjing

doi:10.3389/fimmu.2023.1140755

ORIGINAL RESEARCH article

Front. Immunol., 03 April 2023

Sec. Systems Immunology

Volume 14 - 2023 | https://doi.org/10.3389/fimmu.2023.1140755

This article is part of the Research TopicClinical Application of Artificial Intelligence in Emergency and Critical Care Medicine, Volume IVView all 17 articles

Machine learning for the prediction of all-cause mortality in patients with sepsis-associated acute kidney injury during hospitalization

Hongshan Zhou^1†

Leping Liu^2†

Qinyu Zhao^3†

Xin Jin⁴

Zhangzhe Peng^1,5,6

Wei Wang^1,5,6

Ling Huang^1,5,6

Yanyun Xie^1,5,6

Hui Xu¹

Lijian Tao^1,5,6

Xiangcheng Xiao¹

Wannian Nie¹

Fang Liu^7*

Li Li^8*

Qiongjing Yuan^1,5,6,9*

¹Department of Nephrology, Xiangya Hospital of Central South University, Changsha, Hunan, China
²Department of Pediatrics, The Third Xiangya Hospital, Central South University, Changsha, China
³College of Engineering and Computer Science, Australian National University, Canberra, ACT, Australia
⁴Critical Care Medicine, The Third Xiangya Hospital, Central South University, Changsha, Hunan, China
⁵Organ Fibrosis Key Lab of Hunan Province, Central South University, Changsha, Hunan, China
⁶National International Joint Research Center for Medical Metabolomices, Xiangya Hospital, Central South University, Changsha, Hunan, China
⁷Health Management Center, Xiangya Hospital of Central South University, Changsha, Hunan, China
⁸Critical Care Medicine, Xiangya Hospital of Central South University, Changsha, Hunan, China
⁹National Clinical Medical Research Center for Geriatric Diseases, Xiangya Hospital of Central South University, Changsha, Hunan, China

Background: Sepsis-associated acute kidney injury (S-AKI) is considered to be associated with high morbidity and mortality, a commonly accepted model to predict mortality is urged consequently. This study used a machine learning model to identify vital variables associated with mortality in S-AKI patients in the hospital and predict the risk of death in the hospital. We hope that this model can help identify high-risk patients early and reasonably allocate medical resources in the intensive care unit (ICU).

Methods: A total of 16,154 S-AKI patients from the Medical Information Mart for Intensive Care IV database were examined as the training set (80%) and the validation set (20%). Variables (129 in total) were collected, including basic patient information, diagnosis, clinical data, and medication records. We developed and validated machine learning models using 11 different algorithms and selected the one that performed the best. Afterward, recursive feature elimination was used to select key variables. Different indicators were used to compare the prediction performance of each model. The SHapley Additive exPlanations package was applied to interpret the best machine learning model in a web tool for clinicians to use. Finally, we collected clinical data of S-AKI patients from two hospitals for external validation.

Results: In this study, 15 critical variables were finally selected, namely, urine output, maximum blood urea nitrogen, rate of injection of norepinephrine, maximum anion gap, maximum creatinine, maximum red blood cell volume distribution width, minimum international normalized ratio, maximum heart rate, maximum temperature, maximum respiratory rate, minimum fraction of inspired O₂, minimum creatinine, minimum Glasgow Coma Scale, and diagnosis of diabetes and stroke. The categorical boosting algorithm model presented significantly better predictive performance [receiver operating characteristic (ROC): 0.83] than other models [accuracy (ACC): 75%, Youden index: 50%, sensitivity: 75%, specificity: 75%, F1 score: 0.56, positive predictive value (PPV): 44%, and negative predictive value (NPV): 92%]. External validation data from two hospitals in China were also well validated (ROC: 0.75).

Conclusions: After selecting 15 crucial variables, a machine learning-based model for predicting the mortality of S-AKI patients was successfully established and the CatBoost model demonstrated best predictive performance.

Introduction

Sepsis, which is one of the principal causes of mortality worldwide and affects more than 19 million people every year (1–3), is defined as a sequential fatal organ dysfunction after infection with a dysregulated host response by the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3). Similarly, the Kidney Disease: Improving Global Outcomes (KDIGO) group integrated previous diagnostic criteria and proposed an international consensus for acute kidney injury (AKI) to be defined as (i) an increase in SCr level by more than 26.5 μmol/L (0.3 mg/dl) within 48 h; (ii) an increase in SCr level by more than 1.5 times the baseline (confirmed or presumed to occur within 7 days); and (iii) urine volume <0.5 ml/(kg·h) lasting for more than 6 h (4). In critically ill patients, the main cause of AKI has been considered to be sepsis for a long time, and 45%–70% of AKI patients are considered to have sepsis (5). Thus, sepsis-associated acute kidney injury (S-AKI) should be defined as a syndrome that meets the Sepsis-3 and KDIGO criteria simultaneously (6).

The epidemiology of S-AKI has not been fully clarified probably because of uncoordinated epidemiology of sepsis and AKI criteria, but the global incidence is estimated to be 6 million cases annually (6). The mortality of S-AKI was reported to be 45.99% in the intensive care unit (ICU) (7), and a retrospective cohort study discovered that S-AKI was correlated with a significantly higher mortality rate compared to sepsis without AKI (71.7% vs. 21.3%) (8). At present, many studies have shown that S-AKI imposed a heavy burden on patients. In a review, Hoste et al. summarized that the occurrence of AKI was related to the severity of sepsis and that S-AKI was responsible for the increase in disease acuity and burden of organ dysfunction (9). Bagshaw et al. conducted an observational cohort study spanning multiple nations and centers, which reported that S-AKI was associated with a high-crude in-hospital case fatality rate (51.8%) (5). Furthermore, a multicenter retrospective cohort study in China concluded that sepsis resulted in 32.0% of hospital-acquired AKI and 15.2% of community-acquired AKI. In addition, AKI was correlated with high mortality, longer length of stay, and heavier daily expenses while in the hospital (10). Additionally, an observational study of 618 ICU patients with AKI, the Program to Improve Care in Acute Renal Disease (PICARD), revealed that the in-hospital mortality rate of S-AKI was noticeably high, regardless of sepsis occurring before AKI (48%) or after AKI (44%) (11).

Considering that S-AKI patients experience high morbidity and mortality, the precise prediction of their prognosis is necessary. Novel biomarkers like tissue inhibitor of metalloproteinases-2 (TIMP-2), neutrophil gelatinase-associated lipocalin (NGAL), and insulin-like growth factor binding protein-7 (IGFBP-7) have been evaluated to forecast the prognosis of S-AKI; however, their sensitivity has not been verified in large multicenter studies (12). Conventional scoring systems of severity, such as Sequential Organ Failure Assessment (SOFA) and Acute Physiology and Chronic Health Evaluation II (APACHE II), have been widely used in the ICU to predict outcomes. Regrettably, they lack discrimination and prediction accuracy, and external validation is required before application to S-AKI cohorts (13). Consequently, it is essential to establish a new model that efficiently and accurately predicts the outcomes of S-AKI.

As a novel technology, machine learning has been utilized in various medical fields owing to its ability to develop robust risk models and improve prediction power (14, 15). The accuracy of predicting the occurrence of S-AKI utilizing machine learning has been confirmed (16–18). However, this radical new technology has not been applied to predict the mortality of patients with S-AKI, which is equally noteworthy. Gradient boosted decision trees (GBDTs) are powerful machine learning ensemble techniques, particularly when massive amounts of data are involved in classification and regression tasks. As one of the GBDT families, CatBoost is perfectly suited to processing categorical, heterogeneous data (19). Since its debut, CatBoost has been used in some medical studies and demonstrated its excellent predictive ability.

This study aimed to identify the risk factors associated with mortality in patients with S-AKI and develop a machine learning model to predict death in hospitals on the basis of primary research emphasizing the prediction of occurrence. The performance of this machine learning model was compared with 10 other machine learning models to validate the superiority of the proposed model.

Materials and methods

Study subjects

The Medical Information Mart for Intensive Care IV (MIMIC-IV) is a database containing patient data from all ICU and emergency departments at Beth Israel Deaconess Medical Center from 2008 to 2019. The contents of the database include basic patient information, diagnosis, clinical data, and medication records, among others. We extracted the data of patients with sepsis and AKI after admission from the MIMIC-IV database as training and validation sets. Then, we collected the data of patients with sepsis and AKI in the ICU of Xiangya Hospital (from 2015 to 2022) and Third Xiangya Hospital (from 2022) of Central South University, Changsha, China as an external validation set (Figure 1).

FIGURE 1

Figure 1 (A) The workflow of the study. (B) The algorithm chart of the study.

According to the KDIGO guidelines, AKI is characterized by one or more of the following: (i) an increase in SCr level by more than 26.5 μmol/L (0.3 mg/dl) within 48 h; (ii) an increase in SCr level by more than 1.5 times the baseline (confirmed or presumed to occur within 7 days); and (iii) urine volume <0.5 ml/(kg·h) lasting for more than 6 h. According to the Third International Consensus Definitions for Sepsis and Septic Shock (Sepsis-3), sepsis is characterized by life-threatening organ dysfunction as a result of infection coupled with an impaired host response. According to the SOFA, organ dysfunction is a change in the total SOFA score of 2 points caused by infection. As part of this study, patients who were younger than 18 years of age, had stayed in the ICU for less than 24 h, and missed essential data were excluded. We used multiple imputations to supplement the missing values of patients. The death group is composed of patients who died in the hospital, and the alive group consists of patients who did not die during hospitalization.

According to the ethical standards of the responsible committee on human experimentation in China and to the Helsinki Declaration of 1975, all procedures in this study were conducted in accordance with the ethical standards of the responsible committee. The study was initiated under the guidance of Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) (Supplementary Figure 1). The Xiangya Hospital of Central South University Ethics Committee reviewed and approved this study on 27 April 2022 (protocol number 202204101), which used machine learning to predict all-cause mortality among patients with S-AKI while hospitalized.

Study design and data collection

We collected 129 variables within 24 h of admission. The collected variables included patients’ basic information, diagnosis, medication records, clinical data such as temperature, blood pressure, concomitant disease, laboratory indicators, urine output (24-h urine volume after diagnosis of S-AKI), injection rate of norepinephrine (initial concentration of norepinephrine after diagnosis of S-AKI), and commonly used scores such as Simplified Acute Physiology Score II (SAPS-II), SOFA score, and Glasgow Coma Scale (GCS). The external validation set was derived from the electronic health record systems of Xiangya Hospital and Third Xiangya Hospital. The data were collected by two authors (LL and HZ). Data collected by different hospitals were converted and unified. As an example, the injection rate of norepinephrine at 1 mcg/kg/min equaled 1 μg/kg/min. The concentration of creatinine in the blood was 88.4 μmol/L per mg/dl.

Statistical analysis

As appropriate, continuous variables were compared between the death and alive groups using either Student’s t-test or the rank-sum test. A chi-square test or Fisher’s exact test was used to compare categorical variables.

Then, the data were standardized such that the mean value was 0 and the standard deviation was 1. The K-nearest neighbor (KNN) algorithm was used to impute missing values. Next, the dataset was randomly split into a training set (80%) and a validation set (20%). On the training set, the recursive feature elimination (RFE) algorithm was utilized to identify crucial variables, and we developed a machine learning model based on categorical boosting (CatBoost) (20). Basically, RFE is a way of selecting features that recursively fit a model derived from smaller feature sets until a given termination criterion is reached. A feature’s importance in the trained model is graded in each loop. In an RFE model, dependencies and collinearities are eliminated by recursively eliminating the lowest-priority feature. As a final step, the most important features were screened out, and the CatBoost model was developed based on the final set of features. Other features were not included because they only brought a small increment in the area under the receiver operating characteristic (AUROC) curve but significantly increased the difficulty of model applications. The trained model was validated on the validation set, and the AUROC curve was calculated correspondingly.

This study compared 10 other machine learning models to the proposed one, namely, KNN, AdaBoost, multilayer perceptron (MLP), support vector machine (SVM), logistic regression (LR), NaiveBayes, gradient boosting decision tree (GBDT), random forest, light gradient boosting (LightGBM), and extreme gradient boosting (XGBoost). These models were also developed on the training set and validated on the validation set. AUROC curves were compared between these models and our CatBoost model. Additionally, other performance measures were examined, such as accuracy (ACC), Youden index, sensitivity, specificity, F1 score, positive predictive value (PPV), and negative predictive value (NPV).

To explain the model, the SHapley Additive exPlanations (SHAP) package in Python was used. A game-theoretic approach is used by the SHAP package to interpret the output of the machine learning model (21). The model was able to connect optimal credit allocation to local explanations for each prediction sample. Two cases were analyzed by using SHAP values to examine model interpretability. The statistical analyses that were carried out in the present study were performed using Python (version 3.7.6); a significance level of p < 0.05 was considered to be statistically significant.

Results

Study population

There were 16,154 patients included in the MIMIC-IV set, and relevant information of the cohort can be viewed in Table 1. The average age of the patients was 67.7 years, men accounted for 42.3%, and the average body mass index (BMI) was 30.9. In the cohort, 20.5% of the patients died in the hospital, and their length of stay in the ICU was 3.7 days, longer than that of patients in the alive group. Information of external validation cohort is shown in Supplementary Table 1 and overall workflow and algorithm chart are shown in Figure 1.

TABLE 1

Table 1 Most of the variables that differ between the two groups in the MIMIC-IV set.

Key variables

After utilizing the RFE algorithm, 15 essential variables were selected, namely, urine output, maximum blood urea nitrogen (BUN), rate of injection of norepinephrine, maximum anion gap, maximum creatinine, maximum red blood cell volume distribution width (RDW), minimum international normalized ratio (INR), maximum heart rate, maximum temperature, maximum respiratory rate, minimum fraction of inspired O₂ (FiO₂), minimum creatinine, minimum GCS score, and diagnosis of diabetes and stroke (Figure 2).

FIGURE 2

Figure 2 The importance of each feature to the machine learning model.

Then, machine learning was used for predicting hospital death of patients. The AUC of the proposed CatBoost model was 0.827, which is shown in Figure 3. The CatBoost model markedly outperformed conventional LR (AUC: 0.788) and nine other machine learning models. As described in Table 2, the ACC, best cutoff, Youden index, sensitivity, specificity, F1 score, PPV, and NPV of the CatBoost model were 75%, 19.5%, 50%, 75%, 75%, 56%, 44%, and 92%, respectively. These indicators of LR were 73%, 20.1%, 44%, 71%, 74%, 52%, 41%, and 90%, respectively. In addition, the ROC curve of the validation set reached 0.75, indicating the good applicability of our model (Supplementary Figure 2). To compare with the conventional scoring system, a CatBoost model for the SOFA score was made, and the results show that the prediction ability of SOFA is inferior to the proposed model in the training and validation set (Supplementary Figure 3). As AST was almost double in the death group, and in the raw data, the number of patients with AST greater than 45 U/L was almost equal to the number of patients with liver disease. Therefore, a CatBoost model was also established to conduct a liver disease subgroup analysis that also demonstrates a good prediction power on the mortality of S-AKI among these subgroup patients (Supplementary Figure 4).

FIGURE 3

Figure 3 Receiver operating characteristic curves for the machine learning model and logistic regression in the training set. CatBoost, categorical boosting; GBDT, gradient boosting decision tree; LightGBM, light gradient boosting; AdaBoost, adaptive boosting; XGBoost, extremely gradient boosting; KNN, K-nearest neighbor; MLP, multilayer perceptron; LR, logistic regression. SVM, support vector machine.

TABLE 2

Table 2 Performance of machine learning models.

Application of the model

Analyzing the integral cohort by the SHAP package showed the crucial variables for predicting death (Figure 4). Input the information of a patient into the model: history of stroke, minimum GCS score of 15, maximum heart rate of 121 beats per minute, maximum temperature of 36.56°C, maximum respiratory rate of 68 breaths per minute, maximum BUN level of 73 mg/dl, minimum INR of 2.9, maximum creatinine level of 3 mg/dl, minimum creatinine level of 2.1 mg/dl, maximum RDW of 16.8%, minimum FiO₂ of 100%, maximum anion gap of 31 mEq/L, urine output of 405 ml/day, and a rate of injection of norepinephrine of 0.499 mcg/kg/min. The model showed that the risk of hospital mortality was 28.9% (higher than the best cutoff), suggesting that the patient had a high risk of death (Example 1, Figure 4). Input the information of another patient into the model: no history of stroke or diabetes, minimum GCS score of 15, maximum heart rate of 86 beats per minute, maximum temperature of 36.94°C, maximum respiratory rate of 28 breaths per minute, maximum BUN level of 74 mg/dl, minimum INR of 1.1, maximum creatinine level of 4.1 mg/dl, minimum creatinine level of 3.5 mg/dl, maximum RDW of 14.9%, minimum FiO₂ of 70%, maximum anion gap of 18 mEq/L, urine output of 1,060 ml, and a rate of injection of norepinephrine of 0 mcg/kg/min. The probability of hospital mortality was predicted to be 18.37%, suggesting a good prognosis (Example 2, Figure 4).

FIGURE 4

Figure 4 Two examples of website tool usage. Enter the values of 15 key variables to predict the risk of death and show the contribution of each value to the outcome. Example 1 has a higher risk of death, and example 2 may have a better prognosis.

Discussion

Machine learning has been widely applied to solve medical and clinical problems, by which it has become a popular research topic. Based on their shortcomings, novel biomarkers and conventional scoring systems lack enough power to estimate the mortality of S-AKI patients (12, 13). In this article, we discussed whether machine learning improves the mortality prediction of S-AKI patients and then selected the model with the strongest prediction ability.

From the MIMIC-IV database used as a training set, 15 crucial variables were selected using the RFE algorithm. These variables are common in various clinical settings, which means information on them can be easily obtained, and the application of machine learning models will not be limited to a variable that is difficult to detect. Some studies have focused on the relative importance of each variable in predicting prognosis. For example, a retrospective study from a prospective cohort conducted by Sukmark et al. suggested that a lower GCS score was associated with in-ICU mortality with an adjusted odds ratio of 4.16 (3.10, 5.60) (22). Serum creatinine has been extensively utilized as a predictor in severity scores that assess renal function and adverse effects of renal dysfunction, such as SOFA and APACHE II. In addition, it has been reported that BUN is associated with multiorgan failure of ICU patients regardless of admission diagnosis, including kidney failure and long-term mortality (23). Sukmark also elaborated that BUN possibly reflected multiorgan failure better than serum creatinine (22). As mentioned before, some variables were found to be correlated with prognosis. However, few have put them into one prediction model and successfully quantified their ability to predict mortality.

After identifying these 15 variables, machine learning was applied to predict the mortality of patients during hospitalization. CatBoost is an open-source package and a new GBDT algorithm announced in 2017. Compared to other GBDT algorithms, it outperformed in handling categorical variables and reducing overfitting (24). To prove the efficiency of the CatBoost model, it was compared with 10 other machine learning models and SOFA. Satisfactorily, the proposed model significantly outperformed the others with an AUC of 0.827. Furthermore, we collected data from Xiangya Hospital and Third Xiangya Hospital, Central South University, China, to use as an external validation set. The ROC curve of the validation set was also as high as 0.754.

Compared with several other S-AKI-related clinical model studies (16–18), the innovation of this study is that the fourth edition of the MIMIC database used includes more patients from 2017 to 2019 than the third edition, with a larger amount of data and more recent data. In addition, in contrast to the related studies, emphasis was placed on predicting the mortality of S-AKI patients for the first time. Second, this study not only utilized data from the database but also collected data from other hospitals for validation, making the model more reliable. In addition, our training set is from Western countries, while the validation set is from China, indicating that the model has applicability among different populations. Moreover, instead of just using one machine learning algorithm to build the model, we compared multiple machine learning algorithms and selected the one that performed the best. Finally, since the chosen variables are easily accessible, the prediction model has a wide range of applications in areas with different medical levels.

However, our study has some limitations. First, the training set data originated from only one database, while the validation set data came from two hospitals in one region; thus, selection bias may have occurred. Even in view of this, the proposed model constructed by the MIMIC-IV database still passed the validation set from China, which, in turn, proved the superiority of our model. However, we must admit that more external validations are needed. Second, the variables were selected by the RFE algorithm, but the underlying mechanism was not discussed in our study.

As found in previous studies, S-AKI patients were treated with mechanical ventilation and vasoactive therapy with greater possibility (9), so was dialysis (70%) (11), which was simultaneously associated with a longer hospital stay (5). Prolonging hospital stays and expensive treatments mean an increasingly larger economic burden on patients and medical insurance. Meanwhile, it is sometimes challenging for clinicians to decide the priority treatment in the next step when condition deteriorates rapidly. Consequently, applying the CatBoost-based model to discern high-risk S-AKI patients and predict prognoses in a timely and accurate manner and providing clinicians with optimal treatment decision-making suggestions may help reduce these burdens. In conclusion, we hope that the proposed model will assist clinicians with better decision-making and allocating medical resources reasonably.

Conclusions

This study demonstrates that predicting the mortality of S-AKI patients in the ICU is critical and that the CatBoost-based model we proposed outperformed conventional LR and nine other machine learning models. Further validations across diverse study centers will help verify the reliability and improve the validation efficiency of this model.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

This study was reviewed and approved by the Ethics Committee of the Xiangya Hospital of Central South University on 27 April 2022 (protocol number 202204101).

Author contributions

HZ: resources and writing—original draft; LPL: methodology, resources, validation, visualization, and writing—original draft; QZ: formal analysis, methodology, and validation; XJ: resources; ZP: investigation; WW: investigation; LH: investigation; YX: investigation; HX: supervision; LT: supervision; XX: supervision; WN: investigation; FL: review and editing; LL: review and editing; QY: review and editing and supervision. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the Natural Science Foundation of Hunan province China (Grant Nos. 2020JJ5942, 2019JJ40515, and 2019JJ20035), the Major Program of the National Natural Science Foundation of China (Grant No. 82090024), the General Programs of the National Natural Science Foundation of China (Grant No. 82173877), and the Key Research and Development Program of Hunan Province (Grant No. 2021SK2015).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1140755/full#supplementary-material

Supplementary Figure 1 | TRIPOD Checklist: Prediction Model Development.

Supplementary Figure 2 | Receiver operating characteristic curves for the machine learning model and logistic regression in the validation set.

Supplementary Figure 3 | Receiver operating characteristic curves for the machine learning model of SOFA score. (A) ROC of the training set. (B) ROC of the validation set.

Supplementary Figure 4 | Receiver operating characteristic curves for the machine learning model of liver disease subgroup. (A) ROC of the training set. (B) ROC of the validation set.

Supplementary Table 1 | Demographic and comorbidity information on the external validation cohort.

References

1. Cecconi M, Evans L, Levy M, Rhodes A. Sepsis and septic shock. Lancet (2018) 392:75–87. doi: 10.1016/S0140-6736(18)30696-2

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Prescott HC, Angus DC. Enhancing recovery from sepsis: A review. JAMA (2018) 319:62–75. doi: 10.1001/jama.2017.17687

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Singer M, Deutschman CS, Seymour CW, Shankar-Hari M, Annane D, Bauer M, et al. The third international consensus definitions for sepsis and septic shock (Sepsis-3). JAMA (2016) 315:801–10. doi: 10.1001/jama.2016.0287

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Palevsky PM, Liu KD, Brophy PD, Chawla LS, Parikh CR, Thakar CV, et al. KDOQI US commentary on the 2012 KDIGO clinical practice guideline for acute kidney injury. Am J Kidney Dis (2013) 61:649–72. doi: 10.1053/j.ajkd.2013.02.349

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Bagshaw SM, Uchino S, Bellomo R, Morimatsu H, Morgera S, Schetz M, et al. Septic acute kidney injury in critically ill patients: clinical characteristics and outcomes. Clin J Am Soc Nephrol (2007) 2:431–9. doi: 10.2215/CJN.03681106

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Peerapornratana S, Manrique-Caballero CL, Gómez H, Kellum JA. Acute kidney injury from sepsis: current concepts, epidemiology, pathophysiology, prevention and treatment. Kidney Int (2019) 96:1083–99. doi: 10.1016/j.kint.2019.05.026

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Liu J, Xie H, Ye Z, Li F, Wang L. Rates, predictors, and mortality of sepsis-associated acute kidney injury: a systematic review and meta-analysis. BMC Nephrol (2020) 21:318. doi: 10.1186/s12882-020-01974-8

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Bouchard J, Acharya A, Cerda J, Maccariello ER, Madarasu RC, Tolwani AJ, et al. A prospective international multicenter study of AKI in the intensive care unit. Clin J Am Soc Nephrol. (2015) 10:1324–31. doi: 10.2215/CJN.04360514

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Hoste EAJ, Kellum JA, Selby NM, Zarbock A, Palevsky PM, Bagshaw SM, et al. Global epidemiology and outcomes of acute kidney injury. Nat Rev Nephrol (2018) 14:607–25. doi: 10.1038/s41581-018-0052-0

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Xu X, Nie S, Liu Z, Chen C, Xu G, Zha Y, et al. Epidemiology and clinical correlates of AKI in Chinese hospitalized adults. Clin J Am Soc Nephrol (2015) 10:1510–8. doi: 10.2215/CJN.02140215

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Honoré PM, Jacobs R, Boer W, Joannes-Boyau O. Sepsis and AKI: more complex than just a simple question of chicken and egg. Intensive Care Med (2011) 37:186–9. doi: 10.1007/s00134-010-2097-9

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Bellomo R, Kellum JA, Ronco C, Wald R, Martensson J, Maiden M, et al. Acute kidney injury in sepsis. Intensive Care Med (2017) 43:816–28. doi: 10.1007/s00134-017-4755-7

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Demirjian S, Chertow GM, Zhang JH, O'Connor TZ, Vitale J, Paganini EP, et al. Model to predict mortality in critically ill adults with acute kidney injury. Clin J Am Soc Nephrol (2011) 6:2114–20. doi: 10.2215/CJN.02900311

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. eDoctor: machine learning and the future of medicine. J Intern Med (2018) 284:603–19. doi: 10.1111/joim.12822

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Deo RC. Machine learning in medicine. Circulation (2015) 132:1920–30. doi: 10.1161/CIRCULATIONAHA.115.001593

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Chaudhary K, Vaid A, Duffy Á, Paranjpe I, Jaladanki S, Paranjpe M, et al. Utilization of deep learning for subphenotype identification in sepsis-associated acute kidney injury. Clin J Am Soc Nephrol (2020) 15:1557–65. doi: 10.2215/CJN.09330819

PubMed Abstract | CrossRef Full Text | Google Scholar

17. He J, Lin J, Duan M. Application of machine learning to predict acute kidney disease in patients with sepsis associated acute kidney injury. Front Med (Lausanne) (2021) 8:792974. doi: 10.3389/fmed.2021.792974

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Luo XQ, Yan P, Zhang NY, Luo B, Wang M, Deng YH, et al. Machine learning for early discrimination between transient and persistent acute kidney injury in critically ill patients with sepsis. Sci Rep (2021) 11:20269. doi: 10.1038/s41598-021-99840-6

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Hancock JT, Khoshgoftaar TM. CatBoost for big data: an interdisciplinary review. J Big Data. (2020) 7:94. doi: 10.1186/s40537-020-00369-8

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Zhao QY, Liu LP, Luo JC, Luo YW, Wang H, Zhang YJ, et al. A machine-learning approach for dynamic prediction of sepsis-induced coagulopathy in critically ill patients with sepsis. Front Med (Lausanne) (2021) 7:637434. doi: 10.3389/fmed.2020.637434

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell (2020) 2:56–67. doi: 10.1038/s42256-019-0138-9

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Sukmark T, Lumlertgul N, Praditpornsilpa K, Tungsanga K, Eiam-Ong S, Srisawat N. THAI-ICU score as a simplified severity score for critically ill patients in a resource limited setting: Result from SEA-AKI study group. J Crit Care (2020) 55:56–63. doi: 10.1016/j.jcrc.2019.10.010

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Arihan O, Wernly B, Lichtenauer M, Franz M, Kabisch B, Muessig J, et al. Blood urea nitrogen (BUN) is independently associated with mortality in critically ill patients admitted to ICU. PloS One (2018) 13:e0191697. doi: 10.1371/journal.pone.0191697

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Dorogush AV, et al. CatBoost: gradient boosting with categorical features support. Available at: https://arxiv.org/abs/1810.11363 (Accessed September 1, 2020).

Google Scholar

Keywords: sepsis, acute kidney injury, mortality, predictive model, machine learning

Citation: Zhou H, Liu L, Zhao Q, Jin X, Peng Z, Wang W, Huang L, Xie Y, Xu H, Tao L, Xiao X, Nie W, Liu F, Li L and Yuan Q (2023) Machine learning for the prediction of all-cause mortality in patients with sepsis-associated acute kidney injury during hospitalization. Front. Immunol. 14:1140755. doi: 10.3389/fimmu.2023.1140755

Received: 09 January 2023; Accepted: 17 March 2023;
Published: 03 April 2023.

Edited by:

Rahul Kashyap, WellSpan Health, United States

Reviewed by:

Pratikkumar Vekaria, University of South Carolina, United States
Pranjal Sharma, Northeast Ohio Medical University, United States

Copyright © 2023 Zhou, Liu, Zhao, Jin, Peng, Wang, Huang, Xie, Xu, Tao, Xiao, Nie, Liu, Li and Yuan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fang Liu, eHlsaXVmYW5nQGNzdS5lZHUuY24=; Li Li, bGxpY3VAcXEuY29t; Qiongjing Yuan, eXVhbnFpb25namluZ0Bjc3UuZWR1LmNu

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Machine learning for the prediction of all-cause mortality in patients with sepsis-associated acute kidney injury during hospitalization

Introduction

Materials and methods

Study subjects

Study design and data collection

Statistical analysis

Results

Study population

Key variables

Application of the model

Discussion

Conclusions

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good