- 1Department of Nephrology, The Affiliated Hospital of Qingdao University, Qingdao, China
- 2Department of Nephrology, Linyi People's Hospital, Linyi, China
Introduction: Acute kidney injury (AKI) is a prevalent complication in older people, elevating the risks of acute kidney disease (AKD) and mortality. AKD reflects the adverse events developing after AKI. We aimed to develop and validate machine learning models for predicting the occurrence of AKD, AKI and mortality in older patients.
Methods: We retrospectively reviewed the medical records of older patients (aged 65 years and above). To explore the trajectory of kidney dysfunction, patients were categorized into four groups: no kidney disease, AKI recovery, AKD without AKI, or AKD with AKI. We developed eight machine learning models to predict AKD, AKI, and mortality. The best-performing model was identified based on the area under the receiver operating characteristic curve (AUC) and interpreted using the Shapley additive explanations (SHAP) method.
Results: A total of 22,005 patients were finally included in our study. Among them, 4,434 patients (20.15%) developed AKD, 4,000 (18.18%) occurred AKI, and 866 (3.94%) patients deceased. Light gradient boosting machine (LGBM) outperformed in predicting AKD, AKI, and mortality, and the final lite models with 15 features had AUC values of 0.760, 0.767, and 0.927, respectively. The SHAP method revealed that AKI stage, albumin, lactate dehydrogenase, aspirin and coronary heart disease were the top 5 predictors of AKD. An online prediction website for AKD and mortality was developed based on the final models.
Discussion: The LGBM models provide a valuable tool for early prediction of AKD, AKI, and mortality in older patients, facilitating timely interventions. This study highlights the potential of machine learning in improving older adult care, with the developed online tool offering practical utility for healthcare professionals. Further research should aim at external validation and integration of these models into clinical practice.
1 Introduction
Acute kidney injury (AKI), a complex public health concern, prevalent in about 12% of patients (1–3) and often accompanied by multiple organ failure especially in older people, which leads to up to 1.7 million annual deaths (4–8). Studies reported that older AKI survivors face a considerable risk of progressing to chronic kidney disease (CKD) (9). The poor prognosis of older patients with kidney disease poses significant challenges to the healthcare system and result in a substantial economic burden on families due to multi-system damage or long-term hemodialysis treatment.
Current evidence indicates that AKI can progress to an intermediate stage called acute kidney disease (AKD), defined by the 16th Acute Disease Quality Initiative (ADQI) meeting as acute or subacute damage and/or loss of kidney function for 7–90 days after an AKI-initiating event (9). Distinguishing between AKD and AKI in clinical practice is crucial, as the management strategies and prognostic implications for these conditions differ. While AKI represents a sudden decline in kidney function, AKD encompasses a broader timeframe and includes patients who do not fully recover from an episode of AKI, presenting a poorer prognosis in older patients, with a study showing a 31.8% in-hospital mortality rate for older patients in the validation cohorts (10). Explaining this clinical distinction is vital for understanding the progression of kidney diseases and the necessity for targeted prediction models. As a transitional period, AKD may serve as a turning point for improving patients’ renal function and presents significant potential for clinical research. Developing accurate prediction models for AKD has substantial clinical implications. These models can facilitate early identification of at-risk patients, enabling timely interventions that may prevent further kidney damage and improve patient outcomes. However, current studies mainly focus on AKI, with insufficient exploration of AKD’s impacts and trajectories in the older adult, underscoring the importance of targeted research on AKD.
Recently, several studies have demonstrated that the superior predictive capabilities of machine learning (ML) models over traditional statistical methods in predicting AKI. For instance, in pediatric critical care, the prediction of Stage 2/3 AKI by a ML model showed an AUROC of 0.89 (11). The random forest (RF) model for predicting AKI in patients undergoing cardiac surgery achieved an AUC of 0.839 (12). Despite ML’s complexity, the SHapley Additive exPlanation (SHAP) method has been developed to make these models more interpretable (13, 14). Nevertheless, the application of ML and SHAP methods for the prediction of AKD in older patients remains limited.
Hence, the primary aim of this study was to investigate the incidence rates of AKD, AKI, and mortality among older patients, addressing a gap in the epidemiology of kidney injury trajectories in the older adult. Secondly, we aimed to pioneer the development of predictive ML models for AKD, AKI, and mortality. Furthermore, we integrated the SHAP approach to bolster the interpretability of prediction models. Finally, we have also developed an innovative online risk calculator rooted in ML algorithms. These may provide a critical window for early targeted interventions to improve the prognosis of the older adult, thereby alleviating pressure on healthcare systems.
2 Materials and methods
2.1 Data collection
We retrospectively reviewed the medical records of 40,325 patients aged ≥65 years between October 2012 and October 2019. Patients were excluded if they met one of the following criteria: continuous dialysis, renal transplantation before AKD diagnosis, less than two serum creatinine (Scr) tests during hospitalization or missing inpatient data and the duration of hospitalization <48 h. We collected data on demographic characteristics, comorbidities, laboratory parameters, and medications from the hospital information system. Comorbidities mentioned in this study were all defined according to the International Classification of Disease (ICD) 10th Revision. The study was approved by the Institutional Review Board (IRB; QYFY WZLL 28250), ensuring patient confidentiality through anonymized data collection and adherence to privacy protocols.
2.2 Definition
The primary outcome was the occurrence of AKD, with secondary outcomes including AKI and mortality. AKI was diagnosed based on Kidney Disease: Improving Global Outcomes (KDIGO) 2012 as follows: Scr level > 26.5 mmol/L (0.3 mg/dL) within 48 h; an increase in Scr to more than 1.5-fold the baseline-confirmed value or an increase presumed to have occurred within 7 days; or urine output <0.5 mL/kg/h for more than 6 h (15). AKD was defined following the 2017 ADQI as acute or subacute damage and/or loss of kidney function for a duration of between 7 and 90 days after exposure to an AKI initiating event (9). Diagnosis and staging of AKI and AKD were determined at the first fulfillment of these criteria.
Based on the diagnostic criteria of AKI and AKD, patients were classified into the following four groups. AKI Recovery: This group included patients whose Scr levels returned to baseline within 7 days, indicating a renal impairment duration of less than 7 days or a rapid recovery within that timeframe. AKD without AKI: This group comprised patients whose Scr levels increased gradually but remained elevated for more than 7 days, indicating subacute AKD without meeting the AKI criteria. AKD with AKI: Patients in this category experienced stage ≥1 AKI that persisted for at least 7 days after the initial AKI event, indicating a continuous progression from AKI to AKD. No Kidney Disease (NKD): Patients falling into this category had an eGFR of 60 mL/min/1.73 m2 or higher, no detectable albuminuria, and did not meet the criteria for either AKI or AKD. To thoroughly assess the influence of evolving kidney injury patterns on mortality among older patients, we integrated AKI and AKD into a unified metric termed ‘dynamic’ during the mortality model’s construction. The ‘dynamic’ variable adopts values 0, 1, 2 and 3 corresponding to NKD, AKI recovery, AKD without AKI, and AKD with AKI, respectively. Baseline Scr was defined as the first Scr value measured during hospitalization. The baseline estimated glomerular filtration rate (eGFR) was calculated using the Chronic Kidney Disease Epidemiology Collaboration formula (16).
2.3 Model development
We engineered predictive models for AKD, AKI, and mortality, respectively. Scikit-learn (https://github.com/scikit-learn/scikitlearn) package was used to build models including logistic regression (LR), support vector machine (SVM), random forest (RF), naïve byes (NB), k-nearest neighbor (KNN), multi-layer perceptron (MLP), gradient boosting machine (GBM) and light gradient boosting machine (LGBM). The data were divided, with 80% utilized for training and 20% for testing. Grid search method with ten-fold cross validation was used in the training set to prevent overfitting and to identify the optimal hyperparameters for each model. To address the disparity in the distribution of positive and negative samples, we implemented a strategy of class weight adjustment during the training phase of the ML model (17).
2.4 Model interpretation and evaluation
SHAP method was designed to address the “black-box” issue in prediction models by providing a means to rank the importance of input features and explain model results (14, 18). This approach offers both global and local explanations, enhancing our understanding of the model’s decision-making process. Globally, it provides consistent attribution values for each feature, revealing associations. Locally, it explains specific predictions for individual cases, enhancing interpretability. In our pursuit of feature optimization, we also utilized the SHAP method for feature selection in the optimal model. SHAP value-assisted feature selection was utilized to identify the top 20, 15, 10, and 5 features for model construction. This approach was to find the best balance between accuracy and complexity, leading to a final lite model. SHAP method was implemented using Python shap package (https://shap.readthedocs.io/en/latest/).
The performance of our predictive models was evaluated on the test set, focusing on their discriminative ability and clinical utility. Discrimination was quantitatively assessed using a suite of performance metrics, including area under curve (AUC) of the receiver operating characteristic (ROC) curve (19), sensitivity, specificity, recall, accuracy, F1 score, Brier score and Matthews correlation coefficient (MCC). The model demonstrating the highest AUC was designated as the optimal one. For clinical applicability, decision curve analysis (DCA) was employed, which calculated the net benefit of the final model by contrasting the predicted benefits against the expected risks associated with the outcomes (20). Furthermore, the performance of the final model was showed through precision-recall (PR) curves, Kolmogorov–Smirnov (KS) plots, and confusion matrix.
2.5 Online prediction website
We created an online web-based risk calculator utilizing the Streamlit Python framework, employing the model with the optimal number of features. Upon the values of corresponding features are provided, the website can return the probability of AKD and mortality, respectively. This tool showed the practical application of our research in a clinical setting.
2.6 Sensitivity analysis
A sensitivity analysis was performed to thoroughly examine the predictive efficacy of the models, focusing specifically on stages 2–3 of AKD. Additionally, the models’ performance underwent a thorough assessment across various subgroups, with a particular emphasis on patients stratified by age brackets: 65–74 years, 75–84 years, and those aged over 85 years.
2.7 Statistical analysis
Variables with over 15% missing values were excluded, while those with less than 15% missing data were imputed using the Multivariate Imputation by Chained Equations (MICE) algorithm (21). Continuous variables were shown as mean with standard deviation, or median with interquartile range and compared by the Independent-sample T test or Wilcoxon rank-sum test. Categorial variables were expressed in quantities and percentages and compared by the Chi-square tests. All analyses were carried out with Python version 3.10.11, R version 4.3.1, and SPSS version 25.0. A 2-tailed p value of <0.05 was considered statistically significant.
3 Results
3.1 Patient characteristics
In total, this study enrolled 22,005 patients (Supplementary Figure S1), in which 4,434 patients (20.15%) developed AKD, and 4,000 (18.18%) occurred AKI. Specifically, 2,237 patients (10.17%) had AKD with AKI, 2,671 (12.14%) had AKD without AKI, and 1,763 (8.01%) had AKI recovery. On top of that, there were 866 (3.94%) patients deceased. In the AKD group, 3,553 patients (16.15%) were at stage 1, 663 (3.01%) at stage 2, and 218 (0.99%) at stage 3. These findings suggested that the high occurrence of AKI and AKD among older patients.
The differences in characteristics between kidney injury group and NKD group are partially shown in Table 1, with a detailed comparison of all characteristics provided in Supplementary Table S1. In brief, compared to the NKD group, patients with acute/subacute kidney dysfunction were older on average (75.00 ± 13.00 vs. 73.00 ± 12.00, p < 0.05) with more risk factors like smoking, alcohol use, diabetes and other conditions. The baseline lab tests including eGFR, blood urea nitrogen (BUN), cystatin C (Cys), blood glucose, lipid profiles, uric acid (UA) and others were also worst in kidney dysfunction group (p < 0.05). Furthermore, the data indicated that patients with renal impairment endured longer hospital stays (18.00 ± 14.00 vs. 17.00 ± 9.00 days, p < 0.05) and encountered higher hospital mortality rates (9.6% vs. 1.5%, p < 0.05) in comparison to the NKD group. This signified that older patients with kidney dysfunction were susceptible to a worsening prognosis.
3.2 Feature selection and model performance
Eight ML models were developed to predict AKD occurrence in older patients, by utilizing all available features, with the ROC curves illustrated in Figure 1A. The LGBM model emerged as the most efficacious in predicting AKD, achieving an AUC of 0.781. The performance metrics of these eight ML models in predicting AKD were comprehensively tabulated in Table 2. Given LGBM’s superior performance, we subsequently conducted a feature selection process specifically within the LGBM model framework. Additionally, Supplementary Figure S2 presented a correlation matrix heatmap, delineating the interrelationships between the predictive outcomes of the various ML models.
Figure 1. Performance of eight ML models for different outcomes with all features. (A) The ROC curve of AKD. (B) The ROC curve of AKI. (C) The ROC curve of mortality.
To identify the most significant features, we ranked the importance of LGBM features using the SHAP method in the training set. The evaluation metrics for LGBM models with different numbers of features were presented in Table 3. The model’s AUC increased to 0.760 when considering the top 15 features, leading to notable improvements in accuracy and precision. However, expanding the feature set to 20 did not yield a substantial uplift in AUC, and the other performance metrics exhibited a tendency toward stabilization. Given that, we selected the top 15 critical variables as the final lite prediction model for AKD (Figure 2A). Performance of the final lite LGBM model for AKD were presented in Supplementary Figure S3. We showed a DCA demonstrating the model’s substantial clinical utility. Furthermore, the confusion matrix, KS plot, and PR curve demonstrated the model exhibited satisfactory classification capabilities and maintained a favorable balance between precision and recall.
Figure 2. Importance matrix plot and SHAP summary plot of the final lite LGBM model. (A) The importance ranking of the first 15 features of the LGBM model. (B) The SHAP summary plot demonstrates the general importance of each feature in LGBM model. The color bar on the right indicates the relative value of a feature in each case. Red dots indicate high values and blue dots indicate low values. The violin graph lining up on the midline is the aggregation of dots representing each case in the train set. The distance between the upper and lower margin of the violin graph represents the amount of the cases that end up with the same SHAP values offered by this feature. SHAP force plots of 4 examples of patients. Categorical features including AKI stage, CHD, Omeprazole and β-lactam antibiotics were represented by 0 and 1, while “0” means “No” and “1” means “Yes.” *ALB, albumin; LDH, lactate dehydrogenase, CHD, coronary heart disease; CK, creatine kinase; Cys, cystatin C; GGT, gamma-glutamyl transferase; Scr, serum creatinine, CCB, calcium channel blocker; RBC, red blood cell count.
We employed the aforementioned methodology to derive features and construct models for both AKI and mortality prediction, with detailed results included in the supplementary files. The ROC curves utilizing all available features were illustrated in Figures 1B,C. The LGBM emerged as the optimal model for both AKI and mortality predictions (Supplementary Tables S2, S3), with 15 features identified as the ideal number for model performance (Supplementary Tables S4, S5). The refined model of AKI had an AUC of 0.767. In addition, it’s worth emphasizing that the final lite LGBM model of mortality showed impressive predictive capabilities, achieving an AUC of 0.927, and high recall and accuracy at 0.731 and 0.933, respectively. The ROC curves and DCA of the final lite LGBM model for AKI and mortality were presented in Supplementary Figures S4, S5.
3.3 Model interpretations
The SHAP summary plot (Figure 2B) displayed the contributions of the feature to the model. The analysis revealed that the primary factors influencing the model’s predictions were AKI stage, albumin (ALB), lactate dehydrogenase (LDH), the use of aspirin, and coronary heart disease (CHD). SHAP dependence plots (Figure 3) facilitated understanding how a single feature affected the output of the prediction model and showed the relationship between two features at the same time. For instance, as the value of Cys increased, so did the SHAP value and AKI stage, which implied a rising risk of developing AKD and a positive correlation between Cys and AKI stage (Figure 3A). The SHAP interaction plot (Supplementary Figure S6) revealed the interactions between all features. Furthermore, local explanation analyzed how features contributing to a particular prediction for an individual. The force plots (Figure 4) mainly presented the major factors that contributed to the final model output in a certain individual. Furthermore, the SHAP decision plots for other four patients (Supplementary Figure S7) provided a clear visualization of the decision-making paths attributed to each feature.
Figure 3. SHAP dependence plots demonstrate the distribution of SHAP output value of a single feature. The colors on the dependence plot correspond to another feature that could potentially interact with the feature being analyzed. (A) The relationship between Cys and AKI stage SHAP values, with the color bar indicating various levels of AKI stage. (B) The relationship between Cys and Scr SHAP values, where the color bar represents different levels of Scr. (C) The relationship between Scr and AKI stage SHAP values, with the color bar also denoting distinct AKI stage levels. (D) The relationship between Scr and ALB SHAP values, with the color bar reflecting varying ALB levels. *ALB, albumin; Cys, cystatin C; Scr, serum creatinine.
Figure 4. Force plots of the final lite LGBM model. (A,B) Show the examples of patients predicted to have AKD. (C,D) Show the examples of patients predicted to be non-AKD. The features shown in red represent a higher risk of AKD, while the features shown in blue represent a lower risk. The plots help physicians identify the main features in the model that have high decision power at the individual level. Categorical features including AKI stage, CHD, Omeprazole and β-lactam antibiotics were represented by 0 and 1, while “0” means “No” and “1” means “Yes.” *ALB, albumin; LDH, lactate dehydrogenase, CHD, coronary heart disease; CK, creatine kinase; Cys, cystatin C; GGT, gamma-glutamyl transferase; Scr, serum creatinine, CCB, calcium channel blocker; RBC, red blood cell count.
The SHAP method was also used for the AKI and mortality models, and detailed results were in the supplementary files. For the AKI model, Scr was the top contributing factor, as expected (Supplementary Figure S8). In the mortality model, the ‘dynamic’ variable ranked second in terms of significance (Supplementary Figure S9). The increasing ‘dynamic’ grade correlated with rising SHAP values, suggesting a higher mortality risk, highlighting the significant impact of kidney injury trajectory on older patients’ survival rates.
3.4 Online prediction website
Based on the lite prediction models, we developed an online risk website to streamline external validation and assess AKD and mortality risk in older patients. https://xuly94-elderly-hospitalized-patients-app-app-dxfrws.streamlit.app/, which can promptly generate the estimated risk for AKD and mortality offering immediate support for clinical decision-making.
3.5 Sensitivity analysis
The LGBM model demonstrated robust predictive accuracy for AKD stages 2–3, achieving an AUC of 0.843 in the test set (Supplementary Figure S10A). This indicated the model’s enhanced capability in predicting more severe cases of AKD, which was crucial to improve patient outcomes. When tested across various age groups, the performance of the model also remained stable (Supplementary Figures S10B–D). Specifically, the model yielded its highest performance in the 65–74 age subgroup, with an AUC of 0.755.
4 Discussion
In this retrospective cohort study, we developed and validated ML algorithms to forecast AKD, AKI, and mortality among older patients. The LGBM algorithm exhibited the strongest discrimination capability across all three outcomes. Additionally, SHAP was used for individualized patient interpretations, and an online AKD and mortality risk calculator for older patients was created, aiding early prediction and intervention. To the best of our knowledge, our study is the first to establish ML models for AKD, AKI and mortality in older patients that are valuable for risk assessment and clinical decision-making.
Several investigations have been conducted to explore the epidemiology of AKD before. James et al. reported that among more than one million Canadian residents, AKD without AKI was common — the incidence per 100 of the population tested was 3.8 in individuals without preexisting CKD and 0.6 in individuals with pre-existing CKD (22). Su et al. reported the incidence rate of community-acquired AKD was 4.60%, while it was 28.2% for hospital-acquired AKD (23). In our own study cohort, we observed that 4,434 patients, accounting for 20.15% of the total, satisfied the criteria for AKD.
In recent years, ML methods have been widely employed in predicting AKI (24–27). However, there is comparatively limited research on predicting AKD, particularly in older patients. A nomogram was developed and validated to predict the transition from AKI to AKD in patients undergoing partial nephrectomy for renal masses, demonstrating good discrimination with a concordance index of 0.891 (95% CI: 0.830, 0.953) (28). Chen et al. demonstrated that predictive models of acute decompensated heart failure (ADHF) patients had C-statistics of 0.726 (95% CI: 0.712–0.740) for AKD (29). Li et al. found that SVM showed better discrimination in older patients admitted to the intensive care unit (ICU) with AUC of 0.810 and 0.776 in the training and external validation cohorts, respectively (10). Unlike their study, which focused on older patients with AKD in the ICU, our research encompassed a broader spectrum, targeting the entire older patient population within hospital settings. What’s more, we have crafted models for predicting not only AKD but also AKI and mortality among older patients.
International consensus emphasizes the importance of early detection and prevention of AKD to mitigate its impact on patients and healthcare systems (9). Although, in theory, all older patients would benefit from comprehensive preventive measures against AKD, technical limitations often hinder early intervention. To address this issue, the ML algorithm simplifies early prediction. Furthermore, an online prediction website utilizing LGBM models can quickly identify high-risk older patients. This enables early detection and preventive interventions to enhance the prognosis for older individuals. The SHAP summary plot and force plots in Figure 2 enhanced understanding of the model’s decision-making process and can further assist physicians in implementing targeted preventive interventions for AKD.
In our study, the importance of variables showed that AKI stage, ALB, LDH, the use of aspirin and CHD were the most important factors that contributed to the predicted occurrence of AKD among older patients. Numerous studies have shown that AKI is intricately linked to the development of AKD (23, 29–31). Although current studies predominantly focus on AKI, they also suggest that these factors are risk for renal function impairment, consistent with our findings. Specifically, low serum albumin levels and elevated LDH levels are both associated with AKI AND poor outcomes (32–40). Aspirin, a common NSAID, and CHD have also been identified as independent risk factors for AKI, particularly among older people (34, 41–44).
This study has several key clinical implications. Firstly, it represents the initial effort to compare the baseline characteristics and hospital mortality across three distinct renal function trajectories post-injury. Secondly, we have successfully formulated succinct yet highly discriminative LGBM models for AKD, AKI, and mortality. Thirdly, the application of the SHAP method mitigated the opacity of ML models by globally and locally identifying and elucidating the most influential features for all three outcomes. In addition, we selected an optimal number of features for our final model to ensure a balance between complexity and clinical applicability, emphasizing its practicality with features that are readily obtainable in standard clinical settings. Furthermore, our models have been designed for direct clinical use, exemplified by a web-based risk calculator that assesses the risk of AKD and mortality in older patients, thus providing physicians with a valuable tool to enhance decision-making.
Our study faced several limitations. Firstly, it had a single-center design and a lack of ethnic diversity, which may affect the generalizability of our findings. Additionally, the identification of AKD and AKI could benefit from incorporating more early diagnostic markers, such as cystatin C, to improve predictive accuracy. Besides, the retrospective nature of our data collection introduces potential recall and selection biases. To address these issues, future research should aim for nationwide, multi-center prospective trials to enhance the validation and reliability of our predictive models, ensuring their applicability across diverse populations, including testing and verifying the model among people of other ethnicities. Last but not least, this article aims to predict kidney injury in older adult patients without specifically distinguishing the etiology. Due to the complex conditions of older adult patients, including numerous underlying diseases, susceptibility to infections, use of nephrotoxic drugs, and other common causes of kidney injury, it is often the result of multiple factors combined (8). Therefore, we have established a universal, comprehensive, and representative risk prediction model. However, its effectiveness in predicting kidney injury caused by different specific factors may not be optimal. Consequently, in future research, we plan to conduct separate studies on kidney injury caused by specific factors, such as sepsis.
This study highlights the increased susceptibility of older patients to AKD. We presented LGBM models to forecast AKD, AKI, and mortality at the time of admission. Furthermore, the web tool we developed to identify high-risk AKD and mortality cases in older patients can aid in clinical decision-making. Moving forward, we will conduct nationwide, multi-center trials with diverse participation, validating our predictive models across various ethnic groups.
Data availability statement
The data underlying this article will be shared upon reasonable request to the corresponding author.
Ethics statement
The study was approved by the Institutional Review Board (IRB; QYFY WZLL 28250) of the Affiliated Hospital of Qingdao University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
XW: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing. LX: Writing – review & editing, Formal analysis, Methodology, Project administration, Software. CG: Formal analysis, Writing – review & editing, Data curation, Investigation. DX: Data curation, Formal analysis, Writing – review & editing. LC: Data curation, Writing – review & editing. YW: Data curation, Writing – review & editing. XM: Data curation, Writing – review & editing. CL: Data curation, Investigation, Project administration, Supervision, Validation, Writing – review & editing. YX: Funding acquisition, Project administration, Resources, Supervision, Validation, Visualization, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Taishan Scholar Program of Shandong Province (grant number tstp20230665); the National Natural Science Foundation of China (grant numbers 81970582 and 82270724); the Qingdao Key Health Discipline Development Fund; and the Qingdao Key Clinical Specialty Elite Discipline.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2024.1407354/full#supplementary-material
References
1. Mehta, RL, Cerdá, J, Burdmann, EA, Tonelli, M, García-García, G, Jha, V, et al. International Society of Nephrology's 0by25 initiative for acute kidney injury (zero preventable deaths by 2025): a human rights case for nephrology. Lancet. (2015) 385:2616–43. doi: 10.1016/S0140-6736(15)60126-X
2. al-Jaghbeer, M, Dealmeida, D, Bilderback, A, Ambrosino, R, and Kellum, JA. Clinical Decision Support for In-Hospital AKI. J Am Soc Nephrol. (2018) 29:654–60. doi: 10.1681/ASN.2017070765
3. Lewington, AJ, Cerdá, J, and Mehta, RL. Raising awareness of acute kidney injury: a global perspective of a silent killer. Kidney Int Suppl. (2013) 84:457–67. doi: 10.1038/ki.2013.153
4. Ronco, C, Bellomo, R, and Kellum, JA. Acute kidney injury. Lancet. (2019) 394:1949–64. doi: 10.1016/S0140-6736(19)32563-2
5. Ishani, A, Xue, JL, Himmelfarb, J, Eggers, PW, Kimmel, PL, Molitoris, BA, et al. Acute kidney injury increases risk of ESRD among elderly. J Am Soc Nephrol. (2009) 20:223–8. doi: 10.1681/ASN.2007080837
6. Fabbian, F, Savriè, C, De Giorgi, A, Cappadona, R, Di Simone, E, Boari, B, et al. Acute kidney injury and in-hospital mortality: a retrospective analysis of a Nationwide administrative database of elderly subjects in Italy. J Clin Med. (2019) 8:1371. doi: 10.3390/jcm8091371
7. Druml, W, Lenz, K, and Laggner, AN. Our paper 20 years later: from acute renal failure to acute kidney injury--the metamorphosis of a syndrome. Intensive Care Med. (2015) 41:1941–9. doi: 10.1007/s00134-015-3989-5
8. Chronopoulos, A, Cruz, DN, and Ronco, C. Hospital-acquired acute kidney injury in the elderly. Nat Clin Pract Nephrol. (2010) 6:141–9. doi: 10.1038/nrneph.2009.234
9. Chawla, LS, Bellomo, R, Bihorac, A, Goldstein, SL, Siew, ED, Bagshaw, SM, et al. Acute kidney disease and renal recovery: consensus report of the acute disease quality initiative (ADQI) 16 workgroup. Nat Rev Nephrol. (2017) 13:241–257. doi: 10.1038/nrneph.2017.2
10. Li, MA-O, Zhuang, Q, Zhao, S, Huang, L, Hu, C, Zhang, B, et al. Development and deployment of interpretable machine-learning model for predicting in-hospital mortality in elderly patients with acute kidney disease. Uremia Invest. (2022) 44:1886–96. doi: 10.1080/0886022X.2022.2142139
11. Dong, JA-O, Feng, T, Thapa-Chhetry, B, Cho, BG, Shum, T, Inwald, DP, et al. Machine learning model for early prediction of acute kidney injury (AKI) in pediatric critical care. Crit Care in pediatric critical care. (2021) 25:288. doi: 10.1186/s13054-021-03724-0
12. Tseng, PY, Chen, YT, Wang, CH, Chiu, KM, Peng, YS, Hsu, SP, et al. Prediction of the development of acute kidney injury following cardiac surgery by machine learning. Crit Care. (2020) 24:478. doi: 10.1186/s13054-020-03179-9
13. Azodi, CB, Tang, J, and Shiu, SH. Opening the Black Box: Interpretable Machine Learning for Geneticists. Trends Genet. (2020) 36:442–55. doi: 10.1016/j.tig.2020.03.005
14. Lundberg, S, and Lee, SI. A unified approach to interpreting model predictions. Proceedings of the 31st International Conference on Neural Information Processing Systems. (2017) 10:4768–4777. doi: 10.5555/3295222.3295230
15. Palevsky, PM, Liu, KD, Brophy, PD, Chawla, LS, Parikh, CR, Thakar, CV, et al. KDOQI US commentary on the 2012 KDIGO clinical practice guideline for acute kidney injury. Am J Kidney Dis. (2013) 61:649–72. doi: 10.1053/j.ajkd.2013.02.349
16. Levey, AS, Stevens, LA, Schmid, CH, Zhang, YL, Castro, AF 3rd, Feldman, HI, et al. A new equation to estimate glomerular filtration rate. Ann Intern Med. (2009) 150:604–12. doi: 10.7326/0003-4819-150-9-200905050-00006
17. Ren, YA-O, Wu, DA-O, Tong, YA-O, López-DeFede, AA-O, and Gareau, S. Issue of Data Imbalance on Low Birthweight Baby Outcomes Prediction and Associated Risk Factors Identification: Establishment of Benchmarking Key Machine Learning Models With Data Rebalancing Strategies. J Med Educ. (2023) 25:e44081. doi: 10.2196/44081
18. Lundberg, SM, Erion, G, Chen, H, DeGrave, A, Prutkin, JM, Nair, B, et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. (2020) 2:56–67. doi: 10.1038/s42256-019-0138-9
19. Hanley, JA, and McNeil, BJ. A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology. (1983) 148:839–43. doi: 10.1148/radiology.148.3.6878708
20. Zhang, Z, Rousson, V, Lee, WC, Ferdynus, C, Chen, M, Qian, X, et al. Decision curve analysis: a technical note. Ann Transl Med. (2018) 6:308. doi: 10.21037/atm.2018.07.02
21. Azur, MJ, Stuart, EA, Frangakis, C, and Leaf, PJ. Multiple imputation by chained equations: what is it and how does it work? Int J Methods Psychiatr Res. (2011) 20:40–9. doi: 10.1002/mpr.329
22. James, MT, Levey, AS, Tonelli, M, Tan, Z, Barry, R, Pannu, N, et al. Incidence and prognosis of acute kidney diseases and disorders using an integrated approach to laboratory measurements in a universal health care system. JAMA Netw Open. (2019) 2:e191795. doi: 10.1001/jamanetworkopen.2019.1795
23. Su, CC, Chen, JY, Chen, SY, Shiao, CC, Neyra, JA, Matsuura, R, et al. Outcomes associated with acute kidney disease: a systematic review and meta-analysis. EClinicalMedicine. (2023) 55:101760. doi: 10.1016/j.eclinm.2022.101760
24. Bihorac, A, Ozrazgat-Baslanti, T, Ebadi, A, Motaei, A, Madkour, M, Pardalos, PM, et al. MySurgeryRisk: development and validation of a machine-learning risk algorithm for major complications and death after surgery. Ann Surg. (2019) 269:652–62. doi: 10.1097/SLA.0000000000002706
25. Guven, G, Brankovic, M, Constantinescu, AA, Brugts, JJ, Hesselink, DA, Akin, S, et al. Preoperative right heart hemodynamics predict postoperative acute kidney injury after heart transplantation. Intensive Care Med. (2018) 44:588–97. doi: 10.1007/s00134-018-5159-z
26. Hofer, IS, Lee, C, Gabel, EA-O, Baldi, P, and Cannesson, M. Development and validation of a deep neural network model to predict postoperative mortality, acute kidney injury, and reintubation using a single feature set. Digital Med. (2020) 3:58. doi: 10.1038/s41746-020-0248-0
27. Xue, B, Li, D, Lu, C, King, CR, Wildes, T, Avidan, MS, et al. Use of machine learning to develop and evaluate models using preoperative and intraoperative data to identify risks of postoperative complications. JAMA Netw Open. (2021) 4:e212240. doi: 10.1001/jamanetworkopen.2021.2240
28. Zhang, S, Jin, D, Zhang, Y, and Wang, T. Risk factors and predictive model for acute kidney Injury Transition to acute kidney disease in patients following partial nephrectomy. Urology. (2023) 23:156. doi: 10.1186/s12894-023-01325-3
29. Chen, JJ, Lee, TH, Kuo, G, Yen, CL, Chen, SW, Chu, PH, et al. Acute kidney disease after acute decompensated heart failure. Kidney Int. (2022) 7:526–36. doi: 10.1016/j.ekir.2021.12.033
30. See, EJ, Polkinghorne, KR, Toussaint, ND, Bailey, M, Johnson, DW, and Bellomo, R. Epidemiology and outcomes of acute kidney diseases: a comparative analysis. Am J Nephrol. (2021) 52:342–50. doi: 10.1159/000515231
31. Xu, L, Li, C, Li, N, Zhao, L, Zhu, Z, Zhang, X, et al. Incidence and prognosis of acute kidney injury versus acute kidney disease among 71 041 inpatients. NDT Plus. (2023) 16:1993–2002. doi: 10.1093/ckj/sfad208
32. Chen, L, Wu, X, Qin, H, and Zhu, H. The PCT to albumin ratio predicts mortality in patients with acute kidney injury caused by abdominal infection-evoked Sepsis. Front Nutr. (2021) 8:584461. doi: 10.3389/fnut.2021.584461
33. Ji, MS, Wu, R, Feng, Z, Wang, YD, Wang, Y, Zhang, L, et al. Incidence, risk factors and prognosis of acute kidney injury in patients treated with immune checkpoint inhibitors: a retrospective study. Sci Rep. (2022) 12:18752. doi: 10.1038/s41598-022-21912-y
34. Xu, L, Li, C, Zhao, L, Zhou, B, Luo, C, Man, X, et al. Acute kidney injury after nephrectomy: a new nomogram to predict postoperative renal function. Nephrology. (2020) 21:181. doi: 10.1186/s12882-020-01839-0
35. James, MT, Ghali, WA, Tonelli, M, Faris, P, Knudtson, ML, Pannu, N, et al. Acute kidney injury following coronary angiography is associated with a long-term decline in kidney function. Kidney Int. (2010) 78:803–9. doi: 10.1038/ki.2010.258
37. Wu, Y, Lu, C, Pan, N, Zhang, M, An, Y, Xu, M, et al. Serum lactate dehydrogenase activities as systems biomarkers for 48 types of human diseases. Sci Rep. (2021) 11:12997. doi: 10.1038/s41598-021-92430-6
38. Guan, C, Li, C, Xu, L, Zhen, L, Zhang, Y, Zhao, L, et al. Risk factors of cardiac surgery-associated acute kidney injury: development and validation of a perioperative predictive nomogram. J Nephrol. (2019) 32:937–45. doi: 10.1007/s40620-019-00624-z
39. Zhang, Z, Hu, X, Jiang, Q, Hu, W, Li, A, Deng, L, et al. Clinical characteristics and outcomes of acute kidney injury in patients with severe fever with thrombocytopenia syndrome. Front Virol. (2023) 14:6091. doi: 10.3389/fmicb.2023.1236091
40. Rizk, S, Abdel Moneim, AA-O, Abdel-Gaber, RA, Alquraishi, MI, Santourlidis, S, and Dkhil, MA. Nephroprotective Efficacy of Echinops spinosus against a Glycerol-Induced Acute Kidney Injury Model. American Chem. Soci. Omega. (2023) 8:41865–75. doi: 10.1021/acsomega.3c06792
41. Yuan, Y, Qiu, H, Hu, XY, Luo, T, Gao, XJ, Zhao, XY, et al. Risk factors of contrast-induced acute kidney injury in patients undergoing emergency percutaneous coronary intervention. Chin Med J. (2017) 130:45–50. doi: 10.4103/0366-6999.196578
42. Mittal, A, Tamer, P, Shah, I, Cortes, A, and Hinman, AD. Postoperative acute kidney injury with dual NSAID use after outpatient primary Total joint arthroplasty. J Am Acad Orthop Surg. (2022) 30:676–81. doi: 10.5435/JAAOS-D-21-00934
43. Kim, JY, Yee, J, Yoon, HY, Han, JM, and Gwak, HS. Risk factors for vancomycin‐associated acute kidney injury: a systematic review and meta‐analysis. Br J Clin Pharmacol. (2022) 88:3977–89. doi: 10.1111/bcp.15429
Keywords: acute kidney disease, hospital mortality, risk prediction, machine learning, older people
Citation: Wang X, Xu L, Guan C, Xu D, Che L, Wang Y, Man X, Li C and Xu Y (2024) Machine learning-based risk prediction of acute kidney disease and hospital mortality in older patients. Front. Med. 11:1407354. doi: 10.3389/fmed.2024.1407354
Edited by:
Tao-Hsin Tung, Taizhou Hospital of Zhejiang Province Affiliated to Wenzhou Medical University, ChinaReviewed by:
Pranjal Sharma, Northeast Ohio Medical University, United StatesBin Yi, Army Medical University, China
Copyright © 2024 Wang, Xu, Guan, Xu, Che, Wang, Man, Li and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chenyu Li, bGljbnl1QDE2My5jb20=; Yan Xu, eHV5YW5AcWR1LmVkdS5jbg==
†These authors have contributed equally to this work and share first authorship