ORIGINAL RESEARCH article

Front. Pharmacol., 23 May 2024

Sec. Renal Pharmacology

Volume 15 - 2024 | https://doi.org/10.3389/fphar.2024.1361923

Mortality prediction and influencing factors for intensive care unit patients with acute tubular necrosis: random survival forest and cox regression analysis

  • 1. Department of Epidemiology and Health Statistics, School of Public Health, Hangzhou Normal University, Hangzhou, China

  • 2. Department of Occupational and Environmental Health, School of Public Health, Hangzhou Normal University, Hangzhou, China

  • 3. Research on Accurate Diagnosis and Treatment of Tumor, School of Pharmacy, Hangzhou Normal University, Hangzhou, China

  • 4. Department of Nutrition and Toxicology, School of Public Health, Hangzhou Normal University, Hangzhou, China

Abstract

Background: Patients with acute tubular necrosis (ATN) not only have severe renal failure, but also have many comorbidities, which can be life-threatening and require timely treatment. Identifying the influencing factors of ATN and taking appropriate interventions can effectively shorten the duration of the disease to reduce mortality and improve patient prognosis.

Methods: Mortality prediction models were constructed by using the random survival forest (RSF) algorithm and the Cox regression. Next, the performance of both models was assessed by the out-of-bag (OOB) error rate, the integrated brier score, the prediction error curve, and area under the curve (AUC) at 30, 60 and 90 days. Finally, the optimal prediction model was selected and the decision curve analysis and nomogram were established.

Results: RSF model was constructed under the optimal combination of parameters (mtry = 10, nodesize = 88). Vasopressors, international normalized ratio (INR)_min, chloride_max, base excess_min, bicarbonate_max, anion gap_min, and metastatic solid tumor were identified as risk factors that had strong influence on mortality in ATN patients. Uni-variate and multivariate regression analyses were used to establish the Cox regression model. Nor-epinephrine, vasopressors, INR_min, severe liver disease, and metastatic solid tumor were identified as important risk factors. The discrimination and calibration ability of both predictive models were demonstrated by the OOB error rate and the integrated brier score. However, the prediction error curve of Cox regression model was consistently lower than that of RSF model, indicating that Cox regression model was more stable and reliable. Then, Cox regression model was also more accurate in predicting mortality of ATN patients based on the AUC at different time points (30, 60 and 90 days). The analysis of decision curve analysis shows that the net benefit range of Cox regression model at different time points is large, indicating that the model has good clinical effectiveness. Finally, a nomogram predicting the risk of death was created based on Cox model.

Conclusion: The Cox regression model is superior to the RSF algorithm model in predicting mortality of patients with ATN. Moreover, the model has certain clinical utility, which can provide clinicians with some reference basis in the treatment of ATN and contribute to improve patient prognosis.

1 Background

Acute tubular necrosis (ATN) is the most common type of acute renal failure (accounting for approximately 70%–80%) (Zhou et al., 2010). It is a clinical syndrome caused by ischemia of renal tissue or necrosis of tubular epithelial cells due to toxic damage, and resulting in a dramatic decrease in the glomerular filtration rate (An et al., 2022). It is often manifested as progressive azotemia, electrolyte disturbance, acid-base balance disorder, and a host of other symptoms. ATN patients not only have severe renal failure, but also have many comorbidities, such as myocardial infarction, congestive heart failure, peptic ulcer disease, etc., which can be life-threatening and require timely treatment. ATN is associated with high mortality, especially for those patients in the intensive care unit (ICU) (Rosen and Stillman, 2008). Understanding the influencing factors of ATN and taking appropriate interventions can effectively shorten the duration of the disease to reduce mortality and improve patient prognosis. Previous studies have shown that pH, base excess, creatinine, and blood urea nitrogen (BUN) are common influencing factors of ATN (Liaño et al., 1989). However, other potential risk factors that may affect the prognosis of patients with ATN have not yet been identified. In recent years, the continuous development of medical information technology and the popularization of electronic medical record systems have generated a large quantity of data for prognostic model evaluation and other clinical applications.

Cox regression model is the most common semi-parametric regression model, which can analyze the influence of multiple factors on outcome events and carry out statistical analysis on data with censoring (Moolgavkar et al., 2018). This method improves the efficiency and reliability of survival analysis by considering multiple predictor variables simultaneously (Koletsi and Pandis, 2017). It has been widely used in medical research, such as evaluating the survival rate of cancer patients, the risk of heart failure, and the prognosis prediction of patients (Hippisley-Cox and Coupland, 2015; Tang M. et al., 2021; Wang et al., 2022). On the other hand, machine learning (ML) is a data-based algorithmic technology that automatically analyzes and learns patterns and regularities in data to predict and optimize future outcomes, and is a fast-growing area. ML technology has also been applied in many aspects of medical research, which is important for improving medical care and promoting human health (Noorbakhsh-Sabet et al., 2019; Issa et al., 2021). Random survival forest (RSF) is a comprehensive method of random forest and survival analysis, which processes right censored data. Different from the general binary classification method, the target variable of survival analysis method is survival time. By training a large number of survival trees, the model votes out the final prediction results weighted from individual trees in the form of voting. RSF is a ML algorithm based on decision tree (Yosefian et al., 2015), which has good prediction accuracy without over-fitting, and it is suitable for survival analysis of many diseases (Farhadian et al., 2021; Roshanaei et al., 2022). The most important feature of the RSF algorithm is that it can rank the importance of variables in order to filter out those that have a greater impact on outcome indicators (Adham et al., 2017; Wang and Zhou, 2017). Moreover, RSF can effectively deal with the problem of data imbalance, when there is classification imbalance, RSF can balance the data error. Currently, it has been used in the construction of prognostic models for many different diseases, such as heart failure, arrhythmia, multiple myeloma, etc. (Hsich et al., 2011; Miao et al., 2015; Morvan et al., 2020). In addition, several studies have compared the performance of RSF with the classical Cox regression model, while some have found that RSF is more accurate than Cox regression (Sloan et al., 2016; Ma et al., 2020; Tapak et al., 2020), others have reached the opposite conclusion (Qiu et al., 2020). Hitherto, there were no comparative studies of two models in the ATN based on large sample data. Therefore, this study aimed to investigate the prediction of mortality in ATN patients in the ICU and the associated influencing factors by using RSF algorithm and Cox regression method.

2 Materials and methods

2.1 Data source and study population

Medical Information Mart for Intensive Care IV (MIMIC-IV) is a large-scale public database containing clinical information of patients at Beth Israel Deaconess Medical Center between 2008 and 2019, which was established by the Massachusetts Institute of Technology and Beth Israel Deaconess Medical Center. In the MIMIC-IV database, the patients’ true identifying information is hidden, therefore, there is no need to obtain informed consent from patients. However, researchers are required to complete relevant training courses and receive certificate before accessing the database. Datasets were obtained from the Physionet official website (http://mimic.physionet.org/).

A total of 4,031 patients were diagnosed with ATN in the database. For this study, the inclusion criteria were: over 18 years old and admission to the ICU longer than 24 h. Exclusion criteria were: patients who died within 24 h of the ICU admission and patients with incomplete data. For patients with multiple ICU admissions, and only data from their first admission were taken. Ultimately, a total of 3,220 patients were enrolled in this study.

2.2 Data extraction

Datasets were extracted by using structured query language. Basic information of ATN patients included: age at admission, gender, ethnicity, weight, length of stay in ICU, etc. Treatment measures included: antibiotic use, vasopressors use, nor-epinephrine use, the use of continuous renal replacement therapy, etc. Related comorbidities included the following: myocardial infarction, congestive heart failure, peripheral vascular disease, chronic pulmonary disease, cerebrovascular disease, rheumatic disease, mild liver disease, peptic ulcer disease, paraplegia, malignant cancer, severe liver disease, metastatic solid tumor, etc. The first laboratory test results after ICU admission included: hemoglobin, white blood cells, base excess, pH, anion gap (AG), bicarbonate, international normalized ratio (INR), prothrombin time, urine output, arterial partial pressure of oxygen, creatinine, BUN, chloride, glucose, etc. Vital signs after ICU admission included: heart rate, respiratory rate, systolic blood pressure (SBP), diastolic blood pressure (DBP), body temperature, etc. Because of the high sampling frequency, the maximum, the minimum and the average values were used to represent vital signs and laboratory test results.

2.3 Model construction

Patients with ATN were randomly divided into the training set and validation set in an 8:2 ratio. The training set was used to construct RSF or Cox model and the validation set was used to evaluate the performance of the two predictive models. RSF model was constructed on the basis of optimal parameter combination. The out-of-bag (OOB) error rate under different parameter combination, which is calculated by grid search method, and it was used to determine the optimal parameter combination of the model (Wang et al., 2019). RSF algorithm has the ability to assess the importance of each variable that contributed to the outcome indicators. In this study, the minimum depth method was used to measure and rank the importance of each variable (Peng et al., 2016). On the other hand, uni-variate and multivariate regression analysis were carried out for the Cox regression model. All variables were first analyzed in the uni-variate Cox regression model, and those with p-values less than 0.05 were selected and subjected to multivariate Cox regression analysis.

2.4 Model comparison

The OOB error rate is equivalent to the value of 1-C index, which is used to evaluate the prediction ability of the model. The smaller the out-of-bag error rate is, the stronger the differentiation ability of the model is. Brier score is an evaluation index to evaluate different survival models and can represent the prediction accuracy of prediction models. Brier score can be viewed as a “calibrated” measure of a set of probabilistic predictions. The OOB error rate and the integrated Brier score were first calculated to determine the discrimination and calibration ability of the two models. The smaller the OOB error rate, the better the discrimination ability of the predictive model (Banerjee et al., 2012). The model is well calibrated when the Brier score is less than 0.25 (Mogensen et al., 2012). And the smaller the Brier score, the better the calibration of the model. Then, the prediction error curves of two models were plotted for judging the prediction performance. To further assess the prognostic ability of two models, 30-day, 60-day and 90-day dependent receiver operating characteristic (ROC) curves were plotted. A larger area under the curve (AUC) value indicates a stronger predictive ability of the model when the AUC values are greater than 0.5 (Bansal and Heagerty, 2018). To analyze the decision curve analysis (DCA) of the Cox model, 30-day, 60-day, and 90-day DCA were plotted. When the net benefit of DCA is large, it indicates that the clinical application value of the model is high (Vickers and Holland, 2021). The superior performance model was utilized to construct a nomogram as individual prediction tool for ATN mortality risk.

2.5 Statistical analysis

For descriptive variables, the median and quartile were superior to the means and standard deviation values in several statistical guides (Kattan and Vickers, 2020). Therefore, continuous variables were represented by using the median and quartile and were compared by Mann-Whitney U test. Categorical variables were expressed in terms of frequency or percentage and compared using Chi-square tests or Fisher’s exact tests. In this study, indicators with the missing degree greater than 20% were removed, and then the remaining missing data were filled with multipe interpolation method (Zhang et al., 2022). Since the presence of outliers reduces the accuracy of RSF algorithm, the outliers were first identified by using box-plots, then the outlier indicators that exceed 10% were removed and median replacement was performed on the remaining outlier data (Dutta et al., 2022). The software packages used in the data analysis and processing included: randomForestSRC, survival, ggRandomForests, timeROC, and ggplot2.

3 Results

3.1 Baseline characteristics

A total of 3,220 patients with ATN were included in this study, of which 2,457 patients survived and 763 patients died during hospitalization. Comparisons between two groups showed that there were significant differences in the age, weight, length of stay in ICU, vasopressors, mild liver disease, severe liver disease, metastatic solid tumors, AG_min, AG_max, bicarbonate_min, bicarbonate_max, chloride_min, chloride_max, creatinine_min, creatinine_max, base excess_min, base excess_max, temperature_min, temperature_mean, urine output, DBP_min, etc. There were no statistically significant differences in the variables including gender, myocardial infarction, peripheral vascular disease, paraplegia, rheumatic disease, peptic ulcer disease, glucose_min, glucose_max, etc. Other baseline characteristics were shown in Tables 13.

TABLE 1

DeathSurvivalp-value
Number (sample size)7632,457
Age,year72.16(60.12,80.56)66.32(54.98,77.06)< 0.0001
Gender(%)0.0529
   Female330(43.3)966(39.3)
 Male433(56.7)1,491(60.7)
Weight, kg81(68.30,94.40)83.2(70.20,98.80)0.0002
Ethnicity (%)< 0.0001
   White500(65.5)1,647(67.0)
 Black64(8.4)296(12.0)
 Yellow22(2.9)65(2.6)
 Other177(23.2)449(18.4)
Length of stay in the ICU, day7.89(3.86,14.65)5.80(2.78,12.21)< 0.0001
First care unit (%)< 0.0001
   CCU82(10.7)311(12.6)
 SICU118(15.5)365(14.9)
 MICU191(25.1)715(29.1)
 CVICU72(9.4)295(12.0)
 Other300(39.3)771(31.4)

General information of the patients.

ICU, intensive care unit; CCU, coronary care unit; SICU, surgical intensive care unit; MICU, medical intensive care unit; CVICU, cardiac vascular intensive care unit, p-value less than 0.05 are shown in bold text.

TABLE 2

DeathSurvivalp-value
The treatment
Antibiotic (%)< 0.0001
 No15(2.0)158(6.4)
 Yes748(98.0)2,299(93.6)
Dobutamine (%)0.0008
 No707(92.7)2,351(95.7)
 Yes56(7.3)106(4.3)
Dopamine (%)0.0367
 No700(91.7)2,307(93.9)
 Yes63(8.3)150(6.1)
Nerve blockers (%)< 0.0001
 No671(87.9)2,298(93.5)
 Yes92(12.1)159(6.5)
Epinephrine (%)< 0.0001
 No644(84.4)2,263(92.1)
 Yes119(15.6)194(7.9)
Norepinephrine (%)< 0.0001
 No244(32.0)1,388(56.5)
 Yes519(68.0)1,069(43.5)
Phenylephrine (%)< 0.0001
 No485(63.6)1815(73.9)
 Yes278(36.4)642(26.1)
Vasopressor (%)< 0.0001
 No441(57.8)2061(83.9)
 Yes322(42.2)396(16.1)
CRRT(%)0.3576
   No716(93.8)2,327(94.7)
 Yes47(6.2)130(5.3)
Comorbidity
Myocardial infarct (%)0.1509
   No576(75.5)1916(78.0)
 Yes187(24.5)541(22.0)
Congestive heart failure (%)0.0358
 No415(54.4)1,442(58.7)
 Yes348(45.6)1,015(41.3)
Peripheral vascular disease (%)0.9299
   No646(84.7)2077(84.5)
 Yes117(15.3)380(15.5)
Cerebrovascular disease (%)0.0047
 No647(84.8)2,178(88.6)
 Yes116(15.2)279(11.4)
Dementia (%)0.9653
   No735(96.3)2,366(96.3)
 Yes28(3.7)91(3.7)
Chronic pulmonary disease (%)0.0409
   No521(68.3)1772(72.1)
 Yes242(31.7)685(27.9)
Rheumatic disease (%)0.5712
   No730(95.7)2,362(96.1)
 Yes33(4.3)95(3.9)
Peptic ulcer disease (%)0.7695
   No725(95.0)2,341(95.3)
 Yes38(5.0)116(4.7)
Mild liver disease (%)< 0.0001
   No487(63.8)1855(75.5)
 Yes276(36.2)602(24.5)
Diabetes uncomplicated (%)0.0777
   No564(73.9)1735(70.6)
 Yes199(26.1)722(29.4)
Diabetes complicated (%)0.0082
 No658(86.2)2018(82.1)
 Yes105(13.8)439(17.9)
Paraplegia (%)0.9334
   No738(96.7)2,378(96.8)
 Yes25(3.3)79(3.2)
Renal disease (%)0.0001
 No496(65.0)1,407(57.3)
 Yes267(35.0)1,050(42.7)
Malignant cancer (%)< 0.0001
 No553(72.5)2,101(85.5)
 Yes210(27.5)356(14.5)
Severe liver disease (%)< 0.0001
 No597(78.2)2,137(87.0)
 Yes166(21.8)320(13.0)
Metastatic solid tumor (%)< 0.0001
 No653(85.6)2,334(95.0)
 Yes110(14.4)123(5.0)
Aids (%)0.3996
   No758(99.3)2,431(98.9)
 Yes5(0.7)26(1.1)

The treatment and comorbidity of the patients.

CRRT, continuous renal replacement therapy, p-value less than 0.05 are shown in bold text.

TABLE 3

DeathSurvivalp-value
Laboratory tests
Hematocrit_min(%)26.10(22.70,31.00)27.00(23.40,32.00)0.0006
Hematocrit_max(%)31.20(27.20,35.85)31.80(28.10,36.90)0.0004
Hemoglobin_min(g/dL)8.50(7.35,10.00)8.80(7.70,10.50)< 0.0001
Hemoglobin_max(g/dL)10.10(8.80,11.60)10.40(9.10,12.10)< 0.0001
Platelets_min(k/uL)138.00(69.0,203.5)146.00(100.00,213.00)< 0.0001
Platelets_max(k/uL)183.00(105.00,247.00)187.00(138.00,260.00)< 0.0001
WBC_min(k/uL)9.90(6.30,13.90)9.90(6.70,13.30)0.8786
WBC_max(k/uL)13.80(9.40,19.30)13.80(9.90,18.60)0.9863
AG_min(mEq/L)15.00(12.00,17.00)14.00(12.00,17.00)0.0016
AG_max(mEq/L)19.00(16.00,22.00)18.00(15.00,22.00)0.0114
Bicarbonate_min(mEq/L)18.00(15.00,22.00)19.00(16.00,22.00)0.0003
Bicarbonate_max(mEq/L)22.00(19.00,25.00)22.00(20.00,25.00)0.0007
BUN_min(mg/dL)33.00(21.00,50.50)33.00(21.00,52.00)0.7024
BUN_max(mg/dL)41.00(26.00,59.50)41.00(27.00,62.00)0.2251
Calcium_min(EU/dL)7.80(7.20,8.40)7.80(7.30,8.40)0.7065
Calcium_max(EU/dL)8.40(8.0,9.0)8.40(7.90,8.90)0.1561
Chloride_min(mEq/L)101.00(96.00,105.00)101.00(97.00,106.00)0.0206
Chloride_max(mEq/L)105.00(100.00,110.00)105.00(101.00,110.00)0.0202
Creatinine_min(g/dL)1.60(1.00,2.30)1.70(1.20,2.60)< 0.0001
Creatinine_max(g/dL)2.00(1.30,3.00)2.20(1.60,3.30)< 0.0001
Sodium_min(mEq/L)136.00(133.00,139.00)136.00(133.00,139.00)0.9822
Sodium_max(mEq/L)140.00(136.00,143.00)140.00(137.00,142.00)0.9765
Potassium_min(mEq/L)3.90(3.50,4.40)4.00(3.60,4.40)0.0801
Potassium_max(mEq/L)4.60(4.10,5.20)4.70(4.20,5.30)0.0874
INR_min(s)1.30(1.20,1.70)1.30(1.10,1.50)< 0.0001
INR_max(s)1.50(1.40,2.00)1.40(1.20,1.70)< 0.0001
PT_max(s)16.20(14.75,21.45)15.70(13.40,18.90)< 0.0001
PTT_min(s)31.30(28.35,37.90)30.30(26.90,34.20)< 0.0001
pH_min7.29(7.19,7.36)7.30(7.22,7.36)0.1004
pH_max7.40(7.35,7.45)7.40(7.35,7.45)0.8361
PaO2_min(mmHg)51.00(37.00,70.00)53.00(39.00,79.00)< 0.0001
PaO2_max(mmHg)131.00(80.00,221.50)134.00(79.00,243.00)0.2706
PaCO2_min(mmHg)33.00(28.00,39.00)34.00(29.00,39.00)0.0035
PaCO2_max(mmHg)45.00(37.50,52.00)45.00(39.00,52.00)0.2290
TotalCO2_min20.00(16.00,24.00)21.00(17.00,24.00)0.0014
TotalCO2_max24.00(21.00,28.00)25.00(22.00,28.00)0.0045
Base excess_min(mEq/L)−5.00(-11.00,-1.00)−5.00(-9.00,-1.00)0.0033
Base excess_max(mEq/L)−1.00(-4.00,1.00)−1.00(-4.00,1.00)0.0376
Glucose_min(mg/dL)108.00(86.50,132.00)108.00(91.00,131.00)0.1577
Glucose_max(mg/dL)157.00(125.00,197.00)157.00(127.00,204.00)0.2225
Urine output(ml)775.0(304.50,1392.50)970.0(545.00,1732.00)< 0.0001
Vital Signs
Heart rate_min (min-1)76.00(65.00,89.00)73.00(63.00,84.00)< 0.0001
Heart rate_max (min-1)110.00(95.00,127.00)104.00(90.00,120.00)< 0.0001
Heart rate_mean (min-1)91.41(80.08,103.60)86.66(76.44,99.61)< 0.0001
SBP_min(mmHg)83.00(75.00,91.00)85.00(78.00,96.00)< 0.0001
SBP_max(mmHg)139.0(125.00,152.00)143.0(131.00,159.00)< 0.0001
SBP_mean(mmHg)106.73(99.52,116.70)110.96(103.97,121.60)< 0.0001
DBP_min(mmHg)42.00(35.00,48.00)44.00(38.00,50.00)< 0.0001
DBP_max(mmHg)83.00(71.75,96.00)84.00(73.00,98.00)0.0249
DBP_mean(mmHg)58.20(52.01,64.63)59.91(54.12,66.88)< 0.0001
Respiratory rate_min (min-1)13.00(11.00,16.00)13.00(10.00,15.00)< 0.0001
Respiratory rate_max(min-1)30.00(26.00,35.00)28.00(25.00,32.00)< 0.0001
Respiratory rate_mean(min-1)20.89(18.10,24.0)19.64(17.31,22.52)< 0.0001
Temperature_min(°C)36.44(36.11,36.61)36.44(36.17,36.67)0.0062
Temperature_max(°C)37.22(36.89,37.61)37.22(36.94,37.61)0.0505
Temperature_mean(°C)36.78(36.51,37.03)36.80(36.58,37.09)0.0002
SpO2_min (%)92.00(90.00,94.00)92.00(90.00,95.00)0.0005
SpO2_mean (%)96.88(95.33,98.18)97.00(95.66,98.44)0.0003

Laboratory tests and vital signs of the patients.

WBC, white blood cells; AG, anion gap; BUN, blood urea nitrogen; INR, international normalized ratio; PT, prothrombin time; PTT, partial thromboplastin time, pH potential of hydrogen, PaO2 partial pressure of oxygen, PaCO2 partial pressure of carbon dioxide, SBP, systolic blood pressure; DBP, diastolic blood pressure, SpO2 pulse oxygen saturation, Max maximum, Min minimum, p-value less than 0.05 are shown in bold text.

3.2 Model construction

3.2.1 RSF model

The OOB error rates were calculated by grid search method under different combination of parameters. As shown in Figure 1, RSF model achieved the lowest OOB error rate (21.4%) under the parameter combination of mtry = 10 and nodesize = 88. Different shades of the same color indicate the level of OOB error rate, while a shift from yellow to purple indicates an increase in the OOB error rate. The lower the OOB error rate, the better the predictive ability of model (Lines et al., 2021). As shown in Figure 2, the OOB error rate of model stabilized when 500 survival trees were reached. As shown in Table 4, the variables were ranked in importance by using the minimum depth method. Vasopressors, age, length of stay in ICU, metastatic solid tumors, INR_min, respiratory rate_min, chloride_max, calcium_min, base excess_min, bicarbonate_max, AG_min, potassium_min, pH_max, bicarbonate_min, DBP_min were the top 15 variables, indicating that these variables have strong predictive ability and significant effect on the outcome.

FIGURE 1

FIGURE 2

TABLE 4

VariableDepthImportance
Vasopressor0.3901
Age1.3642
Length of stay in the ICU1.3903
Metastatic solid tumor3.4604
INR_min4.5785
Respiratory rate_min5.9426
Chloride_max6.4307
Calcium_min6.5248
Base excess_min7.0509
Bicarbonate_max7.24810
AG_min7.30211
Potassium_min7.87412
pH_max7.95813
Bicarbonate_min7.96414
DBP_min8.17815
Calcium_max8.21416
SBP_min8.45217
WBC_max8.55018
Sodium_max8.75019
pH_min8.88220
PTT_min8.96421
Hemoglobin_min9.13222
Temperature_max9.13823
Weight9.18624
Chloride_min9.20225
Potassium_max9.21226
WBC_min9.23027
Hematocrit_min9.78828
Platelets_max9.86629
Platelets_min9.88630
PaO2_min10.12431
AG_max10.31232
PaCO2_min10.36433
Base excess_max10.49834
TotalCO2_max10.59635
BUN_min10.68436
Heart_rate_max10.70037
Creatinine_min10.99638
Glucose_min11.00439
SpO2_mean11.16640
Hemoglobin_max11.38441
Hematocrit_max11.52242
DBP_max11.66043
Sodium_min11.75044
Resp_rate_mean11.89645
BUN_max12.01246
PaCO2_max12.19447
Glucose_max12.20248
Urine output12.32449
Temperature_min12.34650
PaO2_max12.96451
Creatinine_max12.99852
Respiratory rate_max13.39253
SBP_max13.65054
SBP_mean13.93655
INR_max13.99656
Heart_rate_min14.96857
SpO2_min15.23658
DBP_mean15.35259

Variable importance ranking for RSF.

The 59 variables in the depth threshold and the corresponding minimum depth value; ICU, intensive care unit; INR, international normalized ratio; AG, anion gap, pH potential of hydrogen, DBP, diastolic blood pressure; SBP, systolic blood pressure; WBC, white blood cells; PTT, partial thromboplastin time, PaO2 partial pressure of oxygen, PaCO2 partial pressure of carbon dioxide, BUN, blood urea nitrogen, SpO2 pulse oxygen saturation, Max maximum, Min minimum.

3.2.2 Cox regression model

Uni-variate Cox regression analysis results revealed that variables included age, length of stay in ICU, dopamine, epinephrine, nor-epinephrine, vasopressors, AG_min, AG_max, bicarbonate_min, bicarbonate_max, INR_min, INR_max, pH_min, pH_max, base excess_min, base excess_max, urine output, SBP_min, SBP_max, SBP_mean, temperature_min, temperature_max, temperature_mean, cerebrovascular disease, chronic pulmonary disease, mild liver disease, severe liver disease, and metastatic solid tumors had p-value ≤0.05. These variables were included in the multivariate Cox regression model for analysis. Dopamine, epinephrine, nor-epinephrine, vasopressors, INR_min, cerebrovascular disease, chronic pulmonary disease, mild liver disease, severe liver disease, metastatic solid tumors were important risk factors that increased the risk of death in ATN patients. However, length of stay in ICU was protective factor. The shorter the stay, the lower the risk of death. Detailed information of Cox regression analysis was shown in Table 5.

TABLE 5

VariableUni-variate Cox regressionMultivariate Cox regression
HR(95%CI)p-ValueHR(95%CI)p-Value
Age1(1-1)< 0.00011(1-1)< 0.0001
Gender0.82(0.7-0.97)0.020.89(0.75-1.1)0.2
Ethnicity1.1(0.99-1.1)0.095
Weight0.99(0.99-1)< 0.00011(0.99-1)0.095
First care unit1(0.95-1.1)0.76
Length of stay in the ICU0.96(0.95-0.97)< 0.00010.95(0.94-0.95)< 0.0001
Antibiotic1.1(0.58-1.9)0.85
Dobutamine1.3(0.96-1.8)0.084
Dopamine1.4(1-1.8)0.0371.4(1-1.9)0.045
Nerve blockers1.2(0.91-1.5)0.21
Epinephrine1.6(1.3-2)< 0.00011.7(1.3-2.2)0.0002
Norepinephrine1.9(1.6-2.3)< 0.00011.5(1.2-1.9)0.0004
Phenylephrine1.2(1-1.4)0.0331(0.86-1.3)0.64
Vasopressor2.4(2-2.8)< 0.00012(1.6-2.5)< 0.0001
CRRT1(0.74-1.4)0.89
Hematocrit_min1(1-1)0.17
Hematocrit_max1(1-1)0.2
Hemoglobin_min1(0.96-1)0.77
Hemoglobin_max1(0.96-1)0.9
Platelets_min1(1-1)0.0271(1-1)0.99
Platelets_max1(1-1)0.0441(1-1)0.4
WBC_min1(1-1)0.13
WBC_max1(1-1)0.12
AG_min1(1-1.1)< 0.00010.99(0.96-1)0.51
AG_max1(1-1.1)< 0.00011(0.98-1)0.65
Bicarbonate_min0.96(0.95-0.98)< 0.00011(0.97-1)0.68
Bicarbonate_max0.96(0.94-0.98)< 0.00011(0.96-1)0.97
BUN_min1(1-1)0.58
BUN_max1(1-1)0.98
Calcium_min0.93(0.85-1)0.16
Calcium_max1(0.92-1.1)0.71
Chloride_min1(0.99-1)0.69
Chloride_max1(0.99-1)0.86
Creatinine_min0.95(0.88-1)0.2
Creatinine_max0.94(0.89-1)0.061
Glucose_min1(1-1)0.0441(1-1)0.25
Glucose_max1(1-1)0.14
Sodium_min1(0.99-1)0.69
Sodium_max1(0.99-1)0.18
Potassium_min0.95(0.84-1.1)0.37
Potassium_max0.98(0.89-1.1)0.73
INR_min2.1(1.7-2.6)< 0.00011.5(1.1-2.1)0.0047
INR_max1.3(1.1-1.5)0.000540.96(0.7-1.3)0.78
PT_max1(1-1)0.00130.99(0.96-1)0.44
PTT_min1(1-1)< 0.00011(1-1)0.11
pH_min0.38(0.19-0.77)0.00710.71(0.21-2.4)0.58
pH_max0.21(0.07-0.62)0.00482.4(0.44-13)0.31
PaO2_min0.99(0.99-1)< 0.00011(0.99-1)0.049
PaO2_max1(1-1)0.00831(1-1)0.82
PaCO2_min0.99(0.98-1)0.0141(0.99-1)0.74
PaCO2_max0.99(0.99-1)0.05
Base excess_min0.98(0.96-0.99)0.000281(0.98-1)0.5
Base excess_max0.96(0.94-0.98)< 0.00010.99(0.95-1)0.66
TotalCO2_min0.97(0.96-0.99)0.000221(0.98-1.1)0.37
TotalCO2_max0.96(0.95-0.98)< 0.00010.97(0.94-1)0.068
Urine output1(1-1)< 0.00011(1-1)0.0019
Heart_rate_min1(1-1)0.13
Heart_rate_max1(1-1)0.0241(0.99-1)0.9
Heart_rate_mean1(1-1)0.0391(1-1)0.13
SBP_min0.98(0.98-0.99)< 0.00011(1-1)0.021
SBP_max0.99(0.99-1)< 0.00011(0.99-1)0.84
SBP_mean0.98(0.97-0.99)< 0.00010.99(0.98-1)0.28
DBP_min0.97(0.96-0.98)< 0.00010.99(0.98-1)0.07
DBP_max1(0.99-1)0.19
DBP_mean0.98(0.97-0.99)< 0.00011(0.99-1)0.58
Respiratory rate_min1(1-1.1)0.00331(0.99-1.1)0.11
Respiratory rate_max1(1-1)0.000171(0.99-1)0.19
Respiratory rate_mean1(1-1.1)0.000111(0.98-1)0.37
Temperature_min0.78(0.66-0.91)0.00190.84(0.68-1)0.1
Temperature_max0.85(0.74-0.97)0.0191.1(0.9-1.3)0.38
Temperature_mean0.75(0.63-0.9)0.00150.9(0.67-1.2)0.45
SpO2_min0.97(0.95-0.99)0.00271(0.97-1)0.99
SpO2_mean0.91(0.88-0.95)< 0.00010.97(0.92-1)0.25
Myocardial infarct1.3(1.1-1.5)0.0131.1(0.88-1.3)0.44
Congestive heart failure1.2(0.98-1.4)0.087
Peripheral vascular disease1.1(0.85-1.3)0.6
Cerebrovascular disease1.3(1-1.6)0.0381.5(1.2-1.9)0.0007
Dementia1.6(1-2.4)0.0410.97(0.62-1.5)0.9
Chronic pulmonary disease1.3(1.1-1.5)0.0121.3(1.1-1.6)0.0071
Rheumatic disease1.4(0.96-2.1)0.081
Peptic ulcer disease0.69(0.46-1)0.08
Mild liver disease1.4(1.2-1.6)0.000221.5(1.2-1.9)0.0003
Diabetes uncomplicated0.96(0.8-1.2)0.7
Diabetes complicated0.83(0.65-1)0.11
Paraplegia0.73(0.46-1.2)0.17
Renal disease0.89(0.75-1.1)0.2
Malignant cancer1.5(1.3-1.8)< 0.00011.1(0.86-1.3)0.59
Severe liver disease1.2(1-1.5)0.0451.4(1-1.8)0.03
Metastatic solid tumor2.4(1.9-3.1)< 0.00012.3(1.8-3.1)< 0.0001
Aids0.82(0.34-2)0.66

Cox regression analysis results.

HR, hazard ratio; CI, confidence interval; ICU, intensive care unit; CRRT, continuous renal replacement therapy; WBC, white blood cells; AG, anion gap; BUN, blood urea nitrogen; INR, international normalized ratio; PT, prothrombin time; PTT, partial thromboplastin time, pH potential of hydrogen, PaO2 partial pressure of oxygen, PaCO2 partial pressure of carbon dioxide, SBP, systolic blood pressure; DBP, diastolic blood pressure, SpO2 pulse oxygen saturation, Max maximum, Min minimum, p-value less than 0.05 are shown in bold text.

3.3 Model comparison

As shown in Table 6, the OOB error rates for RSF model and Cox regression model were 0.214 and 0.215. The OOB error rates for two models were small, indicating that both predictive models had good discrimination ability. The integrated Brier score of two models were 0.199 (RSF) and 0.154 (Cox), both of which were less than 0.25, suggesting that two models had good calibration ability. As shown in Figure 3, the prediction error curve of Cox regression model was lower than that of RSF model, indicating that Cox regression model was more stable and reliable. In the validation set, RSF model had AUC values of 0.788, 0.719, and 0.715 at 30, 60 and 90 days (Figure 4). Cox regression model had AUC values of 0.833, 0.736, and 0.732 at 30, 60 and 90 days (Figure 4). Thus, it appeared that Cox regression model was more accurate in predicting mortality of ATN patients. As shown in Figure 5, the net benefit of Cox model at different time points was large, indicating that the Cox model had high clinical practical value.

TABLE 6

ModelOOBIBS
Random survival forest0.2140.199
Multivariate cox regression0.2150.154

Performance comparison of the two models.

OOB, out-of-bag error rate; IBS, integrated brier score.

FIGURE 3

FIGURE 4

FIGURE 5

3.4 Nomogram for predicting risk of ATN mortality

Given that Cox model outperforms RSF model in the discrimination and calibration ability, a nomogram was constructed on the basis of Cox model to predict the probability of death at the individual level in ATN patients. The nomogram was constructed to predict 30-day, 60-day, and 90-day mortality risk based on 15 significant variables in the training set. In the nomogram, an individual score for each factor is obtained by projecting the value of each factor vertically onto the first row of “points”. For each participant, the total score was calculated by adding the scores for each factor. By projecting the total score vertically to the bottom, we can get a picture of the risk of death for ATN patients. Assuming a 68-year-old patient with ATN has a score of 82 for metastatic solid tumor, 83 for severe liver disease, and 87 for mild liver disease. Chronic pulmonary disease score was 84, cerebrovascular disease score was 82, SBP_min score was 83, urine output score was 83, and PaO2_min score was 82. INR_min score was 80, vasopressor score was 91, nor-epinephrine score was 86, epinephrine score was 88, dopamine score was 81, length of stay in the ICU score was 55, with an age score of 83 out of 1,230, the estimated risk of death at 30, 60, and 90 days was 15%, 37.6%, and 61.6% (Figure 6).

FIGURE 6

4 Discussion

ATN is one of the most common types of acute kidney injury, which seriously affects the quality of life and even threatens their lives, and it is characterized by high morbidity, high mortality and poor prognosis (Hoste et al., 2018). Therefore, it is important to identify the influencing factors for ATN, which can help screen those patients at high risk and receive proper treatment.

Although ML methods represented by RSF algorithm performed well in several fields, this does not mean that MLalgorithms have an absolute advantage over traditional methods. For example, Cuthbert et al. (Cuthbert et al., 2022) analyzed the prediction of 8-year revision risk following total knee and hip arthroplasty, and Tang X et al. (Tang X. et al., 2021) studied prognostic prediction in metastatic non-small cell lung cancer patients receiving EGFR-TKI osiertinib treatment, both studies have proved that traditional statistical methods have certain strengths. Nonetheless, RSF algorithm also has its own unique advantages. RSF algorithm can directly and quantitatively calculate the minimum depth of each variable to reflect the magnitude of importance, which facilitates the comparison among variables (Taylor, 2011). This is a feature which has not been found in the traditional Cox regression method. Therefore, it is better to construct predictive models by using several methods and compare them to identify the best ones.

Since Cox model outperforms RSF model, we constructed a nomogram based on the Cox model. The nomogram is a visualization tool used to generate the probability of clinical outcome (Park, 2018). Studies have demonstrated that nomogram enable accurate compared to traditional scoring systems (Wang et al., 2021). It is now widely used for risk prediction of many diseases (Li et al., 2021; Tan et al., 2022; Wang et al., 2023). Therefore, the creation of mortality risk nomogram based on the information about ATN patients can inform existing critical care assessment programs. Clinical staff can use the total score to predict the probability of death in ATN patients, thus assisting them in developing a more rational treatment plan.

In this study, two different models were constructed to explore the influencing factors of ATN. Cox regression analysis concluded that vasopressors, nor-epinephrine, INR_max, severe liver disease, and metastatic solid tumors were the important risk factors. RSF model concluded that vasopressors, INR_max, and metastatic solid tumors were the important influencing factors based on the importance rank of the variables. The influencing factors identified by two methods of analysis are basically similar, indicating that they are probably true factors associated with ATN.

Among the variables associated with predicting ATN patients, the most important ones are the AG, pH, base excess, BUN and bicarbonate, which can be used to determine whether patients have symptoms of acid-base imbalance, azotemia and electrolyte disturbances (Bellomo, 2011). Urine output is the most common factor affecting ATN. This is due to decreased urine output can cause hypovolemia, which increases the risk of death from the disease (Xu et al., 2020). Timely rehydration therapy can restore the circulating blood volume and improve the impaired renal perfusion function.

ATN is often associated with many comorbidities, and the presence of these comorbidities also increases the risk of death from ATN. Severe liver disease is a relatively common comorbidity in patients with ATN. Due to the presence of large amounts of peritoneal fluid in patients with severe hepatitis, it can lead to insufficient circulating blood volume and uneven distribution of intrarenal blood flow, which ultimately increases the probability of death in ATN (Chancharoenthana and Leelahavanichkul, 2019). In addition, metastatic solid tumors are the common comorbidity that increases the risk of death from this disease (Wu et al., 2020), and the main reason is that neutrophils in solid tumors can enhance cytotoxicity and lead to necrosis of renal tubular epithelial cells, thus reducing patient survival (Liao and Liaw, 2020).

Relevant studies have shown that certain drugs also increase the risk of death in ATN. For example, vasopressor drugs can increase glomerular perfusion pressure and urine output, thus affecting renal function (Shi and Wang, 2017). Nor-epinephrine can increase patients’ blood pressure and reduce renal blood flow, resulting in renal function impairment (Kim et al., 2021). INR is a preferred monitoring indicator for oral anticoagulants. Since overdose of anticoagulants increases the probability of death from ATN, the level of this indicator also reflects the risk of death occurring from ATN (Lim and Campbell, 2013).

However, the association between these drugs and disease needs further study. Because machine learning is one of the main methods of drug knowledge discovery. In the follow-up study, we plan to use machine learning, text mining and other technical methods to mine the process of drug tacit knowledge contained in the data, so as to explore whether there is a potential association between drugs and some biomedical entities, such as drug-disease association, and the association between drugs and side effects, etc.

The main advantage of this study is that it was the first to use the RSF algorithm and the Cox regression method to predict hospital mortality of patients with ATN from the MIMIC-IV database. Cox regression model has improved accuracy and precision compared to RSF model. This study also has some limitations: firstly, it is a single-center study and lacks external validation. Secondly, this was a retrospective observational study in which the majority of patients were white, and there may have been unobserved confounding factors that could have influenced the outcome. Finally, although the predictive ability of Cox regression model in this study is superior toRSF model, ML algorithms are evolving rapidly and new algorithms are constantly proposed, and further comparative studies are needed in practical applications.

5 Conclusion

Cox regression model is superior to RSF algorithm model in predicting mortality of patients with ATN. Vasopressors, nor-epinephrine, INR_min, and metastatic solid tumors were imporant factors that also significantly influence prognosis. Therefore, the mortality risk nomogram based on information about ATN patients can inform existing critical care assessment programs. Moreover, the model has certain clinical utility, which can provide clinicians with some reference basis in the treatment of ATN and contribute to improve patient prognosis.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://physionet.org/content/mimiciv/1.0/.

Ethics statement

The studies involving humans were approved by this study was conducted in accordance with the Declaration of Helsinki. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because this study was conducted in accordance with the Declaration of Helsinki. Institutional review board approval and informed consent were not required in current study because MIMIC-IV research data is publicly available and all patient data are de-identified. Informed consent of all subjects and/or their legal guardians was obtained when MIMIC-IV was established.

Author contributions

JZ: Conceptualization, Data curation, Formal Analysis, Methodology, Software, Writing–original draft. MZ: Data curation, Formal Analysis, Investigation, Validation, Writing–review and editing. JD: Investigation, Validation, Writing–review and editing. JH: Investigation, Writing–review and editing. QS: Project administration, Writing–review and editing. TD: Visualization, Writing–review and editing. JY: Supervision, Writing–review and editing. YW: Supervision, Writing–review and editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported in part by grants from the National Natural Science Foundation of China (No. 32270186) and the Huadong Medicine Joint Funds of the Zhejiang Provincail Natural Science Foundation (No. LHDMZ24H040001).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Glossary

ATNAcute tubular necrosis
RSFRandom survival forest
OOBOut-of-bag
ICUIntensive care unit
pHPotential of hydrogen
BUNBlood urea nitrogen
MLMachine learning
MIMICMedical information mart for intensive care
AGAnion gap
INRInternational normalized ratio
SBPSystolic blood pressure
DBPDiastolic blood pressure
ROCReceiver operating characteristic
AUCArea under the curve
MaxMaximum
MinMinimum
CCUCoronary care unit
SICUSurgical intensive care unit
MICUMedical intensive care unit
CVICUCardiac vascular intensive care unit
CRRTContinuous renal replacement therapy
WBCWhite blood cells
PTProthrombin time
PaO2Partial pressure of oxygen
PTTPartial thromboplastin time
PaCO2Partial pressure of carbon dioxide
SpO2Peripheral capillary oxygen saturation
HRHazard ratio
CIConfidence interval

References

  • 1

    AdhamD.AbbasgholizadehN.AbazariM. (2017). Prognostic factors for survival in patients with gastric cancer using a random survival forest. Asian Pac J. Cancer Prev.18 (1), 129134. 10.22034/APJCP.2017.18.1.129

  • 2

    AnY.LiangJ.LiuZ. H. (2022). The value of glomerular filtration rate slope change as a renal replacement endpoint in clinical trials. J. Nephrol. Dialysis Ren. Transplant.31 (6), 556560. 10.3969/j.issn.1006-298X.2022.06.012

  • 3

    BanerjeeM.DingY.NooneA. M. (2012). Identifying representative trees from ensembles. Stat. Med.31 (15), 16011616. 10.1002/sim.4492

  • 4

    BansalA.HeagertyP. J. (2018). A tutorial on evaluating the time-varying discrimination accuracy of survival models used in dynamic decision making. Med. Decis. Mak.38 (8), 904916. 10.1177/0272989X18801312

  • 5

    BellomoR. (2011). Acute renal failure. Semin. Respir. Crit. Care Med.32 (5), 639650. 10.1055/s-0031-1287872

  • 6

    ChancharoenthanaW.LeelahavanichkulA. (2019). Acute kidney injury spectrum in patients with chronic liver disease: where do we stand?World J. Gastroenterol.25 (28), 36843703. 10.3748/wjg.v25.i28.3684

  • 7

    CuthbertA. R.GilesL. C.GlonekG.Kalisch EllettL. M.PrattN. L. (2022). A comparison of survival models for prediction of eight-year revision risk following total knee and hip arthroplasty. BMC Med. Res. Methodol.22 (1), 164. 10.1186/s12874-022-01644-3

  • 8

    DuttaA.HasanM. K.AhmadM.AwalM. A.IslamM. A.MasudM.et al (2022). Early prediction of diabetes using an ensemble of machine learning models. Int. J. Environ. Res. Public Health19 (19), 12378. 10.3390/ijerph191912378

  • 9

    FarhadianM.Dehdar KarsidaniS.MozayanimonfaredA.MahjubH. (2021). Risk factors associated with major adverse cardiac and cerebrovascular events following percutaneous coronary intervention: a 10-year follow-up comparing random survival forest and Cox proportional-hazards model. BMC Cardiovasc Disord.21 (1), 38. 10.1186/s12872-020-01834-1

  • 10

    Hippisley-CoxJ.CouplandC. (2015). Development and validation of risk prediction equations to estimate future risk of heart failure in patients with diabetes: a prospective cohort study. BMJ Open5 (9), e008503. 10.1136/bmjopen-2015-008503

  • 11

    HosteE. A. J.KellumJ. A.SelbyN. M.ZarbockA.PalevskyP. M.BagshawS. M.et al (2018). Global epidemiology and outcomes of acute kidney injury. Nat. Rev. Nephrol.14 (10), 607625. 10.1038/s41581-018-0052-0

  • 12

    HsichE.GorodeskiE. Z.BlackstoneE. H.IshwaranH.LauerM. S. (2011). Identifying important risk factors for survival in patient with systolic heart failure using random survival forests. Circulation Cardiovasc. Qual. Outcomes4 (1), 3945. 10.1161/CIRCOUTCOMES.110.939371

  • 13

    IssaN. T.StathiasV.SchürerS.DakshanamurthyS. (2021). Machine and deep learning approaches for cancer drug repurposing. Semin. Cancer Biol.68, 132142. 10.1016/j.semcancer.2019.12.011

  • 14

    KattanM. W.VickersA. J. (2020). Statistical analysis and reporting guidelines for CHEST. Chest158 (1), S3S11. 10.1016/j.chest.2019.10.064

  • 15

    KimN. Y.ChaeD. W.BetalK.ParkK.KimS. Y. (2021). Development of a risk scoring system for predicting acute kidney injury after minimally invasive partial and radical nephrectomy: a retrospective study. Surg. Endosc.35 (4), 16261635. 10.1007/s00464-020-07545-0

  • 16

    KoletsiD.PandisN. (2017). Survival analysis, part 3: Cox regression. Am. J. Orthod. Dentofac. Orthop.152 (5), 722723. 10.1016/j.ajodo.2017.07.009

  • 17

    LiS.ZhaoZ.YangH.WangD.SunW.LiS.et al (2021). Construction and validation of a nomogram for the preoperative prediction of lymph node metastasis in gastric cancer. Cancer Control.28, 10732748211027160. 10.1177/10732748211027160

  • 18

    LiañoF.García-MartínF.GallegoA.OrteL.TeruelJ. L.MarcénR.et al (1989). Easy and early prognosis in acute tubular necrosis: a forward analysis of 228 cases. Nephron51 (3), 307313. 10.1159/000185314

  • 19

    LiaoT. Y.LiawC. C. (2020). Retrospective analysis of mortality cases in advanced and metastatic solid tumors with concurrent prerenal azotemia. Vivo34 (3), 15151519. 10.21873/invivo.11939

  • 20

    LimA. K.CampbellD. A. (2013). Haematuria and acute kidney injury in elderly patients admitted to hospital with supratherapeutic warfarin anticoagulation. Int. Urol. Nephrol.45 (2), 561570. 10.1007/s11255-012-0364-0

  • 21

    LinesL. M.CohenJ.KirschnerJ.HalpernM. T.KentE. E.MollicaM. A.et al (2021). Random survival forests using linked data to measure illness burden among individuals before or after a cancer diagnosis: development and internal validation of the SEER-CAHPS illness burden index. Int. J. Med. Inf.145, 104305. 10.1016/j.ijmedinf.2020.104305

  • 22

    MaB.GengY.MengF.YanG.SongF. (2020). Identification of a sixteen-gene prognostic biomarker for lung adenocarcinoma using a machine learning method. J. Cancer11 (5), 12881298. 10.7150/jca.34585

  • 23

    MiaoF.CaiY. P.ZhangY. X.LiY.ZhangY. T. (2015). Risk prediction of one-year mortality in patients with cardiac arrhythmias using random survival forest. Comput. Math. Methods Med.2015, 303250. 10.1155/2015/303250

  • 24

    MogensenU. B.IshwaranH.GerdsT. A. (2012). Evaluating random forests for survival analysis using prediction error curves. J. Stat. Softw.50 (11), 123. 10.18637/jss.v050.i11

  • 25

    MoolgavkarS. H.ChangE. T.WatsonH. N.LauE. C. (2018). An assessment of the Cox proportional hazards regression model for epidemiologic studies. Risk Anal.38 (4), 777794. 10.1111/risa.12865

  • 26

    MorvanL.CarlierT.JametB.BaillyC.Bodet-MilinC.MoreauP.et al (2020). Leveraging RSF and PET images for prognosis of multiple myeloma at diagnosis. Int. J. Comput. Assist. Radiol. Surg.15 (1), 129139. 10.1007/s11548-019-02015-y

  • 27

    Noorbakhsh-SabetN.ZandR.ZhangY.AbediV. (2019). Artificial intelligence transforms the future of health care. Am. J. Med.132 (7), 795801. 10.1016/j.amjmed.2019.01.017

  • 28

    ParkS. Y. (2018). Nomogram: an analogue tool to deliver digital knowledge. J. Thorac. Cardiovasc Surg.155 (4), 1793. 10.1016/j.jtcvs.2017.12.107

  • 29

    PengH. X.LinK.HeB. S.PanY. Q.YingH. Q.HuX. X.et al (2016). Platelet-to-lymphocyte ratio could be a promising prognostic biomarker for survival of colorectal cancer: a systematic review and meta-analysis. FEBS Open Bio6 (7), 742750. 10.1002/2211-5463.12083

  • 30

    QiuX.GaoJ.YangJ.HuJ.HuW.KongL.et al (2020). A comparison study of machine learning (random survival forest) and classic statistic (Cox proportional hazards) for predicting progression in high-grade glioma after proton and carbon ion radiotherapy. Front. Oncol.10, 551420. 10.3389/fonc.2020.551420

  • 31

    RosenS.StillmanI. E. (2008). Acute tubular necrosis is a syndrome of physiologic and pathologic dissociation. J. Am. Soc. Nephrol.19 (5), 871875. 10.1681/ASN.2007080913

  • 32

    RoshanaeiG.SafariM.FaradmalJ.AbbasiM.KhazaeiS. (2022). Factors affecting the survival of patients with colorectal cancer using random survival forest. J. Gastrointest. Cancer53 (1), 6471. 10.1007/s12029-020-00544-3

  • 33

    ShiQ.WangG. Q. (2017). Risk factors of acute renal function injury in severe patients with complex abdominal infection. Mod. Med. Hyg.33 (24), 37143716. 10.3969/j.issn.1009-5519.2017.24.007

  • 34

    SloanR. A.HaalandB. A.SawadaS. S.LeeI. M.SuiX.LeeD. C.et al (2016). A fit-fat index for predicting incident diabetes in apparently healthy men: a prospective cohort study. PLoS One11 (6), e0157703. 10.1371/journal.pone.0157703

  • 35

    TanP. Y.HuoM.ZhouX. H.ZhaoB. L. (2022). Development and validation of a nomogram for predicting the risk of nursing home-acquired pneumonia. Eur. Rev. Med. Pharmacol. Sci.26 (22), 82768288. 10.26355/eurrev_202211_30360

  • 36

    TangM.WangH.CaoY.ZengZ.ShanX.WangL. (2021a). Nomogram for predicting occurrence and prognosis of liver metastasis in colorectal cancer: a population-based study. Int. J. Colorectal Dis.36 (2), 271282. 10.1007/s00384-020-03722-8

  • 37

    TangX.LiY.YanW. F.QianW. L.PangT.GongY. L.et al (2021b). Machine learning-based CT radiomics analysis for prognostic prediction in metastatic non-small cell lung cancer patients with EGFR-t790m mutation receiving third-generation EGFR-TKI osimertinib treatment. Front. Oncol.11, 719919. 10.3389/fonc.2021.719919

  • 38

    TapakL.SheikhV.JenabiE.KhazaeiS. (2020). Predictors of mortality among hemodialysis patients in Hamadan province using random survival forests. J. Prev. Med. Hyg.61 (3), E482-E488E488. 10.15167/2421-4248/jpmh2020.61.3.1421

  • 39

    TaylorJ. M. (2011). Random survival forests. J. Thorac. Oncol.6 (12), 19741975. 10.1097/JTO.0b013e318233d835

  • 40

    VickersA. J.HollandF. (2021). Decision curve analysis to evaluate the clinical benefit of prediction models. Spine J.21 (10), 16431648. 10.1016/j.spinee.2021.02.024

  • 41

    WangH.OuY.FanT.ZhaoJ.KangM.DongR.et al (2021). Development and internal validation of a nomogram to predict mortality during the ICU stay of thoracic fracture patients without neurological compromise: an analysis of the MIMIC-III clinical database. Front. Public Health9, 818439. 10.3389/fpubh.2021.818439

  • 42

    WangH.ZhouL. (2017). Random survival forest with space extensions for censored data. Artif. Intell. Med.79, 5261. 10.1016/j.artmed.2017.06.005

  • 43

    WangJ.TangJ.ChenT.YueS.FuW.XieZ.et al (2022). A web-based prediction model for overall survival of elderly patients with early renal cell carcinoma: a population-based study. J. Transl. Med.20 (1), 90. 10.1186/s12967-022-03287-w

  • 44

    WangX.GongG.LiN.QiuS. (2019). Detection analysis of epileptic EEG using a novel random forest model combined with grid search optimization. Front. Hum. Neurosci.13, 52. 10.3389/fnhum.2019.00052

  • 45

    WangY.ZhangY.NiB.JiangY.OuyangY. (2023). Development and validation of a depression risk prediction nomogram for US Adults with hypertension, based on NHANES 2007-2018. PLoS One18 (4), e0284113. 10.1371/journal.pone.0284113

  • 46

    WuY.HaoW.ChenY.ChenY.ChenS.LiuW.et al (2020). Clinical features, risk factors and clinical burden of acute kidney injury in older adults. Ren. Fail42 (1), 11271134. 10.1080/0886022X.2020.1843491

  • 47

    XuL. B.ZhuL. G.LiuG. P.YuL. (2020). A retrospective study of acute kidney injury in elderly patients in Inner Mongolia. Chin. J. Integr. Traditional West. Nephrop.21 (10), 871874. 10.3969/j.issn.1009-587X.2020.10.007

  • 48

    YosefianI.FarkhaniE. M.BaneshiM. R. (2015). Application of random forest survival models to increase generalizability of decision trees: a case study in acute myocardial infarction. Comput. Math. Methods Med.2015, 576413. 10.1155/2015/576413

  • 49

    ZhangL.HuangT.XuF.LiS.ZhengS.LyuJ.et al (2022). Prediction of prognosis in elderly patients with sepsis based on machine learning (random survival forest). BMC Emerg. Med.22 (1), 26. 10.1186/s12873-022-00582-z

  • 50

    ZhouQ.ShengY. H.ZhangT.ZhangZ. A. (2010). The role of chemical mediators in acute tubular necrosis. Med. Rev.16 (5), 653655. 10.3969/j.issn.1006-2084.2010.05.005

Summary

Keywords

acute tubular necrosis, random survival forest, cox regression, nomogram, risk factors

Citation

Zeng J, Zhang M, Du J, Han J, Song Q, Duan T, Yang J and Wu Y (2024) Mortality prediction and influencing factors for intensive care unit patients with acute tubular necrosis: random survival forest and cox regression analysis. Front. Pharmacol. 15:1361923. doi: 10.3389/fphar.2024.1361923

Received

27 December 2023

Accepted

22 April 2024

Published

23 May 2024

Volume

15 - 2024

Edited by

Norberto Perico, Mario Negri Institute for Pharmacological Research (IRCCS), Italy

Reviewed by

Karim Abbasi, Sharif University of Technology, Iran

Haoran Zhang, Peking Union Medical College Hospital (CAMS), China

Updates

Copyright

*Correspondence: Jun Yang, ; Yinyin Wu,

†These authors have contributed equally to this work and share first authorship

‡These authors have contributed equally to this work and share last authorship

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics