Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 26 April 2022
Sec. Computational Genomics
This article is part of the Research Topic Machine Learning Techniques on Gene Function Prediction Volume II View all 25 articles

Clinical Prediction of Heart Failure in Hemodialysis Patients: Based on the Extreme Gradient Boosting Method

Yanfeng Wang&#x;Yanfeng Wang1Xisha Miao&#x;Xisha Miao1Gang Xiao&#x;Gang Xiao2Chun HuangChun Huang1Junwei SunJunwei Sun1Ying WangYing Wang3Panlong Li
Panlong Li1*Xu You
Xu You2*
  • 1The School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou, China
  • 2Department of Clinical Laboratory, The Third Affiliated Hospital, Southern Medical University, Guangzhou, China
  • 3Department of Medical Statistics, School of Public Health, Sun Yat-sen University, Guangzhou, China

Background: Heart failure (HF) is the main cause of mortality in hemodialysis (HD) patients. However, it is still a challenge for the prediction of HF in HD patients. Therefore, we aimed to establish and validate a prediction model to predict HF events in HD patients.

Methods: A total of 355 maintenance HD patients from two hospitals were included in this retrospective study. A total of 21 variables, including traditional demographic characteristics, medical history, and blood biochemical indicators, were used. Two classification models were established based on the extreme gradient boosting (XGBoost) algorithm and traditional linear logistic regression. The performance of the two models was evaluated based on calibration curves and area under the receiver operating characteristic curves (AUCs). Feature importance and SHapley Additive exPlanation (SHAP) were used to recognize risk factors from the variables. The Kaplan–Meier curve of each risk factor was constructed and compared with the log-rank test.

Results: Compared with the traditional linear logistic regression, the XGBoost model had better performance in accuracy (78.5 vs. 74.8%), sensitivity (79.6 vs. 75.6%), specificity (78.1 vs. 74.4%), and AUC (0.814 vs. 0.722). The feature importance and SHAP value of XGBoost indicated that age, hypertension, platelet count (PLT), C-reactive protein (CRP), and white blood cell count (WBC) were risk factors of HF. These results were further confirmed by Kaplan–Meier curves.

Conclusions: The HF prediction model based on XGBoost had a satisfactory performance in predicting HF events, which could prove to be a useful tool for the early prediction of HF in HD.

Introduction

Heart failure (HF) as a clinical syndrome is one of the main causes of mortality (Ebong et al., 2014; Virani et al., 2021). More than 60 million people are affected by HF worldwide, and the number of HF patients is increasing every year (Groenewegen et al., 2020). Compared with the general population, the prevalence of HF in patients with chronic kidney disease (CKD) is much higher, especially in patients with end-stage renal disease (ESRD) (Rangaswami and McCullough, 2018; Rapa et al., 2019). Currently, hemodialysis (HD) as a renal replacement therapy (RRT) is the main treatment of ESRD (Akash et al., 2014). However, more than 40% of HD patients suffer from HF, which increase the medical care, economic burden, and mortality (House et al., 2019; Sun et al., 2019). Therefore, identification, pre-estimation, and timely interventions can improve the prognosis and prolong the survival time for the population who show a high risk of HF (Abdo, 2017).

The causes of HF in HD patients are multifactorial (Wang and Sanderson, 2011). Similar to the general population, traditional risk factors such as aging, hypertension, diabetes mellitus (DM), and atherosclerosis are associated with HF in HD patients (Liu et al., 2014; Cozzolino et al., 2018). However, excess HF prevalence and mortality in HD population are not fully accounted for by the traditional risk factors (Schefold et al., 2016). Some researchers have found that dialysis can increase the risk of HF events in ESRD patients (Rangaswami and McCullough, 2018; Samanta et al., 2019). Evidence shows that the malnutrition–inflammation syndrome in HD patients is associated with HF (Almeida et al., 2008; Chang et al., 2020). In addition, changed hemodynamic and blood pressure by HD can increase the risk of HF (Dorairajan et al., 2010). Several traditional risk prediction models were proposed to predict the risk of HF in population without HD (Ouwerkerk et al., 2014; Voors et al., 2017). However, these linear models missed specific related factors for HD people and oversimplified the complicated relationships between factors, which may lead to a decrease in performance and miss important risk factors (Mpanya et al., 2021).

Recent advances in extreme gradient boosting (XGBoost), a new integrated machine learning algorithm, have provided a robust method to identify the complex non-linear relationship between multiple variables and outcomes (Kagiyama et al., 2019). This algorithm has strong model generalization ability, fast operating speed, and high model accuracy, which has been applied in orthopedic auxiliary classification, prediction of interaction, and analysis of hypertension-related symptoms to improve accuracy in complex clinical decision-making (Chang et al., 2019; Li and Zhang, 2020; Yu et al., 2020).

Therefore, we aimed to establish a prediction model that integrated patient-specific information and non-traditional factors of HF in HD patients based on XGBoost 1) to accurately predict HF events and 2) to assess the risk of HF in patients with HD treatment.

Materials and Methods

Study Population

This retrospective study analyzed the data of 410 ESRD patients who underwent HD treatment in the HD centers of the Third Affiliated Hospital of Southern Medical University and the Third Affiliated Hospital of Sun Yat-sen University between January 2015 and September 2019. All the patients were older than 18 years and received HD treatment for at least 3 months. Patients with the following conditions were excluded from this study: 1) history of renal transplantation; 2) malignancy, acute infection, hepatic, and pulmonary dysfunction; and 3) lack of data on demographic characteristics, laboratory examinations, or physical examination. This study was approved by the Ethics Committee of the Third Affiliated Hospital of Southern Medical University.

Data Collection

A total of 21 factors were collected at the start of HD therapy. The basic information consisted of the patient’s gender, age, and history of various diseases. After fasting overnight for at least 8 h, the patient’s venous blood samples were collected before dialysis therapy. The laboratory indicators included C-reactive protein (CRP), blood urea nitrogen (BUN), calcium (CA), hemoglobin (HB), phosphorus (P), cholesterol (CHOL), serum creatinine (CRE), lymphocyte (LYMPH), uric acid (UA), low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), intact parathyroid hormone (IPTH), white blood cell count (WBC), platelet count (PLT), and neutrophil count (NEUT). All the abovementioned examinations were conducted at the laboratory centers in the Third Affiliated Hospital of Southern Medical University and the Third Affiliated Hospital of Sun Yat-sen University.

Outcome and Follow-Up

The occurrence of HF events (fatal or not) after HD was recorded. The HF diagnosis was made based on Framingham criteria (Groenewegen et al., 2020). Follow-up visits were performed by the HD centers. Patients who developed HF after HD were censored at the earliest date of a HF event. The HF events were recorded up to May 1, 2020.

Statistical Analysis

All statistical analyses and model establishment were performed using SPSS 26.0 for Windows and Python 3.7.6. All continuous variables were described as mean ± standard deviation (SD), and these variables were compared between groups by performing t-tests. In addition, discrete variables were expressed in numbers (n) and percentages (%), and the chi-square test was applied. All results were considered to be statistically significant within a two-sided test with p <0.05.

Two classification models were established based on XGBoost and traditional linear logistic regression models. The multiple logistic regression was selected using a stepwise method. In the XGBoost model, all the characteristics of HD patients were included. The data set was randomly divided into a training set by 75% and a validation set by 25%. The performance of the models was evaluated using calibration curves and area under the receiver operating characteristic curves (AUCs). To evaluate the influence of different variables on the results, the SHAP value of important variables was calculated (Rodriguez-Perez and Bajorath, 2020). A risk curve was constructed using the Kaplan–Meier analysis, and differences between groups were compared by a log-rank test to further evaluate the performance of our model and verify the risk factors selected by the mode. To make our model be an easy-to-use tool, a nomogram was developed based on the risk factors selected by our model.

Results

Demographic and Clinical Characteristics of Study Population

According to the inclusion and exclusion criteria, a total of 353 patients were finally included in this study. The basic variables of study population are shown in Table 1. Among these patients, 96 patients (73 males and 23 females) developed HF during the follow-up duration and 257 patients (166 males and 91 females) did not. In general, the average age of these patients was 54.92 ± 15.29, and the proportion of males was 67.71%. Compared with patients without HF, those with HF were older and had a higher ratio of hypertension and diabetes, and there was a significant difference in gender. In terms of laboratory data, for patients in the HF group, their WBC, NEUT, CRP, and TIBC levels were significantly (p <0.05) higher than those of the other group. There was no significant difference in other characteristics.

TABLE 1
www.frontiersin.org

TABLE 1. Clinical characteristics and laboratory parameters of patients in the HF group and the non-HF group.

Logistic Regression Analysis

As presented in Table 2, the following factors were included in a multivariate stepwise logistic regression model: HTN (OR: 3.786, 95% CI: 1.640–8.739, and p = 0.002), WBC (OR: 1.115, 95% CI: 1.026–1.212, and p = 0.011), CRP (OR: 1.020, 95% CI: 1.009–1.032, and p = 0.001), and age (OR: 1.026, 95% CI: 1.008–1.044, and p = 0.004). The receiver operating characteristic (ROC) curve of the logistic regression model is shown in Figure 1, and the performance of this model was evaluated using the calibration curve, as shown in Figure 2. The area under the curve (AUC) was 0.722 for the validation group and 0.735 for the training group. In addition, accuracy, sensitivity, and specificity of the model were 74.8, 75.6, and 74.4%, respectively, evaluated by the validation group.

TABLE 2
www.frontiersin.org

TABLE 2. Multivariate logistic regression model.

FIGURE 1
www.frontiersin.org

FIGURE 1. Receiver operating characteristic curves of two models for predicting objective response in the training cohort (A) and validation cohort (B). AUC, area under the curve.

FIGURE 2
www.frontiersin.org

FIGURE 2. Calibration curve demonstrating predictions from the model to the actual observed probability. (A) Training cohort and (B) validation cohort.

Extreme Gradient Boosting Model

The ROC curve and the calibration curve of the XGBoost model are presented in Figure 1 and Figure 2, respectively. It was found that the accuracy, sensitivity, specificity, and AUC value were 78.5, 79.6, 78.1%, and 0.814, respectively, evaluated by the validation cohort. The feature importance ranking which represents the contribution of the corresponding feature to the model prediction based on gain is shown in Figure 3. It was found that HTN, age, CRP, WBC, and PLT were more important for the prediction of HC than other features.

FIGURE 3
www.frontiersin.org

FIGURE 3. Related factors of HF were ranked by feature gain. (A) showed the top ten gain features; age, HTN, WBC, PLT, and CRP had a more significant impact. (B), (C), (D), (E), and (F) represented the SHAP value of HTN, age, PLT, WBC, and CRP in the XGBoost model. It reflected the influence of relevant factors on the results. WBC, white blood cell count; HTN, hypertension; UA, uric acid; TG, triglycerides; BUN, blood urea nitrogen; HDL, high-density lipoprotein cholesterol; PLT, platelet count; CRE, serum creatinine; CRP, C-reactive protein.

Interpretation of the Extreme Gradient Boosting Model

The SHAP value quantifies the contribution of data to the outcome in the XGBoost model; the size of the value reflects the influence of the corresponding eigenvalue on the outcome, and the positive and negative values reflect whether the results are promoted. Based on the established XGBoost model, the SHAP values of the five relevant factors showed how the feature affected the results, as shown in Figure 3.

Kaplan–Meier Analysis

The Kaplan–Meier analysis was used to confirm the effects of risk factors on the incidence of HF. The risk factors were stratified by the SHAP value when it was 0. As shown in Figure 4, the risk curve revealed that considering the influence of different outcome time, patients with high levels of WBC, CRP, PLT, old age, and hypertension had a higher incidence of HF, and the effect of UA was not obvious enough. The nomogram of the XGBoost model was constructed, as shown in Figure 5.

FIGURE 4
www.frontiersin.org

FIGURE 4. Risk curve of six factors based on the Kaplan–Meier analysis. (A), (B), (C), (D), (E), and (F) represented risk curves of age, hypertension, CRP, WBC, PLT, and UA, respectively. WBC, white blood cell count; UA, uric acid; CRP, C-reactive protein; PLT, platelet count.

FIGURE 5
www.frontiersin.org

FIGURE 5. Nomogram scaled by the coefficient of each risk factor selected by the XGBoost model. HTN, hypertension; WBC, white blood cell count; CVD, cardiovascular disease; CRP, C-reactive protein; PLT, platelet count.

Discussion

In this retrospective study, we established and validated a clinical prediction model to predict HF in HD patients based on the XGBoost algorithm. The results indicated that our model could effectively identify the individuals who suffer from HF using routinely clinical parameters. Our result indicated that XGBoost as a non-linear model had better performance than the logistic model and showed stronger ability in identification of risk factors.

In recent years, machine learning has been widely used in risk prediction and disease screening, which has obtained excellent performance (Kwon et al., 2020; Meng et al., 2020; Cobb et al., 2021; Koteluk et al., 2021; Kourou et al., 2021). Therefore, in this research, XGBoost, an integrated machine learning algorithm, was applied to identify the complex non-linear relationship between HF and clinical variables, as well as to evaluate the importance of the variables to the HF. Although traditional multivariable analysis methods have been applied for the identification of HF in HD patients (Gedfew et al., 2020; Bramania et al., 2021), to our knowledge, this was the first machine learning model for the prediction of HF. Our results showed that the performance of the XGBoost model was better than the traditional logistic regression model in prediction of HF. To make our XGBoost model interpretable, the feature importance and SHAP value were applied to evaluate the contribution of variables to HF. Except four risk factors including age, hypertension, WBC, and CRP found in the logistic regression model, a new risk factor, PLT, was found in the XGBoost model. This showed the advantages of our XGBoost mode, which could improve the accuracy of the classification and the efficiency of identification.

In addition to HF response, the performance of our model was evaluated by the Kaplan–Meier analysis. The differences between the Kaplan–Meier curves of different risk levels indicated that the risk factors selected by our XGBoost model were important prognostic factors of HF. The results of Kaplan–Meier analysis also showed that age more than 60, WBC more than 6.5 × 109, CRP more than 15 mg/L, PLT more than 250 × 109, and hypertension were independent risk factors which could increase the probability of HF. Moreover, based on the five risk factors selected by the XGBoost model and the Kaplan–Meier analysis, the nomogram (Yao et al., 2019) was designed to give an easy-to-use tool for the prediction of HF.

In our study, the proportion of hypertension in HF patients was significantly greater than that in the non-HF group, which is consistent with a recent study (Bramania et al., 2021). The analysis of characteristics showed that hypertension was a risk factor for HF events, which is consistent with the results of a recent study (Cozzolino et al., 2017). Patients with hypertension already had a huge cardiovascular burden before dialysis (Van Buren, 2016). There are pieces of evidence that dialysis treatment can lead to abnormal fluctuations in blood pressure and affect patient’s hemodynamics (Chou et al., 2018; Douvris et al., 2019). The vascular intima is more damaged by the abnormal fluctuations in the blood pressure, and HF is more likely to occur (Buren and Inrig, 2012).

Age is an important risk factor of HF. In old age, the shape and function of the vascular wall are changed due to oxidative stress, cell aging, and inflammation (Ghebre et al., 2016). In addition, with increasing age, body functions gradually decline, physiological compensatory function decreases, chronic diseases such as high blood pressure and diabetes occur frequently, and symptoms such as anemia, malnutrition, and chronic inflammation are prone to occur; these factors increase the risk of HF events (McHugh and Gil, 2018; Groenewegen et al., 2020). It has been reported that approximately 80% of HF patients were more than 60 years old, and the prevalence increased with age (Vigen et al., 2012). According to the results of our study, the risk of HF continued to increase from about 45 years old, which was consistent with the report from the American Heart Association (Virani et al., 2021).

WBC and CRP are widely used to reflect the inflammatory state in patients (Rienstra et al., 2012; Adamo et al., 2020). Inflammation and HF are strongly interconnected, and higher levels of inflammation markers indicated an increased mortality and morbidity of HF in patients (Ekdahl et al., 2017). Some studies have confirmed that inflammation markers are implicated in the development of HF because the immune system leads to deterioration of the structure and function of the cardiovascular system by the inflammatory response (Van Linthout and Tschope, 2017; Castillo et al., 2020). A study has demonstrated that high levels of WBC are closely related to the incidence of HF (Bekwelem et al., 2011). CRP is one of the recognized risk factors for CVD events, and a meta-analysis has reported that CRP plays an important role in the development of HF (Emerging Risk Factors Collaboration et al., 2010). In this study, our result was consistent with these previous findings. In addition, our results showed that other serum biomarkers, such as an anomalous level of PLT, were related to HF. Excessive PLT can promote inflammation, negatively affect the left ventricular function, and increase the risk of atherosclerosis; these diseases would increase the probability of HF (Gary et al., 2013; Chen and Yang, 2020).

All these data required in this study can be easily obtained through routine clinical examination, not limited by many additional conditions, and can make full use of existing resources. The XGBoost model showed a better prediction effect by using higher indicators such as AUC, accuracy, sensitivity, and atherosclerosis specificity. It can be a useful tool for doctors to evaluate the HF risk of HD patients and carry out personalized intervention in advance. However, there are still some limitations to our study. We only focused on the event of HF in HD population of China; it needs to be verified by different populations and races. In addition, this study is a retrospective examination which lacks longitudinal observation. Therefore, in further research, a longitudinal data analysis can help evaluate the changes in these risk factors over time and further validate the results in our model.

In conclusion, we developed and validated a clinical prediction model based on the XGBoost algorithm for HF in end-stage renal disease patients. The model had an excellent predictive performance and provided a useful tool for the early prediction of HF in HD patients.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by the Ethics Committee of the Third Affiliated Hospital of Southern Medical University. The patients/participants provided their written informed consent to participate in this study.

Author Contributions

PL and XY contributed to the concept and design of the sutdy. YIW and XM organized the database. YAW, XM, and GX wrote the first draft of the manuscript. CH and JS wrote sections fo the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version.

Funding

This work was supported by the Third Affiliated Hospital of Sun Yat-Sen University, Clinical Research Program (P000-277), the Science and Technology Planning Project of Guangdong Province, China (2019B020228001), the Meizhou Science and Technology Project (2019B0203003), the Open Fund of State Key Laboratory of EC Prevention and Treatment (K2020-0010 and K2020-0011), and the National Natural Science Foundation of China (U1804262, 61632002). Zhongyuan Thound Talents Program (204200510003).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abdo, A. S. (2017). Diagnosis and Management of Heart Failure in Long-Term Dialysis Patients. Curr. Heart Fail. Rep. 14 (5), 404–409. doi:10.1007/s11897-017-0354-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Adamo, L., Rocha-Resende, C., Prabhu, S. D., and Mann, D. L. (2020). Reappraising the Role of Inflammation in Heart Failure. Nat. Rev. Cardiol. 17 (5), 269–285. doi:10.1038/s41569-019-0315-x

PubMed Abstract | CrossRef Full Text | Google Scholar

Akash, N. K., Giacomo, M., Enrico, R., and Claudio, R. (2014). The Role of Economies of Scale in the Cost of Dialysis across the World: a Macroeconomic Perspective. Nephrol. Dial. Transpl. 29 (4), 885–891. doi:10.1093/ndt/gft528

CrossRef Full Text | Google Scholar

Almeida, H. R. M., Santos, E. M. C., Dourado, K., Mota, C., and Peixoto, R. (2008). Malnutrition Associated with Inflammation in the Chronic Renal Patient on Hemodialysis. Rev. Assoc. Med. Bras (1992) 64 (9), 837–844. doi:10.1590/1806-9282.64.09.837

PubMed Abstract | CrossRef Full Text | Google Scholar

Bekwelem, W., Lutsey, P. L., Loehr, L. R., Agarwal, S. K., Astor, B. C., Guild, C., et al. (2011). White Blood Cell Count, C-Reactive Protein, and Incident Heart Failure in the Atherosclerosis Risk in Communities (ARIC) Study. Ann. Epidemiol. 21 (10), 739–748. doi:10.1016/j.annepidem.2011.06.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Bramania, P. K., Ruggajo, P. J., and Furia, F. F. (2021). Prevalence and Predictors of Heart Failure Among Patients on Maintenance Hemodialysis Therapy at Muhimbili National Hospital in Tanzania: a Cross-Sectional Study. Egypt. Heart J. 73 (1), 97. doi:10.1186/s43044-021-00223-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Buren, P., and Inrig, J. K. (2012). Hypertension and Hemodialysis: Pathophysiology and Outcomes in Adult and Pediatric Populations. Pediatr. Nephrol. 27 (3), 339–350. doi:10.1007/s00467-011-1775-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Castillo, E. C., Vazquez-Garza, E., Yee-Trejo, D., Garcia-Rivas, G., and Torre-Amione, G. (2020). What Is the Role of the Inflammation in the Pathogenesis of Heart Failure? Curr. Cardiol. Rep. 22 (11), 139. doi:10.1007/s11886-020-01382-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, J. F., Wu, C. C., Hsieh, C. Y., Li, Y. Y., Wang, T. M., and Liou, J. C. (2020). A Joint Evaluation of Impaired Cardiac Sympathetic Responses and Malnutrition-Inflammation Cachexia for Mortality Risks in Hemodialysis Patients. Front. Med. (Lausanne). 7, 99. doi:10.3389/fmed.2020.00099

PubMed Abstract | CrossRef Full Text | Google Scholar

Chang, W., Liu, Y., Xiao, Y., Xu, X., Zhou, S., Lu, X., et al. (2019). Probability Analysis of Hypertension-Related Symptoms Based on XGBoost and Clustering Algorithm. Appl. Sci. 9 (6), 1215. doi:10.3390/app9061215

CrossRef Full Text | Google Scholar

Chen, T., and Yang, M. (2020). Platelet-to-lymphocyte Ratio Is Associated with Cardiovascular Disease in Continuous Ambulatory Peritoneal Dialysis Patients. Int. Immunopharmacol 78, 106063. doi:10.1016/j.intimp.2019.106063

PubMed Abstract | CrossRef Full Text | Google Scholar

Chou, J. A., Streja, E., Nguyen, D. V., Rhee, C. M., Obi, Y., Inrig, J. K., et al. (2018). Intradialytic Hypotension, Blood Pressure Changes and Mortality Risk in Incident Hemodialysis Patients. Nephrol. Dial. Transpl. 33 (1), 149–159. doi:10.1093/ndt/gfx037

CrossRef Full Text | Google Scholar

Cobb, J. S., Seale, M. A., and Janorkar, A. V. (2021). Evaluation of Machine Learning Algorithms to Predict the Hydrodynamic Radii and Transition Temperatures of Chemo-Biologically Synthesized Copolymers. Comput. Biol. Med. 128, 104134. doi:10.1016/j.compbiomed.2020.104134

PubMed Abstract | CrossRef Full Text | Google Scholar

Cozzolino, M., Galassi, A., Pivari, F., Ciceri, P., and Conte, F. (2017). The Cardiovascular Burden in End-Stage Renal Disease. Contrib. Nephrol. 191, 44–57. doi:10.1159/000479250

PubMed Abstract | CrossRef Full Text | Google Scholar

Cozzolino, M., Mangano, M., Stucchi, A., Ciceri, P., Conte, F., and Galassi, A. (2018). Cardiovascular Disease in Dialysis Patients. Nephrol. Dial. Transplant. 33 (Suppl. l_3), iii28–iii34. doi:10.1093/ndt/gfy174

PubMed Abstract | CrossRef Full Text | Google Scholar

Dorairajan, S., Chockalingam, A., and Misra, M. (2010). Myocardial Stunning in Hemodialysis: what Is the Overall Message? Hemodial Int. 14 (4), 447–450. doi:10.1111/j.1542-4758.2010.00495.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Douvris, A., Zeid, K., Hiremath, S., Bagshaw, S. M., Wald, R., Beaubien-Souligny, W., et al. (2019). Mechanisms for Hemodynamic Instability Related to Renal Replacement Therapy: a Narrative Review. Intensive Care Med. 45 (10), 1333–1346. doi:10.1007/s00134-019-05707-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Ebong, I. A., Goff, D. C., Rodriguez, C. J., Chen, H., and Bertoni, A. G. (2014). Mechanisms of Heart Failure in Obesity. Obes. Res. Clin. Pract. 8 (6), e540–e548. doi:10.1016/j.orcp.2013.12.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Ekdahl, K. N., Soveri, I., Hilborn, J., Fellstrom, B., and Nilsson, B. (2017). Cardiovascular Disease in Haemodialysis: Role of the Intravascular Innate Immune System. Nat. Rev. Nephrol. 13 (5), 285–296. doi:10.1038/nrneph.2017.17

PubMed Abstract | CrossRef Full Text | Google Scholar

Emerging Risk Factors Collaboration Kaptoge, S., Di Angelantonio, E., Lowe, G., Pepys, M. B., Thompson, S. G., Collins, R., et al. (2010). C-reactive Protein Concentration and Risk of Coronary Heart Disease, Stroke, and Mortality: an Individual Participant Meta-Analysis. The Lancet 375 (9709), 132–140. doi:10.1016/S0140-6736(09)61717-7

CrossRef Full Text | Google Scholar

Gary, T., Pichler, M., Belaj, K., Hafner, F., Gerger, A., Froehlich, H., et al. (2013). Platelet-to-lymphocyte Ratio: a Novel Marker for Critical Limb Ischemia in Peripheral Arterial Occlusive Disease Patients. PLoS One 8 (7), e67688. doi:10.1371/journal.pone.0067688

PubMed Abstract | CrossRef Full Text | Google Scholar

Gedfew, M., Ayenew, T., Mengst, B., Yirga, T., Zelalem, M., Worku, Y., et al. (2020). Incidence and Predictors of Congestive Heart Failure Among Hemodialysis Patients at Felege Hiote Referral Hospital, Northwest Ethiopia, 2020: Retrospective Cohort Study. Res. Rep. Clin. Cardiol. 11, 65–79. doi:10.2147/rrcc.s274942

CrossRef Full Text | Google Scholar

Ghebre, Y. T., Yakubov, E., Wong, W. T., and Krishnamurthy, P. (2016). Vascular Aging: Implications for Cardiovascular Disease and Therapy. Transl Med. 6 (04). doi:10.4172/2161-1025.1000183

PubMed Abstract | CrossRef Full Text | Google Scholar

Groenewegen, A., Rutten, F. H., Mosterd, A., and Hoes, A. W. (2020). Epidemiology of Heart Failure. Eur. J. Heart Fail. 22 (8), 1342–1356. doi:10.1002/ejhf.1858

PubMed Abstract | CrossRef Full Text | Google Scholar

House, A. A., Wanner, C., Sarnak, M. J., Pina, I. L., McIntyre, C. W., Komenda, P., et al. (2019). Heart Failure in Chronic Kidney Disease: Conclusions from a Kidney Disease: Improving Global Outcomes (KDIGO) Controversies Conference. Kidney Int. 95 (6), 1304–1317. doi:10.1016/j.kint.2019.02.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Kagiyama, N., Shrestha, S., Farjo, P. D., and Sengupta, P. P. (2019). Artificial Intelligence: Practical Primer for Clinical Research in Cardiovascular Disease. J. Am. Heart Assoc. 8 (17), e012788. doi:10.1161/JAHA.119.012788

PubMed Abstract | CrossRef Full Text | Google Scholar

Koteluk, O., Wartecki, A., Mazurek, S., Kolodziejczak, I., and Mackiewicz, A. (2021). How Do Machines Learn? Artificial Intelligence as a New Era in Medicine. J. Pers Med. 11 (1), 32. doi:10.3390/jpm11010032

PubMed Abstract | CrossRef Full Text | Google Scholar

Kourou, K., Manikis, G., Poikonen-Saksela, P., Mazzocco, K., Pat-Horenczyk, R., Sousa, B., et al. (2021). A Machine Learning-Based Pipeline for Modeling Medical, Socio-Demographic, Lifestyle and Self-Reported Psychological Traits as Predictors of Mental Health Outcomes after Breast Cancer Diagnosis: An Initial Effort to Define Resilience Effects. Comput. Biol. Med. 131, 104266. doi:10.1016/j.compbiomed.2021.104266

PubMed Abstract | CrossRef Full Text | Google Scholar

Kwon, E., Cho, M., Kim, H., and Son, H. S. (2020). A Study on Host Tropism Determinants of Influenza Virus Using Machine Learning. Curr. Bioinform 15 (2), 121–134. doi:10.2174/1574893614666191104160927

CrossRef Full Text | Google Scholar

Li, S., and Zhang, X. (2020). Research on Orthopedic Auxiliary Classification and Prediction Model Based on XGBoost Algorithm. Neural Comput. Appl. 32 (59), 1971–1979. doi:10.1007/s00521-019-04378-4

CrossRef Full Text | Google Scholar

Liu, M., Li, X. C., Lu, L., Cao, Y., Sun, R. R., Chen, S., et al. (2014). Cardiovascular Disease and its Relationship with Chronic Kidney Disease. Eur. Rev. Med. Pharmaco 18 (19), 2918–2926.

Google Scholar

McHugh, D., and Gil, J. (2018). Senescence and Aging: Causes, Consequences, and Therapeutic Avenues. J. Cel Biol 217 (1), 65–77. doi:10.1083/jcb.201708092

PubMed Abstract | CrossRef Full Text | Google Scholar

Meng, C., Zhang, J., Ye, X., Guo, F., and Zou, Q. (2020). Review and Comparative Analysis of Machine Learning-Based Phage Virion Protein Identification Methods. Biochim. Biophys. Acta Proteins Proteom 1868 (6), 140406. doi:10.1016/j.bbapap.2020.140406

PubMed Abstract | CrossRef Full Text | Google Scholar

Mpanya, D., Celik, T., Klug, E., and Ntsinjana, H. (2021). Machine Learning and Statistical Methods for Predicting Mortality in Heart Failure. Heart Fail. Rev. 26 (3), 545–552. doi:10.1007/s10741-020-10052-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Ouwerkerk, W., Voors, A. A., and Zwinderman, A. H. (2014). Factors Influencing the Predictive Power of Models for Predicting Mortality And/or Heart Failure Hospitalization in Patients with Heart Failure. JACC Heart Fail. 2 (5), 429–436. doi:10.1016/j.jchf.2014.04.006

PubMed Abstract | CrossRef Full Text | Google Scholar

Rangaswami, J., and McCullough, P. A. (2018). Heart Failure in End-Stage Kidney Disease: Pathophysiology, Diagnosis, and Therapeutic Strategies. Semin. Nephrol. 38 (6), 600–617. doi:10.1016/j.semnephrol.2018.08.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Rapa, S. F., Di Iorio, B. R., Campiglia, P., Heidland, A., and Marzocco, S. (2019). Inflammation and Oxidative Stress in Chronic Kidney Disease-Potential Therapeutic Role of Minerals, Vitamins and Plant-Derived Metabolites. Int. J. Mol. Sci. 21 (1), 263. doi:10.3390/ijms21010263

PubMed Abstract | CrossRef Full Text | Google Scholar

Rienstra, M., Sun, J. X., Magnani, J. W., Sinner, M. F., Lubitz, S. A., Sullivan, L. M., et al. (2012). White Blood Cell Count and Risk of Incident Atrial Fibrillation (From the Framingham Heart Study). Am. J. Cardiol. 109 (4), 533–537. doi:10.1016/j.amjcard.2011.09.049

PubMed Abstract | CrossRef Full Text | Google Scholar

Rodriguez-Perez, R., and Bajorath, J. (2020). Interpretation of Machine Learning Models Using Shapley Values: Application to Compound Potency and Multi-Target Activity Predictions. J. Comput. Aided Mol. Des. 34 (10), 1013–1026. doi:10.1007/s10822-020-00314-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Samanta, R., Chan, C., and Chauhan, V. S. (2019). Arrhythmias and Sudden Cardiac Death in End Stage Renal Disease: Epidemiology, Risk Factors, and Management. Can. J. Cardiol. 35 (9), 1228–1240. doi:10.1016/j.cjca.2019.05.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Schefold, J. C., Filippatos, G., Hasenfuss, G., Anker, S. D., and von Haehling, S. (2016). Heart Failure and Kidney Dysfunction: Epidemiology, Mechanisms and Management. Nat. Rev. Nephrol. 12 (10), 610–623. doi:10.1038/nrneph.2016.113

PubMed Abstract | CrossRef Full Text | Google Scholar

Sun, C. Y., Sung, J. M., Wang, J. D., Li, C. Y., Kuo, Y. T., Lee, C. C., et al. (2019). A Comparison of the Risk of Congestive Heart Failure-Related Hospitalizations in Patients Receiving Hemodialysis and Peritoneal Dialysis - A Retrospective Propensity Score-Matched Study. PLoS One 14 (10). doi:10.1371/journal.pone.0223336

CrossRef Full Text | Google Scholar

Van Buren, P. N. (2016). Evaluation and Treatment of Hypertension in End-Stage Renal Disease Patients on Hemodialysis. Curr. Cardiol. Rep. 18 (12), 125. doi:10.1007/s11886-016-0805-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Linthout, S., and Tschope, C. (2017). Inflammation - Cause or Consequence of Heart Failure or Both? Curr. Heart Fail. Rep. 14 (4), 251–265. doi:10.1007/s11897-017-0337-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Vigen, R., Maddox, T. M., and Allen, L. A. (2012). Aging of the United States Population: Impact on Heart Failure. Curr. Heart Fail. Rep. 9 (4), 369–374. doi:10.1007/s11897-012-0114-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Virani, S. S., Alonso, A., Aparicio, H. J., Benjamin, E. J., Bittencourt, M. S., Callaway, C. W., et al. (2021). Heart Disease and Stroke Statistics-2021 Update: A Report from the American Heart Association. Circulation 143 (8), e254–e743. doi:10.1161/CIR.0000000000000950

PubMed Abstract | CrossRef Full Text | Google Scholar

Voors, A. A., Ouwerkerk, W., Zannad, F., van Veldhuisen, D. J., Samani, N. J., Ponikowski, P., et al. (2017). Development and Validation of Multivariable Models to Predict Mortality and Hospitalization in Patients with Heart Failure. Eur. J. Heart Fail. 19 (5), 627–634. doi:10.1002/ejhf.785

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, A. Y., and Sanderson, J. E. (2011). Current Perspectives on Diagnosis of Heart Failure in Long-Term Dialysis Patients. Am. J. Kidney Dis. 57 (2), 308–319. doi:10.1053/j.ajkd.2010.07.019

PubMed Abstract | CrossRef Full Text | Google Scholar

Yao, Z., Zheng, Z., Ke, W., Wang, R., Mu, X., Sun, F., et al. (2019). Prognostic Nomogram for Bladder Cancer with Brain Metastases: a National Cancer Database Analysis. J. Transl Med. 17 (1), 411. doi:10.1186/s12967-019-2109-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, X., Zhou, J., Zhao, M., Yi, C., Duan, Q., Zhou, W., et al. (2020). Exploiting XG Boost for Predicting Enhancer-Promoter Interactions. Curr. Bioinform 15 (9), 1036–1045. doi:10.2174/1574893615666200120103948

CrossRef Full Text | Google Scholar

Keywords: machine learning, extreme gradient boosting, heart failure prediction, hemodialysis, risk factors

Citation: Wang Y, Miao X, Xiao G, Huang C, Sun J, Wang Y, Li P and You X (2022) Clinical Prediction of Heart Failure in Hemodialysis Patients: Based on the Extreme Gradient Boosting Method. Front. Genet. 13:889378. doi: 10.3389/fgene.2022.889378

Received: 04 March 2022; Accepted: 15 March 2022;
Published: 26 April 2022.

Edited by:

Quan Zou, University of Electronic Science and Technology of China, China

Reviewed by:

Leimin Wang, China University of Geosciences Wuhan, China
Yansen Su, Anhui University, China

Copyright © 2022 Wang, Miao, Xiao, Huang, Sun, Wang, Li and You. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Panlong Li, 2019021@zzuli.edu.cn; Xu You, maja_1983@126.com

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.