
95% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
ORIGINAL RESEARCH article
Front. Neurosci. , 25 October 2021
Sec. Brain Imaging Methods
Volume 15 - 2021 | https://doi.org/10.3389/fnins.2021.773208
This article is part of the Research Topic Advanced Computational Intelligence Methods for Processing Brain Imaging Data View all 62 articles
Detecting and treating cerebrovascular diseases are essential for the survival of patients with chronic kidney disease (CKD). Machine learning algorithms can be used to effectively predict stroke risk in patients with end-stage renal disease (ESRD). An imbalance in the amount of collected data associated with different risk levels can influence the classification task. Therefore, we propose the use of a kernelized k-local hyperplane nearest-neighbor model (KHKNN) for the effective prediction of stroke risk in patients with ESRD. We compared our proposed method with other conventional machine learning methods, which revealed that our method could effectively perform the task of classifying stroke risk.
Chronic kidney disease (CKD) has become a prominent disease affecting global health. According to existing research, the global incidence of CKD is approximately 8–16% and has been increasing yearly (Jha et al., 2013). Cerebrovascular diseases, such as stroke, represent major CKD complications that lead to neurological dysfunction and death, with negative impacts on prognosis in patients with CKD. Cerebral apoplexy, which is a primary cause of death among patients with CKD, refers to a series of adverse events, including cerebral ischemia, hypoxia, and cerebral dysfunction, caused by acute cerebral vascular rupture or acute cerebrovascular embolism (Kelly and Rothwell, 2020). A cohort study showed that CKD progression and a decline in the glomerular filtration rate increased the stroke risk among patients with CKD by nearly 40%, accompanied by a significant increase in the mortality rate (Toyoda and Ninomiya, 2014). Therefore, exploring the risk factors associated with stroke among the CKD population and identifying effective early interventions are necessary steps to reducing morbidity and mortality due to stroke.
Hypertension, diabetes, and dyslipidemia are traditional risk factors that contribute to the development of cerebrovascular diseases in patients with CKD. In addition, recent studies have revealed non-traditional risk factors, such as inflammation, oxidative stress, and CKD-mineral bone disease (CKD-MBD), that impact the occurrence and development of cerebrovascular diseases among patients with CKD. These non-traditional risk factors accelerate a series of pathological processes, such as cerebrovascular endothelial injury and sclerosis, in patients with CKD, leading to cerebrovascular calcification, further changing hemodynamics, and ultimately causing cerebrovascular events (Allen and Bayraktutan, 2009).
Our previous work identified abnormal FGF23 and Klotho levels, inflammatory status, and malnutrition were the unconventional risk factors for vascular calcification and CKD-MBD in patients with end-stage renal failure (Maraj et al., 2018). Using machine learning methods to analyze the risk factors of CKD-MBD in patients with end-stage renal failure, we have found that elevated serum FGF23 levels in patients with ESRD is an independent risk factor for abdominal aortic calcification (Liu et al., 2021). Recent studies have also identified FGF23 as an independent risk factor for cerebrovascular diseases in both CKD and non-CKD populations (Wright et al., 2016). The CHADS2 (congestive heart failure, hypertension, age = 75 years, diabetes mellitus, stroke) and CHA2DS2-VASc (congestive heart failure, hypertension, age ≥ 75 years, diabetes mellitus, stroke or transient ischemic attack vascular disease, age 65 to 74 years, sex category) scores are currently well-recognized methods for predicting the risk of stroke in patients with CKD (Hsu et al., 2020). Therefore, in this study, based on previous research findings, we used machine learning algorithms to develop models that explore the scientificity and veracity of both traditional and non-traditional risk factors combined with the CHADS2 stroke scoring tool and an abdominal aortic calcification scoring method for the prediction of stroke risk in patients with ESRD, which could help clinicians identify cerebrovascular disease and provide early interventions by assessing various risk factors, potentially delaying the occurrence and development of stroke, reducing morbidity and mortality, and improving prognosis among patients with ESRD.
We calculated the CHADS2 score based on the scoring system, as follows (de Bie et al., 2017): 1 point each was assigned for age ≥ 75 years, the presence of hypertension, diabetes mellitus, and congestive heart failure, and 2 points each were assigned for transient ischemic attack or a history of stroke. In addition, we calculated the CHA2DS2-VASc score based on the scoring system, as follows: 1 point each was assigned for congestive heart failure, hypertension, age between 65 and 74 years, diabetes mellitus, female sex, and vascular disease, whereas 2 points each were assigned for a history of stroke and age ≥ 75 years. CKD was defined as estimated glomerular filtration rate (eGFR) < 60 mL/min/m2 and classified as stages 3, 4, or 5 based on the eGFR level (30–59, 15–29, or <15 mL/min/1.73 m2, respectively) combined with kidney damage lasting for longer than 3 months. Patients were categorized into two groups according to their CHADS2 and CHA2DS2-VASc risk scores: (1) low-risk group (0–1 score) for CHADS2 and CHA2DS2-VASc scores and (2) high-risk group (≥2 scores) for CHADS2 and CHA2DS2-VASc scores.
The Geriatric Nutrition Risk Index (GNRI) = [14.89 × serum albumin (g/dl)] + [41.7 × (actual body weight/ideal body weight)] (Yamada et al., 2020). Serum levels of intact FGF23, klotho, fetuin-A, and interleukin-6 were determined using two-site enzyme-linked immunosorbent assays (Elabscience Biotech, Wuhan, China).
All patients underwent lateral lumbar X-ray examinations within 1 week of biochemical blood examinations to assess abdominal aortic calcification corresponding to L1 to L4 (Asher et al., 2021). Each patient was scored based on the length of the calcified plaques identified on the anterior and posterior walls of the abdominal aorta, with each segment scored between 0 and 3 points, as follows: 0 points for no calcification; 1 point for calcification less than one-third of the arterial wall length; 2 points for calcification between one-third and two-thirds of the artery wall length; and 3 points if calcification covers more than two-thirds of the arterial wall length. Each lumbar segment is scored separately for both the posterior and anterior walls, resulting in a total score of 0–24 points. Table 1 shows the demographic and clinical details of our data set.
Vincent and Bengio developed an improved version of the k-nearest-neighbor algorithm (KNN), called the k-local hyperplane (LH) nearest-neighbor algorithm (HKNN) (Vincent and Bengio, 2002). The purpose of HKNN is to estimate the distance from the test sample in each class to its corresponding LH, which is built using the nearest k samples of the test sample. Suppose there are C classes; HKNN will obtain the predicted results of the test sample by calculating the minimum distance from C LHs. For the c-th class, the c-th LH is based on the nearest k neighbors of x in the training set, where x belongs to the c-th class. The c-th hyperplane is expressed as follows:
where is the centroid of the k neighbors of x in class c; is the ith neighbor of the test sample x in class c; and . The objective function of the test sample x to the cth LH is as follows:
where λis the parameter of regular term. αc can be calculated as:
where and . The predictive result for test sample x is defined as:
To further improve the performance of the model, we applied feature mapping and the kernel trick to HKNN to obtain a kernelized HKNN model (KNKNN). Let x map to f by ϕ: χ → F and set . Eq. 2 can then be rewritten, as follows:
We obtained the differential of Eq. 5 as follows:
where K(Vc, Vc) ∈ Rk = k is a Gram matrix calculated by the radial basis function (RBF), and is a vector. The RBF is defined as:
where γ is the Gaussian kernel bandwidth.
To avoid overfitting of the model, KHKNN employed two strategies: (1) For test sample, KHKNN separately constructs a local hyperplane for each category by linear representation of neighborhood samples. It can alleviate the parameter (number) sensitivity of neighbors. (2) When constructing the hyperplane, we added the regular term (L2) of the coefficient. The schematic diagram of KNKNN is shown in Figure 1.
In this study, accuracy (ACC), sensitivity (SN), specificity (SP), positive predictive value (PE), negative predictive value (NPV), a weighted average of the PE and sensitivity (Fscore), and Matthews correlation coefficient (MCC) were calculated as follows:
where TP, TN, FN, and FP are the number of true positive, true negative, false negative, and false positive results, respectively. The high-risk patients are positive samples, whereas the low-risk patients are negative samples.
In our model, three parameters (k, γ, and λ) must be selected. To make the model robust, we set λ as 1. The optimal parameters of k and γ were selected using a grid search. The value of k ranges from 2 to 8 (maximum number of negative samples), with a step of 1. The value of γ ranges from 2–5 to 25 with a step of 21. The results are shown in Figure 2, which shows the predictive performance of the model using different parameters. When k and γare 4 and 2–3, respectively, the best MCC (0.5393) value was obtained.
We compared the performance of several traditional classifiers with our method using the obtained dataset. The results are shown in Table 2. The compared classifiers included artificial neural network with backpropagation (ANN-BP), support vector machines (SVM), Takagi-Sugeno-Kang fuzzy system (TSK-FS), KNN, HKNN, and KHKNN. For small samples, KNN (ACC: 86.67%), HKNN (ACC: 85.00%), and TSK-FS (ACC: 78.33%) achieved good results. Our method (KHKNN) achieved the best MCC (0.5393) and ACC (89.67%) on small dataset. KHKNN was found to have an SN of 94.00%, with Spec reaching 60.00%. Our method achieved a relatively balanced performance for the recognition of both positive and negative samples.
Table 2. Comparison of performance between our method and other existing methods using the PDB1075 data set (Jackknife test evaluation).
Stroke is one of the most serious complications among patients with CKD, leading to brain dysfunction and even death. Over the past 10 years, scholars have conducted a large number of mechanistic studies and epidemiological investigations exploring the kidney–brain interaction. The results of these studies have indicated that the kidney and brain have similar anatomical and functional characteristics. For example, both organs feature an arterial system that automatically adjusts perfusion pressure to ensure a continuous and relatively stable blood flow. In patients with CKD, cerebrovascular sclerosis occurs due to calcifications that form in the arterial system, disrupting the autoregulation function and allowing cerebrovascular events to occur (Lau et al., 2017).
The risk of stroke in patients with CKD is much higher than that in patients without CKD (Chen et al., 2012), and the stroke risk increases further as renal functional defects progress to ESRD. In recent years, in addition to traditional risk factors, such as hypertension, diabetes, and dyslipidemia, the influence of non-traditional risk factors on the occurrence of cerebrovascular calcification in patients with CKD has gained increasing attention, including inflammation, malnutrition, and the FGF23/klotho axis. The results of previous studies performed at our center have indicated that abnormal FGF23, klotho, and fetuin-A levels and malnutrition represent risk factors for abdominal aortic calcification in patients with ESRD (Maraj et al., 2018). FGF23 has been to play an important role in phosphate regulation. Klotho is the receptor protein for FGF23, which participates in regulating bone, calcium, and phosphorus metabolism; protecting the integrity of blood vessels; and inhibiting vascular calcification through the formation of FGF23-klotho complexes. The FGF23/klotho axis is a key participant in CKD-MBD and is closely related to vascular calcification and cerebrovascular diseases (Moldovan et al., 2014). Relevant studies have shown that an elevated FGF23 level is a risk factor for ischemia and hemorrhagic stroke in patients with CKD (Wright et al., 2014).
The CHADS2 and CHA2DS2-VASc scores are considered to be effective evaluation tools for predicting the risk of ischemic stroke in patients with CKD. Among patients with a high risk of ischemic cerebrovascular accidents, the administration of secondary prevention agents, such as anticoagulation and antithrombosis factors, when indicated by the cerebral ischemia score warning system, has been shown to greatly reduce the incidence of stroke and improve prognosis (Toyoda et al., 2014). The results of this study revealed a correlation between the CHADS2 and CHA2DS2-VASc scores in patients with ESRD and vascular calcification, which indirectly suggests the existence of an important relationship between vascular calcification and ischemic stroke. Moreover, when we combined the ischemic stroke scoring tool with traditional stroke risk factors, such as vascular calcification, to predict the risk of CKD stroke, the results were more reasonable, with a stronger scientific basis, than the use of the stroke scoring tool alone to predict risk.
The sample size is very small, so a simple machine learning model is preferred to solve the classification problem. Among them, KNN and SVM are suitable methods. The KNN algorithm is very sensitive to the number of neighboring samples. In the original feature space, the model cannot achieve satisfactory results of prediction. Therefore, we proposed KHKNN on the basis of KNN and HKNN to solve the above two problems. In the results section, KHKNN has obtained good prediction results. KHKNN separately constructs a local hyperplane for each category of test sample. The prediction result is determined by evaluating the distance (minimum) from the test sample to the hyperplane of each category. Therefore, it can alleviate the parameter sensitivity of KNN and avoid overfitting. The prediction result is determined by evaluating the distance (minimum) from the test sample to the hyperplane of each category.
In our study, KHKNN was employed to predict the risk of cerebrovascular disease among patients with ESRD. KHKNN estimates the distance from the test sample in each class to its corresponding LH in a high-dimensional feature space. Unlike KNN, KHKNN is not as sensitive to the parameter k, and its prediction performance is better than that of the ANN-BP and SVM models for small data sets. Compared with other models (ANN-BP, SVM, TSK-FS, KNN, and HKNN), our model achieved the best MCC (0.5393) and ACC (89.67%) values, showing that our method has good robustness and may be useful for determining clinical risk in the future.
We proposed a KHKNN method to filter noise samples, improve the generalization ability of the model, and obtain good results. Although our method achieves a relatively balanced performance for the recognition of positive and negative samples, the following disadvantages must be acknowledged. (1) The sample size must be further increased to minimize prediction bias. (2) No detailed analysis was performed to examine the contribution of various patient factors. (3) Although the kernel function was used to map the original space to further improve the performance, the interpretability of the model was affected. Fuzzy systems will be introduced in the future to improve interpretability. At present, artificial intelligence technology has been used for large-scale medical information processing (Jian et al., 2019; Guo et al., 2021; Jiang et al., 2021a, b; Zhang et al., 2021a, b) and bioinformatics (Qian et al., 2021; Zou et al., 2021) on a large scale, with good performance. (4) The k-dimensional tree is employed to speed up the search speed of the nearest neighbor samples. In addition, parallel computing technology also can increase the speed of searching. In the future, we will use artificial intelligence methods to solve additional clinical problems.
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.
The studies involving human participants were reviewed and approved by the Human Ethics Committee (Wuxi People’s Hospital Ethics Committee, No. KS2019041). The patients/participants provided their written informed consent to participate in this study.
XL: methodology, data curation, and writing – original draft preparation. XZ: methodology and writing – original draft preparation. WS and XG: methodology and data curation. YD: methodology and software. LW: methodology, supervision, and writing – reviewing and editing. All authors contributed to the article and approved the submitted version.
This work is supported by a grant from the Top Talent Support Program for young and middle-aged people of Wuxi Health Committee [HB2020008]; the Scientific Research Project of Wuxi Health Committee [MS201927; Z201914]; the Scientific Research Project of Jiangsu Provincial Health Commission [LGY201801]; Jiangsu Province “333” project [BRA2020142]; Medical and Public Health Project of Wuxi Sci-Tech Development Fund [WX18 II AN047]; and Maternal and Child Health Research Project of Jiangsu Province [F202033].
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
We thank the Department of Nephrology of Wuxi People’s Hospital for collecting data in this study.
Allen, C. L., and Bayraktutan, U. (2009). Oxidative stress and its role in the pathogenesis of ischaemic stroke. Int. J. Stroke. 4, 461–470. doi: 10.1111/j.1747-4949.2009.00387.x
Asher, E., Abu-Much, A., Bragazzi, N. L., Younis, A., Younis, A., Masalha, E., et al. (2021). CHADS2 and CHA2DS2-VASc scores as predictors of platelet reactivity in acute coronary syndrome. J. Cardiol. 77, 375–379. doi: 10.1016/j.jjcc.2020.09.010
Chen, Y. C., Su, Y. C., Lee, C. C., Huang, Y. S., and Hwang, S. J. (2012). Chronic kidney disease itself is a causal risk factor for stroke beyond traditional cardiovascular risk factors: a Nationwide Cohort Study in Taiwan. PLoS One 7:e36332. doi: 10.1371/journal.pone.0036332
de Bie, M. K., Buiten, M. S., Rotmans, J. I., Hogenbirk, M., Schalij, M. J., Rabelink, T. J., et al. (2017). Abdominal aortic calcification on a plain X-Ray and the relation with significant coronary artery disease in asymptomatic chronic dialysis patients. BMC Nephrol. 18:82. doi: 10.1186/s12882-017-0480-2
Guo, X., Zhou, W., Shi, B., Wang, X., Du, A., Ding, Y., et al. (2021). An efficient multiple kernel support vector regression model for assessing dry weight of hemodialysis patients. Curr. Bioinform. 16, 284–293. doi: 10.2174/1574893615999200614172536
Hsu, P. C., Lee, W. H., Chen, S. C., Tsai, Y. C., Chen, Y. C., Chu, C. Y., et al. (2020). Using CHADS2 and CHA2DS2-VASc Scores for mortality prediction in patients with chronic kidney disease. Sci. Rep. 10:18942. doi: 10.1038/s41598-020-76098-y
Jha, V., Garcia-Garcia, G., Iseki, K., Naicker, S., Plattner, B., Saran, R., et al. (2013). Chronic kidney disease: global dimension and perspectives. Lancet 382, 260–272. doi: 10.1016/S0140-6736(13)60687-X
Jian, Y., Zhao, K., Xia, K., Xue, J., Xhou, L., Ding, Y., et al. (2019). A novel distributed multitask fuzzy clustering algorithm for automatic mr brain image segmentation. J. Med. Syst. 43:118. doi: 10.1007/s10916-019-1245-1
Jiang, Y., Gu, X., Wu, D., Hang, W., Xue, J., Qiu, S., et al. (2021a). A novel negative-transfer-resistant fuzzy clustering model with a shared cross-domain transfer latent space and its application to brain CT image segmentation. IEEE ACM Trans. Comput. Biol. Bioinform. 18, 40–52. doi: 10.1109/TCBB.2019.2963873
Jiang, Y., Zhang, Y., Lin, C., Wu, D., and Lin, C.-T. (2021b). EEG-Based Driver Drowsiness Estimation Using an Online Multi-View and Transfer TSK Fuzzy System. IEEE Trans. Intell. Transp. Syst. 22, 1752–1764. doi: 10.1109/TITS.2020.2973673
Kelly, D., and Rothwell, P. M. (2020). Disentangling the Multiple Links Between Renal Dysfunction and Cerebrovascular Disease. J. Neurol. Neurosurg. Psychiatry 91, 88–97. doi: 10.1136/jnnp-2019-320526
Lau, W. L., Huisa, B. N., and Fisher, M. (2017). The cerebrovascular-chronic kidney disease connection: perspectives and mechanisms. Transl. Stroke Res. 8, 67–76. doi: 10.1007/s12975-016-0499-x
Liu, X. B., Zhang, X. R., Guo, X. Y., Ding, Y., Shan, W., Wang, L., et al. (2021). A self-representation-based fuzzy svm model for predicting vascular calcification of hemodialysis patients. Comput. Math Methods Med. 2021:2464821. doi: 10.1155/2021/2464821
Maraj, M., Kuśnierz-Cabala, B., Dumnicka, P., Gala-Błądzińska, A., Gawlik, K., Pawlica-Gosiewska, D., et al. (2018). Malnutrition, inflammation, atherosclerosis syndrome (MIA) and diet recommendations among end-stage renal disease patients treated with maintenance hemodialysis. Nutrients 10:69. doi: 10.3390/nu10010069
Moldovan, D., Moldovan, I., Rusu, C., Kacso, I., Patiu, I. M., and Gherman-Caprioara, M. (2014). FGF-23, vascular calcification, and cardiovascular diseases in chronic hemodialysis patients. Int. Urol. Nephrol. 46, 121–128. doi: 10.1007/s11255-013-0422-2
Qian, Y., Meng, H., Lu, W., Liao, Z., Ding, Y., and Wu, H. (2021). Identification of DNA-binding proteins via hypergraph based laplacian support vector machine. Curr. Bioinform. doi: 10.2174/1574893616666210806091922 [Epub ahead of print].
Toyoda, K., and Ninomiya, T. (2014). Stroke and cerebrovascular diseases in patients with chronic kidney disease. Lancet Neurol. 13, 823–833. doi: 10.1016/S1474-4422(14)70026-2
Toyoda, K., Yasaka, M., Uchiyama, S., Iwade, K., Koretsune, Y., Nagata, K., et al. (2014). CHADS2 and CHA2DS2-VASc scores as bleeding risk indices for patients with atrial fibrillation: the bleeding with antithrombotic therapy study. Hypertens. Res. 37, 463–466. doi: 10.1038/hr.2013.150
Vincent, P., and Bengio, Y. (2002). K-local hyperplane and convex distance nearest neighbor algorithms. Adv. Neural Inf. Process. Syst. 14, 985–992.
Wright, C. B., Dong, C., Stark, M., Silverberg, S., Rundek, T., Elkind, M. S., et al. (2014). Plasma FGF23 and the risk of stroke: the northern manhattan s tudy (NOMAS). Neurology 82, 1700–1706. doi: 10.1212/WNL.0000000000000410
Wright, C. B., Shah, N. H., Mendez, A. J., DeRosa, J. T., Yoshita, M., Elkind, M. S., et al. (2016). Fibroblast growth factor 23 is associated with subclinical cerebrovascular damage: the northern manhattan study. Stroke 47, 923–928. doi: 10.1161/STROKEAHA.115.012379
Yamada, S., Yamamoto, S., Fukuma, S., Nakano, T., Tsuruya, K., and Inaba, M. (2020). Geriatric nutritional risk index (GNRI) and creatinine index equally predict the risk of mortality in hemodialysis patients: J-DOPPS. Sci. Rep. 10:5756. doi: 10.1038/s41598-020-62720-6
Zhang, Y. P., Jian, Y., Qi, L., Bhuiyan, M. Z. A., and Quian, P. (2021a). Epilepsy diagnosis using multi-view & multi-medoid entropy-based clustering with privacy protection. ACM Trans. Internet Technol. 21, 1–20. doi: 10.1145/3404893
Zhang, Y. P., Wang, S. H., Xia, K. J., Jiang, Y. Z., and Qian, Q. J. (2021b). alzheimer’s disease multiclass diagnosis via multimodal neuroimaging embedding feature selection and fusion. Inf. Fusion. 66, 170–183. doi: 10.1016/j.inffus.2020.09.002
Keywords: cerebrovascular disease, end-stage renal disease, local hyperplane, klotho, FGF23
Citation: Liu X, Zhang X, Zhang Y, Ding Y, Shan W, Huang Y, Wang L and Guo X (2021) Kernelized k-Local Hyperplane Distance Nearest-Neighbor Model for Predicting Cerebrovascular Disease in Patients With End-Stage Renal Disease. Front. Neurosci. 15:773208. doi: 10.3389/fnins.2021.773208
Received: 09 September 2021; Accepted: 04 October 2021;
Published: 25 October 2021.
Edited by:
Yuanpeng Zhang, Nantong University, ChinaReviewed by:
Xiaoqing Gu, Changzhou University, ChinaCopyright © 2021 Liu, Zhang, Zhang, Ding, Shan, Huang, Wang and Guo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Liang Wang, d2x3eHNua0AxNjMuY29t; Xiaoyi Guo, a2VycnkuZ3VveGlhb3lpQDE2My5jb20=
†These authors share first authorship
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.