- 1Department of Neurology, The Second Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
- 2Key Laboratory of Neurological Diseases, The Second Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
- 3Department of Neurosurgery, The Affiliated Hospital of Xuzhou Medical University, Xuzhou, Jiangsu, China
- 4Department of Orthopaedic Surgery, The First Affiliated Hospital of Nanchang University, Nanchang, China
- 5Department of Dermatology, Xianyang Central Hospital, Xianyang, China
- 6Faculty of Medicine, Macau University of Science and Technology, Taipa, Macao SAR, China
- 7The State Key Laboratory of Molecular Vaccinology and Molecular Diagnostics and Center for Molecular Imaging and Translational Medicine, School of Public Health, Xiamen University, Xiamen, China
Objective: To explore the predictors of death in acute ischemic stroke (AIS) patients within 1 year based on machine learning (ML) algorithms.
Methods: This study retrospectively analyzed the clinical data of patients hospitalized and diagnosed with AIS in the Second Affiliated Hospital of Xuzhou Medical University between August 2017 and July 2019. The patients were randomly divided into training and validation sets at a ratio of 7:3, and the clinical characteristic variables of the patients were screened using univariate and multivariate logistics regression. Six ML algorithms, including logistic regression (LR), gradient boosting machine (GBM), extreme gradient boosting (XGB), random forest (RF), decision tree (DT), and naive Bayes classifier (NBC), were applied to develop models to predict death in AIS patients within 1 year. During training, a 10-fold cross-validation approach was used to validate the training set internally, and the models were interpreted using important ranking and the SHapley Additive exPlanations (SHAP) principle. The validation set was used to externally validate the models. Ultimately, the highest-performing model was selected to build a web-based calculator.
Results: Multivariate logistic regression analysis revealed that C-reactive protein (CRP), homocysteine (HCY) levels, stroke severity (SS), and the number of stroke lesions (NOS) were independent risk factors for death within 1 year in patients with AIS. The area under the curve value of the XGB model was 0.846, which was the highest among the six ML algorithms. Therefore, we built an ML network calculator (https://mlmedicine-de-stroke-de-stroke-m5pijk.streamlitapp.com/) based on XGB to predict death in AIS patients within 1 year.
Conclusions: The network calculator based on the XGB model developed in this study can help clinicians make more personalized and rational clinical decisions.
1. Introduction
Acute ischemic stroke (AIS) is a disease caused by the occlusion of cerebral arteries, accompanied by brain tissue infarction and neuronal cell damage, causing severe trauma to the body. AIS is the leading cause of disability in adults and the primary cause of human death worldwide (1, 2). In 2019, there were 7,630,800 cases of AIS globally, an 87.55% increase compared to the previous 30 years. The high morbidity, mortality, and disability rates associated with AIS impose a severe economic burden on society and families (3). Several factors may have a significant impact on the pathogenesis and prognosis of patients with AIS, including the immune inflammatory response during AIS development, with the involvement of different pathways and sources of activated inflammatory factors, and is an important regulator of stroke progression, post-stroke damage, cerebral function repair and death (4–6). Approximately 10% of AIS patients, representing a type of morbidity, experience a fatal event within 1 year (7). There is an urgent need to identify the early and effective predictors of death 1 year after the onset of AIS. The construction of a model of death prediction in stroke patients within 1 year could provide clinicians with a reliable tool to assess the condition of their patients. However, there are few reports in this area.
ML-assisted clinical decision-making and analysis have been widely used in clinical settings (8–11), especially in the screening phase of big data feature variables (12, 13). The superior performance demonstrated by ML algorithms in medical big data makes it possible to obtain better predictive tools than traditional statistical models under certain conditions. However, few studies have been conducted to screen the risk factors of death in AIS patients within 1 year using ML algorithms.
Therefore, this study aimed to develop and validate an interpretable ML model that used clinically relevant variables to predict death within 1 year in AIS patients and construct an easy-to-use web calculator as a convenient and practical protective tool for clinical practitioners to provide valid information for AIS patients.
2. Materials and methods
2.1. Subjects
Patients who were hospitalized in the Department of Neurology of the Second Affiliated Hospital of Xuzhou Medical University and diagnosed with AIS between August 2017 and July 2019 were retrospectively analyzed. A total of 677 patients with AIS were included in this study, 32 of whom died after admission and during follow-up. The study was approved by the Ethics Committee of the Second Affiliated Hospital of Xuzhou Medical University [ethics number: [2020] 081603], and all patients provided written informed consent.
2.2. Inclusion and exclusion criteria
The inclusion criteria were a diagnosis of AIS in accordance with the World Health Organization criteria, and the time between onset and hospital admission did not exceed 24 h. The exclusion criteria were: (1) incomplete clinical data, (2) those with severely abnormal organ function, (3) inadequate ancillary investigations, (4) follow-up of <1 year, and (5) Patients who discontinued treatment for various reasons according to their relatives.
2.3. Methods
2.3.1. Observational variables
In this study, clinical data were collected from the enrolled patients, including demographics (age and sex); vascular risk factors (hypertension, diabetes mellitus, and ischemic heart disease); baseline blood pressure [systolic blood pressure (SBP) and diastolic blood pressure (DBP)]; Trial of Org 10 172 in Acute Stroke Treatment (TOAST) [large-artery atherosclerosis, cardioembolism, small-vessel occlusion, acute stroke of other determined etiology, stroke of undetermined etiology]; stroke severity (SS) [defined as mild stroke according to the National Institutes of Health Stroke Scale (NIHSS) scores of ≤ 8, moderate-to-severe stroke according to NIHSS scores of ≥9; all assessments completed on admission]; magnetic resonance imaging (MRI) features [stroke distribution (SD; anterior circulation, posterior circulation, and anterior/posterior circulation), side of hemisphere (SOH; left, right, and bilateral), number of stroke lesions (NOSs; single and multiple stroke lesions), site of stroke lesions (SOSs; cortical, cortico-subcortical, subcortical, brainstem, and cerebellum)]; laboratory tests [total cholesterol, triglycerides, low-density lipoprotein (LDL), fasting blood glucose (FBG), homocysteine (HCY), uric acid (UA), fibrinogen (FIB), myoglobin (MB), C-reactive protein (CRP), D-dimer brain natriuretic peptide (BNP), HBALC, neuron-specific enolase (NSE), and S-100β levels], treatment regimen [intravenous thrombolysis, arterial thrombolysis, antiplatelet, anticoagulation, statin, and proton pump inhibitor therapy (PPI)]; and stroke comorbidities [dysphagia and stroke-associated pneumonia (SAP)].
2.3.2. Statistical methods
This study used R version 4.0.5 software for data processing and statistical analyses. Continuous variables are expressed as the median or interquartile range (IQR) while categorical variables are presented as frequencies (percentage, %). The continuous variables were compared by independent samples t-tests and the categorical variables were compared using χ2-tests. Understanding the relationship between the independent and dependent variables was clinically meaningful and P-values of <0.05 were considered statistically significant (two-sided).
2.3.3. Modeling of machine learning algorithms
Univariate and multivariate logistic regression analyses were used to assess the risk factors of death within 1 year in the training group study population. The odds ratio (OR) and 95% confidence interval (CI) were calculated, with an OR of > 1 indicating that the variable was a risk factor, and P < 0.05 considered to indicate a statistically significant difference. Then, the factors that were significant in both univariate and multivariate logistic regression were included and subjected to stepwise regression analysis. The factors selected by stepwise regression were used as input variables to construct ML models.
The ML algorithm process was based on Python (V3.7) software and the scikit-learn (version 0.24) library. First, the original dataset was randomly divided into training and test sets at a ratio of 7:3. Then, six machine algorithms [logistic regression (LR), gradient boosting machine (GBM), extreme gradient boosting (XGB), random forest (RF), decision tree (DT), and naive Bayes classifier (NBC)] were used to analyze the data and construct the model. To validate the predictive power of the model, the 10-fold cross-validation method was used for internal validation against the training group. The random search method was used to adjust the hyperparameters of the models.
In the test group, the area under the receiver operating characteristic curve (ROC-AUC), classification accuracy, recall, specificity, and F1 score were used to evaluate the prediction models. We also plotted the prediction recall curve (PRC) as a complementary metric to evaluate the model performance.
2.3.4. Interpretation of the model and importance of features
To illustrate the risk factors of death within 1 year in AIS patients, Shapley Additive explanation (SHAP) analysis was used to interpret the predictive models ranked in terms of feature importance. SHAP analysis is a tool proposed by Lloyd Shapley in game theory to explain the output of machine learning models. The core idea is to calculate the marginal contribution of a variable feature when it is added to the model, and then to interpret the global and local levels of the “black box model” in an additive explanatory model (14, 15). That is, it can assign predictive values to each feature and evaluate and visualize the contribution of each feature to the outcome of the machine learning model (16). Ultimately, a web-based calculator based on the best-performing model was created for inputting patient data to facilitate the clinicians' assessment of death within 1 year in AIS patients.
3. Results
3.1. Baseline patient data characteristics
In this study, clinical information was collected on 677 AIS patients, of whom 645 survived and 32 died of AIS (Table 1). In the observed population, 383 patients (56.6%) were aged <60 years and 294 patients (43.4%) were aged ≥ 60 years, 398 (58.8%) were male and 279 (41.2%) were female. AIS lesions occurred in 270 (39.9%) patients in the anterior circulation, 252 (37.2%) in the posterior circulation, and 155 (22.9%) in both anterior and posterior circulations. The distribution of lesions in the left and right hemispheres was approximately equal, with 283 (41.8%) in the left and 270 (39.9%) in the right, and a relatively small number [124 (18.3%)] in the bilateral cerebral hemispheres. The location of the lesions was mainly subcortical in 186 patients (27.5%), cortical and cortico-subcortical in 155 patients (22.9%), the brainstem in 104 patients (15.4%), and the cerebellum in 77 patients (11.4%). Four hundred and seventy patients (69.4%) had a single AIS lesion, while multiple lesions were found in only 207 patients (30.6%). Two hundred and four patients (30.1%) received intravenous thrombolytic therapy, and 473 (69.9%) did not. Six hundred and forty-four patients (95.1%) did not receive arterial thrombolytic therapy, and 33 (4.9%) did. Five hundred and fifty-five patients (82.0%) received antiplatelet therapy, and 122 (18.0%) did not. Five hundred and seventy-six patients (85.1%) did not receive anticoagulation therapy, and 101 (14.9%) did. The majority of the patients (574, 84.8%) received statin therapy and 103 (15.2%) did not.
The median systolic and diastolic blood pressure was 143 mmHg (IQR 132.0,156.0) and 87 mmHg (IQR 74.0, 97.0), respectively. Total cholesterol, triglycerides, HDL, blood HCY, blood UA, and median FIB, MB, ultrasensitive CRP, D-dimer BNP, atrial natriuretic peptide, NSE, and S-100β were 5.3 mmol/L [4.4, 6.2], 2.2 mmol/L [1.9, 2.4], 4.8 mmol/L [4.3, 4.9], 15.7 μmol/L [12.4, 19.1], 349.8 μmol/L [309.8, 408.1], 4. 3 g/L [4.0, 4.8], 97. 7 ng/mL [75.1, 147.8], 12. 2 mg/L [7.2, 18.1], 174.0 ng/mL [133.0, 221.0], 93.0 ng/mL [73.0, 162.0], 16.2 ng/mL [12.7, 18.6], and 275.0 ng/mL [224.0, 290.0], respectively.
3.2. Univariate and multivariate regression analysis of death within 1 year in AIS patients
In the univariate regression analysis of death within 1 year in AIS patients (Table 2), there was a statistically significant difference (P < 0.05) in the overall population for death within 1 year according to NOS, FBG, HBALC, MB, and CRP levels, as well as anticoagulation therapy, PPI treatment, and SS.
All parameters that were statistically different in the univariate analysis above were included in the multivariate logistic regression analysis. The results suggested that NOS (OR = 3.44, 95% CI: 1.41 – 8.36, P = 0.007), HCY (OR = 1.29, 95% CI: 1.16 – 1.45, P < 0.001), CRP (OR = 1.15, 95% CI: 1.07 – 1.23, P < 0.001), and SS (OR = 3.12, 95% CI: 1.03 – 9.83, P = 0.046) were independent predictors of death within 1 year in AIS patients.
3.3. Machine learning model building and validation
To compare the predictive performance of the six ML algorithm models, this study performed 10-fold cross-validation within the training group. The results are shown in Figure 1. Figure 2 shows the ROC curves of the predictive performance differences of the six ML algorithm models after external validation, and Figure 3 shows the result of radar plot analysis, which is a blanket, clear, intuitive, and easy-to-judge analysis and is suitable for comprehensive evaluation as it can show the AUC value, accuracy, recall, and F1 value of the models in multiple dimensions (Figure 3, Table 3) to more clearly reflect the performance of the models. The PRC curves of the mortality prediction model are shown in Supplementary Figure 1.
Figure 1. Ten-fold cross validation test. LR, logistic regression; NBC, Naive Bayesian classification; DT, Decision Tree; RF, Random Forest; GBM, gradient boosting machine; XGB, extreme gradient boosting.
The results suggest that the XGB model performed best in predicting death within 1 year in AIS patients after a comprehensive evaluation. The remaining models were ranked in descending order according to their predictive performance.
In summary, we finally adopted the XGB model as the preferred predictive model.
3.4. Relative importance of variables in ML algorithms
A SHAP interpretability study was used to analyze the results of the ML models. Generally, the higher the SHAP value of a feature, the higher the probability of the occurrence of the target event. In SHAP analysis, red represents the eigenvalues with positive impact on the model and blue represents the eigenvalues with negative impact on the model (17). The results of the study suggest that SS was the most important variable, followed by CRP, HCY, and NOS in descending order of importance, as shown in Figure 4.
3.5. The web calculator
A web-based calculator based on the XGB model was developed in this study. By entering the clinical characteristic variables of a patient with AIS, clinicians could predict their risk of death within 1 year (https://mlmedicine-de-stroke-de-stroke-m5pijk.streamlitapp.com/; Figure 5).
4. Discussion
In this study, we retrospectively analyzed the clinical data of AIS patients and developed a web-based calculator with ML algorithms to predict the risk of death within 1 year. The accuracy and rationality of the model were validated by 10-fold cross-validation, allowing the model to be used for clinical practice to help clinicians make more rational treatment decisions.
ML is an emerging field of medicine that has demonstrated an extraordinary ability to handle large, complicated, and disparate data, and is the future of biomedical research, personalized medicine, and computer-aided diagnosis. It holds the promise of significantly advancing global healthcare (18, 19). Unlike traditional predictive models, ML is very good at discovering complex structures in selected variables in high-dimensional data and can easily combine a large number of variables (20, 21). ML has been reported to improve the predictive accuracy of long-term prognoses for AIS patients (8, 10).
In this study, six ML methods were used to analyze and construct a model of death prediction within 1 year in AIS patients, and the performance of the six ML algorithms was compared to each other. The XGB algorithm performed best (Figure 1), with a better AUC value than the other five algorithmic models, and the highest accuracy, sensitivity, and F1 score. Therefore, the XGB algorithm model was finally chosen.
ML models are often considered to be a black box where is difficult to explain the predictive performance, and it becomes extremely important to study the interpretability of machine learning models. Therefore, this study attempted to introduce SHAP analysis, a new method for interpreting various black-box ML models that have been previously validated based on their interpretability performance. It can achieve both local and global interpretability and has a solid theoretical foundation compared to other methods (22). The SHAP analysis used in this study could interpret the model prediction results well, and its intuitive visualization is more easily accepted. This study further built a web-based calculator to estimate the probability of death within 1 year in AIS patients to make better use of the model.
AIS is characterized by a high morbidity rate, which increases the economic burden on society and families (23). It is significant to explore the factors influencing the risk of death within 1 year for patients. In this study, the mortality rate of AIS patients within 1 year was only 4.7% (32/677), which was significantly lower than the 10% reported in previous studies (7), probably because of the exclusion of those whose families discontinued treatment for various reasons. Previously, an 8-point scoring system was constructed to predict the risk of death within 7 days of hospitalization (24). Factors influencing death within 6 months of stroke onset were also reported, with variables such as the Barthel index and platelet/lymphocyte ratio screened by LASSO regression and multiple logistic regression (25). A 30-year stroke burden predictive model was established (26). In contrast, unlike many previous studies, this study innovatively used machine learning algorithms to screen variables and, to our knowledge, was the first to develop a predictive model using machine learning algorithms to assess the probability of death within 1 year in patients with AIS.
There is a growing body of research on the relationship between serum inflammatory biomarkers and AIS. A number of studies showed that AIS could induce an inflammatory response, which plays a major role in late ischemic damage to the brain parenchyma, and that inflammatory responses caused by various clinical factors could lead to an increase in inflammatory factors (27, 28). It is also an inflammatory factor that can indirectly indicate the presence of pathogenic microorganisms in patients when it is upraised, which can help the physician in the diagnosis and treatment. In this study, we concluded that CRP levels were the most important predictor of death within 1 year in AIS patients. Elevated CRP levels were previously reported to reflect the severity of AIS, correlate with stroke subtype and risk stratification (27, 28), and be an independent predictor of long-term mortality after ischemic stroke (29). Elevated CRP levels can lead to increased mortality after stroke, which may be related to inflammation-induced endothelial cell dysfunction and platelet activation (30). HCY is a sulfur-containing non-essential amino acid produced by metabolism in vivo as a derivative of methionine cycle demethylation. It is also an inflammatory substance that induces the activation of nuclear factor (NF)-kB, which is a transcription factor common to inflammation and the immune response. Elevated levels of HCY are associated with a variety of diseases, which may lead to endothelial dysfunction, neurotoxicity, and the upregulation of thrombogenic factors. At the same time, monitoring HCY levels may provide a good indication of the development of related diseases (31). Previous studies also showed that elevated HCY levels were associated with AIS dysfunction and recurrent stroke (32). A multicenter study suggested that high levels of serum HCY were an independent predictor of early neurological deterioration in AIS patients (33). This study concluded that HCY levels significantly influenced the risk of death within 1 year in AIS patients. The risk of death in patients with high HCY having 1.29 times (95% CI 1.16 – 1.45) compared to ones with normal HCY.
The NIHSS is a common scale used in neurology as a quantitative indicator of disease severity (34). The present study classified SS with the help of the NIHSS scale, and an NIHSS score of ≥9 was defined as moderate-to-severe stroke. Fischer et al. (35) suggested that patients with low NIHSS scores tended to have a better prognosis, which is consistent with the current study. The present study concluded that SS had a significant influence on death within 1 year in AIS patients. The risk of death in patients with moderate to severe stroke having 3.12 times (95% CI 1.03 – 9.83) higher than those with mild stroke.
Neurological deficits have been associated with lesions in different brain regions (36, 37), but the relationship between the number of lesions and AIS has rarely been reported. In this study, the number of lesions was innovatively included in the analysis, and the results suggested that the number of lesions was a significant factor in death within 1 year in AIS patients. The risk of death in patients with multiple lesions was 3.44 times (95% CI: 1.41 – 8.36) higher than patients with a single lesion.
There were several limitations to this study. First, the retrospective study design may have introduced selection bias, while the data imbalance that emerged from real-world studies resulted in PRC effects without the AUC number. Secondly, although our model showed good performance, its data source was limited to one medical center, which may limit its generalizability, and we will follow up with an additional multicenter study. Thirdly, further independent external validation is needed to confirm these findings. Finally, we collected AIS-related variables as comprehensively as possible, but there were still some important variables that were not available in a timely manner, which may also limit the generalizability of the study. Future research is needed to examine this issue further.
5. Conclusion
The results of this study suggest that serum inflammatory markers (CRP and HCY), SS, and NOS are independent risk factors of death within 1 year in AIS patients. The XGB algorithm showed good performance as a tool to predict death within 1 year in AIS patients. Using this web-based calculator can effectively prevent death, reduce mortality, and assist physicians in making treatment decisions.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The study was written approved by the Ethics Committee of the Second Affiliated Hospital of Xuzhou Medical University [ethics number: [2020] 081603]. The patients/participants provided their written informed consent to participate in this study.
Author contributions
WLi, LR, and XW completed the study design. KW, WLiu, and WLi performed the study and collected and analyzed the data. LG and WLi drafted the manuscript. LR, XW, KW, and HL provided the expert consultations and suggestions. CX and CY conceived of the study, participated in its design and coordination, and helped to embellish language. All authors reviewed the final version of the manuscript.
Funding
This study was supported by Scientific Research Project of Jiangsu Health Committee (No. H2019054), the Xuzhou Science and Technology Planning Project (No. KC21220), Development Fund of Affiliated Hospital of Xuzhou Medical University (No. XYFY2020013), and Shaanxi Provincial Health and Health Research Fund Project (2022E006).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2023.1092534/full#supplementary-material
Supplementary Figure 1. Precision recall curve (PRC) for machine learning algorithm.
References
1. Ding Q, Liu S, Yao Y, Liu H, Cai T, Han L. Global, regional, and national burden of ischemic stroke, 1990–2019. Neurology. (2022) 98:e279–90. doi: 10.1212/WNL.0000000000013115
2. Vos T, Lim SS, Abbafati C, Abbas KM, Abbasi M, Abbasifard M, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet. (2020) 396:1204–22. doi: 10.1016/S0140-6736(20)30925-9
3. Mendelson SJ, Prabhakaran S. Diagnosis and management of transient ischemic attack and acute ischemic stroke: A review. J Am Med Assoc. (2021) 325:1088–98. doi: 10.1001/jama.2020.26867
4. Jayaraj RL, Azimullah S, Beiram R, Jalal FY, Rosenberg GA. Neuroinflammation: friend and foe for ischemic stroke. J Neuroinflammation. (2019) 16:142. doi: 10.1186/s12974-019-1516-2
5. Lasek-Bal A, Jedrzejowska-Szypulka H, Student S, Warsz-Wianecka A, Zareba K, Puz P, et al. The importance of selected markers of inflammation and blood-brain barrier damage for short-term ischemic stroke prognosis. J Physiol Pharmacol. (2019) 70:503953. doi: 10.1101/503953
6. Xu S, Lu J, Shao A, Zhang JH, Zhang J. Glial cells: Role of the immune response in ischemic stroke. Front Immunol. (2020) 11:294. doi: 10.3389/fimmu.2020.00294
7. Morotti A, Poli L, Costa P. Acute stroke. Semin Neurol. (2019) 39:61–72. doi: 10.1055/s-0038-1676992
8. Heo J, Yoon JG, Park H, Kim YD, Nam HS, Heo JH. Machine learning-based model for prediction of outcomes in acute stroke. Stroke. (2019) 50:1263–5. doi: 10.1161/STROKEAHA.118.024293
9. Chae SH, Kim Y, Lee KS, Park HS. Development and clinical evaluation of a web-based upper limb home rehabilitation system using a smartwatch and machine learning model for chronic stroke survivors: Prospective comparative study. JMIR mHealth uHealth. (2020) 8:e17216. doi: 10.2196/17216
10. Castaneda-Vega S, Katiyar P, Russo F, Patzwaldt K, Schnabel L, Mathes S, et al. Machine learning identifies stroke features between species. Theranostics. (2021) 11:3017–34. doi: 10.7150/thno.51887
11. Bonkhoff A, Hong S, Bretzner M, Schirmer M, Regenhardt R, Arsava E, et al. Association of stroke lesion pattern and white matter hyperintensity burden with stroke severity and outcome. Neurology. (2022) 99:e1364–79. doi: 10.1212/WNL.0000000000200926
12. Choi RY, Coyner AS, Kalpathy-Cramer J, Chiang MF, Campbell JP. Introduction to machine learning, neural networks, and deep learning. Transl Vis Sci Technol. (2020) 9:14. doi: 10.1167/tvst.9.2.14
13. Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. (2022) 23:40–55. doi: 10.1038/s41580-021-00407-0
14. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. Adv Neural Informat Process Syst. (2017) 2017:4765–74. doi: 10.48550/arXiv.1705.07874
15. Chen H, Lundberg SM, Lee SI. Explaining a series of models by propagating Shapley values. Nat Commun. (2022) 13:4512. doi: 10.1038/s41467-022-31384-3
16. Nohara Y, Matsumoto K, Soejima H, Nakashima N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput Methods Programs Biomed. (2022) 214:106584. doi: 10.1016/j.cmpb.2021.106584
17. Tseng PY, Chen YT, Wang CH, Chiu KM, Peng YS, Hsu SP, et al. Prediction of the development of acute kidney injury following cardiac surgery by machine learning. Crit Care. (2020) 24:478. doi: 10.1186/s13054-020-03179-9
18. Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. eDoctor: Machine learning and the future of medicine. J Intern Med. (2018) 284:603–19. doi: 10.1111/joim.12822
19. Orchard P, Agakova A, Pinnock H, Burton CD, Sarran C, Agakov F, et al. Improving prediction of risk of hospital admission in chronic obstructive pulmonary disease: Application of machine learning to telemonitoring data. J Med Internet Res. (2018) 20:e263. doi: 10.2196/jmir.9227
21. Shek A, Jiang Z, Teo J, Au Yeung J, Bhalla A, Richardson MP, et al. Machine learning-enabled multitrust audit of stroke comorbidities using natural language processing. Eur J Neurol. (2021) 28:4090–7. doi: 10.1111/ene.15071
22. Ou C, Liu J, Qian Y, Chong W, Zhang X, Liu W, et al. Rupture risk assessment for cerebral aneurysm using interpretable machine learning on multidimensional data. Front Neurol. (2020) 11:570181. doi: 10.3389/fneur.2020.570181
23. Herpich F, Rincon F. Management of acute ischemic stroke. Crit Care Med. (2020) 48:1654–63. doi: 10.1097/CCM.0000000000004597
24. Myint PK, Clark AB, Kwok CS, Davis J, Durairaj R, Dixit AK, et al. The SOAR (Stroke subtype, Oxford Community Stroke Project classification, Age, prestroke modified Rankin) score strongly predicts early outcomes in acute stroke. Int J Stroke. (2014) 9:278–83. doi: 10.1111/ijs.12088
25. Sha L, Xu T, Ge X, Shi L, Zhang J, Guo H. Predictors of death within 6 months of stroke onset: A model with Barthel index, platelet/lymphocyte ratio and serum albumin. Nurs Open. (2021) 8:1380–92. doi: 10.1002/nop2.754
26. Wafa HA, Wolfe CDA, Emmett E, Roth GA, Johnson CO, Wang Y. Burden of stroke in Europe: Thirty-year projections of incidence, prevalence, deaths, and disability-adjusted life years. Stroke. (2020) 51:2418–27. doi: 10.1161/STROKEAHA.120.029606
27. Luo Y, Wang Z, Li J, Xu Y. Serum CRP concentrations and severity of ischemic stroke subtypes. Can J Neurol Sci. (2012) 39:69–73. doi: 10.1017/S0317167100012713
28. Hu Z, Lai J, Chen L, Yi Y, Li R, Liao W. Can baseline C-reactive protein level predict functional outcome in acute ischaemic stroke? A meta-analysis. Biomarkers. (2020) 25:525–32. doi: 10.1080/1354750X.2020.1817982
29. Idicula TT, Brogger J, Naess H, Waje-Andreassen U, Thomassen L. Admission C-reactive protein after acute ischemic stroke is associated with stroke severity and mortality: The “Bergen stroke study”. BMC Neurol. (2009) 9:18. doi: 10.1186/1471-2377-9-18
30. Shantikumar S, Grant PJ, Catto AJ, Bamford JM, Carter AM. Elevated C-reactive protein and long-term mortality after ischaemic stroke: Relationship with markers of endothelial cell and platelet activation. Stroke. (2009) 40:977–9. doi: 10.1161/STROKEAHA.108.525105
31. Luo Y, Jin H, Guo ZN, Zhang P, Zhang LY, Chen J, et al. Effect of hyperhomocysteinemia on clinical outcome and hemorrhagic transformation after thrombolysis in ischemic stroke patients. Front Neurol. (2019) 10:592. doi: 10.3389/fneur.2019.00592
32. He Y, Li Y, Chen Y, Feng L, Nie Z. Homocysteine level and risk of different stroke types: A meta-analysis of prospective observational studies. Nutr Metabol Cardiovasc Dis. (2014) 24:1158–65. doi: 10.1016/j.numecd.2014.05.011
33. Kwon HM, Lee YS, Bae HJ, Kang DW. Homocysteine as a predictor of early neurological deterioration in acute ischemic stroke. Stroke. (2014) 45:871–3. doi: 10.1161/STROKEAHA.113.004099
34. Garavelli F, Ghelfi AM, Kilstein JG. Usefulness of NIHSS score as a predictor of non-neurological in-hospital complications in stroke. Med Clin. (2021) 157:434–7. doi: 10.1016/j.medcli.2020.07.034
35. Fischer U, Baumgartner A, Arnold M, Nedeltchev K, Gralla J, De Marchis GM, et al. What is a minor stroke? Stroke. (2010) 41:661–6. doi: 10.1161/STROKEAHA.109.572883
36. Fröhlich K, Macha K, Gerner ST, Bobinger T, Schmidt M, Dörfler A, et al. Angioedema in stroke patients with thrombolysis. Stroke. (2019) 50:1682–7. doi: 10.1161/STROKEAHA.119.025260
Keywords: ischemic stroke, biomarkers, machine learning, prediction model, web calculator
Citation: Wang K, Gu L, Liu W, Xu C, Yin C, Liu H, Rong L, Li W and Wei X (2023) The predictors of death within 1 year in acute ischemic stroke patients based on machine learning. Front. Neurol. 14:1092534. doi: 10.3389/fneur.2023.1092534
Received: 20 November 2022; Accepted: 02 February 2023;
Published: 23 February 2023.
Edited by:
Daniel Donoho, Children's National Hospital, United StatesReviewed by:
Kais Gadhoumi, Duke University, United StatesXiaohuan Xia, Tongji Hospital Affiliated to Tongji University, China
Copyright © 2023 Wang, Gu, Liu, Xu, Yin, Liu, Rong, Li and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Wenle Li, ZHJsZWUwOTEwJiN4MDAwNDA7MTYzLmNvbQ==; Liangqun Rong, cm9uZ2xpYW5ncXVuJiN4MDAwNDA7MTYzLmNvbQ==; Xiu'e Wei, d3hlcXEmI3gwMDA0MDsxNjMuY29t
†These authors have contributed equally to this work
‡ORCID: Wenle Li orcid.org/0000-0002-2933-646X