Skip to main content

ORIGINAL RESEARCH article

Front. Neurol., 12 June 2024
Sec. Neurological Biomarkers

Machine learning is an effective method to predict the 3-month prognosis of patients with acute ischemic stroke

Qing Huang&#x;Qing Huang1Guang-Li Shou&#x;Guang-Li Shou2Bo ShiBo Shi3Meng-Lei LiMeng-Lei Li4Sai ZhangSai Zhang3Mei HanMei Han1Fu-Yong Hu
Fu-Yong Hu1*
  • 1School of Public Health, Bengbu Medical University, Bengbu, Anhui, China
  • 2Department of Neurology, The Second Affiliated Hospital, Bengbu Medical University, Anhui, China
  • 3School of Medical Imaging, Bengbu Medical University, Anhui, China
  • 4Department of Emergency Medicine, The Second Affiliated Hospital, Bengbu Medical University, Anhui, China

Background and objectives: Upwards of 50% of acute ischemic stroke (AIS) survivors endure varying degrees of disability, with a recurrence rate of 17.7%. Thus, the prediction of outcomes in AIS may be useful for treatment decisions. This study aimed to determine the applicability of a machine learning approach for forecasting early outcomes in AIS patients.

Methods: A total of 659 patients with new-onset AIS admitted to the Department of Neurology of both the First and Second Affiliated Hospitals of Bengbu Medical University from January 2020 to October 2022 included in the study. The patient’ demographic information, medical history, Trial of Org 10,172 in Acute Stroke Treatment (TOAST), National Institute of Health Stroke Scale (NIHSS) and laboratory indicators at 24 h of admission data were collected. The Modified Rankine Scale (mRS) was used to assess the 3-mouth outcome of participants’ prognosis. We constructed nine machine learning models based on 18 parameters and compared their accuracies for outcome variables.

Results: Feature selection through the Least Absolute Shrinkage and Selection Operator cross-validation (Lasso CV) method identified the most critical predictors for early prognosis in AIS patients as white blood cell (WBC), homocysteine (HCY), D-Dimer, baseline NIHSS, fibrinogen degradation product (FDP), and glucose (GLU). Among the nine machine learning models evaluated, the Random Forest model exhibited superior performance in the test set, achieving an Area Under the Curve (AUC) of 0.852, an accuracy rate of 0.818, a sensitivity of 0.654, a specificity of 0.945, and a recall rate of 0.900.

Conclusion: These findings indicate that RF models utilizing general clinical and laboratory data from the initial 24 h of admission can effectively predict the early prognosis of AIS patients.

Introduction

The 2019 Global Burden of Disease Study findings reveal that stroke continues to be the second leading cause of death and ranks third in causing disability worldwide (1). Since 2015, stroke has emerged as the preeminent cause of death and disability in China, significantly impacting the nation’s public health and exerting substantial pressure on its healthcare infrastructure (2). Acute ischemic stroke (AIS), constituting over 80% of stroke cases (3), remains the predominant type. Statistics indicate that upwards of 50% of stroke survivors endure varying degrees of disability, with a recurrence rate of 17.7% (4), thereby placing a considerable strain on both families and society at large. Consequently, the early detection and prognosis of individuals at elevated risk of AIS, coupled with prompt intervention and treatment, hold critical importance in enhancing the quality of life for AIS patients.

Presently, studies on clinical prediction of ischemic stroke outcomes predominantly employ established scales or conventional models, such as the Essen stroke risk score, RRE-90 score, Logistic regression, and Cox regression (57). These tools, however, are limited by their inclusion of only a select few traditional risk factors and an inability to capture complex nonlinear interactions among various prognostic elements. In contrast, Machine Learning (ML)—a field exploring the simulation of human learning processes by computer algorithms (8)—holds distinct advantages. ML eschews the prerequisite assumptions about variable relationships and model structures inherent to traditional statistics and can adeptly manage datasets with incomplete entries. Additionally, ML models offer explanatory insights that surpass classical statistical methods (9). With the advent of big data analytics in healthcare research, machine learning is integrating increasingly vast data arrays to more accurately reflect human physiological complexity and the unpredictable aspects of disease traits. This progression heralds an innovative trajectory for disease diagnosis, prognosis, and risk evaluation, offering substantial practical application potential (1012).

In this study, we develop and validate a predictive model for the early prognosis of patients experiencing their first AIS using relevant medical data. We employ various ML algorithms to assess the efficacy of the model and compare the predictive utility of each to determine the optimal mathematical model for forecasting early outcomes in AIS patients. The goal is to facilitate the identification of high-risk individuals with potentially poor prognoses at an early stage, thereby providing a scientific basis for selecting appropriate clinical treatment strategies.

Objects and methods

Subjects

Our study encompassed 659 patients who experienced their first AIS and were admitted to the Department of Neurology at both the First and Second Affiliated Hospitals of Bengbu Medical University from January 2020 to October 2022. The inclusion criteria mandated that (1) the disease presented within 2 weeks of onset, and (2) all participants must conform to the ischemic stroke diagnostic guidelines as revised at the Fourth National Academic Conference on Cerebrovascular Disease, with this incident being their initial occurrence confirmed via CT scan. The exclusion criteria excluded patients with (1) incomplete medical records, (2) CT or MRI evidence of intracranial hemorrhage, expansive infarctions across multiple lobes, tumors, or vascular malformations, (3) severe concurrent conditions involving the cardiac, hepatic, or renal systems, along with malignancies, (4) a history of significant cranial trauma or neurosurgery, and (5) coexisting autoimmune diseases, hematologic disorders, or severe infectious diseases. Ethical endorsement for this research adhered to the Declaration of Helsinki and was secured from the Medical Ethics Committee of Bengbu Medical University. Informed consent was obtained from the patients for this study.

Data collection

Data on patients were gathered via an electronic case system, comprising (1) demographic information such as age and gender; (2) medical history, including high blood pressure, type II diabetes, coronary heart disease, atrial fibrillation; (3) TOAST etiologic classification of stroke; and (4) scores on the National Institute of Health Stroke Scale (NIHSS) (13) upon admission. Additionally, laboratory test results within 24 h of admission featured routine blood parameters like white blood cell (WBC), platelet (PLT), neutrophil (Neu), lymphocyte (Lym), monocyte (Mon), red blood cell (RBC) counts, and hemoglobin (HGB) concentration. Coagulation markers assessed were prothrombin time (PT), PT ratio (PTR), international normalized ratio (INR), activated partial thromboplastin time (APTT), clotting time (TT), fibrinogen (FIB), D-dimer, fibrinogen degradation product (FDP), and others. The analysis also included total cholesterol (CHOL), triacylglycerol (TG), low-density lipoprotein (LDL), high-density lipoprotein (HDL), uric acid (UA), C-reactive protein (CRP), homocysteine (HCY), glucose (GLU), and essential thyroid indices such as thyroid-stimulating hormone (TSH), free tri-iodothyronine (FT3), free thyroxine (FT4), total tri-iodothyronine (T3), and total thyroxine (T4). Calculations for the Neutrophil-to-lymphocyte ratio (NLR), Platelet-to-lymphocyte ratio (PLR), and Monocyte-to-lymphocyte ratio (MLR) were also included, defined, respectively, as Neu/Lym, PLT/Lym, and Mon/Lym ratios.

Prognostic assessment

Prognostic information on participants was collected via telephone or during outpatient follow-up 3-months post-treatment, employing the Modified Rankin Scale (mRS) to assess outcomes. The mRS scores range from 0 to 6, with 0 signifying no symptoms and 6 denoting death. Scores are detailed as follows: (1) reflects mild symptoms without significant disability, allowing independence in daily tasks; (2) represents slight disability, hindering the performance of previous tasks yet permitting self-managed daily activities; (3–5) indicate escalating levels of disability, requiring assistance to varying degrees, with 5 specifically denoting severe disability, paralysis, incontinence, and the need for constant care. A good prognosis is defined by an mRS score of ≤2, while scores >2 signal a poor prognosis (14).

Machine learning modeling

Random forests were initially employed to impute missing values, after which the data were split into a training set and a test set in a 7:3 ratio. The training set was utilized for parameter calculation and model construction, whereas the test set was used to assess prediction accuracy. Feature selection within the training set was conducted using the Least Absolute Shrinkage and Selection Operator cross-validation (Lasso CV) method, enabling the inclusion of feature variables with non-zero coefficients into various algorithms—eXtreme Gradient Boosting (XGB), Logistic Regression (LR), LGBM Classifier, Random Forest Classifier (RF), Ada Boost Classifier (AdaBoost), Decision Tree Classifier (DT), Gradient Boosting Classifier (GBDT), Multi-layer Perceptron Classifier (MLP), and Support Vector Machine (SVM)—to develop prediction models for the early prognosis of AIS patients. Model parameters were refined through 10-fold cross-validation, enhancing model optimization for performance evaluation on the training set. The most effective model was identified, and its classification efficacy on the test set was assessed using metrics including Area Under the Curve (AUC), accuracy, sensitivity, specificity, and recall.

Data filling was conducted utilizing the “missForest ()” function within the “missForest” package of R software version 4.2.3. Feature filtering was performed using the Python machine learning library “scikit-learn” version 1.1.3. For the modeling, we used the “XGBoost” (15) (version 2.0.1), “lightgbm” (16) (version 3.2.1) and “scikit-learn” (17) (version 1.1.3) Python Packages. All development and validation of machine learning models were carried out using Python version 3.11.4.

Statistical analysis

For measures that followed a normal distribution, information was presented as mean ± standard deviation (x̅ ± s), while for those that did not follow a normal distribution, information was presented as the median (interquartile range, IQR). Count data were expressed as frequency and percentage (%). The chi-square (χ2) test was utilized to compare count data between groups, whereas the t-test or Mann–Whitney U test was applied for measured data comparisons. All statistical analyses were conducted using SPSS version 19.0, with a significance level set at α = 0.05.

Results

General conditions of the study population

This study encompassed 659 patients experiencing their first-ever AIS (AIS), comprising 370 males and 289 females, with an average age of 68.34 ± 12.36 years. Three-months post-treatment, 507 patients displayed a favorable prognosis (modified Rankin Scale, mRS ≤ 2), while 152 patients fell into the poor prognosis category (mRS > 2), marking an early poor prognosis incidence of 23.07%. Table 1 presents the detailed clinical data of the subjects categorized into these two groups.

Table 1
www.frontiersin.org

Table 1. Clinical feature comparison between the two outcome groups.

Feature screening

In the training dataset, the Lasso CV identified 18 features with non-zero coefficients: NIHSS, APTT, TT, D-Dimer, FDP, GLU, UA, HCY, CRP, WBC, NEU, RBC, HGB, PLT, PLR, MLR, TSH, and FT4, optimizing the regularization parameter λ to 0.081. The absolute values of the coefficients were sorted in descending order, revealing WBC, HCY, D-Dimer, NIHSS, FDP, and GLU as the most significant variables. The coefficients of these features are visualized in Figure 1.

Figure 1
www.frontiersin.org

Figure 1. Coefficients in the Lasso model.

Diagnostic performance of the machine learning models

Nine machine learning models were developed using 18 previously identified features, with their performance illustrated in Figure 2, which displays the AUC with 95% Confidence Interval (CI) from 10-fold cross-validation on the validation set. Notably, the RF model demonstrated the highest AUC (0.876 with a 95% CI of 0.754–0.991). Calibration curves (Figure 3) revealed that the RF model exhibited the most accurate alignment between predicted and actual probabilities in assessing the early prognosis of AIS patients, achieving a Brier score of 0.113. This RF model, having shown promising results, underwent further evaluation on a test set, achieving an AUC of 0.852, an accuracy rate of 0.818, a sensitivity of 0.654, a specificity of 0.945, a recall rate of 0.900, and an F1-Score of 0.757. The Receiver Operating Characteristic (ROC) curves for the RF model across training, validation, and test sets are depicted in Figures 4AC. Additionally, Figure 4D presents the decision curves for the RF model on the test set, highlighting its substantial net clinical benefit across a range of critical risk thresholds. Comprehensive performance metrics of the RF model, including AUC, accuracy, sensitivity, specificity, and recall for training, validation, and test sets, are detailed in Table 2.

Figure 2
www.frontiersin.org

Figure 2. Receiver operating characteristic curve (ROC) for 9 ML models on the validation set.

Figure 3
www.frontiersin.org

Figure 3. Calibration curves for the 9 ML models on the validation set. The dotted line represents the perfect calibration curve, i.e., the predicted probability matches the true probability perfectly. The numbers in the legend represent the Brier scores of the ML models; the smaller the Brier score, the closer the predicted probability of the ML model is to the true probability.

Figure 4
www.frontiersin.org

Figure 4. Receiver operating characteristic (ROC) curves of the RF model on the training set (A), validation set (B), test set (C), and the decision curve on the test set (D).

Table 2
www.frontiersin.org

Table 2. Predictive metrics of RF model on training set, validation set and test set.

Discussion

In this study, we developed several machine learning models to predict the early prognosis of AIS patients using general clinical and laboratory data collected within 24 h of admission. Feature selection through the Lasso CV method identified the most critical predictors for early prognosis in AIS patients as WBC, HCY, D-Dimer, baseline NIHSS, FDP, and GLU. Among the nine machine learning models evaluated, the Random Forest model exhibited superior performance in the test set, achieving an AUC of 0.852, an accuracy rate of 0.818, a sensitivity of 0.654, a specificity of 0.945, and a recall rate of 0.900. These findings indicate that RF models utilizing general clinical and laboratory data from the initial 24 h of admission can effectively predict the early prognosis of AIS patients.

ML predictive models leverage algorithms to analyze data and forecast future occurrences or trends. These models have the capability to discern patterns in vast, complex datasets and integrate these patterns in a non-linear and highly interactive fashion (18). The application of machine learning in the medical domain, particularly in identifying risk factors and developing prognostic models for AIS patients, has seen significant growth in recent years, with notable contributions from researchers like Veerbeek et al. (19, 20), Xie et al. (12), Lin et al. (21), Ramos et al. (10), Su et al. (11), and Chen M et al. (22). These studies primarily derive their data from hospital records and publicly accessible databases. For instance, Lin et al. (21) assessed the capabilities of various ML models (SVM, RF, ANN, HANN) in predicting the 90-day mRS outcomes for stroke patients within the Taiwan Stroke Registry. Similarly, Su et al. (11) investigated the predictive performance of 4 ML models (SVM, RF, LGBM, DNN) in forecasting mRS scores and hospitalization outcomes at discharge among 2,780 AIS patients registered in the Chang Gung Healthcare System. Public databases offer vast data volume, broad coverage, and access ease, beneficial for swiftly compiling data required for predictive modeling. Nonetheless, ensuring the quality of data from these sources presents challenges, including risks of inconsistent patient assessment and treatment, incomplete records, and lost follow-up data (22). Predominantly, hospital data-driven studies incorporate imaging, demographic, and clinical information. For instance, Xie et al. (12) utilized imaging data and NIHSS scores to develop prediction models, while Ramos et al. (10) employed imaging and clinical baseline data for AIS prognosis. Common clinical assessments, like routine blood, coagulation, and biochemical tests, serve as conventional, accessible, and cost-effective diagnostics for AIS patients. Yet, the construction of an ML predictive model encompassing demographic, clinical, and 24-h admission blood test data for forecasting 3-month AIS patient outcomes lacks representation in the Chinese population. This study aims to fill this research void.

Previous research primarily focused on the prognostic impact of age (19, 20), atrial fibrillation (4, 23, 24), NIHSS scores at admission (2527), D-dimer (28, 29), HCY (3032), and GLU (3335) on AIS outcomes. In the current study, we employed Lasso CV for feature selection, identifying WBC, HCY, D-dimer, NIHSS, FDP, and GLU as critical variables for developing prognostic models for AIS patients. Notably, WBC and FDP, indicators of the body’s inflammatory response and the fibrinolytic system’s activity, respectively, have been under-explored in AIS prognostication. While WBC has been associated with stroke prognosis and an increased risk of disability at discharge following every 1 × 109/L increment at stroke onset (36), its independent predictive value remains unclear, potentially due to limited sample sizes in previous studies. Similarly, FDP and D-dimer serve as markers for hypercoagulability and hyper-fibrinolysis, pivotal for early diagnosis, monitoring treatment efficacy, and prognosticating thrombogenic conditions. Our findings underscore the importance of including D-dimer and FDP in early prognostic assessments for AIS patients, highlighting their significant role in predicting outcomes.

In this study, we utilized nine machine learning models for analysis, based on selected variables. Of these, the RF model demonstrated superior performance on the test set, achieving an AUC of 0.876 (95% CI, 0.754–0.991). This model provided a significantly better prediction capability. The RF algorithm, known for its robust noise resistance and low susceptibility to overfitting, has found extensive application in healthcare (37). Moreover, several studies employing the RF model for predicting stroke patient outcomes have affirmed its effective predictive power (10, 38, 39).

The current study is constrained by several limitations. Primarily, the modest sample size, despite being derived from two different centers, necessitates a larger dataset for robust validation. Also, this study lacked external validation. In the future, we will conduct a validation study with an external cohort. Moreover, while the study encompasses demographic information, past medical history, blood counts within 24 h of admission, and biochemical assays of AIS patients, it lacks imaging data pertaining to the location and extent of cerebral infarctions. The absence of these imaging details could potentially impact the predictive accuracy of the model. Finally, treatment decision-making is a complex process involving multiple considerations. Our model is just one of many aids whose results should be considered along with clinical judgment and other patient-specific factors, and is intended to provide additional information to support the decision-making process, not to replace professional medical judgment.

Conclusion

In conclusion, our study illustrates that utilizing demographic data, past medical histories, and 24-h laboratory information upon admission for ML modeling is a feasible approach to predicting the short-term prognosis of patients experiencing their first AIS. Notably, it reveals that the RF model could serve as an effective predictive tool. This finding is crucial for the early identification and prediction of individuals at elevated risk of developing AIS with an unfavorable prognosis. It underscores the importance of prompt and early intervention and treatment to enhance the quality of life for AIS patients.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Medical Ethics Committee of Bengbu Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

QH: Writing – original draft, Methodology. G-LS: Data curation, Investigation, Writing – original draft. BS: Formal analysis, Methodology, Writing – review & editing. M-LL: Data curation, Investigation, Writing – review & editing. SZ: Methodology, Writing – original draft. MH: Methodology, Writing – review & editing. F-YH: Writing – original draft.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Natural Science Key Project of the Anhui Provincial Education Department (no. 2023AH051915 and no. 2022AH051533) and the Key Project of Natural Science Foundation of Bengbu Medical University (2021byzd035).

Acknowledgments

We gratefully acknowledge the developers and maintainers of R and Python software for their contributions.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2024.1407152/full#supplementary-material

References

1. Feigin, VL, Stark, BA, Johnson, CO, Roth, GA, Bisignano, C, Abady, GG, et al. Global, regional, and national burden of stroke and its risk factors, 1990-2019: a systematic analysis for the global burden of disease study 2019. Lancet Neurol. (2021) 20:795–820. English. doi: 10.1016/S1474-4422(21)00252-0

PubMed Abstract | Crossref Full Text | Google Scholar

2. Tu, WJ, and Wang, LD. China stroke surveillance report 2021. Mil Med Res. (2023) 10:33. doi: 10.1186/s40779-023-00463-x

Crossref Full Text | Google Scholar

3. Wang, W, Jiang, B, Sun, H, Ru, X, Sun, D, Wang, L, et al. Prevalence, incidence, and mortality of stroke in China: results from a Nationwide population-based survey of 480 687 adults. Circulation. (2017) 135:759–71. doi: 10.1161/CIRCULATIONAHA.116.025250

PubMed Abstract | Crossref Full Text | Google Scholar

4. Kang, K, Park, TH, Kim, N, Jang, MU, Park, S-S, Park, J-M, et al. Recurrent stroke, myocardial infarction, and major vascular events during the first year after acute ischemic stroke: the multicenter prospective observational study about recurrence and its determinants after acute ischemic stroke I. J Stroke Cerebrovasc Dis. (2016) 25:656–64. doi: 10.1016/j.jstrokecerebrovasdis.2015.11.036

Crossref Full Text | Google Scholar

5. Drozdowska, BA, Singh, S, and Quinn, TJ. Thinking about the future: a review of prognostic scales used in acute stroke. Front Neurol. (2019) 10:274. doi: 10.3389/fneur.2019.00274

Crossref Full Text | Google Scholar

6. Raza, SA, and Rangaraju, S. A review of pre-intervention prognostic scores for early prognostication and patient selection in endovascular management of large vessel occlusion stroke. Interv Neurol. (2018) 7:171–81. doi: 10.1159/000486539

Crossref Full Text | Google Scholar

7. Song, B, Liu, Y, Nyame, L, Chen, X, Jiang, T, Wang, W, et al. A COACHS nomogram to predict the probability of three-month unfavorable outcome after acute ischemic stroke in Chinese patients. Cerebrovasc Dis. (2019) 47:80–7. doi: 10.1159/000497243

Crossref Full Text | Google Scholar

8. Huang, J-C, Ko, K-M, Shu, M-H, and Hsu, B-M. Application and comparison of several machine learning algorithms and their integration models in regression problems. Neural Comput & Applic. (2019) 32:5461–9. doi: 10.1007/s00521-019-04644-5

Crossref Full Text | Google Scholar

9. Bzdok, D. Classical statistics and statistical learning in imaging neuroscience. Front Neurosci. (2017) 11:543. doi: 10.3389/fnins.2017.00543

Crossref Full Text | Google Scholar

10. Ramos, LA, Hv, O, Hilbert, A, Olabarriaga, SD, Avd, L, Roos, YBWEM, et al. Combination of radiological and clinical baseline data for outcome prediction of patients with an acute ischemic stroke. Front Neurol. (2022) 13:809343. doi: 10.3389/fneur.2022.809343

PubMed Abstract | Crossref Full Text | Google Scholar

11. Su, PA-O, Wei, YA-O, Luo, HA-OX, Liu, CA-O, Huang, WA-O, Chen, KA-O, et al. Machine learning models for predicting influential factors of early outcomes in acute ischemic stroke: registry-based study. JMIR Med Inform. (2022) 10:e32508. doi: 10.2196/32508

PubMed Abstract | Crossref Full Text | Google Scholar

12. Xie, Y, Jiang, B, Gong, E, Li, Y, Zhu, G, Michel, P, et al. Use of gradient boosting machine learning to predict patient outcome in acute ischemic stroke on the basis of imaging, demographic, and clinical information. Am J Roentgenol. (2019) 212:44–51. doi: 10.2214/AJR.18.20260

PubMed Abstract | Crossref Full Text | Google Scholar

13. HPAJ, TB, Olinger, CP, Marler, JR, Barsan, WG, Biller, J, Spilker, J, et al. Measurements of acute cerebral infarction: a clinical examination scale. Stroke. (1989) 20:864–70. doi: 10.1161/01.STR.20.7.864

Crossref Full Text | Google Scholar

14. Huybrechts, KF, Caro, JJ, Xenakis, JJ, and Vemmos, KN. The prognostic value of the modified Rankin scale score for long-term survival after first-ever stroke. Results from the Athens stroke registry. Cerebrovasc Dis. (2008) 26:381–7. doi: 10.1159/000151678

PubMed Abstract | Crossref Full Text | Google Scholar

15. Chen, T, and Guestrin, C. (2016). XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining. New York, NY, USA: Association for Computing Machinery

Google Scholar

16. Ke, G, Meng, Q, Finley, T, Wang, T, Chen, W, Ma, W, et al. (2017). LightGBM: a highly efficient gradient boosting decision tree. In 31st Conference on Neural Information Processing Systems Long Beach, CA, USA: NIPS.

Google Scholar

17. Pedregosa, F. Scikit-learn: machine learning in Python. J Mach Learn Res. (2011) 12:2825–30.

Google Scholar

18. Obermeyer, Z, and Emanuel, EJ. Predicting the future — big data, machine learning, and clinical medicine. N Engl J Med. (2016) 375:1216–9. doi: 10.1056/NEJMp1606181

Crossref Full Text | Google Scholar

19. Meyer, MJ, Pereira, S, McClure, A, Teasell, R, Thind, A, Koval, J, et al. A systematic review of studies reporting multivariable models to predict functional outcomes after post-stroke inpatient rehabilitation. Disabil Rehabil. (2015) 37:1316–23. doi: 10.3109/09638288.2014.963706

PubMed Abstract | Crossref Full Text | Google Scholar

20. Veerbeek, JM, Kwakkel, G, van Wegen, EEH, Ket, JCF, and Heymans, MW. Early prediction of outcome of activities of daily living after stroke. Stroke. (2011) 42:1482–8. doi: 10.1161/STROKEAHA.110.604090

Crossref Full Text | Google Scholar

21. Lin, C-H, Hsu, K-C, Johnson, KR, Fann, YC, Tsai, C-H, Sun, Y, et al. Evaluation of machine learning methods to stroke outcome prediction using a nationwide disease registry. Comput Methods Prog Biomed. (2020) 190:105381. doi: 10.1016/j.cmpb.2020.105381

PubMed Abstract | Crossref Full Text | Google Scholar

22. Chen, M, Qian, D, Wang, Y, An, J, Meng, K, Xu, S, et al. Systematic review of machine learning applied to the secondary prevention of ischemic stroke. J Med Syst. (2024) 48:8. doi: 10.1007/s10916-023-02020-4

Crossref Full Text | Google Scholar

23. Flach, C, Muruet, W, Wolfe, CDA, Bhalla, A, and Douiri, A. Risk and secondary prevention of stroke recurrence. Stroke. (2020) 51:2435–44. doi: 10.1161/STROKEAHA.120.028992

PubMed Abstract | Crossref Full Text | Google Scholar

24. Ryu, W-S, Schellingerhout, D, Hong, K-S, Jeong, S-W, Jang, MU, Park, M-S, et al. White matter hyperintensity load on stroke recurrence and mortality at 1 year after ischemic stroke. Neurology. (2019) 93:e578–89. doi: 10.1212/WNL.0000000000007896

PubMed Abstract | Crossref Full Text | Google Scholar

25. Li, F, Yang, L, Yang, R, Xu, W, Chen, F-p, Li, N, et al. Ischemic stroke in young adults of northern China: characteristics and risk factors for recurrence. Eur Neurol. (2017) 77:115–22. doi: 10.1159/000455093

PubMed Abstract | Crossref Full Text | Google Scholar

26. Omori, T, Kawagoe, M, Moriyama, M, Yasuda, T, Ito, Y, Hyakuta, T, et al. Multifactorial analysis of factors affecting recurrence of stroke in Japan. Asia Pac J Public Health. (2012) 27:NP333–40. doi: 10.1177/1010539512441821

Crossref Full Text | Google Scholar

27. Zhuo, Y, Wu, J, Qu, Y, Yu, H, Huang, X, Zee, B, et al. Clinical risk factors associated with recurrence of ischemic stroke within two years. Medicine. (2020) 99:e20830. doi: 10.1097/MD.0000000000020830

PubMed Abstract | Crossref Full Text | Google Scholar

28. Yuan, B, Yang, T, Yan, T, Cheng, W, and Bu, X. Relationships between D-dimer levels and stroke risk as well as adverse clinical outcomes after acute ischemic stroke or transient ischemic attack: a systematic review and Meta-analysis. Front Neurol. (2021) 12:670730. doi: 10.3389/fneur.2021.670730

Crossref Full Text | Google Scholar

29. Zhang, P, Wang, C, Wu, JH, and Zhang, SL. A systematic review of the predictive value of plasma D-dimer levels for predicting stroke outcome. Front Neurol. (2021) 12:693524. doi: 10.3389/fneur.2021.693524

Crossref Full Text | Google Scholar

30. Geng, H-H, Wang, Q, Li, B, Cui, B-B, Jin, Y-P, Fu, R-L, et al. Early neurological deterioration during the acute phase as a predictor of long-term outcome after first-ever ischemic stroke. Medicine. (2017) 96:e9068. doi: 10.1097/MD.0000000000009068

PubMed Abstract | Crossref Full Text | Google Scholar

31. Mathisen, SM, Dalen, I, Larsen, JP, and Kurz, M. Long-term mortality and its risk factors in stroke survivors. J Stroke Cerebrovasc Dis. (2016) 25:635–41. doi: 10.1016/j.jstrokecerebrovasdis.2015.11.039

Crossref Full Text | Google Scholar

32. Tu, W-J, Zhao, S-J, Liu, T-G, Yang, D-G, and Chen, H. Combination of high-sensitivity C-reactive protein and homocysteine predicts the short-term outcomes of Chinese patients with acute ischemic stroke. Neurol Res. (2013) 35:912–21. doi: 10.1179/1743132813Y.0000000228

PubMed Abstract | Crossref Full Text | Google Scholar

33. Bril, F, McPhaul, MJ, Caulfield, MP, Castille, J-M, Poynard, T, Soldevila-Pico, C, et al. Performance of the SteatoTest, ActiTest, NashTest and FibroTest in a multiethnic cohort of patients with type 2 diabetes mellitus. J Investig Med. (2023) 67:303–11. doi: 10.1136/jim-2018-000864

Crossref Full Text | Google Scholar

34. Heldner, MR, Zubler, C, Mattle, HP, Schroth, G, Weck, A, Mono, M-L, et al. National Institutes of Health stroke scale score and vessel occlusion in 2152 patients with acute ischemic stroke. Stroke. (2013) 44:1153–7. doi: 10.1161/STROKEAHA.111.000604

PubMed Abstract | Crossref Full Text | Google Scholar

35. Wang, X, Kim, YS, Kim, C, Jung, K-H, Kwon, H-M, Heo, SH, et al. Range of glucose as a glycemic variability and 3–month outcome in diabetic patients with acute ischemic stroke. PLoS One. (2017) 12:183894. doi: 10.1371/journal.pone.0183894

Crossref Full Text | Google Scholar

36. Barow, E, Quandt, F, Cheng, B, Gelderblom, M, Jensen, M, Königsberg, A, et al. Association of White Blood Cell Count with Clinical Outcome Independent of treatment with Alteplase in acute ischemic stroke. Front Neurol. (2022) 13:877367. doi: 10.3389/fneur.2022.877367

PubMed Abstract | Crossref Full Text | Google Scholar

37. Fernandez-Lozano, C, Hervella, P, Mato-Abad, V, Rodríguez-Yáñez, M, Suárez-Garaboa, S, López-Dequidt, I, et al. Random forest-based prediction of stroke outcome. Sci Rep. (2021) 11:10071. doi: 10.1038/s41598-021-89434-7

PubMed Abstract | Crossref Full Text | Google Scholar

38. Feng, X, Hua, Y, Zou, J, Jia, S, Ji, J, Xing, Y, et al. Intelligible models for HealthCare: predicting the probability of 6-month unfavorable outcome in patients with ischemic stroke. Neuroinformatics. (2021) 20:575–85. doi: 10.1007/s12021-021-09535-6

PubMed Abstract | Crossref Full Text | Google Scholar

39. Ramos, LA, Kappelhof, M, van Os, HJA, Chalos, V, Van Kranendonk, K, Kruyt, ND, et al. Predicting poor outcome before endovascular treatment in patients with acute ischemic stroke. Front Neurol. (2020) 11:580957. doi: 10.3389/fneur.2020.580957

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: ischemic stroke, machine learning, prognosis, prediction model, random forest

Citation: Huang Q, Shou G-L, Shi B, Li M-L, Zhang S, Han M and Hu F-Y (2024) Machine learning is an effective method to predict the 3-month prognosis of patients with acute ischemic stroke. Front. Neurol. 15:1407152. doi: 10.3389/fneur.2024.1407152

Received: 30 March 2024; Accepted: 27 May 2024;
Published: 12 June 2024.

Edited by:

Jean-charles Sanchez, University of Geneva, Switzerland

Reviewed by:

Xavier Robin, University of Zurich, Switzerland
Guoqi Cai, Anhui Medical University, China

Copyright © 2024 Huang, Shou, Shi, Li, Zhang, Han and Hu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fu-Yong Hu, aHVmdXlvbmdAMTI2LmNvbQ==

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.