Stroke is an acute disorder and dysfunction of the focal neurological system that has long been recognized as one of the leading causes of death and severe disability in most regions globally. This study aimed to supplement and exploit multiple comorbidities, laboratory tests and demographic factors to more accurately predict death related to stroke, and furthermore, to make inferences about the heterogeneity of treatment in stroke patients to guide better treatment planning.
We extracted data from the Medical Information Mart from the Intensive Care (MIMIC)-IV database. We compared the distribution of the demographic factors between the control and death groups. Subsequently, we also developed machine learning (ML) models to predict mortality among stroke patients. Furthermore, we used meta-learner to recognize the heterogeneity effects of warfarin and human albumin. We comprehensively evaluated and interpreted these models using Shapley Additive Explanation (SHAP) analysis.
We included 7,483 patients with MIMIC-IV in this study. Of these, 1,414 (18.9%) patients died during hospitalization or 30 days after discharge. We found that the distributions of age, marital status, insurance type, and BMI differed between the two groups. Our machine learning model achieved the highest level of accuracy to date in predicting mortality in stroke patients. We also observed that patients who were consistent with the model determination had significantly better survival outcomes than the inconsistent population and were better than the overall treatment group.
We used several highly interpretive machine learning models to predict stroke prognosis with the highest accuracy to date and to identify heterogeneous treatment effects of warfarin and human albumin in stroke patients. Our interpretation of the model yielded a number of findings that are consistent with clinical knowledge and warrant further study and verification.