- 1Department of Neurosurgery, Tokyo Medical and Dental University, Bunkyo-ku, Japan
- 2Institute of Education, Innovative Human Resource Development Division, Tokyo Medical and Dental University, Bunkyo-ku, Japan
- 3Department of Neurosurgery, School of Medicine, International University of Health and Welfare, Narita, Japan
- 4Emergency Medical Care Center, Saiseikai Shiga Hospital, Ritto, Shiga, Japan
- 5Department of Neurosurgery, NHO Disaster Medical Center, Tachikawa, Japan
- 6Department of Neurosurgery, Tsuchiura Kyodo General Hospital, Tsuchiura, Ibaraki, Japan
- 7Department of Neurosurgery, Kumamoto Red Cross Hospital, Kumamoto, Japan
- 8Department of Neurosurgery, Sendai City Hospital, Sendai, Miyagi, Japan
- 9Department of Neurosurgery, Chiba Emergency Medical Center, Chiba, Japan
- 10Emergency Medical Center, Kagawa University Hospital, Kita-gun, Kagawa, Japan
- 11Department of Neurosurgery, Yamaguchi University School of Medicine, Ube, Yamaguchi, Japan
- 12Department of Neurosurgery, Hyogo Prefectural Kakogawa Medical Center, Kakogawa, Hyogo, Japan
- 13Department of Emergency and Critical Care Medicine, Graduate School of Medicine, Nippon Medical School, Bunkyo-ku, Japan
- 14Department of Neurological Surgery, Nihon University School of Medicine, Itabashi-ku, Japan
- 15Department of Emergency Medicine, Teikyo University School of Medicine, Itabashi-ku, Japan
- 16Department of Neurosurgery, St. Marianna University School of Medicine, Kawasaki, Kanagawa, Japan
- 17Iwate Prefectural Advanced Critical Care and Emergency Center, Iwate Medical University, Yahaba, Iwate, Japan
Objective: Neurological deterioration after mild traumatic brain injury (TBI) has been recognized as a poor prognostic factor. Early detection of neurological deterioration would allow appropriate monitoring and timely therapeutic interventions to improve patient outcomes. In this study, we developed a machine learning model to predict the occurrence of neurological deterioration after mild TBI using information obtained on admission.
Methods: This was a retrospective cohort study of data from the Think FAST registry, a multicenter prospective observational study of elderly TBI patients in Japan. Patients with an admission Glasgow Coma Scale (GCS) score of 12 or below or who underwent surgical treatment immediately upon admission were excluded. Neurological deterioration was defined as a decrease of 2 or more points from a GCS score of 13 or more within 24 h of hospital admission. The model predictive accuracy was judged with the area under the receiver operating characteristic curve (AUROC) and the area under the precision-recall curve (AUPRC), and the Youden index was used to determine the cutoff value.
Results: A total of 421 of 721 patients registered in the Think FAST registry between December 2019 and May 2021 were included in our study, among whom 25 demonstrated neurological deterioration. Among several machine learning algorithms, eXtreme Gradient Boosting (XGBoost) demonstrated the highest predictive accuracy in cross-validation, with an AUROC of 0.81 (±0.07) and an AUPRC of 0.33 (±0.08). Through SHapley Additive exPlanations (SHAP) analysis, five important features (D-dimer, fibrinogen, acute subdural hematoma thickness, cerebral contusion size, and systolic blood pressure) were identified and used to construct a better performing model (cross-validation AUROC of 0.84 and AUPRC of 0.34; testing data AUROC of 0.77 and AUPRC of 0.19). At the cutoff value from the Youden index, the model showed a sensitivity, specificity, and positive predictive value of 60, 96, and 38%, respectively. When neurosurgeons attempted to predict neurological deterioration using the same testing data, their values were 20, 94, and 19%, respectively.
Conclusion: In this study, our predictive model showed an acceptable performance in detecting neurological deterioration after mild TBI. Further validation through prospective studies is necessary to confirm these results.
Introduction
With population aging, the number of elderly patients with traumatic brain injury (TBI) continues to increase worldwide (1, 2). Although the causes of head injury among elderly patients with TBI, such as falls, are considered minor in this population, they are known to result in worse life and functional outcomes in elderly patients than in younger patients (3–6). One factor contributing to the poor prognosis of TBI in elderly patients is the increased incidence of neurological deterioration, which refers to the progression from an initial mild state—where patients can talk and communicate—to a state where consciousness impairment advances within a short period. Neurological deterioration has been recognized as a poor prognostic indicator in the management of head trauma. In particular, elderly patients are thought to be prone to neurological deterioration due to age-related brain atrophy, which may obscure the manifestations of intracranial hemorrhage or brain swelling in the early stages of injury, leading to underestimation of the initial severity of head trauma (7). Predicting the progression to neurological deterioration and subsequently initiating appropriate monitoring and interventions before the occurrence of deterioration may lead to improved outcomes for elderly patients with TBI.
Factors that have been reported to affect neurological deterioration include the presence of acute subdural hematoma, the use of anticoagulant medications (8), and elevated D-dimer levels on admission. However, there is no established way to assess the risk of neurological deterioration in each patient, possibly because the occurrence of neurological deterioration is relatively rare, so it is difficult to extract data on multiple independent factors related to this condition and construct predictive models via conventional statistical methods.
In recent years, many studies have demonstrated that machine learning enables the development of more accurate predictive models than traditional statistical methods do, such as in predicting the length of stay in the ICU for trauma patients or the risk of developing epilepsy after TBI. Although one of the weaknesses in utilizing machine learning analysis lies in its black-box nature, which possibly hinders its widespread use owing to the lack of clinical interpretability, recent advancements in interpreting the decisions made by machine learning models are expected to facilitate the application of machine learning in clinical settings (9).
In this study, we developed a predictive model using machine learning algorithms to predict the occurrence of neurological deterioration in elderly patients with mild TBI. Furthermore, we examined the clinical validity of the constructed predictive model by using the SHapley Additive exPlanations (SHAP) (10) explainable artificial intelligence (XAI) method.
Methods
Study population
This study was approved by the Medical Research Ethics Committee of Tokyo Medical and Dental University (M2019-210) and all the participating institutions. The requirement for informed consent was waived because of the observational nature of this study.
In this study, analysis was conducted using data from the Think FAST registry (11), a multicenter prospective database that contains data on hospitalized patients aged 65 years and older with head injuries. The participating institutions include Iwate Medical University, Sendai City Hospital, Tsuchiura Kyodo General Hospital, Chiba Emergency Medical Center, Teikyo University Hospital, Nippon Medical School Hospital, Nihon University Hospital, National Disaster Medical Center, St. Marianna University Hospital, Tokyo Medical and Dental University Hospital, Saiseikai Shiga Hospital, Hyogo Prefectural Kakogawa Medical Center, Kagawa University Hospital, Yamaguchi University Hospital, and the Japanese Red Cross Kumamoto Hospital. Patient registration was conducted between December 2019 and May 2021, resulting in the inclusion of data from 721 patients. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guidelines (12).
Definition of neurological deterioration
In this study, neurological deterioration was defined as a decrease of 2 or more points in the Glasgow Coma Scale (GCS) score in patients whose admission GCS score was 13 points or higher and who were treated with conservative management on admission.
Features
The Think Fast registry contains the prospectively registered data of hospitalized patients aged 65 years and older with head trauma. In this study, the following data were extracted from the database: age, sex, vital signs (systolic blood pressure, heart rate), consciousness level (evaluated via the GCS), laboratory data (platelet count, prothrombin time-international normalized ratio (PT-INR), activated partial thromboplastin time (APTT), D-dimer level, fibrinogen level), head computed tomography (CT) findings [acute subdural hematoma thickness (mm), acute epidural hematoma thickness (mm), cerebral contusion diameter (mm), presence of traumatic subarachnoid hemorrhage, skull vault fracture, skull base fracture, midline shift (mm), appearance of basal cisterns (normal/compressed/disappeared)], mechanism of injury (traffic accident/fall), time from injury to hospital arrival (minutes), administration of hemostatic agents (tranexamic acid, carbazochrome), antithrombotic drug intake (number of antiplatelet drugs or anticoagulants), and reversal therapy (vitamin K, fresh frozen plasma, four-factor prothrombin complex concentrate, idarucizumab, platelet transfusion). Vital signs, consciousness levels, and laboratory data were collected at the time of the patient’s arrival at the emergency room. CT findings were also based on examinations performed at the time of arrival, and the measurements were calculated from the images of each patient.
Machine learning algorithms
Prediction models, including logistic regression (13), support vector machine (with linear and radial basis function (RBF) kernels) (14), eXtreme Gradient Boosting (XGBoost) (15, 16), and random forest models (17, 18), were constructed via machine learning algorithms.
The predictive accuracy of the models was compared through cross-validation. The best performing algorithm was then selected, and SHAP (10) values were computed to visualize their relative importance for each feature. Furthermore, the dimensionality of the feature space was reduced according to the importance values, and the resulting features were used to create the most accurate prediction model.
Python version 3.9 was used to build the machine learning models and analyze missing values and SHAP values. Several Python modules were employed for this task, including numpy 1.23.2, scikit-learn 1.4.0, matplotlib-base 3.6.3, pandas 1.5.3, XGBoost 1.7.1, pyampute 0.0.3, and SHAP 0.41.0.
Preprocessing
Supplementary Figure 1 illustrates the percentage of missing values for each feature in the whole dataset. In particular, the proportion of missing values related to the coagulation system was greater than that related to other systems. Little’s missing completely at random (MCAR) test yielded a p value of 0.93, indicating that the missing values were not MCAR. Therefore, we adopted four approaches to handle the missing data in this study: (1) k-nearest neighbors (19); (2) multiple imputation (20); (3) random forest regression (21); and (4) no imputation, employed only in the XGBoost model since only XGBoost can handle data containing missing values.
The entire dataset was randomly divided at a 3:2 ratio into training and validation datasets (60% of the data), which were used for model creation with threefold cross-validation (k = 3), and a testing dataset (the remaining 40% of the data), which was used to test the performance of the model. During cross-validation, hyperparameter tuning within the ranges (shown in Supplementary Table 1) was performed to optimize the hyperparameters of each algorithm.
As shown in Supplementary Table 2, the dataset was imbalanced due to the rarity of neurological deterioration (approximately 6%) among the patients with mild TBI. Thus, class weighting was applied to modify the loss function during model training to solve the problems associated with imbalanced data. Specifically, we assigned an approximately 13 times greater weight to the positive cases than to the negative cases, as reported previously (22).
Performance evaluation
The evaluation metrics used to compare the predictive performance of the machine learning algorithms included the area under the receiver operating characteristic curve (AUROC) and the area under the precision–recall curve (AUPRC). The Youden index was used to establish the cutoff value, and the sensitivity, specificity, positive predictive value, and negative predictive value were calculated.
Statistical analysis
R version 4.0.3 was used as the statistical analysis software. Student’s t test and Welch’s t test were applied to compare continuous and normally distributed variables between groups. For nonnormally distributed variables, the Mann–Whitney U test was used for between-group comparisons. For categorical variables, groups were compared with the chi-square test.
All tests were two-sided, and a significance level of 0.05 was used. Bonferroni adjustment was applied to mitigate the risk of Type I errors arising from performing multiple comparison tests on the same data.
Results
The Think FAST registry includes data from a total of 721 head trauma patients aged 65 years and older, among whom 421 patients had an admission GCS score of 13 points or higher and were treated with conservative management at admission (Figure 1). Neurological deterioration occurred in 25 patients (6%). The distributions of each variable in the training and validation datasets and the testing dataset are presented in Table 1.
Figure 1. The whole study design. XGBoost, eXtreme Gradient Boosting; SHAP, the SHapley Additive exPlanations.
The distribution of missing values is illustrated in Supplementary Figure 1. High rates of missing values were observed, particularly in the coagulation profiles, notably the D-dimer and fibrinogen levels. Little’s MCAR test revealed a p value of 0.93, indicating that none of the missing values were MCAR. Missing values were imputed via k-nearest neighbor, multiple imputation, and random forest regression methods.
All the data were randomly divided into a training/validation dataset and a testing dataset such that the proportion of data corresponding to patients who experienced neurological deterioration remained consistent across both datasets. Sixty percent of the data (those in the training/validation dataset) were utilized to construct the predictive model through cross-validation, whereas the remaining 40% (testing dataset) were used to test the performance of the constructed predictive models.
Initially, predictive models were created using all available data extracted from the Think FAST registry. XGBoost, random forest, support vector machine (with a linear RBF kernel), and logistic regression frameworks were employed to construct the predictive models. XGBoost, in particular, is designed to allow the construction of predictive models without the need to impute missing values. We then compared the performance of each machine learning model and found that the model constructed using XGBoost without imputing missing values had the highest predictive performance, with an AUROC of 0.81 [0.07] and an AUPRC of 0.33 [0.08] (Table 2 and Supplementary Table 3). The mean absolute SHAP values were also calculated and are presented in Figure 2.
Table 2. Area under the receiver operating characteristics curve in each algorithm with each imputation method.
Next, variables with low mean absolute SHAP values were sequentially removed for dimensionality reduction. The AUROC trends in the training/validation dataset and testing dataset after reconstruction of the XGBoost model with the different sets of dimensionally reduced features are shown in Figure 3. Among the various models, the predictive model constructed using the top five variables identified through SHAP analysis demonstrated the highest AUROC during cross-validation and showed minimal differences in predictive accuracy in the testing data. For this model, the cross-validation AUROC was 0.84, and the AUPRC was 0.34, whereas in the testing dataset, the AUROC was 0.77, and the AUPRC was 0.19 (Figure 4 and Supplementary Figure 2). Using the bootstrap method, the 95% confidence interval for the AUROC in the testing dataset was calculated to be 0.57–0.94. This range indicates a significantly higher predictive accuracy of this model than that of random guessing.
Figure 4. The AUROC trends in the training/validation dataset and testing dataset after dimension reduction based on SHAP values. The red circle indicates the results of feature numbers that showed superior AUROC in cross-validation and test. AUROC, area under the receiver operating characteristic curve.
The SHAP dependence plot, which illustrates the distribution between the numerical values of each variable and their corresponding SHAP values in the prediction model, is presented in Figure 5. For D-dimer levels, a transition in SHAP values from negative to positive occurred when the level surpassed 30 μg/mL, indicating a tendency for the SHAP value to increase as the D-dimer level increased. Fibrinogen levels displayed a shift toward positive SHAP values when at levels below 200 mg/dL, at which they contributed more to neurological deterioration. For the acute subdural hematoma (ASDH) thickness, a transition from negative to positive SHAP values occurred at a thickness of 5 mm, whereas a cerebral contusion exceeding 10 mm in size showed a similar trend toward neurological deterioration (Figure 6).
Figure 5. Receiver operating characteristic (ROC) curves in the threefold stratified cross-validation (A) and in the testing dataset (B). AUC, area under the curve.
At the cutoff value for this model determined via the Youden index, the sensitivity was 60%, the specificity was 96%, the positive predictive value was 37.5%, and the negative predictive value was 90%. When three neurosurgeons reviewed the data for each variable in the test dataset to predict the occurrence of neurological deterioration, the sensitivity was 20%, the specificity was 94%, the positive predictive value was 19%, and the negative predictive value was 94% (Table 3).
Table 3. Comparison of neurological deterioration as assessed by the XGBoost model and by neurosurgeons.
Discussion
In this study, we constructed a machine learning model to predict neurological deterioration among patients with mild TBI via data from the Think FAST registry database, which comprises data prospectively collected from multiple facilities, particularly from patients aged 65 years and older. Among several machine learning models constructed with one of several imputation methods, the model constructed via the XGBoost framework without imputation showed comparatively better predictive performance than the other models did, with an AUROC of 0.81 according to cross-validation. Furthermore, by conducting feature dimension reduction on the basis of the feature SHAP values, we built a predictive model using XGBoost that could predict neurological deterioration with an AUROC of 0.84 in cross-validation and an AUROC of 0.77 in the testing dataset.
Neurological deterioration after mild TBI has long been recognized; patients with this condition are said to “talk and deteriorate” or “talk and die” (23–25). Marshal et al.’s study published in 1984 (26) reported a TBI incidence of approximately 10% in head trauma patients, with over half of them dying or being left comatose. According to the report by Lobato et al. (27), among patients with severe TBI, approximately 25% were able to converse before deteriorating to a severe state, with 32% of these patients ultimately dying. Therefore, neurological deterioration has been acknowledged as a condition that emerges in some patients with TBI and results in a poor prognosis, posing a clinical challenge to differentiate patients who experience neurological deterioration from those who do not. Recent meta-analyses have indicated that neurological deterioration occurs in approximately 12% of patients with mild TBI; however, statistically significant prognostic factors for neurological deterioration have not been identified (28). Moreover, to our knowledge, methods for stratifying the risk of neurological deterioration remain elusive. In this study, the best predictive model we constructed achieved good predictive accuracy, with an AUROC of 0.77 for predicting neurological deterioration in the testing dataset. This performance indicates a predictive accuracy superior to that of random guessing and suggests the potential for better sensitivity and a greater positive predictive value than predictions made by neurosurgeons.
For this predictive model, we extracted data on variables that are highly correlated with neurological deterioration via SHAP analysis. The D-dimer level, fibrinogen level, ASDH thickness, systolic blood pressure, and cerebral contusion diameter were selected as the most important variables. The trends in the SHAP values for these variables in this study were consistent with the results of previous reports on the severity of TBI and neurological deterioration. For example, regarding the relationship between coagulation parameters such as D-dimer and fibrinogen levels and TBI, Nakae et al. reported that the D-dimer level tends to increase while the fibrinogen level tends to decrease early after injury (29). Moreover, patients with TBI and elevated D-dimer levels upon admission are more prone to hemorrhage progression and neurological deterioration, resulting in poorer outcomes for patients with elevated D-dimer levels than those without elevated D-dimer levels (30, 31). Head CT studies have revealed that ASDH thickness is a risk factor for subsequent exacerbation (32), and cerebral contusions measuring larger than 20 mm are significantly associated with hemorrhage progression (33). These trends align with the dependence plots shown for each variable, indicating the consistency of these results. In this study, we not only focused on these individual factors but also analyzed their complex interrelationships via machine learning techniques to construct a prediction model to accurately predict neurological deterioration.
The number of studies using machine learning in the field of medicine has rapidly increased in recent years (34). In particular, predictive models that utilize ensemble learning or deep learning have been shown to demonstrate superior predictive accuracy to traditional logistic regression models in many studies (35). However, the black-box nature of the prediction process inherent in these complex models poses a barrier to their practical application in clinical settings. SHAP analysis is a method that was developed for interpreting machine learning models (36), allowing interpretation of the meaning of the features used in the construction of predictive models. In this study, by visualizing the trends of feature importance via dependence plots, the information obtained from the analysis could be used to increase the reliability of the machine learning model. The predictive model developed in this study is publicly available on the GitHub website (37) in a format that can be easily implemented. By utilizing our model, it may be possible to detect cases of deterioration at an early stage after hospitalization. This could contribute to improving the prognosis of TBI by implementing strict monitoring and early follow-up for high-risk patients. Furthermore, enhancing predictive models, such as the one in this study, is expected to advance personalized medicine for patients with TBI.
Limitations
As the Think FAST registry used in this study contains data from multiple facilities collected in a prospective manner, the impact of domain shifts is considered relatively minimal. Although the predictive model we developed did not show a significant discrepancy in AUROC between cross-validation and testing, the overall sample size was too small to establish an external validation dataset. Future external validation studies will be necessary to confirm the validity of this model. In addition, since the Think FAST registry includes data from patients aged 65 years and older, our predictive model can only be applied to older patients. Since the mechanisms of injury, types of hemorrhage, and clinical courses of TBI may differ between younger and older patients, applying our present model to populations with different prior probabilities could yield incorrect results. Therefore, for younger patients, it is necessary to develop another predictive model or at least validate the present model using data with a wider age range.
Furthermore, the variables used to create this model included only those for which data, such as clinical examination data, laboratory data and descriptive findings from head CT scans, are commonly acquired in the management of head trauma. In the future, CT images could be incorporated into predictive models via convolutional neural networks to increase the predictive accuracy of these models. Additionally, since this study focused solely on the progression of consciousness impairment after admission as the outcome, further investigation is needed to understand how this information could be utilized in treatment decision-making and its impact on patient outcomes.
Conclusion
The application of machine learning models suggests the potential to detect the occurrence of neurological deterioration in elderly patients with mild TBI using only admission data. Further validation with an external dataset is needed in the future.
Data availability statement
The data analyzed in this study is subject to the following licenses/restrictions: data sharing requires the permission by Think FAST registry group. Requests to access these datasets should be directed to aW5hbW5zcmdAdG1kLmFjLmpw.
Ethics statement
The studies involving humans were approved by the Medical Research Ethics Committee of Tokyo Medical and Dental University. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because because of the observational nature of this study.
Author contributions
DA: Conceptualization, Formal analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. MI: Conceptualization, Data curation, Supervision, Writing – review & editing. TH: Methodology, Supervision, Writing – review & editing. ES: Data curation, Funding acquisition, Writing – review & editing. NS: Data curation, Writing – review & editing. HY: Data curation, Writing – review & editing. SHi: Data curation, Writing – review & editing. SHa: Data curation, Writing – review & editing. HK: Data curation, Writing – review & editing. AM: Data curation, Writing – review & editing. KK: Data curation, Writing – review & editing. KH: Data curation, Writing – review & editing. HA: Data curation, Writing – review & editing. SY: Data curation, Writing – review & editing. TMaed: Data curation, Writing – review & editing. TO: Data curation, Writing – review & editing. KO: Data curation, Writing – review & editing. NK: Data curation, Writing – review & editing. MS: Data curation, Writing – review & editing. TMaeh: Supervision, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was supported by CSL Behring K.K. The fund was used for collecting data and English editing fee. The funder was not involved in writing this manuscript or submission for publication.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Gen AI was used in the creation of this manuscript.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2024.1502153/full#supplementary-material
References
1. James, SL, Theadom, A, Ellenbogen, RG, Bannick, MS, Montjoy-Venning, W, Lucchesi, LR, et al. Global, regional, and national burden of traumatic brain injury and spinal cord injury, 1990–2016: a systematic analysis for the global burden of disease study 2016. Lancet Neurol. (2019) 18:56–87. doi: 10.1016/S1474-4422(18)30415-0
2. Maas, AIR, Menon, DK, Adelson, PD, Andelic, N, Bell, MJ, Belli, A, et al. Traumatic brain injury: integrated approaches to improve prevention, clinical care, and research. Lancet Neurol. (2017) 16:987–1048. doi: 10.1016/S1474-4422(17)30371-X
3. Hawley, C, Sakr, M, Scapinello, S, Salvo, J, and Wrenn, P. Traumatic brain injuries in older adults-6 years of data for one UK trauma Centre: retrospective analysis of prospectively collected data. Emerg Med J. (2017) 34:509–16. doi: 10.1136/emermed-2016-206506
4. Gardner, RC, Dams-O’Connor, K, Morrissey, MR, and Manley, GT. Geriatric traumatic brain injury: epidemiology, outcomes, knowledge gaps, and future directions. J Neurotrauma. (2018) 35:889–906. doi: 10.1089/neu.2017.5371
5. Yang, C, Lang, L, He, Z, Hui, J, Jiang, J, Gao, G, et al. Epidemiological characteristics of older patients with traumatic brain injury in China. J Neurotrauma. (2022) 39:850–9. doi: 10.1089/neu.2021.0275
6. Yamagami, K, Kurogi, R, Kurogi, A, Nishimura, K, Onozuka, D, Ren, N, et al. The influence of age on the outcomes of traumatic brain injury: findings from a Japanese Nationwide survey (J-ASPECT study-traumatic brain injury). World Neurosurg. (2019) 130:e26–46. doi: 10.1016/j.wneu.2019.05.140
7. Kehoe, A, Rennie, S, and Smith, JE. Glasgow coma scale is unreliable for the prediction of severe head injury in elderly trauma patients. Emerg Med J. (2015) 32:613–5. doi: 10.1136/emermed-2013-203488
8. Karibe, H, Hayashi, T, Narisawa, A, Kameyama, M, Nakagawa, A, and Tominaga, T. Clinical characteristics and outcome in elderly patients with traumatic brain injury: for establishment of management strategy. Neurol Med Chir (Tokyo). (2017) 57:418–25. doi: 10.2176/nmc.st.2017-0058
9. Pierce, RL, Van Biesen, W, Van Cauwenberge, D, Decruyenaere, J, and Sterckx, S. Explainability in medicine in an era of AI-based clinical decision support systems. Front Genet. (2022) 13:903600. doi: 10.3389/fgene.2022.903600
10. Lundberg, Scott M., and Lee, S-I. A unified approach to interpreting model predictions. NIPS’17: Proceedings of the 31st International Conference on Neural Information Processing Systems. Curran Associates, Inc. (2017). 4768–77.
11. Suehiro, E, Shiomi, N, Yatsushige, H, Hirota, S, Hasegawa, S, Karibe, H, et al. The current status of reversal therapy in Japan for elderly patients with head injury treated with antithrombotic agents: A prospective multicenter observational study. Heliyon. (2024) 10:e25193. doi: 10.1016/j.heliyon.2024.e25193
12. von Elm, E, Altman, DG, Egger, M, Pocock, SJ, Gøtzsche, PC, and Vandenbroucke, JP. The strengthening the reporting of observational studies in epidemiology (STROBE) statement: guidelines for reporting observational studies. Lancet. (2007) 370:1453–7. doi: 10.1016/S0140-6736(07)61602-X
13. Hosmer, DW Jr, Lemeshow, S, and Sturdivant, RX. Applied logistic regression. 3rd ed. Hoboken, NJ: John Wiley & Sons (2013).
14. Jun, Z. The development and application of support vector machine. J Phys Conf Ser. (2021) 1748:052006. doi: 10.1088/1742-6596/1748/5/052006
15. Tianqi Chen, CG. XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (Association for Computing Machinery). (2016):785–94.
16. Liu, J, Wu, J, Liu, S, Li, M, Hu, K, and Li, K. Predicting mortality of patients with acute kidney injury in the ICU using XGBoost model. PLoS One. (2021) 16:e0246306. doi: 10.1371/journal.pone.0246306
18. Ge, B. Analysis of a random forests model. J Mach Learn Res. (2012) 13:1063–95. doi: 10.1.1.236.1818
19. Frossard, DEN, Nunes, IO, and Krohling, RA. An approach to dealing with missing values in heterogeneous data using k-nearest neighbors Computer Science, Mathematics. Cham, Switzerland: Springer. (2016).
20. Austin, PC, White, IR, Lee, DS, and van Buuren, S. Missing data in clinical research: a tutorial on multiple imputation. Can J Cardiol. (2021) 37:1322–31. doi: 10.1016/j.cjca.2020.11.010
21. Tang, F, and Ishwaran, H. Random Forest missing data algorithms. Stat Anal Data Min. (2017) 10:363–77. doi: 10.1002/sam.11348
22. Xinyu Dong, SR, Wang, Y, Hajagos, J, Zhao, X, Richard, N, Rosenthal, JK, et al. Machine learning based opioid overdose prediction using electronic health records. AIMA Ann Symposium Proc. (2019) 2019:389–98.
23. Reilly, PL, Graham, DI, Adams, JH, and Jennett, B. Patients with head injury who talk and die. Lancet. (1975) 2:375–7. doi: 10.1016/S0140-6736(75)92893-7
24. Rockswold, GL, Leonard, PR, and Nagib, MG. Analysis of management in thirty-three closed head injury patients who “talked and deteriorated”. Neurosurgery. (1987) 21:51–5. doi: 10.1227/00006123-198707000-00010
25. Ratanalert, S, Chompikul, J, and Hirunpat, S. Talked and deteriorated head injury patients: how many poor outcomes can be avoided? J Clin Neurosci. (2002) 9:640–3. doi: 10.1054/jocn.2002.1085
26. Marshall, LF, Toole, BM, and Bowers, SA. The National Traumatic Coma Data Bank. Part 2: patients who talk and deteriorate: implications for treatment. J Neurosurg. (1983) 59:285–8. doi: 10.3171/jns.1983.59.2.0285
27. Lobato, RD, Rivas, JJ, Gomez, PA, Castaneda, M, Canizal, JM, Sarabia, R, et al. Head-injured patients who talk and deteriorate into coma. Analysis of 211 cases studied with computerized tomography. J Neurosurg. (1991) 75:256–61. doi: 10.3171/jns.1991.75.2.0256
28. Marincowitz, C, Lecky, FE, Townend, W, Borakati, A, Fabbri, A, and Sheldon, TA. The risk of deterioration in GCS13-15 patients with traumatic brain injury identified by computed tomography imaging: a systematic review and Meta-analysis. J Neurotrauma. (2018) 35:703–18. doi: 10.1089/neu.2017.5259
29. Nakae, R, Murai, Y, Morita, A, and Yokobori, S. Coagulopathy and traumatic brain injury: overview of new diagnostic and therapeutic strategies. Neurol Med Chir. (2022) 62:261–9. doi: 10.2176/jns-nmc.2022-0018
30. Tong, WS, Zheng, P, Zeng, JS, Guo, YJ, Yang, WJ, Li, GY, et al. Prognosis analysis and risk factors related to progressive intracranial haemorrhage in patients with acute traumatic brain injury. Brain Inj. (2012) 26:1136–42. doi: 10.3109/02699052.2012.666437
31. Suehiro, E, Koizumi, H, Fujiyama, Y, Yoneda, H, and Suzuki, M. Predictors of deterioration indicating a requirement for surgery in mild to moderate traumatic brain injury. Clin Neurol Neurosurg. (2014) 127:97–100. doi: 10.1016/j.clineuro.2014.10.007
32. Bajsarowicz, P, Prakash, I, Lamoureux, J, Saluja, RS, Feyz, M, Maleki, M, et al. Nonsurgical acute traumatic subdural hematoma: what is the risk? J Neurosurg. (2015) 123:1176–83. doi: 10.3171/2014.10.JNS141728
33. Rehman, L, Afzal, A, Aziz, HF, Akbar, S, Abbas, A, and Rizvi, R. Radiological parameters to predict hemorrhagic progression of traumatic Contusional brain injury. J Neurosci Rural Pract. (2019) 10:212–7. doi: 10.4103/jnrp.jnrp_335_18
34. Kline, A, Wang, H, Li, Y, Dennis, S, Hutch, M, Xu, Z, et al. Multimodal machine learning in precision health: A scoping review. NPJ Digit Med. (2022) 5:171. doi: 10.1038/s41746-022-00712-8
35. Xie, F, Ong, MEH, Liew, J, Tan, KBK, Ho, AFW, Nadarajan, GD, et al. Development and assessment of an interpretable machine learning triage tool for estimating mortality after emergency admissions. JAMA Netw Open. (2021) 4:e2118467. doi: 10.1001/jamanetworkopen.2021.18467
36. Tseng, PY, Chen, YT, Wang, CH, Chiu, KM, Peng, YS, Hsu, SP, et al. Prediction of the development of acute kidney injury following cardiac surgery by machine learning. Crit Care. (2020) 24:478. doi: 10.1186/s13054-020-03179-9
37. GitHub. DaisuAbe/deterioration prediction. Available at: http://www.github.com/DaisuAbe/DeteriorationPrediction (Accessed December 17, 2024).
Keywords: mild traumatic brain injury, neurological deterioration, machine learning, predictive model, XGBoost
Citation: Abe D, Inaji M, Hase T, Suehiro E, Shiomi N, Yatsushige H, Hirota S, Hasegawa S, Karibe H, Miyata A, Kawakita K, Haji K, Aihara H, Yokobori S, Maeda T, Onuki T, Oshio K, Komoribayashi N, Suzuki M and Maehara T (2025) A machine learning model to predict neurological deterioration after mild traumatic brain injury in older adults. Front. Neurol. 15:1502153. doi: 10.3389/fneur.2024.1502153
Edited by:
V. Sujith Sajja, Walter Reed Army Institute of Research, United StatesReviewed by:
Marco Fidel Avila-Rodriguez, Tolima University, ColombiaJinn-Rung Kuo, Chi Mei Medical Center, Taiwan
Copyright © 2025 Abe, Inaji, Hase, Suehiro, Shiomi, Yatsushige, Hirota, Hasegawa, Karibe, Miyata, Kawakita, Haji, Aihara, Yokobori, Maeda, Onuki, Oshio, Komoribayashi, Suzuki and Maehara. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Motoki Inaji, aW5hbW5zcmdAdG1kLmFjLmpw