Prediction of posttraumatic functional recovery in middle-aged and older patients through dynamic ensemble selection modeling

Nhu, Nguyen Thanh; Kang, Jiunn-Horng; Yeh, Tian-Shin; Wu, Chia-Chieh; Tsai, Cheng-Yu; Piravej, Krisna; Lam, Carlos

doi:10.3389/fpubh.2023.1164820

ORIGINAL RESEARCH article

Front. Public Health, 20 June 2023

Sec. Aging and Public Health

Volume 11 - 2023 | https://doi.org/10.3389/fpubh.2023.1164820

This article is part of the Research TopicInnovations in Older Adult Care and Health Service Management: A Focus on the Asia-Pacific RegionView all 15 articles

Prediction of posttraumatic functional recovery in middle-aged and older patients through dynamic ensemble selection modeling

Nguyen Thanh Nhu^1,2^†

Jiunn-Horng Kang^1,3,4,5,6^†

Tian-Shin Yeh^3,7,8,9

Chia-Chieh Wu^10,11

Cheng-Yu Tsai¹²

Krisna Piravej^13,14

Carlos Lam^10,11^*

¹International Ph.D. Program in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
²Faculty of Medicine, Can Tho University of Medicine and Pharmacy, Can Tho, Vietnam
³Department of Physical Medicine and Rehabilitation, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
⁴Department of Physical Medicine and Rehabilitation, Taipei Medical University Hospital, Taipei, Taiwan
⁵Graduate Institute of Nanomedicine and Medical Engineering, College of Biomedical Engineering, Taipei Medical University, Taipei, Taiwan
⁶Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
⁷Department of Physical Medicine and Rehabilitation, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan
⁸Department of Epidemiology and Nutrition, Harvard T. H. Chan School of Public Health, Harvard University, Boston, MA, United States
⁹Nuffield Department of Population Health, University of Oxford, Oxford, United Kingdom
¹⁰Emergency Department, Wan Fang Hospital, Taipei Medical University, Taipei, Taiwan
¹¹Department of Emergency, School of Medicine, College of Medicine, Taipei Medical University, Taipei, Taiwan
¹²Centre for Transport Studies, Department of Civil and Environmental Engineering, Imperial College London, London, United Kingdom
¹³Department of Rehabilitation Medicine, Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
¹⁴Department of Chula Neuroscience Center, King Chulalongkorn Memorial Hospital, Bangkok, Thailand

Introduction: Age-specific risk factors may delay posttraumatic functional recovery; complex interactions exist between these factors. In this study, we investigated the prediction ability of machine learning models for posttraumatic (6 months) functional recovery in middle-aged and older patients on the basis of their preexisting health conditions.

Methods: Data obtained from injured patients aged ≥45 years were divided into training–validation (n = 368) and test (n = 159) data sets. The input features were the sociodemographic characteristics and baseline health conditions of the patients. The output feature was functional status 6 months after injury; this was assessed using the Barthel Index (BI). On the basis of their BI scores, the patients were categorized into functionally independent (BI >60) and functionally dependent (BI ≤60) groups. The permutation feature importance method was used for feature selection. Six algorithms were validated through cross-validation with hyperparameter optimization. The algorithms exhibiting satisfactory performance were subjected to bagging to construct stacking, voting, and dynamic ensemble selection models. The best model was evaluated on the test data set. Partial dependence (PD) and individual conditional expectation (ICE) plots were created.

Results: In total, nineteen of twenty-seven features were selected. Logistic regression, linear discrimination analysis, and Gaussian Naive Bayes algorithms exhibited satisfactory performances and were, therefore, used to construct ensemble models. The k-Nearest Oracle Elimination model outperformed the other models when evaluated on the training–validation data set (sensitivity: 0.732, 95% CI: 0.702–0.761; specificity: 0.813, 95% CI: 0.805–0.822); it exhibited compatible performance on the test data set (sensitivity: 0.779, 95% CI: 0.559–0.950; specificity: 0.859, 95% CI: 0.799–0.912). The PD and ICE plots showed consistent patterns with practical tendencies.

Conclusion: Preexisting health conditions can predict long-term functional outcomes in injured middle-aged and older patients, thus predicting prognosis and facilitating clinical decision-making.

1. Introduction

Traumatic injuries are a leading cause of morbidity and mortality in middle-aged and older individuals (1, 2). Due to various sociodemographic factors and frailty, these patients have posttraumatic complications, prolonged hospitalization, and a poor quality of life, which, in turn, increase the risk of functional disability (3–7). Although prognosis and clinical decision-making are highly complicated in injured middle-aged and older patients, these factors are crucial for ensuring optimal care and rehabilitation, which may minimize mortality rates and improve functional independence in these patients (8, 9).

Approximately 50% of injured older patients have comorbidities, which result in functional limitations and poor physical health (10). Severe complications are associated with certain demographic characteristics and preexisting diseases (e.g., diabetes, cardiovascular disease, liver disease, and psychological conditions) (11); these factors may delay functional recovery. In patients with hip bone fractures, neurological and renal disorders are associated with reduced performance of activities of daily living (ADL), as assessed using the Barthel Index (BI) (6). These findings imply that demographics and baseline health conditions are associated with the risk of functional dependence in injured middle-aged and older patients. Therefore, those features might be used to build the predictive model for predicting long-term functional outcomes in this clinical population, which could support physicians in clinical practice.

In medicine, machine learning (ML) is an effective approach for making diagnoses and predicting prognoses; ML models exhibit satisfactory performance on clinical data sets, which are generally highly dimensional and imbalanced (12). An ML model was successfully used to predict knee pain in middle-aged and older individuals by using demographic, body measurement, and blood test data; the sensitivity and specificity of the prediction were 0.72 and 0.71, respectively (13). Furthermore, the CatBoost algorithm was used to identify depression in this population (sensitivity: 0.71; specificity: 0.89) (14). A study using an ML model demonstrated that physiological biomarkers are associated with mortality, extremity mobility, and ADL in middle-aged and older individuals (15). In addition, previous studies also evaluated the predictions of ML models on poststroke functional recovery, supporting rehabilitation practice (16, 17). These findings suggest that ML explores complicated patterns in clinical data to predict functional outcomes in middle-aged and older individuals in different clinical conditions. However, to the best of our knowledge, no predictive ML model has been constructed to predict the risk of long-term functional dependence in this population on the basis of preexisting health conditions. Therefore, by adopting an ML approach, we investigated the prediction ability of ML models for the risk of functional dependence in middle-aged and older patients 6 months after injury on the basis of their demographic characteristics and preexisting health conditions. We hypothesized that baseline features can be used to classify patients with and without functional dependency 6 months after injury and to predict functional prognosis in clinical practice.

2. Patients and methods

2.1. Study design

The protocol of this prospective observational study was approved by the Institution Research Board of Taipei Medical University, Taiwan (approval number: N202002099). All participants provided informed consent. This study was conducted and reported following the STROBE checklist. The funders had no roles in analyzing the data, interpreting the data, or drawing study conclusions.

2.2. Participants and data sets

We included 670 middle-aged and older patients with primary injury who were admitted to the Emergency Department of Wan Fang Hospital, Taipei Medical University, Taiwan, between August 2020 and March 2022. The inclusion criteria were an age of ≥45 years and the ability to provide informed consent. After the completion of treatment, the researchers contacted eligible patients and explained the study to them. After informed consent was obtained, the patients were interviewed and under physical examination. Data regarding the patients’ sociodemographic characteristics, preexisting diseases, and baseline clinical characteristics were collected. For patients with severe injury, the interviews and assessments were conducted after their conditions had stabilized. Clinical events during hospitalization (e.g., intensive care unit [ICU] admission) were also recorded. After 6 months, a follow-up assessment was performed through telephonic interviews. All assessments were performed by experienced physicians and nurses. After excluding 9 patients who died in the hospital, 134 patients who were lost to follow-up, and 26 patients who died during follow-up, 527 patients were included in this study (Supplementary Figure S1).

Clinical characteristics were assessed using a standardized format. The BI, a 10-item scale that assesses ADL, was used to evaluate the patients’ baseline (preinjury) levels of functional dependence (18). The total BI score ranges from 0 to 100, and a BI score of ≤60 indicates severe or complete functional dependence (19). The Clinical Frailty Scale (CFS), a 9-point scale that assesses physical frailty and cognitive impairment, was used to evaluate overall frailty (20). The CFS score from one to three indicates that an individual is healthy or has well-controlled medical problems, whereas the CFS score from four to nine indicates that an individual has “very mild frailty” to “terminally ill” (20). The Charlson comorbidity index (CCI), a 17-item tool with high validity and reliability, was used to assess the risk of 1-year mortality in the patients (21). The death rate is smaller than 0.5% when the CCI score is equal to zero, whereas this rate is approximately 20–25% when the CCI score is higher than six (21). Furthermore, the revised trauma score (RTS) was used to assess functional outcomes after injury (22, 23). The injury severity score, which is used to assess the severity of injury in six body systems, was calculated for the patients upon admission to the emergency department (ED-ISS), during hospitalization (HOSP-ISS), and before discharge (ISS) (24). The RTS score smaller than seven might suggest the high rates of survival and the low rate of complication in an individual (22, 23).

2.3. Independent input and output features

We used the patients’ baseline health conditions to predict their functional recovery 6 months after injury. The initial input features were sociodemographic characteristics (e.g., age, sex, marital status, employment status, education level, and body mass index), causes of trauma, preexisting diseases (e.g., diabetes, hypertension, heart failure, chronic kidney disease, liver diseases, chronic obstructive pulmonary disorder, stroke, anemia, hip fracture, Parkinson’s disease, and dementia), clinical events (e.g., ICU admission and hospital rehabilitation), and baseline assessment scores (e.g., BI score, CCI score, CFS score, RTS, ED_ISS, HOSP_ISS, and ISS). The outcome feature was functional status determined using the patients’ BI scores calculated 6 months after injury; on the basis of their BI scores, the patients were categorized into two groups: functionally independent (BI >60) and functionally dependent (BI ≤60) groups (16, 25).

2.4. Descriptive statistical analysis

Data are presented as mean ± standard deviation values for continuous variables and number and percentage values for categorical variables. The functionally independent and dependent groups were compared in terms of demographic and clinical characteristics by using the independent samples t test (for continuous variables) or chi-square test (for categorical variables). The BI scores and the number of patients with functional dependency were compared between baseline and 6 months after injury by using paired t and McNemar tests, respectively. In addition, the validation and independent data sets were compared in terms of patient characteristics. A p value of <0.05 indicated statistical significance. R (version 4.1.2; R Foundation for Statistical Computing, Vienna, Austria) and Jeffrey’s Amazing Statistics Program (version 0.16.3; The JASP Team, 2020) were used for statistical analyses.

2.5. ML process

Figure 1 illustrates the ML process, which included data preprocessing, feature selection, model construction, model validation and testing, and model interpretation. All process steps were performed using Python 3.7 with Scikit-learn 1.1.3 (26) and DESlib library 0.3.5 (27).

FIGURE 1

Figure 1. Machine learning process. Data obtained from the patients were first checked for missing values and outliners and then divided into training–validation data set (for model construction) and independent test (for model evaluation) data sets. The permutation feature importance method was used for feature selection. Because our data were imbalanced, random oversampling was applied on each training fold during cross-validation. Six algorithms, including support vector machine (SVM), logistic regression (LR), k-nearest neighbor (KNN), Gaussian Naive Bayes (GNB), linear discrimination analysis (LDA), and decision tree (DT), were validated through five-fold cross-validation. The algorithms exhibiting satisfactory performance (LR, GNB, and LDA) were subjected to bagging; afterward, they were used as the pool of classifiers for ensemble models (i.e., stacking, voting, and dynamic ensemble selection [DES] models). The ensemble models were compared in terms of performance, and the best model (k-nearest oracle elimination [KNORA-E]) was evaluated on the independent test data set. Partial dependence (PD) and individual conditional expectation (ICE) plots were constructed to investigate the interpretability of the model.

2.5.1. Data preprocessing

The data set was first analyzed for missing values and outliers and then randomly divided into training–validation and test data sets by the ratio 70:30 that is commonly used in ML (28, 29). The training–validation data set (n = 368) was used to train and construct ML models, whereas the independent test data set (n = 159) was used for evaluate the constructed models.

2.5.2. Feature selection

The imputation feature importance method was used to select important features and exclude noise. The logistic regression (LR) algorithm was used as an estimator with 1,000 repeats. The features with negative or 0 important scores were eliminated. The remaining features were used to construct the ML model.

2.5.3. Model construction

We first evaluated the classification performance of individual algorithms. The following six algorithms were selected: support vector machine (SVM), LR, k-nearest neighbor (KNN), Gaussian Naive Bayes (GNB), linear discrimination analysis (LDA), and decision tree (DT); these algorithms are commonly used for classification based on highly dimensional clinical data (13, 15, 16). Hyperparameter optimization was performed using GridSearchCV (Supplementary Table S1). The algorithms exhibiting poor performance were excluded.

For classification, an ensemble of classifiers is generally considered to be superior to a single classifier and could reduce the risk of overfitting (30). Therefore, the algorithms exhibiting satisfactory performance were subjected to bagging (a number of estimators were searched using GridSearchCV); then, these were used as base learners to construct stacking, voting, and dynamic ensemble selection (DES) models, including k-nearest oracle union (KNORA-U), k-nearest oracle elimination (KNORA-E), DES performance (DES-P), and meta learning for DES (META-DES).

2.5.4. Cross-validation and internal validation on the test data set

The models were trained on the training–validation data set through stratified five-fold cross-validation, repeated twenty times. Because our data were imbalanced, random oversampling was separately applied on each training fold (but not the testing folds) during cross-validation. The DES and single-algorithm models were compared in terms of performance. The best model was evaluated on the independent test data set for assessing its performance on unseen data. In the cross-validation process, we estimated a 95% confidence interval for each performance indicator of all algorithms, defined by a mean score ± 1.96*validated standard error. When testing the final model on the independent test data set, the 95% CIs for performance indicators were calculated by conducting the bootstrap method (1,000 repeats) on the independent test data set.

2.5.5. Performance matrix

The performance matrix included accuracy, sensitivity, specificity, F1 score, and area under the receiver operating characteristic curve (ROC-AUC). Because our aim was to investigate the prediction ability of the models for the risk of functional dependence, minimizing false negative rates was important. Therefore, we preferred sensitivity over specificity for model evaluation. In addition, because our data sets were imbalanced, a dummy classifier (uniform strategy) was used to construct a no-skill model, whose performance matrix was used as the baseline cutoff score for comparisons.

2.5.6. Model interpretability analysis

Regarding the intuitive interpretation of the models, partial dependence (PD) and individual conditional expectation (ICE) plots were constructed to compare the effects of features on the outcomes predicted by the ML models with clinical tendencies. Given that the assessment scores indicated the effects of preexisting diseases on the model outcome, we constructed PD and ICE plots only for the assessment scores to evaluate the effects of changes in these scores on the risk of functional dependence 6 months after injury.

3. Results

3.1. Patient characteristics

Tables 1, 2 summarize the patient’s sociodemographic characteristics, injury severity levels, and functional outcomes. The most common cause of trauma was falling (69.1%; Table 1). Of the patients, >50% had at least one chronic disease. As shown in Table 2, the most common comorbidities were hypertension (53.7%), diabetes (29.2%), and heart failure (24.7%). The proportion of patients with functional dependency significantly increased 6 months after injury (from 5.6 to 11.4% in total; p < 0.01; Table 2). The patients’ demographic and clinical characteristics did not vary significantly between the training–validation and independent test data sets (p > 0.05; Tables 1, 2), which indicates that the independent test data set can be used for validation.

TABLE 1

Table 1. Sociodemographic characteristics of the patients and causes of trauma.

TABLE 2

Table 2. Clinical characteristics of the patients.

3.2. Selection of informative features for classification

To reduce noise and select the most efficient features for predicting the risk of functional dependence 6 months after injury, feature selection was performed using the permutation feature importance method. Of the total 27 features, 8 had 0 or negative important scores and thus were removed (Figure 2). The remaining 19 features were used as input features to construct the classification models.

FIGURE 2

Figure 2. Permutation feature importance selection. The figure presents important scores of the features assessed using the permutation importance ranking method (1,000 repeats). CKD, chronic kidney disease; COPD, chronic obstructive pulmonary disorder; HOSP_ISS, injury severity score during hospitalization, ED_ISS, injury severity score upon admission to the emergency department; CCI score, Charlson comorbidity index score; RTS, revised traumatic score; EDU, education level; ISS, injury severity score before discharge; DM, diabetes mellitus; LIVER_DIS, liver diseases; HF, heart failure; HT, hypertension; HIP_FX, hip bone fracture; CFS_BASE, baseline score on the Clinical Frailty Scale; and BI_BASE, baseline score on the Barthel Index.

3.3. Performance of single-algorithm models for predicting the risk of functional dependence

Six single-algorithm classification models were constructed through hyperparameter optimization. The LR, LDA, and GNB models exhibited balanced performance. LR exhibited the highest sensitivity (0.789, 95% CI: 0.761–0.817) and specificity (0.789, 95% CI: 0.778–0.799), followed by GNB (sensitivity: 0.719, 95% CI: 0.690–0.745; specificity: 0.761, 95% CI: 0.739–0.783) and LDA (sensitivity: 0.687, 95% CI: 0.656–0.718; specificity: 0.777, 95% CI: 0.766–0.787). The F1 scores of the LR, LDA, and GNB models were higher than the baseline F1 score of the no-skill model, i.e., 0.462 (95% CI: 0.446–0.478), 0.416 (95% CI: 0.397–0.436), and 0.402 (95% CI: 0.386–0.419) vs. 0.181 (95% CI: 0.168–0.193), respectively. By contrast, the SVM, KNN, and DT models exhibited imbalanced performance with high specificity but low sensitivity (Table 3).

TABLE 3

Table 3. Performance of single classifiers on the training–validation data set.

3.4. Performance of ensemble models for predicting the risk of functional dependence

To investigate whether the ensemble models could improve the prediction ability of the input features, stacking, voting, and DES models were constructed through the bagging of LR, LDA, and GNB as the pool of classifiers (Table 4). The KNORA-E model outperformed the other ensemble models (sensitivity: 0.732, 95% CI: 0.702–0.761; specificity: 0.813, 95% CI: 0.805–0.822; Table 4; Figure 3A). Furthermore, the F1 score of this model (0.460, 95% CI: 0.444–0.477) was higher than the baseline F1 score of the no-skill model 0.181 (95% CI: 0.168–0.193) and the F1 scores of the other models.

TABLE 4

Table 4. Performance of heterogeneous assemble models on the training–validation data set.

FIGURE 3

Figure 3. Area under the receiver operating characteristic curve (ROC-AUC) of the k-nearest oracle elimination (KNORA-E) model. (A) ROC-AUC of the KNORA-E model evaluated using the cross-validation data set. (B) ROC-AUC of the KNORA-E model evaluated using the test data set.

3.5. Validation on the independent test data set

To investigate the performance of the KNORA-E model on unseen data, the model was applied to the independent test data set. The performance matrix on the test data was similar to the performance on the train-validation data set, with accuracy, sensitivity, specificity, F1 score, and ROC-AUC values of 0.850 (95% CI: 0.792–0.899), 0.779 (95% CI: 0.559–0.950), 0.859 (95% CI: 0.799–0.912), 0.534 (95% CI: 0.359–0.680), and 0.886 (95% CI: 0.768–0.976), respectively (Figure 3B).

3.6. Model interpretability

PD and ICE plots were constructed for continuous predictive features (e.g., BI_BASE score, CFS_BASE score, ISS, and RTS) to investigate the effects of changes in the scores on the risk of functional dependence. These plots showed a consistent trend. The effects of predictive features on the model-predicted outcomes were consistent with the practical tendency. The risk of functional dependence decreased with increasing baseline BI scores and RTS, particularly when the scores were 40–60 and > 7, respectively. By contrast, this risk increased with increasing baseline CFS scores (nearly linear increase) and ISS scores, particularly when the score was >4 and > 20, respectively (Figure 4).

FIGURE 4

Figure 4. Partial dependence (PD) plots (blue lines) with individual conditional expectation (ICE) plots (grey lines). The plots indicate the overall (for PD) and individual (for ICE) effects of the features, including the baseline Barthel Index (BI_BASE) score, baseline Clinical Frailty Scale (CFS_BASE) score, revised traumatic score (RTS), and injury severity score (ISS).

4. Discussion

Pathophysiological changes may worsen posttraumatic functional outcomes in middle-aged and older individuals (3–7). The pattern of functional recovery in these patients may vary from that in younger patients (2). Developing a model to predict long-term functional dependency in middle-aged and older individuals may facilitate treatment and rehabilitation, thus reducing the risk of functional disability. In the present study, using the DES models including selected features, we found that baseline sociodemographic characteristics, functional assessment scores, and preexisting diseases successfully predicted the risk of functional dependence in the study population 6 months after injury. The assessment scores may be non-linearly correlated with functional outcomes, and the correlation may be stronger in some score ranges. Our findings suggest that preexisting health conditions considerably affect functional recovery in injured middle-aged and older individuals; furthermore, ML models constructed using these input features can be used to predict prognosis in this population.

High dimensionality and imbalance are the characteristics of real-life clinical data; these characteristics complicate the analysis of data using ML models and reduce the applicability of these models (12). In the present study, the selection of features through the permutation importance method helped remove noise from the input features and improve model performance. Using the remaining features, several single-algorithm models, such as the LR, LDA, and GNB models, were constructed; these models predicted functional outcomes with acceptable performance. Although these algorithm exhibited acceptable classification performance on highly dimensional data after appropriate feature selection, they may exhibit overfitting in several circumstances, particularly in the case of data imbalance (31). Thus, we constructed heterogeneous ensemble models; these models may have a reduced overfitting risk and improved prediction ability (30). In the present study, the KNORA-E model exhibited the highest, balanced performance on both the training–validation and independent test data sets, which indicated that the levels of bias and variance were low for this model. Because of the nature of ML on extremely imbalanced data, the different gap between sensitivity and specificity in the final model in our study might be acceptable, which was similar to a previous study conducted an on extremely imbalanced cancer dataset (32). This finding corroborates that DES models exhibit improved classification performance on imbalanced data (33). Thus, DES modeling with feature selection may be an effective ML approach for prognosis using clinical data.

In this study, baseline BI and CFS scores (indicating functional level and frailty, respectively) were found to be the best features for predicting the risk of functional dependence in middle-aged and older patients 6 months after injury. The other assessment scores used in this study, such as RTS and ISS, also contributed to the model-based prediction of functional recovery. The BI is a common scale used for evaluating functional independence in patients with various health conditions; this scale has high validity and reliability (19). The CFS score indicates the requirement of additional health care support for injured patients (34). However, the BI and CFS scores are influenced by comorbidities, age, and other sociodemographic factors (18, 35). The use of only assessment scores may not be sufficient for effectively predicting long-term functional outcomes in injured patients; therefore, additional features, such as sociodemographic characteristics and preexisting health conditions, must be included in the models.

Several preexisting diseases, including anemia, hip bone fracture, hypertension, heart failure, stroke, liver disease, and diabetes, helped predict functional recovery in older patients 6 months after injury. Hypertension, stroke, and diabetes markedly reduced the function and quality of life in patients with those diseases, particularly middle-aged and older patients (36). Anemia reduces physical and cognitive functions, thus worsening functional outcomes in middle-aged and older individuals (37, 38). Hip fractures are common in this population; this reduces their mobility and quality of life (39–41). In a relevant study, approximately 40% of all patients with heart failure experienced moderate to severe difficulties in performing ADL; these challenges were associated with mortality and hospitalization (42). Liver diseases may alter the structure and function of the brain and heart, thus worsening patients’ functional outcomes (43, 44). In summary, preexisting diseases may worsen functional outcomes in injured middle-aged and older individuals and may help predict the risk of long-term functional dependence in this population.

Using PD and ICE plots, we found that several assessment scores nonlinearly affected the risk of functional dependence. Higher baseline BI (particularly >40) scores exerted stronger effects on functional outcomes, which is consistent with Sinoff’s interpretation that older individuals with BI scores of <40 may exhibit severe or complete functional dependence (45). In patients who experienced an acute stroke, those with BI scores of ≥40 exhibited considerable improvements in their ADL compared with those with BI scores of <40 (46). In the present study, CFS scores of >4 markedly increased the risk of functional dependence. According to the CFS guideline, patients with CFS scores of 4 exhibit limited performance of activities, and those with CFS scores of >5 require assistance in performing ADL (47). Prolonged hospitalization in acute medicine units has been reported in patients with CFS scores of >4 (48). In the present study, RTSs of >7 strongly reduced the risk of functional dependence; this is consistent with the findings of other studies indicating that an RTS of 7 serves as the cutoff value for predicting mortality and complication development in injured patients (22, 23). Taken together, the findings support the interpretability of our ML models and their feasibility in clinical practice.

This study has some limitations. First, our data sets were imbalanced; although we validated our models using the independent test data set, the patient sample was regional and may not represent the general population. Thus, external validation is necessary to evaluate the generalizability of the model. Second, we assessed functional outcomes only at baseline and the 6-month follow-up; thus, time-related changes in functional outcomes, which may differ across preexisting health conditions, could not be recorded. Finally, our model did not include preclinical data; thus, the predictive value of preclinical features for long-term functional outcomes could not be estimated.

In conclusion, our model constructed through feature selection and DES modeling exhibited high performance for predicting the risk of functional dependence in injured middle-aged and older patients on the basis of their sociodemographic characteristics and preexisting health conditions. The model showed practical interpretability. This study may facilitate further large-scale studies on the prediction ability of baseline information and its application for the prediction of long-term functional prognosis in injured patients.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving human participants were reviewed and approved by Taipei Medical University Joint Institution Research Board. The patients/participants provided their written informed consent to participate in this study.

Author contributions

J-HK and CL: study design, manuscript revising, and editing. C-CW and CL: patient enrollment and data collection. NTN, J-HK, T-SY, C-CW, C-YT, KP, and CL: data analysis and interpretation. NTN, J-HK, and CL: manuscript drafting. All authors contributed to the article and approved the submitted version.

Funding

This research was jointly supported by grants from the National Science and Technology Council (Grant number: MOST 109-2314-B-038-079), Wan Fang Hospital, Taipei Medical University (Grant number: 110-wf-f-5), National Taipei University of Technology and Wan Fang Hospital, Taipei Medical University Joint Research Program (Grant number: 112-wf-ntut-05), and Injury Prevention and Disaster Medicine Research Foundation. The funders had no role in the design of the study, collection or analysis of data, decision to publish, or preparation of the manuscript.

Acknowledgments

The authors gratefully acknowledge support from the Department of Physical Medicine and Rehabilitation, Taipei Medical University Hospital, and Emergency Department, Wan Fang Hospital, Taipei Medical University, Taiwan. This manuscript was edited by Wallace Academic Editing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2023.1164820/full#supplementary-material

References

1. Jiang, L, Zheng, Z, and Zhang, M. The incidence of geriatric trauma is increasing and comparison of different scoring tools for the prediction of in-hospital mortality in geriatric trauma patients. World J Emerg Surg. (2020) 15:59. doi: 10.1186/s13017-020-00340-1

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Gale, SC, Peters, J, Murry, JS, Crystal, JS, and Dombrovskiy, VY. Injury patterns and outcomes in late middle age (55-65): the intersecting comorbidity with high-risk activity - a retrospective cohort study. Ann Med Surg (Lond). (2018) 27:22–5. doi: 10.1016/j.amsu.2018.01.005

PubMed Abstract | CrossRef Full Text | Google Scholar

3. McGrath, R, Al Snih, S, Markides, K, Hall, O, and Peterson, M. The burden of health conditions for middle-aged and older adults in the United States: disability-adjusted life years. BMC Geriatr. (2019) 19:100. doi: 10.1186/s12877-019-1110-6

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Fan, ZY, Yang, Y, Zhang, CH, Yin, RY, Tang, L, and Zhang, F. Prevalence and patterns of comorbidity among middle-aged and elderly people in China: a cross-sectional study based on CHARLS data. Int J Gen Med. (2021) 14:1449–55. doi: 10.2147/IJGM.S309783

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Low, LL, Kwan, YH, Ko, MSM, Yeam, CT, Lee, VSY, Tan, WB, et al. Epidemiologic characteristics of multimorbidity and sociodemographic factors associated with multimorbidity in a rapidly aging Asian country. JAMA Netw Open. (2019) 2:e1915245. doi: 10.1001/jamanetworkopen.2019.15245

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Bliemel, C, Buecking, B, Oberkircher, L, Knobe, M, Ruchholtz, S, and Eschbach, D. The impact of pre-existing conditions on functional outcome and mortality in geriatric hip fracture patients. Int Orthop. (2017) 41:1995–2000. doi: 10.1007/s00264-017-3591-2

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Benoit, E, Stephen, AH, Monaghan, SF, Lueckel, SN, and Adams, CA Jr. Geriatric Trauma. Hode Island Med J. (2019) 102:19–22.

Google Scholar

8. Li, A, Wang, D, Lin, S, Chu, M, Huang, S, Lee, CY, et al. Depression and life satisfaction among middle-aged and older adults: mediation effect of functional disability. Front Psychol. (2021) 12:755220. doi: 10.3389/fpsyg.2021.755220

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Madni, TD, Ekeh, AP, Brakenridge, SC, Brasel, KJ, Joseph, B, Inaba, K, et al. A comparison of prognosis calculators for geriatric trauma: a prognostic assessment of life and limitations after trauma in the elderly consortium study. J Trauma Acute Care Surg. (2017) 83:90–6. doi: 10.1097/TA.0000000000001506

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Sokas, C, Herrera-Escobar, JP, Klepp, T, Stanek, E, Kaafarani, H, Salim, A, et al. Impact of chronic illness on functional outcomes and quality of life among injured older adults. Injury. (2021) 52:2638–44. doi: 10.1016/j.injury.2021.03.052

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Earl-Royal, E, Kaufman, EJ, Hsu, JY, Wiebe, DJ, Reilly, PM, and Holena, DN. Age and preexisting conditions as risk factors for severe adverse events and failure to rescue after injury. J Surg Res. (2016) 205:368–77. doi: 10.1016/j.jss.2016.06.082

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Zhou, PY, and Wong, AKC. Explanation and prediction of clinical data with imbalanced class distribution based on pattern discovery and disentanglement. BMC Med Inform Decis Mak. (2021) 21:16. doi: 10.1186/s12911-020-01356-y

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Liu, L, Zhu, MM, Cai, LL, and Zhang, X. Predictive models for knee pain in middle-aged and elderly individuals based on machine learning methods. Comput Math Methods Med. (2022) 2022:1–7. doi: 10.1155/2022/5005195

CrossRef Full Text | Google Scholar

14. Zhang, C, Chen, X, Wang, S, Hu, J, Wang, C, and Liu, X. Using CatBoost algorithm to identify middle-aged and elderly depression, national health and nutrition examination survey 2011-2018. Psychiatry Res. (2021) 306:114261. doi: 10.1016/j.psychres.2021.114261

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Cao, X, Yang, G, Jin, X, He, L, Li, X, Zheng, Z, et al. A machine learning-based aging measure among middle-aged and older Chinese adults: the China health and retirement longitudinal study. Front Med (Lausanne). (2021) 8:698851. doi: 10.3389/fmed.2021.698851

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Chang, SC, Chu, CL, Chen, CK, Chang, HN, Wong, AMK, Chen, YP, et al. The comparison and interpretation of machine-learning models in post-stroke functional outcome prediction. Diagnostics (Basel). (2021) 11:1784. doi: 10.3390/diagnostics11101784

CrossRef Full Text | Google Scholar

17. Lin, WY, Chen, CH, Tseng, YJ, Tsai, YT, Chang, CY, Wang, HY, et al. Predicting post-stroke activities of daily living through a machine learning-based approach on initiating rehabilitation. Int J Med Inform. (2018) 111:159–64. doi: 10.1016/j.ijmedinf.2018.01.002

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Pan, H, Zhao, Y, Wang, H, Li, X, Leung, E, Chen, F, et al. Influencing factors of Barthel index scores among the community-dwelling elderly in Hong Kong: a random intercept model. BMC Geriatr. (2021) 21:484. doi: 10.1186/s12877-021-02422-4

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Shah, S, Vanclay, F, and Cooper, B. Improving the sensitivity of the Barthel index for stroke rehabilitation. J Clin Epidemiol. (1989) 42:703–9. doi: 10.1016/0895-4356(89)90065-6

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Chou, YC, Tsou, HH, Chan, DD, Wen, CJ, Lu, FP, Lin, KP, et al. Validation of clinical frailty scale in Chinese translation. BMC Geriatr. (2022) 22:604. doi: 10.1186/s12877-022-03287-x

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Sundararajan, V, Henderson, T, Perry, C, Muggivan, A, Quan, H, and Ghali, WA. New ICD-10 version of the Charlson comorbidity index predicted in-hospital mortality. J Clin Epidemiol. (2004) 57:1288–94. doi: 10.1016/j.jclinepi.2004.03.012

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Tranca, S, Petrisor, C, Hagau, N, and Ciuce, C. Can APACHE II, SOFA, ISS, and RTS severity scores be used to predict septic complications in multiple trauma patients? J Crit Care Med (Targu Mures). (2016) 2:124–30. doi: 10.1515/jccm-2016-0019

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Alvarez, BD, Razente, DM, Lacerda, DAM, Lother, NS, von-Bahten, LC, and Stahlschmidt, CMM. Analysis of the revised trauma score (RTS) in 200 victims of different trauma mechanisms. Rev Col Bras Cir. (2016) 43:334–40. doi: 10.1590/0100-69912016005010

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Palmer, C. Major trauma and the injury severity score - where should we set the Bar? Annu Proc Assoc Adv Automot Med. (2007) 51:13–29.

PubMed Abstract | Google Scholar

25. Zhang, M, Guo, M, Wang, Z, Liu, H, Bai, X, Cui, S, et al. Predictive model for early functional outcomes following acute care after traumatic brain injuries: a machine learning-based development and validation study. Injury. (2023) 54:896–903. doi: 10.1016/j.injury.2023.01.004

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Pedregosa, F, Varoquaux, G, Gramfort, A, Michel, V, Thirion, B, Grisel, O, et al. Scikit-learn: machine learning in Python. JMLR. (2011) 12:2825–30.

Google Scholar

27. Cruz, RMO, Hafemann, LG, Sabourin, R, and GDC, C. DESlib: A dynamic ensemble selection library in Python. arXiv. preprint. (2018). doi: 10.48550/arXiv.1802.04967

CrossRef Full Text | Google Scholar

28. Hadanny, A, Shouval, R, Wu, J, Gale, CP, Unger, R, Zahger, D, et al. Machine learning-based prediction of 1-year mortality for acute coronary syndrome(✰). J Cardiol. (2022) 79:342–51. doi: 10.1016/j.jjcc.2021.11.006

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Chen, SD, You, J, Yang, XM, Gu, HQ, Huang, XY, Liu, H, et al. Machine learning is an effective method to predict the 90-day prognosis of patients with transient ischemic attack and minor stroke. BMC Med Res Methodol. (2022) 22:195. doi: 10.1186/s12874-022-01672-z

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Ea, A, Al-hassan, M, and Aloqaily, A. Effective heterogeneous ensemble classification: an alternative approach for selecting base classifiers. ICT Express. (2021) 7:342–9. doi: 10.1016/j.icte.2020.11.005

CrossRef Full Text | Google Scholar

31. Dong, X, Yu, Z, Cao, W, Shi, Y, and Ma, Q. A survey on ensemble learning. Front Comp Sci. (2019) 14:241–58. doi: 10.1007/s11704-019-8208-z

CrossRef Full Text | Google Scholar

32. Zhang, J, Chen, L, and Abid, F. Prediction of breast Cancer from imbalance respect using cluster-based Undersampling method. J Healthc Eng. (2019) 2019:1–10. doi: 10.1155/2019/7294582

CrossRef Full Text | Google Scholar

33. Zhao, D, Wang, X, Mu, Y, and Wang, L. Experimental study and comparison of imbalance ensemble classifiers with dynamic selection strategy. Entropy (Basel). (2021) 23:822. doi: 10.3390/e23070822

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Thompson, A, Gida, S, Nassif, Y, Hope, C, and Brooks, A. The impact of frailty on trauma outcomes using the clinical frailty scale. Eur J Trauma Emerg Surg. (2022) 48:1271–6. doi: 10.1007/s00068-021-01627-x

CrossRef Full Text | Google Scholar

35. Church, S, Rogers, E, Rockwood, K, and Theou, O. A scoping review of the clinical frailty scale. BMC Geriatr. (2020) 20:393. doi: 10.1186/s12877-020-01801-7

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Prince, MJ, Wu, F, Guo, Y, Gutierrez Robledo, LM, O'Donnell, M, Sullivan, R, et al. The burden of disease in older people and implications for health policy and practice. Lancet. (2015) 385:549–62. doi: 10.1016/S0140-6736(14)61347-7

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Qin, T, Yan, M, Fu, Z, Song, Y, Lu, W, Fu, A, et al. Association between anemia and cognitive decline among Chinese middle-aged and elderly: evidence from the China health and retirement longitudinal study. BMC Geriatr. (2019) 19:305. doi: 10.1186/s12877-019-1308-7

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Patel, KV, and Guralnik, JM. Prognostic implications of anemia in older adults. Haematologica. (2009) 94:1–2. doi: 10.3324/haematol.2008.001289

PubMed Abstract | CrossRef Full Text | Google Scholar

39. de Joode, S, Kalmet, PHS, Fiddelers, AAA, Poeze, M, and Blokhuis, TJ. Long-term functional outcome after a low-energy hip fracture in elderly patients. J Orthop Traumatol. (2019) 20:20. doi: 10.1186/s10195-019-0529-z

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Long, H, Cao, R, Zhang, H, Qiu, Y, Yin, H, Yu, H, et al. Incidence of hip fracture among middle-aged and older Chinese from 2013 to 2015: results from a nationally representative study. Arch Osteoporos. (2022) 17:48. doi: 10.1007/s11657-022-01082-0

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Ray, RI, Aitken, SA, McQueen, MM, Court-Brown, CM, and Ralston, SH. Predictors of poor clinical outcome following hip fracture in middle aged-patients. Injury. (2015) 46:709–12. doi: 10.1016/j.injury.2014.11.005

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Dunlay, SM, Manemann, SM, Chamberlain, AM, Cheville, AL, Jiang, R, Weston, SA, et al. Activities of daily living and outcomes in heart failure. Circ Heart Fail. (2015) 8:261–7. doi: 10.1161/CIRCHEARTFAILURE.114.001542

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Weinstein, G, Zelber-Sagi, S, Preis, SR, Beiser, AS, DeCarli, C, Speliotes, EK, et al. Association of Nonalcoholic Fatty Liver Disease with Lower Brain Volume in healthy middle-aged adults in the Framingham study. JAMA Neurol. (2018) 75:97–104. doi: 10.1001/jamaneurol.2017.3229

PubMed Abstract | CrossRef Full Text | Google Scholar

44. VanWagner, LB, Wilcox, JE, Ning, H, Lewis, CE, Carr, JJ, Rinella, ME, et al. Longitudinal Association of non-Alcoholic Fatty Liver Disease with Changes in myocardial structure and function: the CARDIA study. J Am Heart Assoc. (2020) 9:e014279. doi: 10.1161/JAHA.119.014279

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Sinoff, G, and Ore, L. The Barthel activities of daily living index: self-reporting versus actual performance in the old-old (> or = 75 years). J Am Geriatr Soc. (1997) 45:832–6. doi: 10.1111/j.1532-5415.1997.tb01510.x

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Nakao, S, Takata, S, Uemura, H, Kashihara, M, Osawa, T, Komatsu, K, et al. Relationship between Barthel index scores during the acute phase of rehabilitation and subsequent ADL in stroke patients. J Med Investig. (2010) 57:81–8. doi: 10.2152/jmi.57.81

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Rockwood, K, Song, X, MacKnight, C, Bergman, H, Hogan, DB, McDowell, I, et al. A global clinical measure of fitness and frailty in elderly people. CMAJ. (2005) 173:489–95. doi: 10.1503/cmaj.050051

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Juma, S, Taabazuing, MM, and Montero-Odasso, M. Clinical frailty scale in an acute medicine unit: a simple tool that predicts length of stay. Can Geriatr J. (2016) 19:34–9. doi: 10.5770/cgj.19.196

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: dynamic ensemble selection, machine learning, middle-aged patient, older patient, traumatic injury

Citation: Nhu NT, Kang J-H, Yeh T-S, Wu C-C, Tsai C-Y, Piravej K and Lam C (2023) Prediction of posttraumatic functional recovery in middle-aged and older patients through dynamic ensemble selection modeling. Front. Public Health. 11:1164820. doi: 10.3389/fpubh.2023.1164820

Received: 13 February 2023; Accepted: 17 May 2023;
Published: 20 June 2023.

Edited by:

Madhan Balasubramanian, Flinders University, Australia

Reviewed by:

Hsin-Yao Wang, Linkou Chang Gung Memorial Hospital, Taiwan
Luis Rafael Moscote-Salazar, Colombian Clinical Research Group in Neurocritical Care, Colombia
Zhichao Hao, Southwest University, China

Copyright © 2023 Nhu, Kang, Yeh, Wu, Tsai, Piravej and Lam. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Carlos Lam, bHNrQHcudG11LmVkdS50dw==

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.