Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 17 December 2024
Sec. Aging and Public Health
This article is part of the Research Topic Diagnosing and Treating Frailty and Sarcopenia in Middle-aged and Older Adults View all 3 articles

Predicting grip strength-related frailty in middle-aged and older Chinese adults using interpretable machine learning models: a prospective cohort study

Lisheng Yu,Lisheng Yu1,2Shunshun CaoShunshun Cao3Botian SongBotian Song4Yangyang Hu
Yangyang Hu4*
  • 1Neurosurgery, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
  • 2Wenzhou Municipal Key Laboratory of Neurodevelopmental Pathology and Physiology, Wenzhou Medical University, Wenzhou, China
  • 3Pediatric Endocrinology, Genetics and Metabolism, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
  • 4Reproductive Medicine Center, Obstetrics and Gynecology, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, China

Introduction: Frailty is an emerging global health burden, and there is no consensus on the precise prediction of frailty. We aimed to explore the association between grip strength and frailty and interpret the optimal machine learning (ML) model using the SHapley Additive exPlanation (SHAP) to predict the risk of frailty.

Methods: Data for the study were extracted from the China Health and Retirement Longitudinal Study (CHARLS) database. Socio-demographic, medical history, anthropometric, psychological, and sleep parameters were analyzed in this study. We used the least absolute shrinkage and selection operator (LASSO) regression to filter the model for the best predictor variables and constructed six ML models for predicting frailty. The feature performance of six ML models was compared based on the area under the receiver operating characteristic curve (AUROC) and the light gradient boosting machine (LightGBM) model was selected as the best predictive frailty model. We used SHAP to interpret the LightGBM model and to reveal the decision-making process by which the model predicts frailty.

Results: A total of 10,834 eligible participants were included in the study. Using the lowest quartile of grip strength as a reference, grip strength was negatively associated with the risk of frailty when grip strength was >29.00 kg for males or >19.00 kg for females (p < 0.001). The LightGBM model predicted frailty with optimal performance with an AUROC of 0.768 (95% CI 0.741 ~ 0.795). The SHAP summary plot showed that all features predicted frailty in order of importance, with cognitive function being considered the most important predictive feature. The poorer the cognitive function, nighttime sleep duration, body mass index (BMI), and grip strength, the higher the risk of frailty in middle-aged and older adults. The SHAP individual force plot clearly shows that the LightGBM model predicts frailty in the individual decision-making process.

Conclusion: The grip strength-related LightGBM prediction model based on SHAP has high accuracy and robustness in predicting the risk of frailty. Increasing grip strength, cognitive function, nighttime sleep duration, and BMI reduce the risk of frailty and may provide strategies for individualized management of frailty.

1 Introduction

Frailty is a complex, multidimensional biopsychosocial syndrome associated with aging, encompassing the activity, physiological, cognitive, social, and psychological domains, and characterized by a decline in the physiological capacity of multiple organ systems leading to increased sensitivity to stressors (1). Epidemiologic surveys of people aged 50 years and older in 15 countries showed an average prevalence of frailty of 17.4% (ranging from 3.9 to 51.4%), with a higher prevalence of frailty among older persons in upper-middle-income countries than in high-income countries (2). A prospective study of people over 85 years of age found that frail patients had more than twice the risk of death as non-frail patients at 7 years, and that frailty led to a 54 to 101% increase in healthcare expenditures (3). Therefore, early recognition and treatment of frailty have important public health and clinical implications.

Currently, several well-established methods are available to assess frailty, each offering distinct strengths and limitations. The frailty phenotype model, introduced by Fried LP et al., characterizes frailty based on five physical criteria: unintentional weight loss, exhaustion, low physical activity, slow walking speed, and weak grip strength (4). This model is widely adopted due to its simplicity and emphasis on physical attributes. In contrast, the cumulative deficit model, developed by Rockwood et al., evaluates frailty by tallying health deficits across multiple domains—including physical, cognitive, and social factors—to calculate a frailty index (5). Additionally, tools such as the Tilburg Frailty Indicator (TFI) and the Edmonton Frail Scale (EFS) provide multidimensional assessments that encompass psychological and social domains (6).

Among these methods, grip strength has been extensively validated as a simple, objective, and reliable biomarker for assessing physical frailty. Its use is particularly advantageous in large-scale cohort studies due to its ease of measurement and established association with overall health status (7). Studies have shown grip strength to be generally consistent with the assessment of overall body strength, upper extremity function, fractures, bone density, falls, aging, malnutrition, cognitive dysfunction, depression, sleep disorders, diabetes mellitus, and co-morbidities, and to be associated with all-cause mortality and disease-specific mortality (8). A Mendelian randomization study showed that grip strength was positively and causally related to lumbar spine bone density at the best incidence of osteoporotic fracture, but not to heel forearm or femoral neck bone density (9). A review of the relationship between grip strength and diabetes mellitus by Hamasaki H et al. suggests that grip strength is a reliable indicator for identifying the risk of diabetes mellitus, cardiovascular disease, and mortality, as well as for assessing skeletal muscle mass and strength (10). In addition, Hadzibegovic S et al. concluded that grip strength can also be used to assess the strength and functional status of cancer patients, especially those with cancer cachexia, and is a widely used functional test (11).

Machine learning (ML) represents a powerful tool in predictive disease modeling, offering significant advantages over traditional statistical methods, especially in handling complex, high-dimensional datasets (12). There are many different algorithms for ML, each with different characteristics for analyzing data. Various ML methods have been widely applied in health research, including Support Vector Machine (SVM) known for its classification precision in high-dimensional spaces, Random Forest (RF) for its robustness in variable selection, and Gradient Boosting methods like Light Gradient Boosting Machine (LightGBM) and Extreme Gradient Boosting (XGBoost) for their superior performance in imbalanced datasets. Interpretable ML overcomes the shortcomings of the black-box nature of ML, which allows users to better understand the underlying logic of the predictive model, increases the credibility of the predictive model, and enables the timely detection of possible biases in the model, as well as the diagnosis and repair of the model (13). To the best of our knowledge, no studies have explored grip strength as a predictor of frailty in middle-aged and older Chinese using interpretable ML methods and explored the relationship between grip strength and frailty. To fill this gap, we hypothesized that grip strength could be used as an independent predictor of frailty in middle-aged and older Chinese adults. Based on the data from the China Health and Retirement Longitudinal Study (CHARLS), we tested whether grip strength is an independent predictor of frailty in a representative national population and confirmed their relationship.

The purpose of the study is to explore the relationship between grip strength and frailty in middle-aged and older Chinese adults using the CHARLS dataset and to select the optimal interpretable ML model to predict frailty in middle-aged and older adults. The SHapley Additive exPlanation (SHAP) tool was used to explain the optimal model for predicting frailty in middle-aged and older adults, to explore independent risk factors for frailty, and to provide individualized strategies for the management and prevention of frailty in middle-aged and older adults.

2 Materials and methods

2.1 Study design and population

Our study data were extracted from the CHARLS wave 2 (2013) dataset, which is openly available at http://charls.pku.edu.cn/. CHARLS is a nationally representative longitudinal research cohort in China that collects high-quality data on households and individuals of middle-aged and older adults aged 45 years and above to analyze population aging. As described by Liu Y et al. CHARLS began in 2011–2012 with a baseline survey in 28 provinces, 150 counties, 450 villages, and 10,257 households across the country. Respondents were assessed every 2 years with face-to-face personal interviews, sociodemographics, lifestyle, anthropometric measurements, and laboratory analysis (14).

Since the CHARLS study in wave 2 (2013) has more complete information on the variables, we chose this survey cycle as the data source for our study. A total of 18,605 people were collected in the initial survey, and 10,834 participants were eventually included in the study after gradual screening. The inclusion criteria were as follows: (1) the age ≥ 45 years; (2) completed grip strength tests in both hands; and (3) obtained informed consent. Exclusion criteria were (1) the age < 45 years; (2) missing age, gender, grip strength, and frailty data; and (3) the history of wrist or hand surgery or pain in the last 3 months. The screening process for study participants and the study design are shown in Figure 1.

Figure 1
www.frontiersin.org

Figure 1. Study participants’ selection and design flowchart.

2.2 Ethics statement

The study procedures followed the Declaration of Helsinki, and the CHARLS study was approved by the Ethics Committee of Peking University (Approval No. IRB00001052-11015), and all participants signed the written informed consent. We followed the CHARLS guidelines for data use for study analysis only. All participants’ protected personal information is anonymized during our use of CHARLS data.

2.3 Feature extraction

Based on previous studies (1517) and expert opinions (three independent geriatricians from the Second Affiliated Hospital of Wenzhou Medical University), we selected 31 variables including demographics, anthropometric measurements, lifestyles, medical histories, laboratory analyses, socioeconomic information, and psychiatric interview data (Supplementary material 1) as the original candidates for the prediction of frailty model. We excluded candidate variables with more than 15% missing values, including tobacco use and activities of daily living (ADL) score. To avoid overfitting and multicollinearity, we used the least absolute shrinkage and selection operator (LASSO) regression to screen candidate predictor variables for predicting the frailty model (18, 19). We used 10-fold cross-validation to confirm the appropriate tuning parameter (λ) of the LASSO regression analysis to screen the optimal predictor frailty model variables and the predictor variables with a p-value less than 0.05 were used as the final predictor frailty model variables (Figure 2). For this reason, we removed marital status, hypertension, diabetes, cancer, heart disease, stroke, dyslipidemia, liver disease, kidney disease, asthma, alcohol consumption, tobacco use, insurance, social activities, depression, ADL score, orientation, and pain. After variable screening, inclusion in the final predictive frailty model consisted of 13 features and 1 label, with 5 categorical and 8 continuous variables.

Figure 2
www.frontiersin.org

Figure 2. Screening for optimal model variables using the LASSO regression model. (A) To generate coefficient curves based on log(λ) sequences and to create nonzero coefficients by optimal λ. (B) The best λ in the LASSO model was selected by 10-fold cross-validation and using a minimum value criterion. The binomial deviation curves relative to the log(λ) are plotted and a virtual vertical line is drawn at the optimal value using one SE of the minimum criterion.

2.4 Definition of features and the label

Diagnostic components of frailty based on the description of Bu F et al. include fatigue, weakness, decreased physical activity, weight loss, and slow movement, and frailty is defined as meeting 3 or more of the 5 indicators (20). Whereas those who meet 1 or 2 indicators are pre-frail, those who meet 0 are non-frail (21). Nighttime sleep duration was obtained by asking the question “During the past month, how many hours of actual sleep did you get at night (average hours for one night)?.” Poor sleep quality was assessed based on “my sleep was restless” and was categorized into 4 groups. Chronic diseases (hypertension, diabetes, cancer, chronic lung disease, heart disease, stroke, mental disease, arthritis or rheumatism, dyslipidemia, liver disease, kidney disease, digestive disease, and asthma) and pain were defined by self-report. The Katz Activities of Daily Living (ADL) score was used to assess the ability to perform activities of daily living, which are included in the CHARLS questionnaire: eating, dressing, transferring, toileting, bathing, and continence (22). Cognitive functioning was assessed through visuospatial skills, memory, orientation, and attention (23). The specific methods were as follows: (1) visuospatial skills were assessed by redrawing two overlapping pentagons; (2) memory was measured by the average score of immediate and delayed recall of 10 Chinese words; (3) orientation and attention were measured by cognitive status telephone interview, which was calculated based on responses to questions about the year, month, day, week, and season. The sum of the above items is the total cognitive functioning score, and the score ranges from 0 to 21, the higher the score the better the cognitive functioning (24). Mental health status was assessed using the Center for Epidemiologic Studies Depression Scale (CES-D) as described by Bergenfeld I et al. with a total score of 30 and depression defined as 10 and above (25).

2.5 Handling of missing values

We observed that the presence of missing data for some variables is common in the CHARLS dataset, and that direct deletion of information about individuals with missing data can lead to wasted medical information and participant selection bias. To optimally preserve participant information and biological characteristics of the original dataset, we performed an algorithm based on the K Nearest Neighbors (KNN) for missing value interpolation using the DMwR2 (version0.0.2) R package for variables with less than 15% missing data. To validate the effectiveness of the KNN interpolation method, we compared it with the average interpolation and multiple interpolation methods on the missing dataset using the Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) as the evaluation metrics. The KNN interpolation method performs well in preserving data integrity and minimizing the prediction error.

2.6 Predictive modeling strategy

We randomly split the original dataset into 80:20 to better evaluate the performance and generalization ability of the frailty prediction model on unknown data, where 80% of the data is used as the model training set (N = 8,667) and 20% of the data is used as the model testing set (N = 2,167). We trained the predictive frailty model using the training set data, tested the predictive frailty model on the test set data, and tuned the hyperparameters using grid search to improve the generalization performance of the predictive frailty model (26). To better represent the generalization ability of the predictive frailty model, we used 5-fold cross-validation to evaluate each hyperparameter combination of the predictive frailty model as a way to identify and select the optimal hyperparameter combination (27). Since the prevalence of frailty in the participants of this study was only 10.61%, the data had class-imbalanced characteristics, which tends to bias the accuracy towards more classes when predicting frailty using ML models. To overcome such problems, we processed the class-imbalanced data for predicting frailty using the Synthetic Minority Oversampling Technique (SMOTE) to achieve class balance in the data (28). To eliminate the differences in the data magnitude of different features of the predictive frailty model and convert the data to a uniform scale, we normalized the continuous features using MinMaxScaler and converted the categorical features to a numerical form that can be used for the training of the predictive frailty model using one-hot coding (29, 30).

Different ML algorithms have different applicability properties and performance to the original dataset, so we used six ML models including SVM, Gradient Boosting Decision Tree (GBDT), LightGBM, Category Boosting (CatBoost), XGBoost, and RF to predict the grip strength-related frailty in middle age and older adults. The optimal prediction frailty model is identified based on the area under the receiver operating characteristic curve (AUROC) metrics of different prediction frailty models, and the optimal prediction frailty model is interpreted using the SHAP tool.

2.7 Interpretability tool for SHAP-based predictive frailty modeling

We used SHAP as an interpretable tool for optimal prediction of frailty models in middle-aged and older adults. Traditional variable importance only ranks the importance of the variables in predicting the frailty model and cannot explain how the variables affect the final frailty prediction of the model (31). SHAP is based on cooperative game theory, where the magnitude of the SHAP value accurately assesses the value of each feature’s contribution to the frailty prediction, and the positivity or negativity of the SHAP value suggests the directionality of the contribution (32).

2.8 Statistical analysis

Depending on whether the data conformed to a normal distribution, continuous variables were expressed as mean (standard deviation) or median (quartiles 1–3), and categorical variables were expressed as frequency (percentage). We divided all participants into two groups according to whether they were frail or non-frail. We compared each continuous variable between the two groups of patients using the Mann–Whitney U test, and compared differences in categorical variables using either the χ2 test or Fisher’s precision probability test. The relationship between grip strength and frailty was analyzed using multivariate logistic regression, trend tests, and interactions. We used Python (version 3.11.5) to analyze the ML models and evaluated the performance of each ML model using metrics such as AUROC, Accuracy, Precision, Recall, F1 Score, Brier Score, and Area under the Precise Recall (P-R) curve (AP). Comparisons between groups and logistic regression analysis of variables were statistically analyzed using R (version 4.3.1). The R packages we used include haven, gtsummary, DMwR2, dplyr, plyr, interactions, tidyverse, caret, arsenal, glmnet, and ggplot2. We used the Python libraries Scikit-learn (version 1.2.2) and imblearn library (version 0.10.1). The difference was considered statistically significant with a two-sided p value <0.05.

3 Results

3.1 Baseline characteristics of the participants

Based on the inclusion and exclusion criteria, the final study cohort included a total of 10,834 study participants, of which females: 5,749, and males: 5,085 were included in the baseline frailty status analysis, and the baseline characteristics of these participants are shown in Table 1. The prevalence of frailty in the present study was 10.61% (1,150/10,834) and the patients with frailty were older, with 22.09% being ≥75 years of age. Frail patients were more often unmarried (22.70%) and more often in poor self-perceived health (59.00%). The incidence of frailty was 62.00% in females and 38.00% in males, which is a significant difference. In the frail group, height, weight, body mass index (BMI), alcohol consumption, social activities, nighttime sleep duration, ADL score, orientation, cognitive function, waistline, and grip strength were significantly lower than that of the non-frail group, and the difference was statistically significant (p < 0.01). Hypertension, diabetes, cancer, chronic lung disease, heart disease, stroke, mental disease, arthritis or rheumatism, liver disease, kidney disease, disability, insurance, poor sleep quality, depression, poor life satisfaction, poor hearing, poor vision, and pain were significantly higher in the frail group than in the non-frail group. Disease, digestive disease, asthma, insurance, poor sleep quality, depression, poor life satisfaction, poor hearing, poor vision, and pain were significantly higher than those of the non-frail group, and the difference was statistically significant (p < 0.01).

Table 1
www.frontiersin.org

Table 1. Baseline characteristics of study participants.

3.2 Multivariate logistic regression analysis of grip strength and the risk of frailty

To ensure that a wide range of variations in different grip strength levels could be captured while avoiding bias due to the subjective setting of thresholds, we explored thresholds and dose–response relationships for grip strength and risk of frailty by stratifying grip strength by quartiles (33). Table 2 presents the results of a multivariate logistic regression analysis examining the association between grip strength and the risk of frailty in males. In Model I, a negative association was observed between grip strength and frailty risk, with higher grip strength associated with a reduced risk (OR = 0.932, 95% CI: 0.923–0.942, p < 0.001). This association remained statistically significant in Model II after adjusting for age (OR = 0.939, 95% CI: 0.929–0.950, p < 0.001). In Model III, which included full adjustment for covariates, the association persisted, with each unit increase in grip strength corresponding to a 5.7% decrease in frailty risk (OR = 0.943, 95% CI: 0.931–0.954, p < 0.001). Moreover, in the fully adjusted Model III, males in the highest quartile of grip strength demonstrated a 63.9% lower risk of frailty compared to those in the lowest quartile (OR = 0.361, 95% CI: 0.252–0.509, p < 0.001). In addition, to explore whether grip strength interacted with other important variables, we analyzed the interaction of grip strength with the top three important variables of the SHAP summary plot of the optimal model, respectively. The results showed that there was an interaction effect between grip strength and cognitive function (p < 0.01). Table 3 presents the analysis of the association between female grip strength and the risk of frailty. In Model I, a negative association was identified, with higher grip strength associated with a reduced risk of frailty (OR = 0.928, 95% CI: 0.917–0.939, p < 0.001). This association remained statistically significant in Model II after adjusting for age (OR = 0.947, 95% CI: 0.935–0.958, p < 0.001). In Model III, after full adjustment for covariates, a consistent 4.9% reduction in frailty risk was observed for each unit increase in grip strength (OR = 0.951, 95% CI: 0.939–0.963, p < 0.001). Furthermore, in the fully adjusted Model III, individuals in the highest quartile of grip strength exhibited a 54.3% lower risk of frailty compared to those in the lowest quartile (OR = 0.457, 95% CI: 0.351–0.590, p < 0.001).

Table 2
www.frontiersin.org

Table 2. Multivariate logistic regression analysis of grip strength and the risk of frailty in males.

Table 3
www.frontiersin.org

Table 3. Multivariate logistic regression analysis of grip strength and the risk of frailty in females.

3.3 Performance evaluation and comparison of six ML models

Figure 3 listed seven performance discriminators for SVM, LightGBM, GBDT, CatBoost, XGBoost, and RF models used to predict frailty. We trained six predictive frailty ML models on the training set data and tested each predictive frailty model on the test set data for seven performance discriminators, including AUROC, accuracy, precision, recall, F1 score, Brier score, and AP. In addition, we performed 1,000 resamplings using Bootstrap to calculate the AUROC 95% CI for each predicted frailty model. Based on AUROC, the LightGBM model had the best predictive frailty performance compared to the other five models with an AUROC of 0.768 and 95% CI of 0.741–0.795 (Figure 4). According to the confusion matrix, the LightGBM model predicts frailty with high accuracy and robustness with accuracy, precision, recall, F1 score, Brier score, and AP of 96.9, 76.9, 100%, 0.869, 0.111, and 0.322, respectively, (Figure 5; Figure 6). Ultimately, we use the LightGBM model as the optimal model for predicting frailty and further interpret the model using the SHAP.

Figure 3
www.frontiersin.org

Figure 3. Heatmap comparing the performance of six ML models.

Figure 4
www.frontiersin.org

Figure 4. AUROC comparison of six ML models for predicting frailty. (A) SVM, (B) LightGBM, (C) GBDT, (D) CatBoost, (E) XGBoost, and (F) RF.

Figure 5
www.frontiersin.org

Figure 5. Comparison of confusion matrices for six models for predicting frailty. (A) SVM, (B) LightGBM, (C) GBDT, (D) CatBoost, (E) XGBoost, and (F) RF.

Figure 6
www.frontiersin.org

Figure 6. The AP comparison of six models for predicting frailty. (A) SVM, (B) LightGBM, (C) GBDT, (D) CatBoost, (E) XGBoost, and (F) RF.

3.4 Interpreting the LightGBM model for predicting frailty using SHAP summary plot

The SHAP summary plot listed the SHAP values of each feature, which was used to obtain the magnitude of each feature’s contribution to the LightGBM model’s prediction of frailty, and ordered the importance of each feature from highest to lowest (Figure 7). Cognitive function was the most important feature of the LightGBM model for predicting frailty, followed by nighttime sleep duration and BMI. Higher feature values in the SHAP summary plot are indicated by a reddish color, and a bluish color indicates lower feature values and the horizontal direction represents whether the SHAP value of the feature has a positive or negative effect on the prediction of frailty by the LightGBM model. We observed from the SHAP summary plots that a decreased value of the cognitive function feature has a positive effect on the model predicting frailty, while an increased value of the nighttime sleep duration feature has a negative effect on the model predicting frailty.

Figure 7
www.frontiersin.org

Figure 7. The SHAP global summary plot for the LightGBM model.

3.5 The SHAP individual force plot for the predicted frailty LightGBM model

Figure 8 shows the individual force plots for (A) non-frailty and (B) frailty. The SHAP value for each feature is labeled below the arrows, indicating the contribution of each feature to the model’s frailty prediction, and the number on the horizontal axis is the probabilistic prediction value f(x). The red arrows on the left side represent features that positively influence the model’s frailty prediction, the blue arrows on the right side represent features that negatively influence the model’s frailty prediction, and the length of the arrows indicates the magnitude of the feature’s contribution value to the model’s frailty prediction. Figure 8A demonstrates that the LightGBM model predicted a frail individual, and Figure 8B indicates that the LightGBM model predicted a non-frail individual.

Figure 8
www.frontiersin.org

Figure 8. The SHAP force plot of individuals with different frailty prediction outcomes. (A) frailty, (B) non-frailty, cognition: cognitive function, sleephr: nighttime sleep duration, CLD: chronic lung disease, waist: waistline.

4 Discussion

This study is the first to explore the relationship between grip strength and frailty in middle-aged and older adults using the interpretable ML method, which showed a prevalence of frailty of 10.61%, which is consistent with the previously reported prevalence of frailty of 4.6% ~ 17.1% (34). Studies have shown that frailty is closely associated with hospitalizations, falls, fractures, and death, so accurately identifying those at risk for frailty is important for preventing frailty and related adverse outcomes (35). Our analysis revealed a negative association between grip strength and the risk of frailty, with this relationship becoming evident when grip strength exceeded 29.00 kg in males and 19.00 kg in females. This finding suggests the presence of a grip strength threshold for assessing frailty, which varies by gender. Specifically, grip strength is negatively correlated with frailty risk and can serve as an indicator of frailty only when it exceeds 29.00 kg in males and 19.00 kg in females. We constructed the SHAP-based interpretable lightGBM model to predict grip strength-related frailty in the middle-aged and older adults. In addition, we trained five other ML models, among which the LightGBM model has the optimal performance. The performance metrics of the LightGBM model, including AUROC, accuracy, precision, recall, F1 score, Brier score, and AP, were 0.768 (95% CI, 0.741–0.795), 96.9, 76.9, 100%, 0.869, 0.111, and 0.322, respectively. These results demonstrate the model’s high reliability, robustness, and accuracy in predicting frailty. The model demonstrated moderate to good discriminatory ability in predicting frailty, as an AUROC between 0.7 and 0.8 is generally regarded as acceptable in medical prediction tasks (36).

The LightGBM model’s superior performance is rooted in its efficient computational design and advanced feature handling capabilities. By using histogram-based learning, LightGBM significantly reduces computational overhead, making it faster and more memory-efficient than other gradient-boosting frameworks like XGBoost. Its leaf-wise tree growth strategy enhances the model’s ability to identify complex interactions between variables, leading to higher predictive accuracy. Moreover, the integrated regularization mechanisms effectively prevent overfitting, a critical advantage when working with imbalanced datasets like ours. These factors collectively contribute to the model’s robustness and reliability, underscoring its suitability for predicting frailty in middle-aged and older populations. Using SHAP individual force plots to interpret the LightGBM model allows for a better understanding of the decision-making process for model prediction frailty. Using the SHAP global summary plot, we also identified and ranked the importance of significant risk factors that predict frailty in the middle-aged and older adults. Cognitive function is the most important predictor variable for the LightGBM model, followed by nighttime sleep duration, BMI, and grip strength. Ma W et al. also confirmed the existence of a significant association between frailty and cognitive function and found that depression partially mediated the link between frailty and cognitive function (37).

Using the interpretable LightGBM model to predict frailty, we found that lower grip strength, cognitive function, BMI, and nighttime sleep duration were associated with a higher risk of frailty in middle-aged and older adults. Older age and female gender were also associated with a higher risk of frailty. A cross-sectional study of the relationship between frailty and muscle response by Suzuki Y et al. showed that grip reaction speed decreases with frailty, and that measuring grip strength in both hands can help to study the relationship between frailty and grip strength (38). A review by Vaishya R et al. stated that hand grip strength is a basic indicator for assessing muscle function and overall body function, is particularly relevant to old age, identifies a variety of health problems, and is an important biomarker of health (39). A cross-sectional observational study by Choe et al. found that muscle mass did not directly affect frailty and that grip strength mediated the relationship between muscle mass and frailty in older Korean adults (40). A systematic review and meta-analysis by Robinson TL et al. noted that frailty has a significant and negative impact on cognitive function, and their combined effect may increase the risk of adverse health outcomes in older adults (41). Jayanama K et al. examined the relationship between high BMI and frailty and mortality and found that being overweight or obese was associated with a higher risk of frailty and that the relationship between BMI and frailty was partly due to body fat (42). Arias-Fernández L et al. studied the relationship between sleep disorders and impaired physical functioning in older adults and suggested that self-reported poor sleep quality was associated with frailty and muscle weakness (43). These points confirmed our findings, and we hypothesized that grip strength, cognitive function, BMI, and nighttime sleep duration could serve as independent predictors of frailty.

Gordon EH et al. suggested that aging is a major risk factor for most chronic diseases and that frailty is associated with many factors over the life course, particularly smoking, chronic disease, obesity, and economic deprivation (44). Bellelli F et al. showed that frailty increases with age and is associated with shorter years of education and that there is an exponential interaction between education and age in influencing frailty (45). Park C et al. and Zeidan RS et al. found that frailty is a manifestation of age-related decline in physiologic reserve and increased vulnerability and that there is a significant gender difference in the prevalence of frailty, with frailty being more common and severe in females, but with a relatively lower risk of death in females (3, 46). A cross-sectional study of frailty and gender differences in Europe found that females in Europe were more frail and had more comorbidities than males (47). These points are consistent with our findings and also suggest that age and gender may serve as independent predictors of frailty.

The interpretable LightGBM predictive model developed in this study demonstrates significant clinical utility in the management of frailty. For middle-aged and older adults, grip strength can be enhanced through strength training and nutritional interventions as part of personalized frailty management strategies. Given the observed relationship between nocturnal sleep duration and frailty risk, measures such as behavioral therapies, reduction in the use of central nervous system depressants, and optimization of the sleep environment may help improve sleep quality. Cognitive function, identified as a critical predictor of frailty, can be targeted through interventions like cognitive training and social engagement activities. For patients with excessively low body weight and muscle mass, maintaining a healthy BMI through dietary modifications and moderate exercise can mitigate the risk of frailty. The SHAP force plot offers a valuable tool for visualizing the contribution of individual features to a patient’s frailty risk. For instance, patients with low grip strength but high BMI may benefit from prioritized interventions focused on strength training and weight management. Furthermore, the SHAP global feature importance map highlights key predictors, such as cognitive function and sleep duration, which can be incorporated into clinical screening protocols to guide early intervention efforts. By leveraging SHAP values, specific frailty management pathways can be designed to identify high-risk individuals. These patients can then be allocated to tailored intervention programs, such as grip strength training or cognitive function rehabilitation. The interpretability of the model provides clinicians with actionable decision support, particularly when managing a large cohort of patients with limited resources. Through feature prioritization, the model facilitates the efficient allocation of intervention resources, ensuring that high-impact strategies are applied where they are most needed.

This study has some strengths and limitations regarding the prediction of frailty. First, this study is based on the CHARLS data, which is nationally representative and high-quality data. It also uses the big data analytic power of ML to predict frailty in China’s middle-aged and older adults. Second, the SHAP-based interpretable LightGBM model makes it easy for users to understand how it predicts frailty in middle-aged and older adults. Third, since the CHARLS data was a prospective study, it was possible to infer the existence of a causal relationship between the variables and the outcome, making the LightGBM model highly reliable in predicting frailty. The potential underrepresentation of rural populations in the CHARLS dataset introduces a limitation to our study. Rural communities often experience unique sociodemographic and health-related challenges, including lower socioeconomic status, limited healthcare access, and higher prevalence of malnutrition, which may influence the relationship between grip strength and frailty. Future studies should aim to validate our findings using datasets with a more balanced representation of rural and urban populations, or by employing stratified sampling techniques to minimize bias.

5 Conclusion

The grip strength-related LightGBM prediction model, constructed and tested based on SHAP, has high accuracy, robustness, and reliability in predicting the risk of frailty in middle-aged and older Chinese adults. The interpretable LightGBM predictive model we created accurately explores risk factors for frailty in middle-aged and older adults and has important clinical applications. Reducing grip strength, cognitive function, nighttime sleep duration, and BMI increases the risk of frailty and may provide strategies for individualized management of frailty in middle-aged and older adults.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by the ethics committee of Peking University, Approval No. IRB00001052-11015. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

LY: Investigation, Methodology, Resources, Software, Writing – review & editing. SC: Conceptualization, Data curation, Formal analysis, Funding acquisition, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. BS: Methodology, Resources, Validation, Visualization, Writing – review & editing. YH: Conceptualization, Funding acquisition, Investigation, Methodology, Resources, Software, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Wenzhou Basic Scientific Research Project of China (Y20220414 and Y2023304).

Acknowledgments

The authors would like to thank all those who participated in this study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2024.1489848/full#supplementary-material

References

1. Dent, E, Martin, FC, Bergman, H, Woo, J, Romero-Ortuno, R, and Walston, JD. Management of frailty: opportunities, challenges, and future directions. Lancet. (2019) 394:1376–86. doi: 10.1016/S0140-6736(19)31785-4

PubMed Abstract | Crossref Full Text | Google Scholar

2. Cohen, CI, Benyaminov, R, Rahman, M, Ngu, D, and Reinhardt, M. Frailty: a multidimensional biopsychosocial syndrome. Med Clin North Am. (2023) 107:183–97. doi: 10.1016/j.mcna.2022.04.006

PubMed Abstract | Crossref Full Text | Google Scholar

3. Park, C, and Ko, FC. The science of frailty: sex differences. Clin Geriatr Med. (2021) 37:625–38. doi: 10.1016/j.cger.2021.05.008

PubMed Abstract | Crossref Full Text | Google Scholar

4. Fried, LP, Tangen, CM, Walston, J, Newman, AB, Hirsch, C, Gottdiener, J, et al. Frailty in older adults: evidence for a phenotype. J Gerontol A Biol Sci Med Sci. (2001) 56:M146–57. doi: 10.1093/gerona/56.3.M146

PubMed Abstract | Crossref Full Text | Google Scholar

5. Rockwood, K, and Mitnitski, A. Frailty in relation to the accumulation of deficits. J Gerontol A Biol Sci Med Sci. (2007) 62:722–7. doi: 10.1093/gerona/62.7.722

PubMed Abstract | Crossref Full Text | Google Scholar

6. Sison, S, Shi, SM, Oh, G, Jeong, S, McCarthy, EP, and Kim, DH. Claims-based frailty index and its relationship with commonly used clinical frailty measures. J Gerontol A Biol Sci Med Sci. (2024) 79:glae094. doi: 10.1093/gerona/glae094

PubMed Abstract | Crossref Full Text | Google Scholar

7. Petermann-Rocha, F, Gray, SR, Forrest, E, Welsh, P, Sattar, N, Celis-Morales, C, et al. Associations of muscle mass and grip strength with severe NAFLD: a prospective study of 333, 295 UK biobank participants. J Hepatol. (2022) 76:1021–9. doi: 10.1016/j.jhep.2022.01.010

PubMed Abstract | Crossref Full Text | Google Scholar

8. Bohannon, RW. Grip Strength: An Indispensable Biomarker For Older Adults. Clin Interv Aging. (2019) 14:1681–91. doi: 10.2147/CIA.S194543

PubMed Abstract | Crossref Full Text | Google Scholar

9. Song, J, Liu, T, Zhao, J, Wang, S, Dang, X, and Wang, W. Causal associations of hand grip strength with bone mineral density and fracture risk: a mendelian randomization study. Front Endocrinol (Lausanne). (2022) 13:1020750. doi: 10.3389/fendo.2022.1020750

PubMed Abstract | Crossref Full Text | Google Scholar

10. Hamasaki, H. What can hand grip strength tell us about type 2 diabetes?: mortality, morbidities and risk of diabetes. Expert Rev Endocrinol Metab. (2021) 16:237–50. doi: 10.1080/17446651.2021.1967743

PubMed Abstract | Crossref Full Text | Google Scholar

11. Hadzibegovic, S, Porthun, J, Lena, A, Weinländer, P, Lück, LC, Potthoff, SK, et al. Hand grip strength in patients with advanced cancer: a prospective study. J Cachexia Sarcopenia Muscle. (2023) 14:1682–94. doi: 10.1002/jcsm.13248

PubMed Abstract | Crossref Full Text | Google Scholar

12. Malik, MA, Faraone, SV, Michoel, T, and Haavik, J. Use of big data and machine learning algorithms to extract possible treatment targets in neurodevelopmental disorders. Pharmacol Ther. (2023) 250:108530. doi: 10.1016/j.pharmthera.2023.108530

PubMed Abstract | Crossref Full Text | Google Scholar

13. Cao, S, and Hu, Y. Interpretable machine learning framework to predict gout associated with dietary fiber and triglyceride-glucose index. Nutr Metab (Lond). (2024) 21:25. doi: 10.1186/s12986-024-00802-2

PubMed Abstract | Crossref Full Text | Google Scholar

14. Liu, Y, Cui, J, Cao, L, Stubbendorff, A, and Zhang, S. Association of depression with incident sarcopenia and modified effect from healthy lifestyle: the first longitudinal evidence from the CHARLS. J Affect Disord. (2024) 344:373–9. doi: 10.1016/j.jad.2023.10.012

PubMed Abstract | Crossref Full Text | Google Scholar

15. Kolle, AT, Lewis, KB, Lalonde, M, and Backman, C. Reversing frailty in older adults: a scoping review. BMC Geriatr. (2023) 23:751. doi: 10.1186/s12877-023-04309-y

PubMed Abstract | Crossref Full Text | Google Scholar

16. Verduri, A, Carter, B, Laraman, J, Rice, C, Clini, E, Maskell, NA, et al. Frailty and its influence on mortality and morbidity in COPD: a systematic review and Meta-analysis. Intern Emerg Med. (2023) 18:2423–34. doi: 10.1007/s11739-023-03405-6

PubMed Abstract | Crossref Full Text | Google Scholar

17. Mishra, M, Wu, J, Kane, AE, and Howlett, SE. The intersection of frailty and metabolism. Cell Metab. (2024) 36:893–911. doi: 10.1016/j.cmet.2024.03.012

PubMed Abstract | Crossref Full Text | Google Scholar

18. Ali, H, Shahzad, M, Sarfraz, S, Sewell, KB, Alqalyoobi, S, and Mohan, BP. Application and impact of Lasso regression in gastroenterology: a systematic review. Indian J Gastroenterol. (2023) 42:780–90. doi: 10.1007/s12664-023-01426-9

PubMed Abstract | Crossref Full Text | Google Scholar

19. Andriopoulos, V, and Kornaros, M. LASSO regression with multiple imputations for the selection of key variables affecting the fatty acid profile of Nannochloropsis oculata. Mar Drugs. (2023) 21:483. doi: 10.3390/md21090483

PubMed Abstract | Crossref Full Text | Google Scholar

20. Bu, F, Deng, XH, Zhan, NN, Cheng, H, Wang, ZL, Tang, L, et al. Development and validation of a risk prediction model for frailty in patients with diabetes. BMC Geriatr. (2023) 23:172. doi: 10.1186/s12877-023-03823-3

Crossref Full Text | Google Scholar

21. Yin, H, Gao, C, Quan, Z, and Zhang, Y. The relationship between frailty, walking ability, and depression in elderly Chinese people. Medicine (Baltimore). (2023) 102:e35876. doi: 10.1097/MD.0000000000035876

PubMed Abstract | Crossref Full Text | Google Scholar

22. Masoudi, N, Sarbazi, E, Soleimanpour, H, Abbasian, M, Ghasemi, M, Rostami, Z, et al. Loneliness and its correlation with self-care and activities of daily living among older adults: a partial least squares model. BMC Geriatr. (2024) 24:621. doi: 10.1186/s12877-024-05215-7

Crossref Full Text | Google Scholar

23. Ye, Z, Li, X, Lang, H, and Fang, Y. Long-term PM2.5 exposure, lung function, and cognitive function among middle-aged and older adults in China. J Gerontol A Biol Sci Med Sci. (2023) 78:2333–41. doi: 10.1093/gerona/glad180

Crossref Full Text | Google Scholar

24. Hou, X, Luo, Y, Yang, F, Zhu, X, Gao, X, Wang, W, et al. The mediating role of children's intergenerational support in association between grandparenting and cognitive function among middle-aged and older Chinese: findings from the CHARLS cohort study. BMC Public Health. (2024) 24:597. doi: 10.1186/s12889-024-18106-8

PubMed Abstract | Crossref Full Text | Google Scholar

25. Bergenfeld, I, Kaslow, NJ, Yount, KM, Cheong, YF, Johnson, ER, and Clark, CJ. Measurement invariance of the Center for Epidemiologic Studies Scale-Depression within and across six diverse intervention trials. Psychol Assess. (2023) 35:805–20. doi: 10.1037/pas0001262

PubMed Abstract | Crossref Full Text | Google Scholar

26. Mahardika, TNQ, Fuadah, YN, Jeong, DU, and Lim, KM. PPG signals-based blood-pressure estimation using grid search in Hyperparameter optimization of CNN-LSTM. Diagnostics (Basel). (2023) 13:2566. doi: 10.3390/diagnostics13152566

PubMed Abstract | Crossref Full Text | Google Scholar

27. Fernandes, M, Sun, H, Chemali, Z, Mukerji, SS, Moura, MVR, Zafar, SF, et al. Brain health scores to predict neurological outcomes from electronic health records. Int J Med Inform. (2023) 180:105270. doi: 10.1016/j.ijmedinf.2023.105270

PubMed Abstract | Crossref Full Text | Google Scholar

28. Huang, X, Schmelter, F, Irshad, MT, Piet, A, Nisar, MA, Sina, C, et al. Optimizing sleep staging on multimodal time series: leveraging borderline synthetic minority oversampling technique and supervised convolutional contrastive learning. Comput Biol Med. (2023) 166:107501. doi: 10.1016/j.compbiomed.2023.107501

PubMed Abstract | Crossref Full Text | Google Scholar

29. Singh, VK, Maurya, NS, Mani, A, and Yadav, RS. Machine learning method using position-specific mutation based classification outperforms one hot coding for disease severity prediction in haemophilia 'A'. Genomics. (2020) 112:5122–8. doi: 10.1016/j.ygeno.2020.09.020

PubMed Abstract | Crossref Full Text | Google Scholar

30. Shrivastava, T, Singh, V, and Agrawal, A. Autism spectrum disorder detection with kNN imputer and machine learning classifiers via questionnaire mode of screening. Health Inf Sci Syst. (2024) 12:18. doi: 10.1007/s13755-024-00277-8

PubMed Abstract | Crossref Full Text | Google Scholar

31. Tseng, PY, Chen, YT, Wang, CH, Chiu, KM, Peng, YS, Hsu, SP, et al. Prediction of the development of acute kidney injury following cardiac surgery by machine learning. Crit Care. (2020) 24:478. doi: 10.1186/s13054-020-03179-9

Crossref Full Text | Google Scholar

32. Li, X, Zhao, Y, Zhang, D, Kuang, L, Huang, H, Chen, W, et al. Development of an interpretable machine learning model associated with heavy metals' exposure to identify coronary heart disease among US adults via SHAP: findings of the US NHANES from 2003 to 2018. Chemosphere. (2023) 311:137039. doi: 10.1016/j.chemosphere.2022.137039

PubMed Abstract | Crossref Full Text | Google Scholar

33. Park, HJ, Han, B, Chang, SY, Kang, SH, Lee, DW, and Kang, S. Hand grip strength, osteoporosis, and quality of life in middle-aged and older adults. Medicina (Kaunas). (2023) 59:2148. doi: 10.3390/medicina59122148

PubMed Abstract | Crossref Full Text | Google Scholar

34. Ishii, H, Doi, T, Tsutsumimoto, K, Nakakubo, S, Kurita, S, and Shimada, H. Driving cessation and physical frailty in community-dwelling older adults: a longitudinal study. Geriatr Gerontol Int. (2021) 21:1047-52. doi: 10.1111/ggi.14272

PubMed Abstract | Crossref Full Text | Google Scholar

35. Doody, P, Asamane, EA, Aunger, JA, Swales, B, Lord, JM, Greig, CA, et al. The prevalence of frailty and pre-frailty among geriatric hospital inpatients and its association with economic prosperity and healthcare expenditure: a systematic review and meta-analysis of 467, 779 geriatric hospital inpatients. Ageing Res Rev. (2022) 80:101666. doi: 10.1016/j.arr.2022.101666

PubMed Abstract | Crossref Full Text | Google Scholar

36. Tantai, XX, Liu, N, Yang, LB, Wei, ZC, Xiao, CL, Song, YH, et al. Prognostic value of risk scoring systems for cirrhotic patients with variceal bleeding. World J Gastroenterol. (2019) 25:6668–80. doi: 10.3748/wjg.v25.i45.6668

PubMed Abstract | Crossref Full Text | Google Scholar

37. Ma, W, Wu, B, Gao, X, and Zhong, R. Association between frailty and cognitive function in older Chinese people: a moderated mediation of social relationships and depressive symptoms. J Affect Disord. (2022) 316:223–32. doi: 10.1016/j.jad.2022.08.032

PubMed Abstract | Crossref Full Text | Google Scholar

38. Suzuki, Y, Matsui, Y, Hirano, Y, Kondo, I, Nemoto, T, Tanimoto, M, et al. Relationships among grip strength measurement, response time, and frailty criteria. J Frailty Aging. (2023) 12:182–8. doi: 10.14283/jfa.2023.18

PubMed Abstract | Crossref Full Text | Google Scholar

39. Vaishya, R, Misra, A, Vaish, A, Ursino, N, and D'Ambrosi, R. Hand grip strength as a proposed new vital sign of health: a narrative review of evidences. J Health Popul Nutr. (2024) 43:7. doi: 10.1186/s41043-024-00500-y

PubMed Abstract | Crossref Full Text | Google Scholar

40. Choe, YR, Jeong, JR, and Kim, YP. Grip strength mediates the relationship between muscle mass and frailty. J Cachexia Sarcopenia Muscle. (2020) 11:441–51. doi: 10.1002/jcsm.12510

PubMed Abstract | Crossref Full Text | Google Scholar

41. Robinson, TL, Gogniat, MA, and Miller, LS. Frailty and cognitive function in older adults: a systematic review and Meta-analysis of cross-sectional studies. Neuropsychol Rev. (2022) 32:274–93. doi: 10.1007/s11065-021-09497-1

PubMed Abstract | Crossref Full Text | Google Scholar

42. Jayanama, K, Theou, O, Godin, J, Mayo, A, Cahill, L, and Rockwood, K. Relationship of body mass index with frailty and all-cause mortality among middle-aged and older adults. BMC Med. (2022) 20:404. doi: 10.1186/s12916-022-02596-7

PubMed Abstract | Crossref Full Text | Google Scholar

43. Arias-Fernández, L, Smith-Plaza, AM, Barrera-Castillo, M, Prado-Suárez, J, Lopez-Garcia, E, Rodríguez-Artalejo, F, et al. Sleep patterns and physical function in older adults attending primary health care. Fam Pract. (2021) 38:146–52. doi: 10.1093/fampra/cmaa085

PubMed Abstract | Crossref Full Text | Google Scholar

44. Gordon, EH, and Hubbard, RE. Frailty: understanding the difference between age and ageing. Age Ageing. (2022) 51:afac185. doi: 10.1093/ageing/afac185

PubMed Abstract | Crossref Full Text | Google Scholar

45. Bellelli, F, Consorti, E, Hettiarachchige, T, Rossi, P, Lucchi, T, Froldi, M, et al. Relationship among age, education and frailty in older persons. J Frailty Aging. (2023) 12:326–8. doi: 10.14283/jfa.2023.39

PubMed Abstract | Crossref Full Text | Google Scholar

46. Zeidan, RS, McElroy, T, Rathor, L, Martenson, MS, Lin, Y, and Mankowski, RT. Sex differences in frailty among older adults. Exp Gerontol. (2023) 184:112333. doi: 10.1016/j.exger.2023.112333

PubMed Abstract | Crossref Full Text | Google Scholar

47. Ahrenfeldt, LJ, Möller, S, Thinggaard, M, Christensen, K, and Lindahl-Jacobsen, R. Sex differences in comorbidity and frailty in Europe. Int J Public Health. (2019) 64:1025–36. doi: 10.1007/s00038-019-01270-9

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: frailty, grip strength, machine learning, SHapley Additive exPlanation, CHARLS

Citation: Yu L, Cao S, Song B and Hu Y (2024) Predicting grip strength-related frailty in middle-aged and older Chinese adults using interpretable machine learning models: a prospective cohort study. Front. Public Health. 12:1489848. doi: 10.3389/fpubh.2024.1489848

Received: 02 September 2024; Accepted: 02 December 2024;
Published: 17 December 2024.

Edited by:

Diogo Luís Marques, University of Beira Interior, Portugal

Reviewed by:

Chen-Cheng Yang, Kaohsiung Municipal Siaogang Hospital, Taiwan
Paulo Jorge Coelho, Polytechnic Institute of Leiria, Portugal

Copyright © 2024 Yu, Cao, Song and Hu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yangyang Hu, MjA5MjA0QHd6aGVhbHRoLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.