ORIGINAL RESEARCH article

Front. Public Health, 11 March 2025

Sec. Aging and Public Health

Volume 13 - 2025 | https://doi.org/10.3389/fpubh.2025.1518472

This article is part of the Research TopicInnovations and Strategies for Comprehensive Frailty Management in Older PeopleView all articles

Determinants and risk prediction models for frailty among community-living older adults in eastern China

Lin Qi,Lin Qi1,2Jianyu LiuJianyu Liu2Xuhui SongXuhui Song2Xinle WangXinle Wang2Mengmeng YangMengmeng Yang2Xinyi CaoXinyi Cao2Yan He
Yan He1*
  • 1College of Management, Hainan Medical University, Haikou, China
  • 2College of Public Health, Zhengzhou University, Zhengzhou, China

Objective: The purpose of this study is to develop predictive models for frailty risk among community-dwelling older adults in eastern China using machine learning techniques. This approach aims to facilitate early detection of high-risk individuals and inform the design of tailored interventions, with the ultimate goals of enhancing quality of life and mitigating frailty progression in the older adult population.

Methods: This study involved 1,263 participants aged 60 years or older, who were selected through stratified cluster sampling. Frailty was assessed using the Tilburg Frailty Indicator (TFI), which encompasses physical, psychological, and social dimensions. Predictive models were constructed using decision trees, random forests, and XGBoost algorithms, implemented in R software (version 4.4.2). The performance of these models was evaluated using metrics such as the area under the receiver operating characteristic curve (AUC), ROC curves, and confusion matrices.

Results: The results showed that 64.77% of the older adult were physically weak. Body mass index (BMI), living arrangements, frequency of visits and smoking status are the main factors contributing to frailty. When comparing predictive model metrics, random forest and extreme Gradient Lift (XGBoost) outperform decision tree models in terms of accuracy and applicability.

Conclusion: Older adults living in communities in eastern China showed slight frailty, and many factors influenced their frailty scores. Random forest and XGBoost models outperform decision tree models in predicting frailty in older adults, so identifying high-risk individuals early and developing personalized interventions can help slow the development of frailty and improve quality of life in older adults.

1 Introduction

Frailty is marked by the decline of multiple physiological systems, reducing the ability to cope with external stressors and increasing the risk of adverse health outcomes (13). It is a significant challenge in aging populations and has become a major public health concern as global aging accelerates (4). Our study shows that frailty rates rise sharply with age, exceeding 20% among those aged 80 and older. Frailty affects individuals on physical, psychological, and social levels and is closely tied to chronic diseases like heart disease, diabetes, and respiratory conditions, which worsen frailty (58). Sarcopenia is also a key contributor (9), while mental health issues such as depression and anxiety, often linked to social isolation, further exacerbate frailty (10, 11). These multifaceted impacts increase health risks, reduce quality of life, and drive up healthcare costs, placing a heavy burden on families and society (12). Identifying frailty risk factors and developing effective predictive models are now central to geriatric research.

To better assess and prevent frailty, various tools have been developed. The Tilburg Frailty Indicator (TFI) is widely used, covering physical, psychological, and social dimensions through 15 items (13). Other tools, such as the Fried Frailty Phenotype and Frailty Index, are also used to identify high-risk individuals and guide interventions (1416). In this study, we chose the TFI to assess frailty among older adult individuals in eastern China, capturing its multidimensional nature. Research shows that machine learning models like decision trees, random forests, and XGBoost are effective for predicting frailty risk. However, many studies lack systematic internal validation after model construction, limiting the evaluation of model stability and reliability. Even when validation is conducted, some studies use simpler methods, such as Bootstrap resampling without cross-validation (1720). This study aims to conduct systematic internal validation, compare model performance, and select an optimal model with strong generalization and robustness for frailty risk prediction. Early detection and appropriate interventions, such as exercise, nutritional support, and psychological care, can significantly slow frailty progression and improve quality of life in older adults (21). Interventions should focus on managing reversible diseases and preventing frailty through early identification. This study uses a machine learning model to predict frailty risk, aiming to provide personalized intervention recommendations for middle-aged and older adult individuals in the community and offer a scientific basis for future public health policies.

2 Methods

2.1 Data collection

This study targeted older adults living in communities in eastern China and collected data through a cross-sectional survey conducted by public health staff between September and December 2021 as part of a household survey. The method of multi-stage stratified cluster sampling is adopted. The first step is to randomly select Shandong, Jiangsu and Guangdong from 12 provinces in eastern China. The second step is to randomly select one municipality from each selected province: Qingdao, Suzhou, and Guangzhou. Step 3: Jimo District of Qingdao, Kunshan County of Suzhou and Yuexiu District of Guangzhou shall set up one district or county each. The fourth step is to randomly select two streets from each district, and then select two neighborhoods from each street. Participants were randomly sampled from these communities. The inclusion criteria were older adult individuals aged 60 or older, with at least two chronic diseases diagnosed by a doctor, and those who consented to participate after being informed about the survey. Exclusion criteria included incomplete information on chronic disease conditions or responses with logical errors or unclear answers. Out of 1,380 collected questionnaires, 1,323 were retained after excluding those with missing key variables. Further exclusions were made for errors or unreasonable data related to height and weight, leaving 1,263 older adult individuals for the final analysis. The recruitment process of participants is shown in Figure 1.

Figure 1
www.frontiersin.org

Figure 1. Flowchart on participant recruitment.

2.2 Covariates and frailty assessment

The following factors are considered covariates. Demographic factors include gender, age, whether you live in rural or urban areas, race, education level, marital status, frequency of medical visits, living arrangements, monthly income, BMI, and pension insurance. Lifestyle and chronic disease factors included smoking status, drinking habits, number of chronic diseases, type of medication, length of sleep, and the frequency of participants’ outdoor activities. The study defined drinking status as “yes” for those who had consumed alcohol and were still drinking. People who have never drunk alcohol, and people who have stopped drinking are rated as “no.” Smoking is defined as “yes” for those who have smoked and are still smoking, “no” for those who have never smoked, and “no” for those who have quit smoking.

The Tilburg Frailty Indicator (TFI) was chosen to assess frailty in older adults. Developed by Gobbens et al. (13), this instrument integrates multiple frailty models to evaluate frailty status in the older adult (22). It comprises 15 items divided into three domains: physical, psychological, and social. The physical domain (8 items) covers conditions like poor health, unexplained weight loss, walking difficulties, balance issues, vision and hearing impairments, reduced grip strength, and persistent fatigue. The psychological domain (4 items) examines memory decline, depression, anxiety, and problem-solving difficulties. The social domain (3 items) assesses social isolation, lack of interpersonal relationships, and insufficient social support. The TFI uses a binary scoring system (1 point for frailty, 0 for no frailty; some items score 0.5). Scores range from 0 to 15, with ≥5 indicating frailty and higher scores reflecting greater severity. Participants were categorized as frail or non-frail based on these scores. The scale exhibited good internal consistency, with a Cronbach’s α of 0.72.

2.3 Principle and evaluation method of decision tree, random forest and XGBoost model

2.3.1 Decision tree model

The decision tree is a tree-based model used for classification and regression. It recursively divides the dataset into smaller subsets to build the model. The fundamental concept involves selecting features that maximize data purity and using these features to split the data.

Formula:

Information entropy: used to measure the uncertainty of data, the formula is calculated as:

Entropy ( S ) = Σ i = 1 n p i log 2 ( p i )

Where pi is the proportion of Class i samples in the data set.

Information gain: used to measure the effect of features on reducing uncertainty, calculated by:

Gain ( S , F ) = Entropy ( S ) Σ v V a l u e s ( F ) | S v | | S | Entropy ( S v )

Where F is the feature, Values(F) is all possible values of feature F, and Sv is the subdataset of feature F with value v.

2.3.2 Random forest model

Random forest enhances model accuracy and robustness by constructing multiple decision trees and aggregating their predictions via voting or averaging. It employs Bootstrap sampling to create subsamples from the dataset. Each subsample generates a decision tree, with random feature subsets selected at each split. The final prediction is obtained by consolidating the outputs of all trees. The construction of individual trees in random forest follows the principles of a standalone decision tree, using information gain or the Gini index to determine optimal splits.

2.3.3 XGBoost model

XGBoost is a gradient-boosting-based ensemble algorithm that enhances model performance by iteratively adding decision trees. Each new tree is built to address the prediction errors of its predecessor, thereby progressively reducing model bias.

Formula:

The objective function of XGBoost is:

obj θ = 1 N Σ i = 1 N L y i , f x i + Ω f

L(yi, f(xi)) represents the loss function, which quantifies the discrepancy between the predicted and actual values. Meanwhile, Ω(f) serves as the regularization term to manage the model’s complexity.

When splitting a node, XGBoost determines whether to split by calculating the gain after splitting:

Gain = 1 2 ( G L 2 H L + λ G R 2 H R + λ ( G L + G R ) 2 H L + H R + λ γ )

GL and GR denote the first-order derivatives of the left and right subtrees before and after splitting, respectively. HL and HR represent the sums of the second-order derivatives. Meanwhile, λ and γ are regularization parameters.

2.3.4 Performance evaluation of prediction model

To evaluate the model’s performance comprehensively, the dataset was randomly split into training and test sets at a 7:3 ratio. A confusion matrix was generated based on this division to assess the model’s predictive accuracy. The key performance metrics calculated include Accuracy (ACC), Recall (R), Precision (P), F1 score, and AUC (Area Under the ROC Curve). The confusion matrix is detailed in Table 1.

Table 1
www.frontiersin.org

Table 1. Confusion matrix of binary classification model.

The calculation formula of each index is as follows:

(1)  Accuracy rate represents the probability that the prediction sample is correctly identified:

ACC = TP + TN TP + FP + TN + FN     (1)

(2)  The recall rate represents the proportion of demand that is correctly predicted as a percentage of all real demand:

R = TP TP + FN     (2)

(3)  The accuracy rate represents the true percentage of those who are really in demand:

P = TN FP + TN     (3)

(4)  F1 score represents the harmonic mean of accuracy rate and recall rate, which can balance the relationship between recall rate and accuracy rate:

F 1 = 2 × R × P R + P     (4)

(5)  AUC represents the area under ROC curve. The AUC ranges from 0 to 1, with an AUC of 0.5 indicating no predictive power, 0.5 to 0.7 low predictive power, 0.7 to 0.9 medium predictive power, and higher than 0.9 high predictive power.

Given that accuracy, recall, and precision are single metrics and cannot fully capture model performance, this study uses both the F1 score and AUC value to provide a comprehensive assessment of the model’s predictive ability.

2.4 Statistical analysis

Statistical analyses were performed using SPSS 27.0, with significance set at p < 0.05. Categorical data were described using counts (n) and percentages (%), and chi-square tests were used to assess group differences. Variables significant in univariate analyses (p < 0.05) were included in binary logistic regression models to identify frailty-related factors. For predictive modeling, R 4.4.2 was employed with three machine learning packages: rpart for decision trees, randomForest for random forests, and xgboost for XGBoost. The dataset (n = 1,263) was split into training (n = 885, 70%) and testing sets (n = 378, 30%) to prevent overfitting and ensure model generalizability.

3 Results

3.1 Frailty status and demographic characteristics of community-dwelling older adults

According to the scores of the three frailty dimensions, most older adult individuals had physical scores between 0 and 3. This indicates that only a small number of them showed significant physical frailty. The psychological scores were mostly concentrated at 2, with fewer individuals scoring 0 (no psychological frailty) or 3–4. This suggests that most older adult individuals experienced mild to moderate psychological frailty. Similarly, social dimension scores were also centered at 2 points, with fewer older adult scoring 0 or 3. This implies that extreme social isolation or strong social support were uncommon. Overall, the majority of older adult individuals scored between 5 and 6, indicating that a higher percentage of community-dwelling older adult were mildly frail. Physical and social frailty were more noticeable, while psychological frailty, though less frequent, still needs attention (Figure 2).

Figure 2
www.frontiersin.org

Figure 2. Overall frailty and various dimensions of frailty in the older adult.

3.2 Univariate and multivariate analysis of frailty risk factors

The study involved 1,263 older adult participants, comprising 47.5% males and 52.5% females. Their ages were distributed as follows: 24.2% were aged 60–69, 29.9% were 70–79, and 45.9% were 80 or older. The majority (97.1%) were Han Chinese, with 2.9% belonging to ethnic minorities. Educational backgrounds varied, with 18.0% illiterate, 34.7% having primary education, 21.3% with lower secondary education, and 26.0% holding high school or higher qualifications. Marital status showed that 50.4% were married, 11.4% were single, and 35.5% were widowed. Residence-wise, 55.7% lived in urban areas and 44.3% in rural areas. Regarding monthly income, 59.2% earned ≤3,000 yuan, while 60.7% had pension insurance. BMI data revealed that 9.9% were underweight, 65.7% were normal weight, 22.2% were overweight, and 2.2% were obese. Participants were categorized into frail (818) and non-frail (445) groups. Significant differences between the groups were observed in age, marital status, monthly income, pension insurance, BMI, living arrangements, visit frequency, smoking status, sleep duration, outdoor activity frequency, medication type, and number of chronic diseases (p < 0.05). The results indicate that older adults are more likely to be frail if they are older, widowed, have lower incomes, lack pension insurance, are underweight or overweight, live alone, have fewer visits, smoke, take multiple medications, or have multiple chronic conditions (Table 2).

Table 2
www.frontiersin.org

Table 2. Demographic characteristics and single factor analysis of older adult people living at home in community.

To identify factors associated with frailty, a binary logistic regression analysis was conducted. Variables significant in univariate analyses were included, with the last category serving as the reference group. Thirteen variables were examined, and collinearity diagnostics (VIF < 5) confirmed no multicollinearity. The analysis revealed that age, BMI, monthly income, living arrangement, visit frequency, pension insurance, smoking status, number of chronic diseases, and type of medication significantly predicted frailty (p < 0.05). Conversely, gender and marital status had no significant impact on frailty. Notably, BMI, living arrangements, visit frequency, and smoking status emerged as key risk factors. Overweight older adults were 5.139 times more likely to be frailty than those of normal weight (OR = 5.139, 95%CI = 2.111–12.507). Similarly, those who lived alone were 2.831 times more likely to be frail than those who did not (OR = 2.831, 95%CI = 1.958–4.092). Older adults who went out less frequently were 2.229 times more likely to be frail than those who went out more frequently (OR = 2.229, 95%CI = 1.509–3.294). Smokers are 1.443 times more likely to be weak than non-smokers (OR = 1.443, 95%CI = 1.034–2.015) (Table 3).

Table 3
www.frontiersin.org

Table 3. Multifactor analysis of frailty of older adult people living at home in community.

3.3 Application of decision tree, random forest, and XGBoost models in predicting frailty status of community-dwelling older adults

To enhance the performance of the prediction models and ensure comprehensive results, this study selected potential risk factors from univariate frailty analyses with p < 0.05 as the inclusion criterion. Thirteen independent variables were included for model construction. Three machine learning algorithms—decision tree, random forest, and XGBoost—were employed to predict frailty status among older adult individuals in eastern China. The dataset was split into 70% for training and 30% for testing.

The Decision Tree model uses recursive splitting to identify optimal features based on information gain or the Gini index. While interpretable, it is prone to overfitting, leading to poor generalization. In this study, the rpart package in R was used to build the model. To improve generalization, key parameters were optimized using grid search with the caret package. The optimal parameters were: (criterion = “gini”, max_depth = 7, min_samples_leaf = 3, min_samples_split = 8, splitter = “best”). The model achieved an AUC of 0.78 and an F1 score of 0.8239 on the training set, but the AUC dropped to 0.66 and the F1 score to 0.7649 on the test set, indicating some overfitting (Figures 3, 4).

Figure 3
www.frontiersin.org

Figure 3. ROC curves and AUC values of the three models.

Figure 4
www.frontiersin.org

Figure 4. Confusion matrix results for the training set (top) and test set (bottom) of the three models.

Random forest improves robustness and accuracy by constructing multiple decision trees and aggregating predictions through majority voting, reducing the risk of overfitting. The randomForest package in R was used, with 5-fold cross-validation (method = “cv,” number = 5) and grid search for parameter tuning. The optimal parameters were: (ntree = 500, mtry = 2, nodesize = 5). The model achieved an AUC of 0.99 and an F1 score of 0.9604 on the training set, but the AUC dropped to 0.73 and the F1 score to 0.7583 on the test set, revealing significant overfitting (Figures 3, 4).

XGBoost enhances predictive performance by training and combining weak classifiers. It is efficient for large datasets, handles missing values, and mitigates overfitting through regularization. The xgboost package in R was used, with 5-fold cross-validation and grid search for parameter tuning. The optimal parameters were: (nrounds = 100, max_depth = 4, eta = 0.1). The model achieved an AUC of 0.81 on the training set and 0.72 on the test set, with an F1 score of 0.7986 on the training set and 0.7595 on the test set, indicating a slight overfitting issue (Figures 3, 4).

3.4 Comparison of model performance and feature importance analysis

Machine learning algorithms, whether single or ensemble-based, are widely utilized in the medical field. In studies predicting frailty risk among older populations, data are often collected through questionnaires. Given the relatively small sample size and numerous variables, this study employs both common single algorithms and ensemble algorithms to construct a binary classification prediction model. These include decision tree, random forest, and XGBoost. The performance of these models was evaluated on a test set, as detailed in Table 4.

Table 4
www.frontiersin.org

Table 4. Comparison of three model evaluation indexes.

According to Table 4, the random forest model achieved an AUC of 0.7350 and an F1 score of 0.7583, outperforming the XGBoost model, which had an AUC of 0.7204 and an F1 score of 0.7595. The Decision Tree model had the lowest AUC at 0.6593, with an F1 score of 0.7649. While random forest exhibited the best performance, it also showed significant overfitting, likely due to its complexity in capturing noise in the training data. Overall, both random forest and XGBoost models performed better than the Decision Tree model, mainly due to their higher AUC values.

To further analyze model performance, the importance and xgb.importance functions in R were used to rank and visualize variable importance in both the random forest and XGBoost models. As depicted in Figures 5, 6, “Type of Medication” emerged as a key predictor of frailty risk in both models. Although the feature importance scores varied slightly, “Living Arrangement,” “Number of Chronic Diseases,” and “Sleep Duration” were also identified as significant factors. Given that XGBoost and random forest models excel at capturing complex and non-linear relationships, these factors should be prioritized in managing the daily lives of the older adult.

Figure 5
www.frontiersin.org

Figure 5. Visualize the feature importance in random forest model.

Figure 6
www.frontiersin.org

Figure 6. Visualize the feature importance in XGBoost model.

4 Discussion

In this study, among the 1,263 older adult participants, 818 were diagnosed with frailty syndrome, while 445 were not, yielding a frailty prevalence of 64.77% among community-dwelling older adult in eastern China. When compared to international studies, the pre-frailty rate was 50.9% in Northern Thailand (23), 55.5% in Indonesia (24), and 21% of participants in Australia were frail, with an additional 48% in the pre-frailty stage (25). The prevalence rates in this study are higher than those reported in other regions. This difference may be due to the use of the Tilburg Frailty Indicator, which assesses physical, psychological, and social dimensions, while other studies often rely on the Fried frailty phenotype, focusing solely on physical aspects. Moreover, the higher prevalence in this study might also be related to the COVID-19 pandemic. During this period, many older adult individuals stayed indoors for extended periods, leading to reduced physical activity and functional decline, which likely contributed to increased frailty risk.

The study reveals that frailty in older adult individuals is influenced by multiple factors, with BMI, living arrangement, visit frequency, and smoking status being the primary contributors among community-dwelling older adult in this research. BMI is a key indicator of nutritional status and is widely used to assess the impact of various diseases in the older adult, such as hypertension, dyslipidemia, and chronic conditions. The findings show that being overweight or obese significantly increases frailty risk among the older adult. Obesity and excess weight, as metabolic conditions, can accelerate functional impairments and disabilities, aligning with observations in Pemecutan Village, Bali (26). Yingzhen Gu’s research (27) further highlights a positive correlation between frailty risk and BMI, noting that higher fat mass, reduced physical activity, and muscle loss are associated with greater frailty risk. Additionally, longer duration of obesity exacerbates this risk. Kai Guo’s study (28) also links elevated TyG-BMI (triglyceride-glucose body mass index) to rapid frailty progression, emphasizing the challenge for older adult individuals to maintain stable weight. Given the frequent need for medication to manage blood glucose and triglyceride levels among these participants, identifying an optimal TyG-BMI range and developing strategies to maintain it is crucial. Thus, BMI plays a significant role in frailty among community-dwelling older adult, underscoring the importance of community workers and caregivers providing targeted interventions and encouraging older adult individuals to monitor their BMI to prevent excessive weight gain and mitigate frailty risk.

Living arrangement and visit frequency are significant factors influencing frailty. Older adult individuals who live alone or receive few visits face a higher risk of frailty. Social isolation or loneliness can directly impact their physiology through neuroendocrine and immune system responses, often leading to conditions like depression and cardiovascular diseases. These individuals may also display personality traits such as low self-esteem and poor self-control, making them more susceptible to stress and increasing their risk of frailty and mortality. This aligns with findings from Hoogendijk et al. (11). A Singaporean study (29) further demonstrated that active social engagement can slow the progression of frailty, highlighting the importance of promoting social participation and addressing loneliness to prevent frailty. A large-scale UK survey (30) suggested a bidirectional relationship between frailty and loneliness: frailty may increase future loneliness, while lonely older adult individuals are more likely to be inactive, leading to reduced muscle mass and strength, and potentially impacting their eating habits and overall health. To support older adult individuals who live alone or are infrequently visited, options like senior apartments or community-based shared living spaces should be considered. Communities can organize regular visits or group activities by staff or volunteers, which not only improve physical functioning but also help build social networks and reduce loneliness.

Smoking significantly increases the risk of frailty in older adults, with smokers being 1.443 times more likely to develop frailty than non-smokers. Research by Lv et al. (31) indicates that lifelong smoking, influenced by genetic factors, is associated with frailty, suggesting that smoking is a causal factor. The harmful chemicals in cigarettes can damage multiple organs, contributing to both physical and mental health issues that increase the likelihood of frailty. Chronic inflammation and oxidative stress are key pathways linking smoking to frailty. This study aligns with the findings of Marcel et al. (32), showing that smoking impacts physical health as well as mental and social well-being. Smoking may alter neural pathways, increasing sensitivity to stress and contributing to conditions like depression or anxiety. Additionally, the social stigma surrounding smoking can lead to feelings of isolation and loneliness. Therefore, quitting smoking can improve physical health, enhance mental well-being, and increase social engagement in older adults. In addition to the previously mentioned factors, two key features identified in frailty risk prediction models warrant special attention: Type of Medication and Number of Chronic Diseases. Lv et al. (5) found that the cumulative effect of chronic diseases significantly raises the risk of frailty among the older adult. This study confirmed that the number of chronic diseases is a critical predictor of frailty. Moreover, an increased number of medications is strongly associated with frailty (OR = 1.156). Polypharmacy can lead to drug interactions, which may exacerbate frailty risk (33). The type of medication is also a critical factor, as a higher number of medications increases the likelihood of adverse reactions and interactions, potentially accelerating physical decline and raising the likelihood of frailty.

Machine learning, a widely used data mining tool, has been used to predict vulnerability risk in older adults. A study by Wang et al. (34) used a decision tree model to predict the risk of physical limitations in long-term care in older adults. Although these models are intuitive and easy to interpret, they tend to overfit and have lower predictive performance than other models. This is consistent with the results of this study, in which decision trees demonstrated their effectiveness in initial screening, but were less effective in predicting fatal risk. In contrast, the random forest model shows better generalization ability and is especially good at processing high-dimensional data. Wu’s study (35) confirmed the high accuracy and stability of random forest model in predicting decline trajectory, and affirmed its practicability in predicting vulnerability risk. The work of Han and Wang (36), and Noh (37) demonstrated the ability of XGBoost models to handle nonlinear problems and large-scale data, effectively capturing complex relationships between variables. In conclusion, both random forest and XGBoost models have their advantages in vulnerability risk prediction. These models will play an important role in future vulnerability risk assessment and health management for older people. By analyzing the main characteristics and influencing factors of vulnerability risk, it is helpful to deeply understand the degree of vulnerability risk, strengthen early prevention, and effectively mitigate its occurrence and development, so as to improve the quality of life of the older adult. The study also highlights the potential of machine learning technologies for use in healthcare. Provides new ideas for data analysis and forecasting in healthcare (36). These technologies can be widely used to predict and manage other health problems, thereby improving the efficiency and accuracy of the healthcare system. By promoting frailty risk prediction models based on machine learning, frailty in the older adult can be better addressed and healthy aging of the global older population can be promoted (2). The multi-dimensional intervention strategy emphasized in this study also provides an important reference for global public health policy making.

5 Limitation

This cross-sectional study does not establish causality between frailty and its associated risk factors, precluding the model from making prospective predictions about future frailty. Therefore, future research should incorporate longitudinal studies to elucidate the causal mechanisms and progression of frailty. Additionally, the study’s sample, drawn exclusively from eastern China, limits the generalizability of the findings to older adult populations in other regions. Moreover, the decision tree, random forest, and XGBoost models lack external validation. Future work should prioritize multi-center external validation to enhance model predictive performance and develop a frailty prediction model applicable to older adult individuals across the country.

6 Conclusion

In this research, BMI, living arrangements, visit frequency, and smoking status emerged as critical predictors of frailty among the older adult. A machine learning approach was utilized to forecast frailty risk within a community in eastern China. Ensemble models like random forest and XGBoost exhibited enhanced generalizability and robustness over single-model algorithms. Early identification of at-risk individuals and the implementation of personalized interventions can effectively slow frailty progression and improve quality of life in older age.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

Ethical approval for this study was obtained from the Zhengzhou University Life Science Ethics Committee (Approval Code: ZZUIRB2022-07). All participants provided informed consent, and the study adhered to local regulations and institutional guidelines. Written consent was secured from each participant before their involvement.

Author contributions

LQ: Writing – original draft, Writing – review & editing. JL: Conceptualization, Writing – review & editing. XS: Writing – review & editing. XW: Writing – review & editing. MY: Writing – review & editing. XC: Writing – review & editing. YH: Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. Funding sources include the Henan Province Key Research and Development and Promotion Special Projects (Scientific and Technological Research; grant no. 212102310814), the National Key R&D Program for Active Health and Aging Science and Technology Response Key Special Project (grant no. 2020YFC2006100), and the Annual Project of Henan Provincial Philosophy and Social Science Planning (grant no. 2020BSH017).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Hoogendijk, EO, Afilalo, J, Ensrud, KE, Kowal, P, Onder, G, and Fried, LP. Frailty: implications for clinical practice and public health. Lancet. (2019) 394:1365–75. doi: 10.1016/S0140-6736(19)31786-6

PubMed Abstract | Crossref Full Text | Google Scholar

2. Thillainadesan, J, Scott, IA, and Le Couteur, DG. Frailty, a multisystem ageing syndrome. Age Ageing. (2020) 49:758–63. doi: 10.1093/ageing/afaa112

PubMed Abstract | Crossref Full Text | Google Scholar

3. Satake, S, and Arai, H. Chapter 1 frailty: definition, diagnosis, epidemiology. Geriatr Gerontol Int. (2020) 20:7–13. doi: 10.1111/ggi.13830

PubMed Abstract | Crossref Full Text | Google Scholar

4. Buckinx, F, Rolland, Y, Reginster, JY, Ricour, C, Petermans, J, and Bruyère, O. Burden of frailty in the elderly population: perspectives for a public health challenge. Arch Public Health. (2015) 73:19. doi: 10.1186/s13690-015-0068-x

PubMed Abstract | Crossref Full Text | Google Scholar

5. Lv, J, Li, R, Yuan, L, Yang, XL, Wang, Y, Ye, ZW, et al. Research on the frailty status and adverse outcomes of elderly patients with multimorbidity. BMC Geriatr. (2022) 22:560. doi: 10.1186/s12877-022-03194-1

PubMed Abstract | Crossref Full Text | Google Scholar

6. Pandey, A, Kitzman, D, and Reeves, G. Frailty is intertwined with heart failure: mechanisms, prevalence, prognosis, assessment, and management. JACC Heart Fail. (2019) 7:1001–11. doi: 10.1016/j.jchf.2019.10.005

PubMed Abstract | Crossref Full Text | Google Scholar

7. Bu, F, Deng, XH, Zhan, NN, Cheng, H, Wang, ZL, Tang, L, et al. Development and validation of a risk prediction model for frailty in patients with diabetes. BMC Geriatr. (2023) 23:172. doi: 10.1186/s12877-023-03823-3

PubMed Abstract | Crossref Full Text | Google Scholar

8. Ramanathan, PL, Muthusamy, S, Kumar, JK, and Krishnan, A. Prevalence of frailty among chronic respiratory disease patients. Lung India. (2024) 41:181–4. doi: 10.4103/lungindia.lungindia_464_23

PubMed Abstract | Crossref Full Text | Google Scholar

9. Nascimento, CM, Ingles, M, Salvador-Pascual, A, Cominetti, MR, Gomez-Cabrera, MC, and Viña, J. Sarcopenia, frailty and their prevention by exercise. Free Radic Biol Med. (2019) 132:42–9. doi: 10.1016/j.freeradbiomed.2018.08.035

PubMed Abstract | Crossref Full Text | Google Scholar

10. Zhang, Y, Yu, G, Bai, W, Wu, S, Geng, X, Zhang, W, et al. Association of depression and sleep quality with frailty: a cross-sectional study in China. Front Public Health. (2024) 12:1361745. doi: 10.3389/fpubh.2024.1361745

PubMed Abstract | Crossref Full Text | Google Scholar

11. Hoogendijk, EO, Smit, AP, van Dam, C, Schuster, NA, de Breij, S, Holwerda, TJ, et al. Frailty combined with loneliness or social isolation: an elevated risk for mortality in later life. J Am Geriatr Soc. (2020) 68:2587–93. doi: 10.1111/jgs.16716

PubMed Abstract | Crossref Full Text | Google Scholar

12. De Lepeleire, J, Iliffe, S, Mann, E, and Degryse, JM. Frailty: an emerging concept for general practice. Br J Gen Pract. (2009) 59:e177–82. doi: 10.3399/bjgp09X420653

PubMed Abstract | Crossref Full Text | Google Scholar

13. Gobbens, RJ, van Assen, MA, Luijkx, KG, Wijnen-Sponselee, MT, and Schols, JM. The Tilburg frailty Indicator: psychometric properties. J Am Med Dir Assoc. (2010) 11:344–55. doi: 10.1016/j.jamda.2009.11.003

PubMed Abstract | Crossref Full Text | Google Scholar

14. Choi, J, Ahn, A, Kim, S, and Won, CW. Global prevalence of physical frailty by Fried's criteria in community-dwelling elderly with National Population-Based Surveys. J Am Med Dir Assoc. (2015) 16:548–50. doi: 10.1016/j.jamda.2015.02.004

PubMed Abstract | Crossref Full Text | Google Scholar

15. Op Het Veld, LP, Van Rossum, E, Kempen, GI, De Vet, HC, Hajema, K, and Beurskens, AJ. Fried phenotype of frailty: cross-sectional comparison of three frailty stages on various health domains. BMC Geriatr. (2015) 15:77. doi: 10.1186/s12877-015-0078-0

PubMed Abstract | Crossref Full Text | Google Scholar

16. de Vries, NM, Staal, JB, van Ravensberg, CD, Hobbelen, JS, Olde Rikkert, MG, and Nijhuis-van der Sanden, MW. Outcome instruments to measure frailty: a systematic review. Ageing Res Rev. (2011) 10:104–14. doi: 10.1016/j.arr.2010.09.001

PubMed Abstract | Crossref Full Text | Google Scholar

17. Le Cossec, C, Perrine, AL, Beltzer, N, Fuhrman, C, and Carcaillon-Bentata, L. Pre-frailty, frailty, and multimorbidity: prevalences and associated characteristics from two French National Surveys. J Nutr Health Aging. (2016) 20:860–9. doi: 10.1007/s12603-016-0802-2

PubMed Abstract | Crossref Full Text | Google Scholar

18. Jayanama, K, Theou, O, Godin, J, Mayo, A, Cahill, L, and Rockwood, K. Relationship of body mass index with frailty and all-cause mortality among middle-aged and older adults. BMC Med. (2022) 20:404. doi: 10.1186/s12916-022-02596-7

PubMed Abstract | Crossref Full Text | Google Scholar

19. Apóstolo, J, Cooke, R, Bobrowicz-Campos, E, Santana, S, Marcucci, M, Cano, A, et al. Effectiveness of interventions to prevent pre-frailty and frailty progression in older adults: a systematic review. JBI Database System Rev Implement Rep. (2018) 16:140–232. doi: 10.11124/JBISRIR-2017-003382

PubMed Abstract | Crossref Full Text | Google Scholar

20. Travers, J, Romero-Ortuno, R, Bailey, J, and Cooney, MT. Delaying and reversing frailty: a systematic review of primary care interventions. Br J Gen Pract. (2019) 69:e61–9. doi: 10.3399/bjgp18X700241

PubMed Abstract | Crossref Full Text | Google Scholar

21. Morley, JE, Vellas, B, van Kan, GA, Anker, SD, Bauer, JM, Bernabei, R, et al. Frailty consensus: a call to action. J Am Med Dir Assoc. (2013) 14:392–7. doi: 10.1016/j.jamda.2013.03.022

PubMed Abstract | Crossref Full Text | Google Scholar

22. Gobbens, RJ, van Assen, MA, Luijkx, KG, and Schols, JM. The predictive validity of the Tilburg frailty Indicator: disability, health care utilization, and quality of life in a population at risk. Gerontologist. (2012) 52:619–31. doi: 10.1093/geront/gnr135

PubMed Abstract | Crossref Full Text | Google Scholar

23. Thinuan, P, Siviroj, P, Lerttrakarnnon, P, and Lorga, T. Prevalence and potential predictors of frailty among community-dwelling older persons in northern Thailand: a cross-sectional study. Int J Environ Res Public Health. (2020) 17:4077. doi: 10.3390/ijerph17114077

PubMed Abstract | Crossref Full Text | Google Scholar

24. Pradana, AA, Chiu, HL, Lin, CJ, and Lee, SC. Prevalence of frailty in Indonesia: a systematic review and meta-analysis. BMC Geriatr. (2023) 23:778. doi: 10.1186/s12877-023-04468-y

PubMed Abstract | Crossref Full Text | Google Scholar

25. Thompson, MQ, Theou, O, Karnon, J, Adams, RJ, and Visvanathan, R. Frailty prevalence in Australia: findings from four pooled Australian cohort studies. Australas J Ageing. (2018) 37:155–8. doi: 10.1111/ajag.12483

PubMed Abstract | Crossref Full Text | Google Scholar

26. Triguna, IPB, Kuswardhani, RAT, and Purnami, NKR. Correlation between body mass index and frailty in elderly men at Pemecutan Village, West Denpasar District, Bali. J Penyakit Dalam Udayana. (2021) 5:9–13. doi: 10.36216/jpd.v5i1.162

Crossref Full Text | Google Scholar

27. Gu, Y, Li, Z, Dang, A, Zhang, W, Liu, J, Han, X, et al. Obesity, birth weight, and lifestyle factors for frailty: a Mendelian randomization study. Aging (Albany NY). (2023) 15:14066–85. doi: 10.18632/aging.205290

PubMed Abstract | Crossref Full Text | Google Scholar

28. Guo, K, Wang, Q, Zhang, L, Qiao, R, Huo, Y, Jing, L, et al. Relationship between changes in the triglyceride glucose-body mass index and frail development trajectory and incidence in middle-aged and elderly individuals: a national cohort study. Cardiovasc Diabetol. (2024) 23:304. doi: 10.1186/s12933-024-02373-1

PubMed Abstract | Crossref Full Text | Google Scholar

29. Ge, L, Yap, CW, and Heng, BH. Associations of social isolation, social participation, and loneliness with frailty in older adults in Singapore: a panel data analysis. BMC Geriatr. (2022) 22:26. doi: 10.1186/s12877-021-02745-2

PubMed Abstract | Crossref Full Text | Google Scholar

30. Gale, CR, Westbury, L, and Cooper, C. Social isolation and loneliness as risk factors for the progression of frailty: the English longitudinal study of ageing. Age Ageing. (2018) 47:392–7. doi: 10.1093/ageing/afx188

PubMed Abstract | Crossref Full Text | Google Scholar

31. Lv, J, Wu, L, Sun, S, Yu, H, Shen, Z, Xu, J, et al. Smoking, alcohol consumption, and frailty: a Mendelian randomization study. Front Genet. (2023) 14:1092410. doi: 10.3389/fgene.2023.1092410

PubMed Abstract | Crossref Full Text | Google Scholar

32. van Assen, MALM, Helmink, JHM, and Gobbens, RJJ. Associations between lifestyle factors and multidimensional frailty: a cross-sectional study among community-dwelling older people. BMC Geriatr. (2022) 22:7. doi: 10.1186/s12877-021-02704-x

PubMed Abstract | Crossref Full Text | Google Scholar

33. Setiati, S, Soejono, CH, Harimurti, K, Dwimartutie, N, Aryana, IGPS, Sunarti, S, et al. Frailty and its associated risk factors: first phase analysis of multicentre Indonesia longitudinal aging study. Front Med (Lausanne). (2021) 8:658580. doi: 10.3389/fmed.2021.658580

PubMed Abstract | Crossref Full Text | Google Scholar

34. Wang, J, Chen, H, Wang, H, Liu, W, Peng, D, Zhao, Q, et al. A risk prediction model for physical restraints among older Chinese adults in long-term care facilities: machine learning study. J Med Internet Res. (2023) 25:e43815. doi: 10.2196/43815

PubMed Abstract | Crossref Full Text | Google Scholar

35. Wu, Y, Jia, M, Xiang, C, and Fang, Y. Latent trajectories of frailty and risk prediction models among geriatric community dwellers: an interpretable machine learning perspective. BMC Geriatr. (2022) 22:900. doi: 10.1186/s12877-022-03576-5

PubMed Abstract | Crossref Full Text | Google Scholar

36. Han, Y, and Wang, S. Disability risk prediction model based on machine learning among Chinese healthy older adults: results from the China health and retirement longitudinal study. Front Public Health. (2023) 11:1271595. doi: 10.3389/fpubh.2023.1271595

PubMed Abstract | Crossref Full Text | Google Scholar

37. Noh, B, Youm, C, Goh, E, Lee, M, Park, H, Jeon, H, et al. XGBoost based machine learning approach to predict the risk of fall in older adults using gait outcomes. Sci Rep. (2021) 11:12183. doi: 10.1038/s41598-021-91797-w

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: frailty, community-dwelling older adults, random forest, XGBoost, Tilburg frailty indicator, risk prediction

Citation: Qi L, Liu J, Song X, Wang X, Yang M, Cao X and He Y (2025) Determinants and risk prediction models for frailty among community-living older adults in eastern China. Front. Public Health. 13:1518472. doi: 10.3389/fpubh.2025.1518472

Received: 28 October 2024; Accepted: 10 February 2025;
Published: 11 March 2025.

Edited by:

Jie Li, Huazhong University of Science and Technology, China

Reviewed by:

José Alexandre Bachur, University of Franca, Brazil
Mack Shelley, Iowa State University, United States
Sandra Milena Castellar, National University of Colombia, Colombia

Copyright © 2025 Qi, Liu, Song, Wang, Yang, Cao and He. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yan He, aDE4NjM4Nzg3Mzk3QDE2My5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Research integrity at Frontiers

Man ultramarathon runner in the mountains he trains at sunset

94% of researchers rate our articles as excellent or good

Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


Find out more