- Department of Clinical Nutrition, Beijing Shijitan Hospital, Capital Medical University, Beijing, China
Objective: Nutritional intervention prior to the occurrence of cachexia will significantly improve the survival rate of lung cancer patients. This study aimed to establish an ensemble learning model based on anthropometry and blood indicators without information on body weight loss to identify the risk factors of cachexia for early administration of nutritional support and for preventing the occurrence of cachexia in lung cancer patients.
Methods: This multicenter study included 4,712 lung cancer patients. The least absolute shrinkage and selection operator (LASSO) method was used to obtain the key indexes. The characteristics excluded weight loss information, and the study data were randomly divided into a training set (70%) and a test set (30%). The training set was used to select the optimal model among 18 models and verify the model performance. A total of 18 machine learning models were evaluated to predict the occurrence of cachexia, and their performance was determined using area under the curve (AUC), accuracy, precision, recall, F1 score, and Matthews correlation coefficient (MCC).
Results: Among 4,712 patients, 1,392 (29.5%) patients were diagnosed with cachexia based on the framework of Fearon et al. A 17-variable gradient boosting classifier (GBC) model including body mass index (BMI), feeding situation, tumor stage, neutrophil-to-lymphocyte ratio (NLR), and some gastrointestinal symptoms was selected among the 18 machine learning models. The GBC model showed good performance in predicting cachexia in the training set (AUC = 0.854, accuracy = 0.819, precision = 0.771, recall = 0.574, F1 score = 0.658, MCC = 0.549, and kappa = 0.538). The abovementioned indicator values were also confirmed in the test set (AUC = 0.859, accuracy = 0.818, precision = 0.801, recall = 0.550, F1 score = 0.652, and MCC = 0.552, and kappa = 0.535). The learning curve, decision boundary, precision recall (PR) curve, the receiver operating curve (ROC), the classification report, and the confusion matrix in the test sets demonstrated good performance. The feature importance diagram showed the contribution of each feature to the model.
Conclusions: The GBC model established in this study could facilitate the identification of cancer cachexia in lung cancer patients without weight loss information, which would guide early implementation of nutritional interventions to decrease the occurrence of cachexia and improve the overall survival (OS).
1 Introduction
Lung cancer, the most prevalent malignancy globally and a primary contributor to cancer-related fatalities (constituting ~18% of all cancer deaths) (1), was often accompanied by cachexia. Cachexia is a multifactorial and multi-organ syndrome prevalent in the late stages of chronic conditions (2), especially among cancer patients. Cachexia, a metabolic syndrome, is characterized by loss of muscle mass, with or without concurrent loss of fat mass, and diminished bodily functions. This condition is frequently attributed to an inflammatory state or reduced food intake (2). It manifests in nearly 50% of lung cancer patients and indirectly contributes to at least 20% deaths of all cancer patients (3, 4). Cachexia during cancer treatment poses a severe complication as it is linked to increased chemotherapy-related side effects, fewer completed cycles of chemotherapy, and reduced survival rates (5, 6).
Despite the widespread occurrence of cachexia in clinical practice, addressing its prevention, early identification, and intervention remains a challenge. The impact of cancer cachexia on quality of life, treatment-related toxicity (7, 8), physical function impairment, and mortality was well documented. However, focusing on weight loss to establish a clinically meaningful definition for the diagnosis of cachexia had proven challenging because the failure of patients in recalling their weight history can lead to lack of accurate weight loss information. Early identification and management of cachexia is critical for its prevention. Moreover, cachexia is characterized by not just weight loss but also loss of body composition and body function (9). Importantly, emaciation is a late symptom of cachexia and a characteristic of its advanced phase. Although attempts to more comprehensively define cachexia through body composition, body function, and molecular biomarkers were promising, it had not been routinely incorporated into clinical practice (9–11). To date, there are no effective pharmacological interventions that can completely reverse cachexia. Adequate and early nutritional support remains the mainstay of cachexia treatment (12).
In the established diagnostic framework (12), cachexia is defined as an involuntary weight loss of more than 5% or a body mass index (BMI) of <20 kg/m2, with sustained weight loss of more than 2% within the past 6 months. Additionally, it included cases of sarcopenia combined with sustained weight loss of more than 2%. Cancer cachexia is categorized into three stages: pre-cachexia, cachexia, and refractory cachexia. Notably, refractory cachexia has been the focus in numerous clinical trials for novel interventional drugs (12). However, demonstrating its therapeutic efficacy in this stage is challenging due to the catabolic state's resistance to anticancer therapy, the low performance state, and a prognosis of survival of <3 months. Currently, the focus of academic research had shifted from advanced cachexia to the etiology of cachexia (13). Evidence has shown that different drivers of inflammation, metabolism, and neuro-modulatory can initiate processes that eventually led to advanced cachexia (14).
This study aimed to use the nutrition-related parameters without weight loss information to establish an ensemble learning system for predicting and diagnosing the occurrence of cachexia in lung cancer patients before weight loss or a lack of weight loss information, which will be beneficial for administrating early nutritional support and improving the overall survival (OS) of lung cancer patients.
2 Methods
2.1 Population
All patients were enrolled from the Investigation on Nutrition Status and its Clinical Outcome of Common Cancers (INSCOC) project, which enrolled participants from various clinical centers across China, starting in 2013. The trial registration can be accessed at the Chinese Clinical Trial Registry under the identifier: ChiCTR1800020329. The design, methods, and development of the INSCOC study were conducted as previously described (15).
A total of 5,160 patients diagnosed with lung cancer through pathological examination were admitted for cancer treatment starting in 2013. After excluding 188 participants due to the presence of other primary tumors and 260 participants who were unable to recall weight loss information, we finally included 4,712 patients. Prior to the initiation of the study, all selected patients signed informed consent forms within 48 h of hospital admission. The study protocol adhered to the ethical guidelines outlined in the 1975 Declaration of Helsinki and received approval from the Institutional Review Committee of Beijing Shijitan Hospital.
2.2 Baseline data collection
The baseline information of the patients encompassed age, sex, smoking history, comorbidities such as diabetes, hypertension, and anemia, a family history of cancer, and weight loss (with at least 1 month of weight loss information recall). In addition, hematological examination indicators such as creatinine, total protein, albumin, prealbumin, blood urea nitrogen (BuN), total bilirubin (TBil), direct bilirubin (DBil), aspartate aminotransferase (AST), alanine aminotransferase (ALT), blood glucose, hemoglobin, red blood cell (RBC), white blood cell (WBC), neutrophils, lymphocytes, and platelets (PLT) were also included in the study. A comprehensive interview, conducted by a dietitian or clinician, was carried out with each patient to collect information on recent nutrition. This included assessments such as the Nutritional Risk Screening 2002 (NRS-2002), the Karnofsky Performance Score (KPS), and the European Organization for Research and Treatment of Cancer QLQ-C30 score (QLQ-C30). The NRS-2002 served as a tool for nutritional risk screening, established through the analysis of 128 trials. It had been endorsed by the European Society for Clinical Nutrition and Metabolism (ESPEN) for utilization in clinical settings (16). NRS2002-partial, defined as NRS2002 removing weight loss information, was included in the model. In addition, we determined the value of appendicular skeletal muscle mass (ASM) based on earlier research: 0.193*body weight (kg) + 0.107*height (cm) – 4.157*sex (male: 1, female: 2) – 0.037 (years) – 2.631. The appendicular skeletal muscle index (ASMI) was defined as ASM (kg)/height2 (m2). Patients were classified as having low muscle mass when ASMI was <7 for men or 5.4 for women (17, 18). According to the etiologic criteria of the Global Leadership Initiative on Malnutrition (GLIM) (19), we included gastrointestinal symptoms, reduced food intake, and inflammatory status. Gastrointestinal symptoms include anorexia, nausea, and vomiting. Four inflammatory indicators, namely, neutrophil-to-lymphocyte ratio (NLR), advanced lung cancer inflammation index (ALI), platelet-to-lymphocyte ratio (PLR), and nutritional risk index (NRI), were included. The ALI was calculated from the following formula: BMI*Albumin/NLR. The NRI is calculated from the following formula: 1.519*Albumin + 41.7*present body weight/ideal body weight. The ideal body weight is defined as (height – 100)*0.9. Anthropometric measurements have been widely used as nutritional indicators to evaluate the nutritional status of individuals and populations (20). This study also included anthropometric indicators: height, weight, BMI (weight/height2), mid-arm circumference (MAC); triceps skinfold thickness (TSF); hand grip strength (HGS); mid-arm muscle circumference (MAMC); and calf circumference (CC). Mid-arm circumference (MAC) was measured to the nearest 0.5 cm midway between the acromion and the olecranon. Triceps skin fold (TSF) was determined with the skin fold thickness meter. Mid-arm muscle circumference (MAMC) was calculated by using the following formula: MAMC (mm) = MAC (mm) – [3.14 × TSF (mm)]. Hand grip strength (HGS) was measured by using a hand dynamometer, and calf circumference (CC) was assessed at the thickest part of the calf with a flexible anthropometric tape.
2.3 Statistical analysis
Data of continuous variable were expressed as median (M) and inter-quartile range (IQR), while data of categorical variables were presented as frequency and percentage. Group comparisons for patients' tumor characteristics used the chi-squared test for categorical variables and independent-samples t-test for continuous variables. The analyses were performed using Python Version 3.7.3 and IBM SPSS (Version 26.0; IBM, Armonk, NY, USA). Statistical significance was defined as a P-value of <0.05 (two-sided).
3 Results
3.1 Baseline characteristics
A total of 4,712 lung cancer patients from a multicenter study were analyzed. A flow chart of participant selection is shown in Figure 1 and the baseline characteristics of the study population are displayed in Table 1, encompassing routinely available data on demographics, pathology, tumor stages, blood indices, anthropometric parameters, weight loss, feeding situation, gastrointestinal symptoms, and the physical performance status of patients. A total of 1,392 patients were found to have cachexia, accounting for 29.5% of all patients, which was consistent with previous data (21). In addition, in our study data, the prevalence of cachexia in advanced patients was 49.7, which was consistent with data of previous studies (50%) (3). There were 2,410 (51.1%) patients with non-small cell lung cancer (NSCLC), 602 (12.8%) with small cell lung cancer (SCLC), and 1,700 (36.1%) patients with lung cancer who lacked a clear pathological classification. In our study, the mean age of patients was 61 [54, 67] years, with a higher number of male patients (66.6%, n = 3,139). Among the patients, 59.8% were smokers, and a substantial number (49.7%) exhibited distant metastasis. There were 1,987 (39.96%) patients with nutritional risk assessed by NRS2002 partial excluding weight loss information and 2,304 (46.34%) patients with nutritional risk assessed with NRS2002 including weight loss information (chi-squared test, P < 0.05). Four inflammatory indicators, NLR, ALI, PLR, and NRI, play a significant role in distinguishing patients with cachexia from those without cachexia (P < 0.05). Patients with cachexia showed higher NLR, higher PLR, lower NRI, and lower ALI, which was consistent with the observations in previous studies (22–25).
3.2 Feature selection using the least absolute shrinkage and selection operator (LASSO)
The study population was divided into a training set (n = 3,298, 70%) for model derivation and a test set (n = 1,414, 30%) for assessing model performance. The LASSO (26) approach with 10-fold cross-validation was used to eliminate redundant features before modeling (Figures 2A, B). LASSO can shrink the coefficients of some variables by introducing a penalty term. It is an excellent method for processing high-throughput data and can effectively filter variables. The GBC model is an ensemble learning model based on decision tree and has strong generalization ability. The optimal model, based on 17 variables selected from the training and test sets, is detailed in Table 2. The 17 features selected by LASSO were BMI, feed, NRS2002 partial, NLR, EORTC QLQ_C30, prealbumin, handgrip strength, albumin, anorexia, hemoglobin, vomiting, activity, pathology, TSF, tumor stage, age, and comorbidity. Pearson's correlation analysis indicated no collinearity/multicollinearity among the LASSO-selected features (Figure 2C), supporting their use in subsequent model establishment.
Figure 2. Feature selection using the Least Absolute Shrinkage and Selection Operator (LASSO). (A) The LASSO coefficient profiles and cross-validation for the classification model. (B) The Lasso regression coefficient path diagram. (C) Pearson's correlation analysis of the 17 variables. BMI, body mass index; EORTC, European Organization for Research and Treatment of Cancer; EORTC QLQ-C30, The EORTC QLG Core Questionnaire; NLR, neutrophil-to-lymphocyte ratio; NRS-2002 partial, nutritional Risk Screening 2002 partial.
The learning curve showed that, when the amount of data increased, the GBC model had a tendency to converge, with the score converging at ~0.85 (Figure 3A). The decision boundary of the GBC model in the validation set (Figure 3B) showed the non-linear boundary in GBC model classification of cachexia and non-cachexia. The feature importance diagram (Figure 3C) showed the contribution of each feature to the model. Feature importance analysis showed that BMI was the most important feature in the GBC ensemble learning model (Figure 3C). The top five features that contribute most to the GBC model include feed, NRS2002 partial, NLR, and EORTCQLQ-C30. At present, the standardized nutritional support therapy steps recommended by domestic and foreign guidelines include nutritional screening, nutritional assessment, and nutritional intervention and monitoring (27). Nutritional screening is the first step. The “nutritional risk” derived from NRS2002 relevant to patient clinical outcomes displayed a basis in evidence-based medicine and had been validated in retrospective and prospective clinical studies and was currently the preferred screening tool recommended by many guidelines (28–30). QLQ-C30 consists of 30 questions to measure the quality of life of patients from function and symptoms and was the most widely used international method for measuring the quality of life in cancer patients (31). In addition, low BMI, reduced food intake, which was recognized in the clinical practice, and a marker of systemic inflammation that had been shown in the studies to be significantly related to nutrition and prognostic NLR (32, 33) contributed significantly to the GBC model for predicting cachexia.
Figure 3. (A) Learning curve of the GBC using the training data. (B) Decision boundary of the GBC using the test data. PCA, principal component analysis. (C) The feature importance in the GBC ensemble learning model using the test data.
3.3 The performance of different machine learning models and performance demonstration of the GBC model
We independently developed 18 types of machine learning models to predict the response labels using the training data (Table 3). The training set underwent 10-fold cross-validation, and the effectiveness of 18 models was compared by seven verification criteria, namely, accuracy, AUC, precision, recall, F1-score, MCC, and kappa. Gradient boosting classifier (GBC) and CatBoost classifier exhibited the highest AUC (0.854 vs. 0.853). The F1 score was used to further compare the performances of the machine learning models. The F1-score was used to measure the accuracy of unbalanced data sets, and it is the harmonic mean of precision and recall (34, 35). The F1-score of the GBC model was higher than that of the CatBoost classifier (0.658 vs. 0.654). In addition to the F1-score, the accuracy, precision, MCC, and kappa consistently indicate that the GBC model was optimal (Table 3, Supplementary Figure S1). We then evaluated the performance of the top five models, with the highest efficiency observed in the training set compared to the test set, as shown in Table 4. We can see that the GBC model also performed well in the test set (AUC = 0.859, accuracy = 0.818, precision = 0.801, recall = 0.550, F1 score = 0.652, MCC = 0.552, and kappa = 0.535). We thus selected the GBC model for future use.
The receiver operating characteristic (ROC) curve showed that the AUC was 0.859 (Figure 4A). The area under Precision Recall (PR) curves was 0.794 (Figure 4B). The GBC model classification report of the test set is shown in Figure 4C. The confusion matrix showed that, among 91% patients (no cachexia), 55% patients (cachexia) were correctly predicted (Figure 4D). Among all the indicators, the GBC model showed significant advantages in predicting cachexia in lung cancer patients who could not provide weight loss information, and it was expected to provide clinical diagnosis of cachexia in these patients.
Figure 4. The performance demonstration of the GBC model. (A) ROC curves of the top five models with the best performance in the training set in the test set. Random forest, Random Forest Classifier; catboosting, CatBoost Classifier; GBC, Gradient Boosting Classifier; extra, Extreme Gradient Boosting; lda, Linear Discriminant Analysis. (B) Precision-recall curve for the GBC model in the test data. (C) Classification report for the GBC model using the test data. (D) Confusion matrix for the GBC using the test data.
4 Discussion
This study was a retrospective cohort investigation of 4,712 lung cancer patients at multi-centers in China. The study focused on a real-world clinical challenge, which predicted the onset of cancer cachexia in advance before significant weight loss. To our knowledge, this study represents the initial large-scale investigation tackling this challenge through ensemble learning methods grounded in traditional clinical characteristics. Our discoveries offer valuable insights that may assist clinicians or nutritionists in decision-making for the treatment of high-risk lung cancer patients, guiding effective management strategies to enhance patients' outcomes.
Weight loss was a major factor in the diagnosis of cachexia (12, 36). However, due to the lack of self-awareness in patients' daily life, the delay in recalling weight loss information and the inaccuracy of recalling made the diagnosis of cachexia difficult (37). Previous evidence showed that, once patients lose more than 5% of their body weight, the mortality rate had increased significantly (9). In addition, notable instances of treatment-related toxicity and increased mortality were observed in obese patients with low muscle mass, a condition referred to as sarcopenia obesity (38, 39). Clinical identification of these patients posed a challenge, especially since sarcopenia can be concealed by obesity body mass index (BMI) (40). Consequently, beyond weight loss, increased need for sensitive criteria arises to detect patients in the early stages of cachexia. Such assessments necessitate measurements beyond standard body weight, including instruments for evaluating muscle mass and/or physical activity (38). In the context of cancer, because of the complexity of body composition, some researchers suggested that measurements of specific body compartments using methods such as computed tomography should outweigh the role of weight loss in evaluating cachexia test results (41). We used a convenient and cost-effective GBC ensemble learning model that incorporated demographic, anthropometric, and laboratory parameters, excluding baseline weight loss to construct an effective predictive model for early identification of cachexia. A total of 18 machine learning models were evaluated, and a GBC ensemble learning model with optimal performance was obtained. Compared to the training set, the GBC model showed good performance in the testing set.
In our study, we eliminated weight loss, a dynamically changing feature, and only used currently measurable characteristics to construct a diagnostic model of cachexia. Low BMI, reduced food intake, digestive symptoms, and NLR contributed more to the GBC model. Currently, the GLIM framework suggests the inclusion of at least one phenotypic criterion and one etiological criterion for diagnosing malnutrition. Etiological criteria encompass reduced food intake, digestive or absorption disorders, inflammation, or disease burden in patients (42). BMI was the most important indicator to measure the nutritional status of the human body and is designated to be one of the diagnostic criteria for cachexia (12). The mechanism of anorexia involved tumor-derived humoral factors that can induce cancer anorexia by modulating neuropeptide hormones in the brain associated with eating (43). Furthermore, the elucidation of the process indicates that anorexia preceded tissue wasting in cachexia. In a previous study, severe and moderate reduction in food intake was also found to independently predict OS. When anorexia, vomiting, or other digestive symptoms occurred or food intake decreased, patients were at higher risk for cachexia, which was consistent with findings of previous research (44). In the tumor context, systemic inflammation played an important role in the onset and progression of cancer, existed during cancer-associated cachexia, and served as a diagnostic tool for cancer-associated cachexia (25, 45). Recent meta-analyses demonstrated an association between elevated NLR and reduced progression-free survival and OS both in NSCLC and SCLC (33, 46). A prior study identified an association between elevated NLR and weight loss, as well as an increased prevalence of cachexia in cancer patients with advanced colon, lung, or prostate cancer (22). It had also been found that high levels of NLRs at baseline and a progressive increase in NLRs during treatment were associated with progressive disease, low OS, and weight loss in NSCLC patients (47). In our GBC model, NLR was one of the top five clinical features contributing to the model, indicating the important value of NLR in the diagnosis of lung cancer cachexia.
However, this study had several potential limitations. First, the weight loss relied on patient's reported historical weight, introducing the possibility of recalling bias. To mitigate this, only data from patients who were able to provide past weight information were included. Second, the ASMI in this study was derived from an anthropometric equation validated for the Chinese population. While the equation had demonstrated good agreement with dual-energy X-ray imaging and anthropometric data were readily available (17). It is essential to note that more precise muscle mass measurements for diagnosing cachexia may be obtained through imaging techniques such as dual-energy x-ray imaging or bioimpedance analysis (48). However, our GBC model showed that the classification results of the ASMI were of relatively little importance in the diagnosis of cachexia. Imaging indicators that could more accurately assess muscle mass might improve the performance of our model. For example, transverse CT images, typically at standard markers frequently presented in abdominal CT (such as the third lumbar spine), had established correlations with DXA equivalent total body fat and muscle mass (38, 49). Future studies using more advanced techniques to measure muscle mass are required. Third, as an etiological indicator of cachexia, inflammatory indicators are prognostic indicators of advanced lung cancer, as observed in previous studies. One potential serum biomarker commonly used in clinical practice was C-reactive protein (CRP) (25) that is used to identify patients at risk for cancer cachexia, when combined with other weight loss and nutrient intake factors (50). Due to the substantial amount of missing data on CRP, we could not include CRP and its related systemic inflammation indicators, which would be further improved in the subsequent data collection. However, in our study, we also demonstrated a significant correlation between NLR/ALI/PLR and cachexia through a univariate analysis (P < 0.05). Our model also included NLR, a marker of systemic inflammation, which could compensate for the missing CRP data to some extent. In future studies, we could analyze and clarify the role of systemic inflammatory indicators in the diagnosis of cachexia.
In conclusion, the GBC ensemble learning model using clinical data without weight loss information could identify patients with lung cancer cachexia. Feature importance showed the contribution value of each clinical feature to the diagnosis of cachexia. This study may support patient counseling and targeted interventions to perform nutritional treatment in advance and improve patient prognosis and life quality.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the Institutional Review Committee of Beijing Shijitan Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
PJ: Data curation, Funding acquisition, Project administration, Supervision, Validation, Writing—original draft, Writing—review & editing. QZ: Investigation, Methodology, Writing—review & editing, Writing—original draft. XWu: Data curation, Investigation, Methodology, Writing—original draft, Writing—review & editing. FS: Data curation, Methodology, Writing—original draft, Writing—review & editing. KS: Investigation, Methodology, Writing—original draft, Writing—review & editing. XWa: Data curation, Investigation, Methodology, Writing—original draft, Writing—review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was in part supported by the National Natural Science Foundation of China (Grant No. 82273940) and the National Key Research and Development Program of China (Grant Nos. 2022YFC2010100 and 2022YFC2010101).
Acknowledgments
The authors would like to thank the INSCOC project members for their substantial work on data collection and patient follow-up.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnut.2024.1380949/full#supplementary-material
References
1. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2019. CA Cancer J Clin. (2019) 69:7–34. doi: 10.3322/caac.21551
2. Baracos VE, Martin L, Korc M, Guttridge DC, Fearon KCH. Cancer-associated cachexia. Nat Rev Dis Primers. (2018) 4:17105. doi: 10.1038/nrdp.2017.105
3. Argilés JM, Busquets S, Stemmler B, López-Soriano FJ. Cancer cachexia: understanding the molecular basis. Nat Rev Cancer. (2014) 14:754–62. doi: 10.1038/nrc3829
4. Dolly A, Dumas JF, Servais S. Cancer cachexia and skeletal muscle atrophy in clinical studies: what do we really know? J Cachexia Sarcopenia Muscle. (2020) 11:1413–28. doi: 10.1002/jcsm.12633
5. Suzuki H, Asakawa A, Amitani H, Nakamura N, Inui A. Cancer cachexia–pathophysiology and management. J Gastroenterol. (2013) 48:574–94. doi: 10.1007/s00535-013-0787-0
6. Caillet P, Liuu E, Raynaud Simon A, Bonnefoy M, Guerin O, Berrut G, et al. Association between cachexia, chemotherapy and outcomes in older cancer patients: a systematic review. Clin Nutr. (2017) 36:1473–82. doi: 10.1016/j.clnu.2016.12.003
7. Dewys WD, Begg C, Lavin PT, Band PR, Bennett JM, Bertino JR, et al. Prognostic effect of weight loss prior to chemotherapy in cancer patients. Eastern Cooperative Oncology Group. Am J Med. (1980) 69:491–7. doi: 10.1016/S0149-2918(05)80001-3
8. Vagnildhaug OM, Balstad TR, Almberg SS, Brunelli C, Knudsen AK, Kaasa S, et al. A cross-sectional study examining the prevalence of cachexia and areas of unmet need in patients with cancer. Support Care Cancer. (2018) 26:1871–80. doi: 10.1007/s00520-017-4022-z
9. Bruggeman AR, Kamal AH, LeBlanc TW, Ma JD, Baracos VE, Roeland EJ. Cancer cachexia: beyond weight loss. J Oncol Pract. (2016) 12:1163–71. doi: 10.1200/JOP.2016.016832
10. von Haehling S, Anker SD. Cachexia as a major underestimated and unmet medical need: facts and numbers. J Cachexia Sarcopenia Muscle. (2010) 1:1–5. doi: 10.1007/s13539-010-0002-6
11. Tang M, Shi H. From value-based medicine to value-based nutrition. Precis Nutr. (2022) 1:e00020. doi: 10.1097/PN9.0000000000000020
12. Fearon K, Strasser F, Anker SD, Bosaeus I, Bruera E, Fainsinger RL, et al. Definition and classification of cancer cachexia: an international consensus. Lancet Oncol. (2011) 12:489–95. doi: 10.1016/S1470-2045(10)70218-7
13. Nishikawa H, Goto M, Fukunishi S, Asai A, Nishiguchi S, Higuchi K. Cancer cachexia: its mechanism and clinical significance. Int J Mol Sci. (2021) 22:8491. doi: 10.3390/ijms22168491
14. Ferrer M, Anthony TG, Ayres JS, Biffi G, Brown JC, Caan BJ, et al. Cachexia: a systemic consequence of progressive, unresolved disease. Cell. (2023) 186:1824–45. doi: 10.1016/j.cell.2023.03.028
15. Xu H, Song C, Yin L, Wang C, Fu Z, Guo Z, et al. Extension protocol for the Investigation on Nutrition Status and Clinical Outcome of Patients with Common Cancers in China (INSCOC) study: 2021 update. Precis Nutr. (2022) 1:e00014. doi: 10.1097/PN9.0000000000000014
16. Arends J, Bachmann P, Baracos V, Barthelemy N, Bertz H, Bozzetti F, et al. ESPEN guidelines on nutrition in cancer patients. Clin Nutr. (2017) 36:11–48. doi: 10.1016/j.clnu.2016.07.015
17. Wen X, Wang M, Jiang CM, Zhang YM. Anthropometric equation for estimation of appendicular skeletal muscle mass in Chinese adults. Asia Pac J Clin Nutr. (2011) 20:551–6. doi: 10.6133/APJCN.2011.20.4.08
18. Choi SJ, Lee MS, Kang DH, Ko GJ, Lim HS, Yu BC, et al. Myostatin/appendicular skeletal muscle mass (ASM) ratio, not myostatin, is associated with low handgrip strength in community-dwelling older women. Int J Environ Res Public Health. (2021) 18:7344. doi: 10.3390/ijerph18147344
19. Cederholm T, Jensen GL, Correia M, Gonzalez MC, Fukushima R, Higashiguchi T, et al. GLIM criteria for the diagnosis of malnutrition - a consensus report from the global clinical nutrition community. J Cachexia Sarcopenia Muscle. (2019) 10:207–17. doi: 10.1002/jcsm.12383
20. Obaje SG, Danborno B, Akuyam SA, Timbuak JA. Anthropometric measurements as nutritional indicators and association with sociodemographic factors among the Idoma ethnic group in Nigeria. Precis Nutr. (2023) 2:e00048. doi: 10.1097/PN9.0000000000000048
21. Li X, Hu C, Zhang Q, Wang K, Li W, Xu H, et al. Cancer cachexia statistics in China. Precision Nutrition. (2022) 1:28–38. doi: 10.1097/PN9.0000000000000008
22. Barker T, Fulde G, Moulton B, Nadauld LD, Rhodes T. An elevated neutrophil-to-lymphocyte ratio associates with weight loss and cachexia in cancer. Sci Rep. (2020) 10:7535. doi: 10.1038/s41598-020-64282-z
23. Lipshitz M, Visser J, Anderson R, Nel DG, Smit T, Steel HC, et al. Emerging markers of cancer cachexia and their relationship to sarcopenia. J Cancer Res Clin Oncol. (2023) 149:17511–27. doi: 10.1007/s00432-023-05465-9
24. Adejumo OL, Koelling TM, Hummel SL. Nutritional Risk Index predicts mortality in hospitalized advanced heart failure patients. J Heart Lung Transplant. (2015) 34:1385–9. doi: 10.1016/j.healun.2015.05.027
25. Song M, Zhang Q, Song C, Liu T, Zhang X, Ruan G, et al. The advanced lung cancer inflammation index is the optimal inflammatory biomarker of overall survival in patients with lung cancer. J Cachexia Sarcopenia Muscle. (2022) 13:2504–14. doi: 10.1002/jcsm.13032
26. Tibshirani R. Regression shrinkage and selection via the lasso. J R Stat Soc Ser A Stat Soc. (1996) 58:267–88. doi: 10.1111/j.2517-6161.1996.tb02080.x
27. Mueller C, Compher C, Ellen DM. ASPEN clinical guidelines: nutrition screening, assessment, and intervention in adults. JPEN J Parenter Enteral Nutr . (2011) 35:16–24. doi: 10.1177/0148607110389335
28. Kondrup J, Rasmussen HH, Hamberg O, Stanga Z. Nutritional risk screening (NRS 2002): a new method based on an analysis of controlled clinical trials. Clin Nutr. (2003) 22:321–36. doi: 10.1016/S0261-5614(02)00214-5
29. Xu J, Jiang Z. Different risk scores consider different types of risks: the deficiencies of the 2015 ESPEN consensus on diagnostic criteria for malnutrition. Eur J Clin Nutr. (2018) 72:936–41. doi: 10.1038/s41430-018-0120-3
30. Cederholm T, Barazzoni R, Austin P, Ballmer P, Biolo G, Bischoff SC, et al. ESPEN guidelines on definitions and terminology of clinical nutrition. Clin Nutr. (2017) 36:49–64. doi: 10.1016/j.clnu.2016.09.004
31. Aaronson NK, Ahmedzai S, Bergman B, Bullinger M, Cull A, Duez NJ, et al. The European Organization for Research and Treatment of Cancer QLQ-C30: a quality-of-life instrument for use in international clinical trials in oncology. J Natl Cancer Inst. (1993) 85:365–76.
32. Tu J, Wu B, Xiu J, Deng J, Lin S, Lu J, et al. Advanced lung cancer inflammation index is associated with long-term cardiovascular death in hypertensive patients: national health and nutrition examination study, 1999-2018. Front Physiol. (2023) 14:1074672. doi: 10.3389/fphys.2023.1074672
33. Zhao QT, Yang Y, Xu S, Zhang XP, Wang HE, Zhang H, et al. Prognostic role of neutrophil to lymphocyte ratio in lung cancers: a meta-analysis including 7,054 patients. Onco Targets Ther. (2015) 8:2731–8. doi: 10.2147/OTT.S90875
34. Sokolova M, Lapalme G. A systematic analysis of performance measures for classification tasks. Inf Process Manag. (2009) 45:427–37. doi: 10.1016/j.ipm.2009.03.002
35. Bekkar M, Djemaa HK, Alitouche TA. Evaluation measures for models assessment over imbalanced data sets. J Inf Eng Appl. (2013) 3:27–38. doi: 10.5121/ijdkp.2013.3402
36. Blum D, Stene GB, Solheim TS, Fayers P, Hjermstad MJ, Baracos VE, et al. Validation of the Consensus-Definition for Cancer Cachexia and evaluation of a classification model–a study based on data from an international multicentre project (EPCRC-CSA). Ann Oncol. (2014) 25:1635–42. doi: 10.1093/annonc/mdu086
37. Tamakoshi K, Yatsuya H, Kondo T, Hirano T, Hori Y, Yoshida T, et al. The accuracy of long-term recall of past body weight in Japanese adult men. Int J Obes Relat Metab Disord. (2003) 27:247–52. doi: 10.1038/sj.ijo.802195
38. Martin L, Birdsell L, Macdonald N, Reiman T, Clandinin MT, McCargar LJ, et al. Cancer cachexia in the age of obesity: skeletal muscle depletion is a powerful prognostic factor, independent of body mass index. J Clin Oncol. (2013) 31:1539–47. doi: 10.1200/JCO.2012.45.2722
39. Batsis JA, Villareal DT. Sarcopenic obesity in older adults: aetiology, epidemiology and treatment strategies. Nat Rev Endocrinol. (2018) 14:513–37. doi: 10.1038/s41574-018-0062-9
40. Baracos VE, Arribas L. Sarcopenic obesity: hidden muscle wasting and its impact for survival and complications of cancer therapy. Ann Oncol. (2018) 29(suppl_2):ii1–9. doi: 10.1093/annonc/mdx810
41. Holmes CJ, Racette SB. The utility of body composition assessment in nutrition and clinical practice: an overview of current methodology. Nutrients. (2021) 13:2493. doi: 10.3390/nu13082493
42. Cederholm T, Jensen GL, Correia M, Gonzalez MC, Fukushima R, Higashiguchi T, et al. GLIM criteria for the diagnosis of malnutrition - a consensus report from the global clinical nutrition community. Clin Nutr. (2019) 38:1–9. doi: 10.1016/j.clnu.2019.02.033
43. Yeom E, Yu K. Understanding the molecular basis of anorexia and tissue wasting in cancer cachexia. Exp Mol Med. (2022) 54:426–32. doi: 10.1038/s12276-022-00752-w
44. Martin L, Muscaritoli M, Bourdel-Marchasson I, Kubrak C, Laird B, Gagnon B, et al. Diagnostic criteria for cancer cachexia: reduced food intake and inflammation predict weight loss and survival in an international, multi-cohort analysis. J Cachexia Sarcopenia Muscle. (2021) 12:1189–202. doi: 10.1002/jcsm.12756
45. Zhang Q, Song MM, Zhang X, Ding JS, Ruan GT, Zhang XW, et al. Association of systemic inflammation with survival in patients with cancer cachexia: results from a multicentre cohort study. J Cachexia Sarcopenia Muscle. (2021) 12:1466–76. doi: 10.1002/jcsm.12761
46. Gu XB, Tian T, Tian XJ, Zhang XJ. Prognostic significance of neutrophil-to-lymphocyte ratio in non-small cell lung cancer: a meta-analysis. Sci Rep. (2015) 5:12493. doi: 10.1038/srep12493
47. Derman BA, Macklis JN, Azeem MS, Sayidine S, Basu S, Batus M, et al. Relationships between longitudinal neutrophil to lymphocyte ratios, body weight changes, and overall survival in patients with non-small cell lung cancer. BMC Cancer. (2017) 17:141. doi: 10.1186/s12885-017-3122-y
48. Chen LK, Woo J, Assantachai P, Auyeung TW, Chou MY, Iijima K, et al. Asian working group for sarcopenia: 2019 consensus update on sarcopenia diagnosis and treatment. J Am Med Dir Assoc. (2020) 21:300–7.e2. doi: 10.1016/j.jamda.2019.12.012
49. Prado CM, Lieffers JR, McCargar LJ, Reiman T, Sawyer MB, Martin L, et al. Prevalence and clinical implications of sarcopenic obesity in patients with solid tumours of the respiratory and gastrointestinal tracts: a population-based study. Lancet Oncol. (2008) 9:629–35. doi: 10.1016/S1470-2045(08)70153-0
Keywords: lung cancer, cachexia, weight loss, ensemble learning, cohort study
Citation: Jia P, Zhao Q, Wu X, Shen F, Sun K and Wang X (2024) Identification of cachexia in lung cancer patients with an ensemble learning approach. Front. Nutr. 11:1380949. doi: 10.3389/fnut.2024.1380949
Received: 02 February 2024; Accepted: 14 May 2024;
Published: 30 May 2024.
Edited by:
Marilia Seelaender, University of São Paulo, BrazilReviewed by:
Yixin Zhao, First Affiliated Hospital of Jilin University, ChinaMonica Cattafesta, Federal University of Espirito Santo, Brazil
Copyright © 2024 Jia, Zhao, Wu, Shen, Sun and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Pingping Jia, pingpingj@ccmu.edu.cn
†These authors have contributed equally to this work