Skip to main content

ORIGINAL RESEARCH article

Front. Oncol.
Sec. Thoracic Oncology
Volume 14 - 2024 | doi: 10.3389/fonc.2024.1482374

Machine Learning-Based Prediction of 5-Year Survival in Elderly NSCLC Patients Using Oxidative Stress Markers

Provisionally accepted
Hao Chen Hao Chen 1Jiangjiang Xu Jiangjiang Xu 2Qiang Zhang Qiang Zhang 2Pengfei Chen Pengfei Chen 2Qiuxia Liu Qiuxia Liu 2Bindong Xu Bindong Xu 2*
  • 1 Fuding Hospital, Ningde, Fujian Province, China
  • 2 Putian University, Putian, China

The final, formatted version of the article will be published soon.

    Background: Oxidative stress plays a significant role in aging and cancer, yet there is currently a lack of research utilizing machine learning models to examine the relationship between oxidative stress and prognosis in elderly non-small cell lung cancer (NSCLC) patients.This study included elderly NSCLC patients who underwent radical lung cancer resection from January 2012 to April 2018, exploring the relationship between Oxidative Stress Score (OSS) and prognosis. Machine learning techniques, including Decision Trees (DT), Random Forest (RF), and Support Vector Machine (SVM), were employed to develop predictive models for 5-year overall survival (OS).The datasets consisted of 1647 patients in the training set, 705 in the internal validation set, and 516 in the external validation set. An OSS was formulated from six systemic oxidative stress biomarkers, such as albumin, total bilirubin, and blood urea nitrogen, among others. Boruta variable importance analysis identified low OSS as a key indicator of poor prognosis. The OSS was subsequently integrated into the DT, RF, and SVM models for training. These models, optimized through hyperparameter tuning on the training set, were then evaluated on the internal and external validation sets. The RF model demonstrated the highest predictive performance, with an Area Under the Receiver Operating Characteristic Curve (AUC) of 0.794 in the internal validation set, compared to AUCs of 0.711 and 0.760 for the DT and SVM models, respectively. Similarly, in the external validation set, the RF model achieved an AUC of 0.784, outperforming the DT and SVM models, which had AUCs of 0.699 and 0.730, respectively. Calibration plots confirmed the RF model's superior calibration, followed by the SVM model, with the DT model performing the poorest.The OSS-based clinical prediction model, constructed using machine learning methodologies, effectively predicts the prognosis of elderly NSCLC patients post-radical surgery.

    Keywords: Elderly, NSCLC, Oxidative Stress, machine learning, overall survival

    Received: 18 Aug 2024; Accepted: 24 Sep 2024.

    Copyright: © 2024 Chen, Xu, Zhang, Chen, Liu and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Bindong Xu, Putian University, Putian, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.