Skip to main content

ORIGINAL RESEARCH article

Front. Oncol.
Sec. Gynecological Oncology
Volume 14 - 2024 | doi: 10.3389/fonc.2024.1457294
This article is part of the Research Topic Insights, Controversies, and New Developments in the Initial Treatment Decisions for Advanced Epithelial Ovarian Cancer View all 6 articles

Machine learning for epithelial ovarian cancer platinum resistance recurrence identification using routine clinical data

Provisionally accepted
Lirong Yang Lirong Yang 1,2*Mei Yang Mei Yang 2Jie Ge Jie Ge 3Liu-Ling Chen Liu-Ling Chen 1Lin-Hui Li Lin-Hui Li 1Yuan He Yuan He 2Zong-Ting Meng Zong-Ting Meng 2Wan-Qi Wang Wan-Qi Wang 2Feng Li Feng Li 2Zhi-jin LIU Zhi-jin LIU 2Yu-Feng Wang Yu-Feng Wang 2Yong-Ling Shen Yong-Ling Shen 1
  • 1 the Southern Central Hospital of Yunnan Province, Honghe, China
  • 2 Yunnan Cancer Hospital, Kunming, China
  • 3 Pu'er People's Hospital, Pu'er, Yunnan Province, China

The final, formatted version of the article will be published soon.

    We used laboratory and clinical data to create models based on machine learning for EOC platinum resistance recurrence identification. This study was designed as a retrospective cohort analysis. Initially, we identified 1,392 patients diagnosed with epithelial ovarian cancer who underwent platinum-based chemotherapy at Yunnan Cancer Hospital between January 1, 2012, and June 30, 2022. We collected data on the patients' clinicopathologic characteristics, routine laboratory results, surgical information, details of chemotherapy regimens, and survival outcomes. Subsequently, to identify relevant variables influencing the recurrence of platinum resistance, we screened thirty potential factors using two distinct variable selection methods: Lasso regression and multiple logistic regression analysis. Following this screening process, five machine learning algorithms were employed to develop predictive models based on the selected variables. These included decision tree analysis (DTA), K-Nearest neighbor (KNN), support vector machine (SVM), random forest (RF), and eXtreme gradient boosting (XGBoost). The performance of these models was compared against that of traditional logistic regression. To ensure robust internal validation and facilitate comparison among model performance metrics, a five-fold cross-validation method was implemented. Key performance indicators for the models included the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and average accuracy. Finally, we will visualize these models through nomograms, decision tree diagrams, variable importance plots, etc., to assist clinicians in their practice.Based on the findings from both Lasso regression and multiple logistic regression analysis, models were developed using these 7 and 8 factors. Among these, the XGBoost model derived from multiple logistic regression exhibited superior performance and demonstrated good discrimination during internal validation, achieving an AUC of 0.784, a sensitivity of 0.735, a specificity of 0.713, an average accuracy of 80.4%, with a cut-off value set at 0.240.

    Keywords: platinum resistance, Recurrence, Model, nomogram, Early Detection of Cancer

    Received: 30 Jun 2024; Accepted: 16 Oct 2024.

    Copyright: © 2024 Yang, Yang, Ge, Chen, Li, He, Meng, Wang, Li, LIU, Wang and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Lirong Yang, the Southern Central Hospital of Yunnan Province, Honghe, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.