AUTHOR=Zuo Yan , Liu Qiufang , Li Nan , Li Panli , Zhang Jianping , Song Shaoli TITLE=Optimal 18F-FDG PET/CT radiomics model development for predicting EGFR mutation status and prognosis in lung adenocarcinoma: a multicentric study JOURNAL=Frontiers in Oncology VOLUME=13 YEAR=2023 URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2023.1173355 DOI=10.3389/fonc.2023.1173355 ISSN=2234-943X ABSTRACT=Purpose

To develop and interpret optimal predictive models to identify epidermal growth factor receptor (EGFR) mutation status and subtypes in patients with lung adenocarcinoma based on multicentric 18F-FDG PET/CT data, and further construct a prognostic model to predict their clinical outcome.

Methods

The 18F-FDG PET/CT imaging and clinical characters of 767 patients with lung adenocarcinoma from 4 cohorts were collected. Seventy-six radiomics candidates using cross-combination method to identity EGFR mutation status and subtypes were built. Further, Shapley additive explanations and local interpretable model-agnostic explanations were used for optimal models’ interpretation. Moreover, in order to predict the overall survival, a multivariate Cox proportional hazard model based on handcrafted radiomics features and clinical characteristics was constructed. The predictive performance and clinical net benefit of the models were evaluated via area under receiver operating characteristic (AUC), C-index and decision curve analysis.

Results

Among the 76 radiomics candidates, light gradient boosting machine classifier (LGBM) combined with recursive feature elimination wrapped LGBM feature selection method achieved best performance in predicting EGFR mutation status (AUC reached 0.80, 0.61, 0.71 in the internal test cohort and two external test cohorts, respectively). And extreme gradient boosting classifier combined with support vector machine feature selection method achieved best performance in predicting EGFR subtypes (AUC reached 0.76, 0.63, 0.61 in the internal test cohort and two external test cohorts, respectively). The C-index of the Cox proportional hazard model achieved 0.863.

Conclusions

The integration of cross-combination method and the external validation from multi-center data achieved a good prediction and generalization performance in predicting EGFR mutation status and its subtypes. The combination of handcrafted radiomics features and clinical factors achieved good performance in predicting prognosis. With the urgent needs of multicentric 18F-FDG PET/CT trails, robust and explainable radiomics models have great potential in decision making and prognosis prediction of lung adenocarcinoma.