AUTHOR=Chang Cheng , Sun Xiaoyan , Wang Gang , Yu Hong , Zhao Wenlu , Ge Yaqiong , Duan Shaofeng , Qian Xiaohua , Wang Rui , Lei Bei , Wang Lihua , Liu Liu , Ruan Maomei , Yan Hui , Liu Ciyi , Chen Jie , Xie Wenhui TITLE=A Machine Learning Model Based on PET/CT Radiomics and Clinical Characteristics Predicts ALK Rearrangement Status in Lung Adenocarcinoma JOURNAL=Frontiers in Oncology VOLUME=11 YEAR=2021 URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2021.603882 DOI=10.3389/fonc.2021.603882 ISSN=2234-943X ABSTRACT=Objectives

Anaplastic lymphoma kinase (ALK) rearrangement status examination has been widely used in clinic for non-small cell lung cancer (NSCLC) patients in order to find patients that can be treated with targeted ALK inhibitors. This study intended to non-invasively predict the ALK rearrangement status in lung adenocarcinomas by developing a machine learning model that combines PET/CT radiomic features and clinical characteristics.

Methods

Five hundred twenty-six patients of lung adenocarcinoma with PET/CT scan examination were enrolled, including 109 positive and 417 negative patients for ALK rearrangements from February 2016 to March 2019. The Artificial Intelligence Kit software was used to extract radiomic features of PET/CT images. The maximum relevance minimum redundancy (mRMR) and least absolute shrinkage and selection operator (LASSO) logistic regression were further employed to select the most distinguishable radiomic features to construct predictive models. The mRMR is a feature selection method, which selects the features with high correlation to the pathological results (maximum correlation), meanwhile retain the features with minimum correlation between them (minimum redundancy). LASSO is a statistical formula whose main purpose is the feature selection and regularization of data model. LASSO method regularizes model parameters by shrinking the regression coefficients, reducing some of them to zero. The feature selection phase occurs after the shrinkage, where every non-zero value is selected to be used in the model. Receiver operating characteristic (ROC) analysis was used to evaluate the performance of the models, and the performance of different models was compared by the DeLong test.

Results

A total of 22 radiomic features were extracted from PET/CT images for constructing the PET/CT radiomic model, and majority of these features used were based on CT features (20 out of 22), only 2 PET features were included (PET percentile 10 and PET difference entropy). Moreover, three clinical features associated with ALK mutation (age, burr and pleural effusion) were also employed to construct a combined model of PET/CT and clinical model. We found that this combined model PET/CT-clinical model has a significant advantage to predict the ALK mutation status in the training group (AUC = 0.87) and the testing group (AUC = 0.88) compared with the clinical model alone in the training group (AUC = 0.76) and the testing group (AUC = 0.74) respectively. However, there is no significant difference between the combined model and PET/CT radiomic model.

Conclusions

This study demonstrated that PET/CT radiomics-based machine learning model has potential to be used as a non-invasive diagnostic method to help diagnose ALK mutation status for lung adenocarcinoma patients in the clinic.