AUTHOR=Liu Yun-Fan , Shu Xin , Qiao Xiao-Feng , Ai Guang-Yong , Liu Li , Liao Jun , Qian Shuang , He Xiao-Jing TITLE=Radiomics-Based Machine Learning Models for Predicting P504s/P63 Immunohistochemical Expression: A Noninvasive Diagnostic Tool for Prostate Cancer JOURNAL=Frontiers in Oncology VOLUME=12 YEAR=2022 URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2022.911426 DOI=10.3389/fonc.2022.911426 ISSN=2234-943X ABSTRACT=Objective

To develop and validate a noninvasive radiomic-based machine learning (ML) model to identify P504s/P63 status and further achieve the diagnosis of prostate cancer (PCa).

Methods

A retrospective dataset of patients with preoperative prostate MRI examination and P504s/P63 pathological immunohistochemical results between June 2016 and February 2021 was conducted. As indicated by P504s/P63 expression, the patients were divided into label 0 (atypical prostatic hyperplasia), label 1 (benign prostatic hyperplasia, BPH) and label 2 (PCa) groups. This study employed T2WI, DWI and ADC sequences to assess prostate diseases and manually segmented regions of interest (ROIs) with Artificial Intelligence Kit software for radiomics feature acquisition. Feature dimensionality reduction and selection were performed by using a mutual information algorithm. Based on screened features, P504s/P63 prediction models were established by random forest (RF), gradient boosting decision tree (GBDT), logistic regression (LR), adaptive boosting (AdaBoost) and k-nearest neighbor (KNN) algorithms. The performance was evaluated by the area under the ROC curve (AUC) and accuracy.

Results

A total of 315 patients were enrolled. Among the 851 radiomic features, the 32 top features were derived from T2WI, in which the gray-level run length matrix (GLRLM) and gray-level cooccurrence matrix (GLCM) features accounted for the largest proportion. Among the five models, the RF algorithm performed best in general evaluations (microaverage AUC=0.920, macroaverage AUC=0.870) and provided the most accurate result in further sublabel prediction (the accuracies of label 0, 1, and 2 were 0.831, 0.831, and 0.932, respectively). In comparative sequence analyses, T2WI was the best single-sequence candidate (microaverage AUC=0.94 and macroaverage AUC=0.78). The merged datasets of T2WI, DWI, and ADC yielded optimal AUCs (microaverage AUC=0.930 and macroaverage AUC=0.900).

Conclusions

The radiomic-based RF classifier has the potential to be used to evaluate the presurgical P504s/P63 status and further diagnose PCa noninvasively and accurately.