AUTHOR=Deng Xiaobin , Li Tianyu , Mo Linjian , Wang Fubo , Ji Jin , He Xing , Mohamud Bashir Hussein , Pradhan Swadhin , Cheng Jiwen TITLE=Machine learning model for the prediction of prostate cancer in patients with low prostate-specific antigen levels: A multicenter retrospective analysis JOURNAL=Frontiers in Oncology VOLUME=12 YEAR=2022 URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2022.985940 DOI=10.3389/fonc.2022.985940 ISSN=2234-943X ABSTRACT=Objective

The aim of this study was to develop a predictive model to improve the accuracy of prostate cancer (PCa) detection in patients with prostate specific antigen (PSA) levels ≤20 ng/mL at the initial puncture biopsy.

Methods

A total of 146 patients (46 with Pca, 31.5%) with PSA ≤20 ng/mL who had undergone transrectal ultrasound-guided 12+X prostate puncture biopsy with clear pathological results at the First Affiliated Hospital of Guangxi Medical University (November 2015 to December 2021) were retrospectively evaluated. The validation group was 116 patients drawn from Changhai Hospital(52 with Pca, 44.8%). Age, body mass index (BMI), serum PSA, PSA-derived indices, several peripheral blood biomarkers, and ultrasound findings were considered as predictive factors and were analyzed by logistic regression. Significant predictors (P < 0.05) were included in five machine learning algorithm models. The performance of the models was evaluated by receiver operating characteristic curves. Decision curve analysis (DCA) was performed to estimate the clinical utility of the models. Ten-fold cross-validation was applied in the training process.

Results

Prostate-specific antigen density, alanine transaminase-to-aspartate transaminase ratio, BMI, and urine red blood cell levels were identified as independent predictors for the differential diagnosis of PCa according to multivariate logistic regression analysis. The RandomForest model exhibited the best predictive performance and had the highest net benefit when compared with the other algorithms, with an area under the curve of 0.871. In addition, DCA had the highest net benefit across the whole range of cut-off points examined.

Conclusion

The RandomForest-based model generated showed good prediction ability for the risk of PCa. Thus, this model could help urologists in the treatment decision-making process.