Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 07 April 2022
Sec. Genitourinary Oncology
This article is part of the Research Topic Advances in Prostate Cancer: Model Systems, Molecular and Cellular Mechanisms, Early Detection, and Therapies View all 21 articles

Machine Learning-Based Prediction of Pathological Upgrade From Combined Transperineal Systematic and MRI-Targeted Prostate Biopsy to Final Pathology: A Multicenter Retrospective Study

Junlong Zhuang,&#x;Junlong Zhuang1,2†Yansheng Kan&#x;Yansheng Kan1†Yuwen Wang,,&#x;Yuwen Wang1,2,3†Alessandro MarquisAlessandro Marquis4Xuefeng Qiu,Xuefeng Qiu1,2Marco OderdaMarco Oderda4Haifeng Huang,Haifeng Huang1,2Marco GattiMarco Gatti5Fan Zhang,Fan Zhang1,2Paolo GonteroPaolo Gontero4Linfeng Xu,Linfeng Xu1,2Giorgio CallerisGiorgio Calleris4Yao FuYao Fu6Bing ZhangBing Zhang7Giancarlo Marra,*Giancarlo Marra4,8*Hongqian Guo,*Hongqian Guo1,2*
  • 1Department of Urology, Affiliated Drum Tower Hospital, Medical School of Nanjing University, Nanjing, China
  • 2Institute of Urology, Nanjing University, Nanjing, China
  • 3Medical School of Southeast University, Nanjing Drum Tower Hospital, Nanjing, China
  • 4Department of Urology, San Giovanni Battista Hospital, Città della Salute e della Scienza and University of Turin, Turin, Italy
  • 5Department of Radiology, San Giovanni Battista Hospital, Città della Salute e della Scienza and University of Turin, Turin, Italy
  • 6Department of Pathology, Affiliated Drum Tower Hospital, Medical School of Nanjing University, Nanjing, China
  • 7Department of Radiology, Affiliated Drum Tower Hospital, Medical School of Nanjing University, Nanjing, China
  • 8Department of Urology and Clinical Research Group on Predictive Onco-Urology, APHP, Sorbonne University, Paris, France

Objective: This study aimed to evaluate the pathological concordance from combined systematic and MRI-targeted prostate biopsy to final pathology and to verify the effectiveness of a machine learning-based model with targeted biopsy (TB) features in predicting pathological upgrade.

Materials and Methods: All patients in this study underwent prostate multiparametric MRI (mpMRI), transperineal systematic plus transperineal targeted prostate biopsy under local anesthesia, and robot-assisted laparoscopic radical prostatectomy (RARP) for prostate cancer (PCa) sequentially from October 2016 to February 2020 in two referral centers. For cores with cancer, grade group (GG) and Gleason score were determined by using the 2014 International Society of Urological Pathology (ISUP) guidelines. Four supervised machine learning methods were employed, including two base classifiers and two ensemble learning-based classifiers. In all classifiers, the training set was 395 of 565 (70%) patients, and the test set was the remaining 170 patients. The prediction performance of each model was evaluated by area under the receiver operating characteristic curve (AUC). The Gini index was used to evaluate the importance of all features and to figure out the most contributed features. A nomogram was established to visually predict the risk of upgrading. Predicted probability was a prevalence rate calculated by a proposed nomogram.

Results: A total of 515 patients were included in our cohort. The combined biopsy had a better concordance of postoperative histopathology than a systematic biopsy (SB) only (48.15% vs. 40.19%, p = 0.012). The combined biopsy could significantly reduce the upgrading rate of postoperative pathology, in comparison to SB only (23.30% vs. 39.61%, p < 0.0001) or TB only (23.30% vs. 40.19%, p < 0.0001). The most common pathological upgrade occurred in ISUP GG1 and GG2, accounting for 53.28% and 20.42%, respectively. All machine learning methods had satisfactory predictive efficacy. The overall accuracy was 0.703, 0.768, 0.794, and 0.761 for logistic regression, random forest, eXtreme Gradient Boosting, and support vector machine, respectively. TB-related features were among the most contributed features of a prediction model for upgrade prediction.

Conclusion: The combined effect of SB plus TB led to a better pathological concordance rate and less upgrading from biopsy to RP. Machine learning models with features of TB to predict PCa GG upgrading have a satisfactory predictive efficacy.

Introduction

Biopsy-derived tumor grade is currently used for risk stratification and clinical decision-making of prostate cancer (PCa) (1). However, up to 36% of patients with low-grade biopsy upgrade after radical prostatectomy (RP) (2, 3), leading to the potential risk of underestimation and following undertreatment. The risk of the pathological upgrade has been historically predicted using multivariable tools based on clinical parameters (46).

In recent years, multiparametric MRI (mpMRI) and mpMRI-targeted biopsy (mpMRI-TB) have been shown to improve the detection of clinically significant PCa (csPCa) (7, 8). Currently, the guideline recommends mpMRI before biopsy and mpMRI-TB combined with 12-core systematic biopsy (SB) for patients with positive mpMRI results (2021). Several models have been developed to predict pathological upgrading using final pathology as the reference (9). A significant added value of mpMRI and mpMRI-TB to the clinical parameters in predicting the risk of upgrading as well as reducing the number of unnecessary repeat prostate biopsies has been previously investigated (9, 10).

Machine learning techniques have been increasingly used in the medical field due to their high accuracy. Compared to the traditional predictive models, machine learning-based models can incorporate a larger number of variables (11). Liu et al. developed a risk model to predict upgrading from biopsy to RP using learning machine-assisted decision-support models (12). However, this tool was developed in patients undergoing transrectal ultrasound (TRUS)-guided SB, with big concerns about its applicability in the era of mpMRI and mpMRI-TB.

Therefore, based on a previous multicenter prospective cohort study of mpMRI-TB (13, 14), we aimed to develop a machine learning-based model for the identification of patients at high risk of upgrading from combined SB and mpMRI-TB to RP.

Material and Methods

Study Design

The institutional review board of two hospitals approved this retrospective study from a prospective cohort of mpMRI fusion-targeted biopsies (14) and waived the requirement for informed consent. All patients in this study underwent prostate mpMRI and transperineal systematic plus targeted prostate biopsy under local anesthesia sequentially from October 2016 to February 2020 in Drum Tower Hospital and Molinette Hospital, as previously described (13, 14). Those who were diagnosed with PCa and subsequently treated with robot-assisted laparoscopic RP (RARP) were included. More details about the criteria are shown in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1 Study flowchart. Data from four out of five included patients were trained for prediction model building and the rest for model validation.

Multiparametric MRI Technique

All patients underwent mpMRI performed with a 1.5- or 3-Tesla system (details shown in Supplementary Table 1) (15, 16). Two radiologists with over 10-year experience in urology image analyses supervised the results using the Prostate Imaging-Reporting and Data System (PI-RADS) v2 standards (17). All radiologists had at least 1,000 prostate MRI images of reading experience. The protocol consisted of T2-weighted (T2W) imaging in three planes, diffusion-weighted imaging (DWI) with the calculation of apparent diffusion coefficient (ADC) maps, high b-value images (b > 1,500 s/mm2), and dynamic contrast-enhanced imaging (18). In case there was disagreement over the outcome of the image, the radiologists would discuss it until they reached a consensus.

Biopsy

Biopsy targets were defined as regions of interest (ROIs), which were lesions with PI-RADS score ≥ 3. Before the biopsy, operators annotated targets using an ultrasound system (Esaote Real Time Virtual Sonography, Hitachi Medical Corporation, Tokyo, Japan) with reference to the reports from the two radiologists. All patients underwent transperineal MRI–ultrasonography (MRI-US) fusion biopsy (18G needles with sampling length 17 mm) with a US diagnostic system (MyLab Twice, Esaote S.p.A., Genoa, Italy) consisting of 12-core SB and 2- to 4-core targeted biopsies for each ROI under local anesthesia, as previously described (13, 14).

Histopathology

Histopathology of prostate biopsies was performed by specialized urological pathologists independent of MRI results. For cores with cancer, grade group (GG) and Gleason score (GS) were determined by using the 2014 International Society of Urological Pathology (ISUP) guidelines (19). csPCa was defined as different criteria according to MRI-FIRST study (csPCa-1, ISUP GG 2 or higher tumors; csPCa-2, ISUP GG 3 or higher tumors) (20).

Data Collection

Retrospective collection of urology, radiology, and histopathology data for all patients included the following clinical characteristics: histopathology result of biopsy and RP, age, height, weight, body mass index (BMI), pre-biopsy serum prostate-specific antigen (PSA) value (ng/ml), prostate volume, the maximum diameter of lesion, gap days between biopsy and RP, number of cores, and PI-RADS. The length of the prostate on anterior–posterior (AP), head–foot (HF), and right–left (RL) directions were also calculated on T2W mpMRI images.

Data Visualization and Machine Learning Classifiers

“Upgrade” was defined as ISUP in RP pathology higher than ISUP in biopsy pathology, while “downgrade” was defined as ISUP in RP pathology lower than ISUP in biopsy pathology. These data were visualized under different biopsy situations including SB only, TB only, and SB combined with TB. At the same time, the upgrading and downgrading of different ISUP scores were also visualized. All the visualizations were conducted in Python v3.6.5 (Python Software Foundation) along with the machine learning classifiers with 10-fold cross-validation.

The machine learning algorithms were written using the Python SciKit-learn library except for eXtreme Gradient Boosting (XGBoost), which was written in an individual library called “XgBoost” (21). The logistic regression implemented in SciKit-Learn library was regularized logistic regression. Four classifiers were used including two base classifiers and two ensemble learning-based classifiers. In all classifiers, the training set was 395 of 565 (70%) patients, and the test set was the remaining 170 patients in the two institutions. Among the whole dataset, discrete variables were input to the one-hot encoder, denoting values taken on by categorical (discrete) features. The output would be a sparse matrix where each column corresponded to one possible value of one feature.

Firstly, logistic regression, one of the most classic classifiers to estimate the probability that a patient would have a particular outcome on the basis of related information or clinical characteristics, was used (22). Secondly, a support vector machine (SVM), a mathematical entity for maximizing a particular mathematical function with respect to a given collection of data, was used. The kernel function was used to “space up” the data to higher dimensions, which allowed the data to be linearly separated (23).

A scalable end-to-end tree-boosting system called XGBoost was then used, which could be built with a smaller sample size than most existing systems for ensemble learning-based classifiers. XGBoost is based on gradient tree boosting, an algorithm with which new models are created that predict the residuals of prior models and are then added together for the final prediction (24).

Finally, a random forest within a multiple decision tree model was used (25). Each tree was developed from a bootstrap sample of the training dataset, and each node was the part that was the best among a haphazardly chosen subset of features. The class predictions created by each tree within the forest were amassed, and the ultimate prediction was based on the lion’s share vote (26).

Parameter Tuning

For all classifiers, we needed to choose optimal parameters based on the training set. A grid search was applied to iterate through each parameter combination. The best parameter set was confirmed by first plotting the receiver operating characteristic (ROC) curve and then selecting the one with the maximum area under the ROC curve (AUC). All classifiers were used to make predictions on the test set.

Model Evaluation

The performance of these classifiers was mainly evaluated by using overall accuracy, sensitivity, specificity, and AUC. Overall accuracy was the total correct ratio on all test sets. Sensitivity, similar to recall, indicated the proportion of upgraded PCa patients correctly identified by the classifier. Specificity indicated the proportion of patients who were correctly classified as “no-upgrade.” AUC indicated the probability that the classifier would have a higher prediction between “upgrade” and “no-upgrade” cases. All randomizations involved a random seed number of 0.

Feature Importance

To evaluate the importance of these features and to figure out the most contributed features, the Gini index was leveraged. In the random-forest classifier, every time a node was split on a single feature, the Gini impurity criterion for the two descendent nodes was less than that of the parent node. Adding up the Gini decreases for each feature over all trees in the forest gives fast feature importance, defined as follows:

Gini(p)=k=1Kpk(1pk)

In the formula, K represents the total number of categories and pk the probability that a case is divided into k categories in the case of a single feature (16). Our study was a binary classification, so K was equal to 2.

Nomogram and Evaluation

A nomogram was proposed to visually predict the risk of upgrading. A nomogram was composed of graphical lines of risk factors, points, total points, and upgrading probability. A calibration plot was used to validate how well the nomogram was calibrated. The x-axis of the calibration plot was defined as predicted probability, and the y-axis was defined as the actual probability (27).

Predicted probability was a prevalence rate calculated by a proposed nomogram. Actual probability could be calculated by dividing the number of patients with the same predicted probability by the total number of patients. An ideal line was drawn at a 45° angle in the calibration plot. The nomogram and calibration plot were plotted with R version 3.6.0 in package “rms” version 5.1-4.

Results

Baseline Information

From October 2016 to February 2020, a total of 694 patients underwent RARP after prostate biopsy in Drum Tower Hospital and Molinette Hospital. Of these patients, 179 were excluded because they did not meet the criteria. The remaining 515 were included (Figure 1). Clinical characteristics of patients are summarized in Table 1.

TABLE 1
www.frontiersin.org

Table 1 Clinical characteristics of patients.

Upgrading and Downgrading Under Different Biopsy Methods

Regardless of the combination of SB and TB, of these 515 patients, only 245 (48.15%) cases were in accord with postoperative whole-mount histopathology, accompanied by 120 (23.30%) cases upgrading and 147 (27.18%) cases downgrading (Figure 2 and Table 2).

FIGURE 2
www.frontiersin.org

Figure 2 Volume plot of pathological results from combined biopsy to final RARP according to ISUP grade group. The shade of color reflects the number. ISUP, International Society of Urological Pathology; RARP, robot-assisted laparoscopic radical prostatectomy.

TABLE 2
www.frontiersin.org

Table 2 Concordance, upgrade, and downgrade of Gleason score according to different biopsy methods.

The combined biopsy had a better concordance of postoperative histopathology than SB only (48.15% vs. 40.19%, p = 0.012). But there was no difference between the combined biopsy and TB only (48.15% vs. 42.33%, p = 0.069). The combined biopsy could significantly reduce the upgrading rate of postoperative pathology, in comparison to SB only (23.30% vs. 39.61%, p < 0.0001) or TB only (23.30% vs. 40.19%, p < 0.0001). Meanwhile, the combined biopsy would lead to a higher postoperative pathology downgrade rate (27.18% vs. 20.19% in SB, p = 0.002; and 27.18% vs. 17.48% in TB, p < 0.0001) (Table 2 and Supplementary Figures 1, 2). The pathology detail of primary and secondary Gleason patterns from SB and TB to RP is shown in Supplementary Table 2.

Upgrading and Downgrading Under Different International Society of Urological Pathology

Under different ISUP guidelines, 65 (53.28%), 29 (20.42%), 14 (11.29%), and 12 (10.81%) patients with ISUP1, ISUP2, ISUP3, and ISUP4, respectively, had upgrades as compared with postoperative whole-mount histopathology. At the same time, ISUP2, ISUP3, ISUP4, and ISUP5 had 8 (5.63%), 54 (43.54%), 74 (66.67%), and 10 (62.50%) patients with histopathological downgrades, respectively. One patient with confirmed ISUP1 by biopsy was found to have no PCa in the final histopathology (Figure 2 and Supplementary Table 3).

For different levels of upgrading and downgrading, most patients would upgrade to ISUP2 from ISUP1, regardless of the biopsy method. Very few patients upgrade (1.55%) or downgrade (9.13%) two levels or more (Supplementary Table 3).

When we identified clinically significant PCa using different ISUP levels (csPC-1: ISUP > 1; csPCa-2: ISUP > 2), there were 65 (53.28%) or 34 (12.88%) cases of clinically significant upgrading and 14 (3.56%) or 85 (33.86%) cases of clinically significant downgrading (Figure 3).

FIGURE 3
www.frontiersin.org

Figure 3 Change plot of clinically significant pathological upgrade or downgrade from combined biopsy to final radical prostatectomy according to ISUP grade group. csPCa-1 was defined as ISUP grade group 2 or higher tumors. csPCa-2 was defined as ISUP grade group 3 or higher tumors. The size of dot reflects the number.csPCa, clinically significant prostate cancer; nsPCa, non-significant prostate cancer; ISUP, International Society of Urological Pathology.

Different Classifier Results

We used several supervised machine learning algorithms to predict pathology upgrades at RP. We established some dichotomous models in which the label was set to be an upgrade or otherwise. The overall accuracy was 0.703, 0.768, 0.794, and 0.761 for logistic regression, random forest, XGBoost, and SVM, respectively, adjusted for optimal parameters. Their AUC values were 0.674, 0.670, 0.711, and 0.679, respectively. The detailed parameters are described in Table 3, and the ROC curve is shown in Figure 4.

TABLE 3
www.frontiersin.org

Table 3 Parameters and performance of machine learning algorithms.

FIGURE 4
www.frontiersin.org

Figure 4 The ROC results of machine learning models. ROC, receiver operating characteristic curve; LR, logistic regression; XGboost, eXtreme Gradient Boosting; SVM, support vector machine.

Subsequently, the feature importance calculated using the Gini index was demonstrated, as shown in Supplementary Figure 3. The top four important features were ISUP score in a TB, primary Gleason pattern (G1) score in a TB, ISUP score in an SB, and G1 score in an SB. These features were later used in the construction of nomograms.

Nomogram Construction for Upgrade

We constructed the nomogram to predict the risk of upgrading using 10 risk factors including the top four important features, shown in Figure 5A. The longer the line for each feature, the higher the score. When these scores were summed, the higher the total points, the greater the probability of pathology upgrading. When the total points were below 520, the probability of pathology upgrading was less than 10%, while when the total points were above 650, the probability of pathology upgrading would rise to about 0.7.

FIGURE 5
www.frontiersin.org

Figure 5 (A) The nomogram of pathological upgrade prediction. (B) Calibration plots of observed and predicted probability of pathological upgrade. ISUP, International Society of Urological Pathology; TB, targeted biopsy; G1, primary Gleason pattern; SB, systematic biopsy; G2, secondary Gleason pattern; RL, right–left diameter; PSA, prostate-specific antigen; PI-RADS, Prostate Imaging-Reporting and Data System.

A calibration plot was created with 1,000 repetitions boot in Figure 5B. The mean absolute error was 0.04 in these 515 patients.

Discussion

Accurate GS is still the strongest decision factor of PCa management and predictor of oncologic outcomes. However, overall pathological characteristics of the prostate cannot be presented by biopsy sampling, as the GG in needle biopsies has poor reproducibility and lack correlation with corresponding RP specimens (28). Approximately 30%–50% of cases will be misled by the biopsy grading system (5, 29). In our multicenter cohort, we found that the overall concordance rate of GG was 48.15% from biopsy to RP.

In recent decades, a relatively growing number of men with PCa are opting for therapies other than RP, such as focal therapy, radiation therapy, or active surveillance, which made biopsy grade more important in therapeutic choices (30). Clinicians are more concerned about the upgrade rates, as the only tissue sampled is from the needle biopsy. The confirmatory biopsy setting was upgraded with a range of 18%–30% of cases, who are previously detected to have PCa and were on active surveillance (31).

Pre-biopsy MRI scan and MRI-guided TB have been widely recommended for men with naïve biopsy or repeat biopsy, as several studies have been proven the improvement of detective rates of PCa or clinically significant PCa (8, 20, 32). Moreover, after RP, pathological upgrade is less likely to occur with MRI-guided TB as previously described (33, 34). Our results showed a significantly lower upgrade rate in the combined biopsy, versus SB or TB alone (both p < 0.0001). There was no difference in GG upgrade between SB (39.61%) and TB (40.19%). Some studies indicated that TB alone had better results in pathologic disease upgrade than SB. A possible explanation is in these cohorts of four to six cores, or even saturated biopsy for each lesion was taken (35, 36). Conversely, only two to four cores were performed for each lesion in our protocol.

Similar to other studies (29, 37, 38), the most common pathological upgrade occurred in ISUP GG1 and GG2, accounting for 53.28% and 20.42%, respectively. And these are the two populations of PCa who were potentially selected for active surveillance and/or focal therapy. When we identified csPCa with criteria of ISUP GG3 or higher tumors (csPCa-2 in our study) as previously reported (20), the upgrade rate was 12.88%. But when we made use of other definition criteria, being ISUP GG2 or higher tumors (csPCa-1), the upgrade rate was as high as 53.28%. All these data suggest that upgrading GG may have consequences on clinical outcomes from clinically non-significant PCa to csPCa.

To solve these discrepancies, researchers have sought to predict this pathological upgrade in order to identify patients who are more suitable for active surveillance or needed better and earlier intervention. Several predictive models or analyses of adverse factors were reported, but almost all of them are based on MRI and using PI-RADS (6, 9, 39). Several investigators have included quantitative histologic features to better predict the stage and risk of disease recurrence (40, 41). However, the methods previously used are mainly regression analysis.

Along with the improvement of computer technology, machine learning-based analysis has become better at processing existing data more comprehensively and eliminating more errors (12). Moreover, machine learning methods can construct classifiers with good predictive efficacy. Recently, others combined texture features of mpMRI and machine learning methods to predict GG upgrading (42). They suggested that ADC maps of mpMRI could predict PCa GG upgrading from biopsy to RP non-invasively with satisfactory predictive efficacy. However, their analysis was carried out on a retrospective study of a small group of patients from a single center. Moreover, only TRUS-guided systemic biopsies were included.

To our knowledge, this is the first study to use machine learning to predict pathological GG upgrading including TB. Our transperineal TB cohort was derived from a prospective data collection (13, 14). We used four kinds of supervised machine learning algorithms to predict pathological upgrades from biopsy to RP. Not logistic regression, but XGBoost, was the most accurate algorithm in our cohort. Moreover, we analyzed almost all the relevant characteristics, including TB ISUP score and primary Gleason pattern (G1) score. The top four important features were the ISUP score and G1 score in TB and SB, respectively. We constructed the nomogram to predict the risk of upgrading using 10 risk factors including the top four important features. Similar to another study (6), PI-RADS was one of the most significant predictors, but we demonstrated that the ISUP score and G1 score from TB played a more important role in the prediction model. We hold the opinion that adding features of TB to the prediction model is a notable advance in the selection of candidates for active surveillance. Moreover, the results of the relatively high upgrade rate of ISUP 1 (53.28%) and ISUP2 (20.42%) group might explain the fact that patients with relatively low risk at biopsy suffer from metastasis or even death from PCa. It might suggest that the ISUP 1 and ISUP 2 population, who normally tend to receive active surveillance or focal therapy, might not be suitable candidates.

Some limitations need to be highlighted. Firstly, our study was performed on a retrospective analysis. More prospective data are needed to validate our findings. Secondly, the relatively high number of different operators performed the procedure. Likely, results would have even improved if performed by experienced operators only. In our opinion, the use of multiple operators, together with the multicenter nature, reinforces our results in terms of reproducibility. Despite the limitations, we firmly believe that the principal results of our preliminary study are sufficiently valid.

In conclusion, the combined effect of SB plus TB led to a better pathological concordance rate and the less upgrading rate from biopsy to RP. Machine learning models to predict PCa GG upgrading had satisfactory predictive efficacy. Adding features of TB to the prediction model is a notable advance in the selection of appropriate therapeutic strategies.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics Statement

Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

JZ, HG, and GM conceived the study and revised the paper. AM, MO, HH, MG, FZ, LX, YF, and BZ performed the project. YK, YW, GC, and PG collected and analyzed data. JZ, YK, and XQ wrote the draft. JZ, YW, GM, and XQ interpreted the data.

Funding

This study was supported by grants from the National Natural Science Foundation of China (81602232, 81802535, and 81974394), Natural Science Foundation of Jiangsu Province for Excellent Young Scholars (BK20200051), and Nanjing Medical Science and technique Development Foundation (GRX17127).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.785684/full#supplementary-material

References

1. Mottet M, Cornford P, van den Bergh RCN, Briers E, De Santis M, Gillessen S, et al. EAU - ESTRO ESUR - SIOG Guidelines on Prostate Cancer 2021. Arnhem, The Netherlands: European Association of Urology Guidelines Office. (2021).

PubMed Abstract | Google Scholar

2. Gearman DJ, Morlacco A, Cheville JC, Rangel LJ, Karnes RJ. Comparison of Pathological and Oncologic Outcomes of Favorable Risk Gleason Score 3 + 4 and Low Risk Gleason Score 6 Prostate Cancer: Considerations for Active Surveillance. J Urol (2018) 199(5):1188–95. doi: 10.1016/j.juro.2017.11.116

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Yang DD, Mahal BA, Muralidhar V, Nezolosky MD, Vastola ME, Labe SA, et al. Risk of Upgrading and Upstaging Among 10 000 Patients With Gleason 3+4 Favorable Intermediate-Risk Prostate Cancer. Eur Urol Focus (2019) 5(1):69–76. doi: 10.1016/j.euf.2017.05.011

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Le JD, Stephenson S, Brugger M, Lu DY, Lieu P, Sonn GA, et al. Magnetic Resonance Imaging-Ultrasound Fusion Biopsy for Prediction of Final Prostate Pathology. J Urol (2014) 192(5):1367–73. doi: 10.1016/j.juro.2014.04.094

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Audenet F, Rozet F, Resche-Rigon M, Bernard R, Ingels A, Prapotnich D, et al. Grade Group Underestimation in Prostate Biopsy: Predictive Factors and Outcomes in Candidates for Active Surveillance. Clin Genitourin Cancer (2017) 15(6):e907–13. doi: 10.1016/j.clgc.2017.04.024

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Alqahtani S, Wei C, Zhang Y, Szewczyk-Bieda M, Wilson J, Huang Z, et al. Prediction of Prostate Cancer Gleason Score Upgrading From Biopsy to Radical Prostatectomy Using Pre-Biopsy Multiparametric MRI PIRADS Scoring System. Sci Rep (2020) 10(1):7722. doi: 10.1038/s41598-020-64693-y

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Ahmed HU, El-Shater Bosaily A, Brown LC, Gabe R, Kaplan R, Parmar MK, et al. Diagnostic Accuracy of Multi-Parametric MRI and TRUS Biopsy in Prostate Cancer (PROMIS): A Paired Validating Confirmatory Study. Lancet (2017) 389(10071):815–22. doi: 10.1016/s0140-6736(16)32401-1

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Kasivisvanathan V, Rannikko AS, Borghi M, Panebianco V, Mynderse LA, Vaarala MH, et al. MRI-Targeted or Standard Biopsy for Prostate-Cancer Diagnosis. N Engl J Med (2018) 378(19):1767–77. doi: 10.1056/NEJMoa1801993

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Gandaglia G, Ploussard G, Valerio M, Mattei A, Fiori C, Roumiguie M, et al. The Key Combined Value of Multiparametric Magnetic Resonance Imaging, and Magnetic Resonance Imaging-Targeted and Concomitant Systematic Biopsies for the Prediction of Adverse Pathological Features in Prostate Cancer Patients Undergoing Radical Prostatectomy. Eur Urol (2020) 77(6):733–41. doi: 10.1016/j.eururo.2019.09.005

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Pepe P, Garufi A, Priolo GD, Galia A, Fraggetta F, Pennisi M. Is it Time to Perform Only Magnetic Resonance Imaging Targeted Cores? Our Experience With 1,032 Men Who Underwent Prostate Biopsy. J Urol (2018) 200(4):774–8. doi: 10.1016/j.juro.2018.04.061

PubMed Abstract | CrossRef Full Text | Google Scholar

11. LeCun Y, Bengio Y, Hinton G. Deep Learning. Nature (2015) 521(7553):436–44. doi: 10.1038/nature14539

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Liu H, Tang K, Peng E, Wang L, Xia D, Chen Z. Predicting Prostate Cancer Upgrading of Biopsy Gleason Grade Group at Radical Prostatectomy Using Machine Learning-Assisted Decision-Support Models. Cancer Manag Res (2020) 12:13099–110. doi: 10.2147/CMAR.S286167

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Marra G, Zhuang J, Marquis A, Zhao X, Calleris G, Kan Y, et al. Pain in Men Undergoing Transperineal Free-Hand Multiparametric Magnetic Resonance Imaging Fusion Targeted Biopsies Under Local Anesthesia: Outcomes and Predictors From a Multicenter Study of 1,008 Patients. J Urol (2020) 204(6):1209–15. doi: 10.1097/ju.0000000000001234

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Marra G, Zhuang J, Beltrami M, Calleris G, Zhao X, Marquis A, et al. Transperineal Freehand Multiparametric MRI Fusion Targeted Biopsies Under Local Anaesthesia for Prostate Cancer Diagnosis: A Multicentre Prospective Study of 1014 Cases. BJU Int (2021) 127(1):122–30. doi: 10.1111/bju.15121

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Huang H, Wang W, Lin T, Zhang Q, Zhao X, Lian H, et al. Comparison of the Complications of Traditional 12 Cores Transrectal Prostate Biopsy With Image Fusion Guided Transperineal Prostate Biopsy. BMC Urol (2016) 16(1):68. doi: 10.1186/s12894-016-0185-z

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Kan Y, Zhang Q, Hao J, Wang W, Zhuang J, Gao J, et al. Clinico-Radiological Characteristic-Based Machine Learning in Reducing Unnecessary Prostate Biopsies of PI-RADS 3 Lesions With Dual Validation. Eur Radiol (2020) 30(11):6274–84. doi: 10.1007/s00330-020-06958-8

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Barentsz JO, Weinreb JC, Verma S, Thoeny HC, Tempany CM, Shtern F, et al. Synopsis of the PI-RADS V2 Guidelines for Multiparametric Prostate Magnetic Resonance Imaging and Recommendations for Use. Eur Urol (2016) 69(1):41–9. doi: 10.1016/j.eururo.2015.08.038

PubMed Abstract | CrossRef Full Text | Google Scholar

18. van der Leest M, Cornel E, Israel B, Hendriks R, Padhani AR, Hoogenboom M, et al. Head-To-Head Comparison of Transrectal Ultrasound-Guided Prostate Biopsy Versus Multiparametric Prostate Resonance Imaging With Subsequent Magnetic Resonance-Guided Biopsy in Biopsy-Naive Men With Elevated Prostate-Specific Antigen: A Large Prospective Multicenter Clinical Study. Eur Urol (2019) 75(4):570–8. doi: 10.1016/j.eururo.2018.11.023

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Epstein JI, Egevad L, Amin MB, Delahunt B, Srigley JR, Humphrey PA, et al. The 2014 International Society of Urological Pathology (ISUP) Consensus Conference on Gleason Grading of Prostatic Carcinoma: Definition of Grading Patterns and Proposal for a New Grading System. Am J Surg Pathol (2016) 40(2):244–52. doi: 10.1097/PAS.0000000000000530

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Rouvière O, Puech P, Renard-Penna R, Claudon M, Roy C, Mège-Lechevallier F, et al. Use of Prostate Systematic and Targeted Biopsy on the Basis of Multiparametric MRI in Biopsy-Naive Patients (MRI-FIRST): A Prospective, Multicentre, Paired Diagnostic Study. Lancet Oncol (2019) 20(1):100–9. doi: 10.1016/s1470-2045(18)30569-2

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Abraham A, Pedregosa F, Eickenberg M, Gervais P, Mueller A, Kossaifi J, et al. Machine Learning for Neuroimaging With Scikit-Learn. Front Neuroinform (2014) 8:14. doi: 10.3389/fninf.2014.00014

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Tolles J, Meurer WJ. Logistic Regression: Relating Patient Characteristics to Outcomes. JAMA (2016) 316(5):533–4. doi: 10.1001/jama.2016.7653

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Huang S, Cai N, Pacheco PP, Narrandes S, Wang Y, Xu W. Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. Cancer Genomics Proteomics (2018) 15(1):41–51. doi: 10.21873/cgp.20063

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Ogunleye A, Wang QG. XGBoost Model for Chronic Kidney Disease Diagnosis. IEEE/ACM Trans Comput Biol Bioinform (2020) 17(6):2131–40. doi: 10.1109/TCBB.2019.2911071

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Yang L, Wu H, Jin X, Zheng P, Hu S, Xu X, et al. Study of Cardiovascular Disease Prediction Model Based on Random Forest in Eastern China. Sci Rep (2020) 10(1):5245. doi: 10.1038/s41598-020-62133-5

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Pavey TG, Gilson ND, Gomersall SR, Clark B, Trost SG. Field Evaluation of a Random Forest Activity Classifier for Wrist-Worn Accelerometer Data. J Sci Med Sport (2017) 20(1):75–80. doi: 10.1016/j.jsams.2016.06.003

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Iasonos A, Schrag D, Raj GV, Panageas KS. How to Build and Interpret a Nomogram for Cancer Prognosis. J Clin Oncol (2008) 26(8):1364–70. doi: 10.1200/JCO.2007.12.9791

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Epstein JI, Feng Z, Trock BJ, Pierorazio PM. Upgrading and Downgrading of Prostate Cancer From Biopsy to Radical Prostatectomy: Incidence and Predictive Factors Using the Modified Gleason Grading System and Factoring in Tertiary Grades. Eur Urol (2012) 61(5):1019–24. doi: 10.1016/j.eururo.2012.01.050

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Athanazio D, Gotto G, Shea-Budgell M, Yilmaz A, Trpkov K. Global Gleason Grade Groups in Prostate Cancer: Concordance of Biopsy and Radical Prostatectomy Grades and Predictors of Upgrade and Downgrade. Histopathology (2017) 70(7):1098–106. doi: 10.1111/his.13179

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Marra G, Ploussard G, Ost P, De Visschere PJL, Briganti A, Gandaglia G, et al. Focal Therapy in Localised Prostate Cancer: Real-World Urological Perspective Explored in a Cross-Sectional European Survey. Urol Oncol (2018) 36(12):529.e511–529.e522. doi: 10.1016/j.urolonc.2018.08.013

CrossRef Full Text | Google Scholar

31. Shapiro DD, Gregg JR, Lim AH, Nogueras-González GM, Choi H, Kang HC, et al. Comparing Confirmatory Biopsy Outcomes Between MRI-Targeted Biopsy and Standard Systematic Biopsy Among Men Being Enrolled in Prostate Cancer Active Surveillance. BJU Int (2021) 127(3):340–8. doi: 10.1111/bju.15100

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Ahdoot M, Wilbur AR, Reese SE, Lebastchi AH, Mehralivand S, Gomella PT, et al. MRI-Targeted, Systematic, and Combined Biopsy for Prostate Cancer Diagnosis. N Engl J Med (2020) 382(10):917–28. doi: 10.1056/NEJMoa1910038

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Goel S, Shoag JE, Gross MD, Al Hussein Al Awamlh B, Robinson B, Khani F, et al. Concordance Between Biopsy and Radical Prostatectomy Pathology in the Era of Targeted Biopsy: A Systematic Review and Meta-Analysis. Eur Urol Oncol (2020) 3(1):10–20. doi: 10.1016/j.euo.2019.08.001

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Wenzel M, Preisser F, Wittler C, Hoeh B, Wild PJ, Tschabunin A, et al. Correlation of MRI-Lesion Targeted Biopsy vs. Systematic Biopsy Gleason Score With Final Pathological Gleason Score After Radical Prostatectomy. Diagn (Basel) (2021) 11(5):882. doi: 10.3390/diagnostics11050882

CrossRef Full Text | Google Scholar

35. Porpiglia F, Passera R, Manfredi M, Mele F, Bollito E, De Pascale A, et al. Multiparametric-Magnetic Resonance/Ultrasound Fusion Targeted Prostate Biopsy Improves Agreement Between Biopsy and Radical Prostatectomy Gleason Score. Anticancer Res (2016) 36(9):4833–9. doi: 10.21873/anticanres.11045

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Calio BP, Sidana A, Sugano D, Gaur S, Maruf M, Jain AL, et al. Risk of Upgrading From Prostate Biopsy to Radical Prostatectomy Pathology-Does Saturation Biopsy of Index Lesion During Multiparametric Magnetic Resonance Imaging-Transrectal Ultrasound Fusion Biopsy Help? J Urol (2018) 199(4):976–82. doi: 10.1016/j.juro.2017.10.048

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Demirtaş A, Sönmez G, Tombul Ş T, Demirtaş T, Akgün H. Comparison of the Upgrading Rates of International Society of Urological Pathology Grades and Tumor Laterality in Patients Undergoing Standard 12-Core Prostate Biopsy Versus Fusion Prostate Biopsy for Prostate Cancer. Urol Int (2019) 103(3):256–61. doi: 10.1159/000501528

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Shoag JE, Cai PY, Gross MD, Gaffney C, Li D, Mao J, et al. Impact of Prebiopsy Magnetic Resonance Imaging on Biopsy and Radical Prostatectomy Grade Concordance. Cancer (2020) 126(13):2986–90. doi: 10.1002/cncr.32821

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Liu H, Tang K, Xia D, Wang X, Zhu W, Wang L, et al. Added Value of Biparametric MRI and TRUS-Guided Systematic Biopsies to Clinical Parameters in Predicting Adverse Pathology in Prostate Cancer. Cancer Manag Res (2020) 12:7761–70. doi: 10.2147/CMAR.S260986

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Freedland SJ, Terris MK, Csathy GS, Kane CJ, Amling CL, Presti JC Jr., et al. Preoperative Model for Predicting Prostate Specific Antigen Recurrence After Radical Prostatectomy Using Percent of Biopsy Tissue With Cancer, Biopsy Gleason Grade and Serum Prostate Specific Antigen. J Urol (2004) 171(6 Pt 1):2215–20. doi: 10.1097/01.ju.0000124463.13319.0a

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Pepe P, Fraggetta F, Galia A, Grasso G, Piccolo S, Aragona F. Is Quantitative Histologic Examination Useful to Predict Nonorgan-Confined Prostate Cancer When Saturation Biopsy is Performed? Urology (2008) 72(6):1198–202. doi: 10.1016/j.urology.2008.05.045

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Xie J, Li B, Min X, Zhang P, Fan C, Li Q, et al. Prediction of Pathological Upgrading at Radical Prostatectomy in Prostate Cancer Eligible for Active Surveillance: A Texture Features and Machine Learning-Based Analysis of Apparent Diffusion Coefficient Maps. Front Oncol (2020) 10:604266. doi: 10.3389/fonc.2020.604266

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: prostate cancer, biopsy, upgrade, prostatectomy, prediction, machine learning

Citation: Zhuang J, Kan Y, Wang Y, Marquis A, Qiu X, Oderda M, Huang H, Gatti M, Zhang F, Gontero P, Xu L, Calleris G, Fu Y, Zhang B, Marra G and Guo H (2022) Machine Learning-Based Prediction of Pathological Upgrade From Combined Transperineal Systematic and MRI-Targeted Prostate Biopsy to Final Pathology: A Multicenter Retrospective Study. Front. Oncol. 12:785684. doi: 10.3389/fonc.2022.785684

Received: 29 September 2021; Accepted: 01 March 2022;
Published: 07 April 2022.

Edited by:

Antonina Mitrofanova, Rutgers, The State University of New Jersey, United States

Reviewed by:

Ning Li, Fourth Affiliated Hospital of China Medical University, China
Pietro Pepe, Cannizzaro Hospital, Italy
Jose Eduardo Tavora, Faculdade de Ciências Médicas de Minas Gerais (FCMMG), Brazil

Copyright © 2022 Zhuang, Kan, Wang, Marquis, Qiu, Oderda, Huang, Gatti, Zhang, Gontero, Xu, Calleris, Fu, Zhang, Marra and Guo. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Giancarlo Marra, ZHJnaWFuY2FybG9tYXJyYUBnbWFpbC5jb20=; Hongqian Guo, ZHIuZ2hxQG5qdS5lZHUuY24=

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.