Evaluation of the Efficiency of MRI-Based Radiomics Classifiers in the Diagnosis of Prostate Lesions

Li, Linghao; Gu, Lili; Kang, Bin; Yang, Jiaojiao; Wu, Ying; Liu, Hao; Lai, Shasha; Wu, Xueting; Jiang, Jian

doi:10.3389/fonc.2022.934108

ORIGINAL RESEARCH article

Front. Oncol., 05 July 2022

Sec. Genitourinary Oncology

Volume 12 - 2022 | https://doi.org/10.3389/fonc.2022.934108

This article is part of the Research TopicRadiomics and Radiogenomics in Genitourinary Oncology: Artificial Intelligence and Deep Learning ApplicationsView all 9 articles

Evaluation of the Efficiency of MRI-Based Radiomics Classifiers in the Diagnosis of Prostate Lesions

Linghao Li^1†

Lili Gu^2†

Bin Kang¹

Jiaojiao Yang¹

Ying Wu¹

Hao Liu¹

Shasha Lai¹

Xueting Wu¹

Jian Jiang^1*

¹Department of Radiology, the First Affiliated Hospital, Nanchang University, Nanchang, China
²Department of Pain, the First Affiliated Hospital, Nanchang University, Nanchang, China

Objective: To compare the performance of different imaging classifiers in the prospective diagnosis of prostate diseases based on multiparameter MRI.

Methods: A total of 238 patients with pathological outcomes were enrolled from September 2019 to July 2021, including 142 in the training set and 96 in the test set. After the regions of interest were manually segmented, decision tree (DT), Gaussian naive Bayes (GNB), XGBoost, logistic regression, random forest (RF) and support vector machine classifier (SVC) models were established on the training set and tested on the independent test set. The prospective diagnostic performance of each classifier was compared by using the AUC, F1-score and Brier score.

Results: In the patient-based data set, the top three classifiers of combined sequences in terms of the AUC were logistic regression (0.865), RF (0.862), and DT (0.852); RF “was significantly different from the other two classifiers (P =0.022, P =0.005), while logistic regression and DT had no statistical significance (P =0.802). In the lesions-based data set, the top three classifiers of combined sequences in terms of the AUC were RF (0.931), logistic regression (0.922) and GNB (0.922). These three classifiers were significantly different from.

Conclusion: The results of this experiment show that radiomics has a high diagnostic efficiency for prostate lesions. The RF classifier generally performed better overall than the other classifiers in the experiment. The XGBoost and logistic regression models also had high classification value in the lesions-based data set.

Introduction

Benign prostatic hyperplasia and prostate cancer (PCa) are common diseases in middle-aged and elderly men worldwide. The incidence of PCa has remained high in China, and the trend is increasing year by year. It is an important disease that seriously affects men’s health (1, 2). Prospectively, the diagnosis and staging of prostate diseases is of important clinical value and greatly influences the follow-up treatment and prognosis of patients (3).

Multiparameter magnetic resonance imaging (MP-MRI), including T2-weighted imaging (T2WI), diffusion-weighted imaging (DWI), and dynamic contrast-enhanced (DCE) imaging, has been considered promising by the Prostate Imaging Reporting and Data System (PI-RADS) v2 (4). Combined with transrectal ultrasound biopsy, it can provide an effective diagnostic approach for prostate lesions (5). However, as an invasive examination, additional medical burdens and patient trauma may occur in actual clinical work. PI-RADS 2.1, updated in 2019, proposed biparametric MRI (bpMRI), including T2WI and DWI (6), and several studies have suggested that the application of bpMRI will not reduce the diagnostic accuracy of PCa (7, 8). Although PI-RADS v2 has been used as an important reference tool for the clinical assessment of benign and malignant prostate lesions, problems still exist in that it is limited by the depth and experience of the user and the user’s understanding of the guidelines. Therefore, it is of great value to explore a noninvasive, highly accurate and quantitative analysis diagnostic method.

Through image processing technology, radiomics uses a large amount of feature data extracted from medical images to explore possible high-latitude histopathological information with low visual recognition, which can be used to build a machine learning algorithm model. The diagnostic accuracy of radiomics mainly depends on the selected features and classifiers. A growing number of studies have demonstrated the potential of radiomics in the diagnosis of prostate diseases (9–11). Kendrick J et al. analyzed recent prostate imaging studies and suggested that radiomic analysis showed significant potential for diagnosis, prognosis and prediction in the clinical management of metastatic PCa (mPCa) (12). The radiomic line diagram established by Li et al. showed high accuracy in predicting PI-RADS = 3 prostate lesions (13). The study of Qi et al. confirmed that the introduction of prostate imaging diagnosis can effectively predict PCa before surgery and reduce unnecessary biopsy (14). Bourbonne, V et al. established an omics and neural learning network to predict lymph node invasion of PCa (15). In the above studies, due to different data sets and data processing methods, it is difficult to objectively compare the classification efficiency among various classifiers. At the same time, there are differences in the management of prostate lesions. The region of interest (ROI) delineation of (16) and (17) et al. was based on lesion region division of whole glandular tissue, which meant that the image of a single patient would only be used as a single data point. Bonekamp D et al. (18), on the basis of the former division, treated each lesion area as a separate data, which means that multiple experimental data points may be derived from the same glandular tissue, and they constructed a data set based on lesions. The former tests the diagnostic ability of imaging for benign and malignant glandular tissues, while the latter tends to explore the classification performance of the model in specific lesion areas.

This study used a large amount of data from clinical MRI images, and double parameters were used based on the whole ROI sketch of glands and the pathological changes in each area. Meanwhile, some of the same steps as the above studies were taken, such as image preprocessing, feature extraction and imaging. This study aims to verify the use of radiomics in the diagnosis of prostate diseases and to develop more image omics classifiers for prostate lesions based on bpMRI.

Materials and Methods

Patient Information

This study was approved by the Ethics Committee of the First Affiliated Hospital of Nanchang University. From September 2019 to July 2021, the Department of Imaging at the First Affiliated Hospital of Nanchang University recruited a total of 872 patients who underwent 1.5 T prostate mpMRI. The inclusion criteria were as follows: (1) mpMRI scan, including ADC, DWI and T2WI-FS(T2-weighted Fat-sat imaging), was performed; (2) after MRI examination, transrectal ultrasound (TRUS)-guided prostate biopsy or radical prostatectomy was performed, and pathological results were obtained; and (3) there was no prostate endocrine therapy, biopsy, surgery or radiotherapy performed before MRI examination. The exclusion criteria were as follows: (1) incomplete image sequence; (2) inability to determine the location or boundary of specific lesions on MRI; and (3) serious artifacts on mpMRI.

Ultimately, a total of 238 patients were recruited for the study: 114 patients with PCa and 124 patients pathologically confirmed to have no tumor cells. The patients were randomly divided into two groups (training group and test group) at a ratio of 6:4. The recruitment process is shown in Figure 1.

FIGURE 1

Figure 1 Flowchart of patient recruitment and screening.

MRI Parameters

All patients underwent MRI scans with a Siemens 1.5 T magnetic resonance scanner. The patients were placed in the supine position and scanned from the iliac spine to the lower margin of the symphysis pubis. The parameters of DWI were as follows: fast spin echo sequence, field of view of 180 mm×200 mm, layer thickness of 4 mm, layer spacing of 1, TR of 4100 ms, TE of 91 ms, matrix of 256×256, and b values of 0, 800, and 1,600. Automatically after b =800, processing and reconstruction of the ADC image were performed.

Pathology Reference Standard

The pathological data consisted of TRUS biopsy results and postoperative examination results of radical prostatic eradication. All patients underwent TRUS-guided 12-core systematic biopsies, and needle biopsies were performed on the suspected lesion areas on MRI. An ESAOTE Mylab Twice High-end Color Doppler diagnostic instrument was used as the end-injection dual-plane cavity probe (TRT33, convex array frequency 5.5-8.5 mHz, linear array frequency 5.5-10 mHz). The biopsy was performed by a senior urologist with over 5 years of experience. Histopathological specimens were evaluated by experienced pathologists from our hospital according to the Gleason Scoring system updated by the International Society of Urology Pathology (ISUP) in 2014.

Lesion Segmentation and PI-RADS Assessment

Two researchers (with more than three years of experience in PCa diagnosis) used ITK-SNAP (http://www.itksnap.org/pmwiki/pmwiki.php?n=Downloads.SNAP3) to sketch the ROI of the same set of images independently without regard to other clinical and pathological information. Consensus was reached on any conflicts during this process through discussion. The PI-RADS score was independently assigned by two investigators (with more than three years of PCa diagnostic experience). Two weeks later, the given score sample was assessed for the second time. Divergent scores were resolved after discussion. The final segmented images and scores were reviewed by a diagnostic urological imaging specialist (Figure 2). A total of 151 positive lesions (i.e., tumor cells were found in pathological reports) and 139 negative lesions (i.e., no tumor cells were found in pathological reports) were obtained.

FIGURE 2

Figure 2 A 59-year-old man diagnosed with csPCa in PZ (FPSA, 0.04 ng/mL; TPSA, 4.27 ng/mL; biopsy GS, 4 + 4 = 8). Example segmentations (red masks) of the tumor overlaid on axial T2-weighted fat-sat imaging (T2WI-fs) (A), apparent diffusion coefficient (ADC) map (B), and diffusion-weighted imaging (DWI) (C).

Feature Extraction and Selection

All images were normalized before feature extraction. The images were normalized with the mean and standard deviation as the center. The Sitk.sitkBSpline interpolation method was used to resample all the voxels of the image as 1*1*1 mm, and the bin width was set to 25. Radiomics feature calculations were performed using the PyRadiomics package (https://github.com/Radiomics/pyradiomics). In each volume of interest (VOI), seven imaging radiomics features were calculated, including first-order statistics, shape-based features, gray level co-occurrence matrix (GLCM), gray level run length matrix (GLRLM), gray level size zone matrix (GLSZM)), neighboring gray tone difference matrix (NGTDM) and gray level dependence matrix (GLDM). A total of 321 radiomic features could be extracted from a single sample. To eliminate the characteristic error caused by intergroup and intragroup differences, two radiologists independently plotted ROIs on 50 patient images and calculated the intergroup and intragroup correlation coefficients. The following analysis only included features with intraclass correlation coefficients (ICCs) greater than 0.90. Before feature screening, all data were standardized. The variance test algorithm and t test were used to filter the extracted features. Then, the least absolute shrinkage and selection operator (LASSO) regression method was used to select the best performing features for the classifier model, and fivefold cross-validation was used.

Model Construction and Statistical Analysis

Model construction and statistical analysis were based on Python 3.7.9 (v. 3.7.9; https://www.python.org/; email exchange with factcheck.org on 28 September 2020). Decision tree, Gaussian naive Bayes (GNB), XGBoost (XGB), logistic regression, random forest (RF), and support vector machine classifier (SVC) models were all constructed by Scikit-learn (http://scikit-learn.org/stable/index.html). For the combined sequence model, the feature data of three sequences were connected before screening and model operation. Mesh traversal and cross-validation were used to optimize the model parameters. Statistical analysis included variance test, t test, receiver operating characteristic (ROC), precision, recall, F1-score, and Brier score. ROC describes the performance of a binary classification system under varying discrimination thresholds. Precision refers to the proportion of positive samples in positive cases determined by the classifier, while recall refers to the proportion of positive cases predicted to the total number of positive cases. The F1-score is a measure of classification problems. Some machine learning competitions with multiple classification problems often use the F1-score as the final evaluation method. The harmonic mean of precision and recall was calculated. Brier scores are primarily used to measure the accuracy of predictions and are applicable to tasks in which probabilities must be assigned to a set of mutually exclusive discrete outcomes. Lower Brier scores indicate that the predicted results are closer to the actual classification. The above statistical calculations are based on the SCIPY library (http://www.lfd.uci.edu/~gohlke/pythonlibs/#scipy). T tests and DeLong tests of two independent samples were used to compare the statistical significance of the combined sequence model and the difference in the ROC curves. The above two tests were performed with SPSS 26.0 and MedCalc, respectively.

Result

Subject Characteristics and Distribution of Prostate Lesions

A total of 238 patients were enrolled: 114 patients with PCa and 124 patients pathologically confirmed to be tumor-free. A total of 151 lesions were delineated in patients with PCA, including 5 PI-RADS 3, 87 PI-RADS 4, and 59 PI-RADS 5 lesions. A total of 139 lesions were delineated in the 124 benign patients, including 81 PI-RADS 2 and 58 PI-RADS 3 lesions. The patients and lesions were randomly divided into two groups at a ratio of 6:4. There were 142 patients in the training set and 96 patients in the test set. There were 174 lesions in the training set and 116 lesions in the test set. Epidemiological data are shown in Table 1.

TABLE 1

Table 1 Patient characteristics.

Patient-Based Classification Results

With the data of 238 patients, we constructed a T2WI-FS model using 7 features, an ADC model using 12 features, and a DWI model using 15 features. Three T2WI-FS features, 6 ADC features and 4 DWI features were used to construct a hybrid model. The top five most important features of each sequence are shown in Table 2.

TABLE 2

Table 2 Top five most important parameters in each model.

The ADC sequence and DWI sequence showed high accuracy and specificity in each classifier. In the ADC model, the top two classifiers with the highest area under the curve (AUC) values were XGB (0.907) and SVC (0.893). The top two classifiers with the lowest Brier scores were also the above two models, with scores of 0.072 and 0.086, respectively. In the DWI model, the top two classifiers with the highest AUC values were RF (0.910) and logistic regression (0.870). The top three classifiers with the lowest Brier scores were SVC (0.083), RF (0.094) and logistic regression (0.094). In T2WI-FS, RF (0.813) and SVC (0.804) had the highest AUC values. The top two classifiers with the lowest Brier scores were SVC (0.133) and RF (0.141). In the combined sequences, the top two classifiers with the highest AUC values were logistic regression (0.865) and RF (0.862). The top two classifiers with the lowest Brier scores were RF (0.105) and SVC (0.108).

On the t test of two independent samples based on the combined sequence, RF showed significant differences from the other five classifiers except XGB. SVC showed significant differences from GNB, XGB and RF. Based on DeLong test of the combined sequence, RF showed significant differences from DT, GNB and XGB. SVC showed a significant difference from GNB and XGB.

The specific data are shown in Table 3, and the p values and DeLong tests of each classifier on the combined sequence model are shown in Table 4.

TABLE 3

Table 3 Accuracy, precision, recall, F1-score, AUC, and Brier score results of mpMRI and combined models based on patients for predicting PCa.

TABLE 4

Table 4 P value and DeLong test of each classifier on the sequence-combined model based on patients for predicting PCa.

Lesions-Based Classification Results

With the data of 290 lesions, 12 features were selected to construct the T2WI-FS model, 9 features were selected to construct the ADC model, and 11 features were selected to construct the DWI model. Three T2WI-FS features, 6 ADC features and 5 DWI features were used to construct a hybrid model. The top five most important features and their weights are shown in Table 2.

Except for T2WI-FS, the accuracy and specificity of the focus-based model were improved compared with the former. ADC and DWI also showed generally higher classification efficiencies than T2WI-FS in this experiment. In the ADC model, the top two classifiers with the highest AUC values were GNB (0.940) and SVC (0.927). The top two classifiers with the lowest Brier scores were RF (0.054) and GNB (0.055). In the DWI model, the top two classifiers with the highest AUC values were XGB (0.957) and logistic regression (0.940). The top two classifiers with the lowest Brier scores were XGB (0.048) and logistic regression (0.061). In T2WI-FS, the top three classifiers with the highest AUC values were RF (0.784), SVC (0.741) and logistic regression (0.741). The top two classifiers with the lowest Brier scores were SVC (0.164) and RF (0.169). In the combined sequences, the top three classifiers with the highest AUC values were RF (0.931), logistic regression (0.922) and GNB (0.922). The top two classifiers with the lowest Brier scores were XGB (0.063) and GNB (0.071).

In the t test of two independent samples based on the combined sequence, most of the classifiers showed significant differences from the predicted results of other classifiers. The DeLong test showed that all RF classifiers except GNB had significant differences in ROC curves. The specific data are shown in Table 5. The p value and DeLong test of each classifier on the combined sequence model are shown in Table 6. The comparison of the two data sets and the calibration curve of the combined model are shown in Figure 3.

TABLE 5

Table 5 Accuracy, precision, recall, F1-score, AUC, and Brier score results of mpMRI and combined models based on lesions for predicting PCa.

TABLE 6

Table 6 P value and DeLong test of each classifier on the sequence-combined model based on lesions for predicting PCa.

FIGURE 3

Figure 3 AUC and Brier score of the combined model based on two data sets (A, B); ROC curve of the combined model based on patients (C); ROC curve of the combined model based on lesions (D); combined model calibration curve based on patients (E); combined model calibration curve based on lesions (F).

4 Discussion

The use of MRI is of great value in the diagnosis and staging of PCa. T2WI-FS is usually used to show the correlation between changes in the internal anatomical structure of the prostate and surrounding tissues. DWI images quantify the activity degree of water molecule movement. Tumor tissues on DWI images with high b values often show high signals due to limited water molecule movement. ADC images calculate the signal change rate relative to the b value through the DWI of different b values, and tumor tissues often show a lower change rate than normal tissues. In this study, the above three MRI sequences were used to quantitatively evaluate the diagnosis of prostate lesions by radiomics, and the diagnostic efficacy of six radiomics classifiers was tested. Both the AUC and Brier score can effectively represent model classification ability. The P value and DeLong test were used to reflect whether there was a significant difference in the output results of the classifier. In terms of the current experimental results, the classification effect and stability of RF were better, and there were significant differences with other classifiers in most cases. This is similar to the experimental conclusion in the imaging classification of PCa recently reported by Zhang (19). SVC, XGB, logistic regression, and GNB also have good performance in ADC and DWI. This is similar to the logistic regression model based on imaging and clinical data established by Li et al. and the SVM model based on mpMRI established by Wang et al. (20, 21). Recent studies on the reproducibility of mpMRI imaging in PCa confirmed the diagnostic value of feature-based mpMRI (22, 23).

The diagnostic value of ADC and DWI sequences in PCa has been confirmed by a large number of studies (24, 25). The latest version of the PI-RADS guidelines proposed a bpMRI scoring scheme including T2WI-FS and DWI, and several studies have investigated its diagnostic efficacy and aggressiveness. We found that in the patient-based and focus-based ADC data sets, first-order features accounted for 30%, texture features accounted for 50%, and shape features accounted for 20% of the top five selected features. It is suggested that in addition to ADC image texture and shape information, its own features and ADC values also have certain diagnostic value. The research of Xu (26) and Zhang (27) has proven this conjecture.

At the same time, we found that ADC and DWI models performed better than T2WI-FS models in most cases, whether on patient-based or lesions-based data sets. In addition, on ADC and DWI, the classification performance of a single-lesion-based classifier is generally higher than that of a patient-based classifier. This is similar to the experimental results of the RF classifier constructed by using ADC and T2WI-FS combinations in a previous study (18). The bpMRI-based sequence tested in study (28) had similar results. We suspect the following reasons for the poor classification effect of T2WI-FS: (1) as anatomical imaging, T2WI-FS images, compared with functional MRI such as ADC and DWI, lay more emphasis on the display of the physiological structure, with more complex signals and greater interference from surrounding tissues. (2) The shapes and textures of tumor tissues on T2WI-FS are more diversified. When multiple lesions appear simultaneously in a sample, the similar signal performance can provide a certain reference for the classification of the model. (3) The ROI was manually sketched in this study, although many methods were adopted to avoid the influence of subjective factors such as the doctor’s experience. However, it is undeniable that there are still some errors, especially in the division of tumor tissue boundaries. In T2WI-FS images, the contrast between the tumor tissue signal and normal tissue signal is usually lower than that in ADC and DWI, and the volume of single lesion tissue is usually smaller than that of all lesions based on patients. To some extent, the diagnostic ability of the classifier affected by artificial error may decline more obviously.

In our study, multiple researchers collaborated to complete the ROI delineation of the image, and the clinical information and pathological results of the patients were not obtained before ROI delineation. At the same time, when a single sequence was sketched, no reference was made to other sequence images, and the sketching range was only related to the organizational signals in the sketched image. In different sequences, there may be some differences in ROI at the same location, which is more obvious in tumor tissue boundaries and benign hyperplasia. In a single sequence experiment, all the information provided to the algorithm comes from a single sequence image. The heterogeneity of different sequences may be more beneficial to the algorithm.

In this experiment, the DCE sequence was not included in the data range. The diagnostic value of DCE is pointed out by PI-RADS, and theoretically, the analysis of tissue blood supply will be useful in the identification of tumors. However, several studies suggested that there was no significant difference between mpMRI and bpMRI, including DCE, in the diagnosis of csPCa (29–31). Additionally, clinical indicators were not included in the model construction. A series of clinical indicators, including PSA, can improve classification ability in combination with radiomics. This experiment focuses on exploring the advantages and disadvantages of different classifiers in different sequences.

Single institution data were used in this study. Though single institution experimental data will help researchers to understand data structure, balance categories and conduct in-depth analysis and discussion, there are shortcomings in verifying the universality of the model. In order to overcome such problems, our experiment through gray normalization and resampling; The robustness and repeatability of the model have been improved by using large data volume and as standardized parameter setting as possible. Thus, the differences caused by feature instability and device heterogeneity could be alleviated to some extent. The establishment of large number of data, multi-center database, and comprehensive analysis of multi-device parameter images may discover more correlation between radiomics, genomics and pathophysiology, and strengthen mutual demonstration and in-depth research among various fields. This will be the follow-up research direction of our study (32–34).

The images of indwelling catheters or complications of inflammation were retained. An earlier review suggested that ureteral stents were susceptible to bacterial infesting and that patients with long-term indwelling catheters were at increased risk of urinary tract infections (35). In practice, some imaging manifestations of inflammation and malignant lesions overlap, especially when they are accompanied by benign prostatic hyperplasia and hypertrophy. In order to make the model more widely usable, the researchers responsible for the delineation used PI-RADS guidelines and experience to determine the degree of malignancy in suspected inflammatory areas. For areas that are more likely to be malignant, we used malignant label, but cases will still be classified according to pathological findings.

The limitations of this experiment are as follows: a) This study is a single institution retrospective study, and multi institution data can be used for subsequent evaluation (36). (b) The PZ and TZ regions were not distinguished, although previous studies have shown that the sensitivity and specificity of models can be improved to some extent by distinguishing them. However, in our research data, due to TZ hyperplasia, hypertrophy and PZ atrophy, it was difficult to accurately divide some samples into regions. (c) It was difficult to provide follow-up imaging data because some of the volunteers underwent prostate eradication after MRI scanning, so all the recruited volunteers provided only one MRI sample in our experiment. This means that characteristic data fluctuations caused by individual differences will be unavoidable, and difficult to carry out follow-up and disease progression studies

In conclusion, our study once again demonstrates the value of radiomics in the diagnosis of prostate disease. ADC and DWI were superior to T2WI-FS in the vast majority of cases on both datasets. Among the six classifiers included in the experiment, the classification performance of RF was more accurate and stable. In this study, feature extraction, model construction and other steps were based on the open Python algorithm, which can be easily and quickly constructed and operate in clinical practices, and diagnosis prostate lesions accurately.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by First affiliated hospital of Nanchang University. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

BK, JY, YW, and SL collected the data. HL, and LL analyzed the data. LG, and XW discussed the results. All authors contributed to the article and approved the submitted version.

Funding

This study was funded by the National Natural Science Foundation of China (Grant/Award Number: “81960313”) and Key research and development plans of Jiangxi Provincial Department of Science and Technology (Grant/Award Numbers: “S2020ZPYFB2343”).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We are grateful to all the members who have devoted time and energy to this project, and pay high tribute to all the subjects who have contributed data.

Abbreviations

MRI, Magnetic resonance imaging; MP-MRI, Multiparameter magnetic resonance imaging; DT, decision tree; GNB, Gaussian naive Bayes; XGBoost, eXtreme Gradient Boosting; RF, Random Forest; SVC, Support vector Classifier; AUC, Area under the curve; DWI, Diffusion-weighted imaging; T2WI-FS, T2-weighted Fat-sat imaging; ADC, Apparent diffusion coefficient; PCA, prostate cancer; TZ, Transition zone; PZ, Peripheral; LASSO, Least Absolute Shrinkage and Selection Operator; ROC, Receiver operating characteristic; VOI, Volumetric interest; ROI, Region of interest.

References

1. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global Cancer Statistics 2018: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA: Cancer J Clin (2018) 68(6):394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA: Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Perez-Lopez R, Tunariu N, Padhani AR, Oyen WJG, Fanti S, Vargas HA, et al. Imaging Diagnosis and Follow-Up of Advanced Prostate Cancer: Clinical Perspectives and State of the Art. Radiology (2019) 292(2):273–86. doi: 10.1148/radiol.2019181931

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Weinreb JC, Barentsz JO, Choyke PL, Cornud F, Haider MA, Macura KJ, et al. PI-RADS Prostate Imaging - Reporting and Data System: 2015, Version 2. Eur Urol (2016) 69(1):16–40. doi: 10.1016/j.eururo.2015.08.052

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Vasavada SR, Dobbs RW, Kajdacsy-Balla AA, Abern MR, Moreira DM. Inflammation on Prostate Needle Biopsy is Associated With Lower Prostate Cancer Risk: A Meta-Analysis. J Urol (2018) 199(5):1174–81. doi: 10.1016/j.juro.2017.11.120

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Turkbey B, Rosenkrantz AB, Haider MA, Padhani AR, Villeirs G, Macura KJ, et al. Prostate Imaging Reporting and Data System Version 2.1: 2019 Update of Prostate Imaging Reporting and Data System Version 2. Eur Urol (2019) 76(3):340–51. doi: 10.1016/j.eururo.2019.02.033

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Liang Z, Hu R, Yang Y, An N, Duo X, Liu Z, et al. Is Dynamic Contrast Enhancement Still Necessary in Multiparametric Magnetic Resonance for Diagnosis of Prostate Cancer: A Systematic Review and Meta-Analysis. Trans Androl Urol (2020) 9(2):553–73. doi: 10.21037/tau.2020.02.03

CrossRef Full Text | Google Scholar

8. Chen T, Zhang Z, Tan S, Zhang Y, Wei C, Wang S, et al. MRI Based Radiomics Compared With the PI-RADS V2.1 in the Prediction of Clinically Significant Prostate Cancer: Biparametric vs Multiparametric MRI. Front Oncol (2021) 11:792456. doi: 10.3389/fonc.2021.792456

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Stanzione A, Gambardella M, Cuocolo R, Ponsiglione A, Romeo V, Imbriaco M. Prostate MRI Radiomics: A Systematic Review and Radiomic Quality Score Assessment. Eur J Radiol (2020) 129:109095. doi: 10.1016/j.ejrad.2020.109095

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Bi WL, Hosny A, Schabath MB, Giger ML, Birkbak NJ, Mehrtash A, et al. Artificial Intelligence in Cancer Imaging: Clinical Challenges and Applications. CA: Cancer J Clin (2019) 69(2):127–57. doi: 10.3322/caac.21552

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Checcucci E, Autorino R, Cacciamani GE, Amparore D, De Cillis S, Piana A, et al. Artificial Intelligence and Neural Networks in Urology: Current Clinical Applications. Minerva Urol Nefrol Ital J Urol Nephrol (2020) 72(1):49–57. doi: 10.23736/S0393-2249.19.03613-0

CrossRef Full Text | Google Scholar

12. Kendrick J, Francis R, Hassan GM, Rowshanfarzad P, Jeraj R, Kasisi C, et al. Radiomics for Identification and Prediction in Metastatic Prostate Cancer: A Review of Studies. Front Oncol (2021) 11:771787. doi: 10.3389/fonc.2021.771787

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Li T, Sun L, Li Q, Luo X, Luo M, Xie H, et al. Development and Validation of a Radiomics Nomogram for Predicting Clinically Significant Prostate Cancer in PI-RADS 3 Lesions. Front Oncol (2021) 11:825429. doi: 10.3389/fonc.2021.825429

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Qi Y, Zhang S, Wei J, Zhang G, Lei J, Yan W, et al. Multiparametric MRI-Based Radiomics for Prostate Cancer Screening With PSA in 4-10 Ng/mL to Reduce Unnecessary Biopsies. J Magnetic Resonance Imaging JMRI (2020) 51(6):1890–9. doi: 10.1002/jmri.27008

CrossRef Full Text | Google Scholar

15. Bourbonne V, Jaouen V, Nguyen TA, Tissot V, Doucet L, Hatt M, et al. Development of a Radiomic-Based Model Predicting Lymph Node Involvement in Prostate Cancer Patients. Cancers (2021) 13(22):5672. doi: 10.3390/cancers13225672

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Bagher-Ebadian H, Janic B, Liu C, Pantelic M, Hearshen D, Elshaikh M, et al. Detection of Dominant Intra-Prostatic Lesions in Patients With Prostate Cancer Using an Artificial Neural Network and MR Multi-Modal Radiomics Analysis. Front Oncol (2019) 9:1313. doi: 10.3389/fonc.2019.01313

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Bertelli E, Mercatelli L, Marzi C, Pachetti E, Baccini M, Barucci A, et al. Machine and Deep Learning Prediction Of Prostate Cancer Aggressiveness Using Multiparametric MRI. Front Oncol (2021) 11:802964. doi: 10.3389/fonc.2021.802964

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Bonekamp D, Kohl S, Wiesenfarth M, Schelb P, Radtke JP, Götz M, et al. Radiomic Machine Learning for Characterization of Prostate Lesions With MRI: Comparison to ADC Values. Radiology (2018) 289(1):128–37. doi: 10.1148/radiol.2018173064

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Zhang L, Zhe X, Tang M, Zhang J, Ren J, Zhang X, et al. Predicting the Grade of Prostate Cancer Based on a Biparametric MRI Radiomics Signature. Contrast Media Mol Imag (2021) 2021:7830909. doi: 10.1155/2021/7830909

CrossRef Full Text | Google Scholar

20. Li M, Chen T, Zhao W, Wei C, Li X, Duan S, et al. Radiomics Prediction Model for the Improved Diagnosis of Clinically Significant Prostate Cancer on Biparametric MRI. Quantitative Imaging Med Surg (2020) 10(2):368–79. doi: 10.21037/qims.2019.12.06

CrossRef Full Text | Google Scholar

21. Wang J, Wu CJ, Bao ML, Zhang J, Wang XN, Zhang YD. Machine Learning-Based Analysis of MR Radiomics Can Help to Improve the Diagnostic Performance of PI-RADS V2 in Clinically Relevant Prostate Cancer. Eur Radiol (2017) 27(10):4082–90. doi: 10.1007/s00330-017-4800-5

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Lu H, Parra NA, Qi J, Gage K, Li Q, Fan S, et al. Repeatability of Quantitative Imaging Features in Prostate Magnetic Resonance Imaging. Front Oncol (2020) 10:551. doi: 10.3389/fonc.2020.00551

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Russo F, Manfredi M, Panebianco V, Armando E, De Luca S, Mazzetti S, et al. Radiological Wheeler Staging System: A Retrospective Cohort Analysis to Improve the Local Staging of Prostate Cancer With Multiparametric MRI. Minerva Urol Nefrol Ital J Urol Nephrol (2019) 71(3):264–72. doi: 10.23736/S0393-2249.19.03248-X

CrossRef Full Text | Google Scholar

24. Stabile A, Dell'Oglio P, Soligo M, De Cobelli F, Gandaglia G, Fossati N, et al. Assessing the Clinical Value of Positive Multiparametric Magnetic Resonance Imaging in Young Men With a Suspicion of Prostate Cancer. Eur Urol Oncol (2021) 4(4):594–600. doi: 10.1016/j.euo.2019.05.006

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Salvaggio G, Calamia M, Purpura P, Bartolotta TV, Picone D, Dispensa N, et al. Role of Apparent Diffusion Coefficient Values in Prostate Diseases Characterization on Diffusion-Weighted Magnetic Resonance Imaging. Minerva Urol Nefrol Ital J Urol Nephrol (2019) 71(2):154–60. doi: 10.23736/S0393-2249.18.03065-5

CrossRef Full Text | Google Scholar

26. Xu L, Zhang G, Zhao L, Mao L, Li X, Yan W, et al. Radiomics Based on Multiparametric Magnetic Resonance Imaging to Predict Extraprostatic Extension of Prostate Cancer. Front Oncol (2020) 10:940. doi: 10.3389/fonc.2020.00940

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Zhang KS, Schelb P, Kohl S, Radtke JP, Wiesenfarth M, Schimmöller L, et al. Improvement of PI-RADS-Dependent Prostate Cancer Classification by Quantitative Image Assessment Using Radiomics or Mean ADC. Magnetic Resonance Imag (2021) 82:9–17. doi: 10.1016/j.mri.2021.06.013

CrossRef Full Text | Google Scholar

28. Xu M, Fang M, Zou J, Yang S, Yu D, Zhong L, et al. Using Biparametric MRI Radiomics Signature to Differentiate Between Benign and Malignant Prostate Lesions. Eur J Radiol (2019) 114:38–44. doi: 10.1016/j.ejrad.2019.02.032

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Alabousi M, Salameh JP, Gusenbauer K, Samoilov L, Jafri A, Yu H, et al. Biparametric vs Multiparametric Prostate Magnetic Resonance Imaging for the Detection of Prostate Cancer in Treatment-Naïve Patients: A Diagnostic Test Accuracy Systematic Review and Meta-Analysis. BJU Int (2019) 124(2):209–20. doi: 10.1111/bju.14759

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Tamada T, Kido A, Yamamoto A, Takeuchi M, Miyaji Y, Moriya T, et al. Comparison of Biparametric and Multiparametric MRI for Clinically Significant Prostate Cancer Detection With PI-RADS Version 2. J Magnetic Resonance Imaging JMRI (2021) 53(1):283–91. doi: 10.1002/jmri.27283

CrossRef Full Text | Google Scholar

31. Xu L, Zhang G, Shi B, Liu Y, Zou T, Yan W, et al. Comparison of Biparametric and Multiparametric MRI in the Diagnosis of Prostate Cancer. Cancer Imaging Off Publ Int Cancer Imaging Soc (2019) 19(1):90. doi: 10.1186/s40644-019-0274-9

CrossRef Full Text | Google Scholar

32. Mayerhoefer ME, Materka A, Langs G, Häggström I, Szczypiński P, Gibbs P, et al. Introduction to Radiomics. J Nucl Med Off Publ Soc Nucl Med (2020) 61(4):488–95. doi: 10.2967/jnumed.118.222893

CrossRef Full Text | Google Scholar

33. Fiz F, Viganò L, Gennaro N, Costa G, La Bella L, Boichuk A, et al. Radiomics of Liver Metastases: A Systematic Review. Cancers (2020) 12(10):2881. doi: 10.3390/cancers12102881

CrossRef Full Text | Google Scholar

34. Fornacon-Wood I, Faivre-Finn C, O'Connor JPB, Price GJ. Radiomics as a Personalized Medicine Tool in Lung Cancer: Separating the Hope From the Hype. Lung Cancer (Amsterdam Netherlands) (2020) 146:197–208. doi: 10.1016/j.lungcan.2020.05.028

CrossRef Full Text | Google Scholar

35. Cormio L, La Forgia P, La Forgia D, Siitonen A, Ruutu M. Is It Possible to Prevent Bacterial Adhesion Onto Ureteric Stents? Urol Res (1997) 25(3):213–6. doi: 10.1007/BF00941985

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Urraro F, Nardone V, Reginelli A, Varelli C, Angrisani A, Patanè V, et al. MRI Radiomics in Prostate Cancer: A Reliability Study. Front Oncol (2021) 11:805137. doi: 10.3389/fonc.2021.805137

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: MRI, RF, SVC, radiomic, prostate cancer

Citation: Li L, Gu L, Kang B, Yang J, Wu Y, Liu H, Lai S, Wu X and Jiang J (2022) Evaluation of the Efficiency of MRI-Based Radiomics Classifiers in the Diagnosis of Prostate Lesions. Front. Oncol. 12:934108. doi: 10.3389/fonc.2022.934108

Received: 02 May 2022; Accepted: 07 June 2022;
Published: 05 July 2022.

Edited by:

Elena Bertelli, Careggi University Hospital, Italy

Reviewed by:

Enrico Checcucci, IRCCS Candiolo Cancer Institute, Italy
Daniele La Forgia, IRCCS Istituto Tumori Giovanni Paolo II, Italy

Copyright © 2022 Li, Gu, Kang, Yang, Wu, Liu, Lai, Wu and Jiang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jian Jiang, amlqMjAwMmNuQDEyNi5jb20=

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Evaluation of the Efficiency of MRI-Based Radiomics Classifiers in the Diagnosis of Prostate Lesions

Introduction

Materials and Methods

Patient Information

MRI Parameters

Pathology Reference Standard

Lesion Segmentation and PI-RADS Assessment

Feature Extraction and Selection

Model Construction and Statistical Analysis

Result

Subject Characteristics and Distribution of Prostate Lesions

Patient-Based Classification Results

Lesions-Based Classification Results

4 Discussion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

Acknowledgments

Abbreviations

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good