- 1Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong, Hong Kong SAR, China
- 2Department of Biomedical Engineering, The Hong Kong Polytechnic University, Hong Kong, Hong Kong SAR, China
- 3Research Institute for Smart Aging, The Hong Kong Polytechnic University, Hong Kong, Hong Kong SAR, China
Objective: High-risk prostate cancer (PCa) is often treated by prostate-only radiotherapy (PORT) owing to its favourable toxicity profile compared to whole-pelvic radiotherapy. Unfortunately, more than 50% patients still developed disease progression following PORT. Conventional clinical factors may be unable to identify at-risk subgroups in the era of precision medicine. In this study, we aimed to investigate the prognostic value of pre-treatment planning computed tomography (pCT)-based radiomic features and clinical attributes to predict 5-year progression-free survival (PFS) in high-risk PCa patients following PORT.
Materials and methods: A total of 176 biopsy-confirmed PCa patients who were treated at the Hong Kong Princess Margaret Hospital were retrospectively screened for eligibility. Clinical data and pCT of one hundred eligible high-risk PCa patients were analysed. Radiomic features were extracted from the gross-tumour-volume (GTV) with and without applying Laplacian-of-Gaussian (LoG) filter. The entire patient cohort was temporally stratified into a training and an independent validation cohort in a ratio of 3:1. Radiomics (R), clinical (C) and radiomic-clinical (RC) combined models were developed by Ridge regression through 5-fold cross-validation with 100 iterations on the training cohort. A model score was calculated for each model based on the included features. Model classification performance on 5-year PFS was evaluated in the independent validation cohort by average area-under-curve (AUC) of receiver-operating-characteristics (ROC) curve and precision-recall curve (PRC). Delong’s test was used for model comparison.
Results: The RC combined model which contains 6 predictive features (tumour flatness, root-mean-square on fine LoG-filtered image, prostate-specific antigen serum concentration, Gleason score, Roach score and GTV volume) was the best-performing model (AUC = 0.797, 95%CI = 0.768-0.826), which significantly outperformed the R-model (AUC = 0.795, 95%CI = 0.774-0.816) and C-model (AUC = 0.625, 95%CI = 0.585-0.665) in the independent validation cohort. Besides, only the RC model score significantly classified patients in both cohorts into progression and progression-free groups regarding their 5-year PFS (p< 0.05).
Conclusion: Combining pCT-based radiomic and clinical attributes provided superior prognostication value regarding 5-year PFS in high-risk PCa patients following PORT. A large multi-centre study will potentially aid clinicians in implementing personalised treatment for this vulnerable subgroup in the future.
1 Introduction
Prostate cancer (PCa) ranks the second highest globally in terms of the prevalence of male malignancies, with more than 1.4 million new cases diagnosed in 2020 (1). High-risk PCa accounts for over one-third of the newly diagnosed PCa population, with a three-fold greater risk of developing distant metastasis compared to their low-risk counterparts. The management strategies of these two cohorts differ drastically. The low-risk PCa usually requires only active surveillance, while high-risk ones require combined modality therapy such as surgery, radiotherapy, systemic chemotherapy or hormonal therapy (2). Optimising treatment strategy for the high-risk is challenging.
Clinically undetectable occult pelvic nodal metastasis is commonly present in up to 40% of high-risk PCa patients (3). However, reliable detection of occult pelvic lymph node (PLN) metastasis is yet available for clinical use (4). It remains as an unresolved clinical challenge as to whether PLN should be prophylactically treated. The survival benefits and toxicity profiles of prostate-only radiotherapy (PORT), or prophylactic whole-pelvic radiotherapy (WPRT) were vigorously investigated in large randomised controlled trials (e.g. RTOG-9413), national database analysis and retrospective studies among the western population (5–9). Hence, the trade-off between better survival with WPRT and reduced toxicities with PORT is still highly debated.
Although the 5-year survival of high-risk PCa patients drastically increased by 23% over the years with greater emphasis on health-related quality of life (HRQoL) (10, 11), over 50% of high-risk PCa patients receiving PORT experienced recurrence, which was far higher than the 12.5% recurrence from the WPRT cohort (6). It is evident that a more refined subgrouping is necessary to predict which high-risk patient receiving PORT would experience recurrence within 5 years to support the clinical decision. Two commonly used conventional risk stratification tools are the National Institute for Health and Care Excellence (NICE) guideline and the Roach formula. The NICE guideline stratified patients into low, intermediate and high risk by clinical (c)T stage, prostate-specific antigen (PSA) serum concentration and Gleason score (GS) (12). Although all are prognostic markers (13–15), the cT stage is determined by digital rectal examination (DRE) that is subjected to high interobserver variability because (16) only a small portion of the prostate is palpable (17). The Roach score (RS) is also commonly used for risk stratification, based on the PSA and GS (18). A score of ≥15% is considered high-risk. A recent study reported that the RS has statistically significantly higher predictive power than the NICE guideline, with a concordance index of 0.724 and 0.715 respectively (13). However, the RS tends to overestimate the risk of occult PLN disease by 2.5 to 4 times among high-risk patients. This would result in the over-treatment of patients and compromise the therapeutic index (19, 20). Therefore, both NICE and RS may not be sufficiently effective in the era of precision medicine. There is a growing demand for developing a more refined, personalised risk stratification method for predicting treatment outcomes of high-risk PCa patients.
Recent advancement in artificial intelligence and radiomics accelerates the development of precision and personalised medicine. Radiomics adopts high-throughput methods to extract high-dimensional radiological features, transforming them into imaging biomarkers to improve clinical decisions (21). Magnetic resonance imaging (MRI) has been extensively studied, employing derived radiomic features for diagnosis or risk prediction of PCa. They showed promising classification performance on clinical endpoints such as GS or biochemical recurrence (BCR) (22–26). By contrast, very few investigations were conducted on the prognostic value of imaging biomarkers derived from the pCT of high-risk PCa patients, despite that the prognostic power of CT-based radiomics has been widely reported in other types of primary solid tumours such as non-small cell lung cancer, nasopharyngeal carcinoma and renal cell carcinoma (27–30). Franzese et al. (31) was the only study employing pCT-derived radiomic features to predict metastasis-free survival in PCa patients treated by external beam radiotherapy (EBRT). However, the included patients in their study were treated with various kinds of treatment, including trans-urethral resection of the prostate, high-intensity focused ultrasound, and EBRT. The prognostic power of their model specifically on PORT-treated high-risk PCa patients remains to be explored.
To our best knowledge, this is the first study to investigate the feasibility of utilising pCT-derived radiomic features and clinical attributes to predict 5-year progression-free survival (PFS) in high-risk localised PCa patients following PORT. Recently, a systematic review in prostate radiomics suggested that incorporating clinical features into a radiomic model may improve its clinical utility (32). Given the inherently heterogeneous nature of the disease, conventional clinical factors may provide additional prognostic value (33). The success of this study may provide insightful findings for clinicians to optimise treatment strategies for managing high-risk PCa patients.
2 Methods and materials
2.1 Patients
2.1.1 Patient source
The present study was approved by the Human Subjects Ethics Sub-committee of the Hong Kong Polytechnic University (Reference: HSEARS20220406011) and Kowloon West Cluster Research Ethics Committee (KWC-REC) of the Hong Kong Hospital Authority (Reference: KW/EX-21-155 (165–05)). The requirement for individual informed consent was waived due to the retrospective nature of this study. One hundred and seventy-six biopsy-proven PCa patients who underwent definitive PORT at the Princess Margaret Hospital (PMH) in Hong Kong between February 2011 and December 2016 were retrospectively screened for eligibility. Following the inclusion and exclusion criteria (IEC) shown in Figure 1, 100 patients with localised (cT1-3, N0, M0) disease, with RS ≥ 15% (i.e. high-risk PCa) were included in this study.
Figure 1 Patient stratification and inclusion-exclusion criteria. PMH, Princess Margaret Hospital; n denotes number of patients; TRUS, transrectal ultrasound; RS, Roach Score; pCT, planning computed tomography; GTV, gross tumour volume; PORT, prostate-only radiotherapy; AJCC, American Joint Committee on Cancer; PSA, prostate specific antigen; GS, Gleason score.
2.1.2 Patient data
Clinical information such as the age at diagnosis, PORT start date and disease characteristics (clinical tumour (T), nodal (N) and distant metastatic (M) staging, histology, PSA serum concentration, GS, RS); treatment information (prescription and period of PORT and hormonal therapy, treatment techniques and organ contours); imaging information (pCT registered with organ contours); and clinical outcome (status of biochemical failure, regional and distant metastasis) were collected retrospectively.
2.1.3 Treatment approach and clinical endpoint
All patients were treated with PORT using intensity-modulated radiotherapy (IMRT), to a total of 70 or 74Gy. The treatment regimen also included neoadjuvant and concurrent antiandrogen and/or luteinising hormone-releasing hormone analogue (LHRHa) for 8-12 weeks, and 3 years of adjuvant LHRHa.
For this study, the clinical endpoint was the 5-year PFS status. Patients with disease progression manifested as biochemical recurrence, regional or distant metastasis, or death (34) were labelled “PFS-1”, with the others labelled as “PFS-0”. The Phoenix criteria of biochemical recurrence, defined as > 2 ng/mL rise from nadir PSA (35), was adopted in this study.
2.1.4 Patient stratification
The enrolled patients with treatment commenced on or before the date of 08/11/2016 were allocated to a training dataset (n = 75), and the remaining ones were assigned to an independent validation dataset (n = 25). The PFS-1 to PFS-0 ratio between the training and independent validation cohort was set at 6:4, referencing a similar work adopting temporal stratification (36). This patient stratification approach has been widely adopted in similar studies (36–38), which complies with the recommendation provided in the Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) type-2b classification (39).
2.2 Planning CT acquisition and volume-of-interest segmentation
Patients underwent iodinated contrast-enhanced (intravenous injection of 120mL Omnipaque 300 mg/dL at 3 mL/second with 75 seconds scan delay) on one of the two CT scanners: 16-slice GE LightSpeed RT16 or GE BrightSpeed (GE Medical Systems, WI, USA). The pCT acquisition parameters included: X-ray tube voltage 120 kVp or 140 kVp; X-ray tube current 114-376 mA; field-of-view 500-650 mm; body filter; standard convolution kernel; matrix size 512x512; pixel spacing 0.98-1.27 mm; and reconstruction thickness 2.5 mm. The pCT scans were acquired 24 (range, 6-47) days on average before PORT commencement. All pCT scans were stored in Picture Archiving and Communication System (PACS) in Digital Imaging and Communications in Medicine (DICOM) format. All pCTs scans were registered with GTV segmentation, which was the VOI for radiomic feature extraction and modelling in this study. The GTV of all patients were segmented by experienced oncologists on the Eclipse ARIA system (version 8.6.15 or 13.0.26, Varian Medical System). The delineation of VOI is performed according to the ESTRO ACROP consensus guideline (40). A team of oncologists with expertise in prostate cancer radiotherapy delineated the ROI. To address the inter-observer variabilities, the contours were all reviewed and approved by a Consultant Oncologist before use. Calcification, necrosis, nor artifacts due to fiducial markers were not found in all the included cases in this study. The GTV segmentation of a representative patient is illustrated in Figure 2.
Figure 2 Representative example of a GTV segmentation on contrast-enhanced pCT of a high-risk prostate cancer patient.
2.3 CT image pre-processing
All pCT images were pre-processed before radiomic feature extraction, in compliance with the recommendations provided in the internationally accepted Image Biomarker Standardisation Initiative (IBSI) guideline (41). All steps were implemented by an in-house developed pipeline (IhDP) which used an open-source Pyradiomics v2.2.0 with SimpleITK v1.2.4 package on Python v3.7.3 platform. pCT was first resampled to 1 mm isotropic voxels by B-spline interpolation to account for voxel spacing variation while avoiding longitudinal image oversampling (41). Resampled pCT were discretized to 10-Hounsfield unit (HU) bins to homogenise intensity resolution (42). The GTV was re-segmented by HU thresholding, bounded by -150 and 180 HU (43). Six kernel sizes of Laplacian of Gaussian (LoG) filters, 0.5, 1, 2, 3, 4 and 5 mm, were used to reconstruct the image, facilitating fine and coarse texture feature extraction at different resolutions (44).
2.4 Feature extraction & feature normalization
A total of 665 radiomic features from the GTV were extracted using the IhDP. Types of the extracted features of both unfiltered and LoG-filtered images included shape (n = 14), first-order intensity (n = 108) and texture (n = 543). Details of the extracted features classes and their distributions are shown in Supplementary Material.
Before analysis, all radiomic features were normalised by using z-score transformation using R software (version 4.1.3). Specifically, features were centred and scaled firstly in the training cohort in which each feature had a mean of 0 and a standard deviation of 1. The normalisation factors obtained in the training cohort were then used to perform feature normalisation in the independent validation cohort.
2.5 Feature selection & model development
Relevant feature selection processes including the Spearman correlation coefficient (SCC) and Mann-Whitney U test were performed on the training cohort before the use of Ridge regression algorithms (44–46). The independent validation cohort was adopted for independent model validation. All model training and validation were performed using the R software.
For the radiomic model development, the SCC was first calculated for each pair of radiomic features in the training cohort using the R package “caret”. When the absolute correlation coefficient (r) was ≥ 0.6 in any of the feature pairs, the feature with a higher mean absolute correlation was removed from the original feature set to minimise the likelihood of multicollinearity and model overfitting (21). The clinical significance of the remaining features was assessed by using a two-sided, unpaired Mann-Whitney U test in the training cohort by executing the “wilcox.test’ function in the R software, while features with p > 0.1 were removed. A L2 regularisation was performed using ridge regression in the R package “glmnet”. Ridge regression penalises regression coefficients through hyperparameter (λ) tuning in a grid search. The λ yielding a minimum cross-validation error was then chosen. The objective function was optimised through cyclical coordinate descent in the R package “glmnet”.
For the clinical model development, clinical features including PSA serum concentration, GS, RS and GTV volume formed the initial feature set. PSA was categorised into< 10, 10-20 and > 20 ng/mL while GS was classified into five groups (14). RS was divided into 4 categories: 15-25%, > 25-35%, > 35%-45% and > 45% (18). All clinical features were tested for correlation using the SCC test in the training cohort. The same correlation threshold of r ≥ 0.6 as in the R model was applied. Ridge regression was also performed.
To develop the radiomic-clinical (RC) combined model, all selected radiomic and clinical features were combined and fitted into the ridge regression. A 5-fold cross-validation was employed with 100 iterations to obtain the average predictive performance model. The model score was calculated as
Three model scores were calculated for each patient: R (R-score), C (C-score) and RC combined (RC-score) models. The R package “cvAUC” and “PRROC” were used to compute the averaged area under the receiver-operating characteristics (ROC-AUC) curves and precision-recall curve (PRC). Other performance indicators including sensitivity, specificity, accuracy and the Youden index (YI) were calculated by the R package “pROC”. The optimal cut-off in each model was determined using the Youden method (47).
2.6 Statistical analysis
All continuous variables have been verified for conformity by the Shapiro-Wilk test. Statistical differences of continuous clinical and demographic variables were evaluated by the Mann-Whitney U test, while categorical variables were assessed by Chi’s square or Fisher-exact test as appropriate. Performance of the R, C and RC combined models were compared using the Delong test, based on their averaged ROC-AUC in both training and independent validation cohorts. The correlations between RC-score, radiomic and clinical variables were evaluated by SCC to reveal any dominant features in the model that contribute heavily to the RC-score. SCC was also used to assess inter-feature correlation in the training cohort for feature screening. All statistical tests were two-sided with a value of p< 0.05 considered statistically significant.
3 Results
3.1 Patient characteristics
Among 176 patients screened for eligibility, 100 cases met the inclusion criteria. 10 included patients who experienced disease progression were labelled as “PFS-1”, in which 6 (8%) and 4 (16%) were allocated to the training and independent validation cohorts respectively. Table 1 summarises the characteristics of the enrolled patients.
No statistically significant difference was detected in the distribution of age, PSA serum concentration, GS, RS, GTV volume, neoadjuvant hormonal therapy (NHT), concurrent hormonal therapy (CHT) and adjuvant hormonal therapy (AHT) regimen between training and independent validation cohorts, except for the PORT dose scheme (p< 0.001). Furthermore, PSA, GS, RS and GTV volumes were not significantly different between PFS-1 and PFS-0 patients in both cohorts, except for the CHT regime.
3.2 Model development
A simplified modelling workflow is illustrated in Figure 3. In the R model, 40 features with high independence (r< 0.6) shown in the correlation map (Supplementary Material) underwent further elimination. Among them, the unfiltered-shape-flatness and LoG-1mm-filtered root-mean-square (RMS) features were chosen for the development of the R model. In the C model, the Spearman correlations between PSA, GS, RS and GTV volume were less than 0.6 (Supplementary Material). Therefore, all 4 clinical features were used for modelling. The R and C models consisted of 2 radiomic and 4 clinical features respectively, while these 6 features were combined in the RC model.
Figure 3 Models construction workflow. R, Radiomics; RC, Radiomic-clinical; C, clinical; n denotes number of features; ROC, receiver-operating characteristic; PR, precision-recall; YI, Youden index.
Table 2 lists the intercepts and coefficients of all three models. Patients with model scores higher than the optimal cut-off were classified as high-risk of having disease progression within 5 years since the commencement of treatment, or vice versa.
Table 2 Intercepts and selected radiomic and/or clinical features of the R, C and RC combined models.
3.3 Model evaluation
The performance of each model in both the training and independent validation cohorts is summarised in Table 3. The ROC and PR curves are shown in Figure 4. Boxplots of the model scores are presented in Figure 5.
Figure 4 The ROC and PR curves of R, C and RC combined models. Boxplots (A, B) Averaged ROC curves of the training and independent validation cohort. Boxplots (C, D) The PR curves of both cohorts. ROC, receiver-operating characteristic; PR, precision-recall; R, radiomic; C, clinical; RC, radiomic-clinical; AUC, area under curve.
Figure 5 Boxplots of model scores of R, C and RC combined models. The boxplots in first (A–C) and second rows (D–F) display the distribution of model scores calculated by different models in training and validation cohorts, respectively. R, Radiomics; C, clinical; RC, Radiomic-clinical; T, training cohort; V, validation cohort; PFS, progression-free survival.
The RC combined model yielded the highest ROC-AUC (0.797, 95%CI = 0.786-0.826) in the independent validation cohort compared to the R (0.795, 95%CI = 0.774-0.816) and C (0.625, 95%CI = 0.585-0.665) models. The DeLong test showed that the RC combined model had a ROC-AUC significantly higher than the C model in both training (0.747 vs. 0.554, p< 0.001) and independent validation (0.797 vs. 0.625, p< 0.001) cohorts. A similar finding was also observed with the RC model demonstrating a higher ROC-AUC than the R model in the independent validation cohort (0.797 vs. 0.795, p<.001). Moreover, the RC combined model had the highest PR-AUC, accuracy and YI compared to R and C models in the independent validation cohort. The RC combined model also attained a relatively high sensitivity and specificity of 0.808 and 0.722 in the training cohort and 0.793 and 0.653 in the independent validation cohort, respectively.
Of note, the RC combined model was the only model that resulted in a significantly different RC-score between PFS-1 and PFS-0 patients in both the training (median: -2.428 vs. -2.447, p = 0.01) and independent validation cohorts (median: -2.411 vs. -2.451, p = 0.03). On the other hand, the R model failed to render a significantly different R-score in the independent validation cohort (median: -2.368 vs. -2.491, p = 0.08). Similarly, the C model did not yield any significant difference in C-score for either the training (median: -2.405 vs. -2.440) or independent validation cohort (-2.383 vs. -2.425).
3.4 Correlation analysis among RC-scores, radiomic and clinical features
The average absolute SCC (r) of R features with RC-scores (r = 0.603) was ~2.6 times higher than that of C features with RC-scores (r = 0.235). Among radiomic features, LoG-1mm-filtered RMS had the highest correlation with the RC-score (r = 0.797), followed by the unfiltered-shape-flatness (r = 0.409). The clinical features with the highest and lowest correlation with RC-score were GS (r = 0.411) and GTV volume (r = 0.105) respectively. Between radiomic and clinical features, unfiltered-shape-flatness and LoG-1mm-filtered RMS had the highest correlation with RS (r = 0.372) and GTV volume (r = 0.228), respectively. The correlation matrix (Supplementary Material) described in detail the correlation between RC-scores of patients in the independent validation cohort and their corresponding R and C features.
4 Discussion
For the first time, we demonstrated that combining CT-based radiomic (unfiltered-shape-flatness, LoG-1mm-filtered RMS) and clinical attributes (PSA, GS group, RS group and GTV volume) provided superior prognostic value for 5-year PFS in high-risk PCa patients following PORT. The DeLong test revealed that the ROC-AUC of the RC combined model was significantly higher than those of the R and C models in the independent validation cohort. The RC model had the highest overall accuracy and YI. Only the RC model score significantly classified patients into “progression” (PFS-1) and “progression-free” (PFS-0) groups according to their 5-year PFS in both training and independent validation cohorts (both p< 0.05). These findings suggested the potential of the RC combined model in supporting clinicians to implement personalized treatment for this vulnerable patient subgroup in the future. For instance, if a patient is classified into the “progression” (PFS-1) group ahead of the commencement of the PORT treatment, clinicians may consider a more aggressive therapy (e.g., WPRT) for improving the prognosis of the given patient.
The two identified radiomic features (unfiltered-shape-flatness and LoG-1mm-filtered RMS) are in line with previous studies using CT images (27, 28, 48, 49). Unfiltered-shape-flatness is a shape feature calculated by the square root of the least axis length divided by the major axis length (48). A value closer to zero indicates the tumour is flatter. This feature was reported in another CT radiomic model for predicting tumour response in lung cancer patients receiving EBRT (49). Additionally, the RC-score was negatively correlated to the flatness, with a lower flatness value indicating a poorer prognosis. Such a finding is consistent with Khodabakhshi’s study, which predicted the OS of patients with renal cell carcinoma (27). The feature LoG-1 mm-filtered RMS is calculated by the mean of all squared intensity values in LoG-1mm-filtered ROI (48). Similar to flatness, it is negatively correlated to the RC-score. The RMS has been reported in a CT radiomic model study for lung adenocarcinoma, in which a lower value was associated with poorer PFS and OS (28). The prognostic implications of these features have not been reported for PCa, and the association between the selected pCT radiomic features and biological properties remains unclear. Indeed, the RC model was dominated by the R features since the absolute SCC of R features with RC-score was ~2.6 times higher than those of clinical features (Supplementary Material). Further investigation is needed to explore the biological mechanism of radiomic features.
Another notable finding is the lack of textural features in the RC combined model. It may be attributed to the intrinsic property of pCT images and the clinical endpoint of our study. Most prostate radiomic studies use MRI-derived features for prognostic prediction (22–24, 50). In a study using T2-weighted MRI, features with the highest predictive value originated from the gray-level run-length matrix (GLRLM) texture feature class (23). Another two studies using apparent diffusion coefficient (ADC) MRI for radiomics modelling (23, 24) suggested that the gray-level co-occurrence matrix (GLCM) texture feature class contained the most predictive features. However, in a prostate radiomic study (51) exploring interfraction cone-beam CT, both shape and first-order features have excellent capability in predicting patient outcomes, which was similar to our study. In addition, these two features were also found to be capable of predicting PFS in both nasopharyngeal carcinoma (NPC) and NSCLC (29, 30). Meanwhile, texture features dominate in MRI-based NPC radiomic studies (52, 53). These observations suggested that the feature class selected for PFS prediction could be influenced by the choice of imaging modality. This may be explained by the inherently lower soft-tissue contrast characteristic in CT than in MRI, resulting in less prominent texture features that may be relevant to the PFS. Texture features in CT and MRI have been regarded as the manifestation of tumour heterogeneity. Hence, the dominant feature class may vary according to different clinical endpoints. For instance, texture features are often selected in both CT and MRI prostate radiomic studies when GS is chosen as the clinical endpoint (25, 42). It is not surprising since GS is the gold standard for characterising prostate heterogeneity, while texture features measure the same physical property. The current study demonstrated that both shape (unfiltered-shape-flatness) and first-order features (LoG-1mm-filtered RMS) have similar predictive performance compared to MRI radiomic studies (23, 24, 50). Nonetheless, a further study involving a larger external validation cohort is needed to validate the performance of these two feature classes in predicting the PFS of high-risk PCa patients.
The C model constructed based on clinical factors had the poorest performance. Univariate analysis did not show any statistical significance of these clinical features between PFS-1 and PFS-0 patients in both cohorts. The most probable explanation is that these clinical features are homogeneous within the high-risk subgroup of PCa patients (32). Thus, using the C model alone would lead to the poorest prediction performance. Statistically, the C model also failed to differentiate the 5-year PFS status in both cohorts. These findings are in line with those reported by Fernandes et al. (23) and Bourbonne et al. (26), who have also incorporated PSA and GS in their clinical models. They also retrospectively investigated the clinical predictive model for high-risk PCa patients treated by EBRT (23) and radical prostatectomy during a 5-year follow-up (26). The sensitivity and specificity of the C model were also comparable to those reported by Bourbonne et al. (26) (Sensitivity: 0.53 vs 0.68; Specificity: 0.53 vs. 0.59). These clinical features, however, are not without predictive values when combined with the R model. It is demonstrated by the increase in overall accuracy in both the training and independent validation cohorts after combining R with C models. The potential complementary role between R and C features for prognosis warrants future investigation (54).
Overall, this study explored the potential of combining pCT-derived radiomic and clinical features in the prognostic prediction of high-risk PCa patients receiving PORT. Our study has demonstrated that the classification performance of the combined RC model was comparable to the combined MRI-derived radiomic and clinical models. Among all included features, the shape and first-order features are considered more intuitive than other complex features when interpreting the classifier in predicting 5-year PFS of high-risk patients in the clinical setting (51). Our patient stratification methodology is highly standardised by adopting a temporal validation as in type-2b study according to the TRIPOD guideline (39). Moreover, the use of pCT is preferable due to its higher standardisation, repeatability and calibration of CT over diagnostic MRI (33). For instance, HU in pCT directly quantifies the electron density of the tissue while the pixel value in MR is arbitrarily allocated.
This study has several limitations. First, the sample size is relatively small. This can be attributed to the strict inclusion and exclusion criteria enforced to ensure high-quality radiomics and clinical data. Nonetheless, numerous studies have also contributed insightful findings to the community with a similar sample size as our work (32, 55–57). Second, this work only performed modelling based on data from a single centre as COVID-19 has restricted the research team from conducting data collection from multiple centres. Third, owing to the small sample size and temporal stratification, a class imbalance exists in which only 10% are PFS-1 patients. Although a similar situation was observed in Bosetti’s (2020) work analysing cone-beam CT performance for predicting prostate cancer clinical progression, further study with a larger sample size and proportion of PFS-1 patients would be beneficial to minimise modelling and evaluation bias. Minority class-boosting techniques should also be implemented if appropriate. Fourth, an external testing set was not employed. This would be essential to demonstrate satisfying model generalisability in a multi-centre study before actual clinical application (58). Moreover, Ridge regression, which was a relatively straightforward modelling strategy, was adopted in this work for demonstrating the proposed feasibility of using radiomic-clincial factors to predict patient outcomes. To render the model fit for clinical application, more robust modelling methodologies such as non-linear machine learning techniques and random oversampling should be incorporated when processing multi-centre data sets.
5 Conclusion
This study demonstrated the feasibility and potential of using pCT-derived radiomic and clinical features for predicting 5-year PFS in high-risk PCa patients receiving PORT, which is an important clinical research gap that previously lacks investigation The RC combined model provided statistically superior predictive performance than both R and C models in the independent temporal validation cohort. These findings lay the ground for the future development of a combined radiomic-clinical model involving robust modelling techniques, multicentre data and external validation. We hope that this work will bring attention to the academic community and encourage future work to address this on a larger scale towards clinical implementation. Ultimately, it could potentially act as a supportive decision tool predicting the outcome of different treatment regimens to facilitate personalised management of high-risk PCa patients.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving human participants were reviewed and approved by Human Subjects Ethics Sub-committee of The Hong Kong Polytechnic University and Kowloon West Cluster Research Ethics Committee of the Hong Kong Hospital Authority. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.
Author contributions
JCC, SL, WSL and SWL contributed to the study design, methodology development, results interpretation, and manuscript review. WL offered administrative and material support for data collection. JCC, AOL, and JCK collected patients’ clinical and imaging data. JCC, SL and AOL contributed to image pre-processing and feature extraction. JCC, SL, CCL, AOL and JCK constructed the models. JCC wrote the manuscript. WSL co-supervised the study. SWL supervised the study. All authors contributed to the article and approved the submitted version.
Acknowledgments
The authors would like to express their appreciation to Mr. Jiang Zhang and Mr. Alex Nicol for their expert advice on computation modelling. Mr. Jiang Zhang is the developer of the in-house developed pipeline for fast feature extraction. Finally, our team also thanks staff of the Department of Oncology, Princess Margaret Hospital, Hong Kong for their logistical support on data collection.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1060687/full#supplementary-material
References
1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin (2021) 71:209–49. doi: 10.3322/caac.21660
2. Roy S, Morgan SC. Who dies from prostate cancer? an analysis of the surveillance, epidemiology and end results database. Clin Oncol (R Coll Radiol) (2019) 31:630–36. doi: 10.1016/j.clon.2019.04.012
3. Dirix P, Joniau S, Van den Bergh L, Isebaert S, Oyen R, Deroose CM, et al. The role of elective pelvic radiotherapy in clinically node-negative prostate cancer: a systematic review. Radiother Oncol (2014) 110:45–54. doi: 10.1016/j.radonc.2013.06.046
4. Muteganya R, Goldman S, Aoun F, Roumeguère T, Albisinni S. Current imaging techniques for lymph node staging in prostate cancer: a review. Front Surg (2018) 5:74. doi: 10.3389/fsurg.2018.00074
5. Roach M, Moughan J, Lawton CAF, Dicker AP, Zeitzer KL, Gore EM, et al. Sequence of hormonal therapy and radiotherapy field size in unfavourable, localised prostate cancer (NRG/RTOG 9413): long-term results of a randomised, phase 3 trial. Lancet Oncol (2018) 19:1504–15. doi: 10.1016/S1470-2045(18)30528-X
6. Murthy V, Maitre P, Kannan S, Panigrahi G, Krishnatry R, Bakshi G, et al. Prostate-only versus whole-pelvic radiation therapy in high-risk and very high-risk prostate cancer (POP-RT): outcomes from phase III randomized controlled trial. J Clin Oncol (2021) 39:1234–42. doi: 10.1200/JCO.20.03282
7. Amini A, Jones BL, Yeh N, Rusthoven CG, Armstrong H, Kavanagh BD. Survival outcomes of whole-pelvic versus prostate-only radiation therapy for high-risk prostate cancer patients with use of the national cancer data base. Int J Radiat Oncol Biol Phys (2015) 93:1052–63. doi: 10.1016/j.ijrobp.2015.09.006
8. Pommier P, Chabaud S, Lagrange J-L, Richaud P, Le Prise E, Wagner J-P, et al. Is there a role for pelvic irradiation in localized prostate adenocarcinoma? update of the long-term survival results of the GETUG-01 randomized study. Int J Radiat Oncol Biol Phys (2016) 96:759–69. doi: 10.1016/j.ijrobp.2016.06.2455
9. Aizer AA, Yu JB, McKeon AM, Decker RH, Colberg JW, Peschel RE. Whole pelvic radiotherapy versus prostate only radiotherapy in the management of locally advanced or aggressive prostate adenocarcinoma. Int J Radiat Oncol Biol Phys (2009) 75:1344–9. doi: 10.1016/j.ijrobp.2008.12.082
10. Kim MM, Hoffman KE, Levy LB, Frank SJ, Pugh TJ, Choi S, et al. Improvement in prostate cancer survival over time: a 20-year analysis. Cancer J (2012) 18:1–8. doi: 10.1097/ppo.0b013e3182467419
11. Wang S, Tang W, Luo H, Jin F, Wang Y. Efficacy and toxicity of whole pelvic radiotherapy versus prostate-only radiotherapy in localized prostate cancer: a systematic review and meta-analysis. Front Oncol (2021) 11:796907. doi: 10.3389/fonc.2021.796907
12. Prostate cancer: diagnosis and management NICE guideline. org.uk (2021). Available at: https://www.nice.org.uk/guidance/ng131/resources/%20prostate-cancer-%20diagnosis-and-management-pdf-66141714312133 (Accessed March 18, 2021).
13. Chan MFT, Lam QYN, Lee VHF, Ho PPY, Sze HCK, Siu SWK, et al. Using the roach formula to stratify patients with localised prostate cancer treated with intensity-modulated radiotherapy. Hong Kong J Radiol (2017) 20:103–9. doi: 10.12809/hkjr1715378
14. Joniau S, Briganti A, Gontero P, Gandaglia G, Tosco L, Fieuws S, et al. Stratification of high-risk prostate cancer into prognostic categories: a European multi-institutional study. Eur Urol (2015) 67:157–64. doi: 10.1016/j.eururo.2014.01.020
15. Rodrigues G, Warde P, Pickles T, Crook J, Brundage M, Souhami L, et al. Pre-treatment risk stratification of prostate cancer patients: a critical review. Can Urol Assoc J (2012) 6:121–7. doi: 10.5489/cuaj.11085
16. Gosselaar C, Kranse R, Roobol MJ, Roemeling S, Schröder FH. The interobserver variability of digital rectal examination in a large randomized trial for the screening of prostate cancer. Prostate (2008) 68:985–93. doi: 10.1002/pros.20759
17. Koulikov D, Mamber A, Fridmans A, Abu Arafeh W, Shenfeld OZ. Why I cannot find the prostate? behind the subjectivity of rectal exam. ISRN Urol (2012) 2012:456821. doi: 10.5402/2012/456821
18. Morikawa LK, Roach M 3rd. Pelvic nodal radiotherapy in patients with unfavorable intermediate and high-risk prostate cancer: evidence, rationale, and future directions. Int J Radiat Oncol Biol Phys (2011) 80:6–16. doi: 10.1016/j.ijrobp.2010.11.074
19. Rahman S, Cosmatos H, Dave G, Williams S, Tome M. Predicting pelvic lymph node involvement in current-era prostate cancer. Int J Radiat Oncol Biol Phys (2012) 82:906–10. doi: 10.1016/j.ijrobp.2010.11.053
20. Nguyen PL, Chen MH, Hoffman KE, Katz MS, D’Amico AV. Predicting the risk of pelvic node involvement among men with prostate cancer in the contemporary era. Int J Radiat Oncol Biol Phys (2009) 74:104–9. doi: 10.1016/j.ijrobp.2008.07.053
21. Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141
22. Gnep K, Fargeas A, Gutiérrez-Carvajal RE, Commandeur F, Mathieu R, Ospina JD, et al. Haralick textural features onT2-weighted MRI are associated with biochemical recurrence following radiotherapy for peripheral zone prostate cancer: impact of MRI in prostate cancer. J Magn Reson Imaging (2017) 45:103–17. doi: 10.1002/jmri.25335
23. Dinis Fernandes C, Dinh CV, Walraven I, Heijmink SW, Smolic M, van Griethuysen JJM, et al. Biochemical recurrence prediction after radiotherapy for prostate cancer with T2w magnetic resonance imaging radiomic features. Phys Imaging Radiat Oncol (2018) 7:9–15. doi: 10.1016/j.phro.2018.06.005
24. Shiradkar R, Ghose S, Jambor I, Taimen P, Ettala O, Purysko AS, et al. Radiomic features from pretreatment biparametric MRI predict prostate cancer biochemical recurrence: preliminary findings. J Magn Reson Imaging (2018) 48:1626–36. doi: 10.1002/jmri.26178
25. Toivonen J, Montoya Perez I, Movahedi P, Merisaari H, Pesola M, Taimen P, et al. Radiomics and machine learning of multisequence multiparametric prostate MRI: towards improved non-invasive prostate cancer characterization. PloS One (2019) 14:e0217702. doi: 10.1371/journal.pone.0217702
26. Bourbonne V, Fournier G, Vallières M, Lucia F, Doucet L, Tissot V, et al. External validation of an MRI-derived radiomics model to predict biochemical recurrence after surgery for high-risk prostate cancer. Cancers (Basel) (2020) 12:814. doi: 10.3390/cancers12040814
27. Khodabakhshi Z, Amini M, Mostafaei S, Haddadi Avval A, Nazari M, Oveisi M, et al. Overall survival prediction in renal cell carcinoma patients using computed tomography radiomic and clinical information. J Digit Imaging (2021) 34:1086–98. doi: 10.1007/s10278-021-00500-y
28. Pérez-Morales J, Tunali I, Stringfield O, Eschrich SA, Balagurunathan Y, Gillies RJ, et al. Peritumoral and intratumoral radiomic features predict survival outcomes among patients diagnosed in lung cancer screening. Sci Rep (2020) 10:10528. doi: 10.1038/s41598-020-67378-8
29. Intarak S, Chongpison Y, Vimolnoch M, Oonsiri S, Kitpanit S, Prayongrat A, et al. Tumor prognostic prediction of nasopharyngeal carcinoma using CT-based radiomics in non-Chinese patients. Front Oncol (2022) 12:775248. doi: 10.3389/fonc.2022.775248
30. Forouzannezhad P, Maes D, Hippe DS, Thammasorn P, Iranzad R, Han J, et al. Multitask learning radiomics on longitudinal imaging to predict survival outcomes following risk-adaptive chemoradiation for non-small cell lung cancer. Cancers (Basel) (2022) 14:1228. doi: 10.3390/cancers14051228
31. Franzese C, Cozzi L, Badalamenti M, Baldaccini D, D’Agostino G, Fogliata A, et al. Radiomics-based prognosis classification for high-risk prostate cancer treated with radiotherapy. Strahlenther Onkol (2022) 198:710–8. doi: 10.1007/s00066-021-01886-y
32. Ghezzo S, Bezzi C, Presotto L, Mapelli P, Bettinardi V, Savi A, et al. State of the art of radiomic analysis in the clinical management of prostate cancer: a systematic review. Crit Rev Oncol Hematol (2022) 169:103544. doi: 10.1016/j.critrevonc.2021.103544
33. Delgadillo R, Ford JC, Abramowitz MC, Dal Pra A, Pollack A, Stoyanova R. The role of radiomics in prostate cancer radiotherapy. Strahlenther Onkol (2020) 196:900–12. doi: 10.1007/s00066-020-01679-9
34. Halabi S, Vogelzang NJ, Ou SS, Owzar K, Archer L, Small EJ. Progression-free survival as a predictor of overall survival in men with castrate-resistant prostate cancer. J Clin Oncol (2009) 27:2766–71. doi: 10.1200/JCO.2008.18.9159
35. Roach M 3rd, Hanks G, Thames H Jr, Schellhammer P, Shipley WU, Sokol GH, et al. Defining biochemical failure following radiotherapy with or without hormonal therapy in men with clinically localized prostate cancer: recommendations of the RTOG-ASTRO phoenix consensus conference. Int J Radiat Oncol Biol Phys (2006) 65:965–74. doi: 10.1016/j.ijrobp.2006.04.029
36. He L, Huang Y, Yan L, Zheng J, Liang C, Liu Z. Radiomics-based predictive risk score: a scoring system for preoperatively predicting risk of lymph node metastasis in patients with resectable non-small cell lung cancer. Chin J Cancer Res (2019) 31:641–52. doi: 10.21147/j.issn.1000-9604.2019.04.08
37. Feng XL, Wang SZ, Chen HH, Huang YX, Xin YK, Zhang T, et al. Optimizing the radiomics-machine-learning model based on non-contrast enhanced CT for the simplified risk categorization of thymic epithelial tumors: a large cohort retrospective study. Lung Cancer (2022) 166:150–60. doi: 10.1016/j.lungcan.2022.03.007
38. Dikaios N, Alkalbani J, Sidhu HS, Fujiwara T, Abd-Alazeez M, Kirkham A, et al. Logistic regression model for diagnosis of transition zone prostate cancer on multi-parametric MRI. Eur Radiol (2015) 25:523–32. doi: 10.1007/s00330-014-3386-4
39. Collins GS, Reitsma JB, Altman DG, Moons KGM. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Eur Urol (2015) 67:1142–51. doi: 10.1016/j.eururo.2014.11.025
40. Harris VA, Staffurth J, Naismith O, Esmail A, Gulliford S, Khoo V, et al. Consensus guidelines and contouring atlas for pelvic node delineation in prostate and pelvic node intensity modulated radiation therapy. Int J Radiat Oncol Biol Phys (2015) 92(4):874–83. doi: 10.1016/j.ijrobp.2015.03.021
41. Zwanenburg A. EP-1677: multicentre initiative for standardisation of image biomarkers. Radiother Oncol (2017) 123:S914–5. doi: 10.1016/s0167-8140(17)32209-0
42. Osman SOS, Leijenaar RTH, Cole AJ, Lyons CA, Hounsell AR, Prise KM, et al. Computed tomography-based radiomics for risk stratification in prostate cancer. Int J Radiat Oncol Biol Phys (2019) 105:448–56. doi: 10.1016/j.ijrobp.2019.06.2504
43. Tanadini-Lang S, Bogowicz M, Veit-Haibach P, Huellner M, Pauli C, Shukla V, et al. Exploratory Radiomics in Computed Tomography Perfusion of Prostate Cancer. Anticancer Res (2017) 38(2):685–90. doi: 10.21873/anticanres.12273
44. Sun J, Liu K, Tong H, Liu H, Li X, Luo Y, et al. CT texture analysis for differentiating bronchiolar adenoma, adenocarcinoma in situ, and minimally invasive adenocarcinoma of the lung. Front Oncol (2021) 26:634564(11). doi: 10.3389/fonc.2021.634564
45. Jiang X, Jia H, Zhang Z, Wei C, Wang C, Dong J. The feasibility of combining ADC value with texture analysis of T2WI, DWI and CE-T1WI to preoperatively predict the expression levels of ki-67 and p53 of endometrial carcinoma. Front Oncol (2021) 11:805545. doi: 10.3389/fonc.2021.805545
46. Chen H, Deng X, Pan Y, Chen J, Liu Y, Chen W, et al. Pancreatic serous cystic neoplasms and mucinous cystic neoplasms: differential diagnosis by combining imaging features and enhanced CT texture analysis. Front Oncol (2021) 11:745001. doi: 10.3389/fonc.2021.745001
47. Youden WJ. Index for rating diagnostic tests. Cancer (1950) 3:32–5. doi: 10.1002/1097-0142(1950)3:1<32::aid-cncr2820030106>3.0.co;2-3
48. van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res (2017) 77:e104–7. doi: 10.1158/0008-5472.can-17-0339
49. Yan M, Wang W. Radiomic analysis of CT predicts tumor response in human lung cancer with radiotherapy. J Digit Imaging (2020) 33:1401–3. doi: 10.1007/s10278-020-00385-3
50. Zhong QZ, Long LH, Liu A, Li CM, Xiu X, Hou XY, et al. Radiomics of multiparametric MRI to predict biochemical recurrence of localized prostate cancer after radiation therapy. Front Oncol (2020) 10:731. doi: 10.3389/fonc.2020.00731
51. Bosetti DG, Ruinelli L, Piliero MA, van der Gaag LC, Pesce GA, Valli M, et al. Cone-beam computed tomography-based radiomics in prostate cancer: a mono-institutional study. Strahlenther Onkol (2020) 196:943–51. doi: 10.1007/s00066-020-01677-x
52. Shen H, Wang Y, Liu D, Lv R, Huang Y, Peng C, et al. Predicting progression-free survival using MRI-based radiomics for patients with nonmetastatic nasopharyngeal carcinoma. Front Oncol (2020) 10:618. doi: 10.3389/fonc.2020.00618
53. Zhang B, Ouyang F, Gu D, Dong Y, Zhang L, Mo X, et al. Advanced nasopharyngeal carcinoma: pre-treatment prediction of progression based on multi-parametric MRI radiomics. Oncotarget (2017) 8:72457–65. doi: 10.18632/oncotarget.19799
54. Kittler J, Hatef M, Duin RPW, Matas J. On combining classifiers. IEEE Trans Pattern Anal Mach Intell (1998) 20:226–39. doi: 10.1109/34.667881
55. Zhu C, Huang H, Liu X, Chen H, Jiang H, Liao C, et al. A clinical-radiomics nomogram based on computed tomography for predicting risk of local recurrence after radiotherapy in nasopharyngeal carcinoma. Front Oncol (2021) 11:637687. doi: 10.3389/fonc.2021.637687
56. Lam SK, Zhang J, Zhang YP, Li B, Ni RY, Zhou T, et al. A multi-center study of CT-based neck nodal radiomics for predicting an adaptive radiotherapy trigger of ill-fitted thermoplastic masks in patients with nasopharyngeal carcinoma. Life (2022) 12(2):241. doi: 10.3390/life12020241
57. Yang L, Chu W, Li M, Xu P, Wang M, Peng M, et al. Radiomics in gastric cancer: first clinical investigation to predict lymph vascular invasion and survival outcome using 18 f-FDG PET/CT images. Front Oncol (2022) 12:836098. doi: 10.3389/fonc.2022.836098
Keywords: radiomic, high-risk, prostate cancer, prognosis, progression-free survival (PFS), radiation therapy, prostate-only radiotherapy, radiomic-clinical model
Citation: Ching JCF, Lam S, Lam CCH, Lui AOY, Kwong JCK, Lo AYH, Chan JWH, Cai J, Leung WS and Lee SWY (2023) Integrating CT-based radiomic model with clinical features improves long-term prognostication in high-risk prostate cancer. Front. Oncol. 13:1060687. doi: 10.3389/fonc.2023.1060687
Received: 03 October 2022; Accepted: 10 April 2023;
Published: 27 April 2023.
Edited by:
Sharon R. Pine, University of Colorado Anschutz Medical Campus, United StatesReviewed by:
Yankun Gao, First Affiliated Hospital of Anhui Medical University, ChinaVarut Vardhanabhuti, The University of Hong Kong, Hong Kong SAR, China
Copyright © 2023 Ching, Lam, Lam, Lui, Kwong, Lo, Chan, Cai, Leung and Lee. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Shara W. Y. Lee, shara.lee@polyu.edu.hk; W. S. Leung, wsv.leung@polyu.edu.hk