- 1Department of Radiology, The Netherlands Cancer Institute, Amsterdam, Netherlands
- 2Department of Radiation Biology, Oslo University Hospital, Oslo, Norway
- 3Department of Radiology, Medical University of Warsaw, Warsaw, Poland
- 4Department of Urology, The Netherlands Cancer Institute, Amsterdam, Netherlands
- 5Department of Urology, Amsterdam University Medical Centers, Amsterdam, Netherlands
- 6Department of Pathology, The Netherlands Cancer Institute, Amsterdam, Netherlands
- 7Department of Physics, University of Oslo, Oslo, Norway
- 8Department of Radiation Oncology, The Netherlands Cancer Institute, Amsterdam, Netherlands
- 9Department of Radiology & Nuclear Medicine, Erasmus University Medical Center, Rotterdam, Netherlands
Introduction: Prostate cancer hypoxia is a negative prognostic biomarker. A promising MRI-based tool to assess hypoxia is the ‘Consumption and Supply based Hypoxia’ (CSH) model based on diffusion-weighted imaging (DWI). The aim of the study was to validate the association between the CSH hypoxia fraction (HFDWI) with pathological Grade Group (pGG) and pathological T-staging (pTstage) in an external prostatectomy cohort.
Methods: Apparent diffusion coefficient (ADC) and fractional blood volume (fBV) maps were assessed from DWI data from 291 prostatectomies and combined by the CSH model. HFDWI was calculated for each lesion after median scaling of ADC and fBV to address differences in acquisition and analysis between centers. The absolute HFDWI values and the associations of HFDWI between pGG < 3 versus ≥ 3, and pTstage = 2 versus = 3 in the Netherlands Cancer Institute (NKI) cohort were compared to the obtained by original cohort (Oslo cohort). Statistical T- and Mann-Whitney tests (p<0.05) were performed. Pearson correlation was determined between HFDWI and individual pGG groups.
Results: The HFDWI showed comparable absolute values and similar metric performance as in the original published cohort. Higher HFDWI values were observed for higher pGG (Oslo: 0.27; NKI: 0.24) compared to lower pGG (Oslo: 0.11; NKI: 0.17). Similar results were obtained for pTstage. Furthermore, HFDWI demonstrated a significant positive correlation with pGG groups 1-5 (ρ = 0.41, p<0.001).
Conclusion: The CSH model exhibited sufficient robustness in the external cohort, suggesting a plausible reflection of true hypoxia and enabling the use of the HFDWI metric for further research into prostate cancer and hypoxia.
1 Introduction
Hypoxia in prostate cancer (PCa) has been related to radiation treatment resistance and metastatic disease (1–3). Thus, hypoxia assessment at diagnosis is of great interest for patient stratification and treatment decisions. Given that MRI is the main imaging modality in PCa patients and widely available in modern hospitals, the consideration of MRI biomarkers emerges as an attractive tool for treatment personalization. A promising approach, using diffusion-weighted images (DWI) related to oxygen consumption and supply, was developed by Hompland et al. (4). This model, known as Consumption and Supply based Hypoxia (CSH) imaging, relies on apparent diffusion coefficient (ADC) and fractional blood volume (fBV) maps and its application to PCa patients was based on the underlying assumption that the estimated ADC is linked to oxygen consumption while fBV is linked to oxygen supply.
The CSH model was trained using DWI data of PCa patients who received a hypoxia marker (pimonidazole) administration prior to prostatectomy as a ground truth to find the optimal combination of ADC and fBV representing hypoxia. In a separate test cohort, the hypoxia fraction (HFDWI) of the index lesion showed a robust correlation to hypoxia estimated from pimonidazole staining of the surgical specimen. Consequently, the CSH model appears as a promising non-invasive tool for hypoxia assessment, offering potential for personalized treatment decisions. Indeed this approach appeared to be quite successful for correlating the HFDWI to pimonidazole-derived hypoxia in cancers such as breast and cervix (4–6).
To apply such a model more widely in clinical practice, external validation is necessary. A first hurdle is the potential variability in quantitative MRI parameters between scanners and centers (7, 8). Here we propose a calibration method by scaling the quantitative MRI parameters in matched cohorts of patients.
A second hurdle for validation of HFDWI in relation to true hypoxia in prostate cancer is that it relies on the availability of pimonidazole staining. In the absence of pimonidazole stained specimen, we can make use of the established associations between hypoxia and pathological Grade Group (pGG), as well as Tstage (2, 4). Therefore, in this study we aim to compare the association between HFDWI and pGG and pTstage in a cohort of patients who received a prostatectomy at the Netherlands Cancer Institute (NKI) and compare these associations with those originally obtained by Hompland et al (4).
2 Materials and methods
2.1 Cohort description
The original dataset from Oslo University Hospital consisted of 106 patients enrolled into the FuncProst study (NTC01464216) (4). For the external dataset, men with biopsy-proven prostate cancer and pre-operative MRI, who underwent radical prostatectomy between January 2010 and December 2020, were retrospectively included after approval of the institutional review board (IRBd21-108) at the NKI. Exclusion criteria were men with prior transurethral resection of the prostate, incomplete or technically poor-quality MRI or incomplete pathological specimens. A total of 291 patients were subjected to further analysis.
2.2 MRI acquisition and data analysis of the NKI cohort
MRI data in the NKI cohort were acquired using mostly a 3T scanner (Achieva [n=164], Achieva dStream [n=103] and Ingenia [n=21], Philips Healthcare, Best, the Netherlands). The MRI exam consisted of T2-weighted (T2w), DWI and a separate high b-value DWI acquisition (b= 1400 or 2000 s/mm2). Axial T2-weighted (T2w) turbo spin-echo images were acquired (Repetition time = [2175 – 10233 ms], Echo time = [110 – 130 ms]) with a field-of-view from 140 mm x 140 mm to 284 mm x 284 mm and slice thickness of 2.5 to 4 mm. DWI data were acquired with single-shot echo planar imaging sequences with b-values of 0, 200, 800 or 0,50,300,800 s/mm2. A detailed comparison of DWI acquisition parameters between centers can be found in Supplementary Table S1.
ADC and fBV maps were calculated using an intravoxel incoherent motion model with a segmented fit approach (9) 1. Details of this analysis can be found in Supplementary Analysis.
Tumor delineations on MR images (T2w, ADC and high b-value DWI scans) were performed by two observers (M.D.C. and M.F.S., both 1-2 years’ reading experience) blinded to the pathological ground truth. If required, consensus was reached after discussion with an experienced radiologist (I.G.S. 13 years of experience). Histological evaluation was performed by a pathologist (M.A.S.G., 10 years of experience) using SlideScore software [https://www.slidescore.com/]. Lesions larger than 3 mm were graded according to the 2019 International Society of Urological Pathology recommendations (10). Staging was performed according the TNM classification guidelines (11). For patients with multiple lesions, only the lesion with the highest pGG was used.
2.3 CSH model and calculation of hypoxia fraction
Previously, Hompland et al (4) showed that the discrimination of hypoxic and non-hypoxic regions could be approximated to a linear curve expressed as Equation (1)
where ADC0 and fBV0 are the intersections with the ADC and fBV axes on a pixel-level plot (Figure 1). ADC0 = 0.79 x 10-3 mm2/s and fBV0 = 0.43 a.u. were the intercepts for which HFDWI had the highest correlation with the hypoxia score from pimonidazole staining (HSpimo) in the Oslo cohort.
Figure 1 Example of the CSH model application on a lesion (red contour) of the NKI cohort. HL, hypoxia level (pixel-level); HFDWI, hypoxia fraction (lesion-level).
As differences in acquisition and analysis protocols between the two centers were present (Supplementary Table S1), specifically in the numbers of b-values used, median scaling of the ADC and fBV distributions was applied. However, to make sure that differences in clinical characteristics were not also affecting these variations, the cohorts were first matched based on clinical characteristics. For this purpose, propensity score matching was used and 106 patients from the NKI cohort were selected to clinically match the 106 patients dataset from the original institution (using pGG and pTstage as covariates). The decision to prioritize matching based on pGG and pTstage was driven by their known association with hypoxia (2, 4). When multiple lesions were present, the patient’s pGG corresponded to the lesion with the highest pGG. The R library MatchIt package (12) was used with optimal matching method, logistic regression distances and ‘Average Treatment Effect on the Control’ estimand (ATC), where only NKI patients are allowed to be dropped during the matching process, while maintaining Oslo number of patients fixed. This produced the best match based on standard mean differences.
For median scaling, the ADC and fBV voxel intensities distributions from all lesions of the matched NKI cohort were scaled towards the distributions of the Oslo dataset by multiplying them with the calculated scaled factors [FADC and FfBV, Equation (2)].
Once the scaling was performed for the clinical matched set of patients, the HFDWI metric was computed for each lesion using the ADC0 and fBV0 intercepts (Figure 1).
2.4 Hypoxia fraction validation and application
The absolute HFDWI values were investigated by comparing the distributions within each pGG (low: pGG < 3 and high: pGG ≥ 3) and each pTstage group between centers. This comparison aimed to validate the consistency of HFDWI values between the two centers for each specific clinical subgroup.
Moreover, HFDWI differences were examined between patients with low versus high pGG, and pTstage = 2 versus pTstage = 3 in the matched NKI cohort. These differences were then compared to the corresponding differences observed in the Oslo cohort. T-tests or Mann-Whitney tests (p< 0.05) were used for these comparisons, depending on normality and homogeneity (i.e. with equal variances) assessed by the Shapiro-Wilk test and Levene’s test, respectively.
Lastly, the CSH model was applied to the full NKI cohort (n=319 lesions from 291 patients) to investigate the HFDWI relation and correlation across individual pGG groups using a Kruskal-Wallis (p< 0.05) and the Pearson correlation coefficient tests (ρ, p< 0.05). All analyses were performed using Python v3.7.
3 Results
The Oslo cohort was well balanced in terms of low and high pGG while overall dominated by pTstage 3 patients. In contrast, the full NKI cohort was dominated by low pGG and pTstage 2 patients. After propensity score matching, the matched NKI cohort had similar patient characteristics as the Oslo cohort (Table 1).
Table 1 Patient demographics and tumor characteristics for the original cohort (Oslo), the matched external cohort (NKI, pGG of the index lesion) and the full external cohort (Full NKI, with pGG information for all 319 lesions in 291 patients).
With similar patient characteristics between the two cohorts we expect differences in ADC and fBV values to mostly reflect variations in the imaging acquisitions and MRI apparatus.
Prior to scaling, the matched NKI dataset showed slightly higher ADC values (median (IQR) was 0.84 (0.41) x10-3 mm/s2) compared to the Oslo cohort (0.70 (0.26) x10-3 mm/s2) and slightly lower fBV values (NKI: 0.08 (0.15) a.u. vs Oslo: 0.12 (0.11) a.u.) (Supplementary Figure S1). The obtained scaling factors FADC and FfBV were 0.83 and 1.54, respectively.
No statistically significant differences were observed when comparing the matched NKI distributions of absolute HFDWI values to the Oslo cohort for each specific pGG and pTstage subgroup (Figure 2, p(pGG<3) = 0.3, p(pGG>=3) = 0.8, p(pTstage=2) = 0.1, p(pTstage=3) = 0.4). HFDWI median values for Oslo were 0.11 vs 0.27 for low and high pGG, and 0.08 vs 0.26 for pTstage = 2 and pTstage = 3. Median values for NKI were 0.17 vs 0.24 and 0.17 vs 0.22 for pGG and pTstage groups, respectively. The association between HFDWI and pGG and pTstage was consistent between the Oslo and the matched NKI dataset (Figure 3): both showed that higher HFDWI values were associated with higher pGG (Oslo p<0.001, NKI p=0.004) and pTstage groups (Oslo p<0.001, NKI p=0.03). Furthermore, significant differences were observed among all pGG groups in the full NKI cohort (Figure 4, p< 0.001), in addition to a significant positive correlation (ρ = 0.41, p< 0.001).
Figure 2 HFDWI comparison between the Oslo (n=106) and NKI (n=106) datasets for specific pGG and pTstage sub-groups.
Figure 3 HFDWI distributions for pGG and pTstage groups for both Oslo and NKI datasets. Diamond symbols represent outliers of the distributions.
Figure 4 CSH model applied to the full NKI cohort (319 lesions from 291 patients) showing the correlation between hypoxia fraction with pGG individually. Kruskal-Wallis p< 0.001 indicates significant differences among groups.
4 Discussion
The aim of this study was to independently validate the CSH model in prostate cancer by evaluating the HFDWI association with pGG and pTstage in an independent external prostatectomy cohort.
After clinically matching and scaling, the HFDWI in the NKI dataset exhibited similar associations with the pGG and pTstage groups as observed in the Oslo dataset. In agreement with Hompland et al. (4), a patient with a high HFDWI in the NKI cohort was likely to exhibit more aggressive characteristics, such as a higher pTstage or pGG, in comparison to a patient with a lower HFDWI. Furthermore, the HFDWI absolute values obtained in the NKI dataset were comparable to those of the Oslo dataset for both pGG and pTstage subgroups. While pimonidazole staining was unavailable in the NKI cohort, this similarity suggests that HFdwi may also indicate hypoxia in the NKI cohort.
Prior to scaling, the ADC and fBV distributions of the Oslo and NKI datasets were not very different (Supplementary Figure S1). This is an interesting observation as technical differences (including variations in vendor, sequence protocols, and data analysis methods) were present between the two cohorts. Nonetheless, for future applications of the CSH model in other datasets, ADC and fBV distributions need to be carefully compared with the Oslo data for an accurate use of the hypoxia metric. To allow other institutes to apply this method and determine scaling parameters for their cohort, all voxel values of ADC and fBV for the Oslo dataset and NKI matched dataset can be found in Supplementary Data S1, Data S2 respectively. The framework for the CSH model application between different cohorts presented in this study offers a valuable template for transferring other quantitative MRI biomarkers between different cohorts.
As an example, the model was applied to the full available NKI cohort of 291 patients, showing consistent results for individual pGG categories. The positive correlation observed between HFDWI and the individual pGG categories potentially positions the CSH-model as a non-invasive method to identify or classify patients into specific pGG groups.
A limitation of this study is the lack of pimonidazole staining to biologically validate the HFDWI metric in an external cohort. Nevertheless, we showed that correlations between HFDWI and measures of tumor aggressiveness and spread could be replicated in an external dataset.
MRI has the potential to non-invasively assess hypoxia in prostate cancer, shown by the current external validation of the relationship between HFDWI and pGG/pTstage, previously shown by Hompland et al. (4). MRI may thereby be capable in stratifying patients who are at a higher risk of worse clinical outcomes, e.g. disease progression or radiation resistance. Future research should focus on external validations across diverse clinical settings to ensure the robustness and generalizability of the CSH model.
5 Conclusion
The CSH model exhibited sufficient robustness in the external cohort, suggesting a plausible reflection of true hypoxia and enabling the use of the HFDWI metric for further research into PCa and hypoxia.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by the Netherlands Cancer Institute - IRBd21-108. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because pursuant to national legislation prior to 25 May 2018 (Opt-out) and General hospital informed consent was considered.
Author contributions
MF: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing. TH: Conceptualization, Writing – review & editing, Software. M-DC: Data curation, Writing – review & editing. HV: Writing – review & editing. EB: Writing – review & editing. MG: Data curation, Writing – review & editing. HL: Writing – review & editing. UV: Supervision, Writing – review & editing. IS: Supervision, Writing – review & editing. PV: Conceptualization, Methodology, Supervision, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This project has been founded by a KWF grant. Project number: 12975.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1433197/full#supplementary-material
Footnotes
References
1. Milosevic M, Warde P, Ménard C, Chung P, Toi A, Ishkanian A, et al. Tumor hypoxia predicts biochemical failure following radiotherapy for clinically localized prostate cancer. Clin Cancer Res. (2012) 18:2108–14. doi: 10.1158/1078-0432.CCR-11-2711
2. Ragnum HB, Vlatkovic L, Lie AK, Axcrona K, Julin CH, Frikstad KM, et al. The tumour hypoxia marker pimonidazole reflects a transcriptional programme associated with aggressive prostate cancer. Br J Cancer. (2015) 112:382–90. doi: 10.1038/bjc.2014.604
3. Turaka A, Buyyounouski MK, Hanlon AL, Horwitz EM, Greenberg RE, Movsas B. Hypoxic prostate/muscle Po2 ratio predicts for outcome in patients with localized prostate cancer: long-term results. Int J Radiat Oncol Biol Phys. (2012) 82:e433–e9. doi: 10.1016/j.ijrobp.2011.05.037
4. Hompland T, Hole KH, Ragnum HB, Aarnes E-K, Vlatkovic L, Lie AK, et al. Combined MR imaging of oxygen consumption and supply reveals tumor hypoxia and aggressiveness in prostate cancer patients. Cancer Res. (2018) 78:4774–85. doi: 10.1158/0008-5472.CAN-17-3806
5. Hillestad T, Hompland T, Fjeldbo CS, Skingen VE, Salberg UB, Aarnes E-K, et al. MRI distinguishes tumor hypoxia levels of different prognostic and biological significance in cervical cancer. Cancer Res. (2020) 80:3993–4003. doi: 10.1158/0008-5472.CAN-20-0950
6. Mo T, Brandal SHB, Köhn-Luque A, Engebraaten O, Kristensen VN, Fleischer T, et al. Quantification of tumor hypoxia through unsupervised modelling of consumption and supply hypoxia MR imaging in breast cancer. Cancers. (2022) 14:1326. doi: 10.3390/cancers14051326
7. Shukla-Dave A, Obuchowski NA, Chenevert TL, Jambawalikar S, Schwartz LH, Malyarenko D, et al. Quantitative imaging biomarkers alliance (QIBA) recommendations for improved precision of DWI and DCE-MRI derived biomarkers in multicenter oncology trials. J Magnet Resonance Imaging. (2019) 49:e101–e21. doi: 10.1002/jmri.26518
8. Van Houdt PJ, Kallehauge JF, Tanderup K, Nout R, Zaletelj M, Tadic T, et al. Phantom-based quality assurance for multicenter quantitative MRI in locally advanced cervical cancer. Radiother Oncol. (2020) 153:114–21. doi: 10.1016/j.radonc.2020.09.013
9. Kooreman ES, Van Houdt PJ, Keesman R, Van Pelt VWJ, Nowee ME, Pos F, et al. Daily intravoxel incoherent motion (IVIM) in prostate cancer patients during MR-guided radiotherapy—A multicenter study. Front Oncol. (2021) 11. doi: 10.3389/fonc.2021.705964
10. van Leenders GJLH, van der Kwast TH, Grignon DJ, Evans AJ, Kristiansen G, Kweldam CF, et al. The 2019 international society of urological pathology (ISUP) consensus conference on grading of prostatic carcinoma. Am J Surg Pathol. (2020) 44:e87–99. doi: 10.1097/PAS.0000000000001497
11. Sobin LH GM, Wittekind C. TNM classification of Malignant tumours. 7th edition. West Sussex, United Kingdom: Wiley Blackwell (2009).
Keywords: hypoxia, prostate cancer, diffusion weighted images, prostatectomy, hypoxia biomarker
Citation: Fernandez Salamanca M, Hompland T, Deręgowska-Cylke M, Van der Poel H, Bekers E, Guimaraes MAS, Lyng H, Van der Heide UA, Schoots IG and Van Houdt PJ (2024) A DWI-based hypoxia model shows robustness in an external prostatectomy cohort. Front. Oncol. 14:1433197. doi: 10.3389/fonc.2024.1433197
Received: 15 May 2024; Accepted: 08 July 2024;
Published: 23 July 2024.
Edited by:
Constantinos Zamboglou, German Oncology Center, CyprusReviewed by:
Radka Stoyanova, University of Miami, United StatesFederico Mastroleo, European Institute of Oncology (IEO), Italy
Copyright © 2024 Fernandez Salamanca, Hompland, Deręgowska-Cylke, Van der Poel, Bekers, Guimaraes, Lyng, Van der Heide, Schoots and Van Houdt. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: P. J. Van Houdt, p.v.houdt@nki.nl