- 1Department of Radiation Oncology, University of Nebraska Medical Center, Omaha, NE, United States
- 2Department of Biostatistics, University of Nebraska Medical Center, Omaha, NE, United States
- 3Department of Radiation Oncology, Novant Health Cancer Institute, Winston-Salem, NC, United States
Purpose: The purpose of this study was to investigate the dosiomics features of the interplay between CT density and dose distribution in lung SBRT plans, and to develop a model to predict treatment failure following lung SBRT treatment.
Methods: A retrospective study was conducted involving 179 lung cancer patients treated with SBRT at the University of Nebraska Medical Center (UNMC) between October 2007 and June 2022. Features from the CT image, Biological Effective Dose (BED) and five interaction matrices between CT and BED were extracted using radiomics mathematics. Our in-house feature selection pipeline was utilized to evaluate and rank features based on their importance and redundancy, with only the selected non-redundant features being used for predictive modeling. We randomly selected 151 cases and 28 cases as training and test datasets. Four different models were trained utilizing the Balanced Random Forest framework on the same training dataset to differentiate between failure and non-failure cases. These four models utilized the same number of selected features extracted from CT-only, BED-only, a combination of CT and BED, and a composite of CT and BED including their interaction matrices, respectively.
Results: The cohort included 125 non-failure cases and 54 failure cases, with a median follow-up time of 34.4 months. We selected the top 17 important and non-redundant features (with the Pearsons’s coefficient < 0.5) in each model. When evaluated on the same independent test set, the four models—CT features-only, BED features-only, a combination of CT and BED features, and a composite model including features from CT and BED that includes their interaction matrices—achieved AUC values of 0.56, 0.75, 0.73, and 0.82, respectively, with corresponding accuracies of 0.61, 0.79, 0.71, and 0.79. The composite model demonstrated the highest AUC and accuracy, indicating that incorporating interactions between CT and BED reveals more predictive capabilities in distinguishing between failure and non-failure cases.
Conclusion: The dosiomics model integrating the interaction between CT and Dose can effectively predict treatment failure following lung SBRT treatment and may serve as a useful tool to proactively evaluate and select lung SBRT treatment plans to reduce treatment failure in the future.
Introduction
Despite the decline in the rate of new cases and deaths in recent years (1), lung cancer remains the leading cause of cancer-related deaths in the US, accounting for over 20% of all cancer-related deaths (2). Non-small cell lung cancer (NSCLC) is the predominant type, accounting for more than 80% of all lung cancers (3). With the rapid advances in lung cancer screening (4), NSCLC is increasingly diagnosed at earlier stages (3). Although surgery remains the standard of care for early-stage lung cancer (5), stereotactic body radiation therapy (SBRT) has rapidly been accepted as a standard treatment for patients with early-stage medically inoperable NSCLC (6–8). SBRT allows delivering ablative radiation dose to the target while maintaining tolerable toxicities (9–12) by utilizing modern techniques in motion management (13–16) and imaging guidance technologies (17, 18). Despite achieving an excellent local control rate, lung SBRT still exhibits a relatively high total failure rate, combining both regional and distant failures, at approximately 30% (6, 7). Novel treatment assessment technologies are still warranted to predict potential treatment failures, thereby improving the survival benefits.
Radiomics is a machine learning-based quantitative image analysis technology. Owing to its distinct advantages in providing a comprehensive and quantitative representation of the radiographic phenotype of a 3D target volume (19–25), it has become an active area of research for risk assessment and treatment response prediction in cancer management (26–30). However, radiomics is still limited in comprehensive treatment response assessment, especially in radiation oncology, because it only extracts and analyzes features from the medical images without considering radiation dose. Dose-volume histograms (DVH) are the standard metric in radiation oncology to assess dose coverage to the target and dose exposure to the organs at risk (OARs), assisting in the prediction of treatment outcomes and potential toxicities. However, DVHs do not possess spatial information concerning the dose distribution in a 3D organ or lesion. Limited studies have examined how the subtle difference in the 3D spatial distribution of the radiation dose impacts the treatment response for similar treatments; these types of studies are categorized as dosiomics. First cited in 2018, the term dosiomics refers to studies that attempt to combine radiomics predictive mathematics with 3D spatial dose information (31). These existing dosiomics studies (32), however, only investigated the dose distribution without considering the interplay between the dose and the specific tissue irradiated. Theoretically, the interaction between radiation and the irradiated tissue, as represented by medical images, essentially influences the treatment outcome.
In this study, we designed a novel dosiomics model incorporating the interplay between tissue density and the biologically effective dose to predict potential treatment failure.
Materials and methods
Ethical approval and patient selection
The Institutional Review Board at our institution approved this retrospective study (IRB 722-19-EP). We retrospectively enrolled consecutive NSCLC lung cancer patients treated with stereotactic body radiation therapy (SBRT) at the University of Nebraska Medical Center (UNMC) between October 2007 to June 2022. We exclusively selected cases determined to be the primary NSCLC based on multidisciplinary tumor board discussion about patient clinical history, imaging, and available pathology before radiation treatment. We excluded patients with the following conditions: 1) adjuvant treatment for positive surgical margins, 2) SBRT for metastatic disease, 3) recurrence/oligo-progression of previously treated primary lung tumor, and 4) patients with multiple targets who underwent more than one SBRT treatment simultaneously. In addition, we excluded patients with a follow-up time of less than a year, except those patients who presented with treatment failures between 6 months to a year after the SBRT treatment. Figure 1 summarizes the patient selection process of our study. As summarized in Table 1, the patient cohort included 84 male and 95 female patients. The median age is 73 in the range from 52 to 91. The median follow-up time is 34.4 months, ranging from 7.3 to 178.5 months, with a 10-90 percentile range of 73.5 months. 81 cases are adenocarcinoma, 60 cases are squamous cell carcinoma, 9 cases are NSCLC (not otherwise specified - NOS), and 2 cases are large cell carcinoma. The cohort also included 27 presumed NSCLC cases without tissue confirmation. After the medical record review, 125 cases are considered as non-failure cases, and 54 cases are determined to have treatment failure.
Clinical endpoints
Treatment failure in this study includes local, regional, and distant failures. Local failure refers to the recurrence of cancer within the same lobe as the primary site of disease. Regional failures pertain to regional nodal recurrence, while distant failure involves recurrence in a different lobe or in extrathoracic regions. These definitions of treatment failures are consistent with RTOG0236. Local failures were assessed using PET imaging and biopsy, when deemed safe. If a biopsy was not feasible, we relied on multidisciplinary discussions based on at least two consecutive scans to determine if a case represented a true local failure. Regional failures were similarly confirmed through PET and involved endobronchial ultrasound with biopsy confirmation. Distant failures were validated using PET scans and biopsies, except in cases of brain metastases, which were not always biopsied. We also have closely examined the medical record for the non-failure cases, with standard care being chest CT every 3-4 months during the follow-up time within two years after SBRT. All available CTs right before the death were reviewed in the process.
CT simulation
All the patients underwent a free-breathing computed tomography (CT) scan (FB-CT) and a 4D-CT scan using one of the three Siemens scanners: Sensation Open, Definition, or Confidence (Siemens Medical Solutions USA, Inc., PA, USA). All the scans were acquired using 120kVp with a slice thickness of 2 or 3 mm. The tube current used for the scans were ranging from 30 mA to 381 mA. Additionally, all images were acquired using the smooth convolution kernel equipped on the Siemens CT scanners (B31s or B31f).
Target segmentation and volumes of interest
The Gross Tumor Volume (GTV) and the Planning Target Volume (PTV) were delineated by the attending radiation oncologist based on the maximum intensity projection from the 4D CT scan during the treatment planning. To reduce the inter-observer variability, we created a semi-automatic GTV that encompasses the voxels with the HU value between (-550 to 2000) within the clinical GTV volume. We also defined a peritumoral region that included an outer 2 cm ring outside the PTV and a dose structure (ISO50) for the tissues that received equal to or more than 50% of the prescription dose outside the PTV. Our rationale to create these volumes of interest outside PTV is that peritumoral radiomics signatures have been demonstrated to be valuable in cancer prognosis assessment (27). In addition, the peritumoral region outside the PTV typically experiences a rapid dose fall-off in Lung SBRT, resulting in a notably nonuniform dose distribution across this area. The volumes of interest of an example patient were shown in Figure 2.
Figure 2. Volumes of Interest of our study: (A). Gross Tumor Volume (GTV), (B). Planning Target Volume (PTV), (C). Peritumoral Region, and (D). Normal Tissue receiving greater than 50% of Prescription Dose - ISO50.
Dose calculation
The treatment plans of 133 patients were created on the free breathing (FB) CT whereas the remaining 46 patients were planned on the average intensity projection (average CT) from their respective 4DCT scans. To ensure consistency among the plans in this study, all the plans with the radiotherapy structures that were created on the average CT images were copied to their respective FBCT images, and the radiotherapy plan was recalculated with the same monitor units (MU) without re-optimizing. Among the 179 patients, 3 patients were planned using the collapsed-cone-convolution superposition (CCC) algorithm in the Pinnacle treatment system (TPS) (Philips Medical Systems, Fitchburg,WI). 23 patients were planned in the iPlan treatment planning system (Brainlab AG, Feldkirchen, Germany). The remaining patients were planned the Analytical Anisotropic Algorithm (AAA) in Eclipse treatment planning system (Varian Medical Systems, Palo Alto, CA). For the 23 patients planned in iPlan, the machine was recommissioned in the Eclipse TPS, and the plans were recalculated by the AAA algorithm for improved heterogeneity correction.
Fractionation
The fractionation schemes are summarized in Table 1. 126 patients received 1000cGy x 5 fractions, 25 patients received 1200cGy x 4 fractions, 14 patients received 1250cGy x 4 fractions, 7 patients received 1800cGy x 3 fractions, 4 patients received 1200cGy x 5 fractions, 2 patients received 750cGy x 8 fractions and 1 patient received 700cGy x 10 fractions.
Data preprocessing
The CT volume images, target segmentation, and 3D dose distribution matrix of each patient were exported in DICOM format in Eclipse TPS. We converted all the DICOM files to NRRD format using a batch process in 3D slicer software (33) https://www.slicer.org/), as previously described (34, 35). We then resampled volume CT images, 3D dose distribution matrices, and the target segmentations with a 1 x 1 x 2mm2. Additionally, we performed cropping on all matrices using the segmentation mask encompassing the peritumoral region, along with a margin in all three directions. This process ensured that all matrices of a patient possess identical dimensions and resolution.
Voxelated biological effective dose
We used a linear quadratic (LQ) radiobiological model to generate a 3-D distribution of Biologically Effective Dose (BED) from the 3D physical dose distribution. The voxels within the PTV are considered cancerous and are assigned with an α/β ratio of 10Gy. The voxels outside the PTV are considered healthy tissue, and hence were assigned with an α/β ratio of 3Gy. Depending on the α/β ratio for each voxel, the corresponding BED is calculated using Equation 1, as shown below:
where BEDi,j,k is the BED for the voxel at position (i,j,k), n is the number of fractions, di,j,k is the dose per fraction at voxel (i,j,k), and is the alpha-beta ratio (10 for cancerous tissue or 3 for normal tissue) for the voxel at position (i,j,k) depending on if the voxel is inside or outside the PTV.
Calculation of the voxelated interaction matrices
The BED matrix and the planning CT image matrix exhibit a point-to-point correspondence. Specifically, each voxel within the BED matrix corresponds to a voxel within the CT image matrix with the same coordinates, representing the dose received by the tissue in the corresponding voxel of the CT image matrix. To delve deeper into the potential features concealed within the relationship between the CT image matrix and the BED matrix, we generated a series of interaction matrices to depict the interrelationship between the CT image and the Biological Effective Dose (BED). Specifically, we extracted a pair of three-dimensional patches (3 x 3 x 3) surrounding each voxel in both the CT image matrix and BED matrix. Subsequently, we computed various metrics, including Entropy, Spearman Rank Coefficient, Jensen-Shannon Divergence (JSD), and Wasserstein distance between these two patches (CT and BED). Each of these metric values was then recorded within an empty matrix, preserving the spatial alignment with the original voxel location within the CT image matrix. Figure 3 illustrated the process of calculating the 3-D interaction matrices between CT image and BED. In addition, we also calculated a pair-wise multiplication matrix between the CT image and the BED dose matrix.
Feature extraction
PyRadiomics (24), an open-source package, was used to extract radiomics features. Briefly, we converted the DICOM images and target delineation to NRRD format using a batch process in 3D slicer software (33) https://www.slicer.org/), as previously described (34, 35).
Data split for training and testing
We randomly split the dataset into training and test subsets, comprising 84% (151 cases) and 16% (28 cases) of the total data, respectively. Both datasets were balanced to maintain similar ratios of failure and non-failure cases.
Feature selection
Following the feature extraction, we selected the important and non-redundant features from the features extracted from the CT image, BED and the calculated interaction matrices using our in-house feature selection pipeline that has been previously reported (34, 35). The feature selection pipeline consists of two main steps: quantifying feature importance and identifying non-redundant features. Feature importance is determined through repeated 5-fold cross-validation using various classification algorithms from the training dataset (151 cases), including Random Forest, Extra Trees, AdaBoost, and XGBoost. The final importance score for each feature is the average of the scores from all algorithms. Pearson’s correlation coefficient was then calculated among these features in descending order to identify non-redundant features, with the top-ranked feature automatically considered non-redundant. For each subsequent feature, if the Pearson coefficient with any previously identified non-redundant feature exceeds 0.5, it is deemed redundant and discarded.
Classification predictive modeling
Four different models were trained utilizing the Balanced Random Forest (BRF) framework on the same training dataset (151 cases) to differentiate between failure and non-failure cases. These models utilized various combinations of features: CT-only, BED-only, a combination of CT and BED, and a composite of CT and BED including their interaction matrices, respectively. We utilized a grid-search approach to optimize the hyperparameters of the BRF classifier. We explored a parameter grid consisting of ‘‘max_depth’’, ‘‘n_estimator’’, ‘min_samples_split’, ‘‘min_samples_leaf’’, ‘‘max_feature’’, ‘‘bootstrap’’ and ‘‘criterion’’ with specific value ranges. The parameters are summarized in Table 2. We created a BRF classifier with the defined hyperparameters and utilized the GridSearchCV to perform the grid search by applying the 5-fold cross-validation on the training set. We constructed the BRF classifier model with the best hyperparameters identified in the grid search and tested the model performance on the test dataset.
Table 2. Summary of the search ranges for each parameter during hyperparameter tuning of the Balanced Random Forest (BRF) classification model.
Model performance evaluation
The performance of the models was quantified by the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, and specificity. Besides these metrics, we implemented Delong test (36) to compare the AUCs from different models. We also conducted Decision Curve Analysis (DCA) to evaluate the clinical utility of our predictive models. DCA assesses the net benefit of the model by considering true positives and false positives across a range of threshold probabilities, reflecting the trade-off between interventions and their potential outcomes. We calculated net benefit using standardized formulas, comparing the model’s performance to both default strategies—assuming all patients would receive intervention and assuming none would in Python (37). DCA plots were generated to visually represent the model’s clinical impact, enabling an assessment of its value in guiding patient decision-making. Additionally, we plotted the calibration curve and calculated the Brie score for each model. Briefly, calibration curves were computed using scikit-learn package in Python, providing mean predicted probabilities and corresponding fractions of positive outcomes. Brier score losses, from scikit-learn, were also calculated to quantify prediction accuracy. We visualized the calibration curves for all models on a single plot.
Competing risk time-to-event analysis
The Fine-Gray model (38) was used to predict the cumulative incidence of having treatment failure using the features identified by the four considered models, including CT-only, BED-only, a combination of CT and BED, and a composite of CT and BED including their interaction matrices, separately. The concordance index (c-index) was estimated to assess the models’ ability to separate individuals with treatment failure earlier than others. The c-index value ranges between 0 and 1, and a value of 0.5 suggests no discriminating ability, and a value of 1 suggests perfect discriminating ability. The Brier score was evaluated to assess both calibration (i.e. how close the prediction is to the true underlying risk of event) and discrimination. A smaller Brier score indicates a better model. The integrated Brier score (IBS) was calculated to integrate the Brier score values obtained at all follow up times to assess the overall performance of the considered models. The competing risk analyses using the prodlim, riskRegression, and pec packages of R statistical software (R version 4.4.1, http://www.r-project.org).
Results
BED and interaction matrices
We have calculated the biologically effective dose (BED) distribution from the physical dose distribution using a linear quadratic (LQ) radiobiological model as described above. Figure 4 showed an example of a planning CT, a physical dose distribution extracted directly from the treatment plan, and the corresponding biologically effective dose (BED) distribution. Figure 5 showed interaction matrices calculated using planning CT and biologically effective dose (BED) distribution. (A) the Entropy matrix, (B) the Jensen-Shannon Divergence matrix (C) the pairwise multiplication matrix, (D) the Spearman rank correlation matrix, and E) the Wasserstein distance matrix.
Figure 4. Exemplary representation of (A) planning CT image, (B) physical dose distribution extracted directly from the treatment plan, and (C) calculated biologically effective dose (BED) distribution from one patient.
Figure 5. Interaction matrices calculated using planning CT and biologically effective dose (BED) - (A) Entropy matrix, (B) Jensen-Shannon Divergence matrix (C) pairwise multiplication matrix, (D) Spearman rank correlation matrix, and (E) Wasserstein distance matrix.
Feature extraction
We extracted features from the four volumes of interest (GTV, PTV, peritumor region, and ISO50) on each set of the seven matrices of interest – the CT image matrix, LQ_BED matrix, and five interaction matrices. From each ROI in a single matrix, 105 features were extracted, including 14 shape-based features and 91 first-order and textual features. Using radiomics mathematics, features were extracted from the volumes of interest (GTV, PTV, Peritumoral Region, and ISO50) across various matrices (CT, BED, Entropy, Jensen-Shannon Divergence, pairwise multiplication, Spearman rank correlation, and Wasserstein distance matrices). This process resulted in 420, 420, 840, and 2940 features, respectively, which were then processed in our feature selection pipeline, and the selected features were then utilized to develop four predictive models: CT-only, BED-only, a combination of CT and BED, and a composite model of CT and BED that integrates their interaction matrices.
Feature selection
From the same training set (151 patients), our in-house feature selection pipeline resulted in 17 important and non-redundant features for each model. The importance ranking is shown in Figure 6, and non-redundancy, assessed by the Pearson Coefficient being less than 0.5, are shown in Figures 7A–D, respectively.
Figure 6. Feature importance of the four models: (A) CT-Only model; (B) BED-Only model; (C) CT-BED model; (D) Composite model.
Figure 7. Pearson coefficient of features for (A) CT-Only, (B) BED-Only, (C) CT-BED, and (D) composite model.
Model performance
Repeated 5-fold cross-validation on the training dataset comprising 151 subjects demonstrated that the Balanced Random Forest model attained an average area under the receiver operating characteristic curve (AUC) of 0.98, 0.94, 0.99, and 0.99, respectively (Figure 8). Upon evaluation with the same independent testing set of 28 subjects, each model achieved AUC values of 0.56, 0.75, 0.73, and 0.82, respectively, with corresponding accuracies of 0.61, 0.79, 0.71, and 0.79, as shown in Figures 9A–D). Sensitivity and specificity of each model were displayed in Figures 10A–D via confusion matrices. The p-values for each pairwise comparison between models, calculated using the Delong test, are listed in Table 3.
Figure 8. Receiver-operating characteristic (ROC) curves of (A) CT-Only, (B) BED-Only, (C) CT-BED, and (D) composite model on the same training set.
Figure 9. Receiver-operating characteristic (ROC) curves of (A) CT-Only, (B) BED-Only, (C) CT-BED, and (D) composite model on the same independent test set.
Figure 10. Confusion Matrix of (A) CT-Only, (B) BED-Only, (C) CT-BED, and (D) composite model on the same independent test set.
As shown in Figure 11, the Decision Curve Analysis plot illustrated the net benefit of four models (CT, BED, CT-BED, and Composite) compared to the “Treat All” and “Treat None” scenarios across various threshold probabilities. The Composite model (red line) showed the highest net benefit over a wide range of threshold probabilities, suggesting it provides the most clinical utility. The other models, on the other hand, demonstrated fluctuating net benefits, with the CT-BED (green line) also performing well at certain thresholds. Our results indicated that the Composite model may be the most effective for guiding clinical decisions within the threshold probability range shown. As shown in Figure 12, the composite model achieved a Brie score of 0.18 indicating its superiority compared to other models in reflecting the observed outcomes by its predicted probabilities. The Brie score for both BED and CT-BED models are 0.20 and 0.20, respectively, showing that they had more deviations from the perfect predictions compared to the composite model. The CT model, on the other hand, exhibited poor calibration, diverging from the diagonal line, suggesting that its probability estimates were less reliable.
Figure 11. The decision curve analysis plot illustrating the net benefit of four models (CT, BED, CT-BED, and Composite) compared to the "Treat All" and "Treat None" scenarios across various threshold probabilities.
Competing risk time-to-event analysis
Figure 13 showed the Brier score for the training and validation data, respectively. There is no treatment failure or competing death event after 8 years for the training data and 4.5 years for the test data following the treatment. Therefore, a time range of 1 year to 8 years after treatment was used for the plot on training data and 1 year to 4.5 years for the plot of the validation data. For the training data, the Brier scores are similar and less than 0.25 for all considered models at year 1 to year 8, indicating all these considered models provides good performance. For the validation data, all models have Brier scores less than 0.25, though the combination of CT and Bed model provides larger (worse) brier score at year 1 to year 4 than other three models. In addition, all considered models provide good performance on the training data (c-index >70; IBS values <0.25). Among all these models, the CT only model has the worst performance with the smallest c-index and the largest IBS value. For the validation data, the BED only model and the composite model have better performance (c-index >70, and low IBS values). All these results are presented in Table 4.
Discussion
With technical advancements along with the rapid digitization of information, we are witnessing an exponential increase in the accumulation of data, such as medical imaging data and 3D dose distribution, in Radiation Oncology. Traditionally, these data were primarily utilized for human perception or visual representation. Our analytical capabilities for evaluating medical imaging, specifically CT and the 3D dose distribution matrix, were constrained to simplistic, 1-dimensional methodologies, such as dose-volume histogram (DVH). However, recent advancements in computational capacities and artificial intelligence technologies have revolutionized our approach, equipping us to transform data such as CT and dose matrix into structured, minable datasets. This enables the application of sophisticated data mining techniques to unearth complex patterns and associations that may be indicative of specific clinical outcomes. This transition not only enhances our ability to explore underlying mechanisms but also opens up new avenues for predictive analytics and decision-making based on comprehensive, multi-dimensional data analysis.
Owing to its distinct advantages in providing a comprehensive and quantitative representation of the radiographic phenotype of a 3D target volume (19–25), radiomics has become an active area of research for risk assessment and treatment response prediction in cancer management (26–30). However, despite its utility, radiomics remains constrained in its capacity for comprehensive treatment response assessment, particularly in Radiation Oncology. This limitation arises because radiomics focuses on extracting and analyzing features predominantly from the images. With the recent increase in the application of radiomics techniques in the 3D dose distribution domain, dosiomics features are being increasingly examined for their potential in predictive modeling. However, existing dosiomics studies (32, 39–43) have focused more on the features extracted directly from 3D dose distribution rather than their correlation, or interaction with the medical images. Logically, it is the interplay between the radiation and the underlying characteristics of irradiated tissue, as represented by medical images, that fundamentally determines the treatment outcome. Thus, a more holistic approach that incorporates both medical images and its interaction with the radiation dose is possibly essential for a more accurate and thorough assessment of treatment response.
Our study addresses this gap by incorporating 3D dose matrix into the analytical framework and calculating five distinct interaction matrices between CT and dose. These matrices are designed to potentially represent the dynamic interplay or correlation between CT and dose. This approach enhances the comprehensiveness of our analysis, potentially providing a deeper understanding of the interactions that may influence outcomes within our study’s scope. Specifically, the Entropy matrix elucidates local variations or complexity within data, where components exhibiting low entropy are more uniform compared to those with high entropy. The Jensen-Shannon divergence (JSD) matrix and the Wasserstein distance matrix both quantify the dissimilarity between CT and dose distributions in the local environment based on distinct mathematical perspectives with different emphases. Spearman’s rank correlation matrix assesses the monotonic relationship between CT and dose distributions in the local environment. By integrating these matrices into our feature mining process, we can potentially uncover additional hidden features that represent the voxelated interplay or correlation between images and dose, which might be pivotal in determining treatment outcomes. Our approach potentially allows for a more nuanced understanding of the factors influencing therapeutic efficacy. As shown in our results, the composite model, characterized predominantly by its top features from these interaction matrices, demonstrated superior performance compared to other models, indicating these interaction matrices are informative and predictive.
Although delta-radiomics (44–48) has also been utilized to evaluate treatment response in radiation oncology, it relies on analyzing temporal changes in imaging to predict clinical outcomes post-treatment. This retrospective and passive approach does not facilitate modifications or improvements to treatment plans, which are typically assessed using relatively simplistic, 1D metrics such as DVH prior to treatment delivery. Conversely, our approach, upon thorough validation, could offer physicians the ability to proactively select more effective treatment plans, potentially reducing treatment failures.
Our study has a few limitations. First, we have a relatively small sample size, and our conclusion necessitates cautious interpretation and requires validation through external datasets. The observed discrepancy in AUC values between the training set and the test set suggests that the sample size may be insufficient for robust model evaluation. Additionally, we did not observe the significant difference in AUC for the four models, which is also likely caused by our limited sample size since we only had 28 patients in the test set. Second, although we utilized the biological effective dose (BED) to account for the variations of biological effect due to various fractionations, it should be noted that voxels within the PTV were assigned an alpha-beta ratio of 10, while voxels outside the PTV were assigned a value of 3 in our calculation of the voxelated BED, as there is currently no widely accepted model for voxelated BED based on tissue electron density. Furthermore, we chose an empirical value of 10 for the alpha-beta ratio for the cancerous tissue, although this may not be the most appropriate for NSCLC, where higher values have been suggested (49, 50). In addition, applying a uniform alpha-beta ratio across all NSCLC subtypes may be overly simplistic, given the reported variations in cell survival curves among different lung cancer (51), which potentially overlooks the nuanced biological differences. Our current analyses did not evaluate the potential impact on the model performance from these variations. We did not address these considerations in our current study primarily because these specific values have not yet been widely validated and accepted in clinical settings. Third, we did not further investigate the uncertainty of the features caused by various tube current, voxel size resampling and kernel reconstruction as it has been demonstrated that the variations in tube current do not significantly affect radiomic features (52) and that radiomic features have shown no notable correlation with exposure (53). Also, since resampling minimizes the effect of voxel size on the radiomic features (53, 54), the image sets used in our study were resampled to 1 x 1 x 2 mm³. Additionally, all images were acquired using either the B31s or B31f convolution kernel from the single vendor (Siemens), and both of B31s and B31f kernel are considered as smooth kernels with limited expected difference (55). Fourth, in our analysis, we selected free-breathing CT arbitrarily without conducting an extensive comparison of potential results using average CT, given the fact that the superiority of free-breathing CT compared to average CT within the context of clinical treatment planning remains a subject for debate. Although additional features and outcomes could be explored further using average CT, we have proved our concept that integrating the interaction between dose and CT enhances prediction accuracy compared to using CT or dose alone.
Future investigations following our proof-of-concept study are still warranted in the following areas. First, our model needs to be validated, and possibly evolved, with external datasets. We also plan to perform independent tests using newer patients once the follow-up time meets our requirements as an alternative approach for further independent validation. Second, the impact of various alpha-beta ratios techniques on model performance, for biological effectiveness of SBRT or various histology subtypes, needs to be explored. Third, we also intend to initiate a clinical trial that utilizes our dosiomics model as a plan evaluation metric, in addition to the existing DVH-based metrics, to further validate or refine the model within a prospective setting. This approach potentially mitigates the inherent limitations of most radiomics studies, which primarily are retrospective and thus often fail to incorporate all necessary biological variables.
Conclusion
Our proof-of-concept study demonstrated that a dosiomics model, which incorporates the interaction between CT and dose, has demonstrated effectiveness in predicting treatment failures following lung SBRT treatment. This model holds potential as a proactive tool for evaluating and selecting treatment plans, aimed at reducing future treatment failures.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by Institutional Review Board at the University of Nebraska Medical Center. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because our study does not affect the rights and welfare of the study subjects since our retrospective medical record study involved no more than minimal risks.
Author contributions
AB: Data curation, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing. KJ: Data curation, Formal analysis, Writing – original draft, Writing – review & editing. KO: Data curation, Formal analysis, Writing – review & editing. FY: Methodology, Formal analysis, Writing – original draft. LH: Data curation, Writing – review & editing. YL: Data curation, Writing – review & editing, Methodology. SWi: Writing – review & editing. SZ: Methodology, Resources, Writing – review & editing. MB: Methodology, Resources, Writing – review & editing. CL: Methodology, Resources, Writing – review & editing. CZ: Data curation, Methodology, Resources, Writing – review & editing. SWa: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Resources, Supervision, Visualization, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This project is funded by Otis Glebe Medical Foundation (#01100674) and the Buffett Cancer Center, which is supported by the National Cancer Institute under award number CA036727.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Abbreviations
BED, Biologically Effective Dose; CT, Computed Tomography; 4D CT, Four-Dimensional Computed Tomography; FB-CT, Free Breathing Computed Tomography; DICOM, Digital Imaging and Communications in Medicine; DVH, Dose Volume Histogram; GTV, Gross Tumor Volume; HU, Hounsfield Units; ISO50, 50% of the prescription isodose volume outside the PTV; LQ, Linear Quadratic; NOS, Not Otherwise Specified; NRRD, Nearly Raw Raster Data; NSCLC, Non-Small Cell Lung Cancer; OAR, Organs at Risk; PTV, Planning Target Volume; TPS, Treatment Planning System.
References
1. SEER Database. SEER database: cancer stat facts: lung and bronchus cancer. Available online at: https://seer.cancer.gov/statfacts/html/lungb.html (accessed May 26, 2024).
2. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2018. CA Cancer J Clin. (2018) 68:7–30. doi: 10.3322/caac.21442
3. Ganti AK, Klein AB, Cotarla I, Seal B, Chou E. Update of incidence, prevalence, survival, and initial treatment in patients with non-small cell lung cancer in the US. JAMA Oncol. (2021) 7:1824–32. doi: 10.1001/jamaoncol.2021.4932
4. Li C, Wang H, Jiang Y, Fu W, Liu X, Zhong R, et al. Advances in lung cancer screening and early detection. Cancer Biol Med. (2022) 1(5):591–608. doi: 10.20892/j.issn.2095-3941.2021.0690
5. Mountain CF. Revisions in the international system for staging lung cancer. Chest. (1997) 111:1710–7. doi: 10.1378/chest.111.6.1710
6. Timmerman RD, Hu C, Michalski JM, Bradley JC, Galvin J, Johnstone DW, et al. Long-term results of stereotactic body radiation therapy in medically inoperable stage I non-small cell lung cancer. JAMA Oncol. (2018) 4:1287–8. doi: 10.1001/jamaoncol.2018.1258
7. Onishi H, Shirato H, Nagata Y, Hiraoka M, Fujino M, Gomi K, et al. Stereotactic body radiotherapy (SBRT) for operable stage I non-small-cell lung cancer: can SBRT be comparable to surgery? Int J Radiat Oncol Biol Phys. (2011) 81:1352–8. doi: 10.1016/j.ijrobp.2009.07.1751
8. Pan H, Rose BS, Simpson DR, Mell LK, Mundt AJ, Lawson JD. Clinical practice patterns of lung stereotactic body radiation therapy in the United States: a secondary analysis. Am J Clin Oncol. (2013) 36:269–72. doi: 10.1097/COC.0b013e3182467db3
9. Chang JY, Senan S, Paul MA, Mehran RJ, Louie AV, Balter P, et al. Stereotactic ablative radiotherapy versus lobectomy for operable stage I non-small-cell lung cancer: a pooled analysis of two randomised trials. Lancet Oncol. (2015) 16:630–7. doi: 10.1016/S1470-2045(15)70168-3
10. Crabtree TD, Denlinger CE, Meyers BF, El Naqa I, Zoole J, Krupnick AS, et al. Stereotactic body radiation therapy versus surgical resection for stage I non-small cell lung cancer. J Thorac Cardiovasc Surg. (2010) 140:377–86. doi: 10.1016/j.jtcvs.2009.12.054
11. Grills IS, Mangona VS, Welsh R, Chmielewski G, McInerney E, Martin S, et al. Outcomes after stereotactic lung radiotherapy or wedge resection for stage I non-small-cell lung cancer. J Clin Oncol. (2010) 28:928–35. doi: 10.1200/JCO.2009.25.0928
12. Nyman J, Hallqvist A, Lund JA, Brustugun OT, Bergman B, Bergstrom P, et al. SPACE - A randomized study of SBRT vs conventional fractionated radiotherapy in medically inoperable stage I NSCLC. Radiother Oncol. (2016) 121:1–8. doi: 10.1016/j.radonc.2016.08.015
13. Ball HJ, Santanam L, Senan S, Tanyi JA, van Herk M, Keall PJ. Results from the AAPM Task Group 324 respiratory motion management in radiation oncology survey. J Appl Clin Med Phys. (2022) 23:e13810. doi: 10.1002/acm2.13810
14. Korreman SS. Motion in radiotherapy: photon therapy. Phys Med Biol. (2012) 57:R161–91. doi: 10.1088/0031-9155/57/23/R161
15. Sharma M, Nano TF, Akkati M, Milano MT, Morin O, Feng M. A systematic review and meta-analysis of liver tumor position variability during SBRT using various motion management and IGRT strategies. Radiother Oncol. (2022) 166:195–202. doi: 10.1016/j.radonc.2021.11.022
16. Caillet V, Booth JT, Keall P. IGRT and motion management during lung SBRT delivery. Phys Med. (2017) 44:113–22. doi: 10.1016/j.ejmp.2017.06.006
17. Ding GX, Alaei P, Curran B, Flynn R, Gossman M, Mackie TR, et al. Image guidance doses delivered during radiotherapy: Quantification, management, and reduction: Report of the AAPM Therapy Physics Committee Task Group 180. Med Phys. (2018) 45:e84–99. doi: 10.1002/mp.12824
18. Jaffray DA. Image-guided radiotherapy: from current concept to future perspectives. Nat Rev Clin Oncol. (2012) 9:688–99. doi: 10.1038/nrclinonc.2012.194
19. Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. (2014) 5:4006. doi: 10.1038/ncomms5006
20. Gillies RJ, Kinahan PE, Hricak H. Radiomics: images are more than pictures, they are data. Radiology. (2016) 278:563–77. doi: 10.1148/radiol.2015151169
21. Lambin P, Leijenaar RTH, Deist TM, Peerlings J, de Jong EEC, van Timmeren J, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. (2017) 14:749–62. doi: 10.1038/nrclinonc.2017.141
22. Parmar C, Grossmann P, Bussink J, Lambin P, Aerts HJ. Machine learning methods for quantitative radiomic biomarkers. Sci Rep. (2015) 5:13087. doi: 10.1038/srep13087
23. Sanduleanu S, Woodruff HC, de Jong EEC, van Timmeren JE, Jochems A, Dubois L, et al. Tracking tumor biology with radiomics: A systematic review utilizing a radiomics quality score. Radiother Oncol. (2018) 127:349–60. doi: 10.1016/j.radonc.2018.03.033
24. van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res. (2017) 77:e104–e7. doi: 10.1158/0008-5472.CAN-17-0339
25. Aerts HJ. The potential of radiomic-based phenotyping in precision medicine: A review. JAMA Oncol. (2016) 2:1636–42. doi: 10.1001/jamaoncol.2016.2631
26. Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, et al. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. (2014) 5:4006. doi: 10.1038/ncomms5006
27. Dou TH, Coroller TP, van Griethuysen JJM, Mak RH, Aerts HJWL. Peritumoral radiomics features predict distant metastasis in locally advanced NSCLC. PloS One. (2018) 13:e0206108. doi: 10.1371/journal.pone.0206108
28. Macyszyn L, Akbari H, Pisapia JM, Da X, Attiah M, Pigrish V, et al. Imaging patterns predict patient survival and molecular subtype in glioblastoma via machine learning techniques. Neuro Oncol. (2016) 18:417–25. doi: 10.1093/neuonc/nov127
29. Fave X, Zhang L, Yang J, Mackin D, Balter P, Gomez D, et al. Delta-radiomics features for the prediction of patient outcomes in non-small cell lung cancer. Sci Rep. (2017) 7:588. doi: 10.1038/s41598-017-00665-z
30. van Timmeren JE, Leijenaar RTH, van Elmpt W, Reymen B, Oberije C, Monshouwer R, et al. Survival prediction of non-small cell lung cancer patients using radiomics analyses of cone-beam CT images. Radiother Oncol. (2017) 123:363–9. doi: 10.1016/j.radonc.2017.04.016
31. Gabrys HS, Buettner F, Sterzing F, Hauswald H, Bangert M. Design and selection of machine learning methods using radiomics and dosiomics for normal tissue complication probability modeling of xerostomia. Front Oncol. (2018) 8:35. doi: 10.3389/fonc.2018.00035
32. Murakami Y, Soyano T, Kozuka T, Ushijima M, Koizumi Y, Miyauchi H, et al. Dose-based radiomic analysis (Dosiomics) for intensity modulated radiation therapy in patients with prostate cancer: correlation between planned dose distribution and biochemical failure. Int J Radiat Oncol Biol Phys. (2022) 112:247–59. doi: 10.1016/j.ijrobp.2021.07.1714
33. Pieper S, Lorensen B, Schroeder W, Kikinis R. (2006). The NA-MIC Kit: ITK, VTK, pipelines, grids and 3D slicer as an open platform for the medical image computing community. 3rd IEEE International Symposium on Biomedical Imaging: Nano to Macro. IEEE Corporate Headquarters, New York, NY: IEEE, Vol. 2006. pp. 6–9.
34. Wang S, Lin C, Kolomaya A, Ostdiek-Wille GP, Wong J, Cheng X, et al. Compute tomography radiomics analysis on whole pancreas between healthy individual and pancreatic ductal adenocarcinoma patients: uncertainty analysis and predictive modeling. Technol Cancer Res Treat. (2022) 21:15330338221126869. doi: 10.1177/15330338221126869
35. Huynh LM, Bonebrake B, Tran J, Marasco JT, Ahlering TE, Wang S, et al. Multi-institutional development and validation of a radiomic model to predict prostate cancer recurrence following radical prostatectomy. J Clin Med. (2023) 12:7322–7333. doi: 10.3390/jcm12237322
36. Sun X, Xu W. Fast implementation of deLong’s algorithm for comparing the areas under correlated receiver operating characteristic curves. IEEE Signal Process Letters. (2014) 21:1389–93. doi: 10.1109/LSP.2014.2337313
37. Vickers AJ, van Calster B, Steyerberg EW. A simple, step-by-step guide to interpreting decision curve analysis. Diagn Progn Res. (2019) 3:18. doi: 10.1186/s41512-019-0064-7
38. Fine JP, Gray RJ. A proportional hazards model for the subdistribution of a competing risk. J Am Stat Assoc. (1999) 94:496–509. doi: 10.2307/2670170
39. Sun L, Burke B, Quon H, Swallow A, Kirkby C, Smith W. Do dosiomic features extracted from planned 3-dimensional dose distribution improve biochemical failure-free survival prediction: an analysis based on a large multi-institutional data set. Adv Radiat Oncol. (2023) 8:101227. doi: 10.1016/j.adro.2023.101227
40. Liang B, Yan H, Tian Y, Chen X, Yan L, Zhang T, et al. Dosiomics: extracting 3D spatial features from dose distribution to predict incidence of radiation pneumonitis. Front Oncol. (2019) 9:269. doi: 10.3389/fonc.2019.00269
41. Huang Y, Feng A, Lin Y, Gu H, Chen H, Wang H, et al. Radiation pneumonitis prediction after stereotactic body radiation therapy based on 3D dose distribution: dosiomics and/or deep learning-based radiomics features. Radiat Oncol. (2022) 17:188. doi: 10.1186/s13014-022-02154-8
42. Yang SS, OuYang PY, Guo JG, Cai JJ, Zhang J, Peng QH, et al. Dosiomics risk model for predicting radiation induced temporal lobe injury and guiding individual intensity-modulated radiation therapy. Int J Radiat Oncol Biol Phys. (2023) 115:1291–300. doi: 10.1016/j.ijrobp.2022.11.036
43. Chounta S, Allodji R, Vakalopoulou M, Bentriou M, Do DT, De Vathaire F, et al. Dosiomics-based prediction of radiation-induced valvulopathy after childhood cancer. Cancers (Basel). (2023) 15:3107–3128. doi: 10.3390/cancers15123107
44. Wu RR, Zhou YM, Xie XY, Chen JY, Quan KR, Wei YT, et al. Delta radiomics analysis for prediction of intermediary- and high-risk factors for patients with locally advanced cervical cancer receiving neoadjuvant therapy. Sci Rep. (2023) 13:19409. doi: 10.1038/s41598-023-46621-y
45. Nardone V, Reginelli A, Grassi R, Boldrini L, Vacca G, D’Ippolito E, et al. Delta radiomics: a systematic review. Radiol Med. (2021) 126:1571–83. doi: 10.1007/s11547-021-01436-7
46. Cousin F, Louis T, Dheur S, Aboubakar F, Ghaye B, Occhipinti M, et al. Radiomics and delta-radiomics signatures to predict response and survival in patients with non-small-cell lung cancer treated with immune checkpoint inhibitors. Cancers (Basel). (2023) 15:1968–1982. doi: 10.3390/cancers15071968
47. Xie D, Xu F, Zhu W, Pu C, Huang S, Lou K, et al. Delta radiomics model for the prediction of progression-free survival time in advanced non-small-cell lung cancer patients after immunotherapy. Front Oncol. (2022) 12:990608. doi: 10.3389/fonc.2022.990608
48. Lin P, Yang PF, Chen S, Shao YY, Xu L, Wu Y, et al. A Delta-radiomics model for preoperative evaluation of Neoadjuvant chemotherapy response in high-grade osteosarcoma. Cancer Imaging. (2020) 20:7. doi: 10.1186/s40644-019-0283-8
49. Klement RJ, Sonke JJ, Allgauer M, Andratschke N, Appold S, Belderbos J, et al. Estimation of the alpha/beta ratio of non-small cell lung cancer treated with stereotactic body radiotherapy. Radiother Oncol. (2020) 142:210–6. doi: 10.1016/j.radonc.2019.07.008
50. Chi A, Wen S, Liao Z, Fowler J, Xu J, Nguyen NP, et al. What would be the most appropriate alpha/beta ratio in the setting of stereotactic body radiation therapy for early stage non-small cell lung cancer. BioMed Res Int. (2013) 2013:391021. doi: 10.1155/2013/391021
51. Morstyn G, Russo A, Carney DN, Karawya E, Wilson SH, Mitchell JB. Heterogeneity in the radiation survival curves and biochemical properties of human lung cancer cell lines. J Natl Cancer Inst. (1984) 73:801–7. doi: 10.1093/jnci/73.4.801
52. Mackin D, Ger R, Dodge C, Fave X, Chi PC, Zhang L, et al. Effect of tube current on computed tomography radiomic features. Sci Rep. (2018) 8:2354. doi: 10.1038/s41598-018-20713-6
53. Larue R, van Timmeren JE, de Jong EEC, Feliciani G, Leijenaar RTH, Schreurs WMJ, et al. Influence of gray level discretization on radiomic feature stability for different CT scanners, tube currents and slice thicknesses: a comprehensive phantom study. Acta Oncol. (2017) 56:1544–53. doi: 10.1080/0284186X.2017.1351624
54. Shafiq-Ul-Hassan M, Zhang GG, Latifi K, Ullah G, Hunt DC, Balagurunathan Y, et al. Intrinsic dependencies of CT radiomic features on voxel size and number of gray levels. Med Phys. (2017) 44:1050–62. doi: 10.1002/mp.12123
Keywords: radiomics, dosiomics, lung SBRT, modeling, CT-dose interaction, treatment failure, Non-Small Cell Lung Cancer (NSCLC)
Citation: Bhandari A, Johnson K, Oh K, Yu F, Huynh LM, Lei Y, Wisnoskie S, Zhou S, Baine MJ, Lin C, Zhang C and Wang S (2024) Developing a novel dosiomics model to predict treatment failures following lung stereotactic body radiation therapy. Front. Oncol. 14:1438861. doi: 10.3389/fonc.2024.1438861
Received: 26 May 2024; Accepted: 07 November 2024;
Published: 12 December 2024.
Edited by:
Guang Li, Memorial Sloan Kettering Cancer Center, United StatesCopyright © 2024 Bhandari, Johnson, Oh, Yu, Huynh, Lei, Wisnoskie, Zhou, Baine, Lin, Zhang and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chi Zhang, Y2hpLnpoYW5nQHVubWMuZWR1; Shuo Wang, c2h1by53YW5nQHVubWMuZWR1