Deep-learning-driven dose prediction and verification for stereotactic radiosurgical treatment of isolated brain metastases

Pan, Jinghui; Xiao, Jinsheng; Ruan, Changli; Song, Qibin; Shi, Lei; Zhuo, Fengjiao; Jiang, Hao; Li, Xiangpan

doi:10.3389/fonc.2023.1285555

ORIGINAL RESEARCH article

Front. Oncol., 20 November 2023

Sec. Radiation Oncology

Volume 13 - 2023 | https://doi.org/10.3389/fonc.2023.1285555

Deep-learning-driven dose prediction and verification for stereotactic radiosurgical treatment of isolated brain metastases

Jinghui Pan^1,2

Jinsheng Xiao¹

Changli Ruan²

Qibin Song³

Lei Shi³

Fengjiao Zhuo⁴

Hao Jiang^1*

Xiangpan Li^2*

¹School of Electronic Information, Wuhan University, Wuhan, Hubei, China
²Department of Radiation Oncology, Renmin Hospital, Wuhan University, Wuhan, Hubei, China
³Department of Oncology, Renmin Hospital, Wuhan University, Wuhan, Hubei, China
⁴Department of Radiation Oncology, Jiangling County People’s Hospital, Jingzhou, Hubei, China

Purpose: While deep learning has shown promise for automated radiotherapy planning, its application to the specific scenario of stereotactic radiosurgery (SRS) for brain metastases using fixed-field intensity modulated radiation therapy (IMRT) on a linear accelerator remains limited. This work aimed to develop and verify a deep learning-guided automated planning protocol tailored for this scenario.

Methods: We collected 70 SRS plans for solitary brain metastases, of which 36 cases were for training and 34 for testing. Test cases were derived from two distinct clinical institutions. The envisioned automated planning process comprised (1): clinical dose prediction facilitated by deep-learning algorithms (2); transformation of the forecasted dose into executable plans via voxel-centric dose emulation (3); validation of the envisaged plan employing a precise dosimeter in conjunction with a linear accelerator. Dose prediction paradigms were established by engineering and refining two three-dimensional UNet architectures (UNet and AttUNet). Input parameters encompassed computed tomography scans from clinical plans and demarcations of the focal point alongside organs at potential risk (OARs); the ensuing output manifested as a 3D dose matrix tailored for each case under scrutiny.

Results: Dose estimations rendered by both models mirrored the manual plans and adhered to clinical stipulations. As projected by the dual models, the apex and average doses for OARs did not deviate appreciably from those delineated in the manual plan (P-value≥0.05). AttUNet showed promising results compared to the foundational UNet. Predicted doses showcased a pronounced dose gradient, with peak concentrations localized within the target vicinity. The executable plans conformed to clinical dosimetric benchmarks and aligned with their associated verification assessments (100% gamma approval rate at 3 mm/3%).

Conclusion: This study demonstrates an automated planning technique for fixed-field IMRT-based SRS for brain metastases. The envisaged plans met clinical requirements, were reproducible across centers, and achievable in deliveries. This represents progress toward automated paradigms for this specific scenario.

1 Introduction

Brain metastases rank as the most ubiquitous intracranial tumors and significantly contribute to the elevated mortality and disability rates associated with cancer. An estimated 25-40% of malignant neoplasms culminate in brain metastases (1, 2). Such metastatic occurrences predominantly manifest in patients diagnosed with non-small cell lung, breast, and melanoma. However, they can also arise from gastrointestinal tract malignancies, liver, pancreas, uterus, ovary, thyroid, adrenal gland, prostate, kidney, and bone (3). Present-day therapeutic interventions primarily involve surgery and radiotherapy, with Stereotactic radiosurgery (SRS) and whole brain radiation therapy (WBRT) (4, 5) emerging as the leading radiotherapeutic techniques. Due to its spatial dose distribution, exemplary conformal shape, reduced treatment sessions, enhanced tumor control rate and diminished adverse effects on healthy tissues compared to WBRT, SRS often holds a therapeutic advantage. Consequently, SRS is extensively employed in treating brain metastases (6, 7).

Current treatment planning strategies generally use the inverse intensity-modulated radiotherapy (IMRT) method, in which a clinical goal is first determined, followed by optimization of dose distribution to meet this goal. Physicists generally need to adjust the optimization parameters repeatedly to obtain a clinically plausible plan due to factors such as dose limitations for organs at risk, multileaf collimator physical constraints, etc. (8, 9). This planning procedure is time-consuming, labor-intensive, and may result in inconsistent treatment quality. Thus, automatic planning methods have been proposed to improve the quality and efficiency of IMRT planning. For example, the knowledge-based automatic planning method (10–12) predicts the dose distribution for a new patient by training on the dataset of accepted clinical plans. The predicted plan provides a better starting point, thereby reducing the frequency of trial and error. However, this method has certain limitations. On one hand, the method extracts the hand-design features from the patient’s anatomy, which separates the processes of feature extraction and dose prediction, resulting in a suboptimal solution; on the other hand, the predicted dose is usually a one-dimensional dose-volume histogram (DVH) or zero-dimensional dose point parameter, which cannot accurately reflect the three-dimensional (3D) dose distribution.

In recent years, advanced deep learning methods have made great progress in the automatic segmentation of organs at risk and multimodal image registration. This success has also inspired research on end-to-end 3D dose prediction using deep learning. In light of the excellent performance of ResNet (13) in image classification, Chen (14) and Fan (15) proposed deep learning methods based on ResNet to predict the 3D dose distribution of IMRT plans for head and neck cancer. However, ResNet performs resolution reduction operations on the images, resulting in low resolution of the predicted doses. To overcome this problem, Nguyen (16) introduced the 3D UNet (17) network, originally used for image segmentation, to facilitate dose prediction during IMRT planning of prostate cancer. UNet (18, 19) has a unique resolution-preserving feature that improves the resolution of the predicted dose distributions. UNet-based dose prediction has been predominantly applied to non-stereotactic intensity modulated radiation therapy (IMRT) plans (20–24), with limited exploration for stereotactic radiation therapy. A few studies have utilized UNet for stereotactic body radiation therapy (SBRT) dose prediction, including Kearney et al. (25) for prostate cancer and Momin et al. (26) for pancreatic cancer. Zhang et al. (27) developed a UNet model to predict Gamma Knife radiosurgery dose distributions for intracranial tumors. However, the application of deep learning for dose prediction in the specific scenario of linear accelerator-based stereotactic radiosurgery using fixed-field IMRT for brain metastases has not been extensively studied. Moreover, some commercial vendors such as Brainlab and Varian have partially implemented automated planning products using deep learning, though these are not tailored for the particular scenario addressed here.

Furthermore, while deep learning for dose prediction has been widely reported, conversion and delivery verification of predicted doses into clinically viable treatment plans remains an unmet challenge. Dose distributions forecasted on a voxel basis may not be achievable in practice due to beam and collimator limitations.

In summary, automated treatment planning for brain metastasis radiosurgery is an active research area with multiple academic and commercial solutions in various stages of development. Groups such as Zhang et al. have explored techniques like deep learning for this application. Commercial products from vendors including Brainlab and Varian perform automated planning tasks for radiosurgery of brain metastases using diverse approaches including deep-learning. However, exploring novel approaches may offer opportunities to improve on existing techniques. Here we describe a deep learning approach to planning brain metastasis radiosurgery as delivered by a linac radiosurgery platform employing a co-planar fixed-field IMRT delivery technique.

We proposed an automated treatment planning pipeline that integrates dose prediction, plan generation, and delivery verification. Two 3D UNet architectures were implemented to forecast dose distributions from CT and contour data. The proposed AttUNet model additionally leverages an attention mechanism to incorporate CT information. The predicted doses were transformed into deliverable plans using a dose mimicking approach (28). The resulting plans were validated on a 6MV linear accelerator with orthogonal stacked multileaf collimation. Testing on multi-institutional data affirmed the clinical applicability of the proposed models.

2 Materials and methods

2.1 Patient data

A total of 70 patients with solitary brain metastases treated with IMRT from April 2019 to July 2022 were recruited. Among them, 60 patients were selected from Radiotherapy Center 1, and the remaining 10 were selected from Radiotherapy Center 2. The training dataset consists of 36 patients randomly selected from Center 1. The subsequent evaluation of the models was conducted using the residual 34 patients from both centers.

The prescription dose for all patients was 30 Gy in 5 fractions. The dosimetric considerations for SRS were summarized as follows. The dose to 95% primary gross tumor volume (PGTV) reaches the prescription dose. Any areas receiving greater than 105% of the prescription dose, commonly referred to as high-dose spillage, are generally confined to the PGTV. For difficult cases, normal tissue volume receiving >105% of the prescription dose should be kept under 15% of the PGTV. Intermediate-dose spillage is responsible for most of the toxicity associated with SRS (29). The dose to any point 2 cm away from the PGTV surface (D2cm) should be below a limit. The ratio of 50% isodose volume (TV50%) to the PGTV volume (expressed as R50% = TV50%/PGTV) ought to be minimized to the greatest extent feasible.

2.2 Data processing

Using in-house Python scripts, the anatomical structure (RT-Struct), clinical dose distribution, and CT images were extracted from DICOM files. The coordinate systems of the structure and dose were aligned with the coordinate system of the CT image. Integer values denoted PGTV and organs at risk (OARs). Dose values were normalized to a range of -1 to 1. For enhanced image contrast, the CT values outside the range [-200, 300] were clipped to the interval edges. The resulting values were further normalized to [-1, 1]. To reduce computational complexity and save graphics memory, all data were cropped to 224 × 224 × 32 pixels with a pixel size of 2.5 × 2.5 × 2.5 mm. In the preliminary experiment, we found that the PGTV size greatly influences the model performance. As a result, we categorized training and test patients into two groups based on PGTV volume. Patients with a PGTV ≤ 20 cc were grouped under small PGTV, while those with a PGTV > 20 cc were labeled as large PGTV. In the training set, there were 36 cases in total: 15 cases in the large target area and 21 in the small target area. The test set comprised of 34 cases: 17 cases in the large target area and 17 in the small target area. Models were then trained and evaluated separately for each group.”

2.3 Neural network structure design and training

We employed the deep learning framework PyTorch (Meta, Menlo Park, CA) to build dose prediction models. The first one is the 3D UNet which has been applied extensively to dose prediction tasks. The UNet model takes anatomical structures as input and outputs dose distribution. The second is our proposed 3D AttUNet, which uses anatomical structures and CT images as inputs. A novel attention mechanism was developed to facilitate the information fusion between CT images and anatomical structures. Figure 1A illustrates the two dose prediction models.

FIGURE 1

Figure 1 (A) Workflow of the dose prediction models. (B) UNet network. (C) Multi-head attention mechanism. (D) UNet equipped with multi-head attention (AttUNet). CT, computed tomography; ReLU, rectified linear unit.

The widely used UNet network is designed to overcome the problem of resolution reduction caused by the pooling operation of convolutional neural networks. The network structure consists of two parts, namely, the encoder and the decoder. The encoder’s input layer is a 224×224×32 matrix, representing processed CT images and structural contours. The output consists of the extracted features of the anatomical structures. The decoder predicts a 224×224×32 dose matrix from the extracted features. As shown in Figure 1B, the encoder encompasses four layers bridged by 2x2x2 max pooling. Each layer incorporates dual 3x3x3 convolutions succeeded by batch normalization (BN) and rectified linear unit (ReLU) activation. The filter count commences at 32, and doubles post each pooling, peaking at 512 filters in the bottleneck layer. The decoder, mirroring the encoder, progressively upsamples the features to match the original input resolution. Skip connections amalgamate encoder and decoder features. UNet’s distinctive feature is its low-rank compact representation of anatomical structures, enhancing model generalizability.

The proposed AttUNet is an improved version of the above UNet. As illustrated in Figure 1D, two encoders extract features from the CTs and structures separately. The CT and structure features are fused by a multi-head attention layer before being passed to the decoder. The main purpose of attention is to allow an element in the structure features to look at all elements in CT features for clues that can help lead to a better encoding for this element. As illustrated in Figure 1C, we use 8 attention heads with 64 dimensional features per head. The attention layer introduces three matrices: Query, Key, and Value, to calculate the attention scores. The scores measure the importance of the key term compared to the query term related to a feature element. The key and value matrices are computed from the CT features with the dot product. The query matrix is obtained similarly from the structure features. The score is calculated by taking the dot product of the query with the key of the respective element we are scoring. The attention block is further refined by adding a mechanism called multi-head attention. It expands the model’s ability to focus on different positions and gives the attention layer multiple representation subspaces. The decoder, akin to UNet, comprises four upsampling layers.

We initialized the neural networks’ parameters with Kaiming initialization (30). The mean squared error between the inputs and outputs was used as the loss function. The network parameters were optimized on two NVIDIA 3090 GPUs using Adam optimizer with a learning rate set at 0.0001 and a batch size of 2. Whenever the validation loss plateaued, the learning rate was reduced by a factor of 10. Model training spanned 1000 epochs, incorporating an early stopping criterion if the validation loss saw no improvement over 100 consecutive epochs.

2.4 Dosimetric comparison between clinical and predicted doses

We employed specific dosimetric parameters (31) to juxtapose the predicted doses with clinical doses.

For PGTV, the dosimetric parameters encompass the doses covering 2%, 98%, and 95% of the target volume (D2%, D98%, and D95%), mean dose (Dmean), homogeneity index (HI), conformity index (CI), and intermediate-dose spillage (R50% and D2cm). The dosimetric parameters for OARs include the maximum point dose in brainstem volume (brainstem Dmax), the volume of brainstem receiving 23 Gy (brainstem V23), optic nerve Dmax, optic chiasm Dmax, lens Dmax, lens Dmean, eyeball Dmax, eyeball Dmean, and hypophysis Dmax.

The CI is defined in equation (1), indicating the ratio of the volume of the isodose shell that receives the prescription dose to the PGTV volume (32). VPGTV is the PGTV volume. VRX is the volume enclosed by the prescription isodose line. PGTVRX is the volume of PGTV enclosed by the prescription isodose line. The closer the value of the CI is to 1, the higher the conformity of the target volume.

\begin{array}{l} C I = \frac{{[P G T V_{R X}]}^{2}}{V_{P G T V} \times V_{R X}} & (1) \end{array}

HI is defined in equation (2), indicating the difference between the maximum dose, the minimum dose, and the average dose in the target region (33). The dose distribution in the target region is considered to be homogeneous when the HI = 0. The larger the HI value, the poorer the homogeneity of dose distribution in the target region.

\begin{array}{l} H I = \frac{D_{2 %} - D_{98 %}}{D_{m e a n}} & (2) \end{array}

R50% is defined in equation (3), representing ratio of 50% isodose volume V50%RX to the PGTV volume VPGTV. A larger R50% indicates a poor dose drop.

\begin{array}{l} R_{50 %} = \frac{V_{50 % R X}}{V_{P G T V}} & (3) \end{array}

D2cm is defined as the maximum dose within 2 cm outside the PGTV. A larger D2cm indicates a poor dose drop.

The dosimetric parameter difference between predicted and clinical doses was patient-wisely calculated as their absolute difference $| δD |$ and normalized to the prescription dose (30 Gy). The paired-sample t-test was used to assess the statistical significance of the difference mentioned above. A P-value > 0.05 indicates that the predicted dosimetric parameters do not deviate from the clinical ones, whereas a P-value< 0.05 implies the predicted plan is significantly different from the clinical plan.

The dose-point difference between clinical and predicted doses was computed point-wisely as $δ (r, r) = D_{c} (r) - D_{p} (r)$ , where r represents the point position. The mean ( $μ_{δ (r, r)}$ ), standard deviation ( $σ_{δ (r, r)}$ ), and mean absolute error (MAE) ( $MAE = \frac{1}{n} \sum_{i}^{n} ∣ D_{c} (r) - D_{p} (r) ∣_{i}$ ) of $δ (r, r)$ were calculated to assess the accuracy of the predicted dose.

We used the dice similarity coefficient (DSC) to assess the 3D isodose accuracy, $DSC (α, b) = \frac{2 | α \cap b |}{| α | + | b |}$ , where $α$ represents the isodose volume for the clinical plan and b is the isodose volume for the predicted outcome. We also used the 3D gamma passing rate at 3 mm/3% to evaluate the prediction accuracy (34–36).

For SRS plans, the dose gradient and the hot spot’s positioning are also of paramount importance. Hence, we extracted the dose profile along the cross-section’s axis to assess both the dose gradient and the location of the hot spot.

2.5 Plan delivery and dosimetric verification

The predicted dose distributions were imported into LinaTech TiGRT TPS (LinaTech, Sunnyvale, CA) for dose mimicking. This approach enabled the creation of a clinically viable plan within the TPS system that mirrored the predicted dose without necessitating any alterations to the commercial software. Initially, coplanar beams were auto-configured based on institutional guidelines, eliminating the need for user intervention. Specifically, the inaugural beam direction is established by linking the center of the entire brain to the PTV’s center. Subsequently, four additional beams are introduced clockwise at 20-degree intervals from the primary beam. Another quartet of beams is added counter-clockwise, culminating in nine uniformly spaced beams. The optimizer then progressively refines the fluence intensity maps to curtail the voxel-wise disparity between the actualized and anticipated doses. The refined fluence maps are then transitioned into deliverable MLC sequences using the TPS’s inherent beam sequencing algorithms. A novel dual-layer orthogonal stacked MLC was used for the MLC sequencing, which collimates the 6 MV flattening-filter-free beam from a VenusX LINAC, as depicted in Supplementary Figure 1. The generated plans were then compared with clinical plans using the dosimetric parameters described in section 2.4. Figure 2 provides the overview of the dose mimicking.

FIGURE 2

Figure 2 Pipeline of the dose mimicking. MLC, multi-leaf collimator; MC, Monte Carlo.

Based on our current hardware condition, the delivered doses were verified using the MatriXX Evolution 2D ionization chamber array and OmniPro IMRT (IBA Dosimetry, Schwarzenbruck, Germany) according to the routine for dose quality control. All the fields were set to gantry angle 0 and then exposed to the MatriXX to obtain the dose maps. The measured dose maps were then compared with the planned dose maps using the 2D gamma passing rate at 3 mm/3%.

3 Results

3.1 Dosimetry index statistics and DVH comparison

Figure 3A shows the DVHs of the predicted and clinical plans for two test patients from two clinical centers. For both models, the DVHs of the predicted doses were close to the DVHs of the clinical plans. And the PGTV D95% of the predicted doses reached the prescription dose.

FIGURE 3

Figure 3 (A) Comparison of the DVHs of two test cases from two clinical centers (solid: clinical dose, dashed-dotted: prediction dose of UNet, dotted: prediction dose of AttUNet). (B) Illustrated dose distribution and the corresponding dose difference for the two patients ( $D_{D o s e D i f f e r e n c e} = D_{\Pr e d i c t e d D o s e} - D_{C l i n i c a l D o s e}$ ). (C) Dose heatmaps of the two patients (top left: clinical dose, top right: UNet dose, bottom left: AttUNet dose, bottom right: normalized dose profile along the Y-axis of the cross lines). DVH, dose-volume histogram; PGTV, primary gross tumor volume.

Figure 3B shows the dose distributions of the clinical plan, the predicted plan, and their absolute difference for the two test patients. We observed two models produced similar dose distributions and achieved a slightly better intermediate-dose spillage than the clinical plans.

Figure 3C shows that both models controlled the hot spot within the target region. The intersecting point of the cross lines indicates the maximum dose. We also plot the normalized dose profile along the vertical cross line in the bottom right subfigure. We found the high-dose spillage was confined within the PGTV area.

3.2 Dosimetric parameters of the predicted dose

Table 1-1 presents the dosimetric values for the small PGTV group across the two models, while Table 1-2 delineates the dosimetric values for the large PGTV group. It was observed that the doses produced by both UNet and AttUNet closely matched the clinical benchmarks in terms of D98%, D95%, D50%, D2%, Dmean, HI, and CI. With UNet, the statistical analysis indicated that six out of the seven dosimetric parameters in the small PGTV group significantly deviated from the clinical dose. In contrast, only one parameter in the large PGTV group demonstrated a statistically significant variation from the clinical dose. Conversely, the proposed AttUNet yielded statistically indistinguishable dose distributions across all seven parameters in the small PGTV group. Notably, even though the predicted D95% value by both models in the small PGTV group was statistically different from the clinical dose, the actual and predicted dose values were remarkably similar in each instance, with only negligible differences. Additionally, both UNet and AttUNet exhibited a steeper dose fall-off surrounding the PGTV in the large PGTV group compared to the clinical plan, as evidenced by the D2cm and R50% values. AttUNet manifested a more consistent dose gradient than UNet, as highlighted by the P-value.

TABLE 1–1

Table 1–1 Target dosimetric parameters of small PGTV group.

TABLE 1–2

Table 1–2 Target dosimetric parameters of large PGTV group.

Table 2 summarizes the dosimetric parameters of OARs for the two models. For 11 out of 14 patients, the difference in dosimetric parameters between predicted and clinical doses were not statistically significant. For the three deviating parameters (left, right eye Dmean, and right eye Dmax), UNet achieved lower toxicity AttUNet, although the dose distribution produced by AttUNet did not exceed the dose limitation of the eye.

TABLE 2

Table 2 OARs dosimetric parameters.

In summary, Tables 1-1, 1-2, and 2 suggest that the two models produced clinically acceptable dose predictions, and AttUNet outperformed UNet slightly.

3.3 Comparison of the two prediction models

Dose difference between clinical and predicted distributions was computed point-wisely. Figure 4A plots the median, mean, and standard deviation of the dose difference for 12 test patients. The difference was rescaled relative to the prescription dose. The two models showed similar predictive ability among the test patients. For example, they both predicted large dose differences for patient 7 and 22. And they both achieved small dose differences for patient 1, 25, 28, and 34. Overall, AttUNet achieved a smaller average MAE (4.3%) for all patients than UNet (4.4%). The maximum MAE was 13.8% for UNet and 10.6% for AttUNet. These findings indicate that AttUNet performed more closely in line with the desired outcomes than UNet when assessing point-wise dose difference.

FIGURE 4

Figure 4 (A) point-wise dose difference between predicted and clinical doses for UNet (left) and AttUNet (right). Mean, median, and standard deviation are plotted in blue, red, and box, respectively. (B) patient-wise dosimetric difference of UNet (green) and AttUNet (purple). (C) dice score of the two models as a function of relative dose for 12 test patients. PGTV, primary gross tumor volume; OARs, organs at risk.

Figure 4B compares the dosimetric parameter difference of UNet and AttUNet. For 16 out of 20 parameters, AttUNet surpassed the UNet.

Figure 4C compares the DSC of clinical and predicted doses for 12 test patients. The DSC of 1 indicates an ideal match of 3D isodose surface distribution. The predicted doses were close to the clinical doses in the high-dose area, while the model performance in the low-dose area was relatively poor.

While AttUNet generally outperformed UNet, it is harder to train, as illustrated in Figure 5, where training loss (weight MSE) was plotted as the function of the training epoch. We found the training dynamic of UNet is quite stable, while AttUNet fluctuated over the training process.

FIGURE 5

Figure 5 Training loss as a function of the training epoch for the two models. MSE, mean square error.

3.4 Dose delivery verification

Two deliverable plans were generated from the predicted doses following the dose-mimicking pipeline shown in Figure 2. Dose mimicking is achieved by optimizing fluence maps to replicate the predicted dose distribution closely. Typically, this optimization spans between 1 and 5 minutes. Following this, the fluence maps are seamlessly transformed into MLC sequences, upon which a Monte Carlo dose algorithm calculates the ultimate delivery dose. This phase necessitates a duration of roughly 2 to 10 minutes.

Their DVHs, gamma passing rates, and dosimetric parameters are shown in Figures 6A, 6B, and Table 3, respectively.

FIGURE 6

Figure 6 (A) Comparison of DVHs of two test cases from two clinical centers (solid line indicates the clinical dose; dotted line indicates the delivery dose). (B) Gamma pass rate (%) of the two delivered plans under the 3 mm/3% criterion. DVH, dose-volume histogram.

TABLE 3

Table 3 Dosemetric parameters of the delivered plans.

Figure 6A compares the DVHs of the delivered plans and clinical plans. Table 2 reports the PGTV and OAR dosimetric parameters of the delivered plans and clinical plans. The results suggest that the delivered plans met the clinical criteria.

Figure 6B shows the gamma pass rate of the two delivered plans at the 3 mm/3% criterion. Both plans achieved a 100% passing rate, indicating successful delivery.

3.5 Automatic planned quality and efficiency assessment

The 20 auto-generated plans underwent a thorough reevaluation by a physician boasting a decade of planning expertise, confirming that all were clinically acceptable. Furthermore, when the automatic plans were juxtaposed with their corresponding manual versions for the 20 cases and presented to the physician for assessment, they were tasked with selecting the superior plan. Impressively, 14 of the auto-generated plans were the favored choice of the radiologist. Given that they fulfill clinical prerequisites, the automatic plans exhibit swifter fall-down in low doses and superior low-dose protection compared to their manual counterparts. Figure 7 provides a comprehensive visual representation of the clinical pass rate and the physician’s selection criteria.

FIGURE 7

Figure 7 Plan assessment conducted by a physician with 10 years of experience.

Supplementary Table 1 enumerates the optimization durations for both automatic and manual plans. Specifically, for solitary brain metastase plans, the mean and standard deviation of optimization time are tabulated as ([481.2 ± 55/1212 ± 237.7] s, [Automatic/manual]). In this study, the fluence map’s automated generation utilized a deposition matrix, a method proposed by M.C. The strategy ensures target coverage and observes organ-at-risk constraints. It also considers the dose gradient beyond the target region within the stereotactic radiotherapy plans’ context. This requires the harnessing of the patient’s entire CT voxel dataset. Consequently, this method processes a more substantial voxel dataset than the automatic plans designed for conventional fractionated doses, which might account for a more extended duration. However, creating automated plans that adhere to clinical standards undeniably offers a time-saving advantage over manual plan formulations.

4 Discussion

Two deep-learning dose prediction models based on 3D UNet have been developed and evaluated for brain metastases SRS. The comparison of the predicted and clinical doses shows that the average MAE was 4.4% and 4.3% for UNet and AttUNet, respectively. The dice similarity coefficient of isodose volume above 65% of the prescribed dose exceeded 85%. The predicted doses had a sharp dose gradient, and hot spots fell within the target region. The dose verification results show that the deliverable plans achieved a 100% gamma pass rate at 3 mm/3% and met the dose evaluation criteria. We also observed some unfavorable issues. For instance, in the low-dose area (below 65% of the prescribed dose), the model prediction results varied considerably from the clinical plan. For some patients, the DSC of isodose was less than 85%, suggesting the models’ heightened focus on PGTV. Training loss metrics highlighted that AttUNet experienced more fluctuating training dynamics compared to UNet’s steady convergence. This inconsistency could be attributed to several factors:

The integrated attention mechanism amplifies the model’s complexity, complicating optimization.

AttUNet grapples with assimilating information from two diverse data sources (CTs and structures), whereas UNet exclusively processes structures. Streamlining this fusion is challenging.

The restricted dataset size might be inadequate for effectively training the augmented parameters in AttUNet. Potential solutions encompass employing advanced optimization strategies, enhancing data augmentation, modifying architectural components, and rigorously monitoring validation performance. Further research is paramount to facilitate the robust and efficient training of intricate multimodal structures like AttUNet for dose prediction endeavors.

While deep learning has been rigorously explored for dose prediction in radiotherapy planning, making direct comparisons with extant models remains challenging due to differences in treatment modalities and patient datasets used across studies. Campbell (37) noted an MAE discrepancy for SBRT for pancreatic cancer not surpassing 10%, while our AttUNet model registered a peak MAE of 10.6%. Liu (38) disclosed a 3D gamma pass rate of 81.5-93.4% for helical tomotherapy for nasopharyngeal carcinoma utilizing a U-ResNet-D model. Our study’s corresponding figures were 87.9% for UNet and 85.5% for AttUNet at 3 mm/3% for brain metastases.

A prevalent limitation in contemporary dose prediction techniques is the omission of deliverable plan verification. The predicted dose distributions are not necessarily clinically deliverable, even though they are derived from previously deliverable treatment plans. The current work, therefore, completed the entire automated planning pipeline in a closed-loop framework. Furthermore, the proposed AttUNet incorporates the multi-head attention mechanism to combine features from CT images and anatomical structures. This integration aims to discern both commonalities and differences, potentially enhancing the model’s efficiency. Our experiments, based on this specific dataset, indicated that AttUNet’s performance was more in line with the desired outcomes than UNet across various criteria. However, it’s important to note that these findings might not be generalizable to other datasets given the limited scope of our testing. Tests across two centers validated the model’s predictive prowess in a multi-center setting. Utilizing the forecasted dose as an initial reference for planning assures consistent plan quality, independent of planner, TPS, and LINAC variables. The automated planning method proposed herein operates autonomously, devoid of manual intervention, culminating in 3-17 minutes. In contrast, traditional planning durations for multi-met SRS cases using HyperArc have been documented at 77 minutes (39). Thus, our novel methodology significantly curtails planning durations by obviating labor-intensive manual iterations.

While the models we have developed exhibit encouraging outcomes for single-target brain metastases SRS, several constraints warrant attention. Firstly, these models have been tailored and assessed exclusively for isolated brain metastases, a scenario markedly simpler than multi-lesion occurrences. Adapting these models to cater to multiple lesions may necessitate alterations to accommodate the interplay among adjacent abnormalities. In our investigations, mitigating the toxicities in surrounding healthy tissue proved straightforward, mainly because the organs at risk (OARs) were distinctly separated from the targeted region. Predicting doses for instances where OARs are juxtaposed or even encroach upon the targets could intensify the challenge. Thirdly, our models have been calibrated for a singular prescription dose tier, leaving their adaptability to fluctuating prescription doses uncharted. Fourthly, while our models forecast a solitary dose distribution, clinical protocols might see multiple potential dose distributions for a single patient. Variabilities might stem from institutional preferences, patient-centric considerations, or LINAC specifications. Peering into the future, enhancing the models might involve training on diversified datasets encompassing multiple metastases and an array of prescriptions. Introducing sophisticated network structures could amplify dose conformality and organ conservation. Estimating uncertainties could pinpoint scenarios where model predictions might waver. A culmination of biological modeling with radiomic data might pivot predictions towards clinical outcomes, transcending mere physical dose distributions. In essence, this research paves the way for automation in planning, yet a concerted effort is essential to navigate intricate clinical landscapes.

Our study developed and evaluated an automatic brain-metastases-SRS planning pipeline using deep learning for dose prediction. While automated planning techniques have been explored for Gamma Knife radiosurgery, this work represents a novel application of deep learning to predict dose distributions specifically for stereotactic radiosurgery of brain metastases delivered by a linear accelerator platform with multi-leaf collimator beam modulation, as well as verification of the deliverable treatment plans. Both models yielded clinically acceptable predicted doses in our study. The prediction outcomes from AttUNet were more consistent with the original clinical plan compared to those from UNet. The predicted dose distribution of the two models fulfilled the clinical prerequisites in terms of target region dose, dose drop, and protection of organs at risk. The deliverable plans agreed with the predicted doses regarding the gamma passing rate and achieved clinical criteria. When applied to other treatment sites or modalities, the proposed models may require adaptation. The 3D dose projections proffered by our models stand to refine radiotherapy planning blueprints, ensuring consistent plan quality and underscoring the potential for a fully automated radiotherapy treatment planning paradigm.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

JP: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Software, Validation, Writing – original draft, Writing – review & editing. JX: Data curation, Methodology, Software, Writing – review & editing. CR: Methodology, Validation, Writing – review & editing. QS: Methodology, Writing – review & editing. LS: Funding acquisition, Project administration, Validation, Writing – review & editing. FZ: Formal analysis, Investigation, Resources, Supervision, Writing – review & editing. HJ: Conceptualization, Data curation, Formal analysis, Software, Writing – review & editing. XL: Data curation, Funding acquisition, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This investigation was financially supported by the Natural Science Foundation of Hubei Province under grant numbers 2019CFB721; the Health and Family Planning Commission of Hubei Province under grant numbers WJ2017M027 and WJ2021M157; the Cisco hausen Cancer Research Foundation under grant number Y-HS202101-0079; and the Teaching Research Project of Wuhan University Health Science Center under grant number 2020028; the Interdisciplinary Innovative Talents Foundation from Renmin Hospital of Wuhan University under grant number JCRCWL-2022-003; the Research Foundation on Cutting-edge Cancer Supportive Care under grant number cphcf-2022-146; the Key Research and Development Project of Hubei Province’s Technical Innovation Plan under grant number 2023BCB020; the National Natural Science Foundation of China Enterprise Innovation Development Key Project under grant number U19B2004.

Acknowledgments

The authors extend their gratitude to the Suzhou LinaTech Medical Science and Technology Co., Ltd. (Suzhou, China) for their invaluable technical guidance and insightful recommendations.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2023.1285555/full#supplementary-material

References

1. Cagney DN, Martin AM, Catalano PJ, Redig AJ, Lin NU, Lee EQ, et al. Incidence and prognosis of patients with brain metastases at diagnosis of systemic Malignancy: a population-based study. Neuro Oncol (2017) 19:1511–21. doi: 10.1093/neuonc/nox077

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Saria MG, Piccioni D, Carter J, Orosco H, Turpin T, Kesari S. Current perspectives in the management of brain metastases. Clin J Oncol Nurs (2015) 19:475–9. doi: 10.1188/15.CJON.475-478

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Boire A, Brastianos PK, Garzia L, Valiente M. Brain metastases. Nat Rev Cancer (2020) 20:4–11. doi: 10.1038/s41568-019-0220-y

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Dziggel L, Gebauer N, Bartscht T, Schild SE, Rades D. Performance status and number of metastatic extra-cerebral sites predict survival after radiotherapy of brain metastases from thyroid cancer. Anticancer Res (2018) 38:2391–4. doi: 10.21873/anticanres.12488

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Yamamoto M, Serizawa T, Higuchi Y, Sato Y, Kawagishi J, Yamanaka K, et al. A Multi-institutional prospective observational study of stereotactic radiosurgery for patients with multiple brain metastases (JLGK0901 Study Update): irradiation-related complications and long-term maintenance of Mini-Mental State Examination scores. Int J Radiat Oncol Biol Phys (2017) 99:31–40. doi: 10.1016/j.ijrobp.2017.04.037

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Nieder C, Grosu AL, Gaspar LE. Stereotactic radiosurgery (SRS) for brain metastases: a systematic review. Radiat Oncol (2014) 9:155. doi: 10.1186/1748-717X-9-155

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Brown PD, Jaeckle K, Ballman KV, Farace E, Cerhan JH, Anderson SK, et al. Effect of radiosurgery alone vs radiosurgery with whole brain radiation therapy on cognitive function in patients with 1 to 3 brain metastases: a randomized clinical trial. JAMA (2016) 316:401–9. doi: 10.1001/jama.2016.9839

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Schlegel W, Bortfeld T, Grosu AL. New technologies in radiation oncology. J Nucl Med (2008) 49:683–4. doi: 10.2967/jnumed.107.048827

CrossRef Full Text | Google Scholar

9. Craft DL, Hong TS, Shih HA, Bortfeld TR. Improved planning time and plan quality through multicriteria optimization for intensity-modulated radiotherapy. Int J Radiat Oncol Biol Phys (2012) 82:e83–90. doi: 10.1016/j.ijrobp.2010.12.007

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Shiraishi S, Tan J, Olsen LA, Moore KL. Knowledge-based prediction of plan quality metrics in intracranial stereotactic radiosurgery. Med Phys (2015) 42:908–17. doi: 10.1118/1.4906183

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Ge Y, Wu QJ. Knowledge-based planning for intensity-modulated radiation therapy: a review of data-driven approaches. Med Phys (2019) 46:2760–75. doi: 10.1002/mp.13526

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Kamima T, Ueda Y, Fukunaga JI, Shimizu Y, Tamura M, Ishikawa K, et al. Multi-institutional evaluation of knowledge-based planning performance of volumetric modulated arc therapy (VMAT) for head and neck cancer. Phys Med (2019) 64:174–81. doi: 10.1016/j.ejmp.2019.07.004

PubMed Abstract | CrossRef Full Text | Google Scholar

13. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. IEEE (2016) 90:770–8. doi: 10.1109/CVPR.2016.90

CrossRef Full Text | Google Scholar

14. Chen X, Men K, Li Y, Yi J, Dai J. A feasibility study on an automated method to generate patient-specific dose distributions for radiotherapy using deep learning. Med Phys (2019) 46:56–64. doi: 10.1002/mp.13262

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Fan J, Wang J, Chen Z, Hu C, Zhang Z, Hu W. Automatic treatment planning based on three-dimensional dose distribution predicted from deep learning technique. Med Phys (2019) 46:370–81. doi: 10.1002/mp.13271

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Nguyen D, Long T, Jia X, Lu W, Gu X, Iqbal Z, et al. A feasibility study for predicting optimal radiation therapy dose distributions of prostate cancer patients from patient anatomy using deep learning. Sci Rep (2019) 9:1076. doi: 10.1038/s41598-018-37741-x

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Ronneberger O, Fischer P, Brox T. U-net: convolutional networks for biomedical image segmentation. Springer International Publishing (2015) 9351:234–41. doi: 10.1007/978-3-319-24574-4_28

CrossRef Full Text | Google Scholar

18. LeCun Y, Bengio Y. Convolutional networks for images, speech, and time series Handbook of Brain Theory and Neural Networks. Arbib MA, editor. Cambridge: MA: MIT Press (1995). p. 3361.

Google Scholar

19. Long J, Shelhamer E, Darrell T, Berkeley U. Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. IEEE Trans Pattern Anal Mach Intell (2017) 39:640–51. doi: 10.1109/TPAMI.2016.2572683

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Mahmood R, Babier A, McNiven A, Diamant A, Chan TCY. Automated treatment planning in radiation therapy using generative adversarial networks. Mach Learn Healthcare Conf (2018) 85:1–14. doi: 10.48550/arXiv.1807.06489

CrossRef Full Text | Google Scholar

21. Nguyen D, Long T, Jia X, Lu W, Gu X, Iqbal Z, et al. A feasibility study for predicting optimal radiation therapy dose distributions of prostate cancer patients from patient anatomy using deep learning. Sci Rep (2019) 9:1–10. doi: 10.1038/s41598-018-37741-x

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Barragán-Montero AM, Nguyen D, Lu W, Lin MH, Norouzi-Kandalan R, Geets X, et al. Three-dimensional dose prediction for lung IMRT patients with deep neural networks: robust learning from heterogeneous beam configurations. Med Phys (2019) 46:3679–91. doi: 10.1002/mp.13597

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Zhou J, Peng Z, Song Y, Chang Y, Pei X, Sheng L, et al. A method of using deep learning to predict three dimensional dose distributions for intensity modulated radiotherapy of rectal cancer. J Appl Clin Med Phys (2020) 21:26–37. doi: 10.1002/acm2.12849

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Li X, Zhang J, Sheng Y, Chang Y, Yin FF, Ge Y, et al. Automatic IMRT planning via static field fluence prediction (AIP-SFFP): a deep learning algorithm for real-time prostate treatment planning. Phys Med Biol (2020) 65:175014. doi: 10.1088/1361-6560/aba5eb

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Kearney V, Chan JW, Haaf S, Descovich M, Solberg TD. DoseNet: a volumetric dose prediction algorithm using 3D fully-convolutional neural networks. Phys Med Biol (2018) 63:235022. doi: 10.1088/1361-6560/aaef74

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Momin S, Lei Y, Wang T, Zhang J, Roper J, Bradley JD, et al. Learning-based dose prediction for pancreatic stereotactic body radiation therapy using dual pyramid adversarial network. Phys Med Biol (2021) 66:1–19. doi: 10.1088/1361-6560/ac0856

CrossRef Full Text | Google Scholar

27. Zhang B, Babier A, Chan TCY, Ruschin M. 3D dose prediction for Gamma Knife radiosurgery using deep learning and data modification. Physica Med (2023) 106:102533. doi: 10.1016/j.ejmp.2023.102533

CrossRef Full Text | Google Scholar

28. McIntosh C, Welch M, McNiven A, Jaffray DA, Purdie TG. Fully automated treatment planning for head and neck radiotherapy using a voxel-based dose prediction and dose mimicking method. Phys Med Biol (2017) 62:5926–44. doi: 10.1088/1361-6560/aa71f8

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Milano MT, Grimm J, Niemierko A, Soltys SG, Moiseenko V, Redmond KJ, et al. Single-and multifraction stereotactic radiosurgery dose/volume tolerances of the brain. Int J Radiat Oncol Biol Phys (2021) 110:68–86. doi: 10.1016/j.ijrobp.2020.08.013

PubMed Abstract | CrossRef Full Text | Google Scholar

30. He K, Gkioxari G, Dollar P, Girshick R. Mask R-CNN. IEEE Trans Pattern Anal Mach Intell (2020) 42:386–97. doi: 10.1109/TPAMI.2018.2844175

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Menon SV, Paramu R, Bhasi S, Nair RK. Evaluation of plan quality metrics in stereotactic radiosurgery/radiotherapy in the treatment plans of arteriovenous malformations. J Med Phys (2018) 43:214–20. doi: 10.4103/jmp.JMP_25_18

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Paddick I. A simple scoring ratio to index the conformity of radiosurgical treatment plans. Tech Note. J Neurosurg (2000) 3:219–22. doi: 10.3171/jns.2000.93

CrossRef Full Text | Google Scholar

33. Hodapp N. The ICRU Report No. 83: prescribing, recording and reporting photon-beam intensity-modulated radiation therapy (IMRT). Strahlenther Onkol (2012) 188:97–9. doi: 10.1007/s00066-011-0015-x

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Harms WB Sr, Low DA, Wong JW, Purdy JA. A software tool for the quantitative evaluation of 3D dose calculation algorithms. Med Phys (1998) 25:1830–6. doi: 10.1118/1.598363

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Low DA, Harms WB, Mutic S, Purdy JA. A technique for the quantitative evaluation of dose distributions. Med Phys (1998) 25:656–61. doi: 10.1118/1.598248

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Hussein M, Clark CH, Nisbet A. Challenges in calculation of the gamma index in radiotherapy – Towards good practice. Phys Med (2017) 36:1–11. doi: 10.1016/j.ejmp.2017.03.001

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Campbell WG, Miften M, Olsen L, Stumpf P, Schefter T, Goodman KA, et al. Neural network dose models for knowledge-based planning in pancreatic SBRT. Med Phys (2017) 44:6148–58. doi: 10.1002/mp.12621

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Liu Z, Fan J, Li M, Yan H, Hu Z, Huang P, et al. A deep learning method for prediction of three-dimensional dose distribution of helical tomotherapy. Med Phys (2019) 46:1972–83. doi: 10.1002/mp.13490

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Popple Richard A, Brown MH, Thomas EM, Willey CD, Cardan RA, Covington EL, et al. Transition from manual to automated planning and delivery of volumetric modulated arc therapy stereotactic radiosurgery: clinical, dosimetric, and quality assurance results. Pract Radiat Oncol (2021) 11:e163–e171. doi: 10.1016/J.PRRO.2020.10.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: brain metastases, stereotactic radiosurgery, dose prediction, deep learning, radiation oncology

Citation: Pan J, Xiao J, Ruan C, Song Q, Shi L, Zhuo F, Jiang H and Li X (2023) Deep-learning-driven dose prediction and verification for stereotactic radiosurgical treatment of isolated brain metastases. Front. Oncol. 13:1285555. doi: 10.3389/fonc.2023.1285555

Received: 30 August 2023; Accepted: 31 October 2023;
Published: 20 November 2023.

Edited by:

Xiaodong Wu, The University of Iowa, United States

Reviewed by:

Raees Tonse, Baptist Hospital of Miami, United States
Aaron Simon, UCI Health, United States

Copyright © 2023 Pan, Xiao, Ruan, Song, Shi, Zhuo, Jiang and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Hao Jiang, amhAd2h1LmVkdS5jbg==; Xiangpan Li, cm0wMDEyMjdAd2h1LmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.