Automated contouring, treatment planning, and quality assurance for VMAT craniospinal irradiation (VMAT-CSI)

Simiele, Eric; Romero, Ignacio O.; Wang, Jen-Yeu; Chen, Yizheng; Lozko, Yuliia; Severyn, Yuliia; Skinner, Lawrie; Yang, Yong; Xing, Lei; Gibbs, Iris; Hiniker, Susan M.; Kovalchuk, Nataliya

doi:10.3389/fonc.2024.1378449

ORIGINAL RESEARCH article

Front. Oncol. , 10 April 2024

Sec. Radiation Oncology

Volume 14 - 2024 | https://doi.org/10.3389/fonc.2024.1378449

Automated contouring, treatment planning, and quality assurance for VMAT craniospinal irradiation (VMAT-CSI)

Eric Simiele^†‡

Ignacio O. Romero^‡

Jen-Yeu Wang

Yizheng Chen

Yuliia Lozko

Yuliia Severyn

Iris Gibbs

Nataliya Kovalchuk^*§

Department of Radiation Oncology, Stanford University, Stanford, CA, United States

Purpose: Create a comprehensive automated solution for pediatric and adult VMAT-CSI including contouring, planning, and plan check to reduce planning time and improve plan quality.

Methods: Seventy-seven previously treated CSI patients (age, 2-67 years) were used for creation of an auto-contouring model to segment 25 organs at risk (OARs). The auto-contoured OARs were evaluated using the Dice Similarity Coefficient (DSC), 95% Hausdorff Distance (HD95), and a qualitative ranking by one physician and one physicist (scale: 1-acceptable, 2-minor edits, 3-major edits). The auto-planning script was developed using the Varian Eclipse Scripting API and tested with 20 patients previously treated with either low-dose VMAT-CSI (12 Gy) or high-dose VMAT-CSI (36 Gy + 18 Gy boost). Clinically relevant metrics, planning time, and blinded physician review were used to evaluate significance of differences between the auto and manual plans. Finally, the plan preparation for treatment and plan check processes were automated to improve efficiency and safety of VMAT-CSI.

Results: The auto-contours achieved an average DSC of 0.71 ± 0.15, HD95 of 4.81 ± 4.68, and reviewers’ ranking of 1.22 ± 0.39, indicating close to “acceptable-as-is” contours. Compared to the manual CSI plans, the auto-plans for both dose regimens achieved statistically significant reductions in body V50% and D_mean for parotids, submandibular, and thyroid glands. The variance in the dosimetric parameters decreased for the auto-plans as compared to the manual plans indicating better plan consistency. From the blinded review, the auto-plans were marked as equivalent or superior to the manual-plans 88.3% of the time. The required time for the auto-contouring and planning was consistently between 1-2 hours compared to an estimated 5-6 hours for manual contouring and planning.

Conclusions: Reductions in contouring and planning time without sacrificing plan quality were obtained using the developed auto-planning process. The auto-planning scripts and documentation will be made freely available to other institutions and clinics.

1 Introduction

Craniospinal irradiation (CSI) plays a crucial part in the multidisciplinary care of CNS tumors in children and adults (1–3). Postoperative CSI with chemotherapy is the current standard of care of medulloblastoma (1, 2, 4). Additionally, CSI in conjunction with total body irradiation (TBI) can be used as a conditioning regimen for stem cell transplantation for patients with acute lymphoblastic leukemia (ALL) with CNS involvement (5, 6).

Radiation treatments for CSI have historically involved simple 3D techniques that lead to significant short and long-term toxicities due to the unnecessary dose delivered to uninvolved organs. To reduce these effects, Chen et al. (7) introduced Volumetric Modulated Arc Therapy (VMAT), which is a modern alternative to the classic 3D treatment technique of these cancers. However, due to the extent and shape of the target volume, this approach is resource and time-intensive in terms of target and organs-at-risk (OAR) contouring, treatment planning, and treatment delivery. The European Society for Pediatric Oncology (SIOPE) radiotherapy group developed extensive guidelines for target volume delineation and provided the recommended OAR structures to consider for pediatric and young adult CSI treatments (8). Due to the large number of OARs to contour and the specifications of the target volume, CSI treatment planning can consume many hours from radiation oncologists, dosimetrists, and medical physicists. Furthermore, inter-planner variability can significantly impact the final plan quality of complex treatment techniques, such as VMAT CSI. Therefore, there is a need to standardize and automate VMAT-CSI treatment planning without sacrificing plan quality.

Previous groups have investigated the feasibility of automating the planning process for CSI (9–11). Wang et al. (10) developed an automated planning process for robust IMRT CSI in RayStation (RaySearch Laboratories, Stockholm, Sweden) and shared their script publicly. They found no significant differences between the manual and autoplans for the 10 patients in their study, but observed a decrease in planning time of approximately 50%. Zhang et al. (11) developed an automated planning technique for VMAT-CSI utilizing PlanIQ Feasibility software (Sun Nuclear, Melbourne, FL) integrated into the Pinnacle treatment planning system. They observed significant decreases in the doses to OARs compared to manual planning, but no significant changes compared to a conventional auto-planning technique without PlanIQ. They only observed a minor decrease in planning time of 12 minutes (on average) using their developed technique compared to manual planning. Hernandez et al. (9) developed an automated contouring and planning tool in RayStation for 3D conventional CSI treatments targeted at improving safety and efficiency in low and middle income countries. While these prior works have automated parts of the treatment planning process, no study has automated the entire chain of treatment planning from contouring to physics second check for a modern CSI treatment technique such as VMAT-CSI. Only Wang et al. (10) shared their developed scripts with the public via GitHub, however, many of the parameters including dose levels, structure names, and optimization structures are hard-coded parameters that follow conventions used in their clinic, which may limit widespread adoption of their technique.

This goal of this work was to automate the contouring, treatment planning, and physics plan check process for VMAT-CSI. All developed software in this work will target the Eclipse treatment planning system (Varian Medical Systems Inc., Palo Alto, CA). In addition, the results of this work, including developed software, will be shared publicly to enable institutions to adopt the developed auto-planning process into their own practice. To our knowledge, this is the first work to develop a comprehensive auto-contouring and auto-planning tool for VMAT-CSI.

2 Methods

A figure of the workflow for the developed auto-planning process is provided in the Supplementary Material (Supplementary Figure 1). Each step in the workflow is described in more detail in the following sections. The use of anonymized patient data for this study was approved by the institutional review board.

2.1 Auto-contouring

2.1.1 Training and test data

Auto-contouring was implemented for 25 OARs including brain, brainstem, optic chiasm, parotids, heart, lungs, kidneys, etc. A total of 77 previously treated CSI patients (age range, 2 - 67 years, median, 13.3 years) were used for the development of the auto-contouring model and divided into 50 training cases, 7 validation cases, and 20 test cases. For each case, the CT scan and RT structure set used for radiation treatment planning were thoroughly reviewed by two radiation oncologists and two physicists to standardize contours and structure names.

2.1.2 Auto-contouring model architecture and training details

A classic 2D U-Net architecture was selected to perform the auto-segmentation of 20 OARs, and a modified 2.5D U-Net architecture was selected to perform the auto-segmentation of the remaining 5 OARs (optic chiasm, left optic nerve, right optic nerve, left cochlea, and right cochlea). Two deep learning models were employed in this work as it was observed during training and testing that the 2D model performed well for larger structures, but poorly for small structures. Utilizing a 2.5D model significantly increased the accuracy of the segmentation of these smaller structures, but also significantly increased the time required for segmentation, with little gain in accuracy for the larger structures. Therefore, a hybrid approach consisting of two models represented an ideal balance between speed and accuracy of the auto-contouring process.

The input to both networks is a 192×352-pixel image, and the output is a 192×352-pixel binary mask for each of the 25 OARs. Both networks consist of an encoder section composed of four contraction blocks and a decoder section composed of four expansion blocks. Each contraction block consists of two 2D convolution layers followed by 2D maximum pooling and doubles the number of kernels to encode features of greater complexity. At the bottom of the networks, two standalone 2D convolution layers of kernel size 1024 connect the encoder section to the decoder section. Subsequently, each expansion block in the decoder section consists of a 2D transposed convolution operator followed by two 2D convolution layers to perform spatial up-sampling. Skip connections between symmetrical contraction and expansion blocks facilitate the concatenation of lower-level information to processed information in latter layers of the network. At the end of the decoder section, two standalone 2D convolution layers of kernel size 64 are followed by sigmoid activation, classifying each pixel as part of or outside of each of the OARs. In the 2.5D U-Net, three consecutive 2D image slices with an interval of 2, i.e., slices n-2, n, and n+2, are input into the network, and the auto-segmentation predictions of the three slices are output. The 2.5D network is applied for the whole image volume slice by slice, and the predictions for each 2D image slice are staked and averaged as the final auto-segmentation. This 2.5D method considers the continuity and variability of the organ structure, which provided auto-segmentation accuracy superior to that of the 2D method for the 5 OARs.

For training, the hyperparameters were set to 70 epochs, batch size of 8, initial learning rate of 0.001. The Adam optimizer and Dice loss function were used for the network optimization. The model was implemented, trained, and evaluated using Python 3.10 and the CUDA-enabled PyTorch 1.13 library. An Alienware Aurora R15 workstation with 13^th-generation Intel Core i9 CPU, NVIDIA GeForce RTX 4090 GPU, and 64 GB RAM was used to train the auto-contouring model. The model was deployed for production in the clinical setting on a Dell Precision workstation with Intel Xeon Silver 4110 CPU, NVIDIA Quadro P5000, and 32 GB RAM.

2.1.3 Auto-contouring evaluation metrics

To measure spatial overlap, Dice Similarity Coefficient (DSC) between predicted and ground truth labels was calculated using Equation 1 where A and B are the predicted and ground truth volumes, respectively:

\begin{array}{l} D S C = \frac{2 | A \cap^{} B |}{| A | + | B |}, & (1) \end{array}

To evaluate surface-to-surface distance, 95th percentile Hausdorff Distance (HD95) between predicted and ground truth labels was calculated using the MedPy 0.4.0 library. A qualitative ranking of auto-contours was also performed by one physician and one physicist using the clinical trials contour ranking scale: 1 - acceptable, 2 - minor edits, and 3 – major edits.

2.2 Auto-planning

2.2.1 Architecture

Multiple application programming interface (API) scripts were developed within version 15.6 of the Varian treatment planning system Eclipse Scripting API (ESAPI) to facilitate auto-planning. The framework for these scripts were derived from the results of our previously published works with VMAT TBI auto-planning (12, 13). The process was broken into two parts: preparation and optimization. By breaking the process into two distinct sections, the planner has a chance to review the generated tuning/optimization structures, the created plan(s), the isocenter and beam placement, and assigned optimization objectives prior to the optimization loop. Should a problem be discovered, the planner can easily fix the issue or rerun the preparation script rather than losing time optimizing plan(s) with a sub-optimal setup.

Once the auto-contours have been reviewed and the targets drawn and approved by the physician, the planner will launch the preparation script. The user will then select a plan template to use for the given patient, which will pre-populate all the relevant parameters for preparation. To guide the user where to go on the GUI, the tabs change color depending on the action that should be performed next: red indicates an action should be performed and green indicates the action on that tab is complete. Since generation of optimization/tuning structures and plan generation can take more than 30 seconds each (e.g., tuning structure generation can take up to 4 minutes depending on the complexity of the case and the resolution of the CT scan), asynchronous progress reporting was implemented for these operations so the user is aware of what the script is doing during these operations. Once the plan(s) have been prepared using the script, the user will save their changes, reload the patient in Eclipse, and review the output of the script.

Once the user is satisfied with the prepared plan(s), they will launch the optimization loop script, either from within Eclipse or on a thick-client workstation. The script will pick up where the preparation script left off and allow the user another chance to change the plan objectives and optimization constraints for this patient. Once the optimization loop is launched, it proceeds until all plan objectives are met or until the maximum number of user-requested iterations is reached. While the logic for the optimization loop was built on our previous code (13), several modifications were necessary to allow for sequential optimization of initial and boost CSI plans, specifically regarding plan evaluation. Since there is no available method within ESAPI v15.6 to generate a plan sum, a custom method was implemented to create a sum plan for evaluation of the plan objectives. The optimization loop script was designed to eliminate planner oversight/intervention so the user can launch the script at the end of the workday and return to plan(s) ready for review in the morning. Video demonstrations of both scripts are provided in the Supplementary Material.

The auto-planning code developed in this work has been made open-source under the MIT License via Github (https://github.com/esimiele/VMAT-TBI-CSI). In addition to the flexibility provided by the script GUIs, configuration and template files are provided with the scripts that are read at run-time. Users can simply adjust the configuration files and modify/add plan template files without having to adjust and recompile the underlying code, which allows users to adapt the scripts to their clinical environment and practice.

2.2.2 Evaluation

The developed scripts were tested on two patient cohorts previously treated at our institution: ten patients treated to 12 Gy in 6 fractions and ten patients treated to 36 Gy in 20 fractions followed by an 18 Gy boost in 10 fractions. The 12 Gy cohort represents a simple case of auto-planning a single plan only, whereas the 36 Gy cohort represents the more challenging case of auto-planning two plans simultaneously where the achieved quality of one plan influences the achieved quality of the other and the quality of the composite sum.

The quality of auto-plans was compared to their clinical (i.e., manual) plan counterparts on the basis of clinically relevant DVH metrics such as target coverage, target hotspot, low isodose spill, etc. Paired t-tests were used to evaluate the significance of any observed differences where a p ≤ 0.05 was considered statistically significant. To further evaluate the quality of the auto-plans, two physicians and one physicist were asked to review the 40 plans in a blinded retrospective manner where all identifying information was removed from the plans. The reviewers were asked for each patient: 1) if each plan was clinically acceptable, and 2) between the two sets of plans, which would they choose for treatment and why. Finally, the auto- and manual-plans were compared based on required planning time. The time for manual planning was estimated based on time stamps in the Varian Aria system whereas the time for auto-planning was determined from evaluating the log files produced from the scripts.

2.3 Auto-plan checking

2.3.1 Automated Plan Checker (APC)

The APC tool was adapted from a previously developed ESAPI script at our institution that focused on reducing treatment planning errors before they reached patient treatment (14). Liu et al. (14) utilized the Six Sigma DMAIC methodology combined with an FMEA analysis of our institution’s planning and treatment practice (14) to guide development of the APC. During testing and initial clinical implementation, they found APC to significantly reduce planning errors while simultaneously improving the efficiency of physics plan checking. This tool is still in use at our institution today and is routinely updated based on changes in workflow and planning practice. The present work built on the success of the APC and incorporated checks for identified failure modes during VMAT-CSI planning for both low-dose and high-dose CSI. The APC tool was modified and refined during the auto-planning development process and was thoroughly tested for false positives and negatives prior to clinical implementation.

2.3.2 Robustness evaluation

To ensure the auto-plans are insensitive to small changes in longitudinal positions of the isocenters, plan robustness was tested for the 12-Gy cohort by shifting the isocenters of the upper spine and lower spine fields 3 mm in the superior direction. The change in the global max dose between the nominal and shifted plan was then compared between the manual and auto-plans.

3 Results

3.1 Auto-contouring

3.1.1 Auto-contouring workflow

The auto-contouring model was deployed on the clinical workstation to read CT scans from a user-specified network location. The auto-planning script communicates with a DICOM Daemon to facilitate the delivery of planning CT scans to the network location, at which point auto-contouring would commence. The predicted mask for each organ-at-risk was transformed from a binary mask into an RT structure contour, and the output DICOM RT structure set file is identified by the auto-planning script and imported into the treatment planning system to be used for the subsequent planning procedure. The entire process takes less than 5 minutes.

3.1.2 Auto-contouring evaluation

Table 1 shows the auto-contour evaluation using Dice Similarity Coefficient (DSC), 95% Hausdorff distance (HD95), and a qualitative ranking by two physicians and one physicist (1 - acceptable, 2 - minor edits, 3 - major edits). The auto-contours achieved an average DSC of 0.71 ± 0.15, average HD95 of 4.81 ± 4.68, and average ranking of 1.22 ± 0.39 from the experts. The average ranking for all the structures was between 1 (acceptable) and 1.5. Optic chiasm was ranked with the worst score from the reviewers 1.45 ± 0.5 and the lowest DSC of 0.38 ± 0.21. On the other hand, brain received highest DSC of 0.97 ± 0.01. Reviewer 1 ranked 30 structures out of 500 (6%) as “3 - requiring major edits” with optic chiasm being most frequently ranked as “3” (4 out of 20 patients). Reviewer 2 ranked 16 structures out of 500 (3.2%) as “3 - requiring major edits” with optic chiasm (2 out of 20), pituitary (2 out of 20), cochlea R (2 out of 20) and cochlea L (2 out of 20) being most frequently ranked as “3”.

Table 1

Table 1 Auto-contouring evaluation parameters: Dice Similarity Coefficient (DSC), 95% Hausdorff Distance (HD95), and a qualitative ranking by one physician and one physicist (1 acceptable, 2 minor edits, 3 major edits).

Overall agreement rate between reviewers for all the structure ranking was 80.8%. For two patients with unusual anatomy (patient with missing anterior skull after surgery, and patient with hardware and hyper-flexed neck), the auto-segmentation performed the worst where 10 out of 25 structures required major editing based on Reviewer 1 ranking. Overall, 60% of structures had DSC ≥0.7 and reviewer ranking ≤1.4.

3.2 Auto-planning

Figure 1 shows the manual and auto plan dose distributions for a pediatric patient and an adult patient for the low-dose and high-dose regimens of VMAT-CSI. The dose cloud is thresholded to 50% of the initial plan prescription dose. As seen from the axial, coronal, and sagittal slices, the auto plan exhibits less anterior dose spillage compared to the manual plan.

Figure 1

Figure 1 Comparison of the auto-plans and manual plans for one adult and one pediatric patient for 12 Gy and 54 Gy dose regimens. The dose clouds have been thresholded to 50% of the initial plan prescription dose.

Dosimetric scatterplots of the six most statistically significant quantities between the manual and auto-plans for the low-dose and high-dose regimens of VMAT-CSI are illustrated in Figure 2. The low-dose regimen is represented by circles while the high-dose regimen is represented by triangles. A clear preference by the medical experts toward the auto-plans is seen by the color shading of the shapes.

Figure 2

Figure 2 Dosimetric scatterplots of the six most statistically significant quantities between the manual plans and auto-plans for 12 Gy (circles) and 54 Gy (triangles) dose regimens: body V50%, and Dmean for submandibular glands, parotids, thyroid, and heart and Dmax for brainstem. The Brainstem Dmax was normalized to the CSI Rx dose. The plan preference was based on the majority consensus response (from at least 2 reviewers). The black arrows in each plot indicate the direction of auto-plan superiority/inferiority relative to the line of plan equality.

The average achieved dosimetric indices for the auto- and clinical manual-plans are shown in Tables 2 and 3 for the 12 Gy cohort and 36 Gy initial with 18 Gy boost cohort, respectively. The majority of the observed differences were not statistically significant with some notable exceptions: intermediate-dose spill (body V50%), parotids D_mean, submandibulars D_mean, and thyroid D_mean were significantly lower for the auto-plans as compared to the manual plans for both cohorts. In addition, brainstem D_max (constraint of D_max<55 Gy) was significantly lower for the auto-plans compared to the manual plans for the 36 Gy cohort (Table 3).

Table 2

Table 2 Average dosimetric indices and their associated p-values for the auto- and manual plans for 12 Gy CSI.

Table 3

Table 3 Average dosimetric indices and their associated p-values for the auto- and manual plans for the CSI plan sum (36 Gy initial plan + 18 Gy boost).

Figure 3 shows the box plots of differences in achieved plan quality metrics (left) and reviewers’ ranking of the plans (right) indicating auto-plan preference 88.3% of the time.

Figure 3

Figure 3 Box plots on the left show the differences in achieved plan quality metrics between the auto-plans and manual plans for 12 Gy and 54 Gy CSI cohorts. Parameters plotted in the left part of the graph show auto-plan superiority. Red shading denotes statistically significant differences. The differences in PTV coverage and plan heterogeneity were not statistically significant. Plots on the right show the blinded plan review results by 3 experts: blue denotes auto-plan preference, gray denotes auto- and manual plan equivalency and red shows manual plan preference.

3.3 Auto-plan checking

3.3.1 Automated Plan Checker (APC)

A sample plan check report generated by the APC is available in the Supplementary Material. Information regarding the patient, course, and treatment plan sum name is displayed in the header of the report. The item checks of the VMAT-CSI plan sum are shown in the left column. The middle column reports the status of the item check. The right column provides a description for the result of the check. In addition to the routine checks e.g., prescription, dose, dose rate, energy, etc, the APC tool checks the geometry of the CSI beams, location of isocenters and verifies the entered shifts in the Aria dynamic document used by the therapists during treatment delivery.

3.3.2 Robustness evaluation

When evaluating robustness for all the plans by deliberately shifting isocenters into each other by 3 mm, the median global D_max increase was lower with auto-plans compared to manual plans (5.9% vs 9.6%, p = 0.037). The standard deviation of global D_max increase was also smaller for auto-plans compared to manual plans (4.3% vs 8.3%) indicating a more consistent change in plan heterogeneity with isocenter positioning uncertainty with the auto-plans as compared to the manual plans.

4 Discussion

Patients requiring CSI treatments can include pediatric, adolescent, and adult patient populations. Therefore, any automated planning tool would need to be applicable to a large range of patient, organ, and target sizes. The primary challenge in developing accurate deep learning auto-contouring models for pediatric patients is the greater variation in organ structure size and shape as compared to adult patients (15, 16).

Efforts have been made to develop robust auto contouring tools to alleviate the demands of pediatric treatment planning and remove inter-planner variability while improving planning time efficiency and plan accuracy. Qiu et al. (17) constructed and evaluated a Triplet-Attention U-Net (TAU-Net) auto-contouring model to contour important pediatric skeletal growth centers in the craniofacial, shoulder, and pelvic regions with the objectives of mitigating growth abnormalities induced by radiation treatment. Hernandez et al. (9) automated the end-to-end treatment planning process for 3D CSI to provide a tool for resource-constrained communities. Auto-contouring in their work was performed with a CNN model for HN organ structures and a nn-UNet model for the remaining structures required for 3D CSI. Adamson et al. (18) developed and evaluated a fully convolutional network for pediatric CT organ delineation which proved to be generalizable across CT scanner model protocols and patient age. Similar to the approach by Hernandez et al. (9), auto-contouring in this work was achieved by utilizing a combination of two models to achieve satisfactory contours: a 2.5D UNet model for contouring small HN structures (e.g., optic nerves, chiasm, etc.) and 2D UNet model for contouring the remaining OARs. This hybrid approached provided better flexibility and accuracy in generating accurate contours for very small structures (e.g., chiasm, optic nerves, etc.) and structures of varying size depending on patient size and age (e.g., lungs, kidneys, etc.).

Our group previously developed an automated planning process for VMAT-TBI (12, 13), which allowed us to leverage the framework from that study in the present work. Similar to our previous works (12, 13), multiple API scripts were developed to streamline the auto-planning process. Following the initial introduction of VMAT-CSI (7) into our clinical practice, planning for these cases consisted of using clinical protocol templates, which contained the necessary target and OAR structures, beam arrangement, optimization objectives, and plan information. While these templates are helpful, they have several limitations, the greatest of which is they are not patient-specific, which is critical for cases where the target shape and size can significantly vary. Thus, the planner would be required to modify the initial beam and plan arrangement to conform to the patient-specific anatomy. Furthermore, the template only contains the necessary OAR structures, but not any of the tuning/optimization structures, which must be generated manually by the planner. Finally, due to the large calculation volume, the memory requirements of optimization and dose calculation significantly slow down the planning process, which reduces the number of tasks a planner could work on simultaneously.

The developed preparation script can generate optimization structures, determine the required number and location of isocenters, generate the plan, place the appropriate number of beams, and assign optimization constraints in 2 minutes on average and 5 minutes maximum. Combined with the developed auto-contouring model, contouring normal tissues and preparing the patient plan for optimization can be achieved in less than 15 minutes as compared to several hours when done manually. The optimization loop script permitted autonomous optimization of the CSI plan(s) without requiring planner intervention.

The developed APC for VMAT-CSI allows the physicist performing the second check to readily review items that may be detrimental to patient safety during treatment. Of the items that are checked, verification of the dosimetric shifts between isocenters is particularly important for VMAT-CSI as without this information, the therapists will be missing important pieces of treatment delivery instructions. This may necessitate last-minute shift calculations, searching the treatment plan report documentation, etc., which can be an error-prone process when under time pressure. In the example provided in the Supplementary Material, the APC recognized that the dosimetric shifts document was not found and flagged it for review by the second checking physicist.

Significant improvements in multiple clinically relevant metrics were observed for both sets of patient cohorts considered in this study (12 Gy only and 36 Gy initial with 18 Gy boost) for the auto-plans compared to the manual plans (Tables 2, 3). Due to these improvements, the auto-plans were preferred by majority of medical experts for 18 out of the 20 patients selected for this study (90%) as highlighted in Figure 3. In the two remaining cases, the majority of reviewers assigned them as equivalent (n=1) and manual plan preferred (n=1). For patient 2 in the 12 Gy cohort all reviewers preferred the manual plan as the dose to the parotids, globes, and lenses were slightly lower with the manual plan compared with the auto-plan. However, the auto-plan dosimetric indices for these structures were still below the acceptable threshold. All 20 plans generated by the auto-planning scripts were deemed clinically acceptable by all 3 reviewers. Of particular concern with CSI treatments is intermediate-dose spill, which could result in secondary malignancies later in life due to the large target volume. To ensure tight conformity of the prescription dose to the target, the developed auto-planning scripts prioritize body V50% using a ring structure of 2 cm thickness and 1.5 cm margin from the target structure. Compared to the manual plans, the auto-plans achieved significant reductions in body V50% without sacrificing target coverage or heterogeneity.

While substantial advancements were made toward automating VMAT CSI treatment planning in this work, there were also several limitations to the present study. First, the size of both the low-dose (Rx dose = 12 Gy) and high-dose (36 Gy initial with 18 Gy boost) patient cohorts was relatively small at 10 patients per dose regimen. Second, the number of expert reviewers was relatively limited at three, which may skew the blinded plan review results. While the sample size of each CSI dose cohort was relatively limited for this work, feedback mechanisms from dosimetry have been implemented into our clinical practice to improve the accuracy and robustness of the automated planning scripts as more patients are treated with this technique. Third, the developed auto-planning scripts were built using version 15.6 of Eclipse and all API functions and calls were targeted at this version of ESAPI. There is no guarantee the scripts will work for any other version of Eclipse, which may limit widespread adoption of the scripts at other institutions. Future work includes expanding the number of patients for each dosing cohort to better understand the dosimetric differences between the manual and auto-plans. However, the described process and autoplanning software has been clinically implemented in the authors institution given the positive initial results from this work regarding efficiency and standardization improvements in VMAT-CSI treatment planning without comprosing plan quality.

5 Conclusions

Significant reductions in contouring and planning time were achieved without sacrificing plan quality using the developed auto-planning process. The required time for contouring and planning decreased from 5–6 hours (manual planning) to consistently between 1–2 hours (auto-planning). Furthermore, statistically significant improvements were achieved in OAR sparing using the developed scripts. The auto-plans were marked as equivalent or superior to the manual plans 88.3% of the time from the blinded review by the three experts. The scripts and documentation are publicly available on GitHub (https://github.com/esimiele/VMAT-TBI-CSI) so other institutions can adopt the developed auto-planning process into their own practice.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

Ethical approval was not required for the study involving humans in accordance with the local legislation and institutional requirements. Written informed consent to participate in this study was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and the institutional requirements.

Author contributions

ES: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. IR: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. J-YW: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing. YC: Writing – review & editing, Data curation, Methodology, Software. YL: Writing – review & editing, Data curation, Methodology. YS: Writing – review & editing, Data curation, Methodology. LS: Writing – review & editing, Methodology, Resources. YY: Methodology, Writing – review & editing. LX: Writing – review & editing, Methodology, Resources, Supervision. IG: Data curation, Investigation, Writing – review & editing. SH: Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing, Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation. NK: Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. Stanford Cancer Center Clinical Innovation Fund Grant (EHBBD).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1378449/full#supplementary-material

References

1. Gajjar A, Chintagumpala M, Ashley D, Kellie S, Kun L, Merchant T, et al. Risk-adapted craniospinal radiotherapy followed by high-dose chemotherapy and stem-cell rescue in children with newly diagnosed medulloblastoma (St. Jude Medulloblastoma-96): long-term results from a prospective. Multicentre Trial Lancet Oncol. (2006) 7:813–20. doi: 10.1016/S1470-2045(06)70867-1

CrossRef Full Text | Google Scholar

2. Packer R, Gajjar A, Vezina G, Rorke-Adams L, Burger P, Robertson P, et al. Phase III study of craniospinal radiation therapy followed by adjuvant chemotherapy for newly diagnosed average-risk medulloblastoma. J Clin Oncol. (2006) 24:4202–8. doi: 10.1200/JCO.2006.06.4980

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Pizer BL, Weston C, Robinson K, Ellison D, Ironside J, Saran F, et al. Analysis of patients with supratentorial primitive neuro-ectodermal tumours entered into the SIOP/UKCCSG PNET 3 study. Eur J Cancer. (2006) 42:1120–8. doi: 10.1016/j.ejca.2006.01.039

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Lannering B, Rutkowski S, Doz F, Pizer B, Gustafsson G, Navajas A, et al. Hyperfractionated versus conventional radiotherapy followed by chemotherapy in standard-risk medulloblastoma: results from the randomized multicenter HIT-SIOP PNET 4 trial. J Clin Oncol. (2012) 30:3187–93. doi: 10.1200/JCO.2011.39.8719

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Hiniker SM, Agarwal R, Modlin LA, Gray CC, Harris JP, Million L, et al. Survival and neurocognitive outcomes after cranial or craniospinal irradiation plus total-body irradiation before stem cell transplantation in pediatric leukemia patients with central nervous system involvement. Int J Radiat Oncol Biol Phys. (2014) 89:67–74. doi: 10.1016/j.ijrobp.2014.01.056

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Mayadev JS, Douglas JG, Storer BE, Appelbaum FR, Storb R. Impact of cranial irradiation added to intrathecal conditioning in hematopoietic cell transplantation in adult acute myeloid leukemia with central nervous system involvement. Int J Radiat Oncol Biol Phys. (2011) 80:193–8. doi: 10.1016/j.ijrobp.2010.01.062

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Chen J, Chen C, Atwood TF, Gibbs IC, Soltys SG, Fasola C, et al. Volumetric modulated arc therapy planning method for supine craniospinal irradiation. J Radiat Oncol. (2012) 1:291–7. doi: 10.1007/s13566-012-0028-9

CrossRef Full Text | Google Scholar

8. Ajithkumar T, Horan G, Padovani L, Thorp N, Timmermann B, Alapetite C, et al. SIOPE–Brain tumor group consensus guideline on craniospinal target volume delineation for high-precision radiotherapy. Radiother Oncol. (2018) 128:192–7. doi: 10.1016/j.radonc.2018.04.016

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Hernandez S, Nguyen C, Parkes J, Burger H, Rhee Joo Netherton D T, et al. Automating the treatment planning process for 3D-conformal pediatric craniospinal irradiation therapy. Pediatr Blood Cancer. (2023) 70:e30164. doi: 10.1002/pbc.30164

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Wang X, Li C, Li G, Zhao J, Bai S. A script-based automatic intensity modulated radiation therapy planning method with robust optimization for craniospinal irradiation. Pract Radiat Oncol. (2023) 13:e209–15. doi: 10.1016/j.prro.2022.08.015

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Zhang Y, Huang Y, Lin J, Ding S, Gong X, Liu Q, et al. Multi-isocenter VMAT craniospinal irradiation using feasibility dose-volume histogram-guided auto-planning technique. J Radiat Res. (2023) 64:612–21. doi: 10.1093/jrr/rrad026

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Kovalchuk N, Simiele E, Skinner L, Yang Y, Howell N, Lewis J, et al. The stanford volumetric modulated arc therapy total body irradiation technique. Pract Radiat Oncol. (2022) 12:245–58. doi: 10.1016/j.prro.2021.12.007

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Simiele E, Skinner L, Yang Y, Blomain E, Hoppe R, Hiniker S, et al. A step toward making VMAT TBI more prevalent: automating the treatment planning process. Pract Radiat Oncol. (2021) 11:415–23. doi: 10.1016/j.prro.2021.02.010

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Liu S, Bush K, Bertini J, Fu Y, Lewis J, Pham D, et al. Optimizing efficiency and safety in external beam radiotherapy using automated plan check (APC) tool and six sigma methodology. J Appl Clin Med Phys. (2019) 20:56–64. doi: 10.1002/acm2.12678

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Riggs BL, Melton LJ III, Robb RA, Camp JJ, Atkinson EJ, Peterson JM, et al. Population-based study of age and sex differences in bone volumetric density, size, geometry, and structure at different skeletal sites. J Bone Mineral Res. (2004) 19:1945–54. doi: 10.1359/jbmr.040916

CrossRef Full Text | Google Scholar

16. Chang HP, Kim SJ, Wu D, Shah K, Shah DK. Age-related changes in pediatric physiology: quantitative analysis of organ weights and blood flows: age-related changes in pediatric physiology. AAPS J. (2021) 23:1–15. doi: 10.1208/s12248-021-00581-1

CrossRef Full Text | Google Scholar

17. Qiu W, Zhang W, Ma X, Kong Y, Shi P, Fu M, et al. Auto-segmentation of important centers of growth in the pediatric skeleton to consider during radiation therapy based on deep learning. Med Phys. (2023) 50:284–296. doi: 10.1002/mp.15919

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Adamson PM, Bhattbhatt V, Principi S, Beriwal S, Strain LS, Offe M, et al. Evaluation of a V-Net autosegmentation algorithm for pediatric CT scans: Performance, generalizability, and application to patient-specific CT dosimetry. Med Phys. (2022) 49:2342–54. doi: 10.1002/mp.15521

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: auto-planning, craniospinal irradiation (CSI), VMAT-CSI, auto-contouring, Eclipse Scripting API (ESAPI), deep learning

Citation: Simiele E, Romero IO, Wang J-Y, Chen Y, Lozko Y, Severyn Y, Skinner L, Yang Y, Xing L, Gibbs I, Hiniker SM and Kovalchuk N (2024) Automated contouring, treatment planning, and quality assurance for VMAT craniospinal irradiation (VMAT-CSI). Front. Oncol. 14:1378449. doi: 10.3389/fonc.2024.1378449

Received: 29 January 2024; Accepted: 18 March 2024;
Published: 10 April 2024.

Edited by:

Guang Li, Memorial Sloan Kettering Cancer Center, United States

Reviewed by:

Enrica Seravalli, University Medical Center Utrecht, Netherlands
Ruijie Yang, Peking University Third Hospital, China
Jeff Buchsbaum, National Institutes of Health (NIH), United States

Copyright © 2024 Simiele, Romero, Wang, Chen, Lozko, Severyn, Skinner, Yang, Xing, Gibbs, Hiniker and Kovalchuk. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Nataliya Kovalchuk, bmF0a292YWxAc3RhbmZvcmQuZWR1

^†Present address: Eric Simiele, Department of Radiation Oncology, The University of Alabama at Birmingham, Birmingham, AL, United States

^‡These authors have contributed equally to this work and share first authorship

^§These authors have contributed equally to this work and share senior authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Automated contouring, treatment planning, and quality assurance for VMAT craniospinal irradiation (VMAT-CSI)

1 Introduction

2 Methods

2.1 Auto-contouring

2.1.1 Training and test data

2.1.2 Auto-contouring model architecture and training details

2.1.3 Auto-contouring evaluation metrics

2.2 Auto-planning

2.2.1 Architecture

2.2.2 Evaluation

2.3 Auto-plan checking

2.3.1 Automated Plan Checker (APC)

2.3.2 Robustness evaluation

3 Results

3.1 Auto-contouring

3.1.1 Auto-contouring workflow

3.1.2 Auto-contouring evaluation

3.2 Auto-planning

3.3 Auto-plan checking

3.3.1 Automated Plan Checker (APC)

3.3.2 Robustness evaluation

4 Discussion

5 Conclusions

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good