Skip to main content

ORIGINAL RESEARCH article

Front. Oncol., 14 June 2022
Sec. Radiation Oncology

Optimizing the Region for Evaluation of Global Gamma Analysis for Nasopharyngeal Cancer (NPC) Pretreatment IMRT QA by COMPASS: A Retrospective Study

  • Department of Oncology, The First Affiliated Hospital of Chongqing Medical University, Chongqing, China

Background: The global gamma passing rate is the most commonly used metric for patient-specific pretreatment quality assurance in radiation therapy. However, the optimal region for evaluation and specific action limits (ALs) need to be explored. Therefore, this study was carried out to explore the optimal region for evaluation of the global gamma passing rate and define ALs by using the COMPASS software.

Methods: A total of 93 intensity-modulated radiation therapy (IMRT) plans for nasopharyngeal cancer (NPC) patients, including 61 original plans and 32 multileaf collimator (MLC) error-introduced test plans, were selected for retrospective analysis. Firstly, the dose distribution was divided into six isodose regions (“≥10%”, “≥20%”, “≥30%”, “≥40%”, “≥50%”, and “≥60%”) based on the prescribed dose and one clinically oriented region for evaluation (“whole”) to perform the three-dimensional (3D) global gamma reanalysis. Meanwhile, the percentage gamma passing rate (%GP), mean gamma index (μGI) based on 3%/2 mm criteria, and percentage dose error (%DE) of the dose–volume histogram (DVH) metrics were recorded by COMPASS application. Secondly, the Pearson’s correlation coefficient was used to analyze the correlation between %GP and %DE and between μGI and %DE in different regions. Additionally, receiver operating characteristic (ROC) methodology was applied to quantify the fraction of patient-specific plans evaluated as “fail” and “pass”. In order to improve the correlation between gamma analysis result and clinical criteria, ROC analysis was carried out in accordance with hybridization analysis criteria (%DE ≤3%, %GP ≥90% and %DE ≤3%, μGI ≤0.6). ROC was performed for two purposes: 1) to analyze the sensitivity and specificity of %GP and μGI in different regions for evaluation and 2) to define the ALs of %GP and μGI in the optimal region for evaluation. Finally, the plans introduced with MLC errors were prepared for validation. Moreover, we also compared the positive rate of ALs of both %GP and μGI in detecting MLC error-introduced plans in different regions.

Results: 1) In our study, a number of DVH-based metrics were found to be correlated with the evaluation parameters. The corresponding number was 4, 2, 1, 1, 1, 1, and 3 in γwhole, γ10%, γ20%, γ30%, γ40%, γ50%, and γ60%, respectively, and 5, 3, 0, 1, 1, 4, and 2 in μGIwhole, μGI10%, μGI20%, μGI30%, μGI40%, μGI50%, and μGI60%, respectively. The results by COMPASS have revealed that the %DE of specific structures has a slightly higher correlation with both %GP and μGI of the “whole” region compared with that of any other region. However, it is a moderate correlation (0.5 ≤ |r| < 0.8). 2) The areas under the curves (AUCs) of γwhole, μGIwhole, μGI40%, μGI50%, and μGI60% were >0.7 based on 3%/2 mm criteria. According to the Youden coefficient, we defined the ALs of γwhole ≥92%, μGIwhole ≤0.36, μGI40% ≤0.43, and μGI60% ≤0.40 based on 3%/2 mm criteria. 3) In the validation, for original plans, the accuracy of ALγwhole, ALγ10%, ALμGIwhole, ALμGI40%, ALμGI50%, and ALμGI60% was 23%, 9.8%, 90%, 80.3%, 9.8%, and 88.5%, respectively. For test plans with systematic MLC errors smaller than 0.8 mm, the positive rates of ALγwhole, ALγ10%, ALμGIwhole, ALμGI40%, ALμGI50%, and ALμGI60% were 25%, 58%, 92%, 92%, 42%, and 100%, respectively. For the plans with systematic MLC errors higher than 0.8 mm, the positive rates of all AL%GP and ALμGI were 100%. From the COMPASS validation results, the accuracy of γwhole, μGIwhole, μGI40%, and μGI60% was higher than that of the conventional γ10% and commonly used μGI50%.

Conclusions: Compared with the traditional evaluation region (i.e., the criteria with a threshold of 10% or a threshold of 50%, it was the same with the isodose regions of “≥10%”, “≥50%” based on the prescribed dose in our study), the “whole” region is more meaningful to the clinic by COMPASS. The accuracy of μGIwhole is higher than that of the conventional γ10% and the commonly used μGI50%.

Introduction

As an advanced radiation therapy technique, intensity-modulated radiation therapy (IMRT) allows for a complex dose distribution, with sharp dose gradients often achieved at the edge of the planning target volume (PTV). As IMRT is able to create an intentionally inhomogeneous dose distribution, it has routinely been employed as a radiotherapy technique (1). Due to the complex features of the inverse planning and the complexity in the delivery process, safety and quality assurance (QA) are important (2). As an essential and important process, patient-specific QA analyzes and evaluates the deviation between the predicted dose and the delivered/measured dose. This process can be carried out by using film dosimetry, ionization chambers, electronic portal imaging device (EPID), two-dimensional (2D) array detector, three-dimensional (3D) dosimetric system, gel dosimetry, log file, etc. (3, 4). Introduced in a seminal work by Low et al. (5), the global gamma index method is the most widely used technique for comparison of dose distribution for patient-specific pretreatment QA in radiation therapy (57). The 2D or 3D gamma analysis could be chosen based on different equipment. As a 3D volume analysis, the 3D gamma analysis can be used by COMPASS, Delta4, ArcCheck, Dolphin, OCTAVIUS 4D, etc. As a plane analysis, the 2D gamma analysis is often used by array detector, just like EPID, log file, Mapcheck, etc. Compared with the 2D gamma analysis, the 3D gamma analysis is increasingly winning high favor among medical physicists because it can provide more comprehensive QA information. However, the 3D gamma analysis could make the Region of Interest (ROI) selection more complex.

In clinical practice, the results of the patient-specific QA should reveal the percentage dose errors (%DEs) of dose–volume histogram (DVH) metrics (i.e., if the results of the patient-specific QA was acceptable, the %DE should be ≤3%), which are the important parameters used by radiation oncologists (8). However, Nelms et al. (9), Stasi et al. (10), and Fredh et al. (11) have found that the conventional global gamma analysis has limited sensitivity to dose errors of DVH metrics. Recommended by the TG-218 (3), the conventional global gamma analysis for QA measurements and corresponding treatment plan was performed in absolute dose mode, adopting global normalization (often using the maximum planned dose point) and a dose threshold of 10%. In some studies, it has been argued recently that the percentage gamma passing rate (%GP) of individual structures is more sensitive to dose errors than that in the conventional gamma method (12, 13). The %GP of individual structures provides a greater insight into the dose delivered to target volume and organs at risk (OARs). Stasi et al. (10), Heilemann et al. (12), and Yi et al. (13) have adopted the %GP of individual structures instead of conventional global gamma. Because the %GP of individual structures can not only improve the weak correlation between the %GP and %DE but also provide spatial information of dose errors that are unavailable in conventional gamma analysis. However, this method is only suitable for plans for one body site with a simple anatomy just like the IMRT plan for cervical cancer. It is difficult to be promoted in head and neck plan verification due to multiple targets and OARs that cause inefficiencies and increased clinical workload due to an abundance of DVH parameters that must now be reviewed.

In some reports (14), the current IMRT QA practice has been fundamentally questioned due to the poor sensitivity and specificity of the global gamma index implementation. Under such circumstances, Stojadinovic et al. (15) and Yu et al. (6, 16) have proposed the divide and conquer (D&C) gamma method. In this method, the dose distribution is divided into distinct regions: i) a high-dose (HD) region within the 90% isodose, ii) a high-gradient (HG) region with doses ranging from 50% to 90% isodose, iii) a medium-dose (MD) region with doses ranging from 20% to 50% isodose, and iv) a low-dose (LD) region with doses ranging from 10% to 20% isodose. It has been indicated that the conventional gamma method includes almost all dose regions (≥10% isodose level). However, from a clinical perspective, not all dose regions are equally essential. By using the D&C gamma method, more information can be provided for validation without further increasing the time-intensive nature of IMRT QA. Hence, it has challenged the adequacy of the conventional approach to patient-specific QA analysis. However, these publications have neither illustrated the correlation between the QA results and clinical DVH metrics nor verified the sensitivity and specificity of the D&C gamma method.

In addition, the %GP has been explored in most studies, but the mean gamma index (μGI) has been accounted for in only a few publications. In fact, Stock et al. (17) have proposed the initial concept of achieving the μGI action limits (ALs) for the hot area (receiving 50% or more of the prescribed dose) as early as in 2005. Recently, Visser et al. (18) have completed the 3D dose verification by combining μGI evaluation with %DE. Few publications have explored the relationship between μGI and %DE or the impact of different evaluation regions on μGI.

In this study, a measurement-guided dose calculation system was used to divide the measured dose distribution into six isodose regions and one global clinically concerned region (PTV plus all OARs). By doing so, we aimed to investigate the optimal region where the correlation between %GP and %DE and between μGI and %DE can be improved. Besides, this study was also implemented to prove a new method that does not further increase the time-intensive nature of IMRT QA.

Materials and Methods

We selected 93 IMRT plans for nasopharyngeal cancer (NPC), including 61 original plans and 32 multileaf collimator (MLC) error-introduced test plans for retrospective analysis. These plans had been performed with measurement-based patient-specific QA. With ALs based on 3%/2 mm (global normalization) criteria recommended in American Association of Physicists in Medicine (AAPM) TG-218 (3), these IMRT QA plans were reanalyzed by using the different regions for evaluation.

Definition of Regions for Evaluation

For the treatment planning, the dose distribution was computed with the anisotropic analytical algorithm (AAA). The dose calculation was carried out on a 2.5 mm isotropic dose grid in Eclipse treatment planning system (TPS) (Eclipse 13.5, VARIAN, USA) for a 6-MV photon beam on a UNIQUE linac (VARIAN, USA) equipped with a 120-leaf multileaf collimator (MLC) (millennium120 MLC, VARIAN, USA). In the dose verification, the structures were categorized into target volume and OAR as shown in Table 1. Figure 1 shows the dose distribution that was divided into six isodose regions (“10%”, “20%”, “30%”, “40%”, “50%”, and “60%”) and clinically oriented region (“whole”):

(1) The region of “10%” label refers to all the voxels with a dose value of 10% or greater. Universal ALs adopted in AAPM TG-218 report: gamma passing rate ≥90% based on 3%/2 mm and a 10% dose threshold. Therefore, the region of “10%” label corresponds to the most widely used region by the conventional global gamma method.

(2) The region of “50%” label refers to all the voxels with a dose value of 50% or greater. It is the same region with the commonly used μGI for the hot area (receiving 50% or more of the prescribed dose) in the patient CT based on Stock et al. (17).

(3) The regions of “20%”, “30%”, “40%”, and “60%” labels refer to all the voxels with a dose value of 20%, 30%, 40%, and 60% or greater, respectively.

(4) The region of “whole” label indicates all voxels of PTV plus all of the OARs (spinal cord, brain stem, parotid L/R).

TABLE 1
www.frontiersin.org

Table 1 The correlation between the %DE and the %GP in different regions for NPC patients.

FIGURE 1
www.frontiersin.org

Figure 1 Dose distribution of the IMRT plan for nasopharyngeal cancer. The dose distribution was divided into seven parts depicted in different colors: the pink line refers to the region of “10%” label; the yellow line refers to the region of “20%” label; the orange line refers to the region of “30%” label; the green line refers to the region of “40%” label; the magenta line refers to the region of “50%” label; the cyan line refers to the region of “60%” label; the blue line refers to the region of “whole” label.

Introduction of Multileaf Collimator Errors in Test Plans

An in-house Python program based on Pydicom (version 2.1.2) was used to introduce systematic MLC errors into the test plans by manipulation of DICOM RT files. As described in Figure 2, the introduced systematic MLC errors included an increase and a decrease of the distance between leaf pairs in the beam field. The magnitudes of systematic MLC errors were ±0.2, ± 0.4, ± 0.6, ± 0.8, ± 1.0, ± 2.0, ± 3.0, and ±4.0 mm, respectively. If any magnitude of MLC errors had led to a negative leaf gap in some of the leaf pairs, the gap of this leaf pair would have been set to 0.5 mm. After systematic MLC errors were introduced into the original plans, the modified RT files were reimported into TPS for dose calculation. A total of 32 plans were generated as a validation dataset.

FIGURE 2
www.frontiersin.org

Figure 2 Creation of the MLC error-introduced IMRT plan. (A) Original plan. (B) Plan introduced with positive MLC errors. (C) Plan introduced with negative MLC errors.

Quality Assurance Procedure

COMPASS (version 1.2. IBA Dosimetry, Germany) is a software that is used in conjunction with MatriXXEvolutionionization chamber-based 2D_IC array detector to provide a measurement-guided dose calculation. The COMPASS system is based on a beam model that includes the characteristics of the accelerator, a collapsed-cone convolution (CCC) algorithm. This allows pretreatment dose verification on CT images, including DVH. A strict commissioning of the COMPASS system, including the validation of accuracy for 2D_IC array (MatriXX, IBA Dosimetry) measurement, beam modeling, gantry angle sensor, and absolute dose calibration, was performed in advance according to the same standards as the clinic- used TPS. Figure 3 shows the flowchart of QA procedure that had been carried out in the study: i) Initially, all plans were transferred from the TPS to the COMPASS software, including all of the DICOM files (RT plans, RT doses, RT structures, and CT images). ii) Afterward, all plans were performed by a linear accelerator to obtain the dose fluence by COMPASS so as to get the measurement-based QA. Meanwhile, the MatriXX was hung under the collimator to obtain the dose fluence. iii) Finally, the 2D_IC array measurement doses were reconstructed on patients’ CT images using the COMPASS application that models the characteristics of the linear accelerator head.

FIGURE 3
www.frontiersin.org

Figure 3 Flowchart of the quality assurance process in the study.

Furthermore, the dose distribution was calculated by the CCC algorithm and on a 2.5 mm isotropic dose grid by using the COMPASS application. In comparing the treatment planning and measurement-based QA, we recorded %DE, %GP, and μGI in different regions for reanalysis.

i) The %DE between the TPS dose and DVH index of measurement-based QA for specific structures was calculated as follows:

%DEi=|D(i)evalD(i)refD(i)ref|×100%(1)

where Deval is the measurement-based QA dose acquired by COMPASS, Dref refers to the treatment planning dose, and i refers to the measured and calculated dose point pair.

ii) The global %GP is defined as follows:

Γi(rref,reval)=Δd2δd2+ΔD2δD2(2)
γ(reval)=min{Γ(rref,reval)}{reval}(3)

The γ function is the minimum value of generalized Γ function computed for arbitrary isodose point distance Δd and dose difference ΔD. Values of γ between 0 and 1 will indicate that the comparison is acceptable according to the dose and distance criteria. If γ(reval) ≤1, it will indicate that the comparison result of rref is acceptable. If γ(reval) >1, it will mean a failure in the comparison result of rref. The percentage of passing points in the gamma distribution is referred to as gamma pass rate (or %GP), δD/δd refers to the criteria of 3%/2 mm, and i refers to the measured and calculated dose point pair.

ΔDi=|D(i)evalD(i)refD(i)norm|×100%(4)

where Deval is the measurement-based QA dose acquired by COMPASS, Dref is the treatment planning dose, and Dnorm refers to the normalized max dose point. Normalization plays a critical role in the interpretation of dose comparison results. Specifically in global normalization, the dose difference between any measured and calculated dose point pair is normalized using the same value for all point pairs, often the maximum planned dose point. Here, i refers to the measured and calculated dose point pair.

Δdi=|r(i)evalr(i)ref|(5)

where Δd refers to the distance difference of the same dose between the measurement-based QA dose acquired by COMPASS and the treatment planning dose; i refers to the measured and calculated dose point pair.

iii) The definition of μGI is as follows:

μGIx=1nγ(reval)n(6)

μGIx refers to the mean γ (reval) for the measured and calculated dose point pair in x region, and n refers to the number of the measured and calculated dose point pairs in x region. A lower μGI is better.

Hybrid Analysis: Comprehensive Gamma-Based and Dose–Volume Histogram-Based Analysis

In conventional gamma analysis (i.e., 3%/2-mm global normalization and 10% dose threshold, hereafter referred to as γ10%), dimensionless gamma passing rate is adopted to combine the dose difference (DD) with the distance to agreement (DTA). The universal ALs of %GP ≥90% in AAPM TG-218 report was proposed.

The μGI (the gamma index for receiving 50% or more of the prescribed dose, hereafter referred to as μGI50%) was the gamma passing rate for the composite dose. It was obtained and categorized by a global 3D gamma analysis (17, 18). The μGI50% was categorized into “PASS” (acceptable for treatment), “EVAL” (to be evaluated by the medical physicist), or “FAIL” (rejected for treatment). Criteria for the μGI50% are classified into PASS (μGI50% ≤0.4, acceptable for treatment), EVAL (0.4 < μGI50% < 0.6, to be evaluated by the medical physicist), and FAIL (μGI50% ≥0.6, rejected for treatment).

DVH information was acquired for the Planning Target Volume (PTV), gross tumor volume (GTV), and four types of OARs (brain stem, spinal cord, parotid L, parotid R). The details of the DVH metrics are shown in Figure 4. According to the study by Cozzolino et al. (19) and Visser et al. (18), the DVH ALs should be set to 2%–5%. Therefore, we have chosen DVH ALs of 3%. It means that the %DE of all the structures must be ≤3%.

FIGURE 4
www.frontiersin.org

Figure 4 The %DE of DVH metrics based on 3%/2 mm criteria for NPC patients. D98, D95, D5, and D1 refers to the dose received by 98%, 95%, 5% and 1% volume of the structures including PTV, GTV, Parotid gland, Brainstem, Spinalcord, respectively. Dmean refers to the mean dose of the structures.

Based on available literature (13, 18), combinations of DVH information, %GP, and μGI were employed in this study. By doing so, we aimed to improve the correlation between gamma analysis result and clinical criteria. In particular, a measured plan can be considered to be clinically acceptable when the following two conditions are met (16, 20): i) γ10% ≥90% and the absolute value of %DE ≤3%; ii) μGI50% <0.6 and the absolute value of %DE ≤3%.

Data Analysis

Figure 5 shows the patient-specific QA reports of the 61 original IMRT plans that were categorized into four types based on the DVH_γ10% or DVH_ μGI50% ALs:

i) In DVH_γ10% evaluation chart: “true negatives” (TNs) refer to γ10% ≥90% and the absolute value of %DE ≤3%; “false negatives” (FNs) refer to γ10% ≥90% and the absolute value of %DE >3%; “true positives” (TPs) refer to γ10% <90% and the absolute value of %DE >3%; “false positives” (FPs) refer to γ10% <90% and the absolute value of %DE ≤3%

ii) In DVH_ μGI50% evaluation chart: “true negatives” (TNs) refer to μGI50% <0.6 and the absolute value of %DE ≤3%; “false negatives” (FNs) refer to μGI50% <0.6 and the absolute value of %DE >3%; “true positives” (TPs) refer to μGI50% ≥0.6 and the absolute value of %DE >3%; “false positives” (FPs) refer to μGI50% ≥0.6 and the absolute value of %DE ≤3%

FIGURE 5
www.frontiersin.org

Figure 5 The DVH_%GP and DVH_μGI evaluation charts.

When the correlation between %DE and %GP and between %DE and μGI in various regions was analyzed by the SPSS 19.0 (SPSS Inc., Chicago, IL, USA) software, we used Pearson’s correlation coefficient (r). If the p value was <0.05, it would have indicated that %DE has a correlation with both %GP and μGI. Different correlation coefficients (r) represent different degrees of correlation: If |r| ≥ 0.8, it will be considered to be highly correlated; if 0.5 ≤ |r| < 0.8, it will be considered to be moderately correlated; if |r| < 0.5, it will be considered to be weakly correlated.

In this study, the TP rate indicated the sensitivity and was defined as TP/(TP+FN), while the TN rate indicated the specificity and was given by TN/(TN+FP). The receiver operating characteristic (ROC) curves were generated based on the TN and TP rates to characterize %GP and μGI in different regions. The area under the curve (AUC) was used as the classification indicator as follows: If AUC = 1, it would have meant that the indicator can be used for classification precisely; If AUC = [0.8, 0.9], it would have indicated a very good effect; If AUC = [0.7, 0.8], it would have indicated an average effect; If AUC = [0.5, 0.7], it would have meant a low effect. Consequently, the ALs can also be defined according to the Youden coefficient (21). Youden coefficient is used to evaluate the authenticity of the screening test. Its value is equal to the sum of sensitivity and specificity of point in ROC minus 1. The higher the Youden coefficient, the better the authenticity. The value of the max Youden coefficient point in ROC is the cutoff.

Finally, in the validation, for the original plans and the MLC error-introduced test plans, the ALs were used to evaluate the accuracy of various regions.

Results

Dose–Volume Histogram Metrics Evaluation and 3D γ_Analysis

As shown in Figure 4, most of the %DE metrics were found to be within 3% criteria, except for D1 of the brain stem. As shown in Figure 6, the larger the volume of the region, the smaller the results of %GP. However, the results of μGI were consistent with volume changing of the different regions. The %GPs of the different regions (γ10%, γ20%, γ30%, γ40%, γ50%, γ60%, and γwhole) were higher than 90%, and the μGIs of these regions (μGI10%, μGI20%, μGI30%, μGI40%, μGI50%, μGI60%, μGIwhole) were lower than 0.6.

FIGURE 6
www.frontiersin.org

Figure 6 (A) The %GP in different regions based on 3%/2 mm criteria for NPC patients. (B) The μGI in different regions based on 3%/2 mm criteria for NPC patients.

Correlation Analysis Between %DE and %GP and Between %DE and μGI in Different Regions

Ten DVH-based parameters were chosen for evaluation. Correlation analysis between the different %DEs of DVH metrics and %GP, μGI of different regions was performed one by one as shown in Table 1. It shows that a number of DVH-based metrics were found to be correlated with the evaluation parameters (i.e., the corresponding number was 4, 2, 1, 1, 1, 1, and 3 in γwhole, γ10%, γ20%, γ30%, γ40%, γ50%, and γ60%, respectively). i) Only γwhole is considered to be moderately correlated with the %DE of GTV-Dmean and GTV-D2 (0.5 ≤ |r| < 0.8, p < 0.05); ii) The γwhole, γ10%, γ20%, γ30%, γ40%, γ50%, and γ60% are considered to be weakly correlated with the other %DE (|r| < 0.5, p < 0.05).

Table 2 shows that a number of DVH-based metrics were found to be correlated with the evaluation parameters (i.e., the corresponding number was 5, 3, 0, 1, 1, 4, and 2 in μGIwhole, μGI10%, μGI20%, μGI30%, μGI40%, μGI50%, and μGI60%, respectively). i) The μGIwhole is considered to be moderately correlated with the %DE of GTV-D2 (0.5 ≤ |r| < 0.8, p < 0.05); ii) The μGIwhole, μGI10%, μGI20%, μGI30%, μGI40%, μGI50%, and μGI60% are considered to be weakly correlated with the other %DE (|r| < 0.5, p < 0.05).

TABLE 2
www.frontiersin.org

Table 2 The correlation between the %DE and the μGI in different regions for NPC patients.

Receiver Operating Characteristic Analysis

In accordance with the results, further analysis on sensitivity was performed by using the ROC of %GP (γwhole, γ10%, γ20%, γ30%, γ40%, γ50%, and γ60%) and μGI (μGIwhole, μGI10%, μGI20%, μGI30%, μGI40%, μGI50%, and μGI60%). Figure 7 shows the areas under the curves (AUCs) of %GP and μGI in different regions. The AUCs of both %GP and μGI acquired for the “whole” region were higher than 0.700. On the other hand, the AUCs of μGI40%, μGI50%, and μGI60% were 0.714, 0.714, and 0.738, respectively. It would have meant that the %GP and the μGI of other areas have a very low efficiency in classification. Therefore, these parameters were chosen for further analysis.

FIGURE 7
www.frontiersin.org

Figure 7 (A) Comparison of the AUCs of %GP in different regions (γwhole, γ10%, γ20%, γ30%, γ40%, γ50%, and γ60%) and (B) μGI (μGIwhole, μGI10%, μGI20%, μGI30%, μGI40%, μGI50%, and μGI60%).

The Accuracy Analysis

According to the Youden coefficient, the ALs of the “whole”, “40%”, and “60%” regions were adopted for ALγwhole ≥92%, ALμGIwhole ≤0.36, ALμGI40% ≤0.43, and ALμGI60% ≤0.40, respectively, based on 3%/2 mm criteria.

In the validation, for the original plans and the MLC error-introduced test plans, the accuracy of the various regions was evaluated by ALs and compared with the hybrid analysis. As shown in Table 3, for original plans, the accuracy of ALγwhole, ALγ10%, ALμGIwhole, ALμGI40%, ALμGI50%, and ALμGI60% is 23%, 9.8%, 90%, 80.3%, 9.8%, and 88.5%, respectively. As shown in Figure 8, for test plans with systematic MLC errors smaller than 0.8 mm, the positive rates of ALγwhole, ALγ10%, ALμGIwhole, ALμGI40%, ALμGI50%, and ALμGI60% were 25%, 58%, 92%, 92%, 42%, and 100%, respectively; for test plans with systematic MLC errors higher than 0.8 mm, the positive rates of all the AL%GP&μGI in identifying MLC error-introduced plans were 100%.

TABLE 3
www.frontiersin.org

Table 3 The comparison of the hybrid analysis among the evaluation parameters (i.e., ALγ10%, ALγwhole, ALμGI50%, ALμGIwhole, ALμGI40%, ALμGI60%).

FIGURE 8
www.frontiersin.org

Figure 8 The positive rates of test plans introduced with different magnitudes of systematic MLC errors.

Discussion

In global gamma analysis, all points are normalized to one reference dose point where the maximum dose or isocenter dose is the most common choice (9). In general, errors are masked in the lower dose regions in the conventional global gamma because the low dose points are normalized to a high dose point (22). Furthermore, the TG-218 report has also implied that a low dose region (such as 10% of the prescription dose) can significantly affect the analysis (3). As a result, the conventional global gamma passing rate is often questioned regarding its limitations or clinical relevance (23, 24). Therefore, changing the evaluation region for the global gamma analysis (or changing the minimum threshold) is a preferred method because it can decrease the bias of the global gamma analysis.

In essence, the methodology proposed in our study computed the %GP and the μGI of reduced sample voxels of the dose distribution in different regions rather than that of all voxels. A conventional gamma analysis includes all voxels in the dose distribution (with dose values of 10% or greater). However, not all voxels have clinical significance in patient-specific QA. The targets and OARs are critical clinically relevant volume for making a final determination. The “whole” region for evaluation was explored with increasing dose level until it only contained the targets and OARs. The correlation between %GP, μGI, and %DE of various dosimetry metrics was used to assess the relevance of both %GP and μGI with clinically relevant observations.

From calculation equations 3–6, it is μGI rather than %GP that was able to more or less reflect the gamma value change of each evaluated point. The ROC analysis also demonstrated that the μGI in different regions was more sensitive than %GP. Therefore, it has implied that μGI reveals more clinically relevant problems compared with %GP.

In this study, both %GP and μGI results of seven regions for evaluation were found to have moderate correlation with the DVH metrics. This can be compared with the studies made by Zhen et al. (25) and Chan et al. (26). The μGIwhole was recommended to replace the conventional γ10% and μGI50% in the study. There are two reasons why the “whole” region is superior. One, from equations 3–6, both the spatial information and %DE were included in %GP and μGI. However, the dose difference information regarding specific structures was unavailable, which made %GP and μGI difficult to explore for assessing clinical implications (26). As the proposed “whole” region in our study only focused on the area of clinical concern, it can partially solve this problem. Moreover, Yu et al. (11) have demonstrated that different dose levels might include some other information such as the accuracy of TPS calculation in low-dose regions (10%–20% dose region), the accuracy of MLC modeling (50% dose region), and the expected information regarding the validity of the source model including beam energy, focal spot size, and collimator feature (high-dose regions). The “whole” region not only was the clinically concerned region but also provided the information in different dose levels. Although the sensitivities of μGI60% and μGIwhole were similar, the “whole” region included more dose levels. Hence, it provided greater insight into the dose delivered to specific structures. According to the study by Cozzolino et al. (19) and Visser et al. (18), the DVH ALs should be set to 2%–5%. Therefore, we chose the DVH ALs of 3%. However, in areas with a sharp dose gradient and in metrics with small volumes, relatively high dose differences such as point dose (D1) were observed. For results whose %DE exceeded DVH ALs, it was acceptable from the clinical point of view if the measured dose of the target volume was higher than the calculated dose or if the measured dose of OAR was lower than the calculated dose (27). The reason was that the results would not affect clinical efficiency. This view was proven by the %DE of the brain stem in Figure 5. The %DE of brain stem (D1) of 32 cases was >3%, but most of the dose deviations were negative. Hence, this DVH ALs were too strict for OARs near the target volume such as the brainstem within or near the PTV in the NPC plan. The DVH ALs had an influence on defining the ALs of both %GP and μGI directly. So, if values determined for μGIwhole exceed the ALs, we need to check whether they have a clinical impact.

MLC errors mainly include random MLC positional errors and systematic positional errors. Random MLC positional errors have little dosimetric effect on IMRT plans (28). Steers and Fraass (23) concluded that the shifts in different directions may be an indication of different types of systematic errors arising from TPS model issues, machine issues, or other device issues. In addition, in our study, for the systematic MLC errors smaller than 0.8 mm, the positive rate of μGIwhole was higher compared with 0.75 mm systematic MLC errors in some publications (29), which were claimed to be undetected due to the poor resolution of array detectors. The author analyzed and concluded that the reason was that it was too strict to set the DVH ALs of 3% for the NPC plan. However, there is no consensus on the tolerance of the systematic MLC errors for patient QA. Numerous publications (3032) have found that the systematic MLC errors up to 1 mm can produce a clinically relevant influence on the dose distribution. However, Rangel et al. (33) have suggested that systematic MLC errors need to be limited to 0.3 mm. Oliver et al. (34) have suggested that the systematic MLC errors should be within 0.63 mm to keep the PTV95 within 2%. In brief, μGIwhole can improve the detecting rate of small systematic MLC errors and the clinical effect of radiotherapy.

By gamma analysis, the verification results can be compressed into a single value (%GP or μGI). It can improve the efficiency of dose verification, which is usually important for clinics under a heavy workload. As proposed in this study, μGIwhole would not increase the clinical workload. Instead, it has a better correlation with DVH metrics than other commonly used evaluation metrics. Of course, these results also highlighted major limitations of our study. It was not a multicenter study and can potentially bias the analysis. In addition, these results may also have been affected by the specific choice of dosimeter, QA equipment, delivery system, and tumor sites (23). However, this study could be extended by stratifying the “whole” region by isodose level and anatomical site to provide more clinical information without increasing clinical work.

Conclusion

The empirical-type metrics, such as those in the patient-specific QA, may vary from institution to institution, since those metrics are based on local equipment, processes, and treatment types. Therefore, the conventional regions including “10%” and “50%” may not be suitable for all institutions. Different regions of the global gamma analysis have different diagnostic efficiencies. The results of μGIwhole can provide a new metric for the global gamma analysis.

Data Availability Statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.

Ethics Statement

This study was reviewed and approved by the Ethical and Scientific Committees of The First Affiliated Hospital of Chongqing Medical University (Chongqing, China). Written consent was obtained from the patient individual(s) for the publication of any potentially identifiable images or data included in this paper.

Author Contributions

WL, YL, and XY conceived and designed the study and contributed to the analysis and interpretation of trials and literature and development and review of the article. HC and HZ collected the data, reviewed literature, and wrote the paper. WH prepared the figures and participated in the revision of literature. All authors contributed to the article and approved the submitted version.

Funding

This work was sponsored by Natural Science Foundation of Chongqing, China (cstc2021jcyj-msxmX0138).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

NPC, Nasopharyngeal cancer; IMRT, intensity-modulated radiation therapy; MLC, multileaf collimator; %GP, gamma passing rate; μGI, mean gamma index; %DE, percentage dose error; DVH, dose–volume histogram; ROC, receiver operating characteristic; AUCs, areas under the curves; AL, action limit; PTV, planning target volume; QA, quality assurance; EPID, electronic portal imaging device; 2D, two-dimensional; 3D, three-dimensional; D&C, divide and conquer; HD, high-dose; HG, high-gradient; MD, medium-dose; LD, low-dose; OARs, organs at risk; AAA, anisotropic analytical algorithm; TPS, treatment planning system; CCC, collapsed-cone convolution; DD, dose difference; DTA, distance to agreement; TN, true negative; FN, false negative; TP, true positive; FP, false positive.

References

1. Taylor A. M E B Powell,Intensity-Modulated Radiotherapy–What is it? Cancer Imaging (2004) 4(2):68–73. doi: 10.1102/1470-7330.2004.0003

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Pan Y, Yang R, Zhang S, Li J, Dai J, Wang J, et al. National Survey of Patient Specific IMRT Quality Assurance in China. Radiat Oncol (2019) 14(1):69. doi: 10.1186/s13014-019-1273-5

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Miften M, Olch A, Mihailidis D, Moran J, Pawlicki T, Molineu A, et al. Tolerance Limits and Methodologies for IMRT Measurement-Based Verification QA: Recommendations of AAPM Task Group No. 218. Med Phys (2018) 45(4):e53–83. doi: 10.1002/mp.12810

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Ahmed S, Nelms B, Kozelka J, Zhang G, Moros E, Feygelman V. Validation of an Improved Helical Diode Array and Dose Reconstruction Software Using TG-244 Datasets and Stringent Dose Comparison Criteria. J Appl Clin Med Phys (2016) 17(6):163–78. doi: 10.1120/jacmp.v17i6.6414

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Low DA, Harms WB, Mutic S, Purdy JA. A Technique for the Quantitative Evaluation of Dose Distributions. Med Phys (1998) 25(5):656–61. doi: 10.1118/1.598248

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Yu L, Tang TLS, Cassim N, Livingstone A, Cassidy D, Kairn T, et al. Analysis of Dose Comparison Techniques for Patient-Specific Quality Assurance in Radiation Therapy. J Appl Clin Med Phys (2019) 20(11):189–98. doi: 10.1002/acm2.12726

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Low DA, Dempsey JF. Evaluation of the Gamma Dose Distribution Comparison Method. Med Phys (2003) 30(9):2455–64. doi: 10.1118/1.1598711

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Nelms BE, Simon JA. A Survey on Planar IMRT QA Analysis. J Appl Clin Med Phys (2007) 8(3):76–90. doi: 10.1120/jacmp.v8i3.2448

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Nelms BE, Zhen H, Per-beam T. Planar IMRT QA Passing Rates Do Not Predict Clinically Relevant Patient Dose Errors. Med Phys (2011) 38:1037–44. doi: 10.1118/1.3544657

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Stasi M, Bresciani S, Miranti A, Maggio A, Sapino V, Gabriele P. Pretreatment Patient-Specific IMRT Quality Assurance: A Correlation Study Between Gamma Index and Patient Clinical Dose Volume Histogram. Med Phys (2012) 39:7626–34. doi: 10.1118/1.4767763

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Fredh A, Scherman JB, Fog LS, Rosenschöld PMA. Patient QA Systems for Rotational Radiation Therapy: A Comparative Experimental Study With Intentional Errors. Med Phys (2013) 40:031716. doi: 10.1118/1.4788645

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Heilemann G, Poppe B, Laub W. On the Sensitivity of Common Gamma-Index Evaluation Methods to MLC Misalignments in Rapidarc Quality Assurance. Med Phys (2013) 40:031702. doi: 10.1118/1.4789580

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Yi J, Han C, Zheng X, Zhou Y, Deng Z, Xie C, et al. Individual Volume-Based 3D Gamma Indices for Pretreatment VMAT QA. J Appl Clin Med Phys (2017) 18:28–36. doi: 10.1002/acm2.12062

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Gordon J, Siebers J. Addressing a Gap in Current IMRT Quality Assurance. Int J Radiat Oncol Biol Phys (2013) 87(1):20–1. doi: 10.1016/j.ijrobp.2013.03.030

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Stojadinovic S, Ouyang L, Gu X, Pompoš A, Bao Q, Solberg TD. Breaking Bad IMRT QA Practice. J Appl Clin Med Phys (2015) 16(3):5242. doi: 10.1120/jacmp.v16i3.5242

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Yu L, Kairn T, Trapp JV, Crowe SB. Comparison of Global and Local Gamma Evaluation Results Using Isodose Levels. Phys Eng Sci Med (2021) 44:201–6. doi: 10.1007/s13246-020-00968-y

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Stock M, Kroupa B, Georg D. Interpretation and Evaluation of the Gamma Index and the Gamma Index Angle for the Verification of IMRT Hybrid Plans. Phys Med Biol (2005) 50(3):399–411. doi: 10.1088/0031-9155/50/3/001

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Visser R, Wauben DJL, De Groot M, Steenbakkers RJHM, Bijl HP, Godart J, et al. Evaluation of DVH-Based Treatment Plan Verification in Addition to Gamma Passing Rates for Head and Neck IMRT. Radiother Oncol (2014) 112:389–95. doi: 10.1016/j.radonc.2014.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Cozzolino M, Oliviero C, Califano G, Clemente S, Pedicini P, Caivano R, et al. Clinically Relevant Quality Assurance (QA) for Prostate RapidArc Plans: Gamma Maps and DVH_based Evaluation. Phys Med (2014) 30:462–72. doi: 10.1016/j.ejmp.2014.01.003

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Visser R, Wauben DJL, Groot M, Godart J, Langendijk JA, Veld AA, et al. Efficient and Reliable 3D Dose Quality Assurance for IMRT by Combining Independent Dose Calculations With Measurements. Med Phys (2013) 40(2):021710. doi: 10.1118/1.4774048

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Chen F, Xia J-l. Clinical Trial Statistics. Beijing: PEOPLE’S MEDICAL PUBLISHING HOUSE (2018). p. 368.

Google Scholar

22. Nelms BE, Chan MF, Jarry G, Lemire M, Lowden J, Hampton C, et al. Evaluating IMRT and VMAT Dose Accuracy: Practical Examples of Failure to Detect Systematic Errors When Applying a Commonly Used Metric and Action Levels. Med Phys (2013) 40(11):111722. doi: 10.1118/1.4826166

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Steers JM, Fraass BA. IMRT QA Selecting Gamma Criteria Based on Error Detection Sensitivity. Med Phys (2016) 43(4):1982–94. doi: 10.1118/1.4943953

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Bailey DW, Nelms BE, Attwood K, Kumaraswamy L, Podgorsak MB. Statistical Variability and Confidence Intervals for Planar Dose QA Pass Rates. Med Phys (2011) 38(11):6053–64. doi: 10.1118/1.3651695

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Zhen H, Nelms BE, Tome WA. Moving From Gamma Passing Rates to Patient DVH-Based QA Metrics in Pretreatment Dose QA. Med Phys (2011) 38(10):5477–89. doi: 10.1118/1.3633904

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Chan MF, Li J, Schupak K, Burman C. Using a Novel Dose QA Tool to Quantify the Impact of Systematic Errors Otherwise Undetected by Conventional QA Methods: Clinical Head and Neck Case Studies. Technol Cancer Res Treat (2014) 13(1):57–67. doi: 10.7785/tcrt.2012.500353

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Yi X, Lu W-L, Dang J, Huang W, Cui H-X, Wu W-C, et al. A Comprehensive and Clinical-Oriented Evaluation Criteria Based on DVH Information and Gamma Passing Rates Analysis for IMRT Plan 3D Verification. J Appl Clin Med Phys (2020) 21(8):47–55. doi: 10.1002/acm2.12910

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Wang Y, Pang X, Feng L, Wang H, Bai Y. Correlation Between Gamma Passing Rate and Complexity of IMRT Plan Due to MLC Position Errors. Phys Med (2018) 47:112–20. doi: 10.1016/j.ejmp.2018.03.003

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Woon W, Ravindran PB, Ekayanake P, Vikraman S, Lim YY, Khalid J. A Study on the Effect of Detector Resolution on Gamma Index Passing Rate for VMAT and IMRT QA. J Appl Clin Med Phys (2018) 19(2):230–48. doi: 10.1002/acm2.12285

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Wootton LS, Nyflot MJ, Chaovalitwongse WA, Ford E. Error Detection in Intensity-Modulated Radiation Therapy Quality Assurance Using Radiomic Analysis of Gamma Distributions. Int J Radiat Oncol Biol Phys (2018) 102(1):219–28. doi: 10.1016/j.ijrobp.2018.05.033

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Nithiyanantham K, Mani GK, Subramani V, Mueller L, Palaniappan KK, Kataria T. Analysis of Direct Clinical Consequences of MLC Positional Errors in Volumetric-Modulated Arc Therapy Using 3D Dosimetry System. J Appl Clin Med Phys (2015) 16(5):296–305. doi: 10.1120/jacmp.v16i5.5515

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Klein EE, Hanley J, Bayouth J, Yin F-F, Simon W, Dresser S, et al. Task Group 142 Report: Quality Assurance of Medical Accelerators. Med Phys (2009) 36(9):4197–212. doi: 10.1118/1.3190392

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Rangel A, Palte G, Dunscombe P. The Sensitivity of Patient Specific IMRT QC to Systematic MLC Leaf Bank Offset Errors. Med Phys (2010) 37(7):3862–7. doi: 10.1118/1.3453576

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Oliver M, Gagne I, Bush K, Zavgorodni S, Ansbacher W, Beckham W. Clinical Significance of Multi-Leaf Collimator Positional Errors for Volumetric Modulated Arc Therapy. Radiother Oncol (2010) 97(3):554–60. doi: 10.1016/j.radonc.2010.06.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: patient-specific quality assurance, region for evaluation, global gamma passing rate, percentage dose error, action limits

Citation: Lu WL, Li Y, Huang W, Cui HX, Zhang HY and Yi X (2022) Optimizing the Region for Evaluation of Global Gamma Analysis for Nasopharyngeal Cancer (NPC) Pretreatment IMRT QA by COMPASS: A Retrospective Study. Front. Oncol. 12:859415. doi: 10.3389/fonc.2022.859415

Received: 21 January 2022; Accepted: 10 May 2022;
Published: 14 June 2022.

Edited by:

Nesrin Dogan, University of Miami, United States

Reviewed by:

Gage Redler, Moffitt Cancer Center, United States
Perry Johnson, University of Florida, United States

Copyright © 2022 Lu, Li, Huang, Cui, Zhang and Yi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xin Yi, noodles@hospital.cqmu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.