Mutual Information-Based Non-Local Total Variation Denoiser for Low-Dose Cone-Beam Computed Tomography

Lee, Ho; Sung, Jiwon; Choi, Yeonho; Kim, Jun Won; Lee, Ik Jae

doi:10.3389/fonc.2021.751057

ORIGINAL RESEARCH article

Front. Oncol. , 21 October 2021

Sec. Radiation Oncology

Volume 11 - 2021 | https://doi.org/10.3389/fonc.2021.751057

This article is part of the Research Topic Dual-energy/spectral CT in Radiation Therapy. View all 6 articles

Mutual Information-Based Non-Local Total Variation Denoiser for Low-Dose Cone-Beam Computed Tomography

Jiwon Sung

Department of Radiation Oncology, Gangnam Severance Hospital, Yonsei University College of Medicine, Seoul, South Korea

Conventional non-local total variation (NLTV) approaches use the weight of a non-local means (NLM) filter, which degrades performance in low-dose cone-beam computed tomography (CBCT) images generated with a low milliampere-seconds (mAs) parameter value because a local patch used to determine the pixel weights comprises noisy-damaged pixels that reduce the similarity between corresponding patches. In this paper, we propose a novel type of NLTV based on a combination of mutual information (MI): MI-NLTV. It is based on a statistical measure for a similarity calculation between the corresponding bins of non-local patches vs. a reference patch. The weight is determined in terms of a statistical measure comprising the MI value between corresponding non-local patches and the reference-patch entropy. The MI-NLTV denoising process is applied to CBCT images generated by the analytical reconstruction algorithm using a ray-driven backprojector (RDB). The MI-NLTV objective function is minimized based on the steepest gradient descent optimization to augment the difference between a real structure and noise, cleaning noisy pixels without significant loss of the fine structure and details that remain in the reconstructed images. The proposed method was evaluated using patient data and actual phantom measurement data acquired with lower mAs. The results show that integrating the RDB further enhances the MI-NLTV denoising-based analytical reconstruction algorithm to achieve a higher CBCT image quality when compared with those generated by NLTV denoising-based approach, with an average of 15.97% higher contrast-to-noise ratio, 2.67% lower root mean square error, 0.12% lower spatial non-uniformity, 1.14% higher correlation, and an average of 18.11% higher detectability index. These quantitative results indicate that the incorporation of MI makes the NLTV more stable and robust than the conventional NLM filter for low-dose CBCT imaging. In addition, achieving clinically acceptable CBCT image quality despite low-mAs projection acquisition can reduce the burden on common online CBCT imaging, improving patient safety throughout the course of radiotherapy.

Introduction

Cone-beam computed tomography (CBCT) is used as an image guidance system in many radiotherapy institutions (1). It provides a transformation through registration with a reference image that can be used to adjust the patient’s position to align appropriately with the radiation isocenter, ensuring that the planned dose is accurately delivered to the patient during treatment (2–5). Although there is currently no clinical application of dual-energy CBCT in radiotherapy, with the increase in the use of CBCT imaging devices, the CBCT imaging guidance can potentially be extended to dual-energy capabilities to improve tumor visualization or localization in patients. The only method that is readily adaptable to existing commercial scanners involves two sequential CBCT scans at different tube voltage settings for a high-energy kilovoltage peak and a low-energy kilovoltage peak. Thus, tissues and other materials can be distinguished by exploiting the nature of X-ray attenuation changes in different photon spectra. However, the material decomposition process is highly sensitive to noise fluctuations owing to the overlap of x-ray spectra at low and high energies. Even this approach comes at the expense of doubling radiation dose. Accordingly, the need for patient exposure management is emerging. Particularly, considering the high radiation sensitivity of pediatric cancer patients, early detection of cancer, and increased life expectancy of cancer patients, it is necessary to appropriately manage the imaging dose accompanying image-guided radiation therapy (IGRT), according to the principle of “as low as reasonably achievable” (6, 7). The report of the American Association of Physicists in Medicine (AAPM) Task Group 75 also emphasizes the need for an imaging dose management in the field of radiation oncology (8). Of note, the CBCT image dose management has a correlation with the image quality. In general, it is correlated with lower dose values when the milliampere-second (mAs) parameter related to the tube current and exposure time per projection is adjusted toward a lower value (9, 10), resulting in excessive noise in the reconstructed CBCT images. If a clinically acceptable CBCT image quality can be obtained with a lower mAs of the projection data, the risk of radiation exposure to imaging doses is minimized (11). Therefore, it is necessary to develop an algorithm that can improve the CBCT image quality while appropriately managing the indication-oriented patient dose.

CBCT reconstruction generates a volumetric image from the projection data acquired at various angular positions during a single gantry rotation. The reconstruction problem is formulated as a set of mathematical relations considering the scanning geometry between the volume and projections and is solved through various assumptions and strategies. These solutions generally fall into the three broad categories of analytical, iterative, and deep learning approaches. Analytical methods calculate the volume directly from the voxel-driven backward projection transform, describing pixels of projections corresponding to each value of the voxels. Certain types of analytical methods are based on a filtered backprojection (FBP) algorithm involving two main steps, first performing filtering on each set of projection data and then backprojecting the filtered projection data (12–14). CBCT image quality is biased via an advanced denoising technique (10, 15–17) because the performance relies on the filtering operation. The total variation (TV) tended to keep the edge information too smooth by uniformly penalizing the local image gradient (18, 19). Anisotropic TV was introduced to reduce the blurriness at edge regions by deriving an adaptively weighted local image gradient (17). However, these local edge detection operators have limitations in reliably separating structures with low contrast to noise. NLTV was developed to allow for more global searches and non-uniform weights, depending on non-local means (NLM) filter, which accounts for the difference in intensity between pixel pairs (10, 20, 21). This NLM filter can reduce the similarity between corresponding patches when the patches used to determine the pixel weights contain noise-damaged pixels. Conversely, iterative reconstruction methods continually refine the volume to find an optimal solution by repetitively performing forward and backward projection operations. Accurate reconstructions have been realized by integrating physical, statistical, and/or heuristic modeling of a CT system (22–26). Its efficacy has been improved by incorporating a compressed sensing theory (19, 20, 27). A disadvantage of iterative techniques is that the noise properties are non-linear and unpredictable across the field of view (FOV) as compared to linear noise in analytical methods. Several deep learning-based reconstructions have recently been introduced as supervised training using paired images and unsupervised training using unpaired images (28–34). Achieving satisfactory results requires sufficient training data. The lack of stability and injectivity of deep learning approaches to image reconstruction poses potential problems that require further investigation (35).

In a previous study, we implemented a conventional non-local total variation (NLTV) based on NLM filter that focused mainly on optimizing the performance of the denoiser on the projection domain (10). However, in general, because the number and resolution of the projections are larger than those of the reconstructed images, the calculation time is longer than when the denoiser is applied in the image domain. Therefore, in this study, we propose a mutual information-based NLTV (MI-NLTV), which is a new type of NLTV applied to the image domain. To our knowledge, this study, integrating MI into NLTV, is the first to augment the image quality of low-dose CBCT generated with a low mAs parameter value. In particular, this approach is effective when used in conjunction with an analytical reconstruction algorithm using a ray-driven backprojector (RDB) (14). The RDB tends to be susceptible to aliasing artifacts but is potentially more accurate because it maintains the exact geometric path of the rays forming the projection data. The algorithm consists of two main steps. The FBP reconstruction algorithm based on RDB is first performed by calculating the length of the intersection between the ray paths and each voxel to be reconstructed. A post-refinement process is subsequently performed using the MI-NLTV denoiser by minimizing the weighted total variation objective function based on the steepest gradient descent optimization with an adaptive step size. This minimization process removes noisy pixels remaining in the reconstructed images and cleans the edges without losing a large amount of fine structure and details. The effectiveness of the proposed method is demonstrated using patient data and actual measurement data based on the Catphan^®503 and anthropomorphic head-and-neck phantoms.

Methods

Ray-Driven Backprojector Based on X-Ray Transmission Length Calculation

A circular pre-weighting factor on each projection was applied to circumvent the intensity drop caused by the cone angle effect (17). A one-dimensional (1D) ramp filter using the product of the Shepp–Logan filter and raised cosine window function was thereafter applied to repress the high frequency in the frequency domain (10). The ramp filter was multiplied by the Fourier-transformed values from each line parallel to the horizontal direction of the pre-weighted projection via 1D Fourier transformation. The filtered projection was obtained by applying an inverse 1D Fourier transformation.

In the proposed approach, the backprojection step is performed after this filtering step to achieve a volumetric image from the filtered projection data. In contrast to the use of a pixel-driven backprojector (PDB) that determines the corresponding location on the detector from the voxels to be reconstructed, in this work, we developed an RDB based on ray tracing (36). This RDB is potentially more accurate than a PDB because it retains the exact geometry of the rays that form the projection data (14). It computes the length of the intersection between the ray paths and each voxel and is expressed as follows:

\begin{array}{l} μ_{j} = \frac{Σ_{k} l^{j k} P_{k}}{Σ_{k} l^{j k}}, & (1) \end{array}

where μ _j is the attenuation coefficient to be reconstructed on the j th indexed voxel (V_j) in the volume, P_k is the value of the k-th indexed pixel in the filtered projection data, and l^jk is a voxel-dependent weighting factor that is the length of the intersection between V_j and the ray originating from P_k (l^jk = 0, if there is no intersection). This ray-tracing process is applied to all the pixels in the projection data. First, all the μ_j values are set to zero. To determine the voxels that affect P_k, a backprojection ray (α^k) that originates from the center position of P_k and travels back to the focal point is considered. The intersections between α^k and the voxels in the volume are subsequently calculated using the Siddon method. At each voxel V_j where α^k passes, the current value of μ_j is increased by l^jkP_k. The volume is finally normalized by dividing each μ_j by $Σ_{k} l^{j k} .$

Mutual Information-Based Non-Local Total Variation Denoiser

A weighted total variation denoising process is applied on the reconstructed images to enhance the intensity difference between the striking features and unsolicited noise by determining the similarities between non-local patches (10, 20, 37, 38). When ∅_i,j is the area affected by the center point of each patch, the state of the stationary patch is determined by calculating the statistical measurements with the moving patches included in the search set. The MI (39, 40) is used as a metric of image matching that does not require a linear relationship between the corresponding intensities. If the stationary patch and the moving patch do not match, then the MI has a low value. A stationary patch with a pattern comprising noise-corrupted pixels without noticeable features is considered a high-entropy region. Therefore, it is likely to be in the smoothing state. Conversely, the area to be placed in the preserving state may be a patch with low entropy and a high MI value. Moreover, the MI value is less than or equal to the amount of marginal entropy for each patch. Considering these characteristics, a statistical measure M(∅_i,j) is defined as follows:

\begin{array}{l} M (\emptyset_{i, j}) = \frac{M I (I_{A}, I_{B}^{Ω})}{H (I_{A})} & (2) \end{array}

Here, I_A indicates a region in the stationary patch and the search set for the non-local patch, I_B is defined as Ω. $I_{B}^{Ω}$ implies all the non-local patches included in the search set corresponding to the stationary patch.

\begin{array}{l} M I (I_{A}, I_{B}) = H (I_{A}) + H (I_{B}^{Ω}) - H (I_{A}, I_{B}^{Ω}), & (3) \end{array}

where H(I_A)and $H (I_{B}^{Ω})$ represent the marginal entropy of the stationary patch and moving patch, respectively. These entropies are defined as follows:

\begin{array}{l} H (I_{A}) = - Σ_{i \in I_{A}} p_{i} \log_{2} p_{i}, & (4) \end{array}

\begin{array}{l} H (I_{B}^{Ω}) = - Σ_{i \in I_{B}^{Ω}} p_{i} \log_{2} p_{i}, & (5) \end{array}

\begin{array}{l} H (I_{A}, I_{B}^{Ω}) = - Σ_{i \in I_{A}} Σ_{j \in I_{B}^{Ω}} p_{i, j} \log_{2} p_{i, j} . & (6) \end{array}

Here, Pi and P_i,j are the marginal probability distribution and the joint probability distribution of I_A and $I_{B}^{Ω}$ , respectively. For the probability distributions required to calculate these entropies, a joint histogram is used to consider the relationship between the intensities of the corresponding pixels in two stationary and moving patches. The joint histogram is developed by scaling the intensity value of each patch according to the binning size that is adjusted such that the maximum value of each axis of the joint histogram equals the specified value.

The penalty with different weights based on M(∅_i,j) is expressed as follows:

\begin{array}{l} w_{j} = \exp (- {(V_{j} / τ)}^{ρ} M (\emptyset_{i, j})), & (7) \end{array}

where j identifies the index of the voxel element in the image, and V_J is the jth voxel element. w_j denotes the weights between the voxels in Ω and the current voxel j. A patch size is defined to be (2a + 1) × (2a + 1) with unit variance and a was set to 2, such that the patch size is 5 × 5 in this study. The non-local search extent is 21 × 21 with unit variance. (V_j/τ)^ρ is the spatially encoded factor that reduces the weighted averaging effect at high-intensity localities to maintain the contrast (37). The normalization factor τ ensures that the V_j/τ ratio exceeds 1. It was set to 90% of the cumulative distribution function histogram that accumulates the intensity at each voxel of the reconstructed image. The scaling factor γ ensures smaller weights for higher intensities (in this study, ρ = 10). Figure 1 depicts the MI-NLTV weighting map calculated using Equation (7) on the CBCT image. The dark regions indicate the preservation voxels, and the light regions represent voxels affected by smoothing.

FIGURE 1

Figure 1 Weight map of the MI-NLTV denoiser that composes the MI-based statistical measure and the spatially encoded factor. The first row shows the original image generated by applying the analytical reconstruction algorithm based on RDB. The ROI in the left image identifies the location of the background for CNR calculation, and the ROI in the middle is for SNU calculation. The second row shows the corresponding weight map. These weight maps are displayed at the window (width and level) settings of (0.2, 0.9).

Minimizing the weighted TV objective function in Equation (8) indicates that the edges with high contrast relative to the non-local patches are preserved and that the noisy voxels with low contrast are smoothed.

\begin{array}{l} R (V) = Σ_{j} R (V_{j}) = Σ_{j} w_{j} D (V_{j}), & (8) \end{array}

\begin{array}{l} D (V_{j}) = \sqrt{{(V_{(x, y)} - V_{(x - 1, y)})}^{2} + {(V (x, y) - V_{(x, y - 1)})}^{2}}, & (9) \end{array}

where V_(x,y) is the voxel element at the 2D position (x, y). The weighted TV objective function is minimized based on the steepest gradient descent method with an adaptive step size that is expressed as follows:

\begin{array}{l} V_{j}^{t + 1} = V_{j}^{t} - λ \nabla R (V_{j}) / | \nabla R (V) |, & (10) \end{array}

\begin{array}{l} λ = γ \sqrt{Σ_{j} {(V_{j}^{t})}^{2}}, & (11) \end{array}

where λ is an adaptive parameter that reduces the smoothing degree as the iteration step progresses (10). By using the square root of all voxel elements updated in each step, λ is enforced to gradually smaller values with an increase in the number of iterations. To escape local minimization due to sudden changes, a scaling parameter γ is used and set to an initial value of 1.0. If R(V) in the current iteration step is larger than that in the preceding step, this value is linearly decreased by multiplying a constant value (r_red = 0.8. ∇R(Vj) is the gradient of the objective function R(V) at the jth indexed pixel. The root sum square of the gradient calculated at all the pixels, |∇R(V)|, is required for the normalized gradient calculation. The expression below is provided for clarity.

\begin{array}{l} \nabla R (V_{j}) = \frac{\partial R (V)}{\partial V_{j}} = \frac{\partial R (V)}{\partial V_{(x, y)}} = (\begin{matrix} w_{(x, y)} \frac{2 V_{(x, y)} - V_{(u - 1, v)} - V_{(u, v - 1)}}{\sqrt{{(V_{(x, y)} - V_{(x - 1, y)})}^{2} + {(V_{(x, y)} - V_{(x, y - 1)})}^{2}}} \\ + w_{(x + 1, y)} \frac{V_{(x, y)} - V_{(x + 1, y)}}{\sqrt{{(V_{(x + 1, y)} - V_{(x, y)})}^{2} + {(V_{(x + 1, y)} - V_{(x + 1, y - 1)})}^{2}}} \\ + w_{(x, y + 1)} \frac{V_{(x, y)} - V_{(x, y + 1)}}{\sqrt{{(V_{(x, y + 1)} - V_{(x - 1, y + 1)})}^{2} + {(V_{(x, y + 1)} - V_{(x, y)})}^{2}}} \end{matrix}), & (12) \end{array}

\begin{array}{l} | \nabla R (V) | = \sqrt{Σ_{j} {(\nabla R (V_{j}))}^{2} .} & (13) \end{array}

The number of iterations is fine-tuned for the gradient descent optimizer. In this study, it was set to 20, which was the optimal number of iterations. The pseudo-code of the MI-NLTV denoiser considers all the abovementioned components and is presented in Appendix A.

Experimental Studies

The Catphan^®503 phantom (The Phantom Laboratory Inc., Salem, NY) and anthropomorphic phantom CIRS ATOM 701 (Computerized Imaging Reference Systems Inc., Norfolk, VA, USA) were used for CBCT acquisition. The acquisitions were performed on an Infinity™ LINAC system with XVI R5.0 (Elekta Limited, Stockholm, Sweden) in its service mode. The Catphan^®503 phantom includes different modules for assessing the quality of 3D images. CBCT projection acquisitions were performed with full gantry rotation in a small FOV protocol using the S20 collimator and F0 filter cassettes into the kV source arm. The F0 filter cassette is a blank filter and has no effect on the X-ray beam. The S20 gives a nominal irradiation field width of 27.6 cm at the isocenter. The total number of scan projections was 670 for the Catphan^®503 phantom and 655 for the anthropomorphic phantom. The size of the X-ray projection at the detector panel was 409.6 mm² × 409.6 mm², containing 1,024 × 1,024 pixels. The source to detector panel distance and the source to axis distance were 1,536 and 1,000 mm, respectively. The low-dose acquisition protocol settings were 100 kVp and 0.1 mAs during each projection. The reconstructed CBCT images of the phantom were generated with 512 × 512 × 100 voxels, comprising 0.5 × 0.5 × 1.0 mm³ per voxel. For quantitative comparison, a high-dose CBCT with increased mAs settings (100 kVp and 1.6 mAs) was scanned as a benchmark image. A transformation to the Hounsfield unit (HU) was performed for all the reconstructed CBCT images. The quantifications of image quality were based on contrast-to-noise ratio (CNR), root mean square error (RMSE), spatial non-uniformity (SNU), and correlation (17, 19). Task-based image quality assessments were also performed using an open-source software (imQuest, Duke University, Durham, NC, USA), which includes a task-based transfer function (TTF) for the spatial resolution and a noise power spectrum (NPS) for the noise texture and magnitude (41, 42). The combined results of NPS and TTF were used to evaluate the detectability index (43). For the detectability index calculation, a non-prewhitening matched filter (NPW) and 2D option were used as an observation model and NPS interpolation method, respectively.

The Catphan^®503 CTP404 module was used for evaluating the correlation, the RMSE, the CNR, and the TTF. A central region of interest (ROI) shown in the left column of the first row in Figure 1 identifies the location of the background for CNR calculation. The TTF was calculated using a circular ROI around each density insert to derive an edge spread function followed by Fourier Transform. Twenty-one consecutive axial slices including density inserts were selected for analysis. The Catphan^®503 uniformity module was used for calculating the SNU and the NPS. The ROIs for SNU calculation are shown in the middle column of the first row of Figure 1. The 104 ROIs for NPS calculation were selected on 13 consecutive axial slices (8 ROIs/slice × 13 slices = 104 ROIs). Overall frequencies were compared using 1D profiles. To visually compare spatial resolution differences with bin sizes in the joint histogram, profiles were obtained using a resolution gauge in the Catphan^®503 phantom.

A patient with pelvic bone metastases was also scanned for clinical data collection. The patient projection data were acquired by a Versa HD™ LINAC system with XVI R5.0 in its clinical mode. The S20 collimator, F0 filter, x-ray tube current, pulse duration per projection, and the resolution and voxel size of the reconstructed images were set to the same values as those of the phantom acquisition. In contrast, the voltage of the x-ray tube was set to 120 kVp, and 185 projections were obtained, lower than the number of projections obtained during the phantom scan by a factor of 0.28, including partial scans within 200° instead of a full rotation. The resolution of each projection was 512 × 512 pixels, which was lower than the phantom setting.

Results

The MI-NLTV denoiser was applied to CBCTs generated by analytical reconstruction algorithms using PDB or RDB based on the Catphan^®503 phantom. Its performance was compared with that observed in case of the NLTV based on an NLM filter, as well as without a denoiser. Figure 2 shows a representative slice of the CBCT reconstructed by the analytical reconstruction algorithm based on the PDB without the denoiser and the PDB followed by NLTV and MI-NLTV as well as the RDB without the denoiser and the RDB followed by NLTV and MI-NLTV. The RDB evidently achieved substantial gains when no denoiser was used and when NLTV and MI-NLTV were employed. The artifacts were significantly suppressed in the CBCT images processed by the proposed RDB in conjunction with the MI-NLTV denoiser compared with those processed through other approaches. In particular, the CBCT images followed by the MI-NLTV denoiser were smoother and exhibited fewer artifacts for both PDB and RDB, resulting in slightly lower spatial resolution compared to NLTV.

FIGURE 2

Figure 2 Comparison between the same views of the CBCT images generated by applying the analytical reconstruction algorithm based on (A) PDB, (B) RDB, (C) PDB followed by NLTV, (D) RDB followed by NLTV, (E) PDB followed by MI-NLTV, and (F) RDB followed by MI-NLTV using the Catphan^®503 phantom. (G) Benchmark image. These images are displayed at the window width (975 HU) and the window level (0 HU).

In addition, we calculated the CNR for each image for all reconstructed CBCTs included in the Catphan^®503 CTP404 module. The CTP404 module contains seven sensitometry targets made from Delrin™, Teflon, air, polymethylpentene (PMP), low-density polyethylene (LDPE), polystyrene, and air. By selecting a central ROI and seven ROIs within the density target, the mean HU values and standard deviation were recorded to calculate the CNR for each CBCT image. The relative image contrast between the corresponding regions could be compared. Table 1 provides the average value of the CNRs calculated for each insert in the CBCT images obtained based on the analytical reconstruction algorithms followed by no denoiser, NLTV, and MI-NLTV. Table 2 provides the statistical significance (p-values) of comparing the CNR values of six combinations to determine whether the CNR values are statistically higher or lower for each type of insert.

TABLE 1

Table 1 Comparison of contrast-to-noise ratio (CNR) values at seven ROIs in the CBCT image generated by analytical reconstruction algorithms based on Pixel-driven backprojector (PDB) and Ray-driven backprojector (RDB) using low-dose projection data of the Catphan^®503 phantom.

TABLE 2

Table 2 Statistical analysis comparing the CNR values of RDB+MI-NLTV vs. different combinations and PDB+MI-NLTV vs. PDB+NLTV for each type of insert using the paired t-test.

The NLTV and MI-NLTV show improved CNRs in all ROIs when compared with those obtained with PDB or RDB without a denoiser. Compared with PDB+NLTV and RDB+NLTV, PDB+MI-NLTV and RDB+MI-NLTV showed higher CNRs in all ROIs except for those of Teflon. In the case of Teflon, which has the highest HU of the seven inserts, MI-NLTV combinations showed statistically insignificant results because the p-values were greater than 0.05 when compared with the NLTV combinations. The reason is that both NLTV and MI-NLTV use spatially encoded factors to maintain high-intensity contrast in Teflon. Although combining RDB in MI-NLTV yielded slightly higher CNR values than combining PDB in MI-NLTV (except for polystyrene), these values are considered statistically similar.

Figure 3 shows the NPS curves and the TTF curves at seven density inserts for all reconstruction types. The eight square ROIs were placed at different positions in the Catphan^®503 uniformity module, as shown on the right side in Figure 3A. The NPS peaks were lower for all RDB types than their corresponding PDB types. The lowest NPS peak value was obtained as 1,139 HU²mm² for RDB+MI-NLTV. The NPS average spatial frequency shifted to lower frequencies when combining NLTV or MI-NLTV with PDB or RDB. Numerically, the NPS average spatial frequencies were obtained as 0.29 mm^-1 for PDB, 0.27 mm^-1 for RDB, 0.13 mm^-1 for PDB+NLTV, 0.17 mm^-1 for RDB+NLTV, 0.11 mm^-1 for PDB+MI-NLTV, and 0.12 mm^-1 for RDB+MI-NLTV. In terms of spatial resolution, TTF values tended to decrease as the noise magnitude decreased for all ROIs. In particular, it should be noted that MI-NLTV denoiser did not appear to enhance the whole spatial resolution while reducing noise magnitude. In the comparison of MI-NLTV and NLTV, TTF_50% values with MI-NLTV denoiser were lower than with NLTV denoiser for both PDB and RDB. Conversely, TTF_10% values showed that MI-NLTV was more dominant than NLTV. As such, similar spatial resolutions were found between MI-NLTV and NLTV when considering TTF_50% and TTF_10%. However, the combined results of the NPS and the TTF showed that the detectability index of the seven density inserts was higher with RDB+MI-NLTV than with other reconstruction types. This means that the CBCT image quality can be improved with RDB+MI-NLTV because the noise reduction effect is much greater than the spatial resolution reduction.

FIGURE 3

Figure 3 Task-based image quality assessment for PDB, RDB, PDB+NLTV, RDB+NLTV, PDB+MI-NLTV, and RDB+MI-NLTV. (A) Noise power spectrum (NPS) for evaluating the noise texture and magnitude. (B–H) The task-based transfer function (TTF) curves calculated by selecting a circular ROI around each insert among seven different density targets. (I) TTF_50% values and (J) TTF_10% values for assessing the spatial resolution. (K) The detectability index based on NPS and TTF for estimating the ability to detect some regions.

Table 3 provides three quantitative measures (RMSE, correlation, and SNU) obtained from the CBCT images generated based on six combinations. After calculating the RMSE, correlation, and SNU for each image of all reconstructed slices included in the Catphan^®503 module section, the average values were recorded. A paired t-test was also used to determine whether the paired measurements of the proposed and alternative combinations were statistically significant. The p-values obtained by the paired t-test are listed in Table 4.

TABLE 3

Table 3 Quantitative comparisons based on three metrics in the CBCT image generated by six analytical reconstruction algorithms using low-dose projection data of the Catphan^®503 phantom.

TABLE 4

Table 4 Statistical analysis of the three metrics of RDB+MI-NLTV and five approaches using a paired t-test.

The RMSE was calculated to assess the differences in the reconstructed and benchmark images, whereas the correlation evaluates the concordance with the benchmark image. For the RMSE and correlation measurements, a circular measurement region completely inside the phantom was chosen, as shown in Figure 2G; the surrounding air region of the phantom was excluded. The SNU was also calculated by selecting five ROIs with uniform intensity distribution on the Catphan^®503 uniformity module, one being located in the center and four in peripheral positions symmetrically arranged around the center. The RDB could yield a lower RMSE, higher correlation, and lower SNU compared with the PDB. Applying either the NLTV or MI-NLTV denoiser produced better results regardless of employing the PDB or RDB for the Catphan^®503 phantom for a low-dose scenario. The correlation values varied widely between PDB and RDB but were close after the NLTV or MI-NLTV denoisers were applied. When MI-NLTV was applied to PDB, the recorded correlation was always better and the recorded RMSE and SNU were worse than when NLTV was applied, whereas RDB followed by the MI-NLTV denoiser produced the lowest RMSE, highest correlation, and lowest SNU. Except for the correlation between the RDB followed by NLTV vs. the proposed method, there was statistical significance between the proposed method and five combinations because all p-values were less than 0.05.

Figure 4 compares the maximum intensity projections (MIPs) of the low-dose CBCT images reconstructed by the RDB without denoiser, RDB followed by NLTV, and RDB followed by MI-NLTV. It consists of projecting the voxel with the highest value onto a 2D image, traveling the viewing ray in a longitudinal direction throughout the volume. The MIP image enables it to determine whether some regions with high contrast are preserved while reducing the noise. The proposed MI-NLTV method shows that the regions with remarkable features are well-preserved while reducing noisy pixels compared with the results obtained using only RDB as well as with NLTV incorporated. After incorporating the proposed MI-NLTV denoiser, the homogeneous region was smoother, whereas a point feature and line bars showing a high contrast in the red ROIs were almost preserved.

FIGURE 4

Figure 4 Comparison of maximum intensity projections (MIPs) of the CBCT images generated by applying analytical reconstruction algorithm based on (A) RDB, (B) RDB followed by NLTV, and (C) RDB followed by MI-NLTV using the Catphan^®503 phantom. These images are displayed at the window (width and level) settings of (1,500, 500) HU.

To support our claim that the MI-NLTV denoiser improves the image quality, we calculated the mean HU for the two red boxes in Figure 4 to provide a quantitative comparison of the image quality of MIP. For ROI1 and ROI2, the mean HU and standard deviation were calculated as 346 ± 365 HU and 91 ± 38 HU for RDB, 312 ± 379 HU and 46 ± 42 HU for RDB+NLTV, and 291 ± 383 HU and 12 ± 30 HU for RDB+MI-NLTV, respectively. The mean HU decreased as expected because the noisy pixels were reduced, whereas the high-contrast pixels were preserved.

In addition to the Catphan^®503 phantom, the anthropomorphic head-and-neck phantom was reconstructed based on six combinations. Because the anthropomorphic phantom is surrounded by the skull, MIP mainly displays voxels contained within the skull, which are composed of high intensities. Therefore, instead of the MIP, Figure 5 shows the minimum intensity projections (MinIPs) of the reconstruction images along the longitudinal direction. MinIP is the opposite of MIP and returns the lowest value among the voxels that it encounters along the ray. MinIP indicates whether the noisy pixels generated in areas other than the skull corrupt the striking structures. Adding the proposed MI-NLTV denoiser produced improved preservation of the details and reduced the noise markedly, whereas the FBP with PDB was almost completely obscured by noisy pixels. Although employing MI-NLTV remarkably reduced the noise, it was not able to completely restore the detail level of the structures compared with the RDB. Figure 6 compares the corresponding slices of the low-dose CBCT images reconstructed by the RDB without denoiser and the RDB followed by NLTV and MI-NLTV. MI-NLTV denoiser further reveals improved detail-preserving and noise-reduction effects compared with the NLTV denoiser.

FIGURE 5

Figure 5 Comparison of minimum intensity projections (MinIPs) of CBCT images generated by applying the analytical reconstruction algorithm based on (A) PDB, (B) RDB, (C) PDB followed by NLTV, (D) RDB followed by NLTV, (E) PDB followed by MI-NLTV, and (F) RDB followed by MI-NLTV using the anthropomorphic head-and-neck phantom. (G) Benchmark image. These images are displayed at the window settings of (width = 750, level = 0) HU.

FIGURE 6

Figure 6 Comparison between the same views of CBCT images generated by applying the analytical reconstruction algorithm based on (A) RDB, (B) RDB followed by NLTV, and (C) RDB followed by MI-NLTV using the anthropomorphic head-and-neck phantom. (D) Benchmark image. These CBCT images are displayed at the window settings of (width = 1,400, level = 200) HU.

For the quantitative comparison, the CNR was calculated for each image of all the reconstructed CBCTs included in the skull. By selecting an ROI within the soft tissue and an ROI within the skull, the mean HU value and standard deviation were recorded for the CNR calculation. Table 5 provides the average values of the CNRs calculated for each image in the CBCTs obtained based on RDB followed by no denoiser, NLTV, and MI-NLTV. Statistical testing was also performed to compare the outcomes of the different approaches. The RDB+MI-NLTV combination showed a higher CNR and statistically significant result because the p-value was less than 0.05 when compared with RDB as well as RDB+NLTV.

TABLE 5

Table 5 CNR comparisons in the CBCT image generated by three RDB-based reconstruction algorithms using low-dose projection data of the anthropomorphic phantom.

To further illustrate the edge information, the spatial resolution was measured using a resolution gauge in the Catphan^®503 phantom. Figure 7 shows the 1D HU profile along the orthogonal direction of line bars from 1 through 8 line pairs per cm on the reconstructed images generated by only RDB and RDB followed by MI-NLTVs, with different bin sizes (256 × 256, 128 × 128, and 64 × 64). In the cases of 128 × 128 and 256 × 256 bin sizes, it was possible to identify up to six line pairs/cm due to preservation of major features such as for RDB. Moreover, when using the 64 × 64 bin size, there was a slight difference from six line pairs/cm onward compared with the other bin sizes. To balance the reconstructed image quality with the computational demands, the bin size of the joint histogram was set as 128 × 128 pixels to afford more interesting results.

FIGURE 7

Figure 7 Comparison between the same views of the reconstruction image generated by (A) RDB only and RDB followed by MI-NLTV applying (B) joint histogram bin size of 256 × 256, (C) 128 × 128, (D) 64 × 64, and (E) line profile at the red square using the Catphan^®503 phantom. These images are displayed at the window settings of (width = 1,500, level = 876) HU.

Table 6 compares the computation times when generating low-dose CBCT images of the Catphan^®503 and anthropomorphic phantoms based on six combinations. Each algorithm was implemented by utilizing OpenMP for parallelization on an Intel Xeon CPU system with 48 logical processors on 24 physical cores. Compared with PDB, the calculation time using RDB was approximately 3.7 times longer for the low-dose CBCT generation. Approximately 100 s were required for NLTV and 240 s for MI-NLTV. Because MI generates a joint histogram as a similarity measure between non-local patches, the calculation time was 2.4 times higher than that of the case of a conventional NLTV using NLM filter. Therefore, the computation time using the proposed RDB as well as MI-NLTV was approximately 6.1 times and 1.7 times longer compared with PDB and RDB. Table 7 delineates the computation time of the proposed MI-NLTV in terms of the joint histogram with different bin sizes. The calculations of the marginal entropy and joint entropy required for MI computation are dependent on the bin size of the joint histogram. The computation time for the MI-NLTV with 128 × 128 bin sizes was half that of 256 × 256 bin sizes.

TABLE 6

Table 6 Computation time (s) when generating CBCTs of the Catphan^®503 and anthropomorphic phantoms based on six combinations.

TABLE 7

Table 7 Computation time (s) when applying MI-NLTVs with different bin sizes on low-dose CBCTs of the Catphan^®503 and anthropomorphic phantoms.

Figure 8 shows low-dose CBCT images generated from real patient data with pelvic bone metastases. Because the patient projections were acquired under a short scan mode, a short scan weighting in all projections was added to avoid discontinuous artifacts due to redundant scans at certain angles by modifying the Parker’s weighting used in fan-beam CT reconstructions (44). Compared to the phantom study, the low signal-to-noise level resulted in an overall increase in noise in the reconstructed image and inferior feature details due to the use of a small number of projection data and lower resolution projection data. However, the relative superiority of the proposed MI-NLTV denoiser was also the same in clinical data. PDB was poor at delineating the pelvic bone metastatic lesion due to increased noise, as shown in the red ROI. This lesion appeared as a noisy but faint transparency with the RDB. It was more visible on NLTV as poorly defined, having low intensity with some noise. Meanwhile, the proposed MI-NLTV denoiser could better recognize metastatic bone lesions with fewer noisy pixels than NLTV. It was confirmed that the proposed method can significantly reduce noise pixels and preserve the detailed structure well in patient data. As such, image processing approaches for noise removal and contrast enhancement are of great interest. Noise and low contrast have a direct impact on the delineation of various tumor regions, including enhancing and non-enhancing tumors, necrosis, and edema. They not only affect the ROI extraction, but also interfere with operation in various post-processing tasks such as registration, segmentation, and classification. Therefore, if the quality of low-dose CBCT images is improved through the proposed algorithm, more precise monitoring of the tumor target and movement of the organs at risk (OARs) is possible during IGRT for cancer patients, thereby improving the target accuracy and reducing the dose to the OARs adjacent to the tumor. Furthermore, recalculating the dose distribution for the tumor target and adjacent OARs using the obtained low-dose CBCT may establish a new clinical guideline. This is expected to result in a lower complication rate and improved overall survival rates for radiotherapy outcomes.

FIGURE 8

Figure 8 Comparison between the same views of CBCT images generated by applying the analytical reconstruction algorithm based on (A) PDB, (B) RDB, (C) RDB followed by NLTV, and (D) RDB followed by MI-NLTV using a patient data with the pelvic bone metastatic lesion. These CBCT images are displayed at the window settings of (width = 2,400, level = 400) HU.

Discussion

This study has aimed to address the feasibility of obtaining low-dose CBCT images using a combination of RDB and MI-NLTV denoiser. The results of this study demonstrate the enhanced performance of the proposed approach compared with the results of the five combinations (PDB, RDB, PDB+NLTV, RDB+NLTV, and PDB+MI-NLTV) for low-dose CBCT. The evaluation utilized is based on a comparison through the image quality phantom analysis. It is demonstrated that in the actual measurement data with the Catphan^®503 and anthropomorphic head-and-neck phantoms, combining the RDB enables the MI-NLTV denoising-based analytical reconstruction algorithm to be further enhanced. Thereby, a higher CBCT image quality with a lower mAs acquisition protocol is obtained in terms of visual inspection, CNR, RMSE, correlation, SNU, and detectability index. In addition, when using the MI-NLTV denoiser, the spatial resolution was slightly further reduced, but over-smoothing and loss of important features did not occur. Conversely, some noisy pixels remained when the conventional NLTV was employed. In particular, reducing the number of projections by increasing the gantry rotation speed to reduce the patient radiation dose is challenging to implement in the currently available commercial CBCT scanners, whereas the other method proposed in our work, involving lowering the mAs level, can be easily applied in existing commercial scanners.

The proposed MI-NLTV denoiser in the reconstruction process involves a distinct contrast compared with the conventional NLTV approaches using the weight of an NLM filter to account for the difference in intensity between the pixel pairs. The NLM filter degrades the performance in low-dose CBCT images generated with low mAs because the local patch used to determine the pixel weights contains noisy-damaged pixels that reduce the similarity between the corresponding patches. On the contrary, the MI can facilitate overcoming this shortcoming by utilizing a statistical measure for the robust similarity calculation between the corresponding non-local patches and the reference patch. It is almost invariant to the change in noisy pixels, making the NLTV more stable and robust than the NLM filter.

Because the acquisition of CBCT scans utilizes a large-area detector, deterioration in CBCT image quality due to beam scattering is inevitable. However, although this study included improvement in low-dose CBCT image quality via the proposed method, measurement-based scatter correction such as an anti-scatter grid (45) or beam blocker (19, 46) was not applied to the acquired CBCT projection data. Considering that scatter correction methods can be used to acquire CBCT projection data, further reductions in the SNU of low-dose CBCT image quality can be expected.

There is no consensus on the guidelines for noise reduction in CBCT images. However, in radiotherapy, CBCT, taken before or during patient treatment, is used to monitor interfraction or intrafraction differences in positional settings and anatomical changes (3). Based on the bone anatomy or soft tissue at the treatment site, rigid body image registration of CBCT and Plan-CT is performed to obtain an appropriate transformation vector to correct motion variations. Recently, it has also been used for deformable image registration between CBCT and Plan-CT for adaptive radiotherapy. The accuracy of this image registration is affected by the difference in image quality of the CBCT used. A prior study reported that higher image quality reduced the uncertainty of soft tissue image registration during IGRT (47). As such, high-quality CBCT images can improve the accuracy of image registration, thereby reducing the target margin during treatment planning. Therefore, reducing the noise level in the low-dose CBCT images provides the best visualization of bone and soft tissue structures, which reduces the uncertainty of image registration during IGRT. In clinical practice, an image processing technique with improved performance such as an MI-NLTV denoising algorithm to remove noise while maintaining the edge structure is needed.

The proposed MI-NLTV denoiser showed better quantitative and qualitative tendencies when combined with RDB instead of PDB. Because this method was applied in the image domain, applying a backprojector that can reduce the noticeable noise level in CBCT image generation appears to produce more stable weight values in MI-based statistical calculations. Instead of RDB, the proposed MI-NLTV can also be combined with other more sophisticated backprojector methods such as distance-driven methods (48) or separable footprints (49). When the MI-NLTV denoiser was applied to CBCT images generated from PDB, obtaining a robust weight value was difficult in the MI-based statistical calculation because the noise level of PDB was relatively high. If projection images could be acquired using slightly higher mAs to bring the NPS peak down to the RDB level, the MI-NLTV denoiser is expected to work well with PDB. Although the proposed MI-NLTV denoiser method was applied to a 2D inter-patched image, the method can be technically extended to 3D blocks (50). This would increase the computational cost because of the larger search area in the weight computations.

This study on low-dose CBCT reconstruction involves a few practical considerations that are discussed here. First, considering the time consumed during the proposed reconstruction approach, the time consumed by the RDB and MI-NLTV denoiser in the experiments is measured. The computation time of the PDB is affected by the reconstructed volume size because it estimates the projection pixel for each voxel in the reconstructed volume. Moreover, the RDB is affected by the number of projections and the size as well as the reconstruction volume size because it uses each pixel of the projection as the starting point of the ray and calculates the intersection length between the ray path and each voxel in the reconstructed volume. The MI-NLTV denoiser increases the computational cost owing to the generation of a joint histogram as a similarity measure between non-local patches when computing the weight function compared with the NLTV denoiser. These algorithms were implemented using OpenMP-based parallelization. The RDB process was parallelized in terms of multiple rays, and the MI-NLTV was mainly parallelized in multiple voxels. It should be noted that there is still a considerable potential for further accelerating the algorithm. A scalable approach would be to improve the computational ability using a GPU (14) or FPGA (51). Second, the parameter values pertaining to the number of iterations, search area, patch size, and spatially encoded factor were set to be equal for ensuring a fair comparison between MI-NLTV and NLTV. These parameters were empirically determined to balance the image quality with the computational load; however, the best performance of either algorithm is not ensured. Nonetheless, we found that the results are not highly sensitive to these parameters. The findings here are not far from optimal. Third, in the deep learning approaches, there are many data augmentation strategies to add more training data based on image processing techniques. The proposed MI-NLTV denoiser can be considered as an advanced augmentation technique for building better statistical models. Fourth, MI-NLTV-based analytical reconstruction algorithm can be used to generate an initial guess image or an image fidelity term for iterative reconstruction. In general, when using an iterative reconstruction algorithm, it has been observed that the better the initial guess image, the faster the convergence and the higher the contrast of the CBCT images produced.

Conclusion

The incorporation of MI has proven to be almost invariant to the change in the noisy pixels while maintaining the original advantages with similar properties of the conventional NLTV denoiser, making the NLTV more stable and robust than the conventional NLM filter. These differences indicate a preference for the MI in NLTV for low-dose CBCT imaging. Moreover, achieving clinically acceptable CBCT image quality despite low-mAs projection acquisition can reduce the burden on common online CBCT imaging, such as correcting setup errors and monitoring patient movements, thus making the use of IGRT widely available. The proposed approach can improve patient safety throughout the course of radiotherapy.

Data Availability Statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding author.

Ethics Statement

Ethical approval for this study was obtained from the institutional review board of Yonsei University Health System, Gangnam Severance Hospital (Approval No.: 3-2020-0464). Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

Conceptualization, HL and IL. Methodology, HL. Software, HL. Validation, IL and HL. Formal analysis, HL. Investigation, HL. Data acquisition, JS and YC. Data curation, JS and YC. Patient enrollment, JK. Writing—original draft preparation, HL. Writing—review and editing, IL. Supervision, IL. All authors contributed to the article and approved the submitted version.

Funding

This research was supported by the Basic Science Research Program through the National Research Foundation of Korea funded by the Ministry of Education (grant number 2019R1I1A1A01062157) and a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI) funded by the Ministry of Health & Welfare, Republic of Korea (grant number HI19C1330).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Nabavizadeh N, Elliott DA, Chen Y, Kusano AS, Mitin T, Thomas CR Jr, et al. Image Guided Radiation Therapy (IGRT) Practice Patterns and IGRT's Impact on Workflow and Treatment Planning: Results From a National Survey of American Society for Radiation Oncology Members. Int J Radiat Oncol Biol Phys (2016) 94(4):850–7. doi: 10.1016/j.ijrobp.2015.09.035

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Fuchs F, Habl G, Devečka M, Kampfer S, Combs SE, Kessel KA. Interfraction Variation and Dosimetric Changes During Image-Guided Radiation Therapy in Prostate Cancer Patients. Radiat Oncol J (2019) 37(2):127. doi: 10.3857/roj.2018.00514

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Lee JJB, Lee IJ, Choi Y, Jeon MJ, Jung IH, Lee H. Clinical Implications of Geometric and Dosimetric Uncertainties of Inter-And Intra-Fractional Movement During Volumetric Modulated Arc Therapy for Breast Cancer Patients. Cancers (Basel) (2021) 13(7):1651. doi: 10.3390/cancers13071651

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Fung CY, Chen E, Vapiwala N, Pohar S, Trifiletti D, Truong M-T, et al. The American Society for Radiation Oncology 2017 Radiation Oncologist Workforce Study. Int J Radiat Oncol Biol Phys (2019) 103(3):547–56. doi: 10.1016/j.ijrobp.2018.10.020

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Han Y. Current Status of Proton Therapy Techniques for Lung Cancer. Radiat Oncol J (2019) 37(4):232. doi: 10.3857/roj.2019.00633

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Aird E. Second Cancer Risk, Concomitant Exposures and IRMER (2000). Br Institute Radiol (2004) 77(924):983–5. doi: 10.1259/bjr/56613233

CrossRef Full Text | Google Scholar

7. Moon YM, Kim H-J, Kwak DW, Kang Y-R, Lee MW, Ro T-I, et al. Effective Dose Measurement for Cone Beam Computed Tomography Using Glass Dosimeter. Nucl Eng Technol (2014) 46(2):255–62. doi: 10.5516/NET.08.2012.080

CrossRef Full Text | Google Scholar

8. Murphy MJ, Balter J, Balter S, BenComo JA Jr, Das IJ, Jiang SB, et al. The Management of Imaging Dose During Image-Guided Radiotherapy: Report of the AAPM Task Group 75. Med Phys (2007) 34(10):4041–63. doi: 10.1118/1.2775667

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Wang J, Li T, Liang Z, Xing L. Dose Reduction for Kilovotage Cone-Beam Computed Tomography in Radiation Therapy. Phys Med Biol (2008) 53(11):2897. doi: 10.1088/0031-9155/53/11/009

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Lee H, Park J, Choi Y, Park KR, Min BJ, Lee IJ. Low-Dose CBCT Reconstruction via Joint Non-Local Total Variation Denoising and Cubic B-Spline Interpolation. Sci Rep (2021) 11(1):1–15. doi: 10.1038/s41598-021-83266-1

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Brenner DJ, Hall EJ. Computed Tomography—an Increasing Source of Radiation Exposure. N Engl J Med (2007) 357(22):2277–84. doi: 10.1056/NEJMra072149

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Feldkamp L, Davis L, Kress J. Practical Cone-Beam Algorithm. J Optical Soc America A (1984) 1(6):612–9. doi: 10.1364/JOSAA.1.000612

CrossRef Full Text | Google Scholar

13. Lee B, Lee H, Shin YG. Fast Hybrid CPU-And GPU-Based CT Reconstruction Algorithm Using Air Skipping Technique. J Xray Sci Technol (2010) 18(3):221–34. doi: 10.3233/XST-2010-0256

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Park H-G, Shin Y-G, Lee H. A Fully GPU-Based Ray-Driven Backprojector via a Ray-Culling Scheme With Voxel-Level Parallelization for Cone-Beam CT Reconstruction. Technol Cancer Res Treat (2015) 14(6):709–20. doi: 10.7785/tcrt.2012.500429

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Ahmadi R, Farahani JK, Sotudeh F, Zhaleh A, Garshasbi S. Survey of Image Denoising Techniques. Life Sci J (2013) 10(1):753–5. doi: 10.7537/marslsj100113.118

CrossRef Full Text | Google Scholar

16. Motwani MC, Gadiya MC, Motwani RC, Harris FC. Survey of Image Denoising Techniques. Proc GSPX (2004).

Google Scholar

17. Lee H, Yoon J, Lee E. Anisotropic Total Variation Denoising Technique for Low-Dose Cone-Beam Computed Tomography Imaging. Prog Med Phys (2018) 29(4):150–6. doi: 10.14316/pmp.2018.29.4.150

CrossRef Full Text | Google Scholar

18. Sidky EY, Pan X. Image Reconstruction in Circular Cone-Beam Computed Tomography by Constrained, Total-Variation Minimization. Phys Med Biol (2008) 53(17):4777. doi: 10.1088/0031-9155/53/17/021

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Lee H, Fahimian BP, Xing L. Binary Moving-Blocker-Based Scatter Correction in Cone-Beam Computed Tomography With Width-Truncated Projections: Proof of Concept. Phys Med Biol (2017) 62(6):2176. doi: 10.1088/1361-6560/aa5913

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Kim H, Chen J, Wang A, Chuang C, Held M, Pouliot J. Non-Local Total-Variation (NLTV) Minimization Combined With Reweighted L1-Norm for Compressed Sensing CT Reconstruction. Phys Med Biol (2016) 61(18):6878. doi: 10.1088/0031-9155/61/18/6878

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Sohn JJ, Kim C, Kim DH, Lee S-R, Zhou J, Yang X, et al. Analytical Low-Dose CBCT Reconstruction Using Non-Local Total Variation Regularization for Image Guided Radiation Therapy. Front Oncol (2020) 10:242. doi: 10.3389/fonc.2020.00242

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Wang J, Li T, Xing L. Iterative Image Reconstruction for CBCT Using Edge-Preserving Prior. Med Phys (2009) 36(1):252–60. doi: 10.1118/1.3036112

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Sun T, Sun N, Wang J, Tan S. Iterative CBCT Reconstruction Using Hessian Penalty. Phys Med Biol (2015) 60(5):1965. doi: 10.1088/0031-9155/60/5/1965

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Wu J, Wang X, Mou X, Chen Y, Liu S. Low Dose CT Image Reconstruction Based on Structure Tensor Total Variation Using Accelerated Fast Iterative Shrinkage Thresholding Algorithm. Sensors (Basel) (2020) 20(6):1647. doi: 10.3390/s20061647

CrossRef Full Text | Google Scholar

25. Song Y, Zhang W, Zhang H, Wang Q, Xiao Q, Li Z, et al. Low-Dose Cone-Beam CT (LD-CBCT) Reconstruction for Image-Guided Radiation Therapy (IGRT) by Three-Dimensional Dual-Dictionary Learning. Radiat Oncol (2020) 15(1):1–9. doi: 10.1186/s13014-020-01630-3

CrossRef Full Text | Google Scholar

26. Liu L, Li X, Xiang K, Wang J, Tan S. Low-Dose CBCT Reconstruction Using Hessian Schatten Penalties. IEEE Trans Med Imaging (2017) 36(12):2588–99. doi: 10.1109/TMI.2017.2766185

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Lee H, Xing L, Davidi R, Li R, Qian J, Lee R. Improved Compressed Sensing-Based Cone-Beam CT Reconstruction Using Adaptive Prior Image Constraints. Phys Med Biol (2012) 57(8):2287. doi: 10.1088/0031-9155/57/8/2287

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Wang J, Liang J, Cheng J, Guo Y, Zeng L. Deep Learning Based Image Reconstruction Algorithm for Limited-Angle Translational Computed Tomography. PLoS One (2020) 15(1):e0226963. doi: 10.1371/journal.pone.0226963

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Yuan N, Dyer B, Rao S, Chen Q, Benedict S, Shang L, et al. Convolutional Neural Network Enhancement of Fast-Scan Low-Dose Cone-Beam CT Images for Head and Neck Radiotherapy. Phys Med Biol (2020) 65(3):035003. doi: 10.1088/1361-6560/ab6240

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Chen L, Liang X, Shen C, Jiang S, Wang J. Synthetic CT Generation From CBCT Images via Deep Learning. Med Phys (2020) 47(3):1115–25. doi: 10.1002/mp.13978

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Kida S, Nakamoto T, Nakano M, Nawa K, Haga A, Kotoku JI, et al. Cone Beam Computed Tomography Image Quality Improvement Using a Deep Convolutional Neural Network. Cureus (2018) 10(4). doi: 10.7759/cureus.2548

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Chen H, Zhang Y, Zhang W, Liao P, Li K, Zhou J, et al. Low-Dose CT via Convolutional Neural Network. BioMed Opt Express (2017) 8(2):679–94. doi: 10.1364/BOE.8.000679

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Lee H, Lee J, Kim H, Cho B, Cho S. Deep-Neural-Network-Based Sinogram Synthesis for Sparse-View CT Image Reconstruction. IEEE Trans Radiat Plasma Med Sci (2018) 3(2):109–19. doi: 10.1109/TRPMS.2018.2867611

CrossRef Full Text | Google Scholar

34. Liang X, Chen L, Nguyen D, Zhou Z, Gu X, Yang M, et al. Generating Synthesized Computed Tomography (CT) From Cone-Beam Computed Tomography (CBCT) Using CycleGAN for Adaptive Radiation Therapy. Phys Med Biol (2019) 64(12):125002. doi: 10.1088/1361-6560/ab22f9

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Sidky EY, Lorente I, Brankov JG, Pan X. Do CNNs Solve the CT Inverse Problem. IEEE Trans BioMed Eng (2021) 68(6):1799–810. doi: 10.1109/TBME.2020.3020741

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Choi S, Lee S, Lee Y-J, Kim Y-S, Lee H-H, Kim H-J. Feasibility of Ray-and Pixel-Driven Projector/Back-Projector in Linear Motion Tomosynthesis. SPIE Medical Imaging 2015: Physics of Medical Imaging. Orlando, Florida, United States: International Society for Optics and Photonics (2015). p. 94122Q.

Google Scholar

37. Kim K, El Fakhri G, Li Q. Low-Dose CT Reconstruction Using Spatially Encoded Nonlocal Penalty. Med Phys (2017) 44(10):e376–90. doi: 10.1002/mp.12523

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Kim Y, Kudo H. Nonlocal Total Variation Using the First and Second Order Derivatives and Its Application to CT Image Reconstruction. Sensors (Basel) (2020) 20(12):3494. doi: 10.3390/s20123494

CrossRef Full Text | Google Scholar

39. Pluim JP, Maintz JA, Viergever MA. Mutual-Information-Based Registration of Medical Images: A Survey. IEEE Trans Med Imaging (2003) 22(8):986–1004. doi: 10.1109/TMI.2003.815867

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Maes F, Collignon A, Vandermeulen D, Marchal G, Suetens P. Multimodality Image Registration by Maximization of Mutual Information. IEEE Trans Med Imaging (1997) 16(2):187–98. doi: 10.1109/42.563664

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Samei E, Bakalyar D, Boedeker KL, Brady S, Fan J, Leng S, et al. Performance Evaluation of Computed Tomography Systems: Summary of AAPM Task Group 233. Med Phys (2019) 46(11):e735–e56. doi: 10.1002/mp.13763

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Greffier J, Frandon J, Pereira F, Hamard A, Beregi J, Larbi A, et al. Optimization of Radiation Dose for CT Detection of Lytic and Sclerotic Bone Lesions: A Phantom Study. Eur Radiol (2020) 30(2):1075–8. doi: 10.1007/s00330-019-06425-z

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Greffier J, Hamard A, Pereira F, Barrau C, Pasquier H, Beregi JP, et al. Image Quality and Dose Reduction Opportunity of Deep Learning Image Reconstruction Algorithm for CT: A Phantom Study. Eur Radiol (2020) 30(7):3951–9. doi: 10.1007/s00330-020-06724-w

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Parker DL. Optimal Short Scan Convolution Reconstruction for Fan Beam CT. Med Phys (1982) 9(2):254–7. doi: 10.1118/1.595078

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Park Y, Alexeev T, Miller B, Miften M, Altunbas C. Evaluation of Scatter Rejection and Correction Performance of 2D Antiscatter Grids in Cone Beam Computed Tomography. Med Phys (2021) 48(4):1846–58. doi: 10.1002/mp.14756

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Zhu L, Wang J, Xing L. Noise Suppression in Scatter Correction for Cone-Beam CT. Med Phys (2009) 36(3):741–52. doi: 10.1118/1.3063001

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Morrow NV, Lawton CA, Qi XS, Li XA. Impact of Computed Tomography Image Quality on Image-Guided Radiation Therapy Based on Soft Tissue Registration. Int J Radiat Oncol Biol Phys (2012) 82(5):e733–e8. doi: 10.1016/j.ijrobp.2011.11.043

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Liu R, Fu L, De Man B, Yu H. GPU-Based Branchless Distance-Driven Projection and Backprojection. IEEE Trans Comput Imaging (2017) 3(4):617–32. doi: 10.1109/TCI.2017.2675705

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Long Y, Fessler JA, Balter JM. 3D Forward and Back-Projection for X-Ray CT Using Separable Footprints. IEEE Trans Med Imaging (2010) 29(11):1839–50. doi: 10.1109/TMI.2010.2050898

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Zhao T, Hoffman J, McNitt-Gray M, Ruan D. Ultra-Low-Dose CT Image Denoising Using Modified BM3D Scheme Tailored to Data Statistics. Med Phys (2019) 46(1):190–8. doi: 10.1002/mp.13252

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Kulkarni A, Mohsenin T. Accelerating Compressive Sensing Reconstruction OMP Algorithm With CPU, GPU, FPGA and Domain Specific Many-Core. 2015 IEEE International Symposium on Circuits and Systems (ISCAS). Lisbon, Portugal: IEEE (2015). p. 970–3.

Google Scholar

Appendix A

For n www.frontiersin.org 1 to all images do

r www.frontiersin.org 1, r_red 0.8, ε 3, a 2, Ω 10

For j www.frontiersin.org 1 to all voxels do

Calculate D(V_j) using Eq. (9)

Create intensity CDF histogram using V_j

End For

τ www.frontiersin.org Intensity at 90% of intensity CDF

R(V) www.frontiersin.org 0

For j www.frontiersin.org 1 to all voxels do

w_j www.frontiersin.org 0

For k www.frontiersin.org a to a do

Find the largest number in V_j₊_k and call it ′A_max′

End For

For i www.frontiersin.org j – Ω to j + Ω do

S www.frontiersin.org 0

For k www.frontiersin.org a to a do

Find the largest number in V_i₊_k and call it ′B_max′

End For

For k www.frontiersin.org a to a do

A www.frontiersin.org (V_j+k/A_max) × B in Size

B www.frontiersin.org (V_i+k/B_max) × B in Size

Voting at Bin of (A,B) of the joint histogram

End For

Calculate w_j using Eq. (7)

Calculate D(V_j) using Eq. (9)

R(V_j) www.frontiersin.org w_jD(V_j)

R(V) www.frontiersin.org R(V) + R(V_j)

End For

For t www.frontiersin.org to 20 do

For j www.frontiersin.org 1 to all voxels do

λ www.frontiersin.org $\sqrt{\sum_{j} V_{j}^{2}}$ λ λ × r

∂V_j ← www.frontiersin.org ∇R(V_j) calculated by Eq. (12)

$| \nabla R (V) |$ ← www.frontiersin.org $\sum \sqrt{{\sum_{j} (\partial V_{j})}^{2}} a s i n E q . (13)$

End For

For j www.frontiersin.org 1 to all voxels do

$V_{j}^{'}$ www.frontiersin.org $\sum V_{j} + λ \partial V_{j} / | \nabla R (V) |$

End For

While R $(V_{j}^{'})$ > R(V) do

r www.frontiersin.org r × r_red

λ www.frontiersin.org λ × r

For j www.frontiersin.org 1 to all voxels do

$V_{j}^{'}$ www.frontiersin.org $V_{j} + \frac{λ \partial V_{j}}{| \nabla R (V) |}$

End For

End While

$U p d a t e V_{j}^{'} t o V_{j}$

End For

Keywords: low-dose cone-beam computed tomography, mutual information, non-local total variation, ray-driven backprojector, low mAs, image-guided radiation therapy

Citation: Lee H, Sung J, Choi Y, Kim JW and Lee IJ (2021) Mutual Information-Based Non-Local Total Variation Denoiser for Low-Dose Cone-Beam Computed Tomography. Front. Oncol. 11:751057. doi: 10.3389/fonc.2021.751057

Received: 31 July 2021; Accepted: 28 September 2021;
Published: 21 October 2021.

Edited by:

Tonghe Wang, Emory University, United States

Reviewed by:

James Sohn, Virginia Commonwealth University, United States
Jordan Slagowski, University of Chicago, United States

Copyright © 2021 Lee, Sung, Choi, Kim and Lee. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ik Jae Lee, aWtqYWU0MTJAeXVocy5hYw==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Mutual Information-Based Non-Local Total Variation Denoiser for Low-Dose Cone-Beam Computed Tomography

Introduction

Methods

Ray-Driven Backprojector Based on X-Ray Transmission Length Calculation

Mutual Information-Based Non-Local Total Variation Denoiser

Experimental Studies

Results

Discussion

Conclusion

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

References

Appendix A

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good