Motion guided Spatiotemporal Sparsity for high quality 4D-CBCT reconstruction

Liu, Yang; Tao, Xi; Ma, Jianhua; Bian, Zhaoying; Zeng, Dong; Feng, Qianjin; Chen, Wufan; Zhang, Hua

doi:10.1038/s41598-017-17668-5

Download PDF

Article
Open access
Published: 12 December 2017

Motion guided Spatiotemporal Sparsity for high quality 4D-CBCT reconstruction

Yang Liu ORCID: orcid.org/0000-0001-9088-0299¹,
Xi Tao¹,
Jianhua Ma¹,
Zhaoying Bian¹,
Dong Zeng¹,
Qianjin Feng¹,
Wufan Chen¹ &
…
Hua Zhang¹

Scientific Reports volume 7, Article number: 17461 (2017) Cite this article

1307 Accesses
6 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Conventional cone-beam computed tomography is often deteriorated by respiratory motion blur, which negatively affects target delineation. On the other side, the four dimensional cone-beam computed tomography (4D-CBCT) can be considered to describe tumor and organ motion. But for current on-board CBCT imaging system, the slow rotation speed limits the projection number at each phase, and the associated reconstructions are contaminated by noise and streak artifacts using the conventional algorithm. To address the problem, we propose a novel framework to reconstruct 4D-CBCT from the under-sampled measurements—Motion guided Spatiotemporal Sparsity (MgSS). In this algorithm, we try to divide the CBCT images at each phase into cubes (3D blocks) and track the cubes with estimated motion field vectors through phase, then apply regional spatiotemporal sparsity on the tracked cubes. Specifically, we recast the tracked cubes into four-dimensional matrix, and use the higher order singular value decomposition (HOSVD) technique to analyze the regional spatiotemporal sparsity. Subsequently, the blocky spatiotemporal sparsity is incorporated into a cost function for the image reconstruction. The phantom simulation and real patient data are used to evaluate this algorithm. Results show that the MgSS algorithm achieved improved 4D-CBCT image quality with less noise and artifacts compared to the conventional algorithms.

Comparative study of sub-second temporal resolution 4D-MRI and 4D-CT for target motion assessment in a phantom model

Article Open access 21 September 2023

Low-dose CBCT reconstruction via joint non-local total variation denoising and cubic B-spline interpolation

Article Open access 11 February 2021

Motion correction for routine X-ray lung CT imaging

Article Open access 12 February 2021

Introduction

Three-dimensional cone-beam computed tomography (3D-CBCT) has been widely used in image guided radiation therapy (IGRT)^1,2,3. It can provide the volumetric information for tumor localization in IGRT. But for thoracic and upper abdominal regions, the 3D-CBCT image is often deteriorated by motion blur⁴. To address the problem, four-dimensional CBCT (4D-CBCT) imaging that incorporating temporal (phase) information on the basis of 3D-CBCT was proposed^5,6,7. Compared to 3D-CBCT imaging, 4D-CBCT can provide the patient-specific respiratory motion information and multiple three-dimensional volumes to represent the different status in the breathing cycle^8,9,10. For 4D-CBCT imaging, the respiratory signal is recorded or estimated and motion contained cone-beam projections are usually sorted into 8–10 subsets according to the respiratory signal. However, the gantry rotation speed and frame rate of the flat-panel imager limit the total number of cone-beam projections (usually 600~800), which result in relatively fewer projections in each respiratory phase. Consequently, the reconstructed CBCT images by using the conventional Feldkamp–Davis–Kress (FDK) algorithm¹¹ suffer from significant artifacts and noise^12,13. In addition, the randomness of breathing would lead to the cone-beam projections bunched into several clusters, and the bunched sampling scheme will aggravate the noises and artifacts level in the reconstructed images¹⁴.

To address this problem, various strategies have been proposed to improve the image quality of 4D-CBCT¹⁵. On one side, increasing the number of angular sampling, multiple-gantry rotation and slow-gantry rotation schemes were structured^7,13. But these schemes will prolong the time of data acquisition and increase the risk of motion embracing. On the other side, lots of reconstruction algorithms^16,17,18 have been proposed to improve the quality of 4D-CBCT image, such like the total variation minimization based algorithms^15,19,20, the McKinnon–Bates (MKB) algorithm^21,22, the prior image constrained compressed sensing (PICCS) algorithm²³ and the auto-adaptive phase correlation (AAPC) algorithm²⁴. As shown in the study by Frank Bergnera et al., the reconstruction algorithms using an iterative scheme can remarkably reduce the 4D-CBCT specific artifacts²⁵. Nevertheless, algorithms that use the full data set, at least for initialization, such as MKB and PICCS algorithm, are only a trade-off and achieve sub-optimal temporal resolution, of which the residual motion could still be found in the reconstructed 4D-CBCT images²⁶. On the other hand, algorithms that only use the projections assigned to the current phase to reconstruct the final image can fully achieve the temporal resolution, such as the total variation minimization based algorithm. For these algorithm, when the projection number is very low or there exist small size and low-contrast objects, tiny structure may be usually erased with the piecewise smooth constraint.

In recent years, the image restoration or denoising algorithms based on block processing have shown increasing vitality. The earliest concept of ‘patch’ was proposed in Haralick’s study on textural features for image classification²⁷. In 1980, JS Lee pioneeringly used this concept for image enhancement²⁸. Latterly, algorithms such like the dictionary learning based image reconstruction/restoration approaches²⁹, the nonlocal means filter³⁰ and the BM3D algorithm³¹, et al.^32,33,34, have been successfully developed. Local patches often experience much less distortion than the global image and therefore it becomes easier to define the similarity between local patches. Low-rank and sparsity can be better reflected in patch based processing.

In this work, we propose to reconstruct 4D-CBCT volumes from the subset projections of current phase and incorporate the image domain phase-correlated information into the iterative procedure. Motived by the success of block based image restoration, we propose a Motion guided Spatiotemporal Sparsity (MgSS) to formulate the regularization for 4D-CBCT reconstruction. In this scheme, CBCT images of different phases are divided into small cubes (three-dimensional blocks) and the cubes are tracked with estimated motion vector fields through time (phase). After then, regional spatiotemporal sparsity is applied on the tracked cubes. Specifically, we recast the tracked cubes into four-dimensional matrix, and use the higher order singular value decomposition (HOSVD)³⁵ technique to analyze the regional spatiotemporal sparsity. Finally, a cost function is formulated with embedding the block based spatiotemporal sparsity. One simple but effective optimization algorithm was used for the cost function solution.

This paper is structured as follows: In the method part, we first present the flow chart of the proposed MgSS scheme and then give detailed introduction of each step. After this, we formulate the reconstruction framework that incorporates the MgSS scheme. In the experiment part, we exhibit the results of the NCAT phantom simulation data, 4DCT based simulation data and real patient data by using FDK, SART-TV and the MgSS algorithms. Lastly, in the discussion part, we simply discuss the superiority and limitation of the proposed algorithm.

Methods

Flow chart of the proposed MgSS algorithm

In this work, the proposed Motion guided Spatiotemporal Sparsity (MgSS) applying to 4D-CBCT reconstruction comprises the following steps;

(1)
Estimate the motion maps for each voxel between adjacent phases of the 4D-CBCT images.
(2)
Divide the CBCT sequence images into cubes (3D blocks), and track the cubes through time using the three-dimensional motion maps obtained in step 1.
(3)
Stack the tracked cubes and apply regional spatiotemporal sparsity on the tracked cubes by framing the 4D-CBCT reconstruction problem to be a constrained optimization problem, and then optimize the problem to get the reconstructions.

In the following sections, we will introduce these steps in detail.

Motion Maps Estimation

In this study, we are not devoted to develop new algorithm to achieve the three-dimensional motion vector fields (3D-MVFs) between CBCT images of adjacent phases, but rather we use the Real-Time Image-based Tracker (RTIT)³⁶ toolbox to estimate the three-dimensional motion maps. The RTIT toolbox is an available open source with implementation of the optical flow based registration algorithms^37,38. Assume that we obtain the initial/current 4D-CBCT estimation, with the RTIT, we can achieve the pixel based motion vector fields consisting of the changes in space coordinates that describe the distribution of the apparent motion velocities of intensity patterns in the sequences of images.

Cubes Tracking with MVFs

For the task of cubes tracking, we define the voxel space-time position as $u=(x,\,y,\,z,\,t)$, where x, y, z and t represent the spatial position of the voxel in the CBCT volumetric image and the temporal phase index, respectively. In MgSS, we fetch the cubes (3D blocks) from the images of the first phase and use the 3D-MVFs to track structurally similar cubes in the other phases. Specifically, the first phase image was initiated with highly overlapping cubes. In the extreme case, all the voxels in the first phase image can be defined as the central voxel of one cube, but this may result in huge computation burden. Thus in this study, rather than sliding by one voxel to every next, we use a step of N_step = 2 voxels to define the cubes. For the motion tracking, considering one cube $B({u}_{1})\in {{X}_{1}}^{{N}_{b}\times {N}_{b}\times {N}_{b}}$, here ${u}_{1}=({x}_{1},\,{y}_{1},\,{{\rm{z}}}_{1},\,{t}_{1})$ indicates the central pixel position of the cubes with spatial position to be $({x}_{1},\,{y}_{1},\,{{\rm{z}}}_{1})$ and temporal phase index to be t ₁. By using Δu to denote the displacement of the central voxel in the cube between t ₁ and t ₂ phases, we can use u ₁ + Δu to track the center pixel location for cube in the t ₂ phase. Considering u ₁ + Δu might be non-integral, the next block center voxel was taken as u ₂ = {u ₁ + Δu ₂}, where “{}” is a rounding operation. Extracting the voxels centered from u ₂, we can constitute a new cube B(u ₂) with the same block size. Following this schedule, B(u _n) can be tracked with u _n= {u ₁ + Δu _n} using the above block-center-tracking method. A block cluster can be ulteriorly constructed: ${{\rm{\Theta }}}_{MgSS}=[B({u}_{1}),B({u}_{2}),\mathrm{...},B({u}_{Nt})]$. Here the cluster Θ_MgSS is a four dimensional matrix with the size of ${N}_{b}\times {N}_{b}\times {N}_{b}\times {N}_{t}$. Based on the fact that not all the chest is moving during the CBCT scan, the tracked cubes for the static parts should be noisy blocks with same anatomy structures. As shown in Fig. 1, compared to the blocks of the same spatial position in the original phases, the tracked blocks exhibit more mutual similarity.

Regional Spatiotemporal Sparsity

The technology about matrix rank sparsity has been successfully investigated in the field of dynamic image reconstruction, such as the k-t SLR method³⁹. In some previous studies⁴⁰, matrix rank sparsity was often used to dispose the entire image. But the recent studies, such like Low-dimensional-structure self-learning and thresholding (LOST)⁴¹ and compartment-based k-t principal component analysis⁴² have indicated that the spatiotemporal sparsity and reconstruction quality could be further promoted by separating the entire image into blocks. Under the patch-based processing theory, decomposition of the tracked regions of dynamic datasets using the singular value decomposition (SVD) algorithm has been reported⁴³. In Chen’s work, blocks in the clusters were vectorized and each cluster Θ was first rearranged into a 2-D matrix $\tilde{{\rm{\Theta }}}\in {{\mathbb{C}}}^{{N}_{s}\times {N}_{t}}({N}_{s}={N}_{b}\times {N}_{b})$. The SVD technique was then adopted to decompose the cluster: ${\tilde{{\rm{\Theta }}}}_{svd}=U{S}_{svd}{V}^{\ast }$. If the cluster is truly spatiotemporal sparsity, a least number of significant values will be found in the singular matrix S _svd. For the standard SVD, the image blocks are manually vectorized and then the image structural properties in the spatial domain are ignored. To address the problem, the higher order singular value decomposition (HOSVD), which is able to directly decompose dynamic datasets into a multidimensional singular matrix rather than unfolding the blocks into column vectors, has been reported³⁵. Also, in this work, we use the HOSVD to decompose the clusters due to its naturality and flexibility. By using the HOSVD technique, a four-dimensional tensor Θ can be decomposed as:

$${\rm{\Theta }}={S}_{hosvd}{\times }_{1}{U}^{(1)}{\times }_{2}{U}^{(2)}{\times }_{3}{U}^{(3)}{\times }_{4}{U}^{(4)}$$

(1)

where U ⁽¹⁾, U ⁽²⁾, U ⁽³⁾, U ⁽⁴⁾ are orthogonal matrices that contain the orthonormal vectors spanning the column space of the matrix Θ. Here, the symbol ×_n stands for the n-th mode tensor product. The core tensor S _hosvd is not necessarily diagonal matrix, which implies that each dimension of the tensor can have a different rank. Generally, noise and artifacts can be attenuated by approximating the rank of the core tensor. However, the computation of the best rank approximation requires an iterative alternated least-square (ALS) algorithm and is quite time consuming⁴⁴. Moreover, the approximation obtained from simple truncation has been proved to be in most cases quite similar to the optimal approximation⁴⁵. Base on this, in our study, we apply a soft-thresholding strategy on the HOSVD coefficients to reduce noise and artifacts. The thresholding of the rank coefficients can be represented as follows:

$${\hat{S}}_{hosvd}={H}_{\tau }({S}_{hosvd})$$

(2)

where H _τ denotes the soft-thresholding operator with threshold τ. Then a new tensor can be synthesized by the inverse HOSVD transformation with truncated coefficients ${\hat{S}}_{hosvd}$ and the orthogonal matrices U ⁽¹⁾, U ⁽²⁾, U ⁽³⁾, U ⁽⁴⁾:

$${{\rm{\Theta }}}_{hosvd}^{\ast }={\hat{S}}_{hosvd}{\times }_{1}{U}^{(1)}{\times }_{2}{U}^{(2)}{\times }_{3}{U}^{(3)}{\times }_{4}{U}^{(4)}$$

(3)

The above operation is repeated for each reference cube, thus provides multiple estimates at the same coordinate. For this reason, the final estimates are aggregated by weighted averaging all the obtained block-wise estimates that overlapped at each voxel.

MgSS based 4D-CBCT reconstruction

To incorporate the motion tracking induced blocky spatiotemporal sparsity into the CBCT reconstruction framework, in this section, we formulate the following minimization scheme:

$$\begin{array}{c}{\rm{miniminze}}{}_{f,u}{\rm{\Phi }}({T}_{u}\cdot f)\\ s.t.{\Vert Af-y\Vert }^{2} < \delta \end{array}$$

(4)

where f represents the estimated dynamic images, A is the CBCT imaging system matrix with elements of a _ij, y denotes the projection measurements. Operator Φ is the cube based sparsity penalties, while ${T}_{u}\cdot f$ denotes the dataset that includes a sequence of 4D matrix of the tracked cubes. Operator T _u includes two steps: (1) Track three-dimensional blocks with the motion vector, and (2) Rearrange the three-dimensional blocks into 4D matrix.

To optimize the problem in (4), in this work, inspired by the techniques used by Pan et al.⁴⁶, we present an efficient way to solve (4), which can be summarized as:

(1) SART⁴⁷ step:

$${f}_{j}^{(n+1/2)}={f}_{j}^{(n)}+\lambda (\sum _{i}[{a}_{ij}\frac{{y}_{i}-\sum _{n=1}^{N}{a}_{in}\,{f}_{j}^{(n)}}{\sum _{n=1}^{N}{a}_{in}}]/\sum _{i}{a}_{ij})$$

(5)

Here λ is an over-relaxed factor.

(2) MgSS step:${f}^{(n+1)}=W({\tilde{{\rm{\Theta }}}}_{MgSS}^{\ast }({f}^{(n+1/2)}))$ Here, ${\tilde{{\rm{\Theta }}}}_{MgSS}^{\ast }({f}^{(n+1/2)})$ denotes the restored block-based clusters based on ${f}^{(n+1/2)}$ by using the HOSVD approach

$${{\rm{\Theta }}}_{MgSS}^{\ast }={\hat{S}}_{MgSS}{\times }_{1}{U}^{(1)}{\times }_{2}{U}^{(2)}{\times }_{3}{U}^{(3)}{\times }_{4}{U}^{(4)}$$

(6)

W is a weighted averaging operator indicates how the cubes are merged back into the image. The elements of W is the reciprocal of the times which one pixel was overlapped by different cubes.

In our implementation, the MgSS step is carried out after 10^th iteration of SART step until ${|{f}^{(n)}-{f}^{(n-1)}|}^{2} < \xi $ or reached the predetermined iterative number. Also one implicit motion vector field estimation step was performed using the intermediate reconstructions of SART.

Results

Data acquisition of digital simulation

NCAT phantom based simulation

In this work, the 4D NURBS-based Cardiac-Torso (NCAT)⁴⁸ phantom, which is capable of providing the realistic model of human anatomy and simulating cardiac and respiratory motion simultaneously, was used for data simulation. A dynamic phantom with ten respiratory phases and breathing period set to be 5 s was generated. The maximum diaphragm motion and the maximum chest anterior–posterior motion is 20 mm and 5 mm. We manually added several tumors with different sizes, contrast and shapes in the right lung field of phantom to test the robustness of the MgSS algorithm. The diameters of the spherical tumors were: 6 mm, 10 mm, 16 mm and 22 mm. The long diameter and short diameter of the non-spherical tumor is: 28 mm and 22 mm. For all simulations, the size of the digital phantom is 256 × 256 × 150, with voxel size of 2 × 2 × 2 mm³. The projections were generated by utilizing fast ray-tracing technique. The projection size of each angular view is 300 × 200 with detector pixel size 2 × 2 mm². In the process of simulation, projection views are evenly distributed over 360 degrees with the projection number of each phase range from 21 to 51. The noisy signal S _i at each detector bin i was simulated based on the Poisson noise model:

$${S}_{i}=Poisson({I}_{0}\,\exp (-{y}_{i}))+Normal(0,{\sigma }_{e}^{2})$$

(7)

Here I ₀ and ${\sigma }_{e}^{2}$ represent the incident x-ray intensity and the background noise, respectively. I ₀ is set to be 2 × 10⁶ and ${\sigma }_{e}^{2}$ is chosen to be 10.

4DCT based simulation

To further evaluate the performance of the MgSS algorithm, the 4DCT based digital phantom simulation was also performed. The 4DCT images were acquired on a 16-slice helical CT scanner (Brilliance Big Bore, Philips Medical Systems, Andover, MA). The three dimensional CT volumes at each phase were first interpolated to be isotropic data set with voxel size 1.0254 × 1.0254 × 1.0254 mm³. Then CB projections were computed from the reference 3D CT image using the projection matrix. The scan geometry was chosen according to the Varian On-Board Imager® and True Beam™ CBCT units.

Data acquisition of patient

The patient data was downloaded from an open data website (http://wiki.openrtk.org/index.php/ RTK/Examples/MCCBCTReconstruction). The cone-beam projections were acquired on the Elekta Synergy system. The clinical dataset consisted of 644 projections and were sorted to 10 phases based on the AS method⁴⁹. The size of the digital flat panel is 512 × 512 with the pixel size of 0.8 × 0.8 mm². In our experiments, the isotropic reconstruction target resolution was set to be 1.2 × 1.2 × 1.2 mm³ on a 256 × 256 × 200 matrix.

Evaluation metrics

To quantitative evaluate the performance of the proposed algorithm, we calculate the relative root mean square error (rRMSE) between the phantom images and the reconstructions. The rRMSE is defined as:

$${\rm{rRMSE}}=\sqrt{\sum _{m=1}^{Q}{|{f}_{m}-{f}_{{\rm{p}},m}|}^{2}/\sum _{m=1}^{Q}{|{f}_{{\rm{p}},m}|}^{2}}$$

(8)

Here, f denotes the target image, f _p denotes the phantom image, and m is the voxel index.

The universal quality index⁵⁰ (UQI) index was utilized to conduct region of interest (ROI) based analysis by evaluating the degree of similarity between the reconstructed and the reference images. We select ROIs including the tumor and lung details within the reconstructed and reference images, the mean, variance and covariance of intensities in the ROIs can be respectively calculated as:

$$\bar{f}=\frac{1}{Q}\sum _{m=1}^{Q}{f}_{m},\quad \,\,{\sigma }^{2}=\frac{1}{Q-1}\sum _{m=1}^{Q}{({f}_{m}-\bar{f})}^{2}$$

(9)

$${\bar{f}}_{ture}=\frac{1}{Q}\sum _{m=1}^{Q}{f}_{ture,m},{\sigma }_{ture}^{2}=\frac{1}{Q-1}\sum _{m=1}^{Q}{({f}_{ture,m}-{\bar{f}}_{ture})}^{2}$$

(10)

$$cov(f,{f}_{ture})=\frac{1}{Q-1}\sum _{m=1}^{Q}({f}_{m}-\bar{f})({f}_{ture,m}-{\bar{f}}_{ture})$$

(11)

the f _ture denotes the golden standard image, m is the voxel index, and Q denotes the number of voxels within the ROI. The UQI can be calculated as follows:

$$UQI=\frac{2cov(f,{f}_{ture})}{{\sigma }^{2}+{\sigma }_{ture}^{2}}\frac{2{\bar{f}}_{ture}\bar{f}}{{\bar{f}}^{2}+{\bar{f}}_{ture}^{2}}$$

(12)

UQI measures the intensity similarity between the two images, and its value ranges from zero to one. A UQI value closer to one suggests better similarity to the reference image.

Digital NCAT phantom study

Visual inspection

Figure 2 shows the results of the NCAT phantom reconstructed by using different methods at transverse, coronal, and sagittal planes for phase #1 with the projection number set to be 21. The columns one to three show the transverse, coronal, and sagittal images, respectively. First row in Fig. 2 shows the designed digital phantom images. The second row shows the results which were reconstructed by using the FDK algorithm from all projections. As we can see, the motion blurring artifacts are obvious. The third row shows the phase correlated 4D-CBCT images reconstructed by the FDK algorithm, and we can see that the FDK reconstructions are full of noise and streak artifacts. For the SART-TV reconstructions, the fine structures inside the lung area are severely blurred though the view aliasing artifacts are suppressed. Compared to FDK and SART-TV algorithms, the proposed MgSS approach can yield images with superior quality. Moreover, Figs 3 and 4 demonstrate the reconstructions of -31 and -51 views, which further illustrate the gains of the proposed MgSS approach.

Quantitative evaluation

Table 1 presents the results of rRMSE measures with the projection views range from 21 to 51 for each phase. The rRMSEs of ten phases reconstructions by using the proposed MgSS reduced by an average of 43% compared to those of FDK reconstructions. It can be obviously seen that the MgSS algorithm can achieve smaller rRMSE values compared to the other algorithms, which suggests the promising performance of the proposed MgSS approach.

Table 1 rRMSE measures on the reconstructions of the FDK, SART-TV and MgSS reconstructions, respectively.

Full size table

Figure 5 illustrates the horizontal profiles of the transverse plane images in Fig. 2. It can be observed that the profiles obtained from the MgSS reconstructions match much better than the others, which suggest that the present MgSS approach achieves more noticeable gains than other approaches.

Motion trajectory accuracy

Because of the various applications of IGRT is interested in the tumor position information, we are devoted to extract the motion trajectories from different reconstruction schemes: FDK, SART-TV and the proposed MgSS algorithms. We define the motion trajectories which extracted from the NCAT phantom with high-contrast tumor as a reference. In this paper, we extract the movement information from the center of tumor. The motion trajectories have been showed in Fig. 6. As we can see, the motion information of tumor extracted from MgSS algorithm matches well with the reference trajectory, while the trajectories extracted from the FDK diverges the reference trajectory.

Low contrast lesion detection study

To test the robustness of the MgSS algorithm in reproducing the low contrast lesion, Figs 7 and 8 present the reconstructions of NCAT phantom with different tumor sizes and shapes at the transverse and coronal planes. The first row shows the designed phantom image used for visual comparison. The second to fourth rows show 4D-CBCT at the begin-expiration phase reconstructed by using FDK, SART-TV and proposed MgSS algorithms, respectively. It can be seen, the low contrast tumors in the NCAT phantom can’t be completely rebuilt by the FDK and SART-TV algorithms. The morphology of tumors is partly destroyed. The phenomenon is particularly evident for tumors with small size. On the contrast, the MgSS algorithm can mostly recover the tumors with relatively complete morphological structures. Furthermore, Fig. 9 presents the UQI test on the ROIs shown in Fig. 8. All the results suggest that, with the introduction of phase-correlated information, the MgSS algorithm has better lesion detection ability than algorithms that only using the single phase information.

Parameter selection

In our method, there are two parameters need to be tuned: (1) the tracked cube size; (2) the soft-thresholding coefficients for the HOSVD processing. The cube size plays an important role in our work as it not only directly affects the accuracy of the results, but also affects the computation time. Traditionally, for the block-based techniques, the size of the cube is an empirical parameter specified by the user. A reasonable cube size can help us attend to the local structure characteristics while removing undesirable distortions.

To study the influence of the cube size on the proposed algorithm, we experimentally change the size of cube to get the associated reconstructions and calculate the rRMSEs between the reconstructions and the designed phantom image. We reconstruct the NCAT phantom from -21 projections with block size set from 3 × 3 × 3 to 23 × 23 × 23. Figure 10 shows the reconstructions of cube size to be 5 × 5 × 5, 7 × 7 × 7, 9 × 9 × 9, 13 × 13 × 13, 17 × 17 × 17, 23 × 23 × 23. As shown in Fig. 10, on one hand, boundary distortion could be observed when the cube size is too small. On the other hand, the image blur increases with the increase of the cube size. This phenomenon is due to the large cube contains too much structural information, the movement of central voxel in the cube is not enough to describe the whole cube. Thus, to generate high quality 4D-CBCT image, appropriate size of the cube is needed. Moreover, as shown in Fig. 11, the averaged rRMSEs of ten phases reconstructions indicate that with the cube size range from 7 × 7 × 7 to 11 × 11 × 11, we can get relative smaller rRMSEs. In order to balance the reconstruction quality and computational time, in our other experiments, the cube size was fixed to 9 × 9 × 9.

For the parameter of thresholding coefficients for the HOSVD processing, we choose with a threshold of $\sigma \sqrt{2\,\mathrm{log}\,{p}^{2}}$ to manipulate the coefficients of core tensor. This is also an experiential selection as used in other works⁵¹. Here, σ is defined as the standard deviation of a uniform region in the intermediary images during the iteration and p is the cube size.

Algorithm convergence

To validate and analyze the convergence of the present MgSS method, the $|{f}({\bf{n}})-{f}({\bf{n}}-1)|$ (absolute value of differences between two adjacent estimations) measuring on the entire to-be-reconstructed NCAT phantom image were performed. Figure 12 shows the $|{f}({\bf{n}})-{f}({\bf{n}}{-}1)|$ measures with respect to the number of iterations. Results show that the present MgSS algorithm can yield a steadily convergence solution.

Influence of motion tracking

To demonstrate the effect of the estimated motion fields on the reconstruction, we plot the rRMSE measures as a function of the number of iterations for the MgSS with and without motion tracking. Figure 13 shows the benefits of motion guidance, as MgSS with motion tracking can obtain reduced rRMSE as compared to MgSS without motion tracking. Figure 13 also illustrates the convergence of the proposed MgSS algorithm.

Realistic 4D CT based digital phantom study

Figure 14 shows the results of the realistic digital phantom reconstructed by using different methods at transverse, coronal, and sagittal planes for phase #1 with the projection number set to be 21. First column in Fig. 14 shows the 4DCT images using as the golden standard for comparison. The second column shows 4D-CBCT images reconstructed by the FDK algorithm. It can be seen that the FDK reconstructions are seriously contaminated by noise and artifacts, and some anatomical structures can’t be clearly seen. For the SART-TV reconstruction, some fine structures have been erased though most of the view aliasing artifacts are suppressed. Compared to FDK and SART-TV algorithms, the proposed MgSS approach can yield images with superior quality. In addition, the UQI test between the reconstructions and golden standard image were calculated. Figure 15 shows the test results of then phases at the transverse, sagittal and coronal planes, separately. For then phases, the UQI values of MgSS reconstructions are always higher than 0.95, which suggests the promising performance of the proposed MgSS algorithm.

Patient study

Figure 16 shows the representative reconstructions of patient data by using the FDK, SART-TV and MgSS methods. Each row of Fig. 16 shows the image of 4D-CBCT at different phase in respiratory cycle: 20–30%, 40–50%, 60–70%, 80–90%. The first column in Fig. 16 shows the sagittal view of 4D-CBCT images reconstructed by conventional FDK. Because of the limited number of projections at each phase, severe noise and artifacts present in the FDK reconstruction. The second column of Fig. 16 shows the results of SART-TV algorithm. Although noise and artifacts have been remarkably reduced, details within the lung area and edges of bony structure can’t be clearly seen. Last column of Fig. 16 shows the images reconstructed by MgSS method, from which we can observe noise is suppressed. The boundaries of bony structures as well as fine structures inside the lung are well preserved. To further illustrate the performance of our algorithm, zoomed images of the tumor areas in the then phasesof the reconstructions by using different reconstruction are presented (Fig. 17).

Discussion

In this work, we developed a MgSS algorithm to improve the image quality of 4D-CBCT. This algorithm is developed on the assumption that there exists high structural similarity between the images of neighbored phases, which is similar with Chen’s work on dynamic MR reconstruction⁴³. In Chen’s work, two dimensional patches are tracked with estimated motion maps and then SVD was used to decompose the tacked cluster. For the standard SVD, the image blocks are manually vectorized and then the image structural properties in the spatial domain are ignored. For the proposed MgSS algorithm, the 3D-MVFs between phases were utilized to track the three dimensional cubes and then form the cluster Θ_MgSS with the size of ${N}_{b}\times {N}_{b}\times {N}_{b}\times {N}_{t}$. To preserve the structural properties, in this work, the higher order singular value decomposition which is able to directly decompose dynamic datasets into a multidimensional singular matrix rather than unfolding the cubes into column vectors, was used to process the four dimensional cluster Θ_MgSS. The MgSS algorithm can reveal the fine details shared by grouped blocks and preserve the essential unique features of each individual block. Results of digital phantom and patient data demonstrate the proposed approach can significantly suppress the view aliasing artifacts and noise.

One important step in the proposed MgSS algorithm is: cube based motion tracking. We utilize the Real-Time Image-based Tracker (RTIT)³⁶ toolbox to obtain the voxel-by-voxel displacement maps between 3D images of different phases. Specifically, we rely mainly on the displacement of the center voxel of cube in the current phase to determine the cube in the next phase. We assume the displacements of region are changing smoothly. Thus, the central voxel is sufficient to describe the cube motion. And in this case, the accuracy of 3D-MVFs estimation would play an important role in finding cubes with similar structures. In this work, the initial 3D-MVFs are generated after the first SART iteration, which may not so accurate. Notably, as the iteration goes on, the 3D-MVFs would be updated with intermediate reconstructions, and therefore both the image quality and the accuracy of the 3D-MVFs will be improved. Other approaches such like the 3D-MVFs initialized with those got from the 4D planning CT of the same patient can be considered. There is a very important parameter need to be tuned for the proposed MgSS approach: the tracked cube size. A large size cube may include more details and local geometry, but on the other side, if the cube size is too large, the central voxel will be insufficient to describe the whole cube motion and also result in increasing computation time. Thus, the proper selection of the cube size is critical. In our studies, we change the block size from 3 × 3 × 3 to 23 × 23 × 23 and generate the reconstructions as shown in Fig. 10 and rRMSEs between the reconstructions and the phantom images. For visual inspection, too small or large cube size would lead to image blur or distortion. The rRMSE results suggest that when the cube size set to be 9 × 9 × 9, we can get relative higher quality image with smaller rRMSE for the NCAT phantom. Also as illustrated in other studies of HOSVD technique, the appropriate selection of the cube size should base on the structural complexity of the target image and may be different for different images. But overall, in our work for lung CT imaging, we found the cube size set to be 9 × 9 × 9 is robust enough to balance the structural information and computation time. Another issue of the proposed MgSS algorithm is the heavy computation burden. It takes about 30 minutes on a desktop computer (3.60 GHz Intel(R) i7 CPU with 8GB RAM) to run one iteration. On one hand, as mentioned above, we can short the computation time with a relatively small cube size as well as an elegant initialization. On the other hand, we can use a step of N_step pixels in transverse, coronal, and sagittal directions, respectively. Hence, the number of overlapping cubes is decreased by 1/N_step ³. But with large N_step or large motion, gaps between moved blocks may appear, then a mask of the uncovered areas (gaps) on the n-th frame after block motion tracking can be detected and non-motion tracking is performed for the gap blocks to avoid potential additional gaps. Other techniques including using the graphics processing unit (GPU) can be also considered to short the computation time.

In summary, we have developed a MgSS algorithm to improve the image quality of 4D-CBCT. This method effectively utilizes the correlated information from other phases to reconstruct any particular phase of 4D-CBCT. By enforcing the regional spatiotemporal sparsity on the tracked cubes, noise and artifacts can be suppressed and the image quality of 4D-CBCT can be substantially improved.

References

Jaffray, D. A., Siewerdsen, J. H., Wong, J. W. & Martinez, A. A. Flat-panel cone-beam computed tomography for image-guided radiation therapy. International Journal of Radiation Oncology* Biology* Physics 53, 1337–1349 (2002).
Article Google Scholar
Oldham, M. et al. Cone-beam-CT guided radiation therapy: A model for on-line application. Radiotherapy and oncology 75, 271. E1–271. E8 (2005).
Article Google Scholar
Pouliot, J. et al. Low-dose megavoltage cone-beam CT for radiation therapy. International Journal of Radiation Oncology* Biology* Physics 61, 552–560 (2005).
Article Google Scholar
Chen, G. T., Kung, J. H. & Beaudette, K. P. Artifacts in computed tomography scanning of moving objects. Semin Radiat Oncol 14, 19–26 (2004).
Article PubMed Google Scholar
Sonke, J., Zijp, L., Remeijer, P. & van Herk, M. Respiratory correlated cone beam CT. Medical physics 32, 1176–1186 (2005).
Article ADS PubMed Google Scholar
Li, T. et al. Four-dimensional cone-beam computed tomography using an on-board imager. Medical physics 33, 3825–3833 (2006).
Article ADS PubMed Google Scholar
Lu, J. et al. Four-dimensional cone beam CT with adaptive gantry rotation and adaptive data sampling. Medical physics 34, 3520–3529 (2007).
Article ADS PubMed Google Scholar
Ford, E. C., Mageras, G. S., Yorke, E. & Ling, C. C. Respiration-correlated spiral CT: a method of measuring respiratory-induced anatomic motion for radiation treatment planning. Medical physics 30, 88–97 (2003).
Article ADS CAS PubMed Google Scholar
Low, D. A. et al. A method for the reconstruction of four-dimensional synchronized CT scans acquired during free breathing. Medical physics 30, 1254–1263 (2003).
Article ADS PubMed Google Scholar
Pan, T., Lee, T., Rietzel, E. & Chen, G. T. 4D-CT imaging of a volume influenced by respiratory motion on multi-slice CT. Medical physics 31, 333–340 (2004).
Article ADS PubMed Google Scholar
Feldkamp, L. A., Davis, L. C. & Kress, J. W. Practical cone-beam algorithm. JOSA A 1, 612–619 (1984).
Article ADS Google Scholar
Li, T., Koong, A. & Xing, L. Enhanced 4D cone-beam CT with inter-phase motion model. Medical physics 34, 3688–3695 (2007).
Article ADS PubMed Google Scholar
Li, T. & Xing, L. Optimizing 4D cone-beam CT acquisition protocol for external beam radiotherapy. International Journal of Radiation Oncology* Biology* Physics 67, 1211–1219 (2007).
Article Google Scholar
Li, T., Schreibmann, E., Yang, Y. & Xing, L. Motion correction for improved target localization with on-board cone-beam computed tomography. Phys Med Biol 51, 253–67 (2006).
Article CAS PubMed Google Scholar
Zhang, H. et al. Few-view cone-beam CT reconstruction with deformed prior image. Med Phys 41, 121905 (2014).
Article PubMed Google Scholar
Wang, J., Li, T. & Xing, L. Iterative image reconstruction for CBCT using edge-preserving prior. Medical physics 36, 252–260 (2009).
Article ADS PubMed Google Scholar
Brock, R. S., Docef, A. & Murphy, M. J. Reconstruction of a cone-beam CT image via forward iterative projection matching. Medical physics 37, 6212–6220 (2010).
Article ADS PubMed PubMed Central Google Scholar
Jia, X., Dong, B., Lou, Y. & Jiang, S. B. GPU-based iterative cone-beam CT reconstruction using tight frame regularization. Physics in medicine and biology 56, 3787 (2011).
Article ADS PubMed Google Scholar
Sidky, E. Y. & Pan, X. Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization. Physics in medicine and biology 53, 4777 (2008).
Article ADS PubMed PubMed Central Google Scholar
Sidky, E. Y. et al. Enhanced imaging of microcalcifications in digital breast tomosynthesis through improved image-reconstruction algorithms. Medical Physics 36, 4920–32 (2009).
Article ADS PubMed PubMed Central Google Scholar
Kinnon, G. C. M. & Bates, R. H. T. Towards Imaging the Beating Heart Usefully with a Conventional CT Scanner. IEEE Transactions on Biomedical Engineering BME-28, 123–127 (1981).
Article Google Scholar
Leng, S. et al. Streaking artifacts reduction in four-dimensional cone-beam computed tomography. Medical Physics 35, 4649–59 (2008).
Article ADS PubMed PubMed Central Google Scholar
Leng, S. et al. High temporal resolution and streak-free four-dimensional cone-beam computed tomography. Physics in Medicine & Biology 53, 5653–73 (2008).
Article ADS Google Scholar
Bergner, F. et al. Autoadaptive phase-correlated (AAPC) reconstruction for 4D CBCT. Medical physics 36, 5695–5706 (2009).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Kida, S., Masutani, Y., Nakano, M., Imae, T. & Haga, A. Improvement of 4D Cone-beam CT image quality via iterative reconstruction method. Ieice Technical Report 112, 73–75 (2012).
Google Scholar
Bergner, F. et al. An investigation of 4D cone-beam CT algorithms for slowly rotating scanners. Medical Physics 37, 5044–5053 (2010).
Article ADS PubMed Google Scholar
Haralick, R. M. Statistical and structural approaches to texture. Proceedings of the IEEE 67, 786–804 (1979).
Article Google Scholar
Lee, J. S. Digital Image Enhancement and Noise Filtering by Use of Local Statistics. IEEE Transactions on Pattern Analysis & Machine Intelligence 2, 165–168 (1980).
Article ADS CAS Google Scholar
Aharon, M., Elad, M. & Bruckstein, A. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on Signal Processing 54, 4311–4322 (2006).
Article ADS MATH Google Scholar
Buades, A., Coll, B. & Morel, J. M. A Review of Image Denoising Algorithms, with a New One. Siam Journal on Multiscale Modeling & Simulation 4, 490–530 (2005).
Article MathSciNet MATH Google Scholar
Dabov, K., Foi, A., Katkovnik, V. & Egiazarian, K. Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Transactions on Image Processing 16, 2080–95 (2007).
Article ADS MathSciNet PubMed Google Scholar
Liu, C. & Freeman, W. T. A high-quality video denoising algorithm based on reliable motion estimation. European Conference on Computer Vision 706–719 (2010).
Dwivedi, A. & Shrivastava, S. K. An efficient and fast patch reordering approach for image denoising without losing structural information. International Conference on Computer Communications 1–6 (2014).
Parameswaran, S., Luo, E. & Nguyen, T. Q. Patch Matching for Image Denoising Using Neighborhood-based Collaborative Filtering. IEEE Transactions on Circuits and Systems for Video Technology, 1–1 (2016).
Costantini, R., Sbaiz, L. & Süsstrunk, S. Higher order SVD analysis for dynamic texture synthesis. IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society 17, 42–52 (2008).
Article MathSciNet PubMed Google Scholar
Ries, M. et al. Real-time 3D target tracking in MRI guided focused ultrasound ablations in moving tissues. Magn Reson Med 64, 1704–12 (2010).
Article PubMed Google Scholar
Zachiu, C., Senneville, B. D. D., Moonen, C. & Ries, M. A framework for the correction of slow physiological drifts during MR-guided HIFU therapies: Proof of concept. Medical Physics 42 (2015).
Zachiu, C., Papadakis, N., Ries, M., Moonen, C. & Denis, D. S. B. An improved optical flow tracking technique for real-time MR-guided beam therapies in moving organs. Physics in Medicine & Biology 60, 9003–29 (2015).
Article ADS CAS Google Scholar
Lingala, S. G., Hu, Y., Dibella, E. & Jacob, M. Accelerated Dynamic MRI Exploiting Sparsity and Low-Rank Structure: k-t SLR. IEEE Transactions on Medical Imaging 30, 1042–1054 (2011).
Article PubMed PubMed Central Google Scholar
Cai, J. F., Jia, X., Gao, H. & Jiang, S. B. Cine Cone Beam CT Reconstruction Using Low-Rank Matrix Factorization: Algorithm and a Proof-of-Principle Study. IEEE Transactions on Medical Imaging 33, 1581–1591 (2014).
Article PubMed Google Scholar
Akcakaya, M. et al. Low-dimensional-structure self-learning and thresholding: regularization beyond compressed sensing for MRI reconstruction. Magn Reson Med 66, 756–67 (2011).
Article PubMed PubMed Central Google Scholar
Vitanis, V. et al. High resolution three-dimensional cardiac perfusion imaging using compartment-based k-t principal component analysis. Magn Reson Med 65, 575–87 (2011).
Article PubMed Google Scholar
Chen, X., Salerno, M., Yang, Y. & Epstein, F. H. Motion‐compensated compressed sensing for dynamic contrast‐enhanced MRI using regional spatiotemporal sparsity and region tracking: Block low‐rank sparsity with motion‐guidance (BLOSM). Magnetic resonance in medicine 72, 1028–1038 (2014).
Article PubMed Google Scholar
Lathauwer, L. D., Moor, B. D. & Vandewalle, J. On the Best Rank-1 and Rank-(R 1, R 2, …, R N) Approximation of Higher-Order Tensors. Siam Journal on Matrix Analysis & Applications 21, 1324–1342 (2000).
Article MathSciNet MATH Google Scholar
Pan, H., Huang, T. Z. & Ma, T. Two-step group-based adaptive soft-thresholding algorithm for image denoising. Optik - International Journal for Light and Electron Optics 127, 503–509 (2016).
Article Google Scholar
Jorgensen, J. S., Sidky, E. Y. & Pan, X. Quantifying admissible undersampling for sparsity-exploiting iterative image reconstruction in X-ray CT. IEEE Trans Med Imaging 32, 460–73 (2013).
Article PubMed Google Scholar
Andersen, A. H. & Kak, A. C. Simultaneous algebraic reconstruction technique (SART): a superior implementation of the ART algorithm. Ultrasonic imaging 6, 81–94 (1984).
Article CAS PubMed Google Scholar
Segars, W. P. et al. Development and application of the new dynamic Nurbs-based Cardiac-Torso (NCAT) phantom (2001).
Zijp, L., Sonke, J. J. & Herk, M. V. Extraction of the Respiratory Signal from Sequential Thorax Cone-Beam X-Ray Images. In ICCR (2004).
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society 13, 600 (2004).
Article PubMed Google Scholar
Dabov, K., Foi, A. & Egiazarian, K. Image denoising with block-matching and 3D filtering. Proceedings of SPIE - The International Society for Optical Engineering 6064, 354–365 (2006).
ADS Google Scholar

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of Chinaunder Grants (No. 81501466, No. 61571214, No. 81371544), the Natural Science Foundation of Guangdong Provinceunder Grants (No. 2015A030310018), theGuangzhou Science and Technology Project (No. 201710010099). The authors thank the RTK Group very much for the supplying of the opensource data of CBCT. The authors would also like to thank the anonymous reviewers for their constructive comments and suggestions that greatly improved the quality of the manuscript.

Author information

Authors and Affiliations

Guangdong Provincial Key Laboratory of Medical Image Processing, Southern Medical University, Guangzhou, Guangdong, 510515, China
Yang Liu, Xi Tao, Jianhua Ma, Zhaoying Bian, Dong Zeng, Qianjin Feng, Wufan Chen & Hua Zhang

Authors

Yang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xi Tao
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Ma
View author publications
You can also search for this author in PubMed Google Scholar
Zhaoying Bian
View author publications
You can also search for this author in PubMed Google Scholar
Dong Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Qianjin Feng
View author publications
You can also search for this author in PubMed Google Scholar
Wufan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Hua Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.L. and H.Z. wrote the main manuscript text, Y.L., H.Z. and Z.B. did the whole experiments, X.T., D.Z., Q.F., J.M. and W.C. revised the manuscript. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Wufan Chen or Hua Zhang.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, Y., Tao, X., Ma, J. et al. Motion guided Spatiotemporal Sparsity for high quality 4D-CBCT reconstruction. Sci Rep 7, 17461 (2017). https://doi.org/10.1038/s41598-017-17668-5

Download citation

Received: 19 May 2017
Accepted: 29 November 2017
Published: 12 December 2017
DOI: https://doi.org/10.1038/s41598-017-17668-5

This article is cited by

Intracellular-to-extracellular localization switch of acidic lipase in Enterobacter cloacae through multi-objective medium optimization: aqueous two-phase purification and activity kinetics
- Atim Asitok
- Maurice Ekpenyong
- Sylvester Antai
World Journal of Microbiology and Biotechnology (2022)
Motion compensated micro-CT reconstruction for in-situ analysis of dynamic processes
- Thomas De Schryver
- Manuel Dierick
- Matthieu N. Boone
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.