Abstract
Conventional conebeam computed tomography is often deteriorated by respiratory motion blur, which negatively affects target delineation. On the other side, the four dimensional conebeam computed tomography (4DCBCT) can be considered to describe tumor and organ motion. But for current onboard CBCT imaging system, the slow rotation speed limits the projection number at each phase, and the associated reconstructions are contaminated by noise and streak artifacts using the conventional algorithm. To address the problem, we propose a novel framework to reconstruct 4DCBCT from the undersampled measurements—Motion guided Spatiotemporal Sparsity (MgSS). In this algorithm, we try to divide the CBCT images at each phase into cubes (3D blocks) and track the cubes with estimated motion field vectors through phase, then apply regional spatiotemporal sparsity on the tracked cubes. Specifically, we recast the tracked cubes into fourdimensional matrix, and use the higher order singular value decomposition (HOSVD) technique to analyze the regional spatiotemporal sparsity. Subsequently, the blocky spatiotemporal sparsity is incorporated into a cost function for the image reconstruction. The phantom simulation and real patient data are used to evaluate this algorithm. Results show that the MgSS algorithm achieved improved 4DCBCT image quality with less noise and artifacts compared to the conventional algorithms.
Introduction
Threedimensional conebeam computed tomography (3DCBCT) has been widely used in image guided radiation therapy (IGRT)^{1,2,3}. It can provide the volumetric information for tumor localization in IGRT. But for thoracic and upper abdominal regions, the 3DCBCT image is often deteriorated by motion blur^{4}. To address the problem, fourdimensional CBCT (4DCBCT) imaging that incorporating temporal (phase) information on the basis of 3DCBCT was proposed^{5,6,7}. Compared to 3DCBCT imaging, 4DCBCT can provide the patientspecific respiratory motion information and multiple threedimensional volumes to represent the different status in the breathing cycle^{8,9,10}. For 4DCBCT imaging, the respiratory signal is recorded or estimated and motion contained conebeam projections are usually sorted into 8–10 subsets according to the respiratory signal. However, the gantry rotation speed and frame rate of the flatpanel imager limit the total number of conebeam projections (usually 600~800), which result in relatively fewer projections in each respiratory phase. Consequently, the reconstructed CBCT images by using the conventional Feldkamp–Davis–Kress (FDK) algorithm^{11} suffer from significant artifacts and noise^{12,13}. In addition, the randomness of breathing would lead to the conebeam projections bunched into several clusters, and the bunched sampling scheme will aggravate the noises and artifacts level in the reconstructed images^{14}.
To address this problem, various strategies have been proposed to improve the image quality of 4DCBCT^{15}. On one side, increasing the number of angular sampling, multiplegantry rotation and slowgantry rotation schemes were structured^{7,13}. But these schemes will prolong the time of data acquisition and increase the risk of motion embracing. On the other side, lots of reconstruction algorithms^{16,17,18} have been proposed to improve the quality of 4DCBCT image, such like the total variation minimization based algorithms^{15,19,20}, the McKinnon–Bates (MKB) algorithm^{21,22}, the prior image constrained compressed sensing (PICCS) algorithm^{23} and the autoadaptive phase correlation (AAPC) algorithm^{24}. As shown in the study by Frank Bergnera et al., the reconstruction algorithms using an iterative scheme can remarkably reduce the 4DCBCT specific artifacts^{25}. Nevertheless, algorithms that use the full data set, at least for initialization, such as MKB and PICCS algorithm, are only a tradeoff and achieve suboptimal temporal resolution, of which the residual motion could still be found in the reconstructed 4DCBCT images^{26}. On the other hand, algorithms that only use the projections assigned to the current phase to reconstruct the final image can fully achieve the temporal resolution, such as the total variation minimization based algorithm. For these algorithm, when the projection number is very low or there exist small size and lowcontrast objects, tiny structure may be usually erased with the piecewise smooth constraint.
In recent years, the image restoration or denoising algorithms based on block processing have shown increasing vitality. The earliest concept of ‘patch’ was proposed in Haralick’s study on textural features for image classification^{27}. In 1980, JS Lee pioneeringly used this concept for image enhancement^{28}. Latterly, algorithms such like the dictionary learning based image reconstruction/restoration approaches^{29}, the nonlocal means filter^{30} and the BM3D algorithm^{31}, et al.^{32,33,34}, have been successfully developed. Local patches often experience much less distortion than the global image and therefore it becomes easier to define the similarity between local patches. Lowrank and sparsity can be better reflected in patch based processing.
In this work, we propose to reconstruct 4DCBCT volumes from the subset projections of current phase and incorporate the image domain phasecorrelated information into the iterative procedure. Motived by the success of block based image restoration, we propose a Motion guided Spatiotemporal Sparsity (MgSS) to formulate the regularization for 4DCBCT reconstruction. In this scheme, CBCT images of different phases are divided into small cubes (threedimensional blocks) and the cubes are tracked with estimated motion vector fields through time (phase). After then, regional spatiotemporal sparsity is applied on the tracked cubes. Specifically, we recast the tracked cubes into fourdimensional matrix, and use the higher order singular value decomposition (HOSVD)^{35} technique to analyze the regional spatiotemporal sparsity. Finally, a cost function is formulated with embedding the block based spatiotemporal sparsity. One simple but effective optimization algorithm was used for the cost function solution.
This paper is structured as follows: In the method part, we first present the flow chart of the proposed MgSS scheme and then give detailed introduction of each step. After this, we formulate the reconstruction framework that incorporates the MgSS scheme. In the experiment part, we exhibit the results of the NCAT phantom simulation data, 4DCT based simulation data and real patient data by using FDK, SARTTV and the MgSS algorithms. Lastly, in the discussion part, we simply discuss the superiority and limitation of the proposed algorithm.
Methods
Flow chart of the proposed MgSS algorithm
In this work, the proposed Motion guided Spatiotemporal Sparsity (MgSS) applying to 4DCBCT reconstruction comprises the following steps;

(1)
Estimate the motion maps for each voxel between adjacent phases of the 4DCBCT images.

(2)
Divide the CBCT sequence images into cubes (3D blocks), and track the cubes through time using the threedimensional motion maps obtained in step 1.

(3)
Stack the tracked cubes and apply regional spatiotemporal sparsity on the tracked cubes by framing the 4DCBCT reconstruction problem to be a constrained optimization problem, and then optimize the problem to get the reconstructions.
In the following sections, we will introduce these steps in detail.
Motion Maps Estimation
In this study, we are not devoted to develop new algorithm to achieve the threedimensional motion vector fields (3DMVFs) between CBCT images of adjacent phases, but rather we use the RealTime Imagebased Tracker (RTIT)^{36} toolbox to estimate the threedimensional motion maps. The RTIT toolbox is an available open source with implementation of the optical flow based registration algorithms^{37,38}. Assume that we obtain the initial/current 4DCBCT estimation, with the RTIT, we can achieve the pixel based motion vector fields consisting of the changes in space coordinates that describe the distribution of the apparent motion velocities of intensity patterns in the sequences of images.
Cubes Tracking with MVFs
For the task of cubes tracking, we define the voxel spacetime position as \(u=(x,\,y,\,z,\,t)\), where x, y, z and t represent the spatial position of the voxel in the CBCT volumetric image and the temporal phase index, respectively. In MgSS, we fetch the cubes (3D blocks) from the images of the first phase and use the 3DMVFs to track structurally similar cubes in the other phases. Specifically, the first phase image was initiated with highly overlapping cubes. In the extreme case, all the voxels in the first phase image can be defined as the central voxel of one cube, but this may result in huge computation burden. Thus in this study, rather than sliding by one voxel to every next, we use a step of N_{step} = 2 voxels to define the cubes. For the motion tracking, considering one cube \(B({u}_{1})\in {{X}_{1}}^{{N}_{b}\times {N}_{b}\times {N}_{b}}\), here \({u}_{1}=({x}_{1},\,{y}_{1},\,{{\rm{z}}}_{1},\,{t}_{1})\) indicates the central pixel position of the cubes with spatial position to be \(({x}_{1},\,{y}_{1},\,{{\rm{z}}}_{1})\) and temporal phase index to be t _{1}. By using Δu to denote the displacement of the central voxel in the cube between t _{1} and t _{2} phases, we can use u _{1} + Δu to track the center pixel location for cube in the t _{2} phase. Considering u _{1} + Δu might be nonintegral, the next block center voxel was taken as u _{2} = {u _{1} + Δu _{2}}, where “{}” is a rounding operation. Extracting the voxels centered from u _{2}, we can constitute a new cube B(u _{2}) with the same block size. Following this schedule, B(u _{ n }) can be tracked with u _{ n }= {u _{1} + Δu _{ n }} using the above blockcentertracking method. A block cluster can be ulteriorly constructed: \({{\rm{\Theta }}}_{MgSS}=[B({u}_{1}),B({u}_{2}),\mathrm{...},B({u}_{Nt})]\). Here the cluster Θ_{ MgSS } is a four dimensional matrix with the size of \({N}_{b}\times {N}_{b}\times {N}_{b}\times {N}_{t}\). Based on the fact that not all the chest is moving during the CBCT scan, the tracked cubes for the static parts should be noisy blocks with same anatomy structures. As shown in Fig. 1, compared to the blocks of the same spatial position in the original phases, the tracked blocks exhibit more mutual similarity.
Regional Spatiotemporal Sparsity
The technology about matrix rank sparsity has been successfully investigated in the field of dynamic image reconstruction, such as the kt SLR method^{39}. In some previous studies^{40}, matrix rank sparsity was often used to dispose the entire image. But the recent studies, such like Lowdimensionalstructure selflearning and thresholding (LOST)^{41} and compartmentbased kt principal component analysis^{42} have indicated that the spatiotemporal sparsity and reconstruction quality could be further promoted by separating the entire image into blocks. Under the patchbased processing theory, decomposition of the tracked regions of dynamic datasets using the singular value decomposition (SVD) algorithm has been reported^{43}. In Chen’s work, blocks in the clusters were vectorized and each cluster Θ was first rearranged into a 2D matrix \(\tilde{{\rm{\Theta }}}\in {{\mathbb{C}}}^{{N}_{s}\times {N}_{t}}({N}_{s}={N}_{b}\times {N}_{b})\). The SVD technique was then adopted to decompose the cluster: \({\tilde{{\rm{\Theta }}}}_{svd}=U{S}_{svd}{V}^{\ast }\). If the cluster is truly spatiotemporal sparsity, a least number of significant values will be found in the singular matrix S _{ svd }. For the standard SVD, the image blocks are manually vectorized and then the image structural properties in the spatial domain are ignored. To address the problem, the higher order singular value decomposition (HOSVD), which is able to directly decompose dynamic datasets into a multidimensional singular matrix rather than unfolding the blocks into column vectors, has been reported^{35}. Also, in this work, we use the HOSVD to decompose the clusters due to its naturality and flexibility. By using the HOSVD technique, a fourdimensional tensor Θ can be decomposed as:
where U ^{(1)}, U ^{(2)}, U ^{(3)}, U ^{(4)} are orthogonal matrices that contain the orthonormal vectors spanning the column space of the matrix Θ. Here, the symbol ×_{n} stands for the nth mode tensor product. The core tensor S _{ hosvd } is not necessarily diagonal matrix, which implies that each dimension of the tensor can have a different rank. Generally, noise and artifacts can be attenuated by approximating the rank of the core tensor. However, the computation of the best rank approximation requires an iterative alternated leastsquare (ALS) algorithm and is quite time consuming^{44}. Moreover, the approximation obtained from simple truncation has been proved to be in most cases quite similar to the optimal approximation^{45}. Base on this, in our study, we apply a softthresholding strategy on the HOSVD coefficients to reduce noise and artifacts. The thresholding of the rank coefficients can be represented as follows:
where H _{ τ } denotes the softthresholding operator with threshold τ. Then a new tensor can be synthesized by the inverse HOSVD transformation with truncated coefficients \({\hat{S}}_{hosvd}\) and the orthogonal matrices U ^{(1)}, U ^{(2)}, U ^{(3)}, U ^{(4)}:
The above operation is repeated for each reference cube, thus provides multiple estimates at the same coordinate. For this reason, the final estimates are aggregated by weighted averaging all the obtained blockwise estimates that overlapped at each voxel.
MgSS based 4DCBCT reconstruction
To incorporate the motion tracking induced blocky spatiotemporal sparsity into the CBCT reconstruction framework, in this section, we formulate the following minimization scheme:
where f represents the estimated dynamic images, A is the CBCT imaging system matrix with elements of a _{ ij }, y denotes the projection measurements. Operator Φ is the cube based sparsity penalties, while \({T}_{u}\cdot f\) denotes the dataset that includes a sequence of 4D matrix of the tracked cubes. Operator T _{ u } includes two steps: (1) Track threedimensional blocks with the motion vector, and (2) Rearrange the threedimensional blocks into 4D matrix.
To optimize the problem in (4), in this work, inspired by the techniques used by Pan et al.^{46}, we present an efficient way to solve (4), which can be summarized as:
(1) SART^{47} step:
Here λ is an overrelaxed factor.
(2) MgSS step:\({f}^{(n+1)}=W({\tilde{{\rm{\Theta }}}}_{MgSS}^{\ast }({f}^{(n+1/2)}))\) Here, \({\tilde{{\rm{\Theta }}}}_{MgSS}^{\ast }({f}^{(n+1/2)})\) denotes the restored blockbased clusters based on \({f}^{(n+1/2)}\) by using the HOSVD approach
W is a weighted averaging operator indicates how the cubes are merged back into the image. The elements of W is the reciprocal of the times which one pixel was overlapped by different cubes.
In our implementation, the MgSS step is carried out after 10^{th} iteration of SART step until \({{f}^{(n)}{f}^{(n1)}}^{2} < \xi \) or reached the predetermined iterative number. Also one implicit motion vector field estimation step was performed using the intermediate reconstructions of SART.
Results
Data acquisition of digital simulation
NCAT phantom based simulation
In this work, the 4D NURBSbased CardiacTorso (NCAT)^{48} phantom, which is capable of providing the realistic model of human anatomy and simulating cardiac and respiratory motion simultaneously, was used for data simulation. A dynamic phantom with ten respiratory phases and breathing period set to be 5 s was generated. The maximum diaphragm motion and the maximum chest anterior–posterior motion is 20 mm and 5 mm. We manually added several tumors with different sizes, contrast and shapes in the right lung field of phantom to test the robustness of the MgSS algorithm. The diameters of the spherical tumors were: 6 mm, 10 mm, 16 mm and 22 mm. The long diameter and short diameter of the nonspherical tumor is: 28 mm and 22 mm. For all simulations, the size of the digital phantom is 256 × 256 × 150, with voxel size of 2 × 2 × 2 mm^{3}. The projections were generated by utilizing fast raytracing technique. The projection size of each angular view is 300 × 200 with detector pixel size 2 × 2 mm^{2}. In the process of simulation, projection views are evenly distributed over 360 degrees with the projection number of each phase range from 21 to 51. The noisy signal S _{ i } at each detector bin i was simulated based on the Poisson noise model:
Here I _{0} and \({\sigma }_{e}^{2}\) represent the incident xray intensity and the background noise, respectively. I _{0} is set to be 2 × 10^{6} and \({\sigma }_{e}^{2}\) is chosen to be 10.
4DCT based simulation
To further evaluate the performance of the MgSS algorithm, the 4DCT based digital phantom simulation was also performed. The 4DCT images were acquired on a 16slice helical CT scanner (Brilliance Big Bore, Philips Medical Systems, Andover, MA). The three dimensional CT volumes at each phase were first interpolated to be isotropic data set with voxel size 1.0254 × 1.0254 × 1.0254 mm^{3}. Then CB projections were computed from the reference 3D CT image using the projection matrix. The scan geometry was chosen according to the Varian OnBoard Imager® and True Beam™ CBCT units.
Data acquisition of patient
The patient data was downloaded from an open data website (http://wiki.openrtk.org/index.php/ RTK/Examples/MCCBCTReconstruction). The conebeam projections were acquired on the Elekta Synergy system. The clinical dataset consisted of 644 projections and were sorted to 10 phases based on the AS method^{49}. The size of the digital flat panel is 512 × 512 with the pixel size of 0.8 × 0.8 mm^{2}. In our experiments, the isotropic reconstruction target resolution was set to be 1.2 × 1.2 × 1.2 mm^{3} on a 256 × 256 × 200 matrix.
Evaluation metrics
To quantitative evaluate the performance of the proposed algorithm, we calculate the relative root mean square error (rRMSE) between the phantom images and the reconstructions. The rRMSE is defined as:
Here, f denotes the target image, f _{ p } denotes the phantom image, and m is the voxel index.
The universal quality index^{50} (UQI) index was utilized to conduct region of interest (ROI) based analysis by evaluating the degree of similarity between the reconstructed and the reference images. We select ROIs including the tumor and lung details within the reconstructed and reference images, the mean, variance and covariance of intensities in the ROIs can be respectively calculated as:
the f _{ ture } denotes the golden standard image, m is the voxel index, and Q denotes the number of voxels within the ROI. The UQI can be calculated as follows:
UQI measures the intensity similarity between the two images, and its value ranges from zero to one. A UQI value closer to one suggests better similarity to the reference image.
Digital NCAT phantom study
Visual inspection
Figure 2 shows the results of the NCAT phantom reconstructed by using different methods at transverse, coronal, and sagittal planes for phase #1 with the projection number set to be 21. The columns one to three show the transverse, coronal, and sagittal images, respectively. First row in Fig. 2 shows the designed digital phantom images. The second row shows the results which were reconstructed by using the FDK algorithm from all projections. As we can see, the motion blurring artifacts are obvious. The third row shows the phase correlated 4DCBCT images reconstructed by the FDK algorithm, and we can see that the FDK reconstructions are full of noise and streak artifacts. For the SARTTV reconstructions, the fine structures inside the lung area are severely blurred though the view aliasing artifacts are suppressed. Compared to FDK and SARTTV algorithms, the proposed MgSS approach can yield images with superior quality. Moreover, Figs 3 and 4 demonstrate the reconstructions of 31 and 51 views, which further illustrate the gains of the proposed MgSS approach.
Quantitative evaluation
Table 1 presents the results of rRMSE measures with the projection views range from 21 to 51 for each phase. The rRMSEs of ten phases reconstructions by using the proposed MgSS reduced by an average of 43% compared to those of FDK reconstructions. It can be obviously seen that the MgSS algorithm can achieve smaller rRMSE values compared to the other algorithms, which suggests the promising performance of the proposed MgSS approach.
Figure 5 illustrates the horizontal profiles of the transverse plane images in Fig. 2. It can be observed that the profiles obtained from the MgSS reconstructions match much better than the others, which suggest that the present MgSS approach achieves more noticeable gains than other approaches.
Motion trajectory accuracy
Because of the various applications of IGRT is interested in the tumor position information, we are devoted to extract the motion trajectories from different reconstruction schemes: FDK, SARTTV and the proposed MgSS algorithms. We define the motion trajectories which extracted from the NCAT phantom with highcontrast tumor as a reference. In this paper, we extract the movement information from the center of tumor. The motion trajectories have been showed in Fig. 6. As we can see, the motion information of tumor extracted from MgSS algorithm matches well with the reference trajectory, while the trajectories extracted from the FDK diverges the reference trajectory.
Low contrast lesion detection study
To test the robustness of the MgSS algorithm in reproducing the low contrast lesion, Figs 7 and 8 present the reconstructions of NCAT phantom with different tumor sizes and shapes at the transverse and coronal planes. The first row shows the designed phantom image used for visual comparison. The second to fourth rows show 4DCBCT at the beginexpiration phase reconstructed by using FDK, SARTTV and proposed MgSS algorithms, respectively. It can be seen, the low contrast tumors in the NCAT phantom can’t be completely rebuilt by the FDK and SARTTV algorithms. The morphology of tumors is partly destroyed. The phenomenon is particularly evident for tumors with small size. On the contrast, the MgSS algorithm can mostly recover the tumors with relatively complete morphological structures. Furthermore, Fig. 9 presents the UQI test on the ROIs shown in Fig. 8. All the results suggest that, with the introduction of phasecorrelated information, the MgSS algorithm has better lesion detection ability than algorithms that only using the single phase information.
Parameter selection
In our method, there are two parameters need to be tuned: (1) the tracked cube size; (2) the softthresholding coefficients for the HOSVD processing. The cube size plays an important role in our work as it not only directly affects the accuracy of the results, but also affects the computation time. Traditionally, for the blockbased techniques, the size of the cube is an empirical parameter specified by the user. A reasonable cube size can help us attend to the local structure characteristics while removing undesirable distortions.
To study the influence of the cube size on the proposed algorithm, we experimentally change the size of cube to get the associated reconstructions and calculate the rRMSEs between the reconstructions and the designed phantom image. We reconstruct the NCAT phantom from 21 projections with block size set from 3 × 3 × 3 to 23 × 23 × 23. Figure 10 shows the reconstructions of cube size to be 5 × 5 × 5, 7 × 7 × 7, 9 × 9 × 9, 13 × 13 × 13, 17 × 17 × 17, 23 × 23 × 23. As shown in Fig. 10, on one hand, boundary distortion could be observed when the cube size is too small. On the other hand, the image blur increases with the increase of the cube size. This phenomenon is due to the large cube contains too much structural information, the movement of central voxel in the cube is not enough to describe the whole cube. Thus, to generate high quality 4DCBCT image, appropriate size of the cube is needed. Moreover, as shown in Fig. 11, the averaged rRMSEs of ten phases reconstructions indicate that with the cube size range from 7 × 7 × 7 to 11 × 11 × 11, we can get relative smaller rRMSEs. In order to balance the reconstruction quality and computational time, in our other experiments, the cube size was fixed to 9 × 9 × 9.
For the parameter of thresholding coefficients for the HOSVD processing, we choose with a threshold of \(\sigma \sqrt{2\,\mathrm{log}\,{p}^{2}}\) to manipulate the coefficients of core tensor. This is also an experiential selection as used in other works^{51}. Here, σ is defined as the standard deviation of a uniform region in the intermediary images during the iteration and p is the cube size.
Algorithm convergence
To validate and analyze the convergence of the present MgSS method, the \({f}({\bf{n}}){f}({\bf{n}}1)\) (absolute value of differences between two adjacent estimations) measuring on the entire tobereconstructed NCAT phantom image were performed. Figure 12 shows the \({f}({\bf{n}}){f}({\bf{n}}{}1)\) measures with respect to the number of iterations. Results show that the present MgSS algorithm can yield a steadily convergence solution.
Influence of motion tracking
To demonstrate the effect of the estimated motion fields on the reconstruction, we plot the rRMSE measures as a function of the number of iterations for the MgSS with and without motion tracking. Figure 13 shows the benefits of motion guidance, as MgSS with motion tracking can obtain reduced rRMSE as compared to MgSS without motion tracking. Figure 13 also illustrates the convergence of the proposed MgSS algorithm.
Realistic 4D CT based digital phantom study
Figure 14 shows the results of the realistic digital phantom reconstructed by using different methods at transverse, coronal, and sagittal planes for phase #1 with the projection number set to be 21. First column in Fig. 14 shows the 4DCT images using as the golden standard for comparison. The second column shows 4DCBCT images reconstructed by the FDK algorithm. It can be seen that the FDK reconstructions are seriously contaminated by noise and artifacts, and some anatomical structures can’t be clearly seen. For the SARTTV reconstruction, some fine structures have been erased though most of the view aliasing artifacts are suppressed. Compared to FDK and SARTTV algorithms, the proposed MgSS approach can yield images with superior quality. In addition, the UQI test between the reconstructions and golden standard image were calculated. Figure 15 shows the test results of then phases at the transverse, sagittal and coronal planes, separately. For then phases, the UQI values of MgSS reconstructions are always higher than 0.95, which suggests the promising performance of the proposed MgSS algorithm.
Patient study
Figure 16 shows the representative reconstructions of patient data by using the FDK, SARTTV and MgSS methods. Each row of Fig. 16 shows the image of 4DCBCT at different phase in respiratory cycle: 20–30%, 40–50%, 60–70%, 80–90%. The first column in Fig. 16 shows the sagittal view of 4DCBCT images reconstructed by conventional FDK. Because of the limited number of projections at each phase, severe noise and artifacts present in the FDK reconstruction. The second column of Fig. 16 shows the results of SARTTV algorithm. Although noise and artifacts have been remarkably reduced, details within the lung area and edges of bony structure can’t be clearly seen. Last column of Fig. 16 shows the images reconstructed by MgSS method, from which we can observe noise is suppressed. The boundaries of bony structures as well as fine structures inside the lung are well preserved. To further illustrate the performance of our algorithm, zoomed images of the tumor areas in the then phasesof the reconstructions by using different reconstruction are presented (Fig. 17).
Discussion
In this work, we developed a MgSS algorithm to improve the image quality of 4DCBCT. This algorithm is developed on the assumption that there exists high structural similarity between the images of neighbored phases, which is similar with Chen’s work on dynamic MR reconstruction^{43}. In Chen’s work, two dimensional patches are tracked with estimated motion maps and then SVD was used to decompose the tacked cluster. For the standard SVD, the image blocks are manually vectorized and then the image structural properties in the spatial domain are ignored. For the proposed MgSS algorithm, the 3DMVFs between phases were utilized to track the three dimensional cubes and then form the cluster Θ_{ MgSS } with the size of \({N}_{b}\times {N}_{b}\times {N}_{b}\times {N}_{t}\). To preserve the structural properties, in this work, the higher order singular value decomposition which is able to directly decompose dynamic datasets into a multidimensional singular matrix rather than unfolding the cubes into column vectors, was used to process the four dimensional cluster Θ_{ MgSS }. The MgSS algorithm can reveal the fine details shared by grouped blocks and preserve the essential unique features of each individual block. Results of digital phantom and patient data demonstrate the proposed approach can significantly suppress the view aliasing artifacts and noise.
One important step in the proposed MgSS algorithm is: cube based motion tracking. We utilize the RealTime Imagebased Tracker (RTIT)^{36} toolbox to obtain the voxelbyvoxel displacement maps between 3D images of different phases. Specifically, we rely mainly on the displacement of the center voxel of cube in the current phase to determine the cube in the next phase. We assume the displacements of region are changing smoothly. Thus, the central voxel is sufficient to describe the cube motion. And in this case, the accuracy of 3DMVFs estimation would play an important role in finding cubes with similar structures. In this work, the initial 3DMVFs are generated after the first SART iteration, which may not so accurate. Notably, as the iteration goes on, the 3DMVFs would be updated with intermediate reconstructions, and therefore both the image quality and the accuracy of the 3DMVFs will be improved. Other approaches such like the 3DMVFs initialized with those got from the 4D planning CT of the same patient can be considered. There is a very important parameter need to be tuned for the proposed MgSS approach: the tracked cube size. A large size cube may include more details and local geometry, but on the other side, if the cube size is too large, the central voxel will be insufficient to describe the whole cube motion and also result in increasing computation time. Thus, the proper selection of the cube size is critical. In our studies, we change the block size from 3 × 3 × 3 to 23 × 23 × 23 and generate the reconstructions as shown in Fig. 10 and rRMSEs between the reconstructions and the phantom images. For visual inspection, too small or large cube size would lead to image blur or distortion. The rRMSE results suggest that when the cube size set to be 9 × 9 × 9, we can get relative higher quality image with smaller rRMSE for the NCAT phantom. Also as illustrated in other studies of HOSVD technique, the appropriate selection of the cube size should base on the structural complexity of the target image and may be different for different images. But overall, in our work for lung CT imaging, we found the cube size set to be 9 × 9 × 9 is robust enough to balance the structural information and computation time. Another issue of the proposed MgSS algorithm is the heavy computation burden. It takes about 30 minutes on a desktop computer (3.60 GHz Intel(R) i7 CPU with 8GB RAM) to run one iteration. On one hand, as mentioned above, we can short the computation time with a relatively small cube size as well as an elegant initialization. On the other hand, we can use a step of N_{step} pixels in transverse, coronal, and sagittal directions, respectively. Hence, the number of overlapping cubes is decreased by 1/N_{step} ^{3}. But with large N_{step} or large motion, gaps between moved blocks may appear, then a mask of the uncovered areas (gaps) on the nth frame after block motion tracking can be detected and nonmotion tracking is performed for the gap blocks to avoid potential additional gaps. Other techniques including using the graphics processing unit (GPU) can be also considered to short the computation time.
In summary, we have developed a MgSS algorithm to improve the image quality of 4DCBCT. This method effectively utilizes the correlated information from other phases to reconstruct any particular phase of 4DCBCT. By enforcing the regional spatiotemporal sparsity on the tracked cubes, noise and artifacts can be suppressed and the image quality of 4DCBCT can be substantially improved.
References
 1.
Jaffray, D. A., Siewerdsen, J. H., Wong, J. W. & Martinez, A. A. Flatpanel conebeam computed tomography for imageguided radiation therapy. International Journal of Radiation Oncology* Biology* Physics 53, 1337–1349 (2002).
 2.
Oldham, M. et al. ConebeamCT guided radiation therapy: A model for online application. Radiotherapy and oncology 75, 271. E1–271. E8 (2005).
 3.
Pouliot, J. et al. Lowdose megavoltage conebeam CT for radiation therapy. International Journal of Radiation Oncology* Biology* Physics 61, 552–560 (2005).
 4.
Chen, G. T., Kung, J. H. & Beaudette, K. P. Artifacts in computed tomography scanning of moving objects. Semin Radiat Oncol 14, 19–26 (2004).
 5.
Sonke, J., Zijp, L., Remeijer, P. & van Herk, M. Respiratory correlated cone beam CT. Medical physics 32, 1176–1186 (2005).
 6.
Li, T. et al. Fourdimensional conebeam computed tomography using an onboard imager. Medical physics 33, 3825–3833 (2006).
 7.
Lu, J. et al. Fourdimensional cone beam CT with adaptive gantry rotation and adaptive data sampling. Medical physics 34, 3520–3529 (2007).
 8.
Ford, E. C., Mageras, G. S., Yorke, E. & Ling, C. C. Respirationcorrelated spiral CT: a method of measuring respiratoryinduced anatomic motion for radiation treatment planning. Medical physics 30, 88–97 (2003).
 9.
Low, D. A. et al. A method for the reconstruction of fourdimensional synchronized CT scans acquired during free breathing. Medical physics 30, 1254–1263 (2003).
 10.
Pan, T., Lee, T., Rietzel, E. & Chen, G. T. 4DCT imaging of a volume influenced by respiratory motion on multislice CT. Medical physics 31, 333–340 (2004).
 11.
Feldkamp, L. A., Davis, L. C. & Kress, J. W. Practical conebeam algorithm. JOSA A 1, 612–619 (1984).
 12.
Li, T., Koong, A. & Xing, L. Enhanced 4D conebeam CT with interphase motion model. Medical physics 34, 3688–3695 (2007).
 13.
Li, T. & Xing, L. Optimizing 4D conebeam CT acquisition protocol for external beam radiotherapy. International Journal of Radiation Oncology* Biology* Physics 67, 1211–1219 (2007).
 14.
Li, T., Schreibmann, E., Yang, Y. & Xing, L. Motion correction for improved target localization with onboard conebeam computed tomography. Phys Med Biol 51, 253–67 (2006).
 15.
Zhang, H. et al. Fewview conebeam CT reconstruction with deformed prior image. Med Phys 41, 121905 (2014).
 16.
Wang, J., Li, T. & Xing, L. Iterative image reconstruction for CBCT using edgepreserving prior. Medical physics 36, 252–260 (2009).
 17.
Brock, R. S., Docef, A. & Murphy, M. J. Reconstruction of a conebeam CT image via forward iterative projection matching. Medical physics 37, 6212–6220 (2010).
 18.
Jia, X., Dong, B., Lou, Y. & Jiang, S. B. GPUbased iterative conebeam CT reconstruction using tight frame regularization. Physics in medicine and biology 56, 3787 (2011).
 19.
Sidky, E. Y. & Pan, X. Image reconstruction in circular conebeam computed tomography by constrained, totalvariation minimization. Physics in medicine and biology 53, 4777 (2008).
 20.
Sidky, E. Y. et al. Enhanced imaging of microcalcifications in digital breast tomosynthesis through improved imagereconstruction algorithms. Medical Physics 36, 4920–32 (2009).
 21.
Kinnon, G. C. M. & Bates, R. H. T. Towards Imaging the Beating Heart Usefully with a Conventional CT Scanner. IEEE Transactions on Biomedical Engineering BME28, 123–127 (1981).
 22.
Leng, S. et al. Streaking artifacts reduction in fourdimensional conebeam computed tomography. Medical Physics 35, 4649–59 (2008).
 23.
Leng, S. et al. High temporal resolution and streakfree fourdimensional conebeam computed tomography. Physics in Medicine & Biology 53, 5653–73 (2008).
 24.
Bergner, F. et al. Autoadaptive phasecorrelated (AAPC) reconstruction for 4D CBCT. Medical physics 36, 5695–5706 (2009).
 25.
Kida, S., Masutani, Y., Nakano, M., Imae, T. & Haga, A. Improvement of 4D Conebeam CT image quality via iterative reconstruction method. Ieice Technical Report 112, 73–75 (2012).
 26.
Bergner, F. et al. An investigation of 4D conebeam CT algorithms for slowly rotating scanners. Medical Physics 37, 5044–5053 (2010).
 27.
Haralick, R. M. Statistical and structural approaches to texture. Proceedings of the IEEE 67, 786–804 (1979).
 28.
Lee, J. S. Digital Image Enhancement and Noise Filtering by Use of Local Statistics. IEEE Transactions on Pattern Analysis & Machine Intelligence 2, 165–168 (1980).
 29.
Aharon, M., Elad, M. & Bruckstein, A. KSVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Transactions on Signal Processing 54, 4311–4322 (2006).
 30.
Buades, A., Coll, B. & Morel, J. M. A Review of Image Denoising Algorithms, with a New One. Siam Journal on Multiscale Modeling & Simulation 4, 490–530 (2005).
 31.
Dabov, K., Foi, A., Katkovnik, V. & Egiazarian, K. Image denoising by sparse 3D transformdomain collaborative filtering. IEEE Transactions on Image Processing 16, 2080–95 (2007).
 32.
Liu, C. & Freeman, W. T. A highquality video denoising algorithm based on reliable motion estimation. European Conference on Computer Vision 706–719 (2010).
 33.
Dwivedi, A. & Shrivastava, S. K. An efficient and fast patch reordering approach for image denoising without losing structural information. International Conference on Computer Communications 1–6 (2014).
 34.
Parameswaran, S., Luo, E. & Nguyen, T. Q. Patch Matching for Image Denoising Using Neighborhoodbased Collaborative Filtering. IEEE Transactions on Circuits and Systems for Video Technology, 1–1 (2016).
 35.
Costantini, R., Sbaiz, L. & Süsstrunk, S. Higher order SVD analysis for dynamic texture synthesis. IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society 17, 42–52 (2008).
 36.
Ries, M. et al. Realtime 3D target tracking in MRI guided focused ultrasound ablations in moving tissues. Magn Reson Med 64, 1704–12 (2010).
 37.
Zachiu, C., Senneville, B. D. D., Moonen, C. & Ries, M. A framework for the correction of slow physiological drifts during MRguided HIFU therapies: Proof of concept. Medical Physics 42 (2015).
 38.
Zachiu, C., Papadakis, N., Ries, M., Moonen, C. & Denis, D. S. B. An improved optical flow tracking technique for realtime MRguided beam therapies in moving organs. Physics in Medicine & Biology 60, 9003–29 (2015).
 39.
Lingala, S. G., Hu, Y., Dibella, E. & Jacob, M. Accelerated Dynamic MRI Exploiting Sparsity and LowRank Structure: kt SLR. IEEE Transactions on Medical Imaging 30, 1042–1054 (2011).
 40.
Cai, J. F., Jia, X., Gao, H. & Jiang, S. B. Cine Cone Beam CT Reconstruction Using LowRank Matrix Factorization: Algorithm and a ProofofPrinciple Study. IEEE Transactions on Medical Imaging 33, 1581–1591 (2014).
 41.
Akcakaya, M. et al. Lowdimensionalstructure selflearning and thresholding: regularization beyond compressed sensing for MRI reconstruction. Magn Reson Med 66, 756–67 (2011).
 42.
Vitanis, V. et al. High resolution threedimensional cardiac perfusion imaging using compartmentbased kt principal component analysis. Magn Reson Med 65, 575–87 (2011).
 43.
Chen, X., Salerno, M., Yang, Y. & Epstein, F. H. Motion‐compensated compressed sensing for dynamic contrast‐enhanced MRI using regional spatiotemporal sparsity and region tracking: Block low‐rank sparsity with motion‐guidance (BLOSM). Magnetic resonance in medicine 72, 1028–1038 (2014).
 44.
Lathauwer, L. D., Moor, B. D. & Vandewalle, J. On the Best Rank1 and Rank(R 1, R 2, …, R N) Approximation of HigherOrder Tensors. Siam Journal on Matrix Analysis & Applications 21, 1324–1342 (2000).
 45.
Pan, H., Huang, T. Z. & Ma, T. Twostep groupbased adaptive softthresholding algorithm for image denoising. Optik  International Journal for Light and Electron Optics 127, 503–509 (2016).
 46.
Jorgensen, J. S., Sidky, E. Y. & Pan, X. Quantifying admissible undersampling for sparsityexploiting iterative image reconstruction in Xray CT. IEEE Trans Med Imaging 32, 460–73 (2013).
 47.
Andersen, A. H. & Kak, A. C. Simultaneous algebraic reconstruction technique (SART): a superior implementation of the ART algorithm. Ultrasonic imaging 6, 81–94 (1984).
 48.
Segars, W. P. et al. Development and application of the new dynamic Nurbsbased CardiacTorso (NCAT) phantom (2001).
 49.
Zijp, L., Sonke, J. J. & Herk, M. V. Extraction of the Respiratory Signal from Sequential Thorax ConeBeam XRay Images. In ICCR (2004).
 50.
Wang, Z., Bovik, A. C., Sheikh, H. R. & Simoncelli, E. P. Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing A Publication of the IEEE Signal Processing Society 13, 600 (2004).
 51.
Dabov, K., Foi, A. & Egiazarian, K. Image denoising with blockmatching and 3D filtering. Proceedings of SPIE  The International Society for Optical Engineering 6064, 354–365 (2006).
Acknowledgements
This work was partially supported by the National Natural Science Foundation of Chinaunder Grants (No. 81501466, No. 61571214, No. 81371544), the Natural Science Foundation of Guangdong Provinceunder Grants (No. 2015A030310018), theGuangzhou Science and Technology Project (No. 201710010099). The authors thank the RTK Group very much for the supplying of the opensource data of CBCT. The authors would also like to thank the anonymous reviewers for their constructive comments and suggestions that greatly improved the quality of the manuscript.
Author information
Affiliations
Contributions
Y.L. and H.Z. wrote the main manuscript text, Y.L., H.Z. and Z.B. did the whole experiments, X.T., D.Z., Q.F., J.M. and W.C. revised the manuscript. All authors reviewed the manuscript.
Corresponding authors
Correspondence to Wufan Chen or Hua Zhang.
Ethics declarations
Competing Interests
The authors declare that they have no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Received
Accepted
Published
DOI
Further reading

Motion compensated microCT reconstruction for insitu analysis of dynamic processes
Scientific Reports (2018)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.