logo

Hybrid reconstruction algorithm for computed tomography based on diagonal total variation

NUCLEAR ELECTRONICS AND INSTRUMENTATION

Hybrid reconstruction algorithm for computed tomography based on diagonal total variation

Lu-Zhen Deng
Peng He
Shang-Hai Jiang
Mian-Yi Chen
Biao Wei
Peng Feng
Nuclear Science and TechniquesVol.29, No.3Article number 45Published in print 01 Mar 2018Available online 28 Feb 2018
58900

Inspired by total variation (TV), this paper represents a new iterative algorithm based on diagonal total variation (DTV) to address the computed tomography (CT) image reconstruction problem. To improve the quality of a reconstructed image, we used DTV to sparsely represent images when iterative convergence of the reconstructed algorithm with TV-constraint had no effect during the reconstruction process. To investigate our proposed algorithm, the numerical and experimental studies were performed, and root mean square error (RMSE) and structure similarity (SSIM) were used to evaluate the reconstructed image quality. The results demonstrated that the proposed method could effectively reduce noise, suppress artifacts, and reconstruct high-quality image from incomplete projection data.

Computed tomography (CT)Sparse view reconstructionDiagonal total variation (DTV)Compressive sensing (CS)

1. Introduction

X-ray computed tomography (CT) has been widely used in clinical and preclinical applications and plays a central role in the examination of diseases and procedures [1, 2]. X-ray radiation dose is harmful for patients, and low-dose CT reconstruction techniques are a research hotspot in the current medical CT field. To reduce the X-ray radiation dose, medical CT systems can decrease the X-ray intensity or the number of scanning projection angles. However, these two strategies result in a reconstructed image with a low signal-to-noise ratio (SNR), indicating the negative impact of noise and artifacts.

In comparison to traditional CT-reconstructed algorithms [3-5], the iterative algorithms based on compressive sensing (CS) [6] can be used to reconstruct high-quality CT images with incomplete or low SNR projection data. In the CT reconstruction, some advanced exemplary algorithms were employed, such as total variation (TV) minimization [7,8], soft-thresholding algorithms [9, 10], adaptive-weighted total variation (AWTV) [11], dictionary learning [12], multi-direction anisotropic total variation (MDATV) [13], and split-Bregman reconstruction [14]. Among them, TV using the x- and y-coordinate gradient operators as the sparse representation approach during the iteration process is one of the most popular algorithms. However, it still can be improved by combining with the diagonal total variation (DTV) [15-19], which accelerates the iterative convergence and reconstructs the high-quality image from incomplete projection data.

In this paper, the research goal was to reconstruct high-quality CT images with a low dose. There are many methods to reduce the delivered dose in CT scanning but we focused on a sparse view reconstruction strategy. To handle the noise and artifacts in the reconstructed images from few-view projections, we proposed a hybrid reconstruction approach that combined TV- and DTV-constraints aimed at exploring the sparse view reconstruction. We introduce the methodology including the proposed algorithm in Sect. 2, analyze the reconstructed results in Sect. 3, and discuss the issues related to our reconstruction process and the corresponding results in Sect. 4.

2. Methodology

2.1 Total Variation and Diagonal Total Variation

The reconstruction algorithm with TV-constraint can be defined as:

minfTVs.t.Afp22<σ2. (1)

Where f is the reconstructed image, A is the projection matrix, p is the projection data, σ is permissible error, and TV of f can be expressed as:

fTV=s,t|fs,t|=[Ds(fs,t)]2+[Dt(fs,t)]2. (2)

Where represents the local gradient operator, fs,t is a pixel value of f at position (s,t), Ds() and Dt() are the discrete differential operators along the s- and t-axes, respectively, defined as:

Ds(fs,t)=fs,tfs1,t (3) Dt(fs,t)=fs,tfs,t1. (4)

The positional relationship between fs,t and its neighboring pixels is shown in Figure 1.

Fig. 1.
Illustration of the pixel positions in a reconstructed image
pic

The DTV of f is defined as fDTV:

fDTV=s,t|Dfs,t|=[Dds(fs,t)]2+[Ddt(fs,t)]2 (5)

where D represents the local diagonal gradient operator and Dds() and Ddt() are the discrete differential operators along the diagonal s- and t-axes, respectively, given by

Dds(fs,t)=fs,tfs1,t-1 (6) Ddt(fs,t)=fs,tfs-1,t+1. (7)
2.2 Proposed Algorithm

In this paper, we used DTV to sparsely represent images when iterative convergence of the reconstructed algorithm with TV-constraint showed no change during the reconstruction process. To solve the optimization problem, we employed the steepest descent method [20]. Our proposed algorithm can be defined as follows:

minfTVs.t.Afp22<σ12 (8) minfDTVs.t.Afp22<σ22 (9)

where σ1 and σ2 are the permissible errors of TV and DTV, respectively.

The steepest descent method was applied to solve Eqs. (8) and (9), and we obtained the following formulas:

fl+1=fl-αgTVl (10) fk+1=fk-βgDTVk (11)

Where l and k denote the iteration indices of TV and DTV in the steepest descent method, respectively, and α and β are the gradient descent step sizes of TV and DTV, respectively. gTV is the normalized TV gradient while gTV is the TV gradient, and are related by gTV=gTV/gTV2. The individual elements of gTV can be defined as follows:

fTVfs,t(fs,tfs1,t)+(fs,tfs,t1)ε+(fs,tfs1,t)2+(fs,tfs,t1)2(fs+1,tfs,t)ε+(fs+1,tfs,t)2+(fs+1,tfs+1,t1)2(fs,t+1fs,t)ε+(fs,t+1fs,t)2+(fs,t+1fs1,t+1)2. (12)

gDTV is the normalized DTV gradient while gDTV is DTV gradient, and related by gDTV=gDTV/gDTV2. The individual elements of gDTV can be defined as follow:

fDTVfs,t(fs,tfs1,t1)+(fs,tfs1,t+1)ε+(fs,tfs1,t1)2+(fs,tfs1,t+1)2(fs+1,t-1fs,t)ε+(fs+1,t-1fs,t)2+(fs+1,t-1fs,t2)2(fs+1,t+1fs,t)ε+(fs+1,t+1fs,t)2+(fs+1,t+1fs,t+2)2 (13)

Where ε is a known positive integer. In our study, we selected ε =10-8.

We will now describe the iterative steps of the proposed algorithm. The whole iteration process contained two loops, the outside and inside loops operated the Algebraic Reconstruction Techniques (ART) [4] and TV gradient descent, respectively, then the DTV was used as a substitute when the iterative convergence of the reconstructed algorithm was stable after the NTV th iteration. The flow chart is shown in Table 1, where m=2,...,M denotes the projection angles, Am is the mth row vector of the projection matrix A=(A1,A2,...,Am,...,AM), p=(p1,p2,...,pm,...,pM) is the projection, and λ is the convergence parameter of the ART method. The inside loop is labeled by k and K is the iteration count for the TV and DTV minimizations.

Table 1.
Implementation steps of the TV+DTV reconstruction
      Algorithm TV+DTV
%Initialization
Given M,Am(m=1,2,...,M),pmm=12,...,M),λ,α,β, σ,K, NTV and Niter
f0=0;
%Main iteration loop
for n=1,2,...,Niter do
%ART Updating
fn,0=fn1
for m=12,...,M do
fn,m=fn,m1+λAm(pmAm.fn,m1)Am.Am
     end
    (fs,t)n={(fs,t)n,M0(fs,t)n,M0(fs,t)n,M<0
     %TV+DTV
    d(n)=fn1fn2
    for k=1,2,...,K do
      if nNTV do
gs,tn,k1=gTV(s,t)n,k1=fn,M+k1TV/fs,t
      else do
gs,tn,k1=gDTV(s,t)n,k1=fn,M+k1DTV/fs,t
      end
      gn,k1=gn,k1/gn,k12
      fn,M+k=fn,M+k1αd(n)gn,k1
    end
    %Image Updating
    fn+1=fn,M+k
    %Exit Criterion
    if Afp22<σ2 then
      exit
    end
  end    
Show more

3. Numerical and experimental studies

In this section, we present our numerical and experimental studies. There were two sets of experiments; FORBILD head phantom [21] was used in the numerical study and real projection data was used to reconstruct the aspirin images in the experimental study. The image quality was assessed with relative root mean square error (RMSE) and structure similarity (SSIM) [22].

The RMSE is defined as

RMSE=(0s<N0t<M(fs,tf*s,t)2)/(M×N) (14)

where fs,t is the pixel value of the original image at position (s,t) and f*s,t is the pixel value of the reconstructed image at position (s,t).

SSIM is defined as

SSIM=l(f,f*)δc(f,f*)γv(f,f*)η (15)

where

l(f,f*)=(2f¯×f¯*+c1)/((f¯)2+(f¯*)2+c1) (16) c(f,f*)=(2σfσf*+c2)/(σf2+σf*2+c2) (17) v(f,f*)=(σff*+c3)/(σfσf*+c3) (18)

where f¯ and f¯* are the mean of f and f*, respectively, σf and σf* are the variances of f and f*, respectively, σff* is the covariance of f and f*, and c1, c2, and c3 are three small positive constants used to avoid instability if the value of the denominator in Eq.(16) is very close to zero. δ, γ, and η are used to adjust the weights of the luminance l(f,f*), contrast c(f,f*), and structures v(f,f*). In our study, we selected δ=γ=η=1, c1=2×10-8, c2=1×10-8, and c3=0.5×c2=0.5×10-7, and the value of SSIM was between -1 and 1. When two images were the same, the SSIM value was 1.

The selection of the optimal parameters is very important. According to Table 1, there are three parameters: λ, α, and β. Previous studies reported a range for λ of [0,2], and the steepest-descent parameter (α or β) and descent iterative number (K) should be set so that α*K (or β*K) is of the order of, but larger than one [7]. In the cases considered here, the descent iterative numbers were 20, and the ranges of α and β were both [0.05, 1]. The following optimal parameters were selected from these ranges. We use the FORBILD head phantom with 30 projections as an example to determine the optimal parameters. We changed the value of λ to reconstruct images with the ART method and calculated the reconstruction error. Figure 2 shows that λ=1 was the optimal parameter where RMSE was the lowest and SSIM was approaching 1. When λ was fixed, we changed the value of α to reconstruct images with the TV method and calculated the reconstruction error. Figure 3 shows that α=0.55 was the optimal parameter. When λ and α were fixed, we changed the value of β to reconstruct images with the TV+DTV method and calculated the reconstruction error. Figure 4 shows that β =0.28 was the optimal parameter. Then we used λ=1, α=0.55, and β =0.28 in the next numerical simulation using the FORBILD head phantom. A similar search was conducted for real data and the optimal values of these parameters are shown in Table 2.

Table 2.
Optimum parameter selections for each dataset
Data λ α β
FORBILD head phantom 1 0.55 0.28
Real data 0.5 0.15 0.1
Show more
Fig. 2.
RMSE and SSIM of the analyses to find the optimum regularization parameter λ.
pic
Fig. 3.
RMSE and SSIM of the analyses to find the optimum regularization parameterα.
pic
Fig. 4.
RMSE and SSIM of analyses to find the optimum regularization parameter β.
pic
3.1 Numerical Simulation Study

In the numerical simulation, a FORBILD head phantom (256×256×8 Bits), shown in Figure 5(a) was used to analyze the performance of the proposed algorithm. The scanning range of the CT system was from 0 to 2π with a θ angular increment. The projection number was set to 30 so that θ= π/15. The scanned angle can be specified by ψ=θ*i (1≤i≤30). The iteration number was set to 1000 and the proposed algorithm (TV+DTV) included 600 TV iterations and 400 DTV iterations. Figures 5(b) and 5(c) were reconstructed by the TV and TV+DTV methods, respectively.

Fig. 5.
Reconstructed FORBILD head phantom for comparison.
pic

Although the reconstructed images obtained using the TV and TV+DTV methods were not significantly different in Figure 5, the positions of the arrows in Figure 6 showed that the profile of the reconstructed image using the TV+DTV method was more stable than that of the TV method. Furthermore, the zoomed in part of the reconstructed images are shown in Figure 7, and Figure 7(c) had fewer artifacts, clearer edges, and a more uniform distribution compared with Figure 7(b) reconstructed by TV method.

Fig. 6.
Profile of line 180 in the different reconstructed methods for the FORBILD head phantom.
pic
Fig. 7.
Magnified part of the reconstructed FORBILD head phantom for comparison.
pic

In Figure 8, the RMSE and SSIM are plotted with the iteration number. For both criteria, there was a similar trend in which the RMSE and SSIM of the reconstructed images became better when converted from TV-constraint to DTV-constraint after 600 iterations. Table 3 lists the RMSE and SSIM calculated from the reconstructed FORBILD head phantom with the TV and TV+DTV methods. It was evident that the RMSE of the reconstructed images using the TV+DTV method was considerably smaller than those obtained by the TV method, and the SSIM was significantly larger. Both the RMSE and SSIM showed that the TV+DTV method can be used to reconstruct images with higher quality.

Table 3.
RMSE and SSIM of reconstruction images
Methods FORBILD head phantom Real aspirin data
  TV TV+DTV TV TV+DTV
RMSE 0.0159 0.0143 0.1586 0.1359
SSIM 0.9987 0.9989 0.9745 0.9816
Show more
Fig. 8.
RMSE and SSIM lines of reconstructed FORBILD head phantoms with iteration number (from 100 to 1000) for the TV and TV+DTV methods.
pic
3.3 Real Projection Data Study

We applied our proposed algorithm to a set of real projection data, acquired from scanned aspirin. The source-to-object distance was 48.1551 cm, and the object to detector distance was 100.4449 cm. The detector pixel size was 0.0098 cm, and the number of detector elements was 1472. X-ray CT geometrical calibration via locally linear embedding [23] was used to calibrate the geometry and the calibrated rotation center and detector offsets were -0.3318 cm and -0.5364 cm, respectively. The number of projection angles was 360, equally spaced in the angular range [0, 2π]. To demonstrate the performance of our proposed approach, we reduced the number of views to 90 by setting the angular increment to θ= π/45. As shown in Figure 9(a), the reconstructed full-view image was considered to be the standard image. Figures 9(b) and 9(c) were reconstructed by the TV and TV+DTV methods with 90 projection views, respectively. The TV-constraint was converted into the DTV-constraint after 40 iteration numbers, and the lines of the RMSE and SSIM with respect to iteration number are shown in Figure 10. The RMSE and SSIM of reconstructed images are listed in Table 3. It was observed that the result of the TV+DTV method was considerably better than that of the TV method.

Fig. 9.
Reconstructed aspirin images for comparison.
pic
Fig. 10.
RMSE and SSIM lines of reconstructed aspirin projection data with respect to iteration number for the TV and TV+DTV methods.
pic

4. Discussions and Conclusion

There are several issues worth further discussion. Although the proposed hybrid method can be used to reconstruct high-quality images from sparse-views data, it should be noted that the DTV-constraint had no obvious advantages over the TV-constraint with a small number of iterations, due to the gradient operators for the sparse representation. Therefore, it is vital to find the appropriate iteration number to convert the TV-constraint into the DTV-constraint to accelerate the iterative convergence. In the reconstruction process, the iteration numbers were 600 and 40 in the numerical and experimental studies, respectively, according to the characteristics of the low and high frequency components in the reconstructed image.

Another issue regarding the feasibility of the reconstruction algorithm is whether the run-time is acceptable. The run-time depends on the computational environment. MatLabR2014b on a computer with the Intel(R) Core(TM) i5-4590 CPU @3.30 GHz, RAM 8.00 GB, 64-bit OS was used here. The implementation of the TV + DTV algorithm took 179 s for the FORBILD head phantom (total number of iterations was 1000) and 285 s for the real aspirin projection data (the total iterative number was 90). The run-time was acceptable, but could be further improved by many methods, such as parallel computing and CUDA acceleration.

For our proposed algorithm, the selection of the weights of the TV- and DTV-constraints were also an important issue in the reconstruction. If they are too small, the algorithm based on the TV- and DTV-constraints would not be able to reduce the artifacts and noise of the reconstructed image. If they are too large, the TV- and DTV-constraints would over-smooth the CT images. Thus, the weight parameter selection for the TV- and DTV-constraints depends on the levels of artifacts and noise. In this paper, the used parameters are shown in Table 2 according to the experimental analysis.

In conclusion, we proposed a hybrid reconstruction approach by combining the TV- and DTV-constraints, by operating the DTV-constraint to sparsely represent the images when the iterative convergence of the reconstructed algorithm with the TV-constraint did not vary during the reconstruction process. The numerical and experimental studies demonstrated that the proposed hybrid method can be used to reconstruct high-quality images from sparse-views data, and the RMSE and SSIM were improved when the TV-constraint was converted to the DTV-constraint after a set number of iterations. Further research will be performed to explore the directional TV optimization problem in the CT reconstruction.

References
[1] E.J. Hall, D.J. Brenner,

Cancer risks from diagnostic radiology

. British Journal of Radiology. 81(965), 362-378 (2008).doi: 10.1259/bjr/01948454
Baidu ScholarGoogle Scholar
[2] A. Berrington, M. Mahesh, K. P. Kim et al.,

Projected cancer risks from computed tomographic scans performed in the United States in 2007

. Arch Intern Med. 169(20), 2071-2077 (2009).doi: 10.1001/archinternmed.2009.440
Baidu ScholarGoogle Scholar
[3] J.D Cameron,

One-dimensional tomography: a comparison of Abel, onion-peeling, and filtered backprojection methods

.Applied Optics. 31(8), 1146-1152 (1992). doi: 10.1364/AO.31.001146
Baidu ScholarGoogle Scholar
[4] R. Gordon, R. Bender, G. Herman,

Algebraic reconstruction techniques (ART) for three-dimensional electron microscopy and x-ray photography

. Journal of Theoretical Biology. 29(3), 471-481 (1970).doi: 10.1016/0022-5193(70)90109-8
Baidu ScholarGoogle Scholar
[5] H. Andersen, A. Kak,

Simultaneous algebraic reconstruction technique (SART): asuperior implementation of the ART algorithm

. Ultrasonic Imaging. 6(1), 81-94 (1984).doi: 10.1177/016173468400600107
Baidu ScholarGoogle Scholar
[6] D. Donoho,

Compressed sensing

. IEEE Transactions on Information Theory. 52(4), 1289-1306 (2006). doi: 10.1109/TIT.2006.871582
Baidu ScholarGoogle Scholar
[7] E.Y. Sidky, X.C. Pan,

Image reconstruction in circular cone-beam computed tomography by constrained, total variation minimization

Phys. Med. Biol53, 4777-4807 (2008).doi: 10.1088/0031-9155/53/17/021
Baidu ScholarGoogle Scholar
[8] J. Bian, J. Siewerdsen et al.,

Evaluation of sparse-view reconstruction from flat-panel-detector cone-beam CT

.Phys. Med. Biol55(65), 75-99(2010).doi: 10.1088/0031-9155/55/22/001
Baidu ScholarGoogle Scholar
[9] H. Yu, G. Wang,

A soft-threshold flteringapproach for reconstruction from a limited number of projections

.Phys. Phys. Med. Biol55(13),3905-3916 (2010).doi: 10.1088/0031-9155/55/13/022
Baidu ScholarGoogle Scholar
[10] Q. Xu, X. Mou et al.,

Statistical interior tomography

. IEEE Transactions on Medical Imaging. 30(5), 1116-1128 (2011).doi: 10.1109/TMI.2011.2106161
Baidu ScholarGoogle Scholar
[11] Y. Liu, J. Ma, Y. Fan, Z. Liang,

Adaptive-weighted total variation minimization for sparse data toward low-dose x-ray computed tomography image reconstruction

Phys. Med. Biol.  57(23), 7923-7956(2012).doi: 10.1088/0031-9155/57/23/7923
Baidu ScholarGoogle Scholar
[12] Q. Xu, H. Yu, X. Mou, L. Zhang, J. Hsieh, G. Wang,

Low-dose x-ray CT reconstruction via dictionary learning, IEEE Transactions on Medical Imaging

31(9), 1682-1697 (2012).doi: 10.1109/TMI.2012.2195669
Baidu ScholarGoogle Scholar
[13] H. Li, X. Chen, Y. Wang, Z. Zhou, Q. Zhu, D. Yu,

Sparse CT reconstruction based on multi-direction anisotropic total variation (MDATV)

.BioMedical Engineering OnLine, 13, 92 (2014).doi: 10.1186/1475-925X-13-92
Baidu ScholarGoogle Scholar
[14] T. Goldstein, S. Osher,

The split bregmanmethod for L1-regularized problems

.SIAM Journal on Imaging Science. 2, 323-343 (2009).doi: 10.1137/080725891
Baidu ScholarGoogle Scholar
[15] M.Y. Chen, D.L. Mi, P. He, L.Z Deng, B. Wei,

A CT reconstruction algorithm based on L1/2 regularization

. Computational and Mathematical Methods in Medicine, 2014, 8 (2014).doi: 10.1155/2014/862910
Baidu ScholarGoogle Scholar
[16] L.Z Deng, P. Feng, M.Y. Chen, P. He, Q.S. Vo, B. Wei,

A CT reconstruction algorithm based on non-aliasing contourlet transform and compressive sensing

. Computational and Mathematical Methods in Medicine, 2014,9 (2014).doi: 10.1155/2014/753615
Baidu ScholarGoogle Scholar
[17] L.Z. Deng, D.L. Mi, P. He, P. Feng, M.Y. Chen, Z.C. Li, J. Wang, Wei Biao,

A CT reconstruction approach from sparse projection with adaptive-weighted diagonal total-variation in biomedical application

. Bio-Medical Materials and Engineering, 26, 1685-1693 (2015).doi: 10.3233/BME-151468
Baidu ScholarGoogle Scholar
[18] M.Y. Chen, Y. Ren, P. Feng, L.Z. Deng, P. He, B. Wei, Z.C. Li, J. Wang,

Computed tomography image reconstruction from few-views data by multi-directional total variation

. Journal of Medical Imaging and Health Informatics. 5, 309-316 (2015).doi: 10.1166/jmihi.2015.1392
Baidu ScholarGoogle Scholar
[19] L.Z. Deng, P. Feng, M.Y. Chen, P. He, Wei Biao,

An improved total variation minimization method using prior images and split-bregmanmethod in CT reconstruction

.BioMed Research International, 2016, 9 (2016).doi: 10.1155/2016/3094698
Baidu ScholarGoogle Scholar
[20] H. Zhou, P. Wang,

A simpler explicit iterative algorithm for a class of variationalinequalities in hilbertspaces

. Journal of Optimization Theory and Applications, 161, 716-727 (2014).doi: 10.1007/s10957-013-0470-x
Baidu ScholarGoogle Scholar
[21] Z. Yu, F. Noo, F. Dennerlein et al.,

Simulation tools for two dimensional experiments in x-ray computed tomographyusing the FORBILD head phantom

. Phys. Med. Biol. 57, 237-252 (2012). doi: 10.1088/0031-9155/57/13/N237
Baidu ScholarGoogle Scholar
[22] W. Zhou, C. Alan, R Hamid,P. Eero,

Image quality assessment: from error visibility tostructural similarity

.IEEE Transactions on Image Processing, 13, 600-612 (2004).doi: 10.1109/TIP.2003.819861
Baidu ScholarGoogle Scholar
[23] M. Y. Chen, Y. Xi, W.X. Cong, B.D. Liu, B. Wei,G. Wang,

X-ray CT geometrical calibration via locally linear embedding

. Journal of X-Ray Science and Technology. 24, 241-256 (2016).doi: 10.3233/XST-160548
Baidu ScholarGoogle Scholar