1. Introduction
CBCT is becoming widely used in many fields such as clinic diagnosis, public safety inspection, and non-destructive testing. In the field of CBCT reconstruction, analytic methods such as FDK0] are effective and practical to reconstruct 3D objects. To reduce the potential radiation risk, there are major efforts to lower X-ray radiation dose[2,3] and to speed up CT scans[4]. Because computer hardware has greatly evolved, researchers have looked to optimization-based iterative methods to reduce noise[5,6]. Iterative algorithms like ART (Algebraic Reconstruction Techniques)-type[5], SIR[7], and MBIR[8] have been used to reconstruct objects when projection data is noisy. In the past few years, compressed sensing (CS) algorithms have been studied[9,10]. These studies have shown that high-quality CT images can be reconstructed by iterative methods, which can incorporate both models of imaging physics and additional constraints[11-13]. A typical example is image denoising based on total variation (TV) regularization[14,15]. However, optimization-based iterative reconstruction is normally computationally expensive. Therefore, it is almost impossible to use such iterative algorithms when real-time reconstruction is required.
Recently, deep learning and convolutional neural networks (CNN) have been widely used in image processing[16-18]. In the CT field, researchers have published some studies using different kinds of CNNs to gain better reconstruction images [19-21]. Researchers have shown that deep learning methods can be used to help with low-dose image denoising, metal-artifact reduction, and sparse-data CT[22-25]. Recently, a U-net[26] structure for a large receptive field was also proposed to reduce noise in low-dose CT images[19]. The successful implementation of CNN in solving the problem of noisy data has shown that CT noise caused by the low-dose condition has characteristics that can be learned and reduced by CNNs. However, the networks mentioned above focused only on learning the characteristics in the image domain. Furthermore, not much study has been focused on the issue of cone-beam CT reconstruction. Considering the popular imaging modality of CBCT scan, we intended to catch the features in both the projection domain and the image domain, as well as employ the Radon transform relationship between these two domains. For a certain cross-section of an object, we started our work by trying to implement a residual U-net structure. That structure would estimate fan-beam projection data using cone-beam projection data. In this way, we aimed to transfer the problem from 3D imaging to 2D imaging. In this work, we combine the processes of projection estimation, image reconstruction, and image refinement together into one network. Hence, we incorporate the complete imaging physics model of CBCT, as well as characterize data features in both the projection and image domains. An end-to-end reconstruction network is built accordingly.
2. Theory and Method
In this section, we first introduce briefly the basic physics of a dental cone-beam CT system and address the problem of our interest. Then a Res-CNN type neural network is specifically constructed according to the physics and reconstruction theories of X-ray CT. Detailed network architecture is presented and explained together with the step-by-step training method.
2.1 Physics of a dental cone-beam CT imaging
A typical cone-beam CT imaging system with a circular orbit is shown in Fig. 1. The rotation center is set as the origin O. The point D is the projection of O on the detector plane. As shown in the front view of the detector in Fig. 1, the position of D is denoted as
-201904/1001-8042-30-04-007/alternativeImage/1001-8042-30-04-007-F001.jpg)
As shown in Fig. 1, in a cone-beam CT, the image μm is projected onto as many as R* rows on the detector, i.e., projections on these rows are related to μm.
2.2 Main architecture of the network
Our proposed reconstruction network is an end-to-end solution, i.e., the input is P3D and the output is
The whole network is composed of three sub-networks: the cone-to-fan transformation sub-network, the 2D analytical inversion sub-network, and the image refinement sub-network. Two residual U-nets are utilized in the cone-to-fan transformation and image-refinement sub-networks. As shown in Fig. 2, the U-net structure we use consists of four stages connected by pooling layers in the first half and upsampling layers in the second half. The number of channels for the first convolution layer of the U-net is χc.
-201904/1001-8042-30-04-007/alternativeImage/1001-8042-30-04-007-F002.jpg)
In the cone-to-fan transformation sub-network, P3D is the input. As shown in Fig. 2, in one branch, we reconstruct image μm0 with P3D using an analytical reconstruction algorithm[27]. The simulated available half of the fan-beam projection of μm0 is denoted as
According to our experience, CT reconstruction is sensitive to errors in projection data. It is hard for the cone-to-fan transformation sub-network to fully catch the characteristics of a sinogram. Hence, we configured a sinogram-to-image-domain transformation sub-network. We exactly follow the computation as an analytical reconstruction and hence refer to it as "2D analytical inversion sub-network". The procedure of this sub-network includes three steps: weighting, filtration, and back-projection. All three of these steps can be realized by matrix-vector multiplication. Therefore, we can formulate the 2D analytical inversion sub-network as FBP reconstruction steps:
Here, W is a diagonal matrix for ray-by-ray weighting, the matrix F represents a ramp filtration process in the detector axis for all views, and
The outputs of the 2D analytical inversion sub-network
2.3 Network training
When training, we first train the cone-to-fan transformation sub-network separately by a loss of l2- norm,
3. Experiments and Results
To examine the performance of the proposed method, we arranged our research to reconstruct a certain slice of a cone-beam dental CT.
3.1 Validating the network on simulated low-dose CT data
In total, 110 patients’ normal-dose CT projection data were obtained from a cone-beam dental CT. Among them, 100 randomly chosen patients’ data were used in training, while the other 10 patients’ data were used for validation. All these 110 patients were chosen randomly from hospitals. The personal information of these patients was anonymized. The data were taken from the same dental CT system. Training data and validation data were independent. For the cone-beam scan system, the source-to-origin distance (
To achieve labels for our network, we collected data scanned with the X-ray source set to be 100 kV and 4 mA. Each projection was acquired in 20 ms. This was deemed a normal dose situation, and the corresponding blank scan was denoted as IN. We simulated the low-dose projections
We denoted the projection data in the training and validation sets as
with μm3D and H3D the 3D object and the corresponding projection matrix respectively,
When training the cone-to-fan transformation sub-network separately, the loss function of the network
-201904/1001-8042-30-04-007/alternativeImage/1001-8042-30-04-007-F004.jpg)
As a demonstration, we show some intermediate results in Fig. 3. We can see that analytical reconstruction can provide high-frequency information for the slice of interest. The cone-to-fan transformation sub-network can recover the low-frequency information caused by the approximation in analytical reconstruction for the cone-beam problem. Details of intermediate results from the cone-to-fan transformation sub-network (
Methods | RRMSE | SSIM | SNR |
---|---|---|---|
DDL with U-net | 0.0589±0.0016 | 0.9957±0.0006 | 20.6768±0.5722 |
Estimation of μFBP | 0.0704±0.0027 | 0.9939±0.0007 | 19.1384±0.5251 |
DDL with Plain CNN | 0.0718±0.0017 | 0.9936±0.0008 | 18.9579±0.5572 |
Image-domain U-net | 0.0768±0.0017 | 0.9927±0.0009 | 18.3803±0.5373 |
Analytical reconstruction | 0.1629±0.0097 | 0.9682±0.0040 | 11.8569±0.6597 |
-201904/1001-8042-30-04-007/alternativeImage/1001-8042-30-04-007-F003.jpg)
Using low-dose data, we compared our method with U-net [19,26] only in the image domain and with analytical reconstruction[27]. The image-domain U-net-based method also used
Three cases in the validation data set are shown in Fig. 5 with zoom-in of the region of interest (ROI) in the blue boxes for demonstration. The horizontal profiles of the difference images between the reconstructions and the labels along the red line in Fig. 5 are plotted in Fig. 6. We can see that structural information is severely contaminated in analytical reconstructions for low-dose data. Image-domain U-net-based methods can work well in reducing noise but the edges are a little blurred. The proposed network performs best in recovering structural details comparable to the normal dose case.
-201904/1001-8042-30-04-007/alternativeImage/1001-8042-30-04-007-F005.jpg)
-201904/1001-8042-30-04-007/alternativeImage/1001-8042-30-04-007-F006.jpg)
Moreover, we quantitatively evaluated the image quality of all validation reconstructions in terms of relative root mean square error (RRMSE), the structural similarity (SSIM) index, and signal-to-noise ratio (SNR). The RRMSE index for each reconstruction image was computed as,
with
where
where j indexes the pixels. Results are shown in Table 1. It is shown that our DDL design is the most important factor for realizing high quality reconstructions in this low-dose CT reconstruction problem. The RRMSE, SSIM, and SNR of
3.2 Validating the network on practical CT data
We also imaged a skull head phantom (shown in Fig. 7) for practical experiments. The low-dose scans were done on the same dental CT system used above. The voltage of the X-ray source was set to 80 kV and the current was set to 2 mA. Under this low-dose situation, the blank scan IL was about 25% of IN. The reconstructions are shown in Fig. 8.
-201904/1001-8042-30-04-007/alternativeImage/1001-8042-30-04-007-F007.jpg)
-201904/1001-8042-30-04-007/alternativeImage/1001-8042-30-04-007-F008.jpg)
From these results, we can see that the proposed method can effectively reduce the noise in practical low-dose CT reconstructions. All main structures of the phantom are well reconstructed by the trained network.
4. Computational complexity
The computational complexity of the proposed method can be estimated from the computation load of the three sub-networks. The major computation load in both cone-to-fan transformation and image refinement sub-networks is in the convolution layers. If we count the multiplications only, there will be
multiplications for the lth convolution layer. Here, l is the layer index, with channels of dimension (Xl×Yl). Hence, the number of multiplications in the network will be approximately,
Here,
Layer # | Multiplications in U-net A | Layer # | Multiplications in U-net B |
---|---|---|---|
2 | 16 | ||
3 | 17 | ||
4 | 18 | ||
5 | 19 | ||
11 | 25 | ||
12 | 26 | ||
13 | 27 | ||
14 | 28 |
5. Conclusion
We propose a new framework of X-ray CT reconstruction based on deep learning for slice-wise reconstruction in a cone-beam CT system. The proposed method utilizes a novel structure containing three parts, which were designed for cone-to-fan projection estimation, 2D analytical inversion transform, and image refinement, respectively. The cone-to-fan transformation and the image refinement sub-networks are both built using residual U-net structures. A 2D analytical inversion transformation sub-network completes the domain transformation from projection domain to image domain. The cone-to-fan transformation sub-network is trained first. Then the whole network is trained using ultimate image-domain loss. Our results with a realistic phantom show that the proposed method can effectively reduce noise and recover detailed structures in scanned objects. Reconstructions have higher image quality than commonly used low-dose cone-beam CT reconstructions.
It is worth pointing out that when the targeted slice is farther away from the mid-plane of a cone-beam CT, the projection of the slice is contained in multiple detector rows. The data of the slice of interest are mixed with many other slices, and this makes it more challenging to obtain the 2D projection of the slice. Because the cone angle of practical CT systems is usually within -5~+5 degrees, we have researched the most difficult situation, where the cone angle is about 5 degrees, as an example. By building up similar branches for different slices, one could conveniently reconstruct a volume of interest or multiple inconsecutive slices.
This proposed network is initially designed to incorporate the imaging physics (modelled by a CT system matrix) in the network design so that it can learn the characteristics of both projection and image domains in an end-to-end mechanism. It combines the capability of physical models and information mining from big data sets. Moreover, this network simplifies the 3D imaging process by transferring it into a 2D form so that only a 2D system matrix is needed in the projection-to-image domain transfer thereby reducing the memory requirement. By decoupling the 3D projection into an independent 2D problem, significant computation time can be saved compared with 3D projection and back-projection in iterative methods. Finally, the reconstruction using the trained network can be completed with good speed using currently available parallel-computing power. These advantages could be greatly beneficial to real-time applications.
In this work, we use dental CBCT data to confirm the effectiveness of our method of reconstructing a certain slice of a scanned object. We do not consider the issue of metal artefacts in this work. Our group is working on restraining metal artifacts as a separate problem [30]. For future work, we plan to combine our work together and further optimize the network for 3D volumes. We will extend the method to other CT scan geometries as well.
A free-geometry cone beam CT and its FDK-type reconstruction
. J. X-Ray Sci. Technol., 15(3), 157-167 (2007)Epidemiological research on radiation-induced cancer in atomic bomb survivors
, J. Radiat. Res., 57(Suppl 1), i112-i117 (2016). doi: 10.1093/jrr/rrw005The use of computed tomography in pediatrics and the associated radiation exposure and estimated cancer risk
, Jama Pediatr., 167(8), 700-707 (2013). doi: 10.1001/jamapediatrics.2013.311Scan time and patient dose for thoracic imaging in neonates and small children using axial volumetric 320-detector row CT compared to helical 64-, 32-, and 16- detector row CT acquisitions
. Pediatr. Radiol., 40(3), 294-300 (2010). doi: 10.1007/s00247-009-1436-xInnovations in CT dose reduction strategy: Application of the adaptive statistical iterative reconstruction algorithm
. Ajr Am J. Roentgenol, 194(1), 191-199 (2010). doi: 10.2214/AJR.09.2953Iterative reconstruction technique for reducing body radiation dose at CT: Feasibility study
. Ajr Am J Roentgenol, 193(3), 764-771 (2009). doi: 10.2214/AJR.09.2397Statistical image reconstruction for polyenergetic X-ray computed tomography
, IEEE T. Med. Imaging, 21(2), 89-99 (2002). doi: 10.1109/42.993128Statistical model based iterative reconstruction (MBIR) in clinical CT systems: Experimental assessment of noise performance
, Med. Phys., 41(4) (2014). doi: 10.1118/1.4867863Prior image constrained compressed sensing: Implementation and performance evaluation
, Med. Phys., 39(1), 66-80 (2012). doi: 10.1118/1.3666946Prior image constrained compressed sensing (PICCS): A method to accurately reconstruct dynamic CT images from highly undersampled projection data sets
, Med. Phys., 35(2), 660-663 (2008). doi: 10.1118/1.28364233D feature constrained reconstruction for low dose CT imaging
, IEEE T. Circui. Systems Vid. Technol., 28(5), 1232-1247 (2016). doi: 10.1109/TCSVT.2016.2643009Artifact suppressed dictionary learning for low-dose CT image processing
, IEEE T. Med. Imaging 33(12), 2271-2292 (2014). doi: 10.1109/TMI.2014.2336860Discriminative Feature Representation to Improve Projection Data Inconsistency for Low Dose CT Imaging
, IEEE T. Med. Imaging, 36(12), 2499-2509 (2018). doi: 10.1109/TMI.2017.2739841Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization
, Phys. Med. Biol. 53(17), 4777-4807 (2008). doi: 10.1088/0031-9155/53/17/021GPU-based fast low-dose cone beam CT reconstruction via total variation
, J. X-ray Sci. Technol. 78(3), 139-154 (2010). doi: 10.3233/XST-2011-0283Deep learning
, Nature, 521(7553), 436 (2015). doi: 10.1038/nature14539Bilinear CNN Models for Fine-grained Visual Recognition
, IEEE I. Conf. Comp. Vis. 1449-1457 (2015). doi: 10.1109/ICCV.2015.170Deep learning for image-based cassava disease detection
. Front. Plant Sci., 8 (1852) (2017). doi: 10.3389/fpls.2017.01852A deep convolutional neural network using directional wavelets for low-dose X-ray CT reconstruction
, Med. Phys. 44(10), e360-e375 (2017). doi: 10.1002/mp.12344Low-Dose CT with a Residual Encoder-Decoder Convolutional Neural Network (RED-CNN)
, IEEE T. Med. Imaging, 99, 1-1 (2017). doi: 10.1109/TMI.2017.2715284Low-dose CT via convolutional neural network
, Biomed. Opt. Express, 8(2), 679-694 (2017). doi: 10.1364/BOE.8.000679Low-dose CT image denoising using a generative adversarial network with wasserstein distance and perceptual loss
. IEEE T. Med. Imaging, 37(6), 1348-1357 (2018). doi: 10.1109/TMI.2018.2827462Convolutional neural network based metal artifact reduction in X-ray computed tomography
. IEEE T. Med. Imaging, 37(6), 1370-1381 (2018). doi: 10.1109/TMI.2018.2823083Framing u-net via deep convolutional framelets: Application to sparse-view CT
. IEEE T. Med. Imaging, 37(6), 1418-1429 (2018). doi: 10.1109/TMI.2018.2823768LEARN: Learned experts' assessment-based reconstruction network for sparse-data CT
. IEEE T. Med. Imaging, 37(6), 1333-1347 (2018). doi: 10.1109/TMI.2018.2805692U-Net: Convolutional networks for biomedical image segmentation
. International Conference on Medical Image Computing and Computer-Assisted Intervention, 9351, 234-241 (2015). doi: 10.1007/978-3-319-24574-4_28A practical image reconstruction and processing method for symmetrically off-center detector CBCT system
, Nucl. Sci. Tech., 24(4), 17-22 (2013).Optimal short scan convolution reconstruction for fan beam CT
, Med. Phys., 9(2), 254-257 (1982). doi: 10.1118/1.595078Statistical image reconstruction for low-dose CT using nonlocal means-based regularization, Comput
. Med. Imaging Graph, 38(6), 423-435 (2014). doi: 10.1016/j.compmedimag.2014.05.002Optimize interpolation-based MAR for practical dental CT with deep learning
,