Adaptive-weighted total variation minimization for sparse data toward low-dose x-ray computed tomography image reconstruction

Yan Liu; Jianhua Ma; Yi Fan; Zhengrong Liang

doi:10.1088/0031-9155/57/23/7923

1. Introduction

In view of the negative effects of x-ray exposure to patients, minimizing the exposure risk has been one of the major endeavors in current computed tomography (CT) examinations (Brenner and Hall 2007, Einstein et al 2007). Up to now, many hardware-based optimal data-acquisition protocols have been proposed for dose reduction (Kalra et al 2003, Smith et al 2007, McCollough et al 2009). In the meanwhile, many software-based optimal technologies have been introduced to process low-dose data acquired from available CT scanners without hardware modification. It is well known that lowering the x-ray exposure (by lower x-ray tube current—measured by milliampere-seconds (mAs); lower x-ray tube voltage—measured by kilovoltage-peak (kVp); or less projection views—measured by the number of views per rotation, i.e. data sparsity) will unavoidably increase the data noise and/or the data inconsistence associated with the sparsity. As a result, high-quality diagnostic CT images cannot be yielded if no adequate handling of the data noise and/or the data inconsistence is applied during image reconstruction. Various image processing and reconstruction methods with noise suppression capability for the purpose of dose reduction have been reported (Li and Orchard 2000, Lu et al 2001, Li et al 2004).

As one of the major strategies for low-dose CT image reconstruction, restoring the line integrals from acquired low-mAs projection data has been explored (Lu et al 2001, Li et al 2004, Wang et al 2006). For example, Lu et al (2001) investigated the noise property of low-mAs CT sinogram data by analyzing repeatedly scanned data from a commercial CT scanner, and correspondingly presented a transform-based method to restore the line integrals from the low-mAs scans. Later, a nonlinear relationship between the variance and the mean of the acquired low-mAs sinogram data was determined by Li et al (2004), which provides a reasonable theoretical prediction of the variance of the projection data to facilitate the low-dose CT image reconstruction. Based on the relationship, Wang et al (2006) investigated a framework of image reconstruction from the low-mAs sinogram data by minimizing the penalized re-weighted least squares. The restoration principle can be applicable to acquired low-kVp scans. While they were not intended for low-dose CT applications, La Rivière et al (2006) presented an interesting penalized-likelihood sinogram restoration algorithm and Elbakri and Fessler (2002, 2003) reported a series of sophisticated CT image reconstruction algorithms in general.

Another major strategy, which could be applied for CT dose reduction, is to reduce the number of projection views per rotation around the body. In current CT exanimations, several hundred or even over a thousand of projections per rotation are acquired. Reducing the number by a half would cut the radiation by a half. In 2006, Donoho proposed the concept of ompressed ensing (CS) (Donoho 2006), which proves that an image of sparse signals could be satisfactorily reconstructed from far less measurements than what is usually considered necessary, according to the Nyquist sampling theorem, when the associated transfer matrix of the sparse signals satisfy some conditions (such conditions were later described as the restricted isometry property (RIP) (Candès and Wakin 2008)). In the same year of 2006, Candès et al introduced the CS concept into the K-space for image reconstruction in magnetic resonance imaging by solving the l₁-norm optimization problem (Candès et al 2006). Furthermore, if a previous image with high similarity to the to-be-estimated image can be introduced into the cost function, the data samples for satisfactory image reconstruction can be further reduced (Chen et al 2008, Ma et al 2011, Xu et al 2012, Liu et al 2011). However, for CT image reconstruction, particularly in low-dose CT applications, the associated transfer matrix of sparse signals in the transfer domain is less likely to have the RIP (Sidky et al 2006, Sidky and Pan 2008). Furthermore, except for parallel-beam geometry, the central slice theorem, which describes the direct connection between the projection data and the transformed data in the frequency domain in tomographical imaging, may not exist for nonparallel geometries such as fan and cone-beam geometries and, therefore, an exact implementation of the CS theorem for low-dose CT may not be feasible. Alternative means to reduce the number of projections seems to be needed.

In 1992, Rudin et al reported that the total variation (TV) norm of the to-be-estimated solution is essentially the l₁-norm of derivatives, and they further showed that this norm can be utilized to address the ill-posed image restoration problem (Ruding et al 1992). In 2006, Sidky et al adapted the concept of TV minimization to consider the piecewise constant or sparse source distribution and formulated an innovative algorithm, called TV-POCS (projection onto convex sets), to perform CT image reconstruction from sparse-sampled or sparse-view projection data (Sidky et al 2006). Later in 2008, Sidky et al presented an updating algorithm, i.e. the adaptive-steepest-descent-POCS (ASD-POCS) (Sidky and Pan 2008), for TV minimization with improved robustness against the cone-beam artifacts from sparse or limited projection views in comparison to other classical methods, e.g., the well-known EM algorithm. This ASD-POCS algorithm, simply called TV-POCS hereafter, can be considered as a new attempt to reconstruct images of sparse signals from under-sampled projection data for CT applications. Although the images reconstructed by the TV-POCS algorithm from sparse-sampled data are close to the true source distributions, over-smoothing in the reconstructed image is frequently seen due to the assumption of isotropic edge property in calculating the TV term. Recently, a TV-based edge preserving (EPTV) model (Tian et al 2011) was proposed to address the issue of the original TV, and was claimed to preserve edges by bringing in different weights in the TV term from edges and constant areas of the to-be-estimated image.

In this paper, different from the EPTV model, we consider the anisotropic (rather than isotropic) edge property of an image and propose a novel adaptive-weighted TV (AwTV) model for low-dose CT image reconstruction from sparse-sampled projection data. In order to achieve a reasonable balance between resolution and contrast-to-noise ratio in the reconstruction, the associated weights in the AwTV model are expressed as an exponential function, which can be adaptively adjusted with the local image-intensity gradient for the purpose of preserving the edge details. Inspired by the TV-POCS implementation (Sidky et al 2006, Sidky and Pan 2008), a similar implementation, called AwTV-POCS, is developed to minimize the AwTV with subjection to data and other constraints for the purpose of dose reduction via CT image reconstruction from sparse data.

The remainder of this paper is organized as follows. In section 2, the AwTV model and its associated cost function are presented, and then the POCS-based image reconstruction algorithm for solving the constrained AwTV minimization problem is described. In section 3, experimental results are reported. Finally, discussions and conclusions are given in section 4.

2. Method

In this section, two CT imaging models with the conventional TV and the presented AwTV minimizations are introduced respectively, and then the corresponding optimization strategy with POCS, i.e. the TV-POCS and AwTV-POCS algorithms, is described in detail.

2.1. CT imaging model with the conventional TV minimization

For CT image reconstruction from sparse-view or sparse-sampled data, the classic filtered back-projection (FBP) method always suffers from notable artifacts due to the ill condition of the measured data (Sidky et al 2006, Sidky and Pan 2008). To mitigate the ill condition, a satisfactory CT image may be yielded from sparse-viewed data by solving the following constrained optimization problem (Sidky et al 2006, Sidky and Pan 2008):

$\begin{equation} \mathop {\min }\limits_{\mu \ge 0} \| \mu \|_{{\rm TV}} \;\,\,{\rm subject}\;{\rm to}\,\,\;| {p - A\mu } | \le \varepsilon , \end{equation} \tag{ 1 }$

where the TV of the to-be-reconstructed image, i.e. ||μ||_TV, is defined as

$\begin{equation} \| \mu \|_{{\rm TV}} = \sum\limits_{s,t} {\sqrt {( {\mu _{s,t} - \mu _{s - 1,t} } )^2 + ( {\mu _{s,t} - \mu _{s,t - 1} } )^2 } } , \end{equation} \tag{ 2 }$

where μ denotes the vector of attenuation coefficients of the object with $\mu \in \mathbb{R}^M$ , wherein the object is discretized on a two-dimensional (2D) grid of M image elements or voxels, s and t are the indices of the location of the attenuation coefficients, p represents the linearized or log-transformed projections data, A is the system matrix which depends on the projection geometry (Joseph 1982), and its elements are usually modeled as the intersecting lengths of a ray path with the associated voxels on the path. Because many factors may cause inconsistency between the measurements and the desired data conditions, such as missing data and presence of noise in the measurements, the inequality constraint in (1) is used to control the data fidelity with an error tolerance factor ε. In practice, with the expectation of the l₁-norm measure between the acquired and the desired projection data under the tolerance of ε, an optimal solution within the feasible region of minimizing the TV term may be found by the use of an optimization strategy (Sidky et al 2006, Sidky and Pan 2008).

2.2. CT imaging model with the presented AwTV minimization

In theory, the conventional TV term in the cost function (1) is based on the assumption of piecewise constant distribution for the desired image, and the assumption often leads to the associated cost function optimization suffering from over-smoothing on the edges in the reconstructed images. Meanwhile, the edge details are vital information for diagnosis in clinic. In order to mitigate the over-smoothing of edges in the conventional TV minimization, a new imaging model with AwTV minimization is proposed as follows:

$\begin{equation} \mathop {\min }\limits_{\mu \ge 0} \| \mu \|_{{\rm AwTV}} \;\,\,{\rm subject}\;{\rm to}\,\,\;| {p - A\mu } | \le \varepsilon \end{equation} \tag{ 3 }$

where the AwTV of the to-be-reconstructed image, i.e. ||μ||_AwTV, is defined as

$\begin{equation} \fl \| \mu \|_{{\rm AwTV}} = \sum\limits_{s,t} {\sqrt {w_{s,s - 1,t,t} ( {\mu _{s,t} - \mu _{s - 1,t} } )^2 + w_{s,s,t,t - 1} ( {\mu _{s,t} - \mu _{s,t - 1} } )^2 } } , \end{equation} \tag{ 4 }$

$\begin{eqnarray} &&\fl w_{s,s - 1,t,t} = \exp \left[ { - \left( {\frac{{\mu _{s,t} - \mu _{s - 1,t} }}{\delta }} \right)^2 } \right]\;\quad {\rm and}\quad \;w_{s,s,t,t - 1} = \exp \left[ { - \left( {\frac{{\mu _{s,t} - \mu _{s,t - 1} }}{\delta }} \right)^2 } \right],\nonumber\\ \end{eqnarray} \tag{ 5 }$

where δ in the weights (w_{s, s − 1, t, t} and w_{s, s, t, t − 1}) is a scale factor which controls the strength of the diffusion during each iteration (Perona and Malik 1990, Wang et al 2008).

By the form of AwTV in (4), it is possible to fully consider the gradient of the desired image and also to include the change of local voxel intensities. Specifically, for a smaller change of voxel intensity, a stronger weight can be given, whereas for a larger change of voxel intensity, a weaker weight may be given. Through this diffusion-type weighting process, an adaptive smoothing is encouraged in reference to the difference between neighboring voxels' intensities. From the viewpoint of scale space in the diffusion framework, the AwTV of the desired image will no longer be linearly and uniformly calculated for each diffusion direction from a voxel; rather the calculation will be adaptive to the local information of the image with an exponential form. Intuitively, the AwTV model of (4) approaches to the conventional TV model of (2) as the weight goes to 1; thus, the TV model may be considered as a special case of the AwTV model when δ → ∞.

2.3. Brief review of the POCS strategy and the TV-POCS algorithm

The POCS strategy is a general iterative scheme to solve linear equations by successive and repeated applications of several projection operators. This strategy was investigated by Sidkey and co-workers (Sidky et al 2006, Sidky and Pan 2008) as a possible way to solve the constrained minimization problem of (1), named the TV-POCS algorithm. Two independent operating steps are involved in the implementation of their algorithm. In the first step, an initially estimated image is updated iteratively by the POCS strategy. This step is basically the operation of the well-known algebraic reconstruction technique (ART). For illustration purpose, we adopt the simultaneous ART (SART) (Andersen and Kak 1984, Jiang and Wang 2001) to solve the under-determined linear system of (1). More specifically, the SART algorithm is used to yield an image estimate from the initially estimated image by minimizing the distance between the measured and estimated projection data. The associative update scheme can be described as follows:

$\begin{equation} \mu _j^{(k + 1)} = \mu _j^{(k)} + \frac{\omega }{{A_{ + ,j} }}\sum\limits_{i = 1}^M {\frac{{A_{i,j} }}{{A_{i, + } }}( {p_i - \overline p _i (\mu ^{(k)} )} )} \end{equation} \tag{ 6 }$

$\begin{equation} A_{i, + } = \sum\limits_{j = 1}^N {A_{i,j} } \;{\rm for}\;i = 1, \ldots ,M, \end{equation} \tag{ 7 }$

$\begin{equation} A_{ + ,j} = \sum\limits_{i = 1}^M {A_{i,j} } \;{\rm for}\;j = 1, \ldots ,N, \end{equation} \tag{ 8 }$

$\begin{equation} \bar p(\mu ) = A\mu , \end{equation} \tag{ 9 }$

where A_{i, j} is an M × N system matrix according to the projection geometry (Joseph 1982) (M was defined before as the total number of image voxels and N is the total number of data samples). ω is a relax parameter for updating the current estimate of the image. k indicates the iterative number. Through the SART algorithm, the initially estimated image is updated iteratively to fulfill the data constraints and an intermediate image is yielded for further update by the second step below.

The second step of the TV-POCS algorithm updates iteratively the intermediate image estimated from the above first step to minimize the TV of the to-be-estimated image. Although many numerical methods can be implemented in this second step to solve the TV minimization problem, such as the TV superiorization strategy (Penfold et al 2010) and the surrogate TV term strategy (Dfrise et al 2011), we adapt the same gradient decent strategy as described in the TV-POCS algorithm (Sidky et al 2006, Sidky and Pan 2008) to avoid any bias on the results that may be caused by using different numerical calculating methods for the purpose of comparing the conventional TV and the presented AwTV models. In order to achieve a reasonable optimization solution, some stop criteria should be considered in the iterative process (to be discussed later).

2.4. Presentation of the AwTV-POCS algorithm

Due to the nonlinear form of the AwTV with respect to the image intensity, it is numerically difficult to utilize directly the second-order derivative for the purpose of effectively minimizing the objective function (3). Inspired by the optimization strategy as described in Ma et al (2010), the weights can be pre-computed at current iteration for the AwTV minimization at the next iteration. By this strategy, the gradient descent technique is adapted to minimize the AwTV of the SART-estimated intermediate image where only the first-order derivative of the AwTV term with respect to each voxel value is needed, which can be approximately expressed as

$\begin{eqnarray} &&\fl \frac{{\partial \| \mu \|_{{\rm AwTV}} }}{{\partial \mu _{s,t} }} \approx \frac{{2w_{s,s - 1,t,t} ( {\mu _{s,t} - \mu _{s - 1,t} } ) + 2w_{s,s,t,t - 1} ( {\mu _{s,t} - \mu _{s,t - 1} } )}}{{\sqrt {\xi + w_{s,s - 1,t,t} ( {\mu _{s,t} - \mu _{s - 1,t} } )^2 + w_{s,s,t,t - 1} ( {\mu _{s,t} - \mu _{s,t - 1} } )^2 } }} \nonumber\\ &&+ \frac{{ - 2w_{s + 1,s,t,t} ( {\mu _{s + 1,t} - \mu _{s,t} })}}{{\sqrt {\xi + w_{s + 1,s,t,t} ( {\mu _{s + 1,t} - \mu _{s,t} } )^2 + w_{s + 1,s + 1,t,t - 1} ( {\mu _{s + 1,t} - \mu _{s + 1,t - 1} } )^2 } }} \nonumber\\ &&+ \frac{{ - 2w_{s,s,t + 1,t} ( {\mu _{s,t + 1} - \mu _{s,t} } )}}{{\sqrt {\xi + w_{s,s,t + 1,t} ( {\mu _{s,t + 1} - \mu _{s,t} } )^2 + w_{s,s - 1,t + 1,t + 1} ( {\mu _{s,t + 1} - \mu _{s - 1,t + 1} } )^2 } }} \end{eqnarray} \tag{ 10 }$

where ξ is a relax parameter introduced to avoid the denominator going to zero.

Similar to the ASD-POCS approach (Sidky and Pan 2008), the optimization of the objective function (3) is implemented by the following iterative scheme, named the AwTV-POCS algorithm. For an image with the array size of S × T, each of the general iterations of I cycles includes J iteration cycles of POCS operation and K iteration cycles of AwTV minimization by gradient descent. The relax parameter ω in the POCS operation decreases as the iteration increases and the step-size τ of the gradient descend also decreases as the iteration increases. Summarily, the pseudo-code for the presented AwTV-POCS algorithm is listed as follows:

$\begin{equation*} \begin{array}{@{}l@{}} \;\;{\rm 1:}\,\,\,\,\,{\rm initial:}\;\mu _{s,t}^{(0)} : = 1;\;s = 1,2,...,S,t = 1,2,...,T; \\ \;\;{\rm 2:}\,\,\,\,\,{\rm initial:}\;\delta ,\varepsilon ,\tau ;\omega = 1{\rm ;} \\ \;\; {\rm 3:}\,\,\,\,\,{\rm while (stop\, criterion \,\, is\,\, not\,\, met) } \\ \;\; {\rm 4:}\,\,\,\,\quad {\rm for\,\, j} = 1,2,...,J{\rm ;}\ {(POCS)} \\ \;\; {\rm 5:}\qquad {\rm if\,\, j} = = 1; \\ \;\; {\rm 6:}\,\,\,\,\, \qquad\mu _{s,t}^{(j)} : = { SART}\big(\mu _{s,t}^{(0)} ,\omega \big); \\ \;\; {\rm 7: }\qquad {\rm else }\,\,\mu _{s,t}^{(j)} : = { SART}\big(\mu _{s,t}^{(j - 1)} ,\omega \big);\,\,\,\,\,\,\,\,\,\,s = 1,2,...,S,t = 1,2,...,T; \\ \;\; {\rm 8:}\qquad {\rm end\,\, if} \\ \;\; {\rm 9:}\,\,\,\,\quad{\rm end\,\, for} \\ {\rm 10:}\,\,\,\,\quad{\rm if}\,\,\;\mu _{s,t}^{(J)} > 0,\;{then}\,\,\mu _{s,t}^{(J)} = \mu _{s,t}^{(J)} ;{\rm }\,\,\,\,\,\,\,\,\,\,s = 1,2,...,S,t = 1,2,...,T; \\ {\rm 11:}\qquad {\rm else }\,\,\mu _{s,t}^{(J)} : = 0;\;\,\,\,\,\,\,\,\,\,\,\,\quad s = 1,2,...,S,t = 1,2,...,T{\rm ;} \\ {\rm 12:}\,\,\,\,\quad{\rm end\,\, if} \\ {\rm 13:}\,\,\,\,\quad {dp}: = \big\| {A\mu _{s,t}^{(J)} - A\mu _{s,t}^{(0)} } \big\|_2 ;\;\qquad \quad s = 1,2,...,S,t = 1,2,...,T; \\ {\rm 14:}\,\,\,\,\quad d\mu _{{\rm SART}} : = \big\| {\mu _{s,t}^{(J)} - \mu _{s,t}^{(0)} } \big\|_2 ;\; \qquad \quad s = 1,2,...,S,t = 1,2,...,T; \\ {\rm 15:}\,\,\,\,\quad{\rm }w_{s,s - 1,t,t} = \exp \left[ { - \left( {\frac{{\mu _{s,t}^{(J)} - \mu _{s - 1,t}^{(J)} }}{\delta }} \right)^2 } \right]{\rm and }\\ \ms \qquad w_{s,s,t,t - 1} = \exp \left[ { - \left( {\frac{{\mu _{s,t}^{(J)} - \mu _{s,t - 1}^{(J)} }}{\delta }} \right)^2 } \right]; \\ {\rm 16:}\qquad {\rm for }\,\,k = 1,2,...,K;\qquad{\rm (Aw\,\,TV\,\, gradient\,\, descent)} \\ {\rm 17:}\,\,\,\,\, \qquad\mu _{s,t}^{(J + k)} : = \mu _{s,t}^{(J + k - 1)} - d\,\mu _{{\rm SART}} \cdot \tau \cdot \frac{{\nabla \big\| {\mu _{s,t}^{(J + k - 1)} } \big\|_{{\rm AwTV}} }}{{\big| {\nabla \big\| {\mu _{s,t}^{(J + k - 1)} } \big\|_{{\rm AwTV}} } \big|}}; \\ {\rm 18:}\qquad {\rm end\,\, for} \\ {\rm 19:}\qquad {\rm if}\,\,dp < \varepsilon ; \\ {\rm 20:}\,\,\,\,\, \qquad\omega : = 0.995 \times \omega ; \\ {\rm 21:}\qquad {\rm end\,\, if} \\ {\rm 22:}\qquad \mu _{s,t}^{(0)} : = \mu _{s,t}^{(J + K)} ; \\ {\rm 23:}\,\,\,\,\,{\rm calculate\,\, the\,\, criterion;} \\ {\rm 24:}\,\,\,\,\, \tau = \tau *0.995; \\ {\rm 25:}\,\,\,\,\,{\rm end\,\, if\,\, stop\,\, criterion\,\, is\,\, satisfy} \\ \end{array} \end{equation*}$

In line 1, an initial estimate of the to-be-reconstructed image is set to be uniform with voxel value of 1. In line 2, four parameters, δ, ε, ω and τ, are initialized before the iteration starts. Specifically, the error tolerance ε is initialized based on the noise level of the data. The initial value of δ in the weights of the AwTV term will be discussed later in section 3, and so are the parameters ω and τ. Each outer loop (lines 3–24) is performed by two separated iteration steps, i.e. the POCS (or the SART) (lines 4–12) and the gradient descent for the AwTV minimization (lines 16–18). The weights are pre-computed using latest image estimation μ^(J)_{s, t} in line 15. By setting the weight to 1, the above pseudo-code for the presented AwTV-POCS algorithm is applicable to the TV-POCS algorithm (Sidky et al 2006, Sidky and Pan 2008). A brief discussion on the stop criterion for both TV-POCS and AwTV-POCS implementations is given below.

2.5. Stop criterion for implementation of the AwTV-POCS and TV-POCS algorithms

In order to ensure the solution of the objective function (3) obtained by the above-presented AwTV-POCS implementation is an optimal estimate, the associative Karush–Kuhn–Tucker (KKT) condition should be satisfied, similar to that in the TV-POCS implementation, as reported in Sidky and Pan (2008). For the TV-POCS algorithm implementation, the KKT condition can be satisfied with an indicator factor c_α = −1.0 where c_α is defined as

$\begin{equation} c_\alpha = \frac{{\vec d_{{\rm TV}} \cdot \vec d_{{\rm data}} }}{{| {\vec d_{{\rm TV}} } | \cdot | {\vec d_{{\rm data}} } |}}, \end{equation} \tag{ 11 }$

where $\vec d_{{\rm TV}}$ is a vector of derivative of the TV term, and $\vec d_{{\rm data}}$ is a vector of derivative of the data constraints using the Lagrangian multiplier. For the presented AwTV-POCS algorithm implementation, a similar indicator factor can also be used to describe the KKT condition for an optimal estimate. As stated in Sidky and Pan (2008), c_α = −1.0 is a necessary condition for an optimal solution for the TV minimization with sufficient data constraints. The necessary condition of c_α = −1.0 may not be reached unless a great number of iteration cycles are executed, which may not be practical. In the AwTV-POCS algorithm, we discovered that very small or imperceptible changes were notable in the reconstructed images when c_α went below −0.6. Thus, in our algorithm implementation, we used c_α < −0.6 as the stop criterion in line 23 of the above pseudo-code.

To evaluate the differences between the resulting images from the AwTV-POCS and TV-POCS approaches, several computer simulation and phantom experiment studies were performed and reported in the following section.

3. Results

For computer simulation studies, a modified Shepp–Logan mathematical phantom was designed based on the mass attenuation coefficients of different tissues in the objects in the phantom. For phantom experiment studies, two sets of cone-beam projection data were acquired from two different types of phantom models, respectively, using a commercial CT scanner. One phantom model is the CatPhan® 600 phantom and the other is an anthropomorphic head phantom. The TV-POCS algorithm (Sidky et al 2006, Sidky and Pan 2008) was implemented as the baseline or reference for comparison purpose. In addition, the EPTV model with incorporation of the POCS strategy, or the EPTV-POCS algorithm, was implemented by using the similar scheme as the TV-POCS algorithm.

3.1. Computer simulation studies

For simplicity, without loss of generality, a parallel-beam CT imaging geometry was used for the purpose of measuring the gain of the AwTV minimization in comparison to the conventional TV minimization. This geometry was modeled with 1024 bins on a 1D detector for 2D image reconstruction. The distance between the centers of two neighboring detector elements or bins is 0.25 mm. Given the digital phantom, the noise-free transmission data were computed by the use of the Lambert–Beer law, $I_i = I_i^o \exp ( - \bar p_i )$ , where $\bar p_i$ is the line integral of the phantom intensity distribution along the ray i and I^o_i is the mean number of incident photons. Given the noise-free data, the noisy transmission data were simulated based on the assumption for the statistical model of the measurements (to be discussed later in section 3.1.2).

3.1.1. Design of a modified Shepp–Logan phantom and computation of line integrals

According to the mass attenuation coefficients as listed in table 1 for different tissues at 80 keV in Khan (1984), a modified Shepp–Logan phantom was carefully designed as shown in figure 1 for simulation studies. The dimensions of the phantom are 256 × 256 mm², consisting of 512 × 512 pixels.

**Figure 1.** A modified Shepp–Logan phantom with display window [0, 0.0034] mm⁻¹.
Download figure:
Standard image

Table 1. Mass attenuation coefficients and the relate densities for different tissues.

	Mass attenuation coefficients	The density of tissue
Body tissue	µ/ρ (m² kg⁻¹)	(kg m⁻³) in 20 °C
Air, dry	1.661 × 10⁻²	1.205
Water	1.835 × 10⁻²	1000
Muscle	1.822 × 10⁻²	1040
Fat	1.805 × 10⁻²	920
Bone	2.083 × 10⁻²	1850

With the parallel-beam imaging geometry, the noise-free sinogram can be computed by the line integration of the attenuation coefficients along the corresponding projection paths:

$\begin{equation} \bar p = A\mu . \end{equation} \tag{ 12 }$

A set of noise-free sinograms was computed with 1024 detector bins per view and several different numbers of projection views, i.e. 20, 40 and 60, at equal angular increment of 360° around the phantom.

3.1.2. Noise model

Although the compound Poisson model (Whiting 2002) is more accurate for description of the noise of the detected photon numbers in CT imaging, it is numerically challenging to implement this model for data noise simulation. Several reports have discussed the approximation of this model by the Poisson model (La Rivière et al 2006, Elbakri and Fessler 2002, Whiting 2002, Lasio et al 2007). Based on these reports, the CT transmission data can be assumed to be a Poisson-distributed quantum noise plus Gaussian-distributed electronic noise (La Rivière et al 2006, Xu and Tsui 2009, Ma et al 2012). In our simulation, we assumed that the detected photon number follows the Poisson process plus the electronic noise background (Xu and Tsui 2009, Ma et al 2012); in other words,

$\begin{equation} \hat I_i = {\rm Poisson} (I_i) + {\rm Gaussian} \big(m_{ic} ,\sigma _{ie}^2 \big) \end{equation} \tag{ 13 }$

where $\hat I_i$ is the simulated noisy transmission datum, I_i is the mean number of photons or the noise-free transmission datum for detector bin i at a projection view, and m_ic and σ²_ie are the mean and variance of the electronic noise, respectively, for detector bin i. By system calibration, the mean value is usually set to zero, m_ic =0, and the variance σ²_ie ≈ 10 was found in some clinical CT scanners (Ma et al 2012).

Based on the noise model (13), the noisy transmission data can be simulated as follows. Given the modified mathematical Shepp–Logan phantom of figure 1, the line integral $\bar p_i$ was computed along the projection path or ray i. By the Lambert–Beer law, $I_i = I_i^o \exp ( - \bar p_i )$ , and the knowledge of I^o_i ≈1.0 × 10⁵ in routine clinical studies (Wang et al 2006, Dfrise et al 2011, Lasio et al 2007, Ma et al 2012, O'Sullivan and Benac 2007), the mean I_i was calculated. Given the mean and a Poisson random number generator, the first term of Poisson (I_i) in (13) was obtained. The second term in (13) was obtained by the use of a Gaussian random number generator with zero mean and variance of 10. After the noisy transmission datum $\hat I_i$ was simulated from the noise-free transmission datum by sampling the Poisson variable with mean I_i and the Gaussian variable with mean of zero and variance of 10, the corresponding noisy projection datum p_i was obtained by the logarithm transform of the noisy transmission datum:

$\begin{equation} p_i = \log \left( {\frac{{I_i^o }}{{\hat I_i }}} \right). \end{equation} \tag{ 14 }$

3.1.3. Parameter selection

To reconstruct the image of the Shepp–Logan phantom {μ_j} of figure 1 from the above-simulated noisy sinogram data {p_i}, we followed the description in Sidky et al (2006) and Sidky and Pan (2008) to implement their TV-POCS algorithm. In a similar way to implement our AwTV-POCS algorithm, the parameter of δ in the weight of (5) shall be determined. By some experimental trials, the value of this scale factor was set to 0.6 × 10⁻² to simulate the strength of the diffusion model (Perona and Malik 1990, Wang et al 2008). For the EPTV-POCS method, the scale factor was also set to 0.6 × 10⁻²for comparison purpose. In addition to this parameter, another factor of ξ = 1.0 × 10⁻⁵ in (10) was set to ensure that the denominators will not go to zero. For the TV-POCS, EPTV-POCS and AwTV-POCS algorithms, each of the general iterations consisted of ten POCS iterations and ten gradient descent iterations. The strop criterion was discussed in section 2.5. The error tolerance ε for the data constraint will be discussed later. The initial values of ω and τ were set as 1 and 0.7 × 10⁻⁵, respectively, similar to that in Sidky et al (2006) and Sidky and Pan (2008).

3.1.4. Visualization-based evaluation

In this evaluation study, two numerical experiments were performed: (1) image reconstruction from noise-free data and (2) image reconstruction from noisy data. In each numerical experiment, images were reconstructed from the data simulated with 20, 40 and 60 projection views, respectively, by the use of the AwTV-POCS algorithm in comparison to the TV-POCS and EPTV-POCS algorithms.

Noise-free cases.

Figure 2 shows the results from the noise-free experiment. It can be observed that the images reconstructed by the TV-POCS, EPTV-POCS and AwTV-POCS are visually much better than the results of FBP in all the cases of 20, 40 and 60 projection views. The difference between the images from the TV-POCS, EPTV-POCS and AwTV-POCS can be observed by using a narrow grayscale display window as shown in figure 3. Regions of interest (ROIs) in figure 3 were selected to examine some details of the reconstructed images. The corresponding ROI results are shown in figure 4. It can be seen that, in the case of 20 projection views, the results of the AwTV-POCS and EPTV-POCS algorithms demonstrate some gains in terms of edge preserving. Meanwhile, the gains gradually disappeared as more projection views were used. It is worth noting that a little over-enhancement at the edges in the EPTV-POCS reconstruction can be observed as shown in the second row of figure 4, which is consistent with the results published in Tian et al (2011). From 60 projection views, all the TV-POCS, EPTV-POCS and AwTV-POCS algorithms generated good quality images with high similarity.

**Figure 3.** The images reconstructed by the TV-POCS (top row), EPTV-POCS (middle row) and AwTV-POCS (bottom row) algorithms from 20 (left column), 40 (middle column) and 60 (right column) projection views, respectively. The display window is [0.0013, 0.0018] mm⁻¹.
Download figure:
Standard image

**Figure 4.** The ROIs of the images reconstructed by the TV-POCS (top row), EPTV-POCS (middle row) and AwTV-POCS (bottom row) algorithms from 20 (left column), 40 (middle column) and 60 (right column) projection views, respectively. The display window is [0.0013, 0.0018] mm⁻¹.
Download figure:
Standard image

To further visualize the difference between the three approaches in the cases of 20, 40 and 60 projection views, horizontal profiles of the resulting images were drawn across the 410th row for each case and are shown in figures 5–7, where the corresponding profile from the true phantom image is given for reference. In each case, three ROIs were selected to inspect the difference of the results. Figures 5(b)–(d) show that the AwTV-POCS and EPTV-POCS algorithms can achieve better profiles matching with the ideal ones than the TV-POCS algorithm. And the gain from the AwTV-POCS is observable as compared to the results of the EPTV algorithm. As the number of projection views increased, the results of the TV-POCS, EPTV-POCS and AwTV-POCS algorithms approached to that of the true phantom image. However, the improved edge preservation by the AwTV-POCS is still visible in the results from 60 projection views, see figure 7.

**Figure 5.** Horizontal profiles (410th row) of the images reconstructed by different algorithms from 20 projection views of noise-free data. Picture (a) shows the overall profiles. Pictures (b), (c) and (d) show the partial profiles of the three ROIs indicated in (a).
Download figure:
Standard image

**Figure 6.** Horizontal profiles (410th row) of the images reconstructed by different algorithms from 40 projection views of noise-free data. Picture (a) shows the overall profiles. Pictures (b), (c) and (d) show the partial profiles of the three ROIs indicated in (a).
Download figure:
Standard image

**Figure 7.** Horizontal profiles (410th row) of the images reconstructed by different algorithms from 60 projection views of noise-free data. Picture (a) shows the overall profiles. Pictures (b), (c) and (d) show the partial profiles of the three ROIs indicated in (a).
Download figure:
Standard image

The above noise-free simulation studies concurred with our previous discussion in section 2.2 about the advantage of using adaptive weights for edge preservation in the AwTV model as compared to the conventional TV and EPTV models. To further support our previous discussion, studies on noisy projection data were performed and reported in the next section.

Noisy cases.

In this section, image reconstruction from noisy data was performed to analyze the robustness to noise of the AwTV-POCS algorithm. For all the AwTV-POCS, EPTV-POCS and TV-POCS algorithms, the values of the tolerance parameter ε were chosen to be 0.085, 0.082 and 0.078 for the 20, 40 and 60 projection views, respectively. A smaller ε value was chosen for a larger number of projection views by the reason that the constraints in (1) and (3) would be more restrictive for more data samples. Figure 8 shows that the FBP images have notable artifacts as compared to the images reconstructed by the TV-POCS, EPTV-POCS and AwTV-POCS algorithms from 20, 40 and 60 projection views of the noisy sinogram data.

**Figure 8.** The images reconstructed by the FBP (first row), TV-POCS (second row), EPTV-POCS (third row) and AwTV-POCS (fourth row) algorithms from 20 (left column), 40 (middle column) and 60 (right column) projection views of noisy sinogram data, respectively. The display window is [0, 0.0034] mm⁻¹.
Download figure:
Standard image

A narrow grayscale display window was presented to examine the differences among the results of the three latter approaches as shown in figures 9 and 10. Compared to the TV-POCS and EPTV-POCS algorithms, the AwTV-POCS algorithm preserved more edge details for 20 and 40 projection views and generated similar results for 60 projection views.

**Figure 9.** The images reconstructed by the TV-POCS (top row), EPTV-POCS (middle row) and AwTV-POCS (bottom row) algorithms from 20 (left column), 40 (middle column) and 60 (right column) projection views of noisy sinogram data, respectively. The display window is [0.0013, 0.0018] mm⁻¹.
Download figure:
Standard image

**Figure 10.** The ROIs of the images reconstructed by the TV-POCS (top row), EPTV-POCS (middle row) and AwTV-POCS (bottom row) algorithms from 20 (left column), 40 (middle column) and 60 (right column) projection views of noisy sinogram data, respectively. The display window is [0.0013, 0.0018] mm⁻¹.
Download figure:
Standard image

The horizontal profiles of the images reconstructed in the cases of 20, 40 and 60 projection views of noisy data along the 410th row are shown in figures 11–13, respectively, with the corresponding profile of the true phantom image as a reference. These profiles also show that the AwTV-POCS preserved the edge details better than the TV-POCS in the noisy cases for 20, 40 and 60 projection views, except for the display of figure 13(b) which shows similar performance. The profiles also show that the results of AwTV-POCS and EPTV-POCS strategy are very close but some gains from the present AwTV-POCS can be observed as shown in figures 11(b)–(d) and 12(b)–(d). These noisy simulation studies were consistent with our previous observations in the noise-free cases, and further concurred with our previous discussion in section 2.2 about the advantage of using the adaptive weights for edge preservation in the AwTV model as compared to the conventional TV model. With the same tendency as in the noise-free cases, the profiles in the noisy cases show that the reconstruction quality increased as the number of projection views increased. In the case of 60 projection views, the resulting images were close to the true phantom image by all the TV-POCS, EPTV-POCS and AwTV-POCS approaches.

**Figure 11.** Horizontal profiles (410th row) of the images reconstructed by different algorithms from 20 projection views of noisy data. Picture (a) shows the overall profiles. Pictures (b), (c) and (d) show the partial profiles of the three ROIs indicated in (a).
Download figure:
Standard image

**Figure 12.** Horizontal profiles (410th row) of the images reconstructed by different algorithms from 40 projection views of noisy data. Picture (a) shows the overall profiles. Pictures (b), (c) and (d) show the partial profiles of the three ROIs indicated in (a).
Download figure:
Standard image

**Figure 13.** Horizontal profiles (410th row) of the images reconstructed by different algorithms from 60 projection views of noisy data. Picture (a) shows the overall profiles. Pictures (b), (c) and (d) show the partial profiles of the three ROIs indicated in (a).
Download figure:
Standard image

For the purpose of focusing on the edge characterization of the AwTV model, quantitative evaluation using observer detection power and computer simulation data is given in appendix A of this paper.

3.2. Phantom experiment studies

To further realize the potential gain of the AwTV-POCS in comparison to the TV-POCS in more realistic cases, cone-beam data were acquired from two physical phantoms using a commercial CT scanner.

3.2.1. Experiment with the CatPhan® 600 phantom

An image slice of the CatPhan® 600 phantom is shown in figure 14. Cone-beam CT projection data were acquired by an Acuity simulator (Varian Medical System, Palo Alto, CA) (Wang et al 2008). The x-ray tube current was set at 80 mA and the duration of the x-ray pulse at each projection view was set to be 12 ms. A total of 634 projection views were acquired for a fully 360° rotation on a circular orbit. The distance of source-to-axis is 100 cm and source-to-detector distance is 150 cm. The voxel size in the reconstructed image is 0.776 × 0.776 × 0.776 mm³. The array size of the reconstructed image is 350 × 350 × 8. Sparse projection datasets can be extracted from the total 634 projection views. For example, 63, 79 and 158 views, respectively, were extracted which are evenly distributed over 360°. To ensure convergence to a stable solution, the parameter c_α was set as –0.6 for the AwTV-POCS algorithm and −0.5 for the TV-POCS algorithm. Two POCS iterations and 12 gradient descent iterations were performed in each general loop. The execution time for each general iteration step was around 45 s on a HP PC with Intel Xeon X5450 CPU and 24 gigabyte memory. The 3D AwTV term was defined similarly as the 2D AwTV term and can be expressed as

$\begin{eqnarray} &&\fl \| \mu \|_{{\rm AwTV} - 3{\rm D}} = \nonumber\\ &&\fl\selectfont $\sum\limits_{s,t,z} {\sqrt {w_{s,s - 1,t,t,z,z} (\mu _{s,t,z} - \mu _{s - 1,t,z} )^2 + w_{s,s,t,t,z,z - 1} (\mu _{s,t,z} - \mu _{s,t,z - 1} )^2 + w_{s,s,t,t - 1,z,z} (\mu _{s,t,z} - \mu _{s,t - 1,z} )^2 } }$} \end{eqnarray} \tag{ 15 }$

where z is the voxel's index along the z-axis direction. By setting the weight as 1, the conventional TV term is obtained. The reconstructed images are shown in figures 14 and 15. The reconstruction by the well-known Feldkamp–Davis–Kress (FDK) method with Hanning window at Nyquist frequency cutoff is shown as the reference image.

**Figure 15.** CatPhan® 600 phantom image reconstructions by different algorithms from 79 projection views. Column (a) shows the reconstruction by the FDK algorithm from the total 634 projection views as a reference. Column (b) shows the reconstruction by the FDK algorithm from 79 projection views. Column (c) shows the reconstruction by the AwTV-POCS algorithm from 79 projection views. Column (d) shows the reconstruction by the TV-POCS algorithm from 79 projection views. The bottom row shows the zoomed pictures. The display window of top row is [0, 0.024] mm⁻¹. The display window of bottom row is [0.008, 0.02] mm⁻¹.
Download figure:
Standard image

From figure 14, it is seen that both the AwTV-POCS and TV-POCS algorithms reconstructed much better images as compared to the result of the FDK method from 63 projection views. In addition, the result of the AwTV-POCS shows more details on the edges than the result of the TV-POCS as indicated by the arrows in figures 14(c) and (d). As the number of projection views increased, the visual difference on the results of the AwTV-POCS and TV-POCS algorithms became not significant except for some small difference between the spots as indicated by the arrows in figures 15(c) and (d). This observation is consistent with our previous conclusion in the Shepp–Logan numerical phantom simulation study.

3.2.2. Experiment with the anthropomorphic head phantom

An image slice of the anthropomorphic head phantom is shown in figure 16. Cone-beam projection data were acquired from the anthropomorphic head phantom by the same protocol as used for the CatPhan® 600 phantom study. In order to observe the difference between the results from the AwTV-POCS and TV-POCS algorithms, we extracted 79 and 158 projection views from the full views for sparse image reconstruction. An ROI was selected to inspect the fine structures of the reconstructed results as indicated in figure 16(a). The resulting image and the ROI observations are shown in figures 16 and 17.

**Figure 17.** Head phantom image reconstructions by different algorithms from 158 projection views. Column (a) shows the reconstruction by the FDK algorithm from the total 634 projection views as a reference. Column (b) shows the reconstruction by the FDK algorithm from 158 projection views. Column (c) shows the reconstruction by the AwTV-POCS algorithm from 158 projection views. Column (d) show the reconstruction by the TV-POCS algorithm from 158 projection views. The bottom row shows the zoomed pictures. The display window is [0, 0.03] mm⁻¹ for the first row and [0.01,0.03] mm⁻¹ for the second row.
Download figure:
Standard image

By inspecting the images reconstructed from 79 projection views as shown in figure 16, it can be seen that some fine structures of the soft tissue, such as the structures of ear, are lost for both AwTV and TV models due to the sparse projection views. Despite this, some gains from the AwTV model are notable at both the ear location and the cold spots as indicated in the figures 16(b) and (c). By comparison to the CatPhan® 600 phantom result of figure 15, the loss of the fine structures in the results of head phantom as shown in figure 16 indicates that the measurements required for sparse image reconstruction should be associated with the structure of the signals. Intuitively, more projection views are needed to recover the fine structures in the head phantom. Based on this intuition, we performed another experiment by the use of 158 projections. Figure 17 shows the reconstructed results from 158 projections. Significant improvement in recovering the small structures is seen by the use of more projections for both TV and AwTV models. The gain by the AwTV model is also notable as indicated by right lower circle in figures 17(c) and (d). These results are consistent with those from the Shepp–Logan phantom simulation study. This then suggests that the presented AwTV model can preserve the edge details better than the TV model for image reconstruction from sparse-viewed projections.

3.3. Quantitative evaluation

In this section, two metrics for quantitative evaluation were used to show the performance of the AwTV-POCS algorithm in comparison to the TV-POCS algorithm.

3.3.1. Full-width at half-maximum measurement

To quantitatively analyze the gain by using the AwTV model in comparison to the conventional TV model in the POCS framework, the full-width at half-maximum (FWHM) of two spots (a hot spot and a cold spot) of the CatPhan® 600 phantom as indicated in figures 14 and 15 is calculated. Figures 18 and 19 show the profiles passing through the two spots in the images reconstructed from 63 and 79 projection views, respectively. A Gaussian-like function is used to fit the profiles as indicated in the figures, then the FWHM of the fitted Gaussian broadening kernel is calculated by 2.35σ. From figures 18 and 19, we can observe that the peak value of the result from the conventional TV-POCS algorithm is lower than that from the AwTV-POCS algorithm, which indicates that the AwTV-POCS algorithm can gain in resolution. The FWHM values of the reconstructions from 63 and 79 projection cases by TV-POCS and AwTV-POCS algorithms are shown in table 2. Both of the cases reveal that the AwTV-POCS algorithm can produce a smaller FWHM value on both hot and cold spots compared to the TV-POCS strategy, which is consistent with our observation about the profile comparison.

**Figure 18.** Horizontal profiles of the CatPhan® 600 phantom images reconstructed by different algorithms from 63 projection views of noisy data. Picture (a) shows the profiles across the cold spot (416th row, 130th–150th column). Picture (b) shows the profiles across the hot spot (and 139th row, 195th–215th column).
Download figure:
Standard image

**Figure 19.** Horizontal profiles of the CatPhan® 600 phantom images reconstructed by different algorithms from 79 projection views of noisy data. Picture (a) shows the profiles across the cold spot (416th row, 130th–150th column). Picture (b) shows the profiles across the hot spot (and 139th row, 195th–215th column).
Download figure:
Standard image

Table 2. The FWHM values of the cold and hot spots.

63 projection views	Cold spot	Hot spot
TV-POCS	5.3580	4.7024
AwTV-POCS	4.8857	4.6690
79 projection views	Cold spot	Hot spot
TV-POCS	6.3168	5.3815
AwTV-POCS	4.8927	5.2922

3.3.2. Resolution–noise tradeoff study on the AwTV model

The parameter δ of the weight w_{s, s', t, t'} in the AwTV model (5) plays an important role for the AwTV-POCS algorithm. Its effect on the image resolution and noise tradeoff was investigated in this study. The image resolution was calculated from the edge spread function (ESF) (a measurement of the broadening of a step edge) along the horizontal profile on the small vertical ellipse which is indicated at the right bottom of figure 20(a). The calculation procedure is based on the descriptions in Wang et al (2006), La Rivière et al (2006), where the edge broadening kernel is assumed as a Gaussian function with standard deviation σ_R, and an error function parameterized by σ_R is used to describe the ESF. By fitting a horizontal profile through the center of the small vertical ellipse to an error function, the parameter σ_R can be obtained. With the similar concept as introduced in the previous section, the FWHM of the fitted Gaussian broadening kernel is calculated by 2.35σ_R, which indicates the resolution of the reconstructed image. In this study, the image noise was calculated from the pixels in a small square ROI, which was selected nearby the small vertical ellipse at the bottom right of figure 20(a). The standard deviation, σ_N, of the local uniform region in the ROI was used as the noise indicator. By varying the weight parameter δ from 0.3 × 10⁻² to 6.0, we can obtain a curve in the coordinates (σ_R, σ_N). Figure 20(b) shows three curves corresponding to the AwTV-POCS reconstructions from 20, 40 and 60 projection views, respectively. The resolution and noise tradeoff improved as the number of projection views increased. This observation concurs with the expectation in general sense, indicating the validity of the plots. For all the three cases of 20, 40 and 60 projection views, the standard deviation σ_N or noise measure of the reconstructed images decreased as δ increased, indicating that the images became 'smoother'. In the meanwhile, the resolution measure σ_R of the reconstructed images also increased as δ increased, indicating that the edges became more 'blurry'. This observation also concurs with the expectation in general sense, further indicating the validity of the plots. A similar evaluation was also performed using the reconstruction results from 63 and 79 projection views, respectively, of the CatPhan® 600 phantom. The corresponding resolution–noise tradeoff curve is shown in figure 21. Thus, according to the tendency of the resolution–noise tradeoff curves, it is possible to obtain an optimal resolution–noise tradeoff in the reconstruction by determining a proper value for δ. In all the simulation and experiment studies, a small value was used as the initial guess for the δ value. Staring from this small value, we increased the value empirically until a proper value δ was obtained, which rendered visual-appearing results. For example, δ = 0.6 × 10⁻² was found in the Shepp–Logan phantom cases, δ = 0.9 × 10⁻² in the CatPhan® 600 phantom cases, and δ = 0.01 in the anthropomorphic head phantom cases. Comparing to the TV-POCS reconstructions, the results from the AwTV-POCS algorithm did not show notable difference when δ > 1.

**Figure 20.** The resolution–noise tradeoff curves from the Shepp–Logan phantom study. Picture (a) shows the modified Shepp–Logan phantom with display window [0, 0.0034] mm⁻¹, where the square at the right-bottom location is the selected ROI, the line on the right-bottom small ellipse indicates the location of the profiles. Graph (b) shows the resolution–noise tradeoff curves from the reconstructed images using different values of δ for the 20, 40 and 60 projection views, respectively.
Download figure:
Standard image

**Figure 21.** The resolution–noise tradeoff curves from the CatPhan® 600 phantom study. Picture (a) shows the CatPhan® 600 phantom with display window [0, 0.024] mm⁻¹, where the square at the left-top location is the selected ROI, the line on the left-top small circle indicates the location of the profiles. Graph (b) shows the resolution–noise tradeoff curves from the reconstructed images using different values of δ for the 63 projection views.
Download figure:
Standard image

3.4. Convergence analysis

The signal-to-noise ratio (SNR) and mean-square-errors (MSE) metrics have been widely used to measure the noise level and image quality for a known signal, respectively. In this study, the convergence performance of the AwTV-POCS and TV-POCS algorithms was documented by calculating the SNR and the MSE versus the iteration steps. The definitions of SNR and MSE are listed as follows:

$\begin{equation} {\rm SNR} = \frac{{\sum\limits_{s,t}^M {\mu _{s,t}^2 } }}{{\sum\limits_{s,t}^M {( {\mu _{s,t} - \hat \mu _{s,t} } )^2 } }}, \end{equation} \tag{ 16 }$

$\begin{equation} {\rm MSE} = \frac{1}{M}\sum\limits_{s,t}^M {(\mu _{s,t} - \hat \mu _{s,t} )^2 } , \end{equation} \tag{ 17 }$

where μ_{s, t} is the true value of the attenuation coefficient at voxel location index (s, t) and $\hat \mu _{s,t}$ is the reconstructed attenuation coefficient at voxel (s, t), M was defined before as the total number of image voxels. Each algorithm was executed up to 1000 iteration steps to ensure its convergence to a stable solution.

Figure 22 shows the SNR and MSE versus the iteration steps for the AwTV-POCS and TV-POCS algorithms, respectively. Graphs in 22(a) and (b) indicate that both the two algorithms converged robustly and reached their stable solutions after around 450 iterations. In addition to the SNR and MRE measures, the stop criterion c_α of (11) was also considered. It dropped below −0.6 after 492 general iteration steps. As shown in graph 22(a), the SNR of the AwTV-POCS reconstructions approached to 38 dB at 1000 iterations, as compared to the 27.5 dB by the TV-POCS algorithm at the same number of iteration steps. This indicates that the AwTV-POCS algorithm can improve the SNR in reconstructions over the TV-POCS algorithm. From the curve of the MSE versus iteration steps, as shown in graph 22(b), it can be observed that the reconstructions of the AwTV-POCS algorithm have a lower MSE level than that of the TV-POCS algorithm, indicating that the reconstructed images by the AwTV-POCS can be more accurate than the results of the TV-POCS algorithm.

4. Discussion and conclusion

In this paper, we introduced a novel adaptive-weighted total variation (AwTV) minimization model for low-dose CT image reconstruction from sparse-view projection measurements. By introducing an anisotropic diffusion-based adaptive weight to preserve the edge information in the conventional TV minimization paradigm, the gain in mitigating the over-smoothing on the edges in the conventional TV minimization was observed by comparing the performance of the presented AwTV-POCS implementation with the established TV-POCS algorithm (Sidky et al 2006, Sidky and Pan 2008).

In the computer simulation studies, the visual comparison via displaying the results of AwTV-POCS, EPTV-POCS and TV-POCS algorithms showed that the AwTV model enabled to reconstruct image satisfactorily without introducing artifacts from 20 projection views in both noise-free and noisy data cases compared to the conventional TV model and the EPTV model. Moreover, it should be noted that as the number of projection views increased to 40 and 60, all the algorithms improved the reconstruction quality compared to the results from 20 projection views. Similar tendency has also been observed in experiment studies (i.e. the CatPhan® 600 and anthropomorphic head phantoms). The reason is that a denser sampling of the data, by increasing the number of projection views, has stronger constraints to the sparse-view reconstruction optimization problem and, therefore, restricts the result much closer to the true image. This observation is consistent with the previous work of Bian et al (2010). In addition, more importantly, the present AwTV model can yield notable gain in preserving the fine structures and edges than the conventional TV model. In addition to the visual inspection, several more quantitative merits were utilized to analyze the differences between the presented AwTV and the conventional TV models. The following conclusions can be drawn from these quantitative measures.

First, using the similar parameters for both TV and AwTV models (except parameter δ, which is only for the AwTV), the FWHM measure indicates that the results from the AwTV-POCS algorithm has higher peak and smaller values in both cold and hot spots as compared to the conventional TV-POCS algorithm. Thus, it could be concluded that the AwTV-POCS algorithm has a higher capability to preserve edge details compared to the conventional TV-POCS algorithm for sparse-view CT image reconstruction.

Second, the resolution–noise tradeoff study showed that the resolution in AwTV-POCS reconstructed images decreases with increasing the value of scale parameter δ. In the meanwhile, the standard deviation of image noise decreases. This observation indicates that a smoother image is obtained with a larger δ value. On the contrary, while decreasing the δ, the resolution and noise level increased, indicating a sharper image being obtained. Based on this observation, it could be concluded that the weight in AwTV model for edge information can give an optimal image quality by a proper value of δ, and the determination of this proper value is currently empirical.

Except the above quantitative measurements, the ROC study using the Shepp–Logan phantom (as shown in appendix A) indicated that both TV-POCS and AwTV-POCA algorithms have similar detection performances when the small lesion's contrast is too low or too high. In the former case both algorithms would certainly fail, while in the latter case both algorithms would certainly succeed. However, when the small lesion contrast is in between, the values of the area under the curve (AUC) of the receiver operating characteristics (ROC) of the AwTV-POCS are statistically significantly higher than that of the TV-POCS. In addition, the bias–variance tradeoff study (as shown in appendix B) indicated that both algorithms have smaller bias and variance values at lower noise levels and their values are very close when the noise level is very low. However, at the same variance level, the AwTV-POCS has less bias than the TV-POCS. Although both the results from the ROC study and bias–variance analysis indicated that the AwTV-POCS algorithm can have higher quantitative capability in its reconstructions than the TV-POCS algorithm, it should be mentioned that more studies should be conducted by using more realistic clinical data than the simulated Shepp–Logan phantom data. For this reason, both the ROC and bias–variance results are presented in the appendices.

Last but not the least, the convergence study showed that both TV-POCS and AwTV-POCS algorithms converged to their stable solutions, respectively, and had similar convergence rates, see figure 22. The converged solution of the AwTV-POCS had higher SNR and less MSE than that of the TV-POCS. Thus, it could be concluded that the AwTV-POCS can reconstruct more accurate images than the TV-POCS.

Based on both the qualitative inspection and quantitative measure of the reconstructions from the AwTV-POCS and TV-POCS algorithms, the gain by incorporating the edge characteristics into the AwTV model is notable. The gain shall be attributed to the AwTV model because both algorithms were implemented similarly in data constraints and numerical calculations, except for the TV and AwTV terms. Thus, it could be conjectured that the AwTV model can gain in different implementations in the case of both the parallel-beam projection geometry and nonparallel-beam projection geometries. In practice, the presented AwTV model can further incorporate different data constraints with associated optimization strategies for different applications. One typical example is to add data statistics into the cost function for penalized-likelihood image reconstruction (Yu and Wang 2009, Ma et al 2011), which will be an interesting topic for further research. In addition, the comparison between the AwTV model and EPTV model by using clinical data is also interested for further research. Besides the model analysis, many novel methods have been proposed to accelerate the convergence for solving similar inverse problems with TV regularization, such as the gradient-projection-Barzilai–Borwen method (Park et al 2012), the accelerated barrier optimization CS method (Niu and Zhu 2012) and the unknown-parameter Nesterov method (Jensen et al 2012). The common idea of these methods is using the single-step gradient calculation to solve the TV regularization problem instead of using the presented two-step alternative optimization scheme. Compared with the TV-POCS and AwTV-POCS algorithms, the single-step method can fast converge to the optimal solution without reducing image quality. However, the step size of the gradient calculation should be carefully designed to balance the convergence speed and accuracy. For example, the well-known 'line-search' scheme could guarantee the monotonic convergence, but it may need an intensive computation. In contrary, although a fixed large step length in the conventional gradient scheme can reduce the computing burden, the outcome could be unacceptable. Thus, proper strategies for parameter estimation are necessary when designing a new algorithm (Park et al 2012, Jensen et al 2012). To apply the proposed AwTV model to clinical data, where the image size is always large because of the need of high resolution, a new algorithm with less computation complexity would be always desired for our further research.

Acknowledgment

This work was partially supported by the NIH/NCI under grant nos CA143111 and CA082402. JM was partially supported by the NSF of China under grant nos 81000613 and 81101046 and the National Key Technologies R&D Program of China under grant no. 2011BAI12B03. The comments on the scientific aspects and the provision of cone-beam phantom data for evaluation studies of this work from Dr Jing Wang is also acknowledged.

Appendix A.: Receiver operating characteristic study

One of the important tasks for medical image analysis is helping the physicians to detect lesions or abnormalities. The ROC curve, which plots the tradeoff between the true-positive (TP) and true-negative scores, is extensively used as a valuable merit to evaluate the diagnostic accuracy for a medical imaging system and/or image reconstruction algorithm. In practice, the ROC curve can be generated by the pairs of the TP fraction and false-positive fraction (Wang et al 2006)⁵ with different confidence thresholds. The most common measure for comparison of the ROC curves is the AUC. An image reconstruction algorithm, which generates a larger AUC, usually has a higher capability for detection of abnormalities.

The human observer is one of the most desired observers, but the procedure needs an experienced physician to manually evaluate each case, which is time consuming for processing a large number of cases. The channelized hotelling observer (CHO) is one of the most efficient numerical observers that can help us to evaluate the algorithms without performing the human observer procedure. In our studies, we utilized the four octave-wide rotationally symmetric frequency channels proposed by Myers and Barrett (1987), which have been shown to give good predictions of a human observer procedure in abnormal detection. In our implementation of the CHO procedure, each reconstructed image generated a four-element feature vector according to the four channels, and the CHO was trained for the AwTV-POCS and TV-POCS algorithms, respectively. A group of scalar rating values were produced from different independent ensemble of the feature vectors of the reconstructed images in two classes of categories (i.e. with or without lesion) by using the CHO_MAT code⁶. The scores were subsequently analyzed using the ROCKIT (see footnote 5) and the AUC values were calculated to document the detection efficiency.

Since a large sample size is needed to perform the ROC study, computer simulation is usually the choice. For the detection task, a low-contrast small lesion of radius 3 mm was simulated as a growth from the big ellipse in the Shepp–Logan phantom as shown in figure 23, where the arrow indicates the lesion. Four intensity contrast levels of the added lesion were considered as 1.5%, 3.0%, 4.5% and 6.5%, respectively, higher than that of the background to evaluate the performance of the detection efficiency for the two reconstruction algorithms, i.e. TV-POCS and AwTV-POCS.

Noise-free projections from the Shepp–Logan phantom of figure 23(a) without and with the lesion at each lesion contrast level of figure 23(b) were first computed as described in section 3.1.1. A total of five sets of noise-free data were computed. One set has no lesion and the other four sets have the lesion with the four different contrast levels in figure 23(b). From each noise-free dataset, a total of 500 noisy realizations were generated using the noise model of section 3-1-2. These noisy sinogram data were then reconstructed by the two algorithms of TV-POCS and AwTV-POCS, respectively. A ROI of 19 × 19 pixel array size on each reconstructed image was selected around the lesion structure as the input of the CHO_MAT code. The series of ratings from the CHO output were subsequently analyzed using the ROCKIT package with bi-normal model. For each contrast level of the lesion, the ROC curves obtained from the two algorithms are shown in figure 24, and the AUC values are listed in table 3.

**Figure 24.** The ROC curves of the two algorithms: AwTV-POCS and TV-POCS. Graph (a) shows the ROCs for the lesion with 1.5% contrast level. Graph (b) shows the ROCs for the lesion with 3% contrast level. Graph (c) shows the ROCs for the lesion with 4.5% contrast level. Graph (d) shows the ROCs for the lesion with 4.5% contrast level.
Download figure:
Standard image

Table 3. The AUC measures and the one-tailed P-values for different lesion contrast levels from the AwTV-POCS and TV-POCS reconstructions. Note N/A in the right lower corner indicates that the value could not be obtained by the ROCKIT package.

Lesion's intensity	AwTV-POCS (AUC)	TV-POCS (AUC)	One-tailed P-value
1.5%	0.6496	0.6264	0.3473
3.0%	0.9301	0.8460	0.0089
4.5%	0.9796	0.8940	0.0033
6.5%	0.9964	0.9711	N/A

From figure 24 and table 3, it can be seen that at 1.5% level, the AUC value from the AwTV-POCS was 0.6496 and from the TV-POCS was 0.6264. The one-tailed P-value was 0.3473 (greater than 0.05), which indicates that the difference between the two algorithms is not statistically significant at the 1.5% contrast level. In other words, both algorithms could not be able to detect the low-contrast lesion effectively. At the higher contrast levels of 3.0% and 4.5%, the AUC values from the AwTV-POCS were 0.9301 and 0.9796, respectively, whereas 0.846 and 0.894 from the TV-POCS. The one-tailed P-values of the two algorithms were 0.0089 and 0.0033, respectively, which are less than 0.05, indicating the difference between these two algorithms is statistically significant at the 3% and 4.5% contrast levels. In other words, the AwTV-POCS can outperform the TV-POCS for the lesion contrast levels at 3.0% and 4.5%. To gain further insight into these two algorithms, we considered the next higher contrast level of 6.5%. At this level, the AUC value of the AwTV-POCS algorithm reached 0.9964, indicating a perfect detection performance, and the value of the TV-POCS algorithm is slightly smaller, i.e. 0.9711. At such high contrast level, both algorithms can detect the lesion successfully, and it is expected that they shall perform similarly.

From the above ROC studies for different lesion contrast levels, it can be observed that the AwTV-POCS can outperform the TV-POCS in detecting small low-contrast lesions because of the modeling of edge properties in the AwTV model. It is expected that both algorithms will perform similarly if the lesion contrast level is too low where both will surely fail, and too high where both will surely succeed. Although the results indicated that the AwTV-POCS has advantages compared to the TV-POCS strategy, more experiments using clinical data are needed in further studies.

Appendix B.: Bias versus variance tradeoff

Another common merit for imaging system evaluation is the bias versus variance tradeoff plot, which is also one of the general figures of merit for evaluating the quality of reconstructed images. The plot describes the strength of the signal in relationship to the quantity of noise.

In this study, we focused on the robustness to different noise levels of the two algorithms in their reconstructions from the 20 projection views, where these two algorithms showed notable difference in the computer simulation studies, see section 2.1. An ROI of 19×19 array size on the uniform image intensity was selected inside the top-middle ellipse, as indicated in figure 25(a). Six different values of I^o_i from 5.0 × 10³ to 2.5 × 10⁶ were selected to simulate noisy data at the corresponding noise levels. At each noise level, 100 noisy data samples were simulated and their reconstructions were performed by the use of the two algorithms, respectively. These reconstructions were then used to calculate the bias and variance. According to the description in Wernick et al (1999), Chen et al (2007), the bias and variance are expressed as follows:

$\begin{equation} {\rm bias}(\mu ) = \frac{{\sum_{s,t \in W}} {| {\bar {\hat {\mu _{s,t}}} - {\mu _{s,t}} } |} } {{\sum_{s,t \in W}} {| {\mu _{s,t} } |} } \end{equation} \tag{ B.1 }$

$\begin{equation} {\rm variance} = \frac{1}{{M_W }}\overline {\sum\limits_{s,t \in W} {\big( {\hat {\mu _{s,t}} - \bar {\hat {\mu _{s,t}}} } \big)} ^2 } , \end{equation} \tag{ B.2 }$

where μ_{s, t} is the true value of the attenuation coefficient at voxel location index (s, t), $\hat \mu _{s,t}$ is the reconstructed attenuation coefficient at voxel (s, t), and $\bar \mu _{s,t}$ is the sample mean from the 100 samples of the resulting images at voxel (s, t). The over bar in (B.1) and (B.2) denotes the mean over the 100 noise realization samples. W is the pixel's index within the ROI and M_W is the number of voxels in the ROI.

Figure 25(b) shows the bias–variance plots of the AwTV-POCS and TV-POCS algorithms. Both algorithms can yield very small bias and variance values at low noise level (approaching to the origin of the plot), indicating that they can reconstruct high-quality images at low noise level for the sparse-signal Shepp–Logan phantom with 20 projection views. When the noise level went up as the incident photon number went down below I^o_i = 1 × 10⁵, some difference between these two algorithms was observed. This observation concurs with the simulation results in section 3.1, indicating the validity of the plots. At the same variance or the same noise level, the images reconstructed by the AwTV-POCS have less bias as compared to the results of the TV-POCS. In other words, the AwTV-POCS can outperform the TV-POCS in terms of the bias–variance plots.

Adaptive-weighted total variation minimization for sparse data toward low-dose x-ray computed tomography image reconstruction

Article metrics

Permissions

Author e-mails

Author affiliations

Author notes

Dates