Elsevier

Academic Radiology

Volume 25, Issue 4, April 2018, Pages 509-518
Academic Radiology

Original Investigation
Quantitative Measurements Versus Receiver Operating Characteristics and Visual Grading Regression in CT Images Reconstructed with Iterative Reconstruction: A Phantom Study

https://doi.org/10.1016/j.acra.2017.10.020Get rights and content

Rationale and Objectives

This study aimed to evaluate the correlation of quantitative measurements with visual grading regression (VGR) and receiver operating characteristics (ROC) analysis in computed tomography (CT) images reconstructed with iterative reconstruction.

Materials and Methods

CT scans on a liver phantom were performed on CT scanners from GE, Philips, and Toshiba at three dose levels. Images were reconstructed with filtered back projection (FBP) and hybrid iterative techniques (ASiR, iDose, and AIDR 3D of different strengths). Images were visually assessed by five readers using a four- and five-grade ordinal scale for liver low contrast lesions and for 10 image quality criteria. The results were analyzed with ROC and VGR. Standard deviation, signal-to-noise ratios, and contrast-to-noise ratios were measured in the images.

Results

All data were compared to FBP. The results of the quantitative measurements were improved for all algorithms. ROC analysis showed improved lesion detection with ASiR and AIDR and decreased lesion detection with iDose. VGR found improved noise properties for all algorithms, increased sharpness with iDose and AIDR, and decreased artifacts from the spine with AIDR, whereas iDose increased the artifacts from the spine. The contrast in the spine decreased with ASiR and iDose.

Conclusions

Improved quantitative measurements in images reconstructed with iterative reconstruction compared to FBP are not equivalent to improved diagnostic image accuracy.

Introduction

Iterative reconstruction algorithms decrease image noise in computed tomography (CT) images compared to filtered back projection (FBP) 1, 2, 3. In FBP, the image noise is inversely proportional to the square of the radiation dose, but with iterative reconstruction, this relationship is changed. Some iterative algorithms change the image texture, which is shown by the different shape of the noise power spectrum 1, 4. The shape of the noise power spectrum can be dose-dependent (5), and thereby influence the relationship between noise and low contrast resolution. Studies have shown that regardless of vendors' claims of dose reduction because of use of iterative reconstruction, low contrast resolution does not benefit from the same improvement as noise 6, 7, 8, 9, 10.

In addition to noise, spatial resolution may influence the visibility of small low-contrast objects. Some iterative reconstruction algorithms improve spatial resolution 1, 11, 12, but there are also studies that show that the spatial resolution can be degraded 1, 13. Iterative reconstruction can reduce artifacts such as metal artifacts, beam hardening artifacts, and scattering artifacts 14, 15; however, they can also introduce phenomena perceived by viewers as artifacts, like an artificial or blotchy appearance 16, 17.

One advantage of iterative reconstruction is that it is easy to implement different models to correct for irregularities in the reconstruction. However, most commercial algorithms appear to the user as black boxes, not making information about models and corrections available to the users. Differences between the algorithms result in differences in noise power spectrum, as well as spatial resolution and artifact reduction (13). It is therefore important to evaluate all the algorithms to ensure diagnostic acceptable images.

Quantitative measurements as standard deviation (SD) of the noise, signal-to-noise ratios (SNR), and contrast-to-noise ratios (CNR) are often chosen to evaluate image quality because of their effectiveness. These evaluations assume that lower noise and better CNR improve diagnostic efficacy. However, these quantitative measures largely ignore changes in noise properties, texture, and spatial resolution introduced by iterative reconstruction. This may influence the relationship between quantitative measurements and diagnostic effectiveness as lesion conspicuity.

Visual grading experiments can be performed on all types of images. Actual diagnostic information present in the images or image quality properties are evaluated visually, and the analysis can be carried out with visual grading regression (VGR) 18, 19 or visual grading characteristics (20). This gives information about image quality, but may not necessarily give information about diagnostic effectiveness. The assumption is that visibility of pathology is correlated to the visibility of normal anatomic structures. Careful selection of structures or properties to score is crucial to ensure that the analysis is related to the actual diagnostic outcome.

With access to a reliable reference method (“gold standard”), it is possible to evaluate the diagnostic accuracy of an imaging procedure. Receiver operating characteristics (ROC) analysis (21) is often used for analysis in such studies. However, it may be practically difficult to perform ROC analysis. Even when a “gold standard” is available, studies of this kind are time- and cost-consuming, and still the truthfulness may be questioned. In phantom experiments, however, the ground truth may be available by the construction of the phantoms. However, phantom experiments have other limitations like lack of realistic patient appearance, lack of anatomic details, or that the images always have the same background (if only one phantom is used).

Previous studies have explored the relationship between technical and clinical image quality in general x-ray 22, 23 and in CT (24), showing that the correlation is dependent on the imaging task and technical measure. Studies also suggest a nonlinear relationship between technical and visual measures at low doses (25).

The purpose of this study was to evaluate the correlation between quantitative measurements, VGR and ROC, in CT images reconstructed with three different hybrid iterative reconstruction algorithms.

Section snippets

Phantom

The phantom used in this study was an upper abdominal anthropomorphic phantom custom-made for ROC analysis (St. Bartholomew's Hospital, Clinical Physics Group, London EC1A 7BE, UK) (Figure 1, Figure 2) (26). The dimensions were 35 cm (lateral direction), 27 cm (anterior-posterior), and 6 cm (superior-inferior). Four cylindrical inserts, each divided into eight sectors, were placed in the liver. Sixteen of these 32 sectors contained lesions with diameters ranging from 2 to 7 mm (mean 4.4 mm,

Results

Images of the phantom reconstructed with FBP and two strengths of iterative reconstruction at 10 mGy are shown for each vendor in Figure 3. Noise decreases with higher strengths of iterative reconstruction compared to FBP.

All the tested iterative reconstruction algorithms decreased the SD and increased the SNR and the CNR compared to FBP. Significant results are shown in Figure 4. At 10 mGy, images reconstructed with ASiR showed improved SD, SNR, and CNR compared to FBP (Fig 4, first column).

Discussion

The results from this study illustrate the different kinds of answers to image quality-related questions that may be given by the three used methodologies. Although certain findings agree between the methods, there are also discrepancies.

The positive quantitative measurements did not reflect the lesion conspicuity in the phantom liver, which was either improved, degraded, or no change seen compared to FBP. This is also shown in other studies 9, 10. Some of the discrepancies in this study can be

Conclusion

In conclusion, quantitative measurements gave limited information about diagnostic image quality in images reconstructed with iterative reconstruction algorithms compared to images reconstructed with FBP.

References (35)

  • A. Euler et al.

    Impact of model-based iterative reconstruction on low-contrast lesion detection and image quality in abdominal CT: a 12-reader-based comparative phantom study with filtered back projection at different tube voltages

    Eur Radiol

    (2017)
  • A.H. Goenka et al.

    Image noise, CNR, and detectability of low-contrast, low-attenuation liver lesions in a phantom: effects of radiation exposure, phantom size, integrated circuit detector and iterative reconstruction

    Radiology

    (2016)
  • A. Omotayo et al.

    Objective performance assessment of five computed tomography iterative reconstruction algorithms

    J Xray Sci Technol

    (2016)
  • J. Nuyts et al.

    Modelling the physics in iterative reconstruction for transmission computed tomography

    Phys Med Biol

    (2013)
  • E.F. Boas et al.

    CT artifacts: causes and reduction techniques

    Imaging Med

    (2012)
  • A. Kotsenas et al.

    CT metal artifact reduction in the spine: can an iterative reconstruction technique improve visualization?

    AJNR Am J Neuroradiol

    (2015)
  • F.A. Miéville et al.

    Paediatric cardiac CT examinations: impact of the iterative reconstruction method ASIR on image quality—preliminary findings

    Pediatr Radiol

    (2011)
  • View full text