Introduction
Chronic Obstructive Pulmonary Disease (COPD) is defined as a respiratory disease characterized mainly by chronic airflow limitation that is not fully reversible. It is a leading cause of death and chronic morbidity worldwide, and it represents a major public health problem [
1]. The airflow limitation often presents as dyspnoea and is caused by airway disease (chronic bronchitis) or destruction of lung parenchyma (emphysema). Computed tomography (CT) allows visualization of pathologic changes in the lung parenchyma and classification of patients into different phenotypes according to the presence of bronchitis or emphysema [
2]. CT analysis of lung attenuation is commonly used to quantify the extent of emphysema in the lungs by computing the emphysema score (ES): the percentage of voxels in the lung below a certain Hounsfield Unit (HU). ES is an accepted way of measuring the extent of emphysema and has been proven to correlate well with pulmonary function tests and pathology [
3‐
7]. It is well known that ES values are sensitive to, among other factors, scanner type, dose, slice thickness, and reconstruction filter kernel used [
8]. Previous studies have shown that the chosen filter kernel strongly affects emphysema quantification [
9,
10], where sharper (higher spatial resolution) reconstruction kernels generally result in higher ES than smoother (lower spatial resolution) kernels. As a result, it is impossible to make meaningful comparisons between emphysema quantifications from scans obtained with different parameters. This is an important issue for longitudinal and multi-center studies where it may be difficult or impossible to control scan parameter settings.
The purpose of this retrospective study was to reduce the variability in emphysema quantification by developing a method to normalize scans obtained with different scanners and reconstructed with different kernels, and to evaluate its effectiveness in a cohort of COPD patients from a multi-center study. We hypothesized that normalization of chest CTs with respect to the filter kernel would reduce the variability in emphysema quantification.
Materials and methods
Patient selection
The COPDGene Study is a multi-center observational investigation designed to analyse the genetic and epidemiologic factors associated with COPD [
11]. This study recruited 10,364 subjects from 21 institutions, with inclusion and exclusion criteria as described in [
11]. All subjects underwent spirometry and CT imaging of the chest at full inspiration (TLC) and relaxed expiration (FRC) [
11]. This research protocol has obtained institutional review board approval at every site and written informed consent was provided by all enrolees. For this work, the COPDGene study provided 366 scans from 183 subjects, obtained at one institution with two Siemens scanners, and 372 scans from 186 subjects, acquired at six institutions with three GE scanners. Characteristics of the patients included in our study are provided in Tables
1 and
2.
Table 1
Descriptive statistics for the study population of the Siemens group
N (%) | 12 (6.5) | 9 (5) | 64 (35) | 51 (28) | 35 (19) | 12 (6.5) | 183 |
Age, mean (SD) | 62.1 (10.1) | 63.2 (9.8) | 64.3 (8.0) | 63.1 (9.2) | 64.2 (7.2) | 62.2 (9.6) | 63.6 (8.5) |
Female, N (%) | 5 (42) | 7 (78) | 32 (50) | 29 (57) | 11 (31) | 10 (83) | 94 (51) |
White, N (%) | 11 (92) | 8 (89) | 63 (98) | 51 (100) | 34 (97) | 11 (92) | 178 (97) |
Pack Years, mean (SD) | 37.9 (20.6) | 47.4 (13.4) | 51.6 (22.5) | 49.2 (20.4) | 61.0 (25.5) | 51.5 (32.9) | 51.6 (23.3) |
Current Smoker, N (%) | 2 (17) | 2 (22) | 11 (17) | 12 (24) | 7 (20) | 5 (42) | 39 (21) |
Table 2
Descriptive statistics for the study population of the GE group
N (%) | 83 (44.6) | 12 (6.5) | 45 (24.2) | 17 (9.1) | 3 (1.6) | 26 (14) | 186 |
Age, mean (SD) | 57.2 (8.4) | 66.4 (10.8) | 61.6 (8.1) | 62.9 (5.7) | 66.6 (7.7) | 55.7 (7.9) | 59.3 (8.7) |
Female, N (%) | 46 (55) | 3 (25) | 25 (56) | 10 (59) | 0 (0) | 13 (50) | 97 (52) |
White, N (%) | 50 (60) | 11 (92) | 33 (73) | 10 (59) | 2 (67) | 14 (54) | 120 (64) |
Pack Years, mean (SD) | 32.2 (15.2) | 48.1 (16.5) | 46.3 (23.8) | 54.0 (25.0) | 69.9 (53.8) | 36.7 (17.1) | 39.9 (21.3) |
Current Smoker, N (%) | 42 (51) | 5 (42) | 18 (40) | 6 (35) | 1 (33) | 13 (50) | 85 (46) |
Imaging protocol
Whole-lung volumetric multi-detector CT was obtained with scanners from two different manufacturers: Siemens and GE. All scans were reconstructed applying filtered back projection (FBP) with two kernels: a standard one and a sharp one. For both manufacturers, CT scans were acquired at full inspiration without contrast medium at 120 KVp tube energy and 200 mAs effective dose. The reconstruction field-of-view was configured per patient to encompass the widest diameter of the lungs.
In the Siemens group, a Siemens Definition scanner (n = 122, 64 × 0.6 mm detector configuration, 1.1 pitch) or a Siemens Definition AS+ scanner (n = 61, 128 × 0.6 mm detector configuration, 1.0 pitch) were used. All images were reconstructed with two kernels: b31f (standard) and b45f (sharp), using a reconstruction field-of-view ranging from 260 to 410 mm and a 512 × 512 matrix, yielding a pixel size between 0.5 and 0.8 mm. The slice thickness used was 0.75 mm with an interval of 0.5 mm.
In the GE group, scans were taken using a GE LightSpeed 16 scanner (n = 105, 16 × 0.625 mm detector configuration, 1.375 pitch), a GE LightSpeed VCT scanner (n = 71, 16 × 0.625 mm detector configuration, 1.375 pitch) or a GE LightSpeed Pro 16 scanner (n = 10, 16 × 0.625 mm detector configuration, 1.375 pitch). Scans were reconstructed with STANDARD (standard) and BONE (sharp) kernels. All subjects underwent CT imaging with 0.625 mm slice thickness with an interval of 0.625 mm. The reconstruction field-of-view ranged from 260 to 500 mm and a 512 × 512 matrix, resulting in a pixel size between 0.5 and 1 mm.
Six virtual cohorts were constructed for analysis: four cohorts using scans reconstructed with a single kernel (b31f Siemens (n = 183), b45f Siemens (n = 183), STANDARD GE (n = 186), BONE GE (n = 186)) and two mixed cohorts. The single kernel cohorts allow direct comparison between standard and sharp kernels. They were used to illustrate the variation in emphysema scoring due to the reconstruction kernels, and to demonstrate that normalization reduces this variation while maintaining correlations with lung function parameters. The first mixed cohort contained all scans reconstructed with the standard kernel on the Siemens and GE scanners (n = 369). This mixed cohort represents a typical situation in multi-center studies in which different sites have different scanners and the most similar reconstruction settings are chosen. The second mixed cohort (n = 369) was constructed to contain scans with both standard and sharp kernels from both manufacturers: from the Siemens group, the b31f reconstruction was chosen for 92 randomly selected subjects, and the b45f reconstruction for the remaining 91 subjects. From the GE group, the STANDARD kernel was selected for 93 randomly selected subjects and the BONE kernel for the remaining 93 subjects. This mixed cohort simulates a possible situation in which scans from different studies are combined, or scans are retrospectively collected from different centers with different acquisition protocols.
Pulmonary function tests
Spirometry was performed using an EasyOne spirometer (ndd Medical Technologies, Andover, MA). The mean time between CT imaging and spirometry was 10.45 days (range 0–98).
Emphysema quantification
Quantification of emphysema was performed using CIRRUS Lung 13.08 (
http://cirrus.diagnijmegen.nl, Diagnostic Image Analysis Group, Nijmegen, The Netherlands; Fraunhofer MEVIS, Bremen, Germany). As a first step, the lungs were automatically segmented using methods based on region growing and morphological operations [
12]. All segmentations were visually checked and corrected when needed in CIRRUS Lung. The percentage of lung affected by emphysema was quantified in terms of ES, using -950 HU as an attenuation threshold.
To rule out possible variations in ES due to segmentation differences in data reconstructed with different filter kernels, we segmented the lungs in the image reconstructed with the standard kernel, and used this segmentation for both reconstructions.
Normalization
To reduce the variability in measured ES, the proposed method changes the appearance of data obtained with various reconstruction kernels so that it will have similar characteristics as a chosen reference reconstruction. The rationale behind the proposed normalization is the fact that filter kernels affect spatial resolution and image noise of the reconstructed data: sharper reconstruction kernels will preserve higher spatial frequencies but increase image noise [
8].
The proposed normalization decomposes the scan into frequency bands and alters the energy in each band according to the average energies observed in the set of scans reconstructed with a reference kernel. This will reduce image noise to a similar level. In this study, we selected Siemens b31f as the reference kernel. For a detailed description of the algorithm, see Appendix
1.
Statistical analysis
Statistical analysis was performed using statistical software (IBM SPSS Statistics, Release 20.0; IBM Corp, Armonk, NY). Medians and inter-quartile ranges were computed for the non-normally distributed ES. Differences in ES for two reconstructions of the same scan were computed as ES in scans reconstructed with the sharp kernel minus ES in scans reconstructed with the standard kernel. Mean and standard deviations were computed for the normally distributed differences. The Bland-Altman approach [
13] was used for analysis, with 95 % confidence intervals as limits of agreement.
Spearman correlation coefficients were calculated to evaluate the correlation between ES and forced expiratory volume in 1 second (FEV
1) and FEV
1-to-forced vital capacity (FVC) ratio. The statistical significance of the difference between correlation values was evaluated using the statistical test described by Steiger [
14], using the R statistical analysis package (R Foundation for Statistical Computing, Vienna, Austria (URL
http://www.R-project.org/).
Discussion
The filter kernel used for image reconstruction has a known influence on emphysema quantification on chest CT images [
9,
10], where sharper kernels lead to higher ES than smoother kernels. These effects may have implications in longitudinal or multi-center studies in which different reconstruction parameters are used. From Figs.
1 and
2, it can be observed that changing to a sharper kernel between visits could lead to false interpretations of disease progression in a longitudinal study where reconstruction settings may vary over time; for example, if scanner models and imaging software are replaced by new models. Furthermore, in the case of a multi-center study such as COPDGene, in which CT image data is obtained in various centers using several reconstruction algorithms and scanner manufacturers, these effects hamper the possibility to compare data obtained from different locations. Even though standardized imaging protocols are often used, exactly corresponding settings do not exist between manufacturers. The method presented in this paper reduces the variability in emphysema measurements due to the use of different reconstruction kernels and scanner manufacturers. The method therefore improves comparison of CT image data in settings in which reconstruction parameters are not homogeneous and can no longer be changed retrospectively. Figures
1 and
2 provide an illustration of the normalization effect in the CT data and in the emphysema mask. It can be observed that, after normalization, not only the ES values, but also the emphysema pattern, become much more similar.
The proposed normalization does not require knowledge of the proprietary reconstruction algorithms used. A set of energy coefficients is obtained by analyzing the energy in different frequency bands in a set of scans reconstructed with the desired reference kernel. These energy coefficients are then applied to the current scan to create a new normalized reconstruction. Thus, for every new scan received, regardless of the kernel or manufacturer used to reconstruct it, the same coefficients are used to create the normalized image. This makes the method very flexible, since there is no need to recompute the energy coefficients or have any knowledge of the filter kernel. In this work, we used the Siemens b31f kernel as the reference one, and we applied the energy coefficients computed in the reference to all the scans, independent of the kernel or manufacturer used to create the CT image.
Schaller et al. presented a technique to simulate smoother kernels by using a Gaussian filter to approximate the ratio between the smooth (desired) and sharp (original) kernel [
15]. This method needs information about the kernel function, which is usually not publicly available. Ohkubo et al. presented an image filtering technique to create the image for a desired kernel using an existing CT reconstructed with a different kernel [
16]. This image filtering uses the modulation transfer functions (MTFs) of the current kernel and the desired one. Contrary to the proposed method, the technique of Ohkubo et al. requires prior knowledge of the MTF of each kernel that may be used. The effect of the methods proposed by Schaller et al. and Ohkubo et al. on emphysema quantification has not been assessed. Bartel et al. proposed a method for equating emphysema measurements using a mathematical model to convert values between different kernels, requiring the construction of a model for every kernel separately [
17]. The technique calculates an equivalent ES, but the effect of the model cannot be visually observed in the scan. Furthermore, all these methods need to be re-calibrated for every new desired kernel.
In order to ensure the validity of the normalized ES measurements, correlation coefficients between ES and lung function parameters were calculated. As shown in Table
3, for the single kernel cohorts, there are no significant differences before and after normalization, indicating that the normalization does not change the correlations with lung function parameters as expected. The correlations after normalization are slightly higher for each cohort. However, the Bland-Altman plots in Figs.
3 and
4 show that the difference in emphysema quantification between the different reconstructions of the same scans is greatly reduced by the normalization.
The first mixed cohort was obtained from a multi-center study in which standardized imaging protocols were well controlled by selecting the most similar reconstruction kernels between vendors. Even in this well-controlled setting, there is variation in emphysema measurements, because exactly corresponding acquisition parameters do not exist across vendors. There was a significant correlation between emphysema quantification and lung function parameters. However, applying the normalization significantly improved this correlation. This suggests that the use of the normalization method presented is beneficial in multi-center studies, even when the acquisition parameters are well controlled.
The second mixed cohort was constructed to simulate a less well-controlled study, e.g., when different studies with different acquisition protocols are retrospectively combined. As expected, the correlation with lung function parameters in this cohort was lower than in the well-controlled setting. Applying the normalization to this cohort not only significantly improved this correlation, but it resulted in correlation coefficients similar to those of the controlled mixed cohort. This allowed us to compare emphysema quantifications obtained from data acquired with different parameters.
From the results in Table
3, it can be observed that the correlations between emphysema quantification and spirometry are higher for the mixed cohorts than for each individual cohort. This can be explained by the difference in emphysema severity in both cohorts; the Siemens cohort contains mostly subjects with mild to severe COPD, while the GE group includes mostly subjects with no or mild COPD. Combining the cohorts includes a more complete range of COPD and emphysema severity, and therefore increases the correlation with lung function parameters.
Although the study provides promising results, it also has limitations. We validated emphysema measurements against lung function parameters, but not against histopathological references. The latter would involve subjects undergoing lobe or lung resection, which is not a feasible option in our study. Therefore, since CT quantified emphysema has been shown to correlate both with histopathological reference standards [
3‐
5,
7] and pulmonary function testing [
6,
7], we chose to compare emphysema measurements with lung function parameters.
In the current study, the normalization was evaluated in scans reconstructed with two kernels from two vendors. However, these results suggest that this idea can be extended to more reconstruction kernels from the same scanner models, and can be applied to kernels from other scanner manufacturers. This normalization method should be evaluated in a future study including data reconstructed with different kernels and scanner manufacturers.
Furthermore, in this study we have analysed only filtered back projection algorithms, but ES values have been shown to vary, depending upon the choice of filtered back projection or iterative reconstruction algorithms [
18].
In conclusion, the lack of standards that guarantee the validity of CT measurements performed with different technical parameters makes it difficult to compare data obtained with different reconstruction settings. The proposed method may be a feasible solution to overcome this issue. It requires no prior knowledge about the filter kernels used, and allows one to obtain more reliable results that are independent of the reconstruction parameters chosen. Normalization of chest CT data reduces variation in emphysema quantification due to different reconstruction filters and scanner manufacturers, and improves correlation of emphysema quantifications and spirometry in data obtained with varying reconstruction settings.
Acknowledgments
The scientific guarantor of this publication is Eva van Rikxoort. The authors of this manuscript declare relationships with the following companies: MeVis Medical Solutions AG, Bremen, Germany; Thirona, Nijmegen, The Netherlands. This study has received funding by scientific grants from the Dutch Organization for Scientific Research (NWO) (VENI 016.128.007, VICI 016.130.326). No complex statistical methods were necessary for this paper. Institutional Review Board approval was obtained.
Written informed consent was obtained from all subjects (patients) in this study. Some study subjects or cohorts have been previously reported. There have been a number of publications using data from the COPDGene database for different purposes. However, none of them tries to develop a method to reduce the variability in emphysema quantifications obtained from CT scans reconstructed with different filter kernels. Some examples of publications using subjects from the COPDGene study for imaging research are: 1. Sieren JP, Newell JD, Judy PF, Lynch DA, Chan KS, Guo J, Hoffman EA (2012) Reference standard and statistical model for intersite and temporal comparisons of CT attenuation in a multicenter quantitative lung study. Med Phys 39: 5757–5767. 2. Ginsburg SB, Lynch DA, Bowler RP, Schroeder JD (2012) Automated texture-based quantification of centrilobular nodularity and centrilobular emphysema in chest CT images. Acad Radiol 19:1241-1251.
Methodology: retrospective, experimental, multicenter study.