Feasibility of PET/CT system performance harmonisation for quantitative multicentre 89Zr studies

Kaalep, Andres; Huisman, Marc; Sera, Terez; Vugts, Danielle; Boellaard, Ronald

doi:10.1186/s40658-018-0226-7

Short communication
Open access
Published: 21 November 2018

Feasibility of PET/CT system performance harmonisation for quantitative multicentre ⁸⁹Zr studies

Andres Kaalep ORCID: orcid.org/0000-0002-5593-8886¹,
Marc Huisman²,
Terez Sera^3,5,
Danielle Vugts² &
Ronald Boellaard^2,4,
on behalf of EARL,
EATRIS &
the TRISTAN Consortium (#IB4SD-116106)

EJNMMI Physics volume 5, Article number: 26 (2018) Cite this article

1971 Accesses
17 Citations
1 Altmetric
Metrics details

Abstract

Purpose

The aim of this study was to investigate the variability in quantitative performance and feasibility of quantitative harmonisation in ⁸⁹Zr PET/CT imaging.

Methods

Eight EANM EARL-accredited (Kaalep A et al., Eur J Nucl Med Mol Imaging 45:412–22, 2018) PET/CT systems were investigated using phantom acquisitions of uniform and NEMA NU2-2007 body phantoms. The phantoms were filled according to EANM EARL guidelines for [¹⁸F]FDG, but [¹⁸F]FDG solution was replaced by a ⁸⁹Zr calibration mixture. For each system, standard uptake value (SUV) accuracy and recovery coefficients (RC) using SUVmean, SUVmax and SUVpeak metrics were determined.

Results

All eight investigated systems demonstrated similarly shaped RC curves, and five of them exhibited closely aligning recoveries when SUV bias correction was applied. From the evaluated metrics, SUVpeak was found to be least sensitive to noise and reconstruction differences among different systems.

Conclusions

Harmonisation of PET/CT scanners for quantitative ⁸⁹Zr studies is feasible when proper scanner-dose calibrator cross-calibration and harmonised image reconstruction procedures are followed. An accreditation programme for PET/CT scanners would facilitate multicentre ⁸⁹Zr quantitative studies.

Introduction

The use of radiolabelled antibodies for diagnostic and therapeutic purposes has been going on for more than 50 years [1]. Their application as imaging probes in positron emission tomography (PET) combines the high sensitivity of PET with the high antigen specificity of monoclonal antibodies [2]. ⁸⁹Zr-based tracers are becoming widespread with increasingly available supply, advances in radiochemistry and successful pilot studies in humans. However, multicentre studies using ¹⁸F-labelled tracers have demonstrated the need for standardisation of image acquisition, reconstruction, and analysis procedures and international harmonisation programmes such as EANM and EARL aim to facilitate the use of FDG PET as a quantitative imaging biomarker [3, 4]. A detailed discussion on ⁸⁹Zr physics in PET has been published by Conti et al. [5].

The aim of this study was to investigate the variability in quantitative performance and feasibility of quantitative harmonisation in ⁸⁹Zr PET/CT imaging.

Materials and methods

Investigated systems and phantom experiments

Eight PET/CT systems (system 1–8), calibrated according to the manufacturer’s instructions, while also participating and accredited in the EANM/EARL [¹⁸F]FDG PET/CT accreditation programme, were selected for this study. The investigated systems were two General Electric Discovery 690, two General Electric Discovery 710, one Siemens Biograph 40 mCT, one Siemens Biograph 64 mCT, one Siemens Somatom Definition AS mCT and one Philips Ingenuity TF.

Two phantom experiments were carried out in accordance with EANM/EARL guidelines—Calibration QC and NEMA Phantom QC—where [¹⁸F]FDG was substituted with a ⁸⁹Zr calibration sample. In the first experiment, a uniform cylindrical phantom was filled with a solution containing 8–12 kBq/mL of ⁸⁹Zr. In the second experiment, the NEMA NU2-2007 body phantom background compartment and spheres were filled with a ⁸⁹Zr solution of 2 kBq/mL and 20 kBq/mL, respectively, so as a 10:1 sphere to background ratio can be achieved (Fig. 1). Exact amount of ⁸⁹Zr activity was measured for each scan using only local dose calibrators, which had not underwent specific cross-calibration for ⁸⁹Zr. In both experiments, the phantoms underwent a low-dose CT acquisition followed by PET acquisition of two consecutive bed positions of 5 min each. Images were reconstructed using EARL-compliant parameters routinely used by the corresponding sites for [¹⁸F]FDG quantitative imaging (Table 1).

Table 1 Reconstruction settings

Full size table

Data analysis

Reconstructed DICOM images were analysed using the EARL semi-automatic tool [3, 6] designed for quantitative analysis of images of uniform and NEMA NU2-2007 body phantoms. From the uniform phantom and the NEMA body phantom’s uniform background compartment, SUV accuracies for each system were determined. From the NEMA body phantom experiments, recovery coefficients (RC) were calculated as a function of sphere sizes, defined as ratio of activity concentration estimated from PET images to the expected activity concentration measured by dose calibrator. Different RC metric values were calculated based on 50% background-corrected isocontour VOI (SUVmean), maximum voxel value included in VOI (SUVmax) and spherical VOI with a diameter of 12 mm, positioned so as to yield the highest uptake (SUVpeak) [6,7,8]. Using data from the EARL database, relevant FDG RC curves of the corresponding scanners are displayed as a reference.

Additionally, RC curves were rescaled to correct for a global SUV bias, derived from the phantom’s background compartment, to mitigate the impact of cross-calibration error between PET/CT system and dose calibrator on the observed RC. In order to directly compare the RC curves’ shapes of all systems, the individual recovery coefficients of the NEMA body phantom spheres were normalised to the recovery coefficient of the largest (37 mm) sphere.

Results

The SUV bias from both phantom experiments is presented in Fig. 2. The results for SUVmean, SUVmax and SUVpeak together with corresponding information for EARL [¹⁸F]FDG can be seen in Fig. 3, while the results corrected for SUV bias calculated from the body phantom background are presented in Fig. 4. Figure 5 demonstrates the RC curves normalised to the largest 37-mm sphere recovery.

Discussion

In order to remain in the optimal measurement range of the dose calibrators, the ⁸⁹Zr activity used in the study was similar to what is injected to a patient in clinical practice, resulting in significantly higher activity concentrations in the phantoms compared to patients (due to the smaller phantom volumes). However, lower counts are expected to further increase the variability of the results and may have hampered comparing recoveries between systems, with current EARL specifications and with those seen with ¹⁸F. For clinical studies, low count rates potentially induce an upward bias when SUVmax is used. To mitigate this upward bias, SUVpeak is an alternative, which is less sensitive to scanner variation and image noise, and might therefore be the optimal metric to assess tracer uptake for ⁸⁹Zr. Consequently, in our phantom study, we included SUVpeak as well.

In addition to verifying the results of a recent study by Makris et al. [9], current study investigated the real-life scenario of using only local dose calibrators for ⁸⁹Zr measurement as well as by asking sites to perform the experiments themselves using the provided manuals and instructions. Out of the eight systems investigated in total, four Calibration QC and three NEMA Phantom QC experiments demonstrate a SUV bias of > 10% (Fig. 2). Since the scanners are EARL accredited for [¹⁸F]FDG-PET/CT, they comply with accreditation specifications for SUV bias (≤ 10%); it is therefore believed that the large global errors are due to inaccurate cross-calibration between the scanners and dose calibrators used to measure the ⁸⁹Zr solution activity on site. While each of the dose calibrators should be set up by the manufacturer to accurately measure ⁸⁹Zr, the results from our study underline the importance of a traceable calibration performance of dose calibrators used in ⁸⁹Zr quantitative PET/CT imaging.

From Fig. 2, it can be seen that SUV bias values derived from Calibration QC and NEMA Phantom QC background agree reasonably well, with the exception of only system 3 and to some extent system 4. These inconsistencies as well as the variable bias in RC curves (Fig. 3) are suggested to be related to activity measurement and phantom filling procedures on site.

The initial RC curves derived from the images (Fig. 3, a–c) demonstrate increased spread compared to the background-corrected ones (Fig. 4). After applying the SUV bias correction, the RC values of five systems show good alignment with each other and with EANM specifications for [¹⁸F]FDG. Two of the investigated systems (1 and 7) remain out of specifications even after correcting for SUV bias. The reason for this is unknown and would need further investigation. RC curves normalised to the largest (37 mm) sphere (Fig. 5) demonstrate similar shapes of RC curves for all investigated systems. This would suggest that, with further adjustment—meaning reduction of global SUV bias based on Calibration QC experiment data and possibly minor adjustment of the image reconstruction parameters—all of the systems should be able to achieve harmonisation.

The closest alignment of the RC curves can be observed when SUVpeak is used. This demonstrates the potential of this metric being used when quantitative harmonisation is desired. It should however be noted that with the use of SUVpeak, one should expect a decrease in overall contrast recovery, compared to SUVmax.

Finally, with the shape of ⁸⁹Zr RC curves shown to be similar to ¹⁸F in our pilot study, further harmonisation efforts could be focused on the cross-calibration of the dose calibrators, which is considered to be the largest source of uncertainty in this case. A future ⁸⁹Zr harmonisation scheme could therefore be based on a ⁸⁹Zr dose calibrator cross-calibration quality control, with a successful site [18F]FDG EARL accreditation being a prerequisite.

Conclusions

All eight investigated systems demonstrated similarly shaped RC curves, and five of them exhibited close alignment when SUV bias correction was applied. Use of SUVpeak as a metric, which proved to be the least sensitive to noise and reconstruction differences among systems, is strongly recommended for multicentre quantitative ⁸⁹Zr studies. When PET/CT and dose calibrator cross-calibration procedures are closely followed and the image reconstruction parameters adjusted, the quantitative harmonisation of scanners for ⁸⁹Zr PET studies is feasible. Yet, our results demonstrate the urgent need to set up a suitable cross-calibration and accreditation programme to facilitate multicentre ⁸⁹Zr quantitative studies.

References

McCardle RJ, Harper PV, Spar IL, Bale WF, Andros G, Jiminez F. Studies with iodine-131-labeled antibody to human fibrinogen for diagnosis and therapy of tumors. J. Nucl. Med. 1966;7:837–47. Available from: http://jnm.snmjournals.org/content/7/11/837.short
PubMed CAS Google Scholar
Zhang Y, Hong H, Cai W. PET tracers based on zirconium-89. Curr Radiopharm. 2011;4:131–9.
Article PubMed PubMed Central Google Scholar
Kaalep A, Sera T, Oyen W, Krause BJ, Chiti A, Liu Y, et al. EANM/EARL FDG-PET/CT accreditation - summary results from the first 200 accredited imaging systems. Eur J Nucl Med Mol Imaging. 2018;45:412–22
Aide N, Lasnon C, Veit-Haibach P, Sera T, Sattler B, Boellaard R. EANM/EARL harmonization strategies in PET quantification: from daily practice to multicentre oncological studies. Eur J Nucl Med Mol Imaging. 2017;
Conti M, Eriksson L. Physics of pure and non-pure positron emitters for PET: a review and a discussion. EJNMMI Phys. 2016:3. Available from: https://doi.org/10.1186/s40658-016-0144-5
Boellaard R, Delgado-Bolton R, Oyen WJG, Giammarile F, Tatsch K, Eschner W, et al. FDG PET/CT: EANM procedure guidelines for tumour imaging: version 2.0. Eur. J. Nucl. Med. Mol. Imaging. 2014;42:328–54.
Article PubMed PubMed Central CAS Google Scholar
Kinahan P, Wahl R, Shao L, Frank R, Perlman E. The QIBA profile for quantitative FDG-PET/CT oncology imaging. J. Nucl. Med. 2014;55:1520.
Google Scholar
Lodge MA, Chaudhry MA, Wahl RL. Noise considerations for PET quantification using maximum and peak standardized uptake value. J Nucl Med. 2012;53:1041–7.
Article PubMed PubMed Central CAS Google Scholar
Makris NE, Boellaard R, Visser EP, de Jong JR, Vanderlinden B, Wierts R, et al. Multicenter harmonization of 89Zr PET/CT performance. J Nucl Med. 2014;55:264–7. Available from: http://jnm.snmjournals.org/cgi/doi/10.2967/jnumed.113.130112
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We would like to thank the EARL sites that provided data for the experiments, namely:

University Hospital Olomouc, Olomouc, Czech Republic

Academisch Ziekenhuis Groeninge Hospital, Kortrijk, Belgium

University College London, London, UK

Turku PET Centre, Turku, Finland

VU University Medical Centre, Amsterdam, The Netherlands

Radboud University Medical Center, Nijmegen, The Netherlands

University Hospital of Navarra, Pamplona, Spain

Funding

The research leading to these results received funding from the Innovative Medicines Initiatives 2 Joint Undertaking under grant agreement no. 116106. This Joint Undertaking receives support from the European Union’s Horizon 2020 research and innovation programme and EFPIA.

Availability of data and materials

Data supporting the findings reported can be found at Zenodo.

Author information

Authors and Affiliations

Department of Medical Technology, North Estonia Medical Centre Foundation, J. Sutiste Str 19, 13419, Tallinn, Estonia
Andres Kaalep
Department of Radiology and Nuclear Medicine, VU University Medical Center, Amsterdam, The Netherlands
Marc Huisman, Danielle Vugts & Ronald Boellaard
Department of Nuclear Medicine, University of Szeged, Szeged, Hungary
Terez Sera
Department of Nuclear Medicine and Molecular Imaging, University of Groningen, University Medical Center Groningen, Hanzeplein 1, Groningen, The Netherlands
Ronald Boellaard
EANM Research Limited (EARL), Vienna, Austria
Terez Sera

Authors

Andres Kaalep
View author publications
You can also search for this author in PubMed Google Scholar
Marc Huisman
View author publications
You can also search for this author in PubMed Google Scholar
Terez Sera
View author publications
You can also search for this author in PubMed Google Scholar
Danielle Vugts
View author publications
You can also search for this author in PubMed Google Scholar
Ronald Boellaard
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

on behalf of EARL

EATRIS

the TRISTAN Consortium (#IB4SD-116106)

Contributions

RB conceived of the study, participated in data collection, study design and coordination and helped to draft the manuscript. TS helped to draft the manuscript. AK analysed the acquired data and drafted the manuscript. MH participated in the data collection and analysis. DV participated in the data collection. All authors provided critical review and approved the final manuscript.

Corresponding authors

Correspondence to Andres Kaalep or Ronald Boellaard.

Ethics declarations

Ethics approval and consent to participate

This article does not contain any studies with human participants or animals performed by any of the authors.

Consent for publication

Not applicable.

Competing interests

Andres Kaalep declares that he has no conflict of interest. Marc Huisman declares that he has no conflict of interest. Terez Sera is a member of the EARL scientific advisory board and has received travel grants and honoraria from EARL. Danielle Vugts declares that she has no conflict of interest. Ronald Boellaard is an unpaid member of the EARL scientific advisory board.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Kaalep, A., Huisman, M., Sera, T. et al. Feasibility of PET/CT system performance harmonisation for quantitative multicentre ⁸⁹Zr studies. EJNMMI Phys 5, 26 (2018). https://doi.org/10.1186/s40658-018-0226-7

Download citation

Received: 25 April 2018
Accepted: 02 August 2018
Published: 21 November 2018
DOI: https://doi.org/10.1186/s40658-018-0226-7

Feasibility of PET/CT system performance harmonisation for quantitative multicentre 89Zr studies

Abstract

Purpose

Methods

Results

Conclusions

Introduction

Materials and methods

Investigated systems and phantom experiments

Data analysis

Results

Discussion

Conclusions

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Consortia

on behalf of EARL

EATRIS

the TRISTAN Consortium (#IB4SD-116106)

Contributions

Corresponding authors

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Feasibility of PET/CT system performance harmonisation for quantitative multicentre ⁸⁹Zr studies