We investigated whether artificial intelligence (AI)-based denoising halves PET acquisition time in digital PET/CT.
One hundred ninety-five patients referred for [18F]FDG PET/CT were prospectively included. Body PET acquisitions were performed in list mode. Original “PET90” (90 s/bed position) was compared to reconstructed ½-duration PET (45 s/bed position) with and without AI-denoising, “PET45AI and PET45”. Denoising was performed by SubtlePET™ using deep convolutional neural networks. Visual global image quality (IQ) 3-point scores and lesion detectability were evaluated. Lesion maximal and peak standardized uptake values using lean body mass (SULmax and SULpeak), metabolic volumes (MV), and liver SULmean were measured, including both standard and EARL1 (European Association of Nuclear Medicine Research Ltd) compliant SUL. Lesion-to-liver SUL ratios (LLR) and liver coefficients of variation (CVliv) were calculated.
PET45 showed mediocre IQ (scored poor in 8% and moderate in 68%) and lesion concordance rate with PET90 (88.7%). In PET45AI, IQ scores were similar to PET90 (P = 0.80), good in 92% and moderate in 8% for both. The lesion concordance rate between PET90 and PET45AI was 836/856 (97.7%), with 7 lesions (0.8%) only detected in PET90 and 13 (1.5%) exclusively in PET45AI. Lesion EARL1 SULpeak was not significantly different between both PET (P = 0.09). Lesion standard SULpeak, standard and EARL1 SULmax, LLR and CVliv were lower in PET45AI than in PET90 (P < 0.0001), while lesion MV and liver SULmean were higher (P < 0.0001). Good to excellent intraclass correlation coefficients (ICC) between PET90 and PET45AI were observed for lesion SUL and MV (ICC ≥ 0.97) and for liver SULmean (ICC ≥ 0.87).
AI allows [18F]FDG PET duration in digital PET/CT to be halved, while restoring degraded ½-duration PET image quality. Future multicentric studies, including other PET radiopharmaceuticals, are warranted.
This article is part of the Topical Collection on Advanced Image Analyses (Radiomics and Artificial Intelligence)
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Recent research in PET has focused on decreasing noise and increasing signal-to-noise ratios (SNR) . Digital PET with silicon photomultipliers (SiPM) has led to improved timing, energy, spatial resolution, and effective time-of-flight (TOF) sensitivity [2‐5]. This has resulted in faster scanning with less injected activity . However, despite these advances, there is an ever-increasing demand for PET scans, which can contribute to significant delays in scheduling examinations and patient management.
Deep learning (DL), a subdivision of artificial intelligence (AI), has many emerging applications in nuclear medicine [6, 7]. DL is able to increase PET resolution, decrease noise, and thus enhance image quality [8‐12]. It may allow for reducing injected activity, acquisition time, or a combination of both [10, 13‐21].
DL-based denoising can either be associated with PET reconstruction or be used as a post-reconstruction tool. SubtlePET™ (Subtle Medical, Stanford, US, provided by Incepto, France) is a post-reconstruction PET denoising software that has been approved by the Food and Drug Administration and validated by the European Commission for [18F]FDG PET . The algorithm is based on deep convolutional neural networks (DCNN), the most common DL architecture [23, 24]. SubtlePET™ uses multi-slice 2.5D encoder-decoder U-Net DCNN. It takes the pixel’s neighborhood into account to reduce noise and increase image quality. Using a residual learning approach that is optimized for quantitative (L1 norm) and structural similarity (SSIM), it has learned to separate and suppress noise components while preserving non-noise components.
Recently, SubtlePET™ processed [18F]FDG PET images obtained with 33% less injected activity gave similar visual and quantitative performances to native PET in analog PET/CT without time-of-flight (TOF) . Promising results were also reported while reducing reconstructed PET acquisition time by 75% using analog and digital PET/CT (with or without TOF) in a smaller study population with a substantially higher original time-activity PET product . Our group demonstrated the stability of most [18F]FDG PET radiomics features while applying this software without study count reduction .
In this prospective study, we aimed to evaluate the feasibility of halving PET acquisition time in a routine clinical setting by using SubtlePET™ while preserving visual and semi-quantitative PET performances in digital TOF PET/CT.
Materials and methods
One hundred ninety-five adult patients referred to our comprehensive cancer center for initial or follow-up [18F]FDG PET/CT from end-January to end-February 2021 were prospectively included in this study. The only exclusion criterion was a specific acquisition protocol involving a longer acquisition time per bed position on the head and neck or liver areas.
This non-interventional clinical study was approved by the local institutional review board from the François Baclesse Comprehensive Cancer Center and was registered with the French Health Data Hub under reference N° F20210720123322 on 20 January 2021. All patients provided informed consent to the use of their data.
Imaging protocol and processing
All exams were performed in accordance with the EANM imaging guidelines  on a digital SiPM PET/CT (VEREOS, Philips Healthcare). After a 6-h fasting period, patients were injected with 3 MBq/kg [18F]FDG intravenously.
Before each PET scan, a low-dose non-contrast-enhanced CT scan was acquired for attenuation correction and as an anatomical reference. CT scan parameters were 100–140 kV, with variable mAs according to a dose right index of 14 and an iterative reconstruction Idose of 4:64 × 0.625-mm slice collimation, the pitch of 0.83, rotation time 0.5 s, 3D modulation, matrix 512 × 512 and voxel size 0.97 × 0.97 × 3 mm3.
PET acquisition, 1 h post-injection, was recorded in list-mode. Its field comprised at least the skull base to the upper thigh and was extended to total body acquisition if needed. Two PET reconstructions were performed: one for routine clinical purposes using the full acquisition time of 90 s per bed position (“PET90”), and a second one using 45 s per bed position for the purpose of this study (“PET45”). For both reconstructions, we used 3D ordered subset expectation maximization (3D-OSEM) with Point Spread Function (PSF), 4 iterations with 4 subsets (4i4s), a 2 × 2 × 2 mm3 voxel size, and 288 × 288 matrix size. Scatter, attenuation, and random corrections were computed.
PET45 images were processed by SubtlePET™ and are referred to hereafter as “PET45AI.” A fully automatic workflow allowed image transfer as well as denoising. A common and affordable NVIDIA 1080 GPU processor was used for SubtlePET™.
Original blinded PET90 and PET45AI were reviewed side-by-side by five experienced nuclear medicine physicians on a Syngo.via viewing server (version VB 30A, Siemens Healthcare). Each reader interpreted a unique part of the study population (all images per patient) and did not review PET/CT scans they had previously seen in clinical practice.
Readers attributed a global, whole-image quality (IQ) score to each PET series: 1 = poor; 2 = moderate; 3 = good. It was based on global and hepatic image noise and on normal tissue contrast.
All lesions with increased [18F]FDG uptake were notified on each PET series. For each lesion, the readers specified the preferred PET series for detection (related to the contrast-to-background ratio), the supposed nature, i.e. malignant (primary tumor, local recurrence, (nodal) metastasis), benign or indeterminate, and its location.
Additionally, to evaluate the incremental value of AI-based denoising, PET45 was compared to PET90 and PET45AI in 146 patients (due to missing data).
Lesions were independently and semi-automatically segmented on each PET series, using the 50% 3D-isocontour of the maximal pixel value.
In each lesion volume-of-interest (VOI), the following standardized uptake values based on lean body mass (SUL) were measured: SULmax and SULpeak. LBM was estimated using the Janma formula .
The metabolic volume (MV) of the lesion and, when feasible, its short and long axes on the associated CT were calculated.
In addition, the reference liver SULmean with its standard deviation (SD) were collected in a 3 cm-diameter VOI in the right liver lobe, identical for each PET series.
Both standard and EARL1 (European Association of Nuclear Medicine Research Ltd) SUL were analyzed. EARL1 SUL was obtained numerically by Gaussian post-filtering within Syngo.via (EQ.PET filter ), with a full width at half maximum (FWHM) of 7.2 mm for all PET series. Our center is EARL accredited, and we use EARL1 SUL in routine practice for quantification, as it is transposable to different PET cameras and reconstructions .
Lesion-to-liver ratios (LLR) were calculated as SUL/SULmean liver and the coefficient of variation in the liver (CVliv) as SD/ SULmean.
Shapiro–Wilk testing found all quantitative variables (except for denoising processing time and SUL differences) to be non-normally distributed, further expressed by the median and interquartile range (IQR).
IQ scores between two PET series were compared by the Wilcoxon signed-rank test with continuity correction for paired data. Concordance rates of lesion detection between PET90 and PET45AI and between PET90 and PET45 were compared by the chi-squared test. Differences in continuous quantitative variables (semi-quantitative PET measures) between two PET series were statistically analyzed by the Wilcoxon signed-rank test for paired data.
Intraclass correlation coefficients (ICC) between semi-quantitative measures in PET90 and PET 45AI were also calculated, considering 0.5–0.75 as moderate, 0.75–0.9 as good, and > 0.9 as excellent reliability . Absolute differences in SUL between PET series were calculated as SULPET45AI − SULPET90 and relative differences or delta ∆ as (SULPET45(AI) − SULPET90) / SULPET90, and likewise for MV.
Bland Altman plots were used to display absolute SUL differences between PET90 and PET45AI, with Limits of Agreement (LOA) computed as the mean difference ± 1.96 × SD. A logistic uni- and multivariable regression analysis was carried out to look for predictive factors of a decrease of over 10% in SULmax in PET45AI vs PET90.
This decrease threshold of 10% was set by the required accuracy of SUL calibration within 10% for VEREOS PET, according to the AAPM report 126 . Bonferroni correction for statistical significance level was used in univariable logistic regression analysis. Elsewhere, P-values < 0.05 were considered statistically significant. Analyses were conducted with R version 4.0.2.
Patient population and image processing
The main characteristics of the 195 patients included in this study are shown in Table 1.
N = 195
Age (years), median; IQR1
Gender, n (%)
Weight (kg), median; IQR
BMI 2 (kg/m2)
Scan delay (min) 3
Study indication (n (%) of patients)
Characterization (benign vs malignant)
Primary lesion (origin)
Gynecological (except breast)
1IQR, interquartile range between first and third quartile (Q1 and Q3); 2BMI, body mass index; 3 2 patients had a delay < 55 or > 65 min pi.(53 and 70 min); 4 Other primaries: prostate (4), melanoma (3), head-and-neck (3), esophagus and stomach (3), bladder (3), testicle (1), pancreas (1), anus (1), myeloma (1), mesothelioma (1), skin squamous cell carcinoma (1), Merkel cell carcinoma (1). 12 patients had more than one primary; 7 patients had no primary (sum > 195)
All two-fold count reduced PET series (PET45) were successfully treated by the denoising software with a mean processing time of 90 s (min–max; 45–122 s).
Visual analysis: comparison between PET90 and PET45AI
IQ scores were similar between original PET90 and PET45AI (P = 0.80), good (score 3) in 92% (n = 180 vs 179) and moderate (score 2) in 8% (n = 15 vs n = 16) of exams.
Concerning lesion detection, 33 out of 195 patients presented a normal and concordant examination on both PET series. In the remaining 162 patients, a total of 856 lesions were detected.
Of these, 836 lesions were visualized in both original PET90 and denoised PET45AI, resulting in a lesion concordance rate of 97.7%. Seven out of 856 (0.8%) small and low-uptake lesions were detected exclusively on PET90 in 6 patients (Table 2). Thirteen foci (1.5%) were detected only on PET45AI in 10 patients, mostly corresponding to indeterminate liver lesions. An illustration is shown in Fig. 1.
Description of discordant lesions between PET90 and PET45AI
Original PET90 only
There was no per-lesion preferred PET series for detection in 86% of lesions. On the other hand, original PET90 was preferred for 12% and PET45AI for 2%.
Semi-quantitative analysis: PET90 and PET45AI measures
Statistical comparison of standard values
Lesion SULmax, SULpeak, LLR, and CVliv were significantly lower in denoised PET45AI than in original PET90 (P < 0.0001) (Table 3). In contrast, lesion MV and liver SULmean were higher in PET45AI than in PET90 (P < 0.0001). Lesion SUL, MV, LLR, and liver SULmean showed a good-to-excellent correlation between both PET series (≥ 0.873 up to 0.998).
Standard semi-quantitative measures in original PET90 and denoised PET45AI
All values are expressed as median [interquartile range]. ICC, intraclass correlation coefficients between PET90 and PET45AI measures; MV, metabolic volume; CV, coefficient of variation; LLR, lesion-to-liver ratio
Statistical comparison of EARL1values
Lesion EARL1 SULpeak was not significantly different between both PET series (P = 0.09). Otherwise, the comparison of EARL1 SUL and derived measures between PET90 and PET45AI was similar to the comparison of standard measures.
Absolute and relative differences
Bland Altman (Fig. 2) plots show the absolute difference between both PET series in SULmax and SULpeak (both standard and EARL1 measures) for each lesion. The highest mean absolute difference reached − 0.38 g/mL [95% CI − 0.43, − 0.34] for standard SULmax in PET45AI vs original PET90. The other average absolute differences were close to 0.
The mean ± SD relative differences in PET45AI compared to PET90 reached − 9.48 ± 11.50% for standard SULmax, − 3.41 ± 7.17% for standard SULpeak, − 3.74 ± 7.34% for EARL1 SULmax, and − 1.37 ± 5.71% for EARL1 SULpeak of lesions. For liver SULmean, the mean relative difference was + 5.64 ± 4.75% and 5.88 ± 3.93% for standard and EARL1 measures, respectively.
Explanatory factors analysis of differences between PET90 and PET45AI
In visual lesion detection
Table 4 shows lesion characteristics (size and uptake) according to their detectability. Most discordant and preferred lesions had a low-to-moderate uptake and size.
Lesion features according to their detectability in original PET90 and denoised PET45AI
Size on CT (mm)
MV (ml) 1
4.4 [3.1–7.5] (2)
Discordant PET90 only
Discordant PET45AI only
Preferred serie for detection
4.6 [3.1–7.7] (2)
2.7 [2.0–3.6] (2)
2.2 [1.6–2.8] (2)
All measures are displayed as median [interquartile range]; 1MV, metabolic volume. Note that metabolic volumes of small lesions and with low contrast-to-background ratios are less accurate. 2 of original PET90; 3NA, not assessable (no measurable CT lesion)
In lesion SULmax
Multivariable logistic regression analysis indicated two independent predictors of a SULmax decrease of over 10% in PET45AI compared to PET90, namely SULmax in PET45AI (P < 0.0001) and CT long axis (P = 0.01) (Table 5). Supplementary Fig. 3 shows that the smaller the lesion size on CT and the lower the SULmax, the greater the probability of a negative SULmax bias over 10%.
Uni-and multivariable logistic regression analysis for predicting a negative ΔSULmax above 10% in PET45AI compared to PET90
OR [95% CI]
CT long axis
* statistically significant. 1 of PET45AI. OR, odds ratio; BMI, body mass index; MV, metabolic volume; CVliv, coefficient of variation in the liver. CVliv_Ratio = CVliv (PET45AI) / CVliv (PET90). PET45AI SULmax values were used to build a predictive model focusing on the end result, namely denoised and not original PET. However, we obtained the same results with original PET90 SULmax. A negative ΔSULmax above 10% concerned 383 lesions (46%). Few lesions showed an increase above 10% in SULmax (n = 9; 1.0%) on PET45AI vs PET90, not further analyzed
Evaluation of PET45
PET90, PET45, and PET 45AI were compared in 146 patients. Two cases are illustrated in Figs. 3 and 4.
IQ scores were lower in PET45 (median: 2) than in both PET90 and PET45AI (median: 3), P < 0.0001. Poor IQ scores (= 1) were exclusively found in PET45 scans (n = 12; 8%). IQ was scored moderate (= 2) in 99 (68%) PET45 examinations vs in 13 (9%) PET90 and 16 (11%) PET 45AI, the remainder being considered of good image quality.
In this subgroup of patients, the lesion detection concordance rate between PET90 and PET45 was 88.7% (582/656), while that between PET90 and PET45AI was 97.4% (589/605), P < 0.0001. The number of false-positive foci was higher in PET45 (n = 61; 9.3%) than in PET45AI (n = 10; 1.7%), P < 0.0001. Furthermore, 13 (2.0%) false-negative lesions were present in PET45 and 6 (1.0%) in PET45AI, P = 0.15.
Lesion standard SULmax was significantly higher in PET45 than in PET90 (P ≤ 0.0001, with an average ± SD relative bias of + 3.30 ± 10.34%). Lesion standard SULpeak, EARL1 SULpeak and EARL1 SULmax were similar in PET90 and PET45.
CVliv was significantly higher in PET45 (median 18.00; IQR 15.98–21.16%) than in PET90 (12.84; 11.88–14.27%) and than in PET45AI (10.80; 9.68–12.21%), P < 0.0001.
This prospective study shows good visual and semi-quantitative performances of AI-denoised half-count PET compared to original PET in a digital PET/CT. We simulated a two-fold reduction in the PET acquisition time and then applied a commercially available PET denoising software based on U-net DCNN. All PET series were successfully denoised within 2 min in an automatic workflow using a common GPU card. This makes it compatible with routine clinical use. Visually, global image quality scores were similar between PET90 and PET45AI but lower and clinically insufficient in half-count PET45 due to high noise. We obtained few discordances (2.3%) between original PET90 and denoised PET45AI in the absolute detection of 856 lesions.
A total of 0.8% of lesions were detected only on PET90 in 3% of patients. This concerned sub-centimetric or small lesions with a maximum SULmax of 3.1 g/ml. Most of these “original PET90-only or false-negative lesions in PET45AI” were classified as authentically malignant (71%) or indeterminate (29%). Many other concordant malignant lesions were detected in all but one of these patients.
A total of 1.5% of lesions were exclusively visualized on denoised PET45AI in 5% of patients. These “false positives” were predominantly located in the liver and interpreted as indeterminate or benign foci. For most lesions, there was no per-lesion preferred PET series for detection. However, in a minority of lesions (12%), original PET was preferred and less frequently (in 2%) denoised PET. Whether on original or on denoised PET, preferred lesions showed a variable uptake and size, mostly low-to-moderate. More expertise in the reading of these new denoised PET images could further improve the accuracy and comfort of readers.
A higher lesion detection discordance rate (> 10%) was found between PET90 and half-duration PET45 than between PET90 and PET45AI, with particularly additional false positives in PET45. This further renders half-count PET not compatible with routine clinical use. Similar results were observed in , with also a decrease in diagnostic confidence when dividing acquisition time by two.
Comparing semi-quantitative SUL measures in lesions between PET90 and PET45AI, only harmonized EARL1 SULpeak was not significantly different when using the same Gaussian post-filter for both PET series. Standard SULpeak and standard and EARL1 SULmax were lower in denoised PET45AI than in original PET90. The average relative difference remained below 10% for all lesion SUL. Greater SUL biases occurred especially in lesions with a moderate size and uptake and mostly “non-target and non-evaluable lesions” according to PERCIST criteria [34, 35]. In our quantitative study, all lesions were taken into account. The overrepresentation of small, low-uptake lesions negatively affected quantitative differences between both PET series.
On the other hand, SULmean in the reference liver was slightly higher (on average + 6%) in PET45AI than in the original PET90. Its standard deviation and thus its noise levels were lower (on average − 12% for standard CVliv). The decrease in CVliv highlights the denoising efficacy even when dividing study counts by two.
Some other research groups have found even lower SUV biases, despite a higher study count reduction, especially while using CycleGANs as DL architecture [16, 17] or Subtle PET™ (U-net) . However, their studies were performed on different and/or smaller cohorts.
A pilot study of 10 small lung nodules suggested that a fully 3D U-net compared to a 2.5D U- net, as used in our study, may offer better lesion quantitative performance, even though visual image quality was similar . However, 2.5D U-net is useful for routine clinical practice owing to its shorter computational time and lower processing capacity requirement.
Nevertheless, probably more important than these differences in semi-quantitative measures was their correlation between original PET90 and PET45AI, in particular for lesion SUL.
This inter-PET correlation was very high for lesion SUL and MV (with ICCs of at least 0.97) and high for liver SULmean (with ICCs of at least 0.87), testifying to the stability and reliability of these measures obtained after PET count reduction and denoising.
A strength of our study is a large number of lesions of very different sizes, uptake, nature, and location.
Study limitations are the side-by-side reading methodology which could have enhanced the detection accuracy in PET45(AI). Second, the clinical impact of denoised PET has not been properly established. Third, the unlimited lesion number per patient led to a potential statistical bias due to the over-representation of dependent lesions in the same patients. Fourth, the effect of AI-denoising on image artifacts was not studied. A final small drawback is a use of harmonized EARL1 SUL measures, which are still widely used, and not more recent EARL2 values .
Our study thus supports the routine use of Subtle PET™ combined with a two-fold faster PET acquisition.
The benefit of decreasing PET duration, thus reducing waiting time for appointments and helping patients who experience discomfort, outweighs the minor decrease in performance.
Although not properly studied, our findings could also lead to a reduction in injected activity or a combination of both (activity and time). Initially, an Italian group reported a similar performance of Subtle PET™ treated PET with 33% less injected [18F]FDG activity compared to native PET in non-TOF analog PET/CTs .
Further research should be carried out on ways to increase performances, e.g. by optimizing the DL-model and/or adapting acquisition time in liver and regions of interest. Furthermore, large multicentric studies with different PET cameras, reconstruction parameters, and various reductions in [18F]FDG PET acquisition time-activity product are necessary. Striking the optimal balance between performance and time savings is essential. Moreover, research with other PET radiopharmaceuticals is warranted.
This prospective study demonstrates the satisfactory preservation of [18F]FDG PET image quality and quantification when applying AI-based denoising on half-duration PET compared to original full-duration PET. AI restored degraded and clinically insufficient image quality of half-duration PET. It paves the way for a significant reduction in acquisition time and the optimization of PET imaging equipment in routine clinical practice.
We thank Ray Cooke for language editing.
This study was performed in line with the principles of the Declaration of Helsinki and compliance with the French Research Standard MR-004. Approval was granted by the Ethics Committee of Centre François Baclesse and registered (20/1/2021/ N° F20210720123322 on the French Health Data Hub).
Consent to participate
Informed consent was obtained from all individual participants included in the study.
Consent for publication
The authors affirm that human research participants provided informed consent for the publication of the images in Figs. 1, 3, and 4.
The authors declare no competing interests.
Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Mit e.Med Radiologie erhalten Sie Zugang zu CME-Fortbildungen des Fachgebietes Radiologie, den Premium-Inhalten der radiologischen Fachzeitschriften, inklusive einer gedruckten Radiologie-Zeitschrift Ihrer Wahl.