Background
More than a half million people are diagnosed with head and neck squamous cell carcinoma (HNSCC) each year, accounting for nearly 10 % of cancers worldwide [
1]. Despite recent progress in treating this disease, a substantial number of patients experience locoregional and/or systemic failure (LRSF) within the first 3 years of definitive therapy [
2,
3]. The prognosis with such failure is poor, marked by median overall survival (OS) <1 year [
4]. Hence, there is clinical need for more timely detection of disease relapse or progression, enabling early intervention while disease burden is low.
Functional imaging with
18F-fluorodeoxyglucose (FDG) positron emission tomography/computed tomography (PET/CT) after definitive chemoradiotherapy (CRT) has been investigated as a useful means of detecting residual disease or recurrences earlier [
2,
3]. Furthermore, PET/CT imaging has been contemplated as a source of prognostic biomarkers in HNSCC after definitive CRT [
5,
6]. Despite the cumulative corroborative data that exists, use of PET/CT for this purpose remains contentious, primarily due to the lack of prospective trials that address the ramifications for patient management.
Salvage surgery is considered the most curative intervention for residual or recurrent disease in the aftermath of definitive CRT [
7]. However, given the functional disability that generally results, selecting candidates appropriate for salvage surgery is often difficult. In addition, indications for salvage surgery and the survival benefits thereof are still anecdotal due to a limited body of evidence.
In the course of this study, we evaluated the predictive and prognostic value of PET/CT imaging in the context of immediate locoregional and/or systemic failure (iLRSF) after radical CRT, assessing any related impact on clinical outcomes of salvage surgery.
Methods
Study population
Patients treated at Seoul National University Hospital (SNUH) between January, 2005 and January, 2013 for locally advanced HNSCC (LA-HNSCC) were reviewed retrospectively. A total of 78 patients whose treatment responses were assessed by whole-body FDG PET/CT scans before and after definitive therapy qualified for study. Primary sites were oropharynx, hypopharynx, larynx, oral cavity, or nasal cavity. Biopsy-proven squamous carcinomas of unknown origin in cervical lymph nodes were presumed to be head and neck cancers and were included. Patients having more than one measurable lesion according to the Response Evaluation Criteria in Solid Tumors (RECIST) Criteria v1.1 were also admissible; and Eastern Cooperative Oncology Group performance status (ECOG PS) 0–2 was required [
8]. Staging was stipulated by the American Joint Committee on Cancer (7
th edition).
Treatment
Modality of radical CRT was decided through multidisciplinary approach by the SNUH Head and Neck Cancer Team. Bulky nodal status, higher T- or N-stage, and the possibility of organ preservation after induction chemotherapy influenced the decision-making process [
9]. Patients of the IC/CRT group received induction chemotherapy (IC) upfront for two or three cycles every 3 weeks, followed by definitive CRT. Patients of the CRT group were given definitive CRT directly, without IC. IC regimens included docetaxel, cisplatin, 5-fluorouracil, or cetuximab. Radiotherapy was delivered daily on 5 days a week using 3-dimensional conformal radiotherapy (3D-CRT) or intensity-modulated radiotherapy (IMRT) with cisplatin or cetuximab. On planning computed tomography images, gross tumor volumes at primary sites and metastatic nodes and clinical target volumes for occult tumor spreads were delineated. Gross tumor volumes included any documented tumors in primary sites and metastatic lymph nodes with a least margin of 5 mm. Selection of cervical lymph nodal stations in clinical target volumes was decided with consideration of clinical stages, location of primary tumors and physician’s discretion. 3D-CRT was delivered using conventional fractionation with a 1.8-Gy daily dose: gross tumor received 70 Gy or higher, while high-risk and low-risk regional nodal stations received 60 Gy and 45 Gy, respectively. For IMRT, simultaneous integrated boost technique was used to deliver differential daily doses to various target volumes in 30 daily fractions: 67.5 Gy to gross tumor, 54 Gy and 48 Gy to high-risk and low-risk clinical target volumes, respectively. To account for set up errors, clinical target volumes were expanded by 3 mm to generate planning target volumes.
For second curative attempts after definitive CRT, the multidisciplinary team identified candidates for salvage surgery based on follow-up CT, MRI, PET/CT, and/or biopsy results of suspicious residual lesions. Technical feasibility and preemptive medical conditions were considered as well.
FDG PET/CT studies
Whole-body FDG PET/CT scans were acquired before (baseline PET/CT) and 3.2 ± 1.1 months after definitive therapy (post CRT PET/CT) for early metabolic response evaluations. FDG PET/CT was done using dedicated scanners (Gemini PET/CT: Philips Healthcare, Best, The Netherlands; Biograph 40 or Biograph 64 PET/CT: Siemens Healthcare, Munich, Germany). Patients fasted for at least 6 h before FDG injection. FDG (5.18 MBq/kg) was administered intravenously, and images were acquired approximately 60 min after injection. A CT scan for attenuation correction and anatomic correlation was done first (120 kVp, 50-160 mAs). Whole-body emission scans were obtained from base of skull to proximal thigh for 2 min in recumbent position. PET images were reconstructed using iterative algorithms (ordered-subset expectation maximization) on 256 × 256 matrix.
Standard uptake values (SUVs) were calculated from the amount of injected FDG activity, body weight, and tissue uptake in the attenuation-corrected regional images as follows: SUV = (activity/unit volume) / (injected dose/body weight). For quantitative assessment of tumor FDG uptake, a spherical volume of interest (VOI) was manually drawn to include the highest radioactivity concentration of tumor or regional lymph node, using an image analysis software package (Syngo.via; Siemens Healthcare). Maximum SUV (SUVmax) was defined as the highest SUV value within the VOI range of tumor or regional lymph nodes.
Response evaluation
Complete physical examinations and all imaging studies, including MRI or CT of head and neck and PET/CT images were assessed, as well as any CT studies (chest, abdomen) and brain MRI obtained as indicated by specific symptoms or clinical suspicions. In keeping with our institutional protocol, baseline PET/CT was done in all patients with HNSCC prior to initiation of definitive therapy. To assess response of primary tumor to CRT, CT of primary site and neck and/or MRI with contrast were performed in combination with panendoscopy at 4–8 weeks after the end of CRT as recommended in National Comprehensive Cancer Network (NCCN) guideline [
10]. Most of patients underwent post CRT PET/CT 3 months after completion of definitive CRT. In some instances, post CRT PET/CT scans were done earlier or later than 3 months, as dictated by clinical suspicions of residual or recurrent disease. Follow-up imaging was performed after two or three cycles of IC, at 4–8 weeks after the end of CRT, and then every 3–6 months until progression or death. Responses to treatment were evaluated according to RECIST v1.1 [
8]. Metabolic tumor response was assessed according to the SUV measurement criteria of European Organization for Research and Treatment of Cancer [
11]. The metabolic complete response (mCR) was defined complete resolution of FDG uptake in the tumor such that activity is less intense than the liver and indistinguishable from surrounding background blood pool levels.
Outcome measurement
iLRSF was defined as residual disease or locoregional and/or systemic relapse within 6 months after CRT, because in such case HNSCC was considered to be platinum-refractory [
12]. As primary outcome measures, accuracy of post CRT PET/CT in predicting iLRSF and prognostic value in terms of OS and progression-free survival (PFS) were evaluated. LRSF beyond 6 months was presumed independent of post CRT PET/CT findings. As secondary objective, we evaluated the survival benefit of salvage surgery performed on the basis of post CRT PET/CT, measuring OS from date of diagnosis until death or last follow-up visit (if censored). PFS was calculated from the first day of initial IC or CCRT to the date of disease progression (confirmed by imaging or biopsy), death, or last follow-up visit (if censored).
Statistical analysis
The differences in clinicaopathologic characteristics according to whether patients achieved mCR or not were tested for significane using Mann-Whitney test for continuous variables and the chi-square test or Fisher’s exact text for categorical variables. Of the various metabolic parameters, optimal predictive indices and cutpoints were obtained via Youden index [
13]. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were estimated. OS and PFS in all patients and in various subgroups, namely those defined by predictive thresholds, were estimated through Kaplan-Meier method. Between-group differences in OS and PFS were compared using log-rank test. All reported
P values were two-sided, with statistical significance set at
P < 0.05. Above calculations relied on standard software (STATA version 11; StataCorp LP, College Station, Texas, USA).
Ethical consideration
The study was approved by the Seoul National University Hospital Institutional Review Board (SNUH IRB) (IRB approval number: H-1307-051-504) and was conducted in accordance with Declaration of Helsinki provisions. Patient informed consent was waived from SNUH IRB because of the retrospective design of the study.
Discussion
Our analysis indicates that postSUVmax has value in predicting iLRSF after definitive CRT. High postSUVmax corresponded with poor prognosis, but OS was not significantly altered by early salvage surgery done on the basis of post CRT PET/CT findings.
In patients with HNSCC, recurrence is the dominant cause of treatment failure. Therefore post-treatment follow-up is well integrated into the management of HNSCC. Post-treatment PET/CT is now commonly used to gauge patient response after definitive CRT [
14,
15]. Conventional imaging (including contrast-enhanced CT and MRI) has limited ability to distinguish between radiation induced inflammation or fibrosis and residual or recurrent diseases. On the other hand, PET/CT has not only improved the precision of initial staging but also yielded significantly better results for the detection of recurrence of HNSCC after CRT than CT/MRI [
16]. The sensitivity, NPV and PPV of postSUVmax in our analysis were superior to those of post CT or MRI imaging when predicting iLRSF.
As discovered in previous studies, higher SUVmax values on post CRT PET/CT images may predict local recurrence and OS [
17‐
20]. However, a decisive SUV cutpoint, enabling residual cancer to be distinguished from inflammation, has been lacking to date [
21,
22]. Herein, we found that a postSUVmax cutpoint of 4.4 served well in predicting iLRSF. Furthermore, it was apparent postSUVmax also held prognostic value. In other words, OS and PFS at postSUVmax ≥4.4 (vs postSUVmax <4.4) were poorer by comparison. Within a 6-month time frame after definitive CRT, postSUVmax ≥4.4 signals the likelihood of recurrent or progressive disease. On the other hand, postSUVmax <4.4 is indicative of inflammation in the aftermath of radiation or chemotherapy.
The PPV we determined for postSUVmax was relatively low (45.0 %) yet compared favorably with the 19–58 % range reported by others [
21,
23‐
25], and it is thought that PPV may be a function of proper timing [
15]. According to Schöder et al., post CRT PET/CT should not be performed for 10–12 weeks after treatment ends [
15,
22,
26]. At a lesser interval (<4-8 weeks), inflammatory changes related to radiation or chemotherapy are apt to increase false-positive interpretations. Furthermore, small-volume residual disease may escape detection by PET/CT, potentially increasing the number of false-negative readings [
27]. In our cohort, 39 patients (50.0 %) viewed as high-risk for immediate failure underwent post CRT PET/CT before 3 months had elapsed, which perhaps explains the relatively low PPV of postSUVmax. Still, the true value of PET/CT imaging in this context is the high NPV attached. We recorded a remarkably high NPV (98.3 %) for postSUVmax, as did several earlier retrospective studies [
21,
22,
26], suggesting that negative PET/CT scans are exceedingly reliable for determining the absence of residual disease. In fact, this strategy has helped reduced post CRT neck dissections by up to 85 % in patients initially treated for bulky nodal disease [
28,
29].
If residual/recurrent disease is resectable, salvage surgery is regarded as standard of care. Nonetheless, even recent advances in extirpative and reconstructive surgical techniques have not diminished the inherent controversies. Many aspects of such surgeries are still of dubious benefit. Although studies by Bachar GY et al. and Goodwin et al. maintain that salvage surgery controls disease long-term [
30,
31], other sources emphasize that these procedures (especially laryngectomy) are associated with high morbidity rates and poor overall or disease-specific survival [
32,
33]. Higher rates of complications and impaired quality of life after salvage surgery have the potential to overshadow any theoretical gains [
7,
32,
33]. Proper clinical selection of candidate for salvage surgery is therefore paramount. Our decision to perform salvage surgery was subsequently guided by the risk of immediate failure, based on post CRT PET/CT results. The rationale was that patients at high risk of immediate failure would benefit most. Immediate salvage surgery took place in 5 of 20 patients with postSUVmax ≥4.4, four of whom eventually developed iLRSF. Unfortunately, no significant survival benefit was demonstrated when comparing 3-year OS rates with and without aggressive surgical intervention. This disappointing outcome need to be further addressed.
Our study has a number of acknowledged limitations, the first being its retrospective design. Although a heterogenous patient population resulted, the multidisciplinary, individualized approach implemented by our institutional head and neck cancer team helped to minimize consequences. Additionally, the sample size did not confer adequate statistical power, and our index/cutpoint for predicting iLRSF may not be applicable to other institutions where patient population, equipment, and imaging protocols differ. As much as we hope our findings may prove relevant in future research, this particular investigation was never expected to be conclusive. Finally, recurrence or progression of disease in five patients was only confirmed clinically, via imaging diagnostics. This leaves some uncertainty surrounding the potential for residual viable tumor. Of note, salvage surgery, including neck dissection, is not done routinely at our institution, so opportunities for histologic verification are limited. Given that previous reports have shown a relationship between metabolic and pathologic responses [
34,
35], we believe that follow-up clinical and imaging data provide reasonable support here for a disease-free state.
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
Designing the concept of the study: BK. Provision of study patients and chemotherapy: BK, TMK, DWK, DSH. Provision of study patients and surgery: SKK, JHH, TKK, MWS. Provision of study patients and radiation therapy: JHK, HGW. Imaging analysis: JCP. Data gathering, statistical analysis and interpretation: RK, CYO. Manuscript writing: RK. All authors read and approved the final manuscript.