Background
Lung cancer is the leading cause of cancer death and the second most frequently diagnosed cancer [
1]. The TNM classification is accepted as the standard for therapy stratification [
2]. Tumor staging based on the TNM classification is also known to be a strong predictor of prognosis [
2]. Age, race, gender, tumor size, histology, and grade have also been shown to be independent predictors of survival [
3]. Initial staging in patients with newly diagnosed NSCLC is needed to select the most appropriate therapeutic strategy and to determine prognosis. Combined positron emission tomography/computed tomography (PET/CT) using the tracer F-18-fluorodeoxyglucose (FDG) has been reported to be superior to conventional imaging modalities including CT and MRI in cancer staging especially for detection of nodal and metastatic site involvement [
4]. By providing metabolic tumor characterization beyond clinical and structural information [
5], FDG-PET has the potential to contribute independently to improved prediction of responsiveness or resistance to a specific treatment associated with in-vivo tumor biology.
Several studies suggest that high FDG uptake in the primary tumor at initial staging, mainly characterized by standardized uptake value (SUV), is associated with worse outcome in patients suffering from NSCLC [
6‐
9]. Other studies propose the metabolic tumor volume (MTV) as a prognostic factor of disease recurrence and survival [
10]. However, there are also studies in which the prognostic value of both these measures at initial staging is found to be unsatisfactory in NSCLC [
11‐
14].
There is increasing recognition that the heterogeneity of pretherapeutic FDG uptake in the primary tumor can provide predictive information in several solid tumors [
15,
16]. Quantifying the heterogeneity of FDG uptake appears promising for the prediction of therapy outcome as it might reflect the biological variability causing this intratumoral heterogeneity [
17].
An increasing number of different measures have been proposed to quantify the voxel-wise or shape heterogeneity of tracer uptake [
16,
18‐
20]. Some previous studies show encouraging results in prediction of treatment outcome from pretherapeutic FDG-PET, based on heterogeneity of uptake characterized by textural features in different carcinomas [
16,
20]. Heterogeneity of FDG uptake, as measured by textural features, was shown to be a predictive factor also in patients with NSCLC [
21,
22]. A measure of shape irregularity of the tumor’s FDG uptake was proposed by Eary et al. and has been shown to be associated with overall survival in certain types of sarcoma [
18].
We were able to show that ‘asphericity’ (ASP), as a parameter for quantification of the spatial irregularity of FDG uptake, is a promising prognostic factor for tumor progression and outcome in patients with primary head and neck cancer [
15]. The aim of this study was to evaluate the independent prognostic value of ASP in patients with NSCLC prior to therapy with respect to progression-free (PFS) and overall survival (OAS) in addition to conventional quantitative PET parameters such as SUVmax, SUVmean, metabolic tumor volume (MTV) and total lesion glycolysis (TLG) as well as relevant clinical parameters. Additionally, we compared ASP in terms of prognostic significance to another, previously described measure for quantitative characterization of shape irregularity of FDG uptake, the so-called “solidity” [
19].
Discussion
High heterogeneity of tumors with respect to various biological parameters is known to be associated with aggressive tumor behavior, response to therapy and survival in a number of cancer types [
17,
25]. This is the rationale for the quantitative evaluation of the heterogeneity of the FDG-PET uptake in tumor lesions, which is assumed to capture the heterogeneity of tumor biology to some extent, although the exact relationship has not yet been fully elucidated. FDG-PET based heterogeneity measures have been found to be superior to tumor volume and conventional FDG-PET based measures including SUVs, MTV and TLG for various indications [
16,
18,
19].
In the present study we have demonstrated that relevant improvement of outcome prediction in patients with NSCLC treated with curative intent can be achieved using ASP, a novel parameter for quantitative characterization of the shape irregularity of the FDG uptake in the primary tumor. In curatively treated patients, binarized ASP was an independent significant prognostic factor for both PFS (HR, 3.4; p = 0.001) and OAS (HR, 2.97; p = 0.03) as well as ‘primary surgical treatment’ (PFS: HR, 2.09; p = 0.05 and OAS: HR, 3.78; p = 0.01). The probability of 2-years PFS decreased from 52% in the patients with low ASP (≤ 46.6%) to 12% in the patients with high ASP (> 46.6%). A similar, significant effect was observed for OAS where 1-year OAS decreased from 91% to 67% in patients with high ASP (> 50.2%).
A pilot study of the novel parameter suggested that ASP is a strong independent predictor of outcome in patients with primary manifestation of head and neck cancer. Univariate Cox regression revealed hazard ratios of 7.8 and 7.4 for PFS and OAS, respectively [
15]. The hazard ratios associated with high ASP were somewhat lower in the NSCLC patient group of the present study.
In head and neck cancer we found that combining ASP with the MTV further improved the predictive power (HR, 22.7 for PFS and 13.2 for OAS), despite a moderate correlation between MTV and ASP similar to the correlation between MTV and ASP in the present study (rho = 0.54, Figure
2A). The factors that mediate a positive correlation between ASP and MTV include spatial resolution and necrosis. Limited spatial resolution of PET imaging causes small lesions to appear more spherical (lower ASP) than they actually might be. Necrosis, which results in increased ASP by producing additional internal surface area within the MTV (Figure
1), is more likely to occur in large tumors than in small ones. However, the fact that the correlation between ASP and MTV was only moderate suggests that the two parameters are not redundant. Nevertheless, in the present study the combination of ASP and MTV did not improve the predictive power over ASP alone (data not shown). This accords with the fact that MTV provides weaker prognostic power in NSCLC in comparison to head and neck cancer [
15].
None of the conventional PET metabolic parameters, including SUVmax, SUVmean, MTV and TLG, showed a significant predictive effect in the current study. Previous studies of the prognostic value of SUVs in NSCLC reported rather variable results. A few studies reported an association between high SUV and poor prognosis in early stage [
6‐
8] as well as locally advanced stage NSCLC [
9]. However, other studies did not support this finding [
11,
12]. For example, a large multicenter study including 250 patients treated with primary radiochemotherapy found no significant prognostic value of
pretreatment SUVs; only
posttreatment SUVs were found to be associated with survival [
12]. The level of evidence is similar for the volumetric PET parameters MTV and TLG. Most published studies report an association between survival and these volumetric PET parameters [
10,
26‐
28], however, contradictory results have also been published [
13,
14]. For example, Soussan et al. found only the post- to pretreatment difference of volumetric PET parameters to be predictive of outcome in patients with stage III NSCLC; pretreatment parameters alone were not predictive. The somewhat conflicting results on the prognostic value of conventional pretreatment FDG-PET measures in the literature suggest that their prognostic value is rather limited and therefore underline the importance of identifying novel PET parameters which provide stronger predictive power for risk stratification from the baseline PET prior to therapy.
Concerning clinical parameters, previous studies have demonstrated the prognostic value of both patient characteristics, e.g. age or ECOG performance status [
3], and tumor specific characteristics, e.g. tumor stage, histological differentiation, blood vessel infiltration, lymph vessel infiltration and biological markers [
29]. In the present study, only one of the clinical parameters considered, primary treatment strategy, reached the level of statistical significance as prognostic factor. This might be due to the fact that the sample size did not provide sufficient statistical power to detect such effects. This suggests that the prognostic value of these clinical variables is lower than the prognostic value of the ASP in pretreatment FDG-PET. Having undergone primary surgical treatment was a predictor of outcome in the present study, particularly with respect to OAS. This is in accordance with the results of large therapy trials [
30].
There are two recent studies on the use of textural heterogeneity measures in NSCLC. Cook et al. demonstrated that high heterogeneity of FDG uptake in the primary tumor, as characterized by textural features, was associated with non-response to chemoradiotherapy and poor prognosis. In agreement with the present study neither SUVs nor volumetric parameters were found to be predictive [
21]. Tixier and co-workers compared a visual score of heterogeneity with quantitative heterogeneity measures based on textural analysis in a mixed population of lung cancer patients (n = 102) [
22]. The authors found that the visual score correlated with the quantitative heterogeneity measures, but only the quantitative measure provided independent prognostic value. This finding supports the use of quantitative measures, which are determined either fully or semi-automatically and are therefore not limited by intra- and inter-reader variability. In addition, the continuous range of values of quantitative measures most probably allows for better discrimination than visual scores with only a few discrete values. Beside textural features, several other parameters were found to be predictive for OAS (but not PFS) in the study of Tixier et al. These were SUVmean and MTV, as well as surgical treatment; the latter in agreement with the present study.
The textural features used by Cook et al. and Tixier et al. mainly characterize the voxel-wise heterogeneity of the FDG uptake within the tumor. In contrast, ASP is a quantitative measure of the irregularity of the 3-dimensional contour of the tumor in its FDG-PET image, i.e. the ASP depends on the shape of the surface of the metabolically active volume, but it is not sensitive to the variability of the FDG uptake in interior tumor voxels. The shape-based measure ASP therefore depicts different information on tumor heterogeneity to textural features based on voxel-by-voxel variability of tracer uptake. An advantage of ASP compared to other measures of heterogeneity is that it is easily computed (just by counting voxels) from the ROI which delineates the metabolically active part of the tumor, since only the volume and the surface area of this ROI are required. Thus, computation of ASP is easily integrated in any existing software which provides a ROI tool [
15]. To the best of our knowledge, the present study is the first to evaluate the prognostic value of the shape irregularity of FDG uptake in NSCLC.
We compared ASP to another shape-based measure of heterogeneity, the parameter “solidity” proposed by El Naqa and co-workers for quantitative characterization of the convexity of the metabolically active tumor lesion [
19]. These authors identified solidity as a promising prognostic parameter in cervical as well as head and neck cancer. In ROC analyses solidity provided a larger area under the curve than SUVmax and some uptake-based textural features [
19]. In the present study, solidity was found to be inversely correlated with ASP, as was to be expected. However, in contrast to ASP, solidity did not provide significant prognostic information according to univariate Cox regression, either for PFS or for OAS. Only after binarization did solidity show a significant association with PFS (not OAS), and this was still considerably lower than that found for ASP (HR 2.2 versus 3.4), although the cut-off was optimized independently for both parameters. The higher prognostic value of ASP compared to solidity was confirmed by Kaplan-Meier and multivariate regression analyses.
Limitations of the present study are its retrospective character, the inclusion of patients from a single institution only, as well as the limited sample size. Our results must therefore still be considered as preliminary. We have initiated a prospective trial to confirm the prognostic value of ASP in a larger cohort of curatively treated patients with NSCLC and to prospectively test the cut-offs proposed in the present study. If the strong prognostic power of ASP is confirmed in this study, the clinical value of ASP for stratification of high risk patients to intensified primary chemoradiation of locally advanced NSCLC will be evaluated in further studies.
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
IA and JR data acquisition and analysis, writing; RB statistics, expert reading of the paper; HW, HJA, JS and SR analysis and discussion on the clinical cases, definition of therapeutic strategy and patients outcome; CF data analysis, radiological counseling; AL, GS and FH methodological development, data processing, computation of parameters; HA scientific discussion and conception of the manuscript, approval of the final content of the manuscript; IGS scientific discussion, statistical concept, revising the manuscript. All authors read and approved the final manuscript.