Introduction
Hepatocellular carcinoma (HCC) is a major global health burden being the third most common cause of cancer-associated mortality worldwide [
1,
2]. In the majority of patients, HCC arises on a background of chronic liver disease. Hence, liver transplantation is often considered treatment of choice since it addresses both the underlying liver and the oncological disease [
3]. However, a large proportion of HCC patients is too old for transplantation, has major comorbidities, or presents with other contraindications, e.g., active alcoholism as well as advanced tumor stages precluding this approach. Moreover, the limited availability of liver grafts which result in strict allocation regulation and the excellent oncological outcome of surgery in small, solitary HCC underline the importance of liver resections as a major therapeutic option in patients suffering from HCC [
4,
5]. Despite recent advances in HCC surgery including the use of dynamic liver function tests, e.g., LiMAx (maximum liver function capacity) or indocyanine green (ICG) and the increasing implementation of minimally invasive liver surgery, a significant proportion of patients is usually regarded as not ideal candidates for surgery based on a high risk of post-hepatectomy liver failure or poor overall oncological prognosis [
6‐
11]. Therefore, identifying preoperative characteristics associated with a higher perioperative risk and prognostic value for oncological outcome has been in the center of interest in HCC. The latter has led to the development of various staging systems aiming to support clinical decision-making in HCC patients.
The most widespread preoperative staging systems are the Milan criteria and the Barcelona Clinic Liver Cancer (BCLC) staging system. While the Milan criteria is used to predict the outcome in HCC undergoing transplantation based on radiologic features, the BCLC system stratifies patients based on radiologic features, physical performance, and liver function. Hence, BCLC is commonly adopted in therapy guidelines, giving distinct recommendations regarding the treatment of choice for each subgroup of patients [
12,
13]. In particular, BCLC allocates patients with early stage tumors to curative-intent surgery, while more oncological progressed individuals or patients with impaired liver function are scheduled to interventional or systemic therapy [
12]. This traditional paradigm has been challenged by recent reports indicating a survival benefit of liver resection over other treatment modalities regardless of the pre-hepatectomy BCLC stage [
14,
15]. Subsequently, various other staging systems have been proposed to overcome limitations of the BCLC staging systems, e.g. ,Cancer of Liver Italian Program (CLIP) score, Groupe d’Etude et de Traitément du Carcinome Hepatocellulaire (GRETCH) score, Italian Liver Cancer (ITA.LI.CA) tumor staging and score, Hong Kong Liver Cancer (HKLC) staging and score, model to estimate survival in hepatocellular carcinoma (MESH) score, or Okuda staging (Table
1) [
12,
13,
16‐
21].
Table 1
Synopsis of staging systems for hepatocellular carcinoma
Milan | x | x | x | | | | | | | | |
BCLC | x | x | x | x | x | x | | x | x | x | |
HKLC | x | x | x | x | x | x | | x | x | x | |
Okuda | x | | | x | | | | x | x | | |
CLIP | x | x | | x | | x | x | x | x | x | |
ITA.LI.CA staging | x | x | x | | | | | | | | |
ITA.LI.CA score | x | x | x | x | x | x | x | x | x | x | |
MESH | x | x | x | x | x | x | x | x | x | x | x |
GRETCH | | x | | | x | | x | | x | | x |
Given the variety of staging systems, we aimed to evaluate the prognostic ability of each staging systems to determine the “best” performing model in a European cohort of patients undergoing curative-intent surgery for HCC.
Discussion
HCC represents one of the major global health issues with liver resection being the treatment of choice in patients with compensated liver function [
1‐
3]. Given this importance of the disease, a variety of staging systems reflecting oncological prognosis and guiding treatment decisions have been proposed, but no international consensus has been achieved which staging system should be preferred [
12,
13,
16‐
21]. In a European cohort of patients, we were able to demonstrate a superiority of ITA.LI.CA score and CLIP over various other staging systems in their prognostic ability for OS after surgical resection. Our data does further suggest that staging systems incorporating biochemical markers of tumor biology (AFP) provide more solid estimates for OS in surgical patients than staging systems focusing on radiological characteristics only. This suggests ITA.LI.CA score and CLIP as the preferable staging systems for preoperative risk assessment balancing oncological outcome with perioperative risks in patients with HCC scheduled for liver surgery.
The BCLC staging system is traditionally considered to guide treatment decision in European patients and provides the basis of the current guideline of the European Society for Medical Oncology (ESMO) [
25]. BCLC assigns early stage HCC in patients with compensated liver function and good performance status to surgical therapy, while more advanced disease stages or more compromised individuals are referred to ablative and locoregional therapies or palliative treatment [
12,
13]. However, such conservative interpretation of the BCLC staging has to be viewed critically in the era of modern HCC surgery using minimal invasive techniques and novel liver function tests which resulted in significantly improved patient selection and perioperative outcomes [
6‐
11]. Correspondingly, two multicenter studies have shown that liver resection for HCC patients results in survival benefit over medical or interventional therapy regardless of their BCLC stage [
14,
15]. These observations are further reassured by a randomized trial indicating better survival in BCLC B patients undergoing liver resection compared to TACE [
26]. Therefore, more patients with higher BCLC stages are nowadays considered candidates for surgery provided that severe liver dysfunction and a significantly impaired performance status are absent [
14,
15]. Despite this accepted expansion of the BCLC criteria, there is lacking evidence whether higher BCLC stages—which were originally not developed to predict survival in patients undergoing surgery—can be used to provide a basis for the selection of surgical candidates. Interestingly, our data does suggest significant differences in survival in between patients categorized BCLC 0/A compared to BCLC B/C but less discriminative value of the staging system in BCLC B and C categories (Fig.
2b).
One might argue that for patients undergoing surgery, the pathology-based Union for International Cancer Control (UICC) staging systems provides an excellent stratification for estimated postoperative OS. The TNM staging usually does predict OS well in HCC patients but does not take the underlying liver disease into account which limits its overall prognostic ability [
27]. Therefore, some staging systems do incorporate the pathological TNM staging and add individual patient characteristics to overcome this major limitation (e.g., Japanese Integrated System (JIS) or Chinese University Prognostic Index (CUPI)) [
28,
29]. However, pathological staging only allows a post hoc assessment and is not available for the preoperative decision-making and patient selection. We therefore decided not to include staging systems requiring data based on postoperative pathological examinations into our present analysis.
Similar to the BCLC system, all of the reported staging systems are originally designed to cover the whole disease spectrum of HCC but not exclusively to predict OS in surgical candidates [
12,
13,
16‐
21]. Furthermore, the initial publications regarding development and validation of the different prognostic staging systems are based on large heterogeneous cohorts using various treatment approaches including palliation. This explains the observation that advanced stages of some staging systems are not represented in our analysis (Table
3, Fig.
2). In addition, previous literature predominantly compares various prognostic scores using patient cohorts with a broad disease spectrum and different treatment modalities and, therefore, might have limited use for the selection and stratification of surgical candidates [
30]. Nonetheless, to the best of our knowledge, this analysis is the first report in the literature comparing a plethora of staging systems in a European cohort of HCC patients undergoing curative-intent surgery.
Based on our comparative analysis, we identified the ITA.LI.CA score and CLIP as preferable staging system for patients scheduled for liver resection. CLIP was introduced in 1998 and can be considered an advancement over the older Okuda staging which was published in 1985 [
19,
20]. CLIP basically uses the same set of variables as the Okuda staging but added AFP and the presence of portal vein thrombosis to the assessed patient characteristics [
20]. Despite being relatively old and simple compared to some more novel staging systems, CLIP performed well against other staging systems in comparative analyses. In particular, CLIP outranked BCLC, HCLC, JIS, GRETCH, CUPI, Okuda staging, and TNM staging in its overall prognostic performance in a large Taiwanese study with 3000 patients [
31]. Another study from China also demonstrated a higher prognostic value of CLIP for 3- and 6-month OS compared to other staging systems [
32]. Based on the small variable set included into calculating the score (tumor volume compared to liver volume, Child-Pugh category, AFP, and the presence of portal vein thrombosis), CLIP is feasible to be used in surgical candidates and does display decent prognostic ability in our current study. Although showing the best discriminatory ability among the investigated staging systems, CLIP was inferior to the novel ITA.LI.CA score in terms of homogeneity and overall explanatory ability. ITA.LI.CA is a complex system-based ITA.LI.CA staging which stratifies patients with respect to size and number of tumor nodules, vascular invasion, and metastasis into four main and some sub-stages [
18]. Interestingly, ITA.LI.CA staging performed inferior to the standard BCLC staging in our analysis. The ITA.LI.CA score utilizes the ITA.LI.CA staging and adds functional status, Child-Pugh score, and AFP to calculate a score ranging from 0 to 13 corresponding to overall prognosis in HCC patients [
18]. Of note, ITA.LI.CA score has already been validated with an external cohort of patients in a study analyzing 1500 patients undergoing various treatments and showed prognostic superiority over CLIP, HKLC, JIS, ITA.LI.CA staging, and BCLC [
33].
Despite showing the best mathematical abilities to predict survival, ITA.LI.CA and CLIP are certainly not perfect from a theoretical point of view as illustrated by the fact that patients with moderate HCC stages could have inferior OS compared to patients with higher cancer stages especially in the ITA.LI.CA staging (Fig.
2). This underlines that staging systems are helpful regarding patient selection but are just one of multiple characteristics guiding decision-making in this complex disease.
Interestingly, the two staging systems which showed the best prognostic ability (CLIP and ITA.LI.CA score) in our setting were originally developed using European patient cohorts. In contrast, the HKLC staging which is based on Asian HCC patients performed only slightly better than the Milan criteria which we have included in our analysis for reference reasons [
13,
21]. These observations may suggest a potential difference between Asian and European patient cohorts and its impact on the prognostic ability of the various staging systems. General disease etiology and even genomic characteristics vary between Asian and European patients [
34]. Also, the general approach to HCC seems to be more aggressive in Asian cohorts. This might partially be explained by the larger proportion of viral etiology in Eastern patients which results in a generally younger HCC population with often less severe underlying cirrhosis and fewer comorbidities [
35]. Therefore, staging systems developed for European cohorts might be more suitable for European patients. The same is true for Eastern patients as recently demonstrated within a large Singaporean cohort. In a comparative analysis of Selby et al. comprising 716 patients, HKLC showed a better performance in guiding treatment compared to BCLC [
36]. These considerations do also imply limitations when the results of comparative analyses of staging systems in Eastern patients are directly transferred to Western HCC patients.
Among the staging systems that do not include AFP to correlate radiological and clinical patient characteristics with tumor biology, BCLC showed good results in our cohort. However, in our subgroup of patients with available information on AFP levels, CLIP and ITA.LI.CA score provided a better overall staging performance. Based on this, AFP seems to be a major contributor for accurate staging of HCC patients undergoing surgery. AFP is a known predictor of OS in various clinical situations of HCC patients and characteristics of the tumor [
37]. Thus, it is not surprising that staging systems incorporating AFP might be superior in their overall prognostic performance. This observation does further underline the importance of tumor biology and the individual genetic pathogenesis of HCC. Nault et al. have recently proposed a gene score including 5 genes to predict OS and demonstrated significant prognostic accuracy in a surgical cohort of patients [
38]. It is therefore important that future staging systems integrate novel biomarkers to further increase the prognostic value of pretreatment staging in HCC patients.
Like any other clinical study, our analysis has certain inherent limitations. All HCC patients analyzed in this study underwent treatment in a monocentric setting reflecting our individual clinical approach to this particular disease, and the study is based on a retrospective data collection which was not obtained in a controlled prospective clinical trial. This also results in large proportion of ASA III patients and individuals with higher BCLC stages due to our liberal department policy. Further, our data set appears small compared to some other studies especially from Asian cohorts. Most importantly, however, the majority of studies focusing on staging systems for HCC comprise heterogeneous cohorts in which curative treatments are carried out in the minority of patients.
Notwithstanding the aforementioned limitations, we here provide a detailed analysis of a plethora of HCC staging systems in a European cohort of patients who underwent curative-intent liver resection, demonstrating ITA.LI.CA score and CLIP to be the most suitable staging systems for surgical candidates.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.