Introduction
Intrahepatic cholangiocarcinoma (ICC), a rare malignant tumor originating from the epithelium lining and peribiliary glands of the secondary bile ducts extending to the terminal branches of the intrahepatic biliary tree, has seen an increasing incidence in recent decade and now accounting for approximately 10 − 15% of primary liver cancer (PLC) [
1,
2]. Surgical resection remains the fundamental and potentially the most efficacious approach for managing patients with resectable ICC [
3]. Unfortunately, only a small subset of patients with ICC present with resectable tumors, and long − term outcomes after even curative − intent resection remain dismal, with reported 5-year overall survival rates ranging from 20–35% [
4]. This can be largely attributed to the tumor progression and the nature history of recurrence [
5]. Even among patients with similar tumor morphology characteristics (e.g., tumor size and tumor number) and comparable tumor biological markers (e.g., carcinoembryonic antigen, abbreviated as CEA) or within the same TNM stage, there remains significant diversity in outcomes following ICC resection [
6,
7]. Given the suboptimal outcomes frequently observed postoperatively, there has been intense interest in identifying means to select patients better in the preoperative setting who might benefit more from curative − intent resection of ICC and who are better candidates for neoadjuvant treatment strategies.
To date, only a few clinical staging systems and personalized prediction models for ICC have been established to better understand the prognosis of specific patient groups and provide individual prognostic predictions [
8‐
12]. For instance, the Tumor − Node − Metastasis (TNM) classification system of ICC, promulgated by the American Joint Committee on Cancer (AJCC) and the Union for International Cancer Control, is the most extensively employed and accepted staging schema in clinical practice [
8]. In addition, the utilization of an alternative Liver Cancer Study Group of Japan (LCSGJ) staging system is infrequent and limited to local practices [
11]. However, the common predictors of these staging systems, such as tumor size, tumor number, vascular invasion, lymph node metastasis, and extrahepatic metastasis, are solely based on post − operative histopathological examination and cannot be effectively assessed preoperatively. Simultaneously, Wang et al. [
12] and Hyder et al. [
9] have sequentially established individual prediction models using nomograms that incorporate numerous clinical and pathological indicators. Despite the favorable predictive performance demonstrated by these models, the complexity of these prognostic models restricts their practicality and widespread application in patient care, even after validation [
13]. Establishing a preoperative prediction model with minimal parameters is urgently required to provide comprehensive prognostic information, accurately assess which patients may benefit the most from a given treatment, and inform discussions on long-term outcomes [
14].
The TBS, initially proposed by Sasaki et al. [
15], is a comprehensive morphological measure that integrates tumor size and tumor number for patients undergoing hepatic resection of colorectal liver metastasis (CRLM). Traditional assessments of tumor burden typically involve the tumor size and tumor number, with both variables having been incorporated into the AJCC − ICC − TNM staging system [
8]. However, in the majority of prognostic prediction models, when shifting these two continuous variables to categorical variables, the selection of cutoff values is often relatively arbitrary [
16‐
18]. TBS overcomes the limitations of this category classification by utilizing continuous variables, thereby avoiding arbitrary cutoff values that may diminish statistical power and potentially result in inaccurate inferences [
19]. In recent years, there has been a growing interest in studies on TBS in PLC, making it a clinical research hotspot. The TBS model has demonstrated its prognostic risk stratification capabilities in hepatocellular carcinoma (HCC) [
20], ICC [
21], and combined hepatocellular − cholangiocarcinoma [
22], while remaining user − friendly. Relevant studies have shown a strong correlation between radiological TBS and pathological TBS, with no significant difference observed in postoperative prediction ability [
23,
24]. Furthermore, several studies have suggested the integration of TBS with tumor markers (e.g., AFP, CA19 − 9) [
25,
26] or liver function surrogate index (e.g., ALBI) [
27] to develop innovative classification models. These prognostic models have demonstrated effectiveness in prognostic stratification, surpassing the standalone TBS model in certain scenarios.
Elevations in tumor markers, such as CEA, also serves as a significant adverse prognostic factor for ICC, indicating unfavorable tumor biology [
28,
29]. In 2022, Sanchez et al. [
7] demonstrated elevated serum CEA levels (cutoff value, 5 IU/ml) was observed in patients with locally advanced or metastatic (both
p < 0.001) ICC when compared to those with earlier − stage, liver − confined disease. Recently, Moazzam et al. [
30] utilized international multi − institutional data to incorporate TBS and CA19 − 9 into a composite CTC grading system, allowing for the categorization of patients into distinct subgroups. Importantly, higher CTC grades were correlated with worse outcomes in terms of recurrent free survival (RFS) and overall survival (OS). However, it is worth noting that most of these studies lack validation cohorts, which somewhat limiting their generalizability.
To date, there is a lack of literature reporting the efficacy of models that combine radiological TBS and serum CEA levels in preoperative evaluation. The objective of this study was to develop and externally validate a simplified prognostic CTC grading system through a large sample, multi − institutional database that is user − friendly in clinical practice, applicable in a preoperative setting, highly accurate, and discriminatory.
Methods
Study population and selection criteria
A retrospective study was conducted, collecting data from eight large tertiary medical institutions in China from 2011 to 2019. The derivation cohort data (812 patients) were collected from the Primary Liver Cancer Big Data (PLCBD) system [
31], which includes Mengchao Hepatobiliary Hospital of Fujian Medical University and Eastern Hepatobiliary Surgery Hospital of Naval Medical University. The data from the PLCBD system are prospectively collected and updated annually since 2019. The validation cohort data (266 patients) were obtained from the electronic medical record systems of The Southwest Hospital Affiliated to the Army Medical University, Beijing Friendship Hospital Affiliated to Capital Medical University in Beijing, Cancer Hospital of Chinese Academy of Medical Sciences, Tongji Hospital Affiliated to Tongji Medical College, Renji Hospital Affiliated to Shanghai Jiaotong University, and West China Hospital of Sichuan University. The patients in this study all provided informed consent prior to surgery, and strict adherence was maintained to the guidelines of the Declaration of Helsinki. Ethical approval was obtained from the Institutional Review Boards of all participating institutions (approval number 2022_077_01).
The inclusion criteria were as follows: (1) ECOG score of 0 − 2, (2) Child–Pugh score from A5 to B7, (3) Dynamic contrast − enhanced computed tomography (CT) or magnetic resonance imaging (MRI) assessment of tumor size and number within one month before surgery, (4) serum CEA level measured within one week before surgery, (5) undergoing curative − intent resection, (6) with postoperative pathological confirmation of ICC. The exclusion criteria were as follows: (1) Postoperative pathological confirmation of cHCC − CCA, (2) R2 resection, (3) receipt of therapies such as radiofrequency ablation, local interventional procedures, or chemotherapy before surgery, (4) recurrent ICC or simultaneously having malignancies in other anatomical sites, (5) Patients who died within 30 days postoperatively or were lost to follow − up shortly after surgery. Unexpectedly, intraoperative identification of extra − regional lymph node spread, peritoneal metastasis, and mesenteric metastasis, requiring concurrent resection with the primary lesion for complete eradication, and these data were not excluded.
Variables of interested
Demographic and clinicopathologic data were collected, including age, gender, hepatitis B virus (HBV) infection, American Society of Anesthesiologists Physical Status Classification (ASA) score, presence of cirrhosis, Child − Pugh grade. Hematological parameters included platelet count, neutrophil count, lymphocyte count, as well as CA19 − 9 and CEA levels. Neutrophil count and lymphocyte count together constituted the Neutrophil − to − Lymphocyte Ratio (NLR). Tumor number and the size of the largest lesion were evaluated by CT or MRI. Furthermore, data on the type of resection, histologic tumor grade, presence of macrovascular invasion, microvascular invasion, margin status [i.e., microscopically negative (R0), positive (R1)], postoperative 30 − day complication, and receipt of adjuvant therapy were also recorded. The TNM staging was performed according to the 8th edition of the AJCC staging manual.
TBS definition and CTC Grade evaluation
TBS is calculated as the Euclidean distance in a Cartesian plane, considering two variables: maximum tumor size (x − axis) and the number of tumors (y − axis). For patients with multiple nodules, tumor size was defined as the size of the largest lesion. The formula for TBS computation follows the Pythagorean theorem: TBS2 = (the size of the largest lesion)2 + (number of tumors)2.
The receiver operating characteristic (ROC) analysis was utilized to determine the optimal cutoff values for TBS (6.1 units) and CEA (5.1 U/mL). TBS and CEA levels were then classified into low and high categories, respectively. Subsequently, CTC grades were established by combining the categories: low CTC grade (low TBS/low CEA), intermediate CTC grade (low TBS/high CEA or high TBS/low CEA), and high CTC grade (high TBS/high CEA).
Definition of other important clinical and follow − up related variables
Major hepatectomy was defined as resection of three or more Couinaud segments according to the consensus of the Brisbane 2000 system [
32]. Postoperative complications occurring within 30 days were classified according to the Clavien − Dindo classification [
33]. The curative − intend surgery was to completely remove the macroscopic tumors with adequate resection margins. The presence of visible tumor remnants at the surgical margin was classified as an R2 resection and excluded from the study. Microvascular invasion (MVI) was defined as the presence of intraparenchymal vascular involvement identified on histological examination. Macrovascular invasion was defined as the involvement of primary and secondary branches of the portal vein or hepatic artery, or the invasion of one or more of the three major hepatic veins [
8]. The primary outcome of interest was OS, defined as the time interval between the date of ICC liver resection and the date of death or last follow − up. Secondary outcome was RFS, defined as the time between resection of ICC and date of recurrence or last follow − up. Recurrence of ICC was confirmed either by tumor biopsy or the identification of a suspicious lesion on follow − up imaging.
Statistical analysis
Continuous variables were presented as mean ± standard deviation (SD) or as median and interquartile range (IQR). Differences were assessed using one-way analysis of variance (ANOVA) or the Kruskal-Wallis H tests (K-W tests). Categorical variables were expressed as frequencies and percentages, and group comparisons were conducted using the Chi-square test. Survival analyses for OS and RFS were performed through Kaplan-Meier (KM) curves, with differences evaluated by the log-rank test. Independent prognostic factors for OS and RFS were identified using Cox regression analyses. The optimal cutoff values for TBS and CEA were determined through ROC analysis by maximizing the Youden Index. The discriminative ability of the model was analyzed by the Harrell c-index and areas under the receiver operating characteristic curve (AUC). Calibration of the model was analyzed using the calibration plot with Hosmer-Lemeshow-Test. Statistical analysis was executed using SPSS® version 25.0 (IBM, Armonk, New York, USA) and R program version 3.2.0 (
http://www.r-project.org/). A
p-value of less than 0.05 was considered statistically significant.
Discussion
In this study, we used a large multi-institutional database to develop and validate a simplified preoperative prognostic model, named the CTC model. This model successfully stratified long-term outcomes for patients undergoing curative-intent resection for ICC in both the derivation and validation cohorts. Our findings revealed that a high TBS grade and elevated CEA levels were independent risk factors for poor prognosis in ICC patients. These indicators demonstrated promise in predicting OS and RFS in ICC patients. When categorizing patients into low, intermediate, and high CTC groups based on varying TBS and CEA levels, we observed that patients in the high CTC group had worse OS and RFS rates compared to those in the median and low CTC groups. Consistency in these outcomes was observed in the external validation cohort. Cox regression analyses in the derivation cohort further confirmed that medium and high CTC grades were an independent risk factors for both OS (5 − year OS: low CTC, 48.9% vs. intermediate CTC, 18.7% vs. high CTC, 9.1%; p < 0.001) and RFS (5 − year RFS: low CTC, 38.2% vs. intermediate CTC, 17.1% vs. high CTC, 7.2%; p < 0.001). Importantly, the CTC grading system demonstrated moderate to well prognostic prediction capabilities in both the derivation and validation cohorts. These AUC values surpassed those of individual TBS and CEA level assessments. CTC grade system was further able to stratify prognosis within certain TNM stages (stages I and III/IV), indicating its potential as a valuable tool for estimating the prognosis of patients undergoing resection for ICC. In summary, our newly developed CTC grading system serves as a reliable prognostic predictor for ICC patients undergoing hepatectomy. This system can aid in risk stratification and treatment decision − making for ICC patients, particularly in estimating prognosis.
Relative to the potential benefits of surgical resection, risk stratification of patients is crucial for aiding in treatment decision − making and facilitating reliable discussions around prognosis with ICC patients. Our newly developed CTC grading system combines radiological TBS and serum CEA levels, both of which can be easily assessed preoperatively. Tumor markers such as CEA and CA19 − 9 have long been recognized as surrogates for tumor biology and are well − established predictors of long − term outcomes in ICC patients [
28,
34,
35]. They are commonly measured in clinical practice when ICC is suspected and inform prognosis after therapies. Our data showed that elevated CEA levels predicts worse prognosis (5 − year OS: high CEA levels, 9.0% vs. low CEA levels, 36.7%;
p < 0.001), consistent with our previous findings that preoperative elevation in serum CEA serves as a predictor of poorer outcomes [
12]. In the multivariable analysis, we also observed that elevated CA19 − 9 level also serves as an adverse prognostic factor for OS. It’s important to note that we did not conduct an additional in − depth analysis of CA19 − 9 in this study. Recently, Li et al. [
26] and Moazzam et al. [
30] successively constructed CTC grading models based on the combination of CA19 − 9 and TBS levels, consistently demonstrated that high CTC grading is associated with poor prognosis. However, it is worth noting that they employed different approaches to determine the cutoff value for CA19 − 9 level. Li utilized the normal upper limit of 37 U/ml as the threshold, whereas Moazzam derived a cutoff value of 125 U/ml through ROC analysis. Differences in cutoff values can result in misclassification of specific patients. In our study, we employed the ROC approach to determine the cut − off value of CEA level, yielding a 5.0 U/ml. This value is in proximity to the normal upper limit of 5.0 U/ml at our center and to the cutoff value of 5.0 U/ml utilized by Sanchez et al. [
7].
The concept of tumor burden, originating from the “Metro − ticket” system proposed by Mazzaferro et al. [
36], highlighted a negative correlation between tumor burden and OS. Substantial studies have supported that tumor size and tumor number are important prognostic factors. Notably, tumor size and tumor number indicate the extent of the tumor in ICC, and are included in the AJCC staging systems. In the 7th edition of the AJCC staging manual, T category did not take into account tumor size [
37]. Subsequently, in the revised 8th edition of the AJCC staging manual, T1 stage was further classified into T1a (isolated tumor ≤ 5 cm) and T1b (isolated tumor > 5 cm), emphasizing the effect of tumor size on outcomes [
8]. These two parameters are dichotomously categorized with arbitrary cutoffs in the AJCC staging manual, which may present limitations when assessing the prognosis of patients with variable tumor sizes and nodules. The simplistic T staging system may not sufficiently offer precise information for providing personalized treatment recommendations and making decisions for patients with resectable ICC. A composite metric of tumor morphology such as TBS may be helpful in capturing total tumor burden within the liver relative to prognosis. Following this, based on the “Metro − ticket” system, Sasaki et al. [
15] proposed the TBS model, which was based solely on the maximum tumor diameter and the number of lesions and represents the tumor burden in patients with CRLM. In our multivariate analysis, high TBS was a negative prognostic factor. Moreover, high TBS was associated with an inferior 5 − year RFS rate (low TBS, 32.1% vs. high TBS, 15.4%;
p < 0.001) and 5 − year OS rate (low TBS, 40.3% vs. high TBS, 17.6%;
p < 0.001).
Furthermore, TBS score combined with other primary tumor factors, demonstrates enhanced predictive efficacy for the prognosis of patients following ICC resection. Recently, Munir et al. [
27] have demonstrated that both TBS and ALBI grade significantly influence outcomes following ICC resection. Patients with both high TBS and ALBI grade had significantly higher hazards of death compared with those who had both low TBS and ALBI grade disease (HR 2.42, 95% CI 1.57–3.73;
p < 0.001). Ding et al. [
25] developed an “AFP − TBS (ATS)” prognostic model that incorporates TBS and AFP levels at the time of initial diagnosis before surgery and at the time of recurrence to predict post − recurrence survival following the initial resection of HCC. The ATS model was negatively correlated with post − recurrence survival time and demonstrated a time − dependent AUC value of 0.70 (95% CI 0.64 − 0.75), surpassing other staging systems. Importantly, a high ATS grades significantly correlates with poorer OS and RFS outcomes, highlighting its potential clinical utility in risk stratification and treatment decision − making for ICC patients. Similar findings were observed in our study, where higher CTC grades were associated with worse RFS and OS outcomes. The CTC model, outperforming both CEA and radiographic TBS with AUC values of 0.694, 0.642, and 0.616 for OS, respectively, in pairwise comparisons (all
p < 0.001), provides a comprehensive preoperative prognostic assessment.
It is worth noting that when analyzing subgroups across the TNM classification in the derivation cohort, patients in stage II with a high CTC grade exhibited better 5-year OS (17.6% for intermediate CTC and 28.5% for high CTC) and RFS (16.6% for intermediate CTC and 16.8% for high CTC) compared to those with an intermediate CTC grade. When interpreting this result, multiple factors may be influenced, so conclusions should be drawn with caution. Compared to high CTC grades, intermediate CTC grades are associated with higher rates of MVI (intermediate CTC:
n = 65, 18.3% vs. high CTC:
n = 14, 11.1%) and R1 resection (intermediate CTC:
n = 45, 12.7% vs. high CTC:
n = 12, 11.2%) (Table
1), which may partially influence the outcomes. MVI and R1 resection are both potential adverse prognostic factors for ICC. Additionally, incomplete pathological examination results may have underestimated the proportion of MVI. Another important factor may also be worth noting that TNM Stage II includes patients with both multiple tumors and presence of vascular invasion, implying that both multifocal tumors and vascular invasion have an equal impact on prognosis. According to our findings, the current Stage II may necessitate further subcategorization. Defects cannot belittle virtues; Our CTC grading system generally maintains good stability.
The CTC model effectively stratified OS and RFS in patients with ICC who underwent liver resection, demonstrating good predictive performance as evidenced by discrimination and calibration curves analyses. Patients in the high CTC grade group exhibited significantly lower 5-year OS and RFS rates compared to those in the low- and intermediate- CTC grade groups. Therefore, it is imperative to thoroughly discuss the risks and benefits of upfront surgery with patients who have a high CTC grade, and to ensure a more intensive follow-up postoperatively. Currently, the application of molecularly targeted agents and immunotherapies has expanded the treatment options for ICC and improved patient prognosis. A large retrospective cohort study has shown that despite typically being used for more advanced tumors, the neoadjuvant treatment is associated with significantly improved survival compared to up − front surgery for resectable ICC [
38]. We hypothesized that patients with a high CTC grade might derive potential benefits from neoadjuvant therapy, and the CTC model could serve as a reference for decision-making in preoperative neoadjuvant treatment.
When interpreting the data from the current study, several limitations should be considered. Firstly, like all retrospective studies, there is the potential selection bias in choosing patients for surgical resection. Secondly, while the multi − institutional database was a strength, the absence of a control group (i.e., patients receiving alternative or non − surgical treatments) limits the generalizability to non − curative intend ICC patients. Thirdly, the exact etiology of ICC remains unclear [
39,
40]; Chinese patients typically have a higher prevalence of viral hepatitis backgrounds, whereas Western patients are more commonly associated with non − alcoholic steatohepatitis. Therefore, caution should be exercised when assessing the applicability of CTC grading in Western countries, necessitating further comprehensive research and discussion on this issue. Moreover, elevated CA19 − 9 levels were also associated with adverse prognosis in the multivariable analysis. However, since this study primarily focused on the impact of TBS, CEA, and their combined CTC grading on prognosis, an extensive analysis of CA19 − 9 levels was not conducted. Therefore, future study integrating CA19 − 9 into the CTC model may enhance the performance of this model.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.