Background
The very recent epidemiologic study has shown that liver cancer ranks the sixth commonly diagnosed cancer and the fourth leading cause of cancer death in the world. An estimated 84,100 liver cancer cases occurred and 78,200 liver cancer cases died in 2018 [
1]. Hepatocellular carcinoma (HCC) compromises 75–85% of primary liver cancer [
1]. The main clinical curative treatments for HCC include liver transplantation, percutaneous radiofrequency ablation and liver resection, among which liver resection is the most employed treatment [
2]. Although 5-year overall survival rate reaches up to 50%, recurrence occurs in more than 70% HCC patients after curative surgery [
3]. Clinically, the recurrence within 2-year after resection is defined as early recurrence, whereas the recurrence > 2-year is defined as late recurrence. Compared to late recurrence, HCC patients with early recurrence usually showed poorer prognosis [
4].
Currently, many approaches, such as the TNM staging system of the American Joint Committee on Cancer (AJCC), the Barcelona Clinic Liver Cancer (BCLC) classification, and the Cancer of the Liver Italian Program (CLIP) staging system, have been employed to evaluate the prognosis of HCC patients [
5]. However, their assessment criteria mainly rely on the clinicopathological features of HCC patients but do not take into account the critical and complicated molecular pathogenesis, an important factor in determining the outcome of HCC. Therefore, their prognostic predictive performance was unsatisfactory [
6]. Meanwhile, serum alpha-fetoprotein (AFP) detection and medical imaging techniques are clinically used for post-surgery surveillance of recurrence in HCC patients, but with limited effectiveness due to the low specificity and sensitivity of those surveillance means [
7].
The advent of high throughput array/sequencing and high-efficiency big data analysis in past decades makes it possible and reliable to construct multi-gene signatures to evaluate prognosis and predict therapeutic response in cancer patients. For example, a 70-gene signature had been established to aid decision making of adjuvant chemotherapy in patients with estrogen receptor-positive early breast cancer [
8,
9]. More importantly, this 70-gene-signature based diagnostic test known as “MammaPrint” (Agendia, Amsterdam, The Netherlands) has been approved by the Food and Drug Administration (FDA) to predict breast cancer recurrence [
10], and been validated in several retrospective studies [
11,
12]. Additionally, an 18-gene signature ColoPrint (Agendia, Amsterdam, The Netherlands) was developed to predict disease relapse in patients with early-stage colorectal cancer (CRC) [
13], and had been validated in other independent studies [
14,
15]. Several multi-gene signatures have been constructed in HCC for prognosis evaluation. For example, Wei et al. developed a 20-miRNA signature to predict post-surgery survival in HCC patients [
16]; Nault et al. constructed a 5-gene signature to evaluate the overall survival in HCC patients [
17]; Kim et al. established a 233-gene signature to predict late recurrence in HCC patients [
18]. Recently, prognostic signatures based on specific groups of genes such as glycolyis-related genes, metabolic-related genes and autophagy-related genes were also reported [
19‐
21]. However, those multi-gene signatures of HCC mainly focus on overall survival and later recurrence, and few multi-gene signatures have been established to predict early recurrence in HCCs.
Long non-coding RNAs (lncRNAs) are a class of transcripts that are longer than 200 nucleotides (nt) and do not encode proteins [
22]. Accumulating evidence has indicated the involvement of lncRNAs in diverse biological processes and disease pathogenesis [
23]. Moreover, some lncRNAs have been reported to contribute to the initiation and progression in HCCs. For example, lncRNA-ANRIL has been reported to promote hepatocarcinoma cell proliferation [
24]; and lncRNA-MALAT1 could function as a proto-oncogene to transform hepatocytes and enhance hepatocarcinoma cell growth [
25]. In addition, some lncRNAs have been demonstrated to associate with HCC prognosis. For example, the overexpression of lncRNA-MVIH was associated with poor recurrence-free survival and overall survival in HCC patients [
26]; LncRNA-PTTG3P expression was positively associated with tumor size, TNM stage and poor survival in HCC patients [
27]. Although lncRNAs are involved in the progression and associated with prognosis in HCCs, lncRNA-based gene signatures for HCC prognostic evaluation, especially for early recurrence, are limited.
In this study, we analyzed the expression profile of lncRNAs and their association with early recurrence in the Liver Hepatocellular Carcinoma (LIHC) project from The Cancer Genome Atlas (TCGA) database (TCGA-LIHC). We constructed a 25-lncRNA signature significantly associated with HCC early recurrence. Based on this multi-lncRNA signature, HCC patients can be classified into low- and high-risk groups according to their risk scores. The early recurrence rate was significantly higher in the high-risk group than in the low-risk group. Moreover, the risk score negatively correlated with recurrence-free survival in HCC patients. Multivariate Cox regression analysis demonstrated that the 25-lncRNA signature, serum AFP, TNM stage and vascular invasion were 4 independent risk factors of HCC early recurrence. Compared with the other 3 risk factors, the 25-lncRNA signature had the best predictive performance for HCC early recurrence. Furthermore, the 25-lncRNA signature could synergize with serum AFP, TNM stage and vascular invasion to improve the prognosis evaluation for HCC early recurrence. In addition, in the context of this 25-lncRNA risk signature, we demonstrated that the “E2F TARGETS”, “G2M CHECKPOINT”, “MYC TARGETS V1” and “DNA REPAIR” were the most significantly enriched gene sets in the high-risk group. Moreover, the low-risk group showed greater tumor-infiltrating lymphocytes (TILs) compared to the high-risk group, and the 25-lncRNA prognostic signature was significantly negatively associated with the potent antitumor immune cells (i.e. type 1 T helper cell, effector memory CD8 T cell and activated CD8 T cell). Finally, the low-risk group was predicted to be more sensitive to immunotherapy like anti-PD-1 and chemotherapies like docetaxel, gefitinib and vinblastine, while the high-risk group was predicted to be more sensitive to doxorubicin, mitomycin C and paclitaxel. In conclusion, our findings may provide some insight into lncRNA-based personalized treatment and improve the strategy of post-surgery recurrence surveillance in HCC patients.
Discussion
As a class of non-coding transcripts, lncRNAs have been identified in all model organisms. So far, over 56,000 human lncRNAs have been reported in recent lncRNA annotations and the number of lncRNAs keeps growing [
61]. Unlike protein-coding genes, most lncRNAs are less conserved, which leads to neglect of the function of lncRNAs [
62]. However, accumulating evidence mainly at cellular level has indicated the involvement of lncRNAs in various biological processes such cell proliferation, apoptosis and nutrient sensing to cell differentiation [
63]. Moreover, dysregulation of lncRNAs has been implicated in the pathogenesis of various diseases including cancers [
64]. Many lncRNAs have shown their prognostic value in many types of cancers [
26,
27]. In this study, we established a 25-lncRNA risk signature to predict HCC early recurrence. We demonstrated that, compared to AFP, TNM and VI, this 25-lncRNA risk signature possesses the best prognostic potential for HCC early recurrence. Moreover, the combination of the lncRNA risk signature with AFP, TNM and VI could further improve the predictive performance.
In this study, we define the recurrence in 2-year post-surgery as HCC early recurrence. This is in agreement with a previous study showing that the slopes of early recurrence curve and late recurrence curve are different and the intercept time point of the two curves is defined as the cut-off to separate early and late recurrence [
4]. This separation criterion is widely adopted by many other studies [
65]. We also noticed that some studies define the recurrence in 1-year post-surgery as HCC early recurrence [
66,
67]. Therefore, we analyzed the association of the 25-lncRNA risk signature with both 1-year and 2-year recurrence in most of our analyses and found that this risk signature has great prognostic potential for both of them.
The 25-lncRNA risk signature includes 19 risk lncRNAs (coefficient > 0) and 6 protective lncRNAs (coefficient < 0) (Table
1). Among these lncRNAs, dysregulation of LINC02159, CLDN10-AS1, LOC643201, LRP4-AS1, LOC730100, LINC01697, LOC100505622, and LINC00261 has been reported in several types of cancers (Table
1). In addition, CLDN10-AS1 was reported to be involved in endothelial dysfunction in atherogenesis (Table
1). Some previous studies have suggested the association of LOC153910 with lung function development, risk of chronic obstructive pulmonary disease (COPD) and cardiovascular diseases (CVD) (Table
1). LINC00261 has shown to regulate endoderm differentiation, lung epithelial homeostasis and endometriosis (Table
1). Given the fact that most of lncRNAs demonstrate a tissue-specific expression pattern [
68], further investigation of the role of those 25 lncRNAs in HCCs is warranted.
To investigate the biological processes or pathways related to HCC early recurrence, we performed GSEA to explore the hallmarks of gene sets in the high-risk group. Total 8 gene sets were significantly enriched in the high-risk group. Among them, the gene sets of “E2F TARGETS”, “G2M CHECKPOINT”, “MYC TARGETS V1” and “DNA REPAIR” showed higher significance in enrichment. In fact, members of those four gene sets have been reported to associate with poor prognosis in many types of cancers including HCC [
69‐
72].
Accumulation of TILs is commonly related to an improved prognosis in many types of cancers. In the present study, greater intratumor accumulation of TILs was observed in the low-risk group compared to high-risk group. We demonstrated that the 25-signature risk score significantly and negatively associate with intratumor accumulation of type 1 T helper cell, effector memory CD8 T cell and activated CD8 T cell, which are well-known antitumor immune cells involved in cancer immune therapy [
73‐
75], further suggesting that this 25-lncRNA signature has potential to predict the post-surgery prognosis in HCC patients. In addition, the immunotherapies prediction based on this 25-lncRNA signature suggested that the low-risk group had more effective response to PD-1 inhibitor. Moreover, chemotherapies prediction indicated that the low- and high-risk showed different sensitivity to drugs such as docetaxel and paclitaxel, but not cisplatin and sorafenib. Thus, different therapies might be adapted to HCC patient in the low- and high risk group according to the 25-lncRNA signature.
Although the 25-lncRNA risk signature was validated in the TCGA internal validation cohort and displayed good prognostic potential in the enrolled 299 HCC patients, an external validation cohort is missing in this study. This is because we failed to find any suitable GEO datasets or International Cancer Genome Consortium (ICGC) database which could apply sufficient information on both lncRNA expression profile and clinical survival. For example, there are two GEO datasets, GSE67260 and GSE113850, possess satisfied data on lncRNA expression profile but without clinical records. Moreover, expression profiles of 22 lncRNAs in the 25-lncRNA signature could be extracted from two ICGC datasets including LIRI-JP and LICA-FR, but disease free survival information is missing. The Cancer Genome Atlas (TCGA) is a multi-institutional, cross-discipline effort led by the National Cancer Institute recruiting cancer samples from different countries. For example, the HCC samples recruited in the TCGA-LIHC database were derived from Vietnam, United States, Canada, South Korea, Russia [
76]. Therefore, those samples are actually derived from multi-centers and at certain level support the approach we used in this study by splitting them into a training cohort and a validation cohort. However, validation of this 25-lncRNA risk signature in an external cohort will be warranted as long as suitable data are available. Meanwhile, the validation of individual lncRNA included in this 25-lncRNA signature in clinical HCC tumor and paracancerous tissues were in processing but have not completed yet. LncRNAs ENSG00000231918, ENSG00000248596, and ENSG00000223392 were found to be upregulated in 36 HCC tumor tissues compared with paracancerous tissues (Fig. S
7, Table S
4).
Conclusions
In this study, we established a 25-lncRNA risk signature for HCC early recurrence. According to this risk signature, HCC patients could be accurately separated into the low- and high-risk groups. 1-year and 2-year recurrence rates were significantly higher in the high-risk group than those in the low-risk group. More importantly, the risk score significantly and negatively correlates with DFS in recurrent HCC patients in the high-risk group. Univariate and multivariate Cox regression analyses showed that the 25-lncRNA risk score, serum AFP, TNM stage and vascular invasion (VI) were independent prognostic factors for HCC early recurrence. Moreover, compared to serum AFP, TNM stage and VI, the 25-lncRNA risk signature showed better prognostic potential for HCC early recurrence. In addition, the combination of the 25-lncRNA risk signature with serum AFP, TNM stage and VI could further improve the prognostic potential for HCC early recurrence. Meanwhile, GSEA showed that several gene sets related to malignancy, such as “E2F TARGETS”, “G2M CHECKPOINT”, “MYC TARGETS V1” and “DNA REPAIR”, have significantly enriched in the high-risk group, suggesting that lncRNAs included in this risk signature may affect HCC progression through those biological pathways. Moreover, ssGSEA revealed greater TILs in the low-risk group compared to the high-risk group and the negative association between the 25-lncRNA risk signature score and the intratumor population of several key antitumor TILs such as type 1 T helper cell, effector memory CD8 T cell and activated CD8 T cell, and SubMap algorithm predicted that the low-risk group was more sensitive to anti-PD-1 therapy. Finally, Chemotherapies prediction revealed that the low risk was associated with sensitivity to docetaxel, gefitinib and vinblastine, while high risk was associated with sensitivity to doxorubicin, mitomycin C and paclitaxel.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.