Introduction
Colorectal cancer (CRC) represents approximately 10% of all cancers. CRC is the third most commonly diagnosed and second most fatal cancer globally. Approximately 9.4% of cancer-related deaths were due to CRC in 2020 [
1]. CRC is largely asymptomatic until alarm features develop at advanced stages [
2]. Clinically, more than half of colon cancer patients have undetectable small metastases before surgery [
3]. Thus, it is an urgent need to identify new biomarkers for the early diagnosis and treatment to effectively enhance prognosis of CRC.
Mammalian Cab45/reticulocalbin/ERC-45/calumenin (CREC) protein family consists of reticulocalbin, ER Ca
2+-binding protein of 55 kDa (ERC-55), reticulocalbin-3, Ca
2+-binding protein of 45 kDa (Cab45), and calumenin, and these five proteins are encoded by five genes RCN1, RCN2, RCN3, SDF4, and CALU, respectively [
4]. The CREC protein family contains multiple 'EF-hand' Ca
2+-binding motifs, and participates in secretory pathway, signal transduction, as well as several disease processes [
5,
6]. Although proteomic analysis showed that CREC family members are highly expressed in CRC, there was less experimental validation [
7,
8]. The aberrant expression of CREC family members is associated with poor prognosis of several malignant tumors, including non-small cell lung cancer [
9], bladder cancer [
10], and glioma [
11]. However, the prognostic value and the role of CREC family in colorectal cancer have yet to be fully elucidated and thus deserve extensive studies.
The present study aimed to identify the significance of CREC family in CRC progression. We comprehensively analyzed the expression patterns, prognostic values, immune cell infiltration and biological functions of CREC family in CRC using publicly accessible databases and further verified the bioinformatic results in human CRC cell lines and CRC tissues. We demonstrated the value of CREC family in CRC progression and the preliminary molecular mechanisms and suggested the possibility of CREC family as potential target for the treatment and prognosis evaluation of CRC.
Materials and methods
Oncomine analysis
The online gene expression array database Oncomine [
12] (
http://www.oncomine.org) was used to analyze the transcription levels of CREC family in different cancers. Student's
t test was used to generate
p values for the differences of the mRNA levels of CREC family members between clinical tumor specimen and normal control specimen. The critical
p value was defined as 0.05 and the fold change was set as 1.5.
GEPIA dataset analysis
The Gene Expression Profiling Interactive Analysis (GEPIA) web server is a valuable resource for gene expression analysis based on tumor and normal samples from the TCGA and the GTEx databases. GEPIA2 (
http://gepia2.cancer-pku.cn/) provides the analysis of RNA sequencing expression data from 198,619 isoforms and 84 cancer subtypes as well as the analysis of a specific cancer subtype, and comparison between subtypes [
13]. In our study, GEPIA was applied to verify the mRNA levels of CREC family and the prognostic value of CREC family in CRC.
UALCAN
The UALCAN Database (
http://ualcan.path.uab.edu) is an interactive portal for analyzing the RNA-seq and clinical data from TCGA [
14]. In this study, UALCAN was used to analyze the mRNA levels of CREC family and the relationship between CREC gene expression and tumor stage in Colon adenocarcinoma.
Kaplan–Meier plotter and ROC analysis
The Kaplan–Meier plotter [
15] (
https://kmplot.com/analysis/) is a web-based tool for analyzing the correlation between gene (mRNA, miRNA, protein) expression and survival. Here, Kaplan–Meier plotter was used to assess the prognostic value of CREC gene family through predicting the overall survival (OS) of CRC.
The Receiver Operating Characteristic (ROC) curve was established by “ROCR” package in R to further assess the sensitivity and specificity of the risk score for prognosis prediction. The area under ROC curve (AUC) was calculated. AUC > 0.6 was considered as a potential cancer biomarker for clinical utility. RNA-seq data and clinical data for CRC patients were obtained from the Gene Expression Omnibus (GEO) database GSE17538 in our analysis.
Collection of human CRC tissues
The CRC tissues and paired normal adjacent colorectal tissues tested in the present study were collected from eleven CRC patients in Shanxi Bethune Hospital, Taiyuan, China (Table S
1). The experiments on colorectal specimens from these patients were mainly designed to verify the reliability of bioinformatic analyses. The tissues harvested during surgeries were frozen in liquid nitrogen and stored at an ultra-low-temperature freezer for experiments of quantitative real-time PCR and western blot.
Ethics statement
This study involving human participants were reviewed and approved by the Shanxi Bethune Hospital (Approval no.: YXLL-2019–051), and conducted according to the principles expressed in the Declaration of Helsinki of 1964 and later versions. All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation. All patients have signed informed consent.
Quantitative real-time PCR
Quantitative real-time PCR (qPCR) was performed to examine the mRNA levels of CREC gene family both in CRC cell lines and CRC tissues, with normal cell line or normal colorectal tissues as controls. Briefly, total RNA was extracted using trizol reagent (Invitrogen, USA). The cDNA was synthesized with 2 μg RNA following the manufacturer’s instructions of the PrimeScript RT reagent kit (TAKARA, DRR047). The transcriptional levels of CREC family were analyzed by qPCR with SYBR Green PCR master Mix (Takara, DRR041A). Relative gene expression was normalized to the level of β-actin. Primers for β-actin was purchased from Sangon Biotech (Order NO. B661102). Primer sequences for qPCR are shown as follows:
RCN1 forward: 5'-GGATGGGTTTGTGGATCAGGATGAG-3',
RCN1 reverse: 5'-TCTTTGTCTAACTTCCCGTCCTTGTTC-3'.
RCN2 forward: 5'-CCTAATAATCAGGGCATTGCAC-3',
RCN2 reverse: 5'-CTTCAGAGAGCTTTTTGTCACC-3'.
RCN3 forward: 5'-GGGAACTTCCAGTACGACC-3',
RCN3 reverse: 5'-CTTTCCTCTGGGGTGAGTTG-3'.
SDF4 forward: 5'-GAGAGAGTAGCCAACAGGGAGGAG-3',
SDF4 reverse: 5'-CATCAAAGCCACCCAGGTCCTTG-3'.
CALU forward: 5'-TGGATTTACGAGGATGTAGAGC-3',
CALU reverse: 5'-TTTTAAACCTCCGCTCATCTCT-3'.
Western blot
Briefly, equal amounts of protein were applied to SDS–polyacrylamide gel and electroblotted onto poly-vinylidene difluoride membranes. The membrane was blocked with 5% non-fat milk in TBST for 1 h at room temperature, and then incubated with primary antibodies overnight at 4 °C. At room temperature, the membrane was incubated with peroxidase-conjugated secondary antibodies for 1 h. And proteins were detected by a super ECL Prime detection kit (SEVEN, SW134-01).
Immunohistochemistry
Tissue sections were deparaffinized in xylene and rehydrated with gradient ethanol. The tissue slides were placed in sodium citrate buffer (pH = 6.0) and bathed in water (94 − 99 °C) for 20 min to achieve antigen retrieval. Then 3% H2O2 was used to block endogenous peroxidase. Tissue sections were rinsed three times in phosphate buffer solution and then were incubated with primary antibody (CD206, 1:1000) at 4 °C for overnight. Subsequently, tissue sections were incubated with corresponding secondary antibody. The target protein was visualized with DAB chromogen. Three fields were obtained in each section. ImageJ was used to calculate the positive staining signals in each field.
HPA dataset analysis
The Human Protein Atlas (HPA) is an open access database to map all the human proteins in cells, tissues, and organs using an integration of various omics technologies, including antibody-based imaging, mass spectrometry-based proteomics, transcriptomics, and systems biology [
16]. In this study, immunohistochemical stains of CREC family were derived from HPA dataset.
TFs-target and miRNAs-target of CREC gene family
Transcription factors (TFs) are proteins capable of binding DNA in a sequence-specific manner and regulating transcription of gene [
17]. In this study, hTFtarget [
18] (
http://bioinfo.life.hust.edu.cn/hTFtarget), a comprehensive database for regulations of human TFs, was used to determine TFs regulating the expressions of CREC gene family. Besides, microRNAs (miRNAs) have been implicated in cell-fate determination and in various human diseases via inducing RNA-silencing and working as post-DNA transcription regulators [
19]. Here, Starbase [
19] (
http://starbase.sysu.edu.cn/) and Targetscan [
20] (
http://www.targetscan.org/vert_72/) were used to predict the upstream miRNAs regulating the genes expression of CREC family.
TIMER analysis
Tumor Immune Estimation Resource (TIMER) (
https://cistrome.shinyapps.io/timer/) is a comprehensive resource for systematic analysis of immune infiltration across diverse cancer types based on 32 cancer types and 10,897 samples from TCGA [
21]. TIMER was applied to determine the correlation between CREC family gene expression and the immune cell infiltration degree. Besides, TIMER was used to analyze the correlation of CREC family gene expressions.
cBioPortal analysis with TCGA
The Cancer Genome Atlas (TCGA) has genomic sequence, expression, methylation, and copy number variation data on over 11,000 individuals who represent over 30 different types of cancer [
22]. The cBioPortal for Cancer Genomics (
http://cbioportal.org) provides a web resource for exploring, visualizing, and analyzing multidimensional cancer genomics data [
23]. In this study, cBioPortal was used to analyze genetic and epigenetic alterations of CREC family from the colorectal adenocarcinoma (TCGA, Firehose Legacy) dataset with 640 samples.
Protein–protein interaction (PPI) network analysis
The Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) (
https://string-db.org/) is an online tool to analyze the interaction relationship between proteins [
24]. In this study, STRING was applied to obtain the top 50 co-expressed genes of CREC family and to construct the co-expression PPI network of CREC family.
Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis
Metascape is a web-based portal designed to provide a comprehensive gene list annotation and analysis resource for experimental biologists. In terms of design features, Metascape combines functional enrichment, interactome analysis, gene annotation, and membership search to leverage over 40 independent knowledgebases within one integrated portal [
25]. GO (
http://geneontology.org) provides structured, computable knowledge regarding the functions of genes and gene products [
26]. KEGG (
http://www.kegg.jp/ or
http://www.genome.jp/kegg/) is an encyclopedia of genes and genomes, including gene function and biological pathway information [
27]. Here, the GO functions and KEGG pathways of CREC family genes and their top co-expressed 50 genes were obtained using Metascape (
http://metascape.org).
Statistical analysis
Data from at least three separate experiments were presented as mean ± standard error (SEM) and analyzed by t-test with SPSS 17.0 software (SPSS Inc., Chicago, IL, USA). Differences at p < 0.05 were considered statistically significant.
Discussion
CREC family members have been implicated in cancer progression. Accumulating evidences suggest that RCN1 expression is significantly upregulated in various tumors, including CRC [
9,
35]. Upregulation of RCN2 facilitates cell malignant behaviors and angiogenesis in cervical cancer and hepatocellular carcinoma [
36,
37]. Overexpression of RCN3 is also associated with cancer progression [
38]. The SDF4 gene, which contains seven exons and maps to 1p36.33 in chromosomes, encodes the 362-amino acid Cab45 protein. Alternative splicing of Cab45 mRNA results in three members: Cab45 of golgiosome (Cab45-G), Cab45 cytosolic (Cab45-C) and Cab45 secreted (Cab45-S). Cab45-G exhibits an increased expression in cell lines with higher metastatic potential and promotes cell migration in multiple types of cancer cells [
39]. Cab45-S has been identified as a crucial modulator of tumor growth in cervical cancer cells [
40]. Calumenin shows an increased expression in clinical tissue samples of colon tumors and acts as a novel putative biomarker of CRC [
41,
42]. In the present study, the transcriptional levels of CREC family in CRC were systematically analyzed using Oncomine, GEPIA and UALCAN databases. Results indicated that RCN1, RCN2, RCN3 and CALU were all highly expressed in CRC compared with normal colorectal tissues. However, analysis based on GEPIA and UALCAN revealed that the transcriptional level of SDF4 was decreased in CRC, while analysis based on the Human Protein Atlas revealed that SDF4 protein expression significantly elevated in CRC. Our qPCR and western blot experimental data obtained from CRC tissues further showed that the expressions of CREC gene family were all significantly increased in CRC tissues compared with adjacent tissues. Hence, we speculated that the expression of SDF4 in CRC possessed tumor heterogeneity. An alternative potential explanation for differential SDF4 expression may be due to the difference of two subtypes (non-mucinous and mucinous type) of CRC. Besides, different expressions of SDF4 might also correlate with alternative splice variants. Given that the samples in the database are diverse, we will expand the sample size and include non-mucinous and mucinous type of CRC samples to investigate the above possibilities in our future study.
Genes of CREC family are also closely related to cell migration and cancer prognosis. RCN1 expression correlates with lymph node metastasis, migration and invasion of cancer cells [
43]. Besides, RCN1 is highly expressed in invasive breast cancer cell and colorectal cancer cell, suggesting that RCN1 is implicated in tumor cell invasiveness. We further found that silencing of RCN1 was able to suppress CRC cell migration (Fig.
8). Moreover, overexpression of RCN1 correlates with poor prognosis of non-small cell lung cancer and glioblastoma [
44‐
46]. Increased RCN2 level plays a vital role in hepatocellular carcinoma (HCC) proliferation, invasion and migration and predicted poor prognosis in HCC patients [
47]. RCN2 also enhances the proliferation and invasion of colorectal cancer cells [
48]. Cox's proportional hazards analysis showed that high RCN2 expression was an independent prognostic marker of poor outcome in colorectal cancer. Knockdown of RCN2 inhibited colorectal cancer cell proliferation both in vitro and in vivo [
49]. RCN3 is considered a fibroblast-specific biomarker of poorer prognosis of CRC [
50]. Upregulation of Cab45-S favors tumor growth and seems correlated with the cervical carcinoma grade [
40]. High expression levels of Cab45 are correlated with cancer progression and metastasis [
51] Silencing of Cab45-G remarkably inhibited cancer cell migration [
39]. Increasing evidence indicate that overexpression of CALU promotes cancer cell growth, migration, invasion and metastasis [
52,
53]. Knockdown of calumenin suppressed invasiveness of lung cancer cells [
53]. Here, we reported that genes of CREC family were significantly related to the tumor stage and prognosis of CRC. Furthermore, combination of CREC family genes performed as a better prognostic marker for CRC. Thus, our findings suggested that CREC family were key factors in CRC progression and acted as candidate biomarkers for CRC prognosis. In addition, the functional networks of the top 50 co-expressed genes of CREC family in CRC mainly enriched in cell migration, also suggesting that CREC family played an important role in tumor metastasis. In the next study, we will further elucidate the specific mechanism of each member in regulating CRC metastasis and progression, thus providing new targets for CRC therapy.
Tumor microenvironment is mainly composed of cancer-associated fibroblasts (CAFs), immunosuppressive immune cells (regulatory T cells, M2 macrophages, myeloid-derived suppressor cells), extracellular matrix, a variety of growth factors, and inflammatory factors [
54]. The infiltration degree of different immune cells is highly correlated with tumor survival and progression [
55]. TAMs represent one of the main tumor-infiltrating immune cell types, mostly with the phenotype of M2 macrophages. TAMs promote tumor metastasis and are closely related to poor prognosis [
56]. Here, we found that CD206 (a marker of M2 macrophage) in CRC tissues was highly expressed, indicating increased M2 macrophage infiltration in CRC. Therefore, we speculate that RCN1, RCN2, RCN3, and CALU are associated with poor prognosis for CRC by regulating macrophage infiltration. And this hypothesis will be further verified in future studies.
Conclusion
We systematically analyzed the expression, prognostic value, and molecular biological functions of CREC family in CRC. Results indicate that the expressions of RCN1, RCN2, RCN3, and CALU are significantly higher in CRC tissues than in normal adjacent tissues, whereas the expression of SDF4 is controversial. The expression of CREC family is significantly related to CRC progression. Combination of CREC family genes is a potential prognostic marker for CRC. Furthermore, CREC family may play an important role in CRC oncogenesis and invasion. Our findings suggest that genes of CREC family might be potential therapeutic targets for CRC.
Acknowledgements
We would like to thank the owners of TCGA, Oncomine, GEPIA, UALCAN, Kaplan-Meier plotter, HPA, hTFtarget, Starbase, Targetscan, TIMER, cBioPortal, STRING and Metascape for data sharing.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit
http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (
http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.