Background
Many experimental and theoretical studies indicate that most solid tumors are associated with immune infiltrate, as early as 15 years ago, immune response within colorectal cancers are associated with early metastatic invasion and survival were introduced by Franck Pagès et al. [
1]. In some digestive system neoplasms, immune cells may inhibit tumor progression, T cell infiltration is closely related to the patient prognosis of colorectal cancer, and types of lymphocytic infiltration, density, and intratumoral location may better predict prognosis than TNM or Duke’s classification [
1,
2]. With the deepening of research on immune-related mechanisms, immunotherapy and application of immune-checkpoint inhibitors make it possible to effective treatment or even cure several malignancies previously untreated [
3,
4]. However, the role and type of tumor-infiltrating immune cells in the prognosis of gastric cancer is unknown, identification of immune cells associated with tumor prognosis and new immune-related therapeutic targets in gastric cancer is the urgent need to solve practical problems.
Tumors are composed of many types of cells, the main part of which is a large number of malignant cells. Tumor-infiltrating immune cells are also one of the types that play an important role [
5,
6], for instance, T cells are one step in the elimination of cancer, they can specifically recognize and kill tumor cells and manage the delicate balance between the recognition of nonself and the prevention of autoimmunity [
7]. Quantification of infiltrating immune cells in tumors may untie the role of immune cells in tumor progression and provide a new direction for immunotherapy. Heretofore, immune infiltration has been primarily studied by immunohistochemistry, immunofluorescence and flow cytometry. But with the widely used of next-generation sequencing (NGS) technologies, tumor RNA-Seq data can be obtained from the database, such as the cancer genome atlas (TCGA) and Gene Expression Omnibus (GEO). Based on a set of immune-specific marker genes, MCP-counter, single-sample Gene set enrichment analysis (ssGSEA), CIBERSORT and other computational approaches can be used to quantify tumor-infiltrating immune cells from RNA sequencing data [
8‐
10]. Therefore, we attempted to quantify tumor-infiltrating immune cells across human healthy tissues and tumors based on the ssGSEA method and identify genes associated with prognosis-related immune cells.
Materials and methods
Data collection
We download gene expression data, somatic mutation data and clinical data of stomach adenocarcinoma (STAD) from the cancer genome atlas (TCGA) database by TCGAbiolinks and maftools packages in R (3.5.1, [
11,
12]). In order to verify the results of the study in the TCGA data, gene expression profile and clinical data in GSE84437 were downloaded from the Gene Expression Omnibus (GEO) database. In the TCGA dataset, samples with death reason of other malignancy, other and non-malignant disease, sample type is not “Primary Tumor”, and samples with incomplete overall survival information were excluded, total 360 samples were included finally. The GEO dataset included 433 gastric cancer tissues. For further investigate the underlying mechanisms in digestive system tumors, gene transcripts per million (TPM) data of Pan-cancer in TCGA and normal tissues in genotype tissue expression (GTEx) database were downloaded from the UCSC Xena database, which processed by the TOIL process, free of computational batch effects. All analyses and plots are done by R (3.6.0).
Data preprocessing and quantification of immune cells
Gastric cancer patients died of non-malignant disease and other malignancies were excluded, and samples with complete survival data were included. According to Gabriela Bindea et al., we obtained the marker genes of 24 immune cells, including aDC, B cells, CD8 T cells, Cytotoxic cells, DC, Eosinophils, iDC, Macrophages, Mast cells, Neutrophils, NK CD56 bright cells, NK CD56 dim cells, NK cells, pDC, T cells, T helper cells, Tcm, Tem, TFH, Tgd, Th1 cells, Th17 cells, Th2 cells and TReg [
10] . Then, based on the gene expression data and marker genes, infiltrating immune cells were quantified by ssGSEA.
Survival analysis
The association between immune cells and overall survival was carried out using univariate Cox regression, immune cells with statistically significant(
P < 0.05) in both groups be considered as effects of prognosis. For further evaluate the impact of the immune cells with statistically significant, patients were divided into 2 groups according to the method of best separation in “survminer” R package, then, overall survival were analyzed by “survival” R package. Kaplan Meier-plotter (KM plotter,
http://kmplot.com/analysis/) could assess the effect of hub genes on survival [
13] . The hazard ratio (HR) with 95% confidence intervals and log rank
P value were calculated and displayed on the plot.
Hub genes identification and validation
Hub genes were several genes that are related to immune cells. The methods of Pearson correlation coefficient and Spearman’s rank correlation coefficient were used for calculation of the correlation between gene expression and immune cells, genes with P < 0.01 and correlation> 0.3 were included in the follow-up study. To further study genes associated with immune cells. Genes in the intersection of all groups (genes associated with Th2cells, T helper cells, and mast cells in the TCGA and the GEO groups) were selected as hub genes. The method of survival analysis of hub genes is the same as the previous step.
Assessment of tumor mutational burden
Data of tumor mutational burden were downloaded by “TCGAbiolinks” R package, “Maftools” R package was used to read the maf files and count the number of variants in each sample. We tried to analyze whether there are differences in tumor mutational burden (TMB) between the high and low expression of hub genes and prognostic immune cells. 322 samples with complete survival information, gene expression data and TMB were included. According to the method of best separation in “survminer” R package, patients were divided into groups of high and low, and the Wilcoxon test was used to identify differences of tumor mutational burden(p < 0.05).
Differences in tumor and normal tissues
Gastric cancer is one of digestive system tumor, we further compare the differences of immune cells and hub gene expression between digestive tumors and normal tissue. Gene transcripts per million (TPM) of digestive system normal and tumors tissues were downloaded from UCSC Xena (
https://xenabrowser.net/datapages/), Normal tissue data is from Genotype tissue expression (GTEx) database, tumor tissue data is from TCGA database, and infiltrating immune cells were quantified by ssGSEA.
Functional annotation of hub genes
Gene counts of TCGA-STAD were downloaded by “TCGAbiolinks” R package, patients were divided into 2 groups according to the expression of hub genes by method of best separation. Then, differentially expressed genes (DEGs) screened between the high and low group, gene set enrichment analysis (GSEA) [
14] and enrichment analysis were performed with “clusterProfiler” package in R [
15]. We use “GOSemSim” package to calculate the similarity between Gene Ontology (GO) terms and then plot it with “ggtree” package.
Discussion
The clinical success of immune checkpoint therapy recently, demonstrate the enormous potential of immunotherapy in cancer treatment. Currently, the main treatment method of patients with cancer is blocking CTLA-4 and PD-1 pathways and CAR T cell therapy. These methods were dependent on a sequence of basic science discoveries [
16], Dong H et al. found out that antibodies blocking the PD-L1/PD-1 interaction lead to tumor regression in mice [
17], transduced T cells of chronic lymphocytic leukemia patients can effectively lyse autologous tumor cells [
18]. All these discoveries are based on research on immune cells and genes. Advances in next-generation sequencing permit the rapid research progress of mutant tumor neoantigens [
8]. This article presented research linking gastric cancer to immune cells based on data of sequencing, thus deepened the understanding of the immune mechanism of gastric cancer.
Single-sample gene set enrichment analysis (ssGSEA) can be used to quantify immune infiltrating cell types based on the marker genes of immune cells [
8,
10], based on RNA-seq data of Stomach adenocarcinoma (STAD) of TCGA and gene expression profile of GSE84437, ssGSEA was used to quantify immune infiltrating cell types of stomach adenocarcinoma samples. 24 kinds of tumor-infiltrating immune cells were quantified, and 3 kinds of immune cells (T helper type 2 (Th2) cells, T helper cells, and Mast cells) were identified as prognostic immune cells.
Through the analysis, Th2 cells and T helper cells were identified as protective factors, and Mast cells as a risk factor, but immune cells may play a dual role in cancers, even one kind of immune cells has a dual role. It has reported that Th2 cells can be used to eradicate cancer [
19], and Th2 cells may promoting the immune escape of urological tumor [
20]. T helper cells influence tumor antigen-specific ca cytotoxic T cell (CTL) response by producing many factors and further induce antitumor immunity [
21]. Mast cells have the ability to facilitate tumor proliferation and invasion directly,and indirectly promote tumor proliferation and invasion by regulating tumor microenvironment [
22], it may provide further evidence for Mast cells can be applied in the adjuvant treatment of mammary adenocarcinoma and melanoma [
23]. Previous studies have shown that Th2 cells, T helper cells, and Mast cells may play key roles in the development and invasion of cancer [
24‐
27], our studies show that these immune cells may play a role in gastric cancer.
However, the concrete mechanism is still unknown, further analysis was performed and two related hub genes (SUPV3L1 and SLC22A17) in three immune cells types of TCGA and GEO groups were regarded as hub genes for further validation, indicating that the two hub genes had a high connection with infiltration as well as prognosis. It has been reported that overexpression of SLC22A17 associated with poor prognosis of cancer, such as endometrial carcinoma, gliomas and hepatocellular [
28‐
30], and Lipocalin-2 (LCN2) has the potential to alter immune cell infiltration and the tumor microenvironment in pancreatic ductal adenocarcinoma by downregulation of LCN2-specific receptor SLC22A17 [
31]. These all indicate that SLC22A17 may influence prognosis through influencing immune cell infiltration and provided further evidence that SLC22A17 may play the same role in gastric cancer. But, research about SUPV3L1 on tumors is limited and further study is needed.
Below, we illustrated the differences between hub genes and prognostic immune cells in non-tumor tissues and tumor tissues within the context of specific gastrointestinal tumors. We can see the different infiltration of 3 kinds immune cells in normal and tumor tissues, Mast cells is less in tumor tissue, and Th2 cells is more in tumor tissue, it further suggested that immune cell infiltration may related to gastrointestinal tumors.
Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit
http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (
http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.