Introduction

Primary central nervous system (CNS) lymphomas (PCNSLs) are extranodal non-Hodgkin’s lymphomas (NHLs) of diffuse large B-cell lymphomas (DLBCLs), localized to the brain, eye, meanings, and spinal cord, which are distinct from systemic lymphomas1,2. PCNSLs account for approximately 3% of primary CNS tumors and approximately 1% of NHLs in adults3. Most PCNSLs are immune-privileged site-associated DLBCLs, according to the World Health Organization (WHO) diagnostic criteria1,2. Despite intensive treatments, including high-dose methotrexate (HD-MTX)-based polychemotherapy and deferred whole brain radiotherapy, the median overall survival (OS) time of PCNSLs was associated with poor prognoses (approximately 4 years) compared to extracerebral DLBCLs4.

Cancer immunotherapy has advanced by targeting antigens on cell surfaces, as immune checkpoint molecules, which repress killer T cells and pro-inflammatory lymphocytes5. Checkpoint inhibitors as monoclonal antibodies block inhibitory checkpoint antigens and repress stimulation of T cells, showing the effects of anticancer activities6. The monoclonal antibodies against programmed death 1 (PD-1), also known as cluster of differentiation (CD) 279, and cytotoxic T-lymphocyte-associated protein 4 (CTLA-4; also known as CD152), suppress T-cell receptor (TCR) responses of NHLs7,8,9,10,11. In particular, PD-1 blockade with nivolumab is effective in relapse and/or refractory PCNSLs12,13. Recent studies have shown that the signal transducer and activator of transcription 3 (STAT3) inhibitors abrogate the expression of PD-1 ligand 1 (PD-L1; also known as CD274), and PD-1 ligand 2 (PD-L2; also known as PDCD1LG2 or CD273) on a lymphoma cell line, HKBML, in addition to an adult T-cell leukemia-lymphoma cell line, ATL-T, and a splenic lymphoma with villous lymphocyte cell line, SLVL14. Stimulus-dependent expression of PD-L1 and indoleamine 2,3-dioxygenase 1 (IDO-1) by macrophage-interaction causes immune evasion of PCNSL-derived cell lines HKBML and TK15. Besides, a clinicopathological study on 64 PCNSL patients shows that the PD-L1 protein is detected in tumor microenvironments than in tumor cells and is correlated with expression of interferon-gamma (IFN-γ) and CD4 with OS16.

Despite various studies and the aforementioned molecular evidences, there are only a few diagnostic and/or prognostic marker candidates in PCNSL. Recently, clinical next-generation sequencing (NGS) enabled an ultra-high-throughput screening for whole genome expression, copy number variation (CNV), single nucleotide variant (SNV) detection in the complete exon, and gene fusion for onco-driver mutation17,18,19,20,21,22. In this study, we conducted high-throughput RNA-sequencing using NGS on tumor tissues from 31 patients with PCNSL, and performed multivariable analysis for their expression and correlations to prognoses, focused on the balance of Th-1 and Th-2 helper T-cell differentiation and expression of immune checkpoint genes to investigate diagnostic and/or prognostic marker candidates and immune checkpoint blockade pathways against CNS lymphomas. We analyzed 84 selected transcript variants derived from 62 genes. Multivariable analysis on the expression analysis composed of the formulas of prognostic prediction and revealed the correlation between the calculated scores of T-cell differentiation status and expression of checkpoint genes, which was associated with prognoses of PCNSL patients.

Results

Patient characteristics

This study was performed on specimens from 31 patients with PCNSL whose characteristics were described in Table 1. The median age of the patients was 67 years (range, 31–85 years). Of the 31 patients, 16 patients were female (51.61%), and 15 patients were male (48.38%). The median OS time was 765 days (range, 188–3611 days) (Suppl. Fig. S1A), and the OS was “deceased” in 19 (61.29%) and “living” in 12 patients (38.70%). Univariable and multivariable analyses for OS in gender, age, Karnofsky Performance Status (KPS), Memorial Sloan Kettering Cancer Center (MSKCC) risk score, International Extranodal Lymphoma Study Group (IELSG) risk score, and chemotherapies including ionizing radiation (IR), polychemotherapy, and high dose-methotrexate (HD-MTX), were performed; however, the results did not show any statistically significant difference, except for HD-MTX in univariable analysis (hazard ratio (HR) = 0.2098, 95% confidence interval (CI): 0.0571–0.989, p = 0.0486) (Table 1, Suppl. Fig. S1B–G).

Table 1 Characteristics of the PCNSL patients examined in this study.

Expression patterns of the transcript variants of the genes of interests in PCNSL

First, to examine the expression of transcript variants of the genes of interests in the 31 PCNSL specimens, we performed NGS using the Illumina HiSeq2000/2500 as a high throughput comprehensive RNA-sequencing for whole transcript variant detection. Recently, cancer immunotherapies have dramatically been improved by the advanced profiling of immune cells and immune checkpoint molecules5,6,7,8,9,10,11. Therefore, in this study, we focused on cancer immunotherapy-related genes, especially, immune checkpoint genes and genes related to Th-1 and Th-2 differentiation. The expression values of 84 transcript variants derived from a total of 62 selected genes were used for the following multivariable analysis for diagnosis and/or prognosis marker prediction in PCNSL (Suppl. Table S1, Suppl. Fig. S2). Expression data are summarized in the heat map with hierarchical clustering for Th-1 and Th-2 differentiation and stimulatory and inhibitory immune checkpoints (Fig. 1a). In particular, highly interquartile ranges (IQRs) of the representative genes in each were: (i) STAT1-001/003/011, CD4-001, and TNFRSF1B-001 for Th-1 differentiation, (ii) CD4-001, STAT6-001, and IL2RB-001 for Th-2 differentiation, (iii) CD27-001, CD70-001, IL2RB-001, and CD40-001 for stimulatory checkpoint, and (iv) HAVCR2-001, ADORA2A-001, PDCD1LG2-001, CD274-001, PDCD1-001, BTLA-001/002, LAG3-001, and CTLA4-001 for inhibitory checkpoint (Fig. 1b). These data clearly indicate that the specific transcript variants are highly expressed in PCNSL, but not always are all variants expressed.

Figure 1
figure 1

Expression patterns of transcript variants of the genes related to T helper cells type 1/2 (Th-1/Th-2) and immune checkpoint in primary central nervous system lymphoma (PCNSL). (a) Hierarchical clustering of relative expression among samples and (b) interquartile range (IQR) of the transcript variants of genes related to Th-1, Th-2, stimulatory checkpoint, and inhibitory checkpoint in 31 PCNSL patients. High and low expression is indicated by red and green, respectively, in heat map.

Constitution of the prognosis prediction formulas in PCNSL

Second, based on the IQRs in each category, we calculated the index contributing to Th-1 and Th-2 differentiation and immune checkpoints on the clinical information as OS times using Cox regression model, random forests analysis, and principal component analysis (PCA) (Fig. 2a). In particular, STAT1-001, STAT6-001, CD40-001, CD70-001, CD274-001, and PDCD1-001 possessing high index were also highly expressed in PCNSL (Fig. 1b). The index was normalized using standard deviations of variables and used to generate formulas, as the sum of the integration of the coefficients calculated from Cox regression analyses and the fragments per kilobase of exon per million mapped fragments (FPKM) values of genes, to estimate the status of patients with PCNSL like prognoses, as follows:

Figure 2
figure 2

Survival prediction based on T helper cell type 1/2 (Th-1/Th-2) status and immune checkpoint activity in primary central nervous system lymphoma (PCNSL). (a) Index calculated from combined methods of random survival forests analysis and principal component analysis (PCA). (b) Kaplan-Meier analysis on the survival prediction formula in 31 patients with PCNSL. The patients were divided into two subgroups with high (black line) and low (red line) scores from the median score calculated based on the formula. (c) Kaplan-Meier analysis of the expression levels of representative genes for immune checkpoint. CD40 and CD70 as stimulatory checkpoint genes. Lymphocyte activation gene 3 (LAG3), programmed cell death ligand 2 (PDCD1LG2), and programmed cell death 1 (PDCD1) as inhibitory checkpoint genes. The patients were divided into two subgroups with high (black line) and low (red line) expression by the median expression of the gene. Hazard ratio (HR) with 95% confidence interval (CI) and p-value from log rank test were calculated. OS; overall survival.

Th-1 Status = 0.007 × CD4-001 + 0.012 × IFNG-001 + 0.017 × STAT1-001 + 0.012 × STAT1-002 + 0.006 × STAT1-003 + 0.003 × STAT1-010 + 0.001 × STAT1-011 + 0.007 × TNFRSF1A-001 + 0.006 × TNFRSF1A-004 + 0.004 × TNFRSF1B-001

Th-2 Status = 0.018 × CD28-001 + 0.002 × CD28-201 + 0.012 × CD28-202 + 0.011 × CD4-001 + 0.015 × IL18R1-001 + 0.001 × IL18R1-003 + 0.004 × IL18R1-201 + 0.001 × IL18R1-202 + 0.003 × IL2RB-001 + 0.01 × IL10-003 + 0.025 × IL6-001 + 0.002 × IL6-003 + 0.029 × IL6-004 + 0.015 × IL6-005 + 0.005 × IL6-006 + 0.017 × IL6-201 + 0.047 × STAT6-001 + 0.029 × TGFB3-001

Stimulatory checkpoint = 0.003 × IL2RB-001 + 0.013 × TNFRSF18-001 + 0.015 × TNFRSF18-002 +0.005 × TNFRSF18-003 + 0.003 × CD27-001 + 0.013 × CD40-001 + 0.011 × CD40-002 + 0.015 ×TNFRSF4-001 + 0.022 × CD70-001

Inhibitory checkpoint = 0.004 × ADORA2A-001 + 0.004 × BTLA-001 + 0.002 × BTLA-002 + 0.007 × PDCD1LG2-001 + 0.01 × PDCD1LG2-201 + 0.025 × CD274-001 + 0.023 × CD274-201 + 0.003 × HAVCR2-001 + 0.009 × IDO1-002 + 0.012 × LAG3-001 + 0.042 × PDCD1-001 + 0.034 × PDCD1-002 + 0.039 × PDCD1-003

Candidates for the prognosis markers in PCNSL

The subgroups by the median scores calculated by each formula divided Kaplan-Meier curves (Fig. 2b). In particular, the subgroups with Th-1low (hazard ratio (HR) = 3.8, 95% confidence interval (CI): 1.5–9.9, p = 0.0033) and stimulatory checkpointhigh (HR = 3.4, 95%CI: 1.2–10.0, p = 0.014) were associated with poor prognoses with statistical differences (Fig. 2b). Th-2high (HR = 2.4, 95%CI: 0.8–6.6, p = 0.072) and inhibitory checkpointhigh (HR = 2.1, 95%CI: 0.8–5.6, p = 0.1) also correlated with poor prognoses (Fig. 2b). As for representative marker candidates of stimulatory checkpoint genes, CD40-001high (HR = 2.5, 95%CI: 1.0–6.2, p = 0.043) and CD70-001high (HR = 2.5, 95%CI: 1.0–6.2, p = 0.04) were associated with poor prognoses (Fig. 2c). On the other hand, as for inhibitory checkpoint genes, LAG3-001high (HR = 2.8, 95%CI: 1.1–7.1, p = 0.019), PDCD1LG2-201low (HR = 2.9, 95%CI: 1.2–7.1, p = 0.018), PDCD1-001high (HR = 3.3, 95%CI: 1.2–9.1, p = 0.012), PDCD1-002high (HR = 9.3, 95%CI: 2.4–35.7, p = 8.4E-05), and PDCD1-003high (HR = 2.6, 95%CI: 1.1–6.7, p = 0.032) were associated with poor prognoses (Fig. 2c). Besides, the subgroups with IL2RB-001high, TNFRSF18-001high, TNFRSF18-002high, CD27-001high, CD40-002high, TNFRSF4-001high (Suppl. Fig. S3A), and TNFRSF18-003low (Suppl. Fig. S3B) were associated with poor prognoses, but were not significantly different, as for stimulatory checkpoint genes. Similarly, ADORA2A-001high, PDCD1LG2-001high, HAVCR2-001high (Suppl. Fig. S4A), BTLA-001/002low, CD274-001/201low, and IDO1-002low (Suppl. Fig. S4B) were associated with poor prognoses, but were not statistically significant, as for inhibitory checkpoint genes. The specific transcript variants such as CD40-001, CD70-001, LAG3-001, PDCD1LG2-201, and PDCD1-001/002/003 are candidates for immune checkpoint genes for promising prognosis factors to predict OS of PCNSL patients.

Assessment of the balance of Th-1 and Th-2 differentiation in PCNSL

We next wanted to identify facilitating factors to divide Kaplan-Meier curves and/or to enlarge HR values in the survival analysis in PCNSL. We focused on the balance in Th-1 and Th-2 differentiation. As shown in Fig. 3a, the calculated Th-1 scores were distributed at a wide range, but the calculated Th-2 scores were compacted. The four subgroups with Th-1highTh-2high, Th-1highTh-2low, Th-1lowTh-2high, and Th-1lowTh-2low were generated. Except for Th-1highTh-2low, the other three subgroups were associated with poor prognoses in the Kaplan-Meier curves (Fig. 3b,c). While, Th-1lowTh-2low was associated with the worst prognosis among the four subgroups (HR = 2.4, 95% CI: 0.6-9.2, p = 0.21) (Fig. 3b,c). These results suggest that the Th-1 activity and the Th-2 inactivity would contribute to prolonged OS of the PCNSL patients.

Figure 3
figure 3

Balance of T helper cell type 1/2 (Th-1/Th-2) predicts prognoses in primary central nervous system lymphoma (PCNSL). (a) The balance of Th-1 and Th-2 status in scatter plot. (b) Kaplan-Meier analysis on the survival prediction formula in PCNSL patients. The patients were divided into four subgroups with Th-1highTh-2high (black), Th-1highTh-2low (red), Th-1lowTh-2high (green), and Th-1lowTh-2low (blue) by the median score calculated on the formula. OS; overall survival. (c) Comparison of risk in survival of PCNSL with Th-1/Th-2 balance. Hazard ratio (HR) with 95% confidence interval (CI) compared with the Th1highTh2low subgroup.

Overlay of the transcript variant expression on the Th-1/Th-2 balance in PCNSL

The expression patterns of transcript variants of genes of interests were examined to investigate the effects on the Th-1/Th-2 balance. In the balance between Th-1/Th-2 differentiation and stimulatory checkpoint genes, lower and higher expression of stimulatory checkpoint genes were overlaid on the Th-1highTh-2low and Th-1lowTh-2high balances, respectively (Fig. 4a,c), whereas higher and lower expression of inhibitory checkpoint genes was detected on the Th-1highTh-2low and Th-1lowTh-2high balances, respectively (Fig. 4b,c), indicating that reciprocal patterns were found in stimulatory and inhibitory checkpoint genes on the Th-1/Th-2 balance. In addition, the changes in Th-1/Th-2 balance were diffused in stimulatory checkpoint genes (p = 0.014) but not in inhibitory checkpoint genes (p = 0.381), in addition to the spread changes in Th-1 (p = 0.003) and Th-2 scores (p < 0.001) (Fig. 4c). Additionally, lower and higher expression of CD70-001 on Th-1highTh-2low and Th-1lowTh-2high were found in the stimulatory checkpoint genes, respectively (Fig. 4d,e). Similarly, CD27-001 and CD40-001/002 showed similar results with no statistical significances (Suppl. Tables S2, S3, and Suppl. Fig. S5). Inversely, higher and lower expression of PDCD1LG2-001/201, PDCD1-002, and HAVCR2-001 on Th-1highTh-2low and Th-1lowTh-2high were found in the inhibitory checkpoint genes, respectively (Fig. 4d,e, and Suppl. Table S2). Similarly, BTLA-001/002, CD274-001/201, IDO1-002, LAG3-001, and PDCD1-001/003 showed similar results with no statistical significances (Suppl. Tables S2, S4, and Suppl. Fig. S6). These results suggest that lower expression of stimulatory checkpoint genes is correlated with the Th-1highTh-2low balance, whereas higher expression of inhibitory checkpoint genes is correlated with the Th-1lowTh-2high balance. Coupled with the aforementioned results in Figs 24, these data clearly suggest that higher expression of inhibitory checkpoint genes on the Th-1lowTh-2high balance is correlated with a poorer prognosis in PCNSL.

Figure 4
figure 4

Comparative expression analysis of immune checkpoint-related genes on the balance of T helper cell type 1/2 (Th-1/Th-2) status in primary central nervous system lymphoma (PCNSL). (a,b) Relative expression patterns of immune checkpoint-related genes based on the balance of Th-1/Th-2 status. (a) Stimulatory checkpoint. (b) Inhibitory checkpoint. (c) Statistics for the four subgroups defined as Th-1highTh-2high, Th-1highTh-2low, Th-1lowTh-2high, and Th-1lowTh-2low. The p-value indicates one-way analysis of variance (ANOVA). (d) Statistics for the differential expression of the genes on the four groups. The p-value indicates one-way ANOVA. (e) The box-whisker plots of the expression of stimulatory and inhibitory immune checkpoint genes. The PCNSL patients were divided into four subgroups including Th-1highTh-2high, Th-1highTh-2low, Th-1lowTh-2high, and Th-1lowTh-2low.

Inhibitory checkpoint genes are satisfied with central factors for prognosis prediction in PCNSL

Considering the aforementioned results, we next focused on the inhibitory checkpoint genes. After random survival forests analysis and PCA, we found that each variable importance of PDCD1-001/002/003, KIR3DL1-002, PDCD1LG2-201, LAG3-001, and CD274-001 especially contributed to OS of PCNSL patients (Fig. 5a). Cox regression analysis also revealed that higher expression of PDCD1-002 (HR = 15.83, 95%CI: 3.17-79.09, p = 0.001), CD160-006 (HR = 5.79, 95%CI: 1.68-20.02, p = 0.006), CD160-007 (HR = 72.39, 95%CI: 2.89-1811.75, p = 0.009), CEACAM1-202 (HR = 3.57, 95%CI: 1.85-6.89, p < 0.001), LGALS9-005 (HR = 43.97, 95%CI: 6.21-311.43, p < 0.001), in addition to CD274-001 (HR = 0.94, 95%CI: 0.85-1.03, p = 0.176), was correlated with higher hazard ratios for OS (Fig. 5b).

Figure 5
figure 5

Random forests survival analysis and Cox regression analysis to predict prognoses with the expression of inhibitory checkpoint genes in primary central nervous system lymphoma (PCNSL). (a) Variable importance derived from a random forests survival analysis. (b) Cox regression analysis for representative genes including programmed cell death 1 (PDCD1), CD274 (PD-L1), CD160, LGALS9, and CEACAM1. Akaike information criterion (AIC)-based optimization was performed, and hazard ratios with 95% confidence interval (CI) were shown. (c) Kaplan-Meier analysis on the expression levels of representative genes for inhibitory checkpoint. The patients were divided into the two subgroups with high (black line) and low (red line) expression by the cutoff score of the expression of the transcript variants, including programmed cell death 1 (PDCD1), programmed cell death ligand 2 (PDCD1LG2) (=PD-L2), CD80, LAG3, LGALS9, and CEACAM. Hazard ratio (HR) with 95% and p-value from log rank test were calculated. OS; overall survival.

The Kaplan-Meier survival analysis also showed that the identical cut off values on each expression of PDCD1-001 (cut off = 6.87), PDCD1-002 (cut off = 0.37), PDCD1-003 (cut off = 0.29), PDCD1LG2-201 (cut off = 0.42), and LAG3-001 (cut off = 5.01) reconstituted from the Kaplan-Meier results by their median expression (Figs 2c and 5c), in addition to higher expression of CD80-001 (cut off = 12.72), CEACAM1-011 (cut off = 0.3), CEACAM1-202 (cut off = 0.87), LGALS9-001 (cut off = 0.09), LGALS9-002 (cut off = 0.26), and LGALS9-005 (cut off = 0.48) with poor prognoses (Fig. 5c). On the other hand, higher expression of CD160-006/007/202, CD276-002/201, CD86-002, CD96-001, CEACAM1-003/004/005, CTLA4-001/005, HAVCR2-201, LGALS3-001, PDCD1LG2-001, PVR-002/003/006, TIGIT-201, TMIGD2-001, TNFRSF14-001/009, KIR3DL1-201, and VTCN1-201 by each cut off value correlated with poorer prognoses with no statistical significance in PCNSL (Suppl. Fig. S7). Inversely, lower expression of LGALS9-002 (cut off = 1.06) was associated with poorer prognoses (Fig. 5c). In addition, lower expression of BTLA-001/002, C10orf54-001, CD160-201, CD274-001/201, CD80-202, CD86-001/201, CD96-002, CEACAM1-001/002, IDO1-002, LGALS9-201, PVR-004, KIR3DL1-003, and VTCN1-001/002 by each cut off value was associated with poor prognoses with no statistically significant in PCNSL (Suppl. Fig. S8). These results suggest that the specific transcript variants derived from inhibitory checkpoint genes, especially PDCD1-001/002/003, PDCD1LG2-201, and LAG3-001, would be satisfied with central factor candidates for prognosis prediction in PCNSL. In other word, these transcript variants may be promising prognosis marker candidates in PCNSL.

Correlation analysis among Th-1/Th-2 differentiation and immune checkpoint genes in PCNSL

To validate the correlation between expression of checkpoint genes and the Th-1 and Th-2 status, additional analysis for the correlation among Th-1/Th-2 differentiation and expression of stimulatory and inhibitory immune checkpoint genes was carried out. The analysis for the correlations between multiple pairs of variables returned representative Pearson’s rank correlation coefficient values (r) with statistical significances by additional nonparametric analyses, which were summarized in the matrix (Suppl. Fig. S9A). The variable 1, including HAVCR2-001, LAG3-001, PDCD1LG2-001, ICOS-001, IDO1-002, and CTLA4-001, is correlated with the variable 2, including CD28-001, STAT4-002, IFNG-001, CD4-001, CD28-202, and TBX21-001, with relative high correlation coefficient values (r > 0.47, p < 0.05) (Suppl. Fig. S9A). The results suggested that a complex correlation network was constituted of the variable 1, mainly composed of inhibitory checkpoint genes, and the variable 2, principally composed of developmental status of Th-1 differentiation (Suppl. Fig. S9A). Nonparametric analyses with Spearman, Kendall rank distance, and Hoeffding independence test also indicated that the variable 1, including BTLA-002, CD274-001, HAVCR2-001, ICOS-001, LAG3-001, PDCD1LG2-001/201, STAT4-002, TNFRSF18-001, and TNFRSF1A-001, was correlated with the variable 2, including BTLA-001, PDCD1LG2-001, TNFRSF1A-001, CD28-001, TBX21-001, CD4-001, STAT1-001, and IFNG-001, with relative high correlations (r > 0.3, p < 0.05) (Suppl. Fig. S9B). Besides, in part of the genes analyzed, the schematic representation of their correlation with graphical lasso showed that HAVCR2-001 and PDCD1LG2-001, both inhibitory checkpoint genes, were pivotal factors with important nodes in the Th-1/Th-2 network, followed by TNFRSF1A-001, IFNG-001, STAT1-001, and CD4-001 (Fig. 6a). CD28-202-to-LAG3-003 interaction connected the Th-1/Th-2 gene network and the immune checkpoint gene network, suggestive of an important network hub between the two (Fig. 6a). Further, focused on the inhibitory checkpoint network and extracted them, PDCD1LG2-201, CD274-001, and VTCN1-001/002 seemed a network hub into the complex inhibitory checkpoint gene network (Fig. 6b). These correlation analysis results suggest that expression control of the hub genes with several nodes, including HAVCR2-001, PDCD1LG2-001/201, CD274-001, and VTCN1-001/002, can reconstitute the complex network composed of Th-1/Th-2 status and immune checkpoint genes and their balances.

Figure 6
figure 6

Schematic representation of the correlation between the gene expression in the T helper cell type 1/2 (Th-1/Th-2) status and immune checkpoint in primary central nervous system lymphoma (PCNSL). (a) Correlation among Th-1 (red) and Th-2 (green), stimulatory checkpoint (blue), and inhibitory checkpoint (purple). (b) Correlation among inhibitory checkpoint molecules. Thick and thin lines with a distance indicate strong and weak correlation between the expression levels of the two genes. The numbers with circles indicate the numbers of nodes over two.

Pathway analysis on the cancer immunotherapy-related genes in PCNSL

We finally performed gene set enrichment analysis (GSEA) using the dataset. In this study, 7565 known genes were detected after sequencing, and the 337 genes of these found in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. While, the genes with differential expression in the PCNSL subgroup with poor prognoses (cutoff by median OS), compared that with good prognoses, were 3140, and the 59 genes found in the KEGG. Of these, the average expression of the two genes, including CD70 and PDCD1, and the 12 isoforms derived from the seven genes, including CD40, CD70, IL6, IL10, STAT1, STAT6, and TNFRSF14, were cancer immunotherapy-related genes (false discovery rate (FDR) < 0.01) (Table 2). In the GSEA with KEGG, 10 pathways on “expression analysis of the gene” and 16 pathways on “expression analysis of the transcript variant” analysis included 31 genes and 139 transcript variants, respectively (p < 0.05; Table 3, and FDR < 0.01; Suppl. Table S5). In particular, T cell receptor signaling pathway (KEGG ID: hsa04660) (Suppl. Fig. S10), cytokine-cytokine receptor interaction (hsa04060) (Suppl. Fig. S11), and cell adhesion molecules (hsa04514) (Suppl. Fig. S12) may be involved in cancer immunotherapy. While, since the complete data of all genes or transcripts is more unbiased in the PCNSL clinical samples, systematic approaches may return different results.

Table 2 Differential expression of genes in the poor prognosis subgroup, compared to the good prognosis subgroup.
Table 3 The target pathway candidates in PCNSL.

Discussion

Here, we performed NGS for distinct transcript variant detection and multivariable analyses for evaluating prognoses in 31 patients suffering PCNSL and for discovering a stimulus-dependent oncopathway input from tumor microenvironments, including activated T cells, and stimulatory and inhibitory immune checkpoints in PCNSL. In particular, we focused on the correlation between checkpoint genes and Th-1/Th-2 differentiation to estimate OS of PCNSL patients. Data showed lower and higher expression of Th-1 and Th-2 differentiation genes with poorer prognoses, respectively. CD40-001high and CD70-001high as stimulatory checkpoint genes and LAG3-001high, PDCD1 (PD-1)-001/002/003high, and PDCD1LG2 (PD-L2)-201low as inhibitory checkpoint genes also were associated with poorer prognoses. Th-1highTh-2low and Th-1lowTh-2high were correlated with lower expression of CD70-001, and PDCD1LG2-001/201 and HAVCR2 (TIM-3)-001 for inhibitory checkpoint. For inhibitory checkpoint genes, Cox regression analysis showed higher HR in the expression of CD274 (PD-L1)-001, CD160-006/007, LGALS9-005, CEACAM1-202, and PDCD1-002. Further, higher expression of inhibitory checkpoint genes, including PDCD1-001/002/003, PDCD1LG2-201, and LAG3-001, with a cut off score reconstituted successfully the Kaplan-Meier curves estimated by the median expression. Besides, correlation coefficient analyses indicated that inhibitory checkpoint genes, including HAVCR2-001 and PDCD1LG2-001, governed the Th-1/Th-2 differentiation network. In addition, CD28-202 genetically interacted with LAG3-001, a hub gene of the checkpoint gene network bridging to the Th-1/Th-2 network in PCNSL. The GSEA with KEGG also clarified gene networks harboring differential expression of the PDCD1 gene, T-cell receptor signaling, cytokine interaction, and cell adhesion. These results suggest that identical expression of transcript variants of inhibitory immune checkpoint genes overlaid on the Th-1/Th-2 balance enables to predict survival distributions in PCNSL patients.

On the other hand, we also examined the dataset of DLBCL (n = 47) deposited in The Cancer Genome Atlas (TCGA) for the Th-1/Th-2 status and the immune checkpoint molecule scores (Suppl. Figs S13 and S14). The results from the DLBCL were as follows: (i) Th-1 score was correlated with Th-2 score (Suppl. Fig. S13E). (ii) Th-1low was correlated with lower expression of inhibitory checkpoint gene expression (Suppl. Fig. S13I). (iii) The LGALS9low showed poor prognoses (Suppl. Fig. S14F). (iv) Differential expression of PDCD1 or PDCD1LG2 did not divide survival curves in DLBCL (Suppl. Fig. S14H and I). Hence, we considered that the correlation between Th-1/Th-2 balance and checkpoint gene expression is significant in PCNSL but not in DLBCL. Thus, this study may provide insights for development of molecular target therapies and identification of diagnosis and prognosis markers based on NGS and multivariable analysis in PCNSL.

As described above, we only examined the Th-1/Th-2 balance and checkpoint genes, including PD-1, the ligands, and the other antigen molecules. However, the differentiation status of other T cells such as regulatory T cells (Treg)23,24,25,26,27,28, Th-17 cells27, CD4+27, CD8+ cells16, and macrophages within tumor microenvironments15 contribute to immune checkpoint activity via intrinsic and extrinsic factors in tumor cells or T cells14,29,30,31. On the other hand, MTX is an antifolate that inhibits DNA syntheses32 and the expression of glucocorticoid receptors in human blood cells33. HD-MTX treatment and deferred radiotherapy are a standard protocol for PCNSL treatment; nevertheless, most of the cases come to relapse-acquired resistances4. Recent studies showed that immune checkpoint blockade with monoclonal antibodies against the cell surface antigens, including CTLA-4, ipilimumab, and PD-1, nivolumab and/or pembrolizumab34, and a combination anti-PD-1/CTLA-4 regimens (nivolumab- ipilimumab) have been effective against melanoma35, lung cancer31, gastrointestinal tract cancer36, urologic cancer37, and liver cancer38. However, it has also been reported that tumor and T-cell intrinsic and extrinsic factors contribute to immunotherapy resistances such as adaptive immune and acquired resistances, except for patients who have primary resistance to checkpoint inhibitors30. Hence, it is important to prevent the recurrences with chemical and checkpoint inhibition resistance in PCNSL treatments39,40.

In addition to CTLA-4 and PD-1, recent trends shifted to alternative inhibitory receptors and their mechanisms within tumor microenvironments. LAG-3 is considered the third inhibitory receptor candidate in clinics in the next generation41, whereasTIM-3 is expressed in FoxP3+ Treg and activates Treg function, and the TIM-3 blockade has therapeutic effects in a preclinical model42. TIM-3 also functions on the IFN-γ-producing T-cells, macrophages, and dendritic cells, where it leads to the inhibition of Th-1 responses42. Therefore, multi-targeting of LAG-3, TIM-3, PD-1, and/or CTLA-4 may serve as a next generation cancer immunotherapy. However, these molecules are also responsible for a primary or adaptive resistance for immunotherapy30. PDCD1LG2 also functions in the PD-1 blockade such as on PD-L1, showing a potential resistance mechanism in immune checkpoint inhibition30. Hence, these inhibitory checkpoint genes may also be difficult to assign target molecules in part of PCNSLs. This study identified promising diagnosis and/or prognosis marker candidates and potential target genes as hub genes (i.e., PDCD1LG2-001/201, HAVCR2-001, CD274-001, VITCN1-001/002, CD28-202, and LAG3-001/003) connecting the Th-1/Th-2 and checkpoint gene network in PCNSL. Nevertheless, we should conceive and develop innovative methods (e.g., cell-based cancer reprograming of cancer-cell themselves43) as an alternative to conventional immunotherapy with a checkpoint blockade.

Methods

Patients and materials

A total of 31 patients with PCNSL were enrolled. Patients were diagnosed according to the WHO classification1,2 and treated at Chiba University, Toyama Prefectural Central Hospital, Wakayama Medical University School of Medicine, and Yamaguchi University. This study was approved by the Ethics Committee of Kyoto Prefectural University of Medicine (RBMR-G-146) that covered recruitment of patients from other centers. Prior informed consents were obtained from all patients. Biopsy or resected tumor tissues immediately snap-frozen were collected. The experiments were performed in accordance with the institutional guidelines.

NGS

Total RNAs were extracted from 100 mg of tumor biopsies or resected tissues using Isogen II (Nippongene). The quality of the extracted RNA was verified with the Bioanalyzer System using RNA Pico Chips (Agilent Technologies). NGS was performed using the Illumina HiSeq2000/2500 platform with a standard 124-bp paired-end read protocol44,45.

Clustering analysis

Expression of genes of interests in the 31 PCNSL specimens was clustered with the hierarchical method using the JMP built-in modules (SAS Institute, Inc.)22.

Kaplan-Meier survival analysis

The Kaplan-Meier analysis was performed to estimate survival distributions for subgroups with the log-rank test using the JMP built-in modules (SAS Institute Inc.)22.

Random survival forest analysis

Random survival forest analysis was used to determine the factors with variable importance distinguishing the expression of transcript variants with NGS raw data46,47. Briefly, the values of variable importance reflected the relative contribution of each variable to the prediction for OS, and they were estimated by randomly permuting the values and recalculating the predictive accuracy of the model, which were expressed as the log rank test statistics. The method was implemented by using the randomForestSRC package of the statistical software R.

Cox proportional hazards analysis

The association of expression of genes of interests with OS was evaluated by multivariable analyses with clinical characteristics as other predictors using the Cox proportional hazards regression model using the JMP built-in modules (SAS Institute Inc.)46.

Multivariable correlation coefficient analysis

Correlation among variables were analyzed by the graphical lasso using the glasso package in R48,49. Correlations between pairs of variables were analyzed using the JMP built-in modules (SAS Institute, Inc.)15. Briefly, the correlations and multivariable analyses with multidimensional behavior of variables returned Pearson’s rank correlation coefficient values (r), in addition to the statistical significances with nonparametric analyses by the methods of Spearman, Kendall rank distance, and Hoeffding’s test of independence.

GSEA

GSEA was performed using a dataset constructed by differential expression of genes defined by FPKM (FDR < 0.01)43. Differentially expressed genes were detected using the edgeR in Bioconductor package (http://bioconductor.org/packages/release/bioc/html/edgeR.html), followed by survey of pathways using the KEGG (https://www.genome.jp/kegg/).

DLBCL dataset

A dataset of 47 patients with DLBCL available for OS and gene expression data (RNA-Seq) were collected from The Cancer Genome Atlas (TCGA) (https://tcga-data.nci.nih.gov/docs/publications/tcga/?) via the cBioPortal for Cancer Genomics (https://www.cbioportal.org/)50.

Gene annotation

Genes of interests were annotated online at the GOstat (http://gostat.wehi.edu.au/) and the DAVID (https://david.ncifcrf.gov/)50.

Statistics

Statistical analysis was performed using the JMP built-in modules (SAS Institute Inc.)50. p-value < 0.05 was considered statistically significant.