Skip to main content
Erschienen in: BMC Cancer 1/2020

Open Access 01.12.2020 | Research article

Sepsis-associated pathways segregate cancer groups

verfasst von: Himanshu Tripathi, Samanwoy Mukhopadhyay, Saroj Kant Mohapatra

Erschienen in: BMC Cancer | Ausgabe 1/2020

Abstract

Background

Sepsis and cancer are both leading causes of death, and occurrence of any one, increases the likelihood of the other. While cancer patients are susceptible to sepsis, survivors of sepsis are also susceptible to develop certain cancers. This mutual dependence for susceptibility suggests shared biology between the two disease categories. Earlier analysis had revealed a cancer-related pathway to be up-regulated in Septic Shock (SS), an advanced stage of sepsis. This has motivated a more comprehensive comparison of the transcriptomes of SS and cancer.

Methods

Gene Set Enrichment Analysis was performed to detect the pathways enriched in SS and cancer. Thereafter, hierarchical clustering was applied to identify relative segregation of 17 cancer types into two groups vis-a-vis SS. Biological significance of the selected pathways was explored by network analysis. Clinical significance of the pathways was tested by survival analysis. A robust classifier of cancer groups was developed based on machine learning.

Results

A total of 66 pathways were observed to be enriched in both SS and cancer. However, clustering segregated cancer types into two categories based on the direction of transcriptomic change. In general, there was up-regulation in SS and one group of cancer (termed Sepsis-Like Cancer, or SLC), but not in other cancers (termed Cancer Alone, or CA). The SLC group mainly consisted of malignancies of the gastrointestinal tract (head and neck, oesophagus, stomach, liver and biliary system) often associated with infection. Machine learning classifier successfully segregated the two cancer groups with high accuracy (> 98%). Additionally, pathway up-regulation was observed to be associated with survival in the SLC group of cancers.

Conclusion

Transcriptome-based systems biology approach segregates cancer into two groups (SLC and CA) based on similarity with SS. Host response to infection plays a key role in pathogenesis of SS and SLC. However, we hypothesize that some component of the host response is protective in both SS and SLC.
Begleitmaterial
Hinweise
Himanshu Tripathi and Samanwoy Mukhopadhyay contributed equally to this work.

Supplementary information

Supplementary information accompanies this paper at https://​doi.​org/​10.​1186/​s12885-020-06774-9.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Abkürzungen
BLCA
Bladder urothelial carcinoma
BRCA
Breast invasive carcinoma
CA
Cancer alone
CHOL
Cholangiocarcinoma
COAD
Colon adenocarcinoma
ESCA
Esophageal carcinoma
FPKM
Fragments Per Kilobase of transcript per Million mapped reads
GEO
Gene expression omnibus
HNSC
Head and neck squamous cell carcinoma
ICU
Intensive care unit
KEGG
Kyoto encyclopaedia of genes and genomes
KICH
Kidney chromophobe
KIRC
Kidney renal clear cell carcinoma
KIRP
Kidney renal papillary cell carcinoma
LFC
Log2(fold-change)
LIHC
Liver hepatocellular carcinoma
LUAD
Lung adenocarcinoma
LUSC
Lung squamous cell carcinoma
NCBI
National Center for Biotechnology Information
NN
Neural Net
PRAD
Prostate adenocarcinoma
READ
Rectum adenocarcinoma
SLC
Sepsis-Like Cancer
SS
Septic Shock
STAD
Stomach adenocarcinoma
SVM
Support Vector Machine
TCGA
The cancer genome atlas
THCA
Thyroid carcinoma
UCEC
Uterine corpus endometrial carcinoma

Background

Sepsis is a potentially life-threatening complication caused by dysregulated host response to infection, often leading to organ failure and death. Estimated global burden of sepsis is more than 48.9 million people in 2017 with 11 million deaths [1]. Septic shock is the advanced stage of sepsis with metabolic dysregulation and uncontrolled hypotension. Several epidemiological studies have linked sepsis and cancer [2, 3]. Liu et al. [2] conducted an association study between sepsis and ensuing risk of cancer in elderly adult population of the United States, and observed that sepsis is significantly associated with increased risk for many cancers including chronic myeloid leukemia, myelodysplastic syndrome, acute myeloid leukemia, cancers of the cervix, liver, lung, rectum, colon. Another association study revealed 2.5 fold increased risk of sepsis in survivors of cancer in community-dwelling adults (the risk is increased up to 10 times in hospitalized cancer patients) [3]. Co-occurrence of cancer with sepsis is associated with higher mortality than sepsis alone without cancer [4]. On the other hand, sepsis is a common cause of death in critically ill patients with cancer, with high ICU and hospital mortality [57]. Interestingly, mortality due to sepsis varies widely from 42 to 82% across cancer tissue types [8], suggesting varying likelihood of survival of patients suffering from different cancers.
This study is motivated by multiple shared features of septic shock (SS) and cancer. There is co-occurrence of the two entities, with synergistic effect on mortality. Both are associated with inflammation at some stage of the disease. Inflammation is well understood to promote malignant growth with participation of diverse immune cells and molecules, such as, cytokines [9]. Similarly, sepsis is understood as a non-resolving inflammatory response to infection that leads to organ dysfunction [10]. Both the diseases are associated with anaerobic metabolism with lactic acidosis being a hallmark of septic shock [9, 11]. In our earlier work, we have observed a cancer associated pathway to be significantly up-regulated in septic shock [12]. Additionally, there are previous reports on shared molecular changes in sepsis and cancer. Bergenfelz et al. [13] reported that Wnt5a induces immunosuppressive phenotype of macrophages in sepsis and breast cancer patients. HMGB1, a key late inflammatory mediator of systemic inflammatory response syndrome associated with bacterial sepsis, is also implicated in tumorigenesis and disease progression [14]. Muscle wasting - observed in patients with cancer, severe injury and sepsis - is associated with increased expression of several genes, particularly transcription factors and nuclear cofactors, regulating different proteolytic pathways [15]. Methodologically, all of these studies employed gene-level analysis, i.e., considering gene as the functional unit. On the other hand, a gene set or pathway represents coordinated molecular activity and represents a higher-order functional unit in a tissue or cell. Pathway-level analysis allows detection of a cumulative signal that is not accessible at the gene-level. To our knowledge, there is no report in literature on pathway-level comparison between sepsis and cancer. In the present study, we have performed unbiased analysis of SS and cancer datasets to discover shared patterns of pathway-level transcriptional alteration underlying the two illnesses.

Methods

Gene expression data

TCGA data

Gene expression data as FPKM (Fragments Per Kilobase of transcript per Million mapped reads) values were retrieved from The Cancer Genome Atlas (TCGA) database (https://​portal.​gdc.​cancer.​gov/​) on July 5, 2018 for 17 different human cancers i.e., bladder urothelial carcinoma (BLCA), breast invasive carcinoma (BRCA), cholangiocarcinoma (CHOL), colon adenocarcinoma (COAD), esophageal carcinoma (ESCA), head and neck squamous cell carcinoma (HNSC), kidney chromophobe (KICH), kidney renal clear cell carcinoma (KIRC), kidney renal papillary cell carcinoma (KIRP), liver hepatocellular carcinoma (LIHC), lung adenocarcinoma (LUAD), lung squamous cell carcinoma (LUSC), prostate adenocarcinoma (PRAD), rectum adenocarcinoma (READ), stomach adenocarcinoma (STAD), thyroid carcinoma (THCA) and uterine corpus endometrial carcinoma (UCEC). For each cancer type, TCGA project code was provided in the search field and RNA-seq data for paired samples (each pair consisting of tissue from tumour zone and tissue from adjacent unaffected zone in the same individual) were downloaded. In all, gene expression data were derived from 687 patients with cancer. Data were transformed to logarithmic scale (base 2).

GEO data

Six studies of septic shock (SS) were selected following the procedure described earlier [12]. Normalized gene expression data from the series matrix files were retrieved from NCBI Gene Expression Omnibus (GEO) database (https://​www.​ncbi.​nlm.​nih.​gov/​geo/​) on April 10, 2019. Data were transformed to logarithmic scale (base 2). Expression intensity for each Entrez gene ID was calculated after removing duplicated probe sets. Genes common to all SS studies were included in the analysis. In all, gene expression data were derived from 445 patients with SS and 116 control subjects. Redundant samples (other than control and SS) were excluded from analysis.
Characteristics of 23 distinct data sets used in the current study (6 of SS and 17 of cancer) are listed in Table 1. Each data set consisted of transcriptome-wide expression data from a number of patients suffering from a single disease (SS or cancer). For SS, each data set consisted of blood transcriptome data. For cancer, each data set consisted of transcriptome data from a tissue. Control was defined as adjacent normal tissue for cancer (TCGA), and a healthy subject for SS (GEO).
Table 1
Characteristics (such as tissue of origin, source database, disease type, sample size of each study and details about the platform technology used to generate the data) of the 23 data sets (17 cancer + 6 septic shock) included in the analysis. Study_code refers to the code assigned by the source database (either TCGA or GEO) to the data set. TCGA stands for The Cancer Genome Atlas. GEO stands for NCBI Gene Expression Omnibus. Paired control refers to adjacent normal tissue in the same cancer patient. Technology for transcriptome assay is either sequencing (RNA-seq) or hybridization-based microarray (Affymetrix)
Study_Code
Data source
Disease
Tissue
Number of samples
Paired control
Technology
BLCA
TCGA
Cancer
Bladder
38
Yes
RNA-seq (Illumina)
BRCA
TCGA
Cancer
Breast
220
Yes
RNA-seq (Illumina)
CHOL
TCGA
Cancer
Gallbladder, liver and parts of biliary tract
18
Yes
RNA-seq (Illumina)
COAD
TCGA
Cancer
Colon and rectosigmoid junction
82
Yes
RNA-seq (Illumina)
ESCA
TCGA
Cancer
Esophagus
16
Yes
RNA-seq (Illumina)
HNSC
TCGA
Cancer
Base of tongue, floor of mouth, gum, hypo- and oro-pharynx, larynx, etc.
86
Yes
RNA-seq (Illumina)
KICH
TCGA
Cancer
Kidney
48
Yes
RNA-seq (Illumina)
KIRC
TCGA
Cancer
Kidney
144
Yes
RNA-seq (Illumina)
KIRP
TCGA
Cancer
Kidney
64
Yes
RNA-seq (Illumina)
LIHC
TCGA
Cancer
Liver and intrahepatic bile ducts
100
Yes
RNA-seq (Illumina)
LUAD
TCGA
Cancer
Bronchus and lung
114
Yes
RNA-seq (Illumina)
LUSC
TCGA
Cancer
Bronchus and lung
98
Yes
RNA-seq (Illumina)
PRAD
TCGA
Cancer
Prostate gland
104
Yes
RNA-seq (Illumina)
READ
TCGA
Cancer
Rectum rectosigmoid junction
20
Yes
RNA-seq (Illumina)
STAD
TCGA
Cancer
Stomach
62
Yes
RNA-seq (Illumina)
THCA
TCGA
Cancer
Thyroid gland
116
Yes
RNA-seq (Illumina)
UCEC
TCGA
Cancer
Corpus uteri
46
Yes
RNA-seq (Illumina)
GSE4607
GEO
Septic Shock
Whole Blood
84
No
Microarray (Affymetrix HGU 133 Plus 2.0)
GSE8121
GEO
Septic Shock
Whole Blood
75
No
Microarray (Affymetrix HGU 133 Plus 2.0)
GSE9692
GEO
Septic Shock
Whole Blood
45
No
Microarray (Affymetrix HGU 133 Plus 2.0)
GSE13904
GEO
Septic Shock
Whole Blood
124
No
Microarray (Affymetrix HGU 133 Plus 2.0)
GSE26378
GEO
Septic Shock
Whole Blood
103
No
Microarray (Affymetrix HGU 133 Plus 2.0)
GSE26440
GEO
Septic Shock
Whole Blood
130
No
Microarray (Affymetrix HGU 133 Plus 2.0)

Pathway enrichment analysis

Pathway (gene set) enrichment analysis was performed using the algorithm previously described [12, 16]. Gene sets were defined based on pathways annotated at KEGG [17]. Any pathway with 10 or less number of genes was discarded from analysis. For each gene, t-statistic was computed to denote change in gene expression in case group compared to the control group. For each pathway, a score was calculated by weighted averaging (i.e., sum of the gene-level t-statistics divided by the square root of the number of genes in the pathway) of all gene-level t-statistics for the pathway. Significance of the observed pathway score was calculated by permutation testing performed in the following manner. In each permutation, the samples were randomly re-labelled as case and control, with calculation of a simulated pathway score. This was done 10,000 times generating 10,000 simulated values representing the null distribution of the pathway score. Deviation of the observed pathway score from the null distribution was quantified by the fraction of times that the simulated score was more extreme than the observed score. This result was assigned as permutation p-value of the observed pathway score. Pathway enrichment analysis was performed using code modified from the R function gseattperm() of the package Category [18].

Cluster analysis

Pathway scores of all 23 studies (6 SS + 17 cancer) were subjected to hierarchical clustering in the following manner. First, the Euclidean distance matrix was computed to capture the pairwise dissimilarity among the studies. Thereafter, the distance matrix was subjected to agglomerative hierarchical cluster analysis, with “complete” linkage method. Distance matrix computation and cluster analysis were performed using the R functions dist() and hclust() respectively. Output of the cluster analysis was plotted as a dendrogram.

Visualization of pathway-level and gene-level expression scores

A pathway was selected if it was significantly enriched (permutation p < 0.01) in 80% or more of the studies in one or both of the cancer groups (SLC, CA). The pathway score matrix was generated for all selected pathways across all 23 data sets. Heat map was generated with the R function heatmap.2() of the package gplots [19]. For each pathway, the mean pathway score across the disease group was calculated separately. Boxplot of the pathway scores for the three disease groups (SS, SLC and CA) was drawn using R function boxplot().
For generating gene-level heat maps of individual pathways, the following steps were followed. First, average log-expression level was calculated for the case group and control group separately. Log2(fold-change) or LFC was calculated by subtracting the average control value from the average case value. This was done for each gene. For each disease type (SS, SLC or CA), median LFC was calculated across the studies in that disease group. Combined heat map was generated based on the pathway genes and the three disease groups. Only those genes with similar LFC directionality in SLC and SS were included in visualization.

Network analysis

Firstly, a background network was constructed by including all the pathways (KEGG) [17] with overlapping gene memberships, i.e., for a pathway to be included, it must share at least 5% of the total number of genes with another pathway. Pathways connected to less than 3 other pathways each were dropped from network analysis. In this network, each pathway was considered a node, and the edge between two nodes represented overlap between the two pathways. In this way, a network was constructed with 244 nodes and 5304 edges. Degree distribution of the network was calculated using the function degree_distribution() of the package igraph [20]. Plot of the degree distribution of the network was drawn. For each node in the network, degree and betweenness centrality were calculated with the R functions degree() and betweenness() respectively. In the network diagram, selected pathways were shown as nodes coloured in red. Box plot was drawn to show the difference of degrees between selected and other pathways.

Sample-level pathway score

Individual pathway scores for the cancer patients were computed in the following manner. For each subject, two genome-wide expression vectors were retrieved from TCGA: one for the tumour tissue (T) and another for the adjacent normal tissue (N). Normalized gene expression vector (E) was then calculated by subtracting the normal expression values from the tumour expression values (E = T - N). For each of the selected 66 pathways, a score (Z) was calculated as the weighted average of the expression of the genes in the pathway (i.e., sum of the individual gene expression scores divided by the square root of the number of genes in the pathway). This resulted in 66 pathway scores for each patient. These individual pathway scores were used for survival analysis.

Machine learning

For the machine learning-based prediction, we retrieved additional transcriptome data for SLC (Sepsis-Like Cancer) and CA (Cancer Alone) cancers from NCBI GEO. Six datasets containing 542 cancer subjects (of both SLC and CA) and 180 healthy control subjects were included (Additional file 7: Table S4.). The additional validation data sets were required because the TCGA data sets were used for feature selection (i.e., 66 pathways) and segregation of cancer groups (SLC and CA). For each cancer sample, pathway scores were generated by weighted averaging of the gene-level t-statistic between the case group and control group, (i.e., sum of the individual t-statistic divided by the square root of the number of genes in the pathway). The scores of the 66 pathways were used as input and the cancer group label (SLC or CA) as output in machine learning-based classification of cancer patients. Support Vector Machine (SVM) and Neural Net (NN) classifiers were implemented using R package MLInterfaces [21]. SVM was used through the function call Mlearn() with the learning function svmI. Neural Network was used through the function call MLearn() with the learning function nnetI and three nodes in the hidden layer, with weight decay set to 0.01. Confusion matrix from the classifier output of five-fold cross-validation was collected for calculation of misclassification rate (fraction of samples wrongly predicted by the algorithm) and accuracy.

Survival analysis

Based on the number of cases and available survival information, three SLC cancer types (HNSC, KIRC and LIHC) were selected for survival analysis. The following clinical metadata were collected as provided by TCGA: days (to death or last follow-up) and outcome (survivor or non-survivor). Patient-level (sample-level) pathway score was used for analysis. For each cancer type, the patients were divided into two groups (high-score and low-score) by applying the median pathway score as threshold. Survival plots were generated using the functions in the R package survival [22] as described here. First, a survival object was created applying the function Surv(), with day and outcome. The output, i.e., the fitted model, was passed to the function survfit() that created two survival curves, based on the level of pathway score (high or low). Survival plots were displayed using the function ggsurvplot() from the package survminer [23]. Significance of the difference between the survival plots of the two curves was calculated by the function surv_pvalue() from the same package.

Code and data availability

All programming was done in the R programming language [24]. Data and R code are available at the following link: https://​figshare.​com/​ (https://​doi.​org/​10.​6084/​m9.​figshare.​8118413.​v3, https://​doi.​org/​10.​6084/​m9.​figshare.​8118647.​v5). The whole data can be downloaded as a single zip file (tcnibmgdoc.zip). Upon uncompressing the zip, instructions for running the code and generating the figures of this manuscript are available in the file howto.pdf.
Selection of the transcriptome data sets and analysis workflow leading to the final list of 66 significant pathways are described in Fig. 1. The study characteristics of the data sets are listed in Table 1.

Results

The transcriptomic data for 23 studies (17 cancer and 6 SS) consisted of 687 patients with cancer and 445 with SS. The data were subjected to progressive analysis and pathway filtering (Fig. 1).

Hierarchical clustering revealed two groups of cancers

At the outset, independent of our data, there were a total of 272 pathways in the KEGG database. Application of permutation-based testing of each pathway revealed that 90 of these were significantly (p < 0.01) enriched in each of the 6 SS data sets, thus constituting a robust set of SS-specific pathways. Hierarchical clustering on combined cancer and sepsis data sets for these 90 pathways revealed two groups of cancers, with one group segregating with SS (Additional file 1: Figure S1). The 6 cancers (HNSC, ESCA, STAD, LIHC, CHOL and KIRC) belonging to the SS clade were termed Sepsis-Like Cancer (SLC). The other 11 cancers (BLCA, BRCA, COAD, KICH, KIRP, LUAD, LUSC, PRAD, READ, THCA and UCEC) formed the other clade and were termed the CA (Cancer Alone) group.
In order to exclude those pathways that may be irrelevant to cancer biology, the 90 pathways were tested for enrichment in any or both of the cancer groups (i.e., significantly enriched in at least 80% of a group of cancer). 24 out of the 90 SS-specific pathways were observed not to be associated with any of the two cancer groups. Exclusion of these 24 non-significant pathways resulted in retention of 66 pathways significantly associated with both SS and cancer. Pathway score matrix of these 66 pathways across all diseases (including SS, SLC and CA) was visualized as a heat map (Fig. 2a). The sample dendrogram of this heat map recapitulated the segregation of the cancer groups as observed earlier with 90 pathways (Additional file 1: Figure S1). In general, the heat map demonstrated up-regulation of the pathways in SS and SLC. Further, for each pathway, mean score was calculated across all data sets in one disease group (SS, SLC or CA). Box plot of these scores clearly demonstrated the shared directionality in pathway dysregulation in SS and SLC, i.e., in both these conditions there was up-regulation of the pathway (Fig. 2b). Additionally, the average number of pathways up-regulated in a disease group was observed to be 63, 58 and 11 for SS, SLC and CA respectively (Additional file 4: Table S1), thus establishing general agreement in the direction of pathway transcriptional change between SS and SLC. The overall trend in pathway dysregulation was also reflected in the gene-level heat map for individual pathways (Additional file 2: Figure S2).

Network analysis

Genome-level biological significance of the selected pathways was revealed by network analysis. Here, the network consisted of all KEGG pathways as nodes and substantial pairwise gene-overlap among pathways as edges. The centrality of the selected pathways was further quantified in terms of degree and betweenness centrality. Degree captures short-range connectivity of a node (i.e., how many nodes are connected to this node?), while betweenness centrality captures long-range centrality (i.e., how often this node falls on the shortest path between any two other nodes?) Degree captures the number of immediate neighbours of a node, while betweenness captures the essentiality of a node to the structure of the network. The degree and betweenness values for the selected 66 pathways were recorded (Additional file 5: Table S2).
The degree distribution of the network showed scale-free property of a biological network, i.e., many nodes with a few edges and a few nodes with large number of edges (Additional file 3: Figure S3 - panel A). Visual exploration revealed the selected pathways at the core of the network (Fig. 3), and this was confirmed by comparing the degree of these 66 pathways with other pathways in the human genome. As shown in Additional file 3: Figure S3 (panel B), selected pathways had higher degrees than the other pathways.

Machine learning-based prediction of cancer group (SLC/CA) from 66 pathway scores

In order to assess the clinical relevance of the selected pathways, we asked if it is possible to predict (by a machine learning algorithm) the cancer group based on the selected pathway scores of an individual patient of cancer. For each patient, 66 pathway scores were input for the classifier, with the expected output being SLC or CA. Five-fold cross-validation was employed with balanced partitioning in each fold. Both algorithms - Support Vector Machine (SVM) and Neural Net (NN) – performed with high accuracy in assigning the samples to their respective cancer groups. SVM correctly assigned 537 (99.08%) out of 542 patients, while NN correctly assigned 536 (98.89%) out of 542 patients, with a misclassification rate of 0.9 and 1.1% respectively (Table 2). This independent validation showed the clinical relevance of the selected 66 pathways for prediction of cancer group from individual patient-level data.
Table 2
Two machine learning algorithms - Support Vector Machine (SVM) and Neural Network (NN) - were implemented through the function call MLearn (of R package MLInterfaces). SVM was applied with default parameters. NN was applied with three nodes in the hidden layer, with weight decay set to 0.01. Five-fold cross-validation was performed by generating a partition function (for cross-validation), where the partitions are approximately balanced with respect to the distribution of cancer tissue types (SLC or CA) in training and test sets in each fold. Confusion matrix from the classifier output was collected for calculation of misclassification rates
Support Vector Machine
Predicted
 
  
CA
SLC
Given
CA
349
1
 
SLC
4
188
Misclassification rate
  
0.9
Neural Network
 
Predicted
 
  
CA
SLC
Given
CA
347
3
 
SLC
3
189
Misclassification rate
  
1.1

Survival analysis

Another measure of clinical relevance of pathway signature was provided by its association with survival. For a given pathway, the patients were divided into two groups of subjects (high-scoring and low-scoring) based on the level of pathway score. Survival probability for these two groups was assessed both graphically and statistically. For many of the pathways, there was a higher pathway score in survivors compared to non-survivors in SLC. This directionality was also maintained in SS, i.e., higher pathway score in survivors compared to non-survivors (Additional file 6: Table S3). The number of pathways associated (p < 0.1) with survival varied: 9 (14%) in HNSC, 28 (42%) in LIHC and 34 (52%) in KIRC (Fig. 4a). Representative plots were drawn to show survival differences between high-score and low-score patients for the pathway “Fc gamma R-mediated phagocytosis” (Fig. 4b-d).

Functional classification of the pathways

The selected 66 pathways were divided into categories based on their functional relevance. The categories included: immune system, infection, cancer, catabolism, signal transduction, ribosome biogenesis and carbohydrate metabolism (Additional file 9: Text S1).

Viral integration in TCGA samples

Out of the 687 cancer samples from TCGA analysed by us, there was overlap of 425 (61.9%) samples with the samples analysed for viral integration by Tang et al. [25]; of the 425 samples, 120 belonged to SLC group, and 305 belonged to CA group. Viral integration was observed in 9 (7.5%) SLC samples and 2 (0.66%) CA samples, showing significantly (p = 0.0003) higher viral integration in SLC. In another study by Cao et al. [26]⁠, there was an overlap of 533 (77.6%) of 687 samples analysed by us. Of the 533, 200 (37.5%) were SLC while 333 (62.5%) were CA cancers. There was significantly (p = 0.001) higher viral integration in SLC (30%) compared to CA (17.4%). The report by Kazemian et al. [27] was not associated with supplementary sample-level data. However, re-analysis of summary data provided (Additional file 8: Table S5) led to the finding of significantly (p = 4.10E-13) higher viral integration in SLC (18.7%) compared to CA (5.6%). Over all, the findings were consistent with SLC cancers to have greater likelihood of prediction of viral integration.

Discussion

Comprehensive system-level analysis reveals shared transcriptomic response between septic shock (SS) and a subset of cancers (SLC). The SLC group predominantly consists of cancer of the upper GI tract, including head and neck, oesophagus, stomach, liver and biliary tract. The striking segregation of the SLC group with SS suggests shared elements of pathophysiology operational in these disorders. Sixty-six pathways differentially expressed in both SS and SLC represent critical biological processes, such as metabolism, immune response and protection against infection. Network analysis reveals the selected pathways as a core functional module (Fig. 3) of the genome-scale network dysregulated in both SS and SLC. Many of these processes, such as metabolism and inflammatory response are known to contribute to the pathogenesis of both sepsis and cancer [9].
The segregation of cancer into two groups (SLC and CA) prompted us to investigate the prospect of the selected pathways as a classifier. Machine learning-based validation of an independent data set proved that the pathways indeed consist of a robust sample-level signature of the cancer groups. It is thus possible to predict, with high accuracy, whether a patient belongs to a particular cancer group (SLC or not). With plummeting costs and increasing acceptability of clinical transcriptomics, this signature may be useful in future for patient stratification.
Similarity in the direction of change in pathway activity immediately evokes an explanation in terms of shared immunological response in either case. SLC are often associated with infection (i.e., human papilloma virus in “head and neck”, H. pylori in stomach, hepatitis viruses in liver). In fact, TCGA data analysed for the presence of viral sequence reads in RNA-seq datasets of several cancers have revealed infection status of the samples. Interestingly, the majority of the infected samples belong to the GI associated cancers including head and neck (HNSC), oesophagus (ESCA), stomach (STAD) and liver (LIHC) [2529]. Cao et al. (2016) [26] estimated a high percentage of infected samples in TCGA datasets for liver (21.2%), head and neck (16.6%), stomach (14.8%) and oesophagus (12%). It is known that sepsis starts out as an infection, but it is the uncontrolled host response that drives the phase of shock. Similarly, in cancers with an infectious origin, host response (e.g., inflammation) plays a role in malignant transformation [9]⁠. Pathways related to immune response are consistently differentially expressed in both SS and SLC. Although the time scale of pathogenesis is much shorter in SS compared to cancer, our finding calls for greater attention into the shared cellular processes underlying these two distinct clinical phenotypes.
The finding of pathway up-regulation associated with survival brings new insight into the relationship between the shared transcriptomic patterns and disease outcome. Sepsis is a highly lethal dysregulated host response to infection, and SLC, by extension, appears to share a part of that response. Since the pathways are selected on the basis of up-regulation in both SLC and SS (compared to control), a plausible explanation is that the pathway up-regulation is protective rather than detrimental to the human host. It is worth mentioning that viral integration or bacterial infection in a setting of cancer has been reported to favour survival in SLC cancers of oropharynx [30], liver [31] and kidney [32].
Alteration of intestinal permeability is known in both sepsis and cancer [33]. It may be speculated that anatomical proximity increases the likelihood of the liver and upper gastrointestinal tissue to be exposed to the translocated microbial products through a permeable gut, eliciting a host response in both SS and SLC. Significant association between survival and up-regulation of phagocytosis-associated pathways (i.e., Fc gamma R-mediated phagocytosis) lends support to this view. In general, sepsis is less lethal in patients with cancer of the digestive system than cancer of other organs [8]. The role of gut in survival from sepsis and cancer is an active area of research with translational potential.

Conclusions

Firstly, transcriptome-based systems biology approach segregates cancer into two groups (SLC and CA) based on similarity with SS. Secondly, the similarity is based on a set of pathways associated with pathogenesis of sepsis and cancer. These pathways form a robust signature of a novel cancer grouping. Lastly, up-regulation of the pathways is protective rather than detrimental to the patient. A mechanism of the shared protective host response is hypothesized in terms of immunocompetence induced by microbial products from the permeable gut. This work is the first step towards a systems biology-based patient stratification. It is hoped that future work in this direction shall generate actionable knowledge for clinical management of both cancer and septic shock.

Supplementary information

Supplementary information accompanies this paper at https://​doi.​org/​10.​1186/​s12885-020-06774-9.

Acknowledgements

We thank the National Cancer Institute for making TCGA data available in the public domain. We thank the National Center for Biotechnology Information for making GEO data available in the public domain. This study would not have been possible without data from these two resources. We thank the Director of the National Institute of Biomedical Genomics for making the computational resources accessible to us. HT and SKM thank the Department of Biotechnology, Government of India for financial support to carry out the analysis. SKM acknowledges very helpful advice received from Prof. Subrata Sinha during the revision of the manuscript.
In this study, patient data from public databases (GEO and TCGA) have been analysed. We have not used any patient data that are not already available in the public domain.
Not applicable.

Competing interests

The authors declare that they have no competing interests.
Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://​creativecommons.​org/​licenses/​by/​4.​0/​. The Creative Commons Public Domain Dedication waiver (http://​creativecommons.​org/​publicdomain/​zero/​1.​0/​) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Anhänge

Supplementary information

Literatur
1.
Zurück zum Zitat Rudd KE, Johnson SC, Agesa KM, Shackelford KA, Tsoi D, Kievlan DR, et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the global burden of disease study. Lancet. 2020;395:200–11.CrossRefPubMedPubMedCentral Rudd KE, Johnson SC, Agesa KM, Shackelford KA, Tsoi D, Kievlan DR, et al. Global, regional, and national sepsis incidence and mortality, 1990–2017: analysis for the global burden of disease study. Lancet. 2020;395:200–11.CrossRefPubMedPubMedCentral
2.
Zurück zum Zitat Liu Z, Mahale P, Engels EA. Sepsis and risk of cancer among elderly adults in the United States. Clin Infect Dis. 2019;68:717–24.CrossRefPubMed Liu Z, Mahale P, Engels EA. Sepsis and risk of cancer among elderly adults in the United States. Clin Infect Dis. 2019;68:717–24.CrossRefPubMed
3.
Zurück zum Zitat Moore JX, Akinyemiju T, Bartolucci A, Wang HE, Waterbor J, Griffin R. A prospective study of cancer survivors and risk of sepsis within the REGARDS cohort. Cancer Epidemiol. 2018;55:30–8. Moore JX, Akinyemiju T, Bartolucci A, Wang HE, Waterbor J, Griffin R. A prospective study of cancer survivors and risk of sepsis within the REGARDS cohort. Cancer Epidemiol. 2018;55:30–8.
4.
Zurück zum Zitat Abou Dagher G, El Khuri C, Chehadeh AAH, Chami A, Bachir R, Zebian D, et al. Are patients with cancer with sepsis and bacteraemia at a higher risk of mortality? A retrospective chart review of patients presenting to a tertiary care Centre in Lebanon. BMJ Open. 2017;7:1–8.CrossRef Abou Dagher G, El Khuri C, Chehadeh AAH, Chami A, Bachir R, Zebian D, et al. Are patients with cancer with sepsis and bacteraemia at a higher risk of mortality? A retrospective chart review of patients presenting to a tertiary care Centre in Lebanon. BMJ Open. 2017;7:1–8.CrossRef
5.
Zurück zum Zitat Torres VBL, Azevedo LCP, Silva UVA, Caruso P, Torelly AP, Silva E, et al. Sepsis-associated outcomes in critically ill patients with malignancies. Ann Am Thorac Soc. 2015;12:1185–92.PubMed Torres VBL, Azevedo LCP, Silva UVA, Caruso P, Torelly AP, Silva E, et al. Sepsis-associated outcomes in critically ill patients with malignancies. Ann Am Thorac Soc. 2015;12:1185–92.PubMed
6.
Zurück zum Zitat Rosolem MM, Rabello LSCF, Lisboa T, Caruso P, Costa RT, Leal JVR, et al. Critically ill patients with cancer and sepsis: clinical course and prognostic factors. J Crit Care. 2012;27:301–7.CrossRefPubMed Rosolem MM, Rabello LSCF, Lisboa T, Caruso P, Costa RT, Leal JVR, et al. Critically ill patients with cancer and sepsis: clinical course and prognostic factors. J Crit Care. 2012;27:301–7.CrossRefPubMed
7.
Zurück zum Zitat Aygencel G, Turkoglu M, Turkoz Sucak G, Benekli M. Prognostic factors in critically ill cancer patients admitted to the intensive care unit. J Crit Care. 2014;29:618–26.CrossRefPubMed Aygencel G, Turkoglu M, Turkoz Sucak G, Benekli M. Prognostic factors in critically ill cancer patients admitted to the intensive care unit. J Crit Care. 2014;29:618–26.CrossRefPubMed
8.
Zurück zum Zitat Wang Y, Zhou J, Wu K. High 28-day mortality in critically ill patients with sepsis and concomitant active cancer. J Int Med Res. 2018;46:5030–9.CrossRefPubMedPubMedCentral Wang Y, Zhou J, Wu K. High 28-day mortality in critically ill patients with sepsis and concomitant active cancer. J Int Med Res. 2018;46:5030–9.CrossRefPubMedPubMedCentral
9.
Zurück zum Zitat Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144:646–74.PubMed Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144:646–74.PubMed
11.
12.
Zurück zum Zitat Mukhopadhyay S, Thatoi PK, Pandey AD, Das BK, Ravindran B, Bhattacharjee S, et al. Transcriptomic meta-analysis reveals upregulation of gene expression functional in osteoclast differentiation in human septic shock. PLoS One. 2017;12:1–17. Mukhopadhyay S, Thatoi PK, Pandey AD, Das BK, Ravindran B, Bhattacharjee S, et al. Transcriptomic meta-analysis reveals upregulation of gene expression functional in osteoclast differentiation in human septic shock. PLoS One. 2017;12:1–17.
13.
Zurück zum Zitat Bergenfelz C, Medrek C, Ekström E, Jirström K, Janols H, Wullt M, et al. Wnt5a induces a Tolerogenic phenotype of macrophages in Sepsis and breast Cancer patients. J Immunol. 2012;188:5448–58.CrossRefPubMed Bergenfelz C, Medrek C, Ekström E, Jirström K, Janols H, Wullt M, et al. Wnt5a induces a Tolerogenic phenotype of macrophages in Sepsis and breast Cancer patients. J Immunol. 2012;188:5448–58.CrossRefPubMed
14.
Zurück zum Zitat Diener KR, Al-Dasooqi N, Lousberg EL, Hayball JD. The multifunctional alarmin HMGB1 with roles in the pathophysiology of sepsis and cancer. Immunol Cell Biol. 2013;91:443–50.CrossRefPubMed Diener KR, Al-Dasooqi N, Lousberg EL, Hayball JD. The multifunctional alarmin HMGB1 with roles in the pathophysiology of sepsis and cancer. Immunol Cell Biol. 2013;91:443–50.CrossRefPubMed
15.
Zurück zum Zitat Aversa Z, Alamdari N, Hasselgren PO. Molecules modulating gene transcription during muscle wasting in cancer, sepsis, and other critical illness. Crit Rev Clin Lab Sci. 2011;48:71–86.CrossRefPubMed Aversa Z, Alamdari N, Hasselgren PO. Molecules modulating gene transcription during muscle wasting in cancer, sepsis, and other critical illness. Crit Rev Clin Lab Sci. 2011;48:71–86.CrossRefPubMed
16.
17.
18.
Zurück zum Zitat Gentleman R. Category: Category Analysis. R package version 2018; 2.50.0. 2000. Gentleman R. Category: Category Analysis. R package version 2018; 2.50.0. 2000.
21.
Zurück zum Zitat Carey V, Gentleman. Carey V, Gentleman R, Mar J, Vertrees J, Gatto L. MLInterfaces: Uniform interfaces to R machine learning procedures for data in Bioconductor containers. R package version. 2018; 1.58.1. Carey V, Gentleman. Carey V, Gentleman R, Mar J, Vertrees J, Gatto L. MLInterfaces: Uniform interfaces to R machine learning procedures for data in Bioconductor containers. R package version. 2018; 1.58.1.
22.
Zurück zum Zitat Grambsch PM, Terry M. Therneau. Modeling survival data: extending the cox model. New York: Springer; 2000. ISBN 0-387-98784-3. Grambsch PM, Terry M. Therneau. Modeling survival data: extending the cox model. New York: Springer; 2000. ISBN 0-387-98784-3.
24.
Zurück zum Zitat Team RC, et al.. R: A language and environment for statistical computing. 2013. Team RC, et al.. R: A language and environment for statistical computing. 2013.
25.
Zurück zum Zitat Tang KW, Alaei-Mahabadi B, Samuelsson T, Lindh M, Larsson E. The landscape of viral expression and host gene fusion and adaptation in human cancer. Nat Commun. 2013;4:1–9. Tang KW, Alaei-Mahabadi B, Samuelsson T, Lindh M, Larsson E. The landscape of viral expression and host gene fusion and adaptation in human cancer. Nat Commun. 2013;4:1–9.
26.
Zurück zum Zitat Cao S, Wendl MC, Wyczalkowski MA, Wylie K, Ye K, Jayasinghe R, et al. Divergent viral presentation among human tumors and adjacent normal tissues. Sci Rep. 2016;6:28294. Cao S, Wendl MC, Wyczalkowski MA, Wylie K, Ye K, Jayasinghe R, et al. Divergent viral presentation among human tumors and adjacent normal tissues. Sci Rep. 2016;6:28294.
27.
Zurück zum Zitat Kazemian M, Ren M, Lin J-X, Liao W, Spolski R, Leonard WJ. Possible human papillomavirus 38 contamination of endometrial Cancer RNA sequencing samples in the Cancer genome atlas database. J Virol. 2015;89:8967–73.CrossRefPubMedPubMedCentral Kazemian M, Ren M, Lin J-X, Liao W, Spolski R, Leonard WJ. Possible human papillomavirus 38 contamination of endometrial Cancer RNA sequencing samples in the Cancer genome atlas database. J Virol. 2015;89:8967–73.CrossRefPubMedPubMedCentral
28.
Zurück zum Zitat Cantalupo PG, Katz JP, Pipas JM. Viral sequences in human cancer. Virology. 2018;513:208–16.CrossRefPubMed Cantalupo PG, Katz JP, Pipas JM. Viral sequences in human cancer. Virology. 2018;513:208–16.CrossRefPubMed
29.
Zurück zum Zitat Salyakina D, Tsinoremas NF. Viral expression associated with gastrointestinal adenocarcinomas in TCGA high-throughput sequencing data. Human Genomics. 2013;7:1–12.CrossRef Salyakina D, Tsinoremas NF. Viral expression associated with gastrointestinal adenocarcinomas in TCGA high-throughput sequencing data. Human Genomics. 2013;7:1–12.CrossRef
30.
Zurück zum Zitat Chaturvedi AK, Engels EA, Pfeiffer RM, Hernandez BY, Xiao W, Kim E, et al. Human papillomavirus and rising Oropharyngeal Cancer incidence in the United States. JCO. 2011;29:4294–301.CrossRef Chaturvedi AK, Engels EA, Pfeiffer RM, Hernandez BY, Xiao W, Kim E, et al. Human papillomavirus and rising Oropharyngeal Cancer incidence in the United States. JCO. 2011;29:4294–301.CrossRef
31.
Zurück zum Zitat Chao JS, Zhao SL, Ou-Yang SW, Qian YB, Liu AQ, Tang HM, et al. Post-transplant infection improves outcome of hepatocellular carcinoma patients after orthotopic liver transplantation. World J Gastroenterol. 2019;25:5630–40.CrossRefPubMedPubMedCentral Chao JS, Zhao SL, Ou-Yang SW, Qian YB, Liu AQ, Tang HM, et al. Post-transplant infection improves outcome of hepatocellular carcinoma patients after orthotopic liver transplantation. World J Gastroenterol. 2019;25:5630–40.CrossRefPubMedPubMedCentral
32.
Zurück zum Zitat Chen J, Lv Y, Mu F, Xu K. Long-term response of metastatic renal clear cell carcinoma following a subcutaneous injection of mixed bacterial vaccine: a case report. Onco Targets Ther. 2019;12:2531–8.CrossRefPubMedPubMedCentral Chen J, Lv Y, Mu F, Xu K. Long-term response of metastatic renal clear cell carcinoma following a subcutaneous injection of mixed bacterial vaccine: a case report. Onco Targets Ther. 2019;12:2531–8.CrossRefPubMedPubMedCentral
33.
Zurück zum Zitat Bindels LB, Neyrinck AM, Loumaye A, Catry E, Walgrave H, Cherbuy C, et al. Increased gut permeability in cancer cachexia: mechanisms and clinical relevance. Oncotarget. 2018;9:18224–38.CrossRefPubMedPubMedCentral Bindels LB, Neyrinck AM, Loumaye A, Catry E, Walgrave H, Cherbuy C, et al. Increased gut permeability in cancer cachexia: mechanisms and clinical relevance. Oncotarget. 2018;9:18224–38.CrossRefPubMedPubMedCentral
Metadaten
Titel
Sepsis-associated pathways segregate cancer groups
verfasst von
Himanshu Tripathi
Samanwoy Mukhopadhyay
Saroj Kant Mohapatra
Publikationsdatum
01.12.2020
Verlag
BioMed Central
Erschienen in
BMC Cancer / Ausgabe 1/2020
Elektronische ISSN: 1471-2407
DOI
https://doi.org/10.1186/s12885-020-06774-9

Weitere Artikel der Ausgabe 1/2020

BMC Cancer 1/2020 Zur Ausgabe

Alphablocker schützt vor Miktionsproblemen nach der Biopsie

16.05.2024 alpha-1-Rezeptorantagonisten Nachrichten

Nach einer Prostatabiopsie treten häufig Probleme beim Wasserlassen auf. Ob sich das durch den periinterventionellen Einsatz von Alphablockern verhindern lässt, haben australische Mediziner im Zuge einer Metaanalyse untersucht.

Mammakarzinom: Senken Statine das krebsbedingte Sterberisiko?

15.05.2024 Mammakarzinom Nachrichten

Frauen mit lokalem oder metastasiertem Brustkrebs, die Statine einnehmen, haben eine niedrigere krebsspezifische Mortalität als Patientinnen, die dies nicht tun, legen neue Daten aus den USA nahe.

Labor, CT-Anthropometrie zeigen Risiko für Pankreaskrebs

13.05.2024 Pankreaskarzinom Nachrichten

Gerade bei aggressiven Malignomen wie dem duktalen Adenokarzinom des Pankreas könnte Früherkennung die Therapiechancen verbessern. Noch jedoch klafft hier eine Lücke. Ein Studienteam hat einen Weg gesucht, sie zu schließen.

Viel pflanzliche Nahrung, seltener Prostata-Ca.-Progression

12.05.2024 Prostatakarzinom Nachrichten

Ein hoher Anteil pflanzlicher Nahrung trägt möglicherweise dazu bei, das Progressionsrisiko von Männern mit Prostatakarzinomen zu senken. In einer US-Studie war das Risiko bei ausgeprägter pflanzlicher Ernährung in etwa halbiert.

Update Onkologie

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.