Background
Sarcoma is a group of malignant tumors originating from stromal cells, which can be divided into soft tissue sarcoma (including muscle, fat, nerve, blood vessel and other connective tissue) and bone sarcoma [
1]. According to the report of cancer statistics, the total incidence of sarcoma is close to 5 per 100,000 people per year, accounting for 15% of all malignant tumors in children and 1% of malignant tumors in adults [
2]. With the development of diagnostic and therapeutic technology, the 5-year survival rate was higher than 70% in some sarcomas. However, as a whole, the improvement in the diagnosis and treatment for sarcoma is still slowed down [
3,
4]. For advanced stage patients or patients with distant metastasis, the 5-year survival rate was lower than 20% [
4]. Although some strengthening therapies or new drugs were used for treating sarcoma patients, the outcome were still unsatisfactory [
5,
6]. Therefore, developing effective tools for early diagnosis and prognostic prediction plays a vital role in the tailor management of sarcoma patients. Nowadays, AJCC TNM staging system is widely used to evaluate the prognosis of tumors and guide treatment decisions. However, the accuracy of TNM staging is not satisfactory and the prognosis of patients with same TNM stage may be various.
Autophagy is a metabolic and circulatory process in which cells decompose damaged organelles and proteins and release decomposition products [
7]. It has been reported that autophagy regulates many pathological and physiological processes, such as cell differentiation and death, immune stress, inflammatory reaction, and anti-aging [
7‐
11]. Nevertheless, autophagy had dual effects in the process of tumorigenesis and tumor progression. In the early stages of tumorigenesis, autophagy can suppress tumor probably by reducing cell damage and chromosome instability [
12]. However, for existing tumor, autophagy can promote tumor progression by releasing decomposition products and providing nutrition for tumor cells [
13]. Recent study indicated that several autophagy-related genes (ARGs) affect the activation of autophagy and the mutation of ARGs play an important role in the pathogenesis of cancer [
14‐
17]. Moreover, ARGs may have potential value for prognostic prediction for tumor patients, including sarcoma. For example, Zhao et.al reported that the positive expression of ATG5 associated with TSSC3 can inhibit the metastasis and invasion of osteosarcoma cells [
18]. In addition, Muscolino et.al demonstrated that the inhibition of ARG expression was related to the immune escape of Kaposi’s sarcoma-associated herpesvirus [
19]. Although the role of autophagy in the occurrence and development of sarcoma were initially reported in present studies, most studies have focused on specific ARGs and no research performed a comprehensive analysis of ARGs for sarcoma.
In this study, based on a series of available data sets, we performed a comprehensive analysis of ARGs to explore the diagnostic and prognostic values of ARGs for sarcoma patients. Meanwhile, two ARGs-based prognostic nomograms were established and externally validated and the association between ARGs and tumor microenvironment were studied.
Methods
Data acquisition and processing
The RNA-sequencing data of 911 normal human muscle and adipose tissues and 259 primary sarcoma were obtained from the UCSC Xena browser (
https://xenabrowser.net/). For both normal tissue and sarcoma cohorts, RNA-sequencing data (FPKM values) were normalized into log2 (FPKM+ 1). Meanwhile, the corresponding clinical data were downloaded from the UCSC Xena browser and cBioPortal (
http://www.cbioportal.org/). Meanwhile, for independent cohorts, including GSE2719, GSE21122, GSE30929, and TARGET-OS, were included as validation cohorts. The first two cohorts were used to validate the diagnostic value of ARGs for sarcoma, while the remaining cohorts were used to validate the prognostic value of ARGs for sarcoma. The expression file and clinical data of TARGET-OS cohort were downloaded from the UCSC Xena browser, whereas the data of GSE2719, GSE21122, and GSE30929 cohorts were obtained from the Gene Expression Omnibus database (
https://www.ncbi.nlm.nih.gov/geo).
Profiling of DEARGs for sarcoma
To determine sarcoma-related genes, differential analyses were performed between tumor samples and normal tissues with “limma” package. We used R software to perform Wilcoxon signed-rank test to identify the differential expression genes (DEGs). The cutoff value of DEARGs was identified by | log2 (FC) | > 1 and adjusted
p value < 0.05. Furthermore, according to the ARGs list obtained from the HADb (
http://autophagy.lu/clustering/index.html), DEARGs were confirmed. Heatmap and circos plot were used to visualize DEGs and DEARGs.
Furthermore, to understand the function and related pathway of DEARGs, enrichment analyses were used for functional annotation of DEARGs. Gene Ontology (GO) analysis was performed to screen the potential function of DEARGs and Kyoto Encyclopedia of Genes and Genomes (KEGG) was used for pathway analysis of DEARGs. Terms with a
p-value< 0.05, a minimum count of 3, and an enrichment factor > 1 were collected and grouped into clusters based on their membership similarities. The enrichment analyses were performed in Metascape (
http://metascape.org).
Diagnostic value of ARGs for sarcoma
Early and accurate diagnosis for sarcoma is vital for systematic treatment. Although the diagnostic role of ARGs for several diseases were reported, the value for sarcoma remains unclear [
20,
21]. In the present study, several DEARGs were incorporated into the analysis of diagnostic value of ARGs. First, to avoid overfitting among DEARGs, Least absolute shrinkage and selection operator (LASSO) analysis was performed and significant genes were incorporated into receiver operating characteristic (ROC) curve analysis. The area under the curve (AUC) value was used to quantify the diagnostic value of AGRs. Furthermore, to validate the diagnostic value of AGRs for sarcoma patients. The ROC curves of diagnostic genes in two validate cohorts were generated and corresponding AUC values were determined.
Characteristic of autophagy-related phenotypes
Based on the DEARGs, ConsensusClusterPlus package was used to perform the unsupervised clustering analysis to explore the distinct autophagy patterns of sarcoma. In addition, “CIBERSORT” package was used to quantify the 22 types immune cells and the “ESTIMATE” package was used to determine the immune and stromal scores. The difference of clinical data, DEARSs, tumor microenvironment score, microsatellite instability, and immune cell fraction between clusters were analyzed.
Construction of ARG signatures for OS and DFS of sarcoma patients
Previous studies indicated that AGRs-based signature can serve as effective biomarkers for predicting patient’s prognosis [
22‐
25], but the role in sarcoma patients remains unclear. In the present study, two ARGs-based signatures were generated to predict the OS and DFS for sarcoma patients, respectively. Firstly, the TCGA-SARC cohort was used as the training cohort to develop signatures. Based on DEARGs, the univariate Cox regression model was performed. Genes with a
P value< 0.05 was considered to be statistically significant. Then, LASSO regression was performed to exclude overfitting genes. The significant genes in the LASSO analysis were further incorporated into the multivariate Cox regression model to develop signatures, and the risk scores of each patient were calculated by combining regression coefficient and gene expression. According to the median of risk score, 259 sarcoma patients were divided into low- and high-risk groups. The performance of two signatures was evaluated by Kaplan-Meier survival curve and the time-dependent ROC curves.
External validation of ARG signatures
External validation by another independent cohort is vital for prognostic signature. Therefore, we downloaded two independent cohorts (TARGET-OS and GSE30929) to validate our signatures. The risk score of patients in TARGET-OS were calculated by the OS signature, whereas the risk score of patients in GSE30929 were calculated by the DFS signature. The time-dependent ROC curves were generated for two validation cohorts. In addition, survival curves were generated to show the prognostic difference between low- and high-risk groups.
Development of ARG-clinical nomogram for sarcoma
Clinical variables, such tumor site, age and metastatic disease, were also important prognostic biomarkers for sarcoma patients. Therefore, to enhance the predictive ability of signature, we intended to establish two ARG-clinical nomograms. First, the univariate Cox analysis was performed to select OS- and DFS-related variables. Significant variables in the univariate Cox analysis were further incorporated into the multivariate Cox analysis to determine independent prognostic clinical biomarkers. Then, two nomograms that included corresponding independent prognostic biomarkers were developed by “rms” package. The concordance index(C-index) was calculated to show the discrimination and the calibration curve was used to show the calibration of nomogram.
Discussion
Sarcoma is a primary malignant tumor derived from mesenchymal tissue [
26]. Studies have confirmed that autophagy had impact on the tumorigenesis and progression of sarcoma [
27]. In this study, we comprehensively analyzed the role of ARGs in sarcoma patients. A total of 84 genes were determined as DEARGs in sarcoma, which were involved in several autophagy- and tumor-related functions and pathways. Further diagnostic analysis indicated that several ARGs were robust biomarkers for sarcoma diagnosis and survival analysis confirmed the prognostic value of DEARG. One OS autophagy signature and another DFS autophagy signature were developed and validated in independent cohorts. In addition to survival analyses, we firstly performed the ARG-based cluster analysis for sarcoma. Two clusters were determined and showed significantly associated with clinical variables and tumor microenvironment.
Compared with previous studies, this study has the following advantages. First of all, based on the identified DEARG, we not only analyzed the survival of patients with sarcoma, constructed an autophagy-related risk model, but also analyzed the immune microenvironment of sarcoma. Through multi-angle and multi-level analysis, we have a deeper understanding of the prognosis of sarcoma. Secondly, we identified 5 biological functions and pathways related to autophagy in sarcoma, which have been proved to be significantly related to the occurrence and development of malignant tumors. Then, for the first time, our team identified and verified the autophagy-related characteristics of sarcoma from three different databases: GEO, TCGA and TARGET. Finally, we developed two nomograms to predict 3- and 5-year OS and DFS in sarcoma patients and found that it had good predictive performance. Therefore, through the research and analysis of the autophagy-related prognostic model of OS and DFS in, we can provide individual survival prediction and promote the choice of better treatment strategies, which is of guiding significance in clinical application.
Consistent with previous studies, all the genes we identified in ARGs that predict the prognosis of patients with sarcomas have been reported to be closely related to the prognosis of sarcomas or other malignant tumors in previous studies. As early as 1999, some scholars proposed that vascular endothelial growth factor (VEGF) is related to the poor prognosis of osteosarcoma [
28]. VEGFA, a member of the VEGF family, is one of the key regulators of tumor angiogenesis and plays a role in cancer by stimulating VEGF receptor (VEGFR) on tumor cells [
29]. At present, drugs targeting VEGF receptors have been widely used in the treatment of sarcoma, colon cancer, breast cancer and other cancers [
30,
31]. Mika et al. Found that the expression of BIRC5/survivin decreased and sarcoma cells died after YM155 anti-cancer treatment experiment on human synovial sarcoma cell line and gene engineering mouse model of synovial sarcoma [
32]. Moreover, Wirries A et al. [
11] reported that survivin and its downstream target Bcl-2 were suppressed by Panobinostat in osteosarcoma cells, which provide new perspectives for the therapy of osteosarcoma. The above research results also support our analysis hypothesis that the expression of VEGFA and BIRC5 is too high, the prognosis of patients may be poor. Niedan et al. found that high expression of FOXO1 induces cell death in Ewing’s sarcoma through WS-FLI1 inhibitory signals [
33]. In addition to sarcomas, high expression of APOL1 can induce autophagy and autophagy-related cell death, which may be the key to maintaining cell homeostasis in the kidney. Similarly, APOL1 protects renal cells from renal cell carcinoma [
34]. Rutkovsky et al. have shown that the overexpression of EIF4EBP1 in patients with breast cancer is associated with poor prognosis. Silence of EIF4EBP1 can significantly inhibit the proliferation of breast cancer cells by promoting G1 cell cycle arrest [
35].
In addition, through functional enrichment analysis, we identified five biological functions of “autophagy”, “vacuole”, “apoptosis signal pathway”, “autophagy regulation” and “cellular stress response regulation”, as well as “autophagy-animal”, “mitosis-animal”, “cancer pathway”, “apoptosis” and “protein processing in endoplasmic reticulum”. The five autophagy signal pathways were significantly enriched in the sample DEARG. We speculate that the imbalance of expression of these functions and signaling pathways related to autophagy may be related to the biological pathway of sarcoma.
However, there are also some limitations in this study. Firstly, the data used in the study are taken from several public databases, from which the clinical information downloaded is limited and incomplete. Therefore, some important prognostic factors, such tumor grade, tumor size, surgery information, and distant metastatic data, were not available and were not included in the present study. Secondly, the muscle and fat tissues were selected to form as normal tissue to compare the sarcoma. However, some sarcomas, such as myxofibrosarcoma and synovial sarcoma, are not derived from these tissues in the first place. Therefore, this may also lead to some bias in the present study. Finally, as a regulatory factor in the occurrence and development of sarcoma, the molecular mechanism of ARGs regulation needs to be confirmed by further research, so as to promote the clinical application of ARGs as a prognostic marker and therapeutic target of sarcoma.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.