Background
Bladder cancer (BC) is one of the most difficult to treat and costly cancers due to its relapse tendency and chemoresistance [
1]. In total, 76,000 new cases and 16,000 deaths are attributed to BC in the USA per year [
2]. With such a large patient population, accurately diagnosing and effectively treating BC have become difficult challenges for basic medical researchers and urologists.
Epigenetic dysregulation is an important mechanism of tumorigenesis that affects the expression of numerous genes [
3]. Aberrant DNA methylation, i.e., hyper- or hypomethylation, on CpG islands of promoters is one such mechanism, resulting in aberrant gene expression and having a major impact on the biological behaviour of BC [
4,
5]. DNA methylation could also serve as a good biomarker for clinical diagnosis because of its stable and easily detectable attributes in many types of clinical specimen [
6,
7]. Dulaimi et al. [
8] reported that the detection of hypermethylation in the APC, RASSF1A, and ARF genes in BC patients may act as a non-invasive method for early diagnosis. Casadio et al. [
9] also indicated that the methylation frequencies of HIC1, GSTP1 and RASSF1A could predict BC recurrence. Ohad et al. [
10] found that CDH13 is downregulated by promoter methylation in BC patients, and this may be closely associated with tumour development.
The TCGA project aims to catalogue and discover major molecular changes to create a comprehensive “map” of the human cancer genome [
11]. The multiple dimensions of data and massive samples not only provide a more comprehensive view of cancer but also enable the finding of better biomarkers, which could affect cancer treatment and prognosis [
12]. DNA methylation data are also included in the massive data set, and a computational protocol called MethylMix can distinguish disease-specific hyper/hypomethylation genes, both of which are publicly available [
13]. Several studies have been conducted to assess methylation-driven genes using the MethylMix algorithm and TCGA database [
13‐
15].
In this study, we identified BC-related methylation-driven genes by using the data from the TCGA database. By coupling DNA methylation and gene transcriptome data, we identified methylation-driven genes and further constructed a model of DNA methylation status to predict prognosis in BC patients.
Discussion
Urothelial carcinoma is generally classified as non-muscle-invasive bladder cancer (NMIBC) or muscle-invasive bladder cancer (MIBC). The standard treatment for NMIBC is transurethral resection, and the universal treatment for MIBC is radical cystectomy, but a considerable number of NMIBC patients (50% to 80%) have tumour recurrence [
1,
2]. Pathological staging is a key factor in current clinical decision making and prognosis of BC; nevertheless, the clinical outcomes of patients with the same stage often differ, indicating that the current staging system is not sufficient to reflect biological heterogeneity, and accurately determining the prognosis of patients is challenging. A new prognostic evaluation model based on molecular entities could guide individualized treatment and improve the therapeutic effect.
DNA methylation is an epigenetic modification that affects the interaction between DNA and regulatory factors, which, in turn, regulates gene expression [
25]. Hypermethylation inhibits gene expression, while hypomethylation promotes gene expression. In addition, the DNA methylation status is faithfully inheritable through cell division but also revisable, it plays a very important role in the dynamic regulation of expression. Numerous studies based on either a genome-wide view or a gene-specific view have demonstrated that DNA methylation drives abnormal gene expression and is a crucial factor in the development and progression of tumours [
26]. Therefore, the methylation profiles of methylation-driven genes in tumour patients could serve as potential biomarkers [
27]. This phenomenon in BC patients is extensive, and many genes have been suggested to be factors involved in pathogenesis and are used as diagnostic and prognostic biomarkers [
28,
29]. Our study provides a comprehensive view of methylation-driven genes in BC, and a prognosis model based on the methylation profile of six genes was developed and has implications for both basic research and clinical applications.
We identified a cohort of 167 methylation-driven genes in BC. The functional annotation demonstrated that these genes are widely scattered in diverse biological processes and pathways ranging from signal transduction, gene regulation, and development to metabolism and cell structure. These results demonstrate that DNA methylation is involved in the dysregulation of genes with distinct functions and suggest possible mechanisms by which DNA methylation is functionally linked to outcomes in BC patients.
Six genes (IDH2, GSTM2, LURAP1, ARHGDIB, LINC00526, and ARL14) with methylation profiles closely related to survival were selected by a LASSO Cox regression. Based on their methylation level and coefficients with survival, a prognostic model was developed. The verification of this model in the whole patient set and subsets grouped by either clinical or molecular characteristics showed that the low-risk group has a better survival status. The AUC of the ROC curve of the whole cohort based on this model was 0.698 at 3 years of OS.
For further potential application of this model in clinical work, a nomogram was generated. The nomogram integrates multiple predictors and simplifies the statistical prediction model to the probability of outcome events; thus, the survival probability of individual patients can be calculated. The predicted survival rate is close to the actual survival situation (C-index: 0.694), and the nomogram has a prediction effectiveness similar to that of the ROC curve. These results indicate the excellent predictive ability of this model in the prognosis of BC patients.
The six genes included in the model were further analysed individually. The hypomethylation of IDH2 and hypermethylation of ARL14 were associated with poor prognosis, and a high expression matched hypomethylation of ARHGDIB and ARL14 was meaningfully correlated with better prognosis. Further analysis of the methylation sites showed that the hypermethylation of 5 sites in LURAP1 and GSTM2 is associated with better prognosis, and the hypermethylation of another 9 sites in ARHGDIB, LINC00526 and ARL14 is associated with poor prognosis in BC. Additionally, the methylation levels at several methylation sites were correlated with the expression levels of the associated genes, all with negative correlations, indicating that these individual methylation sites alone contributed to expression regulation.
The methylation levels of these six genes contributed to the risk score of this model either positively or negatively. Some of this contribution could be functionally explained by previous studies, but the remainder lacks explanation, as information regarding the role of these genes in cancer is very limited.
The methylation levels of ARHGDIB, LINC00526 and ARL14 are positively related to poor survival. ARHGDIB (Rho GDP dissociation inhibitor GDI beta), which is also known as RhoGDI2, is a member of the guanine nucleotide dissociation inhibitor (GDI) family [
30]. Mounting evidence suggests that the reduced expression of ARHGDIB is associated with the development of several types of cancer and that its hypermethylation contributes to its reduced expression [
31]. The CpG islands of ARHGDIB were relatively hypermethylated in cases of ovarian cancer relapse after chemotherapy [
32]. Huang et al. [
33] demonstrated that ARHGDIB is significantly associated with OS in lung cancer patients. In BC, the reduced expression of ARHGDIB is associated with shorter disease-free survival time [
34‐
36]. In our study, the methylation level matched gene expression analysis of ARHGDIB, and the analysis of CpG sites showed that hypomethylation in the ARHGDIB gene is associated with better survival. Our result is consistent with the results of previous studies. LINC00526 is a long intergenic non-protein-coding RNA, and one study has demonstrated that it suppresses glioma progression [
37]. ARL14 (ADP Ribosylation Factor Like GTPase 14) is a protein-coding gene that participates in GTP binding and signal transduction [
38]. However, information regarding the role of ARL14 in cancer is lacking.
The methylation level of IDH2, LURAP1 and GSTM2 is negatively related to poor survival. IDH2 is a protein-coding gene. The function of IDH2 in cancer has been relatively well documented. Li et al. [
39,
40] found that IDH2 promotes lung cancer cell growth and serves as a novel therapeutic target in lung cancer. Mutations of IDH2 are frequently observed in acute myeloid leukaemia [
41], colon cancer [
42,
43], and gliomas [
44], causing alterations in metabolism and DNA methylation; these mutations could represent a possible mechanism of tumorigenesis [
44] and provide potential avenues for therapeutic intervention. We found that hypermethylation in IDH2 is associated with a better prognosis in BC patients. In our study, the relationship among GSTM2, LURAP1 and prognosis showed similar characteristics to IDH2. Hypermethylation at 3 sites in GSTM2 and 2 sites in LURAP1 is correlated with a better prognosis. GSTM2 is a subtype of glutathione S-transferase (GSTs) that performs functions such as eliminating free radicals and is involved in cell protection and the regulation of cell growth. Consistent with our findings, Kresovich et al. [
45] found that a high methylation level in the GSTM2 promoter could be involved in ER/PR-negative breast cancer progression. Ashour et al. [
46] proved that the epigenetic silencing of GSTM2 is a common phenomenon in prostate cancer that could be used as a molecular marker for diagnosis.
To the best of our knowledge, these six genes have not been previously studied as a prognostic model in BC patients. Further verification of this model in other types of clinical specimen, such as urine sediment cells and circulating tumour cells from BC patients, could provide more information regarding its potential clinical application. For urologists, accurate prognostic assessments are critical for selecting the optimal treatment. Our nomogram is a predictive model that combines gene information and clinical factors to provide a prognostic indication for clinicians.
However, this study also has certain limitations. First, this is a retrospective study, and the application of this model requires further verification by increasing the sample size and performing prospective studies. Second, the treatments that the patients have received are highly heterogeneous and incomplete, thus we could not include this information in our analysis. Improving these aspects for future in-depth studies could further increase the persuasiveness of these results.
In summary, we screened methylation-driven genes in BC, and a six-gene model was constructed based on methylation profiles. This model was validated in groups with different disease characteristics and could be expected to serve as a predictive tool for clinical outcomes and guide personalized anticancer treatment. In addition, we analysed the relationships between individual CpG islands associated with these six genes and survival, which may provide important bioinformatic clues for mechanistic research related to the development and progression of BC.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.