Background
Urothelial bladder cancer (UBC) is among the most common malignant tumors of the urological system and one of the most prevalent cancers due to its chronic nature [
1]. As a consequence, it poses an enormous burden on health care systems [
2].
UBC also represents a paradigm of heterogeneous diseases with respect to its phenotype and prognosis. Approximately, 75 % of newly diagnosed UBCs do not invade the muscle (non-muscle invasive bladder cancer, NMIBC) at the time of diagnosis. Most of these cancers remain stable over the time after a transurethral resection (TUR); a high proportion relapse without invading the muscle (recurrence) while a lower proportion progress as a muscle invasive bladder cancer (MIBC). Based on tumor characteristics, mainly stage and grade, NMIBC are subsequently classified as “low risk” (LR) and “high risk” (HiR) of progression [
3].
Current prognostic tools for NMIBC are based on well-known clinico-pathological prognosticators such as pathological grade and stage, number and size of tumours, and presence of carcinoma in situ [
3,
4]. However, these factors do not have enough discriminative ability to predict, at the patient level, the risk of recurrence and progression [
5]. An accurate estimation of the outcome risk in the individual patient would help identifying the most appropriate therapy to avoid tumor progression and, hopefully reducing the number of follow-up cystoscopies in patients at low risk [
6].
There is a growing evidence for a role of germline genetic polymorphisms in cancer risk and prognosis, UBC being a paradigm [
7,
8]. However, the individual effect of the genetic variants is expected to be small and they may not be medically actionable. Multimarker analyses have been shown to capture a much higher percentage of the genetic variance than individual markers which passed the significant threshold in GWAS [
9‐
11].
Our objective was to investigate whether genome-wide common SNP profiles are able to predict the risk of recurrence and progression in NMIBC patients and to estimate how much they contribute to these predictions when combined with clinico-pathological prognosticators. To this end, we adapted Bayesian statistical learning strategies to be applied to the human prognosis field for the first time.
Discussion
Here we present a high dimensional model considering the time-to-event nature of the information and censored data enabling to accommodate a large number of variables in a relatively small number of individuals. To our knowledge, this is the first time that such a model is applied in the clinical and genetic epidemiology fields. More specifically, we have applied it to study the predictive ability of prognostic models for NMIBC patients.
The major goal in managing NMIBC patients is to prevent tumor relapse, this including both the high number of recurrences and the progression to MIBC. To this end, treatment needs to be tailored according to the aggressiveness of the disease. Therefore, accurate prognostic models are crucial. Currently, there are no validated prognostic molecular biomarkers to guide the clinical management of patients [
22,
23] and the therapeutic decisions are still based on risk tables only including clinico-pathological prognosticators [
3]. Here we have investigated the potential clinical utility of inherited genetic markers (SNP profiles) based on their robustness and precise measurements as well as on their time-independent nature in comparison to serological and histological markers. To this end we have assessed the ability to improve TFR and TP risk stratification in NMIBC patients of genome-wide common SNPs profiles. We have also evaluated the performance of well-known clinico-pathological prognosticators and how much the whole genome approach improved their performance to better classify patients.
Regarding the classification performance of clinico-pathological prognosticators alone, our sequential threshold models for both TFR and TP got similar estimates to those obtained previously by us with a Cox proportional hazard regression analysis [
11]. Discrimination of patients according to the risk of TFR using clinico-pathological prognosticators was poorer than previously reported by Hernandez et al [
24] (0.62 vs. 0.75), although better than that reported by Vedder et al [
25] in a large cohort including ours. Nevertheless, it is worth noting that the definition of the outcome differs (recurrence vs. first recurrence) between our and these studies [
24,
25]. Regarding TP outcome, our clinico-pathological prognosticators model classified the patients better than in Hernandez et al [
24] (0.76 vs. 0.54) and than in a Danish cohort using both EORTC (0.76 vs. 0.72) and CUETO (0.76 vs. 0.74) scores [
25]. However, it performed worse than in a Dutch cohort using the same classifiers: EORTC (0.76 vs. 0.81 and 0.77) and CUETO scores (0.76 vs. 0.82 and 0.81) [
25].
The prediction ability of clinico-pathological prognosticators depends on the outcome. They clearly perform better in predicting TP than TFR, both in terms of classification (AUC, 0.76 vs. 0.62) and proportion of the explained variance (
R
probit
2
, 5.4 % vs. 3.1 %). Their lower performance when predicting TFR could be due to the dependence of factors other than biological explanations such as the potential incomplete resection of the tumor during the TURB and the tumour cell reimplantation on first tumour recurrence [
23], factors that are difficult to be assessed and therefore are not accounted for in the model. When the patients were stratified according to their risk status, clinico-pathological prognosticators explained a larger proportion of the phenotypic variance (~15 %) in the HiR group than in the LR NMIBC, probably because these factors were specifically selected to identify patients with HiR tumors with a high potential of progression. However, the overall classification performance of HiR NMIBC patients was poorer (AUC = 0.57) than in the whole cohort. While the discriminatory ability of clinical-pathological parameters for both NMIBC outcomes is valuable, there is room for improvement. More accurate discriminatory models would better select patients for aggressive treatment as well as would avoid unnecessary treatments towards a better patient management. This justifies the search of further prognostic factors, among them tumour molecular alteration and inherited variation markers [
3,
26,
27].
Our results showed that common genome-wide SNPs similarly, though poorly, classified patients regarding both TFR and TP in the whole series and in the HiR and LR subcohorts, AUCs ranging from 0.55 to 0.58. Adding SNP to the models did not improve the classification performance of clinico-pathological prognosticators although improvements of
R
probit
2
were achieved for TFR (3–4) and TP in the HiR cohort (15.1 - 15.5 %). Surprisingly, adding SNP to clinico-pathological prognosticators worsened the percentage of phenotypic variance (
R
probit
2
) explained by the model with clinico-pathological prognosticators only by 7 and 25 % when predicting TP in the whole and the LR-NMIBC cohorts, respectively. The little improvement or even deterioration in terms of
R
probit
2
could be explained by a correlation between the prediction of clinico-pathological prognosticators and that of SNPs. To confirm this, we calculated the
R
probit
2
of a model with
\( \mathbf{X}\widehat{\boldsymbol{\upbeta}} \)obtained from clinico-pathological prognosticators only as dependent variable and the SNPs as independent variables (see Tables
2 and Additional file
1: Table S6). The proportion of the clinico-pathological prognosticators prediction variances of TFR and TP explained by SNPs was larger than that of the TFR and TP phenotypic variances. The calculation of
R
probit
2
allowed us to report the first
ĥ
2 for TFR and TP in the whole series and in the HiR and LR subcohorts. The largest
ĥ
2 corresponded to TFR (1 %) and to TP of patients at HiR (1 %), although they may be underestimated because of the sample size and the limitation on the number of SNPs included in the model [
28]. All the above explains the small or nil contribution of the SNPs to the predictive ability of clinico-pathological prognosticators of the phenotypes of interest. The poor predictive ability of common SNPs in NMIBC prognosis is in line with a previous study reporting low GWAS risk predictive values for UBC [
19], as well as with those obtained in studies predicting risk for other neoplasms, such as breast cancer [
29,
30]. The different results obtained with AUC and
R
probit
2
can be explained by the different scales in which the predictions are expressed (observable for AUC and liability for
R
probit
2
), their non-monotonic relationship, and the lower number of events, especially when the individuals were stratified.
Table 2
Estimates of the determination coefficient (R
probit
2
) measuring the proportion of variance of the liability to first recurrence (TFR) and progression (TP) risks in whole, high risk (TPHiR) and low risk (TPLR) cohorts of the clinicopathological prognosticators explained by the common SNPs
R
probit
2
| 0.0260 | 0.0165 | 0.0025 | 0.0066 |
While this is one of the largest and well-characterized NMIBC cohort worldwide, the restricted sample size in the subgroup analyses is one of the limitations we face here because the small number of events limits the prediction accuracy of the genomic profile achieved with the SNPs. This is even clearer when patients were further stratified as LR-NMIBC. Although increasing sample size of the study would be desirable, heterogeneity across studies regarding patient recruitment, pathological classifications applied, and treatment or patient management would increase random misclassification and, therefore, would dilute estimates. While we conducted a genome-wide exploration, the models did not include all genotyped SNPs (1 million) but a subset that were filtered by a restrict LD. When we applied a less restrictive LD threshold (r
2
< 0.8) and considered a larger number of common SNPs neither the classification performance nor the percentage of the phenotypic variance explained improved (results not shown). Including in the models both rare and structural variants may help in further characterizing and increase the precision of the predictive estimates. Application of other statistical modeling approaches could indeed yield improvements in the predictive power, for example by considering non-additive models that include epistatic interactions between SNPs or adding functional information in the model. Exploring the integration of other –omics data such as microRNAs, as well as considering possible interactions between treatment and variants could also help in this regard.
This study also presents several strengths as its population-based nature, detailed medical information, long follow-up, and centralized pathological review decreasing heterogeneity of the covariates stage and grade. The use of state-of-the art methodology applied here allowed to handle a highly dimensional problem and time-to-event data, as well as censoring. The application of such methodology allowed us to provide the first estimates of heritability for UBC outcomes.
Acknowledgements
We acknowledge the coordinators, field and administrative workers, technicians and study participants of the Spanish Bladder Cancer/EPICURO study.
Spanish Bladder Cancer (SBC)/EPICURO Study investigators: Institut Municipal d’Investigació Mèdica, Universitat Pompeu Fabra, Barcelona – Coordinating Center (M. Kogevinas, N. Malats, F.X. Real, M. Sala, G. Castaño, M. Torà, D. Puente, C. Villanueva, C. Murta-Nascimento, J. Fortuny, E. López, S. Hernández, R. Jaramillo, G. Vellalta, L. Palencia, F. Fermández, A. Amorós, A. Alfaro, G. Carretero); Hospital del Mar, Universitat Autònoma de Barcelona, Barcelona (J. Lloreta, S. Serrano, L. Ferrer, A. Gelabert, J. Carles, O. Bielsa, K. Villadiego), Hospital Germans Trias i Pujol, Badalona, Barcelona (L. Cecchini, J.M. Saladié, L. Ibarz); Hospital de Sant Boi, Sant Boi de Llobregat, Barcelona (M. Céspedes); Consorci Hospitalari Parc Taulí, Sabadell (C. Serra, D. García, J. Pujadas, R. Hernando, A. Cabezuelo, C. Abad, A. Prera, J. Prat); Centre Hospitalari i Cardiològic, Manresa, Barcelona (M. Domènech, J. Badal, J. Malet); Hospital Universitario de Canarias, La Laguna, Tenerife (R. García-Closas, J. Rodríguez de Vera, A.I. Martín); Hospital Universitario Nuestra Señora de la Candelaria, Tenerife (J. Taño, F. Cáceres); Hospital General Universitario de Elche, Universidad Miguel Hernández, Elche, Alicante (A. Carrato, F. García-López, M. Ull, A. Teruel, E. Andrada, A. Bustos, A. Castillejo, J.L. Soto); Universidad de Oviedo, Oviedo, Asturias (A. Tardón); Hospital San Agustín, Avilés, Asturias (J.L. Guate, J.M. Lanzas, J. Velasco); Hospital Central Covadonga, Oviedo, Asturias (J.M. Fernández, J.J. Rodríguez, A. Herrero), Hospital Central General, Oviedo, Asturias (R. Abascal, C. Manzano, T. Miralles); Hospital de Cabueñes, Gijón, Asturias (M. Rivas, M. Arguelles); Hospital de Jove, Gijón, Asturias (M. Díaz, J. Sánchez, O. González); Hospital de Cruz Roja, Gijón, Asturias (A. Mateos, V. Frade); Hospital Alvarez-Buylla (Mieres, Asturias): P. Muntañola, C. Pravia; Hospital Jarrio, Coaña, Asturias (A.M. Huescar, F. Huergo); Hospital Carmen y Severo Ochoa, Cangas, Asturias (J. Mosquera).