Introduction
Bladder cancer remains a common problem in the Western world [
1]. Approximately 75–85 % of bladder cancer presents as non-muscle-invasive bladder cancer (NMIBC); the remaining patients have muscle-invasive disease (MIBC) [
2].
Treatment of NMIBC consists of complete transurethral resection of the bladder tumor (TURBT), followed by a single immediate postoperative instillation (POI) with chemotherapy. Further treatment depends on patients’ and tumor characteristics. In the guidelines of the European Association of Urology (EAU), patients are divided into three risk groups [
2] (Supplementary Table [ST] 1). This stratification is similar to that provided by the International Bladder Cancer Group (IBCG) [
3] and is partially based on the well-known risk tables developed by the European Organization for Research and Treatment of Cancer (EORTC) [
4].
In general, treatment advises for low-risk and high-risk groups are clearly stated in the guidelines, but treatment advises for intermediate-risk (IR) patients are less clear. This is an important lack of information as the IR group covers a large number of patients with heterogeneous characteristics, making selection of appropriate therapy challenging. Therefore, we identified predictors of recurrence and developed a prediction model for recurrence probabilities for IR-NMIBC patients treated with intravesical chemotherapy.
Methods
Data of three prospective Dutch studies [
5‐
7] were available for analyses, providing us with individual data of 2006 patients. Treatment and follow-up have been described in detail before [
5‐
7] and can be found in ST2.
For the development of the prediction model, in this study, we included only patients with Ta G1/2 urothelial carcinoma
without carcinoma in situ (CIS) and without the combination ‘multiple & recurrent & diameter >3 cm.’ This is consistent with the definition of IR group according to the EAU guideline [
2]. All included patients received intravesical chemotherapy (either mitomycin C or epirubicin).
Primary outcome measurement was time to first recurrence (recurrence-free survival; RFS): time from randomization to the date of the first bladder recurrence.
Statistical methods
First, baseline demographics of the selected Dutch patients are presented (n = 724).
Univariable and multivariable Cox regression with selection procedures and likelihood analyses were used for selecting independent variables for RFS. Smoking and tumor diameter were removed from analyses based on the number of missing data and the hazard ratios (HRs). For the final model, the adjusted HRs are presented, including the 1, 2, and 5 years probabilities for recurrence. To assess the model’s accuracy (discrimination), Harrell’s bias corrected concordance index (c-index) was calculated at 1, 2, and 5 years and models were refitted 200 times with bootstrap resampling techniques.
Three risk groups were constructed based on the risk profiles of the final model: <P33, P33–P66, and >P66. In addition, sensitivity, specificity, negative, and positive predictive value (NPV, PPV) were calculated for the minor risk group at 2 years, as most recurrences occurred within 2 years.
External validation
Prospectively collected, independent, individual patient data provided by Fundacio Puigvert, Barcelona, Spain, were used to study the final prediction model (n = 137). However, this cohort included only data of primary patients. For the comparison, we used a subcohort of primary patients from the Dutch cohort (n = 305). The prediction model was applied to the data of these cohorts.
Statistical analyses were done with SPSS 20.0.0 for Windows (SPSS Inc., Chicago, Il, USA), in SAS 9.2 for Windows (SAS Institute Inc., Cary, NC, USA) and in R 2.2 for Windows.
Discussion
We present a study comparing recurrence outcome and treatment options for the heterogeneous spectrum of IR patients, and we propose a prediction model on recurrence probabilities with external validation. We found five relevant predictors for RFS: a history of recurrences, history of previous treatment, tumor grade 2, multiple tumors, and adjuvant treatment with epirubicin, with HRs of 1.48, 1.38, 1.22, 1.56, and 1.27, respectively. There is a huge difference between 1 and 5 years outcome, and between having none or all of the independent predictors (Table
4). We defined the IR group into three subgroups (minor, moderate, and major risk) in a way that each risk group needs to be considered for a less or more aggressive adjuvant treatment schedule or treatment type. The recurrence probabilities as predicted in Table
4 can be related to a risk group in Fig.
1.
The EORTC have developed risk tables based on a group of 2596 patients [
4]. However, the EORTC risk tables have several limitations: 22 % of patients received no intravesical treatment at all; only 171 patients (7 %) received treatment with bacillus Calmette–Guerin (BCG) and none received BCG maintenance. Therefore, the EORTC risk tables could be interpreted as probabilities of the untreated natural history of the disease, especially for progression. Another well-known prediction model is the scoring model of Club Urologico Español de Tratamiento Oncologico (CUETO) [
8]. Data of 1062 patients, all treated with BCG, were used to identify risk factors for recurrence and progression after BCG treatment. Several other prediction models have been developed for NMIBC [
9‐
15], but none of them included solely patients who, according to the guidelines, should have been and in fact were treated with intravesical chemotherapy.
Recently, Kamat et al. [
16] developed an algorithm specifically for IR patients based on the consensus of the IBCG. They consider tumor size, tumor multiplicity, timing and frequency of recurrences, and previous treatment to be key factors. Based on these key factors, they divide IR patients in three groups: low-risk patients, ‘true’ IR patients, and high-risk patients. Our analyses and model support these recommendations; only tumor size, which is also a significant predictor in the EORTC risk model [
4], is of no influence in our model, and tumor grade is not considered to be a key factor by the IBCG.
Concerning tumor size, Kamat et al. [
16] do mention that the well-known cutoff of 3 cm might be no longer relevant as the number of patients with large tumors is very low, which we could confirm (in the Dutch cohort only 6 %). Within the IBCG, it was suggested to further study a new cutoff of 1 cm. When analyzing this cutoff in our cohort, no statistical significant influence on recurrence outcome was seen (
p = 0.480), but this might be due to the high number of missing data, which is a limitation of our study.
We found that tumor diameter had many missing data and was no statistically significant predictor for RFS in the complete cohort. This is clearly different from other prediction models including the EORTC risk model [
4], but our group of patients all received adjuvant intravesical chemotherapy and are therefore not comparable with, e.g., the EORTC risk model patients.
The term ‘low grade’ is based on the WHO 2004 grading system, which was not yet available during the inclusion period of the three Dutch studies. Therefore, we considered G1 and G2 tumors to be low grade, but G2 tumors are a mixture of low-grade and high-grade tumors. According to Chen et al. [
17], approximately 80 % of the G2 tumors are low grade. Thus, in this study, we could have misclassified 75 patients, and consequently, these patients could be treated insufficiently with subsequently more recurrences.
The c-index of our model was 0.60, 0.62 and 0.63 at year 1, 2, and 5, respectively. This is comparable to the c-index for recurrence probabilities of the EORTC risk Tables (0.66 both at 1 and 5 years) and that of the CUETO scoring model (0.64 both at 1 and 5 years) [
4,
8]. However, the clinical relevance of the c-index is doubtful and there is no consensus how high the c-index should be to make a model clinically relevant.
For a more practical approach, based on their risk factors, we divided the patients in three subgroups: minor, moderate and major risk. As can be seen in Fig.
1, this subdivision is clearly related to recurrence outcome, and thus, the major risk group could be considered for more aggressive treatment and the minor risk group for less aggressive treatment. The relevance of this subdivision is also reflected in the predictive accuracy of our model (ST3). For treatment options, the NPV and PPV are more important than sensitivity and specificity, as this is associated with under- and overtreatment. Compared to the EORTC and CUETO, our model is clearly better in preventing overtreatment in minor risk patients as PPV is much higher (68 % versus 21 and 24 %), but NPV is somewhat lower (65 versus 94 and 92 %) which, however, is less of a problem in minor risk patients [
18]. Additionally, the external validation shows very good overlap in HR. However, as the Spanish cohort only included primary patients, it is in fact a partial external validation. Nevertheless, as agreement between the Dutch subcohort and the Spanish cohort is high, one could hypothesize that these results could be extrapolated to the total model. An external validation with primary and recurrent patients is needed to confirm our results.
Limitations of this study are the long inclusion period, the missing data, and the differences with the current standard of treatment including the quality of TURBT due to, e.g., the introduction of re-TURBT, the introduction of fluorescence cystoscopy, and the lack of immediate POIs in most patients (only 23 % of patients received immediate POI). On the other hand, fluorescence cystoscopy is most useful in CIS, but these patients were excluded in our analyses. In the Dutch cohort, no re-TURBT was done, but this is not always necessary in IR patients. Another limitation is the variability in adjuvant treatment, including the dose, the concentration of chemotherapy used, and the treatment schedule which might have influenced the outcome. Yet, the median number of instillations received was 10, and only 2.1 % of patients received less than six instillations. Furthermore, both European and American guidelines do not recommend specific chemotherapy schedules [
2,
19].
Acknowledgments
The 39 hospitals that participated in the original three studies are as follows: Onze Lieve Vrouwe Gasthuis, Amsterdam; Academic Medical Center, Amsterdam; Hospital Center Apeldoorn, Apeldoorn; Rijnstate Hospital, Arnhem; Wilhelmina Hospital, Assen; Lievensberg Hospital, Bergen op Zoom; Ignatius Hospital, Breda; Medical Center Haagland, Den Haag; Rode Kruis Hospital, Den Haag; St. Deventer Hospital, Deventer; Slingeland Hospital, Doetinchem; Gelderse Vallei, Ede; Catharina Hospital, Eindhoven; Medisch Spectrum Twente, Enschede; Martini Hospital, Groningen; St. Jansdal Hospital, Harderwijk; Jeroen Bosch Hospital, ’s-Hertogenbosch; Hospital Bethesda, Hoogeveen; Medical Center Leeuwarden Noorderbreedte, Leeuwarden; Diaconessen Hospital, Leiden; Leids University Medical Center, Leiden; Rijnland Hospital, Leiderdorp; Zuiderzee Hospital, Lelystad; Diaconessen Hospital, Meppel; Radboud University Nijmegen Medical Center, Nijmegen; Canisius Wilhelmina Hospital, Nijmegen; Pasteur Hospital, Oosterhout; St. Franciscus, Roermond; St. Laurentius, Roermond; St. Franciscus Hospital, Roosendaal; Academic Hospital Rotterdam, Rotterdam; Zuiderzee Hospital, Rotterdam; Elisabeth Hospital, Tilburg; Tweesteden Hospital, Tilburg; Academic Hospital Utrecht, Utrecht; Viecuri, Venlo; Hospital Walcheren, Vlissingen; Streek Hospital Queen Beatrix, Winterswijk; Isala, Zwolle. Furthermore, we would like to thank Christien T.M. Caris and Maria H.D. Janzing-Pastors from CuraTrial SMO & Research, Arnhem, the Netherlands, for collecting the data of the Dutch three studies. Finally, we would like to thank Dr. Sergio Skrobot for collecting the data in Fundacio Puigvert.