Introduction
Prostate cancer is the most common cancer in Europe among men [
1]. In 2012, there were 417,000 new cases of prostate cancer in Europe, representing 12.1 % of all new cancers [
1]. The economic burden associated with this high incidence is substantial. For example, the combined cost of direct healthcare, informal care and productivity loss associated with prostate cancer was estimated at €7,848 million in the European Union, in 2009 [
2].
Despite 80–90 % of metastatic prostate cancer patients responding to androgen deprivation therapy [
3], progression to castration-resistant disease occurs in most patients after 2–3 years, with a subsequent survival time of 24–48 months [
4]. The health-related quality of life (HRQoL) of patients with prostate cancer declines substantially toward the end of life [
5]. Therefore, treatment of metastatic castration-resistant prostate cancer (mCRPC) is mainly palliative, with the aim of prolonging survival, relieving symptoms and improving HRQoL. The European Association of Urology guidelines recommend docetaxel as first-line chemotherapy, together with corticosteroids, for the treatment of symptomatic mCRPC [
6]. Bisphosphonates are prescribed for the management of metastatic bone disease (present in >90 % of patients with mCRPC [
7]) to prevent skeletal-related events and improve symptom control [
6]. Radionuclides, radiotherapy and analgesics may also be considered for the management of bone pain [
6].
Cost-efficacy is important in the technical evaluation of new therapies by reimbursement agencies. Generic preference instruments, such as the EuroQol-5D (EQ-5D) [
8], can aid decision makers in resource allocation. These instruments generate health state utilities that can be used to compare quality-adjusted life years gained for interventions across different patient groups and diseases. However, measurements of HRQoL in clinical trials often use disease-specific instruments that address outcomes important to a particular patient population, thus limiting their usefulness in cost-utility analyses. One solution is to derive validated algorithms that map scores from disease-specific HRQoL instruments onto generic preference instruments. This approach has been accepted by bodies such as the UK’s National Institute for Health and Care Excellence, who specifically require EQ-5D utility values as part of health technology assessment submissions [
9].
In line with such requirements, an increasing number of strategies for mapping disease-specific responses to preference-based instruments have been published. A database held at the Health Economics Research Centre, Oxford University, lists ninety studies of statistical mapping to predict EQ-5D utilities [
10]. However, only one of these focused on mCRPC. Furthermore, 17 % of models were based on less than 200 observations and 39 % of studies used ordinary least squares (OLS) regression as the single statistical tool.
The mCRPC study included in the database demonstrated the feasibility of mapping the functional assessment of cancer therapy-prostate (FACT-P) questionnaire, which specifically measures the HRQoL of prostate cancer patients, to EQ-5D scores [
11]. However, application of the algorithm to an external data set was found to yield mean EQ-5D values greater than 1 [
12], and the algorithm requires a correction applicable to a truncated linear model [
13].
Therefore, a requirement remains for the development of a mapping function that adequately predicts EQ-5D utility values based on responses to FACT-P. In this article, we describe the construction of a prediction model using data obtained from a large, cross-sectional, observational study in patients with mCRPC. Furthermore, we assess the performance of four regression models to predict EQ-5D utility values from responses to the FACT-P questionnaire. In addition to OLS, Tobit, median and Gamma regression models were included to account for ceiling effects and to anticipate any violations of normality and homoscedasticity.
Methods
Study sample and data collection
Data were derived from a cross-sectional, observational study conducted in six countries: Belgium, France, Germany, Sweden, the Netherlands and the UK. The study enrolled male patients aged ≥18 years presenting with mCRPC at 47 specialist prostate cancer centers during a 10-month recruitment period. Consecutive patients who visited the clinic during regular follow-up visits were invited to participate. Patients were eligible for inclusion in the study if they had a histologically or cytologically confirmed diagnosis of adenocarcinoma of the prostate; prostate cancer progression documented by prostate-specific antigen according to Prostate Cancer Working Group 2 (PCWG2) criteria or radiographic progression, and disease progression despite surgical or medical castration [a testosterone level of <50 ng/dL (<1.735 nM) was required if testosterone levels were routinely measured]. Exclusion criteria included participation in any investigational drug study or any expanded access program during the observation period. Patients’ HRQoL was assessed at the inclusion visit by utilizing the EQ-5D and FACT-P questionnaires.
Study instruments
FACT-P is a questionnaire that has been validated to estimate HRQoL in men with prostate cancer [
14]. The tool comprises the 27-item FACT-General (FACT-G) questionnaire, which measures HRQoL in cancer patients, and a 12-item prostate cancer subscale, designed to measure prostate cancer-specific HRQoL. The FACT-P is scored by adding the subscales of the FACT-G plus the prostate cancer subscale to yield a comprehensive HRQoL score.
The EQ-5D comprises five domains, which measure general health status: mobility, self-care, usual activities, pain/discomfort and anxiety/depression. In this study, the ‘3l,’ rather than the ‘5l’ version of the tool was used, which subdivides each domain into three, rather than five levels. The EQ-5D provides a simple descriptive profile and a single utility index of health status and is widely used in health economic analyses [
8].
Model specifications—statistical analysis
Utility values were derived from EQ-5D profiles based on a UK-specific EQ-5D value set. The mapping exercise was conducted using responses from patients from multiple countries, and UK preference weights were applied.
The predictive validity of the five FACT-P subscales, patient demographics, comorbidities and prior chemotherapy for utility values was tested using four different regression models: (1) OLS regression was used to construct linear prediction models of EQ-5D, describing differences in mean EQ-5D as a function of mean patient characteristics; (2) median regression was used to describe differences in median health status; (3) generalized linear models (GLM) with log link and Gamma family predicting EQ-5D disutility (where disutility = 1—utility), which allows for skewed distribution of utility values and prevents prediction of utilities >1; (4) the Tobit model, also called a censored regression model, designed to estimate linear relationships between variables when there is either left censoring or right censoring in the dependent variable.
Model validation and predictive ability
A prediction model usually performs better with the data that were used in its development. Therefore, it is critical to evaluate how well the model works with other data sets. Similar to Wu et al. [
11], we estimated the cross-validation
R
2 as the primary indicator of prediction model performance. Tenfold cross-validation techniques were employed to derive goodness of fit statistics. To calculate the cross-validation model performance indicators, the study sample was first divided into 10 equally sized groups. Each group was used successively to test each model, and the remaining 90 % of the sample were used to fit the prediction model. The resulting estimated prediction model was then used to estimate the performance of the original 10 % of the sample. Finally, the estimated error terms were pooled to estimate the overall performance of the model. Additionally, the root mean square error and the mean absolute deviation were generated.
Predictive ability was assessed by comparing observed and predicted EQ-5D scores for three patient subgroups that were defined according to the chemotherapy status of patients at study inclusion: chemotherapy-naïve, undergoing chemotherapy and previously treated with chemotherapy.
Discussion
We have developed an algorithm to map FACT-P, a disease-specific instrument, to EQ-5D, a generic preference instrument, based on data collected from mCRPC patients. OLS was the best-performing model. It explained 61.2 % of the variation in EQ-5D values following tenfold cross-validation and provided good concordance between actual and mapped EQ-5D utility scores in predictive assessment.
A previous study demonstrated the feasibility of mapping FACT-P to EQ-5D scores [
11]. However, the equation cannot be used without correction [
13]. Furthermore, it has been reported that linear regressions may not always accurately predict the EQ-5D distribution for high and low EQ-5D values [
15,
16]. Our data tend to confirm this observation; the mapping formula tended to overpredict utility at the lower end of the scale (below 0.4; Fig.
1). Additionally, linear regression may not account for the bounded nature of the EQ-5D, leading to implausible estimates outside of the possible range of values (1 to −0.594). We used a range of regression models to estimate EQ-5D utility values. Tobit regression was included in our analyses to account for the ceiling effect, as it allows for censored dependent variables, and censored the predicted values at 1. However, the Tobit model operates poorly if assumptions of normality and homoscedasticity are violated [
17]. Median regression does not rely on these assumptions. However, it has been reported that median regression, while not explicitly dealing with censoring, is equivalent to censored least absolute deviation (used by other mapping studies) when censoring occurs in less than 50 % of the study sample [
18]. We also used a generalized linear model (Gamma regression) to account for any skewed distribution of utility values and prevent prediction of utilities >1.
Over the past decade, there has been an increase in the number of studies that have mapped disease-specific responses to preference-based instruments. In addition to the Oxford database [
10], a recent literature review identified ten studies that used mapping methods to determine utilities from two cancer-specific instruments (QLQ-C30 and FACT) [
19], of which only one study focused on mCRPC patients [
11].While statistical models differed across the ten studies, most employed an OLS method and did not conduct an out-of-sample validation. Most studies also used the statistical significance of the coefficients corresponding to different components of the HRQoL scale to determine which variables should be retained in the final model, with a parsimonious approach to final model selection. In doing so, most studies reported that OLS regression performed best, irrespective of its strict assumptions. All of the reviewed studies reported the models’ explanatory power in terms of R
2, with a range of values between 0.417 and 0.909.
In our study, OLS regression performed equally well to the median and Tobit models in predicting utility scores, with an
R
2 (0.612) in the middle of the range reported by the recent literature review [
19]. In view of the relative simplicity of applying OLS regression formulae to other datasets, this was retained as our final model.
The patients included in this study provided FACT-P and EQ-5D data in line with those previously reported. For example, Sullivan et al. measured FACT-P and EQ-5D scores in 280 patients with a mean time from initial diagnosis of prostate cancer to diagnosis of mCRPC of 3.51 years and mean time from diagnosis of mCRPC to study entry of 1.5 years [
20]. FACT-P prostate cancer subscale scores ranged from 27.3 to 30.7 and the EQ-5D utility ranged from 0.527 to 0.750 across the seven countries included. These compare to a mean FACT-P prostate cancer subscale of 29.7 and mean EQ-5D utility of 0.66 in our study. In both studies, the FACT-P scores and EQ-5D utility index indicate the significant impact of prostate cancer on patients’ HRQoL.
The mapping exercise in our study was similar to that published previously by Wu et al. [
11]. Both studies used data from multinational studies and applied UK preference weights. Mean observed values for EQ-5D and FACT-P were very similar in both studies (EQ-5D:0.66 and 0.64; FACT-P: 104 and 105 for the present study and Wu et al., respectively). In addition to the OLS and median regression employed by Wu et al., we explored two additional statistical models. However, in both studies, OLS was retained as the best-performing model. The estimates of the coefficients of the FACT-P subscales based on the OLS model were in similar directions, with a high weight assigned to the PWB subscale in both studies. However, the larger sample size in this present study (
N = 602) compared with Wu et al. (
N = 280) may allow for the generation of more precise parameter estimates.
The limitations of our study include the derivation of utility values using the UK-specific EQ-5D value set. Algorithms developed using country-specific preference weights may account for differences in preferences arising from cultural influences, and value sets should be appropriate to the economic analysis required. The extent to which our algorithm can be generalized is strengthened by the multinational nature of the population that was used. However, further analysis is required to validate the algorithm in other populations.
Although regression-based approaches are commonly used to map HRQol instruments, a recent publication by Fayers et al. (2014) suggests that such approaches may result in biased estimates as a result of regression to the mean [
20].
Disease-specific instruments have been developed to address aspects of health-related outcomes that are important to specific patient populations and can overcome the limitation of generic instruments, which may lack the responsiveness to detect meaningful differences in HRQoL. However, some studies have found that OLS regression tends to overestimate the true value of EQ-5D utilities for patients in poor health, while underestimating the true EQ-5D utilities at the upper end of the scale [
16,
21‐
23]. Such considerations reinforce the use of a preference-based measure when assessing HRQoL in clinical trials. Nevertheless, our analysis provides an algorithm that can effectively translate FACT-P scores to generic utility values.
This study has developed an algorithm for mapping EQ-5D index scores from FACT-P. The algorithm was found to have good predictive ability, with a high degree of correlation between observed and predictive EQ-5D-based utility scores in defined subgroups of patients with mCRPC. The algorithm provides an instrument for the calculation of appropriate preference-based HRQoL scores for use in analyses of interventions for mCRPC when a generic measure is not available.