1 Introduction
Cost-utility analysis (CUA) is often used to inform whether new treatments or interventions should be reimbursed within a healthcare system. CUA typically measures benefits in terms of quality-adjusted life-years (QALYs), combining the length of life with an index of health-related quality of life (HRQoL), often known as a utility, or values, of patients. Values can be obtained using a variety of direct and indirect methods, including using description of health states associated with a condition (i.e. vignettes), asking patients to value their own health directly or, more commonly, indirect valuation using a preference-based measure (PBM) of health [
1].
PBMs comprise a descriptive system through which health is described, and a value set reflecting strength of preference of members of the general public, or more rarely patients, for the health state described. PBMs can be generic (GPBMs) or condition-specific. While GPBMs can be used to describe health problems relevant across multiple diseases and conditions, condition-specific PBMs typically describe health problems occurring in a specific condition of interest [
1‐
3], often focussing on particular symptoms, aspects of functioning or side effects.
For use in CUA, prominent health technology assessment bodies such as the National Institute for Health and Care Excellence (NICE) in the UK and the Zorginstituut in the Netherlands express a preference for using GPBMs [
4,
5]. In other countries, for example in Spain, the choice of which measure to use for obtaining values for QALYs calculation is left to the analyst, who can consider either GPBM or condition-specific PBM [
6]. There are multiple arguments supporting the use of GPBM, including the fact that GPBMs detect the negative impact of a wide range of comorbidities alongside the positive impact of interventions, as well as avoiding labelling and focusing effects biases [
2]. More importantly, using a single GPBM for all assessments across all conditions allows for cross-program comparability [
7]. The potential concern with using a single GPBM alone is that the chosen GPBM may not be valid and responsive for the group of patients being examined. In those cases, using a different GPBM or condition-specific PBM is necessary [
5].
A limited number of GPBMs dominate the literature [
8], specifically the EQ-5D, the SF-6D, the Health Utility Index mark 3 (HUI3), the Assessment of Quality of Life (AQoL),the 15 Dimensions (15D) and the Quality of Wellbeing Self-Administered (QWB-SA) [
9,
10]. Although there is considerable evidence for their validity and responsiveness in many common medical problems such as skin, respiratory, genitourinary, endocrine, nutritional and metabolic diseases [
7,
11,
12], there is also mixed or inconsistent evidence for all these measures in some specific types of cancers [
7,
11,
13].
Recently, a partnership between the Multi-Attribute Utility in Cancer (MAUCa) Consortium and the European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Group led to the development of the EORTC QLU-C10D [
14]. The EORTC QLU-C10D is a condition-specific PBM derived from the EORTC QLQ-C30, one of the most commonly used patient-reported outcome measures in cancer randomized controlled trials [
15]. Given the widespread use of the EORTC QLQ-C30, the EORTC QLU-C10D might be an alternative when the preferred GPBM is not included in the trial of interest, or a useful measure to perform CUA sensitivity analysis in those cases in which the preferred GPBM reports mixed validity and responsiveness results. This is because the QLU-C10D allows values to be directly estimated from clinical studies that have used the QLQ-C30, without the need for mapping onto GPBMs or additional data collection.
The development of the EORTC QLU-C10D followed two stages. First, the health state classification system was developed to reduce the thirty items of the EORTC QLQ-C30 into 13 items covering ten dimensions [
16]. Subsequently, a valuation method using a discrete choice experiment (DCE) was developed [
17], which was then used to generate a value set from members of the general public of Australia [
18]. This valuation method is valid and has been increasingly used due to its ease of application and reduction in data collection burden [
19,
20].
For the conduct of CUAs, it is recommended to use country-specific value sets, as differences in preferences across countries might substantially alter the values obtained [
21‐
23] and consequently cost-effectiveness estimates. Hence, valuation studies for the EORTC QLU-C10D have been completed, or are currently being undertaken, in numerous countries, including Canada [
24], Germany [
14], the United Kingdom [
25], Austria, France, Italy, Poland [
26], the Netherlands, the United States [
14] and Singapore [
27]. The current study reports on the EORTC QLU-C10D valuation in Spain. Preferences were elicited from a representative sample of the Spanish general population, replicating the methods employed by King and colleagues [
18].
3 Results
3.1 Characteristics of the Sample
Of the 1625 participants who were invited to the study, 17 (1%) declined to participate, 236 (14.5%) entered the web link but dropped out before completing the survey (e.g. did not give informed consent, quit before the beginning of the valuation component or did not complete the valuation component, etc.) and 362 (22.3%) were excluded as their respective quotas were already full. The remaining 1010 participants completed the study. The median time taken to complete the survey progressively decreased from 33 seconds in choice set 1–9 s in choice set 16. In line with previous studies, there was variability in the time taken per choice set between respondents [
18,
25]. Appendix Figure 1 reports the median and percentile time stamps (i.e. median, fifth, twenty-fifth, seventy-fifth and ninety-fifth percentiles of time taken in seconds to complete each progressive choice set, i.e. first, second, third etc.) for the Spanish sample (see electronic supplementary material [ESM]).
Table
2 presents the sociodemographic characteristics of the included sample. The sample had an equal distribution between males (
n = 500, 49.5%) and females (
n = 510, 50.5%) and good representation of all age groups of the general population. Of the 1010 participants interviewed, 347 stated they had a chronic disease (34.4%). This was lower and statistically significantly different than the percentage of people with chronic disease in the general Spanish population. Regarding education, the most common responses were a university education (
n = 461, 45.7%), completed high school (
n = 319, 31.6%) and completed only compulsory education (
n = 230, 22.7%). This was statistically significantly different from the percentages found in the Spanish general population, where a higher number of compulsory education and a lower number of high school and university education responders were found.
Table 2
Background characteristics of participants
Gender | | | | |
Male | 500 | 49.5 | 49.4 | 0.977 |
Female | 510 | 50.5 | 50.6 |
Age (years) | | | | |
18–29 | 163 | 16.1 | 15.9 | 0.862 |
30–39 | 184 | 18.2 | 18.4 | 0.870 |
40–49 | 216 | 21.4 | 21.6 | 0.877 |
50–59 | 192 | 19.0 | 18.9 | 0.935 |
60–69 | 143 | 14.2 | 14.2 | 1.000 |
70–80 | 112 | 11.1 | 11.0 | 0.919 |
Chronic diseases | | | | |
Yes | 347 | 34.4 | 41.1 | < 0.000 |
No | 663 | 65.6 | 58.1 |
Education | | | | |
Compulsory | 230 | 22.7 | 45.0 | < 0.000 |
Higher secondary | 319 | 31.6 | 22.0 | < 0.000 |
University | 461 | 45.7 | 33.0 | < 0.000 |
EQ-5D-5L | | | | |
Mobility Level 1 | 809 | 80.1 | | |
Mobility Level 2 | 141 | 14.0 | | |
Mobility Level 3 | 41 | 4.0 | | |
Mobility Level 4 | 15 | 1.5 | | |
Mobility Level 5 | 4 | 0.4 | | |
Selfcare Level 1 | 934 | 92.5 | | |
Selfcare Level 2 | 44 | 4.3 | | |
Selfcare Level 3 | 29 | 2.9 | | |
Selfcare Level 4 | 2 | 0.2 | | |
Selfcare Level 5 | 1 | 0.1 | | |
Usual Activities Level 1 | 845 | 83.6 | | |
Usual Activities Level 2 | 112 | 11.1 | | |
Usual Activities Level 3 | 34 | 3.4 | | |
Usual Activities Level 4 | 14 | 1.4 | | |
Usual Activities Level 5 | 5 | 0.5 | | |
Pain/Discomfort Level 1 | 564 | 55.8 | | |
Pain/Discomfort Level 2 | 326 | 32.3 | | |
Pain/Discomfort Level 3 | 91 | 9.0 | | |
Pain/Discomfort Level 4 | 26 | 2.6 | | |
Pain/Discomfort Level 5 | 3 | 0.3 | | |
Anxiety/Depression Level 1 | 675 | 66.8 | | |
Anxiety/Depression Level 2 | 230 | 22.8 | | |
Anxiety/Depression Level 3 | 75 | 7.4 | | |
Anxiety/Depression Level 4 | 23 | 2.3 | | |
Anxiety/Depression Level 5 | 7 | 0.7 | | |
Kessler K-10 score, mean (SD) | 20.1 | (7.5) | | |
3.2 DCE Feedback Module
Table
3 reports the results of the questionnaire regarding survey difficulty. As can be seen, most responders (58.2%,
n = 587) found the survey easier or the same as other surveys. Of the 1010 participants, 235 considered the presentation of the health states as very clear (23.3%), 445 as clear (44.1%), 189 as neither clear nor unclear (18.7%), 106 as unclear (10.5%) and 35 as very unclear (3.5%). Choosing between health states was considered easy or very easy by 340 responders (33.7%), neither easy nor difficult by 302 (29.9%) and difficult or very difficult by 368 responders (36.4%). Most of the participants (31.7%,
n = 320) explained that they considered only the aspects that differ between choice options (i.e. highlighted in yellow), 187 (18.5%) stated that they considered all the aspects and 228 (22.6%) stated that they considered most of the aspects.
Table 3
Difficulty of survey
Was the survey easier or harder than most surveys? | Easier | 147 | 14.6 |
The same | 440 | 43.6 |
Harder | 391 | 38.7 |
Could not say | 32 | 3.2 |
Was the presentation of health states clear? | Very clear | 235 | 23.3 |
Clear | 445 | 44.1 |
Neither clear nor unclear | 189 | 18.7 |
Unclear | 106 | 10.5 |
Very unclear | 35 | 3.5 |
How difficult it was to choose between pairs of health states? | Very easy | 108 | 10.7 |
Easy | 232 | 23.0 |
Neither easy or difficult | 302 | 29.9 |
Difficult | 314 | 31.1 |
Very difficult | 54 | 5.3 |
Did you have a strategy for choosing between the health states? | Considered all aspects | 187 | 18.5 |
Considered most of the aspects | 228 | 22.6 |
Considered only aspects in yellow | 320 | 31.7 |
Considered only a few aspects | 161 | 15.9 |
Other | 43 | 4.3 |
Did not have a strategy | 71 | 7.0 |
3.3 Generalized Estimating Equation Utility Decrements (Non-imposed Monotonicity)
Table
4 reports the generalized estimating equation
β coefficients and associated utility decrements (i.e.
β/
α) for the EORTC QLU-C10 without adjustments for inconsistencies. The significance of coefficients indicate that the attribute level had a statistically significant impact on the responders’ choice. The sign of the coefficients indicates whether this impact was positive or negative.
Table 4
Generalized estimating equation QLU C10D utility decrements (not adjusted for non-monotonicities)
Time coefficient (α) | (linear) | | | 0.562 |
Physical functioning | 2 | − 0.051** | 0.021 | − 0.090 |
Physical functioning | 3 | − 0.091** | 0.022 | − 0.163 |
Physical functioning | 4 | − 0.143** | 0.019 | − 0.254 |
Role functioning | 2 | − 0.002 | 0.017 | − 0.003 |
Role functioning | 3 | − 0.059** | 0.017 | − 0.104 |
Role functioning | 4 | − 0.060** | 0.015 | − 0.107 |
Social functioning | 2 | -0.014 | 0.015 | − 0.024 |
Social functioning | 3 | − 0.052** | 0.015 | − 0.093 |
Social functioning | 4 | − 0.048** | 0.015 | − 0.085 |
Emotional functioning | 2 | − 0.007 | 0.015 | − 0.013 |
Emotional functioning | 3 | − 0.022* | 0.016 | − 0.039 |
Emotional functioning | 4 | − 0.044** | 0.014 | − 0.078 |
Pain | 2 | − 0.015 | 0.015 | − 0.026 |
Pain | 3 | − 0.066** | 0.017 | − 0.117 |
Pain | 4 | − 0.098** | 0.015 | − 0.174 |
Fatigue | 2 | − 0.030** | 0.014 | − 0.053 |
Fatigue | 3 | − 0.034** | 0.016 | − 0.061 |
Fatigue | 4 | − 0.043** | 0.014 | − 0.077 |
Sleep disorders | 2 | 0.003 | 0.015 | + 0.006 |
Sleep disorders | 3 | − 0.007 | 0.016 | − 0.013 |
Sleep disorders | 4 | − 0.017* | 0.013 | − 0.031 |
Lack of appetite | 2 | − 0.022* | 0.014 | − 0.038 |
Lack of appetite | 3 | − 0.033** | 0.015 | − 0.059 |
Lack of appetite | 4 | − 0.025* | 0.014 | − 0.045 |
Nausea | 2 | − 0.037** | 0.015 | − 0.066 |
Nausea | 3 | − 0.059** | 0.015 | − 0.105 |
Nausea | 4 | − 0.051** | 0.014 | − 0.091 |
Bowel problems | 2 | − 0.023* | 0.015 | − 0.042 |
Bowel problems | 3 | − 0.047** | 0.015 | − 0.084 |
Bowel problems | 4 | − 0.049** | 0.013 | − 0.087 |
As can be seen, for all attributes, level 4 was statistically significant, but level 2 and 3 of some attributes were not. More precisely, five attributes (i.e. physical functioning, fatigue, lack of appetite, nausea and bowel problems) registered negative and statistically significant coefficients for level 2, level 3 and level 4, four attributes for level 3 and level 4 only (i.e. role functioning, social functioning, emotional functioning and pain) and one attribute (i.e. sleep) for level 4 only.
The attribute resulting in the largest utility decrements was physical functioning, ranging between − 0.090 of level 2 and − 0.254 of level 4. The second largest utility decrements were associated with pain, that is, level 3 (− 0.117) and level 4 (− 0.174). Five attributes reported utility decrements < 0.1 for all their severity levels, and these were social functioning, emotional functioning, fatigue, sleep and bowel problems. Among them, the attribute with the smallest utility decrement was sleep (i.e. only level 4 had a statistically significant decrement and this was − 0.031).
Three of the EORTC QLU-C10 attributes were not monotonically decreasing (i.e. larger decrements associated with smaller levels of severity); these were social functioning, lack of appetite and nausea. For all three attributes, decrements were larger for level 3 than for level 4.
3.4 Generalized Estimating Equation Utility Decrements (Imposed Monotonicity)
Table
5 presents the generalized estimating equation
β coefficients and associated utility decrements (i.e.
β/
α) for the EORTC QLU-C10 in presence of adjustments for monotonicity. Appendix Figure 2 presents a graph showing the utility decrements per dimension (see ESM). Once again, the significance of coefficients indicates that the attribute level had a significant impact on the responders’ utility, while the sign indicates the direction of this impact.
Table 5
Generalized estimating equation QLU C10D utility decrements (adjusted for non-monotonicities)
Time coefficient (α) | (linear) | | | 0.560 |
Physical functioning | 2 | − 0.050** | 0.021 | − 0.089 |
Physical functioning | 3 | − 0.091** | 0.022 | − 0.162 |
Physical functioning | 4 | − 0.142** | 0.019 | − 0.254 |
Role functioning | 2 | − 0.002 | 0.017 | − 0.003 |
Role functioning | 3 | − 0.058** | 0.017 | − 0.104 |
Role functioning | 4 | − 0.060** | 0.015 | − 0.107 |
Social functioning | 2 | − 0.013 | 0.015 | − 0.023 |
Social functioning | 3 | − 0.049** | 0.014 | − 0.087 |
Social functioning | 4 | − 0.049** | 0.014 | − 0.087 |
Emotional functioning | 2 | − 0.007 | 0.015 | − 0.013 |
Emotional functioning | 3 | − 0.021* | 0.016 | − 0.037 |
Emotional functioning | 4 | − 0.044** | 0.014 | − 0.078 |
Pain | 2 | − 0.015 | 0.015 | − 0.027 |
Pain | 3 | − 0.066** | 0.017 | − 0.118 |
Pain | 4 | − 0.098** | 0.015 | − 0.175 |
Fatigue | 2 | − 0.030** | 0.014 | − 0.053 |
Fatigue | 3 | − 0.034** | 0.016 | − 0.061 |
Fatigue | 4 | − 0.043** | 0.014 | − 0.076 |
Sleep disorders | 2 | 0.000 | 0.000 | 0.000 |
Sleep disorders | 3 | − 0.008 | 0.014 | − 0.015 |
Sleep disorders | 4 | − 0.018* | 0.012 | − 0.033 |
Lack of appetite | 2 | − 0.020* | 0.014 | − 0.036 |
Lack of appetite | 3 | − 0.028** | 0.013 | − 0.050 |
Lack of appetite | 4 | − 0.028** | 0.013 | − 0.050 |
Nausea | 2 | − 0.036** | 0.015 | − 0.064 |
Nausea | 3 | − 0.054** | 0.012 | − 0.096 |
Nausea | 4 | − 0.054** | 0.012 | − 0.096 |
Bowel problems | 2 | − 0.024* | 0.015 | − 0.043 |
Bowel problems | 3 | − 0.047** | 0.015 | − 0.084 |
Bowel problems | 4 | − 0.049** | 0.013 | − 0.087 |
Similarly to the analysis without monotonicity adjustment, coefficients of level 4 were statistically significant for all attributes, but only five of the ten attributes reported significant coefficients in all their levels. These were physical functioning, fatigue, lack of appetite, nausea and bowel problems. The attribute with the largest decrements was physical functioning, followed by pain, while the one with the smallest decrements was sleep.
The collapsed utility decrements for the three attributes that were not monotonically decreasing in the unadjusted analysis were −0.087 for social functioning level 3 and level 4, − 0.096 for nausea level 3 and level 4 and −0.050 for lack of appetite level 3 and level 4.
3.5 Mixed Logit Utility Decrements (Imposed Monotonicity)
Table
6 presents the mixed logit
β coefficients and associated utility decrements (i.e.
β/
α) for the EORTC QLU-C10 in the presence of adjustments for monotonicity. The Akaike information criterion (AIC) was 15,594, the Bayesian information criterion (BIC) 16,013 and the log likelihood − 7747. Once again, the significance of coefficients indicate that the attribute level had a significant impact on the responders’ utility, while the sign indicates the direction of this impact.
Table 6
Mixed logit QLU C10D utility decrements (adjusted for non-monotonicities)
Time coefficient (α) | (linear) | | | | | 1.182 |
Physical functioning | 2 | − 0.137** | 0.018 | 0.127** | 0.021 | − 0.116 |
| 3 | − 0.163** | 0.020 | 0.158** | 0.023 | − 0.138 |
| 4 | − 0.216** | 0.018 | 0.181** | 0.019 | − 0.183 |
Role functioning | 2 | − 0.013 | 0.017 | 0.139** | 0.023 | − 0.011 |
| 3 | − 0.108** | 0.015 | 0.126** | 0.025 | − 0.092 |
| 4 | − 0.108** | 0.015 | 0.126** | 0.025 | − 0.092 |
Social functioning | 2 | 0.0 | | | | 0.000 |
| 3 | − 0.092** | 0.012 | 0.113** | 0.021 | − 0.078 |
| 4 | − 0.092** | 0.012 | 0.113** | 0.021 | − 0.078 |
Emotional functioning | 2 | − 0.000 | 0.017 | 0.161** | 0.024 | 0.000 |
| 3 | − 0.032 | 0.017 | 0.108** | 0.028 | − 0.027 |
| 4 | − 0.113** | 0.016 | 0.141** | 0.020 | − 0.095 |
Pain | 2 | − 0.004 | 0.017 | 0.138** | 0.027 | − 0.006 |
| 3 | − 0.125** | 0.017 | 0.145** | 0.024 | − 0.105 |
| 4 | − 0.177** | 0.016 | 0.247** | 0.021 | − 0.149 |
Fatigue | 2 | − 0.058** | 0.016 | 0.092** | 0.033 | − 0.049 |
| 3 | − 0.070** | 0.017 | 0.066* | 0.034 | − 0.059 |
| 4 | − 0.089** | 0.016 | 0.144** | 0.022 | − 0.076 |
Sleep disturbances | 2 | 0 | | | | 0 |
| 3 | 0 | | | | 0 |
| 4 | − 0.045** | 0.013 | 0.082* | 0.028 | − 0.038 |
Appetite loss | 2 | − 0.038* | 0.015 | 0.077** | 0.029 | − 0.032 |
| 3 | − 0.052** | 0.014 | 0.052** | 0.018 | − 0.044 |
| 4 | − 0.052** | 0.014 | 0.052** | 0.018 | − 0.044 |
Nausea | 2 | − 0.028 | 0.016 | − 0.086* | 0.045 | − 0.024 |
| 3 | − 0.086** | 0.017 | 0.114** | 0.022 | − 0.073 |
| 4 | − 0.099** | 0.015 | 0.093** | 0.028 | − 0.084 |
Bowel problems | 2 | − 0.019 | 0.016 | 0.107** | 0.027 | − 0.016 |
| 3 | − 0.063** | 0.017 | 0.122** | 0.028 | − 0.053 |
| 4 | − 0.065** | 0.015 | 0.098** | 0.026 | − 0.055 |
Similarly to the generalized estimating equation analysis with adjustment for monotonicity, coefficients of level 4 were statistically significant for all attributes. However, only four of the ten attributes reported significant coefficients in all their levels. These are physical functioning, fatigue, lack of appetite and nausea. The attribute with the largest decrement for level 4 was physical functioning, followed by pain, while the one with the smallest decrement was sleep.
Utility decrements of the generalized estimating equation and the mixed logit were generally of the same size and in the same direction, highlighting a high degree of consistency between the two analyses. The value range of the generalized estimating equation is slightly larger as evidenced by the generally larger utility decrements associated with level 4.
3.6 Sensitivity Analyses
Appendix table 1 in the ESM presents the results of the QLU C-10 utility decrements when estimated using the conditional logit model. As it can be seen, differences in utility decrements between the methods were ≤ 0.01 in absolute size and were not systematic. In the conditional logit model, role functioning level 4 was inconsistent, while this was not the case for the generalized estimating equation. The remaining inconsistencies occurred in both models.
The second sensitivity analysis tested the exclusion of responders who reported only focusing on a subset of dimensions. This sensitivity analysis had a non-systematic and modest effect on the value set (results available from the authors on request).
3.7 QLU-C10D Health State Value Estimation
Utility decrements from the monotonically ordered generalized estimating equation model can be used for the calculation of the utility associated with different EORTC QLU-C10 health states. This is done by subtracting the utility decrements associated with each of the item levels from 1 (i.e. full health). For example, EORTC QLU-C10 state 2411111111 would be calculated as: 1 − 0.089 − 0.107 − 0 − 0 − 0 − 0 − 0 − 0 − 0 – 0 = 0.804.
The utility associated with the PITS state (i.e. level 4 in all attributes), which indicates the worst possible attainable health, is − 0.043. This is similar to other international studies (e.g. [
18,
25]). Of the 1,048,0256 described by the QLU-C10, 316 are worse than death based on the Spanish value set.
4 Discussion
This study collected preferences for a set of EORTC QLU-C10D health states to derive a value set representative of the Spanish adult population. For all attributes, statistically significant decrements were associated with the worst level (level 4), reflecting that people have a strong preference to not live with this degree of dysfunction or symptom severity. For five of the attributes, namely physical functioning, fatigue, lack of appetite, nausea and bowel problems, decrements were statistically significant for all levels, while for four attributes, namely role, social and emotional functioning and pain, decrements were statistically significant for level 3 and 4 but not for level 2. Lack of monotonicity occurred in three attributes, but this was adjusted through collapsing the two dimensions levels and re-estimating the model. The monotonically ordered generalized estimating equation model represents a consistent tariff that can be used for the derivation of health state values to use in economic evaluations as a second-best option when the preferred GPBM is not included in the trial of interest, or as a useful alternative to performing CUA sensitivity analyses.
The rank order of QLU-C10 items found in this study is in line with those of previous QLU-C10 valuations, with generic items generally receiving larger weights compared with cancer-specific items. Among the generic items, physical functioning reported the largest decrements followed by pain and role functioning. These findings are in line with those in Germany [
14], Austria, Italy, Poland [
26] and France [
39]. Among the cancer-specific items, nausea reported the largest utility decrement followed by bowel problems, which mimicked the same rank order found in the Austrian valuation study [
16].
In the current study, weights for three items did not decrease monotonically. Lack of monotonicity for some of the QLU-C10 items was observed also in other valuation studies, for example those of Austria, Australia and the Netherlands, as well as in numerous valuations of other PBMs (e.g. [
40‐
43]). There are multiple possible explanations for the observed inconsistencies, such as, among the others, the size of the descriptive system (i.e. number of attributes) in relation to the sample of the study (i.e. number of participants), the wording of the attributes (i.e. responders’ understanding of the descriptors and labels), the administration mode (i.e. engagement of participants in online surveys) or the fact that those items do not have a stable impact on utility. While the current study accounted for the lack of monotonicity by collapsing levels 3 and 4, therefore generating a consistent tariff, the reasons underlining the observed inconsistencies are of interest and should be investigated in future research.
Two of the three cancer-specific items, fatigue and lack of appetite, were associated with small weights, a result that was in line with the attribute decrements of the QLU-C10 found in Australia and Germany [
14,
18] and with preferences registered in other PBM experimental studies [
44]. A possible explanation for this might be that members of the general public do not have experience of cancer symptoms, and how these severely impact the HRQoL of patients [
45‐
47], or may have some experience of those symptoms, but at a lesser degree of intensity. Valuation studies eliciting values from cancer patients are currently ongoing. Results of these studies will inform on whether the small weights associated with cancer attributes depend on the population performing the valuation task or the comparative relevance of these attributes in explaining HRQoL against generic attributes.
In the current study, two different modelling approaches were employed, namely generalized estimating equation and mixed logit. The models reported similar mean utility decrements. We chose to estimate the QLU-C10 tariff using a generalized estimating equation, as in economic evaluation mean responses are more important than variability in preferences between different groups (i.e. preference heterogeneity) [
18].
The current study developed a value set for the QLU-C10, a condition-specific PBM. Value sets for a number of other condition-specific PBMs exist, including those for the Amyotrophic Lateral Sclerosis utility index (ALS) [
48], the Health Assessment Questionnaire (HAQ) [
49], the Asthma Quality of Life Questionnaire (AQLQ) [
50], the Exacerbation Utility (Exact U) [
51], the Multi Sclerosis Impact Scale (MSIS) [
49], the NEWQoL 8D [
41] and the Dementia Quality of Life measure (DEMQoL) [
42]. Valuation studies of condition-specific PBMs differ substantially in terms of the chosen preference elicitation technique (discrete choice experiment, time trade-off, standard gamble, rating scale etc.), the population providing preference weights (general public, patients, professionals etc.) and the country in which preference weights were obtained [
52]. These choices have an impact on the relevance of the value sets for different application contexts. For example, a value set obtained from general public responders may be preferred over a value set obtained from patients by HTA bodies that take the payer perspective, while patients’ index scores may be better suited to investigations of large patient registries, population health studies and for personalized medicine [e.g.
53]. The Spanish value set for the QLU-C10 was developed to facilitate the conduct of economic evaluation in Spain, where condition-specific PBM values are accepted and used [
6].
This study has some limitations. First, it used an online administration procedure. While this has the advantage of being cheaper and less time consuming than face-to-face administration, it may be associated with poorer quality of data due to responders’ reduced engagement with the task and strategic behaviours [
54‐
56]. To assess whether this occurred, time stamps were investigated, without finding any relevant problem. Yet, this possibility cannot be entirely ruled out. Second, the colour coding approach used a yellow colour for the non-overlapping items. Currently, there is a debate in the DCE literature with some supporting this approach, and others championing intensity colour coding as a better alternative [
57]. While the approach adopted for task presentation is reasonable, it is important to acknowledge that other colour coding variants might have been used. Third, the sample reported higher education levels than the Spanish general population. From a theoretical point of view, this may have an impact on the values obtained, for example because of different understanding of the task, different priorities and different underlying health of responders with varying education levels. Yet, departures in the sample representativeness such as those reported in this study are similar to the ones of other valuation studies [
58]. Moreover, there is evidence that educational levels do not substantially impact values when elicited through time trade-off [
59,
60], albeit this evidence is not available for DCE. Fourth, in the DCE, the physical functioning domain was presented as two attributes (related to short walk and long walk) to simplify the wording of the attribute. However, the underlying design treated it as a single attribute. This may have had an impact on the values obtained, resulting in a larger decrement associated with physical functioning. Despite these limitations, this study also has important strengths. It generated the first value set for the QLU-C10 based on preferences elicited from the Spanish population. The availability of a value set for this population allows for a more theoretically sound alternative to the use of mapping techniques from non-PBMs to PBMs and from PBMs to non-PBMs [
24]. It also allows easy access to values for end users, researchers and policy makers. Furthermore, by being part of a broader European programme of research, the current study allows comparative assessments of preferences for health states relevant to cancer populations across European countries. Finally, the methods and the DCE design employed in the current research have been previously tested in an experimental study [
17] and have been already used for the estimation of QLU-C10 preference weights in different European countries, which increases the confidence in the results obtained.