Background
There has long been a concern that considerable bias in survey research can stem from respondents providing answers that are partly determined by social influences, in particular social desirability
[
1]. While the influence of social desirability bias has been found to vary according to the survey method, telephone and personal interviews have been found to be particularly prone to socially desirable responding
[
1‐
3]. Hence, social desirability bias may be a major threat to the validity of self-report outcomes data. Although there are several elements to its conceptualization, social desirability bias can generally be described as a response style exhibited by respondents who endorse items that represent traits and/or behaviors that they think stand for a socially acceptable or endorsed position
[
4]. Further, it can be differentiated between two dimensions: 1) the need for social approval, i.e. creation of a positive impression of oneself to receive approval from others (impression management), and 2) self-deception or defensiveness, i.e. avoidance of disapproval by denying socially undesirable traits and/or behaviors
[
5‐
8]. Social desirability has been found to be related to demographic variables; it is more likely to be identified in older women
[
9,
10], women of lower socio-economic status
[
10,
11], and higher age
[
12,
13]. Finally, social desirability has been found to be strongly related to the positive rating of the personal qualities of self, family and friends and not of ‘people in general’, the so-called ‘better than average’ effect
[
14].
While social desirability bias has been a general concern in evaluations based on self-reports
[
3], it may play a particularly important role in chronic disease health education interventions, in particular those that are offered to groups of people with chronic conditions who were initially unknown to each other. First, it is likely that individuals would be inclined to present themselves or certain health behaviors in a more positive light. This phenomenon would generally apply to any health-related outcomes assessment. Second, in the specific context of group-based interventions, it is intended that participants and course leaders build strong rapport during the intervention that may last several weeks or months. As a result, at the end of courses, participants may be inclined to provide socially desirable answers to endorse course leaders regardless of whether they truly benefited from the intervention. That is, participants may be aware that they are indirectly evaluating the performance of both the course leader and the organization and therefore provide socially desirable responses to appease leaders rather than showing how they really felt after graduating from the self-management course. Finally, in this setting, participants often fill out questionnaires in the presence of leaders and their peers which again may trigger socially desirable responses as they may feel pressurized to endorse the leaders’ performance. Hence, social desirability bias may have a particular influence on post-test scores and thus apparent change scores.
To measure the influence of potential socially desirable responses, several scales have been developed
[
5,
15‐
18]. Of these, the Marlowe-Crowne (MC) Social Desirability scale
[
16] is one of the most widely used indices
[
19]. It is commonly described as a measure of a person’s need for approval. Although the original authors defined the concept of social desirability in terms of two dimensions, i.e. need for approval and avoidance of disapproval
[
6,
20], they conceptualized the MC scale as a measure of a single dimension
[
6,
21]. However, subsequent studies found little support for this hypothesis, with results ranging from two-factor
[
5,
22] to multi-factor solutions
[
19,
21,
23‐
26]. While such findings cast some doubt on the measurement properties of the MC scale, these studies should be treated with caution. Only two studies applied rigorous psychometric statistical techniques to investigate the properties of the MC scale
[
19,
21]. Moreover, the generalizability of studies is questionable as almost all samples consisted of students
[
19,
24,
25,
27‐
29].
The original MC scale consists of 33 items. Therefore, for some respondents it may be a burden to complete, particularly if the scale is among a panel of scales. As a consequence, short forms have been developed, with Reynolds’ (1982) and Strahan and Gerbasi’s (1972) short forms being most frequently applied
[
19,
21]. Commentaries on the usefulness of the short forms vary substantially. While some suggest that all are unsatisfactory
[
19,
24], others show that they are improvements over the original
[
25,
26,
28]. However, these studies should also be treated with caution. Apart from one study
[
19] none applied rigorous statistical methods. Further, factor analyses on the short forms were generally aimed at confirming/rejecting the one-factor hypothesis, whereas none tested the scales for a potential two- or multi-factor solution. Of all short forms, Reynolds’ MC-C
[
30] has been explored extensively
[
31] and is one of the most frequently used short forms
[
32‐
34]. It has generally been described as a reliable alternative to the full scale
[
30,
31,
35] with acceptable internal consistency
[
24,
25,
30,
31,
34].
In summary, social desirability bias has received frequent attention in the literature
[
20,
36]. However, in view of its potential threat to the validity of scores derived from participants of health interventions, it is surprising that this bias has rarely been explored in contexts where social desirability is likely to be an important bias. Only two out of more than 100 controlled trials of chronic disease self-management courses considered social desirability as a potential covariate
[
37]. The aim of this study was to explore the influence of social desirability bias on change scores derived from data collected from groups of participants taking part in chronic disease self-management courses.
Results
Exploratory factor analyses of the MC-C using CEFA suggested that a one-factor solution did not fit the data well. With two eigenvalues clearly above one (3.4 and 1.9, respectively) and two further eigenvalues at 1.1, factor solutions ranging between two factors and four factors were explored. While fit statistics improved in all multi-factor solutions, models beyond two factors were not superior to the two-factor solution. Therefore, a two-factor solution – labeled SD1 ‘defensiveness’ and SD2 ‘self-presentation’ – appeared most suitable for the MC-C with a moderate correlation of the two factors (0.48). As shown in Table
2, this solution was confirmed in LISREL. While fit statistics were excellent (non-significant Satorra-Bentler chi-square
[
64,
65], RMSEA = 0.023 [90% CI, 0.0;0.043], CFI = 0.99, and SRMR = 0.079), some small factor loadings were obtained ranging from 0.33 to 0.76. Reliability was also relatively low, with Coefficient alpha at 0.59 for SD1 and 0.56 for SD2. As the validation of the MC-C was of exploratory nature
[
54], these values were deemed acceptable for the present study.
Table 2
Confirmatory factor analysis of the short form Marlowe-Crowne social desirability scale MC-C (n = 318)
Self-presentation | | |
1 | It is sometimes hard for me to go on with my work if I am not encouraged. | 0.44 | 0.81 |
2 | I sometimes feel resentful when I don’t get my way. | 0.76 | 0.43 |
3 | On a few occasions, I have given up doing something because I thought too little of my ability. | 0.51 | 0.74 |
4 | There have been times when I felt like rebelling against people in authority even though I knew they were right. | 0.54 | 0.71 |
6 | There have been occasions when I took advantage of someone. | 0.33 | 0.89 |
8 | I sometimes try to get even rather than forgive and forget. | 0.48 | 0.77 |
11 | There have been times when I was quite jealous of the good fortune of others. | 0.51 | 0.74 |
12 | I am sometimes irritated by people who ask favours of me. | 0.37 | 0.87 |
Defensiveness | | |
5 | No matter who I’m talking to, I’m always a good listener. | 0.54 | 0.71 |
7 | I’m always willing to admit it when I make a mistake. | 0.64 | 0.59 |
9 | I am always courteous, even to people who are disagreeable. | 0.70 | 0.51 |
10 | I have never been irked when people expressed ideas very different from my own. | 0.48 | 0.77 |
13 | I have never deliberately said something that hurt someone’s feelings. | 0.42 | 0.83 |
Social desirability in heiQ data
The first step of the 5-step procedure suggested that ‘defensiveness’ correlated significantly with pre-test data across all heiQ scales. Correlations ranged from 0.24 to 0.39, equivalent to a small to medium effect
[
59,
66]. In contrast, none of the heiQ scales indicated an association between ‘self-presentation’ and pre-test data (Table
3). Thus, only ‘defensiveness’ was explored as a potential partial mediating variable in heiQ data, while ‘self-presentation’ could be ruled out as a mediator.
Table 3
Covariance between ‘defensiveness’, ‘self-presentation’, and heiQ pre-test data
SD1-SD2 | Cov | 0.254* | 0.247* | 0.251* | 0.249* | 0.248* | 0.246* | 0.250* | 0.240* |
| (SE) | (0.060) | (0.058) | (0.059) | (0.058) | (0.058) | (0.058) | (0.059) | (0.058) |
| Corr | 0.482 | 0.480 | 0.473 | 0.478 | 0.475 | 0.477 | 0.480 | 0.477 |
SD1-pre | Cov | 0.208* | 0.192* | 0.233* | 0.312* | 0.360* | 0.255* | 0.264* | 0.199* |
| (SE) | (0.071) | (0.071) | (0.069) | (0.092) | (0.084) | (0.057) | (0.079) | (0.057) |
| Corr | 0.255 | 0.242 | 0.280 | 0.310 | 0.388 | 0.375 | 0.280 | 0.271 |
SD2-pre | Cov | 0.089 | 0.015 | −0.032 | 0.155 | 0.032 | 0.098 | 0.009 | 0.019 |
| (SE) | (0.068) | (0.068) | (0.065) | (0.098) | (0.076) | (0.061) | (0.070) | (0.066) |
| Corr | 0.116 | 0.020 | −0.041 | 0.156 | 0.037 | 0.153 | 0.010 | 0.026 |
In Step 2 it was found that all direct paths from pre-test to post-test were significant. While subscale Social integration and support showed the strongest association between the two scores, all heiQ subscales showed substantial paths from predictor to outcome (Table
4).
Table 4
Regression of heiQ post-test on heiQ pre-test datav
pre-post | Path | 0.648* | 0.829* | 0.758* | 0.593* | 0.672* | 0.610* | 0.774* | 0.743* |
| (SE) | (0.044) | (0.069) | (0.049) | (0.071) | (0.066) | (0.065) | (0.044) | (0.053) |
| stand. | 0.744 | 0.776 | 0.761 | 0.665 | 0.718 | 0.626 | 0.808 | 0.777 |
Finally, Table
5 presents the associations between pre-test and post-test once SD1 was included in the models. Again, paths between pre-test and ‘defensiveness’ were significant. Once pre-test data were controlled for ‘defensiveness’, Emotional distress was the only subscale that showed a significant association between ‘defensiveness’ and heiQ post-test data.
Table 5
Regression of ‘defensiveness’ and heiQ post-test data on heiQ pre-test data, and regression of heiQ post-test data on ‘defensiveness’
pre-post | Path | 0.654* | 0.826* | 0.696* | 0.575* | 0.698* | 0.606* | 0.765* | 0.757* |
| (SE) | (0.049) | (0.072) | (0.052) | (0.077) | (0.076) | (0.075) | (0.046) | (0.058) |
| stand. | 0.753 | 0.772 | 0.700 | 0.642 | 0.752 | 0.618 | 0.796 | 0.788 |
pre-SD1 | Path | 0.172* | 0.169* | 0.191* | 0.166* | 0.229* | 0.301* | 0.160* | 0.192* |
| (SE) | (0.057) | (0.059) | (0.056) | (0.049) | (0.052) | (0.067) | (0.047) | (0.053) |
| stand. | 0.259 | 0.252 | 0.290 | 0.326 | 0.399 | 0.388 | 0.284 | 0.283 |
SD1-post | Path | −0.038 | 0.026 | 0.322* | 0.128 | −0.141 | 0.024 | 0.070 | −0.060 |
(SE) | (0.088) | (0.107) | (0.099) | (0.132) | (0.123) | (0.101) | (0.092) | (0.102) |
| stand. | −0.029 | 0.016 | 0.213 | 0.073 | −0.087 | 0.019 | 0.041 | −0.043 |
As ‘defensiveness’ was found to be associated with Emotional distress, steps 4 and 5 were performed on this heiQ scale only. Once ‘defensiveness’ was included in the model, the path between pre-tests and post-tests decreased by 0.062, a significant effect, as it was more than twice its standard error
[
52], i.e. SE
M = √ (0.191
2 * 0.099
2 + 0.322
2 * 0.056
2) = 0.026. The magnitude of the effect, however, was small as it contributed only 8.2% of the total variation in change scores.
Discussion
This study explored the potential mediating effect of social desirability in the measurement of outcomes of chronic disease self-management courses. For this, we used rigorous statistical techniques – including exploratory and confirmatory factor analysis of the MC-C as well as applying a comprehensive 5-step model – to explore both direct and mediating effects on key outcomes. Surprisingly, while we had expected clear evidence of bias in estimates of change through socially desirable responding, virtually no social desirability bias was found. When analyzing social desirability bias as a potential mediating variable between heiQ pre-test and post-test data, only the ‘defensiveness’ factor but not the ‘self-presentation’ factor of the MC-C showed an association with pre-test data, a prerequisite for investigating mediation. The notion of ‘defense’ and ‘self-protection’ was introduced as one critical aspect of the approval motive
[
6]. Subsequent research, however, suggested that subjects’ motivations to present themselves in a socially desirable way was linked more strongly to ‘defensiveness’ rather than ‘self-presentation’
[
7,
67] which may explain our findings, i.e. the lack of association of pre-test data with ‘self-presentation’.
Despite the significant association of ‘defensiveness’ with all pre-tests, it exerted only little influence on heiQ post-test data once pre-test data were controlled for. Only one heiQ scale (Emotional distress) showed that ‘defensiveness’ operated as a true, albeit minor, mediator. Therefore, the influence of social desirability bias in heiQ data can largely be ruled out. This finding is contrary to our expectations. First, the specific context of group-based chronic disease self-management interventions, potential rapport among participants and between participants and course leader(s), and provision of data in the presence of course leaders are factors that may be conducive to exhibiting a socially desirable response style. Second, social desirability has been found to be related to a range of demographic variables. Among others, older women
[
9,
10], women of lower socio-economic status
[
10,
11], and older respondents
[
12,
13] have been found to be most prone to socially desirable responding. While we did not have socio-economic data, the remaining characteristics largely fit our sample, i.e. an additional argument for the presence of social desirability in our study.
There are several possible explanations for our findings, i.e. lack of social desirability bias in heiQ data. First, all heiQ items have been written in a way that discourage response styles
[
41]. That is, even people who are usually prone to socially desirable responding may have been discouraged to do so through the content and structure of heiQ items. The heiQ was developed using grounded approaches including the use of concepts and wording that were directly derived from patients. Second, the short form MC-C scale was used to explore a potential effect of social desirability bias. Although there is sufficient support in the literature that the MC-C is a valid alternative to the full MC scale, and our re-validation supported a two-factor solution with excellent fit statistics, it is possible that the analyses were hampered by a suboptimal performance of this shortened measure. Despite excellent fit indices in LISREL, low reliability and some small factor loadings may have limited the power of the analyses to detect mediational effects of social desirability.
In this study we applied a novel approach to testing the influence of social desirability bias in the context of chronic disease self-management programs. Apart from providing a detailed re-validation of the MC-C
[
30], with both exploratory and confirmatory analyses, a sophisticated model of partial mediation was developed that should have detected an association of social desirability if there had been any. However, it cannot be ruled out that the MC-C scale did not perform sufficiently well, while a potential co-existence of equivalent models also needs to be acknowledged
[
68,
69]. For example, it would have been plausible to define ‘defensiveness’ as a predictor of both pre-test and post-test or define a model of moderated mediation
[
57,
59], with variables such as age, gender, or education operating as moderating variables. It is possible that there was a mediating effect of socially desirable responding in older participants but not in their younger counterparts. The sample size of the dataset, however, did not allow for such modeling. Further, it is possible that social desirability moderated – rather than mediated – the effect between pre-test and post-test. However, as ‘social desirability’ was defined as a response style that was hypothesized to improve the prediction of post-test levels – i.e. the variable ‘social desirability’ was defined as part of the causal chain
[
56,
57] – current model definition was assumed to be most appropriate to test for socially desirable responding. In view of our specific research questions, the present model is a logical and theoretically sound approach
[
61]. That is, the path between pre-test and post-test was understood as the primary path in the model, and social desirability was defined as a response style that potentially partially mediated the relationship between heiQ pre-test and post-test data.
Competing interests
The authors state that there are no conflicts of interests.