Background
Internationally, the SCL-90 [
16,
21,
22] is the most used questionnaire for the assessment of psychological distress, especially in clinical practice [
45]. However, it is a very long and time-consuming questionnaire, which is why two Brief Symptom Inventory versions of the Symptom Checklist SCL-90-R were developed [
12,
18,
20,
21]: the Brief Symptom Inventory with 53 items (BSI; [
13,
17,
19,
23]), which measures psychological distress, and its shortened version, the Brief Symptom Inventory with 18 items (BSI-18; [
11,
14,
24]).
The Brief Symptom Inventory with 53 items was developed by Derogatis using a factor analysis and maintaining the scale structure with the reduced item number of the SCL-90-R (somatization, obsessive-compulsive, interpersonal sensitivity, depression, anxiety, anger-hostility, phobic anxiety paranoid ideation and psychoticism). In Germany, the BSI is mainly used for quality management in psychotherapy (e.g. [
28]).
In order to reduce and prevent an overload to the patients and to ensure an easy screening-tool, the BSI-18 was developed with highest clinical relevance. The BSI-18 contains only the three six-item scales somatization (SOMA), anxiety (ANX), depression (DEPR), and the global Scale Global Severity Index (GSI). (They are documented in Table
1). Contrary to the SCL-90-R and the BSI-53, the BSI-18 scores were calculated by sum scores. The GSI therefore ranges between 0 – 72 and the three scales between 0 – 24. The application studies demonstrated that the BSI-18 is a suitable instrument for measuring psychological distress and comorbidities in patients with different mental and somatic illnesses (e.g. [
1,
4,
8,
9,
10,
29,
38,
39,
46,
48]). This instrument is also used in longitudinal studies [
5,
6,
37].
Table 1
Item- and scale statistics, reliability, and convergent validity in the total sample (N = 2516)
Scale 1: Somatization (α = .82) |
∑1.46 (2.58) | | | .52** | .48** |
1 | Faintness or dizziness | 0.19 (0.51) | .56 | .80 | | |
4 | Pains in heart or chest | 0.23 (0.56) | .61 | .79 | | |
7 | Nausea or upset stomach | 0.28 (0.63) | .49 | .81 | | |
10 | Trouble getting your breath | 0.22 (0.57) | .65 | .78 | | |
13 | Numbness or tingling in parts of your body | 0.22 (0.60) | .55 | .80 | | |
16 | Feeling weak in parts of your body | 0.33 (0.69) | .66 | .77 | | |
Scale 2: Depression (α = .87) |
∑1.76 (3.23) | | | .72** | .71** |
2 | Feeling no interest in things | 0.36 (0.69) | .67 | .85 | | |
5 | Feeling lonely | 0.42 (0.82) | .66 | .85 | | |
8 | Feeling blue | 0.27 (0.67) | .78 | .83 | | |
11 | Feelings of worthlessness | 0.26 (0.69) | .74 | .83 | | |
14 | Feeling hopeless about the future | 0.39 (0.83) | .73 | .84 | | |
17 | Thoughts of ending your life | 0.07 (0.36) | .51 |
.88
| | |
Scale 3: Anxiety (α = .84) |
∑1.44 (2.59) | | | .63** | .72** |
3 | Nervousness or shakiness inside | 0.27 (0.62) | .66 | .80 | | |
6 | Feeling tense or keyed up | 0.50 (0.77) | .59 | .83 | | |
9 | Suddenly scared for no reason | 0.17 (0.50) | .62 | .81 | | |
12 | Spells of terror or panic | 0.10 (0.39) | .66 | .82 | | |
15 | Feeling so restless you couldn’t sit still | 0.23 (0.60) | .66 | .80 | | |
18 | Feeling fearful | 0.19 (0.53) | .63 | .81 | | |
Global Severity Index (α = .93) |
∑4.66 (7.44) | | | .71** | .73** |
Until now, there have only been three studies which address the applicability and psychometric properties of the German version of the BSI-18 in patients after renal transplantation [
26] and in hospitalized psychosomatic patients [
25,
49].
In contrast, the psychometric properties of the BSI-18 were discussed internationally in 13 publications. The reliability (Cronbachs α) ranged for SOMA between α
min
= .61 [
36] and α
max
= .84 [
50], for DEPR between α
min
= .64 [
36] and α
max
= .92 [
43], for ANX between α
min
= .71 [
2] and α
max
= .88 [
50] and for the GSI between α
min
= .84 [
36] and α
max
= .94 [
43]. The reliability was mostly above .80 and can thus be evaluated as good. The reliability for the American norm sample (
N = 1134; α-SOMA = .74, α-DEPR = .84, α-ANGS = .79, α -GSI = .89; [
14]) is to be rated as satisfactory.
The retest-reliability for
n = 103 psychological distressed patients after 15 days without intervention was satisfactory with values between
r
tt
= .68 and
r
tt
= .82 [
2]. For validity evidence based on internal structure, a strong first factor was discussed (e.g. [
3]) alike to the SCL-90-R and the BSI-53 [
42]. Based on an exploratory factor analysis, the original 3-scale structure could be replicated in
n = 638 hospitalized psychosomatic patients [
25]. In addition, the original scale structure was often tested by confirmatory factor analysis [
25,
26,
50]. Convergent validity was shown in several studies [
2,
49]. Sensitivity and specificity were first analyzed by Zabora et al. [
51] using the BSI-53.
As yet, psychometric properties based on a representative sample are still not available for Germany. Therefore, the aim of this study was to (1) describe the psychological distress within the German population, to present (2) the reliability, and (3) the factorial validity.
Materials
Sample description
The representative sample contains 2516 individuals (53.7% female) with an average of 50.5 years of age (SD = 18.6, Range = 14 – 94 years). A total of seven nearly equidistant age groups were set up: ages 14–24 (10.7%), 25–34 (11.6%), 35–44 (16.3%), 45–54 (17.3%), 55–64 (16.5%), 65–74 (17.5%), and 75–94 (10%). In the sample, 52% were married, 23.4% single, 11.3% divorced, and 13.2% separated. Employment: 37.9% had a full-time job, and 9.2% had a part-time job. The remainder of the sample was unemployed (8.1%), retired (33.9%), housewife/ house-husband (5.3%), and 9.7% had not yet completed their education. Educational background: 44.1% had a lower education, 36.3% an upper education, and 6.8% an advanced education; 6.6% were university students, 4.1% were still attending school, and 2% had not graduated.
Psychological assessments
Demographic information, the BSI-18, and further psychological assessments were collected in the survey. To investigate validity evidence based on external criteria, the 4-item version of the Patient Health Questionnaire was used to screen for depression and anxiety (PHQ-4; [
32‐
34]). All the questions apply to the two preceding weeks and are to be rated by using “0 = not at all”, “1 = several days”, “2 = more than half the days” and “3 = nearly every day”. For statistical calculations, the answer category “0” was to be opposed to the other three categories.
Statistics
The analyses were carried out using PASW and AMOS. First, a Missing Data Analysis led to the exclusion of four participants because they showed more than the tolerated amount of missing data (tolerated < 1 items of each scale, < 3 items in total). At last, a total of 0.09% of the answers were missing and not assigned randomly (Little MCAR-Test:
Chi-Quadrat = 550.971,
df = 333,
p < .0001). Therefore they were replaced by using Multiple Imputation (MCMC in LISREL 8.15; [
35]).
Descriptive statistics, reliability as well as discriminant and convergent correlations were estimated. Construct validity was tested by using the confirmatory factor analysis (CFA).
Using AMOS [
31], the respective fit of the two-factor and the three-factor model was tested using CFAs. Due to the lack of multivariate normality in the data tested with the Marida-test in AMOS, the Asymptotically Distribution Free-estimator (ADF) was used for model testing [
7]. According to Schermelleh-Engel, Moosbrugger, and Müller [
47], a good (acceptable) model fit is a given with SB χ
2/df index below 2.0 (below 3.0), Comparative Fit Index (CFI) as well as Tucker-Lewis-Index (TLI) above .95 (above .90), Standardized Root Mean Square Residual (SRMR) below .05 (below .10), and Root Mean Square Error of Approximation (RMSEA) below .05 (below .08).
Discussion
Up to now, the BSI-18 has not been used widely in Germany. The psychometric properties and benefits of the instrument were investigated in three samples [
25,
26,
49]. For the present representative sample, the questions concerning reliability and model fit could be answered.
The reliability (Cronbach’s α) of the BSI-18 (α-SOMA = .82, α-DEPR = .87, α-ANX = .84, α-GSI = .93) was good to very good and ranged higher than in the US standardization. The reliability of the American norm sample (
N = 1134; α-SOMA = .74, α-DEPR = .84, α-ANGS = .79, α -GSI = .89; [
14]) had to be rated as satisfactory. Therefore, it can be concluded that the internal consistency of the scales can be affected by a sufficient sample procedure [
41]. The internal consistency of the scale Depression could be increased by eliminating item 17 (thoughts of ending your life). This result is similar to that of other samples, but due to the clinical relevance the item should be retained.
Using the two-item scales Depression and Anxiety of the PHQ-4 [
30], to analyze convergent validity, the results were quite similar to the results by Spitzer et al. [
49] using a longer PHQ-version. On the one hand, corresponding BSI-18- and PHQ-subscales demonstrated highest correlations; on the other hand, the Anxiety scale of PHQ-4 correlated similarly with BSI-18-Anxiety and BSI-18-Depression. Non-corresponding scales like the BSI-18-SOMA showed lower correlations. The results by Spitzer et al. [
49] and our own results were found in non-clinical samples. Regarding clinical data [
25,
26], it could be concluded that the BSI-18 is more suitable to psychologically distressed than non-distressed populations.
Congruent with international [
27,
40,
42,
50] and German clinical studies [
25,
26] the three scales of the BSI-18 showed the best model fits by reproducing the scale structure using the confirmatory factor analysis. Nevertheless, the boundaries for a good model fit according to Schermelleh-Engel, Moosbrugger, and Müller [
47] could not be reached. The model fit based on RMSEA is good but that model fit based on CFI and TLI are too low.
The remarkable strength of the present sample is its good age distribution due to representative sampling: − young (
n = 270, aged 14 – 24), elderly (
n = 440, aged 65 – 74), and old age (
n = 252, aged 75 – 94). Besides the strength of a large sample size as a limitation, it is not possible to draw general conclusions based on the data from a representative sample since a large sample size could easily lead to significant effects. Since the sample was representative for the normal population, the results are not offhandedly applicable to highly distressed samples [
15]. In turn, the BSI-18 should be applied to different clinical samples to further replicate or reprobate the factorial structure.
In future research it would be productive to test the stability of the distress construct (test-retest reliability) and to explore connections to other distress questionnaires (convergent validity) or external ratings (criterion validity) [
44]. A design with repeated measurements would allow for the comparison of factor structures across time and the determination of possible cohort effects.
The available version of the used software to measure the factor analysis with categorical indicators was applied. This should be seen as a limitation of this study and advice for future research.
Conclusion
The BSI-18 is a very short, reliable instrument for the assessment of psychological distress. The factorial structure of the instrument is very good when using confirmatory factor analyses as well as the psychometric criteria. Therefore, it is an instrument that can be used to reliably assess psychological distress in clinical samples as well as in the general population. In addition, it can be used in psychotherapy research as well as in quality assurance for psychotherapeutic long-term effects. Taking into account the good internal consistency reliability estimates and the encouraging convergent validity estimates, this preliminary validation is a good step forward in validation studies which are iterative in nature.
Acknowledgements
We would like to thank Liz Orrison for the native speaker proof reading.