Skip to main content
Erschienen in: Journal of General Internal Medicine 4/2022

Open Access 31.03.2021 | Original Research

Establishing Crosswalks Between Common Measures of Burnout in US Physicians

verfasst von: Keri J. S. Brady, PhD, MPH, Pengsheng Ni, MD, MPH, Lindsey Carlasare, MBA, Tait D. Shanafelt, MD, Christine A. Sinsky, MD, Mark Linzer, MD, Martin Stillman, MD, JD, Mickey T. Trockel, MD, PhD

Erschienen in: Journal of General Internal Medicine | Ausgabe 4/2022

Abstract

Background

Physician burnout is often assessed by healthcare organizations. Yet, scores from different burnout measures cannot currently be directly compared, limiting the interpretation of results across organizations or studies.

Objective

To link common measures of burnout to a single metric in psychometric analyses such that group-level scores from different assessments can be compared.

Design

Cross-sectional survey.

Setting

US practices.

Participants

A total of 1355 physicians sampled from the American Medical Association Physician Masterfile.

Main Measures

We linked the Stanford Professional Fulfillment Index (PFI) and Mini-Z Single-Item Burnout (MZSIB) scale to the Maslach Burnout Inventory (MBI) in item response theory (IRT) fixed-calibration and equipercentile analyses and created crosswalks mapping PFI and MZSIB scores to corresponding MBI scores. We evaluated the accuracy of the results by comparing physicians’ actual MBI scores to those predicted by linking and described the closest cut-point equivalencies across scales linked to the same MBI subscale using the resulting crosswalks.

Key Results

IRT linking produced the most accurate results and was used to create crosswalks mapping (1) PFI Work Exhaustion (PFI-WE) and MZSIB scores to MBI Emotional Exhaustion (MBI-EE) scores and (2) PFI Interpersonal Disengagement (PFI-ID) scores to MBI Depersonalization (MBI-DP) scores. The commonly used MBI-EE raw score cut-point of ≥27 corresponded most closely with respective PFI-WE and MZSIB raw score cut-points of ≥7 and ≥3. The commonly used MBI-DP raw score cut-point of ≥10 corresponded most closely with a PFI-ID raw score cut-point of ≥9.

Conclusions

Our findings allow healthcare organizations using the PFI or MZSIB to compare group-level scores to historical, regional, or national MBI scores (and vice-versa).
Begleitmaterial
Hinweise

Supplementary Information

The online version contains supplementary material available at https://​doi.​org/​10.​1007/​s11606-021-06661-4.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Abkürzungen
IRT
Item response theory
MBI
Maslach Burnout Inventory-Human Services Survey for Medical Personnel
MBI-EE
Maslach Burnout Inventory-Human Services Survey for Medical Personnel Emotional Exhaustion scale
MBI-DP
Maslach Burnout Inventory-Human Services Survey for Medical Personnel Depersonalization scale
MZSIB
Mini-Z Single-Item Burnout scale
PFI
Stanford Professional Fulfillment Index
PFI-WE
Stanford Professional Fulfillment Index Work Exhaustion scale
PFI-ID
Stanford Professional Fulfillment Index Interpersonal Disengagement scale

INTRODUCTION

In the US, burnout is more common in physicians than in workers in other fields,1 and is characterized by work-related feelings of exhaustion and depersonalization or interpersonal disengagement.2, 3 Physician burnout is associated with poor physician health outcomes, reduced quality of care, and at least 4.6 billion dollars in excess health system costs annually.46 In an effort to curb physician burnout,7, 8 health systems across the nation are integrating measures of burnout into routine organizational assessments to monitor system functioning and evaluate the effectiveness of practice changes designed to improve physician well-being.911 This practice is recommended in the National Academy of Medicine’s consensus report on clinician burnout and regarded by healthcare leaders as a basic first step to addressing the problem.7, 1014
With the widespread adoption of physician burnout assessment within US healthcare systems has come the problem of comparing outcomes across different burnout measures. With several validated options available that vary in length and cost, a number of different measures are currently in use in the US,9, 10 including the Maslach Burnout Inventory-Human Services Survey for Medical Personnel (MBI),15 Stanford Professional Fulfillment Index (PFI),16 and the Mini-Z Single-Item Burnout (MZSIB) scale.17 When two different burnout measures are used across organizations or within an organization over time, the scores are not comparable unless they are placed onto the same metric, or “linked,” in psychometric analyses. To date, no studies to our knowledge have linked common measures of physician burnout onto a single metric, which would allow healthcare organizations to compare burnout scores/rates across different measures.
The primary aim of this study was to link the PFI and MZSIB to the MBI metric and create crosswalks that map scores from the PFI and MZSIB to corresponding scores on the MBI. Using the crosswalks, we aimed to describe the closest cut-point equivalences for scales linked to the same metric. Our secondary aim was to examine the psychometric properties of scales linked to the same metric, including each scale’s reliability and associations with relevant adverse outcomes.

METHODS

Linking refers to the statistical process of placing two or more measures with different content and/or construct severity levels onto the same scale.18 Through this process, a relationship is established between the linked measures, such that for each score on Burnout Measure A, an equivalent score (within standard error) on Burnout Measure B is established.

Design and Participants

This study used a single-group linking design, whereby items from each burnout instrument were administered in a confidential, cross-sectional survey to all respondents from February to March 2019. To obtain a representative convenience sample, we randomly sampled physicians of all ages, sexes, and specialties from the American Medical Association Physician Masterfile. Physicians were emailed the survey and offered a small financial incentive to participate. The survey was administered in waves until we reached a target sample size of ≥1200 respondents, which was estimated as the minimum sample size needed for item response theory linking analyses. Physicians (including postgraduate trainees) practicing in the US at the time of the survey were eligible for inclusion.

Measures

We measured physician burnout using the MBI 9-item Emotional Exhaustion (MBI-EE) and 5-item Depersonalization (MBI-DP) subscales (0 = never, 1 = a few times a year or less, 2 = once a month or less, 3 = a few times a month, 4 = once a week, 5 = a few times a week, 6 = every day); the PFI 4-item Work Exhaustion (PFI-WE) and 6-item Interpersonal Disengagement (PFI-ID) subscales (0 = not at all, 1 = very little, 2 = moderately, 3 = a lot, 4 = extremely); and the single-item MZSIB (1 = no burnout; 2 = under stress; 3 = have one or more burnout symptom; 4 = burnout won’t go away; 5 = completely burned out; see Supplemental Appendix 1 for the complete MZSIB response options).17 The sequence in which each instrument was administered was randomized to prevent ordering effects.
The MBI and PFI are outcome measures, whereas the MZSIB scale is a screening measure. Commonly used raw (total) score cut-points for each scale are ≥27, ≥10, and ≥3 on the MBI-EE, MBI-DP, and MZSIB scales, respectively.1, 19, 20 The raw (total) score cut-point for the PFI Burnout Composite (PFI-BC) Scale is ≥14.16 Cut-points for PFI-WE and PFI-ID subscales have not been published and are identified in the current study.16
We also assessed physicians’ demographics, depressive symptoms (4-item PROMIS depression measure),21 distress as measured by the original, 7-item Physician Well-Being Index (WBI),2224 and intent to leave one’s current practice or intent to leave medicine (for attending physicians and postgraduate trainees, respectively) in the next 2 years (1 item). 17 All measures were scored such that higher scores indicate more of each construct.

Linking Analyses

Our methods were informed by those used in the PROsetta Stone Project.25, 26 Scales were linked in item sets, consisting of two scales: a target measure and an anchor measure. In linking analyses, a target measure is linked to an anchor measure, which places the target measure onto the metric of the anchor measure. Because the MBI is historically the most common physician burnout assessment,27 we selected the MBI-EE and MBI-DP scales as anchor measures. Target measures included the PFI-WE, PFI-ID, and MZSIB scales.
Prior to conducting linking analyses, we qualitatively and quantitatively examined the degree to which the scales that we aimed to link assess essentially the same construct, a key assumption of linking.18, 28 Scales assessing essentially the same construct were expected to (1) have very similar item content as determined by two independent subject domain expert raters (TS, ML); (2) be highly correlated (inter-scale Pearson’s r of ≥0.75); and (3) be essentially unidimensional as determined in confirmatory factor analyses (CFAs) (see Supplemental Appendix 2 for additional assumption assessment details).25
For each item set, we conducted item response theory (IRT) fixed-calibration linking and equipercentile linking analyses using a fivefold cross validation process (Supplemental Appendix 3). In IRT linking, raw (total) scores on each target measure were linked to t-scores on each MBI anchor scale. A t-score is a standardized score ranging from 0 to 100, with a mean score and standard deviation equal to 50 and 10, respectively. T-scores on each MBI anchor scale were then mapped to corresponding MBI raw scores. In our IRT linking analyses, we derived the MBI-EE and MBI-DP anchor metrics from a prior IRT calibration of the MBI in a 2014 national sample of US physicians.29, 30 In equipercentile linking, the MBI metric was derived from the primary survey data collected in this study. We evaluated the accuracy of each linking method for each item set by calculating the correlation, mean difference, and standard deviation (SD) of the difference between physicians’ predicted and actual t-scores on the MBI anchor scale, using pooled predicted and actual t-scores produced from a fivefold cross validation process. The method that yielded the highest correlations, lowest mean differences, and lowest SD of difference across all item sets was used to create a crosswalk mapping raw scores on the target measure to corresponding t-scores and raw scores on the MBI anchor measure. Once each item set was linked, we (1) identified the closest cut-point equivalencies across scales linked to the same metric and (2) described the reliability of scales linked to the same metric (Supplemental Appendix 4).31 We used the Brady et al. 29 IRT analysis to identify the t-scores corresponding with (1) each MBI-EE and MBI-DP raw score cut-point and (2) each raw score on the MBI predicted by equipercentile linking.
Finally, we computed correlations between each scale and measures of physician depressive symptoms, distress, and intent to leave to compare the magnitude of each scale’s associations with these outcomes. Analyses were conducted in R (v3.5.1) psych, lavaan, mirt, and equate packages.3236 This study was approved by the University of Illinois at Chicago Institutional Review Board.

RESULTS

Sample

The overall sample included 1355 US physicians (Table 1). The most common demographic characteristics of respondents were White race, male sex, non-primary care specialty, and <44 years of age. Thirty-one percent of respondents were trainees. In subgroup invariance analyses, we found support for the invariance of our linking results across early versus late responders (where late responders were used as a proxy for non-responders; Supplemental Appendix 5, Table 5.5). Overall, mean raw scores on the MBI-EE, PFI-WE, MZSIB, MBI-DP, and PFI-ID scales were 21.82, 6.06, 2.45, 7.86, and 6.63, respectively (Table 2) (see Supplemental Appendix 6 for specialty-level descriptive scale statistics).
Table 1
Overall Sample Characteristics (n = 1355)
Characteristic
n (%)a
Sex
Male
763 (57)
Female
579 (43)
Missing
13 (0.1)
Age group
<35 years
440 (33)
35–44 years
385 (28)
45–54 years
243 (18)
55–64 years
193 (14)
65 years
94 (7)
Race
White/Caucasian
894 (66)
Black/African American
54 (4)
Asian
292 (22)
Other
115 (9)
Trainee status
Trainee (resident/fellow)
420 (31)
Non-trainee
935 (69)
Primary care
Primary careb
442 (33)
Non-primary care
913 (67)
Specialty
Anesthesiology
97 (7)
Dermatology
24 (2)
Emergency medicine
74 (6)
Family medicine
167 (12)
General surgery
62 (5)
General surgery subspecialty
71 (5)
General internal medicine
184 (14)
General pediatrics
91 (7)
Internal medicine-subspecialty
127 (9)
Neurology
28 (2)
Obstetrics and gynecology
96 (7)
Ophthalmology
30 (2)
Other
81 (6)
Pathology
4 (0.3)
Pediatric subspecialty
63 (5)
Physical medicine
13 (1)
Psychiatry
90 (7)
Radiology
53 (4)
Practice type
Non-governmental hospital
473 (35)
Group practice
404 (30)
City/county/state/federal government hospital
130 (10)
Self-employed solo practice
97 (7)
Otherc
250 (18)
Missing
1 (0.1)
aPercentages may not add to 100 due to rounding. Missingness is only specified for variables that had missing data. bIncludes physicians in general internal medicine, pediatrics, and family medicine specialties. cIncludes physicians practicing in/as HMO, locum tenens, medical school, two physician practice (full or part owner), other patient care, city/county/state government non-hospital setting, and no classification
Table 2
Overall Descriptive Scale Statistics by Domain and Measure (n = 1346)
Domain/measure
Statistica
Emotional exhaustion
MBI-EE, mean (SD)
21.82 (12.16)
MBI-EE ≥27, n (%)
469 (34.8)
PFI-WE, mean (SD)
6.06 (3.46)
PFI-WE ≥7, n (%)
582 (43.2)
MZSIB, mean (SD)
2.45 (0.92)
MZSIB ≥3, n (%)
589 (43.8)
Depersonalization
MBI-DP, mean (SD)
7.86 (6.41)
MBI-DP ≥10, n (%)
458 (34.0)
PFI-ID, mean (SD)
6.63 (4.77)
PFI-ID ≥9, n (%)
470 (34.9)
Burnout
MBI (EE ≥27 and/or DP ≥10), n (%)
584 (43.4)
PFI-BCb, mean (SD)b
12.68 (7.63)
PFI-BCb ≥14, n (%)b
599 (44.5)
aIncludes respondents with ≤1 missing item response for all scales. Cut-points presented are raw total scores on each scale. bPFI BC refers to the PFI Burnout Composite Scale, which is scored as the total raw score from both the PFI-WE and PFI-ID scales

Assumption Assessment

In qualitative evaluations of each target and anchor scale’s item content overlap, both raters agreed that the following item sets assess essentially the same underlying construct: PFI-WE and MBI-EE (item set 1), PFI-ID and MBI-DP (item set 2), and MZSIB and MBI-EE (item set 3). Inter-scale correlations between the target and anchor scales in item sets 1–3 were 0.80, 0.76, and 0.76, respectively. Item sets 1–3 met all other linking assumptions in quantitative analyses (Supplemental Appendix 5).

Crosswalks and Closest Cut-Point Equivalents

Overall, IRT (versus equipercentile) linking produced the most accurate results (Supplemental Appendices 7 - 9) and was used to create crosswalks mapping raw scores on the PFI-WE, PFI-ID, and MZSIB (target) scales to corresponding t-scores and raw scores on their respective MBI-EE, MBI-DP, and MBI-EE anchor scales (Table 3).
The commonly used raw score cut-point of ≥27 (t-score = 50.70) 29 on the MBI-EE scale corresponded most closely with raw score cut-points of ≥7 and ≥3 on the respective PFI-WE and MZSIB scales (Table 3). The commonly used raw score cut-point of ≥10 (t-score = 53.76) 29 on the MBI-DP scale corresponded most closely with a raw score cut-point of ≥9 on the PFI-ID scale. The raw score cut-point of ≥3 on the MZSIB scale corresponded most closely with a raw score of ≥8 on the PFI-WE scale.
Table 3
Crosswalks Produced from IRT Linking Mapping Raw Scores from the PFI and MZSIB to Corresponding Predicted MBI T-scores and Raw Scores
Item Set 1: PFI Work Exhaustion (PFI-WE) Scale (target scale) linked to MBI Emotional Exhaustion (MBI-EE) Scale (anchor scale)a
Item Set 2: PFI Interpersonal Disengagement (PFI-ID) Scale (target scale) linked to MBI Depersonalization (MBI-DP) Scale (anchor scale)a
Item Set 3: Mini-Z Single-Item Burnout (MZSIB) Scale (target scale) linked to MBI Emotional Exhaustion (MBI-EE) Scale (anchor scale)a
PFI-WE raw (total) score
Predicted MBI-EE T-score (SE)
Predicted MBI-EE raw (total) score
PFI-ID Scale raw (total) score
Predicted MBI-DP Scale T-score (SE)
Predicted MBI-DP raw (total) score
MZSIB item raw score
Predicted MBI-EE Scale T-score (SE)
Predicted MBI-EE raw score
0
30.15 (4.93)
2.57
0
35.46 (5.36)
1.31
1
35.44 (6.26)
6.27
1
34.96 (3.79)
5.86
1
40.99 (3.59)
2.41
2
44.75 (5.35)
17.64
2
38.09 (3.52)
8.89
2
42.76 (3.58)
2.99
3
52.37 (5.05)
29.92
3
40.74 (3.39)
12.06
3
45.06 (3.18)
3.94
4
60.09 (5.60)
40.62
4
43.10 (3.33)
15.24
4
46.90 (3.21)
4.85
5
69.49 (6.30)
48.80
5
45.34 (3.32)
18.54
5
48.51 (3.10)
5.74
   
6
47.59 (3.32)
22.09
6
50.19 (3.02)
6.77
   
7
49.81 (3.30)
25.73
7
51.81 (3.07)
7.89
   
8
51.98 (3.29)
29.29
8
53.30 (3.10)
9.01
   
9
54.13 (3.30)
32.65
9
54.91 (3.07)
10.30
   
10
56.33 (3.31)
35.80
10
56.47 (3.11)
11.62
   
11
58.55 (3.31)
38.72
11
57.93 (3.07)
12.88
   
12
60.77 (3.32)
41.43
12
59.47 (3.04)
14.22
   
13
63.11 (3.38)
44.04
13
60.92 (3.10)
15.47
   
14
65.71 (3.49)
46.41
14
62.31 (3.10)
16.64
   
15
68.75 (3.75)
48.41
15
63.85 (3.08)
17.87
   
16
73.18 (4.70)
50.36
16
65.35 (3.09)
19.01
   
   
17
66.84 (3.02)
20.07
   
   
18
68.40 (2.98)
21.11
   
   
19
69.95 (3.02)
22.09
   
   
20
71.52 (3.05)
23.01
   
   
21
73.28 (3.03)
23.94
   
   
22
75.24 (3.16)
24.83
   
   
23
77.17 (3.21)
25.58
   
   
24
80.61 (3.98)
26.69
   
aBolded values are those that are closest to the mean on the corresponding MBI anchor metric. Crosswalks were generated using item response theory fixed-calibration linking based on MBI item parameter estimates established in prior IRT analysis of MBI data from a 2014 national physician sample.29 Note that item set 2 is not on the same metric as item sets 1 and 3. Therefore, item set 2 cannot be compared with item sets 1 and 3

Reliability

Both the MBI-EE and PFI-WE scales demonstrated ≥0.70 reliability to assess a wide range of low and high emotional exhaustion levels on the MBI-EE t-score metric (Fig. 1a). The MZSIB scale showed less than 0.70 reliability to assess emotional exhaustion across the MBI-EE t-score metric. Both the MBI-DP and PFI-ID scales also demonstrated ≥0.70 reliability to assess a range of low and high depersonalization levels on the MBI-DP t-score metric (Fig. 1b). Compared to the PFI-WE scale, the MBI-EE scale possessed ≥0.70 reliability over a wider range of below average emotional exhaustion t-scores, whereas, compared to the MBI-DP scale, the PFI-ID scale possessed ≥0.70 reliability over a wider range of above average depersonalization t-scores.

Associations with Adverse Outcomes

All scales correlated with physician depressive symptoms, physician distress, and physicians’ intent to leave their practice or medicine within 2 years (Table 4). Among measures assessing the same underlying construct (i.e., the MBI-EE, PFI-WE, and MZSIB measures of emotional exhaustion and the MBI-DP and PFI-ID measures of depersonalization), there were no major differences in the magnitude of correlations between each burnout scale and depressive symptom, distress, and intent to leave outcomes (Table 4). The MBI-DP scale showed a modestly lesser correlation with intent to leave compared to the PFI-ID scale.
Table 4
Correlation Analysis of Each Scale’s Raw Scores with Adverse Outcomes
Outcome
Emotional Exhaustion Measuresa
Depersonalization Measuresa
MBI-EE
PFI-WE
MZSIB
MBI-DP
PFI-ID
Depressive symptoms
0.63
0.64
0.59
0.53
0.54
Distressb
0.71
0.71
0.70
0.60
0.60
Intent to leave one’s practice (attending) or medicine (trainee) in two years
0.18
0.21
0.21
0.14
0.20
aAll correlations are Spearman correlations; all correlations are significant at p < 0.05; b defined by burnout, depression, mental quality of life, physical quality of life, stress, and fatigue

DISCUSSION

Healthcare organizations across the US are monitoring physician burnout as an indicator of health system performance.9 Common applications of physician burnout measurement as a performance indicator are to make inferences regarding the quality of physicians’ medical practice environments, workforce sustainability, and healthcare quality.9 Yet, comparisons of performance over time, across organizations, or across studies are not possible when different burnout measures have been employed. In this study, we used IRT linking to place common burnout measures—the PFI and MZSIB—onto the metric of the MBI, and created crosswalks that map raw scores on the PFI-WE, PFI-ID, and MZSIB scales to corresponding MBI subscale scores. For scales linked to the same metric, we identified the closest cut-point equivalencies across all linked metrics and compared the reliability across linked outcome metrics.
By linking the PFI, MZSIB, and MBI to the same metric, the crosswalks we produced allow investigators using these measures to make several useful comparisons.25 First, investigators can compare summary sample scores across the PFI, MZSIB, and MBI. That is, using the crosswalks produced in this study, group-level emotional exhaustion scores can be compared across the MBI-EE, PFI-WE, and MZSIB scales, and group-level depersonalization scores can be compared across the MBI-DP or PFI-ID scales.25 Second, investigators can use the crosswalks to calculate emotional exhaustion/depersonalization rates across metrics by substituting respondents’ raw (total) scores on the PFI or MZSIB with the corresponding MBI t-score. The corresponding MBI t-scores can then be used to calculate the percent of physicians scoring at or above a selected MBI cut-point. The substituted MBI scores can be further analyzed in descriptive and inferential analyses.25 The crosswalks can also be used to calculate emotional exhaustion/depersonalization rates across metrics using only aggregated data. In Supplemental Appendix 10, we demonstrate how to calculate emotional exhaustion/depersonalization rates on the MBI metric using frequency tables of physicians’ raw scores on the PFI. The crosswalks can facilitate comparisons of burnout scores/rates across organizations using different measures, within organizations using different measures over time, and to published regional/national benchmarks. The use of our crosswalks to convert burnout scores from different measures to a common metric may also improve comparative effectiveness and meta-analysis research by reducing error associated with the use of different scales across studies.25, 37
Our reliability assessment provides important information regarding the psychometric performance of each measure, each of which has its own strengths and weaknesses that should be considered within the intended purpose of an organization’s assessment.9 For example, the MBI-EE scale provides >0.90 reliability to assess a wide range of emotional exhaustion levels, but at the cost of additional items. With less than half the items of the MBI-EE, the PFI-WE scale offers >0.80 reliability to assess a similar range of above-average emotional exhaustion levels as the MBI-EE scale, but has less precision at below average emotional exhaustion levels than the MBI-EE scale. Similarly, with only one item, the MZSIB offers the least response burden but has less precision to assess emotional exhaustion than the MBI-EE and PFI-WE scales (an expected result given the MZSIB was originally designed as a brief screening tool, not an outcome assessment). However, this level of precision may be sufficient, for example, if the intended purpose of assessment is for screening followed by additional assessment, or to predict the risk of occupational outcomes of depression symptoms, distress, or intent to leave one’s practice at a group-level. The PFI-ID scale offers the most reliable assessment of depersonalization across the widest range of depersonalization levels, with one additional item compared to MBI-DP scale. We should note that, to our knowledge, this is the first assessment of the MZSIB’s reliability (as internal consistency reliability is not applicable to single-item scales and test-retest reliability has not yet been investigated for this measure).
All scales showed significant correlations with important, adverse outcomes, including physician depression, distress, and intent to leave. The association between each measure and each adverse outcome underscores the importance of including measures of physician burnout in institutional assessments.
To our knowledge, this is the first study to crosswalk common measures of burnout among US physicians. Strengths of this study include the use of a single-group linking design (permitting the direct comparison of physicians’ actual MBI scores to those predicted by linking to determine the accuracy of our results) and the use and agreement of two different linking methods.
However, this study has several limitations. First, because the MBI-EE and MBI-DP metrics to which the PFI and MZSIB are linked were derived from a prior IRT analysis of 2014 MBI data from the Shanafelt et al. (2015) national physician burnout prevalence study,29, 30 the mean of each MBI anchor scale is fixed to the mean EE and DP scores of US physicians in 2014. Therefore, when interpreting a score on a target scale relative to its SDs above/below the mean score on its MBI anchor scale, it should be known that the comparison is relative to the underlying mean MBI score of US physicians in 2014. Despite this limitation, the crosswalks remain valid assuming that the MBI subscales function equivalently across the 2014 US physician sample and US general physician population. Second, although our findings provide support for the invariance of our crosswalks across early and late responder groups and, therefore, provide potential support for the representativeness of our sample, this support relies on the assumption that late responders are an adequate proxy for non-respondents. Nevertheless, several studies have demonstrated no significant differences in burnout estimates across respondent and non-respondent groups, despite the low response rates that are common in physician survey research.1, 38 Third, we chose to highlight the closest cut-point equivalencies across linked measures using commonly used cut-points on each metric. Because raw scores on each target metric are linked to continuous scores on each anchor metric, the closest cut-point equivalencies across metrics are an approximation. Although we identified the closest cut-point equivalency for scores ≥27 and ≥10 on the respective MBI-EE and MBI-DP scales, investigators can use crosswalks published in Brady et al. 29 in conjunction with the crosswalks presented herein to identify cut-point equivalencies on the PFI and MZSIB at other MBI raw score cut-points.
It is important to note that the crosswalk tables rendered with this research allow reasonable approximate translation of aggregate, group-level scores from one measure of burnout to another. They are not intended to translate individual-level respondent scores from one measure of burnout to another, and attempting to do so would produce unreliable results. In addition, it is important to note that crosswalking scores from one measure of burnout to another is only appropriate across measures that assess the same construct. A measure of emotional exhaustion (such as the MZSIB) cannot be crosswalked to derive an equivalent score on a metric of depersonalization.

CONCLUSIONS

As US healthcare organizations are increasingly measuring physician burnout as an indicator of health system performance, there is a need to compare burnout outcomes across different assessments. Our findings allow healthcare organizations using the PFI or MZSIB to compare group-level scores to historical, regional, or national MBI scores (and vice-versa).

Declarations

Conflict of Interest

Dr. Shanafelt is co-inventor of the Well-being Index instruments and the Participatory Management Leadership Index. Mayo Clinic holds the copyright for these instruments and has licensed them for use outside of Mayo Clinic. Dr. Shanafelt receives a portion of any royalties paid to Mayo Clinic. Dr. Linzer is supported in part through grants to Hennepin Healthcare from the AMA, Institute for Healthcare Improvement, the American Board of Internal Medicine Foundation, and the American College of Physicians for research and training in burnout prevention. All other authors report no conflicts of interest.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://​creativecommons.​org/​licenses/​by/​4.​0/​.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Unsere Produktempfehlungen

e.Med Interdisziplinär

Kombi-Abonnement

Jetzt e.Med zum Sonderpreis bestellen!

Für Ihren Erfolg in Klinik und Praxis - Die beste Hilfe in Ihrem Arbeitsalltag

Mit e.Med Interdisziplinär erhalten Sie Zugang zu allen CME-Fortbildungen und Fachzeitschriften auf SpringerMedizin.de.

Jetzt bestellen und 100 € sparen!

e.Med Innere Medizin

Kombi-Abonnement

Mit e.Med Innere Medizin erhalten Sie Zugang zu CME-Fortbildungen des Fachgebietes Innere Medizin, den Premium-Inhalten der internistischen Fachzeitschriften, inklusive einer gedruckten internistischen Zeitschrift Ihrer Wahl.

Jetzt bestellen und 100 € sparen!

e.Med Allgemeinmedizin

Kombi-Abonnement

Mit e.Med Allgemeinmedizin erhalten Sie Zugang zu allen CME-Fortbildungen und Premium-Inhalten der allgemeinmedizinischen Zeitschriften, inklusive einer gedruckten Allgemeinmedizin-Zeitschrift Ihrer Wahl.

Jetzt bestellen und 100 € sparen!

Anhänge

Supplementary Information

Literatur
1.
Zurück zum Zitat Shanafelt TD, West CP, Sinsky C, et al. Changes in burnout and satisfaction with work-life integration in physicians and the general US working population between 2011 and 2017. Mayo Clinic Proceedings 2019;94(9):1681-1694. Shanafelt TD, West CP, Sinsky C, et al. Changes in burnout and satisfaction with work-life integration in physicians and the general US working population between 2011 and 2017. Mayo Clinic Proceedings 2019;94(9):1681-1694.
2.
Zurück zum Zitat Maslach C, Jackson SE. The measurement of experienced burnout. J Occup Behav 1981;2(2):99-113.CrossRef Maslach C, Jackson SE. The measurement of experienced burnout. J Occup Behav 1981;2(2):99-113.CrossRef
3.
Zurück zum Zitat Shanafelt TD, Boone S, Tan L, et al. Burnout and satisfaction with work-life balance among US physicians relative to the general US population. Arch Intern Med 2012;172(18):1377-1385.CrossRef Shanafelt TD, Boone S, Tan L, et al. Burnout and satisfaction with work-life balance among US physicians relative to the general US population. Arch Intern Med 2012;172(18):1377-1385.CrossRef
4.
Zurück zum Zitat Dyrbye LN, T.D. Shanafelt, C.A. Sinsky, P.F. Cipriano, J. Bhatt, A. Ommaya, C.P. West, and D. Meyers. Burnout among health care professionals: A call to explore and address this underrecognized threat to safe, high-quality care. NAM Perspectives. Discussion Paper, National Academy of Medicine, Washington, DC. 2019; https://doi.org/10.31478/201707b Dyrbye LN, T.D. Shanafelt, C.A. Sinsky, P.F. Cipriano, J. Bhatt, A. Ommaya, C.P. West, and D. Meyers. Burnout among health care professionals: A call to explore and address this underrecognized threat to safe, high-quality care. NAM Perspectives. Discussion Paper, National Academy of Medicine, Washington, DC. 2019; https://​doi.​org/​10.​31478/​201707b
5.
Zurück zum Zitat Tawfik DS, Scheid A, Profit J, et al. Evidence Relating Health Care Provider Burnout and Quality of Care: A Systematic Review and Meta-analysis. Ann Intern Med 2019;171(8):555-567.CrossRef Tawfik DS, Scheid A, Profit J, et al. Evidence Relating Health Care Provider Burnout and Quality of Care: A Systematic Review and Meta-analysis. Ann Intern Med 2019;171(8):555-567.CrossRef
6.
Zurück zum Zitat Han S, Shanafelt TD, Sinsky CA, et al. Estimating the Attributable Cost of Physician Burnout in the United States. Ann Intern Med 2019;170(11):784-790.CrossRef Han S, Shanafelt TD, Sinsky CA, et al. Estimating the Attributable Cost of Physician Burnout in the United States. Ann Intern Med 2019;170(11):784-790.CrossRef
7.
Zurück zum Zitat Jha A, Ilif A, Chaoui A. A crisis in health care: a call to action on physician burnout. In: Massachusetts Medical Society. Available at: http://www.massmed.org/Publications/Research,-Studies,-and-Reports/A-Crisis-in-Health-Care--A-Call-to-Action-on--Physician-Burnout/#.X99sCJNKh_k. Accessed June 19, 2020. Jha A, Ilif A, Chaoui A. A crisis in health care: a call to action on physician burnout. In: Massachusetts Medical Society. Available at: http://​www.​massmed.​org/​Publications/​Research,-Studies,-and-Reports/A-Crisis-in-Health-Care--A-Call-to-Action-on--Physician-Burnout/#.X99sCJNKh_k. Accessed June 19, 2020.
8.
Zurück zum Zitat Dzau VJ, Kirch DG, Nasca TJ. To Care Is Human — Collectively Confronting the Clinician-Burnout Crisis. N Engl J Med 2018;378(4):312-314.CrossRef Dzau VJ, Kirch DG, Nasca TJ. To Care Is Human — Collectively Confronting the Clinician-Burnout Crisis. N Engl J Med 2018;378(4):312-314.CrossRef
9.
Zurück zum Zitat Brady KJS, Kazis LE, Sheldrick RC, Ni P, Trockel MT. Selecting Physician Well-Being Measures to Assess Health System Performance and Screen for Distress: Conceptual and Methodological Considerations. Curr Probl Pediatr Adolesc Health Care 2019;49(12):100662.CrossRef Brady KJS, Kazis LE, Sheldrick RC, Ni P, Trockel MT. Selecting Physician Well-Being Measures to Assess Health System Performance and Screen for Distress: Conceptual and Methodological Considerations. Curr Probl Pediatr Adolesc Health Care 2019;49(12):100662.CrossRef
10.
Zurück zum Zitat Dyrbye LN, Meyers D, Ripp J, Dalal N, Bird SB, Sen S. A Pragmatic Approach for Organizations to Measure Health Care Professional Well-Being. NAM Perspectives. Discussion Paper, National Academy of Medicine, Washington, DC. 2018; https://doi.org/10.31478/201810b Dyrbye LN, Meyers D, Ripp J, Dalal N, Bird SB, Sen S. A Pragmatic Approach for Organizations to Measure Health Care Professional Well-Being. NAM Perspectives. Discussion Paper, National Academy of Medicine, Washington, DC. 2018; https://​doi.​org/​10.​31478/​201810b
11.
Zurück zum Zitat Shanafelt TD, Noseworthy JH. Executive leadership and physician well-being: nine organizational strategies to promote engagement and reduce burnout. Mayo Clin Proc 2017;92(1):129-146.CrossRef Shanafelt TD, Noseworthy JH. Executive leadership and physician well-being: nine organizational strategies to promote engagement and reduce burnout. Mayo Clin Proc 2017;92(1):129-146.CrossRef
14.
Zurück zum Zitat National Academies of Sciences, Engineering, and Medicine. Taking Action Against Clinician Burnout: A Systems Approach to Professional Well-Being. The National Academies Press; 2019. National Academies of Sciences, Engineering, and Medicine. Taking Action Against Clinician Burnout: A Systems Approach to Professional Well-Being. The National Academies Press; 2019.
16.
Zurück zum Zitat Trockel M, Bohman B, Lesure E, et al. A Brief Instrument to Assess Both Burnout and Professional Fulfillment in Physicians: Reliability and Validity, Including Correlation with Self-Reported Medical Errors, in a Sample of Resident and Practicing Physicians. Acad Psychiatry 2017;42(1):11-24.CrossRef Trockel M, Bohman B, Lesure E, et al. A Brief Instrument to Assess Both Burnout and Professional Fulfillment in Physicians: Reliability and Validity, Including Correlation with Self-Reported Medical Errors, in a Sample of Resident and Practicing Physicians. Acad Psychiatry 2017;42(1):11-24.CrossRef
17.
Zurück zum Zitat Konrad TR, Williams ES, Linzer M, et al. Measuring physician job satisfaction in a changing workplace and a challenging environment. Med Care 1999;37(11):1174-1182.CrossRef Konrad TR, Williams ES, Linzer M, et al. Measuring physician job satisfaction in a changing workplace and a challenging environment. Med Care 1999;37(11):1174-1182.CrossRef
18.
Zurück zum Zitat Kolen MJ, Brennan RL. Test equating, scaling, and linking: Methods and practices. Springer Science & Business Media; 2014. Kolen MJ, Brennan RL. Test equating, scaling, and linking: Methods and practices. Springer Science & Business Media; 2014.
19.
Zurück zum Zitat Maslach C, Jackson S, Leiter M. Maslach Burnout Inventory Manual. 3rd ed. Consulting Psychologists Press; 1996. Maslach C, Jackson S, Leiter M. Maslach Burnout Inventory Manual. 3rd ed. Consulting Psychologists Press; 1996.
20.
Zurück zum Zitat Williams ES, Konrad TR, Linzer M, et al. Refining the measurement of physician job satisfaction: results from the Physician Worklife Survey. SGIM Career Satisfaction Study Group. Society of General Internal Medicine. Med Care 1999;37(11):1140-1154.CrossRef Williams ES, Konrad TR, Linzer M, et al. Refining the measurement of physician job satisfaction: results from the Physician Worklife Survey. SGIM Career Satisfaction Study Group. Society of General Internal Medicine. Med Care 1999;37(11):1140-1154.CrossRef
21.
Zurück zum Zitat Pilkonis PA, Choi SW, Reise SP, Stover AM, Riley WT, Cella D. Item Banks for Measuring Emotional Distress From the Patient-Reported Outcomes Measurement Information System (PROMIS®): Depression, Anxiety, and Anger. Assessment. 2011;18(3):263-283.CrossRef Pilkonis PA, Choi SW, Reise SP, Stover AM, Riley WT, Cella D. Item Banks for Measuring Emotional Distress From the Patient-Reported Outcomes Measurement Information System (PROMIS®): Depression, Anxiety, and Anger. Assessment. 2011;18(3):263-283.CrossRef
22.
Zurück zum Zitat Dyrbye LN, Szydlo DW, Downing SM, Sloan JA, Shanafelt TD. Development and preliminary psychometric properties of a well-being index for medical students. BMC Med Educ 2010;10(1):8.CrossRef Dyrbye LN, Szydlo DW, Downing SM, Sloan JA, Shanafelt TD. Development and preliminary psychometric properties of a well-being index for medical students. BMC Med Educ 2010;10(1):8.CrossRef
23.
Zurück zum Zitat Dyrbye LN, Schwartz A, Downing SM, Szydlo DW, Sloan JA, Shanafelt TD. Efficacy of a brief screening tool to identify medical students in distress. Acad Med 2011;86(7):907-914.CrossRef Dyrbye LN, Schwartz A, Downing SM, Szydlo DW, Sloan JA, Shanafelt TD. Efficacy of a brief screening tool to identify medical students in distress. Acad Med 2011;86(7):907-914.CrossRef
24.
Zurück zum Zitat Dyrbye LN, Satele D, Sloan J, Shanafelt TD. Utility of a brief screening tool to identify physicians in distress. J Gen Intern Med 2013;28(3):421-427.CrossRef Dyrbye LN, Satele D, Sloan J, Shanafelt TD. Utility of a brief screening tool to identify physicians in distress. J Gen Intern Med 2013;28(3):421-427.CrossRef
25.
Zurück zum Zitat Choi SW, Schalet B, Cook KF, Cella D. Establishing a Common Metric for Depressive Symptoms: Linking the BDI-II, CES-D, and PHQ-9 to PROMIS Depression. Psychol Assess 2014;26(2):513-527.CrossRef Choi SW, Schalet B, Cook KF, Cella D. Establishing a Common Metric for Depressive Symptoms: Linking the BDI-II, CES-D, and PHQ-9 to PROMIS Depression. Psychol Assess 2014;26(2):513-527.CrossRef
27.
Zurück zum Zitat Rotenstein LS, Torre M, Ramos MA, et al. Prevalence of burnout among physicians: A systematic review. Jama. 2018;320(11):1131-1150.CrossRef Rotenstein LS, Torre M, Ramos MA, et al. Prevalence of burnout among physicians: A systematic review. Jama. 2018;320(11):1131-1150.CrossRef
28.
Zurück zum Zitat Dorans NJ, Holland PW. Population invariance and the equatability of tests: Basic theory and the linear case. J Educ Meas 2000;37(4):281-306.CrossRef Dorans NJ, Holland PW. Population invariance and the equatability of tests: Basic theory and the linear case. J Educ Meas 2000;37(4):281-306.CrossRef
29.
Zurück zum Zitat Brady KJS NP, Sheldrick RC, Trockel MT, Shanafelt T, Rowe SG, Schneider JI, Kazis LE. Describing the Emotional Exhaustion, Depersonalization, and Low Personal Accomplishment Symptoms Associated with Maslach Burnout Inventory Subscale Scores in US Physicians. J Patient Rep Outcomes 2020;4(1):1-14.CrossRef Brady KJS NP, Sheldrick RC, Trockel MT, Shanafelt T, Rowe SG, Schneider JI, Kazis LE. Describing the Emotional Exhaustion, Depersonalization, and Low Personal Accomplishment Symptoms Associated with Maslach Burnout Inventory Subscale Scores in US Physicians. J Patient Rep Outcomes 2020;4(1):1-14.CrossRef
30.
Zurück zum Zitat Shanafelt TD, Hasan O, Dyrbye LN, et al. Changes in Burnout and Satisfaction With Work-Life Balance in Physicians and the General US Working Population Between 2011 and 2014. Mayo Clin Proc 2015;90(12):1600-1613.CrossRef Shanafelt TD, Hasan O, Dyrbye LN, et al. Changes in Burnout and Satisfaction With Work-Life Balance in Physicians and the General US Working Population Between 2011 and 2014. Mayo Clin Proc 2015;90(12):1600-1613.CrossRef
34.
Zurück zum Zitat Albano AD. equate: An R package for observed-score linking and equating. J Stat Softw 2016;74(8):1-36.CrossRef Albano AD. equate: An R package for observed-score linking and equating. J Stat Softw 2016;74(8):1-36.CrossRef
35.
Zurück zum Zitat Chalmers P. mirt: A Multidimensional Item Response Theory Package for the R Environment. J Stat Softw 2012;48(6):1-29.CrossRef Chalmers P. mirt: A Multidimensional Item Response Theory Package for the R Environment. J Stat Softw 2012;48(6):1-29.CrossRef
36.
Zurück zum Zitat Rosseel Y. lavaan: An R Package for Structural Equation Modeling. J Stat Softw 2012;48(2):1-36.CrossRef Rosseel Y. lavaan: An R Package for Structural Equation Modeling. J Stat Softw 2012;48(2):1-36.CrossRef
37.
Zurück zum Zitat Lai J-S, Cella D, Yanez B, Stone A. Linking fatigue measures on a common reporting metric. J Pain Symptom Manag 2014;48(4):639-648.CrossRef Lai J-S, Cella D, Yanez B, Stone A. Linking fatigue measures on a common reporting metric. J Pain Symptom Manag 2014;48(4):639-648.CrossRef
38.
Zurück zum Zitat Simonetti JA, Clinton WL, Taylor L, et al. The impact of survey nonresponse on estimates of healthcare employee burnout. Healthcare. 2020;8(3):100451.CrossRef Simonetti JA, Clinton WL, Taylor L, et al. The impact of survey nonresponse on estimates of healthcare employee burnout. Healthcare. 2020;8(3):100451.CrossRef
Metadaten
Titel
Establishing Crosswalks Between Common Measures of Burnout in US Physicians
verfasst von
Keri J. S. Brady, PhD, MPH
Pengsheng Ni, MD, MPH
Lindsey Carlasare, MBA
Tait D. Shanafelt, MD
Christine A. Sinsky, MD
Mark Linzer, MD
Martin Stillman, MD, JD
Mickey T. Trockel, MD, PhD
Publikationsdatum
31.03.2021
Verlag
Springer International Publishing
Erschienen in
Journal of General Internal Medicine / Ausgabe 4/2022
Print ISSN: 0884-8734
Elektronische ISSN: 1525-1497
DOI
https://doi.org/10.1007/s11606-021-06661-4

Weitere Artikel der Ausgabe 4/2022

Journal of General Internal Medicine 4/2022 Zur Ausgabe

Leitlinien kompakt für die Innere Medizin

Mit medbee Pocketcards sicher entscheiden.

Seit 2022 gehört die medbee GmbH zum Springer Medizin Verlag

Erhebliches Risiko für Kehlkopfkrebs bei mäßiger Dysplasie

29.05.2024 Larynxkarzinom Nachrichten

Fast ein Viertel der Personen mit mäßig dysplastischen Stimmlippenläsionen entwickelt einen Kehlkopftumor. Solche Personen benötigen daher eine besonders enge ärztliche Überwachung.

Nach Herzinfarkt mit Typ-1-Diabetes schlechtere Karten als mit Typ 2?

29.05.2024 Herzinfarkt Nachrichten

Bei Menschen mit Typ-2-Diabetes sind die Chancen, einen Myokardinfarkt zu überleben, in den letzten 15 Jahren deutlich gestiegen – nicht jedoch bei Betroffenen mit Typ 1.

15% bedauern gewählte Blasenkrebs-Therapie

29.05.2024 Urothelkarzinom Nachrichten

Ob Patienten und Patientinnen mit neu diagnostiziertem Blasenkrebs ein Jahr später Bedauern über die Therapieentscheidung empfinden, wird einer Studie aus England zufolge von der Radikalität und dem Erfolg des Eingriffs beeinflusst.

Costims – das nächste heiße Ding in der Krebstherapie?

28.05.2024 Onkologische Immuntherapie Nachrichten

„Kalte“ Tumoren werden heiß – CD28-kostimulatorische Antikörper sollen dies ermöglichen. Am besten könnten diese in Kombination mit BiTEs und Checkpointhemmern wirken. Erste klinische Studien laufen bereits.

Update Innere Medizin

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.