nach oben

Erschienen in:

Open Access 31.03.2021 | Original Research

Establishing Crosswalks Between Common Measures of Burnout in US Physicians

verfasst von: Keri J. S. Brady, PhD, MPH, Pengsheng Ni, MD, MPH, Lindsey Carlasare, MBA, Tait D. Shanafelt, MD, Christine A. Sinsky, MD, Mark Linzer, MD, Martin Stillman, MD, JD, Mickey T. Trockel, MD, PhD

Erschienen in: Journal of General Internal Medicine | Ausgabe 4/2022

Abstract

Background

Physician burnout is often assessed by healthcare organizations. Yet, scores from different burnout measures cannot currently be directly compared, limiting the interpretation of results across organizations or studies.

Objective

To link common measures of burnout to a single metric in psychometric analyses such that group-level scores from different assessments can be compared.

Design

Cross-sectional survey.

Setting

US practices.

Participants

A total of 1355 physicians sampled from the American Medical Association Physician Masterfile.

Main Measures

We linked the Stanford Professional Fulfillment Index (PFI) and Mini-Z Single-Item Burnout (MZSIB) scale to the Maslach Burnout Inventory (MBI) in item response theory (IRT) fixed-calibration and equipercentile analyses and created crosswalks mapping PFI and MZSIB scores to corresponding MBI scores. We evaluated the accuracy of the results by comparing physicians’ actual MBI scores to those predicted by linking and described the closest cut-point equivalencies across scales linked to the same MBI subscale using the resulting crosswalks.

Key Results

IRT linking produced the most accurate results and was used to create crosswalks mapping (1) PFI Work Exhaustion (PFI-WE) and MZSIB scores to MBI Emotional Exhaustion (MBI-EE) scores and (2) PFI Interpersonal Disengagement (PFI-ID) scores to MBI Depersonalization (MBI-DP) scores. The commonly used MBI-EE raw score cut-point of ≥27 corresponded most closely with respective PFI-WE and MZSIB raw score cut-points of ≥7 and ≥3. The commonly used MBI-DP raw score cut-point of ≥10 corresponded most closely with a PFI-ID raw score cut-point of ≥9.

Conclusions

Our findings allow healthcare organizations using the PFI or MZSIB to compare group-level scores to historical, regional, or national MBI scores (and vice-versa).

ESM 1 (DOCX 80.7 kb)

Supplementary Information

The online version contains supplementary material available at https://doi.org/10.1007/s11606-021-06661-4.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

IRT

Item response theory

MBI

Maslach Burnout Inventory-Human Services Survey for Medical Personnel

MBI-EE

Maslach Burnout Inventory-Human Services Survey for Medical Personnel Emotional Exhaustion scale

MBI-DP

Maslach Burnout Inventory-Human Services Survey for Medical Personnel Depersonalization scale

MZSIB

Mini-Z Single-Item Burnout scale

PFI

Stanford Professional Fulfillment Index

PFI-WE

Stanford Professional Fulfillment Index Work Exhaustion scale

PFI-ID

Stanford Professional Fulfillment Index Interpersonal Disengagement scale

INTRODUCTION

In the US, burnout is more common in physicians than in workers in other fields,¹ and is characterized by work-related feelings of exhaustion and depersonalization or interpersonal disengagement.^{2, 3} Physician burnout is associated with poor physician health outcomes, reduced quality of care, and at least 4.6 billion dollars in excess health system costs annually.^4‐6 In an effort to curb physician burnout,^{7, 8} health systems across the nation are integrating measures of burnout into routine organizational assessments to monitor system functioning and evaluate the effectiveness of practice changes designed to improve physician well-being.^9‐11 This practice is recommended in the National Academy of Medicine’s consensus report on clinician burnout and regarded by healthcare leaders as a basic first step to addressing the problem.^{7, 10‐14}

With the widespread adoption of physician burnout assessment within US healthcare systems has come the problem of comparing outcomes across different burnout measures. With several validated options available that vary in length and cost, a number of different measures are currently in use in the US,^{9, 10} including the Maslach Burnout Inventory-Human Services Survey for Medical Personnel (MBI),¹⁵ Stanford Professional Fulfillment Index (PFI),¹⁶ and the Mini-Z Single-Item Burnout (MZSIB) scale.¹⁷ When two different burnout measures are used across organizations or within an organization over time, the scores are not comparable unless they are placed onto the same metric, or “linked,” in psychometric analyses. To date, no studies to our knowledge have linked common measures of physician burnout onto a single metric, which would allow healthcare organizations to compare burnout scores/rates across different measures.

The primary aim of this study was to link the PFI and MZSIB to the MBI metric and create crosswalks that map scores from the PFI and MZSIB to corresponding scores on the MBI. Using the crosswalks, we aimed to describe the closest cut-point equivalences for scales linked to the same metric. Our secondary aim was to examine the psychometric properties of scales linked to the same metric, including each scale’s reliability and associations with relevant adverse outcomes.

METHODS

Linking refers to the statistical process of placing two or more measures with different content and/or construct severity levels onto the same scale.¹⁸ Through this process, a relationship is established between the linked measures, such that for each score on Burnout Measure A, an equivalent score (within standard error) on Burnout Measure B is established.

Design and Participants

This study used a single-group linking design, whereby items from each burnout instrument were administered in a confidential, cross-sectional survey to all respondents from February to March 2019. To obtain a representative convenience sample, we randomly sampled physicians of all ages, sexes, and specialties from the American Medical Association Physician Masterfile. Physicians were emailed the survey and offered a small financial incentive to participate. The survey was administered in waves until we reached a target sample size of ≥1200 respondents, which was estimated as the minimum sample size needed for item response theory linking analyses. Physicians (including postgraduate trainees) practicing in the US at the time of the survey were eligible for inclusion.

Measures

We measured physician burnout using the MBI 9-item Emotional Exhaustion (MBI-EE) and 5-item Depersonalization (MBI-DP) subscales (0 = never, 1 = a few times a year or less, 2 = once a month or less, 3 = a few times a month, 4 = once a week, 5 = a few times a week, 6 = every day); the PFI 4-item Work Exhaustion (PFI-WE) and 6-item Interpersonal Disengagement (PFI-ID) subscales (0 = not at all, 1 = very little, 2 = moderately, 3 = a lot, 4 = extremely); and the single-item MZSIB (1 = no burnout; 2 = under stress; 3 = have one or more burnout symptom; 4 = burnout won’t go away; 5 = completely burned out; see Supplemental Appendix 1 for the complete MZSIB response options).¹⁷ The sequence in which each instrument was administered was randomized to prevent ordering effects.

The MBI and PFI are outcome measures, whereas the MZSIB scale is a screening measure. Commonly used raw (total) score cut-points for each scale are ≥27, ≥10, and ≥3 on the MBI-EE, MBI-DP, and MZSIB scales, respectively.^{1, 19, 20} The raw (total) score cut-point for the PFI Burnout Composite (PFI-BC) Scale is ≥14.¹⁶ Cut-points for PFI-WE and PFI-ID subscales have not been published and are identified in the current study.¹⁶

We also assessed physicians’ demographics, depressive symptoms (4-item PROMIS depression measure),²¹ distress as measured by the original, 7-item Physician Well-Being Index (WBI),^22‐24 and intent to leave one’s current practice or intent to leave medicine (for attending physicians and postgraduate trainees, respectively) in the next 2 years (1 item). ¹⁷ All measures were scored such that higher scores indicate more of each construct.

Linking Analyses

Our methods were informed by those used in the PROsetta Stone Project.^{25, 26} Scales were linked in item sets, consisting of two scales: a target measure and an anchor measure. In linking analyses, a target measure is linked to an anchor measure, which places the target measure onto the metric of the anchor measure. Because the MBI is historically the most common physician burnout assessment,²⁷ we selected the MBI-EE and MBI-DP scales as anchor measures. Target measures included the PFI-WE, PFI-ID, and MZSIB scales.

Prior to conducting linking analyses, we qualitatively and quantitatively examined the degree to which the scales that we aimed to link assess essentially the same construct, a key assumption of linking.^{18, 28} Scales assessing essentially the same construct were expected to (1) have very similar item content as determined by two independent subject domain expert raters (TS, ML); (2) be highly correlated (inter-scale Pearson’s r of ≥0.75); and (3) be essentially unidimensional as determined in confirmatory factor analyses (CFAs) (see Supplemental Appendix 2 for additional assumption assessment details).²⁵

For each item set, we conducted item response theory (IRT) fixed-calibration linking and equipercentile linking analyses using a fivefold cross validation process (Supplemental Appendix 3). In IRT linking, raw (total) scores on each target measure were linked to t-scores on each MBI anchor scale. A t-score is a standardized score ranging from 0 to 100, with a mean score and standard deviation equal to 50 and 10, respectively. T-scores on each MBI anchor scale were then mapped to corresponding MBI raw scores. In our IRT linking analyses, we derived the MBI-EE and MBI-DP anchor metrics from a prior IRT calibration of the MBI in a 2014 national sample of US physicians.^{29, 30} In equipercentile linking, the MBI metric was derived from the primary survey data collected in this study. We evaluated the accuracy of each linking method for each item set by calculating the correlation, mean difference, and standard deviation (SD) of the difference between physicians’ predicted and actual t-scores on the MBI anchor scale, using pooled predicted and actual t-scores produced from a fivefold cross validation process. The method that yielded the highest correlations, lowest mean differences, and lowest SD of difference across all item sets was used to create a crosswalk mapping raw scores on the target measure to corresponding t-scores and raw scores on the MBI anchor measure. Once each item set was linked, we (1) identified the closest cut-point equivalencies across scales linked to the same metric and (2) described the reliability of scales linked to the same metric (Supplemental Appendix 4).³¹ We used the Brady et al. ²⁹ IRT analysis to identify the t-scores corresponding with (1) each MBI-EE and MBI-DP raw score cut-point and (2) each raw score on the MBI predicted by equipercentile linking.

Finally, we computed correlations between each scale and measures of physician depressive symptoms, distress, and intent to leave to compare the magnitude of each scale’s associations with these outcomes. Analyses were conducted in R (v3.5.1) psych, lavaan, mirt, and equate packages.^32‐36 This study was approved by the University of Illinois at Chicago Institutional Review Board.

RESULTS

Sample

The overall sample included 1355 US physicians (Table 1). The most common demographic characteristics of respondents were White race, male sex, non-primary care specialty, and <44 years of age. Thirty-one percent of respondents were trainees. In subgroup invariance analyses, we found support for the invariance of our linking results across early versus late responders (where late responders were used as a proxy for non-responders; Supplemental Appendix 5, Table 5.5). Overall, mean raw scores on the MBI-EE, PFI-WE, MZSIB, MBI-DP, and PFI-ID scales were 21.82, 6.06, 2.45, 7.86, and 6.63, respectively (Table 2) (see Supplemental Appendix 6 for specialty-level descriptive scale statistics).

Table 1

Overall Sample Characteristics (n = 1355)

Characteristic	n (%)^a
Sex
Male	763 (57)
Female	579 (43)
Missing	13 (0.1)
Age group
<35 years	440 (33)
35–44 years	385 (28)
45–54 years	243 (18)
55–64 years	193 (14)
≥65 years	94 (7)
Race
White/Caucasian	894 (66)
Black/African American	54 (4)
Asian	292 (22)
Other	115 (9)
Trainee status
Trainee (resident/fellow)	420 (31)
Non-trainee	935 (69)
Primary care
Primary care^b	442 (33)
Non-primary care	913 (67)
Specialty
Anesthesiology	97 (7)
Dermatology	24 (2)
Emergency medicine	74 (6)
Family medicine	167 (12)
General surgery	62 (5)
General surgery subspecialty	71 (5)
General internal medicine	184 (14)
General pediatrics	91 (7)
Internal medicine-subspecialty	127 (9)
Neurology	28 (2)
Obstetrics and gynecology	96 (7)
Ophthalmology	30 (2)
Other	81 (6)
Pathology	4 (0.3)
Pediatric subspecialty	63 (5)
Physical medicine	13 (1)
Psychiatry	90 (7)
Radiology	53 (4)
Practice type
Non-governmental hospital	473 (35)
Group practice	404 (30)
City/county/state/federal government hospital	130 (10)
Self-employed solo practice	97 (7)
Other^c	250 (18)
Missing	1 (0.1)

^aPercentages may not add to 100 due to rounding. Missingness is only specified for variables that had missing data. ^bIncludes physicians in general internal medicine, pediatrics, and family medicine specialties. ^cIncludes physicians practicing in/as HMO, locum tenens, medical school, two physician practice (full or part owner), other patient care, city/county/state government non-hospital setting, and no classification

Table 2

Overall Descriptive Scale Statistics by Domain and Measure (n = 1346)

Domain/measure	Statistic^a
Emotional exhaustion
MBI-EE, mean (SD)	21.82 (12.16)
MBI-EE ≥27, n (%)	469 (34.8)
PFI-WE, mean (SD)	6.06 (3.46)
PFI-WE ≥7, n (%)	582 (43.2)
MZSIB, mean (SD)	2.45 (0.92)
MZSIB ≥3, n (%)	589 (43.8)
Depersonalization
MBI-DP, mean (SD)	7.86 (6.41)
MBI-DP ≥10, n (%)	458 (34.0)
PFI-ID, mean (SD)	6.63 (4.77)
PFI-ID ≥9, n (%)	470 (34.9)
Burnout
MBI (EE ≥27 and/or DP ≥10), n (%)	584 (43.4)
PFI-BC^b, mean (SD)^b	12.68 (7.63)
PFI-BC^b ≥14, n (%)^b	599 (44.5)

^aIncludes respondents with ≤1 missing item response for all scales. Cut-points presented are raw total scores on each scale. ^bPFI BC refers to the PFI Burnout Composite Scale, which is scored as the total raw score from both the PFI-WE and PFI-ID scales

Assumption Assessment

In qualitative evaluations of each target and anchor scale’s item content overlap, both raters agreed that the following item sets assess essentially the same underlying construct: PFI-WE and MBI-EE (item set 1), PFI-ID and MBI-DP (item set 2), and MZSIB and MBI-EE (item set 3). Inter-scale correlations between the target and anchor scales in item sets 1–3 were 0.80, 0.76, and 0.76, respectively. Item sets 1–3 met all other linking assumptions in quantitative analyses (Supplemental Appendix 5).

Crosswalks and Closest Cut-Point Equivalents

Overall, IRT (versus equipercentile) linking produced the most accurate results (Supplemental Appendices 7 - 9) and was used to create crosswalks mapping raw scores on the PFI-WE, PFI-ID, and MZSIB (target) scales to corresponding t-scores and raw scores on their respective MBI-EE, MBI-DP, and MBI-EE anchor scales (Table 3).

The commonly used raw score cut-point of ≥27 (t-score = 50.70) ²⁹ on the MBI-EE scale corresponded most closely with raw score cut-points of ≥7 and ≥3 on the respective PFI-WE and MZSIB scales (Table 3). The commonly used raw score cut-point of ≥10 (t-score = 53.76) ²⁹ on the MBI-DP scale corresponded most closely with a raw score cut-point of ≥9 on the PFI-ID scale. The raw score cut-point of ≥3 on the MZSIB scale corresponded most closely with a raw score of ≥8 on the PFI-WE scale.

Table 3

Crosswalks Produced from IRT Linking Mapping Raw Scores from the PFI and MZSIB to Corresponding Predicted MBI T-scores and Raw Scores

Item Set 1: PFI Work Exhaustion (PFI-WE) Scale (target scale) linked to MBI Emotional Exhaustion (MBI-EE) Scale (anchor scale)^a			Item Set 2: PFI Interpersonal Disengagement (PFI-ID) Scale (target scale) linked to MBI Depersonalization (MBI-DP) Scale (anchor scale)^a			Item Set 3: Mini-Z Single-Item Burnout (MZSIB) Scale (target scale) linked to MBI Emotional Exhaustion (MBI-EE) Scale (anchor scale)^a
PFI-WE raw (total) score	Predicted MBI-EE T-score (SE)	Predicted MBI-EE raw (total) score	PFI-ID Scale raw (total) score	Predicted MBI-DP Scale T-score (SE)	Predicted MBI-DP raw (total) score	MZSIB item raw score	Predicted MBI-EE Scale T-score (SE)	Predicted MBI-EE raw score
0	30.15 (4.93)	2.57	0	35.46 (5.36)	1.31	1	35.44 (6.26)	6.27
1	34.96 (3.79)	5.86	1	40.99 (3.59)	2.41	2	44.75 (5.35)	17.64
2	38.09 (3.52)	8.89	2	42.76 (3.58)	2.99	3	52.37 (5.05)	29.92
3	40.74 (3.39)	12.06	3	45.06 (3.18)	3.94	4	60.09 (5.60)	40.62
4	43.10 (3.33)	15.24	4	46.90 (3.21)	4.85	5	69.49 (6.30)	48.80
5	45.34 (3.32)	18.54	5	48.51 (3.10)	5.74
6	47.59 (3.32)	22.09	6	50.19 (3.02)	6.77
7	49.81 (3.30)	25.73	7	51.81 (3.07)	7.89
8	51.98 (3.29)	29.29	8	53.30 (3.10)	9.01
9	54.13 (3.30)	32.65	9	54.91 (3.07)	10.30
10	56.33 (3.31)	35.80	10	56.47 (3.11)	11.62
11	58.55 (3.31)	38.72	11	57.93 (3.07)	12.88
12	60.77 (3.32)	41.43	12	59.47 (3.04)	14.22
13	63.11 (3.38)	44.04	13	60.92 (3.10)	15.47
14	65.71 (3.49)	46.41	14	62.31 (3.10)	16.64
15	68.75 (3.75)	48.41	15	63.85 (3.08)	17.87
16	73.18 (4.70)	50.36	16	65.35 (3.09)	19.01
			17	66.84 (3.02)	20.07
			18	68.40 (2.98)	21.11
			19	69.95 (3.02)	22.09
			20	71.52 (3.05)	23.01
			21	73.28 (3.03)	23.94
			22	75.24 (3.16)	24.83
			23	77.17 (3.21)	25.58
			24	80.61 (3.98)	26.69

^aBolded values are those that are closest to the mean on the corresponding MBI anchor metric. Crosswalks were generated using item response theory fixed-calibration linking based on MBI item parameter estimates established in prior IRT analysis of MBI data from a 2014 national physician sample.²⁹ Note that item set 2 is not on the same metric as item sets 1 and 3. Therefore, item set 2 cannot be compared with item sets 1 and 3

Reliability

Both the MBI-EE and PFI-WE scales demonstrated ≥0.70 reliability to assess a wide range of low and high emotional exhaustion levels on the MBI-EE t-score metric (Fig. 1a). The MZSIB scale showed less than 0.70 reliability to assess emotional exhaustion across the MBI-EE t-score metric. Both the MBI-DP and PFI-ID scales also demonstrated ≥0.70 reliability to assess a range of low and high depersonalization levels on the MBI-DP t-score metric (Fig. 1b). Compared to the PFI-WE scale, the MBI-EE scale possessed ≥0.70 reliability over a wider range of below average emotional exhaustion t-scores, whereas, compared to the MBI-DP scale, the PFI-ID scale possessed ≥0.70 reliability over a wider range of above average depersonalization t-scores.

Associations with Adverse Outcomes

All scales correlated with physician depressive symptoms, physician distress, and physicians’ intent to leave their practice or medicine within 2 years (Table 4). Among measures assessing the same underlying construct (i.e., the MBI-EE, PFI-WE, and MZSIB measures of emotional exhaustion and the MBI-DP and PFI-ID measures of depersonalization), there were no major differences in the magnitude of correlations between each burnout scale and depressive symptom, distress, and intent to leave outcomes (Table 4). The MBI-DP scale showed a modestly lesser correlation with intent to leave compared to the PFI-ID scale.

Table 4

Correlation Analysis of Each Scale’s Raw Scores with Adverse Outcomes

Outcome	Emotional Exhaustion Measures^a			Depersonalization Measures^a
Outcome	MBI-EE	PFI-WE	MZSIB	MBI-DP	PFI-ID
Depressive symptoms	0.63	0.64	0.59	0.53	0.54
Distress^b	0.71	0.71	0.70	0.60	0.60
Intent to leave one’s practice (attending) or medicine (trainee) in two years	0.18	0.21	0.21	0.14	0.20

^aAll correlations are Spearman correlations; all correlations are significant at p < 0.05; ^b defined by burnout, depression, mental quality of life, physical quality of life, stress, and fatigue

DISCUSSION

Healthcare organizations across the US are monitoring physician burnout as an indicator of health system performance.⁹ Common applications of physician burnout measurement as a performance indicator are to make inferences regarding the quality of physicians’ medical practice environments, workforce sustainability, and healthcare quality.⁹ Yet, comparisons of performance over time, across organizations, or across studies are not possible when different burnout measures have been employed. In this study, we used IRT linking to place common burnout measures—the PFI and MZSIB—onto the metric of the MBI, and created crosswalks that map raw scores on the PFI-WE, PFI-ID, and MZSIB scales to corresponding MBI subscale scores. For scales linked to the same metric, we identified the closest cut-point equivalencies across all linked metrics and compared the reliability across linked outcome metrics.

By linking the PFI, MZSIB, and MBI to the same metric, the crosswalks we produced allow investigators using these measures to make several useful comparisons.²⁵ First, investigators can compare summary sample scores across the PFI, MZSIB, and MBI. That is, using the crosswalks produced in this study, group-level emotional exhaustion scores can be compared across the MBI-EE, PFI-WE, and MZSIB scales, and group-level depersonalization scores can be compared across the MBI-DP or PFI-ID scales.²⁵ Second, investigators can use the crosswalks to calculate emotional exhaustion/depersonalization rates across metrics by substituting respondents’ raw (total) scores on the PFI or MZSIB with the corresponding MBI t-score. The corresponding MBI t-scores can then be used to calculate the percent of physicians scoring at or above a selected MBI cut-point. The substituted MBI scores can be further analyzed in descriptive and inferential analyses.²⁵ The crosswalks can also be used to calculate emotional exhaustion/depersonalization rates across metrics using only aggregated data. In Supplemental Appendix 10, we demonstrate how to calculate emotional exhaustion/depersonalization rates on the MBI metric using frequency tables of physicians’ raw scores on the PFI. The crosswalks can facilitate comparisons of burnout scores/rates across organizations using different measures, within organizations using different measures over time, and to published regional/national benchmarks. The use of our crosswalks to convert burnout scores from different measures to a common metric may also improve comparative effectiveness and meta-analysis research by reducing error associated with the use of different scales across studies.^{25, 37}

Our reliability assessment provides important information regarding the psychometric performance of each measure, each of which has its own strengths and weaknesses that should be considered within the intended purpose of an organization’s assessment.⁹ For example, the MBI-EE scale provides >0.90 reliability to assess a wide range of emotional exhaustion levels, but at the cost of additional items. With less than half the items of the MBI-EE, the PFI-WE scale offers >0.80 reliability to assess a similar range of above-average emotional exhaustion levels as the MBI-EE scale, but has less precision at below average emotional exhaustion levels than the MBI-EE scale. Similarly, with only one item, the MZSIB offers the least response burden but has less precision to assess emotional exhaustion than the MBI-EE and PFI-WE scales (an expected result given the MZSIB was originally designed as a brief screening tool, not an outcome assessment). However, this level of precision may be sufficient, for example, if the intended purpose of assessment is for screening followed by additional assessment, or to predict the risk of occupational outcomes of depression symptoms, distress, or intent to leave one’s practice at a group-level. The PFI-ID scale offers the most reliable assessment of depersonalization across the widest range of depersonalization levels, with one additional item compared to MBI-DP scale. We should note that, to our knowledge, this is the first assessment of the MZSIB’s reliability (as internal consistency reliability is not applicable to single-item scales and test-retest reliability has not yet been investigated for this measure).

All scales showed significant correlations with important, adverse outcomes, including physician depression, distress, and intent to leave. The association between each measure and each adverse outcome underscores the importance of including measures of physician burnout in institutional assessments.

To our knowledge, this is the first study to crosswalk common measures of burnout among US physicians. Strengths of this study include the use of a single-group linking design (permitting the direct comparison of physicians’ actual MBI scores to those predicted by linking to determine the accuracy of our results) and the use and agreement of two different linking methods.

However, this study has several limitations. First, because the MBI-EE and MBI-DP metrics to which the PFI and MZSIB are linked were derived from a prior IRT analysis of 2014 MBI data from the Shanafelt et al. (2015) national physician burnout prevalence study,^{29, 30} the mean of each MBI anchor scale is fixed to the mean EE and DP scores of US physicians in 2014. Therefore, when interpreting a score on a target scale relative to its SDs above/below the mean score on its MBI anchor scale, it should be known that the comparison is relative to the underlying mean MBI score of US physicians in 2014. Despite this limitation, the crosswalks remain valid assuming that the MBI subscales function equivalently across the 2014 US physician sample and US general physician population. Second, although our findings provide support for the invariance of our crosswalks across early and late responder groups and, therefore, provide potential support for the representativeness of our sample, this support relies on the assumption that late responders are an adequate proxy for non-respondents. Nevertheless, several studies have demonstrated no significant differences in burnout estimates across respondent and non-respondent groups, despite the low response rates that are common in physician survey research.^{1, 38} Third, we chose to highlight the closest cut-point equivalencies across linked measures using commonly used cut-points on each metric. Because raw scores on each target metric are linked to continuous scores on each anchor metric, the closest cut-point equivalencies across metrics are an approximation. Although we identified the closest cut-point equivalency for scores ≥27 and ≥10 on the respective MBI-EE and MBI-DP scales, investigators can use crosswalks published in Brady et al. ²⁹ in conjunction with the crosswalks presented herein to identify cut-point equivalencies on the PFI and MZSIB at other MBI raw score cut-points.

It is important to note that the crosswalk tables rendered with this research allow reasonable approximate translation of aggregate, group-level scores from one measure of burnout to another. They are not intended to translate individual-level respondent scores from one measure of burnout to another, and attempting to do so would produce unreliable results. In addition, it is important to note that crosswalking scores from one measure of burnout to another is only appropriate across measures that assess the same construct. A measure of emotional exhaustion (such as the MZSIB) cannot be crosswalked to derive an equivalent score on a metric of depersonalization.

CONCLUSIONS

As US healthcare organizations are increasingly measuring physician burnout as an indicator of health system performance, there is a need to compare burnout outcomes across different assessments. Our findings allow healthcare organizations using the PFI or MZSIB to compare group-level scores to historical, regional, or national MBI scores (and vice-versa).

Declarations

Conflict of Interest

Dr. Shanafelt is co-inventor of the Well-being Index instruments and the Participatory Management Leadership Index. Mayo Clinic holds the copyright for these instruments and has licensed them for use outside of Mayo Clinic. Dr. Shanafelt receives a portion of any royalties paid to Mayo Clinic. Dr. Linzer is supported in part through grants to Hennepin Healthcare from the AMA, Institute for Healthcare Improvement, the American Board of Internal Medicine Foundation, and the American College of Physicians for research and training in burnout prevention. All other authors report no conflicts of interest.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Unsere Produktempfehlungen

e.Med Interdisziplinär

Kombi-Abonnement

Jetzt e.Med zum Sonderpreis bestellen!

Für Ihren Erfolg in Klinik und Praxis - Die beste Hilfe in Ihrem Arbeitsalltag

Mit e.Med Interdisziplinär erhalten Sie Zugang zu allen CME-Fortbildungen und Fachzeitschriften auf SpringerMedizin.de.

Jetzt bestellen und 100 € sparen!

Jetzt testen ¹

e.Med Innere Medizin

Kombi-Abonnement

Mit e.Med Innere Medizin erhalten Sie Zugang zu CME-Fortbildungen des Fachgebietes Innere Medizin, den Premium-Inhalten der internistischen Fachzeitschriften, inklusive einer gedruckten internistischen Zeitschrift Ihrer Wahl.

Jetzt bestellen und 100 € sparen!

Jetzt testen ²

e.Med Allgemeinmedizin

Kombi-Abonnement

Mit e.Med Allgemeinmedizin erhalten Sie Zugang zu allen CME-Fortbildungen und Premium-Inhalten der allgemeinmedizinischen Zeitschriften, inklusive einer gedruckten Allgemeinmedizin-Zeitschrift Ihrer Wahl.

Jetzt bestellen und 100 € sparen!

Jetzt testen ³

Supplementary Information

ESM 1 (DOCX 80.7 kb)

Shanafelt TD, West CP, Sinsky C, et al. Changes in burnout and satisfaction with work-life integration in physicians and the general US working population between 2011 and 2017. Mayo Clinic Proceedings 2019;94(9):1681-1694.

Maslach C, Jackson SE. The measurement of experienced burnout. J Occup Behav 1981;2(2):99-113.CrossRef

Shanafelt TD, Boone S, Tan L, et al. Burnout and satisfaction with work-life balance among US physicians relative to the general US population. Arch Intern Med 2012;172(18):1377-1385.CrossRef

Dyrbye LN, T.D. Shanafelt, C.A. Sinsky, P.F. Cipriano, J. Bhatt, A. Ommaya, C.P. West, and D. Meyers. Burnout among health care professionals: A call to explore and address this underrecognized threat to safe, high-quality care. NAM Perspectives. Discussion Paper, National Academy of Medicine, Washington, DC. 2019; https://doi.org/10.31478/201707b

Tawfik DS, Scheid A, Profit J, et al. Evidence Relating Health Care Provider Burnout and Quality of Care: A Systematic Review and Meta-analysis. Ann Intern Med 2019;171(8):555-567.CrossRef

Han S, Shanafelt TD, Sinsky CA, et al. Estimating the Attributable Cost of Physician Burnout in the United States. Ann Intern Med 2019;170(11):784-790.CrossRef

Jha A, Ilif A, Chaoui A. A crisis in health care: a call to action on physician burnout. In: Massachusetts Medical Society. Available at: http://www.massmed.org/Publications/Research,-Studies,-and-Reports/A-Crisis-in-Health-Care--A-Call-to-Action-on--Physician-Burnout/#.X99sCJNKh_k. Accessed June 19, 2020.

Dzau VJ, Kirch DG, Nasca TJ. To Care Is Human — Collectively Confronting the Clinician-Burnout Crisis. N Engl J Med 2018;378(4):312-314.CrossRef

Brady KJS, Kazis LE, Sheldrick RC, Ni P, Trockel MT. Selecting Physician Well-Being Measures to Assess Health System Performance and Screen for Distress: Conceptual and Methodological Considerations. Curr Probl Pediatr Adolesc Health Care 2019;49(12):100662.CrossRef

10.

Dyrbye LN, Meyers D, Ripp J, Dalal N, Bird SB, Sen S. A Pragmatic Approach for Organizations to Measure Health Care Professional Well-Being. NAM Perspectives. Discussion Paper, National Academy of Medicine, Washington, DC. 2018; https://doi.org/10.31478/201810b

11.

Shanafelt TD, Noseworthy JH. Executive leadership and physician well-being: nine organizational strategies to promote engagement and reduce burnout. Mayo Clin Proc 2017;92(1):129-146.CrossRef

12.

National Academy of Medicine. Measuring Burnout. Available at: https://nam.edu/clinicianwellbeing/solutions/measuring-burnout/. Accessed June 19, 2020.

13.

National Academy of Medicine. Validated instruments to assess work-related dimensions of well-being. Available at: https://nam.edu/valid-reliable-survey-instruments-measure-burnout-well-work-related-dimensions/. Accessed June 19, 2020.

14.

National Academies of Sciences, Engineering, and Medicine. Taking Action Against Clinician Burnout: A Systems Approach to Professional Well-Being. The National Academies Press; 2019.

15.

Maslach C, Jackson SE, Leiter MP. Maslach Burnout Inventory Manual. 4th ed: Mind Garden, Inc.; 2017. https://www.mindgarden.com/maslach-burnout-inventory-mbi/686-mbi-manualprint.html

16.

Trockel M, Bohman B, Lesure E, et al. A Brief Instrument to Assess Both Burnout and Professional Fulfillment in Physicians: Reliability and Validity, Including Correlation with Self-Reported Medical Errors, in a Sample of Resident and Practicing Physicians. Acad Psychiatry 2017;42(1):11-24.CrossRef

17.

Konrad TR, Williams ES, Linzer M, et al. Measuring physician job satisfaction in a changing workplace and a challenging environment. Med Care 1999;37(11):1174-1182.CrossRef

18.

Kolen MJ, Brennan RL. Test equating, scaling, and linking: Methods and practices. Springer Science & Business Media; 2014.

19.

Maslach C, Jackson S, Leiter M. Maslach Burnout Inventory Manual. 3rd ed. Consulting Psychologists Press; 1996.

20.

Williams ES, Konrad TR, Linzer M, et al. Refining the measurement of physician job satisfaction: results from the Physician Worklife Survey. SGIM Career Satisfaction Study Group. Society of General Internal Medicine. Med Care 1999;37(11):1140-1154.CrossRef

21.

Pilkonis PA, Choi SW, Reise SP, Stover AM, Riley WT, Cella D. Item Banks for Measuring Emotional Distress From the Patient-Reported Outcomes Measurement Information System (PROMIS®): Depression, Anxiety, and Anger. Assessment. 2011;18(3):263-283.CrossRef

22.

Dyrbye LN, Szydlo DW, Downing SM, Sloan JA, Shanafelt TD. Development and preliminary psychometric properties of a well-being index for medical students. BMC Med Educ 2010;10(1):8.CrossRef

23.

Dyrbye LN, Schwartz A, Downing SM, Szydlo DW, Sloan JA, Shanafelt TD. Efficacy of a brief screening tool to identify medical students in distress. Acad Med 2011;86(7):907-914.CrossRef

24.

Dyrbye LN, Satele D, Sloan J, Shanafelt TD. Utility of a brief screening tool to identify physicians in distress. J Gen Intern Med 2013;28(3):421-427.CrossRef

25.

Choi SW, Schalet B, Cook KF, Cella D. Establishing a Common Metric for Depressive Symptoms: Linking the BDI-II, CES-D, and PHQ-9 to PROMIS Depression. Psychol Assess 2014;26(2):513-527.CrossRef

26.

Choi SW, Prodrabsky T, McKinney N, Schalet BD, Cook KF, Cella D. PROSetta Stone Methodology. Available at: http://www.prosettastone.org/Methodology/Documents/PROSetta%20Methodology%20Report.pdf. Accessed June 19, 2020.

27.

Rotenstein LS, Torre M, Ramos MA, et al. Prevalence of burnout among physicians: A systematic review. Jama. 2018;320(11):1131-1150.CrossRef

28.

Dorans NJ, Holland PW. Population invariance and the equatability of tests: Basic theory and the linear case. J Educ Meas 2000;37(4):281-306.CrossRef

29.

Brady KJS NP, Sheldrick RC, Trockel MT, Shanafelt T, Rowe SG, Schneider JI, Kazis LE. Describing the Emotional Exhaustion, Depersonalization, and Low Personal Accomplishment Symptoms Associated with Maslach Burnout Inventory Subscale Scores in US Physicians. J Patient Rep Outcomes 2020;4(1):1-14.CrossRef

30.

Shanafelt TD, Hasan O, Dyrbye LN, et al. Changes in Burnout and Satisfaction With Work-Life Balance in Physicians and the General US Working Population Between 2011 and 2014. Mayo Clin Proc 2015;90(12):1600-1613.CrossRef

31.

HealthMeasures. PROMIS Instrument Development and Scientific Standards Version 2.0. 2013. Available at: https://www.healthmeasures.net/images/PROMIS/PROMISStandards_Vers2.0_Final.pdf. Accessed June 19, 2020.

32.

R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. 2018.

33.

Revelle W. psych: Procedures for Personality and Psychological Research. https://CRAN.R-project.org/package=psych. 2018.

34.

Albano AD. equate: An R package for observed-score linking and equating. J Stat Softw 2016;74(8):1-36.CrossRef

35.

Chalmers P. mirt: A Multidimensional Item Response Theory Package for the R Environment. J Stat Softw 2012;48(6):1-29.CrossRef

36.

Rosseel Y. lavaan: An R Package for Structural Equation Modeling. J Stat Softw 2012;48(2):1-36.CrossRef

37.

Lai J-S, Cella D, Yanez B, Stone A. Linking fatigue measures on a common reporting metric. J Pain Symptom Manag 2014;48(4):639-648.CrossRef

38.

Simonetti JA, Clinton WL, Taylor L, et al. The impact of survey nonresponse on estimates of healthcare employee burnout. Healthcare. 2020;8(3):100451.CrossRef

Titel: Establishing Crosswalks Between Common Measures of Burnout in US Physicians
verfasst von: Keri J. S. Brady, PhD, MPH
Pengsheng Ni, MD, MPH
Lindsey Carlasare, MBA
Tait D. Shanafelt, MD
Christine A. Sinsky, MD
Mark Linzer, MD
Martin Stillman, MD, JD
Mickey T. Trockel, MD, PhD
Publikationsdatum: 31.03.2021
Verlag: Springer International Publishing
Erschienen in: Journal of General Internal Medicine / Ausgabe 4/2022
Print ISSN: 0884-8734
Elektronische ISSN: 1525-1497
DOI: https://doi.org/10.1007/s11606-021-06661-4

Leitlinien kompakt für die Innere Medizin

Mit medbee Pocketcards sicher entscheiden.

^{Seit 2022 gehört die medbee GmbH zum Springer Medizin Verlag}

Kostenlos registrieren

Update Innere Medizin

Bestellen Sie unseren Fach-Newsletter und bleiben Sie gut informiert.

Newsletter bestellen

Live-Webinar "Urologie und Sexualmedizin in der Praxis"

Springer Medizin

Abstract

Background

Objective

Design

Setting

Participants

Main Measures

Key Results

Conclusions

Supplementary Information

Publisher’s Note

INTRODUCTION

METHODS

Design and Participants

Measures

Linking Analyses

RESULTS

Sample

Assumption Assessment

Crosswalks and Closest Cut-Point Equivalents

Reliability

Associations with Adverse Outcomes

DISCUSSION

CONCLUSIONS

Declarations

Conflict of Interest

Publisher’s Note

Unsere Produktempfehlungen

e.Med Interdisziplinär

e.Med Innere Medizin

e.Med Allgemeinmedizin

Supplementary Information

Weitere Artikel der Ausgabe 4/2022

Reply to Letter to the Editor: Re: The Role of Gender in Careers in Medicine: a Systematic Review and Thematic Synthesis of Qualitative Literature

Prevalence and Predictors of Driving Under the Influence of Alcohol Among Older Adults in the United States, 2015–2019

Association of Uninsurance and VA Coverage with the Uptake and Equity of COVID-19 Vaccination: January–March 2021

Addressing COVID-19 Vaccine Acceptance Within a Large Healthcare System: a Population Health Model

On Gender Bias and the Imposter Syndrome

Association of Vitamin D Status and COVID-19-Related Hospitalization and Mortality

Leitlinien kompakt für die Innere Medizin

Neu im Fachgebiet Innere Medizin

Erhebliches Risiko für Kehlkopfkrebs bei mäßiger Dysplasie

Nach Herzinfarkt mit Typ-1-Diabetes schlechtere Karten als mit Typ 2?

15% bedauern gewählte Blasenkrebs-Therapie

Costims – das nächste heiße Ding in der Krebstherapie?

Update Innere Medizin