Type 2 diabetes (T2D) is associated with significant healthcare resource utilization, especially among patients with sub-optimal management, treatment-related adverse events including hypoglycemia, and comorbid health conditions. Value-based initiatives offer a unique solution to this problem, but additional evidence is needed to design and support these initiatives. |
A Bayesian machine learning platform, Reverse Engineering Forward Simulation (REFS™), was applied to administrative claims data to identify predictors of key clinical and economic outcomes in T2D. |
Machine learning models such as REFS have the potential to guide the provision of data-driven, individualized care with these results establishing the importance of ensuring that patients with T2D are appropriately treated with evidence-based interventions to ensure more favorable outcomes as well as control of healthcare resource utilization and costs. |
Introduction
Methods
Data Source
Sample Selection
Outcomes
Statistical Analyses
- Demographics age, gender, race, insurance type, product type, region, and low-income subsidy status.
- Pharmaceutical utilization National Drug Code (NDC) product codes from pharmacy claims aggregated via Generic Product Identifier (GPI) codes [10].
- Procedures ICD-9, ICD-10, CPT, and Healthcare Common Procedure Coding System (HCPCS) codes from procedures in medical claims aggregated via Berenson-Eggers Type of Service (BETOS) codes [11].
- Laboratory Logical Observation Identifiers Names and Codes (LOINC) codes from laboratory data aggregated via the LOINC hierarchies.
- Healthcare resource utilization Acute inpatient admissions, inpatient length of stay, outpatient visits, office visits, visits with an endocrinologist, and emergency department visits, total medical costs, total pharmacy costs (including copay of the index prescription and total out-of-pocket costs), outpatient costs, and emergency department (ED) costs. Visit counts were categorized into 0, 1, 2, 3 or more visits, while costs were discretized into quartiles to account for heavily skewed distributions.
Compliance with Ethics Guidelines
Results
Population Characteristics
Variable | Overall | Hypoglycemic event | Persistent to antidiabetic class | T2D-related inpatient admission | High T2D-related medical costb | HbA1c target attainment | Change from baseline HbA1c |
---|---|---|---|---|---|---|---|
Population size | 453,487 | 453,487 | 453,487 | 453,487 | 453,487 | 221,473 | 36,263 |
With outcome | – | 16,227 (3.6%) | 82,689 (18.2%) | 37,884 (8.4%) | 113,366 (25.0%) | 161,230 (72.8%) | 10,281 (28.4%) |
Age group | |||||||
18–34 | 4891 (1.1%) | 139 (0.9%) | 1693 (2.0%) | 259 (0.7%) | 903 (0.8%) | 1163 (0.7%) | 162 (1.6%) |
35–44 | 20,892 (4.6%) | 438 (2.7%) | 6407 (7.7%) | 873 (2.3%) | 3684 (3.2%) | 4459 (2.8%) | 793 (7.7%) |
45–54 | 58,369 (12.9%) | 1390 (8.6%) | 15,055 (18.2%) | 3128 (8.3%) | 11,774 (10.4%) | 13,336 (8.3%) | 1979 (19.2%) |
55–64 | 96,878 (21.4%) | 2800 (17.3%) | 19,840 (24.0%) | 7065 (18.6%) | 22,506 (19.9%) | 25,166 (15.6%) | 2492 (24.2%) |
65–74 | 155,837 (34.4%) | 5794 (35.7%) | 21,011 (25.4%) | 13,094 (34.6%) | 39,772 (35.1%) | 65,467 (40.6%) | 3028 (29.5%) |
≥ 75 | 116,620 (25.7%) | 5666 (34.9%) | 18,683 (22.6%) | 13,465 (35.5%) | 34,727 (30.6%) | 51,639 (32.0%) | 1827 (17.8%) |
Age, mean (SD) | 66.1 (12.4) | 69.2 (11.9) | 63.1 (13.7) | 69.4 (11.7) | 67.9 (12) | 69.0 (11.4) | 62.4 (12.9) |
Gender | |||||||
Female | 224,498 (49.5%) | 8829 (54.4%) | 41,563 (50.3%) | 19,563 (51.6%) | 59,967 (52.9%) | 83,969 (52.1%) | 4479 (43.6%) |
Male | 228,989 (50.5%) | 7398 (45.6%) | 41,126 (49.7%) | 18,321 (48.4%) | 53,399 (47.1%) | 77,261 (47.9%) | 5802 (56.4%) |
Race | |||||||
Asian | 21,728 (4.8%) | 685 (4.2%) | 4198 (5.1%) | 1015 (2.7%) | 3749 (3.3%) | 9860 (6.1%) | 513 (5.0%) |
Black | 48,148 (10.6%) | 2152 (13.3%) | 10,842 (13.1%) | 4693 (12.4%) | 14,108 (12.4%) | 15,358 (9.5%) | 1075 (10.5%) |
Hispanic | 76,709 (16.9%) | 3106 (19.1%) | 15,967 (19.3%) | 4466 (11.8%) | 18,057 (15.9%) | 31,245 (19.4%) | 2402 (23.4%) |
Unknown | 39,697 (8.8%) | 1413 (8.7%) | 4720 (5.7%) | 3214 (8.5%) | 9487 (8.4%) | 16,432 (10.2%) | 859 (8.4%) |
White | 267,205 (58.9%) | 8871 (54.7%) | 46,962 (56.8%) | 24,496 (64.7%) | 67,965 (60.0%) | 88,335 (54.8%) | 5432 (52.8%) |
Region | |||||||
Midwest | 97,363 (21.5%) | 2436 (15.0%) | 15,985 (19.3%) | 10,189 (26.9%) | 26,710 (23.6%) | 22,874 (14.2%) | 1217 (11.8%) |
Northeast | 50,876 (11.2%) | 1684 (10.4%) | 8411 (10.2%) | 5031 (13.3%) | 13,891 (12.3%) | 17,818 (11.1%) | 888 (8.6%) |
South | 192,002 (42.3%) | 8048 (49.6%) | 37,931 (45.9%) | 16,361 (43.2%) | 49,914 (44.0%) | 69,635 (43.2%) | 5116 (49.8%) |
Unknown | 2374 (0.5%) | 107 (0.7%) | 487 (0.6%) | 137 (0.4%) | 481 (0.4%) | 1046 (0.6%) | 61 (0.6%) |
West | 110,872 (24.4%) | 3952 (24.4%) | 19,875 (24.0%) | 6166 (16.3%) | 22,370 (19.7%) | 49,857 (30.9%) | 2999 (29.2%) |
Insurance type | |||||||
Commercial | 172,317 (38.0%) | 3837 (23.6%) | 42,619 (51.5%) | 9953 (26.3%) | 33,567 (29.6%) | 40,403 (25.1%) | 4770 (46.4%) |
Medicare | 282,054 (62.2%) | 12,423 (76.6%) | 40,250 (48.7%) | 27,978 (73.9%) | 79,992 (70.6%) | 121,198 (75.2%) | 5524 (53.7%) |
Product type | |||||||
EPO | 19,291 (4.3%) | 453 (2.8%) | 5289 (6.4%) | 1064 (2.8%) | 3856 (3.4%) | 5376 (3.3%) | 636 (6.2%) a |
HMO | 131,125 (28.9%) | 6030 (37.2%) | 22,381 (27.1%) | 8173 (21.6%) | 30,937 (27.3%) | 63,956 (39.7%) | 3592 (34.9%) |
IND | 6345 (1.4%) | 160 (1.0%) | 974 (1.2%) | 835 (2.2%) | 1745 (1.5%) | 703 (0.4%) | 16 (0.2%) a |
OTH | 150,765 (33.2%) | 5817 (35.8%) | 19,891 (24.1%) | 18,239 (48.1%) | 46,418 (40.9%) | 55,390 (34.4%) | 2366 (23%) |
POS | 127,665 (28.2%) | 2702 (16.7%) | 31,638 (38.3%) | 7108 (18.8%) | 25,069 (22.1%) | 28,583 (17.7%) | 3428 (33.3%) |
PPO | 28,794 (6.3%) | 1410 (8.7%) | 4636 (5.6%) | 3362 (8.9%) | 8001 (7.1%) | 10,480 (6.5%) | 462 (4.5%) a |
Low income subsidy | |||||||
Yes | 74,392 (16.4%) | 4150 (25.6%) | 11,086 (13.4%) | 9102 (24%) | 25,826 (22.8%) | 27,714 (17.2%) | 1563 (15.2%) |
CCI, mean (SD) | 2.7 (2.2) | 3.9 (2.6) | 2.5 (2.2) | 3.7 (2.6) | 3.4 (2.4) | 3.0 (2.3) | 2.8 (2.2) |
Indexing antidiabetic class | |||||||
Amylin analogue | 10 (0.0%) | 1 (0.0%) | 1 (0.0%) | 2 (0.0%) | 4 (0.0%) | 2 (0.0%) | 1 (0.0%) |
Alpha-glucosidase inhibitor | 563 (0.1%) | 57 (0.4%) | 144 (0.2%) | 36 (0.1%) | 153 (0.1%) | 252 (0.2%) | 8 (0.1%) |
Biguanide (metformin) | 241,678 (53.3%) | 4942 (30.5%) | 48,744 (58.9%) | 17,047 (45.0%) | 52,536 (46.3%) | 95,501 (59.2%) | 4140 (40.3%) |
Antidiabetic combination | 20,876 (4.6%) | 571 (3.5%) | 4588 (5.5%) | 1331 (3.5%) | 4573 (4.0%) | 6851 (4.2%) | 716 (7.0%) |
DPP4 inhibitor | 19,171 (4.2%) | 668 (4.1%) | 3938 (4.8%) | 1951 (5.1%) | 5565 (4.9%) | 6859 (4.3%) | 438 (4.3%) |
Dopamine receptor agonist | 46 (0.0%) | 4 (0.0%) | 16 (0.0%) | 3 (0.0%) | 13 (0.0%) | 15 (0.0%) | 1 (0.0%) |
GLP-1 receptor agonist | 10,399 (2.3%) | 332 (2.0%) | 939 (1.1%) | 707 (1.9%) | 2710 (2.4%) | 2945 (1.8%) | 346 (3.4%) |
Insulin-sensitizing agent | 9513 (2.1%) | 397 (2.4%) | 1568 (1.9%) | 742 (2.0%) | 2281 (2.0%) | 3924 (2.4%) | 318 (3.1%) |
Insulin | 47,722 (10.5%) | 4243 (26.1%) | 4179 (5.1%) | 6563 (17.3%) | 18,091 (16.0%) | 10,287 (6.4%) | 1493 (14.5%) |
Meglitinide analogue | 1495 (0.3%) | 81 (0.5%) | 60 (0.1%) | 171 (0.5%) | 490 (0.4%) | 579 (0.4%) | 23 (0.2%) |
SGLT2 inhibitor | 7376 (1.6%) | 197 (1.2%) | 1894 (2.3%) | 397 (1.0%) | 1579 (1.4%) | 1878 (1.2%) | 300 (2.9%) |
Sulfonylurea | 94,591 (20.9%) | 4719 (29.1%) | 16,618 (20.1%) | 8928 (23.6%) | 25,353 (22.4%) | 32,129 (19.9%) | 2496 (24.3%) |
Outcome Distributions
Covariate Set
Model Performance
Top Predictors
Interaction Effects
Concentration of Risk and Outcome Interactions
Post-index variablea | Highest risk stratum (20th ventile) | 19th ventile | 11–18th ventiles | Lowest risk stratum (1–10th ventiles) | Overall | ||||
---|---|---|---|---|---|---|---|---|---|
Nb | %c | N | % | N | % | N | % | N | |
Number of patients | 22,672 | 5.0 | 22,672 | 5.0 | 181,376 | 40.0 | 226,719 | 50.0 | 453,439 |
Study outcomes | |||||||||
Hypoglycemic event | 3348 | 25.9 | 1797 | 13.9 | 5745 | 44.5 | 2014 | 15.6 | 12,904 |
Antidiabetic persistence | 2532 | 3.8 | 2590 | 3.9 | 23,250 | 35.1 | 37,790 | 57.1 | 66,162 |
HbA1c target attainment | 8082 | 6.3 | 7573 | 5.9 | 56,345 | 43.7 | 56,982 | 44.2 | 128,982 |
HbA1c change from baseline | 1175 | 7.1 | 1177 | 7.2 | 7498 | 45.6 | 6602 | 40.1 | 16,452 |
T2D-related inpatient admission | 2965 | 9.8 | 2501 | 8.3 | 14,328 | 47.3 | 10,469 | 34.6 | 30,263 |
T2D-related total medical cost | $4114 | 8.7 | $3509 | 7.4 | $2691 | 45.6 | $1801 | 38.2 | $2358 |
Visits: all-cause | |||||||||
Inpatient admissions | 5608 | 10.2 | 4590 | 8.4 | 26,082 | 47.5 | 18,644 | 33.9 | 54,924 |
ER visits | 7243 | 7.4 | 6451 | 6.6 | 43,955 | 44.9 | 40,313 | 41.2 | 97,962 |
Outpatient visits | 22,420 | 5.1 | 22,376 | 5.1 | 177,857 | 40.3 | 218,783 | 49.6 | 441,436 |
Visits: hypoglycemia-related | |||||||||
Inpatient admissions | 501 | 26.7 | 254 | 13.5 | 886 | 47.2 | 236 | 12.6 | 1877 |
ER visits | 570 | 24.7 | 323 | 14.0 | 1144 | 49.6 | 271 | 11.7 | 2308 |
Outpatient visits | 2164 | 27.4 | 1192 | 15.1 | 3381 | 42.9 | 1153 | 14.6 | 7890 |
Costs | |||||||||
Healthcare | $36,567 | 10.7 | $30,090 | 8.8 | $19,406 | 45.4 | $12,038 | 35.2 | $17,114 |
Medical | $28,652 | 11.3 | $23,081 | 9.1 | $14,173 | 44.8 | $8825 | 34.8 | $12,669 |
Pharmacy | $7915 | 8.9 | $7009 | 7.9 | $5233 | 47.1 | $3213 | 36.1 | $4446 |
Hypoglycemia-related medical | $395 | 30.1 | $167 | 12.7 | $75 | 45.6 | $15 | 11.6 | $66 |
Discussion
Previous Value-Based Analyses
Utility of “Top Predictors” Selected by Model
Model Accuracy and Ensembling
Diverse Outcomes for Value-Based Contracting
Limitations
- Access to healthcare—proximity to medical facilities, cultural obstacles related to treatment, educational and socioeconomic status
- Bias on the part of the physician—past experiences with a particular treatment, history with a particular patient
- Patients’ underlying disease severity (i.e., confounding by indication)
- Non-random treatment assignment for novel therapies (i.e., channeling bias) [21]
- Healthy adherer bias [22].