Introduction

Osteoporotic fractures are associated with increased subsequent fracture risk, morbidity, and excess mortality, placing a large medical and economic burden on healthcare systems [1]. Subsequent fracture risk is not constant, but fluctuates over time, and is the highest immediately after initial fractures [2]. One-quarter of all subsequent fractures occur within 1 year after a first fracture, and one in two occur within 5 years [3]. Additionally, the majority of deaths following fractures occur within the first year, thereafter the excess mortality gradually declines [4]. Mortality risk in the first 5 years is increased approximately twofold in women and two- to threefold in men [5]. Of note, the absolute impact on mortality is higher for non-hip non-vertebral (NHNV) fractures, since these account for three-quarters of the number of fractures in the population [6].

Despite the availability of various effective pharmacologic interventions and well-established guidelines for fracture prevention, the majority of patients sustaining a fragility fracture do not receive anti-osteoporosis drugs (AOD) [1]. This treatment gap is more pronounced in men than in women, and worsened in recent years [7]. The magnitude of the treatment gap is reported to be highly variable throughout Europe, ranging between 25 and 95% [8]. An Australian study showed that even less than 20% of postmenopausal women with a fracture received specific treatment for osteoporosis in primary care [9]. The low prescription rate of AOD is attributed to inadequate clinical management, including inadequate communication between physicians, disconnected care between healthcare settings, and knowledge gaps by both patients and physicians [10, 11]. These factors represent missed opportunities to actively manage osteoporosis and the prevention of subsequent fractures [12].

In response to this care gap, the International Osteoporosis Foundation (IOF) launched the Capture the Fracture (CTF) Campaign in 2012 to facilitate the implementation of coordinator-based, multi-disciplinary models of care for secondary fracture prevention. Fracture liaison services (FLSs) are nowadays widely advocated as the most appropriate approach to cover all aspects of secondary fracture prevention, including patient identification, education, risk evaluation, treatment, and long-term monitoring. Until November 2020, more than 550 FLSs (registered in CTF) have been implemented, leading to an increasing number of studies investigating the effectiveness of FLS. A previous review [13] including studies reporting the impact of FLS on subsequent fractures up to 2016 concluded that the observed reduction in subsequent fracture risk after the introduction of a FLS should be further quantified in better-designed studies. Especially the follow-up duration and the comparability of groups with or without FLS care were the main methodological issues. As new studies have been conducted recently, and considering the fact that FLS could also have an impact on mortality, it is worthwhile to update the search, summarize results, and critically appraise studies. This systematic review and meta-analysis was therefore designed to summarize the effectiveness of FLS on subsequent fractures and mortality.

Methods

A systematic literature search was undertaken according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guideline to identify eligible studies comparing FLS to no-FLS care with subsequent fractures and/or mortality as outcomes [14].

Literature search

The search was conducted in PubMed and Embase (Ovid) and restricted to English articles published between January 1, 2010, and April 30, 2020. The search strategy was designed to retrieve records addressing the following PICO research question: population (patients with a fracture), intervention (FLS care), comparator (no-FLS care), and outcome (subsequent fractures and/or mortality). Details on the complete search strategy based on the PICO criteria are provided in Supplementary 1.

Study selection

After removing duplicates, titles and abstracts were screened by one reviewer (NL). Then, full-text screening was performed for eligible studies by two independent reviewers (NL, RB), and discrepancies were resolved by consensus with the consultation of additional reviewers (MH and JB). Finally, reference lists and citations of included articles were manually screened for additional relevant studies using Web of Science.

Studies were included if they reported the effectiveness of FLS care in terms of subsequent fractures and/or mortality compared to no-FLS care. Therefore, studies comparing the outcomes of FLS to historical data (post-FLS vs. pre-FLS) or studies comparing the outcomes of a hospital with FLS to a hospital without FLS were included. Studies comparing FLS attenders to non-attenders were excluded. Of note, during study selection, alternative names for FLS included fracture prevention service, orthogeriatric service/care or active osteoporosis care, etc. Non-original articles (e.g., editorials, review) and abstracts were excluded.

Data extraction

Study characteristics were extracted including publication characteristics (author, year of publication), study design (e.g., experimental or (type of) semi-experimental design, prospective or retrospective data collection), population characteristics (country, inclusion and exclusion criteria for FLS and no-FLS populations, number of participants in each group, percentage of female participants, follow-up time, attendance proportion of FLS care), and outcomes (cumulative incidence of subsequent fractures and mortality, and corresponding P-value). Initiation of anti-osteoporosis treatment and bone mineral density (BMD) measurement were extracted as secondary outcomes when reported within the selected studies.

Study quality

Currently available quality assessment/risk of bias tools (such as ROBINS-I, Newcastle–Ottawa scale, and NIH tool) [15,16,17] did not address all potential methodological issues which we pre-identified. Therefore, concepts and items of the available checklists were combined and adjusted forming our quality assessment checklist, which better aligned to our needs. Overall, ten criteria were identified covering the traditional four domains (selection of participants and completeness of follow-up, exposure to post-fracture care, outcome, and statistical accuracy and analyses) for both intervention (FLS) and control (no-FLS) group. Supplementary 2 shows the checklist and indicates the source of the criteria.

Specifically, patients’ selection was considered a key methodological issue in the study of evaluating the outcomes of FLS. All patients with a fracture should be included in the analysis regardless of whether they attended FLS clinic. Failing this principle could result in spurious associations due to large prognostic dissimilarity between groups. Besides, osteoporotic fracture is more prevalent in the geriatric population. In such population, competition between risk of subsequent fracture and risk of death is particularly high, which would hinder or modify the chance that the event of interest (subsequent fractures) occurs.

Each of the final ten criteria was scored using “Yes” (fulfilled the requirement), “No” (not fulfilled the requirement), “Part” (partially fulfilled the requirement), or “Not reported.” To estimate a total quality score, we assigned a score of 1 for “Yes,” 0.5 for “Part,” and 0 for “No.” Two researchers (NL and MO) independently evaluated the eligible studies; discrepancies were resolved by consensus through discussion.

Meta-analysis

A meta-analysis was performed to synthesize the results of included studies. Pooled results of subsequent fractures and mortality between the FLS and the no-FLS group were reported as odds ratio (OR) with associated 95% confidence interval (CI). Of note, in the meta-analysis, crude events data (how many patients had subsequent fracture/mortality) rather than cumulative incidence of subsequent fracture/mortality were entered, and the OR were calculated based on these data. Statistical heterogeneity was assessed using the I2 test. A fixed-effects model was used in case of small heterogeneity (I2<50%), and a random-effects model was applied if the analysis showed to have high heterogeneity (I2≥50%) [18]. In addition, subgroup analysis by study design (pre-post-FLS vs. hospitals with or without FLS care) and by follow-up time (follow-up ≤ 1 year vs. 1 year < follow-up ≤ 2 years vs. follow-up > 2 years) were conducted.

Of note, studies that did not include all patients with a fracture in both FLS and no-FLS cohorts (only inclusion of FLS attenders, or patient selection by consent procedure for both groups) were regarded as very high selection bias and were excluded from the main meta-analysis. However, to investigate the impact of studies with selection bias, these studies (patients’ selection by consent) were additionally included into the model in the sensitivity analysis.

Given the number of studies included in the main meta-analysis for both subsequent fractures and mortality was less than ten, investigation of publication bias through computation of funnel plot is not meaningful.

All statistical analyses were performed in Review Manager (RevMan 5.4; The Nordic Cochrane Centre, Cochrane Collaboration, Copenhagen, Denmark, 2020).

Results

Study selection

From the initial search, 955 records were retrieved (Fig. 1), of which 199 duplicates were removed. Following screening of titles and abstracts, 709 of the remaining 756 studies were excluded since they did not meet inclusion criteria. Upon review of the full text of the remaining 47 studies, 31 articles were excluded for reasons such as non-original articles (n=3), related to FLS organization (n=3), capturing other clinical outcomes (n=5), no control group (n=11), the intervention was not FLS care (n=3), and other reasons (n=6). In total 16 articles were thus eligible for inclusion.

Fig. 1
figure 1

PRISMA flowchart of the study selection process

Characteristics of included studies

The characteristics of the included studies are reported in Table 1. Most studies (n=8) were conducted in Europe (the Netherlands, Sweden, Italy, UK, Ireland, and Spain), followed by Australia (n=3) and Asia (n=3), and the remaining two studies were performed in Canada and the USA. All studies were designed as cohort studies. Data for the FLS cohort were prospectively collected in ten [12, 19,20,21,22, 24,25,26, 28, 30] and retrospectively collected in six studies [1, 23, 27, 29, 31, 32]. The mean and median duration of follow-up for both FLS and no-FLS groups was 1.8 (2) years, varying from 6 months to 4 years. Of note, Inderjeeth et al. [12] presented the outcomes at 3 and 12 months. Considering 3-month follow-up was quite short, we reported the result of 12 months in our study. The sample size of individual studies varied from 47 to 33,152, and all studies included both genders, with 66 to 89% women.

Table 1 Characteristics of included studies that assessed the effect of fracture liaison service care

Twelve studies [1, 19,20,21,22,23,24,25,26,27,28,29] compared the outcomes of FLS to historical data (post-FLS vs. pre-FLS). Two studies [30, 31] compared the outcomes of the FLS with data from a hospital without FLS, and two other studies [12, 32] performed both comparisons (pre-FLS vs. post-FLS, hospital with FLS vs. hospital without FLS).

When stratified by FLS outcome, 14 studies (16 comparisons) [1, 12, 19, 21,22,23,24, 26,27,28,29,30,31,32] reported subsequent fractures, and eleven studies (12 comparisons) [1, 19,20,21,22, 24,25,26, 28, 30, 32] reported mortality. Interestingly, Hawley et al. [23] reported the results from a post-hip care model in 11 hospitals, where each hospital was analyzed separately and acted as its own control in a before-and-after time series design. However, given specific data for both FLS and no-FLS cohorts were not available, this study was therefore excluded from the meta-analysis. In addition, within selected studies, eight studies [1, 12, 20,21,22, 24, 26, 29] reported BMD measurement, and nine studies [1, 12, 20,21,22, 24, 28, 29, 32] reported initiation of anti-osteoporosis treatment as secondary outcome.

When stratified by type of secondary fracture prevention care, 13 studies reported the outcomes of a typical FLS clinic. In these studies, case finding was conducted by an FLS coordinator such as a fracture nurse, secretaries at the emergency department (ED), or a physician champion, followed by BMD assessment, patients’ education, and treatment initiation. The remaining three studies provided care to patients with fractures in the context of orthogeriatric care/service (OG), fracture prevention service (FPS), and active osteoporosis care, which resemble the model of FLS care and were regarded as FLS care [20, 25, 29].

The proportion of patients who attend the FLS defined as the number of patients actually attending the FLS divided by the total number of patients eligible or invited for the FLS (and thus assuming all patients with fractures are invited), which were available in six studies [1, 12, 19, 28, 30, 31] varying from 20 [31] to 86% [28]. The other ten studies did not report the proportion of FLS attenders.

Quality assessment and recommendations

Table 4 presents the results of quality assessment of the included studies. The average score was 5.4 out of 10 (range 3–8.5). Only 50% of studies fulfilled more than half of the criteria, and room for improvement was thus identified for most studies. For patients’ selection, most studies (n=11) made the comparison between all patients in both FLS and no-FLS groups. However, five studies [1, 12, 21, 24, 26] did not include all patients with fractures in the FLS or no-FLS cohort and were regarded with very high selection bias. Specifically, one study [21] compared FLS attenders to all patients with fractures in the no-FLS cohort, and four other studies [1, 12, 24, 26] only included and compared consenting subjects in both FLS and no-FLS groups. In addition, the quality was especially suboptimal for other criteria including “analyses of outcomes account for competing risk of death,” “sample size is described based on power calculation,” “loss to follow-up rate ≤20% in FLS/no-FLS group,” and “at least 50% eligible patients attend FLS.” Recommendations for each criterion were formulated given that they are the most important methodological issues for studies evaluating the outcomes of FLS (Table 4). Except for criteria mentioned in Table 4, the length of follow-up duration was also crucial to capture the effect of FLS care on subsequent fracture and mortality, and future studies should consider a longer duration of follow-up (at least 2 years).

Subsequent fracture

As shown in Table 2, 10 out of 16 comparisons reported that the reduction of subsequent fractures in the FLS group was significant. Excluding five studies with very high selection bias, the mean cumulative incidence of subsequent fractures was 7.7% (SD 3.9%) and 10.9% (SD 6.5%) (median 6.7% and 9.1%) in the FLS versus no-FLS group. Of note, since Wasfie et al. [27] included patients with vertebral fractures (VFs) that were treated with vertebral augmentation, we did not use the data of VFs and only reported the data of other fractures (hip, ribs, and extremities) in our study. The result of meta-analysis on subsequent fractures of nine comparisons (eight studies) is presented in Fig. 2. Overall, FLS care was associated with a significantly lower probability of subsequent fractures (OR: 0.70, 95% CI: 0.52–0.93, P=0.01; heterogeneity: I2=92%).

Table 2 Results from cohort studies reporting cumulative incidence of subsequent fracture
Fig. 2
figure 2

FLS versus no-FLS for subsequent fracture: overall and subgroup analysis by study design. CI, confidence interval; IV, inverse variance; FLS, fracture liaison service. Asterisk indicates comparison between hospitals with and without FLS

The first subgroup analysis by study design (Fig. 2) revealed that the OR of subsequent fractures in post versus pre-FLS group was 0.62 (95% CI: 0.42–0.91, P=0.01; heterogeneity: I2=90%) and the OR for hospitals with versus without FLS care was 0.87 (95% CI: 0.77–0.99, P=0.03; heterogeneity: I2=16%), both indicating a significant lower probability of subsequent fractures with FLS. The second subgroup analysis by follow-up duration (Fig. 3) revealed that in studies with a follow-up > 2 years, a significantly lower probability of subsequent fractures was captured for FLS care (odds ratio: 0.57, 95% CI: 0.34–0.94, P=0.03), while in studies ≤ 2 years, there was no difference in the odds of subsequent fractures.

Fig. 3
figure 3

FLS versus no-FLS for subsequent fracture: subgroup analysis by follow-up duration. CI, confidence interval; IV, inverse variance; FLS, fracture liaison service. Asterisk indicates comparison between hospitals with and without FLS

Sensitivity analyses (Supplementary 3, Figure 1) including studies with very high selection bias also indicated that the FLS care was associated with a lower probability of subsequent fractures (OR: 0.70, 95% CI: 0.54–0.91, P=0.007). Subgroup analyses by study design remained overall similar.

Mortality

As shown in Tables 3,4 out of 12 comparisons indicated a significantly lower cumulative mortality incidence in the FLS group. Excluding five studies with very high selection bias, the mean cumulative incidence of mortality was 15.1% (SD 4.7%) and 22.8% (SD 7.8%) (median 13.8% and 18.4%) in the FLS versus no-FLS group. The result of meta-analysis on mortality of eight comparisons (seven studies) is presented in Fig. 4. Overall, FLS care was not significantly associated with lower mortality (OR: 0.73, 95% CI: 0.49–1.09, P=0.12; heterogeneity: I2=98%).

Table 3 Results from cohort studies reporting cumulative incidence of mortality
Table 4 Quality of included studies assessed using self-designed tool
Fig. 4
figure 4

FLS versus no-FLS for mortality: overall and subgroup analysis by study design. CI, confidence interval; IV, inverse variance; FLS, fracture liaison service. Asterisk indicates comparison between hospitals with and without FLS

The first subgroup analysis by study design (Fig. 4) revealed a lower probability of mortality in the pre- versus post-FLS studies (OR: 0.65, 95% CI: 0.44–0.95, P=0.03; heterogeneity: I2=95%) but not for studies that compared two different hospitals (OR: 1.03, 95% CI: 0.92–1.15, P=0.57; heterogeneity: I2=29%). In the second subgroup analysis by follow-up duration (Fig. 5), we found no significant influence by duration of follow-up.

Fig. 5
figure 5

FLS versus no-FLS for mortality: subgroup analysis by follow-up duration. CI, confidence interval; IV, inverse variance; FLS, fracture liaison service. Asterisk indicates comparison between hospitals with and without FL

Sensitivity analyses (Supplementary 3, Figure 2) including studies with very high selection bias also indicated that the FLS care was not associated with a lower probability of mortality (OR: 0.81, 95% CI: 0.56–1.17, P=0.27). Subgroup analyses showed that the reduced probability of mortality in pre-post studies was not significant (OR: 0.76, 95% CI: 0.52–1.10, P=0.15).

Secondary outcomes

Within selected studies, nine studies (11 comparisons) [1, 12, 20,21,22, 24, 28, 29, 32] reported the initiation of anti-osteoporosis treatment, and 9 out of 11 comparisons showed a significantly higher treatment proportion in post-FLS group. In addition, of the eight studies (9 comparisons) reported BMD measurement [1, 12, 20,21,22, 26, 29], and 8 out of 9 comparisons indicated that FLS was associated with a significant increase of BMD measurement proportion (Supplementary 4).

Discussion

This systematic review and meta-analysis was performed to evaluate and summarize the evidence regarding the effectiveness of the FLS on subsequent fractures and mortality. The pooled overall results indicated that FLS care is associated with a significantly lower probability of subsequent fractures (30%) and mortality although the latter was only found in studies comparing outcomes before and after the introduction of an FLS. Overall, the effects of FLS care on both outcomes were larger in studies with a pre-post design compared to studies addressing hospitals with and without an FLS. Since only two studies were available for the analysis of mortality in hospitals with or without FLS, this may be insufficient to capture a significant impact. It is difficult to conclude that these study designs provide the most valid estimates. Each study design has some potential limitations. For the pre-post study design, changes in patients’ lifestyles or the effectiveness of healthcare could happen over time. For (two) hospitals’ study design, bias could result from differences in content of care and patients groups regarding lifestyle, comorbidities, or other confounders. Of note, high heterogeneity was revealed, especially for pre-post comparisons, even when the random-effects model and subgroup analysis were applied, which may limit the reliability of the analysis and could be recognized as a limitation.

Subgroup analysis by follow-up duration revealed that studies with relatively longer follow-up duration (more than 2 years) were associated with significantly lower probability of subsequent fractures; however, it was not the case for mortality. The potential reason could be that the impact of the FLS intervention on mortality may require a longer follow-up time to capture, while the studies included in the meta-analysis for mortality had a relatively short follow-up time (the longest was 2.2 years). Therefore, future studies should consider a follow-up duration of at least 2 years to adequately capture the effect of FLS care on subsequent fractures and mortality.

For quality appraisal, several methodology issues were identified among the included studies. Firstly, given it was difficult to design randomized controlled trials (RCTs) to evaluate the outcomes of FLS, some patients’ characteristics could be considered potential confounders and available for adjustment through statistical methods (e.g., the multivariable cox regression model). However, due to the retrospective nature of some studies, several potential variables such as family fracture history, smoking/alcohol consumption, and physical activity that might impact the results were unable to be taken into account. Besides, avoiding selection bias during patients’ enrollment is crucial to guarantee the comparability of two cohorts. As indicated by Huntjens et al. [19, 30], patients who were unable or not willing to visit the FLS should be included in the FLS group and in all analyses although the level of health is not known in non-attenders and the effect of FLS care can only be achieved in the attenders. Sensitivity analysis additionally included studies with very high selection bias suggesting that these studies had no impact on overall results of meta-analysis; however, the impact on subgroups (by study design) was revealed. Future studies should avoid selection bias in the process of designing a study. Moreover, we recommend that some other criteria including “sample size is based on power calculation,” “loss to follow-up ≤20%,” and “at least 50% eligible patients attend the FLS” should be taken into account in future studies to provide sufficient statistical power.

Furthermore, when analyzing subsequent fracture risk, competing mortality risk may be an important methodological issue, which may particularly be the case in the geriatric population. Ignoring the competing risk of subsequent fractures and mortality could bias the results of studies on FLS care. Berry et al. [33] performed a simulation study comparing standard survival analysis versus a competing risk approach in a study of second hip fracture, indicating that standard survival analysis overestimated the 5-year risk of second hip fracture by 37% and the 10-year risk by 75% compared with competing risk estimates. Out of the 16 included studies, four reported a competing risk survival regression analysis [23, 24, 31, 32] (Supplementary 5). Three studies [23, 31, 32] used the method of Fine and Gray [34], which deals with the competing risk of mortality by retaining participants in the risk set with a diminishing weight when they die, rather than simply censoring them at the time of death [31]. Similar results were identified in three studies before and after accounting for competing risk of mortality, which allowed to evaluate (partly) the effect of competing risk (of mortality) on subsequent fractures. However, considering especially major fractures are associated with excess mortality [4], competing risk analyses should be taken into account in future studies to accurately estimate cumulative incidence of subsequent fracture.

The findings of this systematic review and meta-analysis is partially consistent with the study of Wu et al. [35], which included studies up to February 2017 suggesting that FLS programs improved outcomes of osteoporosis-related fractures, with significant increases in BMD testing, treatment initiation, and adherence to treatment and reductions in re-fracture incidence. Given more outcomes of interest were investigated and a wider search strategy was applied, more studies (n=159, including studies before CTF) were included in this previous study. By contrast, our study had a specific focus on effectiveness defined as subsequent fractures and mortality, and restricted inclusion of studies comparing FLS to no-FLS. Further, more precise meta-analyses (exclude studies with selection bias) were conducted. Besides, subgroup and sensitivity analysis could also add value to our review. Compared to other previous reviews [13, 36], our study provides a quality assessment, recommendations for patients’ selection, outcome measurement, and statistical analysis provided for future studies, which would guide researchers to design high-quality studies and further help to reduce inter-study heterogeneity, thereby facilitating inter-study comparisons.

This systematic review and meta-analysis has certain limitations. First, we did not conduct a systematic literature search for additional outcomes (initiation of anti-osteoporosis treatment and BMD measurement) since they were not the outcomes of interest in this review. The results of secondary outcomes should thus be interpreted with caution. Second, the quality assessment tool used in our study was generated through combining and modifying available quality assessment tools to fit several methodological issues, and each criterion was treated equally in scoring, the inter-validity of this tool was not verified.

Conclusion

This systematic review and meta-analysis suggests that FLS care is associated with a significantly lower probability of subsequent fractures and mortality although the latter was only found in studies comparing outcomes before and after the introduction of an FLS. The quality assessment revealed that some important methodological issues were unmet in the currently available studies. We therefore provided recommendations to guide researchers to design high-quality studies for evaluation of FLS outcomes in the future.