Background
Non-steroidal anti-inflammatory drugs (NSAID) have been extensively used to treat post-traumatic and surgical musculoskeletal pain because of the excellent anti-inflammatory and analgesic properties of cyclooxygenase (COX) inhibitors [
1‐
5]. However, clinical and biological data suggest that COX inhibition has a negative impact on bone tissue repair. Results from several animal trials have suggested that decreased bone healing, including biomechanical and histomorphometric properties, was associated with NSAID administration after bone fracture [
6‐
12]. While few studies have investigated the effect of NSAIDs on bone healing in humans, their results are contradictory, possibly reflecting methodological limitations [
13‐
16]. Indeed, authors have argued for well-designed, large, multicentric randomized controlled trials with appropriately defined endpoints [
17]. To the best of our knowledge, there are no clinical studies to accurately determine the effect of NSAID on bone healing.
Several animal models including different animal strains, sex, and fracture techniques have been used to test the effect of types and administration durations of NSAID on bone healing outcomes [
10,
18‐
24]. The assessments of bone healing in these studies include integrated multilevel measurements at the organ (e.g., biomechanical to tissue levels using micro-computed tomography (μ-CT) analysis), cellular (e.g., histology, histomorphometric analysis), and gene (e.g., mRNA microarray to explain the associated healing pathway) levels [
25,
26]. Overall, the techniques used to assess bone healing can be divided into four main categories: imaging analyses, biomechanical tests, detection of serologic markers, and clinical examinations [
25]. Among these categories, the biomechanical confirmed by histomorphometric analysis is the best way to determine the success of fracture healing [
27,
28]. These tests are widely used in animal studies at different time points of fracture healing, which are selected according to the biological, physiological, and pathological changes of the fractured bone [
28‐
31].
Although an extensive literature exists on the administration of NSAID on animal bone healing, no systematic review and meta-analysis have yet been conducted on the subject. Such work is important because it can identify the key histomorphometric and biomechanics characteristics during the healing process as well as it provides information on different factors that may affect the healing process after NSAID administration (e.g., NSAID type, animal type and strain, sex of the animal, fracture type, time of assessment). This information may help to design future experimental trials and facilitate knowledge translation. Therefore, we performed a systematic review and meta-analysis of all identified and available animal studies on the effect of NSAID administration after bone fracture on healing outcomes.
Specifically, our objectives were to estimate the extent to which the effect of NSAID administration after bone fracture using animal models: (i) results in less favorable bone-biomechanical and morphometric healing measurements; and (ii) differs by animal type, age and sex, type of NSAID, and length of follow-up.
Methods
The systematic review methodology was specified and documented in advance using SYstematic Review Center for Laboratory animal Experimentation (SYRCLE)’s protocol template for animal studies and registered in the SYRCLE database (Supplementary material Table S
1) [
32,
33]. In our initial protocol, we had proposed to carry out a sensitivity analysis to assess whether the methodological quality of the studies included in the meta-analysis greatly influences the findings of the review. However, most of the studies in our systematic reviewed received low-quality scores; we therefore amended the review protocol to remove this part of the analysis. In addition, we mentioned using Holm-Bonferroni method in our protocol, but we did not apply it in our analysis.
Search strategy and selection of studies
We searched eight databases (Embase, Scopus, MEDLINE, CINAHL, BIOSIS, Cochrane, Central, and DARE) for original articles concerning the effects of NSAID on fractured bone healing in animal models. We considered the period from the starting date of each database to 1 February 2021. The main terms used in the search strategy, developed with the help of the Liaison Librarian for Life Sciences at McGill University, were “anti-inflammatory agents,” “non-steroidal,” “bone,” and “animals” (the complete search strategy is provided in Supplementary material Table S
2). We used a search filter to detect animal studies and exclude human studies. Although we did not impose language restrictions while searching, only English articles were reviewed. Also, we did not include conference abstracts because they do not provide sufficient data to allow for an evaluation. No other restrictions were used. Additionally, we hand searched the reference lists of eligible articles, and screened review articles for relevant references. The first selection was performed based on independent selection by two reviewers (HA, AP) using RAYYAN software (httpp://
rayyan.org, Doha, State of Qatar) [
34]. Any disagreement was solved by discussion or by including the third reviewer (BN). Full-text articles were screened and included when they met the following pre-specified criteria: (1) a controlled NSAID interventional design after bone fracture; and (2) a description of outcome measures related to bone healing (biomechanical characteristics, μ-CT scan measures, radiographic bone assessment, and/or histomorphometric and histological based grading). Papers were excluded if they fulfilled one of the following criteria: not an original article (e.g., review or letter), use of a bone graft or other material, included only outcomes that were not biomechanical or histomorphometric, not NSAID used, not experimental animal studies, and duplicate studies. A list of articles excluded as well as the reasons for exclusion are available from the authors upon request.
The final set of articles was assessed independently by two reviewers who extracted the data using DistillerSR (Evidence Partners, Ottawa, Canada) following a piloted data extraction method. Disagreements were resolved by consensus.
Data retrieved from the articles included characteristics of the animals (e.g., animal model used, weight and sex), test methods (bone fracture [e.g., site, type, number and technique used to perform the fracture, whether there was fixation or not and type of fixation method]), use of medication (e.g., opioid, antibiotics, and NSAID [e.g., type, name, duration of use and route of administration]), and outcome data (time points of outcome measures collection, type of outcome). We grouped the outcomes into main classes: (i) biomechanical (e.g., maximum force or ultimate force load, stiffness, work-to-failure), (ii) μ-CT assessment of healing (volume and density), and (iii) histomorphometric characterization of fracture callus (bone, cartilage, and mineralized tissue).
Raw data or group averages (mean, median), standard deviation (SD), standard error (SE), or ranges and number of animals per group (n) were extracted for all continuous outcome measures. We contacted the authors to obtain original data if results were presented graphically or were incomplete. If data could not be retrieved, the study was excluded from further analysis.
Risk of bias assessment
Two blinded reviewers (HA, AM) assessed the internal validity of the included studies using SYRCLE’s risk of bias tool (Table S
3) [
35]. The tool, an adaptation of the Cochrane risk-of-bias tool, considers aspects of bias specific to animal studies. It contains 10 entries related to 6 types of bias (selection, performance, detection, attrition, reporting, and other bias). The score (yes) indicates a low risk of bias, (no) indicates a high risk of bias, and (?) indicates an unclear risk of bias. We were concerned that many items would be rated as having an unclear risk of bias because of the known poor reporting of experimental designs [
36]. To overcome this problem, we added four entries to the tool, pertaining to randomization, blinding, sample size calculation, and time of day of the NSAID administration or time of day at which surgery was performed [
35]. For these items, ‘yes’ and ‘no’ indicates reported and unreported, respectively.
Data analysis and synthesis
The meta-analysis was conducted according to Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines, and using Comprehensive Meta-Analysis software (version 2.2.064, Biostat Inc., Englewood) when five or more independent comparisons from at least three different studies per outcome category were included (provided that outcome measure assessments were sufficiently comparable). We calculated the standardized mean differences (SMD) through Hedges g effect sizes [
37]. The calculation was (SMD = the mean of the NSAIDs group minus the mean of the control vehicle group divided by the pooled standard deviations of the two groups) to account for the differences in the units of measurements. We used Hedges
g effect to calculate the SMD, Hedge’s
g (which is based on Cohen’s
D but includes a correction factor for small sample size bias) [
38,
39]. Again, the calculations need to take into account the direction of effect.
Despite the anticipated heterogeneity, the individual effect sizes were subsequently pooled to obtain an overall SMD and 95% confidence interval (95%CI) [
38,
40]. We used a random-effects model [
40], which takes into account the precision of individual studies and the variation among them, and weights each study accordingly. If multiple independent experimental groups were compared to the same control group within the meta-analysis, the number of animals in the control group was corrected by dividing it by the number of experimental groups.
Rather than computing a single summary measure, an important objective of meta-analysis is to explore the sources of heterogeneity [
41], a measure of the degree of variability in study results, and assess which variables influence the effect of NSAID on bone healing outcomes. We conducted subgroup analyses according to sex and animal species (mice, rats, and rabbits), type of NSAID (non-selective/COX-2 selective), type of fracture, and period of outcome measurement or data collection (early healing less than 21 days, 21 to 48 days, and more than 48 days). We present below the results for subgroups containing at least 10 comparisons. A minimum of three independent comparisons per subgroup was needed to record the subgroup characteristics. The interpretation of differences between subgroups should be used mainly to construct new hypotheses rather than drawing definite conclusions. Heterogeneity for subgroup analyses was assessed using
I2 and the
Q statistic.
The subgroup analysis was done and generated through three methods for comparing the effect size across subgroups. One method was to use a
Z-test to compare the effect sizes directly. Another method was to use a
Q-test to partition the variance and test the between-subgroups portion of the variance. A third one was to use a
Q-test to assess the dispersion of the summary effect about the combined effect. All the methods used by the software assess the variance across subgroups effects relative to the variance within subgroups. The row tables for each subgroup analysis generated by the comprehensive meta-analysis software were added in the supplementary information file for detailed results based on the previous described methods (Tables S
5 to S27). We assessed publication bias for different healing outcomes by evaluating the possible asymmetry using funnel plots Trim and fill and “Egger’ regression test” if the analysis contained at least 20 comparisons” [
38]. All statistical analysis was conducted using Comprehensive Meta-Analysis Version 3 Software (NJ, USA) [
42]. A
P value less than 0.05 was considered to be statistically significant.
Discussion
This unique systematic review and meta-analysis was designed to answer a specific research question regarding the effect of different types of NSAID on bone fracture healing in animal models (in vivo studies). Three important outcomes commonly used to assess bone healing were analyzed: biomechanical properties (maximum force to break, stiffness, and work to failure), micro CT, and histomorphometric measurements.
NSAID administration had a negative effect on the biomechanical properties in different animal models of the included studies [
10,
12,
18,
19,
48,
51‐
59]. However, the results for histomorphometric assessments did not show a difference (Table
3).
Depending on the type of NSAID, they are known to inhibit both COX isoforms. Both non-selective NSAID and COX-2-selective drugs decrease prostaglandin production, which plays an essential regulatory role in all phases of bone healing, especially the inflammatory phase [
60‐
63]. Thus, it is reasonable to expect that NSAID administration after bone fracture may delay or impair healing outcomes [
12,
53,
62].
As early as the 1970s, many animal trials strongly emphasized the negative effect of NSAID on bone healing [
7,
9,
12,
19,
24,
45,
53,
59,
64]. Most studies that used rodents and rabbits demonstrated that non-selective and selective COX inhibitors impair the bone healing process [
7,
9‐
12,
19,
24,
45,
53,
59,
64]. Conversely, only a few studies indicated that NSAID has little or no effect on fracture healing outcomes [
20,
56,
57,
65]. However, these studies have significant limitations, because they tested only one time point, did not measure clear bone healing outcomes that include mechanical or histomorphometric analysis, or used very low NSAID doses, and some did not perform a proper statistical analysis [
43,
66,
67]. Recently, a review by Huss et al. in 2019 stated the need for more evidence based and weighting the risk and benefit regarding administration of NSAID for orthopedic treatment and the majority of animal experimental data support short term and perioperative administration of NSAID after bone fracture [
68]. Clinically, there are few retrospective studies and even fewer prospective clinical trials [
13‐
16]. The results from the retrospective studies were contradictory. Some of them have confirmed the negative effect of NSAID on bone formation and healing following hip or femoral neck fracture as well as hip arthroplasty [
2,
13], while others have shown no effect on bone healing. One of the few prospective clinical trials reported beneficial effects of NSAID on bone healing in humans, which was among Colles’ fracture patients who were treated with casting and reduction [
14,
16]. Borgeat et al. 2018, demonstrated in their systematic review that results from the available human trials did not show strong evidence that NSAID administration is related to increase non-union after bone fracture. In addition, they emphasised on the need for further randomized clinical trials to support or refuse this hypothesis [
69]. It is widely accepted that trying to understand the effect of NSAID administration on bone healing is extremely challenging in a clinical setting especially from a methodological perspective. For example, controlling the many confounding factors (e.g., smoking, diabetes, obesity) in a prospective manner requires considerable time and planning [
17]. Therefore, it is imperative to translate the evidence from the available in vivo experimental studies. In fact, animal studies helped in understanding the physiological process of bone healing and can also provide important insights on the effect of NSAID on bone healing.
Some methodological issues that might hamper the interpretation of the experimental animal data and their subsequent translation to the clinical setting should be discussed. First, there was substantial heterogeneity among the various animal studies. We performed subgroup analyses to investigate factors that may modify the effect of NSAID on bone healing outcomes (e.g., animal species, sex, type of bone, age of the animal, type of NSAID, and time at which the outcome was measured. More studies are required, especially in mice, because contrary to other models such as rats, mice show no negative effect of NSAID on the stiffness of the harvested bone compared to the control group. Because the mouse genetic map is similar to that of humans, the mouse may be the best available model to study the effect of NSAID on bone healing in different human genetic conditions [
70]. Overall, the pharmacokinetic variations between species, sexes, and ages should be considered, especially regarding drug absorption. Our meta-analysis showed non-significant negative effect of NSAID administration after bone fracture not only in mice compared to other animals, but also in females compared to males, and in younger compared to older animals.
Our results should be interpreted with the limitation of the included studies regarding higher risk of bias, using healthy animal model and the sample size calculation. Experimental studies that compare different sexes and ages within the same experiment are needed for stratification and comparison.
With the above-mentioned limitations, we observed that the negative effect of NSAID administration on biomechanical properties differs between animal species and this may suggest that rodent models (mice and rats) may be more sensitive than others for bone healing outcomes. Additionally, this finding suggests that the negative effect of NSAID on bone healing is species related for certain outcomes, and this need to be taken into consideration for knowledge translation. Other factors should be taking into consideration in selecting animal model for experimental bone regeneration studies which are highlighted in the 2018 systemic review by Peric et al. such as skeletal features of the selected animals, the model that can mimic the clinical scenarios including relevant doses and statistically supported sample size [
71]. The timing of healing outcome measurements also seems to modify the results because they do not show a significant effect of NSAID administration on μ-CT and histomorphometric outcomes early in the healing process compared to the control groups, but results differ when measurements are taken after 28 days. It is important to consider this information during the design of further experimental protocols, and it may be crucial in managing research efforts and reducing the unnecessary use of animals.
Finally, we did not find any studies that used animals of both sexes in their experimental design, which seems to be important for future studies because results differed between males and females for the work-to-failure biomechanical outcome.
Methodological quality of the studies
Our study quality checklist assessed aspects of both internal and external validity, and we observed many studies were generally of low-quality scores and tended to overstate the effect size. The overall quality score accounted for a significant proportion of between-study heterogeneity; however, the correlation between the aggregate quality score and the effect was not clear. The reporting and risk of bias assessments indicate the need for protocol registration or publication, and for the reporting of the elements of randomization, blinding, and allocation concealment [
35]. This concern is shared and addressed by others [
33,
35,
38]. It is crucial that future animal studies improve the reporting of study procedures, allowing others to replicate and build on previously published work. With better reporting, systematic reviews of higher quality will also become feasible.
Limitations
Several limitations of this work should be considered. One important limitation of this study was the risk of bias analysis which showed most of the included animal studies are poorly reported. This lack of reporting important methodological details might lead to increase bias. This limitation needs to be considered during interpretation of our conclusion from the included animal studies. In addition, we observed a level of heterogeneity present among studies reviewed for certain measured outcomes. This is can be expected as animal studies are explorative and heterogenous with respect species, sex, design, and drug administration protocol compared to clinical trials [
35]. Exploring heterogeneity among animal studies is one of the objectives of performing meta-analysis of these studies that might help to inform future study design. The use of random-effects models as we did in this study can help to account for expecting heterogeneity [
35]; however, appropriate caution still needs to be counted when interpreting the results. A fourth limitation is that the small sample size in animal studies which may exaggerate biases like publication bias which can affect the reliability and the validity of the study outcome. To overcome this limitation, one of the approaches we used was to calculate the standardized mean differences (SMD) through Hedges’
g effect sizes which is based on Cohen’s
D but includes a correction factor for small sample size bias [
35]. More specific limitation for this study was regarding the inclusion criteria of the systematic review, and we did not include animal models of disease or pathology, as we want to estimate the effect on healthy animal model, but this can be investigated and tested in future systematic review and meta-analysis with specific research question on the effect of NSAID administration on bone healing in diseased animal models. Other limitation is the meta-analysis of previous experimental studies did not include healing outcomes other than bone fracture (e.g., pain behavior or levels of inflammatory mediators). The measurements of pain behavior are subjective and depend on the interpretation of the examiner, which may lead to high heterogeneity among studies.
Clinical implications
Drug pharmacokinetics vary between species, and this must be considered when extrapolating data from animals to humans. Therefore, it is important to investigate NSAID doses that are equivalent to those used in humans after bone fracture (through interspecies allometric scaling for dose conversion from animal to human studies) and to evaluate effects using different types of animal and bone fracture models. The increasing amount of evidence from animal studies and the results from this systematic review and meta-analysis indicate that caution should be exercised when using NSAID after bone fractures or with specific orthopedic surgical procedures until prospective human clinical studies indicate otherwise.
Conclusions
Our findings provide some guidance for future laboratory and clinical research. First, it is important to test different hypotheses of bone fracture healing in small animals, and mice especially because mice models provide opportunities to examine genetics and create knock-out species. Our results also indicate the need for studies that compare the effect of NSAID administration on bone healing outcomes between male and female animal models. Overall, choosing appropriate animal model to test the effect of NSAID on fracture bone healing should take into consideration the animals’ species, type of the bone, and age of the animal.
Moreover, our results demonstrate it is important to choose the suitable time of sample collection based on what healing outcome to be measured. Histomorphometric outcome measurements require more than 21 days to show results that are comparable to the control group. Second, improvements in internal (study quality) and external (publication bias) validity might provide more information for the translation of the data to clinical trials, and more robust exploration of the efficacy limits in such studies could inform inclusion and exclusion criteria for these trials.
It is increasingly clear that the function of COX and their products are critical for bone healing. Most animal and human studies support the conclusion that NSAID administration can delay or impair bone fracture healing. However, the anti-inflammatory and analgesic effect of NSAID seems to be beneficial in treating post-traumatic pain and edema. The need for a new approach to using NSAID that preserves its pain control properties without affecting bone healing is justifiable and important.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.