Introduction
Knee osteoarthritis (OA) is one of the most general irreversible articulus diseases globally, and presents with features of incremental cartilage defect and articular space narrowing. Approximately 11.8% to 12.7% of the global population are affected by knee OA, according to World Health Organization [
1]. The condition is similar in China, where the number of OA patients nearly increased 2.35-fold over the past three decades, and approximately 61.2 million individuals suffered from symptomatic OA in 2017, with a percentage of mild, moderate, and severe OA of 47%, 35.9%, and 17.1% respectively [
2]. Despite the high prevalence of knee OA, effective and permanent interventions to halt or reverse the degenerative progression have not yet been developed [
3]. Intra-articular chronic inflammation accompanied with joint pain and dysfunction is the main pathological features of knee OA, which necessitate long-term management. Widely applied pharmacotherapies aimed at anti-inflammation and pain reduction are limited to acetaminophen and non-steroidal anti-inflammatory drugs (NSAIDs) [
4]. While conventional medications have only a marginal effect on pain, with no significant impact on joint function. Adverse events (AEs) that may occur in digestive and cardiovascular systems also restrict the feasibility of long-term administration of NSAIDS [
5‐
7]. Hence, the exploration of alternative options with good safety and efficacy profiles for knee OA has been delved into traditional herbal medicine [
8‐
10]. Notably, curcumin, extracted from the rhizome of Curcuma longa L. [Zingiberaceae] (CL), is a botanical extract with promising clinical values [
11].
Analogues comprising curcumin, bisdemethoxycurcumin, demethoxycurcumin and cyclocurcumin are collectively referred to as curcuminoids (CURs) [
12], which constitute the principal ingredients of CL—an herbal plant used in east Asia to alleviate pain and inflammation. CURs are natural polyphenols which have been shown to exert anti-inflammatory and anti-oxidant effects in vivo and vitro studies by downregulating inflammation-related nuclear factor kappa-B (NF-κB) signaling pathway, scavenging free radicals, and inhibiting the activity of enzymes, such as cyclooxygenase-2 (COX-2), 5-lipoxygenase (5-LOX), and nitric oxide synthase (NOS), which exacerbate the oxidative stress in OA condition [
13‐
15]. Normal NSAIDS are of critical safety concerns due to simultaneously inhibition of COX-1 and COX-2 enzymes in arthritis, while CURs can reduce the synthesis of COX-2 tendentiously [
16], which may result in better safety profiles. Furthermore, CURs exhibit chondroprotective properties by stimulating extracellular matrix synthesis, down-regulating the synthesis of matrix metalloproteinases (MMPs) [
17]. And CURs were shown to postpone joint contracture progress via inhibiting the proliferation of myofibroblasts from the joint capsule [
18]. Considering that the pathophysiology of knee OA is characterized by inflammation and degeneration with prominent symptoms of pain and dysfunction, alleviating local inflammation and oxidative stress, stimulating cartilage regeneration and delaying joint contracture may be conducive to the condition, and CURs have emerged as an attractive treatment option for knee OA.
Several animal studies have assessed the efficacy of CURs administered via nano-scale drug carriers for knee OA, demonstrating that CURs have potent anti-inflammatory and anti-arthritic activity, both with and without biological materials [
19‐
23]. Although pre-clinical studies have revealed promising results, the clinical efficacy, safety, dosage, and treatment duration of CURs for knee OA remain equivocal. Thus far, the therapeutic effects of CURs for knee osteoarthritis (OA) uncovered by multiple reviews remained uncertain due to broadly involving trials with different agents-combined or CURs-free interventions [
8,
24‐
26], and evidence to reveal the clinical significance of CURs alone for knee OA is insufficient. Consequently, we aimed to summarize the evidence to date on the clinical effectiveness of CURs alone in alleviating pain and dysfunction for knee OA by a systematic review and meta-analysis. We postulated that CURs have superior efficacy in pain relief and functional promotion compared to control measures.
Methods
The research was performed according to our pre-registered protocol (CRD42021266888, PROSPERO) with some amendments in the selection and assessment of outcomes.We adopted the concept of the minimum clinically important difference (MCID) [
27] to assess the clinical significance of CURs for treating knee OA. The study was conducted by the guidance of the Cochran Handbook for Systematic Review of Interventions [
28], and reported according to the Preferred Reporting Items for Systematic Review and Meta-Analysis checking list (Supplementary Table
1) [
29].
Literature search
An electronic literature retrieval was conducted on August 2022. The Cochrane Library, Medline via PubMed, Web of Science, Embase, CNKI (China National Knowledge Infrastructure), SinoMed (Chinese BioMedical Literature Service System), Wanfang and VIP databases, and ClinicalTrials.gov (
http://ClinicalTrials.gov) were searched for all published randomized controlled trials (RCTs) evaluating the efficacy and safety of CURs alone in treating knee OA, without time or language restriction. The retrieval strategy sample of PubMed and Embase is shown in Supplementary Table
2.
Study selection
Eligibility criteria
Eligible RCTs were included in this study based on the following criteria: (1) participants: patients diagnosed with knee OA according to the criteria proposed by the American College of Rheumatology (ACR) [
30]; (2) intervention: oral CURs; (3) control: oral conventional agents or placebo; (4) one or more of the following outcomes: visual analog scale (VAS) for pain, Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) total score, WOMAC subscale scores (pain, function and stiffness scores), withdraw rate, concomitant rescue medications, OA biomarkers and adverse events (AEs); and (6) study design: RCTs. Studies were excluded if they met any of the following criteria: (1) studies in which CURs are combined with other treatments; (2) studies lacking essential data; (3) studies in which full-texts were unavailable.
Selection process
To select relevant studies for further assessment, two independent reviewers (F.X. and X.M.) removed duplicate publications using Endnote X9, and identified each citation as eligible, ineligible and uncertain by screening titles and abstracts. For eligible and uncertain records, full-texts were further assessed to confirm if the studies were RCTs comparing CURs alone versus conventional therapies or placebo in the treatment of knee OA.
Data extraction and data items
All data were extracted, and recorded in Excel spreadsheets prepared in advance by two reviewers (L.L.T and Z.L.). The following contents were extracted: (1) study characteristics; (2) patient demographics; and (3) outcomes data. Predefined primary outcomes included VAS for pain, WOMAC pain score, WOMAC function scores, and adverse events. Other outcomes were defined as secondary outcomes. When the data of two or more studies were originated from one clinical trial, only the latest studies providing requisite outcomes were included, and they will be regarded as one study. Attempts were made to obtain missing data by contacting the corresponding author, browsing supplementary files, or consulting relevant data from previous meta-analyses.
Methodological quality assessment
Two reviewers (L.L.T and Z.L.) applied the recommended Cochran Risk of Bias Tool 1 [
31] to assess the risk of bias of the included studies. Each study was judged as having low, unclear, or high risk of bias on the basis of the following assessment domains: random sequence generation (selection bias), allocation concealment (selection bias), blinding of participants and personnel (performance bias), blinding of outcome assessment (detection bias), incomplete outcome data (attrition bias), selective reporting (reporting bias) and other bias.
Statistical analysis
All data were processed and analyzed by J.F. and Z.L. using the Stata 14 (StataCorp, College Station, Texas, USA) and RevMan 5.4 software (The Cochrane Collaboration, Copenhagen, Denmark). We performed meta-analysis to merge the treatment effects of CURs and control groups, using a random-effects model due to the existence of substantial variability within and between studies [
32]. Continuous outcomes were reported as the weighted mean difference (WMD) with 95% confidence interval (CI), and risk ratios (RR) with 95% CI were calculated for dichotomous outcomes. The pooled effect size with a p-value < 0.05 was defined as statistically significant. The MCID, defined as the minimal magnitude an subjective outcome must change to achieve clinical efficacy meeting the satisfaction of patients and clinicians [
27], was adopted as a test threshold for clinical significance.. The MCID threshold for the VAS and WOMAC scores was defined as a 20% fluctuation from the baseline of the included studies according to previous researches [
33‐
36], and calculated as follows: 1.18/10 for VAS for pain, 8.97/96 for WOMAC total score, 2.12/20 for WOMAC pain score, 6.62/68 for WOMAC function score, and 0.76/8 for WOMAC stiffness score. Inter-study heterogeneity was assessed by χ2-based Q-test and the I2 index, and an I2 value of 50% was defined as the demarcation of low and high heterogeneity. To explore the influence of various factors on primary pain- and function-related outcomes, we carried out pre-planned subgroup analyses for the placebo-controlled group based on daily dose of CURs (dose < 1,000 mg, or dose ≥ 1,000 mg), total dose of CURs (dose < 50 g, or dose ≥ 50 g), follow-up duration (time < 12 weeks, or time ≥ 12 weeks), type of CURs (bio-optimized or pure extracts) and regions (Asia or non-Asia). Publication bias was detected using funnel plots and Egger’s test for outcomes involving five or more comparisons. The robustness of the quantitative synthesis was tested by omitting the data of each citation in sensitivity analysis. Other outcomes that cannot be merged quantitatively were summarized as narrative reviews.
Evidence evaluation
The quality of evidence was classified using the GRADE system [
37] as high, moderate, low, or very low, with descending assignment of 4, 3, 2, or 1. As the included studies were all RCTs, the level of each outcome began as high quality, but the confidence of each evidence could be decreased by considering the following domains: (1) study limitations; (2) inconsistency of results; (3) indirectness of evidence; (4) imprecision; and (5) publication bias. When evaluating the study limitations [
38], the quality of evidence could be downgraded from high level according to results of literature quality assessment, for example, if a study was defined as having unclear risk of bias when it was likely to lower confidence in the estimate of effect size, and the quality of the related outcomes would be decreased by minus 1 to moderate. The I2 index values were used to evaluate the consistency [
8,
39] grading: I2 ≤ 50% equalled ‘not serious’ quality downgrade; 50% < I2 ≤ 75% equalled ‘serious’ quality downgrade (minus 1); I2 > 75% equalled ‘very serious’ quality downgrade (minus 2). We applied the MCID in grading imprecision for VAS and WOMAC scores on the premise that the results were not statistically significant: the 95% CI exceeded the MCID either in the upper or lower confidence limit equalled ‘serious’ quality downgrade (minus 1); the 95% CI of WMD encompassed the MCID equalled “very serious” quality downgrade (minus 2). The assessment of imprecision [
40] for RR was implemented by strictly adhering to the GRADE guidelines. As indirectness was appraised by the stringent inclusion and exclusion criteria, reassessment was not necessary. Publication bias was evaluated according to the results of funnel plots and Egger’s test.
Discussion
The principal finding of our study was that CURs were associated with better effectiveness than placebo and not inferior to NSAIDs in terms of pain reduction and functional promotion for knee OA. The pooled analyses found that CURs were more effective than placebo in the improvement of VAS for pain, WOMAC total score, WOMAC pain score, WOMAC function score and WOMAC stiffness score, while there was no significant difference found between CURs and NSAIDs. We used the MCID as a threshold in this meta-analysis to assess the clinical significance of the difference between CURs and the control groups, instead of rely solely on the statistical significance. The MCID can be calculated by anchor-based and distribution-based methods, we applied the anchor-based method to set the threshold at 20% based on previous research [
27,
33‐
36]. The significance test of clinical benefits found that only VAS for pain and WOMAC total score achieved clinical significance by exceeding their MCID, while WOMAC pain score, WOMAC function score and WOMAC stiffness score did not. We also found that CURs did not induce an increase of AEs compared with placebo and NSAIDs. The total incidences of AEs in CURs and control groups were 25.06% and 35.57%. Diarrhea and/or constipation and stomach pain (5.8% and 8.17%) were the most frequent mild AEs in CURs and control groups respectively (Table
4).
Pain and dysfunction were the leading causes for medical care use and clinical decision making for knee OA [
73]. Novel disease-modifying treatments targeting the pathological process of OA are in development to solve the treatment dilemma of symptom-relieving drugs (pain-killers or NSAIDs) [
74]. Among which, CURs have attracted much attention of medical researchers and clinicians [
8‐
10]. CURs have been shown to possess therapeutic effects on knee OA as a result of their anti-inflammatory and anti-oxidant properties [
15]. The regulation of inflammation- and catabolism-related pathways is the main mechanism underlying the anti-inflammatory and chondroprotective properties of CURs [
75]. CURs exhibit anti-apoptotic and antioxidant effect on chondrocytes and induce mesenchymal stem cells chondrogenic proliferation. Thus far, many pre-clinical and clinical studies [
76] have identified CURs as being effective for treating knee OA. Despite the highly pleiotropy in knee OA, the application of CURs is controversial due to poor oral bioavailability. Numerous studies have focused on methods to optimize the pharmacokinetics of CURs [
77]. Among the included studies, ten used bioavailable CURs, such as nanocurcumin [
61,
70,
71], liposome CURs complexes [
58,
67] and so on, while the other five used pure extracts from CL. Theoretically, bio-optimized CURs should be superior to pure extracts given their higher absorptivity and lower metabolism [
77]. According to our study, we found that the result of the bio-optimized extracts subgroup increased to exceed the MCID of WOMAC pain score, but the result of the pure extracts subgroup was neither statistically nor clinically significant. Besides, the effect sizes of both VAS for pain and WOMAC function score in the bio-optimized extracts subgroup exceeded their MCID, while those in the pure extracts group did not. These findings indicated that bio-optimized CURs may have better clinical applicability for knee OA than pure CURs. However, a recent meta-analysis of Wang et al. [
24] found no significant difference between the enhanced and normal CL extracts in pain and physical function related outcomes. Two trials [
44,
59] applying normal CURs as adjuvants to NSAIDs were included in their placebo-controlled group for quantitative synthesis, which may cause the divergence in the priority of the enhanced CURs given that the added effects of NSAIDS were neglected. Each study in this meta-analysis applied different metrics and tactics to remodel the bioavailability of CURs, direct comparisons between different CURs products are essential to verify our findings and seek a cost-effective agent. Predictably, bio-optimization techniques with more than one approach to conquer the hindrances (e. g., poor water solubility, rapid metabolism, and instability) to oral bioavailability would achieve significant improvement in the effectiveness of CURs.
The appropriate dosage of CURs for treating knee OA remains uncertain. Previous meta-analyses by Daily et al. [
10] and Onakpoya et al. [
9] demonstrated the typical dose of 1,000 mg/day as effective in the improvement of inflammation-related symptoms. According to our study, there was no statistically significant difference in the main outcomes between CURs and placebo in all subgroups of different doses (daily dose < or ≥ 1,000 mg and total dose < or ≥ 50 g). Theoretically, the optimal dose of a drug is closely associated with its safety and bioavailability. Various studies focused on diverse diseases have proved that CURs are effective without major safety concerns even at high doses such as 6 g/day [
78], which indicated that the main limitation of optimal dosage is the bioavailability of CURs. Thus theoretically, the requirement for CURs of lower dosage and better compliance without affecting curative effects for knee OA may be met by the optimization of bioavailability. Actually, we found that the effect sizes of VAS for pain, WOMAC pain score and WOMAC function score in low-dose (daily dose < 1,000 mg and total dose < 50 g) subgroup exceeded their MCID, while those in high-dose (daily dose ≥ 1,000 mg and total dose ≥ 50 g) subgroup did not achieve clinical significance. The observed difference in clinical values between low- and high-dose subgroup may be caused by the limited number of studies in each subgroup or the administration of bio-optimized CURs for all studies in low-dose group.
A recently published review by Zeng et al. [
26] suggested that CURs could not exhibit significant therapeutic effects until the duration of administration lasted for more than 12 weeks. Given the simultaneous inclusion of active-controlled [
44,
56,
57,
59,
79] and placebo-controlled trials in their subgroup analyses, the effect sizes at different time points may be weakened by effects of NSAIDs-controlled arms. Thus, we removed the data of active-controlled trials in subgroup analyses, and found that CURs showed favorable improvement in VAS for pain, WOMAC pain score, and WOMAC function score compared to placebo at each time points (follow-up duration < or ≥ 12 weeks). Besides, the MCID was exceeded by the effect sizes of all main outcomes in the time < 12 weeks subgroup, but the effect sizes of the WOMAC pain score and WOMAC function score in the time ≥ 12 weeks subgroup did not achieve clinical significance. The reason for the difference in clinical values between short- and long-term subgroup may be that all three studies [
62,
67,
71] in the short-term subgroup applied bio-optimized CURs, and two [
65,
69] of the three [
65,
66,
69] studies in long-term subgroup used normal CURs. Overall, in terms of alleviating pain and other symptoms, bio-optimized CURs may be sufficiently potent to lower dosage and shorten medication cycle. Besides, larger effect sizes with clinical significance of both pain reduction and functional promotion were observed in trials performed in Asia compared with those in other countries, which was in accordance with the result of a recently published review by Wang et al. [
24].
The quality of our findings was evaluated using the GRADE system [
37]. All pain and function related outcomes were downgraded to have a moderate to very low quality of evidence duo to inconsistency, risk of bias, and publication bias, while AEs were defined as high-quality evidence (Supplementary Table
3). The US Food and Drug Administration (FDA) defined CURs as nutraceuticals under “Generally Recognized as Safe” (GRAS) [
80], and good safety and tolerability properties have been revealed by multitudinous studies at cellular level, in animals and even in human subjects [
78], but it is still worth noting that nutraceuticals like CURs generally lack a systemic safety assessment before being used for medicinal purposes [
81],therefore the potential dose- and time-dependent side effects of CURs on human body should be weighted carefully when facing the current benefits and potential values for broader clinical use of CURs.
Strengths and limitations
In this study, we included the latest fifteen clinical trials focused on this topic. Meanwhile, trials with CURs-free or combined interventions were excluded to realize a more objective display of the therapeutic effect of CURs alone for knee OA. Besides, the clinical significance of CURs in alleviating pain and dysfunction for knee OA was also evaluated by the MCID of patient-reported outcomes. However, the limitations of our study should be considered when interpreting our findings. Firstly, the quality of the original studies was low, and substantial heterogeneity was detected among the included studies, and the exact sources of heterogeneity were hard to be found for which may stem from the multitudinous variations in dosages, follow-up durations, regions, preparation schemes of CURs, and baseline values. Secondly, obvious heterogeneity was still present after subgroup analyses, which indicated that the substantial heterogeneity was not entirely stem from the subgrouping variables. The quality of the included studies was uneven, the degree of bias was large and the numbers of studies in each subgroup was rather small, which could result in large differences in the results of statistical analysis. Thirdly, the durations of follow-ups in main outcomes were limited to within 6 months, as a result, the long-term clinical effectiveness of CURs remains equivocal. Although Egger's test did not indicate significant publication bias, the limited data volume of included studies made the linearity assessment quite uncertain and at risk of being overly influenced by single studies, and the asymmetry of funnel plots observed by visual inspection also indicated the existence of potential publication bias. Because the symmetry of funnel plots could be influence by various factors, such as publication bias, and/or small-study effects, it was difficult to figure out the cause of asymmetry, especially when the sample size of each comparison was less than ten [
82]. For the reasons above, further studies are needed to warrant our findings and perform more comprehensive analyses.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.