Background
Lumbar radiculopathy is a common reason for physician consultations and imaging referrals [
1‐
3]. Typical symptoms are radiating pain, often with numbness, paraesthesia, and/or muscle weakness [
1,
4]. Clinical examination aims to clarify whether there is mechanical impingement of a nerve root [
5]. The most common clinical diagnostic tests are the straight leg raising test, and tests for tendon reflexes, motor weakness, and sensory deficits [
6]. An inaccurate clinical diagnosis may lead to unnecessary imaging and healthcare expenditure, and additional concerns for patients [
7‐
12].
The aim with imaging is to confirm or disprove a clinical suspicion, and to provide a roadmap for planning of surgical or other intervention procedures, if indicated. Mechanical nerve root impingements demonstrated with magnetic resonance imaging (MRI) or computer tomography (CT) is an accepted reference standard [
13].
Systematic reviews on the diagnostic properties of clinical diagnostic tests for lumbar radiculopathy report variable accuracy, with sensitivities ranging from 0.14 to 0.61 for sensory deficits and impaired tendon reflexes [
14,
15], 0.27 to 0.62 for motor weakness [
14,
16], and 0.35 to 0.81 for the straight leg raising test [
17]. Most studies report likelihood ratios (LRs) suggesting negligible differences between pre- and post-test probabilities for presence of nerve root impingement as the target condition, indicating limited value of the tests in clinical decision-making. A recent Cochrane review confirmed poor diagnostic performance of diagnostic tests in 18 studies from specialised care [
13].
This review raised concern that none of the reported studies specifically discriminated between nerve root impingement and just the presence of a disc herniation when using imaging as a reference standard. This could be a major bias, since the prevalence of disc bulging or herniation in unselected populations without radiculopathy symptoms is high [
18].
The aims of this study are to investigate the association between findings at clinical examination and nerve root impingement, to evaluate the accuracy of clinical index tests in a specialised care setting, and to see whether imaging clarifies the cause of chronic radicular pain.
Results
In total, 116 patients with unilateral chronic lumbar radiculopathy were included. Their clinical and demographic characteristics are summarised in Table
1. Mean age was 42.0 (SD 10.3) years, 68 (58.6%) were males, and the mean duration of symptoms on inclusion was 42.0 (SD 99.0) weeks. Figure
1 shows the results of MRI or CT for the included patients. The overall prevalence of disc herniation at any of the studied lumbar levels (L2 to S1) was 77.8%.
Table 1
Clinical and demographic characteristics of 116 patients with chronic lumbar radiculopathy
Smoker | 49 (42.2) |
Body mass index (kg/m2) Mean (SD) | 26.3 (3.8) |
Physically demanding work | 58 (50.0) |
Educational level
| |
Secondary school | 94 (81.0) |
College/University | 24 (19.0) |
Receiving sickness benefit | 53 (45.7) |
VAS Low back pain (0–100) Mean (SD) | 47.6 (24.3) |
VAS Leg pain (0–100) Mean (SD) | 50.6 (24.7) |
Time from referral to inclusion (weeks) Mean (SD) | 6.4 (6.8) |
Table
2 shows the frequencies of positive index tests, the overall clinical evaluation, and the imaging findings. Table
3 shows the diagnostic accuracies for the different index tests for detection of the level and side of the nerve root impingement. None of the individual tests were highly accurate, as both sensitivities and specificities were low with wide CIs. All positive LRs were ≤4.0, and all negative LRs ≥0.4.
Table 2
Incidence of positive index and reference tests in painful leg*
Nerve stretch tests
| | |
Femoral nerve stretch test | 6 | 6.0 |
Straight leg raise test | 62 | 53.4 |
Reflex tests
| | |
Knee reflex | 21 | 18.1 |
Ankle reflex | 47 | 40.5 |
Sensory loss testing
| | |
L3 | 4 | 3.4 |
L4 | 14 | 12.1 |
L5 | 31 | 26.7 |
S1 | 52 | 44.8 |
Motor strength/weakness
| | |
Hip flexion (Iliopsoas L1,L2,L3) | 13 | 11.2 |
Hip extension (Gluteus maximus L5,S1,S2) | 14 | 12.1 |
Hip abduction (Gluteus medius L4,L5,S1) | 9 | 7.7 |
Knee flexion (Hamstrings L5,S1,S2) | 64 | 55.2 |
Knee extension (Quadriceps femoris L2,L3,L4) | 1 | 0.9 |
Ankle dorsiflexion (Tibialis anterior L4,L5) | 37 | 31.9 |
Ankle plantarflexion (Gastro-cnemius and Soleus S1,S2) | 45 | 3.9 |
Ankle eversion (Peronei L5,S1) | 80 | 6.9 |
Big toe extension (Extensor hallucis longus L5,S1) | 25 | 21.5 |
Clinician suspected spinal nerve root impingement
|
L3 | 1 | 0.9 |
L4 | 7 | 6.0 |
L5 | 37 | 31.9 |
S1 | 71 | 61.2 |
MRI or CT proven disc herniation with spinal nerve root impingement
|
L3 | 0 | 0 |
L4 | 3 | 2.6 |
L5 | 30 | 25.9 |
S1 | 27 | 23.3 |
MRI or CT proven disc herniation without spinal nerve root impingement
|
L3 | 0 | 0 |
L4 | 1 | 0.9 |
L5 | 12 | 10.3 |
S1 | 17 | 14.6 |
MRI or CT normal or with minor degenerative changes without spinal nerve root impingement
|
All lumbar spinal levels | 26 | 22.4 |
Table 3
Diagnostic accuracy of individual neurological tests
Femoral nerve stretch test
| * | * | * | * | 0.17 (0.07–0.33) | 0.99 (0.94–1.00) | 14.33 (1.74–117.80) | 0.84 (0.72–0.99) | * | * | * | * |
Straight leg raise test
| * | * | * | * | 0.53 (0.36–0.70) | 0.47 (0.36–0.57) | 1.00 (0.68–1.47) | 1.00 (0.64–1.57) | 0.63 (0.44–0.78) | 0.49 (0.39–0.60) | 1.24 (0.87–1.78) | 0.75 (0.44–1.28) |
Knee reflex
| 0.67 (0.21–0.94) | 0.83 (0.75–0.89) | 3.96 (1.61–9.74) | 0.40 (0.08–1.99) | 0.18 (0.08–0.37) | 0.75 (0.63–0.84) | 0.73 (0.30–1.79) | 1.09 (0.87–1.37) | 0.11 (0.04–0.28) | 0.80 (0.70–0.87) | 0.55 (0.18–1.72) | 1.11 (0.94–1.32) |
Ankle reflex
| 0.67 (0.21–0.94) | 0.60 (0.51–0.69) | 1.67 (0.73–3.84) | 0.55 (0.11–2.76) | 0.27 (0.14–0.44) | 0.55 (0.44–0.65) | 0.59 (0.31–1.11) | 1.34 (1.00–1.79) | 0.44 (0.27–0.639 | 0.61 (0.50–0.70) | 1.13 (0.69–1.85) | 0.92 (0.63–1.33) |
Sensory loss L4
| 0.33 (0.06–0.79) | 0.88 (0.81–0.93) | 2.90 (0.54–15.55) | 0.75 (0.34–1.68) | 0.20 (0.10–0.37) | 0.91 (0.83–0.95) | 2.15 (0.81–5.70) | 0.88 (0.73–1.07) | 0.11 (0.04–0.28) | 0.88 (0.79–0.93) | 0.90 (0.27–2.99) | 1.01 (0.87–1.18) |
Sensory loss L5
| 0.33 (0.06–0.79) | 0.73 (0.65–0.81) | 1.26 (0.25–6.40) | 0.91 (0.40–2.03) | 0.43 (0.27–0.61) | 0.79 (0.69–0.86) | 2.07 (1.16–3.70) | 0.72 (0.51–1.00) | 0.18 (0.08–0.37) | 0.71 (0.61–0.79) | 0.63 (0.27–1.49) | 1.15 (0.92–1.44) |
Sensory loss S1
| * | * | * | * | 0.33 (0.19–0.51) | 0.51 (0.41–0.61) | 0.68 (0.39–1.18) | 1.30 (0.94–1.81) | 0.44 (0.27–0.63) | 0.55 (0.45–0.65) | 0.99 (0.61–1.60) | 1.01 (0.69–1.48) |
Hip flexion
| * | * | * | * | 0.23 (0.12–0.41) | 0.93 (0.86–0.97) | 3.34 (1.22–9.16) | 0.82 (0.67–1.01) | * | * | * | * |
Hip extension
| 0.33 (0.06–0.79) | 0.88 (0.81–0.93) | 2.90 (0.54–15.55) | 0.75 (0.34–1.68) | 0.03 (0.01–0.17) | 0.85 (0.76–0.91) | 0.22 (0.03–1.61) | 1.14 (1.02–1.27) | 0.18 (0.08–0.37) | 0.90 (0.82–0.94) | 1.83 (0.67–5.00) | 0.91 (0.75–1.10) |
Hip abduction
| * | * | * | * | 0.07 (0.02–0.21) | 0.92 (0.84–0.96) | 0.82 (0.18–3.73) | 1.01 (0.91–1.13) | 0.04 (0.01–0.18) | 0.91 (0.83–0.95) | 0.41 (0.05–3.15) | 1.06 (0.96–1.17) |
Knee flexion
| 0.67 (0.21–0.94) | 0.45 (0.36–0.54) | 1.22 (0.54–2.75) | 0.74 (0.15–3.71) | 0.50 (0.33–0.67) | 0.43 (0.33–0.53) | 0.88 (0.59–1.31) | 1.16 (0.75–1.79) | 0.70 (0.51–0.84) | 0.49 (0.39–0.60) | 1.39 (1.01–1.92) | 0.60 (0.32–1.11) |
Knee extension
| * | * | * | * | * | * | * | * | * | * | * | * |
Ankle dorsiflexion
| 0.33 (0.06–0.79) | 0.68 (0.59–0.76) | 1.05 (0.20–5.30) | 0.98 (0.44–2.20) | 0.40 (0.25–0.58) | 0.71 (0.61–0.79) | 1.38 (0.80–2.38) | 0.85 (0.61–1.17) | 0.26 (0.13–0.45) | 0.66 (0.56–0.75) | 0.77 (0.38–1.55) | 1.12 (0.85–1.46) |
Ankle plantarflexion
| 0.67 (0.21–0.94) | 0.62 (0.53–0.70) | 1.75 (0.76–4.03) | 0.54 (0.11–2.68) | 0.27 (0.14–0.44) | 0.57 (0.46–0.67) | 0.62 (0.33–1.18) | 1.29 (0.97–1.71) | 0.44 (0.27–0.63) | 0.63 (0.52–0.72) | 1.20 (0.73–1.98) | 0.88 (0.61–1.28) |
Ankle eversion
| 0.67 (0.21–0.94) | 0.31 (0.23–0.40) | 0.97 (0.43–2.17) | 1.08 (0.21–5.46) | 0.45 (0.27–0.65) | 0.28 (0.19–0.38) | 0.63 (0.39–1.01) | 1.96 (1.17–3.26) | 0.70 (0.51–0.84) | 0.31 (0.23–0.42) | 1.03 (0.77–1.36) | 0.94 (0.49–1.82) |
Big toe extension
| * | * | * | * | 0.33 (0.19–0.51) | 0.83 (0.73–0.89) | 1.91 (0.97–3.79) | 0.81 (0.62–1.06) | 0.15 (0.06–0.32) | 0.76 (0.67–0.84) | 0.63 (0.24–1.67) | 1.11 (0.49–1.82) |
Table
4 shows that the clinicians’ overall evaluations using information from all relevant index tests to predict nerve root impingement were slightly more accurate than each of the individual index tests. ROC analysis of the diagnostic properties of the overall clinical evaluations showed AUCs of 0.95 (95% CI 0.90–1.00) for L4, 0.67 (95% CI 0.56–0.77) for L5, and 0.66 (95% CI 0.54–0.77) for S1 nerve root impingement.
Table 4
Diagnostic accuracy of clinician examination conclusion
Clinician concluded L4 nerve root impingement
| 0.33 (0.06–0.79) | 0.95 (0.89–0.97) | 6.28 (1.06–37.21) | 0.70 (0.32–1.57) | 0.10 (0.03–0.26) | 0.95 (0.89–0.98) | 2.15 (0.51–9.06) | 0.94 (0.83–1.07) | * | * | * | * |
Clinician concluded L5 nerve root impingement
| 0.33 (0.06–0.79) | 0.68 (0.59–0.76) | 1.05 (0.21–5.30) | 0.98 (0.43–2.20) | 0.47 (0.30–0.64) | 0.73 (0.63–0.81) | 1.74 (1.04–2.93) | 0.73 (0.51–1.04) | 0.26 (0.13–0.45) | 0.66 (0.56–0.75) | 0.77 (0.38–1.55) | 1.12 (0.85–1.46) |
Clinician concluded S1 nerve root impingement
| 0.33 (0.06–0.79) | 0.38 (0.30–0.47) | 0.54 (0.11–2.68) | 1.75 (0.76–4.03) | 0.43 (0.27–0.61) | 0.32 (0.23–0.43) | 0.64 (0.42–0.99) | 1.74 (1.12–2.69) | 0.74 (0.55–0.87) | 0.43 (0.33–0.53) | 1.29 (0.97–1.72) | 0.61 (0.31–1.20) |
Discussion
This study included patients with symptoms suggesting lumbar radiculopathy. Patients were recruited by screening and referral from general practitioners, and those with large disc herniation obviously requiring surgery were excluded. The sample emerging from these criteria is typical for the chronic radiculopathy population seen in specialised care. Results from the study are relevant for our understanding of diagnostic accuracy in the common clinical setting where specialists have access to imaging findings prior to the clinical examination, and often are challenged by having to evaluate which of numerous positive imaging findings are to be considered clinically relevant.
The main finding is that individual clinical index tests lack diagnostic accuracy for predicting whether a lumbar nerve root is impinged or not at a specific level in patients with chronic lumbar radiculopathy in specialised care. The overall clinical evaluation, consisting of the specialists’ combined interpretation of the patients’ history and all index tests, was somewhat more accurate. For L5 and S1 nerve root impingement, however, LRs did not reach the levels usually considered necessary to influence post-test probability and thereby clinical decision-making (positive LR >5.0 and negative LR <0.2) [
31]. Accuracy was better (positive LR 6.28, negative LR 0.70) for L4 nerve root impingement. This was probably because L4 nerve root involvement occurred only in 3 (2.6%) cases, and was suspected after the overall clinical evaluation only in 7 (6.0%) cases. This resulted in a high number of true negatives, and thereby high specificity. Clinically, the low pre-test probability for L4 nerve root involvement is well known [
32], and these test properties are therefore not very useful. Accordingly, clinical examination is inaccurate both for predicting the presence or absence of nerve root impingement, and for clarifying the relevant level and side in patients with multiple positive imaging findings.
Our findings are mainly in accordance with other studies of selected populations from specialised care [
13]. Most previous studies have, however, aimed for a generalised understanding of test properties from such selected materials [
13]. This approach is confusing, as the pre-test probability always must be taken into consideration. Recently, a study aimed to specifically investigate the accuracy of clinical index tests from the neurological examination for identification of the level of disc herniation in patients with the target condition already confirmed by MRI [
33]. Unfortunately the study did not find evidence to support this. The results were disappointing, with no single test reaching an AUC >0.75, and only slightly better results (AUC = 0.80) for the neurologists’ overall evaluation.
It has been a weakness of most previous studies that interpretation of the imaging findings has been limited to categorising the target condition (usually a disc herniation) as present or not, without considering whether a nerve root actually was impinged at the relevant spinal level and side [
34]. We therefore improved the study design by specifically addressing findings relevant for clinical decision-making: correspondence between index tests and impingement of specific nerve roots as revealed by MRI [
32]. Disappointingly, this did not improve diagnostic accuracy, neither for individual tests nor for the clinicians’ overall evaluation. AUCs for L5 and S1 nerve root impingement did not reach levels above 0.66, which are even lower than those observed by Hancock et al. in an almost similar specialised care setting [
33]. This could be because we used one or more positive index tests as an inclusion criterion, which probably increased both the proportion of false positives and false negatives. The false negatives increased because the index tests are not independent of each other, implying that inclusion based on one or more positive tests entails an increased proportion of false negatives, since many tests are performed in each patient. We do not consider the selection of patients in our study a methodological weakness, but rather an expression of clinical reality in specialised care. There should, however, be concern about both the definition of the target condition and the reference standard being subjects to bias. First, neuroanatomical overlap between spinal segments influences accuracy when the analysis is done on the level of each single nerve root [
35‐
37]. Patients may have radiculopathy from causes other than ongoing nerve root impingement, and even when an impingement is present, this is not necessarily the cause of the pain. Imaging showed no sign of nerve root impingement in 56 (48.3%) of the included cases despite a clear history and clinical findings suggesting lumbar radiculopathy. This confirms that radiculopathy may have other causes, such as neuropathic and inflammatory conditions, or be mimicked by myofascial pain [
6,
38‐
40]. Moreover, disc herniation without nerve root impingement was demonstrated in 25.9% of the included patients, and in 73.8% of those excluded due to symptoms classified as unspecific low back pain with referred leg pain. This is not surprising, since the prevalence of disc herniation revealed by MRI in the general population is known to be as high as 30% [
3,
18,
41‐
44].
We suggest that our findings reflect clinical reality very well: in a population selected by referral from primary care and exclusion of the most obvious surgical cases, co-morbidity bias and imaging findings not related to the symptoms are common. Diagnostic imaging combined with clinical tests is therefore inaccurate for clarifying the cause of radicular pain. This is probably one of the reasons why these patients are so difficult to treat, and the same inaccuracy may cause significant inclusion bias in clinical trials evaluating treatments for lumbar radiculopathy.
The present study has weaknesses. We did not register inter-tester variability for the clinical tests and image interpretations. However, all clinicians were trained to perform the tests in a standardised manner, and agreement should thus be superior to that achieved between clinicians in daily practice [
22]. MRI was substituted with CT in 7 (6.0%) of the study subjects. A few cases of nerve root impingement may have been missed, but this is unlikely to have influenced the results significantly. Further, the duration of symptoms (average 42 weeks) was relatively long. Development of chronic centralised pain followed by regression of nerve root impingement may have occurred in some patients, and our results may not be generalisable to situations with shorter symptom duration.
Finally, it must be emphasised that the index tests work differently when applied in other settings. In unselected primary care populations, the proportion of false positives will be lower and the specificity of the tests higher. Accordingly, the tests may be useful in primary care to reduce the post-test likelihood of lumbar radiculopathy, and thereby restrict unnecessary referrals for imaging and specialised care. On the other hand, when applied in a highly selected surgical patient population with shorter duration of symptoms and a large disc herniation obviously corresponding with the symptoms, the proportion of true positives will be high and the proportion of false positives low, resulting in high sensitivity and specificity. The results from the present study should therefore not be generalised to unselected patient populations in primary care nor to even more selected surgical populations.
Acknowledgements
We thank Jan Inge Letto, Anne Sofie Broback, Dag Grindheim, Robert Kouwenhoven, Fredrik Granviken, Franz Hintringer, Svetlana Rasic, Helge Hartman, Sigrun Randen, and Einar Vegå for doing the assessments. A special thanks to the Clinical Research Centre at the University Hospital of North Norway and to Bjørn Odvar Eriksen, Inger Sperstad, May Greta Pedersen, Sameline Grimsgaard, Dag Grønvoll, Aslaug Jakobsen, Rolf Salvesen, Dagfinn Thorsvik, Tormod Hagen, Bjørn Skogstad, and all the patients who made this study possible.
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
TI contributed to the study design, data collection, data analysis, interpretation, and writing of the manuscript. TKS, ØN, ToI, TW, and BR contributed to the study design, data analysis, interpretation, and writing of the manuscript. JIB and KW contributed to data analysis, interpretation, and writing of the manuscript. All authors reviewed and approved the final version of the manuscript.