Background
Low back pain (LBP) is one of the most common musculoskeletal problems across the globe with an estimated lifetime incidence of 60-80 % [
1]. The high costs of low back pain are borne by the government, insurance companies, and the general public in the form of medical treatments and impairment compensations [
2,
3]. Range of motion (ROM) amongst other variables has been identified to be an indicator of impairment due to back pain [
4].
A great deal of research has gone into examining different tools for measuring spinal mobility [
5‐
8]. Tools available for clinical use include: plumb line, [
9] goniometry, [
10] fingertip to floor, [
11] flexicurve, [
12] tape measures, [
13] visual-photographs, [
14] and dual inclinometry. [
15] At present, there is no conclusive evidence to advocate one method over another [
16]. Radiographs are considered the gold standard for measuring absolute joint motion. [
4,
7] However, performing a radiograph on every patient with LBP is costly and time-consuming in a clinical setting. Furthermore, range of motion measurements often reflect physical impairment or functional mobility in which case external measures of the ROM are more directly applicable.
One of the methods that has been used to measure joint mobility in research and clinical practice is computerized dual inclinometry (CDI). Previous studies have addressed the validity and reliability of the CDI [
3,
7,
8,
17‐
19]. An examination by Nitschke et al. [
3] showed dual inclinometry had acceptable intra-rater reliability results (flexion ICC = 0.90, extension ICC = 0.71), but fair to poor inter-rater values (flexion ICC = 0.67, extension ICC = 0.35). In a systematic review on the validity of instruments used to measure lumbar ROM, three out of the four studies included were on dual inclinometry [
16]. The authors of that review concluded that there was little evidence to support current measures of lumbar ROM. The poor results were mostly attributed to differences between the raters, reflected by the accuracy and consistency of manual palpation of bony landmarks, and the handling of the inclinometer heads along the subject’s spine [
20]. Nonetheless, dual inclinometry is considered to be more valid tool than modified-modified Schobers (MMS) [
21] because the measurement results are expressed in degrees, which correlate with the angular motion of lumbar flexion and extension [
22].
According to guidelines set out by the American Medical Association (AMA) in its 4th and 5th edition, compensation entitlements for patients with low back pain were based in part, on the impairment of back movement [
23,
24]. It recommended the use of multiple measures including the CDI and the MMS. In the 6th edition however, ROM was removed from the list because of variability in results and lack of strong evidence supporting the validity and reliability of the ROM measures that are currently used [
25]. When making judgments on a patient’s level of impairment, particularly when his/her future financial compensations hang in the balance, reliable methods are essential for determining impairment in spinal motion.
When making routine clinical decisions, a variety of measurement characteristics are important to consider. Ideally tools and the associated procedures to use them are reliable, valid, cost-effective & easy-to-use [
7]. While a number of factors including rater skill, equipment limitations, patient selection and pain status may affect reliability of results, certainly landmarking is one of the major issues that affects the reliability and validity of joint motion measurements. Findings from previous reliability studies of the lumbar ROM measures suggest that palpation inaccuracies were the main source of error [
3,
7,
8,
18,
19,
26,
27]. On the other hand, studies on palpation skills of clinicians in palpating and identifying spinal levels indicate great variability in terms of reproducibility and repeatability between clinicians [
28‐
32]. Hence, one strategy for improving reliability is to study the impact of different landmarking techniques. The utility of a landmarking method will be based on producing accurate measurements with highest reliability with the least difficulty, time and cost. The purpose of this study is to examine the intra-rater and inter-rater reliability of lumbar flexion and extension measurements with a J-Tech dual inclinometer,
1 using three different landmarking methods.
Discussion
The results of the current study indicate that regardless of the landmarking method utilized (methods 1, 2 or 3), dual inclinometry had good intra-rater reliability and fair to good inter-rater reliability for lumbar flexion and extension measurements. This adds on to the credibility of using dual inclinometer because reliable objective measures are fundamental to making data-driven clinical decisions. This study demonstrates reliable methods of measuring lumbar ROM are available. Since, lack of such evidence resulted in ROM being excluded from the measures that are required to calculate impairment due to back pain [
25]. Since previous studies gave mixed messages on reliability of these measures, [
4,
8,
18,
26] we do not suggest these findings are sufficient to change that decision. This study proposed a method to improve reliability of spinal motion measurements and there is value in adopting consistent methods to allow for greater comparability.
Currently, there is no clear evidence about the relevance of lumbar ROM in low back pain. This is commonly seen in routine clinical practice. Symptomatic patients might not show impaired range of motion while symptomatic patients might present substantial impairment of lumbar movements. The following is a classic example of this observation. Quack and colleagues (2007) compared MRI findings to ROM tests of the lumbar spine and found no clear relationship between the changes observed on a MRI to the ROM tests of the lumbar spine [
36]. This is due to our very limited understanding of the relationship between impairment and function low back pain. Hence, studies on the relationship of spinal motion to function are needed to determine if this impairment should be considered important when evaluating impairment.
Nitschke et al. [
3] demonstrated poor inter and intra-rater reliability for measurements of thoracolumbar flexion, extension, lateral flexion and rotation using a J-Tech CDI with respect to reliability. On the contrary, the current study has shown that with appropriate land marking techniques the reliability was fair to good. The differences in reliability reported across studies could be due to positioning. Investigators in the Nitschke et al. study used directions provided in the AMA guide (2nd edition) which does not clearly define the location for the master unit; it defines it as “mid-sacrum” which could bring about substantial variation in identifying this reference point between assessors. On the other hand in the current study we used an easy-to- identify landmark, PSIS which even a novice assessor can identify easily.
An important and common source of error while measuring lumbar range of motion could be to do with the preparation of subjects. Two important aspects of subject preparation are: exposing the lower spine to improve landmarking and considering the flexibility of hamstrings which could reduce the movement in pelvis and consequently in the lumbar spine [
37]. In this study we exposed the lumbar spine by lifting and clipping the clothing. To improve flexibility of the hamstrings we made our subjects perform some practice repetitions which can increase the flexibility of the hamstrings considerably through ‘warm-up’ [
38]. It is also believed that hamstring stretching could have a considerable effect on the lumbar spine because of its pelvic attachment; this might increase pain in patients with LBP and limit the actual lumbar ROM during the actual test. To this end, we did not incorporate a complete hamstring stretching protocol which may have affected spinal motion and also we ensured that the stretches were performed within the limits of pain.
The technique of applying the dual inclinometer also has potential sources of error. Positioning of the MASTER and SLAVE heads of the dual inclinometer may be problematic, including the ability to maintain a constant pressure with the heads against the skin. Measuring lumbar extension can be particularly awkward due to the tendency for the heaping up and folding of skin. Also, with highly flexible subjects, it can be difficult to position the heads to prevent them from colliding. Improvement can be made by moving either one of the measurements heads slightly lateral to prevent collision. This would still provide an accurate measurement without the perfect alignment of the heads because the unit measures the angle between the intersecting planes created at the heads.
Despite the mentioned sources of error, this study has shown dual inclinometry to be a reliable tool for measuring lumbar flexion and extension. Although validity was not tested, dual inclinometry provides its measurements in degrees, which better represents the true spinal angular movement than the linear skin distraction method of MMS. Moreover the criterion validity of dual inclinometer has been tested against the gold standard of radiographic measurements in previous studies [
39,
40]. However, additional research is needed to confirm the validity of the dual inclinometry techniques mentioned in this study for measuring lumbar spinal movement.
There are many different landmarking techniques currently used by clinicians to locate a specific spinal level. However, the reliability between therapists to find the same spinal level tends to be problematic [
27]. Sources of error include the level of training of the therapist, how well the lumbar spine is exposed away from clothing, and the varying distances for individuals of the bony landmarks to the surface of the skin. However, studies have shown that even highly experienced clinicians have low reliability in accurate ascertainment of spinal level, suggesting that measurement strategies based on palpation have inherent limitations [
32]. For this reason and knowing that, most patients in this study fell into the 15 cm distance that is currently used in the MMS. A measurement strategy that utilizes this method may produce more reliable results when one considers the wide spectrum of potential testers that might use inclinometry for spinal evaluation. A limitation of this method is that it does not represent the same portion of the spine for people of different heights. Whether this means the measurement is less valid or less related to function in different populations or across people of different heights, needs further study. A limitation of this current study is its inability to calculate values for absolute reliability which would have made the study even more clinically relevant. We recommend future studies to determine Standard Error of Measurement (SEM) values to determine absolute reliability.
Conclusion
All three landmarking techniques have almost the same reliability. However, from a clinical standpoint, the PSIS to 15 cm cephalad method as used in the modified-modified Schobers test, is recommended, as it is the simplest to perform and aligns with routine clinical practice. Dual inclinometry can produce clinically reliable measurements provided a reliable landmarking strategy is employed.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.
The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Competing interests
None of the authors have any competing interests to declare.
Authors’ contributions
JM, VA, and JV were responsible for manuscript thought, writing and editing. KP and AK helped in the data collection. All authors read and approved the final manuscript.