Introduction

Attaining an acceptable quality of life (QOL) is considered by many to be the ultimate goal of rehabilitation after spinal cord injury (SCI).1, 2 During the past few decades, advances in medical care are enabling persons with SCI to survive the initial injury and to prolong their life expectancy post-SCI.3 The need for outcome measures assessing health and QOL after rehabilitation is, therefore, becoming increasingly important.4, 5, 6, 7, 8, 9 It is clear that simple outcomes-assessing function are insufficient in measuring rehabilitation after SCI7, 9 and in capturing the adaptation of perceptions and values in patients after SCI.7, 9 In fact, it has been suggested that high levels of QOL is synonymous with positive rehabilitation outcomes, and many agree that QOL should be measured in tandem with traditional outcomes assessing functional rehabilitation.9, 10 Such measurements provide different yet complimentary information that aid clinicians in their efforts to help those with SCI. Although an altered life is an inevitable outcome of SCI, literature shows that QOL after SCI is not uniformly worse, but rather a spectrum of recovery outcomes exist that range from QOL well below the general population to QOL that surpasses healthy population averages.9

QOL is a difficult construct to capture. Description of what constitutes the quality of someone's life is an important factor in our ability to assess, measure, and improve treatment outcomes and post-injury lifestyles. A clear definition of QOL has yet to emerge, which is due in part to a lack of consensus on a general definition of QOL.11 As such, our ability to accurately measure QOL among populations is limited. This is especially true with SCI populations.

Dijkers12 proposed a scheme for classifying QOL instruments whereby they are conceptualized from the perspective of an outsider, an ā€˜objectiveā€™ view, or an insider, a ā€˜subjectiveā€™ view. All QOL instruments include reactions to or evaluations of the characteristics of a person's life (achievements) in the context of their expectations about those achievements, either implicitly or explicitly.12 Therefore, the distinction whether the instrument is based on an ā€˜objectiveā€™ or ā€˜subjectiveā€™ view is determined by (1) whose expectations and evaluations are used and (2) which of the three (reactions/evaluations, achievements, expectations) are made explicit.

The objective approach to assessing QOL evaluates characteristics that can be impartially measured by an external appraiser.12 These types of instruments assume (1) that all individuals have the same domains that are important in their lives or same goals and (2) happiness or satisfaction in life is directly proportional to the degree to which an individual achieves these standards or goals.12 Most instruments assessing health status (also known as health-related QOL, HRQOL), are considered to be objective approaches.12 These types of instruments, however, are limited as they tend to overestimate the impact of health and underestimate the other non-medical aspects such as the individual's values and preferences.13, 14

The subjective approach to measuring QOL assumes that QOL can only be determined by the individual.12 Instruments developed using this approach consider an individual's emotions or feelings (happiness/affect) or evaluation (life satisfaction) in the context of their expectations and achievements.12 Life satisfaction can be further differentiated into satisfaction with overall QOL and satisfaction in specific domains (for example job relationships, health, etc.) comprising QOL.15

As improved QOL, be it subjective or objective, is indicative of the success of treatment programs or progress in the life of an SCI patient,9, 16 it should be routinely measured among SCI patients. Our purpose in this review, therefore, is to classify QOL instruments in use among SCI population as either objective or subjective, present evidence from the literature on the measurement properties of various QOL measures, and evaluate the properties against pre-established criteria. Such classification and evaluation will assist both clinicians and researchers select QOL measures appropriate to the context of their rehabilitation programs or research studies.

Materials and methods

Search strategy

A review of the QOL literature from 1986 to April 2009 was conducted. Primary data sources included the Pubmed, CINAHL, Embase, Medline, HaPI, Psycinfo, and Sportdiscus electronic databases.

The key word SCI and its related terms, paraplegia, quadriplegia, or tetraplegia, were used in conjunction with the terms validity, reliability, or responsiveness. To complete the search, these terms were then combined with the names and abbreviations of instruments used to assess the QOL in individuals, including the names of all measures, which met the criteria. After deleting non-relevant and duplicate papers, titles, key words, and objectives were examined by two investigators, and papers that did not specifically assess measurement properties of the outcome measures listed above with an intent to validate their usage, and papers with populations not entirely with SCI were eliminated. Eligibility and the measurement properties of the instruments were evaluated by three team members.

Inclusion criteria

To be included in this review, the instruments had to satisfy several requirements, which included (1) the primary purpose of the paper was to evaluate the measurement properties of a QOL instrument, that is Level one papers based on the classification of Kalpakjian et al.;17 (2) the sample included an SCI population, 18 years of age and older; (3) the data specific to SCI were reported; (4) the paper was published in a peer-reviewed journal; and (5) the paper was written in English.

Classifying the instruments

Instruments were classified as either primarily objective or subjective QOL measures as per Dijkers12 guidelines. Most authors clearly distinguish their instrument or definition of QOL as objective5, 16, 18, 19, 20 or subjective.1, 6, 9, 20, 21 Where no distinction was specified, instruments were classified according to the authors' definition of QOL7 and/or the item content and scoring model.22, 23 For example, instruments addressing HRQOL or the presence, severity, and impact of specific health factors relating to QOL were considered objective, and instruments measuring life satisfaction or using scoring relationships between satisfaction and importance were classified as subjective.

Evaluation criteria

Criteria for assessing the instruments' measurement properties are based on the Outcome Measures chapter in the Spinal Cord Injury Rehabilitation Evidence (SCIRE)24 systematic review, which are adapted from the Desirable Characteristics of Outcomes Research Measures for People with Disabilities defined by Andresen et al.25 Table 1 provides a description of the evaluation criteria.

Table 1 Criteria for instrument evaluation

In addition to the SCIRE and Andresen's criteria and definitions, we have included hypothesis testing to support instruments validity.26 We investigated factorial structure in more detail. When domains were not expected to cover similar constructs, we reversed the Andresen criteria, considering poor correlation scores (rā©½0.30) excellent, moderate correlation scores (0.29ā€“0.59) moderately discriminant, and high correlation scores (rā©¾0.60) poor discriminant evidence. In known-groups validity tests, little is known of these relationships in QOL, so known-group tests could not be assigned A, B, or C based on Andresen's criteria. Groups are defined differently and different statistical methods are used by each study's author. We simply report groups showing statistical differences in mean or total QOL scores.

When more than one paper provided validity or reliability values, a range of scores was assigned. Where more than one measurement was provided for reliability or validity within one domain (for example multiple item scores), a range was provided, and scores were assigned conservatively based on the lowest measured item.

Results

Fourteen Level one studies reporting on 13 QOL instruments have been investigated with SCI populations: eight objective instruments, including the Patient Reported Impact of Spasticity Measure (PRISM),22 the Quality of Well-being Scale (QWB),6 the Qualiveen,23 the Sickness Impact Profile (SIP68),27 the Short Form (SF)-36,28 SF-36V,19 SF-12,29 SF-6D,30 and five subjective instruments, including the Quality of Life Index (QLI),2, 21 the Quality of Life Profile for Adults with Physical Disabilities (QOLP-PD),7 the Satisfaction with Life Scale (SWLS),6, 31 the Sense of Well-being Index (SWBI),9, 32 and the World Health Organization Quality of LIfeā€“BREF scale (WHOQOL-BREF).1, 33

Table 2 presents instrument information, and Table 3 provides information on the studies included in this review. Reliability is presented in Table 4. Validity data is presented in Tables 5 and 6. Table 7 presents a summary of measurement property scores, including results of the factor analyses and hypotheses testing.

Table 2 QOL Instruments
Table 3 Summary of included papers
Table 4 Reliability
Table 5 Construct validityā€”factor analyses, item or subscale correlations
Table 6 Validity: multi-trait multi-method
Table 7 Summarya

Objective QOL instruments

The PRISM measures QOL relative to spasticity,22 with six domains addressing negative consequences of spasticity, and one, the positive impact. Negative PRISM QOL scores worsened in a significant manner with increasing severity of negative spasticity symptoms, as expected by the authors, and persons reporting that benefits of spasticity outweigh problems scored higher on the positive impact scale (P<0.001). However, greater negative interference did not correlate to higher negative impact scores. The PRISM domains correlated moderately to one another (r=0.45ā€“0.73), covering related topics such as need for assistance/positioning and social embarrassment, with the exception of the positive impact scale (r=0.07ā€“0.29). Factor analysis confirmed that the domains are addressing distinct facets of spasticity-related QOL.22

The QWB measures health status and well-being;34 it provides quality adjusted life years for health economic analyses. In persons with SCI, this generic measure did not correlate well with other QOL measures such as the SF-36 (r=0.044ā€“0.29),5 and reliability has not been established in an SCI population.

The Qualiveen measures the impact of urinary disorders on QOL as well as overall ratings of QOL in persons with SCI.23 Qualiveen QOL scores decreased as urinary problems increased. (P<0.0001ā€“0.01) The Qualiveen was originally developed in French for the SCI community. The instrument showed excellent reliability in the SCI population (Ī±=0.80ā€“0.85)23 (Table 4).

The SIP68 is derived from the original 136-question version SIP136.20, 35 The SIP68 showed high correlation with physical measurements such as the Barthel ADL index (r=āˆ’0.91ā€“āˆ’0.41).27, 36 However, Post et al.27 argued that, because SIP68 correlates highly with a measure of life satisfaction, it must be measuring a broader concept than health state and self-care ability. Internal consistency was high for the SIP68 (Ī±=0.8ā€“0.92), but the emotional stability domain showed consistently low reliability in the SCI population (Ī±=0.68).27

The SF-36 is a widely used HRQOL instrument. It has been translated and widely promoted by the International Quality Of Life Assessment project;37 translations have been developed for >60 countries. The SF-36 has been validated in multiple SCI studies.5, 16, 38, 39 Data strongly supports authors' hypotheses that the mental and physical components would correlate to similar domains on other QOL instruments,5, 38 but not to one another.16 Factor analysis has been conducted extensively in other populations; see http://www.sf-36.org. Reliability was moderate to high for the SF-36 (Ī±=0.72ā€“0.98),16, 38 except the general health item (interinterviewer ICC=0.41).The SF-36 was shortened to contain just 12 questions (SF-12) and only provide mental and physical subscores. The SF-12 has not been widely used and validated in the SCI population, but has shown expectedly high correlation with the SF-36 (r=0.78ā€“0.99).5, 38 Reliability was not examined in the SF-12.

The Veterans Health Administration version of SF-36 (SF-36V) modified the physical functioning domain for SCI populations. The modifications generally involved substituting activities more appropriate to SCI clients, such as ā€˜climbingā€¦ wheel chair rampsā€™ rather than stairs, or ā€˜getting up and down from a curb.ā€™19 The SF-36V supported the authors' hypothesis that it more accurately reflects SCI QOL, showing internal validity with the physical, but not the mental component score. It was necessary to alter the wording of the questions based on SCI consumer input more than anticipated, and mobility was rated as if respondents were using their assistive devices. Internal consistency was high (Ī±=0.90).19

A preference-based health measure, the SF-6D, has also been derived from the SF-36 and SF-12.40 Its usefulness was evaluated by assessing its responsiveness to urinary tract infection. It could discriminate between SCI and non-SCI respondents; however, floor effects were seen in the physical domain (37%), and reliability was not examined.18

Subjective QOL instruments

Ferrans' and Powers' QLI attempts to cover all facets of QOL.41, 42 The revised SCI version is comprised of 37 items making up two sub-sections: one measuring satisfaction with various life aspects and the other measuring the importance of those aspects. Although the language of the SCI version of the instrument was well received by SCI clients,21 the domain structure did not fit with subject interpretations.21 Further, the correlations between the QLI total scores and each of the satisfaction (r=0.99)2 and importance (r=0.43)2 sub-section scores were contrary to the authors' hypothesis that significant relationships of equal magnitude between the total score and section scores would exist. Rather, the scoring relationships suggest that the satisfaction and importance ratings contribute to the overall score, but in unequal amounts.

As expected, the QLI scores correlated to both community integration (participation) (r=āˆ’0.65) and self-esteem (r=0.61), but not to body functions and structure or the level of activity. The instrument was unexpectedly not correlated to locus of control (r=āˆ’0.02), and reliability was not examined.2

The QOLP-PD takes an extremely subjective approach to QOL assessment. It holds that QOL elements are common to most human beings, with and without disabilities, but that adults with physical disabilities may address life issues somewhat differently. Reliability (Ī±=0.84ā€“0.98) and construct validity (r=0.63ā€“0.88) are both excellent.7

The SWLS contains five statements about life satisfaction: three set in the present, one in the past, and one in future. It is ā€˜one of the few existing instruments that measure life satisfaction as a global entity, rather than requiring subjects to rate their satisfaction with each of a number of domains of life.ā€™5 In accordance with the authors' hypotheses, impairment (level of injury), number of hospitalizations, and the number of pressure ulcers related to life satisfaction (P<0.05), but completeness of injury did not. Two weeks testā€“retest was moderate to low (ICC=0.39ā€“0.65).5 The SWLS reflects general life satisfaction, but is not detailed as to specific aspects of QOL.

The SWBI was developed to assess QOL in disabled populations in vocational rehabilitation programs. The authors argue that to achieve maximum potential in the rehabilitated worker, rehabilitation programs must assess overall QOL, both subjective and objective, as ā€˜research indicates that there is a dynamic interaction between quality of work life and QOL in general.ā€™9 The authors correctly hypothesized that the revised SWBI for SCI would have similar measurement properties to the original, would correlate well with the WHOQOL-BREF,(r=0.45ā€“0.75), and would show similar patterns of known-group validity to other QOL instrument in the literature.9

The WHOQOL assessment was developed as an international effort to create a cross-cultural, cross-population QOL measure based on a generic theoretical model of QOL.33, 43 The short version, the WHOQOL-BREF, has been assessed by multiple authors.1, 9, 38, 44 Population-specific versions exist, such as the Taiwan/Hong Kong version with two additional questions of local cultural importance.38 Although sometimes considered an HRQOL instrument, many questions rate individual subjective satisfaction, thus covering both objective and subjective components of QOL. The WHOQOL-BREF generally supported authors' hypotheses that it would show item-domain validity (r=0.41ā€“0.77),1 correlate in appropriate domains with other QOL measures such as SF-36 (rs=0.33ā€“0.78),38, 45 and differentiate between sub-groups such as employment, self-care ability, age, marital status, and level of injury.38 Reliability was moderate to high (Ī±=0.74ā€“0.87), with the exception of the social relationships domain, which was consistently lower than the other domains (0.54).1, 38

Discussion

Definition of QOL

It is evident that there are currently as many definitions of QOL as there are instruments measuring it. We have examined existing literature on 13 QOL instruments that have been investigated in SCI populations. By assembling these instruments for this population, conclusions can be drawn on the broad themes found in QOL investigation, such as subjective versus objective measurement, approaches to QOL definition, and lack of data on cause and effect in SCI QOL.

Measurement properties of instruments

Generally, sufficient investigation has been carried out to validate the use of only a few of the QOL instruments with SCI populations. The WHOQOL-BREF and SF-36 have been extensively used and validated. In the SF-36, concern has been expressed in the SCI community regarding the appropriateness of the use of the term ā€˜walkingā€™ in the mobility-related physical questions of the SF-36.5, 19 This concern has been addressed in the SF-36V, which requires further investigation before wide-spread use with SCI populations. The SF-12 and SF-6D also exhibit certain SCI bias and floor effects5, 18 that could easily be removed if they were derived from the SF-36V, as opposed to the original SF-36.

The Qualiveen and PRISM have performed well in all measurement properties,22, 23 likely because of the clarity of results obtained from objective QOL measures, which can be strongly and concretely correlated to health state. Both could use further investigation in SCI populations.

For objective QOL, the SIP68, and for subjective, the QOLP-PD and the SWBI all have very positive supportive evidence and face validity, appropriate for their continued use.7, 9, 20 Again, all three require much more investigation, especially those measuring subjective QOL, which are more difficult to validate, given the theoretical nature of the underlying construct.

Disappointing results were obtained for the QLI, the QWB, and the SWLS.2, 5, 6 The QLI likely requires alteration to the structure of the importance scale scoring, and potentially domain rearrangement before wide-spread use in SCI populations. The QWB preference-weighted scoring approach, weighing certain symptoms more heavily than others, is founded on data from the general population. It measures HRQOL from a ā€˜decision theory approach,ā€™ in which differently weighted symptoms contribute to one final score, which can be applied to any population.46 This is distinct from the approaches of other QOL instruments, in which a profile is generated composed of measurements in several domains; the QWB correlated only moderately to the physical domain of the psychometric SF-36 instrument.5 The QWB tells health-care providers very little about the HRQOL of their individual SCI client, as it is intended to offer inter-population comparison, with less sensitivity to clinical change.46 With respect to the SWLS, it is possible that the brevity and generality of its approach, as well as the wording of several of the questions (for example ā€˜If I could live my life over, I would change almost nothingā€™), weaken its applicability with SCI population.

Subjective QOL issues

In the field of SCI QOL research, there is controversy over the appropriateness of objective QOL measurements. This is due to the assumptions that these measurements impose on individuals with different life circumstances and ability levels, namely, that all individuals prioritize common life domains and goals, especially those related to measureable outcomes such as financial gain or physical strength, and that success and achievement in these domains and goals are directly proportional to happiness and life satisfaction. Not only this, but it had been found that injury-specific variables such as level or severity of injury do not always affect objective QOL.5, 18 As an alternative, subjective QOL purports to measure outcomes through the point of view of the individual that cannot necessarily be broadly generalized. This issue with the subjective instruments is addressed in various ways. In the QOLP-PD development, Renwick et al.6 suggested that all individuals value similar elements of life, but may address or achieve these differently.7 In this and several other subjective QOL instruments, satisfaction with items scores are weighted by scoring the importance of these items to the individual.2, 7 Alternatively, instruments such as the SWLS address QOL without splitting it into domains or questioning specific aspects of life, resulting in a measurement of overall satisfaction with life only.

Although most objective QOL measures have existed for much longer, and thus have much more published data and utilization, many authors now suggest the use of subjective QOL measures as more appropriate in individuals with SCI.6, 7, 9 Scores in objective QOL domains are found to be lower in SCI than in the general population,1, 18 whereas some domains of subjective QOL are equal to the general population, and some are higher.47 Item bias scores for SCI were higher in the subjective instruments, whereas measurement model scores were higher in the objective instruments. These trends would seem to reflect the differences in QOL models, in which subjective instruments are more highly attuned to the individual's situation, and subjective QOL can vary greatly between individuals. Although objective QOL is more easily measured, such instruments have the potential to miss many aspects of the individual's life.

Clearly, objective and subjective QOL instruments are measuring different constructs, and thus have different conceptualizations of QOL. It may be that, such as the mental and physical component scores of the SF-36, both contribute equally, rather than solely, to the overall QOL score. Or, it could be that, similar to satisfaction and importance scores in many of the instruments, one should be weighted against the other. A mixture of subjective and objective approaches could resolve the existing debate on which is the more appropriate approach to QOL research, or, as Fuhrer stated, ā€˜measures of subjective well-being should be viewed as being complementary with objective indices of people's functioning and life status.ā€™10

Conclusions

Overall, there is a wealth of data on SCI QOL. There are numerous promising instruments to measure QOL. Unfortunately, because of a lack of consistent results and definitions, our knowledge pertaining to the QOL among individuals with SCI is still limited. It is important to keep in mind that we are attempting to perform a comparison of different measurement instruments of QOL that are based on different definitions of QOL. A more concrete, universal definition of QOL is required, as is further investigation on the causative and related effects of different aspects of the SCI client's life on QOL. Known-group associations that are examined in a systematic and consistent manner between studies and populations would provide valuable information. With a clear definition of QOL, this type of investigation could be undertaken without the excessive contradictions that exist in the literature at present.

In terms of the instruments included in this review, the disease state-specific HRQOL Qualiveen PRISM, and SIP68 performed very well. The longer, more varied HRQOL SF-36 and subjective WHOQOL-BREF performed moderately well in all areas, with the added bonus of their wide-spread use and the wealth of evidence supporting their outcomes. Several newer subjective QOL instruments also performed very well, but will need further investigation, including the SWBI and the QOLP-PD. The remaining instruments lacked either SCI-appropriate structure, language, or investigation.

Pending further investigation of some of the very promising, but recently, developed instruments, we would recommend the use of the WHOQOL-BREF, as it addresses both objective and subjective QOL, is based on an international effort to clearly define QOL, and has been well studied in SCI populations with acceptable results. The SF-36V or SIP68 as measures of HRQOL, or the QOLP-PD or the SWBI as measures of subjective QOL, also show promise, but require further investigation before using with confidence. In the context of clinical practice or research, investigators must choose tools based on practice/purpose, and explicitly state their concept of the definition of QOL.