Introduction

The frequency of spastic paralysis after spinal cord injury (SCI) has been observed to be 65–78% of individuals with traumatic SCI.1, 2, 3, 4 Greater than a quarter (that is, 28–43%) identify spasticity as a significant problem1, 2, 4, 5 and almost one half (that is, 43–49%) receive pharmacological treatment for this secondary condition.3, 4 Although muscle spasms and/or spasticity may be of some benefit in the maintenance of muscle tone, muscle mass and blood circulation, it is commonly cited as one of the most problematic prevalent secondary conditions after SCI.5, 6, 7 Typically, spasticity can interfere with various body functions such as ambulation and hand or upper limb control as well as bowel and bladder function. It can also limit range of motion, cause pain and/or cause additional stress to muscles and joints.1, 2, 3, 7 It has been reported to significantly impact activities of daily living, limiting workplace participation, as well as adding to the cost of medications and attendant care.1, 3, 7, 8

Spasticity was originally described as ‘a motor disorder, characterized by a velocity-dependent increase in tonic stretch reflexes (muscle tone) with exaggerated tendon jerks, resulting from hyper-excitability of the stretch reflex as one component of the upper motor neurone syndrome’.9 Others have proposed new definitions, including: description of spasticity as ‘disordered sensori-motor control, resulting from an upper motor neurone lesion, presenting as intermittent or sustained involuntary activation of muscle’,10 or ‘characterization of spasticity comprising intrinsic tonic, intrinsic phasic and extrinsic components’.11 These definitions are more inclusive of clinical signs and symptoms of ‘spasticity’ and better reflect its multidimensional nature.12, 13, 14

Other considerations involve understanding the pattern of spasticity and how it affects functional activities, either positive or negative. In addition, changes in patterns of spasticity may indicate hidden and/or developing clinical complications such as urinary tract infections, skin problems (pressure sores) and other medical issues.13 The multidimensional and functional aspects of spasticity are best characterized within a suitable framework such as the International Classification of Functioning, Disability and Health (ICF).15 The ICF is an international, recognized classification system, which considers measurement and other aspects of health according to domains of body structure and function as well as Activity and Participation restrictions within bodily functions, individual activities and societal interactions.15

Finally, it is important, when selecting the best outcome measurement tool, to consider matching the intended purpose of the data with the utility of potential tools. For example, a clinician may be interested in characterizing the nature of the spasticity in a patient to guide treatment decision-making, whereas a researcher may focus on tools that permit the determination of significant differences in a group of patients in response to an intervention. Feinstein et al.16 identified six uses for outcome data including determining compensation, predicting prognosis, planning placement, estimating care requirements, assisting with the choice among specific types of care and determining change in health status secondary to intervention.

The present review is focused on assessing the various outcome measures that have been employed clinically to assess SCI-related spasticity within an ICF framework. Specifically, the psychometric properties (that is, reliability, validity, responsiveness) and clinical utility of spasticity outcome measurement tools were evaluated. The primary objective is to provide a guide for clinicians, with a summary and assessment of the available psychometric evidence, in order to assist the selection of the most appropriate tools for the measurement of spasticity. Thorough spasticity assessment requires an evaluation over the entire range of each joint to be tested. Similarly, selection of the optimal tools to measure spasticity requires consideration of the various issues noted above, each involving a range of alternatives. Secondarily, the summary of available evidence should also assist in the identification of outcome measurement tool gaps that need to be developed to measure more comprehensively the multiple aspects of spasticity secondary to SCI.

Methods

This review targeted clinically applied tools for which there was at minimum one peer-reviewed study that examined psychometric properties (that is, reliability, validity or responsiveness) using an SCI population. Pubmed, CINAHL and EMBASE databases were searched (1986 to January 2007) to locate papers reporting on SCI spasticity outcome measures with psychometric data. Additional articles were identified by hand-searching the references of papers obtained from the electronic search. The key word ‘spinal cord injury’ and variants were used across each of the databases in combination with the following terms: validity, reliability, inter-rater reliability, intra-rater reliability, test–retest reliability, reproducibility, responsiveness, sensitivity to change, evidence-based medicine, outcome measures, clinical assessment tools, scales and measures.

Psychometric data were extracted from papers reporting findings including reliability, validity and responsiveness in addition to information pertaining to clinical utility such as interpretability, acceptability and feasibility. Following this, specific findings from individual articles were evaluated according to the established criteria put forth by the Health Technology Assessment (HTA) programme17 and other sources.18 Additional methodological details are available at http://www.icord.org/scire/pdf/SCIRE_CH22.pdf.19

Results

The present review identified 6 out of 66 SCI-related spasticity outcome measurement tools that had been tested psychometrically.19 These included the Ashworth and Modified Ashworth Scale (AS and MAS) and Penn Spasm Frequency Scale (PSFS), which are applied frequently in routine clinical SCI settings.20 The Visual Analogue Scale (VAS), the Spinal Cord Assessment Tool for Spastic reflexes (SCATS) and the Wartenberg Pendulum Test (WPT) were also included in the analysis as they met the inclusion criteria of having at least one article addressing psychometric properties and having been categorized as clinical tools by various investigators.12, 21, 22, 23 Table 1 outlines the procedural attributes for each tool, including the specific construct being measured, test procedures, scoring methods and the administration time. Further information about the clinical utility for each of these tools is summarized in Table 2.

Table 1 Attributes of SCI spasticity outcome measures—procedural
Table 2 Attributes of SCI spasticity outcome measures—clinical utility

Ashworth and Modified Ashworth Scale

The Ashworth measure of spasticity was developed as a simple clinical classification to assess the antispastic effects of carisoprodol in multiple sclerosis.24 It is a five-point nominal scale focusing on the subjective clinical assessment of tone. For the MAS, an additional grade was added (1+) to enhance sensitivity to accommodate hemiplegic patients who graded typically at the lower end of the scale, more specifically to measure elbow flexor spasticity in patients with multiple sclerosis.25 These measures have since been adopted for assessing spasticity in a variety of indications including SCI,26 although it is important to recognize that there are differences in the characteristics of spasticity with different etiologies.27 The AS and MAS are simple to administer, are well tolerated, require no specialized equipment, but only address the velocity dependent aspect of spasticity across a single joint. The scale determines the amount of resistance felt during the passive displacement of a limb, but it does not account accurately for the dependence of the resistance to the velocity of the stretch, which can be highly variable from examiner to examiner.28 The psychometric properties for 10 AS and 4 MAS studies in SCI are summarized in Table 3. The AS was found to have adequate inter-rater reliability29 for specific lower limb muscles, such as the adductors and flexors. However, inter-rater reliability was found to be poor for the plantar flexors.26

Table 3 SCI articles reviewed for psychometric properties of AS and MAS and PSFS

The AS correlates most strongly with the patellar tendon tap13 and less so with the VAS.30 Higher correlations were found between the MAS and VAS.23 Sherwood et al.31 obtained correlations of the AS and MAS with EMG and suggested that AS does not discriminate well at the low end of spasticity (AS score 0 vs 1) and Skold et al.32 reported increased correlation values when subjects with MAS grades of 0 were excluded. The AS also correlates well with SCATS,12 which was developed to measure, more comprehensively, the different types and severity of spastic reflexes in SCI.

Preliminary evidence of responsiveness has been demonstrated in various interventional studies as pre- and post-AS measurements reflecting declining spasticity progressed in the same direction as was also assessed by the PSFS.33, 34, 35, 36

The AS and MAS were the spasticity outcome measures identified as those most commonly used37 and psychometrically assessed in SCI. Overall, the AS and MAS have good clinical utility and increased standardization and training for test administration is needed to maximize reliability. Target muscle group and severity of spasticity should be considerations in the selection between AS and MAS. Validity for both AS and MAS has only been partially established and even less so for responsiveness.

Penn Spasm Frequency Scale

Penn et al.36 originally defined a five-point spasm frequency scale, which was later modified by Priebe et al.,13 and referred to as the modified PSFS. The modified PSFS is a two-component self-report scale to augment clinical ratings of spasticity and provide a more comprehensive understanding of an individual's spasticity status. The first component is a five-point scale assessing the frequency with which spasms occur ranging from ‘0=no spasms’ to ‘4=spontaneous spasms occurring more than 10 times per hour’. The second component is a three-point scale assessing the severity of spasms (SSS) ranging from ‘1=mild’ to ‘3=severe’. The second component is not answered if the person indicates that they have no spasms in part 1. This measure is simple and appropriate to the clinical setting and does not require specialized equipment to perform (Tables 1 and 2). A total of six studies of PSFS use in SCI were included in this review and their psychometric results are summarized in Table 3.

Reliability has not been established for PSFS in SCI. Validity for the PSFS has been partially established through correlations with other clinical tools, such as the AS13, 33, 35, 36 and the SCATS.12 The PSFS correlated only moderately with routine clinical examination, suggesting that the elements of spasticity evaluated in the physical examination do not represent entirely what is important to persons with SCI spasticity.13 By utilizing a standard time specification for the self-report, Lechner et al.38 established that the ‘present’ spasm severity self-report (that is, rating of spasticity immediately upon Ashworth testing completion) correlated modestly to AS, whereas the ‘in general’ spasm severity self-report (that is, questionnaire rating of effects of daily life activities) correlated poorly to AS. In addition, the PSFS was found to correlate highest with the SCATS clonus measure as compared to the flexor and extensor spasm components of SCATS, suggesting that the role of clonus represents the client's highest perception of spasticity.12

Responsiveness over time has been exhibited in several studies employing the PSFS, within a test battery to assess spasticity-treatment interventions,33, 34, 35 which have been limited to demonstrations of significant differences but not with more robust psychometric testing methods described earlier.

The PSFS has good clinical utility and adds the patient's perspective, provided that standardization of the time of day and time frame (that is, present or general) for the report is established. Each of the psychometric properties requires further investigation with validity and responsiveness having been preliminarily examined.

VAS to assess self-rated quantification of spasticity

Another approach of patient self-reporting involves the VAS, which is a graphical scale that allows the patient to select the degree in which a construct of interest is graded from one extreme to the opposite. In general, VAS is a simple and quick method to establish a baseline and track progress after interventions.23 In the present review, two studies each asked patients to rate their spasticity from ‘no spasticity’ to ‘most imaginable spasticity’ either within the previous hour (VASH)23 or after completion of a specific test activity (VAS SA).38

Convergent validity was demonstrated with significant correlations between VASH and MAS.23 However, it must be noted that this validation cannot be applied generally to all self-reports recorded using VAS. For example, Lechner et al.38 found lower correlations between VASSA and MAS, likely because patients scored their spasticity resulting from a specific activity rather than the general spasticity experienced over the previous hour. Furthermore, time of day for spasticity assessments seems to be more important in patients with cervical lesions due to more pronounced intrinsic diurnal spasticity fluctuations.23 As with the PSFS, the need to standardize the timing of measurements is important.

Spinal cord Assessment Tool for Spastic Reflexes

SCI reflex hyperexcitability is described frequently as including clonus and, flexor and extensor spasms. SCATS was developed by Benz et al.12 to measure SCI spasms and spastic hypertonia. SCATS is simple to administer as it is comprised of elements common to a standard neurological examination of the lower extremities. Despite simplicity, SCATS has yet to be adopted widely and only a single study has been included in this review. Initial validation of this new SCI spasticity outcome measure was accomplished via correlations with kinematic and electromyographic recordings.12 The SCATS extensor spasm score correlated highly with these measures in hip and knee flexors (r=0.98 and 0.88, respectively) but less so for the ankle plantar flexors (r=0.61). The SCATS flexor spasms and clonus scores correlated well with Ashworth scores but only the SCATS clonus score correlated significantly with PSFS (r=0.59). Benz et al.12 noted that the SCATS provided additional information in comparison to the AS and MAS in assessing multijoint spasticity, whereas the AS and MAS are limited to spasticity assessment over a single joint. Further testing for reliability and responsiveness of SCATS is required.

Wartenberg pendulum test

From a simple and reliable clinical test of Parkinson lower limb hypertonia39 to quantification of similar changes secondary to upper motor neuron disorders,40 the pendulum test has been used for many years across indications to assess tone. The pendulum score is calculated during the gravity induced pendulum-like movement of the lower limb, as the ratio of joint angles measured by goniometers or computerized video motion analysis. Although the WPT is well tolerated by patients, it is not used widely in SCI spasticity assessment, likely due to the requirement for specialized equipment and post-assessment analysis. Validity and responsiveness have not been well established for WPT in SCI spasticity other than a single report.22 Although significant correlations between AS and the amplitude of first swing of the WPT were moderate at baseline, the correlation was higher during treatment when tizanidine was shown to decrease significantly muscle tone (P<0.0001) from baseline. Significant treatment effects were found for tizanidine when the AS and the WPT were used as spasticity assessments, which provides preliminary evidence for responsiveness of WPT.22

The technical requirements for the WPT will likely limit widespread use. However, it is interesting to note that the WPT was used alongside the AS to assess spasticity in the only published multicentre RCT that led to the regulatory approval of an antispasmodic for SCI.22 However, if the use of WPT is planned for future trials, further psychometric validation in SCI is recommended.

Discussion

The primary purpose of this review was to summarize and interpret the available psychometric data for SCI spasticity outcome measurement tools in current use. We have identified six available tools referenced in the current SCI spasticity intervention literature37 having partial validation (for example AS, MAS, PSFS, VAS, SCATS and WPT). All six measurements have good clinical utility with the exception of the WPT, which requires additional equipment and data analysis. There is a need to provide further evidence of reliability, validity and responsiveness for all these measures in order that they can be used with confidence. In particular, investigations of responsiveness are universally lacking. The AS and MAS have the largest body of available psychometric data, but the identified limitations, especially with respect to validity and responsiveness for these measures in specific muscle groups, need to be addressed in further studies. SCATS has only been preliminarily assessed in one study, but the advantage of this clinically utilitarian tool warrants further investigation since this one tool will measure three distinct components of lower extremity spasticity.

Further data are also required for the inclusion of the PSFS in a spasticity assessment battery, especially in light of the importance for reflecting the patient's perspective on spasticity. Psychometric properties of PSFS have only been established in part, especially for responsiveness. Stronger evidence of responsiveness would require such techniques as receiver operator characteristic (ROC) curve analysis, the calculation of standardized effect sizes, standardized response means and analysis of change scores or relative efficiencies. Reporting time frame has been identified as an important consideration since immediate (that is, investigator-evoked spasticity) and general (that is, spasticity related to activities of daily living) spasticity do not correlate.38

The simplicity of VAS as a scoring method of the patient's perception of spasticity is useful with respect to clinical utility, but it is important to note the significant differences in responses are highly dependent on the core question posed in each instance of VAS. As with the PSFS, VAS scoring of general spasticity experienced over the previous hour will likely differ from the VAS scoring of spasticity experienced after a specific activity. The flexibility of VAS is offset by the need to validate each unique spasticity-related question that the patient is asked.

Use of WPT is simple to administer, objective and quantifiable.21 Since AS or MAS have limitations at the lower range of spasticity,4, 26, 31 it would seem plausible to suggest WPT as a substitute measure for potential investigation. However, the need for specialized equipment and post-test data analysis, along with the observation that WPT is not effective if the limb is too rigid to produce true oscillations,20 may prevent widespread use of this tool. As well, WPT has limited application in SCI spasticity, as it can only be examined in one muscle group (that is, quadriceps).

As noted previously, when selecting the optimal outcome measurement tools for spasticity assessment, one must consider the range of features of spasticity encompassed by the various definitions of spasticity. It has been suggested that no single outcome measure can capture the multidimensional nature of spasticity reflected over several definitions and represented by velocity dependency, frequency, severity, subclinical conditions, tonic spasticity (tone), phasic spasticity (hyperreflexia, clonus) and involuntary muscle spasms and so on.13, 14 Focusing on one or two spasticity outcome measures may cause authors to over- or under-represent the magnitude of treatment effects.13 Priebe14 has purported that spasticity is measured best by a battery of tests that recognize the many variables of spasticity including conditions, which are subthreshold for detection, as well as the patient's perspective and/or level of fatigue.

The degree to which a tool correlates with other tools that measure, theoretically, the same construct provides evidence for the convergent validity of that tool and several results have shown varying convergent validity depending on the specific tool and the circumstances under which it was administered (for example, body location, time). This suggests that different clinical scales are likely measuring different aspects within the spectrum of spasticity.13 The specific test battery employed by Priebe et al.13, 14 included PSFS and AS, as well as Interference with Function, Painful Spasm and clonus. Priebe et al. also performed neurological tests such as deep tendon reflexes and plantar stimulation response. These measures cover, generally, the clinically important and distinct components of spasticity: tonic spasticity (tone), phasic spasticity (hyperreflexia, clonus) and involuntary muscle spasms.

A second consideration for selecting spasticity outcome measurement tools is they must assess adequately the range of functional activities that spasticity may impact. The ICF classification system provides a suitable framework, which should reflect the range of these functional activities. All of the measures included in the present review, and the vast majority of tools that have been employed in the literature reflect assessment of spasticity under the ICF category of body structure and function. One form of a patient self-report measurement (that is, VASSA) was used to measure spasticity resulting from a specific activity (that is, potentially classified under ICF category of activity) and was only evaluated with respect to convergent validity in relation to MAS. No psychometric data were available for outcome measures targeted at measuring spasticity with respect to an individual's level of societal participation (that is, ICF category of participation). Body structure and function, activity, and participation are all important factors for predicting the success of spasticity interventions and it is apparent that current spasticity measurement tools are focused on the former but are lacking for the latter. In addition to the need to measure the different aspects of functional (body) spasticity, we suggest it is equally important to track the impact of spasticity on the other ICF categories (activity and participation), culminating in the impact on quality of life for the individual living with SCI.

Recent analysis of SCI patient perspective domains (that is, physical activity, emotional, economic, interpersonal, management and community access) underscores the lack of focus on what matters to the patient when assessing the success of spasticity-related interventions.41 The balance of controlling spasticity versus total suppression of spasticity is measured best from the individualized patient perspective. This gap identifies a significant need to develop a valid battery of outcome measures that also assess the impact of spasticity in limiting activity and participation. Priebe et al.13, 14 have reported preliminary psychometric data on Interference with Function and Painful Spasm scales within their recommended test battery. The Interference with Function, along with PSFS and Painful Spasm scales, was found to correlate weakly with clinical assessments and modestly with each other. Further investigation is warranted for the Interference with Function and Painful Spasm scales, as measures of individual activities that are impacted by spasticity after SCI.

Without understanding the impact on activity and participation limitations, SCI spasticity interventions cannot reflect adequately a treatment benefit through clinical assessments at the body function level alone. Although there are certainly some SCI-specific measures that examine functional activity and participation limitations (for example, Craig Handicap Assessment and Recording Technique, CHART and Spinal Cord Independence Measure, SCIM), as well as other generic measures that have been validated partially for SCI, which address activity and participation (for example, FIM, SIP, SF-36), there are no measures that address the specific impact of spasticity in limiting activity or participation. One approach that may fill this void is the recent introduction of the SCI Spasticity Evaluation Tool (SCI-SET).42 At the time of paper submission, information about the SCI-SET was ‘in press’ and therefore was not included in the Results section as done for the other tools. The SCI-SET is a 35-item, 7-day recall self-report scale that has been designed to assess the impact of spasticity on various activities of daily living to issues of social participation. It incorporates a Likert-like scale with anchors of −3 (extremely problematic) to +3 (extremely helpful) in asking patients how spasticity symptoms have affected their lives. Initial developmental testing for the SCI-SET has demonstrated excellent internal consistency, test–retest reliability as well as face and convergent (construct) validity. Average time for completion was 6.8 min, although more testing is required to more fully assess clinical utility, especially given the number of items, the need for post-test analysis and the 7-day recall aspect. The features of the SCI-SET also address the remaining considerations noted earlier and specifically on gaps that exist in the current armament of available tools (that is, assessing range of effects from negative to positive and use as a self-report tool that considers the patients perspective).

As researchers and clinicians, we should aim to assess the influence of new antispasmodic interventions by considering the multidimensional nature of spasticity, thereby making a test battery a suitable choice. Furthermore, an appropriate battery of reliable measures should consider clinical utility, be validated specifically for the construct of interest and include measurements of the impact on activity and participation as well as quality of life. For use in clinical trials as a primary outcome measure, much effort would be required to appropriately configure the tool content and scoring so as to reflect the multidimensional nature of spasticity and satisfy both clinical and regulatory requirements. Currently, the test battery recommended by Priebe et al.14 is a suitable starting point for most evaluations, but this should be augmented by additional patient-focused tools, such as SCI-SET.