Introduction

Spasticity is a long-term complication of spinal cord injuries (SCIs). The commonly accepted definition of spasticity is ‘a motor disorder characterised by a velocity-dependent increase in tonic stretch reflexes with exaggerated tendon jerks, resulting from hyperexcitability of the stretch reflex’ (Lance,1 p 485). However, this definition is too narrow for patients with SCI.2 The Ability Network, which is an international initiative organised to optimise the management of disabling spasticity following SCI, recommends adoption of the definition proposed by Pandyan: ‘disordered sensorimotor control, resulting from an upper motor neuron lesion, presenting as intermittent or sustained involuntary activation of muscles’ (Pandyan et al.,3 p 5; Burns4). Appropriate diagnosis and assessment of spasticity are recommended for the management of disabling spasticity.

The frequency of spasticity observed among people with SCI varies from 65% to 78% in different studies.5, 6, 7 Self-reported data reveal that 53% of participants with SCI report spasticity as a significant problem.8 Spasticity may be of some benefit in the maintenance of muscle tone; however, it can interfere with daily activities, such as transfers, ambulation and stable body position. A proper evaluation of spasticity is necessary for making treatment decisions.

The Ashworth Scale and Modified Ashworth Scale (MAS) are frequently used clinical scales that assess spasticity. Five studies that have assessed the reliability of the MAS of lower extremities among people with SCI9, 10, 11, 12, 13 have presented inconsistent results; so reassessment remains necessary for the MAS for the lower extremity after SCI.

The Tardieu Scale (TS), which quantifies muscle tone at specified velocities, has been suggested as superior to the Ashworth Scale for the assessment of neural versus peripheral contributions of spasticity.14, 15, 16 The Tardieu Scale, developed by Tardieu et al.17 in 1954, was subsequently modified by Held and Peierrot-Deseilligny18 to comprise quantitative joint angle measurements taken at three speeds of passive movements—very slow, a passive fall of the limb under the influence of gravity and as fast as possible. In 1999, Boyd and Graham19 further modified the scale, by standardising the conditions of limb placement and the speeds as slow and fast, to produce the version presently known as the Modified Tardieu Scale (MTS). They recommended an initial speed as slow as possible and a high speed as fast as possible. The relative difference between the slow and fast speeds of passive stretch then determined the dynamic component of the muscle contracture.

A review of the published literature indicates that both the Ashworth Scale and the TS have equivocal reliability, although the TS has superior validity and reliability over the Ashworth Scale.14, 15, 16, 20, 21, 22, 23 The reliability of the MTS has been established by extensive studies conducted in children with cerebral palsy and in adults with spasticity of cerebral origin. Most studies on the reliability of the TS have involved investigations across a range of upper limb muscles. Ben-Shabat et al.23 investigated the intra- and inter-rater reliability of the MTS for the assessment of lower limb spasticity in adults with neurological injuries. They assessed 30 patients but only 3 of those patients had an SCI. No studies have yet utilised a greater number of participants to evaluate the reliability of the MTS for the assessment of lower limb spasticity in SCI.

We aimed to investigate the reliability of the MAS and the MTS for the assessment of lower limb spasticity in people with SCI. We also aimed to compare the reliability of the MAS and the MTS in adults with spasticity due to SCI.

Methods

Data collection

Sixty-five people with SCI, aged 18−88 years, were screened for inclusion from the inpatient rehabilitation clinics of two state hospitals. Six were excluded (3 due to early discharge; 3 due to infection) and 1 declined to be involved, leaving a sample of 58 participants.

People were eligible for inclusion if they had an American Spinal Injury Association (ASIA) Impairment Scale Grades of A–D with spasticity and were at least 6 months after injury. All participants’ medication was stable with no initiation or dose change of oral antispasticity drug treatment within 30 days, and no botulinum toxin injections administered <90 days before the assessment. People were excluded if they had multiple central nervous system lesion sites, significant complications that affect spasticity (such as decubitus ulcers, heterotopic ossification, urinary tract infections and any other infections), known history of joint pain and inability to give informed consent. Informed consent was obtained, and all procedures were conducted in accordance with the Helsinki Declaration of 1975 and approved by the local Institutional Clinical Research Ethics Committee (Fatih Sultan Mehmet Education and Research Hospital). We further certify that all applicable institutional and governmental regulations concerning the ethical use of human volunteers were followed during the course of this research.

Instruments

The MAS is an easy-to-administer tool, which has been adopted for assessing spasticity in a variety of indications, including SCI.9, 24 It is a five-point ordinal scale that subjectively assesses muscle tone. The MAS scores were collected from the right hip adductor and extensor muscles, right knee extensor and flexor muscles and right plantar flexor muscles of the participants. Following the recommendations of Pandyan et al.,25 all participants were assessed while lying supine, with the upper limbs parallel to the trunk in a neutral position and with the lower limbs parallel to one another. One exception was the assessment of knee extensors, which was conducted in the prone position. The rater passively moved each participant’s joints through a full range of motion at a stretching velocity standardised by counting ‘one thousand and one’ and judged the degree of tone.26 The rater performed only one movement cycle in flexion and one movement cycle in extension; repetitive cycles were avoided.

The TS17 was introduced in 1954 and further modified, presently known as MTS.19 The three key components of the test are as follows: the velocity of stretch, the quality of the muscle reaction, and the angle of the muscle reaction. Based on the suggestions of previous researchers,18, 19, 27, 28 the rater first moved the joint as slowly as possible (defined as V1) through its full range of motion. The angle was measured with a universal goniometer by another physiatrist who placed the goniometer near the joints and read the values. The angle of full range of motion was defined as R2. Afterwards, the rater moved the joint as rapidly as possible (defined as V3) in the same direction and through the same full movement arc. The angle of muscle reaction (either a clear catch or clonus) was measured with the goniometer and recorded as R1. The difference between R2 and R1 (R2−R1) reflects the dynamic tone component of spasticity. The larger the spasticity angle, the more spastic the muscle.

The quality of the muscle reaction (the X value) was then rated at V3. A standard 0–5 scale was used for the qualitative rating of reaction to fast movements. A score of 0 represents no resistance throughout the course of the passive movement. A score of 1 represents a slight resistance throughout the course of stretching without a clear catch at any angle. A score of 2 represents a clear catch occurring at a precise angle, interrupting the passive movement and followed by a release. A score of 3 represents a fatigable clonus (<10 s, when maintaining the pressure). A score of 4 represents an infatigable clonus (>10 s, when maintaining the pressure) occurring at a precise angle. A score of 5 represents an immovable joint. Only one stretch was used at each speed.

Testing protocol and reliability

The MAS and MTS scores were collected by two experienced physiatrists from the right hip adductor and extensor muscles, right knee extensor and flexor muscles and right plantar flexor muscles of the participants. These physiatrists each had >5 years of experience in the assessment of spasticity. To ensure optimal standardisation of the MAS and MTS assessment, they undertook a training session before the initiation of the study.

The participants were first measured with the MAS and then 30 min later with the MTS. The raters were blinded to the scores of the examinations. Inter-rater reliability was investigated with a 30-min break between assessments. A third physiatrist performed the goniometric measurements for the MTS. The goniometric measurement protocol for the MTS is shown in Table 1.

Table 1 Goniometer measurement protocol of the MTS

Test–retest reliability was investigated by having the first physiatrist retest each participant positioned in the same way as the first test, at the same time of the day, 1 week apart.

Analyses

Statistical analyses were performed using the IBM SPSS Statistics 22 software package (IBM Turk Limited Company, Istanbul, Turkey). Data are presented as mean±s.d. The inter-rater and test–retest agreement of the MAS and MTS X scores (qualitative spasticity ratings) were determined using weighted kappa statistics.29 We used the criteria of Landis and Koch to interpret kappa values: <0.0=poor, 0.00–0.20=slight, 0.21–0.40=fair, 0.41–0.60=moderate, 0.61–0.80=substantial, and 0.81–1.00=almost perfect.30 For continuous data such as R1, R2 and R2−R1, the test–retest and inter-rater reliability of angle measurements was determined using an intra-class correlation coefficient (ICC).29, 31 The ICC values were interpreted as follows: <0.40=poor; 0.40–0.59=fair; 0.60–0.74=good; and 0.75–1.00=excellent.32 Spearman’s rank correlation coefficient was used to identify the relationship between the MAS and the MTS X scores, and Wald test was used to set statistical differences between kappa values.

Results

The mean (s.d.) age of participants was 44±14 years. Twenty-one were females (36%) and 37 were males (64%) with a mean (s.d.) time since injury of 49±60 months (range: 6–338 months). The characteristics of the study population are given in Table 2. The total mean (s.d.) score for the Functional Independence Measure was 88±26 points.

Table 2 Characteristics of the study population

Inter-rater and test–retest reliability of the MAS

The inter-rater and test–retest kappa values for MAS are shown in Table 3. The test–retest kappa coefficients ranged from 0.636 to 0.716 and demonstrated substantial agreement (P<0.01), except for the hip adductor muscles. The test–retest kappa coefficient for the hip adductor muscles was 0.580, indicating moderate agreement (P<0.01). The inter-rater agreement measures for the MAS were slightly lower than those for test–retest agreement, and the kappa values ranged from 0.531 to 0.774 (P<0.01) for the two physiatrists. The measurements made by the two physiatrists for the hip adductor, hip extensor muscles and knee extensor muscles showed moderate agreement, and the measurements for the knee extensors and ankle plantar flexor muscles showed substantial agreement.

Table 3 Inter-rater and test–retest agreement of the MAS

Inter-rater and test–retest reliability of the MTS

The kappa coefficients for the MTS X scores (qualitative spasticity ratings) were 0.73–0.82, indicating substantial agreement. The results are listed in Table 4.

Table 4 Inter-rater and test–retest agreement of the MTS X scores

The ICCs for the inter-rater and test–retest reliability are shown in Table 5 and Table 6, respectively. The R2−R1 measurements showed excellent inter-rater reliability for all the muscles tested. The R1 measurements of the right hip adductor, hip extensor muscles, right knee flexor and knee extensor muscles also showed excellent inter-rater reliability, but this reliability was only fair for the ankle plantar flexor muscles. The inter-rater reliability for the R2 measurements varied considerably across muscles, ranging from high (ankle plantar flexor) to good (knee flexor) to fair (hip extensor) to poor (hip adductor, knee extensor).

Table 5 Inter-rater reliability of the MTS R1, R2 and R2−R1
Table 6 Test–retest reliability of the MTS R1, R2 and R2−R1

The R2−R1 measurements showed excellent test–retest reliability for all the muscles tested. The R1 measurements of the right hip adductor, hip extensor muscles and right knee flexor and extensor muscles showed excellent test–retest reliability, but the ankle plantar flexor muscles showed only fair reliability. The test–retest reliability for the R2 measurements varied considerably across muscles, ranging from excellent (hip extensor and ankle plantar flexor) to fair (knee flexor) to poor (knee extensor).

Correlations of the MAS with the MTS X scores

Figures 1 and 2 show the percentage of participants with different levels of MAS and MTS X grades, respectively. Significant correlations were noted between the MAS and MTS X grades for all the muscles tested (hip adductor muscles, r: 0.791; hip extensor muscles muscles, r: 0.920; knee extensor muscles, r: 0.539; knee flexor muscles, r: 0.562; and ankle plantar flexor muscles, r: 0.864; all P<0.01). The differences between the kappa values of MAS and MTS X scores revealed significantly higher test–retest reliability for the MTS than for the MAS with respect to the hip adductor and knee flexor muscles (Z>1.96; P<0.05). Inter-rater reliability was significantly higher for the MTS than for the MAS in all the muscle groups (Z>1.96; P<0.05), with the exception of the hip adductor and ankle plantar flexor muscles (Z<1.96; P>0.05).

Figure 1
figure 1

The percentage of participants with scores of 0–4 on the MAS. The figure shows the distribution of the MAS scores based on the location of the muscle group. The vertical axis shows the percentages of the participants. A full colour version of this figure is available at the Spinal Cord journal online.

Figure 2
figure 2

The percentage of participants with scores of 0–5 on the MTS X. The figure shows the different scores of the MTS X with different colours. The vertical axis shows the percentages of the participants. A full colour version of this figure is available at the Spinal Cord journal online.

Discussion

This is the first study to investigate the reliability of the MTS and MAS in a large group of people with SCI. Our results demonstrate substantial test–retest agreement of the MAS for all the muscles tested, except for the hip adductor muscles. The MAS scores for the hip adductor, hip extensor and knee extensor muscles showed moderate inter-rater agreement, while the ankle plantar flexor and knee flexor muscle scores showed substantial inter-rater agreement. The inter-rater and test–retest reliability of the R2−R1 and R1 measurements of the MTS were generally excellent, except for the R1 measurements of the ankle plantar flexor muscles, which were fair. The reliability of R2 measurements of the MTS varied from muscle to muscle (that is, the inter-rater reliability of R2 measurements of the hip adductor muscles was poor, and the inter-rater reliability of R2 measurements of the knee flexor muscles was good). The MTS X scores showed substantial inter-rater and test–retest reliability.

Our results demonstrate significant correlations between the MAS and MTS X scores, with a significantly higher reliability for the MTS than for the MAS in some of the lower limb muscles. These findings somewhat support those of Mehrholz et al.21 and Waninge et al.,22 as these researchers showed a superior reliability of the MTS over the MAS.

Important factors that improve reliability are the use of well-trained, experienced testers and standardised user guidelines. When used by well-trained and experienced testers, the MTS has good-to-very-good test–retest reliability, although it shows only limited inter-rater reliability in patients with severe brain injury and impaired consciousness.21 Gracies et al.,33 who conducted a two-phase pretraining and posttraining study of the TS in patients with cerebral palsy, found significantly improved inter-rater and intra-rater reliability after training when measuring elbow flexor and plantar flexor spasticity in children with cerebral palsy. However, they had difficulty in achieving inter-rater reliability for spasticity angle measurements in the knee, even after training. In our study, we ensured optimal standardisation of the MAS and MTS assessments by having our testers undertake a training session before initiation of the study. Nevertheless, we found moderate-to-fair inter-rater reliability of the MAS and MTS R2 for the tested hip and knee muscles. This could be due to the inherent difficulties maintaining the position of the paralysed lower limbs of participants with SCI, regardless of training or experience. For example, it is difficult to perform rapid knee extension in these individuals. Moreover, MTS might not be as sensitive in the proximal muscle groups of the lower extremities or the clonus might not be as common in the proximal joint muscles of patients with SCI as seen in our study population (Figure 2).

By contrast, we found excellent inter-rater and test–retest reliability of R2−R1 and R1 measurements of the MTS (except R1, measurements of the ankle plantar flexor muscles were fair). These measurements have important clinical implications; for example, the R2−R1 value indicates the level of spasticity in the joint. Differentiation of spasticity and muscle contracture is important before a treatment decision is made. Based on the excellent inter-rater and test–retest reliability of R2−R1 shown by our results, we suggest that MTS may serve as a reliable and useful tool for deciding the treatment plan in patients with SCI.

The MAS is still the most widely used clinical measurement of spasticity in clinical settings.6 However, studies that have examined the reliability of the MAS of lower extremities among patients with SCI have yielded conflicting results.9, 10, 11 For example, Craven and Morris12 found an inadequate reliability for the MAS for determining lower extremity spasticity between raters or over time (inter-session). By contrast, Baunsgaard et al.13 demonstrated an acceptable reliability for the MAS and recommended repeated measures by the same rater when possible. Our test–retest reliability was substantial for the MAS for all the muscles tested, except for the hip adductor muscles. The test–retest reliability of the hip adductor muscles and the inter-rater reliability of the hip adductor, hip extensor and knee extensor muscles were moderate. The knee flexor and ankle plantar flexor muscles showed substantial inter-rater reliability. Therefore, the MAS could be used reliably for the assessment of spasticity in the lower extremity of patients with SCI. We also recommend assessment by the same rater when possible, as the test–retest reliability was higher than the inter-rater reliability.

One limitation of our study was that, although a 30-min break was imposed between assessments of inter-rater reliability, the preceding test could still have affected the subsequent test. Another limitation is that all our testers were experienced physiatrists, so our results may not be generalisable to less experienced testers or different health professionals (therapists, nurses and so on).

The multidimensional nature of spasticity has also been put forward as a consideration when selecting outcome measures for spasticity.9 Patients with SCI often experience exaggerated muscle reflex responses, such as clonus, flexor spasms and extensor spasms. The Spinal Cord Assessment Tool for Spastic Reflexes, developed by Benz et al.,34 is suggested as a valid and reliable tool for assessment of spasm activity in patients with SCI.35 Spasticity intervention trials should also include objective, subjective and functional assessments.36 Hence, a combination of tools is needed for effective evaluation, treatment and management of spasticity in patients with SCI. The MAS is an easy-to-administer tool, and the MTS is a good tool for assessing peripheral contributions of spasticity. Our results also demonstrate that MAS and MTS have adequate reliability for determining lower-extremity spasticity in patients with SCI, indicating that these scales may serve as complementary tools for functional assessments to decide the best treatment options for patients with SCI.

Conclusions

Our study findings indicate moderate-to-substantial inter-rater reliability and test–retest reliability for the MAS. The MAS can therefore be used reliably in spasticity measurements of the lower extremities of people with SCI. Owing to the higher test–retest reliability than the inter-rater reliability, we suggest assessment by the same rater if possible.

Although the MTS R2 showed poor inter-rater reliability for the hip adductor and knee extensor muscles and poor test–retest reliability for the knee extensor muscles, the MTS R2−R1 showed excellent inter-rater reliability and test–retest reliability for all the muscles tested. Thus the MTS may serve as a complementary tool for deciding the treatment plans for people with SCI.

Data archiving

There were no data to deposit.