Introduction

Precise classification of the neurological state of patients with acute spinal cord injury (SCI) is essential in determining the prognosis, therapeutic strategy and outcomes/benefits from the interventions [1,2,3,4,5,6,7,8,9,10]. The American Spinal Injury Association (ASIA) published standards for neurological and functional classification of SCI in 1982 [11]. The International Standards for Neurological Classification of Spinal Cord Injury (ISNCSCI) was accepted and recommended by the International Medical Society of Paraplegia (IMSOP) in 1992, and the ASIA Impairment Scale (AIS) based on the revised version of ISNCSCI [12, 13] is now used worldwide to determine motor and sensory impairments in patients with SCI. In order to accurately assess AIS, it is mandatory to determine the sensory level, sensory scores, motor level and strength of the key muscles using the ISNCSCI classification [14]. The ISNCSCI method was tested for its inter- and intra-rater reliabilities [15,16,17,18,19,20,21,22,23,24,25,26]. This method is slightly complicated, and several studies illustrated that training and experience may be required for accurate agreement [15, 20, 21, 25, 26]. Importantly, some studies reported that the reliability of sensory and motor components of ISNCSCI in patients with incomplete SCI was lower than that in patients with complete SCI [15, 19, 20, 22, 26]. In the patients with motor incomplete SCI, the most complicated task even after training was to distinguish AIS B from AIS C/D based on the motor sparing of more than three segments below the motor level [20, 24, 27]. Moreover, in the patients classified as AIS D after training, ~15% of those was misclassified [20, 27]. These results indicate that a very careful evaluation was mandatory to correctly assess AIS in patients with incomplete SCI. These patients often exhibit spontaneous motor function recovery; hence, an easy and quick to perform screening tool needs to be developed because a complete ISNCSCI examination in these patients might reach up to an hour, and frequent evaluation using the full ISNCSCI method is not practical in the early stage of injury.

We previously reported that the knee-up test was useful for the early detection of postoperative motor deficit [28] as well as hysterical paralysis [29]. This test is also thought to have a possibility to easily estimate the severity of paralysis. The purpose of this study is to verify the usefulness of estimating paralysis severity using the ‘knee-up test’ in patients with incomplete SCI, especially in distinguishing AIS C from D.

Methods

Study population

A total of 675 patients with cervical SCI (CSCI) were treated at the Department of Orthopaedic Surgery of the Spinal Injuries Centre from June 2005 to June 2016. The inclusion criteria were the following: (1) patients with traumatic CSCI verified using magnetic resonance imaging, (2) patients with motor incomplete CSCI determined according to the ISNSCI classification at the time of admission and (3) ability to adequately communicate. The medical records of 215 consecutive patients with acute motor incomplete CSCI, who were treated within 2 days after injury, were reviewed. Fifteen patients were excluded because some motor functions were not testable due to fracture and/or hypersensitivity of the extremities. The medical records of the remaining 200 patients were then analysed.

Informed consent was obtained from all individual participants included in the study.

Manoeuvre of the knee-up test

Patients were placed in the supine position, and the clinicians asked them to lift up their knees off the bed (Fig. 1a). If patients were able to lift their knees to an upright position, they were considered to have a positive knee-up test result (Fig. 1b). If patients were unable to do so, then the knee-up test had a negative result (Fig. 1c). Patient had a positive single knee-up test if he/she was able to lift at least one of his/her knees, whereas the patient was considered to have a negative single knee-up test if he/she was not able to lift either of his/her legs.

Fig. 1
figure 1

a In the knee-up test, the patient was placed in the supine position. b Clinicians ask patients to lift up their knees off the bed. If patient was able to lift their knees to an upright position, then the patient had a positive knee-up test result (left leg). c If the patient was unable to lift the knees up to the upright position, then the patient had a negative knee-up test result (left leg)

Patients were independently examined at the time of admission by two expert orthopaedic surgeons (I.Y. and T.U.), who had >5 years of experience in using the ISNCSCI method. At the same time, two non-expert residents who had <1 year of experience in using the ISNCSCI method performed the knee-up tests. They were blinded to the results obtained by the other observers.

Key muscles for knee-up positive and relation between the key muscles and upper and lower extremities’ muscle functions

Considering that the physiologic continuity of the spinal cord long tract fibres is broken into CSCI, paralysis of one muscle is often linked to paralysis of the other muscles in the upper and lower extremities. Therefore, we analysed the key leg muscles that contributed positively to the knee-up test and the relationship between the motor scores of all key muscles and key leg muscles associated with the knee-up test.

AIS classification

Regarding AIS, data to be filled into the neurological worksheet were: sensory scores including light touch and pin prick, motor score for each segment, presence of voluntary anal contraction, and results of the knee-up test. Since 1980, non-key muscles, such as shoulder elevation, shoulder abductors, wrist flexors, finger extensors, hip adductors, hip abductors and long toe flexors, were examined and graded using the Manual Muscle Test. All data were collected prospectively. Given that the ISNCSCI methods were improved during our study period, to update the AIS classification, every parameter, including sensory score, motor score for each segment and presence of voluntary anal contraction of each patient, was entered into a personal computer, and the AIS of each patient was evaluated and verified by the computerised algorithm designed by Walden et al. [30]. after completing the data entry.

When a disagreement existed between the results of the single knee-up test and the results of AIS obtained by the two observers, the results were classified as single knee-up test negative and as AIS C.

Statistical analysis

The association between the result of the single knee-up test of a patient and the results of AIS was investigated. Fisher’s exact test was used to compare categorical data. Odds ratio (OR), and 95% confidence interval (CI), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. The kappa statistic was used to assess the inter-observer agreement on the knee-up test.

To clarify the importance of the key leg muscles in maintaining the knee in its lifted position, the relationship between the result of the knee-up test and the strength of each key leg muscle was analysed using a multiple logistic regression model. Correlations between the motor scores of the significant key muscles and those of the other key muscles were analysed using Pearson’s correlation coefficient. The correlation coefficient (r) was interpreted as follows: value of r between −1.0 and −0.7 or 1.0 and 0.7: a strong linear relationship; between −0.7 and −0.5 or 0.5 and 0.7: a moderate linear relationship; and between −0.5 and −0.3 or 0.3 and 0.5: a weak linear relationship [31]. All statistical analyses were performed using the JMP 11 statistical software package (SAS Institute Inc., Cary, NC, USA). A value of p < 0.05 was considered to be statistically significant.

Results

Two hundred eligible patients (166 males, 34 females) were selected for inclusion in the final sample. All injuries were the result of blunt trauma. Mean patients’ age was 62.4 ± 15.6 years (mean ± SD) (range, 16–91 years). The consciousness level of all patients was normal. The mean interval between the day of injury and the day of evaluation was 0.8 ± 0.75 day (mean ± SD) (range, 0–2 days). The neurological status of each patient was evaluated at the time of admission. Neurological level of injury was C2 in 4 patients, C3 in 15, C4 in 104, C5 in 45, C6 in 25, C7 in 6 and C8 in 1 patient. Ninety-five of the 200 patients were classified as AIS C. The mean total upper motor score (TUEMS) was 11.95 ± 1.14 (mean ± SD) and mean total lower motor score (TLEMS) was 14.26 ± 1.08 (mean ± SD). According to Pouw et al. [32], traumatic central cord syndrome (TCCS) was defined as a TUEMS of 10 points less than the TLEMS; 30 in 95 patients (31.58%) had TCCS. One hundred and five of the 200 patients were classified as AIS D. The mean TUEMS was 30.10 ± 1.07 (mean ± SD) and mean TLEMS was 41.78 ± 1.02 (mean ± SD). Fifty-five in the 105 patients (52.38%) had TCCS. Both mean TUEMS and TLEMS in the AIS D group were statistically significantly higher than those in the AIS C group (TUEMS: p < 0.0001, TLEMS: p < 0.0001) and the percentage of TCCS in the AIS C group was statistically significantly higher than that in the AIS D group (p = 0.0041) (Table 1). The kappa coefficient for the AIS between two expert orthopaedic surgeons was 0.99 (p < 0.0001). There was no difference in the knee-up test derived by the two non-expert examiners.

Table 1 Demographic characteristics according to the American Spinal Injury Association impairment scale

One hundred and twenty-six patients exhibited positive results when evaluated using the single knee-up test, whereas 74 patients exhibited negative results. The mean TUEMS was 27.03 ± 1.10 (mean ± SD) and the mean TLEMS was 39.16 ± 0.94 (mean ± SD) in the knee-up positive group, whereas the mean TUEMS was 12.04 ± 1.44 (mean ± SD) and the mean TLEMS was 10.91 ± 1.24 (mean ± SD) in the knee-up negative group. Seventy-one in 126 patients (56.35%) had TCCS in the positive group and 14 in 74 patients (18.92%) had TCCS in the negative group. Both mean TUEMS and TLEMS in the knee-up positive group were statistically significantly higher than those in the knee-up negative group (TUEMS: p < 0.0001, TLEMS: p < 0.0001) and the percentage of TCCS in the negative group was statistically significantly higher than that in the positive group (p < 0.0001) (Table 2). One hundred and four of the patients with positive results were classified as AIS D. Seventy-three patients with negative single knee-up test were classified as AIS C (Fisher’s exact p < 0.0001; OR, 345.1; 95% CI, 45.5–2617.7) (Table 3). The sensitivity, specificity, PPV and NPV of this test for all cases were 99.1, 76.8, 82.5 and 98.7, respectively. Concerning the relationship between the knee-up test and TCCS (Table 4), the sensitivity, specificity, PPV and NPV of this test were 83.5, 52.17, 56.35 and 81.03, respectively.

Table 2 Demographic characteristics according to the result of knee-up test
Table 3 Relationship between the American Spinal Injury Association impairment scale and the knee-up sign findings
Table 4 Relationship between the traumatic central cord syndrome and knee-up test findings

Multiple logistic regression demonstrated that hip flexors (p < 0.0001; OR, 5.87; 95% CI, 2.94–13.6), hip adductors (p = 0.0352; OR, 2.11; 95% CI, 1.04–4.29) and hip abductors (p = 0.0291; OR, 1.86; 95% CI, 1.06–3.49) were statistically significant for the key leg muscles that contributed positively to the knee-up test (Table 5). When the muscles were restricted only to the ASIA key muscles, multiple logistic regression demonstrated that hip flexors (p < 0.0001; OR, 10.9; 95% CI, 5.9–23.4) and ankle dorsiflexors (p = 0.0255; OR, 1.6; 95% CI, 1.1–2.6) were statistically significant for the key leg muscles that contributed positively to the knee-up test (Table 6).

Table 5 Multiple logistic regression by combined muscle functions for the positive knee-up test (with the non-key muscles)
Table 6 Multiple logistic regression by combined muscle functions for the positive knee-up test

The motor score of both hip flexors (Table 7) and ankle dorsiflexors (Table 8) showed significantly strong correlations with that of the other key muscles in the lower extremities, and significantly moderate correlations with that of the elbow extensors, finger flexors and finger abductors. The motor scores of hip flexors and ankle dorsiflexors showed a significantly weak correlation compared to those of the elbow flexors and wrist extensors.

Table 7 Correlation between the motor score of hip flexors and those of the other key muscles
Table 8 Correlation between the motor score of ankle dorsiflexors and those of the other key muscles

Discussion

In the present study, we demonstrated the usefulness of the knee-up test for AIS assessment in patients with motor incomplete CSCI. To correctly assess AIS, several complicated examination steps are required. First, sensory (ie, light touch and pin prick sensation) and motor examinations for the right and left extremities are determined according to the ISNCSCI method. Second, NLI, which is the most caudal segment of the cord with intact sensation and antigravity muscle function strength, should be determined. Third, AIS is evaluated using these sensory and motor findings in a comprehensive manner. Mastery of skills needed for each step in the ISNCSCI method is required for clinicians as described above, indicating that examination skills might influence the result of AIS grading. Both AIS C and D indicate that motor function is preserved below the NLI; however, the distinction between them is not always easy. The function of more than half of the key muscles below the single NLI have a grade of <3 (grades 0–2) is classified as AIS C, whereas the case wherein at least half (half or more) of the key muscles below the NLI have a grade greater or equal to 3 (grades 3–5) is classified as AIS D [14].

In patients with acute CSCI, some merits exist for accurate distinction between AIS C and D. For example, AIS C is a more appropriate neurological condition in validating the efficacy of treatment for neurological improvement [1,2,3,4]. In addition, acute complications were found to develop more frequently in patients with AIS C than in those with AIS D [5, 6]. Therefore, early distinction between AIS C and D is important to evaluate treatment outcomes and complication risk during the early stage of injury, especially the need for tracheostomy [5] in patients with incomplete CSCI.

Several studies reported the inter-rater and intra-rater reliability of ISNCSCI [15,16,17,18,19,20,21,22,23,24,25,26]. Generally, intra-rater and inter-rater agreements among trained or experienced raters were acceptable; [17, 21,22,23] however, even for trained examiners, the agreements of sensory and/or motor examination were less reliable in patients with an incomplete SCI than in those with a complete SCI [19, 22]. In addition, although several studies have reported that training improved accuracy of agreement [15, 16, 18, 20, 24,25,26], efficacy was significantly lower in incomplete than in complete SCI [15, 16, 20, 24,25,26]. For example, Chafetz et al. [20] reported that training improved correct classification of patients with AIS C from 29 to 54% and of those with AIS D from 37 to 84%, and stated clearly that accurate classification of AIS designation remained unacceptably low even after training. These reports suggested that non-experts should receive proper training before using the ISNCSCI in clinical practice [20, 25]. In this study, inter-rater agreements among two expert orthopaedic surgeons who had 5 years or more of experience in using the ISNCSCI method in >400 SCI cases were high enough. This means that to be able to correctly classify patients with motor incomplete CSCI, proper training and greater level of experience are needed.

In contrast, the knee-up test presented in this study does not require special skills when performing. We previously reported that the knee-up test was useful for the early detection of postoperative motor deficit [28] as well as hysterical paralysis [29]. In the current study, no patient data overlapped with the previous studies [28, 29]. In a previous study [28], hip adductors and abductors were important muscles in lifting the knees up (knee-up test positive) and in keeping the knee in this position. However, these muscles were not used to analyse their correlations with the key muscles in the current study, because they were not key muscles of the ISNSCI classification. The knee-up test also provides easy distinction between AIS C and D in patients with motor incomplete CSIC. If the patient could not lift his/her knees in either legs to an upright position (single knee-up test negative), this patient was classified as AIS C with 98.6% probability; on the other hand, if the patient was able to lift his/her knee in at least one leg (single knee-up test positive) to an upright position, this patient was classified as AIC D with 82.5% probability. Although the knee-up test was useful to estimate AIS C or D, the purpose of this test was not to promote shortcuts for clinical evaluations but to screen the severity of motor incomplete CSCI for non-expert examiners using an easy clinical tool. The knee-up test is thought to be useful to estimate TCCS. However, with regard to the relationship between the knee-up test and TCCS, the sensitivity, specificity, PPV and NPV of this test were 83.5, 50.89, 56.35 and 80.26, respectively. These results suggest that the knee-up test is more useful for the distinction between AIS C and D in patients with incomplete CSCI than for the distinction of TCCS.

Even though the single knee-up test was positive in 126 patients, only 22 were classified as AIS C. This might explain the relatively low specificity of the knee-up test. Among these patients, 15 (68%) had a one side positive test result. When a single positive test is divided into two categories: both side positive (73 of 80 cases (91%) were AIS D) and one side positive (31 of 46 cases (67%) were AIS D), the percentage of correctly identified cases with AIS D increased; however, the sensitivity and specificity of the test could not be calculated. Regardless of its relatively low specificity, these correction rates of classification (98.6% and 82.5%) are high enough to be compared to the results of the trained raters [15, 20, 24, 27].

In this study, we analysed the motor score of the key muscles necessary for lifting the knee using a multiple logistic regression model and found that the hip flexors and ankle dorsiflexors were the statistically significantly associated with the knee-up test (Table 2). Notably, the motor score of these muscles had a strongly positive correlation with the score of the key muscles in the lower limbs and a moderately positive correlation with that of the key muscles of upper limbs. These results suggest that a simple examination using the knee-up test yields general information of paralysis in extremities, especially in the legs, with great accuracy. Alternatively, the lower-limb muscles, besides the hip flexors and ankle dorsiflexors, had a moderate correlation with the upper-limb muscles. These results indicate that patients with a positive knee-up test most likely had a substantial motor function in the upper and lower limbs. Given that the distinction between AIS C and D is that whether at least half of the key muscle functions below the NLI have a muscle grade of >3 (5–3) or <3 (2–0), there is a high possibility that patients with positive knee-up test are classified as AIS D. For this reason, this test could estimate the AIS classification with high probability in patients with incomplete CSCI.

There are several possible limitations associated with this study. First, the sample size was relatively small. Second, the knee-up test was evaluated at a single institution. Third, this test is not applicable to patients with pelvic fracture and/or lower-limb fracture who are unable to lift up their knees. Fourth, because only patients in a very acute stage were examined, the results of this study cannot be transferred to patients in the chronic stage because of severe spasticity, joint contractures, pain and heterotopic ossifications. Finally, the knee-up test is not useful for the patient with CSCI who has absent motor function of the lower extremities but sacral sensory sparing with sparing of motor function more than three levels below the motor level for that side of the body; the patient is then classified as AIS C. When the patient had no leg motor function but who has sacral sensory sparing, adequate ISNCSCI examination is mandatory to precisely classify AIS. Despite these limitations, we believe that this test is useful at bedside for non-trained clinicians who are not familiar with the ISNCSCI method. In addition, this test is applicable for the patients who have difficulties with precise communication and neurological evaluation.

In conclusion, the sensitivity, specificity, PPV and NPV of this test for distinction between AIS C and D in patients with motor incomplete CSCI were 99.1, 76.8, 82.5 and 98.7, respectively.