Introduction

For people with spinal cord injury (SCI), the assessment of dorsal column medial lemniscus (DCML) function provides important information regarding sensory status, which may be helpful in guiding its clinical management and for prognosticating the outcomes [1]. Knowledge of DCML function allows for setting reasonable goals in rehabilitation, and DCML function is also predictive of motor outcomes [2] during rehabilitation and neuropathic pain [3,4,5]. DCML function often improves during the subacute phase after SCI [6], and one study found that 39% of people classified as having complete SCI (ASIA-A) had some preserved sensory function below the level of injury [7]. Thus, clinically useful tools with adequate test–retest reliability are needed [7].

A variety of methods designed to assess DCML function exist, including light touch detection, vibration detection, two-point discrimination, joint position sense, graphesthesia, and directional cutaneous kinesthesia (DCK). However, these tests often yield different conclusions [8]. For example, in some patients two-point discrimination is impaired while vibration detection is intact. In fact, most of these somatosensory modalities involve the complex integration of numerous neuronal inputs, which may contribute to the discrepancy in findings between these tests [8]. Additionally, for most of these somatosensory modalities the reliability and clinical utility are adversely affected by a poor standardization of procedures in clinical practice and research. For example, it is difficult for clinicians to gauge the amount of force exerted when applying the apparatus for two-point discrimination, and the amount of force used impacts the results. As a result, there are disparate reports on normative values for two-point discrimination [9,10,11]. Similarly, vibration detection is usually assessed clinically using a tuning fork and pressure consistency on applying the tuning fork, or a lack thereof, influences the findings [12].

Quantitative sensory testing research protocols frequently test vibratory sense to evaluate and quantify DCML function [3, 13] in a standardized and reliable manner. However, simpler protocols for clinical use are needed. It has been contended that graphesthesia and DCK are the only true tests of DCML function, because these measures rely on the input from rapidly adapting cutaneous mechanoreceptors, and only the axons of rapidly adapting cutaneous mechanoreceptors remain in the DCML for their full extent [8]. Thus, graphesthesia and DCK appear to have less redundancy of input and are likely better measures of DCML function than the other tests of DCML function [5, 8, 14, 15]. However, the reliability and validity of methods for assessing graphesthesia and DCK has not been investigated in people with SCI, and there is a poor consistency across studies of people with SCI regarding the methodology of graphesthesia and DCK assessment [5, 8, 16, 17].

DCK, the ability to discern the direction of movement when a stimulus is applied across the skin, and graphesthesia, the ability to recognize letters or numbers written on the skin purely by the sensation of touch, are dependent on inputs from the DCML for adequate performance [5, 8]. DCK is thought to provide the essential input for graphesthesia, while graphesthesia performance requires additional cerebral processing for the identification of numbers and shapes [18, 19]. Since graphesthesia places a greater demand on intact patterning of input than does DCK, it is more sensitive to impairment of DCML function than DCK, as demonstrated by Bender et al. [8] in a study of 558 neurologically compromised patients. Tests for DCK and graphesthesia are also quick and easy to perform, and they do not require any expensive equipment. Therefore, DCK and graphesthesia may be ideal methods for the clinical assessment of DCML function after SCI.

The purpose of this study was to evaluate the psychometric properties of a simple scale for assessing graphesthesia and DCK in people with SCI. Current approaches for assessing graphesthesia and DCK typically have dichotomous outcomes (i.e., impaired or normal) or rely on patient self-report (i.e., additional scoring options are provided for the report of altered sensation), which may confound the results. This scale is a performance-based measure with scaled outcome scores. The scale consists of seven different tasks, with a potential score of 0 or 1 for each (representing incorrect or correct performance, respectively), and it provides a potential sum score range of 0–7. This comprehensive and scaled approach to assessing graphesthesia and DCK may be more sensitive to changes in DCML function than the isolated use of tests with dichotomous outcomes.

Methods

Participants

This study was based on a subset of data from a larger study involving people with SCI and neuropathic pain [7]. Seventy-two consecutive people with SCI and neuropathic pain were enrolled, and five were eliminated from final data analysis due to incomplete datasets. Participants were at least 18 years old and were able to speak and understand English. Participants were excluded if they had cognitive impairment (a score of <20 on the Mini-Mental State Exam), major depression, current (within a year) drug abuse, or significant chronic health conditions other than SCI. The average pain intensity was calculated from the average pain rating (0–10 Numeric Rating Scale) over the past week for up to three pain problems.

Sensory assessments

Graphesthesia and directional cutaneous kinesthesia testing (Graph-DCK), light touch detection, and vibration detection were assessed in areas of cutaneous innervation at the level of injury and below the level of injury. The level of injury was determined according to the International Standards for Neurological Classification of Spinal Cord Injury guidelines and defined as the most rostral single level with normal sensation [20]. Below level of injury was defined as areas at least four dermatomes below the neurological level of injury. Standardized test sites were identified based on anatomical landmarks to ensure that the same site could be accurately located in each person. Light touch detection was also performed according to the International Standards for Neurological Classification of Spinal Cord Injury guidelines. We certify that all applicable institutional and governmental regulations and approvals concerning the ethical use of human volunteers were followed during the course of this research.

a. Quantitative sensory testing

Vibration detection measurements were obtained with the TSA-II Neurosensory Analyzer and accompanying software (Medoc Ltd., Ramat Yishai, Israel). The handheld VSA-3000 probe of the Medoc system was used to measure vibration detection thresholds for a 100 Hz stimulus frequency using the ascending method of limits [3]. Participants were instructed to verbally indicate the “first moment” that they felt the vibration sensation, at which time the experimenter stopped the trial and recorded the threshold. This procedure was repeated three times at each test site, and the mean value across the three trials was calculated at the vibration threshold for each site. Vibration thresholds were then transformed into Z scores based on normative data [7].

b. Graphesthesia and directional cutaneous kinesthesia testing

For assessment of Graph-DCK, the experimenter used the rounded back-end of a pen to lightly trace on the participants’ skin, while their eyes were closed. The following tasks were performed in a random order: identification of the 1) direction (caudal or rostral) and 2) speed (fast or slow) of a 4-inch line drawn, 3) identification of the numbers 2 and 3, and 4) identification of the shapes of a circle and square drawn. For each trial, the participants were told if they needed to identify a number or shape, but they were not aware of which order numbers or shapes were being used in the protocol. All procedures were first performed on the cheek to familiarize the participants to the procedures. The slow line was drawn over a total of 4 s. The fast line was drawn over 1 s, and the pen was held on the skin at the terminus for an additional 3 s so that both the slow and fast trials lasted 4 s. One point was awarded for the identification of the correct direction of each line and an additional point was awarded for the correct identification of the speed of one line relative to the other line, providing a potential total sum score range of 0–3 for the identification of direction and speed of a line drawn. Similarly, numbers (two trials) and shapes (two trials) were drawn in a random order, over a 4 × 4 inch area, and scored as correct or incorrect (0 or 1, respectively) for the identification of the number or shape drawn. The sum score for all the tasks was then calculated (for a potential range of 0–7, with 7 indicating correct performance on all tasks) and recorded as the Graph-DCK Scale score (Table 1).

Table 1 Graph-DCK Scale

Analyses

Internal consistency

Cronbach’s alpha test was used to analyze the internal consistency between scale items and to determine if all the items should be retained in the scale. A good internal consistency was defined as a Cronbach’s alpha score above 0.70 [21].

Test–retest reliability

To assess reliability, we administered a scale to 67 participants with chronic SCI and neuropathic pain. Since these participants were in the chronic stage after injury, changes in DCML function were unlikely. The scale was administered twice, with a two- to four-week period between testing sessions, and Intraclass Correlation Coefficient (ICC) estimates and their 95% confidence intervals were calculated to determine the test–retest reliability of Graph-DCK Scale scores. Good reliability was defined as statistically significant (p < 0.05) ICC values above 0.70 based on a single-rater, consistency, 2-way random model [22].

Convergent and divergent validity

To determine convergent validity, the correlation between mean Graph-DCK Scale scores and normalized vibration detection threshold values, another measure of DCML function, was analyzed. Since the repeated Graph-DCK Scale scores were strongly inter-correlated with initial scores (r = 0.84, p = 0.00 for at level and r = 0.85, p = 0.00 for below level), the mean value of Graph-DCK Scale scores for the two measurement sessions was calculated to provide a more accurate representation of true Graph-DCK Scale performance than a single-session performance. Vibration detection is less specific to DCML function than graphesthesia and DCK [8]. Thus, only a moderate, significant positive correlation between Graph-DCK Scale scores and vibration detection was expected. A statistically significant Pearson Product Correlation Coefficient value greater than 0.30 is indicative of a moderate correlation and would be supportive of convergent validity [23]. Divergent validity was determined by analyzing the correlation between mean scale scores and Beck’s Depression Inventory scores. Depression is associated with pain after SCI [24], and there was a wide spectrum of Beck’s Depression Inventory scores in this cohort (ranging from 0 to 36). However, depression is not mediated by DCML function. Thus, the lack of any statistically significant correlation between the scale and the Beck’s Depression Inventory would be considered supportive of divergent validity. We also used hierarchical linear regression analysis to assess to control for potential confounders such as age, medication use, and pain in the area tested, since these factors may influence the validity and/or reliability of the scale in people with SCI.

Results

Sixty-seven participants with SCI and neuropathic pain were included in this study (Table 2). All participants were tested below the level of injury, and 52 were also tested at the level of injury. More than half of the participants had pain in the area being tested, with 63% reporting pain in the site of testing for at-level Graph-DCK assessment and 55% reporting pain in the below-level test site. The mean pain intensity was 6.1 ± 1.8. Moreover, participants without pain in the at-level test site had significantly better Graph-DCK scores than those with pain in the test site, with a mean difference of 1.6 ± 0.65 (p = 0.02). The mean Graph-DCK score was 6.3 ± 1.5 for participants without pain in the at-level test site and 4.8 ± 2.5 for participants with pain in the at-level test site. There was no difference between participants without and those with pain in the below-level test site, with mean scores of 2.2 ± 2.4 and 2.6 ± 2.5, respectively.

Table 2 Participant background information

As expected, below-level Graph-DCK scores were significantly higher for participants with an incomplete SCI compared to those with a complete SCI, with mean scores of 4.4 and 1.2, respectively. Additionally, at-level Graph-DCK scores were higher than below-level scores for all participants with mean values of 5.0 ± 2.4 and 3.5 ± 2.5, respectively. Approximately half of the participants demonstrated normal graphesthesia (mean Graph-DCK score of 7, n = 29) at the level of injury and approximately half demonstrated near-complete agraphesthesia (mean Graph-DCK score ≤ 1, n = 31) below the level of injury. Similarly, 47% of the participants demonstrated normal light touch sensation at level, and 61% had absent light touch below level. However, among participants with some but not a total impairment of DCML function, there was a relative even distribution of participants across all levels of impairment (Fig. 1a and b).

Fig. 1
figure 1

a and b Distribution of Graph-DCK scores

Internal consistency

The Graph-DCK Scale items demonstrated a high internal consistency, with Cronbach’s alpha values of 0.91 and 0.92 for at-level and below-level assessments, respectively. Moreover, internal consistency was best with all the seven items included. Thus, the Cronbach’s alpha value far exceeded the predetermined standard of 0.70 for a good internal consistency.

Test–retest reliability

Intraclass Correlation Coefficient values were 0.83 (95% confidence interval = 0.72–0.90) and 0.85 (95% confidence interval = 0.77–0.91) for at-level assessment and below-level assessment, respectively. Thus, the scale demonstrated good test–retest reliability for use at both sites.

Validity

The mean Graph-DCK scores were calculated for each participant from the values taken over the two measurement sessions for at-level and below-level assessments. At-level and below-level mean Graph-DCK scores were significantly correlated to normalized vibration detection scores for both at-level assessment (r  = 0.43, p = 0.003) and below-level assessment (r = 0.49, p = .000), respectively (Fig. 2a and b). In addition, mean Graph-DCK scores were not significantly associated with Beck Depression Inventory Scores for at-level (r = 0.10, p = 0.50) or below-level assessment (r = 0.22, p = 0.09). The moderate correlation observed between Graph-DCK scores and vibration detection provides evidence of concurrent validity for the use of the Graph-DCK Scale as a measure of DCML function, while the lack of association between Graph-DCK scores and depression provides evidence of divergent validity.

Fig. 2
figure 2

a and b The association between Graph-DCK scores and normalized vibration detection

Assessment of potential confounders

Using hierarchical linear regression analysis, a model including age, medication use (opioid, antispasticity, and anticonvulsant use were entered as separate variables), and pain in the area tested was not predictive of at-level mean Graph-DCK scores (R2 = 0.07, p = 0.68) or below-level mean Graph-DCK scores (R2 = 0.11, p = 0.19). Furthermore, partial correlations for the variables entered were not significant within either model, with the exception of age in the below-level Graph-DCK model (B = 0.07, p = 0.024) (Fig. 3).

Fig. 3
figure 3

Scatterplot of mean below-level Graph-DCK scores and age

Discussion

The Graph-DCK Scale demonstrated good psychometric properties in a cohort of 67 participants with SCI and neuropathic pain, for use at the level of injury and for use below the level of injury. The scale items had an excellent internal consistency (Cronbach’s alpha values > 0.90), test–retest reliability was good (ICC values > 0.80), and convergent validity was supported by moderate and significant positive correlations to vibration detection (r values > 0.40). Moreover, performance on the Graph-DCK Scale was not significantly correlated to scores on the Beck’s Depression Index, providing evidence of divergent validity, and performance on the scale does not appear to be influenced by medication use. Thus, this scale demonstrates the prerequisite qualities of reliability and validity for its recommended clinical use in people with SCI.

The Graph-DCK Scale described in this study is comprised of items that are all regularly used in clinical practice, and all of the items have been previously described in the literature [5, 16]. However, previous descriptions of these measures either had a binary interpretation of findings (i.e., impaired or normal) or relied on a subjective feedback to produce scaled scores. Wall and Noordenbos [5] were the first to describe the methods for DCK assessment used in this scale, and they provided a descriptive summary of repeated trials in three different patients. Performances on the correct identification of direction and speed were reported separately, and no attempt was made to quantify the impairment of DCML function by combining findings of graphesthesia and DCK tests. Defrin et al. [16] utilized the exact same items described in this study, and they scored items on a four-point scale (0 = complete lack of touch sensation to 3 = normal touch). However, in the Defrin study [16], scores of 1 and 2 represented reports of abnormal sensation during the test rather than the correctness of performance. Although reports of abnormal sensation are important findings, using the report of abnormal sensation in the scoring introduces several potential confounders that may not be related to DCML function. For example, most people with SCI and some impairment of DCML function are likely to have sensations that would be considered abnormal in the absence of SCI. Therefore, the correct or incorrect dichotomous responses are more precise because of the difficulty in interpreting the report of abnormal sensations.

The goal of the Graph-DCK Scale was to optimize the simplicity and utility of test item interpretation by using a binary scoring system for individual items, while also providing a mechanism to quantify the degree of DCML impairment by summing the scores of binary test items to create a scale. The utility of the Graph-DCK Scale is supported by the strong psychometrics of the scale observed in this study and the broad spectrum of scores observed in this cohort. The high internal consistency between test items suggests that all the test items are capturing the same construct (i.e., DCML function) and supports the validity of summing the item scores to create a scale. The broad spectrum of Graph-DCK scores observed in this cohort suggests that the Graph-DCK Scale captures varying degrees of DCML impairment, which supports the potential clinical utility of this scale.

Limitations and future research

The findings in this study should be interpreted with a modest degree of caution. This is the first study to investigate the reliability and validity of the Graph-DCK Scale in people with SCI. In fact, we are unaware of any previous studies on the reliability and validity of assessing graphesthesia and DCK in people with SCI. Therefore, these findings need to be replicated. Additionally, the cohort in this study was a relatively homogeneous group of individuals with chronic neuropathic pain after SCI. Future studies are needed to determine the psychometrics of this tool for people in different stages of chronicity after SCI and in those with SCI that do not have neuropathic pain. Further, longitudinal cohort studies are needed to determine the responsiveness of the Graph-DCK Scale to intra-participant changes in DCML function.

Conclusion

The Graph-DCK Scale provides a reliable and valid method for assessing graphesthesia and DCK in people with SCI. Unlike most methods of quantitative sensory testing or the use of sensory evoked potentials to assess DCML function, this tool can be administered in less than 5 min and it does not require any expensive equipment. Thus, the Graph-DCK scale is a clinically useful method for assessing DCML function in people with SCI.

Data archiving

All the data generated or analyzed during this study are included in this published article [and its supplementary information files].