Introduction

The cost burden on the health care system by the rehabilitation system and management of the related complications of spinal cord injury (SCI) is estimated to be 9.7 million dollars per year in the USA [1]. During the last 30 years, owing to improved care, the percentage of people surviving with complete lesions has risen to 50% and in tetraplegia to 33.3% among all people with traumatic SCI [2]. Given that SCI occurs mostly in those in the productive age groups and that there is a high cost placed on health care, arriving at the best therapeutic or rehabilitative management protocols for individuals with SCI is a priority. It remains undebatable that this goal could be done only with a valid and reliable functional tool [3]. Knowledge about the exact disability impact of SCI on an individual’s life may pave the way for better planning in a national health care system.

As a scale for the functional assessment, the Spinal Cord Independence Measure (SCIM) is a well-known tool worldwide [4, 5]. The latest version, SCIM-III, has been assessed in many multicenter trials to determine its validity and reliability [6, 7]. The comprehensive nature of SCIM-III provides patients and their caregivers with rehabilitation goals to be achieved, and motivates individuals with SCI to improve their level of independence by assessing their achievements that are important medically, socially, or psychologically [4].

In order to use the SCIM-III in a different language, not only do translations have to be performed, but also the tool should be adapted to measure the same concept in different cultural contexts and life styles. This process is known as cultural adaptation [8], and has been performed for the English version of SCIM-III successfully into many languages [9,10,11,12,13]. Traditionally, the Persian language, also called “Farsi”, has two varieties called “Dari” and “Tajik”. It is spoken by a noticeable population within several countries such as Iran, Tajikistan, Afghanistan, Azerbaijan, and many others in the Persian Gulf states like Bahrain, Iraq and the United Arab Emirates [14]. Therefore, developing a valid and reliable version of SCIM-III for Persian individuals with SCI seems to be a necessity. The aim of this study is to (a) develop the Persian version of SCIM-III (P-SCIM-III), and (b) evaluate its validity and consistency for Persian speaking participants with SCI.

Methods

Participants

This cross-sectional study was conducted from April 2015 to January 2017 on participants with SCI referred to our Center. Inclusion criteria were: (1) Persian language as mother tongue, (2) age between 18 and 60 years, and (3) disease chronicity of not < 1 month. Those who had major comorbidities or complications, ASIA Impairement Scale (AIS) E cases, and those unable to speak owing to tracheostomy were excluded from the study. Out of 362 referred cases, a total of 279 participants were included in the study. The ethics committee of Tehran University of Medical Sciences approved the study, and informed consent was obtained from all participants.

Instrument development

SCIM-III was translated separately by two translators fluent in English and native in Persian (forward translation). Their comparison and panel discussion led to the primary Persian version. Two other translators fluent in Persian and native in English, not familiar with the scale, translated the Persian form back to English (backward translation). Another committee comprised of a neurosurgeon, an occupational therapist, two general practitioners, and an English literature professor compared the original English form, and the obtained form, to achieve the best Persian version by rejecting inappropriate words. The committee then ensured that the translation was fully comprehensive. The two forms were compared with ensure semantic, idiomatic, experiential, and conceptual equivalence. Thereafter in a pre-test the Persian form of the questionnaire was given to two general practitioners experienced in evaluating patients with SCI to measure functional status in 30 individuals to assure the fluency and feasibility of the Persian questionnaire. Thereafter, according to the comments by the practitioners, the final version of the P-SCIM was developed (Figure S1).

Procedure

The study was performed during a 2-year period in an outpatient rehabilitation setting admitting individuals with SCI for educational programs. After checking demographic data and neurological status of the clients, they were evaluated for inclusion and exclusion criteria. Two general practitioners (rater 1 and 2) registered the P-SCIM forms for the included cases. They were unaware of each other’s assessment and were not involved in the treatment and management protocols of the participants. An occupational therapist (rater 3) educated in Functional Independence Measure™ (FIM™) assessment, registered the FIM scores for each case separately. Each case was interviewed at least 30–45 min in a silent and relaxed environment. The scores were based on the participant’s report of his/her daily activities. Raters did the measurement of each participant using the final Persian version of the SCIM, based on the most common form of activity during the last week at the site of residence. The results then were entered into SPSS software for Windows version 22, to perform statistical analysis.

Analysis

Analysis was conducted, calculating Inter-rater reliability and validity indices.

Validity indices

The developed P-SCIM was evaluated for convergence and discriminative validity. Convergence validity was obtained by means of the Pearson correlation coefficient between P-SCIM and FIM results. A value of 0.9 or greater was assumed to be statistically significant. Discriminative validity was determined by comparing P-SCIM scores for neurological subcategories, using one way analysis of variance with Tukey method.

Reliability indices

Inter-rater reliability was calculated employing Intraclass Correlation of Coefficience (ICC), total agreement, kappa coefficient, and Bland-Altman limit of agreement. Internal consistency was calculated using Cronbach’s alpha, which was assumed reasonable for values > 0.7 [15]. Kappa coefficient > 0.6 [16], and ICC > 0.75 [15] were assumed to be statistically acceptable. Bland-Altman analysis was performed by MedCalc software (version 8.6). A p value < 0.05 was considered to be statistically significant.

Results

Epidemiologic and neurological findings

Demographic characteristics, neurological level, and severity of SCI in the studied participants are shown in Table 1.

Table 1 Neurological and demographic variables of traumatic SCI

Convergence validity

The Pearson’s correlation of coefficience between P-SCIM scores obtained by rater 1 and 2, and FIM obtained by rater 3, was found to be 0.905 for rater 1 and 0.900 for rate 2 (p < 0.001).

Discriminative validity

As shown in Table 2, there was an incremental trend in P-SCIM scores descending from level C1–C4 to T1–S5 and from AIS A, B, and C to AIS D. All the scores’ differences between the above categories within subscales and total P-SCIM were found to be statistically significant (p < 0.05). An exception was the difference between upper and lower cervical cases within all subscales, which was not significant (p > 0.05). Also, the scores of patients with AIS D and patients with “T1–S5, AIS A, B, C” subgroups, did not show significant differences for “self-care” and “mobility in room and toilet” subscales scores (Table 2, p > 0.05).

Table 2 Validity of P-SCIM III discriminating severity categories

Reliability indices

Within all subscales of P-SCIM, the ICC between raters 1 and 2 was acceptable (>0.75) (Table 3).

Table 3 Intraclass correlation between raters 1 and 2 within P-SCIM III subscales and total scores

The total agreement between raters for each item was found to be acceptable (>80%) in 16 out of 19 items, and unacceptable in three items; “grooming” (55.9%), “sphincter management-bowel” (76.7%), and “use of toilet” (74.2%) (Table 4). Kappa coefficient between raters was statistically significant for all items (kappa > 0.6), except for “grooming” (0.46, Table 4).

Table 4 Total agreement and kappa coefficients between raters on different P-SCIM III’s items

Using Bland-Altman analysis, the mean difference between rater 1 and 2 was 0.74 for the total scores (Table 5). In the same way, the mean differences between rater 1 and 2 in total scores stratified by age (< 26, 26–32, 32–40, and > 40 years), gender, education (< 6, 6–12, and >12 years of formal education), neurological level subgroup, and time since injury (< 6 months, >6 months) were measured. Time stratum (< 6 months since trauma) had the greatest mean difference (2.18 score), whereas those aged 32–40 years showed the least mean score difference (0.03) between raters 1 and 2. The mean difference for other strata (neurological level subgroups, AIS, education years subgroup, and gender) did not show statistical significance (p > 0.05).

Table 5 Bland-Altman mean differences: limit of agreements between rater 1 and 2 within P-SCIM III subscales and total scores

Cronbach’s alpha for internal consistency was calculated to be 0.866 for rater 1, and 0.868 for rater 2. The obtained Cronbach’s alpha decreased by all item deletion except for three items: “respiration”, “mobility in bed”, and “transfers ground to wheelchair”.

Discussion

The SCIM-III has been accepted as the most valid assessment tool for individuals with SCI worldwide [17]. Until now, functional independence of patients measured by SCIM-III has been assessed either by observation of their performance, interviewing them, or self-report method. All the three methods have been valid and reliable options, but the highest agreement belongs to observation [7] and the lowest, to self-report [18, 19]. The observation method has been reliable but time consuming, whereas interview has both acceptable reliability and is less time consuming [20]. Therefore, many outpatient centers perform their assessments by the interview method [21, 22].

The SCIM-III has been adapted for Portuguese [9], Italian [11], Thai [13], Greek [23], Spanish [10], and Turkish [12] languages with favorable results. These adaptations are in line with cultural adaptation and translation of assessment scales, a process previously well described [8]. In the same way these processes have been conducted previously for translation of other SCI-related questionnaires and scales into the Persian language [24,25,26]. Discriminative validity of Turkish [12] and Thai [13] versions of SCIM-III have previously been demonstrated, and the same was shown in our study for P-SCIM. We used the neurological categories as proposed by DeVivo et al. [27] to establish discriminative validity of the P-SCIM. Our findings also reveal that P-SCIM has the validity to discriminate participants based on their neurological severity categories (Table 2).

The obtained Cronbach’s alphas of P-SCIM in our study to evaluate internal consistency of the test were close to the previously reported values for the English SCIM-III [6, 7, 28, 29]. As previously demonstrated in the English SCIM-III [2, 7, 23], the Cronbach’s alpha and internal consistency of the P-SCIM subscale “Respiration and Sphincter Management” increases when the “respiration” item is deleted from the subscale [6, 7, 28]. Regarding the “ Mobility in room and toilet” subscale, we observed that deletion of the “Mobility in bed” item increases the Cronbach’s alpha. These findings have been reported during evaluation of English and Turkish versions [6, 7, 12, 28]. This finding may be because “respiration” and “mobility in bed” item scores may remain constant for many neurological levels below the high cervical area with different independence scores. Studies on Italian and Spanish versions have shown the mentioned feature in subscale “Respiration and Sphincter Management” only after deletion of the “Respiration” item [10, 11]. At the same time, the papers on Thai and Portuguese versions of SCIM, have not reported “alpha if item deleted” analysis at all.

Until now, functional independence of individuals with SCI measured by SCIM-III has been assessed either by observation of their performance, interviewing them, or self-report method.

Regarding the mean of differences (0.7), between rater 1 and 2 for total P-SCIM scores in different cases assessed by Bland-Altman method, it was very small, although statistically significant (p < 0.05, Table 5). Our inter-rater reliability indices [ICCs (Table 3) and kappa coefficients (Table 4) of P-SCIM] were close to the results reported for the English SCIM-III obtained by observation method [6, 7, 28, 29]. Therefore, our rapid interview method results may be comparable with the more time consuming observation method.

The obtained kappa coefficients of P-SCIM (Table 4) had more than acceptable standard values (0.6) for most items. All the corresponding values for the English SCIM-III assessed by interview have been < 0.6 except for the two items: ground/bed to wheelchair. This discrepancy could be due to the method of interview, i.e., in our study, each individual was evaluated by the raters with the same profession, whereas in the Itzkovich 2017 study [19], participants were assessed by raters with different professions (occupational therapist and physiotherapist). Also, our large sample size may be another explanation for this difference (279 vs 19).

Ackerman et al. [30] have reported some SCIM-III items either to have a floor effect: “transfer ground to wheelchair” and “stair management”, or ceiling effect: “respiration”. We have had the same observations; floor effect for the items “stair management (86%); and “transfer ground to wheelchair (74%)”, and ceiling effect for items “respiration (81.4%)” and “feeding (77.4%)”.

Although SCIM-III has sufficient reliability indices, nevertheless many studies also have proposed that developing a ‘training manual’ of SCIM-III can promote its reliability [30, 31] further. Developing a manual for P-SCIM may be a good horizon for future work.

Among the limitations of our study we note that the etiology of SCI in our participants was restricted to those with traumatic injuries, which could limit the generalizability of our results. Most studies have considered a mixed population [6, 7, 10,11,12,13, 29]. Further studies to evaluate P-SCIM in non traumatic SCI may be necessary. Future studies may be necessary to evaluate “sensitivity to functional changes” and “intra-rater reliability” indices further, and to assess the appropriateness of P-SCIM for practical clinical application.

Conclusion

Our study suggests that the P-SCIM (Appendix 1) is a valid and reliable measurement scale for individuals with traumatic SCI. P-SCIM may be useful for Persian speaking health professionals who may not be fluent in English. Further multicenter trials may be necessary to obtain more concrete evidence about the validity and reliability of the questionnaire.