Introduction

Lumbar spinal stenosis (LSS) refers to any type of narrowing of the lumbar spinal canal including central spinal canal or intervertebral foramina1, and it is one of the most common pathologic conditions affecting the spine2. The prevalence of degenerative LSS is reported ranging from 1.7 to 13.1%3,4,5, and it sharply increases with age to 47.2% in the 60 s2. Pain is main cause for seeking treatment in patients with LSS, and most commonly involves lower back, buttock, thigh and leg. Patients with LSS typically presents neurogenic claudication, which shows pain appearing with standing or lumbar extension, aggravated by walking, and relieved by sitting or forward flexion6,7. Most patients are treated conservatively with pain killers, injection physiotherapy, and rehabilitation. However, for patients whose symptom do not improve after conservative treatment, surgical treatment which includes decompression of spinal canal with or without spinal fusion can be performed to improve their clinical symptoms6.

Sleep is essential for restoration of physiologic function in addition to learning, memory, and cognition8,9. Disturbed sleep not only negatively affects quality of life but also causes mental10 and physical11 illness such as cardiovascular diseases, diabetes, and cancer, and eventually increases mortality12. Especially in older populations with chronic medical conditions, quality of sleep is closely linked to their life expectancy as well as quality of life12,13,14,15,16,17,18. However, despite the essential role of sleep in mental and physical health of older populations, quality of sleep is not considered in patients with LSS. The Oswestry disability index, the most commonly used disability index for LSS, simply evaluates the presence of pain-related sleep disturbance19. The short-form 36-item questionnaire (SF-36) and the EuroQol (EQ-5D), the most widely used health-related quality of life measures for patients with LSS, do not measure the quality of sleep at all20.

Cross-sectional studies investigated the quality of sleep in other musculoskeletal diseases such as osteoarthritis of the knee, and rotator cuff tear21,22, and recent prospective case-series measured their changes after surgical treatment23,24. In such joint diseases, pain is generally reduced during sleep, because the gravitational joint force is decreased in the lying position. However, in clinical practice, we frequently observe that symptoms of patients with LSS are aggravated having difficulty in falling asleep. Patients with LSS have a relatively smaller spinal canal during lumbar extension compared to lumbar flexion25. During sleep, patients with LSS remain in a prolonged lying position with lumbar extension, which causes significant narrowing of the spinal canal and aggravates symptoms from LSS26. Therefore, the pattern of sleep disturbance in patients with LSS and its changes after treatment is thought to be significantly different from other joint diseases.

Despite the importance of sleep and the distinct positional intolerance for lying during sleeping in LSS patients, there have been little studies that investigated sleep disturbance in LSS patients. We therefore planned a prospective comparative study to compare the changes in sleep disturbance in patients with LSS who either received conservative or surgical treatment.

Materials and methods

Study patients

A prospective cohort study was performed on patients with LSS who visited our institution’s spine center between May 2019 and October 2019. We excluded the following: patients who had either received surgical treatment for LSS, or a nonsurgical spinal procedure that included pain block within 6 weeks before the enrollment; cauda equina syndrome; malignancy; recent hospitalization; diagnosis of psychosis, sleep apnea, or dementia; dyspnea or visceral pain affecting sleep within 4 weeks before enrollment; recent overseas travel with jet lag and night shift works, and finally the inability to complete the questionnaire (Fig. 1).

Figure 1
figure 1

Patient enrollment.

Measurement of sleep disturbance: Pittsburgh sleep quality index (PSQI)

Sleep disturbance was evaluated with the standardized, local language version of the Pittsburgh sleep quality index (PSQI), which evaluates sleep habits during the last month27,28,29. This self-report questionnaire consists of 19 questions with seven subcategories: sleep quality, latency, duration, and disturbance; habitual sleep efficiency; use of sleep medications, and daytime dysfunction. The sum of the scores from all seven subcategories produces a global score (the PSQI score) ranging from 0 to 21 with higher scores associated with a poorer quality of sleep. A PSQI score over 5 points showed a diagnostic sensitivity of 89.6% and specificity of 86.5% (kappa = 0.75, p < 0.001) in distinguishing good and poor sleepers27, and PSQI of ≥ 6 points is generally accepted as indicating sleep disturbance30.

Covariates

Demographic data and medical history included age, body mass index (BMI), education, precise medication history during the previous 4 weeks before enrollment, and medical comorbidities evaluated by the Charlson comorbidity index. Information on lifestyle associated with sleep included occupation (none, sedentary or physical worker), smoking (within 2 h before bedtime), alcohol (within 2 h before bedtime), and caffeine intake (within 6 h before bedtime). The presence of chronic joint pain in the upper or lower extremities was included. Other questionnaires included the Oswestry disability index (ODI) score, and the Hamilton depression rating scale.

The following predefined radiologic parameters on lumbar stenosis severity were measured with MRI. The dural sac cross-sectional area (mm2) was measured at the most severe level of stenosis. At this level, we evaluated the morphologic grade of central-type stenosis based on the morphology of the dural sac31 and the morphologic grade of foraminal-type stenosis32.

Grouping and propensity score matching

Patients with LSS who had sleep disturbance (PSQI ≥ 6 points) were enrolled for the longitudinal study. The cohort of patients was divided into two groups according to the method of treatment: conservative group and surgical group (Fig. 1). Propensity score matching was used to attenuate potential selection bias between the two groups. Sleep quality at the time of enrollment could be influenced by the method of treatment of LSS during the previous 4 weeks. Therefore, we excluded patients who had received any surgical or nonsurgical procedures within the previous 6 weeks, and propensity score matching was done for the precise medication history, including the use of pregabalin, gabapentin, and opioids during the previous 4 weeks. The propensity score model additionally consisted of the covariates including age, sex, Charlson comorbidity index score, Hamilton depression rating scale, initial ODI score, and initial global PSQI score. Patient characteristics and infection profiles were compared between the two groups before and after the propensity score matching (Tables 1, 2).

Table 1 Comparison of initial patients’ characteristics.
Table 2 Comparison of initial radiologic profile.

Treatment protocol and follow-up measures

The conservative treatment protocol included a nonsteroidal anti-inflammatory agent, physical therapy, education for exercising at home, and pregabalin, gabapentin or opioids if tolerated. Patients were guided to take the provided oral medications with strict adherence for the first 4 weeks and freely thereafter on the basis of their need. Surgical treatment included posterior decompression with or without spinal fusion under general anesthesia. Patients received postoperative intravenous patient-controlled analgesia for 24 h. Additionally, patients received oxycodone extended-release 5–10 mg by mouth during the 2-week postoperative period. Follow-up measurement of ODI scores and PSQI scores was performed at 6 weeks, 3 months, and 6 months after enrollment, together with a detailed history of prescribed medication.

Statistical analysis

Sample size was estimated from the result of our preliminary cohort study which included 20 patients with sleep disturbance. After 6-month follow-up, mean PSQI global score was 6.2 points for surgical group (standard deviation = 2.4) and 7.7 points for conservative group. Considering a power of 0.90 (alpha = 0.05), we calculated a minimum sample size of 27 patients.

The effect of surgical treatment on the improvement of the PSQI score at the 6-month follow-up was evaluated using multivariate logistic regression analysis to adjust for confounding factors. Multivariate adjustment was initially done for the variables included in the propensity score matching (model 1) and also after adding radiologic variables (model 2). According to previous reports, a change in global PSQI score ≥ over three points or more was regarded as a minimal clinically important difference1.

The statistical tests were two-tailed, and a p-value of < 0.05 indicated significance. Analyses were performed using SPSS 24 (SPSS Inc., Chicago, Illinois, USA).

Ethics

This study was designed and conducted using the STROBE format (Strengthening the Reporting of Observational studies in Epidemiology) guidelines33. This study was approved by the institutional review board of Hallym University Sacred Heart Hospital (2019-04-002), and written informed consent was obtained from all patients.

Results

Enrollment and grouping

A total of 239 patients with LSS and sleep disturbance (PSQI ≥ 6 points) were identified for our study (Fig. 1). A 6-month follow-up was only available for 201 patients (84.1%), consisting of 147 patients with conservative treatment and 54 patients with surgical treatment. Using the propensity score, 96 (47.7%) of the 201 patients were finally matched (48 patients in each group; Fig. 1). Among 48 patients of the surgical group, 9 patients underwent only decompressive surgery, and 39 patients underwent additional spinal fusion.

Comparison of initial patients’ characteristics

Before matching, the surgical group used opioids more frequently during the 4 weeks before the enrollment than the conservative group (63% versus 44%, p = 0.018, Table 1). However, this difference was not significantly different after matching (60% versus 50%, p = 0.305, Table 1). The other variables of propensity score matching which include age, sex ratio, Charlson comorbidity index score, Hamilton depression rating scale, initial ODI score, initial global PSQI score, and use of pregabalin or gabapentin during the previous 4 weeks were more balanced after matching (Table 1).

Comparison of initial radiologic profile

Before matching, the surgical group had a statistically smaller dural sac cross-sectional area (45 mm2 versus 57 mm2, p = 0.014, Table 2). However, it was not significantly different after matching (45 mm2 versus 51 mm2, p = 0.323, Table 2), even though matching was not done for the dural sac cross-sectional area. Morphological grades of stenosis at the most severe level, presented by either central-type stenosis or foraminal-type stenosis, were not significantly different between the two groups before and after matching (Table 2).

Comparison of prescribed medications within the matched cohorts

There were no significant differences in prescribed medication use during the previous 4 weeks including NSAIDs, opioids, pregabalin or gabapentin, and sleep medication (Table 3). However, 6 weeks after enrollment, opioid, and pregabalin or gabapentin were less frequently prescribed in the surgical group. At 3 and 6-months after enrollment, NSAIDs, opioids, and pregabalin or gabapentin were also less frequently prescribed in the surgical group. There were no significant differences in the use of sleep medication during the follow-up period between the two groups.

Table 3 Comparison of prescribed medications during the follow-up period.

Comparison of changes in clinical profile within the matched cohorts

There were no differences in the initial ODI score and the initial global PSQI score between the two groups. However, the surgical group had significantly lower ODI scores at 3 and 6-months follow-up, and significantly lower global PSQI scores throughout the follow-up (Table 4, Fig. 2).

Table 4 Comparison of changes in clinical profile within the matched cohorts.
Figure 2
figure 2

Comparison of changes in clinical profile within the matched cohorts (*significant intergroup difference of p < 0.05). (a) Comparison of Oswestry Disability Index (ODI) score. (b) Comparison of Pittsburgh sleep quality index (PSQI) score. Middle line in the box: median value; x mark within the box: mean value; upper and lower edges of box: interquartile range including 50% of the observations; dots outside box: outliers. Whisker lines extend for 1.5 times the interquartile range.

Changes of the global PSQI score and the scores of its seven subcategories from the initial values were compared between the two groups (Fig. 3). As the treatment continued, there were decreasing trends in these values regardless of the treatment group, which means clinically improved sleep. However, the surgical group had lower (clinically improved) values in the global PSQI score (Fig. 3a), sleep latency index (Fig. 3c), sleep disturbance index (Fig. 3f), and daytime dysfunction index (Fig. 3h) than the conservative group at both 6-week and 6-month follow-up. Despite the significant difference in sleep latency (minutes), there was no difference in total sleep time (minutes) between the two groups at 6-months after enrollment (Fig. 4).

Figure 3
figure 3

Changes in global PSQI score and scores of its seven subcategories between the three time points (at 6 weeks, 3 months, and 6 months) and their initial values. (a) Global Pittsburgh sleep quality index (PSQI) score. (b) Sleep quality index. (c) Sleep latency index. (d) Sleep duration index. (e) Sleep efficacy index. (f) Sleep disturbance index. (g) Sleep medication index. (h) Daytime dysfunction index. (*significant intergroup difference of p < 0.05; #significant intergroup difference of p < 0.01).

Figure 4
figure 4

Comparison of sleep latency and total sleep time assessed by actual minutes (*significant intergroup difference of p < 0.05). (a) Comparison of sleep latency assessed by minutes. (b) Comparison of total sleep time assessed by minutes.

Effect of surgical treatment on the improvement of PSQI score at 6-month follow-up: multivariate analysis

At the 6-month follow-up, clinically meaningful improvement in sleep (a decreased global PSQI score ≥ 3 points) was observed in 50% (24 of 48 patients) of the conservative group, and 85.4% (41 of 48 patients) of the surgical group (Table 5). The multivariate logistic regression analysis shows that the surgical group had a significantly greater chance of improved sleep than the conservative group by the 6-month follow-up, in both matched and whole cohort.

Table 5 Effect of surgical treatment on the improvement of PSQI score at 6-month follow-up: multivariate analysis.

Factors associated with failure in sleep improvement: subgroup analysis within the whole conservative cohort

Although significant sleep improvement occurred in 85.2% (46 of 54 patients) of surgical group, sleep improvement occurred only in 45.6% (67 of 147 patients) of conservative group. Factors possibly associated poor outcome after conservative treatment were investigated comparing independent variables between the patients with and without sleep improvement within the whole conservative subgroup (Table 6). After subgroup analysis, patients without significant sleep improvement after conservative treatment showed significantly worse score in Hamilton depression rating scale (p = 0.022), and more severe degree of foraminal-type stenosis (p = 0.026) than patients with sleep improvement (Table 6).

Table 6 Factors associated with failure in sleep improvement: subgroup analysis within the whole conservative cohort.

Discussion

To the best of our knowledge, this is the first study to compare the effect of treatment for LSS on sleep quality, between patients managed conservatively and those treated surgically. Our findings suggest that sleep quality, as measured by the PSQI, was improved after treatment, regardless of the treatment method (Table 3, Fig. 3). However, the surgical group showed more rapid improvement of sleep after treatment. Sleep latency index, sleep disturbance index and index of daytime dysfunction were significantly lower in the surgical than in the conservative group, even 6 weeks after treatment (Fig. 3). In addition, the surgical group showed a consistent improvement in sleep after treatment. At 6 weeks, a clinically significant improvement in sleep (a decrease global PSQI score ≥ 3 points) was shown in both the conservative and the surgical group, but there was no difference between them. However, by the 6-month follow-up, clinical improvement of sleep was shown in 46% (22 of 48 patients) in the conservative group, and 85% (41 of 48 patients) in the surgical group and the difference was significant (p < 0.001, Table 4).

Rapid improvement in sleep seen as early as 6 weeks after surgical treatment for LSS is noteworthy. According to the studies that examined changes in sleep after surgical treatment for osteoarthritis of the knee and rotator cuff tear, sleep worsened 6 weeks after surgery, and significant improvement was only achieved 3 months after surgery23,24. However, in our cohort, global PSQI scores decreased as early as 6 weeks after surgical treatment (12.3–6.7, Table 4). We believe that immediate decompression or widening of the lumbar spinal canal by surgery enables such improvement in sleep. In patients with knee osteoarthritis or rotator cuff disease, gravitational joint force is minimized in a supine lying position for sleep. Therefore, their joint pain generally decreases during sleeping, and postoperative pain improvement during sleeping might not be notable early after surgery. Conversely, in LSS patients, their narrow intervertebral foramen and central spinal canal further decreases during lumbar extension compared to that during lumbar flexion25. During sleeping, LSS patients inevitably remain in a prolonged lying position with lumbar extension, causing significant narrowing of the spinal canal and intensifying LSS symptoms. A recent published study demonstrated that foraminal stenosis, which causes leg pain, is independently associated with sleep disturbance in LSS patients26. Surgical decompression of the spinal canal in LSS patients can immediately and permanently enlarge foraminal size, more effectively improve leg pain and resultantly improve sleep quality. In clinical practice we frequently observe LSS patients who needed to assume an uncomfortable lateral decubitus position with lumbar flexion even after pain block, being able to adopt a comfortable supine position with lumbar extension immediately after surgery. As expected, subgroup analysis of conservative cohort identified that patients without significant sleep improvement after conservative treatment showed more severe degree of foraminal-type stenosis (p = 0.026) than patients with sleep improvement (Table 7).

Table 7 Correlation between changes in pain and disability scale and changes in global PSQI score within the whole cohort.

Another characteristic finding of sleep improvement in the surgical group compared to that in the conservative treatment is its consistent effect on sleep quality. In the surgical group, the ODI and the global PSQI scores were consistently decreased during the follow-up period. Although the use of pain killers was significantly lower in the surgical group (Table 3), the percentage of patients who showed clinically meaningful improvement in the global PSQI score increased from 77 to 85% (Table 4). However, in the conservative group, the ODI score was also consistently decreased during the 6-month follow-up (Table 4), but the percentage of patients who showed clinically meaningful improvement in the global PSQI score rather decreased from 60 to 50% during the same period (Table 4).

The major limitation of our study is that we could not eliminate the risk of bias resulting from unknown confounders because of the non-randomized study design. To reduce such bias, we tried to investigate all possible variables which can affect sleep, including the degree of depression, education level, occupation, presence of degenerative joint disease known to be related to the sleep disturbance, extent of lumbar stenosis, and lifestyle factors such as smoking, alcohol intake, and caffeine intake. Then, we matched such variables between the conservative and the surgical group using propensity score matching (Tables 1, 2). Finally, we performed a multivariate analysis to additionally adjust such possible confounders within the matched cohorts.

Our findings must be carefully interpreted. Several patients in the conservative group were using intermittent pain killers including NSAIDS, opioid and pregabalin or gabapentin for LSS at the time of their enrollment (Table 3). However, 60% of patients (29 of 48 patients, Table 4) exhibited significant improvement in sleep within 6 weeks of enrollment in the conservative group. Such improvement might be attributable to the additional physical therapy or education program provided as part of our treatment. However, possible influence of unknown confounders should not be disregarded.

Our findings indicate that surgical treatment for LSS effectively improves sleep. However, the results should not be overestimated. Despite the superior outcome of surgical treatment over conservative treatment for LSS in terms of global PSQI score, we observed no significant difference between the two groups in terms of actual sleep duration throughout the follow-up (Table 4). Moreover, although surgical treatment eliminates the inconvenience of LSS patients due to positional intolerance while sleeping in, nocturnal leg cramps that disturb their sleep and wake them persisted even after surgical treatment. A study which evaluated the surgical outcomes of patients with LSS and nocturnal leg cramps reported that leg cramp improved after surgery in only 18.2% of patients34. Multiple factors cause sleep disturbance in patients with LSS, and sleep disturbance observed in our cohort might be associated with depression or other lifestyle factors in addition to chronic pain35. The favorable results noticed in such patients cannot be attributed merely to surgical treatment of LSS.

There were no differences in the use of prescribed medication at initial enrollment, whereas analgesics and anticonvulsants were more frequently prescribed in the conservative group after enrollment. Chronic use of such drugs could aggravate sleep disturbance in conservative group. Finally, even after propensity score matching, some independent variables such as dural sac cross-sectional area (mm2) showed differences between the two groups even though they were not statistically significant. However, we additionally performed multivariate adjustment including radiologic variables (model 2 in Table 5) after propensity score matching and surgical treatment still showed superior outcome for sleep disturbance despite severe degree of stenosis.

In conclusion, in patients with LSS, sleep quality measured by PSQI was initially improved after treatment, regardless of the treatment method. Sleep quality in the surgical group was remarkably improved early at 6 weeks after surgery and consistently improved during the follow-up period, despite less use of pain killer. In contrast, in the conservative group, improvement of sleep quality at 6-weeks after treatment was not maintained during the follow-up, although the treatment outcome for LSS measured by ODI was continuously improved. Sleep improvement was closely correlated with symptom improvement, especially improvement of leg pain, and multivariate analysis within propensity score matched cohorts identified surgical treatment for LSS patients with sleep disturbance had a significantly greater chance of improving sleep quality than conservative treatment. Subgroup analysis of conservative cohort identified that failure of sleep improvement in conservative group was significantly associated with depression presented by worse score in Hamilton depression rating scale, and more severe degree of foraminal-type stenosis, which should be carefully considered for conservative treatment of LSS patients with sleep disturbance.