Background

Primary Spinal Intradural Tumours (PSIT) include intramedullary (IMT) and extramedullary (EMT) tumours. IMTs are those that arise from the spinal cord itself and EMTs are those that arise from the structures within the dural sac but not from the spinal cord. Both IMTs and EMTs are rare tumours with an estimated incidence of 0.74 per 100,000 person years for both subtypes combined [1]. EMT tumours represent 70% of all intradural tumours, most of which are benign and amenable to surgical resection. A broad range of pathologies fall under the PSIT label, the commonest being meningiomas, schwannomas, and ependymomas [2]. IMTs in adults are most commonly ependymomas or astrocytomas. These are mostly slow growing, however 10% are more aggressive in their growth [3]. The mainstay of treatment for both EMTs and IMTs is early surgery [4]. However, surgical treatment for IMTs often involves performing a myelotomy with an increased risk of residual or worsened neurological deficit. As such, outcomes for IMTs may be worse than for EMTs.

PSITs may present with almost any neurological deficit affecting the upper or lower limbs, bladder or bowel dysfunction, or axial or radicular pain [5]. They therefore have the potential to substantially impact a patients functioning and quality of life. Patient Reported Outcomes (PROs) are measures coming directly from patients about a given health condition or treatment [6]. This umbrella term encompasses measures of health related quality of life (HRQoL), health status, symptom burden, adherence to treatment, and satisfaction with treatment [7]. Measurement of PROs is becoming increasingly important to planning and delivering clinical effectiveness research, for healthcare organisations, and for monitoring individual patient care [8]. The objective of this systematic review was to summarise all primary research that reports PROs in cohorts of adult patients with PSITs.

Methods

The protocol for this review was registered on PROSPERO (CRD 42022290780) and reporting has been performed in accordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines 2020.

Eligibility criteria

Eligibility criteria for study inclusion are listed in Table 1.

Table 1 Eligibility criteria for study inclusion.

Search strategy

Searches were performed in PubMed and Embase using a comprehensive search strategy (Supplementary Appendix 1). The searches were performed 05/11/2021. This search strategy was developed to identify the full breadth of PRO research involving PSITs. Finally, the reference lists for all included articles of this systematic review were examined for further eligible articles.

Selection process

Removal of duplicate articles was performed using EndNote X9.3.3. Abstract screening was performed on the online systematic review management platform (Rayyan). Abstracts were divided such that each abstract was reviewed by at least two co-authors (MAM, OA, CG, GR, AMSA). Full texts for articles included at the end of the screening process were reviewed against the eligibility criteria. Reasons for exclusion of articles were recorded at the full text review stage. Where studies appeared to be from the same author group and/or based on the same patient population, the study with the latest publication date was used.

Data collection process

Each article was read by two of five reviewers (AMSA, MAM, OA, CG, GR). All data were extracted and independently entered into a Microsoft Excel spreadsheet (v16.34, Microsoft, Washington DC, USA). Summary study details as well as details of PROs reported were extracted from articles. To facilitate analysis of PROs, these were categorised into global, physical, social, and mental health PROs.

Additional data extracted included: demographics and number of patients included: participation rate (the percentage of patients responding to invitations to contribute PROs), tumour histology, IMT/EMT location, treatment provided, method of PRO assessment (questionnaire vs. interview), study type, and timepoint of PRO assessment (e.g. before or after surgery). Where data was not obtainable, this was documented.

Quality of reporting

Quality of reporting in each article was assessed according to a modified version of the International Society for Quality of Life Research (ISOQOL) reporting standards. Although originally designed for use with PROs reported in randomised clinical trials (RCT), the ISOQOL standards have been implemented in systematic reviews assessing a variety of study types [9]. However, the modified version permits the use of these standards for any study type (see Supplementary Appendix 2 for full details) [10]. This was performed by one of four data collectors (MAM, OA, CG, GR) and each data point independently checked by the first author (AMSA).

Study risk of bias assessment

Two different tools were used to assess the risk of bias according to study type. The Joanna Briggs Institute (JBI) checklist was used for cross-sectional studies [11]. The Newcastle-Ottawa Scale (NOS) was used for cohort studies [11] The JBI and NOS tools were applied independently by one of four reviewers (MAM, OA, CG, GR) and each item independently confirmed by the first author (AMSA).

Results

Study selection

The literature search identified 2382 studies following removal of duplicates. Following abstract screening, 56 studies remained for full text review. A total of 11 studies were included in the final analysis. Figure 1 demonstrates this study’s PRISMA flowchart with reasons for exclusion after full text review.

Fig. 1: PRISMA flow diagram of search results.
figure 1

PRISMA flow diagram demonstrating search results, excluded studies, and included studies.

Study characteristics

Nearly half of included studies originated from North America (n = 5). The remainder were from Europe (n = 3), Japan (n = 2), and one study from South America. The total number of PSIT patients represented across these studies was 737 (range: 16–149). Of these, 392 were EMTs and 345 were IMTs. One study contained a control group in the form of brain tumour patients [12]. The commonest pathology was schwannoma (n = 190). The mean participation rate in these studies was 74% (range 43–97%). Where the mean age of the patients was reported, (n = 8 studies), the average age of all patients in this review was 44.6 years. Overall, 51.7% of patients were female.

Four studies were focused on EMT, five studies were on IMT, and two studies included both subtypes. In the two studies reporting on both pathological subtypes, outcomes were not broken down by EMT/IMT status. There were 4 cross sectional and 7 longitudinal cohorts. The longitudinal cohorts all assessed pre- and post-operative PROs. All studies were surgical series, and no study evaluated the impact of chemotherapy/radiotherapy. PRO assessments were collected through several different routes. This included completion by the patient online [12], in clinic [13,14,15,16], by mail [17,18,19], in person asked by an interviewer [20], via phone [21], or by a combination of methods [22]. For a full list with abbreviations of PRO tools used, see Table 2.

Table 2 Summary of study findings.

Risk of bias in studies

The results for risk of bias assessments are presented in Supplementary Appendix 3.

Quality of PRO reporting within studies

All studies included in this review achieved satisfactory scoring on the modified ISOQOL reporting standards. Due to heterogeneous reporting, varied levels of granularity, and varied PRO tools used, quantitative meta-analysis of findings was not possible. A qualitative synthesis of findings was therefore used.

Outcomes reported in EMTs

Global health

For EMTs, numerous studies identified an improvement in overall quality of life post-operatively at 12 months [13, 21] and beyond [22]. Post-operative overall QoL as measured using the Visual Analogue Scale (VAS) component on the EQ-5D was middle to high [13]. Although ongoing presence of pain was associated with worsened QoL, the strongest predictors of this were bladder/bowel symptoms and motor deficits [13]. Presence of sensory deficits alone did not significantly influence overall QoL [13]. Where studies attempted to identify the impact of tumour location, cervical tumours were associated with the greatest improvement post-operatively [22].

Physical health

In all EMT studies, significant improvement in post-operative pain at 12-months follow up were reported across a variety of reporting tools (Table 2) [13, 14, 21, 22]. This improvement occurred as early as <1-month follow-up [14] and was not associated with age, tumour type, or tumour location [13].

Physical functioning also improved post-operatively at 12-months measured using the Oswestry Disability Index (ODI), Neck Disability Index (NDI), or the Physical Component Score (PCS) component of the SF36 [21]. However, unlike the early improvement in pain reported above, an initial worsening in the ODI as well as the mobility and self-care EQ5D domains at <1-month was reported by Viereck et al, followed by improvement at all subsequent follow ups [14]. Assessed using the Brief Pain Index (BPI) and the MD Anderson Symptom Inventory (MDASI), Newman et al also reported significant improvements in physical health, general activity, walking ability, and sleep quality at >12-months [22].

Social health

Social health in EMTs was assessed in two studies. Participation in usual social activities as measured using the EQ5D appeared to significantly improve by 3–12 months (Table 2) [14]. This change was sustained beyond 12 months as measured using the BPI and MDASI [22].

Mental health

Self-reported depression and anxiety was measured using a variety of general tools including the EQ5D, SF36, BPI, and the MDASI (Table 2). In a cohort of 44 EMTs, Viereck et al reported a improvements in the anxiety and depression scores of the EQ5D post-operatively as early as <1-month, and at 1–3 months, 3-12 month, and > 12-month follow up [14]. Likewise, Chotai et al reported a significant improvement in overall mental health in a group of EMTs at the 12 month mark using the Mental Component Score (MCS) of the SF36 [21]. Overall mood and enjoyment of life also demonstrated significant improvement as measured using the BPI and MDASI at long term follow up [22].

Outcomes reported in IMTs

Global health

In contrast to EMTs, a general pattern of worsened overall QoL was reported for IMTs. Compared to national averages, overall SF36 scores were worse at a mean follow of 5.4 years post-operatively [17]. These were significantly associated with the existence of pain, as measured using the Neuropathic Pain Symptom Inventory (NPSI). In another cohort, although the minimal clinically important difference (MCID) in the EQ5D was reached in 28% of IMTs, the group average did not show a statistically significant improvement at a median of 12 months of follow up [16]. Ependymoma pathology was trending towards reaching MCID. In contrast to findings in EMTs by Newman et al, cervical location was associated with worsened outcomes in IMTs [22].

Three studies included ependymoma patients only. In a study of 16 ependymoma patients by Nakanashi et al. no improvement in total SF36 score post-operatively was identified at a follow up of 6–12 months [20]. At a median follow up of 5.4 years, Butenschoen et al noted an overall high disease burden and perceived disability in ependymoma patients as measured on the mean utility EQ5D [19]. A worsened outcome was associated with a worse neurological deficit as measured with the MMG, not being able to return to usual physical activities, and female gender. Acquaye et al compared ependymoma pathology in spinal versus cranial patients. They note a significantly higher symptom burden and symptom frequency in spinal ependymoma patients as well as overall worse SF36 scores as compared to cranial patients.

Physical health

In contast to EMTs, improvements in pain were not as prevelant in IMTs. At a mean follow-up of 5.4 years, 56% of 85 IMTs were found to have remaining moderate to severe pain measured using the NPSI [17]. The paresthesiae/dysesthesia domain of the NPSI was significantly worse than other pain categories (burning, pressing, or paroxysmal). Acquaye et al found the commonest reported symptom in ependymoma patients being numbness/tingling (reported by 59% of 149 patients) as well as pain (reported by 46%) [12]. Tumour pathology was found to potentially influence the type of pain perceived with astrocytomas being associated with burning pain and hemangioblastomas being associated with paroxysmal pain [17]. The worsening pain scores were independent of improvements in functional status, as measured using the Japanese Orthopoedic Association score (JOA).

For overall physical health, at 22 months median follow up, Xiao et al reported no significant improvement in mean Pain Disability Questionnaire (PDQ) scores despite 28% reaching MCID [16]. Improvements were associated with ependymoma pathology and thoracic location, while a worsening of the PDQ was associated with cervical location and the development of cerebrospinal fluid leak post-operatively. Acquaye et al reported that the PCS of the SF36 was worse in spinal ependymoma than cranial ependymoma patients [12]. They also identified specific physical symptoms that were reported frequently by ependymoma patients to include fatigue (46%), weakness (47%), and sexual dysfunction (39%). Butenschoen et al reported the worst SF36 domains for ependymoma patients being vitality and role limitations due to physical constraints [19]. These were significantly worse in patients reporting difficulties returning to any sporting activity.

Social health

Compared to brain ependymoma patients, spinal patients demonstrated reduced return to work and participation in usual activities [12]. There was a general shift from full time employment pre-op to part time or early retirement. Commonest reasons reported for this was persistent pain or physical stress. Individual level participation in sports remained stable, although overall frequency of sporting sessions reduced as well as a reduction in participation in group sports. Commonest reported limitations for participation in sports were pain, coordination problems, motor deficits, and a fear of injury. Poorer participation in sports was associated with poorer MMG post-operatively and more extensive operative approaches ( > 5 levels or laminectomy/laminoplasty vs. unilateral laminotomy).

Mental health

In a mixed group of IMTs [16], mean Patient Health Questionnaire 9 (PHQ-9) scores did not significantly differ between pre- and post-operative assessments (22-months median follow up). This was the only study where a dedicted mental health assessment tool was used. MCID in the PHQ-9 was reached in 16% of patients, but no predictors of improvement were identified. Compared to cranial ependymoma patients, the rates of self-reported depression were similar in spinal patients (19% vs. 15%) [12].

Outcomes reported in combined EMT/IMT cohorts

Global health

Two studies reported outcomes of EMTs and IMTs without reporting the outcomes for each pathology subgroup separately. In a mixed study of EMTs and IMTs, Bellut et al found that overall QoL was significantly improved with surgery at 3- and 12-month follow up [18]. They also identified significant satisfaction with surgery and a patient reported sense of surgery having been a useful intervention. In another combined series, Guirado et al reported a correlation with improved post-operative SF36 scores and post-operative neurological status as measuring using the MMG and the Aminoff-Logue Scale (ALS) [15]. Of note, in both studies where a mixed population is reported, the majority of pathologies were EMTs.

Physical health

At a mean of 20 months follow up, Guirado et al reported a worsened PCS component of the SF36 in a combined EMT/IMT cohort which was associated with myelopathy subscales of the ALS (gait, micturition, and bowel) and with poorer post-operative MMG. The SF36 vitality domain was significantly associated with the bowel domain of the ALS. However, in another combined series, Bellut et al reported improvements in axial and peripheral pain measured using the Core Outcome Measures Index (COMI) as early as 3 months follow up, and this was sustained at 12-months follow up [18].

Social health

In mixed IMT and EMT studies, overall social disability as measured using the COMI improved by 3 months [18], however work disability improvement was only detected at 12 months [18]. Overall improvement in social functioning as measured using the social functioning domain of the SF36 was associated with improved MMG post-operatively [15]. Prescence of bowel symptoms on the ALS was associated with a poorer role functioning score on the SF36 [15].

Mental health

In a mixed cohort of IMTs and EMTs, presence of myelopathic symptoms as measured using the ALS was associated with poorer scores in the MCS of the SF36.

Discussion

Collecting and using PROs in the evaluation of interventions allows for clinical care with a patient centred focus [23]. In this systematic review of 11 studies of PROs in 737 PSIT patients, global and physical health were the main patient reported dimensions. Despite the breadth of symptoms suffered by spinal patients, mental and social health was not reported as frequently. For EMTs, an overall improvement in outcomes after surgery was identified. However, for IMTs, poorer or unchanged outcomes across all PRO domains were found. This review also identified substantial heterogeneity in the PRO reporting in the field. In total, 21 PRO tools were identified. All studies evaluated the impact of surgical intervention. Most studies evaluated PROs up to 12-months post-operatively with few studies extending beyond this time point. The different trajectories of EMTs and IMTs necessitate reporting of outcomes separately for each category.

Overall QoL in EMTs was improved due to surgical interventions [13, 21, 22]. However, a similar pattern was not convincingly found in IMTs with no significant improvement found in mean QoL scores post-operatively [16, 20]. Across both tumour subtypes, predictors of better outcomes included an improved pre-operative, or post-operative neurological status and ependymoma pathology in IMTs [16]. Myelopathic symptoms including bladder or bowel symptoms in particular were identified as predictors of worsened overall QoL [13, 15].

Pain was measured with a variety of tools including the VAS, GRS, NRS, BPI, and MDASI. Across all measures, studies in EMTs demonstrates significant improvement in pain post-operatively that was long-lasting into the 12-month follow-up range [13, 14, 21, 22]. In contrast however, chronic pain pain was more common in IMTs and was a significant contributor to worsened QoL [17]. Interestingly, although the sample sizes were small in this study by Nakamura et al, associations were found between pathology of IMT and the subtype of pain.

HRQoL is a multi-dimensional entity, recognised in the varied domains questioned in commonly utilised tools such as the EQ5D or the SF36. Spinal pathology in particular has the potential to severely affect HRQoL through its impact on motor, sensory, pain, bladder, bowel, or sexual function. With the potential to create such varied deficits, it is pertinent to evaluate the effect of interventions on patient return to work and usual social activities that can improve quality of life such as engagement in family time or sporting activities. Few studies directly evaluated such dimensions. Where assessed, in EMTs a return to usual activities had significantly increased by 3 months [14] and sustained up to 12 months [22]. In contrast, for ependmomas, there was a reduction in return to work [19] and engagement in usual activities compared to brain ependymoma patients [12]. More invasive surgical approaches were associated with a reduction in patient return to sporting activity [19].

Although not often reported, improvements in mental health were sustained up to the 12-month mark in EMTs [14, 21]. The only dedicated depression/anxiety assessment tool used was the PHQ-9 in a cohort of IMTs where no significant change post-operatively was detected [16]. Compared to patients with brain ependymomas, the rates of self-reported depression were not significantly different to spinal patients (19% in brain, 15% in spine ependymoma patients) [12]. It is likely that with the significant symptom burden identified in this review and the impact on physical and social health that a significant mental health burden also exists. Regular collection of such outcomes is therefore pertinent to highlight the broader needs of this patient group.

In total, 21 PRO tools were used across studies. The SF36 was the commonest tool used. While the SF36 has been validated for other neurological [24] and non-neurological [25] conditions, it has not been validated for use in PSITs. However, the use of such global tools as the SF36 and EQ5D allow assessment of a broad range of HRQoL domains and comparison with other conditions. To allow the quantitative comparison of outcomes, a standardised set of PRO tools to be used in future reporting in spinal pathology is needed. In addition, use of tools with validated MCID would allow the identification of variables that lead to this clinically meaningful result. PRO tools tailored to account for the complex and varied deficits in spinal patients must also be used [15].

Acquaye et al compared ependymoma pathology in cranial and spinal patients and identified crucial differences between the two groups [12]. This included higher overall symptom burden in spinal patients, worse overall QoL, vitality scores, and PCS scores in the SF36, poorer return to work and usual activities in spinal ependymoma patients as well as comparable rates of depression and anxiety. These findings highlight the severity of symptoms suffered by patients with IMTs.

Finally, there is an emerging recognition that radiotherapy and chemotherapy have substantial effects on neural tissue as displayed in the reduction in cognitive function for patients receiving radiotherapy to the brain [26] or chemotherapy for any type of cancer [27]. However, no studies reported the effects of any such therapies. It is therefore difficult to disentangle the effects of surgery from those of chemotherapy/radiotherapy. This is particularly true for IMTs, where overall poorer outcomes were found and where chemotherapy or radiotherapy are more likely to be indicated [28]. Evaluating the impact of chemotherapy and radiotherapy is certainly an area for future focus in the field and again calls for the creation of granular registries.

Conclusion

HRQoL as measured by PROs demonstrated significant and long-lasting overall improvement in patients with EMTs following surgical intervention. This improvement extended to improvements in pain, physical health, social function, and mental health. Similar improvements in IMTs were not found. No studies were found that evaluated the impact of chemotherapy or radiotherapy on patients with PSITs. Also, there exists substantial heterogeneity in the literature reporting of PROs in PSITs. This heterogeneity precludes the quantitative evaluation of the effect of tumour pathology, location, and surgical approach – key variables needed in counselling and planning care for patients. This calls for the establishment of a key set of PROs for this patient group and possibly the use of granular registries to allow the pooling of data on these rare pathologies.