Abstract
Although deep brain stimulation (DBS) of the globus pallidus internus (GPi) and the subthalamic nucleus (STN) has become an established treatment for Parkinson’s disease (PD), a recent meta-analysis of outcomes is lacking. To address this gap, we performed a meta-analysis of bilateral STN- and GPi-DBS studies published from 1990-08/2019. Studies with ≥10 subjects reporting Unified Parkinson’s Disease Rating Scale (UPDRS) III motor scores at baseline and 6–12 months follow-up were included. Several outcome variables were analyzed and adverse events (AE) were summarized. 39 STN studies (2035 subjects) and 5 GPi studies (292 subjects) were eligible. UPDRS-II score after surgery in the stimulation-ON/medication-OFF state compared to preoperative medication-OFF state improved by 47% with STN-DBS and 18.5% with GPi-DBS. UPDRS-III score improved by 50.5% with STN-DBS and 29.8% with GPi-DBS. STN-DBS improved dyskinesia by 64%, daily OFF time by 69.1%, and quality of life measured by PDQ-39 by 22.2%, while Levodopa Equivalent Daily Dose (LEDD) was reduced by 50.0%. For GPi-DBS information regarding dyskinesia, OFF time, PDQ-39 and LEDD was insufficient for further analysis. Correlation analysis showed that preoperative L-dopa responsiveness was highly predictive of the STN-DBS motor outcome across all studies. Most common surgery-related AE were infection (5.1%) and intracranial hemorrhage (3.1%). Despite a series of technological advances, outcomes of modern surgery are still comparable with those of the early days of DBS. Recent changes in target selection with a preference of GPi in elderly patients with cognitive deficits and more psychiatric comorbidities require more published data for validation.
Similar content being viewed by others
Introduction
The first implantation of deep brain electrodes for tremor in Parkinson’s disease (PD) by Alim-Louis Benabid in the late 1980s in the thalamic ventral intermediate nucleus1 paved the way for the worldwide application of deep brain stimulation (DBS) in PD. Despite an initial lack of randomized controlled studies, DBS of the subthalamic nucleus (STN) or the internal part of the globus pallidus (GPi) quickly became a well accepted therapy for advanced PD with motor complications due to its convincing effect on motor symptoms, shown by smaller and uncontrolled studies2,3,4,5,6,7,8,9 and a highly quoted and influential meta-analysis by Kleiner-Fisman and colleagues10. Since then, the field of DBS has undergone important technical progress, and the efficacy and safety of bilateral STN- and GPi-DBS has been underlined by large randomized controlled trials (RCT)11,12,13,14,15,16,17,18 and many additional uncontrolled studies, while the choice of the best target remains a matter of controversial debate. This is partly explained by the fact that the results of the existing RCTs are difficult to compare due to their different primary outcomes such as quality of life13,15,17, time in ON without dyskinesia measured with patient diaries14,16, motor symptoms (UPDRS-III)11,12, and functional health measured with a composite score for cognitive, mood, and behavioral effects18. Fortunately, most of the RCTs also use some common scales either as primary or secondary outcomes. Importantly, all studies use the UPDRS, but when comparing UPDRS-III (whether primary or secondary endpoint), outcomes are highly variable across published randomized controlled studies.
Although several meta-analyses have been published in recent years, these focused either only on the outcome of specific symptoms or on RCTs19,20,21 and therefore include only a limited number of patients. Due to the different endpoints of the studies, the inconsistent reporting of symptoms such as dyskinesias or missing important outcome parameters, such as quality of life, these meta-analyses had to be basically limited to an analysis of motor outcome (UPDRS III) as a common secondary endpoint. Ultimately, both RCTs and previous meta-analyses leave many questions unanswered because of the aforementioned limitations. Our meta-analysis represents the first since the work of Kleiner-Fisman et al.10 to include the majority of available STN and GPi studies, allowing not only to add analysis of published outcomes from the last 15 years, but also maximizing the number of subjects included, and analyzing multiple outcome parameters simultaneously.
Results
Literature review
As shown in Supplementary Fig. 1, the literature search identified 256 original citations. Sixty-one articles were selected according to the inclusion criteria. Twenty-seven contributed to both safety and efficacy, 20 contributed to safety analysis only, and 12 contributed to efficacy analysis only. A detailed list of the corresponding articles is provided in Supplementary Table 1. Thirty-four articles met efficacy inclusion criteria, to which five additional articles, identified by P.K. and L.L., were added. These historical papers (for more detail see supplementary material) were added to facilitate the understanding of the existing literature without any relevant change in the statistical results. After this process, 39 STN studies2,3,11,12,13,15,16,17,18,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51 involving 2035 subjects with follow-up data (T1) for 1747 subjects and 5 GPi studies2,11,12,18,30 with 292 subjects (n at T1 = 291) were available for efficacy outcome analysis. For safety, 47 articles reporting on a total of 2818 enrolled subjects met the inclusion criteria.
Patient characteristics
Detailed patient characteristics and demographic information are shown in Supplementary Table 2. Study sample sizes (some papers reporting outcomes of both STN and GPi DBS) ranged from 10 to 299 (median [range] = 33) and follow-up time considered for this analysis spanned from 6 months to 2 years (weighted mean, 13.1 ± 5.9 months). The weighted mean age at surgery was 59.1 ± 2.9 years (range of means, 50.7–66.7). Weighted average disease duration prior to surgery was 12.2 ± 2.1 years (range of means, 7.3–19.0). Compared to the period covered by Kleiner-Fisman et al. (1993–2005)10, subjects of STN studies since 2005 were less affected at baseline based on UPDRS-II and -III scores (Table 1 and Supplementary Table 3).
Analyses of heterogeneity and bias
Cochran’s Q test revealed statistically significant evidence for heterogeneity between STN studies in the datasets collected for UPDRS-II and -III score (P < 0.0001), dyskinesia (P < 0.0013), daily OFF time, LEDD (P < 0.0001), and for improvement of PDQ-39 SI (P = 0.0008). Between the GPi studies, there was no significant evidence of heterogeneity for UPDRS-II (P = 0.5916) and -III (P = 0.4321) scores, possibly due to the small sample size, while the information for dyskinesia, daily OFF time, PDQ-39 SI, and LEDD was insufficient for further analysis.
Funnel plots and Egger’s regression test revealed statistically significant asymmetrical distribution only for the PDQ-39 outcome with STN-DBS (P = 0.0028) (Supplemantary Fig. 2 and Supplementary Table 4), which is due to a variation of the effect observed in smaller studies because of their sample size and might indicate publication bias.
UPDRS-II
Twenty-four STN-DBS studies with a total of 1346 subjects reported an estimated decrease in total UPDRS-II score at follow-up in the stimulation-ON/medication-OFF state compared with the preoperative OFF-medication state of 10.4 points (95% CI: 8.3–12.6; Fig. 1), equivalent to a 47% (95% CI: 37.4–56.7%) reduction. The extent of this decrease in UPDRS-II score showed a dose–response relationship with preoperative response to the L-Dopa challenge (Supplementary Fig. 3). Compared to the period covered by Kleiner-Fisman et al. (1993–2005)10, the STN studies since 2005 have shown a slightly lower response of UPDRS-II to DBS (49.9% vs. 40.9%, Table 1 and Supplementary Table 3).
For GPi, three studies with a total of 227 subjects were reporting a decrease in UPDRS-II scores in medication-OFF condition of 3.6 points (95% CI: 2.4–4.7), equivalent in percentage to a change of 18.5% (95% CI: 12.6–24.4%; difference between postoperative stimulation-ON/medication-OFF state and preoperative OFF-medication state; Table 1 and Fig. 2).
UPDRS-III
Thirty-eight STN studies comprising 1859 subjects reported change in UPDRS-III score and estimate of standard error. The estimated decrease in UPDRS-III score at follow-up compared to baseline was 22.1 points (95% CI: 19.9–24.3; Fig. 3), equivalent to a 50.5% (95% CI: 45.6–55.5%) reduction. The magnitude of decrease in UPDRS-III with STN stimulation showed a dose–response relationship with preoperative response to the L-Dopa challenge (Fig. 4). Since 2005, studies have shown a slightly lower response of UPDRS-III to STN-DBS compared to the period covered by Kleiner-Fisman et al. (1993–2005)10 (Table 1 and Supplementary Table 3). According to our literature search criteria, only five GPi studies with 289 subjects were available, which showed a decrease in UPDRS-III score after surgery of 13.0 points (95% CI: 10.7–15.4; Fig. 5), equivalent to a reduction of 29.8%.
Dyskinesia
Data for dyskinesias were very heterogeneous between studies, due to use of different rating scales. However, conversion of mean study scores to percentages relative to their respective scales allowed for comparison. For this purpose, only scales that provide an overall assessment of dyskinesias such as UPDRS-IV (item 32 to 35)52, the Abnormal Involuntary Movement Scale53, and the Marconi Dyskinesia Rating Scale54 were used. Patient diaries and individual items of the UPDRS IV could not be included as these assess either only the duration or the severity of dyskinesias and therefore do not allow such a conversion. For fourteen STN studies with 950 subjects, the average reduction in dyskinesia at follow-up could be determined and was 64.0% (95% CI: 56.4%–71.5%; Fig. 6A). For GPi-DBS there was a weighted mean improvement of 39.7% per Obeso et al.2 (n = 38) and Odekerken et al.18 (n = 62). Since only one of the included GPi studies2 provided the required data quality with standard deviation, a pooled estimate could not be performed.
Daily OFF time
Only six STN studies with 185 subjects provided information on the change in daily OFF time (item 39 of UPDRS-IV or patient diaries) and the estimation of the standard error. After converting the average study values into percentages relative to their respective scales, the comparison revealed an average decrease of 69.1% (95% CI: 46.8%–91.4%; Fig. 6B), while the information for GPi was not sufficient for further analysis.
Quality of life
Change in quality of life was assessed with the PDQ-39 SI55 in eleven STN studies with 627 subjects. An average improvement of the summary index score of 11.0 points (95% CI: 7.9–14.1; Fig. 7) at follow-up (approx. 22.2% of baseline weighted mean) was observed, while the information for GPi was not sufficient to perform the same analysis.
Levodopa equivalent daily dose (LEDD)
Thirty-one STN studies with 1644 subjects provided information regarding changes in LEDD with an average LEDD reduction after surgery of 50.0% (95% CI: 45.1%–54.8%; Supplementary Fig. 4), while the information for GPi was not sufficient to perform the same analysis.
Adverse events (AE)
Safety data were included from forty-seven STN and four GPi studies. The median proportion of surgery-related intracranial hemorrhage (ICH) was 3.1% including both target structures (3.45% for STN-DBS, 2.2% for GPi-DBS). The median proportion of surgery-related ICH with reported permanent neurological deficits was 1.65%. Some studies additionally distinguished ICH with temporary deficits and asymptomatic ICH with a median proportion of 1.0% and 2.1%, respectively. The median proportion of surgery-related infections was 5.1%. A detailed list of the proportions of surgery-, hardware-, disease-, stimulation-, and therapy-related side effects is provided in Supplementary Table 5.
Prediction of STN-DBS benefit
For thirty-two studies, the change in UPDRS-III score caused by L-dopa challenge at baseline (T0) was comparable to that achieved under STN-DBS only (medication-OFF). Graphical representation of this analysis is presented in Fig. 4. Unfortunately, the data available from our article pool for GPi-DBS did not allow us to conduct the same analysis for this therapeutic target.
To evaluate the possible predictive power of the pre-operative change for post-operative results of STN-DBS, normality of the datasets collected for both variables was evaluated through a Shapiro–Wilk test (P = 0.6673: P = 0.2569). A Pearson correlation test was then performed that revealed a statistically significant correlation between UPDRS-III change at baseline upon L-dopa challenge and UPDRS-III change (medication-OFF) after STN-DBS: correlation coefficient = 0.689958 (P = 0.00001).
Variables affecting prediction of benefit from STN-DBS
To identify factors likely to influence the quality of the prediction made by the change in UPDRS-III score upon pre-operative L-dopa challenge, the ratio of UPDRS-III change post-op/UPDRS-III pre-op L-dopa (“surgical efficiency index“) was used as a proxy for assessing the quality of a prediction (if >1, post-op benefit was underestimated by L-dopa challenge; if <1, post-op benefit was overestimated; if=1, perfect prediction). Correlation between this ratio (calculated for each study) and several variables was evaluated through a Pearson correlation test (if both datasets followed a normal distribution) or Spearman test (if one dataset did not follow a normal distribution).
From our correlation analysis, the average disease duration prior to implantation (R2 «1, Supplementary Fig. 5) appears to be the only measured preoperative variable of statistically significant influence on the predictive power of the pre-operative L-dopa challenge (P = 0.003) (Supplementary Table 6).
No statistically significant correlation was observed between the motor benefit prediction ratio and any other measured individual variable (Supplementary Table 6).
Discussion
The aim of this article was to provide an updated review of the published literature on the outcomes of bilateral STN- and GPi-DBS for PD. According to our inclusion criteria, a majority of studies focused on the STN (n = 39) involving 1747 subjects with follow-up data and, to a much lesser extent, the GPi (n = 5) with 291 subjects with follow-up data. Considering that the question of whether STN- or GPi-DBS is clinically superior is still a matter of controversial debate, and given our efforts to minimize biases introduced by the data retrieval and analysis method (e.g., use of random-effects model, standardized literature search, and data extraction protocol), the striking numerical predominance of the STN studies over the GPi studies is very surprising. GPi is an easier target as far as immediate postoperative management is concerned. Unlike STN-DBS, GPi-DBS does not require fine-tuning between stimulation intensity and medication dosage. Unilateral surgery can be easily proposed for GPi, as there is no deterioration of the non-operated side in relation to postoperative medication reduction56. After the publication of the US Veterans Administration study comparing STN versus GPi-DBS, a triumphant return of pallidal stimulation was predicted57. Indeed, there has been a shift towards implantation of GPi based on understanding of the outcomes of the VA study11. GPi targeting increased and was selected for older patients with poorer cognitive and mood indices58. Our meta-analysis of the literature shows that this new popularity of GPi as a target has not yet translated into publications addressing GPi-DBS and justifying such a gradual shift in patient selection. Future randomized controlled trials are needed to evaluate the impact on quality of life when recruiting patients with such selection bias, as surgical complications increase with age39, depression and cognitive decline are important determinants of quality of life in PD59, and a previous study had shown that in PD patients with preoperatively borderline impaired cognition, STN-DBS did not provide any benefit in terms of quality of life60.
Even 14 years after the meta-analysis by Kleiner-Fisman et al.10, which provided the first comprehensive results to estimate the extent of the effects of STN-DBS in PD, studies reporting the outcomes for the main indication of DBS, namely motor fluctuations and dyskinesias, are still very heterogeneous, due to missing reporting or more importantly, different evaluation tools (UPDRS items in different combinations, different specific rating scales, or patient diaries). As a result, the available and comparable data are limited, especially with regard to dyskinesia, and, at least in the context of our analysis, only allow a comparison of percentage changes in different specific total dyskinesia scores, but not of dyskinesia severity or dyskinesia duration only. This highlights the urgent need for a uniform and more detailed assessment of these motor complications across centers. On average, dyskinesia and daily OFF time were significantly improved by STN stimulation (62.5% decrease in dyskinesia, n = 950; 69.1% decrease of OFF time), whereas the available data for GPi-DBS was not sufficient for a comprehensive analysis, but at least showed a weighted mean improvement of 39.7%. In contrast, change in UPDRS-III scores in the stimulation-ON/medication-OFF condition compared to the baseline medication-OFF condition is available throughout almost all publications and therefore still the most commonly used measure to assess DBS efficacy. On average, there was a convincing improvement in UPDRS-III scores by STN- and GPi-DBS compared to preoperative baseline. Although we have not conducted a comparative analysis of these two targets and the discrepant counts of STN and GPi studies affects the precision of our estimates, the improvement in motor symptoms (UPDRS-III) of 50.5% for STN-DBS versus 29.8% for GPi-DBS suggests that STN-DBS is superior to GPi-DBS in terms of motor outcomes.
Interestingly, despite technical advances, STN studies showed a slightly lower response to DBS in the period 2005–2019 compared with the period 1993–2004 covered by Kleiner-Fisman et al.10, which may be due to the fact that subjects enrolled in the studies since 2005 had a shorter disease duration and were less affected in terms of UPDRS-II and -III. In addition, this may also be explained by the fact, that the technical achievements in imaging, targeting, and intraoperative procedures have not yet translated into a relevant advantage over the reviewed period, as they have not all been consistently used by all centers.
The question of the optimal DBS target in PD may not be sufficiently clarified with the UPDRS-III medication OFF score as the primary endpoint, because it does not take into account non-motor symptoms. Results of the two target structures are also difficult to compare, as both targets require very different postoperative management. A relevant LEDD reduction is only possible with STN-DBS and also mandatory to reduce dyskinesia. This, in turn, can lead to apathy and dopamine withdrawal syndrome on one side, or to an improvement of impulse control disorders on the other side with a major impact on quality of life61. Therefore, quality of life assessed by the PDQ-39 may be more appropriate to assess DBS efficacy, as this outcome measure is affected not only by changes in motor symptoms but also by changes in non-motor symptoms and motor complications. For this reason, QOL measurement was introduced in DBS62 and was chosen as the main outcome criterion in the very first randomized controlled trial on DBS13. Our meta-analysis showed a moderate improvement in quality of life (PDQ-39) of approximately 22% with STN-DBS, while the information for GPi was not sufficient for analysis. Although there were only a few STN studies (n = 11) with high variance of results, this improvement is clinically highly relevant, when considering that patients with best medical treatment tend to worsen their quality of life over the same period61. Activities of daily living (UPDRS-II) as a main determinant of quality of life improved on average by 47% after STN surgery, while the GPi studies showed a lower improvement of 18.5%. However, due to the relatively few GPi studies that qualified for analysis, this finding must be interpreted with caution, in particular considering the fact that three years after DBS surgery there was no difference in long-term results between STN and GPi with regard to quality of life63,64,65.
Overall, the published studies are heterogeneous, especially regarding the (primary) outcome parameters, the assessment tools used, and the selection criteria for DBS, which can certainly be explained in part by our inclusion criteria and the inclusion of small non-randomized open-label studies. However, this heterogeneity also affects the RCTs as there are differences in the choice of primary outcome parameters, partly due to the different requirements of the regulatory authorities in the United States and Europe. Furthermore, patient selection is not identical as far as levodopa-sensitivity is concerned (Fig. 4). Postoperative management can be impacted by differences in healthcare systems, which can determine whether DBS is managed on an outpatient or an in-hospital basis with consequences on potential expert time devoted to an individual patient. Therefore, we believe that this heterogeneity reflects clinical reality. Many questions including target preference (STN vs GPi) could not be solved so far by randomized controlled trials, which did not come to identical conclusions11,18, resulting in DBS targeting (STN vs GPi) based on experience of a team, differences in health care systems, and analysis of the full literature, rather than just evidence-based medicine. Therefore, our meta-analysis highlights the need to identify predictors of motor and non-motor outcomes or to investigate the impact of new techniques, such as the advent of newer imaging and targeting techniques, MRI-guided asleep DBS, current steering, or closed loop DBS. To address such knowledge gaps, the use of prospective registries with targeted data collection in the pre-operative, operative, and post-operative phases of DBS treatment66 may be a good option providing larger data allowing for more detailed analyses67.
Although the efficacy of DBS for the treatment of PD is well recognized, preoperative predictive factors for a favorable outcome of DBS are still not sufficiently known. In accordance with Kleiner-Fisman et al.10, the preoperative L-dopa responsiveness was highly predictive of the motor outcome of STN-DBS (Pearson P = 0.00001). From our correlation analysis, the average disease duration prior to implantation appears to be influencing the quality of the prediction made by pre-operative-L-dopa challenge for the benefit of STN-DBS (P = 0.003). The longer the patient is suffering from PD before the implantation, the more the prediction made by pre-operative L-dopa response will correspond to the possible improvement caused by STN-DBS. Motor fluctuations, as measured by the MDS-UPDRS during the L-dopa challenge, depend on fluctuations in dopamine concentration in the nigro-striatal dopaminergic synapse that increase over time due to progressive degeneration of dopaminergic nigral neurons68. The prominent on–off fluctuations in later disease stages with severe degeneration are good predictors of DBS outcome69,70. In earlier disease stages with partially preserved dopamine buffering function of presynaptic neurons, prediction of potential benefit based on L-dopa challenge might be less reliable because of the more erratic wearing-off. This should be kept in mind when discussing DBS in patients with earlier disease stages and less pronounced motor fluctuations. In these patients, long-acting dopamine agonists may mask the true severity of untreated parkinsonism and should therefore be discontinued several days before the L-dopa challenge. This strategy was successfully used in the EARLYSTIM trial17, and based on this experience, we would recommend such an approach on an individual basis closely monitoring the patients in order to prevent dopamine agonist withdrawal syndrome71. Yet the poor fit of the linear regression shows that average disease duration is not the only factor influencing the prediction quality. It is highly likely that other unmeasured factors or unpublished data such as electrode placement or unmeasurable factors such as comorbidity also influence the quality of the prediction made by pre-operative-L-dopa challenge for the benefit of STN-DBS. Although there may be less certainty about the predictive power of the preoperative levodopa response for the benefit of STN-DBS in patients with early disease stage, the degree of motor and quality of life benefits in these patients is comparable17. For this reason, and as there is no loss of efficacy reported for STN-DBS in the very long term72, we would recommend STN-DBS in the presence of motor complications in early disease stages.
The median proportion of all surgery-related intracranial bleeding (ICH) for both targets together and regardless of whether symptomatic or not, was 3.1% in our review and 1.65% for symptomatic bleeding with permanent consequences, which is within the published ICH incidence range of 1.2–5.0%73. Almost exclusively in STN studies, an additional distinction was made between ICH with transient deficits (1.0%) and asymptomatic (2.1%) ICH, with the latter only documented in a few studies. Surgery-related infection was the most frequent complication of DBS with a median proportion of 5.1%. In the literature, frequencies of infectious complications have been reported in a range between 0 and 15%73,74,75, but this variation is likely due to the different definitions of a postoperative infection across centers.
Overall, the analysis of AE revealed an inconsistent and non-systematic reporting across study centers and highlights the urgent need for a uniform recording with unambiguous categories as already proposed by others74.
A limitation of this review is that due to the substantial numerical discrepancies between STN and GPi studies a comparative analysis between these two targets was not possible. In addition, a detailed analysis of motor complications as well as other individual variables such as surgical techniques, patient selection, and the quality of postoperative management was also not possible due to the lack of reporting or differences in the used assessment tools across centers. Nevertheless, by including also non-randomized open studies we maximized the number of studies and subjects to avoid bias and also enabled the analysis of other variables such as quality of life compared to previous meta-analyses.
In summary, DBS is an established and effective treatment for levodopa-responsive PD. While in the early days of DBS contrast ventriculography and multichannel microelectrode recording (MER) were mandatory for successful targeting, advances in imaging have now made it possible to achieve good clinical results with DBS even without MER76. Although our meta-analysis does not allow us to distinguish between the influences of the many individual variables of surgical techniques, patient selection, and the quality of postoperative management, it is rather surprising that the recent very convincing results of modern surgery with its technological advances77 are still comparable to the efficacy and safety results of the pioneering team from Grenoble3.
To conclude, the numerical predominance of the STN studies compared to GPi studies clearly indicates that the STN has become the preferred target for the treatment of levodopa-responsive PD.
Methods
Literature search and selection of articles
A comprehensive review of the literature from 1990 until August 2019 was conducted using PubMed database. Search terms included “deep brain stimulation, “neurostimulation” “Parkinson’s disease,” “subthalamic nucleus,” “globus pallidus pars interna”. The search string excluded reviews, meta-analyses, and case reports. The search was limited to articles in English language. This systematic review process and meta-analysis was performed as outlined in the PRISMA Statement (Preferred Reporting Items for Systematic Reviews and Meta-Analyses)78. Retrieved abstracts were reviewed, and distinct inclusion criteria were applied to the selection of articles for effectiveness and safety analysis. For the efficacy analysis, abstracts were scrutinized to include only articles with a minimum of 10 subjects reporting Unified Parkinson’s Disease Rating Scale (UPDRS) III scores at baseline and follow-up (between 6 months and 12 months after the implantation of the DBS electrodes). For more detailed information on search and selection criteria see supplementary material. After the initial review process five additional studies3,11,17,26,36 of seminal contribution to the clinical field of the present review were identified by the authors (P.K. and L.L) and added to the analysis, even though they did not fully comply with the search criteria as defined above. For a more detailed specification see supplementary material.
Data extraction
The following key study characteristics and patients’ demographics were extracted: name of first author, publication year, enrollment start, and study sample size, therapeutic target location, mean age at surgery, mean disease duration, and pre-operative levodopa responsiveness. Clinical data reported before and after surgery on following variables were retrieved from the selected articles: UPDRS-II and -III score, Levodopa Equivalent Daily Dose (LEDD), dyskinesia severity, mean daily OFF time, and Parkinson’s Disease Questionnaire summary index (PDQ-39 SI). Preoperative L-dopa responsiveness (% difference between mean preoperative medication-OFF UPDRS-III score and mean medication-ON UPDRS-III score) and postoperative response to DBS (% difference between mean preoperative medications-OFF UPDRS-III score and mean postoperative medication-OFF/stimulation-ON UPDRS-III score) were calculated. The UPDRS-III score from the 6- or 12-month post-operation time points (except for Follett et al.11 and Schuepbach et al.17 for which 24-month time point was selected) was used as the postoperative UPDRS-III score. Data were aggregated regarding the different outcomes. Furthermore, a surgical efficiency index was defined
For the safety analysis, the adverse events (AE) were listed if reported in at least 4 studies and were classified as surgery-related, hardware-related, an interaction of body and hardware, or in subcategories of stimulation-induced effects.
Statistical analysis
Heterogeneity of the datasets to be analyzed was evaluated through both Cochran’s Q79 and Higgins I2 tests80 for the sake of relevance. Based on heterogeneity and the sample size of datasets, restricted maximum likelihood estimates of change in absolute scores/dose or percentage-of-score/dose-at-baseline after surgery (difference between postoperative stimulation-ON/medication-OFF and preoperative medication-OFF condition) were generated using the random-effects model, as suggested by Jackson and colleagues81.
The presence of bias was explored graphically by constructing funnel plots, with estimates of surgery effect per study plotted against the standard errors associated with these estimates. Furthermore, statistically significant evidence of bias was sought using Egger’s regression test82.
For STN-DBS, sample size allowed for the analysis of the influence of available baseline variables such as average disease duration, average age at implantation, UPDRS-II and -III scores, PDQ-39 SI, publication year, enrollment start, LEDD, and L-dopa responsiveness on the predictive power of preoperative L-dopa responsiveness at baseline. Normality of the distribution of datasets collected for the variables was verified through a Shapiro–Wilk test. Correlation of each dataset with the surgical efficiency index was evaluated using Pearson’s test when both datasets followed a normal distribution or Spearman’s test when one of the datasets was not following a normal distribution.
The readxl83 and metafor84 libraries were used within the R project v3.6.185 (extended with RStudio v1.1.463 GUI) for statistical analysis. The Fiji86 software was used for linear estimation of scores on plots of publications.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Benabid, A. L., Pollak, P., Louveau, A., Henry, S. & de Rougemont, J. Combined (thalamotomy and stimulation) stereotactic surgery of the VIM thalamic nucleus for bilateral Parkinson disease. Appl. Neurophysiol. 50, 344–346 (1987).
Deep-Brain Stimulation for Parkinson’s Disease Study, G. et al. Deep-brain stimulation of the subthalamic nucleus or the pars interna of the globus pallidus in Parkinson’s disease. N. Engl. J. Med. 345, 956–963 (2001).
Limousin, P. et al. Electrical stimulation of the subthalamic nucleus in advanced Parkinson’s disease. N. Engl. J. Med. 339, 1105–1111 (1998).
Limousin, P. et al. Effect of parkinsonian signs and symptoms of bilateral subthalamic nucleus stimulation. Lancet 345, 91–95 (1995).
Siegfried, J. & Lippitz, B. Bilateral chronic electrostimulation of ventroposterolateral pallidum: a new therapeutic approach for alleviating all parkinsonian symptoms. Neurosurgery 35, 1126–1129 (1994). discussion 1129-1130.
Ghika, J. et al. Efficiency and safety of bilateral contemporaneous pallidal stimulation (deep brain stimulation) in levodopa-responsive patients with Parkinson’s disease with severe motor fluctuations: a 2-year follow-up review. J. Neurosurg. 89, 713–718 (1998).
Kumar, R. et al. Double-blind evaluation of subthalamic nucleus deep brain stimulation in advanced Parkinson’s disease. Neurology 51, 850–855 (1998).
Pahwa, R. et al. High-frequency stimulation of the globus pallidus for the treatment of Parkinson’s disease. Neurology 49, 249–253 (1997).
Volkmann, J. et al. Bilateral high-frequency stimulation of the internal globus pallidus in advanced Parkinson’s disease. Ann. Neurol. 44, 953–961 (1998).
Kleiner-Fisman, G. et al. Subthalamic nucleus deep brain stimulation: summary and meta-analysis of outcomes. Mov. Disord. 21, S290–S304 (2006). Suppl 14.
Follett, K. A. et al. Pallidal versus subthalamic deep-brain stimulation for Parkinson’s disease. N. Engl. J. Med. 362, 2077–2091 (2010).
Anderson, V. C., Burchiel, K. J., Hogarth, P., Favre, J. & Hammerstad, J. P. Pallidal vs subthalamic nucleus deep brain stimulation in Parkinson disease. Arch. Neurol. 62, 554–560 (2005).
Deuschl, G. et al. A randomized trial of deep-brain stimulation for Parkinson’s disease. N. Engl. J. Med. 355, 896–908 (2006).
Weaver, F. M. et al. Bilateral deep brain stimulation vs best medical therapy for patients with advanced Parkinson disease: a randomized controlled trial. JAMA 301, 63–73 (2009).
Williams, A. et al. Deep brain stimulation plus best medical therapy versus best medical therapy alone for advanced Parkinson’s disease (PD SURG trial): a randomised, open-label trial. Lancet Neurol. 9, 581–591 (2010).
Okun, M. S. et al. Subthalamic deep brain stimulation with a constant-current device in Parkinson’s disease: an open-label randomised controlled trial. Lancet Neurol. 11, 140–149 (2012).
Schuepbach, W. M. et al. Neurostimulation for Parkinson’s disease with early motor complications. N. Engl. J. Med. 368, 610–622 (2013).
Odekerken, V. J. et al. Subthalamic nucleus versus globus pallidus bilateral deep brain stimulation for advanced Parkinson’s disease (NSTAPS study): a randomised controlled trial. Lancet Neurol. 12, 37–44 (2013).
Liu, Y. et al. Meta-analysis comparing deep brain stimulation of the globus pallidus and subthalamic nucleus to treat advanced Parkinson disease. J. Neurosurg. 121, 709–718 (2014).
Mansouri, A. et al. Deep brain stimulation for Parkinson’s disease: meta-analysis of results of randomized trials at varying lengths of follow-up. J. Neurosurg. 128, 1199–1213 (2018).
Sako, W., Miyazaki, Y., Izumi, Y. & Kaji, R. Which target is best for patients with Parkinson’s disease? A meta-analysis of pallidal and subthalamic stimulation. J. Neurol. Neurosurg. Psychiatry 85, 982–986 (2014).
Capecci, M. et al. Functional improvement after subthalamic stimulation in Parkinson’s disease: a non-equivalent controlled study with 12-24 month follow up. J. Neurol. Neurosurg. Psychiatry 76, 769–774 (2005).
Chan, D. T. et al. Subthalamic nucleus deep brain stimulation for Parkinson disease in Hong Kong: a prospective territory-wide 2-year follow-up study. World Neurosurg. 93, 229–236 (2016).
Cilia, R. et al. Brain networks underlining verbal fluency decline during STN-DBS in Parkinson’s disease: an ECD-SPECT study. Parkinsonism Relat. Disord. 13, 290–294 (2007).
Fluchere, F. et al. Controlled general anaesthesia for subthalamic nucleus stimulation in Parkinson’s disease. J. Neurol. Neurosurg. Psychiatry 85, 1167–1173 (2014).
Ford, B. et al. Subthalamic nucleus stimulation in advanced Parkinson’s disease: blinded assessments at one year follow up. J. Neurol. Neurosurg. Psychiatry 75, 1255–1259 (2004).
Fraix, V. et al. Clinical and economic results of bilateral subthalamic nucleus stimulation in Parkinson’s disease. J. Neurol. Neurosurg. Psychiatry 77, 443–449 (2006).
Garcia Ruiz, P. J. et al. Evaluation of timed tests in advanced Parkinsonian patients who were candidates for subthalamic stimulation. Clin. Neuropharmacol. 28, 15–17 (2005).
Garcia-Garcia, D. et al. Stimulation sites in the subthalamic nucleus and clinical improvement in Parkinson’s disease: a new approach for active contact localization. J. Neurosurg. 125, 1068–1079 (2016).
Haegelen, C. et al. Functional atlases for analysis of motor and neuropsychological outcomes after medial globus pallidus and subthalamic stimulation. PloS One 13, e0200262 (2018).
Herzog, J. et al. Two-year follow-up of subthalamic deep brain stimulation in Parkinson’s disease. Mov. Disord. 18, 1332–1337 (2003).
Jaggi, J. L. et al. Bilateral stimulation of the subthalamic nucleus in Parkinson’s disease: surgical efficacy and prediction of outcome. Stereotact. Funct. Neurosurg. 82, 104–114 (2004).
Jiang, J. L., Chen, S. Y. & Tsai, S. T. Quality of life in patients with Parkinson’s disease after subthalamic stimulation: an observational cohort study for outcome prediction. Ci ji yi xue za zhi = Tzu-chi Med. J. 31, 107–112 (2019).
Jiang, L. L. et al. Long-term efficacy of subthalamic nucleus deep brain stimulation in Parkinson’s disease: a 5-year follow-up study in China. Chin. Med. J. 128, 2433–2438 (2015).
Kim, R. et al. Long-term effect of subthalamic nucleus deep brain stimulation on freezing of gait in Parkinson’s disease. J. Neurosurg. 131, 1797–1804 (2019).
Krack, P. et al. Five-year follow-up of bilateral stimulation of the subthalamic nucleus in advanced Parkinson’s disease. N. Engl. J. Med. 349, 1925–1934 (2003).
Lefranc, M. et al. Asleep robot-assisted surgery for the implantation of subthalamic electrodes provides the same clinical improvement and therapeutic window as awake surgery. World Neurosurg. 106, 602–608 (2017).
Martinez-Martin, P. et al. Bilateral subthalamic nucleus stimulation and quality of life in advanced Parkinson’s disease. Mov. Disord. 17, 372–377 (2002).
Ory-Magne, F. et al. Does ageing influence deep brain stimulation outcomes in Parkinson’s disease? Mov. Disord. 22, 1457–1463 (2007).
Ostergaard, K. & Aa Sunde, N. Evolution of Parkinson’s disease during 4 years of bilateral deep brain stimulation of the subthalamic nucleus. Mov. Disord. 21, 624–631 (2006).
Pahwa, R., Wilkinson, S. B., Overman, J. & Lyons, K. E. Bilateral subthalamic stimulation in patients with Parkinson disease: long-term follow up. J. Neurosurg. 99, 71–77 (2003).
Portman, A. T. et al. Chronic stimulation of the subthalamic nucleus increases daily on-time without dyskinesia in advanced Parkinson’s disease. Parkinsonism Relat. Disord. 12, 143–148 (2006).
Romito, L. M. et al. Replacement of dopaminergic medication with subthalamic nucleus stimulation in Parkinson’s disease: long-term observation. Mov. Disord. 24, 557–563 (2009).
Schupbach, W. M. et al. Stimulation of the subthalamic nucleus in Parkinson’s disease: a 5 year follow up. J. Neurol. Neurosurg. Psychiatry 76, 1640–1644 (2005).
Thobois, S. et al. Subthalamic nucleus stimulation in Parkinson’s disease: clinical evaluation of 18 patients. J. Neurol. 249, 529–534 (2002).
Timmermann, L. et al. Multiple-source current steering in subthalamic nucleus deep brain stimulation for Parkinson’s disease (the VANTAGE study): a non-randomised, prospective, multicentre, open-label study. Lancet Neurol. 14, 693–701 (2015).
Tir, M. et al. Exhaustive, one-year follow-up of subthalamic nucleus deep brain stimulation in a large, single-center cohort of parkinsonian patients. Neurosurgery 61, 297–304 (2007). discussion 304-295.
Tsai, S. T. et al. Prognostic factors of subthalamic stimulation in Parkinson’s disease: a comparative study between short- and long-term effects. Stereotact. Funct. Neurosurg. 87, 241–248 (2009).
Vingerhoets, F. J. et al. Subthalamic DBS replaces levodopa in Parkinson’s disease: two-year follow-up. Neurology 58, 396–401 (2002).
Yamamoto, T. et al. Long term follow-up on quality of life and its relationship to motor and cognitive functions in Parkinson’s disease after deep brain stimulation. J. Neurol. Sci. 379, 18–21 (2017).
Yang, C. et al. Analysis of contact position for subthalamic nucleus deep brain stimulation-induced hyperhidrosis. Parkinson’s Dis. 2019, 8180123 (2019).
Fahn, S., Elton, R. & Committee, m. o. t. U. D. Recent developments in Parkinson’s disease (eds Fahn S., Marsden C. D., Goldstein M., et al.) (Macmillan Healthcare Information, 1987).
Guy, W. ECDEU Assessment Manual for Psychopharmacology (ed Guy W.) (US Department of Health, Education and Welfare, 1976).
Marconi, R. et al. Levodopa-induced dyskinesias in Parkinson’s disease phenomenology and pathophysiology. Mov. Disord. 9, 2–12 (1994).
Jenkinson, C., Fitzpatrick, R., Peto, V., Greenhall, R. & Hyman, N. The Parkinson’s Disease Questionnaire (PDQ-39): development and validation of a Parkinson’s disease summary index score. Age Ageing 26, 353–357 (1997).
Okun, M. S. Deep-brain stimulation for Parkinson’s disease. N. Engl. J. Med. 367, 1529–1538 (2012).
Okun, M. S. & Foote, K. D. Subthalamic nucleus vs globus pallidus interna deep brain stimulation, the rematch: will pallidal deep brain stimulation make a triumphant return? Arch. Neurol. 62, 533–536 (2005).
Southwell, D. G. et al. Before and after the veterans affairs cooperative program 468 study: deep brain stimulator target selection for treatment of Parkinson’s disease. Parkinsonism Relat. Disord. 48, 40–44 (2018).
Schrag, A. & Quinn, N. What contributes to quality of life in Parkinson’s disease: a re-evaluation. J. Neurol. Neurosurg. Psychiatry 91, 563–565 (2020).
Witt, K. et al. Negative impact of borderline global cognitive scores on quality of life after subthalamic nucleus stimulation in Parkinson’s disease. J. Neurol. Sci. 310, 261–266 (2011).
Martinez-Martin, P. et al. Interpretation of health-related quality of life outcomes in Parkinson’s disease from the EARLYSTIM Study. PloS One 15, e0237498 (2020).
Lagrange, E. et al. Bilateral subthalamic nucleus stimulation improves health-related quality of life in PD. Neurology 59, 1976–1978 (2002).
Odekerken, V. J. et al. GPi vs STN deep brain stimulation for Parkinson disease: three-year follow-up. Neurology 86, 755–761 (2016).
Volkmann, J. et al. Long-term effects of pallidal or subthalamic deep brain stimulation on quality of life in Parkinson’s disease. Mov. Disord. 24, 1154–1161 (2009).
Weaver, F. M. et al. Randomized trial of deep brain stimulation for Parkinson disease: thirty-six-month outcomes. Neurology 79, 55–65 (2012).
Schneider, R. B. et al. Acute readmission following deep brain stimulation surgery for Parkinson’s disease: a nationwide analysis. Parkinsonism Relat. Disord. 70, 96–102 (2020).
Martino, D. et al. The 5 pillars in Tourette syndrome deep brain stimulation patient selection: present and future. Neurology 96, 664–676 (2021).
de la Fuente-Fernandez, R. et al. Biochemical variations in the synaptic level of dopamine precede motor fluctuations in Parkinson’s disease: PET evidence of increased dopamine turnover. Ann. Neurol. 49, 298–303 (2001).
Charles, P. D. et al. Predictors of effective bilateral subthalamic nucleus stimulation for PD. Neurology 59, 932–934 (2002).
Welter, M. L. et al. Clinical predictive factors of subthalamic stimulation in Parkinson’s disease. Brain 125, 575–583 (2002).
Rabinak, C. A. & Nirenberg, M. J. Dopamine agonist withdrawal syndrome in Parkinson disease. Arch. Neurol. 67, 58–63 (2010).
Bove, F. et al. Beyond 15 years of subthalamic deep brain stimulation in Parkinson disease. Neurology in Press (2021).
Voges, J. et al. Thirty days complication rate following surgery performed for deep-brain-stimulation. Mov. Disord. 22, 1486–1489 (2007).
Engel, K. et al. Towards unambiguous reporting of complications related to deep brain stimulation surgery: a retrospective single-center analysis and systematic review of the literature. PloS One 13, e0198529 (2018).
Bhatia, S., Zhang, K., Oh, M., Angle, C. & Whiting, D. Infections and hardware salvage after deep brain stimulation surgery: a single-center study and review of the literature. Stereotact. Funct. Neurosurg. 88, 147–155 (2010).
Foltynie, T. et al. MRI-guided STN DBS in Parkinson’s disease without microelectrode recording: efficacy and safety. J. Neurol. Neurosurg. Psychiatry 82, 358–363 (2011).
Vitek, J. L. et al. Subthalamic nucleus deep brain stimulation with a multiple independent constant current-controlled device in Parkinson’s disease (INTREPID): a multicentre, double-blind, randomised, sham-controlled study. Lancet Neurol. 19, 491–501 (2020).
Moher, D., Liberati, A., Tetzlaff, J. & Altman, D. G., Group, P. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. J. Clin. Epidemiol. 62, 1006–1012 (2009).
Cochran, W. G. The combination of estimates from different experiments. Biometrics 10, 101–129 (1954).
Higgins, J. P., Thompson, S. G., Deeks, J. J. & Altman, D. G. Measuring inconsistency in meta-analyses. Bmj 327, 557–560 (2003).
Jackson, D., Bowden, J. & Baker, R. How does the DerSimonian and Laird procedure for random effects meta-analysiscompare with its more efficient but harder to compute counterparts? J. Stat. Plan. Inference 140, 961–970 (2010).
Egger, M., Davey Smith, G., Schneider, M. & Minder, C. Bias in meta-analysis detected by a simple, graphical test. Bmj 315, 629–634 (1997).
Wickham, H. & Bryan, J. readxl: Read Excel Files. R package version 1.3.1. (2019).
Viechtbauer, W. Conducting meta-analyses in R with the metafor package. J. Stat. Softw. 36, 1–48 (2010).
R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2019).
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
Author information
Authors and Affiliations
Contributions
Conception of the work: M.L.L., P.K. Acquisition of the data: M.L.L., N.A., M.B., A.D. Data interpretation: M.L.L., M.M., N.A., I.D., J.M., J.A.S, A.N., H.Y., J.P.M., G.D., P.K. Statistical analysis: N.A. Writing the first draft: M.L.L., M.M. Critical Revision and approval of the manuscript: All authors.
Corresponding author
Ethics declarations
Competing interests
MLL reveived reimbursement of traveling expenses to scientific meeting by Medtronic and Teva Pharmaceutical Industries. JPM received royalties from Merz Pharma Germany and Ipsen Pharma Germany. ID received a research grant from Boston Scientific,as well as reimbursement of traveling from Zambon and Boston scientific, all support outside the submitted work. NA and MB are employees of Aleva Neurotherapeutics. AD is an employee and stockholder of Aleva Neurotherapeutics, and holds patents assigned to Aleva Neurotherapeutics in the field of neuromodulation. AN received travel grants from Boston scientific. He received a research grant from the Swiss Parkinson Foundation CP is co-founder of Aleva Neurotherapeutics and received consultancy fees from Boston Scientific. GD has served as a consultant for Boston Scientific, Aleva, Cavion and Functional Neuromodulation. He has received lecture fees from Boston Scientific and royalties from Thieme publishers. He is a government employee and receives funding for his research through his institution from the German Research Council, the German Ministery of Education and Research and Medtronic. PK reports grants from Swiss National Science Foundation, Roger de Spoelberch Foundation, Bertarelli Foundation, Michael J Fox Foundation, Annemarie Opprecht Foundation, Parkinson Schweiz, research grants from Boston Scientific, and Aleva, lecturing fees paid to employing institution from Boston Scientific and Bial, as well as reimbursement of traveling expenses to scientific meeting by Zambon, all support outside the submitted work. The remaining authors have nothing to disclose.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lachenmayer, M.L., Mürset, M., Antih, N. et al. Subthalamic and pallidal deep brain stimulation for Parkinson’s disease—meta-analysis of outcomes. npj Parkinsons Dis. 7, 77 (2021). https://doi.org/10.1038/s41531-021-00223-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41531-021-00223-5