Introduction

Parkinson’s disease (PD) is a progressive neurological disorder that impacts millions of individuals globally. Although current treatments can provide symptomatic relief, there is no therapy available that can effectively slow down the progression of PD over time. Clinically, the course of PD is categorized into distinct stages, starting with the prodromal stage that precedes the formal diagnosis1. In clinical trials, early-stage PD is typically defined by the Hoehn and Yahr stages 1 and 22, while advanced-stage PD is often characterized by the presence of motor fluctuations3. Late-stage PD is primarily identified by dopamine-resistant motor symptoms and non-motor symptoms4. However, the biological underpinnings of these stages remain largely unclear. Interestingly, degeneration in the dopaminergic system reaches its peak within the initial four years after diagnosis5, suggesting that the transitions between subsequent stages are not solely attributed to dopamine loss. PD pathology extends beyond the dopaminergic system, affecting other neurotransmitter systems6,7,8, and Lewy pathology progressively spreads throughout the nervous system9,10. Furthermore, functional decline in PD may arise from the weakening of compensatory mechanisms rather than direct neurodegeneration. For instance, maladaptive plasticity in the striatum is believed to contribute to the development of dyskinesias and motor fluctuations11,12.

Although the course of the disease varies considerably in the early stages and appears stable for a significant length of time, the late stage, in contrast, is characterized by a relatively rapid and uniform clinical deterioration13. Specifically, the occurrence of morbidity milestones, which include visual hallucinations, recurrent falls, the onset of dementia, and admission to a nursing facility, have been associated with poor prognosis heralding the final five years of the disease13. Even in patients with long-term deep brain stimulation (DBS), the same rapid progression was found after the onset of morbidity milestones—in spite of more benign disease trajectories in DBS patients in early to mid-stage disease14. These findings indicate a pathophysiological process, which is inherent to the last phase of the disease or an age-modulated exponential progression culminating in comparable trajectories. Such an accelerated progression or “neurodegenerative storm” in the terminal stage can be illustrated in the form of a hockey-stick effect, which closes a steady linear deterioration with a steep upward curve at the end13,14. The underlying factors for these processes are unknown, however, both models could be caused by a failing stress compensation in already critically degenerated areas where only minor cellular loss acts as the straw breaking the camel’s back. Another possible mechanism is the spread of alpha-synuclein beyond the midbrain structures, including the cortex and the limbic system, which results in a pronounced increase in affected brain volume.

Biomarkers such as serum neurofilament light chain (sNfL) could help to determine the underlying mechanism in this stage of the disease as sNfL reports neuro-axonal damage and is elevated in many neurodegenerative diseases including PD15,16,17,18,19. Although sNfL is not specific for PD induced neurodegeneration but rather reflects any neuro-axonal damage—peripheral or central—there is a solid body of evidence supporting its role as a marker for ongoing neurodegeneration20. For example, sNfL levels were already elevated during the asymptomatic phase in mutation carriers of familial Alzheimer’s disease and sNfL dynamics accelerated in individuals transitioning from the presymptomatic to the symptomatic stage21. In many other neurodegenerative disorders like amyotrophic lateral sclerosis, atypical parkinsonism and PD, higher levels of sNfL were shown to be associated with faster disease progression and higher brain atrophy rates22. Furthermore, in a number of neurological disorders, effective disease-modifying therapies were shown to normalize NfL levels linking NfL dynamics to the clinical efficacy of the treatment23,24,25. Based on this, sNfL can be regarded as a measure of the intensity of ongoing neurodegeneration26.

We hypothesize that the rapid increase in disability after the occurrence of a clinical milestone results not only from functional decompensation but from accelerated neurodegeneration. To test this hypothesis, we measured sNfL in a cohort of patients with PD at least 5 years after diagnosis and compared patients with and without clinical milestones. Furthermore, we investigated the predictive value of sNfL for all-cause mortality. In brief, we find increased sNfL levels in patients with morbidity milestones, suggesting a high rate of ongoing neurodegeneration in late-stage PD, which entails the potential to develop stage-specific therapeutics to prevent the rapid clinical deterioration heralded by the onset of morbidity milestones.

Results

118 patients with PD (disease duration of at least 5 years) were included in the study; demographic and clinical data are summarized in Table 1. Pairwise comparisons between the groups in Table 1 are displayed in Supplementary Table 1. A large proportion of study participants were enrolled in an observational register study for patients with DBS. We have shown previously that there are no differences between sNfL values between patients treated with DBS and patients without DBS when correcting for age and disease duration in the same cohort27. As expected from the use of an advanced therapeutic option, patients treated with DBS had a longer disease duration (16.7 ± 6.5 vs 9.2 ± 5.2, p < 0.0001, Mann–Whitney U test), and it was more likely for the milestones dementia (p = 0.0007, Fisher’s exact test) or recurrent falls (p = 0.0418, Fisher’s exact test) to be present.

Table 1 Demographic and clinical data.

Serum NfL is associated with clinical milestones and discriminates between patients with or without clinical milestones

At baseline, 42 patients showed no clinical milestone, while 76 showed at least one milestone, confirming the advanced stage of our cohort. sNfL was significantly higher in patients with at least one clinical milestone than in patients without a clinical milestone (Fig. 1a), and sNfL correlated positively with the number of clinical milestones (Fig. 1b, Spearman’s ρ = 0.672, p < 0.001).

Fig. 1: sNfL increases with the presence of clinical milestones.
figure 1

Comparison of log(NfL) values for patients with no milestone or at least one clinical milestone (a) and of log(NfL) values stratified by the number of clinical milestones (b). Data are depicted as boxplots (center line—median, box limits—upper and lower quartiles, whiskers—1.5× interquartile range, diamonds—outliers) overlaid with individual markers for each patient (dots). Only significant differences between groups are depicted in the figure. Groups were compared using Mann–Whitney U Test (a) or a posthoc Dunn-Test after a significant Kruskal–Wallis test (p < 0.001) adjusted for false discovery rate using the Benjamini–Hochberg method (b).

Linear models using log(NfL) as the dependent variable revealed that 51.1% of the variance in log(NfL) is explained by the combination of the four milestones (Adjusted R2 = 0.511, p < 0.001). The effects of all milestones on log(NfL) remained significant after adding age, disease duration and sex to the linear model and only age significantly improved the model parameters (Adjusted R2 = 0.581, p < 0.001). When using age-adjusted sNfL Z scores as the dependent variable, the milestones dementia, visual hallucinations and recurrent falls contributed significantly to the linear model, while admission to a nursing facility did not (adjusted R2 = 0.371, p = 0.008, p < 0.001, p = 0.025 and 0.132, respectively). The individual effects of each variable on log(NfL) and age-adjusted sNfL Z scores were further explored with univariate linear regression models (Supplementary Table 2).

Serum NfL predicts death within three years

As mentioned above, clinical milestones typically occur about 5 years prior to death13. The mean follow-up period between the sNfL measurement and the last follow-up chart review was 28.4 ± 8.7 months. In our cohort of 118 patients with advanced PD, 20 patients died during that follow-up period. These patients showed significantly higher age, disease duration and more severe symptoms as reported by Hoehn & Yahr stage, UPDRS III and MoCA scores at the baseline visit (Supplementary Table 1). There were no significant differences in age of onset, sex distribution, or levodopa equivalent daily dose (LEDD).

Surprisingly, the presence of one clinical milestone alone at baseline did not predict death during follow-up in our cohort (p = 0.13, Table 2). However, the presence of at least two clinical milestones increased the risk of death during follow-up by a factor of 21.0 (Table 2), and more milestones were associated with a higher risk of death (HR = 3.86 per additional milestone, Table 2). Among individual milestones, dementia showed the highest hazard ratio (HR = 11.6, Table 2), while recurrent falls showed the lowest hazard ratio and was above the threshold for significance (p = 0.051, Table 2). Other clinical parameters like lower MoCA scores, higher UPDRS III scores and a higher Hoehn & Yahr stage were also significantly associated with a higher risk of death during follow-up (Table 2). Consistent with the hypothesis of accelerated neurodegeneration in the final phase of the disease, higher log(NfL) at baseline proved to be a strong risk factor for death during follow-up independent of age at baseline, disease duration and sex (HR = 12.9 per 1 point increase, Table 2).

Table 2 Hazard ratios associated with sNfL and other risk factors.

To further test the predictive quality of sNfL for death during follow-up and to identify a cutoff value with high sensitivity and specificity, we fitted ROC curves (Fig. 2a). The area under the curve for the prediction of death within three years by (unadjusted) sNfL alone was 0.91 (0.85–0.97). The optimal cutoff value was sNfL = 26.9 pg/ml (determined using the Youden Index), resulting in a sensitivity of 85.0% and a specificity of 85.7%.

Fig. 2: ROC curves for sNfL and age-adjusted sNfL percentiles.
figure 2

ROC curves for the prediction of death during follow-up by sNfL (a, blue curve) and age-adjusted sNfL percentiles (b, blue curve) at baseline. For reference, a ROC curve of a logistic regression model containing age and disease duration was added in both panels (a, b, orange curves). 95% confidence intervals (bootstrap method, 2000 stratified intervals) are depicted around each ROC curve. AUC area under the curve, sNfL serum neurofilament light chain.

Survival differed significantly between patients above and below this cutoff value (p < 0.001 Log-rank test, Fig. 3). Patients with sNfL values ≥ 26.9 pg/ml showed a median survival of only 30.6 months (5th percentile = 16.6 months; 95th percentile not in the range of follow-up). Using a median split and a quartile split as threshold values, groups similarly showed significant differences in survival (p < 0.001 for both, Supplementary Fig. 1).

Fig. 3: Survival curve for death during follow-up.
figure 3

The groups for the Kaplan–Meier estimates were determined by maximizing the Youden index for unadjusted sNfL values (a) and population-based age-adjusted sNfL percentiles (b). Both cutoff values show a highly significant difference in survival for the resulting groups (p < 0.001). Groups below the cutoff are depicted in orange and above the cutoff in blue.

Absolute sNfL values can differ between laboratories. In order to provide a cutoff value that can be easily adapted to other cohorts, we converted the absolute values in our cohort into age-adjusted percentiles using the Serum Neurofilament Light Chain Reference App28. Using the same procedure as described above, we found an AUC of 0.89 (0.83–0.96, Fig. 2b) for the prediction of death during follow-up. The optimal cutoff value based on the Youden index was 90% (Sensitivity = 95.0%, Specificity = 65.3%).

We further evaluated ROC curves for age at baseline and disease duration, the presence of milestones, MoCA score, UPDRS III score or Hoehn & Yahr stage (Table 3). Although there were no significant differences between the AUC of sNfL and MoCA score, UPDRS III score or the number of milestones (p = 0.443; p = 0.183, and p = 0.641, respectively), sNfL was the only predictor of death during follow-up, whose AUC was significantly higher than the combined AUC of age and disease duration (0.91 (0.85–0.97) vs 0.80 (0.69–0.91), p = 0.038) or the AUC of Hoehn & Yahr stage (0.92 (0.87–0.97) vs 0.82 (0.73–0.91), p = 0.010). ROC curves for all baseline clinical characteristics are depicted in Supplementary Fig. 2.

Table 3 Area under ROC curves for death during 3-year follow-up.

Discussion

In the present study, the occurrence of disability milestones correlated with accelerated neurodegeneration as measured by a higher value of the blood-based biomarker sNfL. The more clinical milestones a patient had reached, the higher the levels of sNfL appeared to be. This indicates that late-stage PD, after the onset of disability milestones, is associated with accelerated neurodegeneration, and clinical deterioration not only results from functional decompensation. Furthermore, we identified sNfL as a good predictor of all-cause mortality in our advanced patient cohort.

Several well-made studies across different neurological diseases (including PD29,30,31,32,33) have shown that higher NfL values, measured in blood or cerebrospinal fluid, are consistently associated with clinical and radiological signs of neurodegeneration21,34,35. In PD, accordingly, NfL was shown to predict the onset of morbidity milestones and also the progression to more milestones and all-cause mortality36. Higher NfL is also associated with brain volume reduction, a higher number of neuronal comorbidities and all-cause mortality in the general aging population16,37,38. In our view, however, these studies have not fully appreciated the meaning of this association. NfL particularly does not report cumulative neuronal damage but reports the extent of currently ongoing neuronal injury as it has been demonstrated that NfL levels rise during insults that are associated with acute neuronal damage (e.g., traumatic brain injury, active multiple sclerosis or operations) and more importantly fall off again, after the initial period of damage has subsided27,39,40,41. The well-established correlation between sNfL and neurodegeneration, coupled with its temporal dynamics following acute insults, as well as its association with clinical milestones and mortality as demonstrated in this study, may therefore provide compelling evidence for an accelerated rate of neurodegeneration in advanced stages of PD.

Accelerated neurodegeneration in late-stage PD initially appears contradictory to the major dopaminergic neuron degeneration at early PD5. This apparent conflict can be resolved by considering the progressive spread of alpha-synuclein pathology, initially limited to the brainstem and midbrain but eventually affecting the entire brain9,42. While neuronal somata degeneration is most notable for dopaminergic and other catecholaminergic neurons43, brain-wide loss of neuronal processes and synapses likely occurs in Braak stages 5 and 69. Our findings might therefore be consistent with a more widespread alpha-synuclein pathology-induced neuro-axonal damage in late-stage PD. Moreover, the strongest clinical predictor of death, cognitive decline, may result from extensive cortical damage in advanced Braak stages.

In order to exploit the predictive power of sNfL for patient counseling and trial design, we sought to determine the strongest predictors of all-cause mortality within the observation period, obtaining similar results for log(NfL), the number of clinical milestones, MoCA scores and UPDRS III scores. However, there was a trend for higher predictive accuracy using log(NfL) and it was the only predictive marker that outperformed a combined model of age at baseline, disease duration and Hoehn & Yahr stage. We then corrected for patient age since sNfL levels increase with age, and age is a major risk factor for the incidence of PD, disease progression, and death. The association of sNfL with patient survival was still observed when patient age was corrected for, either by including age in a linear model or by using age-dependent sNfL percentiles (Supplmentary Table 2 and Figs. 2, 3b). Interestingly, models that include sNfL without age correction performed somewhat better than age-corrected models, suggesting that age might play a distinct role in the increase in neurodegeneration. Since aging affects many cellular processes that predispose to neurodegeneration, the accumulation of age-related somatic damage combined with a failure of compensatory mechanisms may contribute to an acceleration of PD with age44. This is also consistent with the hypothesis that the increasing values of sNfL observed in an ageing population are in part caused by age-associated neurodegeneration. Accordingly, sNfL levels have been shown to increase with neuronal comorbidities and brain volume reduction16,37.

Interestingly, we found a higher predictive power for sNfL with respect to death during follow-up than observed in previous studies that have used a similar methodology and comparable or longer periods of follow-up (AUC for death during follow-up predicted by age-adjusted sNfL (95% CI): 0.89 (0.83–0.96) vs 0.68; HR for death during follow-up (95% CI): 13.94 (4.93–39.3) vs 1.94 (1.36–2.76)36,45. These discrepancies could reflect an association between the predictive ability of sNfL and disease duration or disease stage, as the study by Vijiaratnam et al. included only patients with de novo PD (disease duration: 1.3 ± 0.9 years) and patients in the study by Rödström et al. had a shorter disease duration (7.9 ± 5.1 years) compared to our cohort (15 ± 7 years)36,45. Furthermore, in the study by Rödström et al 15.5% of participants were excluded because they died less than 2 years after follow-up, potentially obfuscating the relationship between sNfL and death. The predictive ability of sNfL for adverse outcomes could therefore improve over the course of the disease, which could be connected to accelerated neurodegeneration in advanced disease stages.

Limitations of our study include the reliance on a single center and a single marker for neurodegeneration. In addition, this study was based on a single measurement of sNfL, which potentially has a lower predictive value compared to the longitudinal change of sNfL21. However, previous studies by others have shown that a single measurement of sNfL can be considered a good marker for ongoing neurodegeneration around the time of the measurement (see above). Another limitation of the study is the retrospective character of the follow-up chart review that did not allow a specification for the cause of death. Furthermore, it needs to be considered that our cohort contained a high percentage of PD patients with chronic DBS. Although we have shown in a previous study that DBS treatment does not alter the levels of sNfL in patients with PD, the results presented in this article should be fully validated in patients without DBS.

In conclusion, our findings suggest that the rate of ongoing neurodegeneration is higher in advanced PD as defined by the presence of morbidity milestones compared to earlier disease stages. A better understanding of stage-dependent neurodegeneration dynamics and potential accelerators may be helpful to further characterize the respective pathophysiological bases which in turn could help identify new therapeutic targets. Our results further suggest that disease-modifying therapies may continue to hold significant value even in the late stages of the disease and that particularly debilitating symptoms like dementia or loss of physical independence could possibly be prevented, if the neurodegenerative process is halted in that phase. Consequently, it is imperative that research and therapeutic endeavors also encompass the prevention and management of these crucial aspects. sNfL could therefore help to identify patients at risk for clinical deterioration and in addition serve as an objective outcome measure for neuroprotective substance trials.

Methods

Study population and design

In order to increase the probability of morbidity milestones occurring during the observation period, patients beyond the early-stage (“honeymoon”) were selected, defined for this study by a disease duration of at least 5 years46. The study was approved by the institutional review board of the Technische Universität Dresden (EK533122019, EK487122016) and written informed consent was obtained from all participants before inclusion in the study. Participants were recruited at the Department of Neurology of the Dresden University Hospital. The study included 88 patients on chronic deep brain stimulation (DBS) treatment, which were enrolled in a structured observational register study and 30 patients on best medical treatment, which were subjected to a solely chart-based review of clinical data. Baseline sNfL was measured during a routine clinical visit. At the same time, clinical and demographic data were collected, including the presence of clinical milestones, information about the motor and cognitive status (Hoehn & Yahr stage, UPDRS part III, Montreal cognitive assessment (MoCA)) as well as the presence of disease-related complications. Three years after the inclusion of the first patient, we performed a chart review to determine all-cause mortality and the time point of death.

Serum NfL measurements

Serum samples were stored at −20 °C and sNfL measurements were performed as described previously, using the Advantage NF-Light Singleplex-Kit on a Simoa HD-1 instrument (Quanterix)27,47. Calibrators and diluted serum samples were measured in duplicates. Both the mean intraassay coefficient of variation of duplicates and the mean interassay coefficient of variation were <10%.

Statistical analyses

To normalize the right-skewed distribution of sNfL, natural log-transformation (logNfL) was used in linear models and Cox proportional-hazards models, as described by others19. Age-adjusted sNfL percentiles and Z scores were calculated with the Serum Neurofilament Light Chain Reference App, which uses a reference database derived from over 5000 participants without central nervous system disorders28. All Cox proportional-hazards regression models were adjusted for age at baseline, disease duration and sex. The proportional-hazards assumption was tested based on visual inspection of the scaled Schoenfeld residuals and χ2 test.

Comparisons between groups were carried out using the Mann–Whitney U Test. Correlations were assessed with Spearman’s rank test. For multiple group comparisons, the Kruskal–Wallis test with the posthoc Dunn-test corrected for false discovery rate with the Benjamini-Hochberg method was used.

To compute cutoff values for the prediction of death, we used binary regression models and fitted receiver operator curves. The optimal sensitivity and specificity were calculated based on the Youden Index determined with the R-package cutpointr. The computed Youden Index was used to dichotomize our sample to receive Kaplan–Meier curves depending on the respective sNfL or age-adjusted sNfL percentile cutoff value. Kaplan–Meier curves were compared with log-rank tests and corrected for false discovery rate with the Benjamini–Hochberg method if necessary.

Data are depicted either as mean with standard deviation or median with 95% confidence interval if not otherwise stated. Statistical analyses were performed with R-Studio (packages: pROC, survival, survminer, cutpointr) or Python (packages: scipy, pingouin, statsmodels, kaplan–meier). The sample size was based on the size of our patient cohort and not determined by a prospective sample size estimate.