Data-Driven Subtyping of Parkinson’s Disease Using Longitudinal Clinical Records: A Cohort Study

Zhang, Xi; Chou, Jingyuan; Liang, Jian; Xiao, Cao; Zhao, Yize; Sarva, Harini; Henchcliffe, Claire; Wang, Fei

doi:10.1038/s41598-018-37545-z

Download PDF

Article
Open access
Published: 28 January 2019

Data-Driven Subtyping of Parkinson’s Disease Using Longitudinal Clinical Records: A Cohort Study

Xi Zhang¹,
Jingyuan Chou¹,
Jian Liang²,
Cao Xiao³,
Yize Zhao¹,
Harini Sarva⁴,
Claire Henchcliffe⁴ &
…
Fei Wang¹

Scientific Reports volume 9, Article number: 797 (2019) Cite this article

9698 Accesses
73 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Parkinson’s disease (PD) is associated with diverse clinical manifestations including motor and non-motor signs and symptoms, and emerging biomarkers. We aimed to reveal the heterogeneity of PD to define subtypes and their progression rates using an automated deep learning algorithm on the top of longitudinal clinical records. This study utilizes the data collected from the Parkinson’s Progression Markers Initiative (PPMI), which is a longitudinal cohort study of patients with newly diagnosed Parkinson’s disease. Clinical information including motor and non-motor assessments, biospecimen examinations, and neuroimaging results were used for identification of PD subtypes. A deep learning algorithm, Long-Short Term Memory (LSTM), was used to represent each patient as a multi-dimensional time series for subtype identification. Both visualization and statistical analysis were performed for analyzing the obtained PD subtypes. As a result, 466 patients with idiopathic PD were investigated and three subtypes were identified. Subtype I (Mild Baseline, Moderate Motor Progression) is comprised of 43.1% of the participants, with average age 58.79 ± 9.53 years, and was characterized by moderate functional decay in motor ability but stable cognitive ability. Subtype II (Moderate Baseline, Mild Progression) is comprised of 22.9% of the participants, with average age 61.93 ± 6.56 years, and was characterized by mild functional decay in both motor and non-motor symptoms. Subtype III (Severe Baseline, Rapid Progression) is comprised 33.9% of the patients, with average age 65.32 ± 8.86 years, and was characterized by rapid progression of both motor and non-motor symptoms. These subtypes suggest that when comprehensive clinical and biomarker data are incorporated into a deep learning algorithm, the disease progression rates do not necessarily associate with baseline severities, and the progression rate of non-motor symptoms is not necessarily correlated with the progression rate of motor symptoms.

Identification and prediction of Parkinson’s disease subtypes and progression using machine learning in two cohorts

Article Open access 16 December 2022

Anant Dadu, Vipul Satone, … Faraz Faghri

Two distinct trajectories of clinical and neurodegeneration events in Parkinson’s disease

Article Open access 13 July 2023

Cheng Zhou, Linbo Wang, … Minming Zhang

Predictive modelling of Parkinson’s disease progression based on RNA-Sequence with densely connected deep recurrent neural networks

Article Open access 12 December 2022

Siraj Ahmed, Majid Komeili & Jeongwon Park

Introduction

Parkinson’s Disease (PD) is clinically heterogeneous, and identification of subtypes may therefore facilitate further research on underlying etiologies and development of appropriate therapies^1,2,3. However, the disease is associated with a broad spectrum of variable factors including motor, cognitive, neuropsychiatric signs and symptoms, neuroimaging, genetics, and others⁴. Therefore, accurately defining PD subtypes can be challenging. Moreover, PD is a progressive neurodegenerative disorder with heterogeneity in individual disease trajectories⁵. The rationale behind this study is to utilize the comprehensive data provided by the Parkinson’s Progression Markers Initiative (PPMI)⁶ to discover PD subtypes such that the PD patients within each subtype demonstrate cohesive progression pathways. Here “pathway” refers to the longitudinal patient records and “cohesive” refers to the patient records, which are similar to each other longitudinally. We call such subtypes progression subtypes.

Robust and valuable existing studies^3,7 on PD subtyping have defined patient groups by informative motor and non-motor variables. For instance, we can divide PD into tremor-dominant (TD) and postural instability and gait difficulty (PIGD) subtypes, according to the predefined motor criteria based upon the Unified Parkinson’s Disease Rating Scale (UPDRS). However, these conventional approaches typically just focus on one specific aspect (e.g., motor or cognition) of the patient characteristics. Therefore, we need more comprehensive approaches that can consider different aspects of patient characteristics during the subtyping process. In this case, computational techniques will likely be helpful because of the large number of variables and the complex relationships among them.

From a computational (or data-driven) perspective, patient subtyping is a clustering problem⁸, where the goal is to group patients such that each subtype corresponds to a specific patient cluster. The patients within the same subtype are therefore similar to each other. There are a small number of previous studies^1,4,9,10 that applied data-driven clustering methodologies to identify subtypes without any prior assumptions. These methods (e.g., k-means^11,12 or hierarchical agglomerative clustering¹³) are typically based on static patient representation derived from their baseline assessments. In this paper, we additionally incorporate longitudinal patient information into the subtyping process. This complements the subtypes identified by traditional methods as our approach can derive PD subtypes with common progression patterns.

In order to take into account the course of PD progression, we aimed to identify progression subtypes, where the patients within each subtype are similar to each other longitudinally (in terms of the temporal trends of their records). This has the advantage of potentially providing data that could inform discussion of patient prognosis in the clinic. Therefore, quantification of the pairwise similarity between multi-dimensional longitudinal patient records would be key to discover these subtypes. To solve this problem, we first concatenated the multi-source records according to their occurring timestamps to form a temporal sequence for each patient. Then a deep learning model LSTM¹⁴ was trained to encode the raw record sequences into a series of standardized and dense sequence embeddings. Dynamic Time Warping (DTW)¹⁵, which is a common technique for quantifying the distance pairwise temporal sequences, was then applied on those embeddings to evaluate the patient similarities. Finally, the subtypes were identified through conventional clustering with the learned patient similarities (See Fig. 1).

Methods

Data

The patient data used in our study were obtained from the Parkinson Progression Marker Initiative (PPMI) study⁶. PPMI is an important ongoing observational, international, multi-source study that has meticulously collected various potential PD progression markers, including demographics, clinical features, imaging, and biospecimen (cerebrospinal fluid, blood, DNA, RNA) measures, that have been collected for more than six years. We downloaded the data from PPMI database on June 21, 2016. The de-identified data contained archives of enrolled subjects from June 1, 2010, to June 1, 2016. The patient features include clinical evaluation of motor and non-motor features, biospecimen examinations of cerebrospinal fluid (CSF), and neuroimaging of the dopamine transporter using ¹²³I-ioflupane single photon emission computed tomography (SPECT) (DaTScan™) for this study. CSF was collected by standardized lumbar puncture procedures. Measurements of cerebrospinal fluid concentration of amyloid-beta1–42 (Aβ_1–42), total Tau protein (t-Tau), and phosphorylated Tau protein at threonine 181 (p-Tau₁₈₁) were taken in each of 102 CSF aliquots at the University of Pennsylvania using the multiplex Luminex xMAP platform (Luminex Corp). Cerebrospinal fluid alpha-synuclein concentration (α-syn) was analyzed at Covance using a commercially available enzyme-linked immunosorbent assay kit (Covance)¹⁶. DaTscan™ is a radiopharmaceutical imaging agent that works by binding to dopamine transporters (DaT) in the brain. All subjects have DAT imaging at baseline, as acquired in the striatum using SPECT. The DAT images were centrally reconstructed, attenuation corrected and analyzed with a standardized volume of interest template on caudate, putamen, and occipital regions (https://www.indd.org/).

The enrolled PD participants were required to (1) be over 30 years old; (2) have Hoehn and Yahr (H&Y) stage of PD of 1 or 2; (3) have an asymmetric resting tremor, or asymmetric bradykinesia, or two of bradykinesia, resting tremor, and rigidity with recent PD diagnosis; and (4) to be untreated by anti-PD medications⁶. Therefore, the PD patients enrolled in this study were early in their disease course, making it more likely to identify a disease progression biomarker and provide a better population for eventual disease modifying drug trials.

According to the primary diagnosis from the PPMI data, the subjects with “Idiopathic Parkinson’s Disease” or “No PD or other neurological disorder” were extracted as cases and healthy controls, respectively. In total, the dataset consisted of 15,798 records of 683 subjects including 466 PD patients. On average, each patient had approximately 23 records. We used all patients, including both cases and controls, for training LSTM based embedding, and subsequently, PD cases were used for subtyping and statistical analysis.

As there were lots of missing entries in patient records (for instance, there are 14.42% missing values for age, 15.29% missing values for disease duration), an imputation procedure with Multiple Imputation with Chained Equation (MICE)¹⁷ was conducted.

PD Subtyping

We used a deep learning model for pre-processing the patient record sequences. Deep learning methods^18,19 are normally composed of multiple layers of computational units that can perform nonlinear transformations of input features. Empirical results in certain medical applications^20,21 have demonstrated that these learned representations often result in much improved performance compared with traditional approaches. Recently researchers have also started exploring the applications of deep learning in the tasks of learning patient representations from Electronic Health Records (EHR)²².

In this study, we proposed to learn patient representations with the LSTM model, which is a popular deep learning model for sequence representation learning and it has been successfully applied in tasks like speech analysis and natural language processing^14,23. Before applying LSTM, we first concatenated patient records from different sources into an ordered sequence according to their associated timestamps as demonstrated in Fig. 1A. Moreover, we split the events in patient records (termed “features”) into two different types: input features and target features. The target features are critical variables from previous clinical studies that have been shown to be closely related to PD progression⁵. The rest of the features were treated as input features.

For each patient p, we used a sequence of his/her input features \({{\bf{x}}}_{t},\,t=\mathrm{1,}\,\mathrm{2,}\,\cdots ,\,{N}_{p}\) and a sequence of his/her target features \({{\bf{y}}}_{t},\,t=\mathrm{1,}\,\mathrm{2,}\,\cdots ,\,{N}_{p}\), then a novel sequential representation \({{\bf{h}}}_{t},\,t=\mathrm{1,}\,\mathrm{2,}\,\cdots ,\,{N}_{p}\) could be learned with LSTM. Each h_t is a dense vector with values on each dimension standardized to [−1, 1], and such vectors leverage the temporal context around timestamp t. Using this procedure, we could obtain integrated, standardized, and densified multi-dimensional sequential patient representations. The next step was then to evaluate pairwise similarities based on these derived representations.

Once the LSTM model was trained, the sequence of hidden layer representation \({{\bf{h}}}_{t},\,t=\mathrm{1,}\,\mathrm{2,}\,\cdots ,\,{N}_{p}\) that encode multi-source features were obtained for each patient. A sequence consists of vectors can be treated as embeddings. Those embeddings were dense and standardized (value between −1 and 1), which make it much more convenient to evaluate the patient similarities in those sequences. Of note, the problem of patient subtyping is intrinsically the problem of defining proper patient similarities. Using these similarities from patient records we have attempted to discern categories of disease progression.

Dynamic Time Warping (DTW)¹⁵ is a popular technique for measuring the distance (which can be regarded as dissimilarities) between pairwise temporal sequences. Different from straightforward Euclidean distance calculation, DTW first aligns the two sequences using a dynamic programming procedure and then calculates the Euclidean distance between the aligned sequences. In this way, we can consider the time shift in the evaluation process and make the results more accurate and robust. Gaussian function is employed to transform those DTW distances into similarities²⁴. We evaluated such similarity for each pair of patients and form an N by N symmetric patient similarity matrix \({\bf{S}}\in {{\mathbb{R}}}^{N\times N}\). The (i, j)-th entry S_ij is the similarity between patient i and patient j.

Student t-Distributed Stochastic Neighbor Embedding (t-SNE)^25,26 was adopted on to embed the patients into a 2-dimensional space so that the patient similarities could be preserved. Then the patient subtypes could be identified by performing clustering the 2-dimensional space with the k-means algorithm¹¹, and the number of clusters is determined by the Hartigan’s rule²⁷.

Model Evaluation

As introduced above, our patient subtyping process includes three steps (1) representation learning with LSTM; (2) similarity calculation with DTW; (3) embedding with t-SNE and clustering with k-means. LSTM processing is a key step. To assess its effectiveness, we compared the performance of our method with the baseline procedure without LSTM processing, where the target feature sequence was used as the sequential representation for patients followed by steps 2 and 3. We also constructed another baseline with vectorized patient representations, in which each patient is represented by a vector with each dimension corresponding to the summary statistic (e.g., count for codes such as diagnosis, or average for continuous values such as laboratory test values) of a specific feature over a certain time period. The patient vectors were further processed by Principal Component Analysis (PCA)²⁸ to reduce the feature dimensionality and redundancy.

In order to train an LSTM model, the data were randomly divided into training, testing and validation sets with the ratio of 6:2:2, and the three sets were non-overlapped. The sequential representations of the patients can be obtained from the hidden layers of the trained LSTM. The dimensionality of each hidden units was set as 32.

In a recent study identifying clinical subtypes of PD⁵, the overall disease severity and the global composite outcome were defined by a composition of several motor and non-motor variables including Unified Parkinson’s Disease Rating Scale (UPDRS) scores, cognitive assessment, and scales of depression and anxiety. Similar features were therefore selected as target features in our study, consisting of 82 features in total, of which 70 were continuous and 12 were binary. Those features were further integrated into 10 clinical variables shown in Table 1 of the supplemental material (e.g., the variable MoCA includes 28 features)⁶. The rest of the PPMI variables were set as input features, with 319 in total.

Table 1 Group characteristics of patients at the baseline in the three subtypes.

Full size table

To evaluate the effectiveness of the learned patient representation and similarities, we visualized their embeddings with t-SNE and colored the detected subtypes. We also conducted statistical analysis to identify the distinct features for different subtypes for interpretation purpose. More concretely, we used Chi-square test for the categorical variables, one-way ANOVA for the normal continuous variables, Kruskal-Wallis test for the non-normal continuous variables, and Fisher’s exact test for the high sparsity variables. For the tests with significant p-value, Tukey post hoc analysis were performed on every two subtypes to identify specific difference. Based on prior studies^5,29, if the p-value was smaller than 0.05, we considered a significant group effect for the associated variables.

Results

Visualization of patient subtypes

Figure 2 demonstrates the subtyping results with LSTM representation and the two baselines. The first one directly calculated the patient similarities with DTW on the raw target feature sequence. The second one collapses all features into a vector for each patient and then performed PCA on top of the patient vectors. Compared with Fig. 2(B,C), the three subtypes depicted by learned LSTM representation in Fig. 2(A) are much more salient, with a better separation in scatterplot (smaller intra-cluster distance and larger inter-cluster distance).

Subtype characteristics

Patient characteristics for each subtype in Fig. 2(A), including demographics, clinical features, imaging, and biospecimen, are summarized in Tables 1 and 2 (baseline and last records). The variables contain disease duration, age, education, medication use, clinical severity measures such as H&Y Stage, MDS-UPDRS (Movement Disorders Society–revised Unified Parkinson’s Disease Rating Scale) Part I-IV, non-motor measures including cognitive impairment, depression, anxiety, sleep disorders, and imaging assessments including DaTScan Striatal Binding Ratio (SBR), as well as key CSF biomarkers (the online implementation of Subtype Characteristics Analysis is provided on https://github.com/sheryl-ai/Subtype-Analysis).

Table 2 Group characteristics of patients at their last records in the three subtypes.

Full size table

The differences in mean age at baseline for the three patient subtypes are significant, at 58.79 ± 9.53, 61.39 ± 6.56, and 65.32 ± 8.86 years respectively. We therefore performed further multivariate analysis with adjustment to investigate the contribution of the onset-age presented in Supplement Tables 5–10. This, importantly, demonstrated no significant effect of Age after adjusting for multiple comparisons (p > 0.05). Tables 1 and 2 demonstrates important variables that were contributory in characterizing subtypes: severity rating assessed by H&Y stage, motor and non-motor assessment for MDS-UPDRS, global cognitive function assessed by Montreal Cognitive Assessment (MoCA), visuospatial abilities assessed by Benton Judgment of Line Orientation (BJLO), daytime sleepiness assessed by Epworth Sleepiness Scale (ESS), executive function/working memory assessed by Letter Number Sequencing (LNS), verbal memory assessed by Hopkin’s Verbal Learning Test (HVLT), sleep behavior assessed by Rapid Eye Movement sleep behavior disorder (RBD), depression degree assessed by Geriatric Depression Scale (GDS), impulsive-compulsive disorders assessed by Questionnaire for Impulsive-Compulsive Disorders (QUIP), autonomic dysfunction assessed by Scales for Outcomes in Parkinson’s disease-Autonomic symptoms (SCOPA-AUT), semantic testing for semantic fluency, anxiety degree assessed by State Trait Anxiety Inventory (STAI), processing speed/attention assessed by Symbol Digit Modalities Test (SDMT), olfaction measured by University of Pennsylvania Smell Identification Test (UPSIT), cognitive impairment assessed by Mild Cognitive Impairment (MCI), quantified α-syn, Aβ_1–42, t-Tau, and p-Tau₁₈₁ for CSF biomarkers, and DaTScan Striatal Binding Ratios (calculated by (striatal region)/(occipital) −1 from 4 h post-injection 123-I Ioflupane image)⁶. The specific mean values indicate the severity of the significant manifesting variables on the corresponding subtype.

The first subtype (Subtype I) comprised 201 patients, and was characterized by mild H&Y stage (mean value 1.81), mild non-motor symptoms (cognitive impairment, depression, anxiety) as reported by patients on MDS-UPDRS Part I, and significantly lower CSF t-Tau level. The motor severity of the second subtype (Subtype II) (107 patients) was similar to Subtype I. However, several measures of non-motor features such as MoCA, GDS, and STAI of Subtype II were more severe than in Subtype I. Of note, Subtype II had the highest CSF Aβ_1–42 concentration, but the lowest BJLO (Benton Judgment of Line Orientation test) and SCOPA-AUT in independent non-motor domains. Subtype III (158 patients) has the most severe motor and non-motor symptoms.

We also demonstrated the discriminative power of these features though the differences between their mean values within each subtype and their global mean values, using a heatmap presented in Fig. 3. Each column in the figure represents a subtype while each row represents a feature p-value < 0.05 in the statistical testing. By comparing the profiles of the subtypes, we can see that the third subtype was older and had more severe motor and non-motor features. The first and second subtypes significantly differed by cognitive factors including MoCA, BJLO, HVLT, LNS, and SDMT, CSF biomarkers (t-Tau), as well as DaTScan SBR (the detailed mean values of these features are shown in Tables 1 and 2).

Disease progression patterns in different subtypes

The existence of PPMI study follow-up data allowed us to examine disease progression patterns for different subtypes. To identify the features whose value changes are significant from baseline to follow-up visits, we conducted statistical testing on their value differences between the two visits. The variables whose value changes were significantly distinct across the three subtypes are shown in Figs 4 and 5, where greater slope indicates a more rapid progression of the specific variable in the subtype, whereas the smaller slope represents a relatively more stable condition. Based on MDS-UPDRS motor and non-motor subscores and H&Y stage, the disease progression of Subtype III is faster than Subtype I and Subtype II, while progression in Subtype II was slower than Subtype I. Non-motor measures of MoCA, LNS (Letter-Number Sequencing), SDMT (Symbol-Digit Modalities Test), and SCOPA-AUT suggested that Subtype III has the most prominent decline in general cognitive ability and autonomic function. In contrast, the cognitive abilities are relatively unchanged for Subtype I and slightly decreases for Subtype II. Subtype I had faster autonomic function progression of a compared with Subtype II. The DaTScan imaging results (See Supplement Fig. 3) of the region of interest (Caudate and Putamen) for the three subtypes suggested that the DaTScan SBR value of Subtype III decreases more significantly, which was consistent with the fact that Subtype III was associated with the most severe disease course³⁰.

Comparison with other methods

Characteristics of the subtypes at the subjects’ last study records and progression obtained through four different methods are listed (see Supplement Table 2). We analyzed all the baseline methods by statistical testing (Chi-square test; Fisher exact test; One-way ANOVA test; Kruskal-Wallis H-test) and computed p-value for each variable. For a fair comparison, DTW, t-SNE and k-means were utilized on all the subtyping methods.

Supplement Table 2 demonstrates that the variables with more markers in the column were indicative of the more sensitive variables that can be used to interpret the subtyping results. We can observe that the proposed method can identify more significant variables than the baselines, which led to more distinct patient subtypes.

Discussion

Clinical Interpretation of the Identified Subtypes

In this study we have identified three novel PD subtypes based upon incorporation of comprehensive clinical and biomarker data and have summarized their clinical characteristics. Specifically, we can interpret the three subtypes as follows (and we interpret the three subtypes from a more abstract perspective in Supplemental Material Table 4).

Subtype I (Mild Baseline, Moderate Motor Progression)

The patients in this subtype start with a relatively mild deficits on both their motor and non-motor capabilities at baseline. However, their motor functionalities will decay at a moderate rate over time while their cognitive abilities are relatively stable.

Subtype II (Moderate Baseline, Mild Progression)

The patients in this subtype begin with moderate deficits in both their motor and non-motor capabilities at baseline (i.e., more severe than Subtype I). Both their motor and non-motor functionalities progress slowly over time.

Subtype III (Severe Baseline, Rapid Progression)

The patients in this subtypebegin with more significant deficits in both their motor and non-motor capabilities at baseline (i.e., more severe than Subtype I and II). Both their motor and non-motor functionalities progress rapidly over time.

These analyses therefore demonstrate heterogeneity of PD progression between patient subtypes and also between classes of symptoms. From Subtype I to Subtype III, overall the subjects motor and non-motor symptoms are more severe at baseline. In particular we identify a subset of individuals with PD (Subtype III) with more severe motor and non-motor symptoms and faster disease progression rate. However, more severe onset status in our model does not necessarily lead to faster progression, since motor symptom decay rate for Subtype II is slower than Subtype I. Our analyses also suggest dissociation between the progression of non-motor symptoms and motor symptoms in specific subtypes.

Clinical experience with PD has underlined that with progression of motor symptoms, non-motor symptoms commonly worsen. However, our data support that by searching for subtypes based upon phenotypic and possibly biomarker characteristics, it may be possible to dissect out groups of individuals in whom severity of motor and non-motor symptoms does not correlate strongly. Indeed, there is already abundant evidence that non-motor symptoms may associate differentially with “traditional” clinically-based subtypes of tremor-predominant versus PIGD PD³¹. In the clinic, a more nuanced appreciation of the likely future course of a patient with PD could be highly impactful.

Relationship with conventional PD subtypes

Conventionally there are two well-described motor PD subtypes based upon UPDRS scores, (1) Tremor-Dominant PD (TD); and (2) Postural Instability and Gait Difficulty (PIGD)⁷. In PPMI, the motor subtypes can be defined based on MDS-UPDRS³²: cutoff scores of \(\leqslant \)1.15 for TD classification and \(\geqslant \)0.90 for PIGD; if the ratio is between the cutoff scores 0.90 and 1.15, then the patient is classified as indeterminate. We therefore examined prevalence of TD and PIGD in our subtypes at different time points (Fig. 6A). For Subtype I and II, more patients had TD than PIGD, and Subtype III had the highest prevalence of PIGD. Compared with Subtype I and III, the second subtype had the highest TD prevalence but the lowest PIGD prevalence.

We also studied the longitudinal correlations between the described motor subtypes and our subtypes I-III. We observed that over time there was a larger cohort of patients transitioning from TD to PIGD for Subtype III compared with subtypes I-II. Over 6 years, the prevalence of PIGD in Subtype III increased from 20.8% to 48.7%, whereas for Subtype I, prevalence of PIGD increased from 18.9% at baseline to 32.3% after 6 years, and prevalence change in Subtype II was minimal, from 14.2% to 16.9% at 6 years. From the above comparisons, we can conclude that the three subtypes learned from our method had different compositions of the three known motor subtypes. According to a prior review³, PIGD PD often has poor prognosis with rapid progression while TD PD has a better prognosis with slower progression, which is consistent with the above progression analysis of Subtypes I, II, and III.

Similarly, we also investigated the correlations between the three learned subtypes and the three established cognitive subtypes: (1) no impairment (PD-NC), (2) mild impairment (PD-MCI), and dementia (PDD)^33,34,35. Figure 6B demonstrates the results. For all three learned subtypes, the majority of patients were cognitively normal (PD-NC). Comparing with Subtype I and II, the prevalence of PD-MCI in Subtype III was the largest and increased significantly during the 6 years’ follow up. Moreover, Subtype III contained all PDD patients, and their prevalence increased from 0.64% to 7.79% over the 6-year horizon.

Finally, we computed the correlations between the three learned subtypes and mood subtypes. Specifically, we assessed four mood subtypes: (1) Anxiety; (2) Depression; (3) Depression-Anxiety; and (4) Normal. In PPMI, anxiety and depression were measured by STAI and GDS respectively, with higher scores indicating more severe anxiety or depression. A suggested cut-off point used for STAI is 54–55^36,37. The cut-off point for GDS is 5 (patients with GDS \(\geqslant \)5 are “Depressed”; patients with GDS < 5 are “Not Depressed”). During the 6-year follow-up period, we observed that in Fig. 6C: for Subtype I, the prevalence of anxiety and mixed depression-anxiety decreased while the prevalence of depression alone increased; in contrast to Subtype I, the number of anxious patients slightly increased in Subtype II while the number of depression as well as mixed depression-anxiety patients decreased; in Subtype III, the number of patients with mixed depression-anxiety rose significantly, while the percentage of patients with anxiety and those with depression decreased, indicating a gradual transition from having one mood symptom to multiple mood symptoms (Fig. 6C). It is worth noticing that the prevalence of normal mood patients in all three learned subtypes slightly grew slightly in the follow-up period, suggesting that some patients had improvements in their mood disorder during the disease course.

In addition to the above mentioned conventional subtypes, a more traditional way for PD subtyping is just based on patient onset ages^38,39,40. These studies suggested that PD patients with older onset ages are usually associated with more severe motor and non-motor symptoms^38,40, and more rapid disease progression rates³⁹. Our study takes a complimentary approach: we use longitudinal patient records for subtyping without onset ages. Table 1 shows average ages for the three subtypes. Of note Subtype III, the group with the most rapid progression, has the oldest average onset age of the three subtypes.

Limitations

This study is an initial attempt on leveraging advanced data analytics for identification of PD subtypes with longitudinal and heterogeneous clinical study data. Our approach has demonstrated strong potentials of identification of comprehensive progressive PD subtypes. However, there are still some limitations in the current approach including (1) the approach is completely data-driven without utilization of any clinical domain knowledge; (2) the deep learning (LSTM) procedure cannot be straightforwardly interpreted; (3) our study is only conducted on the PPMI cohort. In the future, we will continue our research specifically on these lines, i.e., combining knowledge and data driven insights, making deep learning models interpretable, and replicate the findings on more patient cohorts.

While recognizing limitations of the present analyses based upon a single cohort in de novo PD patients, we suggest that the potential implications in the clinic are that individuals with milder PD motor and non-motor symptoms and lower CSF t-Tau levels will show moderate progression in motor and autonomic symptoms but are at lower risk of cognitive decline; those with mild motor symptoms but presence of significant cognitive deficits and anxiety, along with high CSF Abeta levels, are at risk of greater cognitive decline in the face of slow motor progression; and those with more severe motor combined with non-motor symptoms at onset are at risk of more rapid decline of motor and non-motor, including cognitive, symptoms. Our data therefore not only suggests dissociation of progression of motor versus cognitive symptom progression, but also dissociation between non-motor symptoms of cognition versus autonomic symptoms.

Conclusions

A novel patient subtyping method for PD with deep learning model is proposed, where LSTM is leveraged to standardize and densify the patient records. After that, DTW is leveraged to calculate the patient similarities from which PD subtypes are derived. Using this novel approach we have identified three distinct subtypes in the PPMI cohort, demonstrating heterogeneous characteristics in both motor and non-motor characteristics. These subtypes have distinct patterns of progression, and moreover associate with specific biomarkers. We have examined how these newly discovered subtypes are related to traditional motor, cognitive and mood PD subtypes, and while we found some relationships we suggest that our approach benefits from incorporation of substantially more comprehensive data. The subtypes that we have identified demonstrate that, in contrast to studies that examine aggregate data, disease progression rates in our identified subtypes do not necessarily associate with baseline severity, and the progression rate of non-motor symptoms does not have a simple correlation with motor progression but varies by subtype.

References

Post, B. et al. Clinical heterogeneity in newly diagnosed parkinson?s disease. J. neurology 255, 716–722 (2008).
Article Google Scholar
van Rooden, S. M. et al. Clinical subtypes of Parkinson’s disease. Mov. Disord. 26, 51–58 (2011).
Article Google Scholar
Thenganatt, M. A. & Jankovic, J. Parkinson disease subtypes. JAMA neurology 71, 499–504 (2014).
Article Google Scholar
van Rooden, S. M. et al. The identification of parkinson’s disease subtypes using cluster analysis: a systematic review. Mov. Disord. 25, 969–978 (2010).
Article Google Scholar
Fereshtehnejad, S.-M. et al. New clinical subtypes of parkinson disease and their longitudinal progression: a prospective cohort comparison with other phenotypes. JAMA neurology 72, 863–873 (2015).
Article Google Scholar
Marek, K. et al. The parkinson progression marker initiative (PPMI). Prog. neurobiology 95, 629–635 (2011).
Article Google Scholar
Jankovic, J. et al. Variable expression of parkinson’s disease a base-line analysis of the dat atop cohort. Neurol. 40, 1529–1529 (1990).
Article CAS Google Scholar
Jain, A. K. & Dubes, R. C. Algorithms for clustering data (Prentice-Hall, Inc., 1988).
Lewis, S. et al. Heterogeneity of parkinson’s disease in the early clinical stages using a data driven approach. J. Neurol. Neurosurg. & Psychiatry 76, 343–348 (2005).
Article CAS Google Scholar
Lawton, M. et al. Parkinson?s disease subtypes in the oxford parkinson disease centre (opdc) discovery cohort. J. Park. disease 5, 269–279 (2015).
Article Google Scholar
Hartigan, J. A. & Wong, M. A. Algorithm AS 136: A k-means clustering algorithm. J. Royal Stat. Soc. Ser. C Applied Stat. 28, 100–108 (1979).
MATH Google Scholar
Erro, R. et al. Clinical clusters and dopaminergic dysfunction in de-novo parkinson disease. Park. related disorders 28, 137–140 (2016).
Article Google Scholar
Fereshtehnejad, S.-M., Zeighami, Y., Dagher, A. & Postuma, R. B. Clinical criteria for subtyping parkinson’s disease: biomarkers and longitudinal progression. Brain 140, 1959–1976 (2017).
Article Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural computation 9, 1735–1780 (1997).
Article CAS Google Scholar
Müller, M. Dynamic time warping. Inf. retrieval for music motion 69–84 (2007).
Kang, J.-H. et al. Association of cerebrospinal fluid β-amyloid 1-42, t-tau, p-tau181, and α-synuclein levels with clinical features of drug-naive patients with early parkinson disease. JAMA neurology 70, 1277–1287 (2013).
PubMed PubMed Central Google Scholar
Azur, M. J., Stuart, E. A., Frangakis, C. & Leaf, P. J. Multiple imputation by chained equations: what is it and how does it work? Int. journal methods psychiatric research 20, 40–49 (2011).
Article Google Scholar
Bishop, C. M. Neural networks for pattern recognition (Oxford university press, 1995).
Sutton, R. S. & Barto, A. G. Introduction to reinforcement learning, vol. 135 (MIT press Cambridge, 1998).
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. Jama 316, 2402–2410 (2016).
Article Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nat. 542, 115 (2017).
Article ADS CAS Google Scholar
Miotto, R., Li, L., Kidd, B. A. & Dudley, J. T. Deep patient: an unsupervised representation to predict the future of patients from the electronic health records. Sci. reports 6, 26094 (2016).
Article ADS CAS Google Scholar
Graves, A. Supervised sequence labelling. In Supervised sequence labelling with recurrent neural networks, 5–13 (Springer, 2012).
Zhu, X., Ghahramani, Z. & Lafferty, J. D. Semi-supervised learning using gaussian fields and harmonic functions. In Proceedings of the 20th International conference on Machine learning (ICML-03), 912–919 (2003).
Maaten, L. V. D. & Hinton, G. Visualizing data using t-SNE. J. machine learning research 9, 2579–2605 (2008).
MATH Google Scholar
Van Der Maaten, L., Postma, E. & Van den Herik, J. Dimensionality reduction: a comparative. J Mach Learn. Res 10, 66–71 (2009).
Google Scholar
Hartigan, J. A. Clustering algorithms (Wiley, 1975).
Wold, S., Esbensen, K. & Geladi, P. Principal component analysis. Chemom. intelligent laboratory systems 2, 37–52 (1987).
Article CAS Google Scholar
Dujardin, K. et al. The spectrum of cognitive disorders in Parkinson’s disease: a data-driven approach. Mov. Disord. 28, 183–189 (2013).
Article Google Scholar
Gao, R. et al. CSF biomarkers and its associations with 18F-AV133 Cerebral VMAT2 binding in parkinson’s disease–a preliminary report. PloS one 11, e0164762 (2016).
Article Google Scholar
Huang, X. et al. Non-motor symptoms in early parkinson’s disease with different motor subtypes and their associations with quality of life. Eur. journal neurology (2018).
Stebbins, G. T. et al. How to identify tremor dominant and postural instability/gait difficulty groups with the movement disorder society unified parkinson’s disease rating scale: comparison with the unified parkinson’s disease rating scale. Mov. Disord. 28, 668–670 (2013).
Article Google Scholar
Williams-Gray, C., Foltynie, T., Brayne, C., Robbins, T. & Barker, R. Evolution of cognitive dysfunction in an incident parkinson’s disease cohort. Brain 130, 1787–1798 (2007).
Article CAS Google Scholar
Busse, A., Hensel, A., Gühne, U., Angermeyer, M. & Riedel-Heller, S. Mild cognitive impairment long-term course of four clinical subtypes. Neurol. 67, 2176–2185 (2006).
Article CAS Google Scholar
Yaffe, K., Petersen, R. C., Lindquist, K., Kramer, J. & Miller, B. Subtype of mild cognitive impairment and progression to dementia and death. Dementia geriatric cognitive disorders 22, 312–319 (2006).
Article Google Scholar
Julian, L. J. Measures of anxiety: State-trait anxiety inventory (stai), beck anxiety inventory (bai), and hospital anxiety and depression scale-anxiety (hads-a). Arthritis care & research 63 (2011).
Kvaal, K., Ulstein, I., Nordhus, I. H. & Engedal, K. The spielberger state-trait anxiety inventory (stai): the state scale in detecting mental disorders in geriatric patients. Int. journal geriatric psychiatry 20, 629–634 (2005).
Article Google Scholar
Pagano, G., Ferrara, N., Brooks, D. J. & Pavese, N. Age at onset and parkinson disease phenotype. Neurol. 10–1212 (2016).
Selikhova, M. et al. A clinico-pathological study of subtypes in parkinson’s disease. Brain 132, 2947–2957 (2009).
Article CAS Google Scholar
Wickremaratchi, M. M. et al. The motor phenotype of parkinson’s disease in relation to age at onset. Mov. Disord. 26, 457–463 (2011).
Article Google Scholar

Download references

Acknowledgements

The research is supported by NSF IIS-1716432, NSF IIS-1650723, and Michael J. Fox Foundation grant number 14858. Data used in the preparation of this article were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (http://www.ppmi-info.org/data). For up-to-date information on the study, visit http://www.ppmi-info.org. PPMI–a public-private partnership–is funded by the Michael J. Fox Foundation for Parkinson’s Research and funding partners, including Abbvie, Avid, Biogen, Bristol-Mayers Squibb, Covance, GE, Genentech, GlaxoSmithKline, Lilly, Lundbeck, Merk, Meso Scale Discovery, Pfizer, Piramal, Roche, Sanofi, Servier, TEVA, UCB and Golub Capital.

Author information

Authors and Affiliations

Department of Healthcare Policy and Research, Weill Cornell Medical College, Cornell University, New York, USA
Xi Zhang, Jingyuan Chou, Yize Zhao & Fei Wang
Department of Automation, Tsinghua University, Beijing, China
Jian Liang
AI for Healthcare, IBM Research, Cambridge, USA
Cao Xiao
Department of Neurology, Weill Cornell Medical College, Cornell University, New York, USA
Harini Sarva & Claire Henchcliffe

Authors

Xi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jingyuan Chou
View author publications
You can also search for this author in PubMed Google Scholar
Jian Liang
View author publications
You can also search for this author in PubMed Google Scholar
Cao Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Yize Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Harini Sarva
View author publications
You can also search for this author in PubMed Google Scholar
Claire Henchcliffe
View author publications
You can also search for this author in PubMed Google Scholar
Fei Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

X.Z., J.L., C.X. and F.W. designed the approach. X.Z. and F.W. wrote the paper. X.Z. and J.L. conducted the experiments. Y.Z. performed the statistical analysis. H.S. and C.H. provided the clinical interpretations of the identified subtypes. All authors polished the manuscript.

Corresponding author

Correspondence to Fei Wang.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, X., Chou, J., Liang, J. et al. Data-Driven Subtyping of Parkinson’s Disease Using Longitudinal Clinical Records: A Cohort Study. Sci Rep 9, 797 (2019). https://doi.org/10.1038/s41598-018-37545-z

Download citation

Received: 14 February 2018
Accepted: 10 December 2018
Published: 28 January 2019
DOI: https://doi.org/10.1038/s41598-018-37545-z

This article is cited by

Two distinct trajectories of clinical and neurodegeneration events in Parkinson’s disease
- Cheng Zhou
- Linbo Wang
- Minming Zhang
npj Parkinson's Disease (2023)
A scoping review of neurodegenerative manifestations in explainable digital phenotyping
- Hessa Alfalahi
- Sofia B. Dias
- Leontios J. Hadjileontiadis
npj Parkinson's Disease (2023)
Diagnostic classification of Parkinson’s disease based on non-motor manifestations and machine learning strategies
- Maitane Martinez-Eguiluz
- Olatz Arbelaitz
- Iñigo Gabilondo
Neural Computing and Applications (2023)
Data-driven subtyping of Parkinson’s disease: comparison of current methodologies and application to the Bochum PNS cohort
- Qiang Chen
- Raphael Scherbaum
- Lars Tönges
Journal of Neural Transmission (2023)
Identification and prediction of Parkinson’s disease subtypes and progression using machine learning in two cohorts
- Anant Dadu
- Vipul Satone
- Faraz Faghri
npj Parkinson's Disease (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.