Introduction

Alzheimer’s disease (AD) dementia is the most common form of dementia1. Previous neuroimaging studies have shown that patients with AD demonstrate characteristic patterns of cortical atrophy at a group-level, especially in the medial temporal, temporoparietal, posterior cingulate, and precuneus regions2,3. Recent classification methods also provide a general framework to classify individual subjects, for example using arbitrary features defined on a three-dimensional (3D) cortical surface4,5,6,7 or volumetric features8,9,10. These classification methods have demonstrated adequate performance in an individual subject analysis with high accuracy. Both group- and individual-level analyses have successfully demonstrated the discriminating power of cortical atrophy patterns for AD diagnosis.

Amnestic mild cognitive impairment (aMCI) refers to a transitional state between normal cognition and dementia. Previous studies have found that individuals with aMCI progress to AD at a rate of approximately 5–25% per year11,12, while about 16–23% per year reverted from aMCI to normal cognition13,14,15. Therefore, it is crucial to develop prediction criteria that can distinguish individuals with aMCI at imminent risk of conversion to AD dementia from those who will remain stable16. In addition, different rates of progression have also been observed among patients with AD6,17,18. The rate of disease progression has important implications in clinical practice, as it has been shown to be an important factor in determining the prognosis of AD19. To date, there have been various neuroimaging studies for predicting AD prognosis at the individual-level20,21,22,23,24. However, these were limited by relatively small sample sizes, and the use of less sophisticated imaging methods such as low-field magnetic resonance imaging (MRI). In addition, although several lines of research attempted to predict conversion to AD in aMCI patients, these studies presented problems related to limited prediction accuracy23,24,25,26.

In this study, we first aimed to develop a new method for measuring AD-specific similarity of cortical atrophy patterns at the individual-level by employing an individual-level machine learning algorithm, and then to demonstrate the potential of this similarity measure in predicting the individual-level prognosis on the AD continuum. Our machine learning method, as previously demonstrated4, presents an individual subject classification based on incremental learning for AD diagnosis and prediction for the progression of AD using cortical thickness data. We adopt this method for training the group-level classifier, and then propose a new similarity measure for an individual-level cortical atrophy pattern compared to that of the representative AD patient. This AD-specific atrophy similarity measure represents how similar the cortical atrophy pattern of an individual subject is to that of a representative AD patient defined using a well-defined AD cohort. Specifically, we demonstrated the efficacy of the proposed measure using a large neuroimaging cohort of 869 cognitively normal (CN) individuals and 473 patients with probable AD dementia. We further validated the AD-specific similarity measure using a longitudinal neuroimaging cohort, by comparing this measure between aMCI converters and non-converters and between AD patients with fast and slow degrees of clinical decline. We hypothesize that the proposed individual assessment method is useful not only for determining diagnosis of an individual subject at a given time, but also to predict how likely their future including both progression to AD (a one-year aMCI follow-up validation) and a prognosis of AD (a five-year AD follow-up validation).

Results

Demographic and clinical characteristics

The demographics and clinical characteristics of the study participants were presented in Table 1. For the cross-sectional cohort, patients with AD had significantly higher mean age, lower level of education, and more frequent apolipoprotein E (APOE) ε4 allele and hypertension than CN individuals. In the longitudinal cohort, aMCI converters had significantly more frequent APOE ε4 alleles and lower baseline mini-mental state examination (MMSE) scores than aMCI non-converters. There were no significant demographic differences between AD slow- and fast-decliners.

Table 1 Demographic and clinical characteristics of the study participants.

Group classification performance

We assessed classification performance using the 10-fold cross-validation procedure on the cross-sectional cohort. Our classifier showed accuracy, sensitivity and specificity values of 91.1%, 83.5%, and 95.2%, respectively, for discriminating AD patients from CN individuals. Figure 1A shows the discriminating regions of our classification on the atlas surface meshes. The colored regions in the figure were determined by the amount of contribution of each vertex to classification. That is, the visualization of the axis that is maximally separating two groups in the linear discriminant analysis (LDA) space represents the contribution of the component to classification4. The entorhinal cortex and precuneus were the most discriminative for AD classification, and the lateral temporal lobe and the prefrontal cortex were also discriminative. In addition, we performed the validation of our classification method with the previously proposed method, support vector machine. The discriminative regions for both classifiers were consistent with each other (Supplementary Figure 1).

Figure 1
figure 1

Discriminating features of our classification. (A) The discriminating regions of our classification on the atlas surface meshes and (B) The discriminative pattern of each patient with aMCI and AD. Color intensities in the figure represent discriminative power in AD classification. aMCI = amnestic mild cognitive impairment; AD = Alzheimer’s disease.

Validation of clinical progression in patients with aMCI and AD

Subjects from the longitudinal aMCI and AD cohorts were used to validate the proposed cortical atrophy pattern analysis. Figure 1B visualizes the cortical atrophy patterns of each AD and aMCI patient group over time. The y-axis in the figure represents the AD-specific cortical atrophy similarity measure compared with the representative cross-sectional AD patient cohort used to develop the classifier. In patients with aMCI, non-converters showed no significant discriminative pattern, while converters demonstrated significant discriminative patterns in the inferior parietal lobule at baseline and in the prefrontal, temporal cortices, and inferior parietal lobule at first year follow-up visit, respectively. From the baseline to the third year follow-up visits, AD slow-decliners showed discriminative patterns defined around the prefrontal and temporal cortices, while AD fast-decliners demonstrated significant discriminative patterns in the most of prefrontal, inferior parietal, and temporal cortices.

There were significant differences in the AD-specific atrophy similarity measure at both baseline and first year follow-up visits between aMCI converters and non-converters (Fig. 2A). Specifically, converters showed significantly greater increases of the AD-specific atrophy similarity measure over time than did non-converters on a mixed effects model (β = 3.6, standard error [SE] = 1.6, p = 0.027). In patients with AD, furthermore, there were significant differences between fast- and slow-decliners in the AD-specific atrophy similarity measure at baseline, first year, and third year follow-up visits (Fig. 2B). AD fast-decliners also showed significantly greater increases of the AD-specific atrophy similarity measure than did slow-decliners on a mixed effects model (β = 2.9, SE = 1.3, p = 0.029). Specific details regarding the AD-specific atrophy similarity and neuropsychological performance of both of the longitudinal cohorts by group status can be found in Supplementary Table 1.

Figure 2
figure 2

Comparisons of the AD-specific atrophy similarity at baseline and follow-up years: (A) non-converters vs. converters in patients with aMCI and (B) slow- and fast-decliners in patients with AD. Mixed effects models of the worsening in AD-specific atrophy similarity over time between the classified groups by clinical progression in patients with aMCI and AD showed significant differences between the groups (p = 0.027 in aMCI cohort and p = 0.029 in AD cohort). aMCI = amnestic mild cognitive impairment; AD = Alzheimer’s disease.

Table 2 shows mixed effects models examining how worsening in neuropsychological test performance over time was related to AD-specific atrophy similarity in patients with aMCI and AD. Significant AD-specific atrophy similarity-by-time interactions were obtained for most neuropsychological tests in the two longitudinal cohorts from baseline to year one or three. Specifically, we found AD-specific atrophy similarity by time interactions in both groups for language function, Seoul Neuropsychological Screening Battery-Dementia version (SNSB-D) total score, MMSE, and Clinical Dementia Rating sum of boxes (CDR-SB), while there were significant interactions for only the AD group in attention, memory, frontal/executive function, and Clinical Dementia Rating (CDR).

Table 2 Mixed effects models of worsening in the neuropsychological test performances over time by AD-specific atrophy similarity in patients with aMCI and AD.

Discussion

In this study, we developed and validated an AD-specific atrophy similarity measure as a novel MRI-based biomarker to provide prospective AD risk prediction on an individual subject level. We found that the AD-specific atrophy similarity measure showed promising results at an individual-level, where it not only supported the early prediction of AD, but also enabled the discrimination of brain and clinical trajectories in patients with AD dementia. The AD-specific atrophy similarity measure, based on cortical thickness analyses we recently developed, was derived from a probabilistic statistical classification model. Our method demonstrated high classification performance in the prediction of AD trajectories and in accurately distinguishing AD patients from normal controls, supporting the discriminative power of our method in both prognosis and diagnosis.

Our conclusion that the AD-specific atrophy similarity measure contributes to prediction of prognosis along the AD continuum is supported by the following observations: (1) in patients with aMCI, converters showed higher AD-specific atrophy similarity than non-converters with increasing scores at baseline and one-year follow-up visits; (2) in patients with AD dementia, fast-decliners also revealed higher AD-specific atrophy similarity than slow-decliners at all visits over a three-year follow-up. More specifically, our findings of the discriminative patterns in patients with aMCI converters and AD fast-decliners were consistent with previous literature findings of predicting AD prognosis which presented changes in the lateral temporal and inferior parietal cortices were related with AD progression21,22. Furthermore, significant inverse relationships between the AD-specific atrophy similarity measure and cognitive performance over time were observed in patients with aMCI and AD. Our study therefore provides new insight into both the prediction of aMCI to AD conversion and the prediction of accelerated clinical decline in AD dementia. Further follow-up will allow us to examine whether baseline atrophy similarity measurements can predict the specific time-to-conversion at the individual subject level.

While there have been several recent neuroimaging studies on the prediction of conversion from aMCI to AD dementia, most have exhibited limited prediction accuracy and small sample sizes23,24,25,26. We investigated the use of the AD-specific atrophy similarity measure as a means to obtain a sensitive and specific biomarker of AD-like spatial patterns of cortical thinning, and of conversion from aMCI to AD within a large cohort. Some recent studies using a cortical thickness-based clustering method demonstrated that AD patients with a parietal-dominant atrophy pattern showed poor performance in neuropsychological tests as well as aggressive rates of progression6,18. In comparison, a strength of our study is that different rates of disease progression were investigated by the AD-specific atrophy similarity measure on an individual subject level, and not using cluster or group analyses. Specially, our method has increased statistical power since the AD-specific atrophy similarity measure was derived using machine learning over a large neuroimaging cohort of AD and CN participants. In addition, the current study limited MRI data collection to one scanner with the same scan parameters across waves of data collection, strengthening the consistency of our data and results.

However, some limitations should be considered when interpreting the results. First, pathologic confirmation was not performed in the present study participants. Considering the discrepancy in diagnosis for AD between clinical and neuropathological data27, and a certain portion of clinically diagnosed AD patients may show negative amyloid positron emission tomography scan28, we cannot exclude the possibility that our classification methods might have been altered by AD-mimicking patients. However, this argument is mitigated to some degree by our previous studies showing that about 90% of clinically diagnosed AD had positive amyloid positron emission tomography scan29,30. Second, our classification scheme is based on the assumption that the cortical thickness data could be separated into two categories, such as CN and AD. As some neural network-based methods would be able to handle non-linearity of the feature data, future studies could employ these recently developed deep learning approaches. Third, there is no consensus regarding the time window during which conversion from aMCI to AD must be evaluated, or regarding specific cut-points for defining fast and slow decline in patients with AD. Fourth, two longitudinal cohorts for validation of the AD-specific atrophy similarity had relatively small sample size. Fifth, the proposed classification and AD-specific atrophy similarity measure methods are solely based on the cortical thickness data, and the clinical risk factors and neuropsychological score data were not used. As future works, it would be promising if we could see how those factors can affect the classification performance and atrophy pattern analysis results. Finally, since other classification methods were using various types of feature data with different dataset, it is difficult to compare the classification performance of our method with other methods, directly.

In conclusion, we have developed an AD-specific atrophy similarity measure as a novel MRI-based biomarker. This method provides an innovative approach for enabling the prediction of dementia risk, and for evaluating trajectories along the AD continuum on an individual subject level. Furthermore, while further research is still necessary to validate and further develop the AD-specific similarity measure in other populations, this method will facilitate risk stratification not only for prevention trials but also for personalized therapy.

Methods

Study participants

Cross-sectional cohort for development of the AD classifier

A total of 536 patients with probable AD dementia and 912 CN individuals who underwent high-resolution 3T brain MRI with 3D volumetric imaging and detailed neuropsychological testing were recruited from the Memory Disorders Clinic of the Samsung Medical Center (from June 2006 to June 2012). The patients with probable AD dementia fulfilled the National Institute of Neurological and Communicative Disorders and Stroke and Alzheimer's Disease and Related Disorders Association (NINCDS-ADRDA) criteria31. CN individuals had no history of neurologic or psychiatric disorders, and had normal cognitive function determined using neuropsychological tests (above the 16th percentile for age- and education-matched norms).

We excluded 63 AD patients with any of the following conditions: missing education data (N = 9); unreliable cortical thickness measurements due to head motion, blurring of the MRI, inadequate registration to a standardized stereotaxic space, misclassification of tissue type, or inexact surface extraction (N = 31); or severe white matter hyperintensities (WMH) defined as deep WMH ≥ 25 mm and periventricular WMH ≥ 10 mm (N = 40). Since study participants could have more than one exclusion condition, the final sample size of AD patients was 473. In addition, out of 912 CN individuals, we excluded 22 participants with incomplete demographic data. From the remaining 890 participants, we excluded 21 participants with unreliable analyses of cortical thickness, yielding 869 CN individuals for analysis in this study.

Laboratory tests were conducted in all participants to rule out other causes of dementia, and included complete blood counts, vitamin B12 and folate levels, a metabolite profile, thyroid function tests, and syphilis serology. Participants were also excluded if they had a cerebral, cerebellar, or brainstem infarction, hemorrhage, tumors, hydrocephalus, or severe head trauma.

Longitudinal cohort for validation of the AD-specific atrophy similarity measure

A total of 79 aMCI patients were retrospectively recruited from the Memory Disorders Clinic of the Samsung Medical Center (from August 2007 to December 2010). These aMCI patients had completed at least their first year follow-up visit with the same interview and neuropsychological tests as their baseline evaluation, had undergone high-resolution 3T brain MRI with 3D volumetric imaging, and did not have any critical missing data. Patients were diagnosed with aMCI using the Petersen criteria32 with the following modifications, which have been previously described in detail33: (1) a subjective cognitive complaint by the patient or his/her caregiver; (2) normal Activities of Daily Living (ADL) score determined clinically and with the instrumental ADL scale; (3) an objective cognitive decline below the 16th percentile (−1.0 standard deviation [SD]) of age- and education-matched norms in at least one of four cognitive domains (language, visuospatial, memory or frontal-executive function) on neuropsychological tests; and (4) absence of dementia. Patients with aMCI were grouped as non-converters (N = 53) if they were diagnosed with aMCI at baseline and remained so during their first year of follow-up, and as converters if they were diagnosed with aMCI at baseline and diagnosed with AD during their first year of follow-up, without reversion to aMCI or CN (N = 26).

We also included 36 patients with AD who participated in the prospective, five-year longitudinal Alzheimer’s Disease and Positron Emission Tomography (ADAPET) study, and were recruited from March 2006 to December 2006. The patients fit the criteria of the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition34 and the NINCDS-ADRDA criteria for probable AD31. The enrolled patients were eligible if they had early-stage dementia with a CDR score of 0.5 or 1, were cooperative candidates for this longitudinal study, and had a caregiver. None had a family history suggestive of an autosomal dominant disease. Of 36 patients with AD, 27 patients who completed the third year of evaluation were enrolled in the current study. The assessment procedure of the participants has been described in detail elsewhere35,36. Patients with AD were grouped as fast-decliners (N = 13) if their CDR-SB score increased more than five points during the three year follow-up; otherwise, they were labeled as slow-decliners (N = 14).

Standard protocol approvals, registrations, and patient consents

We obtained written informed consent from each patient. This study was approved by the Institutional Review Board at the Samsung Medical Center. In addition, all methods were carried out in accordance with the approved guidelines.

Neuropsychological tests

All participants underwent a standardized neuropsychological battery, the Seoul Neuropsychological Screening Battery (SNSB), which is described in detail elsewhere37. The SNSB consists of tests for verbal and visual memory, attention, language, praxis, four elements of Gerstmann syndrome, visuospatial function, frontal/executive function, the MMSE, the CDR, and the CDR-SB. From the SNSB results, we calculated the SNSB-D score in attention, language, visuospatial, memory, and frontal/executive domains, as previously described37,38.

Image acquisition and preprocessing

3D T1-weighted Turbo Field Echo MRI images were acquired from all participants in this study using the Philips 3T Achieva MRI scanner with the same imaging parameters (sagittal slice thickness 1.0 mm, over contiguous slice acquisition with 50% overlap; no gap; repetition time 9.9 ms; echo time 4.6 ms; flip angle 8°; and matrix size 240 × 240 reconstructed to 480 × 480 over a 240 mm field of view).

For each subject, we performed image preprocessing using FreeSurfer 5.1.0 (Athinoula A. Martinos Center at the Massachusetts General Hospital, Harvard Medical School; http://surfer.nmr.mgh.harvard.edu/). Figure 3A shows the overview of our image preprocessing method. We first constructed the outer and inner cortical surface meshes from the MR volume of each subject. The two meshes are isomorphic with the same vertices and connectivity because the outer surface is constructed by deforming the inner surface. In order to establish inter-subject correspondence, we resampled each subject’s cortical surface to 40,962 vertices for each hemisphere using the previously proposed method4.

Figure 3
figure 3

Overview of the proposed method. (A) Image preprocessing; (B) Group classifier training; and (C) AD-specific pattern similarity computation. AD = Alzheimer’s disease.

For removing noise in the cortical thickness data, we employed the manifold harmonic transform (MHT) to map the cortical thickness from the surface onto the frequency domain39,40. The MHT regarded high frequency components of the transformed cortical thickness data as noise, and then discarded those components4. It enables us to remove noise and reduce the dimensionality of the cortical thickness data by filtering out high frequency components.

Cortical atrophy pattern analysis

We analyzed the cortical atrophy pattern for each subject based on the preprocessed cortical thickness data. Specifically, cortical atrophy patterns were quantified using Inbrain®, a Korea Food and Drug Administration (KFDA)-cleared software and a registered trademark of MIDAS Information Technology Co., Ltd., which performs fully-automated image analysis of brain structures. The proposed method consists of two steps: training a group classifier (Fig. 3B) and computing an AD-specific pattern similarity (Fig. 3C). The noise-filtered cortical thickness data was converted to w-score adjusting for age and education level in order to minimize the effects of them on cortical thickness. For classifier training, we used w-scores as feature vectors and employed principal component analysis (PCA) and LDA41. Specifically, we reduced the dimensionality of feature vectors with PCA, and found coordinate axes which maximally separated different groups with LDA. Given feature vectors as input, the classifier was trained by performing PCA and LDA in sequence. We calculated the PCA dimension following the methods of our previous paper4.

After training the group classifier, the AD-specific pattern similarity measure was then calculated on an individual subject basis. As shown in Figure 3C, the noise-filtered cortical thickness data of an individual subject was transformed to PCA space using the pre-trained PCA axes. Similarly, the feature vector in PCA space was also mapped onto a single point in LDA space using the pre-trained LDA matrix. Finally, we measured the AD-specific similarity of the cortical atrophy pattern for an individual subject based on the distance between each subject’s mapped point and the mean value of the AD group in LDA space. A higher AD-specific atrophy similarity measure indicates that a subject’s brain atrophy pattern is more similar to the representative pattern of the AD group (Fig. 4).

Figure 4
figure 4

Examples of AD-specific atrophy similarity measure at the individual-level. The AD-specific atrophy similarity scores differed between Case #96 - CN (left, 3.7) and Case #1256 - AD (right, 91.6). The standardized value (Z-score) maps were computed to visualize the AD-specific atrophy similarity. Positive Z-scores (red) indicate that the regions of brain are similar to the AD-specific patterns of atrophy. AD = Alzheimer’s disease; CN = cognitively normal; MMSE = mini-mental state examination.

In order to evaluate group classification performance, we performed a 10-fold cross-validation procedure. We randomly partitioned total participants into two sets of 90% for training and 10% for test. After training the classifier with training data, we assessed the accuracy, sensitivity and specificity of each classification with test data. In addition, we validated the AD-specific atrophy similarity measure in two longitudinal cohorts of patients with aMCI and AD. Specifically, we applied the longitudinal pipelines of FreeSurfer for our longitudinal cohort data. As the FreeSurfer longitudinal pipeline is designed to be unbiased to any particular time, we did not initialize it with information from a specific time point. Instead, a template was created using information from all available time points. This template can be regarded as an initial guess for segmentation of brain regions and surface reconstruction. The longitudinal pipeline consists of three steps42: cross-sectional image processing, individual template construction, and longitudinal alignment. According to the template surface, the FreeSurfer-provided fsaverage was used, and the smoothing process was applied similarly to the previous work4. We manually checked the results of every step and corrected any error that occurred during the surface construction step.

Statistical analyses

Continuous variables were presented as means ± SD and were compared using Student’s t-test. Categorical variables were compared using the Chi-square test or Fisher’s exact test. To examine how longitudinal changes in neuropsychological test performance over time were associated with AD-specific atrophy similarity in patients with aMCI and AD, we performed linear mixed effects modeling within each cohort using AD-specific atrophy similarity, time, and the interaction term between AD-specific atrophy similarity and time (AD-specific atrophy similarity by time) as fixed effects and patient as a random effect. In addition, to determine whether there are significant differences in the AD-specific similarity over time between the groups by clinical progression in patients with MCI and AD, we also performed linear mixed effects modeling within each cohort using group, time, and the interaction term between group and time (group by time) as fixed effects and patient as a random effect. Statistical significance was set at p < 0.05 in two-tailed tests. Statistical analyses were performed using SPSS version 20.0 (SPSS Inc., Chicago, IL, USA).