Predicting Short-term MCI-to-AD Progression Using Imaging, CSF, Genetic Factors, Cognitive Resilience, and Demographics

Varatharajah, Yogatheesan; Ramanan, Vijay K.; Iyer, Ravishankar; Vemuri, Prashanthi

doi:10.1038/s41598-019-38793-3

Download PDF

Article
Open access
Published: 19 February 2019

Predicting Short-term MCI-to-AD Progression Using Imaging, CSF, Genetic Factors, Cognitive Resilience, and Demographics

Yogatheesan Varatharajah¹,
Vijay K. Ramanan²,
Ravishankar Iyer¹ &
Prashanthi Vemuri²
For the Alzheimer’s Disease Neuroimaging Initiative

Scientific Reports volume 9, Article number: 2235 (2019) Cite this article

11k Accesses
50 Citations
25 Altmetric
Metrics details

Subjects

Abstract

In the Alzheimer’s disease (AD) continuum, the prodromal state of mild cognitive impairment (MCI) precedes AD dementia and identifying MCI individuals at risk of progression is important for clinical management. Our goal was to develop generalizable multivariate models that integrate high-dimensional data (multimodal neuroimaging and cerebrospinal fluid biomarkers, genetic factors, and measures of cognitive resilience) for identification of MCI individuals who progress to AD within 3 years. Our main findings were i) we were able to build generalizable models with clinically relevant accuracy (~93%) for identifying MCI individuals who progress to AD within 3 years; ii) markers of AD pathophysiology (amyloid, tau, neuronal injury) accounted for large shares of the variance in predicting progression; iii) our methodology allowed us to discover that expression of CR1 (complement receptor 1), an AD susceptibility gene involved in immune pathways, uniquely added independent predictive value. This work highlights the value of optimized machine learning approaches for analyzing multimodal patient information for making predictive assessments.

Individualized prognosis of cognitive decline and dementia in mild cognitive impairment based on plasma biomarker combinations

Article 30 November 2020

A multipredictor model to predict the conversion of mild cognitive impairment to Alzheimer’s disease by using a predictive nomogram

Article Open access 21 October 2019

Machine learning-based prediction of longitudinal cognitive decline in early Parkinson’s disease using multimodal features

Article Open access 14 August 2023

Introduction

Alzheimer’s disease (AD) affects approximately 5.5 million people in the United States and more than 30 million people around the world, and imposes substantial personal and societal burdens¹. Typically, AD progresses through a preclinical phase with underlying biomarker abnormalities, then a prodromal state of mild cognitive impairment (MCI), and finally frank AD dementia². Annually, 10–15% of patients diagnosed with MCI progress to AD dementia³. Identification of factors contributing to progression from MCI to AD is crucial for clinical prognostication and risk stratification to guide counseling and selection of potential treatments.

In the last decade, biomarkers from cerebrospinal fluid (CSF), positron emission tomography (PET), and magnetic resonance imaging (MRI) have been increasingly used in AD clinical and research studies to assess the degree of AD related pathology. Increased amyloid pathology measured by decreased CSF Aβ42 and increased cerebral amyloid on PET, as well as increased neuronal injury assessed by increased CSF tau, hypometabolism on FDG-PET, and atrophy on structural MRI, are important factors in assessing the degree of brain changes due to AD pathology and as a surrogate for prediction of progression in individuals with MCI². In addition^4,5,6, clinical measures such as Mini-Mental State Examination (MMSE) and Alzheimer’s Disease Assessment Scale-Cognition (ADAS-Cog) which reflect the current level of impairment in individuals have been shown to be useful for prediction of MCI progression^4,5,6.

Two additional important factors in the context of MCI progression to AD are genetic factors and cognitive resilience. AD is a genetically complex disorder, with susceptibility thought to reflect the collective influences of multiple genetic risk and protective factors⁷. Other than the apolipoprotein E (APOE) ε4 allele, individual genetic variants associated with AD have shown modest population-level effect sizes, in keeping with current hypotheses about the genetic architecture of the disease⁸. Although the heritability of AD is thought to be 60–80%, beyond the well-studied effect of the APOE ε4 allele⁹, relatively little is known about the genetic factors specifically related to MCI-to-AD progression¹⁰, particularly regarding their added value above known biomarker profiles. Cognitive resilience represents the ability of an individual to delay the deleterious effects of neurodegenerative pathologies on onset of cognitive symptoms¹¹. While cognitive resilience has been widely used to explain the pathology-cognition disconnect in cognitively unimpaired individuals with AD pathology (and normal cognition), its relative influence specifically on MCI-to-AD progression has not been fully evaluated¹². Furthermore, the complex relationships among the well-known AD biomarkers, cognitive resilience, and genetic factors remain largely unknown and estimating the independent predictive values provided by each of these factors may open the door to alternative strategies to delay or prevent the onset of dementia.

Several machine learning (ML) based approaches have been proposed for predicting MCI-to-AD progression^4,5,13,14 and clinical stage classification¹⁵ utilizing high-dimensional clinical and biomarker data among the potential predictors. A limitation of the ML-based approaches that utilize high-dimensional data is the potential for overfitting, such that the classifiers are so optimally trained to fit the primary dataset that they perform poorly on previously unseen test data, ultimately limiting the generalizability and wider interpretability¹⁵ of the predictive model. This is particularly important in this study because both a) an unbiased estimation of the contributions of cognitive resilience and genetic factors in predicting MCI-to-AD progression and b) the successful clinical translation of this technique, would require a generalizable model. However, prior studies on the predictability of MCI-to-AD progression have not performed satisfactory assessments of generalizability^4,5,13,14. These approaches utilize non-linear classifications methods such as multi-kernel learning and artificial neural networks without establishing the need for non-linear models. Non-linear classification models are more susceptible to overfitting compared to linear classification models in problems involving high-dimensional data since they optimize relatively higher number of model parameters¹⁶. Whereas, a pitfall associated with linear models is that they provide suboptimal prediction performance when the data is not linearly separable. Therefore, our goals in this study with respect to the development of a predictive model are a) to understand whether linear classifiers are sufficient to provide clinically relevant accuracies in predicting MCI-to-AD progression based on a set of features derived from multiple modalities and b) to develop an analytical method that evaluates the generalizability of different classifiers and to aid model selection in clinical settings.

In this study, we used a machine learning-based approach and data from a well-characterized clinical cohort to identify individuals with MCI who rapidly progress to AD versus those with a more protracted course. Specifically, using a set of features derived from the ADNI multimodal biomarker and clinical dataset along with genetic factors and cognitive resilience measures, a linear model-based ML framework was developed for predicting MCI-to-AD progression. Based on a test of linear-separability¹⁷, we show that linear classification models perform just as good as non-linear models in this setting and a support vector machine classifier with a linear kernel provides the most generalizable performance with a test set area under ROC curve (AUC) value of 0.93. Using this framework, we evaluated the predictive values of previously underexplored factors of cognitive resilience and genetic factors and gauged their relative importance compared to commonly used CSF and imaging predictors. We found that, while the markers of AD pathophysiology (amyloid, tau, and neuronal injury) provided very high predictive values, the genetic factors and brain regions associated with cognitive resilience also displayed independent predictive values.

Materials and Methods

Study participants

This report utilized data from the Alzheimer’s Disease Neuroimaging Initiative (ADNI), a multisite longitudinal study of older adults representing clinical stages along the continuum from normal aging to AD¹⁸. All study participants provided written informed consent, and study protocols were approved by each local site’s institutional review board. Further information about ADNI, including full study protocols, complete inclusion and exclusion criteria, and data collection and availability can be found at http://www.adni-info.org/. All methods as stated on the website were performed with the relevant guidelines and regulations. Since all the analyses were performed on de-identified ADNI data which is publically available for download, IRB Review was not required. In addition, all methods were carried out in accordance with the approved guidelines.

In choosing ADNI participants to study, we used these inclusion criteria: i) the person had at least three years of follow-up; ii) the person had all the data modalities of interest (as specified below); and iii) the person was diagnosed with MCI at the baseline evaluation. We identified 135 participants who met those criteria. A total of 39 of the 135 progressed to AD within three years of the MCI diagnosis (and are referred to as the MCI-P group); the remaining 96 progressed to AD after three years or remained in MCI until their last follow-up after three years (and are referred to as the MCI-NP group).

Predictive factors

Figure 1 illustrates potential factors that may impact progression from MCI to AD. Below we describe the specifics of the individual biomarkers utilized in this work.

All the biomarkers utilized in this work were downloaded from the LONI data archive (https://ida.loni.usc.edu/) where the pre-processed ADNI data are hosted.

Cerebrospinal fluid (CSF) biomarkers

CSF levels of amyloid beta (Aβ) and total (T-tau) and phosphorylated (P-tau) tau proteins were assayed by the ADNI Biomarker Core as previously described¹⁹.

Magnetic resonance imaging biomarkers

Structural MRI (SMRI) scans at baseline were downloaded and processed as described previously²⁰. FreeSurfer v5.1 was used to obtain volume and thickness measures for standard regions of interest (ROIs) as surrogates for cerebral atrophy. We scaled the volumes by total intracranial volume. In addition, we also included volumetric measurements of hippocampal subfields.

Positron emission tomography (PET) biomarkers

Fluorodeoxyglucose (FDG) and F-florbetapir PET imaging from the baseline visit were analyzed as surrogates for neuronal injury and amyloid pathology, respectively¹⁸. For FDG-PET, AD-specific ROIs representing the temporal, angular, and posterior cingulate gyri were utilized²¹. For F-florbetapir PET, we included regional amyloid deposition assessed by standardized uptake value ratio (SUVR) in the temporal, parietal, and cingulate cortex, as well as a composite global measure of multiple regions²².

Cognition

Scores on the Mini-Mental State Examination (MMSE)²³ at baseline were utilized as measures of cognitive performance.

Cognitive resilience

The number of errors on the American National Adult Reading Test (ANART) (which is an estimate of pre-morbid verbal IQ) and years of education were utilized as surrogate measures of cognitive resilience.

Genetic factors

Genotype and gene expression data from peripheral blood samples in ADNI were obtained as previously described²⁴. In this study, we specifically analyzed APOE ε4 allele status (carrier vs. non-carrier) as well as expression data for the top genes with validated associations to AD (APOE, BIN1, CLU, ABCA7, CR1, PICALM, MS4A6A, CD33, and CD2AP) as listed in the AlzGene database²⁵.

Demographics

Gender and age at the baseline visit were utilized in the predictive model.

Data aggregation and ML preprocessing

A total of 94 potential predictive factors were included for analysis. A matrix was generated with 135 rows (representing study participants) and 94 columns (representing the potential predictive factors for MCI-to-AD progression). Prior to further analysis, we centered and standardized all data on a feature-by-feature basis by subtracting the mean and then dividing by the standard deviation.

ML-based prediction framework

Figure 2 shows a flow diagram of the prediction framework developed for this study. The framework consists of four major steps: information-theoretic feature selection, classifiers and hyper-parameter optimization, goodness-of-fit evaluation, and generalized performance evaluation. They are explained in the following paragraphs. A unique aspect of this workflow is the ability to specify the number of parameters optimized in the classification model by using an information-theoretic feature selection method. This is particularly useful in assessing the generalizability of a classifier as a function of the number of model parameters.

Feature reduction using joint mutual information (JMI)

Mutual information between two random variables quantifies the amount of information shared between them. Mutual information is a more comprehensive measure of the relationships between random variables than statistical correlation-based approaches, which measure linear relationships only. Mathematically, mutual information is denoted by $I(Q;R)$ and defined for discrete random variables Q and R as shown in Eq. 1, where ${\bf{Q}}$ and ${\boldsymbol{ {\mathcal R} }}$ denote the alphabets of Q and R, respectively.

$$I(Q;R)={{\rm{\Sigma }}}_{q\in {\mathscr{Q}}}{{\rm{\Sigma }}}_{r\in {\mathcal R} }p(q,r)log\frac{p(q,r)}{p(q)p(r)}$$

(1)

When F_k is one of the attributes in a set of attributes $\{{F}_{1},{F}_{2},\ldots {F}_{k}\}$ and Y is an outcome or class that can be predicted by the attributes, mutual-information-based approaches can be used to select the most predictive attributes. One such approach is to treat the attributes as independent random variables, rank them in descending order based on their mutual information with respect to the outcome Y, and select the top n number of attributes. One limitation of that approach is that useful and parsimonious sets of features should be both (i) individually relevant, and (ii) not highly correlated with each other. Joint mutual information shared between $\{{F}_{1},{F}_{2},\ldots {F}_{k}\}$ and Y is defined as shown in Eq. 2, where F_k and Y denote the alphabets of F_k and Y, respectively.

$$\begin{array}{ccc}I({F}_{1},{F}_{2},\ldots {F}_{k};Y) & = & {{\rm{\Sigma }}}_{{f}_{1}\in {{\rm{F}}}_{1}}{{\rm{\Sigma }}}_{{f}_{2}\in {{\rm{F}}}_{2}}\ldots {{\rm{\Sigma }}}_{{f}_{k}\in {{\rm{F}}}_{k}}{{\rm{\Sigma }}}_{y\in {\rm{Y}}}p({f}_{1},{f}_{2},\ldots {f}_{k},y)log\frac{p({f}_{1},{f}_{2},\ldots {f}_{k},y)}{p({f}_{1},{f}_{2},\ldots {f}_{k})p(y)}\end{array}$$

(2)

A JMI-based feature selection method starts with an empty set of attributes and iteratively adds F_is that, when added, provide the maximum increase in the joint mutual information shared between the set of attributes and the outcome²⁶. It has been shown to be the most stable and flexible feature selection method²⁷ among all the information-theoretic feature selection methods developed to date.

Classification methods

We evaluated a number of classifiers in this study to understand the concepts of linear separability and generalizability in the context of predicting MCI-to-AD progression. Support vector machine (SVM), multiple kernel learning (MKL), and generalized linear models (GLM) with elastic-net regularization are the classifiers used in this study. SVM classifier, because it allows the transformation of features using linear and non-linear kernels, provides us the ability to evaluate the suitability of linear and non-linear classifiers for this problem. On the other hand, MKL allows the application of different kernel transformations for features from different modalities while optimizing more model-parameters than an SVM classifier. Although MKL and SVM are similar classification paradigms, MKL facilitates the integration of multiple modalities at the expense of potential overfitting. Finally, GLM classification with elastic-net regularization is an extension of the commonly known logistic regression classifier with additional regularization to minimize overfitting. In this study, we evaluated the overall generalizability of these classifiers that have distinct optimization objectives.

A support vector machine is a binary classifier that finds the maximum margin hyperplane that separates the two classes in the data²⁸. Suppose that the data being classified are denoted by $X\in {{\mathbb{R}}}^{N\times P}$ (N subjects and P features), and that the data from subject i are denoted by ${X}^{(i)}\in {{\mathbb{R}}}^{P}$. Let us also use $Y\in {\{-1,1\}}^{N}$ to denote the class labels for all the subjects (where $-1$ and $1$ are numerical labels for the two classes), and ${Y}^{(i)}\in \{-1,1\}$ to denote the class label for subject i. The optimization problem to find the optimal hyperplane (described by weights $W\in {{\mathbb{R}}}^{P}$ and intercept term $b\in {\mathbb{R}}$) is shown in Eq. 3.

$$\begin{array}{c}{{\rm{\min }}}_{{\rm{w}},{\rm{b}}}\frac{1}{2}\parallel W{\parallel }^{2}\\ {\rm{Subject}}\,{\rm{to}}\,{Y}^{(i)}({W}^{T}{X}^{(i)}+b)\ge 1,\,i\in \{1,\ldots ,N\}\end{array}$$

(3)

Once the optimal hyperplane $[{W}_{opt},{b}_{opt}]$ has been found, the predicted class label for subject i is obtained as the sign of ${W}_{opt}^{T}{X}^{(i)}+{b}_{opt}$. This formulation assumes that the data are fully linearly separable between the two classes. When that is not the case (but there is still a linear classification task), slack variables and a tolerance parameter (box-constraint) can be introduced to obtain separating hyperplanes that tolerate small misclassification errors.

Dual formulation of SVM has received considerable interest due to its ability to use different kernel transformations of the original feature space without altering the optimization task, and due to its advantages in complexity when the data are high-dimensional^29,30. The dual form of the above optimization problem can be written as shown in Eq. 4, where the operation $\langle .\rangle $ denotes an inner product between two vectors.

$$\begin{array}{c}{\max }_{{\rm{\alpha }}}{{\rm{\Sigma }}}_{i}{\alpha }_{i}-\frac{1}{2}{{\rm{\Sigma }}}_{i,j}\,{\alpha }_{i}\,{\alpha }_{j}{Y}^{(i)}{Y}^{(j)}\langle {X}^{(i)},\,{X}^{(j)}\rangle \\ {\rm{subject}}\,{\rm{to}}\,{{\rm{\Sigma }}}_{i}{\alpha }_{i}{Y}^{(i)}=0\,and\,{\alpha }_{i}\ge 0\,{\alpha }_{i}\ge 0\,i\in \{1,\ldots ,N\}\end{array}$$

(4)

Notably, the dual formulation can be expressed simply in terms of the inner product between ${X}^{(i)}$ and ${X}^{(j)}\,$(which is a scalar value). Therefore, any transformation of the features can replace the original features, i.e., $\langle {X}^{(i)},\,{X}^{(j)}\rangle $ can be replaced by $\langle \varphi ({X}^{(i)}),\,\varphi ({X}^{(j)})\rangle $, where $\varphi $ is a feature mapping. Furthermore, for any such feature transformation $\varphi $, we can define a kernel function K such that $\langle \varphi ({X}^{(i)}),\,\varphi ({X}^{(j)})\rangle =K({X}^{(i)},\,{X}^{(j)})$. With the kernel transformation, the cases where the original data are not linearly separable, may be solved as transforming the data to higher dimensions may introduce linear separation in the transformed domain. Linear, radial basis function (RBF), and polynomial kernels are widely used kernels in this context.

Multiple kernel learning is a classification technique that builds upon the above property of SVM and introduces additional variables in order to weight the kernel transformation of³¹ individual features. It does so by replacing the kernel function $K({X}^{(i)},{X}^{(j)})$ with $\sum _{m=1}^{P}{\beta }_{m}{K}_{m}({X}^{(i)}(m),{X}^{(j)}(m))$, where ${K}_{m}\,$is the kernel function of the m^th feature and ${\beta }_{m}$ is its weight. This technique is particularly useful when the data contain different modalities of features that need to be weighted differently. In a standard kernel-SVM, either the kernel transformation is uniformly weighted, or the weights are determined manually. It has been shown that the same dual formulation of SVM can be utilized to find the optimal weightings of the kernel transformation under the constraint that ${{\rm{\Sigma }}}_{m=1}^{{\rm{P}}}\,{\beta }_{m}=1$.

Logistic regression is a subclass of generalized linear models (GLM) and is well-suited for binary classification tasks³². It models the response variable as a binomially distributed random variable whose parameters are described by predictor variables and model parameters. Regularization of the model parameters is utilized to avoid effects related to overfitting. The objective function of a regularized GLM model is

$${\min }_{{\rm{\beta }},{{\rm{\beta }}}_{0}}\frac{1}{N}{{\rm{\Sigma }}}_{I=1}^{{\rm{N}}}[{Y}^{(i)}\,\mathrm{log}\,h({X}^{(i)})+(1-{Y}^{(i)})\mathrm{log}(1-h({X}^{(i)}))]+\lambda {P}_{\alpha }(\beta )$$

(5)

where $h(x)$ is the logistic function defined as $\frac{1}{1+{e}^{-x}}$ and ${P}_{\alpha }(\beta )$ is the regularization term with the elasticity parameter $\alpha $. The regularization term in general might contain both ${L}_{1}$-norm terms and ${L}_{2}$-norm terms, a situation that is commonly referred to as elastic net regularization with ${P}_{\alpha }(\beta )=\frac{1-\alpha }{2}{\Vert \beta \Vert }_{2}^{2}+\alpha {\Vert \beta \Vert }_{1}$³³. The elasticity parameter $\alpha $ ranges in $(0,1]$ with values of α → 0 approaching ridge regression and α → 1 approaching LASSO regression.

Hyper-parameter optimization

The hyper-parameters of the three classifiers are SVM’s kernel, kernel scale σ, and box-constraint $C$, MKL’s kernel, and GLM’s penalty term $\lambda $ and elastic-net parameter $\alpha $. The optimal values of hyper-parameters are chosen by performing a grid-search of parameter values with a k-fold cross-validation within the training dataset.

Goodness-of-fit evaluation and generalized performance metrics

Goodness of fit of a classifier is evaluated by i) predicting the classes of the test dataset by using the classifier that was trained on the training dataset, and ii) comparing the predictions against the true class labels of the test dataset. The comparison is performed using standard performance metrics such as receiver operating characteristics (ROC) curve analysis, area under ROC curve (AUC), sensitivity, specificity, accuracy, precision, recall, and F1-score.

Because of heterogeneity in the data, choosing one partition of training and test datasets is not sufficient to credibly evaluate the performance of a classifier. A common practice to obviate the effect of heterogeneity in the data is to perform an k-fold training-testing cross-validation of the dataset. One run of this procedure is carried out by randomly choosing a subset of the dataset as training data, and testing on the rest of the dataset. That is repeated $n$ different times, and the performance metrics of all $n$ evaluations are averaged. In addition to this, the proportion of the different classes in the randomly chosen training dataset was kept constant in our approach via a stratified data-partitioning to eliminate any variability in the performance of the classifiers due to class imbalances in the data¹⁶. We performed those extra steps to obtain a generalizable set of performance metrics for the analyzed classifier.

Implementation of the prediction framework

The prediction framework was implemented in MATLAB version R2017b and the code is publicly made available at https://gitlab.engr.illinois.edu/varatha2/adni_mci2ad_prediction. We utilized a standard fivefold cross-validation with 80% training data and 20% testing data to evaluate the performance of the classifiers. Training and testing data selection was performed using the stratified data-partitioning approach explained previously. The JMI criterion was used to identify $n\in \{5,10,\ldots ,\,90,\,94\}\,$features based on the training dataset and corresponding labels. Using the reduced dataset, we trained SVM with a linear kernel, SVM with an RBF kernel, MKL with a linear kernel, LR with elastic net regularization, and RF. The highest average AUC obtained with a tenfold cross-validation within the training set was used as the selection criterion for the optimal hyper-parameters. We generated performance metrics for each classifier with the best identified hyper-parameters by predicting the labels of the test set and comparing them against the true class labels of the test set. The performance metrics of all five of the (outer) cross-validation runs were averaged to generate generalized performance metrics.

Test for linear separability

Linear separability of a dataset with two classes signifies that a linear classifier (or a hyperplane) can separate the two classes in the data as well as nonlinear classifiers can. We show that by using a slight modification of the histogram of projections¹⁷ method, which is considered a test for class separability using SVM classification. The histogram of projections method¹⁷, is as follows. An SVM classifier is trained using the training data and a chosen kernel. Then, the test samples are projected on the 1-dimensional line perpendicular to the maximum margin hyperplane learned by the SVM model. Histograms of these projections for the two different classes are compared to evaluate the degree of separation achieved by the classifier. In our approach, we applied a sigmoid transformation to the projections of the test samples to obtain likelihood probabilities, and then, based on the likelihood probabilities, we created histograms of the two classes. This modification was performed in order to eliminate the effects of scaling when different kernel transformations are applied in SVM classification and to maintain the range of histograms within a closed set $[0,1]$.

Results

Our results are organized as follows. Table 1 represents the overall summary statistics of the dataset analyzed in this study with some important demographic factors. First, we present the results of our evaluation of whether linear models are sufficient to classify this dataset. Second, we report the analytic method we developed to evaluate generalizability and the results on the generalizability of the analyzed linear classifiers. Third, we report the relative importance of the biomarkers and clinical variables utilized in this study evaluated using the most generalizable classifier selected based on the previous result. Finally, we report the analysis based on LASSO regression³⁴ and a set of identified features that provide independent information in predicting MCI-to-AD progression.

Table 1 Summary characteristics of participants at baseline.

Full size table

Linear separability

First, we evaluated the linear separability of the data in order to understand whether linear classifiers, which are less susceptible to overfitting, were sufficient to perform prediction of MCI-to-AD progression. We applied the modified histogram of projections method to our dataset by training two SVM classifiers with linear and RBF kernels and by plotting histograms of sigmoid transformations of the projections of test samples.

Figure 3A shows a generic illustration of our approach based on the histogram of projections method. Figure 3B,C show the histograms obtained using our approach for linear and RBF kernels, respectively. The two histograms have similar shapes, and the misclassification errors obtained by choosing an appropriate threshold are also close (20.52% for the linear kernel and 19.78% for the RBF kernel). Figure 3D shows a grouped scatter plot of the probabilities obtained using linear versus RBF kernels for the MCI-P and MCI-NP classes. There is a significantly high correlation between the probabilities obtained using the two kernels ($\rho $ = 0.99, p < 1e-6), indicating that the classification boundaries learned using the linear and nonlinear kernels are similar. This experiment suggests that the dataset being analyzed is linearly separable. Hence, we limited our analyses to linear classification techniques for the rest of this paper.

Generalizability of classifiers

Second, we evaluated the overfitting characteristics of the linear classifiers in order to identify models that were likely to provide highly generalizable performance on this dataset. Three linear classifiers—multiple kernel learning (MKL) with linear kernels, support vector machine (SVM) with linear kernel, and generalized linear model (GLM) with elastic-net regularization—were trained using 80% of the whole dataset as the training set but using only a subset of all the features. We selected the subset (out of a total of 94 features) using the joint mutual information (JMI) criterion, by varying the number of features used each time as {5, 10, 15, …, 85, 90, 94}. We utilized tenfold stratified cross-validation to obtain average training and testing area under ROC curve (AUC) metrics and their respective standard deviations on each occasion. Figure 4 shows plots of the cross-validated AUCs with their standard deviations against the ratio $\frac{{\rm{number}}\,{\rm{of}}\,{\rm{features}}\,{\rm{used}}\,{\rm{in}}\,{\rm{training}}}{{\rm{number}}\,{\rm{of}}\,{\rm{training}}\,{\rm{samples}}}$ for both training (4A) and test (4B) sets. On the training set, the AUCs of all the classifiers showed better performance with increasing numbers of features used in training. But on the test set, only SVM and GLM showed a relatively steady trend in the AUC. MKL, on the other hand, showed a decreasing testing AUC trend with increasing numbers of features used in training. Those data suggest that the MKL classifier may be overfitting the training data when the ratio $\frac{{\rm{number}}\,{\rm{of}}\,{\rm{features}}\,{\rm{used}}\,{\rm{in}}\,{\rm{training}}}{{\rm{number}}\,{\rm{of}}\,{\rm{training}}\,{\rm{samples}}}$ is not small (it is >0.14 in this case).

Those results suggest that both SVM with a linear kernel and GLM with elastic-net regularization have good generalizability (with a consistent AUC value of approximately 0.9) regardless of the ratio of the number of features to the number of training samples, and MKL had good generalizable performance only when this ratio was small. Further, particularly for this dataset, all linear classifiers showed a reasonable test set predictability when the number of features was appropriately chosen during the training phase.

Prediction performance of linear classifiers

Based on the previous experiment, we selected the subset of features that provided the most generalizable predictive performance for each linear classifier. Table 2 lists all the cross-validated goodness-of-fit metrics (AUC, sensitivity, specificity, accuracy, precision, recall, and F1-score) obtained using MKL with linear kernels, SVM with a linear kernel and GLM with elastic-net regularization, with their respective subsets of features. All linear classifiers produced comparable results, while SVM with a linear kernel provided the best result.

Table 2 Cross-validated goodness-of-fit metrics for linear classifiers.

Full size table

Predictive ability of individual modalities

To understand the predictive ability of individual modalities in a statistically impartial manner, we restricted our analysis in this subsection to only one linear classifier. Based on Table 2, we chose the SVM classifier with a linear kernel, as it provided the best performance in terms of the AUC metric. Then, we repeated our classification procedure using a stratified tenfold cross-validation for each of the modalities (using all the variables in the respective modality each time). Figure 5A shows a bar-chart of the average AUCs obtained by individual modalities. CSF proteomic markers (including Aβ, total-Tau, and phosphorylated-Tau) provided the best individual prediction capability, followed by the imaging markers in the order amyloid PET, FDG-PET, and SMRI. Gene expression, clinical, cognitive resilience, and demographic markers showed lower predictive ability than did CSF and imaging-based markers. Figure 5B shows a different bar-chart of the average AUCs obtained by iteratively removing the respective features of the individual modalities in a descending order based on their individual predictive abilities seen in Fig. 5A. A gradual decline in the AUC is observed, with the largest drop occurring when SMRI features were removed.

Best predictors of MCI-to-AD progression

Next, we sought to identify a minimal set of predictors required to model MCI-to-AD progression. This was motivated by an observation about Fig. 5: although CSF features provided the best individual AUC (Fig. 5A), the removal of CSF features from the data (Fig. 5B) did not result in a substantial drop in the AUC (the reduction in AUC is within the error limits). That suggests that other modalities provide overlapping information that may be correlated with the CSF markers, and also indicates that all the modalities might have a notable level of correlation with each other. We used a generalized linear model (GLM) classification method with L1 regularization (known as LASSO regression) to identify a sparse set of features with minimal within-correlation and maximal prediction potential of clinical progression³⁴. We chose to use LASSO regression instead of SVM classification for this task because the feature weights assigned by the SVM classifier with a linear kernel do not represent the independent predictive values of individual features when the features themselves are correlated with each other. However, the L1-norm-based regularization of LASSO regression may enable identification of the smallest possible set of predictors, since it penalizes the classifiers that utilize a large number of features in the resulting predictive function. Table 3 shows the results from this approach, identifying the factors that provided the best spread of independent information to predict MCI-to-AD progression. This model included PET, MRI, and CSF variables in addition to age and expression of CR1 (complement receptor 1) and was able to predict MCI-to-AD progression with an AUC of 0.92.

Table 3 A minimal set of markers that are predictive of MCI-to-AD progression. A generalized linear model (GLM) classifier with L1 regularization was utilized to identify a small set of minimally correlated features that can predict MCI-to-AD progression.

Full size table

Discussion

In this work, we developed an accurate and generalizable machine learning-based methodology for predicting short-term progression of MCI to AD dementia in a well-characterized clinical cohort. A unique aspect of this work was our evaluation of important genetic markers and cognitive resilience markers in addition to the neuroimaging biomarkers and clinical examination. The results suggest that a combination of selected neuroimaging, blood and CSF biomarkers, and demographic traits reflect the underlying pathophysiology and factors that drive clinical progression in the AD spectrum. The methodology allowed us to discover that expression of CR1 and AD-pattern neuropathology and neurodegeneration (MRI measures in the frontal lobes) in brain regions associated with cognitive resilience added independent predictive values in predicting MCI-to-AD progression. We also evaluated the relative importance of individual predictors and identified a minimal set of predictors that are important for predictive modeling of MCI progression to AD.

Predictors of MCI progression to AD

The broad pathogenesis of AD has been well-described conceptually, from initial alterations in molecular and cellular pathology to neurodegeneration and eventually to clinical impairment sufficient to cause dementia². However, it is not yet fully understood what specific factors “shift the curve” to either promote or inhibit the development of dementia. As a result, we initially approached our study in a relatively unbiased fashion, casting a broad net for potential predictive factors out of multidimensional clinical, neuroimaging, and other biomarker data. Through our ML approach, we found that measures of AD neuropathology (CSF amyloid and tau and cerebral amyloid assessed by PET) and neuronal injury (assessed by FDG-PET and structural MRI) explained the most variance in separating fast versus slow progression from MCI to AD dementia. The importance of biomarkers in predicting progression has been studied in the past, but the approach we took allowed us to rank various predictors (including the biomarkers) as well as identify the minimal predictor set that are key to the overall models. When the biomarkers were excluded, predictors such as cognitive performance, cognitive resilience, and expression of selected genes individually explained less of the outcome variance and collectively appeared to offer less new information to the predictive model. Our results support the hypothesis that CSF biomarkers and imaging can be used as surrogates for neuropathology and brain health and can serve as key indicators of future MCI prognosis.

Information added by genetic factors and cognitive resilience

An important aspect of this work is the incorporation of genetic factors and cognitive resilience into our predictive model. Most cases of AD are thought to be genetically complex, with multiple factors presumed to contribute to susceptibility and protection, with the largest known factor being the APOE ε4 allele³⁵. Understanding of the genetic architecture of AD has greatly expanded over the last decade as a result of genome-wide and rare-variant association studies, among other approaches⁷. However, the specific genetic factors that influence progression at various clinical stages of AD are still not well-characterized. Our ML approach identified expression of CR1 as a key factor in predicting fast versus slow progression to AD dementia, which is a novel finding of this study. Interestingly, our final minimal set model included CR1 expression but did not include APOE ε4 allele status, suggesting that the former provided unique information for clinical course prediction while the latter was already represented by surrogate biomarkers, specifically amyloid deposition³⁶. Polymorphisms (genotype variations) in CR1 have been associated with AD status and endophenotypes in numerous large-scale studies^{37,38,39,40,41,42}. CR1 encodes a receptor involved in complement activation, a major immune mechanism with a wide array of functions; it has been proposed that it impacts the clearance of amyloid in AD⁴³. Previous findings on CR1 have been illuminated by a more recent and extensive body of work highlighting immune system pathways as potential cruxes in AD pathophysiology^{44,45,46,47,48}. Our new findings relating CR1 expression to progression from MCI to AD dementia provide further validation of that previous work and argue for greater focus on CR1—and on genetic variation in MCI-to-AD progression more broadly⁴⁹—to enable better understanding of the mechanisms underlying AD and its clinical trajectories.

It is not unexpected that in our ML-based prediction model for MCI to AD progression, CR1 gene expression contributed relatively less to explaining variance than biomarkers of neuropathology and neuronal injury. Surrogates of pathology (fluid biomarker and neuroimaging) reflect the extent of disease related changes and are likely more proximal to clinical manifestations of disease compared to gene expression which may be upstream of these changes and thus possibly modifiable by concomitant forces over time. An example is the relationship between APOE ɛ4 and amyloid: while APOE ɛ4 is a key driver of amyloidosis, the measured effect of amyloid load on cognition is significantly stronger than the impact of APOE ɛ4 on cognition^36,50. Complementary methods to incorporate collective effects of multiple genes, including pathway analyses and polygenic risk scoring^51,52,53,54, could be incorporated into our ML approach in future. In addition, for this study we analyzed microarray gene expression (rather than genotype) data, which offers the conceptual advantage of being a dynamic (rather than static) marker of late-life conditions but which has the disadvantage of representing a downstream effect of influences earlier in life with the potential to be modified by interactions with other heritable and non-heritable factors. Finally, for this study we limited the focus to the AlzGene Top 10 list, but other genes with less well-known population-level effects on case-control status may have larger contributions to late-life clinical progression which could be discovered with an unbiased genome-wide approach. Despite these constraints, our approach serves as a proof of concept that incorporating genetic data can add value to ML-based clinical prediction models and highlights CR1 for further study on its potential effects at the inflection of MCI to AD progression.

Although cognitive resilience has been shown to contribute significantly to delaying the onset of clinical impairment, neither education nor verbal IQ was identified as a key predictor of MCI progression. There are two possible explanations for those results. (i) Preservation of structure or lack of atrophy, as observed via MRI, may be a better surrogate of resilience than are estimates of pre-morbid IQ or educational attainment. It has been shown that cognitive resilience is captured well by greater volume and metabolism, especially in the frontal and cingulate regions⁵⁵. (ii) Cognitive resilience may be more relevant before the onset of cognitive symptoms or impairment¹², and thus may be of less importance in cognitively impaired individuals, who are the focus of this study.

Strengths of the computational approach

We presented a machine learning-based prediction framework for predicting MCI-to-AD progression using state-of-the-art classification techniques under a widely accepted cross-validation setup that accounts for sample bias and unbiased hyper-parameter optimization. Our results indicate that linear classifiers are sufficient to identify patients who have the potential to progress from MCI to AD within 3 years, via the use of multimodal measurements including imaging, CSF, genetic, and clinical data. The reason for this might be that all the features were engineered prior to application of machine learning in such a way that a dichotomy within a feature correlates with disease progression. Linear classifiers are less susceptible to overfitting and thus can be appropriate for clinical translation. In addition, we developed an analytic method for evaluating the generalizability of three different linear classification approaches, namely GLM-ElasticNet, multi-kernel SVM with linear kernels, and standard SVM with a linear kernel. Multi-kernel SVM showed a slightly greater tendency to overfit than the other linear classifiers in our experiments. Because multi-kernel SVM optimizes twice as many model parameters as standard SVM or GLM, it seems that the number of samples used in this study was not enough to train multi-kernel SVM in a generalizable manner. However, as indicated in the experiments, when the number of features used in the classification was small, multi-kernel SVM seemed to provide a more generalizable performance compared to when more features were used in the classification, since there were fewer optimized model parameters in the former scenario. Even so, standard SVM and GLM provided better generalizability and achieved higher classification accuracies with more features. In comparison with previously published predictors of MCI-to-AD progression, our approach achieves superior predictive performance, with a 0.93 area under ROC curve providing a 6% improvement over the current best predictor⁴, which utilized only CSF and imaging modalities for the prediction of short-term MCI-to-AD progression. It is our firm belief that AUCs of test sets are the most reliable measure of a classifier’s general performance, and hence that our evaluation presents a fair assessment of the proposed ML-based predictor.

Limitations and future directions

Our study has several limitations, which may be addressed by future work. We used publicly available data from the ADNI and ultimately analyzed only a modest sample, because we could only use data from patients for whom we had complete data for all the assessed predictive variables. Replication of our methods in similar and larger cohorts would help validate our approach. In addition, application of our final model to an independent dataset could extend our findings by assessing their success in predicting clinical progression from known baseline data. Further, use of alternative surrogate variables—such as verbal episodic memory performance (rather than MMSE scores) as a reflection of cognitive impairment, and lifetime intellectual enrichment (rather than, or in addition to, the AMNART or educational attainment) as a reflection of cognitive resilience—as well as newer biomarkers such as Tau-PET might produce a more optimal model. We also chose to focus on differentiation of faster versus slower progression from MCI. However, it is not yet known whether the factors that are important at that clinical stage are more broadly generalizable to earlier stages of AD at which clinical intervention might have greater impact. Finally, a broader investigation of genetic factors, including genome-wide genotype and expression data, might increase the accuracy of our model and raise additional loci for follow-up.

In conclusion, we used a machine learning approach to integrate data from multiple sources to predict progression from MCI to AD dementia. By analyzing the linear-separability of the dataset, we established that linear classifiers are sufficient for predicting MCI-to-AD progression in the proposed setting. Furthermore, we also developed an analytical method to assess the generalizability of classifiers based on the number of model parameters. We also showed that, the markers of AD pathophysiology (amyloid, tau, and neuronal injury) provided very high predictive values, the genetic factors and brain regions associated with cognitive resilience also displayed independent predictive values. Our findings, particularly in relation to the impact of genetic factors and cognitive resilience in the setting of biomarkers and clinical tests, warrant further investigation using larger datasets and may have downstream benefits leading to improved prognostication and risk stratification in AD.

Data Availability

Data used in preparation of this article were obtained from the Alzheimer’s disease Neuroimaging Initiative (ADNI) database (http://adni.loni.usc.edu). Thus, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data, but did not participate in this analysis or the writing of this report. A complete listing of ADNI investigators can be found at http://adni.loni.usc.edu/wpcontent/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf/. For additional details and up-to-date information, see http://www.adni-info.org.

References

Alzheimer’s Association. 2018 Alzheimer’s disease facts and figures. Alzheimer’s & Dementia 14(3), 367–429, https://doi.org/10.1016/j.jalz.2018.02.001 (2018).
Jack, C. R. Jr. et al. Tracking pathophysiological processes in Alzheimer’s disease: an updated hypothetical model of dynamic biomarkers. The Lancet. Neurology 12, 207–216, https://doi.org/10.1016/S1474-4422(12)70291-0 (2013).
Article CAS PubMed PubMed Central Google Scholar
Farias, S. T., Mungas, D., Reed, B. R., Harvey, D. & DeCarli, C. Progression of mild cognitive impairment to dementia in clinic- vs community-based cohorts. Arch Neurol 66, 1151–1157, https://doi.org/10.1001/archneurol.2009.106 (2009).
Article PubMed PubMed Central Google Scholar
Korolev, I. O., Symonds, L. L. & Bozoki, A. C. Predicting Progression from Mild Cognitive Impairment to Alzheimer’s Dementia Using Clinical, MRI, and Plasma Biomarkers via Probabilistic Pattern Classification. PloS one 11, e0138866, https://doi.org/10.1371/journal.pone.0138866 (2016).
Article CAS PubMed PubMed Central Google Scholar
Roberts, R. O. et al. Higher risk of progression to dementia in mild cognitive impairment cases who revert to normal. Neurology 82, 317–325, https://doi.org/10.1212/wnl.0000000000000055 (2014).
Article PubMed PubMed Central Google Scholar
Tifratene, K., Robert, P., Metelkina, A., Pradier, C. & Dartigues, J. F. Progression of mild cognitive impairment to dementia due to AD in clinical settings. Neurology 85, 331–338, https://doi.org/10.1212/wnl.0000000000001788 (2015).
Article PubMed Google Scholar
Naj, A. C. & Schellenberg, G. D. & Alzheimer’s Disease Genetics, C. Genomic variants, genes, and pathways of Alzheimer’s disease: An overview. Am J Med Genet B Neuropsychiatr Genet 174, 5–26, https://doi.org/10.1002/ajmg.b.32499 (2017).
Article PubMed PubMed Central Google Scholar
Van Cauwenberghe, C., Van Broeckhoven, C. & Sleegers, K. The genetic landscape of Alzheimer disease: clinical implications and perspectives. Genet Med 18, 421–430, https://doi.org/10.1038/gim.2015.117 (2016).
Article PubMed Google Scholar
Elias-Sonnenschein, L. S., Viechtbauer, W., Ramakers, I. H., Verhey, F. R. & Visser, P. J. Predictive value of APOE-epsilon4 allele for progression from MCI to AD-type dementia: a meta-analysis. Journal of neurology, neurosurgery, and psychiatry 82, 1149–1156, https://doi.org/10.1136/jnnp.2010.231555 (2011).
Article PubMed Google Scholar
Reitz, C. & Mayeux, R. Use of genetic variation as biomarkers for mild cognitive impairment and progression of mild cognitive impairment to dementia. Journal of Alzheimer’s disease: JAD 19, 229–251, https://doi.org/10.3233/JAD-2010-1255 (2010).
Article PubMed Google Scholar
Stern, Y. Cognitive reserve in ageing and Alzheimer’s disease. The Lancet. Neurology 11, 1006–1012, https://doi.org/10.1016/S1474-4422(12)70191-6 (2012).
Article PubMed PubMed Central Google Scholar
Vemuri, P. et al. Association of lifetime intellectual enrichment with cognitive decline in the older population. JAMA neurology 71, 1017–1024, https://doi.org/10.1001/jamaneurol.2014.963 (2014).
Article PubMed PubMed Central Google Scholar
Landau, S. M. et al. Comparing predictors of conversion and decline in mild cognitive impairment. Neurology 75, 230–238, https://doi.org/10.1212/WNL.0b013e3181e8e8b8 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhang, D. & Shen, D. Predicting future clinical changes of MCI patients using longitudinal and multimodal biomarkers. PloS one 7, e33182, https://doi.org/10.1371/journal.pone.0033182 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Suk, H. I., Lee, S. W. & Shen, D. & Alzheimer’s Disease Neuroimaging, I. Hierarchical feature representation and multimodal fusion with deep learning for AD/MCI diagnosis. NeuroImage 101, 569–582, https://doi.org/10.1016/j.neuroimage.2014.06.077 (2014).
Article PubMed Google Scholar
Hastie, T., Friedman, J. & Tibshirani, R. In The Elements of Statistical Learning: Data Mining, Inference, and Prediction 193–224 (Springer New York, 2001).
Cherkassky, V. & Dai, W. Y. Empirical Study of the Universum SVM Learning for High-Dimensional Data. Lect Notes Comput Sc 5768, 932–941 (2009).
Article Google Scholar
Weiner, M. W. et al. Impact of the Alzheimer’s Disease Neuroimaging Initiative, 2004 to 2014. Alzheimer’s & Dementia 11, 865–884, https://doi.org/10.1016/j.jalz.2015.04.005 (2015).
Article Google Scholar
Shaw, L. M. et al. Cerebrospinal fluid biomarker signature in Alzheimer’s disease neuroimaging initiative subjects. Ann Neurol 65, 403–413, https://doi.org/10.1002/ana.21610 (2009).
Article CAS PubMed PubMed Central Google Scholar
Jack, C. R. Jr. et al. The Alzheimer’s Disease Neuroimaging Initiative (ADNI): MRI methods. Journal of magnetic resonance imaging: JMRI 27, 685–691, https://doi.org/10.1002/jmri.21049 (2008).
Article PubMed Google Scholar
Jagust, W. J. et al. The Alzheimer’s Disease Neuroimaging Initiative positron emission tomography core. Alzheimer’s & dementia: the journal of the Alzheimer’s Association 6, 221–229, https://doi.org/10.1016/j.jalz.2010.03.003 (2010).
Article Google Scholar
Landau, S. M. et al. Measurement of longitudinal beta-amyloid change with 18F-florbetapir PET and standardized uptake value ratios. J Nucl Med 56, 567–574, https://doi.org/10.2967/jnumed.114.148981 (2015).
Article CAS PubMed Google Scholar
Folstein, M. F., Folstein, S. E. & McHugh, P. R. “Mini-mental state”. A practical method for grading the cognitive state of patients for the clinician. Journal of psychiatric research 12, 189–198 (1975).
Article CAS PubMed Google Scholar
Saykin, A. J. et al. Genetic studies of quantitative MCI and AD phenotypes in ADNI: progress, opportunities, and plan. Alzheimer’s & dementia: the journal of the Alzheimer’s Association (In Press) (2015).
Bertram, L., McQueen, M. B., Mullin, K., Blacker, D. & Tanzi, R. E. Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database. Nature genetics 39, 17–23, https://doi.org/10.1038/ng1934 (2007).
Article CAS PubMed Google Scholar
Yang H. H., M. J. Feature Selection Based on Joint Mutual Information. Proceedings of International ICSC Symposium on Advances in Intelligent Data Analysis, 22–25 (1999).
Brown, G., Pocock, A., Zhao, M. J. & Lujan, M. Conditional Likelihood Maximisation: A Unifying Framework for Information Theoretic Feature Selection. J Mach Learn Res 13, 27–66 (2012).
MathSciNet MATH Google Scholar
Cortes, C. & Vapnik, V. Support-Vector Networks. Mach Learn 20, 273–297, https://doi.org/10.1007/Bf00994018 (1995).
Article MATH Google Scholar
Scholkopf, B. & Smola, A. J. A short introduction to learning with kernels. Lect Notes Artif Int 2600, 41–64 (2002).
MATH Google Scholar
Scholkopf, B. et al. Comparing support vector machines with Gaussian kernels to radial basis function classifiers. Ieee T Signal Proces 45, 2758–2765, https://doi.org/10.1109/78.650102 (1997).
Article ADS Google Scholar
Gonen, M. & Alpaydin, E. Multiple Kernel Learning Algorithms. J Mach Learn Res 12, 2211–2268 (2011).
MathSciNet MATH Google Scholar
Hosmer, D. W., Lemeshow, S. & Sturdivant, R. X. Applied Logistic Regression, 3rd Edition. Wiley Ser Probab St, 1–500, https://doi.org/10.1002/9781118548387 (2013).
Zou, H. & Hastie, T. Regularization and variable selection via the elastic net (vol. B 67, pg 301, 2005). J Roy Stat Soc B 67, 768–768, https://doi.org/10.1111/j.1467-9868.2005.00527.x (2005).
Article Google Scholar
Tibshirani, R. Regression shrinkage and selection via the Lasso. J Roy Stat Soc B Met 58, 267–288 (1996).
MathSciNet MATH Google Scholar
Ramanan, V. K. & Saykin, A. J. Pathways to neurodegeneration: mechanistic insights from GWAS in Alzheimer’s disease, Parkinson’s disease, and related disorders. American journal of neurodegenerative disease 2, 145–175 (2013).
PubMed PubMed Central Google Scholar
Vemuri, P. et al. Age, vascular health, and Alzheimer disease biomarkers in an elderly sample. Ann Neurol 82, 706–718, https://doi.org/10.1002/ana.25071 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lambert, J. C. et al. Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer’s disease. Nature genetics 41, 1094–U1068, https://doi.org/10.1038/ng.439 (2009).
Article CAS PubMed Google Scholar
Keenan, B. T. et al. A coding variant in CR1 interacts with APOE-epsilon4 to influence cognitive decline. Human molecular genetics 21, 2377–2388, https://doi.org/10.1093/hmg/dds054 (2012).
Article CAS PubMed PubMed Central Google Scholar
Barral, S. et al. Genotype patterns at PICALM, CR1, BIN1, CLU, and APOE genes are associated with episodic memory. Neurology 78, 1464–1471, https://doi.org/10.1212/WNL.0b013e3182553c48 (2012).
Article CAS PubMed PubMed Central Google Scholar
Chibnik, L. B. et al. CR1 is associated with amyloid plaque burden and age-related cognitive decline. Ann Neurol 69, 560–569, https://doi.org/10.1002/ana.22277 (2011).
Article CAS PubMed PubMed Central Google Scholar
Jun, G. et al. Meta-analysis Confirms CR1, CLU, and PICALM as Alzheimer Disease Risk Loci and Reveals Interactions With APOE Genotypes. Arch Neurol 67, 1473–1484, https://doi.org/10.1001/archneurol.2010.201 (2010).
Article PubMed PubMed Central Google Scholar
Thambisetty, M. et al. Effect of complement CR1 on brain amyloid burden during aging and its modification by APOE genotype. Biol Psychiatry 73, 422–428, https://doi.org/10.1016/j.biopsych.2012.08.015 (2013).
Article CAS PubMed Google Scholar
Karch, C. M. & Goate, A. M. Alzheimer’s disease risk genes and mechanisms of disease pathogenesis. Biol Psychiatry 77, 43–51, https://doi.org/10.1016/j.biopsych.2014.05.006 (2015).
Article CAS PubMed Google Scholar
Sims, R. et al. Rare coding variants in PLCG2, ABI3, and TREM2 implicate microglial-mediated innate immunity in Alzheimer’s disease. Nature genetics 49, 1373–1384, https://doi.org/10.1038/ng.3916 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jones, L. et al. Genetic evidence implicates the immune system and cholesterol metabolism in the aetiology of Alzheimer’s disease. PloS one 5, e13950, https://doi.org/10.1371/journal.pone.0013950 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, B. et al. Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer’s disease. Cell 153, 707–720, https://doi.org/10.1016/j.cell.2013.03.030 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hamelin, L. et al. Early and protective microglial activation in Alzheimer’s disease: a prospective study using 18F-DPA-714 PET imaging. Brain 139, 1252–1264, https://doi.org/10.1093/brain/aww017 (2016).
Article PubMed Google Scholar
Ramanan, V. K. et al. GWAS of longitudinal amyloid accumulation on 18F-florbetapir PET in Alzheimer’s disease implicates microglial activation gene IL1RAP. Brain 138, 3076–3088, https://doi.org/10.1093/brain/awv231 (2015).
Article PubMed PubMed Central Google Scholar
Lacour, A. et al. Genome-wide significant risk factors for Alzheimer’s disease: role in progression to dementia due to Alzheimer’s disease among subjects with mild cognitive impairment. Mol Psychiatry, https://doi.org/10.1038/mp.2016.18 (2016).
Lim, Y. Y. et al. Stronger effect of amyloid load than APOE genotype on cognitive decline in healthy older adults. Neurology 79, 1645–1652, https://doi.org/10.1212/WNL.0b013e31826e9ae6 (2012).
Article CAS PubMed Google Scholar
Desikan, R. S. et al. Genetic assessment of age-associated Alzheimer disease risk: Development and validation of a polygenic hazard score. PLoS Med 14, e1002258, https://doi.org/10.1371/journal.pmed.1002258 (2017).
Article PubMed PubMed Central Google Scholar
Escott-Price, V., Shoai, M., Pither, R., Williams, J. & Hardy, J. Polygenic score prediction captures nearly all common genetic risk for Alzheimer’s disease. Neurobiology of aging 49, 214 e217–214 e211, https://doi.org/10.1016/j.neurobiolaging.2016.07.018 (2017).
Article Google Scholar
Mormino, E. C. et al. Polygenic risk of Alzheimer disease is associated with early- and late-life processes. Neurology 87, 481–488, https://doi.org/10.1212/WNL.0000000000002922 (2016).
Article CAS PubMed PubMed Central Google Scholar
Ramanan, V. K., Shen, L., Moore, J. H. & Saykin, A. J. Pathway analysis of genomic data: concepts, methods, and prospects for future development. Trends Genet 28, 323–332, https://doi.org/10.1016/j.tig.2012.03.004 (2012).
Article CAS PubMed PubMed Central Google Scholar
Arenaza-Urquijo, E. M. et al. Relationships between years of education and gray matter volume, metabolism and functional connectivity in healthy elders. NeuroImage 83, 450–457, https://doi.org/10.1016/j.neuroimage.2013.06.053 (2013).
Article PubMed Google Scholar

Download references

Acknowledgements

This work was partly supported by National Institute of Health grants (R01 NS097495 and R01 AG056366), National Science Foundation grants (CNS-1337732 and CNS-1624790), Mayo Clinic and Illinois Alliance Fellowship for Technology-based Healthcare Research and an IBM faculty award. We would like to thank Dr. Duygu Tosun for providing clarification regarding the ADNI Freesurfer data. Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
Yogatheesan Varatharajah & Ravishankar Iyer
Mayo Clinic, Rochester, MN, 55905, USA
Vijay K. Ramanan, Prashanthi Vemuri, Ronald Petersen, Clifford R. Jack, Matthew Bernstein, Bret Borowski, Jeff Gunter, Matt Senjem, David Jones, Kejal Kantarci, Chad Ward, Colleen S. Albers, David Knopman & Kris Johnson
University of California, San Francisco, USA
Michael W. Weiner, Norbert Schuff, Howard J. Rosen, Bruce L. Miller & David Perry
University of Southern California, Los Angeles, USA
Paul Aisen, Arthur W. Toga, Karen Crawford, Scott Neu, Lon S. Schneider, Sonia Pawluczyk, Mauricio Beccera, Liberty Teodoro & Bryan M. Spann
Indiana University, Bloomington, USA
Andrew J. Saykin, Tatiana M. Foroud, Kelley Faber, Sungeun Kim, Kwangsik Nho, Sara S. Mason, Martin R. Farlow, Ann Marie Hake, Brandy R. Matthews, Jared R. Brosch & Scott Herring
University of California, Berkeley, Berkeley, USA
William Jagust & Susan Landau
University of Pennsylvania, Philadelphia, USA
John Q. Trojanowki, Leslie M. Shaw, Virginia Lee, Magdalena Korecka, Michal Figurski, Steven E. Arnold, Jason H. Karlawish, David A. Wolk & Christopher M. Clark
University of California, Davis, Davis, USA
Laurel Beckett, Danielle Harvey, Charles DeCArli, Evan Fletcher, Pauline Maillard, John Olichney, Charles DeCarli & Owen Carmichael
Brigham and Women’s Hospital/Harvard Medical School, Boston, USA
Robert C. Green, Reisa A. Sperling, Keith A. Johnson & Gad Marshall
Washington University St. Louis, St. Louis, USA
John Morris, Marc Raichle, David Holtzman, Nigel J. Cairns, Erin Franklin, Lisa Taylor-Reinwald, Beau Ances, Maria Carroll, Mary L. Creech, Erin Franklin, Mark A. Mintun, Stacy Schneider & Angela Oliver
Prevent Alzheimer’s Disease, 2020, Rockville, USA
Zaven Khachaturian
Siemens, Munich, Germany
Greg Sorensen
Alzheimer’s Association, Illinois, USA
Maria Carrillo
University of Pittsburgh, Pennsylvania, USA
Lew Kuller, Chet Mathis, Oscar L. Lopez, Mary Ann Oakley & Donna M. Simpson
Cornell University, NewYork, USA
Steven Paul, Norman Relkin, Gloria Chaing, Michael Lin & Lisa Ravdin
Albert Einstein College of Medicine ofYeshiva University, New York, USA
Peter Davies
AD Drug Discovery Foundation, New York, USA
Howard Fillit
Acumen Pharmaceuticals, California, USA
Franz Hefti
Northwestern University, Illinois, USA
M. Marcel Mesulam, Emily Rogalski, Kristine Lipowski, Sandra Weintraub, Borna Bonakdarpour, Diana Kerwin, Chuang-Kuo Wu & Nancy Johnson
National Institute of Mental Health, Maryland, USA
William Potter
Brown University, Rhode Island, USA
Peter Snyder
Eli Lilly, Indiana, USA
Adam Schwartz
University of Washington, Washington, USA
Tom Montine
University of California, San Diego, California, USA
Ronald G. Thomas, Michael Donohue, Sarah Walter, Devon Gessert, Tamie Sather, Gus Jiminez, Archana B. Balasubramanian, Jennifer Mason, Iris Sim, Leon Thal, James Brewer, Helen Vanderswag & Adam Fleisher
University of London, London, UK
Nick Fox
University of California, Los Angeles, California, USA
Paul Thompson, Liana Apostolova, Kathleen Tingus, Ellen Woo, Daniel H. S. Silverman, Po H. Lu & George Bartzokis
University of Michigan, Michigan, USA
Robert A. Koeppe, Judith L. Heidebrink & Joanne L. Lord
University of Utah, Utah, USA
Norm Foster
Banner Alzheimer’s Institute, Arizona, USA
Eric M. Reiman, Kewei Chen, Pierre Tariot, Anna Burke, Ann Marie Milliken, Nadira Trncic, Adam Fleisher & Stephanie Reeder
University of California, Irvine, California, USA
Steven Potkin, Adrian Preda & Dana Nguyen
National Institute on Aging, Maryland, USA
Neil Buckholtz & John Hsiao
Johns Hopkins University, Maryland, USA
Marilyn Albert, Chiadi Onyike, Daniel D’Agostino & Stephanie Kielb
Richard Frank Consulting, New Hampshire, USA
Richard Frank
Oregon Health and Science University, Oregon, USA
Jeffrey Kaye, Joseph Quinn, Lisa Silbert, Betty Lind, Raina Carter & Sara Dolen
Baylor College of Medicine, Texas, USA
Rachelle S. Doody, Javier Villanueva-Meyer, Valory Pavlik, Victoria Shibley, Munir Chowdhury, Susan Rountree & Mimi Dang
Columbia University Medical Center, New York, USA
Yaakov Stern, Lawrence S. Honig & Karen L. Bell
University of Alabama-Birmingham, Alabama, USA
Daniel Marson, David Geldmacher, Marissa Natelson Love, Randall Griffith, David Clark, John Brockington & Erik Roberson
Mount Sinai School of Medicine, NewYork, USA
Hillel Grossman & Effie Mitsis
Rush University Medical Center, Rush University, Illinois, USA
Raj C. Shah & Leyla deToledo-Morrell
Wien Center, Florida, USA
Ranjan Duara, Maria T. Greig-Custo & Warren Barker
NewYork University, New York, USA
Martin Sadowski, Mohammed O. Sheikh, Anaztasia Ulysse & Mrunalini Gaikwad
Duke University Medical Center, North Carolina, USA
P. Murali Doraiswamy, Jeffrey R. Petrella, Salvador Borges-Neto, Terence Z. Wong & Edward Coleman
University of Kentucky, Kentucky, USA
Charles D. Smith, Greg Jicha, Peter Hardy, Partha Sinha, Elizabeth Oates & Gary Conrad
University of Rochester Medical Center, New York, USA
Anton P. Porsteinsson, Bonnie S. Goldstein, Kim Martin, Kelly M. Makino, M. Saleem Ismail & Connie Brand
University of Texas Southwestern Medical School, Texas, USA
Kyle Womack, Dana Mathews & Mary Quiceno
Emory University, Georgia, USA
Allan I. Levey, James J. Lah & Janet S. Cellar
University of Kansas, Medical Center, Kansas, USA
Jeffrey M. Burns, Russell H. Swerdlow & William M. Brooks
Mayo Clinic, Jacksonville, Florida, USA
Neill R Graff-Radford, Francine Parfitt & Kim Poki-Walker
Yale University School of Medicine, Connecticut, USA
Christopher H. van Dyck, Richard E. Carson, Martha G. MacAvoy & Pradeep Varma
McGill University, Montreal- Jewish General Hospital, Quebec, Canada
Howard Chertkow, Howard Bergman & Chris Hosein
Sunnybrook Health Sciences, Ontario, Canada
Sandra Black, Bojana Stefanovic & Curtis Caldwell
U.B.C. Clinic for AD & Related Disorders, British Columbia, Canada
Ging-Yuek Robin Hsiung, Benita Mudge, Vesna Sossi, Howard Feldman & Michele Assaly
Cognitive Neurology-St. Joseph’s, Ontario, Canada
Elizabeth Finger, Stephen Pasternack, Irina Rachisky, John Rogers, Dick Trost & Andrew Kertesz
Cleveland Clinic Lou Ruvo Center for Brain Health, Ohio, USA
Charles Bernick & Donna Munic
Premiere Research Inst (Palm Beach Neurology), Florida, USA
Carl Sadowsky & Teresa Villena
Georgetown University Medical Center, Washington, D.C, USA
Raymond Scott Turner, Kathleen Johnson & Brigid Reynolds
Stanford University, California, USA
Jerome Yesavage, Joy L. Taylor, Barton Lane, Allyson Rosen & Jared Tinklenberg
Banner Sun Health Research Institute, Arizona, USA
Marwan N. Sabbagh, Christine M. Belden, Sandra A. Jacobson & Sherye A. Sirrel
Boston University, Massachusetts, USA
Neil Kowall, Ronald Killiany, Andrew E. Budson, Alexander Norbash & Patricia Lynn Johnson
Howard University, Washington, D.C, USA
Thomas O. Obisesan, Saba Wolday & Joanne Allard
Case Western Reserve University, Ohio, USA
Alan Lerner, Paula Ogrocki, Curtis Tatsuoka & Parianne Fatica
Neurological Care of CNY, New York, USA
Smita Kittur
Parkwood Hospital, Pennsylvania, USA
Michael Borrie, T.-Y. Lee & Rob Bartha
University of Wisconsin, Wisconsin, USA
Sterling Johnson, Sanjay Asthana & Cynthia M. Carlsson
Dent Neurologic Institute, New York, USA
Vernice Bates, Horacio Capote & Michelle Rainka
Ohio State University, Ohio, USA
Douglas W. Scharre, Maria Kataki & Brendan Kelly
Albany Medical College, New York, USA
Earl A. Zimmerman, Dzintra Celmins & Alice D. Brown
Hartford Hospital, Olin Neuropsychiatry Research Center, Connecticut, USA
Godfrey D. Pearlson, Karen Blank & Karen Anderson
Dartmouth-Hitchcock Medical Center, New Hampshire, USA
Laura A. Flashman, Marc Seltzer, Mary L. Hynes & Robert B. Santulli
Wake Forest University Health Sciences, North Carolina, USA
Kaycee M. Sink, Leslie Gordineer, Jeff D. Williamson, Pradeep Garg & Franklin Watkins
Rhode Island Hospital, Rhode Island, USA
Brian R. Ott, Geoffrey Tremont & Lori A. Daiello
Butler Hospital, Rhode Island, USA
Stephen Salloway, Paul Malloy & Stephen Correia
Medical University South Carolina, Carolina, USA
Jacobo Mintzer, Kenneth Spicer & David Bachman
Nathan Kline Institute, NewYork, USA
Nunzio Pomara, Raymundo Hernando & Antero Sarrael
University of Iowa College of Medicine, Iowa, USA
Susan K. Schultz, Karen Ekstam Smith, Hristina Koleva, Ki Won Nam & Hyungsub Shim
USF Health Byrd Alzheimer’s Institute, University of South Florida, Florida, USA
Amanda Smith, Balebail Ashok Raj & Kristin Fargher

Author notes

A comprehensive list of consortium members appears at the end of the paper

Authors

Yogatheesan Varatharajah
View author publications
You can also search for this author in PubMed Google Scholar
Vijay K. Ramanan
View author publications
You can also search for this author in PubMed Google Scholar
Ravishankar Iyer
View author publications
You can also search for this author in PubMed Google Scholar
Prashanthi Vemuri
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

For the Alzheimer’s Disease Neuroimaging Initiative

Michael W. Weiner
, Paul Aisen
, Ronald Petersen
, Clifford R. Jack
, Andrew J. Saykin
, William Jagust
, John Q. Trojanowki
, Arthur W. Toga
, Laurel Beckett
, Robert C. Green
, John Morris
, Leslie M. Shaw
, Zaven Khachaturian
, Greg Sorensen
, Maria Carrillo
, Lew Kuller
, Marc Raichle
, Steven Paul
, Peter Davies
, Howard Fillit
, Franz Hefti
, David Holtzman
, M. Marcel Mesulam
, William Potter
, Peter Snyder
, Adam Schwartz
, Tom Montine
, Ronald G. Thomas
, Michael Donohue
, Sarah Walter
, Devon Gessert
, Tamie Sather
, Gus Jiminez
, Archana B. Balasubramanian
, Jennifer Mason
, Iris Sim
, Danielle Harvey
, Matthew Bernstein
, Nick Fox
, Paul Thompson
, Norbert Schuff
, Charles DeCArli
, Bret Borowski
, Jeff Gunter
, Matt Senjem
, David Jones
, Kejal Kantarci
, Chad Ward
, Robert A. Koeppe
, Norm Foster
, Eric M. Reiman
, Kewei Chen
, Chet Mathis
, Susan Landau
, Nigel J. Cairns
, Erin Franklin
, Lisa Taylor-Reinwald
, Virginia Lee
, Magdalena Korecka
, Michal Figurski
, Karen Crawford
, Scott Neu
, Tatiana M. Foroud
, Steven Potkin
, Kelley Faber
, Sungeun Kim
, Kwangsik Nho
, Leon Thal
, Neil Buckholtz
, Marilyn Albert
, Richard Frank
, John Hsiao
, Jeffrey Kaye
, Joseph Quinn
, Lisa Silbert
, Betty Lind
, Raina Carter
, Sara Dolen
, Lon S. Schneider
, Sonia Pawluczyk
, Mauricio Beccera
, Liberty Teodoro
, Bryan M. Spann
, James Brewer
, Helen Vanderswag
, Adam Fleisher
, Judith L. Heidebrink
, Joanne L. Lord
, Sara S. Mason
, Colleen S. Albers
, David Knopman
, Kris Johnson
, Rachelle S. Doody
, Javier Villanueva-Meyer
, Valory Pavlik
, Victoria Shibley
, Munir Chowdhury
, Susan Rountree
, Mimi Dang
, Yaakov Stern
, Lawrence S. Honig
, Karen L. Bell
, Beau Ances
, Maria Carroll
, Mary L. Creech
, Erin Franklin
, Mark A. Mintun
, Stacy Schneider
, Angela Oliver
, Daniel Marson
, David Geldmacher
, Marissa Natelson Love
, Randall Griffith
, David Clark
, John Brockington
, Erik Roberson
, Hillel Grossman
, Effie Mitsis
, Raj C. Shah
, Leyla deToledo-Morrell
, Ranjan Duara
, Maria T. Greig-Custo
, Warren Barker
, Chiadi Onyike
, Daniel D’Agostino
, Stephanie Kielb
, Martin Sadowski
, Mohammed O. Sheikh
, Anaztasia Ulysse
, Mrunalini Gaikwad
, P. Murali Doraiswamy
, Jeffrey R. Petrella
, Salvador Borges-Neto
, Terence Z. Wong
, Edward Coleman
, Steven E. Arnold
, Jason H. Karlawish
, David A. Wolk
, Christopher M. Clark
, Charles D. Smith
, Greg Jicha
, Peter Hardy
, Partha Sinha
, Elizabeth Oates
, Gary Conrad
, Oscar L. Lopez
, Mary Ann Oakley
, Donna M. Simpson
, Anton P. Porsteinsson
, Bonnie S. Goldstein
, Kim Martin
, Kelly M. Makino
, M. Saleem Ismail
, Connie Brand
, Adrian Preda
, Dana Nguyen
, Kyle Womack
, Dana Mathews
, Mary Quiceno
, Allan I. Levey
, James J. Lah
, Janet S. Cellar
, Jeffrey M. Burns
, Russell H. Swerdlow
, William M. Brooks
, Liana Apostolova
, Kathleen Tingus
, Ellen Woo
, Daniel H. S. Silverman
, Po H. Lu
, George Bartzokis
, Neill R Graff-Radford
, Francine Parfitt
, Kim Poki-Walker
, Martin R. Farlow
, Ann Marie Hake
, Brandy R. Matthews
, Jared R. Brosch
, Scott Herring
, Christopher H. van Dyck
, Richard E. Carson
, Martha G. MacAvoy
, Pradeep Varma
, Howard Chertkow
, Howard Bergman
, Chris Hosein
, Sandra Black
, Bojana Stefanovic
, Curtis Caldwell
, Ging-Yuek Robin Hsiung
, Benita Mudge
, Vesna Sossi
, Howard Feldman
, Michele Assaly
, Elizabeth Finger
, Stephen Pasternack
, Irina Rachisky
, John Rogers
, Dick Trost
, Andrew Kertesz
, Charles Bernick
, Donna Munic
, Emily Rogalski
, Kristine Lipowski
, Sandra Weintraub
, Borna Bonakdarpour
, Diana Kerwin
, Chuang-Kuo Wu
, Nancy Johnson
, Carl Sadowsky
, Teresa Villena
, Raymond Scott Turner
, Kathleen Johnson
, Brigid Reynolds
, Reisa A. Sperling
, Keith A. Johnson
, Gad Marshall
, Jerome Yesavage
, Joy L. Taylor
, Barton Lane
, Allyson Rosen
, Jared Tinklenberg
, Marwan N. Sabbagh
, Christine M. Belden
, Sandra A. Jacobson
, Sherye A. Sirrel
, Neil Kowall
, Ronald Killiany
, Andrew E. Budson
, Alexander Norbash
, Patricia Lynn Johnson
, Thomas O. Obisesan
, Saba Wolday
, Joanne Allard
, Alan Lerner
, Paula Ogrocki
, Curtis Tatsuoka
, Parianne Fatica
, Evan Fletcher
, Pauline Maillard
, John Olichney
, Charles DeCarli
, Owen Carmichael
, Smita Kittur
, Michael Borrie
, T.-Y. Lee
, Rob Bartha
, Sterling Johnson
, Sanjay Asthana
, Cynthia M. Carlsson
, Pierre Tariot
, Anna Burke
, Ann Marie Milliken
, Nadira Trncic
, Adam Fleisher
, Stephanie Reeder
, Vernice Bates
, Horacio Capote
, Michelle Rainka
, Douglas W. Scharre
, Maria Kataki
, Brendan Kelly
, Earl A. Zimmerman
, Dzintra Celmins
, Alice D. Brown
, Godfrey D. Pearlson
, Karen Blank
, Karen Anderson
, Laura A. Flashman
, Marc Seltzer
, Mary L. Hynes
, Robert B. Santulli
, Kaycee M. Sink
, Leslie Gordineer
, Jeff D. Williamson
, Pradeep Garg
, Franklin Watkins
, Brian R. Ott
, Geoffrey Tremont
, Lori A. Daiello
, Stephen Salloway
, Paul Malloy
, Stephen Correia
, Howard J. Rosen
, Bruce L. Miller
, David Perry
, Jacobo Mintzer
, Kenneth Spicer
, David Bachman
, Nunzio Pomara
, Raymundo Hernando
, Antero Sarrael
, Susan K. Schultz
, Karen Ekstam Smith
, Hristina Koleva
, Ki Won Nam
, Hyungsub Shim
, Norman Relkin
, Gloria Chaing
, Michael Lin
, Lisa Ravdin
, Amanda Smith
, Balebail Ashok Raj
& Kristin Fargher

Contributions

Y.V. and P.V. designed the study. Y.V. developed the methods, conducted the analyses, and drafted the paper. Y.V., V.R., R.I. and P.V. participated in the interpretation of the results and critical revision of the manuscript. All the ADNI consortium authors contributed to the acquisition, processing, and public availability of ADNI data.

Corresponding authors

Correspondence to Yogatheesan Varatharajah or Prashanthi Vemuri.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Varatharajah, Y., Ramanan, V.K., Iyer, R. et al. Predicting Short-term MCI-to-AD Progression Using Imaging, CSF, Genetic Factors, Cognitive Resilience, and Demographics. Sci Rep 9, 2235 (2019). https://doi.org/10.1038/s41598-019-38793-3

Download citation

Received: 23 April 2018
Accepted: 07 January 2019
Published: 19 February 2019
DOI: https://doi.org/10.1038/s41598-019-38793-3

This article is cited by

Anodal HD-tDCS on the dominant anterior temporal lobe and dorsolateral prefrontal cortex: clinical results in patients with mild cognitive impairment
- Soheila Rezakhani
- Mahmood Amiri
- Vahid Sheibani
Alzheimer's Research & Therapy (2024)
Doubly robust estimation and robust empirical likelihood in generalized linear models with missing responses
- Liugen Xue
Statistics and Computing (2024)
Whole Person Modeling: a transdisciplinary approach to mental health research
- Daniel Felsky
- Alyssa Cannitelli
- Jon Pipitone
Discover Mental Health (2023)
A novel machine learning based technique for classification of early-stage Alzheimer’s disease using brain images
- Ruhul Amin Hazarika
- Debdatta Kandar
- Arnab Kumar Maji
Multimedia Tools and Applications (2023)
MRI-based model for MCI conversion using deep zero-shot transfer learning
- Fujia Ren
- Chenhui Yang
- Y. A. Nanehkaran
The Journal of Supercomputing (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and Methods

Study participants

Predictive factors

Cerebrospinal fluid (CSF) biomarkers

Magnetic resonance imaging biomarkers

Positron emission tomography (PET) biomarkers

Cognition

Cognitive resilience

Genetic factors

Demographics

Data aggregation and ML preprocessing

ML-based prediction framework

Feature reduction using joint mutual information (JMI)

Classification methods

Hyper-parameter optimization

Goodness-of-fit evaluation and generalized performance metrics

Implementation of the prediction framework

Test for linear separability

Results

Linear separability

Generalizability of classifiers

Prediction performance of linear classifiers

Predictive ability of individual modalities

Best predictors of MCI-to-AD progression

Discussion

Predictors of MCI progression to AD

Information added by genetic factors and cognitive resilience

Strengths of the computational approach

Limitations and future directions

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Author notes

A comprehensive list of consortium members appears at the end of the paper

Consortia

For the Alzheimer’s Disease Neuroimaging Initiative

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links