Forecasting individual progression trajectories in Alzheimer’s disease

Maheux, Etienne; Koval, Igor; Ortholand, Juliette; Birkenbihl, Colin; Archetti, Damiano; Bouteloup, Vincent; Epelbaum, Stéphane; Dufouil, Carole; Hofmann-Apitius, Martin; Durrleman, Stanley

doi:10.1038/s41467-022-35712-5

Download PDF

Article
Open access
Published: 10 February 2023

Forecasting individual progression trajectories in Alzheimer’s disease

Nature Communications volume 14, Article number: 761 (2023) Cite this article

8372 Accesses
6 Citations
30 Altmetric
Metrics details

Subjects

Abstract

The anticipation of progression of Alzheimer’s disease (AD) is crucial for evaluations of secondary prevention measures thought to modify the disease trajectory. However, it is difficult to forecast the natural progression of AD, notably because several functions decline at different ages and different rates in different patients. We evaluate here AD Course Map, a statistical model predicting the progression of neuropsychological assessments and imaging biomarkers for a patient from current medical and radiological data at early disease stages. We tested the method on more than 96,000 cases, with a pool of more than 4,600 patients from four continents. We measured the accuracy of the method for selecting participants displaying a progression of clinical endpoints during a hypothetical trial. We show that enriching the population with the predicted progressors decreases the required sample size by 38% to 50%, depending on trial duration, outcome, and targeted disease stage, from asymptomatic individuals at risk of AD to subjects with early and mild AD. We show that the method introduces no biases regarding sex or geographic locations and is robust to missing data. It performs best at the earliest stages of disease and is therefore highly suitable for use in prevention trials.

A multidimensional ODE-based model of Alzheimer’s disease progression

Article Open access 23 February 2023

Data-driven causal model discovery and personalized prediction in Alzheimer's disease

Article Open access 08 September 2022

AD Course Map charts Alzheimer’s disease progression

Article Open access 13 April 2021

Introduction

The cost of drug development is highest, by far, for neurodegenerative diseases, with unparalleled failure rates¹. In this respect, the controversial approval of aducanumab on 7 June 2021 by the Food and Drug Administration (FDA) represents a turning point in Alzheimer’s disease (AD) drug development². This decision raises the critical issue of demonstrating the clinical benefit of a compound acting on a key biological process, the accumulation of amyloid plaques in the brain³.

It remains unclear why an effective intervention for such a key biological mechanism is only weakly associated with lower levels of cognitive decline. It is likely that the core biological processes and their interactions are not yet fully understood. Another, non-exclusive explanation is that the issue of who and when to treat must be addressed with greater precision to demonstrate clinical efficacy. In 2019, Cummings and coworkers were already stressing the need to improve clinical trials, by targeting the right participant with the right biomarker in the right trial⁴. The motivation, here, is simple: it is not possible to show that a candidate therapy slows down the degradation of the endpoint if this endpoint is not expected to worsen during the trial. The treatment effect size will be larger if one includes participants right before the disease progression would cause a significant change in the endpoint without an intervention. Such a target period depends on the endpoint selected to demonstrate efficacy.

It is particularly difficult to identify the most appropriate time frame for a disease like AD, which progresses over decades, in a non-linear manner, and with different clinical presentations between patients. The thresholds currently used for the main biomarkers and clinical endpoints are not sufficiently effective for the selection of patient populations with homogeneous progression profiles⁵. Disease modeling uses computational and statistical methods to address this question^{6,7,8,9,10,11,12,13,14}. These models learn the variability of disease progression from observational longitudinal cohort data and can then predict the progression of patients from their historical data. They require various clinical or biomarker assessments at one or several time points as input. These techniques are beginning to be evaluated for clinical trial design. For example, a retrospective analysis showed that the effect size of treatment could be increased by targeting participants with a predicted type of progression at trial entry¹⁵. Other studies indicated that predicting the value of endpoints might make it possible to reduce sample sizes in clinical trials^16,17.

We propose here a software tool using a disease progression model for participant selection in clinical trials. The goal is to enrich the selected population of participants likely to display progression during the trial, a concept called prognostic enrichment¹⁸ by the FDA and already applied in some AD trials^19,20. We will use AD Course Map as a disease progression model. It is a non-linear mixed-effect model, which predicts both the dynamics of progression and the clinical presentation of the disease^21,22. This technique outperformed the 56 alternative methods for predicting cognitive decline in the framework of the TADPOLE challenge^6,23. We will compare this model with RNN-AD, which is a recurrent neural network, namely a deep learning method that learns temporal dynamic behavior. In June 2020, it ranked 2nd for the prediction of cognitive decline in the TADPOLE challenge²⁴.

We will first evaluate the ability of the model to predict progression for the main endpoints used as outcomes in current clinical trials. We will use five independent data sets with data from more than 4600 patients spread over four continents. We will analyze the systematic biases of such algorithms, their robustness to missing data, and suitability for generalization across countries, ethnicities, and disease stages. Finally, we will simulate inclusion procedures for clinical trials by varying several key parameters: the chosen outcome, trial duration, and selection criteria. Finally, we will show that participants predicted to be at risk of the outcome worsening constitute a population likely to show a greater and more homogeneous response to treatment.

Results

Characteristics of the study population

We used data from 4687 participants from five longitudinal multicenter cohorts from North America, Australia, Japan, and Europe: the Alzheimer’s disease neuroimaging initiative (ADNI)^{25,26,27,28,29,30,31} (N = 1652), the Australian imaging, biomarker and lifestyle flagship study of aging (AIBL)^32,33 (N = 460), the Japanese Alzheimer’s disease neuroimaging initiative (J-ADNI)^34,35 (N = 470), the PharmaCog cohort^36,37 (N = 111) and the MEMENTO cohort³⁸ (N = 1994). Each study enrolled participants attending memory clinics.

Tables 1 and 2 summarize the characteristics of each data set. These data sets contain diverse patient profiles from different ethnic, genetic, and geographic backgrounds, with follow-up visits at different disease stages. For all these studies, the neuropsychological examinations were performed in accordance with international standards, and the image acquisition procedures were performed in accordance with the protocols established by the ADNI consortium. Together, these data sets, therefore, correspond to a relevant pool of patients for simulating inclusion procedures for a typical large multicenter phase III trial.

Table 1 Characteristics of study participants

Full size table

Table 2 Distribution of endpoints for study participants

Full size table

Disease progression models learn the timing of changes in biomarker levels during disease progression

We train disease progression models using the ADNI participants with confirmed pathological amyloid levels as the training set (N = 866) with baseline and all available follow-up data. We kept the data from the other ADNI participants and the members of the four external cohorts as the validation set (N = 3821). The same protocol for training and validating the models is used for AD Course Map and RNN-AD. See Methods for details.

The two models include the following endpoints: Mini-Mental State Examination (MMSE), Alzheimer’s Disease Assessment Scale—cognitive sub-scale with 13 items (ADAS-Cog13), Clinical Dementia Rating—sum of boxes (CDR-SB), volumes of the left and right hippocampus and lateral ventricles, Aβ_1–42 and p-tau₁₈₁ levels in the cerebrospinal fluid (CSF), standard uptake value ratio (SUVR) for Amyloid PET and Tau PET scans. See Methods for details.

AD Course Map assumes that these endpoints follow a logistic progression curve during disease progression with distinct progression rate and age at the inflexion point^21,22. It learns how this set of logistic curves need to be adjusted to fit individual data by changing the dynamic of progression and disease presentation (i.e., the relative value of the endpoints at a given disease stage). By contrast, RNN-AD learns how the values of the endpoints will change in the next month given the values of the endpoint at a given time-point. The 1-month transition is assumed to be a non-linear function (e.g. a neural network) of the current value of the endpoints and the current diagnosis. Supplementary Table 1 shows the goodness-of-fit on the training set, consistent with the results of our previous studies on AD Course Map⁶ and RNN-AD²⁴. See Methods for details.

Disease progression models forecast cognitive decline

The disease progression models predict the subject-specific trajectory of biomarker changes from data collected from the subject concerned at one or several visits. The predicted trajectory is used to forecast the values of the biomarkers at future time points. Figure 1 illustrates this forecast procedure.

**Fig. 1: Disease progression models forecast the progression of endpoints from historical data of a participant.**

We repeatedly assessed the errors of AD Course Map and RNN-AD for forecasting cognitive endpoints (ADAS-Cog13, MMSE and CDR-SB) for participants in the validation set. We blinded the latest visits of the participants and tried to predict them from the unblinded data (see Supplementary Fig. 1 and Methods for details of the procedure). From 44,435 forecasts for ADAS-Cog13 (96,970 for MMSE and 96,849 for CDR-SB), we determined the absolute difference between predicted and actual results as a function of the characteristics of the participants and the information used for forecasting purposes.

Figure 2 shows the distribution of mean absolute errors (MAE) for AD Course Map and RNN-AD adjusted for co-founding factors. The reported errors are for the reference participant in the reference forecast design: a 75-year-old American woman from the ADNI cohort with an average education level, no APOE-ε4 mutations, and an A + T + N + status with a questionable dementia (CDR = 0.5 noted C~), for whom we forecast neuropsychological assessments in three years’ time, based on two past visits separated by eight months with no missing data. AD Course Map yields a mean absolute error of 5.98 (95% CI = [5.44, 6.48]) on a scale of 85 for ADAS-Cog13, of 2.54 (95% CI = [2.39, 2.71]) on a scale of 30 for the MMSE, and of 1.86 (95% CI = [1.75, 1.99]) on a scale of 18 for the CDR-SB.

**Fig. 2: AD Course Map forecasts cognitive decline better than alternative methods.**

On all occasions, AD Course Map and RNN-AD yielded significantly smaller errors than two alternative methods: no-change prediction (predicting the same value as obtained at the participant’s last visit) and a linear mixed-effects model (p < 0.01 for both, see Supplementary Table 2). These two alternatives were shown to be good predictor of short-term progression, essentially because of the overall slow pace of progression of the disease^6,23. The deep learning method RNN-AD yields intermediate performance with adjusted mean absolute errors of 6.53 (95% CI = [6.02, 7.19]), 2.75 (95% CI = [2.57, 2.92]), and 1.95 (95% CI = [1.81, 2.09]) for the prediction of ADAS-Cog13, MMSE and CDR-SB respectively.

We investigated the change in MAE for ADAS-Cog 13 score for different categories of participants and forecast designs (Fig. 3). For AD Course Map, the number of previous visits considered (1, 2, or 3) did not significantly affect forecasting error. By contrast, for every additional year of time to prediction, MAE for ADAS-Cog13 score increased by 0.80 (95% CI = [0.71, 0.93]). Forecasts were not significantly affected by sex nor APOE genotype but were slightly improved for participants who are older than average and had longer education. On average, the forecasts for the European participants from PharmaCog cohort as well as the Japanese participants from the J-ADNI cohort were better than those for the American participants from the ADNI cohort, by about 1.1 and 0.6 points respectively. Forecasts were robust to missing CSF or Tau PET data, and slightly worsened when MRI or Amyloid PET were missing with differences in MAE of 0.27 (95% CI = [0.00, 0.55]) and 0.54 points (95% CI = [0.15, 1.02]) respectively. The model forecasts better at earlier stages of the AD continuum than at later clinical stages (Fig. 3a). The method was readily generalizable to the included participants with suspected non-amyloid pathology (SNAP) and possible concomitant pathological non-Alzheimer’s changes.

**Fig. 3: Changes in forecast absolute errors depending on covariates.**

Similar conclusions were drawn for predictions of MMSE and CDR-SB (see Supplementary Figs. 2 and 3). AD Course Map performed better on all but one external validation cohort. Errors were robust to changes in the available information used to make the prediction, such as the number of unblinded visits and missing data. This method did not produce biased forecasts for women. Forecasts for those two endpoints however displayed slightly worse results for participants older than average or with an education level that is below the average, and for APOE-ε4 carriers.

Disease progression models select participants displaying progression for trials

We now use disease progression models to identify the participants likely to experience significant cognitive decline during a trial (see Fig. 4). The definition of participants displaying progression depends on the endpoint used to measure the condition and the duration of the trial. We simulated six clinical trials with different primary outcomes, trial durations, and inclusion criteria. These designs were inspired by real phase III trials (see Table 3).

**Fig. 4: Illustration of the prognostic enrichment procedure in a clinical trial.**

Table 3 Description of the simulated trials

Full size table

For each trial, we selected the participants in the validation set who met the inclusion criteria at one of their visits (considered as the baseline visit for the simulated trial) and attended a follow-up visit after a period equal to the theoretical duration of the trial. We split this population into two equal halves: fast and slow progressors, according to whether the outcome considered (e.g. the annual change in endpoint relative to baseline) was above or below the population median value. We aimed to identify the participants in these two groups exclusively on the basis of their baseline data.

We used the disease progression models to forecast the values of the endpoint at the end of the trial from the baseline data for each participant. The predicted outcome was used as a prognostic score. For AD Course Map, Pearson correlations with the true outcome range from 28% to 47% depending on the trial, while for RNN-AD they range from 13% to 36% (see Supplementary Table 3). Participants with a prognostic score above a given threshold were considered to be likely to be fast progressors. We plotted receiver operating characteristics (ROC) curves for the six simulated trials (Fig. 5). The area under the ROC curve (AUC) of the six simulated trials fell within the 65–80% range for AD Course Map and within the 55–80% range for RNN-AD (see Fig. 5).

**Fig. 5: AD Course Map and RNN-AD select participants at risk of experiencing a worsening of the outcome during the trial.**

We compared this prognostic enrichment strategy with two alternative methods: selecting participants at random (bisector of the ROC curve) as currently done in most trials, or selecting participants based on their APOE genotype (gray crosses in Fig. 5). All selection methods were significantly better than random selection, meaning that disease progression models succeed in identifying the progressors compared to the current practice that does make any difference among the participants meeting the inclusion criteria. In all but one case, selections with AD Course Map were significantly better than selection on the basis of APOE genotype. RNN-AD also compares favorably against the two alternatives. Nevertheless, it has significantly worse performance than AD Course Map in two out of six tested scenarios, with a drop of 9% and 14% in the ROC AUC. AD Course Map shows therefore more robust results than RNN-AD when the trial design is varied.

We analyzed whether our assessment of the risk of progression led to an over- or under-selection of certain types of participants relative to the true progressors (see Supplementary Fig. 4). Depending on the design, the group that was selected using AD Course Map displayed slight enrichment in men or women, and tended to be biased towards older participants. The selected participants were often, but not always, enriched in carriers of the APOE-ε4 variant. The presented disease progression models do not use sociodemographic or genetic factors as proxies for the selection of participants displaying progression. They limit therefore the biases of sex, age, or APOE-ε4 carriership, which are the basis of current practices to increase the likelihood that a participant progresses during a trial.

Disease progression models can be used to design more powered clinical trials

The automatic selection of participants displaying progression makes it possible to implement prognostic enrichment strategies in trials (see Fig. 4). For each trial design, we simulated a hypothetical treatment decreasing the outcome value. We calculated the sample size required to show the effect of this treatment for a range of treatment effects (see Methods). We compared the results when all eligible participants were included to those obtained when only participants predicted to be fast progressors at baseline were included.

We plotted sample size against the treatment effect for all six simulated trials (Fig. 6). The selection of participants at risk of progression with AD Course Map allowed a significant reduction in sample size relative to current inclusion criteria alone, across all scenarios tested. For a treatment effect of 25%, the sample size was reduced by 50.2% (±7.1) for participants at risk of the onset of AD, by 40.9% (±4.9) for a trial targeting individuals with preclinical AD and high brain amyloid levels, by between 38.1% (±1.6) and 45.4% (±2.0), depending on the outcome considered, for subjects with early AD and high levels of brain amyloid, by 44.6% (±3.9) for subjects with early AD and high brain levels of tau, and by 43.1% (±0.8) for participants with mild cognitive impairment probably due to AD or mild AD.

**Fig. 6: Enrichment based on AD Course Map significantly decreases the sample size for a hypothetical treatment effect ranging from 20% to 30%.**

For all preclinical and early AD trials, enrichments based on AD Course Map significantly outperformed the selection of APOE-ε4 variant carriers only. For mild cognitive impairment due to AD or a mild AD trial, the performance of enrichment based on AD Course Map was not significantly different from targeting APOE-ε4 carriers. AD Course Map achieved a similar decrease in sample size, but without the need to target a specific genetic profile. In this case, we also found that 49.2% (95% CI = [48.6, 49.9]) of the participants would be selected by AD Course Map, versus 39.1% (95% CI = [38.7, 39.4]) for heterozygous APOE-ε4 carriers, facilitating recruitment with AD Course Map (see Supplementary Table 4).

RNN-AD also allowed a significant reduction of the sample size compared to current practice, from 21% to 42% depending on the tested scenario. Nevertheless, the reduction was never better than with AD Course Map with an increase of 10% and 35% participants to be selected for the two scenarios where RNN-AD yielded a lower AUC (see Supplementary Table 5).

Discussion

We used disease progression models to forecast cognitive decline across all stages of the AD continuum. Using five independent cohorts containing more than 4,600 participants, we show here that AD Course Map provides a fair, robust, and generalizable predictive method. It is fair, in that its predictions are not biased with respect to sex, and are only marginally affected by level of education and the age of the participant. The method is robust to missing CSF or Tau PET biomarkers, but in general better results are achieved when MRI and Amyloid PET data are present. The model was trained on data acquired in North America, but it is readily generalizable to participants from Europe, Asia, and Oceania, with no loss of performance. It performed better at the earliest preclinical stages of the AD continuum than at later disease stages, and is therefore relevant for early-stage interventions.

Disease progression models automatically identify the participants already at risk of experiencing cognitive decline at baseline in a trial. They can therefore be used to enrich the trial population in participants likely to experience a worsening of a given endpoint during the trial. By targeting more homogeneous groups of participants displaying progression, AD Course Map makes it possible to decrease sample size significantly, by 38% up to 50%, at the expense of discarding about half of the screened participants. It shows better and more robust performance than the deep learning method RNN-AD. Disease progression models adapt seamlessly to various clinical trial designs targeting different disease stages with different outcomes and trial durations. They do so without the need to re-train the model for each new trial. In comparison, a recent method based on another prognosis score reported sample size reductions of 20% to 28%¹⁷.

The main limitation of the method is the data used to monitor disease progression. Cognitive assessment displays about 10% inter-rater variability^39,40,41. MRI biomarkers also display a similar degree of variability between two scans acquired on the same day for the same participant, and their reliability is further decreased by possible variations in the processing pipelines⁴². Mapping CSF biomarkers from different immunoassays also limit their reliability⁴³. These factors limit the accuracy of the method for forecasting disease progression. Increasing the reliability of these measurements would improve the performance of the approach described here. In the future, disease progression models such as AD Course Map may also benefit from the inclusion of promising new biomarkers, such as plasma biomarkers, neurofilament light chain⁴⁴, or digital biomarkers⁴⁵.

Given these limitations, it is notable that such large sample size reductions can be achieved with data already available in routine clinical practice. These findings demonstrate the benefits of companion software tools for patient recruitment in trials and for supporting clinicians in the future, enabling them to prescribe the right treatment to the right patient at the right time.

Methods

Participants

We used the data from five longitudinal multicenter cohorts: the ADNI^{25,26,27,28,29,30,31} (N = 1652), the Australian imaging, biomarker, and lifestyle flagship study of aging (AIBL)^32,33 (N = 460), the JJ-ADNI^34,35 (N = 470), the PharmaCog cohort^36,37 (N = 111) and the MEMENTO cohort³⁸ (N = 1994).

The study protocols were approved by the ethical committees of the university of southern California (ADNI), Austin Health, St Vincent’s Health, Hollywook Private Hospital and Edith Cowan University (AIBL), IRCCS Istituto Centro San Giovanni di Dio Fatebenefratelli (PharmaCog), Comité de protection des personnes sud-ouest et outre-mer III (MEMENTO), the National Bioscience Database Center Human Database (J-ADNI). Informed consent forms were obtained from research participants. The research has been performed in accordance with the Declaration of Helsinki and relevant guidelines and regulations. Participants were not compensated for the current study.

The five cohorts are longitudinal observational studies with an average observation period ranging from 2.0 years for PHARMACOG to 4.8 years for ADNI, with an average number of visits ranging from 3.7 in AIBL to 6.9 in MEMENTO. We considered all participants with at least one year of follow-up. The sociodemographic, genetic, biological and clinical characteristics of the selected participants are reported in Tables 1 and 2, as well as the proportion of available data in each cohort.

Neuropsychological assessments

In our experiments, we considered the following neuropsychological assessments:

The mini-mental state examination³⁹ (MMSE),
The Alzheimer’s disease assessment scale–cognitive sub-scale with 13 items^40,46 (ADAS-Cog13),
The clinical dementia rating scale^41,47 – sum of the boxes score (CDR-SB).

Structural magnetic resonance imaging/anatomical imaging biomarkers

We extracted cortical and subcortical volumes from three-dimensional T1-weighted magnetization-prepared rapid gradient-echo imaging (MPRAGE) sequences.

For the ADNI study, scans were acquired in the standardized protocol for morphometric analyses (http://adni.loni.usc.edu/methods/documents/mri-protocols/). The ADNI MRI core processed raw scans, using Gradwarp for the correction of geometric distortion due to gradient nonlinearity⁴⁸, B1-correction for the adjustment of image intensity inhomogeneity²⁶, N3 bias field correction for reducing residual intensity inhomogeneity^49,50, and geometric scaling for adjusting scanner- and session-specific calibration errors^26,51. The same MRI protocol was also used in AIBL³², J-ADNI⁵², PharmaCog³⁶, and MEMENTO^38,53.

For all studies, cortical reconstruction and volumetric segmentation were performed with the Freesurfer image analysis suite (http://surfer.nmr.mgh.harvard.edu/). Version 5.3 was used for J-ADNI, MEMENTO, and PharmaCog, and version 6.0 for ADNI and AIBL, operated within Clinica for reproducibility purposes⁵⁴. The cohort effect in the following analyses accounts for possible differences due to different versions of the software.

We calculated the mean volume of the left and right hippocampus, and the total volume of the lateral ventricles (including inferior lateral volume). Hippocampus segmentation with Freesurfer was previously reported to have good reproducibility^55,56. Both volumes were normalized by estimated total intracranial volume (ICV).

Cerebrospinal fluid biomarkers

We used the concentrations in cerebrospinal fluid (CSF) of β-Amyloid 1–42 peptide (Aβ_1–42), Tau protein, phosphorylated at the threonine 181 residue (p-Tau₁₈₁), and total tau protein (t–Tau).

ADNI used the automated Elecsys immunoassay (Roche); AIBL, PharmaCog, and MEMENTO used INNOTEST single-analyte ELISA tests (Innogenetics/Fujirebio NV), and J-ADNI used the multiplex xMAP Luminex platform with the INNO-BIA AlzBio3 immunoassay kit (Innogenetics/Fujirebio NV).

We harmonized the measurements to account for the differences in immunoassays and participants' characteristics across cohorts. Within each cohort, we regressed each biomarker against age, APOE genotype, and CDR global score with a linear mixed model with random intercept. We then linearly transformed the measurements so that the intercept is 0 and the total variance is 1 for all cohorts. Harmonization equations used are listed in Supplementary Table 6 for reproducibility purposes.

Positron emission tomography/functional imaging biomarkers

For ADNI participants, we used regional standardized uptake value ratios (SUVR) extracted from Amyloid PET scans ([¹⁸F]-Florbetapir and [¹⁸F]-Florbetaben radiotracers), and, starting from ADNI 3, Tau PET scans ([¹⁸F]-AV-1451 radiotracer). Each PET scan was registered together with the MRI for the subject performed as close as possible to the PET scan in terms of time.

For Amyloid PET, we used a cortical-summary region consisting of the frontal, anterior/posterior cingulate, lateral parietal, and lateral temporal regions; data were normalized with a composite reference region consisting of the whole cerebellum, brainstem/pons, and eroded subcortical white matter^57,58,59. These PET SUVR values were converted to the centiloid scale (CL)⁶⁰ using equations from the literature⁶¹ listed in Supplementary Table 6. In the AIBL cohort, the processed Amyloid PET SUVR data that correspond to the published centiloid conversion equations were not publicly available. In the MEMENTO cohort, Amyloid PET SUVR data are not directly comparable with ADNI data and equations for centiloid conversion were not available. Therefore, we used Amyloid PET data on these cohorts only to define the Amyloid status of the participants, using pathological thresholds provided by these studies.

For Tau PET, we used a volume-weighted average SUVR value for all anatomical Braak regions of interest (I-VI)⁶², normalized against the inferior cerebellum gray matter⁶³.

A/T/N/C classification

We classified participants with the A(myloid)/T(au)/N(eurodegeneration) classification^64,65, together with a C(ogntion)/C(linical) group based on the Clinical Dementia Rating (CDR) global score (see the Supplementary Table 7 for all thresholds used). Participant category at a given visit was based on the patient’s all-time worst biomarker levels to date. Incomplete A/T/N/C profiles are denoted with a star after any of the biomarkers that could not be determined.

Disease progression models

We trained and tested two disease progression models: AD Course Map and RNN-AD. AD Course Map is built on the principles of a parametric Bayesian non-linear mixed-effects model^21,22. RNN-AD is built on the principles of recurrent artificial neural networks^24,66. The implementation of both models relies on the open-source software that was made publicly available by their respective authors.

Both models use the same set of endpoints as input: MMSE, CDR-SB, ADAS-Cog13, volume of the left and right hippocampus and lateral ventricles, CSF Aβ_1–42 and p-tau₁₈₁ levels, together with cortical-summary SUVR on Amyloid PET and Tau PET scans. They consider these endpoints at one or several visits of a participant, allowing for possible missing data, and predict the value of all these endpoints at any time-point in the future. AD Course Map also takes into account the age of the participant at each visit, while RNN-AD takes into account only the duration between two consecutive visits, irrespective of the age of the participant. In addition, RNN-AD needs the diagnosis of the participant at the corresponding visit, the diagnosis being cognitively normal, mild cognitive impairment, or demented, as defined in the ADNI protocol.

AD Course Map assumes that these endpoints follow a logistic progression curve during disease progression with distinct progression rate and age at the inflexion point^21,22. It learns how this set of logistic curves needs to be adjusted to fit individual data, by changing the dynamic of progression and disease presentation (i.e., the ordering and timing of progression among the endpoints. The shape and position of the reference set of logistic curves are the fixed effects, and the parameters changing these curves to fit individual data are the random effects. The model parameters (fixed effects together with the mean and variance of the random effects) are estimated using a training data set containing the repeated measurements of a multitude of participants. After the training phase, the model is fit to the measurements of one test participant (outside the training test) at one or several visit, using the learnt distribution of the random effects as a regularizer. As a result, the model predicts a subject-specific set of logistic curves, which shows the value of each endpoint at any age of the participant.

By contrast, RNN-AD does not make any assumption on the life-long pattern of progression of the endpoints. It learns instead how the values of the endpoints will change in the next month given the values of the endpoint at a given time-point. The 1-month transition is assumed to be a non-linear function of the current value of the endpoints and the current diagnosis (e.g. artificial neurons). The parameters of this transition function are estimated using a training data set containing the repeated measurements of a multitude of participants. After the training phase, the measurements of one test participant (outside the training set) at one or several visits are used as input of the model. The model then computes the values of all the endpoints at each month in the future.

AD Course Map can be trained and tested with missing data: the likelihood is optimized using the available data only. Model training is robust to missing data⁶, so we did not perform data imputation. By contrast, RNN-AD needs complete data at the baseline visit. We imputed missing data with the mean value of the endpoint in the training set, following authors’ recommendations;²⁴ missing data at subsequent visits are imputed recurrently using model predictions.

Both models also need an internal step of data normalization. For AD Course Map, cognitive assessments were normalized to a 0 to +1 scale according to the theoretical minimum and maximum values of each assessment, 0 representing the theoretical best value (unaffected participants) and +1 the worst possible value. Harmonized amyloid PET data are clipped between 0 and 100 and converted to a (0,1) scale. MRI, tau PET, and Harmonized CSF data were clipped at the first and last centile, and then linearly mapped to a (0,1) scale. For RNN-AD, normalization consists in a z-score transformation estimated from training data.

Regardless of the normalization procedure, the outputs of the models are always converted back to the native scale (and unit) of the measurement before being analyzed (see Fig. 1). Predicted values are therefore comparable with the true, non-normalized data. Forecast errors can be compared across methods that do not use the same normalization procedure.

Validation procedure

We split the data sets in two (see Supplementary Fig. 1). We first considered the ADNI participants who were amyloid-positive according to CSF or PET data on at least one visit (shown in red in Supplementary Fig. 1). We then kept the other ADNI participants and all participants from the four other cohorts as an external validation set (shown in blue in the Supplementary Fig. 1).

We then split the amyloid-positive ADNI participants into five random folds and trained AD Course Map and RNN-AD using all available data of the participants in four out of the five folds, e.g., the training set. We repeated this procedure with another split, so that we ended up with 10 instances of each model. Each participant has been counted twice as a test subject in the left-out fold. Therefore, it can be used twice for evaluating prediction tasks with two different instances of each model. By contrast, each participant in the external validation set can be tested with 10 different instances of each model. In the following, we averaged the prediction made by the 2 instances of the model for the participants in the test sets, and by the 10 instances for the participants in the external validation set.

The test subjects did not contribute to any model selection or hyperparameter tuning neither for AD Course Map nor for RNN-AD. Therefore, we pooled the forecasts of test subjects with the ones in the external validation set.

Forecasting endpoints

We aimed to assess the accuracy of each model to forecast the values of the endpoints of a participant in the test set or the external validation set. The general principle is to blind the latest data of the participant, use the unblinded data as input of the model, and compare the predicted value with the blinded data.

We used a combinatorial procedure to generate prediction tasks, as described in Supplementary Fig. 1. Because we have multiple follow-up visits, we assessed several forecast errors for a single participant: we blinded the data of the participant except at one to three consecutive visits, we predict the individual trajectory using the unblinded data, and forecast the data at the blinded visits after the latest unblinded visit. We required that the participants are between 50 and 90 years old and have a CDR global of at most 2 at the latest unblinded visit to exclude severely demented participants, and that the blinded visits used to assess the forecast fall between 1.4 and 6.6 years after the latest unblinded visit. We computed the forecast error as the absolute difference between this value and the value of the endpoint at the follow-up visit concerned.

Analysis of forecast errors

We analyzed the distribution of mean absolute errors with a mixed-effects model. We corrected the errors for several possible cofounding factors and accounted for the fact that multiple forecasts originated from the same participant. In practice, for a given endpoint and a given model, we performed the following procedure 100 times:

We randomly picked a subset of disjointed prediction tasks, namely predictions not sharing any common visit (neither the blinded visit to forecast, nor the unblinded visits used to forecast);
We fit a multivariate linear mixed-effects model with a random intercept for each individual, using the following categorical explanatory variables: A/T/N/C stage at prediction, cohort, number of APOE-ε4 alleles, sex, level of education, number of unblinded visits, and continuous explanatory variables: actual patient’s age at prediction centered on 75 years and normalized by 7.5 years, years to prediction centered on three years and normalized by one year, mean time between unblinded visits centered on eight months and normalized by three months, percentage of missing data for the unblinded visits per modality.

Education level was classified as low if the subject had followed no more than nine years of formal education and high if the subject had followed at least 16 years of education, in accordance with the guidelines of the international standard classification of education of the United Nations.

We derived the mean and empirical confidence interval for the model intercept (the mean absolute error adjusted for cofounding factors) and regression coefficients (association between the mean absolute errors and each cofounding factor).

Comparison with alternative methods

We also compared AD Course Map with two additional alternative methods. The first, the no-change prediction or last-observation-carried-forward method, forecasts the future value of an endpoint to be the same as it was at the last unblinded visit. The second method, the linear mixed model method, involved generating a linear mixed-effects model for each endpoint, regressing endpoint values against the age of the participant at the successive visits, with a random intercept and a random slope per subject. The model was fitted to an unseen participant with a maximum a posteriori estimator⁶⁷. We used the same validation procedure for all models: AD Course Map, RNN-AD, no-change prediction, and the linear mixed model.

Clinical trial simulation, enrichment evaluation, and sample size calculation

We simulated clinical trials in subjects at risk of developing AD or at an early stage of AD, as described in Table 3. For each trial, we selected all pairs of visits from all participants in the five data sets satisfying the following criteria:

The primary endpoint of the trial was assessed at both visits,
The patient fulfilled the inclusion criteria and had none of the exclusion criteria of the trial at the baseline visit,
Visits were separated by the duration of the trial, with a certain tolerance, depending on the trial.

For each pair of visits, the first was considered to be the baseline visit at inclusion and the second was considered to be the visit at the end of the trial. We did not take into account possible intermediate visits. Supplementary Table 8 summarizes the characteristics of participants included in all the simulated trials.

We first evaluated our prognostic enrichment strategy from a diagnostic test standpoint. For each trial, we forecast the value of the primary endpoint at the follow-up visit from the baseline data only, using the procedure described above. We calculated the median value of the outcome (i.e. the annual rate of change between baseline and follow-up visit). Participants above this threshold were considered to be fast progressors and formed the target population to be identified. A threshold for predicted outcomes was used to split the population into two groups: one considered at high risk of progression and the other at low risk of progression. We let the low-risk vs. high-risk threshold vary and calculated the resulting receiver operator characteristic (ROC) curve. On this curve, we identified the point splitting the population into a low-risk and a high-risk group of equal sizes, which was used as the operating point. We determined confidence intervals by performing our analyses 100 times on half the samples selected at random. Within any given run, any visit of a patient was used no more than once. The regions of confidence around ROC curves were constructed graphically as envelopes of both sensitivity and specificity confidence intervals along thresholds.

We evaluated possible biases in the group at high risk of progression. We used a logistic regression predicting selection status from population covariates (age, sex, education, number of APOE-ε4 alleles), cohort, and missing baseline modalities, together with the true indicator of fast progression. This last binary predictor was included to check for biases emerging in addition to the biases naturally present in the target population.

We then evaluated our prognostic enrichment strategy by calculating statistical power. We used a hypothetical individual treatment model: if the outcome actually worsened between baseline and follow-up for the participant, we changed the annual rate of change by the treatment effect, e.g. a 20% improvement of the annual rate of change. We did not apply a treatment effect if the participant improved between baseline and follow-up. For treatment effects ranging from 20% to 30%, we computed effect size (Cohen’s d) and sample size from a two-independent sample asymptotic t-test, with a 5% bilateral level of significance and 80% statistical power. We compared this sample size for the population selected with the trial inclusion criteria alone, and for the subpopulation identified as at high risk of progression. We reported the total sample size for two arms. We did not account for the drop-out rate in the calculation, as the goal was to compare statistical power with and without enrichment.

In these two experiments, we compared the results obtained with those for a method selecting APOE-ε4 carriers (heterozygous or homozygous) as participants at high risk of progression. We were unable to use this method for the trial targeting participants at risk of the onset of AD since this trial included only APOE-ε4 carriers.

Statistics and reproducibility

No statistical method was used to predetermine the sample size. We considered all available data from all the cohorts and excluded only the data of the participants with less than one year of follow-up. The experiments were not randomized since only observational data were used. The investigators were not blinded to allocation during experiments and outcome assessment since only observational data were used. Simulations of clinical trials included a random unblinded allocation into treated and control arms with assessment of biases in sex, center, level of education, and APOE genotype.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The ADNI and AIBL data used in this study are available in the database of the laboratory of neuroimaging at the University of Southern California under accession code at http://adni.loni.usc.edu. The J-ADNI data used in this study are available in the NBDC Human Database under accession code at http://humandbs.biosciencedbc.jp/en/. The PharmaCog data used in this study are available in the NeuGRID2 platform under access code at https://www.neugrid2.eu/ (https://doi.org/10.17616/R31NJN1E). The MEMENTO data used in this study are available in Dementia Platform UK under accession code at https://portal.dementiasplatform.uk/CohortDirectory/Item?fingerPrintID=MEMENTO. Raw data and patient-level data that were generated in this study are protected and are not available due to data privacy laws and data use agreements. These data can be re-generated using the open-source software Leaspy (see below) by anyone with authorized access to the above third-party data. The data used to compute the statistics in this study are available in a dedicated Zenodo repository⁶⁸. Source data are provided with this paper.

Code availability

The statistical analysis of the forecast errors and the simulation of clinical trials were performed in Python. We used the Leaspy open-source software (https://gitlab.com/icm-institute/aramislab/leaspy) for training and testing AD Course Map, and the corresponding open-source software for RNN-AD https://github.com/ThomasYeoLab/CBIG/tree/master/stable_projects/predict_phenotypes/Nguyen2020_RNNAD. Linear mixed models were trained using the open-source statsmodels package⁶⁹. The frozen versions of the Python libraries that were used to generate the results of this article can be found in a dedicated Zenodo repository⁶⁸.

References

Cummings, J., Reiber, C. & Kumar, P. The price of progress: funding and financing Alzheimer’s disease drug development. Alzheimers Dement. Transl. Res. Clin. Interv. 4, 330–343 (2018).
Article Google Scholar
Schneider, L. A resurrection of aducanumab for Alzheimer’s disease. Lancet Neurol. 19, 111–112 (2020).
Article PubMed Google Scholar
Tolar, M., Abushakra, S., Hey, J. A., Porsteinsson, A. & Sabbagh, M. Aducanumab, gantenerumab, BAN2401, and ALZ-801-the first wave of amyloid-targeting drugs for Alzheimer’s disease with potential for near term approval. Alzheimers Res. Ther. 12, 95 (2020).
Article CAS PubMed PubMed Central Google Scholar
Cummings, J., Feldman, H. H. & Scheltens, P. The “rights” of precision drug development for Alzheimer’s disease. Alzheimers Res. Ther. 11, 76 (2019).
Article PubMed PubMed Central Google Scholar
Jutten, R. J. et al. Finding treatment effects in Alzheimer trials in the face of disease progression heterogeneity. Neurology 96, e2673–e2684 (2021).
Article CAS PubMed PubMed Central Google Scholar
Koval, I. et al. AD course map charts Alzheimer’s disease progression. Sci. Rep. 11, 8020 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Iddi, S. et al. Predicting the course of Alzheimer’s progression. Brain Inform. 6, 6 (2019).
Article PubMed PubMed Central Google Scholar
Bilgel, M. & Jedynak, B. M. Predicting time to dementia using a quantitative template of disease progression. Alzheimers Dement. Diagn. Assess. Dis. Monit. 11, 205–215 (2019).
Google Scholar
Young, A. L. et al. Uncovering the heterogeneity and temporal complexity of neurodegenerative diseases with Subtype and Stage Inference. Nat. Commun. 9, 4273 (2018).
Article ADS PubMed PubMed Central Google Scholar
Archetti, D. et al. Multi-study validation of data-driven disease progression models to characterize evolution of biomarkers in Alzheimer’s disease. NeuroImage Clin. 24, 101954 (2019).
Article PubMed PubMed Central Google Scholar
Khanna, S. et al. Using Multi-Scale Genetic, Neuroimaging and Clinical Data for Predicting Alzheimer’s Disease and Reconstruction of Relevant Biological Mechanisms. Sci. Rep. 8, 11173 (2018).
Article ADS PubMed PubMed Central Google Scholar
Lorenzi, M., Filippone, M., Frisoni, G. B., Alexander, D. C. & Ourselin, S. Probabilistic disease progression modeling to characterize diagnostic uncertainty: application to staging and prediction in Alzheimer’s disease. NeuroImage 190, 56–68 (2019).
Article PubMed Google Scholar
van Maurik, I. S. et al. Biomarker-based prognosis for people with mild cognitive impairment (ABIDE): a modelling study. Lancet Neurol. 18, 1034–1044 (2019).
Article PubMed Google Scholar
Birkenbihl, C., Salimi, Y., Fröhlich, H., Japanese Alzheimer’s Disease Neuroimaging Initiative, & Alzheimer’s Disease Neuroimaging Initiative. Unraveling the heterogeneity in Alzheimer’s disease progression across multiple cohorts and the implications for data-driven disease modeling. Alzheimers Dement. J. Alzheimers Assoc. https://doi.org/10.1002/alz.12387 (2021).
Oxtoby, N. P. et al. Targeted screening for Alzheimer’s disease clinical trials using data-driven disease progression models. medRxiv https://doi.org/10.1101/2021.01.29.21250773 (2021).
Insel, P. S. et al. Biomarkers and cognitive endpoints to optimize trials in Alzheimer’s disease. Ann. Clin. Transl. Neurol. 2, 534–547 (2015).
Article CAS PubMed PubMed Central Google Scholar
Kühnel, L. et al. Personalized prediction of progression in pre-dementia patients based on individual biomarker profile: a development and validation study. Alzheimers Dement. J. Alzheimers Assoc. https://doi.org/10.1002/alz.12363 (2021).
Center for Drug Evaluation and Research & Center for Biologics Evaluation and Research. Enrichment Strategies for Clinical Trials to Support Approval of Human Drugs and Biological Products. U.S. Food and Drug Administration https://www.fda.gov/regulatory-information/search-fda-guidance-documents/enrichment-strategies-clinical-trials-support-approval-human-drugs-and-biological-products (2019).
Burns, D. K. et al. The TOMMORROW study: design of an Alzheimer’s disease delay‐of‐onset clinical trial. Alzheimers Dement. Transl. Res. Clin. Interv. 5, 661–670 (2019).
Article Google Scholar
Burns, D. K. et al. Safety and efficacy of pioglitazone for the delay of cognitive impairment in people at risk of Alzheimer’s disease (TOMMORROW): a prognostic biomarker study and a phase 3, randomised, double-blind, placebo-controlled trial. Lancet Neurol. 20, 537–547 (2021).
Article CAS PubMed Google Scholar
Schiratti, J.-B., Allassonniere, S., Colliot, O. & Durrleman, S. Learning spatiotemporal trajectories from manifold-valued longitudinal data. in Neural Information Processing Systems (2015).
Schiratti, J.-B., Allassonnière, S., Colliot, O. & Durrleman, S. A bayesian mixed-effects model to learn trajectories of changes from repeated manifold-valued observations. J. Mach. Learn. Res. 18, 1–33 (2017).
MathSciNet MATH Google Scholar
Marinescu, R. V. et al. TADPOLE challenge: accurate Alzheimer’s disease prediction through crowdsourced forecasting of future data. in Predictive Intelligence in Medicine (eds. Rekik, I., Adeli, E. & Park, S. H.) vol. 11843, 1–10 (Springer International Publishing, 2019).
Nguyen, M. et al. Predicting Alzheimer’s disease progression using deep recurrent neural networks. NeuroImage 222, 117203 (2020).
Article PubMed Google Scholar
Mueller, S. G. et al. Ways toward an early diagnosis in Alzheimer’s disease: the Alzheimer’s Disease Neuroimaging Initiative (ADNI). Alzheimers Dement. J. Alzheimers Assoc. 1, 55–66 (2005).
Article Google Scholar
Jack, C. R. et al. The Alzheimer’s Disease Neuroimaging Initiative (ADNI): MRI methods. J. Magn. Reson. Imaging JMRI 27, 685–691 (2008).
Article PubMed Google Scholar
Petersen, R. C. et al. Alzheimer’s Disease Neuroimaging Initiative (ADNI): clinical characterization. Neurology 74, 201–209 (2010).
Article PubMed PubMed Central Google Scholar
Jack, C. R. et al. Update on the MRI core of the Alzheimer’s disease neuroimaging initiative. Alzheimers Dement. J. Alzheimers Assoc. 6, 212–220 (2010).
Article Google Scholar
Beckett, L. A. et al. The Alzheimer’s disease neuroimaging initiative 2: increasing the length, breadth, and depth of our understanding. Alzheimers Dement. J. Alzheimers Assoc. 11, 823–831 (2015).
Article Google Scholar
Jack, C. R. et al. Magnetic resonance imaging in Alzheimer’s Disease neuroimaging initiative 2. Alzheimers Dement. 11, 740–756 (2015).
Article PubMed PubMed Central Google Scholar
Weiner, M. W. et al. The Alzheimer’s Disease Neuroimaging Initiative 3: continued innovation for clinical trial improvement. Alzheimers Dement. 13, 561–571 (2017).
Article PubMed Google Scholar
Ellis, K. A. et al. The Australian Imaging, Biomarkers and Lifestyle (AIBL) study of aging: methodology and baseline characteristics of 1112 individuals recruited for a longitudinal study of Alzheimer’s disease. Int. Psychogeriatr. 21, 672–687 (2009).
Article PubMed Google Scholar
Albrecht, M. A. et al. Longitudinal cognitive decline in the AIBL cohort: the role of APOE ε4 status. Neuropsychologia 75, 411–419 (2015).
Article PubMed Google Scholar
Iwatsubo, T. Japanese Alzheimer’s disease neuroimaging initiative: present status and future. Alzheimers Dement. 6, 297–299 (2010).
Article PubMed Google Scholar
Iwatsubo, T. et al. Japanese and North American Alzheimer’s Disease Neuroimaging Initiative studies: harmonization for international trials. Alzheimers Dement. 14, 1077–1087 (2018).
Article PubMed Google Scholar
Galluzzi, S. et al. Clinical and biomarker profiling of prodromal Alzheimer’s disease in workpackage 5 of the Innovative Medicines Initiative PharmaCog project: a ‘European ADNI study’. J. Intern. Med. 279, 576–591 (2016).
Article CAS PubMed Google Scholar
Albani, D. et al. Plasma Aβ42 as a biomarker of prodromal alzheimer’s disease progression in patients with amnestic mild cognitive impairment: evidence from the PharmaCog/E-ADNI Study. J. Alzheimers Dis. JAD 69, 37–48 (2019).
Article CAS PubMed Google Scholar
Dufouil, C. et al. Cognitive and imaging markers in non-demented subjects attending a memory clinic: study design and baseline findings of the MEMENTO cohort. Alzheimers Res. Ther. 9, 67 (2017).
Folstein, M. F., Folstein, S. E. & McHugh, P. R. “Mini-mental state”: a practical method for grading the cognitive state of patients for the clinician. J. Psychiatr. Res. 12, 189–198 (1975).
Article CAS PubMed Google Scholar
Rosen, W. G., Mohs, R. C. & Davis, K. L. A new rating scale for Alzheimer’s disease. Am. J. Psychiatry 141, 1356–1364 (1984).
Article CAS PubMed Google Scholar
O’Bryant, S. E. et al. Staging dementia using clinical dementia rating scale sum of boxes scores. Arch. Neurol. 65, 1091–1095 (2008).
Article PubMed PubMed Central Google Scholar
Hedges, E. P. et al. Reliability of structural MRI measurements: the effects of scan session, head tilt, inter-scan interval, acquisition sequence, FreeSurfer version and processing stream. NeuroImage https://doi.org/10.1016/j.neuroimage.2021.118751 (2021).
Blennow, K. et al. Clinical utility of cerebrospinal fluid biomarkers in the diagnosis of early Alzheimer’s disease. Alzheimers Dement. 11, 58–69 (2015).
Article PubMed Google Scholar
Cullen, N. C. et al. Individualized prognosis of cognitive decline and dementia in mild cognitive impairment based on plasma biomarker combinations. Nat. Aging 1, 114–123 (2021).
Article Google Scholar
Chen, R. et al. Developing Measures of Cognitive Impairment in the Real World from Consumer-Grade Multimodal Sensor Streams. in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining 2145–2155 (Association for Computing Machinery, 2019). https://doi.org/10.1145/3292500.3330690.
Mohs, R. C. et al. Development of cognitive instruments for use in clinical trials of antidementia drugs: additions to the Alzheimer’s Disease Assessment Scale that broaden its scope. The Alzheimer’s Disease Cooperative Study. Alzheimer Dis. Assoc. Disord. 11, S13–S21 (1997).
Article PubMed Google Scholar
Hughes, C. P., Berg, L., Danziger, W. L., Coben, L. A. & Martin, R. L. A new clinical scale for the staging of dementia. Br. J. Psychiatry 140, 566–572 (1982).
Article CAS PubMed Google Scholar
Jovicich, J. et al. Reliability in multi-site structural MRI studies: effects of gradient non-linearity correction on phantom and human data. NeuroImage 30, 436–443 (2006).
Article PubMed Google Scholar
Sled, J. G., Zijdenbos, A. P. & Evans, A. C. A nonparametric method for automatic correction of intensity nonuniformity in MRI data. IEEE Trans. Med. Imaging 17, 87–97 (1998).
Article CAS PubMed Google Scholar
Boyes, R. G. et al. Intensity non-uniformity correction using N3 on 3-T scanners with multichannel phased array coils. NeuroImage 39, 1752–1762 (2008).
Article PubMed Google Scholar
Chow, N. et al. Comparing 3T and 1.5T MRI for mapping hippocampal atrophy in the Alzheimer’s disease neuroimaging initiative. Am. J. Neuroradiol. 36, 653–660 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fujishima, M. et al. Sample size estimation for Alzheimer’s Disease Trials from Japanese ADNI serial magnetic resonance imaging. J. Alzheimers Dis. JAD 56, 75–88 (2017).
Article PubMed Google Scholar
Operto, G. et al. CATI: a large distributed infrastructure for the neuroimaging of cohorts. Neuroinformatics 14, 253–264 (2016).
Article PubMed Google Scholar
Routier, A. et al. Clinica: an open-source software platform for reproducible clinical neuroscience studies. Front. Neuroinformatics 15, 39 (2021).
Article Google Scholar
Mulder, E. R. et al. Hippocampal volume change measurement: quantitative assessment of the reproducibility of expert manual outlining and the automated methods FreeSurfer and FIRST. NeuroImage 92, 169–181 (2014).
Article PubMed Google Scholar
Marizzoni, M. et al. Longitudinal reproducibility of automatically segmented hippocampal subfields: a multisite European 3T study on healthy elderly. Hum. Brain Mapp. 36, 3516–3527 (2015).
Article PubMed PubMed Central Google Scholar
Landau, S. M. et al. Measurement of longitudinal β-amyloid change with 18F-florbetapir PET and standardized uptake value ratios. J. Nucl. Med. Publ. Soc. Nucl. Med. 56, 567–574 (2015).
Article CAS Google Scholar
Landau, S., Murphy, A., Qie Lee, J., Ward, T. & Jagust, W. Florbetapir (AV45) processing methods. (2021). https://adni.bitbucket.io/reference/docs/UCBERKELEYAV45/UCBERKELEY_AV45_Methods_11.15.2021.pdf.
Landau, S., Murphy, A., Qie Lee, J., Ward, T. & Jagust, W. Florbetaben (FBB) processing methods. (2021). https://adni.bitbucket.io/reference/docs/UCBERKELEYFBB/UCBerkeley_FBB_Methods_11.15.2021.pdf.
Klunk, W. E. et al. The Centiloid Project: standardizing quantitative amyloid plaque estimation by PET. Alzheimers Dement. J. Alzheimers Assoc. 11, 1-15.e1–4 (2015).
Google Scholar
Royse, S. K. et al. Validation of amyloid PET positivity thresholds in centiloids: a multisite PET study approach. Alzheimers Res. Ther. 13, 99 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Braak, H. & Braak, E. Neuropathological stageing of Alzheimer-related changes. Acta Neuropathol. 82, 239–259 (1991).
Article CAS PubMed Google Scholar
Landau, S., Ward, T. J., Murphy, A. & Jagust, W. Flortaucipir (AV-1451) processing methods. (2021). https://adni.bitbucket.io/reference/docs/UCBERKELEYAV1451/UCBERKELEY_AV1451_Methods_2021-01-14.pdf.
Jack, C. R. et al. A/T/N: An unbiased descriptive classification scheme for Alzheimer disease biomarkers. Neurology 87, 539–547 (2016).
Article CAS PubMed PubMed Central Google Scholar
Jack, C. R. et al. NIA-AA research framework: toward a biological definition of Alzheimer’s disease. Alzheimers Dement. 14, 535–562 (2018).
Article PubMed PubMed Central Google Scholar
Nguyen, M., Sun, N., Alexander, D. C., Feng, J. & Yeo, B. T. T. Modeling Alzheimer’s disease progression using deep recurrent neural networks. in 2018 International Workshop on Pattern Recognition in Neuroimaging (PRNI) 1–4 https://doi.org/10.1109/PRNI.2018.8423955 (2018).
Inference for the Random Effects. in Linear Mixed Models for Longitudinal Data (eds. Verbeke, G. & Molenberghs, G.) 77–92 (Springer, 2000). https://doi.org/10.1007/978-0-387-22775-7_7.
Maheux, E. Forecasting individual progression trajectories in Alzheimer’s disease – software and source data. https://doi.org/10.5281/zenodo.7331109 (2022).
Seabold, S. & Perktold, J. Statsmodels: econometric and statistical modeling with python. in 92–96 https://doi.org/10.25080/Majora-92bf1922-011 (2010).

Download references

Acknowledgements

E.M., I.K., J.O., S.E., and S.D. received funding from the European Research Council (ERC) under grant agreement no. 678304, the Agence Nationale de la Recherche (ANR) under the “Investissements d’avenir” program, grants ANR-10-IAIHU-06 (IHU ICM) and ANR-19-P3IA-0001 (PRAIRIE 3IA Institute). E.M., I.K., J.O., C.B., M.H.A., and S.D. received funding from the European Union’s Horizon 2020 research and innovation program under grant agreement no. 826421 (TVB-Cloud). D.A. and S.D. received funding from the joint program in neurodegenerative diseases (JPND) under grant agreement ANR-19-JPW2-000 (E-DADS). S.E. received funding via an APHP-Inria collaboration grant (poste d’accueil) within the ARAMIS project-team. The data used in the preparation of this article were obtained from:

- The Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu),

- The Australian Imaging Biomarkers and Lifestyle flagship study of ageing (AIBL),

- The Japanese Alzheimer’s Disease Neuroimaging Initiative (J-ADNI),

- The PharmaCog Consortium,

- The MEMENTO cohort.

The investigators of these studies contributed to the design and implementation of the corresponding cohorts, provided data, but did not participate in analysis or writing of this report. Complete listings of these investigators can be found at:

- ADNI: https://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf

- AIBL: https://aibl.csiro.au/about/aibl-research-team/

- J-ADNI: https://humandbs.biosciencedbc.jp/en/hum0043-j-adni-authors

- PharmaCog: https://neugrid2.eu/wp-content/uploads/2022/01/pharmacog_investigators_list.pdf,

- MEMENTO: supplementary material.

Data collection and sharing were partly funded by the ADNI (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private-sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuroimaging at the University of Southern California. AIBL core funding was provided by the Commonwealth Scientific and Industrial Research Organization (CSIRO). The cohort was also supported by the University of Melbourne, Neurosciences Australia Ltd, Edith Cowan University, Mental Health Research Institute, Alzheimer’s Australia, National Ageing Research Institute, Austin Health, University of WA, CogState Ltd., Macquarie University, Hollywood Private Hospital, Sir Charles Gairdner Hospital. J-ADNI was supported by the following grants: Translational Research Promotion Project from the New Energy and Industrial Technology Development Organization of Japan; Research on Dementia, Health Labor Sciences Research Grant; Life Science Database Integration Project of Japan Science and Technology Agency; Research Association of Biotechnology (contributed by Astellas Pharma Inc., Bristol-Myers Squibb, Daiichi-Sankyo, Eisai, Eli Lilly and Company, Merck-Banyu, Mitsubishi Tanabe Pharma, Pfizer Inc., Shionogi & Co., Ltd., Sumitomo Dainippon, and Takeda Pharmaceutical Company), Japan, and a grant from an anonymous foundation. The IMI-PharmaCog/E-ADNI project was funded by the European seventh framework program and European Federation of Pharmaceutical Industries and Associations (EFPIA) for the Innovative Medicine Initiative (Grant no. 115009; http://www.pharmacog.org). The MEMENTO cohort was funded through research grants from the Fondation Plan Alzheimer (Alzheimer Plan 2008–2012), and the French Ministry of Higher Education, Research and Innovation (Plan Maladies Neurodégénératives 2014-2019). This work was also supported by CIC1401-EC, Bordeaux University Hospital (CHU Bordeaux, sponsor of the cohort), Inserm, and the University of Bordeaux. The MEMENTO cohort has received funding from AVID, GE Healthcare, and FUJIREBIO through private-public partnerships. This work was undertaken with resources from the Dementias Platform UK (DPUK) Data Portal; the Medical Research Council supports DPUK through grant MR/L023784/2.

Author information

Authors and Affiliations

Sorbonne Université, Institut du Cerveau - Paris Brain Institute – ICM, CNRS, Inria, Inserm, AP-HP, Hôpital Pitié-Salpêtrière, Paris, France
Etienne Maheux, Igor Koval, Juliette Ortholand & Stanley Durrleman
Department of bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
Colin Birkenbihl & Martin Hofmann-Apitius
Bonn-Aachen International Center for IT, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, 53115, Germany
Colin Birkenbihl & Martin Hofmann-Apitius
IRCCS Instituto Centro San Giovanni di Dio Fatebenefratelli, Brescia, Italy
Damiano Archetti
Université de Bordeaux, CNRS UMR 5293, Institut des Maladies Neurodégénératives, Bordeaux, France
Vincent Bouteloup & Carole Dufouil
Centre Hospitalier Universitaire (CHU) de Bordeaux, pôle de neurosciences cliniques, centre mémoire de ressources et de recherche, Bordeaux, France
Vincent Bouteloup & Carole Dufouil
Sorbonne Université, Institut du Cerveau - Paris Brain Institute – ICM, CNRS, Inria, Inserm, AP-HP, Hôpital Pitié-Salpêtrière, Institut de la mémoire et de la maladie d’Alzheimer (IM2A), center of excellence of neurodegenerative diseases (CoEN), department of Neurology, DMU Neurosciences, Paris, France
Stéphane Epelbaum

Authors

Etienne Maheux
View author publications
You can also search for this author in PubMed Google Scholar
Igor Koval
View author publications
You can also search for this author in PubMed Google Scholar
Juliette Ortholand
View author publications
You can also search for this author in PubMed Google Scholar
Colin Birkenbihl
View author publications
You can also search for this author in PubMed Google Scholar
Damiano Archetti
View author publications
You can also search for this author in PubMed Google Scholar
Vincent Bouteloup
View author publications
You can also search for this author in PubMed Google Scholar
Stéphane Epelbaum
View author publications
You can also search for this author in PubMed Google Scholar
Carole Dufouil
View author publications
You can also search for this author in PubMed Google Scholar
Martin Hofmann-Apitius
View author publications
You can also search for this author in PubMed Google Scholar
Stanley Durrleman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.E., C.D., M.H.-A., and S.D. conceived and supervised the research. E.M., I.K., J.O., C.B., D.A., V.B. conducted the research and performed data analysis. All authors contributed to writing the paper and agreed on its content. Editorial support, in the form of medical writing and copyediting, was provided by Julie Sappa of Alex Edelman and Associates.

Corresponding author

Correspondence to Stanley Durrleman.

Ethics declarations

Competing interests

S.E. received personal fees from Biogen, Eisai, Roche, and GE Healthcare for presentations and participation in advisory boards. S.D. is co-inventor of the patent “a method for determining the temporal progression of a biological phenomenon and associated methods and devices” which protects the potential uses of AD Course Map including targeting the time to administer medicine or identifying a biomarker (applicants: Inserm, CNRS, Sorbonne Université, Inria, Ecole Polytechnique, ICM, AP-HP, inventors: Stanley Durrleman, Jean-Baptiste Schiratti, Stéphanie Allassonnière, Olivier Colliot, international application number: PCT/IB2016/052699, granted in the USA (grant number 10832089), published in Europe and Japan. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Esther Bron, Arman Eshaghi and Brian Gordon for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data file

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Maheux, E., Koval, I., Ortholand, J. et al. Forecasting individual progression trajectories in Alzheimer’s disease. Nat Commun 14, 761 (2023). https://doi.org/10.1038/s41467-022-35712-5

Download citation

Received: 14 January 2022
Accepted: 19 December 2022
Published: 10 February 2023
DOI: https://doi.org/10.1038/s41467-022-35712-5

This article is cited by

Data-driven modelling of neurodegenerative disease progression: thinking outside the black box
- Alexandra L. Young
- Neil P. Oxtoby
- Daniel C. Alexander
Nature Reviews Neuroscience (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Characteristics of the study population

Disease progression models learn the timing of changes in biomarker levels during disease progression

Disease progression models forecast cognitive decline

Disease progression models select participants displaying progression for trials

Disease progression models can be used to design more powered clinical trials

Discussion

Methods

Participants

Neuropsychological assessments

Structural magnetic resonance imaging/anatomical imaging biomarkers

Cerebrospinal fluid biomarkers

Positron emission tomography/functional imaging biomarkers

A/T/N/C classification

Disease progression models

Validation procedure

Forecasting endpoints

Analysis of forecast errors

Comparison with alternative methods

Clinical trial simulation, enrichment evaluation, and sample size calculation

Statistics and reproducibility

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links