Background & Summary

Magnetic resonance imaging (MRI) brain templates (i.e. averages of multi-individual images, co-registered in a similar reference space) are widely used in image processing, for example as targets in registration and intensity normalization, as a common standard space enabling individual and population based comparisons in deformation/tensor or voxel based morphometry, and as the basis for segmentation techniques that rely on nonlinear registration1,2,3,4. An example is the MNI-ICBM152, an average based on images from 152 healthy young adults, and one of the most popular templates in current use given its distribution in processing pipelines such as MINC, FSL, and SPM1,2,3 that have been shared more than 45,000 times worldwide (Data from NITRC.org).

A common feature of existing averages such as the MNI-ICBM152 is their reliance on healthy, young brains, in addition to aggregating both sexes in the template generation process. However, in aging and populations with neurodegenerative diseases, ventricle enlargement, extensive levels of cortical and subcortical atrophy, as well as white matter hyperintensities (WMHs) create large degrees of difference between an individual’s MRI and such templates. We have shown in prior work that such differences significantly increase registration errors in some of these well-known image processing tools (e.g. ANTs, Elastix, FSL, MINC, and SPM)5. Ridwan et al. have shown that use of age-appropriate templates allows for higher inter-subject spatial normalization accuracy for older adult data, facilitating detection of smaller inter-group morphometric differences6. A similar reasoning applies to studies of neurodegeneration. Using a dataset consisting of patients with different frontotemporal dementia variants, we have shown that use of age and disease appropriate templates can significantly reduce nonlinear registration errors7. Van Hecke et al. have also shown that improvement in image alignments due to use of population-specific atlases leads to higher sensitivity and specificity in detecting white matter abnormalities in diffusion tensor imaging (DTI) voxel-based analyses8. Therefore, age and disease appropriate templates are necessary to reflect the anatomical characteristics of the populations of interest and increase downstream accuracy and sensitivity of the analyses by reducing potential image processing errors and biases that can occur when using age and pathology inappropriate templates7. An example use case would be the monitoring of a therapy in a specific pathology, with an effect that may be clinically significant but resulting in small image differences. The increased sensitivity brought about by using an appropriate, age-, sex- and disease template would therefore be significant.

Previous work on average brain templates has been mostly based on pediatric, young adult, or healthy aged brains6,9,10,11,12,13. Xiao et al. have developed a multi-contrast template of 15 Parkinson’s disease patients14. We have previously developed average T1w templates of frontotemporal dementia variants (i.e. behavioural, semantic, and progressive non-fluent aphasia) along with age matched healthy templates, showing that use of age and disease appropriate templates improve nonlinear registration performance7. Guo et al. have recently developed a T1w brain template based on a combination of healthy aged adults, individuals with mild cognitive impairment, and Alzheimer’s disease patients, showing that use of disease-specific templates improves sensitivity in voxel-based gray matter volume analyses, enabling for early detection and earlier therapeutic opportunities15. To our knowledge, no prior work has provided multi-sequence average templates of various neurodegenerative disease populations generated consistently using harmonized image acquisition protocols.

Based on data from the Canadian Consortium for Neurodegeneration and Aging (CCNA)16, a flagship study of the Canadian Institutes of Health Research, we present average templates for T1-weighted (T1w), T2-weighted (T2w), T2*-weighted, Proton Density (PD), and FLuid Attenuated Inversion Recovery (FLAIR) sequences in eleven diagnostic groups, including Alzheimer’s Dementia (AD), Fronto-temporal Dementia (FTD), Lewy Body Dementia (LBD), Mild Cognitive Impairment (MCI), cognitively intact and impaired Parkinson’s Disease patients (PD-CIE and PD-CI, respectively), individuals with Subjective Cognitive Impairment (SCI), Vascular Alzheimer’s Dementia (V-AD), Vascular Mild Cognitive Impairment (V-MCI), as well as Cognitively Intact Elderly (CIE) individuals and one human phantom17. These templates can capture the anatomical characteristics for each disease cohort at the regional level. With multiple contrasts available providing different types of information, the various templates can be used to assess different aspects in each disease: i) T1w templates are useful for assessing fine anatomical details and estimating regional and global atrophy levels; ii) T2w/PD sequences are useful for skull segmentation, and assessment of deep gray matter structures, iii) FLAIR images can be used to detect WMHs and infarcts; and iv) T2* images can be used to identify microbleeds as well as hemorrhages.

There are significant sex and gender related differences in the prevalence, clinical outcomes, and response to treatments for these distinct neurodegenerative diseases (e.g. higher prevalence of Alzheimer’s disease in females and higher prevalence of Parkinson’s disease in males)18,19,20. Sex-specific average templates would therefore be useful tools to represent and assess potential anatomical differences in patterns of atrophy in males and females. Thus, in addition to the disease-specific average templates combining male and female participants, we provide separate templates for males and females in each diagnostic category.

Methods

Data

We used data from the Comprehensive Assessment of Neurodegeneration and Dementia (COMPASS-ND) cohort of the CCNA, a national initiative to catalyze research on dementia16. COMPASS-ND includes deeply phenotyped subjects with various forms of dementia and mild memory loss or concerns, along with cognitively intact elderly subjects. Ethical agreements were obtained at all respective sites. Written informed consent was obtained from all participants.

Clinical diagnoses were determined by participating clinicians based on longitudinal clinical, screening, and MRI findings (i.e. diagnosis reappraisal was performed using information from recruitment assessment, screening visit, clinical visit with physician input, and MRI). The diagnostic groups included, AD, CIE, FTD, LBD, MCI, PD-CIE, PD-MCI, PD-Dementia (for this study, PD-MCI and PD-Dementia groups were merged into one PD-CI group), SCI, V-AD, and V-MCI. Diagnosis was performed according to the current guidelines in the field and diagnostic criteria was harmonized across all CCNA sites. However, we acknowledge that due to the inherent heterogeneity and variabilities in such neurodegenerative disease populations, there might be inevitable variabilities across different centers and studies. For details on clinical group ascertainment, see Pieruccini‐Faria et al.21 as well as Dadar et al.22 (section 1 in the supplementary materials). A single cognitively healthy volunteer was also scanned as a human phantom multiple times across different centers for quality assurance purposes (more information on the SIMON human phantom dataset can be found in Duchesne et al.17).

Table 1 summarizes the demographic characteristics of the participants used to generate each template. Note that due to the lower prevalence and challenges in recruitment of participants in certain disease categories (e.g. FTD and LBD), the resulting templates might not be reflective of the entire spectrum of presentation of the pathology. Further work including larger populations is therefore warranted.

Table 1 Demographic characteristics of the participants used to create the average templates.

All participants were scanned using the Canadian Dementia Imaging Protocol, a harmonized MRI protocol designed to reduce inter-scanner variability in multi-centric studies and which included the following sequences23:

  • 3D isotropic T1w scans (voxel size = 1.0 × 1.0 × 1.0 mm3) with an acceleration factor of 2 (Siemens: MP‐RAGE‐PAT: 2; GE: IR‐FSPGR‐ASSET 1.5; Philips: TFE‐Sense: 2)

  • Interleaved proton density/T2‐weighted (PD/T2w) images (voxel size = 0.9 × 0.9 × 3 mm3), fat saturation, and an acceleration factor of 2.

  • Fluid attenuated inversion recovery (T2w‐FLAIR) images (voxel size = 0.9 × 0.9 × 3 mm3), fat saturation, and an acceleration factor of 2.

  • T2* gradient echo images (voxel size = 0.9 × 0.9 × 3 mm3) and acceleration factor of 2.

Table 2 shows the acquisition parameters for each sequence and scanner manufacturer. A detailed description, exam cards, and operators’ manual are publicly available at: www.cdip-pcid.ca.

Table 2 Acquisition parameters of the CDIP protocol.

Preprocessing

All images were pre-processed with image denoising24, intensity non-uniformity correction25, and image intensity normalization into a 0–100 range. The pre-processed images were then linearly5 registered to the pseudo-Talairach space defined by the MNI-ICBM152-2009c template using a 9-parameter registration (three translation, three rotation, and three scaling parameters)26. T2w, PD, FLAIR, and T2* images were also co-registered (rigid registration, 6 parameters) to the T1w images with a mutual information cost function.

Template generation

The method by Fonov et al. was used to generate unbiased templates for each diagnostic group for all participants, as well as each group but separately for males and females12,27 (all except the LBD group in which there were only two female participants). This method has previously been used to generate templates in various studies, including the latest higher resolution version of the MNI-ICBM2009c template (http://nist.mni.mcgill.ca/?p=904)26,28. In short, the pipeline implements a hierarchical nonlinear registration procedure using Automatic Nonlinear Image Matching and Anatomical Labelling (ANIMAL)29, iteratively refining the previous registrations by reducing the step size (20 iterations in total, four iterations at each of the levels of 32, 16, 8, 4, and 2 mm, respectively) until convergence is reached. This process of increasingly refined iterative nonlinear registrations leads to average brains that reflect the anatomical characteristics of the population of interest with higher levels of anatomical detail27. The higher resolution T1w images (isotropic 1mm3) were used to obtain the nonlinear transformations for creating the average templates. T2w, PD, FLAIR, and T2* templates were then created by combining their rigid to-T1w co-registration transformations with the nonlinear transformations based on the T1w images. All final templates were generated at 1mm3 isotropic resolution.

FreeSurfer segmentation

To appreciate differences between templates, we processed all T1w averages using FreeSurfer version 6.0.0 (recon-all -all). FreeSurfer provides a full processing stream for structural T1w data (https://surfer.nmr.mgh.harvard.edu/)30. The final segmentation output (aseg.mgz) was then used to obtain volumetric information for each template based on the FreeSurfer look up table available at https://surfer.nmr.mgh.harvard.edu/fswiki/FsTutorial/AnatomicalROI/FreeSurferColorLUT.

Data Records

For information on COMPASS-ND dataset and to request access, see https://ccna-ccnv.ca/compass-nd-study/. The average template files for all groups and sequences are available in both compressed MINC31,32 and NIfTI formats at G-Node (https://gin.g-node.org/mahsadadar/CDIP_Templates)33 as well as Zenodo 34.

Technical Validation

Quality control

The quality of the registrations, pre-processed images, as well as the volumetric segmentations performed by FreeSurfer was visually assessed by an experience rater (MD). All images passed this quality control step. Note that the provided data was already quality controlled by the CCNA imaging platform for presence of imaging artifacts, and only scans that had passed this quality control step were acquired and used for this study. In terms of qualitative comparison with other atlases in the field6,9,10,14,27, based on visual assessment, the provided atlases have high levels of image sharpness and anatomical detail, clearly delineating the sulci and gyri in the cortex (Fig. 1).

Fig. 1
figure 1

Axial slices of T1w average templates for all diagnostic groups.

Templates

Figures 15 show axial slices of the T1w, T2w, T2star, PD, and FLAIR average templates for all 11 diagnostic groups, covering the brain at different levels. For more detailed figures of each template, see the supplementary materials (Figures S1S11). As expected, CIE, PD-CIE, and MCI groups had smaller ventricles, with lower levels of atrophy compared with the cognitively impaired and dementia groups (Fig. 1). FLAIR images of the vascular cohorts (i.e. Mixed, V-MCI, and V-AD) showed extensive levels of periventricular hyperintensities compared to other groups (Fig. 2), due to the presence of WMHs in the majority of the patients in these populations. This pattern was also visible to a lesser extent as hypointensity in the T1w templates, as well as hyperintensity in the T2w, PD, and T2* templates (Figs. 3 to 5). Presence of WMHs is another factor that necessitates use of age and disease appropriate templates, since they can directly impact intensity normalization results. In fact, we have previously shown that presence of WMHs significantly reduces linear registration accuracy in currently used image processing pipelines such as MINC, FSL, Elastix, SPM, and ANTs when images with high WMH burden are registered to young and healthy adult templates such as MNI-ICBM1525. Similarly, we showed that increased ventricular volume due to aging and presence of atrophy (e.g. in AD populations) reduces registration accuracy when using healthy young adult templates as the registration target5.

Fig. 2
figure 2

Axial slices of FLAIR average templates for all diagnostic groups.

Fig. 3
figure 3

Axial slices of T2w average templates for all diagnostic groups.

Fig. 4
figure 4

Axial slices of PD average templates for all diagnostic groups.

Fig. 5
figure 5

Axial slices of T2* average templates for all diagnostic groups.

Figure 6 shows axial slices of the male and female templates for all diagnostic groups and sequences. Overall, male templates have larger ventricles and greater levels of atrophy than female templates. For more detailed figures of each template, see the supplementary materials (Figures S12S31).

Fig. 6
figure 6

Axial slices of average male and female templates for all sequences and diagnostic groups.

Figure 7 shows axial slices of the templates for the human phantom (SIMON).

Fig. 7
figure 7

Axial slices of human phantom (SIMON) templates for all sequences.

Volumetric comparisons

Tables 35 summarize the grey and white matter (GM, WM) and cerebrospinal fluid (CSF) volumetric information for the templates as segmented by FreeSurfer. Figure 8 compares GM volumes (log transformed) of each template against the CIE template. Data points below the reference line (shown in red) indicate lower values for the template in comparison with the CIE template. As expected, cognitively impaired and dementia templates had lower GM values than the CIE template, whereas both cognitively intact PD-CIE and SCI templates had similar volumes to the CIE template (i.e. data points fall on the reference line).

Table 3 Volumetric GM information (in mm3) for each template based on FreeSurfer segmentations.
Table 4 Volumetric WM information (in mm3) for each template based on FreeSurfer segmentations.
Table 5 Volumetric CSF information (in mm3) for each template based on FreeSurfer segmentations.
Fig. 8
figure 8

FreeSurfer based GM volumes for each diagnostic group versus the CIE template. CIE = Cognitively Intact Elderly. L: Left. R: Right.

Figure 9 compares GM volumes (log transformed) of male versus female templates. Note that since all templates have been linearly registered to the MNI-ICBM2009c template prior to the template creation step, all volumetric values reflect variabilities after accounting for intracranial volume differences and are not caused by potential head size differences between males and females. Data points below the reference line (shown in red) indicate lower values for the male template in comparison with the female template. In the AD and mixed templates, the nucleus accumbens areas bilaterally had lower volumes in the male templates. In the PD-CI template, most regions had slightly lower GM volumes in the male template.

Fig. 9
figure 9

FreeSurfer based GM volumes for male and female templates for each diagnostic group. L: Left. R: Right.

As expected, mixed dementia, vascular MCI, and vascular AD templates had higher WM hypointensity volumes (corresponding to the WMHs on FLAIR and T2w sequences) on T1w templates (Table 4). Male templates for AD, FTD, PD-CI, V-MCI, and V-AD also had greater WM hypointensity volumes than the female templates (Table 4). The mixed template had the largest ventricles (Table 5), followed by V-AD and AD templates. As expected, CIE template had the smallest ventricles, followed by PD-CIE, and SCI. In all diagnostic groups, lateral ventricles were larger for the male templates in comparison with the female templates. This difference was most prominent in the V-AD group, for which the left and right lateral ventricles were 33% and 43% larger respectively for the male template (Table 5). Regarding asymmetry, in the FTD, V-MCI, and mixed templates, the left lateral ventricle was 12%, 13%, and 17% larger than the right lateral ventricle. This difference was more prominent in the male templates for FTD and V-MCI groups, whereas for the mixed group, the female template had greater asymmetry in the ventricles. All of these differences highlight the need for group-specific templates in multi-individual, multi-centric studies.

Using Disease Appropriate Templates to Improve Registration

Use of age and disease appropriate templates can reduce both linear and nonlinear registration errors. We have previously shown that older subjects, those with larger ventricles, and high levels of WMHs have higher levels of linear registration failure rates when using young adult brain templates as the registration target for most widely used registration tools such as FSL, SPM, ANTs, Elastix, and MINC5. Using disease appropriate templates could be the solution to improve both linear and nonlinear registration for aged and diseased populations. Note that since all templates are in the same space (i.e. share a similar alignment to a pseudo-Talairach coordinate system), linear registration to one would be equivalent to linear registration to other templates without additional manipulation. As for nonlinear registration, these templates can be used as intermediate registration targets even in cases where the intended final application is to register all subjects to one healthy or younger average brain. Intermediate templates have been previously used for various registration tasks, particularly when there exists a large difference between source and target templates35,36,37,38. Disease appropriate average templates can be used as intermediate registration targets to improve nonlinear registration, using the following steps:

  1. 1.

    Linearly register patient brain image(s) to the disease appropriate template.

  2. 2.

    Nonlinearly register patient brain image(s) to the disease appropriate template.

  3. 3.

    Concatenate the nonlinear transformation with the precomputed nonlinear transformation between the two average templates.

  4. 4.

    If necessary, the registration can be refined by performing another nonlinear registration between the nonlinearly transformed image and the average template. Concatenate this additional transformation with the previous two.

Figure 10 demonstrates how using a disease appropriate average template can improve nonlinear registration. Panel a shows a nonlinear registration scenario in which the brain of an individual with FTD has been nonlinearly registered directly to the MNI-ICBM152 average template using ANTs diffeomorphic registration tool39. The red contours consistently show the outline of the MNI-ICBM152 brain and can be used to assess the quality of the nonlinear registration. In a perfectly registered image, the contours of MNI-ICBM152 should match the contours of the nonlinearly deformed image (shown in the last columns on the right). The orange arrow shows the areas of gross registration failure, where ANTs has not been able to accurately register the ventricles of the subject to MNI-ICBM152. This is a common occurrence in dementia patients with large ventricles and gross atrophy. Panel b shows registration results for the same individual, which was first nonlinearly registered to a disease appropriate FTD template, and then nonlinearly registered to the MNI-ICBM152. Comparing the two deformed images (last columns on the right), we can see that when the FTD template was used as an intermediate registration target, ANTs was able to accurately register the ventricles.

Fig. 10
figure 10

An example of T1-weighted scan of an individual with frontotemporal dementia (FTD) that was nonlinearly registered to MNI-ICBM152 average template directly (a) and using a disease appropriate template as an intermediate registration target (b). The red contour shows the outline of MNI-ICBM152 template, and can be used to assess registration accuracy. The orange arrow shows the areas of gross registration failure.