ANTsX neuroimaging-derived structural phenotypes of UK Biobank

Tustison, Nicholas J.; Yassa, Michael A.; Rizvi, Batool; Cook, Philip A.; Holbrook, Andrew J.; Sathishkumar, Mithra T.; Tustison, Mia G.; Gee, James C.; Stone, James R.; Avants, Brian B.

doi:10.1038/s41598-024-59440-6

Download PDF

Article
Open access
Published: 17 April 2024

ANTsX neuroimaging-derived structural phenotypes of UK Biobank

Nicholas J. Tustison^1,2,
Michael A. Yassa²,
Batool Rizvi²,
Philip A. Cook³,
Andrew J. Holbrook⁴,
Mithra T. Sathishkumar²,
Mia G. Tustison⁵,
James C. Gee³,
James R. Stone¹ &
…
Brian B. Avants¹

Scientific Reports volume 14, Article number: 8848 (2024) Cite this article

341 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

UK Biobank is a large-scale epidemiological resource for investigating prospective correlations between various lifestyle, environmental, and genetic factors with health and disease progression. In addition to individual subject information obtained through surveys and physical examinations, a comprehensive neuroimaging battery consisting of multiple modalities provides imaging-derived phenotypes (IDPs) that can serve as biomarkers in neuroscience research. In this study, we augment the existing set of UK Biobank neuroimaging structural IDPs, obtained from well-established software libraries such as FSL and FreeSurfer, with related measurements acquired through the Advanced Normalization Tools Ecosystem. This includes previously established cortical and subcortical measurements defined, in part, based on the Desikan-Killiany-Tourville atlas. Also included are morphological measurements from two recent developments: medial temporal lobe parcellation of hippocampal and extra-hippocampal regions in addition to cerebellum parcellation and thickness based on the Schmahmann anatomical labeling. Through predictive modeling, we assess the clinical utility of these IDP measurements, individually and in combination, using commonly studied phenotypic correlates including age, fluid intelligence, numeric memory, and several other sociodemographic variables. The predictive accuracy of these IDP-based models, in terms of root-mean-squared-error or area-under-the-curve for continuous and categorical variables, respectively, provides comparative insights between software libraries as well as potential clinical interpretability. Results demonstrate varied performance between package-based IDP sets and their combination, emphasizing the need for careful consideration in their selection and utilization.

An expanded set of genome-wide association studies of brain imaging phenotypes in UK Biobank

Article 19 April 2021

PEARL-Neuro Database: EEG, fMRI, health and lifestyle data of middle-aged people at risk of dementia

Article Open access 07 March 2024

Understanding the genetic determinants of the brain with MOSTest

Article Open access 14 July 2020

Introduction

UK Biobank (UKBB) is a unique epidemiological effort which aims to prospectively identify potential relationships between disease and associated risk factors through the leveraging of comprehensive individualized medical and sociodemographic data. Enrollment began in 2006 and continued for four years ultimately resulting in a cohort of approximately 500,000 individuals. Volunteer age criteria was limited to birth years between 1934 and 1971—an optimal range for observing the onset of certain diseases and their subsequent progression. Continued monitoring of a significant subset is expected to continue for at least 30 years facilitated, in part, by coordination with the National Health Services of the UK. This has resulted in several studies exploring a wide variety of research topics (e.g., the relationship between age and cognitive decline,¹ the association of polygenic profiles and mental health,² and potential collider bias in COVID-19 assessment).³

An integral component of UKBB is the subset of approximately 50,000 subjects who underwent comprehensive imaging batteries, including neuroimaging,^4,5 specifically structural T1-weighted MPRAGE and T2-FLAIR MRI; diffusion-weighted MRI; resting-state and task-based functional MRI; and susceptibility-weighted MRI. Employing specialized processing pipelines, these raw imaging data are used to generate various quantities, referred to as image-derived phenotypes (IDPs), for use as potential biomarkers. A sampling of resulting image-based research studies evinces insights into such topics as hippocampal volumetric nomograms across age;⁶ population modeling of age, fluid intelligence, and neuroticism;⁷ and brain structural changes associated with COVID-19 and the corresponding cognitive effects.⁸

Facilitating the majority of existing UKBB imaging-related research is the FMRIB Software Library (FSL)⁹ which has been specifically tailored to provide UKBB IDPs.^4,5 For the structural data alone, this includes global and cortical IDPs from FMRIB’s Automated Segmentation Tool (FAST),¹⁰ subcortical IDPs from FMRIB’s Integrated Registration and Segmentation Tool (FIRST),¹¹ and white matter hyperintensity (WMH) load using the Brain Intensity AbNormality Classification Algorithm (BIANCA).¹² UKBB was subsequently augmented with FreeSurfer-based IDPs¹³ which include both the standard “aseg” segmentation, hippocampal subfield,¹⁴ and amygdala nuclei¹⁵ pipeline outputs.

Analogously, the Advanced Normalization Tools Ecosystem (ANTsX) is a collection of interrelated, open-source software libraries for biological and medical image processing and analysis¹⁶ with developmental roots in high-performing medical image registration^17,18 and built on the Insight Toolkit (ITK).¹⁹ ANTsX-based IDPs have demonstrated utility in several studies spanning a variety of organ systems, species, and imaging modalities.^20,21,22 These IDPs include those which have been previously reported, such as global brain tissue volumes²³ and more localized, FreeSurfer-analogous cortical thickness values^16,24,25 averaged over the Desikan-Killiany-Tourville (DKT) regions.²⁶ In addition, recently developed ANTsX functionality includes a medial temporal lobe (MTL) parcellation framework known as “DeepFLASH,” a neural network for segmenting hippocampal subfields and extra-hippocampal regions which extends previous work.²⁷ Newly introduced functionality also includes regional cerebellum measurements based on the Schmahmann atlas²⁸ including cortical thickness.²⁹

Characterizing the respective sets of FSL, FreeSurfer, and ANTsX IDPs and their mutual relationships can guide researchers in their usage as there are both significant overlap and notable differences between these measures. And although comparison between sets is potentially insightful, a focused, package-wise comparison using UKBB is difficult due to 1) the absence of complete, individual IDP correspondence across packages and 2) the general purpose of UKBB data (in contrast, for example, to the ADNI data³⁰ set which focuses on Alzheimer’s disease). Regarding IDP differences, even between identically defined IDPs (e.g., hippocampal volume), observer bias is a possible source of measurement variance³¹ where “observer bias” is considered in the context of casting computational measurement tools as “observers” with “observer bias” due to the specific set of choices that results in the final numerical measurement. These choices can include (but are certainly not limited to) modeling considerations, preferences with respect to anatomical definitional ambiguities, and the set of parameters used to run the corresponding software. Note that this variance is not indicative of inaccuracy, per se, such as with instrumentation bias where sub-optimal calibration of software is used as a straw-man for comparative purposes.³² Rather, observer bias is supplemental to conventional signal noise considerations as a potential source of measurement discrepancy which can provide insight when considered within the appropriate context. For example, differing labeling protocols for specific anatomical structures, such as the hippocampal subfields and parahippocampal subregions, can reveal differences and those differences can motivate and facilitate harmonization.³³

To this end, in addition to the core contribution of providing ANTsX-based UKBB IDPs, we explore the similarities and differences between the respective sets of structural IDPs and their combination using linear modeling. Such modeling potentially has the additional advantage of providing clinical interpretability of individual features. For example, one of the most well-studied neuroimaging structural correlative relationships is chronological and so-called “brain age” and their health-dependent divergence.³⁴ Such subject-specific, single values are estimated using a variety of machine learning approaches and IDPs. Although establishing normative values over the human life span has clinical utility, as pointed out in Nyberg,³⁵ the single-valued brain age is at the extreme end of an “optimal balance between integration and diversification” required for neuroimaging studies. A single score or index most likely does not capture the extent of the non-linearity and heterogeneity of age and other effects on brain structure.³⁶ In contrast, the type of feature-based investigation performed here reveals insight into such questions as: “In what ways do the different IDP sets perform in terms of their predictive capabilities?,” “How does this performance vary with different sociodemographic variables?,” and “In what ways are features complementary and can their combined effect improve prediction performance?”

Materials and methods

UK Biobank data description

The study was conducted under UKBB Resource Application ID 63965. The total number of subjects at the time of download was 502,413 with 49,351 T1 and FLAIR images from the baseline assessment. Although follow-up visits were available for many participants, only the T1 and FLAIR images from the baseline visit were used for this study. Prior to this study, and as part of UKBB data repository, the FSL and FreeSurfer packages were used to generate sets of IDPs calculated from these baseline images which are made available as tabulated data as part of the resource application. The UKBB’s strict quality control protocols⁵ and the intersection between FSL and FreeSurfer complete sets of IDPs resulted in a UKBB-derived cohort of 40,898 sets of measurements. Intersection with the final ANTs complete processed IDP set resulted in a total study cohort size of 40,796.

FSL structural phenotypes

All structural FSL IDPs were included for consideration.³⁷ These included the following categories:

FAST regional grey matter volumes (Category ID: 1101);
FIRST subcortical volumes (Category ID: 1102);
Global brain tissue volumes and related quantities (Field ID: 25000–25010, 25025); and
Total volume of WMH load (Field ID: 25781)

for a total of $139_{FAST} + 14_{FIRST} + 12_{Global} + 1_{WMH} = 166$ IDPs.

FreeSurfer structural phenotypes

Several categories of IDPs are available for FreeSurfer comprising a total of 1242 measurements.³⁷ However, to make the study dataset more computationally tractable and reduce set size differences between packages, we selected the following popular IDP subsets:

ASEG volumetric measurements (Category ID: 190);
DKT volumes and mean thicknesses (Category ID: 196); and
Hippocampal subfields and amygdala nuclei (Category ID: 191)

totaling $56_{ASEG} + 124_{DKT} + 121_{hipp} = 301$ individual IDPs.

ANTsX structural phenotypes

Both sociodemographic and bulk image data were downloaded to the high performance cluster at the University of Virginia for processing. Grad-warped distortion corrected³⁸ T1-weighted and FLAIR image data were used to produce the following ANTsX IDPs:

Deep Atropos brain tissue volumes (i.e., CSF, gray matter, white matter, deep gray matter, brain stem, and cerebellum);
DKT DiReCT cortical thickness and volumes;
DKT-based regional volumes;
DeepFLASH regional volumes;
Cerebellum regional thickness and volumes;
Regional WMH loads

totaling $7_{Deep Atropos} + 88_{DKT reg} + 128_{DKT DiReCT} + 20_{DeepFLASH} + 48_{Cerebellum} + 13_{WMH} = 302$ IDPs which are illustrated in Fig. 1. We have reported previously on the first three categories of ANTsX IDPs¹⁶ but provide a brief description below. We also provide further details concerning both DeepFLASH and the cerebellum morphology algorithms.

Brain tissue volumes

The ANTsXNet deep learning libraries for Python and R (ANTsPyNet and ANTsRNet, respectively) were evaluated in terms of multi-site cortical thickness estimation.¹⁶ This extends previous work^24,25 in replacing key pipeline components with deep learning variants. For example, a trained network, denoted Deep Atropos, replaced the original Atropos algorithm²³ for six-tissue segmentation (CSF, gray matter, white matter, deep gray matter, cerebellum, and brain stem) similar to functionality for whole brain deep learning-based brain extraction.

DKT cortical thickness, regional volumes, and lobar parcellation

As part of the deep learning refactoring of the cortical thickness pipeline mentioned in the previous section, a framework was developed to generate DKT cortical and subcortical regional labels from T1-weighted MRI.¹⁶ This facilitates regional averaging of cortical thickness values over that atlas parcellation as well as being the source of other potentially useful geometry-based IDPs. In terms of network training and development, using multi-site data,²⁴ two separate U-net³⁹ networks were trained for the “inner” (e.g., subcortical, cerebellar) labels and the “outer” cortical labels, respectively. Similar to Deep Atropos, preprocessing includes brain extraction and affine transformation to the space of the MNI152 template⁴⁰ which includes corresponding prior probability maps. These maps are used as separate input channels for both training and prediction—a type of surrogate for network attention gating.⁴¹ Using FreeSurfer’s DKT atlas label-to-lobe mapping,⁴² we use a fast marching approach⁴³ to produce left/right parcellations of the frontal, temporal, parietal, and occipital lobes, as well as left/right divisions of the brain stem and cerebellum. Using the segmentation output from Deep Atropos, the DiReCT algorithm²⁹ generates the subject-specific cortical thickness map which, as previously mentioned, is summarized in terms of IDPs by DKT regional definitions. Given the diffeomorphic and thickness constraints dictated by the DiReCT algorithm, we generate additional DKT regional labels (cortex only) from the non-zero cortical thickness regions to also be used as IDPs.

Fused labeling for automated segmentation of the hippocampus and extra-hippocampal regions (DeepFLASH)

A set of IDPs was generated using a deep learning-based framework for hippocampal and extra-hippocampal subfield parcellation which is also publicly available within ANTsXNet (refered to as “DeepFLASH”). This work constitutes an extension of earlier work,²⁷ based on joint label fusion (JLF),⁴⁴ which has been used in a variety of studies.^{45,46,47,48,49,50} DeepFLASH comprises both T1/T2 multi-modality and T1-only imaging networks for parcellating the following MTL regions:

Hippocampal subfields
- Dentate gyrus/cornu ammonis 2–4 (DG/CA2/CA3/CA4)
- Cornu ammonis 1 (CA1)
- Subiculum
Extra-hippocampal regions
- Perirhinal
- Parahippocampal
- Antero-lateral entorhinal cortex (aLEC)
- Posteromedial entorhinal cortex (pMEC)

DeepFLASH employs a traditional 3-D U-net model³⁹ consisting of five layers with 32, 64, 96, 128, and 256 filters, respectively. In addition to the multi-region output, three additional binary outputs (the entire medial temporal lobe complex, the whole hippocampus, and the extra-hippocampal cortex) are incorporated as a hierarchical structural output set. Data augmentation employed both randomized shape (i.e., linear and deformable geometric perturbations ) and intensity variations (i.e., simulated bias fields, added noise, and intensity histogram warping). Further information regarding training and prediction can be found at our ANTxNet GitHub repositories.^51,52

Cerebellum morphology

ANTsX cerebellum IDPs comprise both regional volumes and cortical thickness averages based on the Schmahmann atlas²⁸ for cerebellar cortical parcellation. Cortical regions include the following left and right hemispherical lobules: I/II, III, IV, V, VI, Crus I, Crus II, VIIB, VIIIA, VIIIB, IX, and X. Quantifying cerebellar cortical thickness utilizes the DiReCT algorithm.²⁹ Both tissue segmentation (CSF, gray matter, and white matter) and regional parcellation is based on a similar deep learning network as that described previously for DeepFLASH. Training data⁵³ was coupled with previously described data augmentation. In contrast to DeepFLASH which utilized a single network with multiple outputs, cerebellum output is derived from first extracting the whole cerebellum and then using it as input to both the tissue segmentation network and Schmahmann regional atlas network.

White matter hyperintensity segmentation

Although UKBB includes white matter hyperintensity segmentation masks⁵ derived from FMRIB’s BIANCA tool,¹² a recently developed WMH segmentation framework from the “SYSU” team⁵⁴ was imported into the ANTsXNet libraries for WMH segmentation. As discussed in,⁵⁵ this was the top performing algorithm at the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) held in 2017. Image data from five sites were used for both training and testing of segmentation algorithms from 20 different teams. Both the architecture and ensemble weights were made publicly available by the SYSU team which permitted a direct porting into ANTsXNet.

Open-science implementation

Implementations of the previously described pipelines are available in Python and R through our respective ANTsPy/ANTsPyNet and ANTsR/ANTsRNet libraries hosted in the ANTsX ecosystem on GitHub (http://www.github.com/ANTsX/). Assuming the T1-weighted and FLAIR images are stored in the variables t1 and flair, respectively, the specific function invocations to produce the ANTsX UKBB IDPs are as follows:

ANTsPyNet (Python)
- brain_extraction(t1, modality=“t1”)
- deep_atropos(t1)
- cortical_thickness(t1)
- desikany_killiany_tourville_labeling(t1, do_lobar_parcellation=True)
- deep_flash(t1)
- cerebellum_morphology(t1, compute_thickness_image=True)
- sysu_media_white_matter_segmentation(flair, t1)
ANTsRNet (R)
- brainExtraction( t1, modality = “t1” )
- deepAtropos( t1 )
- corticalThickness( t1 )
- desikanyKillianyTourvilleLabeling( t1, doLobarParcellation = TRUE )
- deepFlash( t1 )
- cerebellumMorphology( t1, computeThicknessImage = TRUE )
- sysuMediaWhiteMatterSegmentation( flair, t1 )

Note that deviation from the default parameters is only used to produce additional output. In addition to the scripts that are in the publicly available GitHub repository associated with this work (https://github.com/ntustison/ANTsXUKBBPublic), self-contained examples (i.e., including data and code snippets) of all listed functionality are available as part of an online ANTsX tutorial also hosted on GitHub as a gist (http://tinyurl.com/antsxtutorial).

Predictive modeling for IDP characterization

Insight into the relationships between neurostructural and phenotypic measures is often possible through predictive modeling of sociodemographic targets and neuroimaging biomarkers. Many strategies for data exploration leverage standardized quantities derived from existing pipelines, which constitutes a form of dimensionality reduction or feature extraction based on clinically established relevance. Such tabulated data has several advantages over direct image use including being relatively easier to access, store, and manage. Analyses with off-the-shelf statistical packages is also greatly simplified. Additionally, using standardized features in predictive modeling, where feature importance is a component of the analysis, significantly facilitates the clinical interpretability of the modeling process.

Herein, baseline models are made using standard linear regression where linear dependencies between covariates were resolved using findLinearCombos of the caret R package.⁵⁶ Although other modeling approaches were explored (e.g., XGBoost,⁵⁷ TabNet),⁵⁸ the linear models were the top performing models in terms of predictive accuracy so, in the interest of simplicity, we only discuss those here and refer the interested reader to the GitHub repository associated with this work for these additional explorations. We selected several target variables for our comparative evaluation (cf. Table 1) and generated models of the form:

$$\begin{aligned} Target \sim Age + Genetic\,\,Sex + \sum _{i=1}^N IDP_i \end{aligned}$$

(1)

where i indexes over the set of N IDPs for a particular grouping. In the cases where Age or Genetic Sex is the target variable, it is omitted from the right side of the modeling equation.

Table 1 Set of UKBB sociodemographic targets for evaluation.

Full size table

Assessment of the models based on the three individual sets of IDPs and their combination employs standard quality measures: area under the curve (AUC) for classification targets and root-mean-square error (RMSE) for regression targets. We also explored individual IDP importance through the use of model-specific parameter assessment metrics (i.e.., the absolute value of the t-statistic).

Results

Package-wise group IDP comparison

To compare the groups of IDPs, we used the three IDP sets (FSL, FreeSurfer, ANTsX) and their combination (“All”) to train predictive models using the preselected target sociodemographic variables from Table 1. We first revisit a previous evaluative framework of ANTsX cortical thickness values by comparing their ability to predict Age and Genetic Sex with corresponding FreeSurfer cortical thickness values.¹⁶ Following this initial comparative analysis, ten-fold cross validation, using random training/evaluation sampling sets (90% training/10% evaluation), per IDP set per target variable was used to train and evaluate the models described by Eq. (1).

Revisiting ANTs and FreeSurfer cortical thickness comparison

In previous publications,^16,24 IDPs under consideration were limited to ANTsX-based and FreeSurfer cortical thickness measurements averaged over the 62 regions of the DKT parcellation. These IDP sets were specifically compared in terms of the predictive capability vis-à-vis Age and Genetic Sex. With respect to UKBB-derived cortical thickness IDPs, similar analysis demonstrates consistency with prior results (see Fig. 2).

Package IDP comparison via continuous target variables

Table 2 Summary statistics for the selected continuous UKBB sociodemographic target variables.

Full size table

Predictive models for cohort Age, Fluid Intelligence Score, Neuroticism Score, Numeric Memory, Body Mass Index, and Townsend Deprivation Index were generated and evaluated as described previously. Summary statistics for these variables are provided in Table 2. The resulting accuracies, in terms of RMSE, are provided in Fig. 3. These linear models provide consistently accurate results across the set of continuous target variables with the combined set of IDPs performing well for the majority of cases. All models demonstrate significant correlations across IDP sets (cf. Fig. 4).

Package IDP comparison via categorical target variables

Predictive models for cohort categories associated with Genetic Sex, Hearing Difficulty, Risk Taking, Same Sex Intercourse, Smoking Frequency, and Alcohol Frequency were generated and evaluated as described previously. The resulting accuracies, in terms of binary or multi-class AUC, are provided in Fig. 5. Similar to the continuous variables, the linear models perform well for most of the target variables. Superior performance is seen for predicting Genetic Sex.

Individual IDP comparison

Table 3 Top 10 features for Age, Fluid Intelligence Score, and Neuroticism Score target variables specified for the combined (i.e., All) IDP set.

Full size table

To compare individual IDPs, for each target variable, we selected the set of results corresponding to the machine learning technique which demonstrated superior performance, in terms of median predictive accuracy, for the combined (All) IDP grouping. The top ten features for the principle continuous variables of Age, Fluid Intelligence Score, and Neuroticism Score are listed in Table 3 and ranked according to variable importance score (specifically, absolute t-statistic value for linear models). The ranked lists are also color-coded by IDP package. For additional insight into individual IDPs, full feature lists with feature importance rankings are available for all target variables in the supplementary material hosted at the corresponding GitHub repository⁵⁹.

Discussion

Much UKBB research is made possible through the availability of its characteristic large-scale, subject-specific epidemiological data, including IDPs and enhanced by the stringent data acquisition protocols to ensure consistency across sites. In this work, we complement the existing FSL- and FreeSurfer-based UKBB IDPs with the generation and potential distribution of corresponding ANTsX-derived IDPs. These latter IDPs were generated from well-vetted pipelines that have been used in previous research and are publicly available through the ANTsX ecosystem. By providing these IDP-producing utilities within high-level languages, such as Python and R, in a comprehensive, open-source package, we are able to leverage the computational efficiency of deep learning libraries while also leveraging the numerous packages available for the curation, analysis, and visualization of tabulated data.

In addition to the availability of these ANTsX UKBB IDPs, we explored their utility with respect to other package-specific groupings and their combinations. For exploration of these IDP group permutations, we used linear modeling to predict commonly studied sociodemographic variables of current research interest (Table 1). In addition to research presentation in traditional venues, at least two of these target variables, specifically Age and Fluid Intelligence, have been the focus of two recent competitions.

Regarding the former, research concerning brain age estimation from neuroimaging is extensive and growing (cf. recent reviews).^34,60,61 It was also the subject of the recent Predictive Analytics Competition held in 2019 (PAC2019). This competition featured 79 teams leveraging T1-weighted MRI with a variety of quantitative approaches from convolutional neural networks (CNNs) to common machine learning frameworks based on morphological descriptors (i.e., structural IDPs) derived from FreeSurfer.⁶² The winning team,⁶³ using an ensemble of CNNs and pretrained on a UKBB cohort of $N=14,503$ subjects, had a mean absolute error (MAE) of 2.90 years. Related CNN-based deep learning approaches achieved comparable performance levels and simultaneously outperformed more traditional machine learning approaches.

Given that RMSE provides a general upper bound on MAE (i.e., MAE $\le$ RMSE), the accuracy levels yielded by our FSL, FreeSurfer, ANTsX models can be seen from Fig. 3 to perform comparatively well. The FreeSurfer and ANTsX linear models performed similarly with RMSE prediction values of approximately 4.4 years whereas FSL was a little higher at 4.96 years. However, combining all IDPs resulted in an average RMSE value of 3.8 years. When looking at the top 10 overall linear model features (Table 3) ranked in terms of absolute t-statistic value, all three packages are represented and appear to reflect both global structures (white matter and CSF volumes) and general subcortical structural volumes (ANTsX “deep GM” and both FreeSurfer and ANTsX bi-hemispherical ventral dienchephalon volumes). Increases in CSF volume and ventricular spaces is well-known to be associated with brain shrinkage and aging.^64,65,66

Similarly, the association between brain structure and fluid intelligence has been well-studied⁶⁷ despite potentially problematic philosophical and ethical issues.⁶⁸ With intentions of furthering this research, the ABCD Neurocognitive Prediction Challenge (ABCD-NP-Challenge) was held in 2019 which concerned predicting fluid intelligence scores (using the NIH Toolbox Cognition Battery)⁶⁹ in a population of 9-10 year pediatric subjects using T1-weighted MRI. Fluid intelligence scores were residualized from brain volume, acquisition site, age, ethnicity, genetic sex, and parental attributes of income, education, and marriage (additional data processing details are provided in the Data Supplement).⁷⁰

Of the 29 submitting teams, the first place team of the final leaderboard employed kernel ridge regression with voxelwise features based on the T1-weighted-based probabilistic tissue segmentations specifically, CSF, gray matter, and white matter— both modulated and unmodulated versions for a total of six features per subject. In contrast to the winning set of predictive sparse and global features, the second place team used 332 total cortical, subcortical, white matter, cerebellar, and CSF volumetric features. Although exploring several machine learning modeling techniques, the authors ultimately used an ensemble of models for prediction which showed improvement over gradient boosted decision trees. From Table 3, most predictive features from our study, regardless of package, are localized measures of gray matter.

Although the stated, primary objective of these competitions is related to superior performance in terms of algorithmic prediction of quantitative sociodemographics, similar to the evaluation strategy used in this work, outside of the clinical research into brain age estimation, none of these performance metrics reach the level of individual-level prediction. Consequently, these may be more informative as an interpretation of the systems- level relationship between brain structure and behavior. An obvious secondary benefit is the insight gained into the quality and relevance of measurements and modeling techniques used. In this way, these considerations touch on fundamental implications of the No Free Lunch Theorems for search and optimization⁷¹ where prior distributions (i.e., correspondence of measurements and clinical domain for algorithmic modeling) differentiate general performance. Relatedly, although all packages are represented amongst the top-performing IDPs, their relative utility is dependent, expectedly so, on the specific target variable, and, to a lesser extent, on the chosen machine learning technique. Such considerations should be made along with other relevant factors (e.g., computational requirements, open-source availability) for tailored usage.

Conclusion

The UK Biobank is an invaluable resource for large-scale epidemiological research which includes a thorough neuroimaging battery for a significant subset of the study volunteers. For quantitative exploration and inference of population trends from leveraging imaging data, well-vetted measurement tools are essential. The primary contribution that we have described is the generation and public availability of the set of UK Biobank neuroimaging structural IDPs generated using the ANTsX ecosystem. These ANTsX IDPs, which includes DeepFLASH for hippocampal and extra-hippocampal parcellation, complement the existing sets of FSL and FreeSurfer IDPs. A predictive modeling strategy using a variety of sociodemographic target variables was used to explore IDP viability, importance, and utility via linear modeling.

Data availibility

Data is from the UK Biobank (https://www.ukbiobank.ac.uk) under UKBB Resource Application ID 63965. Restrictions apply to the availability of these data but will be available for researchers upon application to the UK Biobank. All remaining supplementary material is available by the first author (N. Tustison) through the public GitHub repository (https://github.com/ntustison/ANTsXUKBBPublic).

References

Cornelis, M. C. et al. Age and cognitive decline in the UK biobank. PLoS One 14, e0213948 (2019).
Article CAS PubMed PubMed Central Google Scholar
Hagenaars, S. P. et al. Shared genetic aetiology between cognitive functions and physical and mental health in UK biobank (n=112,151) and 24 GWAS consortia. Mol. Psych. 21, 1624–1632 (2016).
Article CAS Google Scholar
Griffith, G. J. et al. Collider bias undermines our understanding of COVID-19 disease risk and severity. Nat. Commun. 11, 5749 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Miller, K. L. et al. Multimodal population brain imaging in the UK biobank prospective epidemiological study. Nat. Neurosci. 19, 1523–1536 (2016).
Article CAS PubMed PubMed Central Google Scholar
Alfaro-Almagro, F. et al. Image processing and quality control for the first 10,000 brain imaging datasets from UK biobank. Neuroimage 166, 400–424 (2018).
Article PubMed Google Scholar
Nobis, L. et al. Hippocampal volume across age: Nomograms derived from over 19,700 people in UK biobank. Neuroimage Clin. 23, 101904 (2019).
Article PubMed PubMed Central Google Scholar
Dadi, K. et al. Population modeling with machine learning can enhance measures of mental health. Gigascience 10, (2021).
Douaud, G. et al. SARS-CoV-2 is associated with changes in brain structure in UK biobank. Naturehttps://doi.org/10.1038/s41586-022-04569-5 (2022).
Article PubMed PubMed Central Google Scholar
Jenkinson, M., Beckmann, C. F., Behrens, T. E. J., Woolrich, M. W. & Smith, S. M. FSL. Neuroimage 62, 782–90 (2012).
Article PubMed Google Scholar
Zhang, Y., Brady, M. & Smith, S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans. Med. Imaging 20, 45–57 (2001).
Article CAS PubMed Google Scholar
Patenaude, B., Smith, S. M., Kennedy, D. N. & Jenkinson, M. A Bayesian model of shape and appearance for subcortical brain segmentation. Neuroimage 56, 907–22 (2011).
Article PubMed Google Scholar
Griffanti, L. et al. BIANCA (brain intensity abnormality classification algorithm): A new tool for automated segmentation of white matter hyperintensities. Neuroimage 141, 191–205 (2016).
Article PubMed Google Scholar
Fischl, B. FreeSurfer. Neuroimage 62, 774–81 (2012).
Article PubMed Google Scholar
Iglesias, J. E. et al. A computational atlas of the hippocampal formation using ex vivo, ultra-high resolution MRI: Application to adaptive segmentation of in vivo MRI. Neuroimage 115, 117–37 (2015).
Article PubMed Google Scholar
Saygin, Z. M. et al. High-resolution magnetic resonance imaging reveals nuclei of the human amygdala: Manual segmentation to automatic atlas. Neuroimage 155, 370–382 (2017).
Article CAS PubMed Google Scholar
Tustison, N. J. et al. The ANTsX ecosystem for quantitative biological and medical imaging. Sci. Rep. 11, 9068 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Avants, B. B. et al. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage 54, 2033–44 (2011).
Article PubMed Google Scholar
Avants, B. B. et al. The Insight ToolKit image registration framework. Front. Neuroinfo. 8, 44 (2014).
Article Google Scholar
Yoo, T. S. & Metaxas, D. N. Open science-combining open data and open source software: Medical image analysis with the insight toolkit. Med. Image Anal. 9, 503–6 (2005).
Article PubMed Google Scholar
Ding, A. S. et al. Automated extraction of anatomical measurements from temporal bone CT imaging. Otolaryngol. Head Neck Surg.https://doi.org/10.1177/01945998221076801 (2022).
Article PubMed PubMed Central Google Scholar
Diamond, K. M., Rolfe, S. M., Kwon, R. Y. & Maga, A. M. Computational anatomy and geometric shape analysis enables analysis of complex craniofacial phenotypes in zebrafish. Biol. Open 11, (2022).
Kini, L. G. et al. Quantitative [18]FDG PET asymmetry features predict long-term seizure recurrence in refractory epilepsy. Epilepsy Behav. 116, 107714 (2021).
Article PubMed PubMed Central Google Scholar
Avants, B. B., Tustison, N. J., Wu, J., Cook, P. A. & Gee, J. C. An open source multivariate framework for n-tissue segmentation with evaluation on public data. Neuroinformatics 9, 381–400 (2011).
Article PubMed PubMed Central Google Scholar
Tustison, N. J. et al. Large-scale evaluation of ANTs and FreeSurfer cortical thickness measurements. Neuroimage 99, 166–79 (2014).
Article PubMed Google Scholar
Tustison, N. J. et al. Longitudinal mapping of cortical thickness measurements: An alzheimer’s disease neuroimaging initiative-based evaluation study. J. Alzheimers Dis.https://doi.org/10.3233/JAD-190283 (2019).
Article PubMed PubMed Central Google Scholar
Klein, A. & Tourville, J. 101 labeled brain images and a consistent human cortical labeling protocol. Front. Neurosci. 6, 171 (2012).
Article PubMed PubMed Central Google Scholar
Reagh, Z. M. et al. Functional imbalance of anterolateral entorhinal cortex and hippocampal dentate/CA3 underlies age-related object pattern separation deficits. Neuron 97, 1187-1198.e4 (2018).
Article CAS PubMed PubMed Central Google Scholar
Schmahmann, J. D. et al. Three-dimensional MRI atlas of the human cerebellum in proportional stereotaxic space. Neuroimage 10, 233–60 (1999).
Article CAS PubMed Google Scholar
Das, S. R., Avants, B. B., Grossman, M. & Gee, J. C. Registration based cortical thickness measurement. Neuroimage 45, 867–79 (2009).
Article PubMed Google Scholar
Weiner, M. W. et al. The alzheimer’s disease neuroimaging initiative: A review of papers published since its inception. Alzheimers Dement. 8, S1-68 (2012).
Article PubMed Google Scholar
Caliskan, A., Bryson, J. J. & Narayanan, A. Semantics derived automatically from language corpora contain human-like biases. Science 356, 183–186 (2017).
Article ADS CAS PubMed Google Scholar
Tustison, N. J. et al. Instrumentation bias in the use and evaluation of scientific software: Recommendations for reproducible practices in the computational sciences. Front. Neurosci. 7, 162 (2013).
Article PubMed PubMed Central Google Scholar
Yushkevich, P. A. et al. Quantitative comparison of 21 protocols for labeling hippocampal subfields and parahippocampal subregions in in vivo MRI: Towards a harmonized segmentation protocol. Neuroimage 111, 526–41 (2015).
Article PubMed Google Scholar
Franke, K. & Gaser, C. Ten. years of BrainAGE as a neuroimaging biomarker of brain aging: What insights have we gained?. Front. Neurol. 10, 789 (2019).
Article PubMed PubMed Central Google Scholar
Nyberg, L. & Wåhlin, A. The many facets of brain aging. Elife 9, (2020).
Smith, S. M. et al. Brain aging comprises many modes of structural and functional change with distinct genetic and biophysical associations. Elife 9, (2020).
UKBB.
Glasser, M. F. et al. The minimal preprocessing pipelines for the human connectome project. Neuroimage 80, 105–24 (2013).
Article PubMed Google Scholar
Falk, T. et al. U-net: Deep learning for cell counting, detection, and morphometry. Nat. Methods 16, 67–70 (2019).
Article CAS PubMed Google Scholar
Fonov, V. S., Evans, A. C., McKinstry, R. C., Almli, C. & Collins, D. L. Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. NeuroImage S102, (2009).
Schlemper, J. et al. Attention gated networks: Learning to leverage salient regions in medical images. Med. Image Anal. 53, 197–207 (2019).
Article PubMed PubMed Central Google Scholar
Desikan, R. S. et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31, 968–80 (2006).
Article PubMed Google Scholar
Sethian, J. A. A fast marching level set method for monotonically advancing fronts. Proc. Natl. Acad. Sci. USA 93, 1591–5 (1996).
Article ADS MathSciNet CAS PubMed PubMed Central Google Scholar
Wang, H. et al. Multi-atlas segmentation with joint label fusion. IEEE Trans. Pattern Anal. Mach. Intell. 35, 611–23 (2013).
Article PubMed Google Scholar
Brown, E. S. et al. A randomized, double-blind, placebo-controlled trial of lamotrigine for prescription corticosteroid effects on the human hippocampus. Eur. Neuropsychopharmacol. 29, 376–383 (2019).
Article CAS PubMed PubMed Central Google Scholar
Brown, E. S. et al. A randomized trial of an NMDA receptor antagonist for reversing corticosteroid effects on the human hippocampus. Neuropsychopharmacology 44, 2263–2267 (2019).
Article CAS PubMed PubMed Central Google Scholar
Holbrook, A. J. et al. Anterolateral entorhinal cortex thickness as a new biomarker for early detection of alzheimer’s disease. Alzheimers Dement. (Amst) 12, e12068 (2020).
PubMed Google Scholar
McMakin, D. L., Kimbler, A., Tustison, N. J., Pettit, J. W. & Mattfeld, A. T. Negative overgeneralization is associated with anxiety and mechanisms of pattern completion in peripubertal youth. Soc. Cogn. Affect Neurosci.https://doi.org/10.1093/scan/nsab089 (2021).
Article PubMed Central Google Scholar
Nguyen, D. M. et al. The relationship between cumulative exogenous corticosteroid exposure and volumes of hippocampal subfields and surrounding structures. J. Clin. Psychopharmacol. 39, 653–657 (2019).
Article CAS PubMed PubMed Central Google Scholar
Sinha, N. et al. APOE $\varepsilon$4 status in healthy older African Americans is associated with deficits in pattern separation and hippocampal hyperactivation. Neurobiol. Aging 69, 221–229 (2018).
Article CAS PubMed PubMed Central Google Scholar
Tustison, N. J. & Avants, B. B. ANTsRNet GitHub. https://github.com/ANTsX/ANTsRNet
Tustison, N. J. & Avants, B. B. ANTsPyNet GitHub. https://github.com/ANTsX/ANTsPyNet
Park, M. T. M. et al. Derivation of high-resolution MRI atlases of the human cerebellum at 3T and segmentation using multiple automatically generated templates. Neuroimage 95, 217–31 (2014).
Article PubMed Google Scholar
Li, H. et al. Fully convolutional network ensembles for white matter hyperintensities segmentation in MR images. Neuroimage 183, 650–665 (2018).
Article PubMed Google Scholar
Kuijf, H. J. et al. Standardized assessment of automatic segmentation of white matter hyperintensities and results of the WMH segmentation challenge. IEEE Trans. Med. Imaging 38, 2556–2568 (2019).
Article PubMed PubMed Central Google Scholar
Kuhn, M. Building predictive models in r using the caret package. J. Stat. Softw. 28, 1–26 (2008).
Article Google Scholar
Chen, T. & Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining 785–794 (ACM, 2016). https://doi.org/10.1145/2939672.2939785.
Arik, S. Ö. & Pfister, T. TabNet: Attentive interpretable tabular learning. Proc. AAAI Conf. Artif. Intell. 35, 6679–6687 (2021).
Google Scholar
Tustison, N. J. ANTsXUKBB GitHub. https://github.com/ntustison/ANTsXUKBB
Mishra, S., Beheshti, I. & Khanna, P. A review of neuroimaging-driven brain age estimation for identification of brain disorders and health conditions. IEEE Rev. Biomed. Eng. PP, (2021).
Baecker, L., Garcia-Dias, R., Vieira, S., Scarpazza, C. & Mechelli, A. Machine learning for brain age prediction: Introduction to methods and clinical applications. EBioMedicine 72, 103600 (2021).
Article PubMed PubMed Central Google Scholar
Lombardi, A. et al. Brain age prediction with morphological features using deep neural networks: Results from predictive analytic competition 2019. Front. Psych. 11, 619629 (2020).
Article Google Scholar
Gong, W., Beckmann, C. F., Vedaldi, A., Smith, S. M. & Peng, H. Optimising a simple fully convolutional network for accurate brain age prediction in the PAC 2019 challenge. Front. Psych. 12, 627996 (2021).
Article Google Scholar
Murphy, D. G., DeCarli, C., Schapiro, M. B., Rapoport, S. I. & Horwitz, B. Age-related differences in volumes of subcortical nuclei, brain matter, and cerebrospinal fluid in healthy men as measured with magnetic resonance imaging. Arch. Neurol. 49, 839–45 (1992).
Article CAS PubMed Google Scholar
Matsumae, M. et al. Age-related changes in intracranial compartment volumes in normal adults assessed by magnetic resonance imaging. J. Neurosurg. 84, 982–91 (1996).
Article CAS PubMed Google Scholar
Scahill, R. I. et al. A longitudinal study of brain volume changes in normal aging using serial registered magnetic resonance imaging. Arch. Neurol. 60, 989–94 (2003).
Article PubMed Google Scholar
Vieira, B. H. et al. On the prediction of human intelligence from neuroimaging: A systematic review of methods and reporting. Intelligence 93, 101654 (2022).
Article Google Scholar
Eickhoff, S. B. & Langner, R. Neuroimaging-based prediction of mental traits: Road to Utopia or Orwell?. PLoS Biol. 17, e3000497 (2019).
Article CAS PubMed PubMed Central Google Scholar
Weintraub, S. et al. Cognition assessment using the NIH toolbox. Neurology 80, S54-64 (2013).
Article PubMed PubMed Central Google Scholar
Pfefferbaum, A. et al. Altered brain developmental trajectories in adolescents after initiating drinking. Am. J. Psych. 175, 370–380 (2018).
Article Google Scholar
Wolpert, D. H. & Macready, W. G. No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1, 67–82 (1997).
Article Google Scholar

Download references

Acknowledgements

Support for the research reported in this work includes funding from a combined grant from Cohen Veterans Bioscience (CVB-461), Office of Naval Research (N00014-18-1-2440), and the National Institutes of Health R01-EB031722. A.H. was supported by a generous gift from the Karen Toffler Charitable Trust.

Author information

Authors and Affiliations

Department of Radiology and Medical Imaging, University of Virginia, Charlottesville, VA, USA
Nicholas J. Tustison, James R. Stone & Brian B. Avants
Department of Neurobiology and Behavior, University of California, Irvine, CA, USA
Nicholas J. Tustison, Michael A. Yassa, Batool Rizvi & Mithra T. Sathishkumar
Department of Radiology, University of Pennsylvania, Philadelphia, PA, USA
Philip A. Cook & James C. Gee
Department of Biostatistics, University of California, Los Angeles, CA, USA
Andrew J. Holbrook
Santiago High School, Corona, CA, USA
Mia G. Tustison

Authors

Nicholas J. Tustison
View author publications
You can also search for this author in PubMed Google Scholar
Michael A. Yassa
View author publications
You can also search for this author in PubMed Google Scholar
Batool Rizvi
View author publications
You can also search for this author in PubMed Google Scholar
Philip A. Cook
View author publications
You can also search for this author in PubMed Google Scholar
Andrew J. Holbrook
View author publications
You can also search for this author in PubMed Google Scholar
Mithra T. Sathishkumar
View author publications
You can also search for this author in PubMed Google Scholar
Mia G. Tustison
View author publications
You can also search for this author in PubMed Google Scholar
James C. Gee
View author publications
You can also search for this author in PubMed Google Scholar
James R. Stone
View author publications
You can also search for this author in PubMed Google Scholar
Brian B. Avants
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.T. and B.A. wrote the main manuscript text and figures. N.T., M.Y., B.R., and M.S. developed the MTL segmentation pipeline. N.T., M.T., and B.A. developed the cerebellum morphology pipeline. N.T., P.C., J.G., and B.A. developed various portions of the ANTsX ecosystem. A.H. consulted on the statistical aspects of the analysis. All authors reviewed the manuscript.

Corresponding author

Correspondence to Nicholas J. Tustison.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tustison, N.J., Yassa, M.A., Rizvi, B. et al. ANTsX neuroimaging-derived structural phenotypes of UK Biobank. Sci Rep 14, 8848 (2024). https://doi.org/10.1038/s41598-024-59440-6

Download citation

Received: 17 October 2023
Accepted: 10 April 2024
Published: 17 April 2024
DOI: https://doi.org/10.1038/s41598-024-59440-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

An expanded set of genome-wide association studies of brain imaging phenotypes in UK Biobank

PEARL-Neuro Database: EEG, fMRI, health and lifestyle data of middle-aged people at risk of dementia

Understanding the genetic determinants of the brain with MOSTest

Introduction

Materials and methods

UK Biobank data description

FSL structural phenotypes

FreeSurfer structural phenotypes

ANTsX structural phenotypes

Brain tissue volumes

DKT cortical thickness, regional volumes, and lobar parcellation

Fused labeling for automated segmentation of the hippocampus and extra-hippocampal regions (DeepFLASH)

Cerebellum morphology

White matter hyperintensity segmentation

Open-science implementation

Predictive modeling for IDP characterization

Results

Package-wise group IDP comparison

Revisiting ANTs and FreeSurfer cortical thickness comparison

Package IDP comparison via continuous target variables

Package IDP comparison via categorical target variables

Individual IDP comparison

Discussion

Conclusion

Data availibility

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links