Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Brain charts for the human lifespan

## Abstract

Over the past few decades, neuroimaging has become a ubiquitous tool in basic research and clinical studies of the human brain. However, no reference standards currently exist to quantify individual differences in neuroimaging metrics over time, in contrast to growth charts for anthropometric traits such as height and weight1. Here we assemble an interactive open resource to benchmark brain morphology derived from any current or future sample of MRI data (http://www.brainchart.io/). With the goal of basing these reference charts on the largest and most inclusive dataset available, acknowledging limitations due to known biases of MRI studies relative to the diversity of the global population, we aggregated 123,984 MRI scans, across more than 100 primary studies, from 101,457 human participants between 115 days post-conception to 100 years of age. MRI metrics were quantified by centile scores, relative to non-linear trajectories2 of brain structural changes, and rates of change, over the lifespan. Brain charts identified previously unreported neurodevelopmental milestones3, showed high stability of individuals across longitudinal assessments, and demonstrated robustness to technical and methodological differences between primary studies. Centile scores showed increased heritability compared with non-centiled MRI phenotypes, and provided a standardized measure of atypical brain structure that revealed patterns of neuroanatomical variation across neurological and psychiatric disorders. In summary, brain charts are an essential step towards robust quantification of individual variation benchmarked to normative trajectories in multiple, commonly used neuroimaging phenotypes.

## Main

The simple framework of growth charts to quantify age-related change was first published in the late eighteenth century1 and remains a cornerstone of paediatric healthcare—an enduring example of the utility of standardized norms to benchmark individual trajectories of development. However, growth charts are currently available only for a small set of anthropometric variables, such as height, weight and head circumference, and only for the first decade of life. There are no analogous charts available for quantification of age-related changes in the human brain, although it is known to go through a prolonged and complex maturational program from pregnancy to the third decade4, followed by progressive senescence from approximately the sixth decade5. The lack of tools for standardized assessment of brain development and ageing is particularly relevant to research studies of psychiatric disorders, which are increasingly recognized as a consequence of atypical brain development6, and neurodegenerative diseases that cause pathological brain changes in the context of normative senescence7. Preterm birth and neurogenetic disorders are also associated with marked abnormalities of brain structure8,9 that persist into adult life9,10 and are associated with learning disabilities and mental health disorders. Mental illness and dementia collectively represent the single biggest global health burden11, highlighting the urgent need for normative brain charts as an anchor point for standardized quantification of brain structure over the lifespan12.

Such standards for human brain measurement have not yet materialized from decades of neuroimaging research, probably owing to the challenges of integrating MRI data across multiple, methodologically diverse studies targeting distinct developmental epochs and clinical conditions13. For example, the perinatal period is rarely incorporated in analysis of age-related brain changes, despite evidence that early biophysical and molecular processes powerfully influence life-long neurodevelopmental trajectories14,15 and vulnerability to psychiatric disorders3. Primary case–control studies are usually focused on a single disorder despite evidence of trans-diagnostically shared risk factors and pathogenic mechanisms, especially in psychiatry16,17. Harmonization of MRI data across primary studies to address these and other deficiencies in the extant literature is challenged by methodological and technical heterogeneity. Compared with relatively simple anthropometric measurements such as height or weight, brain morphometrics are known to be highly sensitive to variation in scanner platforms and sequences, data quality control, pre-processing and statistical analysis18, thus severely limiting the generalizability of trajectories estimated from any individual study19. Collaborative initiatives spurring collection of large-scale datasets20,21, recent advances in neuroimaging data processing22,23 and proven statistical frameworks for modelling biological growth curves2,24,25 provide the building blocks for a more comprehensive and generalizable approach to age-normed quantification of MRI phenotypes over the entire lifespan (see Supplementary Information 1 for details and consideration of previous work focused on the related but distinct objective of inferring brain age from MRI data). Here, we demonstrate that these convergent advances now enable the generation of brain charts that (1) robustly define normative processes of sex-stratified, age-related change in multiple MRI-derived phenotypes; (2) identify previously unreported brain growth milestones; (3) increase sensitivity to detect genetic and early life environmental effects on brain structure; and (4) provide standardized effect sizes to quantify neuroanatomical atypicality of brain scans collected across multiple clinical disorders. We do not claim to have yet reached the ultimate goal of quantitatively precise diagnosis of MRI scans from individual patients in clinical practice. However, the present work proves the principle that building normative charts to benchmark individual differences in brain structure is already achievable at global scale and over the entire life-course; and provides a suite of open science resources for the neuroimaging research community to accelerate further progress in the direction of standardized quantitative assessment of MRI data.

## Mapping normative brain growth

We created brain charts for the human lifespan using generalized additive models for location, scale and shape2,24 (GAMLSS), a robust and flexible framework for modelling non-linear growth trajectories recommended by the World Health Organization24. GAMLSS and related statistical frameworks have previously been applied to developmental modelling of brain structural and functional MRI phenotypes in open datasets19,26,27,28,29,30,31. Our approach to GAMLSS modelling leveraged the greater scale of data available to optimize model selection empirically, to estimate non-linear age-related trends (in median and variance) stratified by sex over the entire lifespan, and to account for site- or study-specific ‘batch effects’ on MRI phenotypes in terms of multiple random effect parameters. Specifically, GAMLSS models were fitted to structural MRI data from control subjects for the four main tissue volumes of the cerebrum (total cortical grey matter volume (GMV), total white matter volume (WMV), total subcortical grey matter volume (sGMV) and total ventricular cerebrospinal fluid volume (ventricles or CSF)). Supplementary Tables 1.11.8 present details on acquisition, processing and demographics of the dataset; see Methods, ‘Model generation and specification’ and Supplementary Information 1 for further details regarding GAMLSS model specification and estimation; image quality control, which used a combination of expert visual curation and automated metrics of image quality (Supplementary Information 2); model stability and robustness (Supplementary Information 3, 4); phenotypic validation against non-imaging metrics (Supplementary Information 3 and 5.2); inter-study harmonization (Supplementary Information 5); and assessment of cohort effects (Supplementary Information 6). See Supplementary Information 19 for details on all primary studies contributing to the reference dataset, including multiple publicly available open MRI datasets32,33,34,35,36,37,38,39,40,41,42.

Lifespan curves (Fig. 1, Supplementary Table 2.1) showed an initial strong increase in GMV from mid-gestation onwards, peaking at 5.9 years (95% bootstrap confidence interval (CI) 5.8–6.1), followed by a near-linear decrease. This peak was observed 2 to 3 years later than previous reports relying on smaller, more age-restricted samples43,44. WMV also increased rapidly from mid-gestation to early childhood, peaking at 28.7 years (95% bootstrap CI 28.1–29.2), with subsequent accelerated decline in WMV after 50 years. Subcortical GMV showed an intermediate growth pattern compared with GMV and WMV, peaking in adolescence at 14.4 years (95% bootstrap CI 14.0–14.7). Both the WMV and sGMV peaks are consistent with previous neuroimaging and postmortem reports45,46. By contrast, CSF showed an increase until age 2, followed by a plateau until age 30, and then a slow linear increase that became exponential in the sixth decade of life. Age-related variance (Fig. 1d), explicitly estimated by GAMLSS, formally quantifies developmental changes in between-subject variability. There was an early developmental increase in GMV variability that peaked at 4 years, whereas subcortical volume variability peaked in late adolescence. WMV variability peaked during the fourth decade of life, and CSF was maximally variable at the end of the human lifespan.

## Extended neuroimaging phenotypes

To extend the scope of brain charts beyond the four cerebrum tissue volumes, we generalized the same GAMLSS modelling approach to estimate normative trajectories for additional MRI phenotypes including other morphometric properties at a global scale (mean cortical thickness and total surface area) and regional volume at each of 34 cortical areas47 (Fig. 2, Supplementary Information 79, Supplementary Tables 1, 2). We found, as expected, that total surface area closely tracked the development of total cerebrum volume (TCV) across the lifespan (Fig. 2a), with both metrics peaking at approximately 11–12 years of age (surface area peak at 10.97 years (95% bootstrap CI 10.42–11.51); TCV peak at 12.5 years (95% bootstrap CI 12.14–12.89). By contrast, cortical thickness peaked distinctively early at 1.7 years (95% bootstrap CI 1.3–2.1), which reconciles previous observations that cortical thickness increases during the perinatal period48 and declines during later development49 (Supplementary Information 7).

We also found evidence for regional variability in volumetric neurodevelopmental trajectories. Compared with peak GMV at 5.9 years, the age of peak regional grey matter volume varied considerably—from approximately 2 to 10 years—across 34 cortical areas. Primary sensory regions reached peak volume earliest and showed faster post-peak declines, whereas fronto-temporal association cortical areas peaked later and showed slower post-peak declines (Fig. 2b, Supplementary Information 8.2). Notably, this spatial pattern recapitulated a gradient from sensory-to-association cortex that has been previously associated with multiple aspects of brain structure and function50.

## Developmental milestones

Neuroimaging milestones are defined by inflection points of the tissue-specific volumetric trajectories (Fig. 3, Methods, ‘Defining developmental milestones’). Among the total tissue volumes, only GMV peaked before the typical age at onset of puberty51, with sGMV peaking mid-puberty and WMV peaking in young adulthood (Fig. 3). The rate of growth (velocity) peaked in infancy and early childhood for GMV (5.08 months (95% bootstrap CI 4.85–5.22)), sGMV (5.65 months (95% bootstrap CI 5.75–5.83)) and WMV (2.4 years (95% bootstrap CI 2.2–2.6)). TCV velocity peaked between the maximum velocity for GMV and WMV at approximately 7 months. Two major milestones of TCV and sGMV (peak velocity and size) (Fig. 3) coincided with the early neonatal and adolescent peaks of height and weight velocity52,53. The velocity of mean cortical thickness peaked even earlier, in the prenatal period at −0.38 years (95% bootstrap CI −0.4 to −0.34) (relative to birth), corresponding approximately to mid-gestation. This early peak in cortical thickness velocity has not been reported previously—to our knowledge—in part owing to challenges in acquiring adequate and consistent signal from typical MRI sequences in the perinatal period54. Similarly, normative trajectories revealed an early period of GMV:WMV differentiation, beginning in the first month after birth with the switch from WMV to GMV as the proportionally dominant tissue compartment, and ending when the absolute difference of GMV and WMV peaked around 3 years (Supplementary Information 9). This epoch of GMV:WMV differentiation, which may reflect underlying changes in myelination and synaptic proliferation4,55,56,57,58, has not been demarcated in previous studies45,59. It was probably identified in this study owing to the substantial amount of early developmental MRI data available for analysis in the aggregated dataset (in total across all primary studies, N = 2,571 and N = 1,484 participants aged less than 2 years were available for analysis of cerebrum tissue volumes and extended global MRI phenotypes, respectively). The period of GMV:WMV differentiation encompasses dynamic changes in brain metabolites60 (0–3 months), resting metabolic rate61 (RMR) (minimum = 7 months, maximum = 4.2 years), the typical period of acquisition of motor capabilities and other early paediatric milestones62, and the most rapid change in TCV (Fig. 3).

## Individualized centile scores

We computed individualized centile scores that benchmarked each individual scan in the context of normative age-related trends (Methods, ‘Centile scores and case–control differences’ and Supplementary Information 16 for further details). This approach is conceptually similar to quantile rank mapping, as previously reported26,28,29, where the typicality or atypicality of each phenotype in each scan is quantified by its score on the distribution of phenotypic parameters in the normative or reference sample of scans, with more atypical phenotypes having more extreme centile (or quantile) scores. The clinical diversity of the aggregated dataset enabled us to comprehensively investigate case–control differences in individually specific centile scores across a range of conditions. Relative to the control group (CN), there were highly significant differences in centile scores across large (N > 500) groups of cases diagnosed with multiple disorders (Fig. 4a, Supplementary Information  10), with effect sizes ranging from medium (0.2 < Cohen’s d < 0.8) to large (Cohen’s d > 0.8) (see Supplementary Tables 3, 4 for all false discovery rate (FDR)-corrected P values and effect sizes). Clinical case–control differences in cortical thickness and surface area generally followed the same trend as volume differences (Supplementary Information 10). Alzheimer’s disease showed the greatest overall difference, with a maximum difference localized to grey matter volume in biologically female patients (median centile score = 14%, 36 percentage points difference from CN median, corresponding to Cohen’s d = 0.88; Fig. 4a). In addition, we generated a cumulative deviation metric, the centile Mahalanobis distance (CMD), to summarize a comparative assessment of brain morphology across all global MRI phenotypes relative to the CN group (Fig. 4b, Supplementary Information 1.6). Notably, schizophrenia ranked third overall behind Alzheimer’s disease and mild cognitive impairment (MCI) on the basis of CMD (Fig. 4c). Assessment across diagnostic groups, based on profiles of the multiple centile scores for each MRI phenotype and for CMD, highlighted shared and distinct patterns across clinical conditions (Supplementary Information 10, 11). However, when examining cross-disorder similarity of multivariate centile scores, hierarchical clustering yielded three clusters broadly comprising neurodegenerative, mood and anxiety, and neurodevelopmental disorders (Supplementary Information 11).

Across all major epochs of the lifespan63, the CMD was consistently greater in cases relative to controls, irrespective of diagnostic category. The largest case–control differences across epochs occurred in late adulthood when risk for dementia increases and in adolescence, which is well-recognized as a period of increased incidence of mental health disorders (Supplementary Information 10.3). In five primary studies covering the lifespan, average centile scores across global tissues were related to two metrics of premature birth (gestational age at birth: t = 13.164, P < 2 × 10−16; birth weight: t = 36.395, P < 2 × 10−16; Supplementary Information 12), such that greater gestational age and birth weight were associated with higher average centile scores. Centile scores also showed increased twin-based heritability in two independent studies (total N = 913 twin pairs) compared with non-centiled phenotypes (average increase of 11.8 percentage points in narrow sense heritability (h2) across phenotypes; Fig. 4d, Supplementary Information 13). In summary, centile normalization of brain metrics reproducibly detected case–control differences and genetic effects on brain structure, as well as long-term sequelae of adverse birth outcomes even in the adult brain10.

## Longitudinal centile changes

Owing to the relative paucity of longitudinal imaging data (about 10% of the reference dataset), normative models were estimated from cross-sectional data collected at a single time point. However, the generalizability of cross-sectional models to longitudinal assessment is important for future research. Within-subject variability of centile scores derived from longitudinally repeated scans, measured with the interquartile range (IQR) (Methods, ‘Longitudinal stability’, Supplementary Information 1.7), was low across both clinical and CN groups (all median IQR < 0.05 centile points), indicating that centile scoring of brain structure was generally stable over time, although there was also some evidence of between-study and cross-disorder differences in within-subject variability (Supplementary Information 14). Notably, individuals who changed diagnostic categories—for example, those who progressed from mild cognitive impairment to Alzheimer’s disease over the course of repeated scanning—showed small but significant increases in within-subject variability of centile scores (Supplementary Information 14, Supplementary Tables 5, 6). Within-subject variability was also slightly higher in samples from younger individuals (Supplementary Information 14), which could reflect increased noise due to the technical or data quality challenges associated with scanning younger individuals, but is also consistent with the evidence of increased variability in earlier development observed across other anthropometric traits64.

## Centile scoring of new MRI data

A key challenge for brain charts is the accurate centile scoring of out-of-sample MRI data, not represented in the reference dataset used to estimate normative trajectories. We therefore carefully evaluated the reliability and validity of brain charts for centile scoring of such ‘new’ scans. For each new MRI study, we used maximum likelihood to estimate study-specific statistical offsets from the age-appropriate epoch of the normative trajectory; we then estimated centile scores for each individual in the new study benchmarked against the offset trajectory (Fig. 5, Methods, ‘Data-sharing and out-of-sample estimation’, Supplementary Information 1.8). Extensive jack-knife and leave-one-study-out analyses indicated that a study size of N > 100 scans was sufficient for stable and unbiased estimation of out-of-sample centile scores (Supplementary Information 4). This study size limit is in line with the size of many contemporary brain MRI research studies. However, these results do not immediately support the use of brain charts to generate centile scores from smaller-scale research studies, or from an individual patient’s scan in clinical practice—this remains a goal for future work. Out-of-sample centile scores proved highly reliable in multiple test–retest datasets and were robust to variations in image processing pipelines (Supplementary Information 4).

## Discussion

We have aggregated the largest neuroimaging dataset to date to modernize the concept of growth charts for mapping typical and atypical human brain development and ageing. The approximately 100-year age range enabled the delineation of milestones and critical periods in maturation of the human brain, revealing an early growth epoch across its constituent tissue classes—beginning before 17 post-conception weeks, when the brain is at approximately 10% of its maximum size, and ending by age 3, when the brain is at approximately 80% of the maximum size. Individual centile scores benchmarked by normative neurodevelopmental trajectories were significantly associated with neuropsychiatric disorders as well as with dimensional phenotypes (Supplementary Information 5.2, 12). Furthermore, imaging–genetics studies65 may benefit from the increased heritability of centile scores compared with raw volumetric data (Supplementary Information 13). Perhaps most importantly, GAMLSS modelling enabled harmonization across technically diverse studies (Supplementary Information 5), and thus unlocked the potential value of combining primary MRI studies at scale to generate normative, sex-stratified brain growth charts, and individual centile scores of typicality and atypicality.

The analogy to paediatric growth charts is not meant to imply that brain charts are immediately suitable for benchmarking or quantitative diagnosis of individual patients in clinical practice. Even for traditional anthropometric growth charts (height, weight and BMI), there are still important caveats and nuances concerning their diagnostic interpretation in individual children66; similarly, it is expected that considerable further research will be required to validate the clinical diagnostic utility of brain charts. However, the current results bode well for future progress towards digital diagnosis of atypical brain structure and development67. By providing an age- and sex-normalized metric, centile scores enable trans-diagnostic comparisons between disorders that emerge at different stages of the lifespan (Supplementary Information 10, 11). The generally high stability of centile scores across longitudinal measurements also enabled assessment of brain changes related to diagnostic transition from mild cognitive impairment to Alzheimer’s disease (Supplementary Information 14), which provides one example of how centile scoring could be clinically useful in quantitatively predicting or diagnosing progressive neurodegenerative disorders in the future. Our provision of appropriate normative growth charts and online tools also creates an immediate opportunity to quantify atypical brain structure in clinical research samples, to leverage available legacy neuroimaging datasets, and to enhance ongoing studies.

Several important caveats are worth highlighting. Even this large MRI dataset was biased towards European and North American populations and European ancestry groups within those populations. This bias is unfortunately common in many clinical and scientific references, including anthropometric growth charts and benchmark genetic datasets, representing an inequity that must be addressed by the global scientific community68. In the particular case of brain charts, further increasing ethnic, socioeconomic and demographic diversity in MRI research will enable more population-representative normative trajectories69,70 that can be expected to improve the accuracy and strengthen the interpretation of centile scores in relation to appropriate norms26. The available reference data were also not equally distributed across all ages—for example, foetal, neonatal and mid-adulthood (30–40 years of age) epochs were under-represented (Supplementary Information 1719). Furthermore, although our statistical modelling approach was designed to mitigate study- or site-specific effects on centile scores, it cannot entirely correct for limitations of primary study design, such as ascertainment bias or variability in diagnostic criteria. Our decision to stratify the lifespan models by sex followed the analogous logic of sex-stratified anthropometric growth charts. Males have larger brain-tissue volumes than females in absolute terms (Supplementary Information 16), but this is not indicative of any difference in clinical or cognitive outcomes. Future work would benefit from more detailed and dimensional self-report variables relating to sex and gender71. The use of brain charts also does not circumvent the fundamental requirement for quality control of MRI data. We have shown that GAMLSS modelling of global structural MRI phenotypes is in fact remarkably robust to inclusion of poor-quality scans (Supplementary Information 2), but it should not be assumed that this level of robustness will apply to future brain charts of regional MRI or functional MRI phenotypes; therefore, the importance of quality control remains paramount.

We have focused primarily on global brain phenotypes, which were measurable in the largest achievable sample, aggregated over the widest age range, with the fewest methodological, theoretical and data-sharing constraints. However, we have also provided proof-of-concept brain charts for regional grey matter volumetrics, demonstrating plausible heterochronicity of cortical patterning, and illustrating the potential generalizability of this approach to a diverse range of fine-grained MRI phenotypes (Fig. 2, Supplementary Information 8). As ongoing and future efforts provide increasing amounts of high-quality MRI data, we predict an iterative process of improved brain charts for an increasing number of multimodal72 neuroimaging phenotypes. Such diversification will require the development, implementation and standardization of additional data quality control procedures27 to underpin robust brain chart modelling. To facilitate further research using our reference charts, we have provided interactive tools to explore these statistical models and to derive normalized centile scores for new datasets across the lifespan at www.brainchart.io.

## Methods

### Ethics

The research was reviewed by the Cambridge Psychology Research Ethics Committee (PRE.2020.104) and The Children’s Hospital of Philadelphia’s Institutional Review Board (IRB 20-017874) and deemed not to require PRE or IRB oversight as it consists of secondary analysis of de-identified primary datasets. Informed consent of participants (or their guardians) in primary studies is referenced in Supplementary Information 19 and Supplementary Table 1.

### Model generation and specification

To accurately and comprehensively establish standardized brain reference charts across the lifespan, it is crucial to leverage multiple independent and diverse datasets, especially those spanning prenatal and early postnatal life. Here we sought to chart normative brain development and ageing across the largest age-span and largest aggregated neuroimaging dataset to date using a robust and scalable methodological framework2,24. We used GAMLSS2 to estimate cross-sectional normative age-related trends from 100 studies, comprising a reference dataset of more than 100,000 scans (see Supplementary Tables 1.11.7 for full demographic information and Supplementary Information 19 for dataset descriptions). We optimised GAMLSS model specification and parameterization to estimate non-linear normative growth curves, their confidence intervals and first derivatives, separately for males and females, allowing for random effects on the mean and higher order moments of the outcome distributions.

The reliability of the models was assessed and endorsed by cross-validation and bootstrap resampling procedures (Supplementary Information 3). We leveraged these normative trajectories to benchmark individual scans by centile scores, which were then investigated as age-normed and sex-stratifed measures of diagnostic and longitudinal atypicalities of brain structure across the lifespan.

The GAMLSS approach allowed not only modelling of age-related changes in brain phenotypes but also age related-changes in the variability of phenotypes, and in the form of both linear and nonlinear changes over time, thereby overcoming potential limitations of conventional additive models that only allow additive means to be modelled2. In addition, study-specific offsets (mean and variance) for each brain phenotype were also modelled as random effects. These modelling criteria are particularly important in the context of establishing growth reference charts as recommended by the World Health Organization24, as it is reasonable to assume the distribution of higher order moments (for example, variance) changes with age, sex, site/study and pre-processing pipeline, and it is impossible to circumvent some of these issues by collecting standardized data longitudinally for individuals spanning the approximately 100-year age range. Furthermore, recent studies suggest that changes in between-subject variability might intersect with vulnerability for developing a mental health condition74. The use of data spanning the entire age range is also critical, as data from partial age-windows can bias estimation of growth charts when extrapolated to the whole lifespan. In short, using a sex-stratified approach24, age, preprocessing pipeline and study were each included in the GAMLSS model estimation of first order (μ) and second order (σ) distribution parameters of a generalized gamma distribution using fractional polynomials to model nonlinear trends. See Supplementary Information for more details regarding GAMLSS model specification and estimation (Supplementary Information 1), image quality control (Supplementary Information 2), model stability and robustness (Supplementary Information 3, 4), phenotypic validation against non-imaging metrics (Supplementary Information 3, 5.2), inter-study harmonization (Supplementary Information 5) and assessment of cohort effects (Supplementary Information 6).

More formally, the GAMLSS framework can be specified in the following way:

$$Y\sim F\left(\mu ,\sigma ,\nu ,\tau \right)$$
(1)
$${g}_{\mu }(\mu )={X}_{\mu }{\beta }_{\mu }+{Z}_{\mu }{\gamma }_{\mu }+\sum _{i}{s}_{\mu ,i}({x}_{i})$$
$${g}_{\sigma }(\sigma )={X}_{\sigma }{\beta }_{\sigma }+{Z}_{\sigma }{\gamma }_{\sigma }+\sum _{i}{s}_{\sigma ,i}({x}_{i})$$
$${g}_{\nu }(\nu )={X}_{\nu }{\beta }_{\nu }+{Z}_{\nu }{\gamma }_{\nu }+\sum _{i}{s}_{\nu ,i}({x}_{i})$$
$${g}_{\tau }(\tau )={X}_{\tau }{\beta }_{\tau }+{Z}_{\tau }{\gamma }_{\tau }+\sum _{i}{s}_{\tau ,i}({x}_{i})$$

Here, the outcome vector, $$Y$$, follows a probability distribution $$F$$ parameterized by up to four parameters, $$(\mu ,\sigma ,\nu ,\tau )$$. The four parameters, depending on the parameterization of the probability density function, may correspond to the mean, variance, skewness, and kurtosis—that is, the first four moments. However, for many distributions there is not a direct one-to-one correspondence. Each component is linked to a linear equation through a link-function, $${g}_{\bullet }()$$, and each component equation may include three types of terms: fixed effects, β (with design matrix X); random effects, γ (with design matrix Z); and non-parametric smoothing functions, s.,i applied to the ith covariate for each parameter. The nature of the outcome distribution determines the appropriate link functions and which components are used. In principle any outcome distribution can be used, from well-behaved continuous and discrete outcomes, through to mixtures and truncations.

Here we have used fractional polynomials as a flexible, but not unduly complex, approach to modelling age-related changes in MRI phenotypes. Although non-parametric smoothers are more flexible, they can become unstable and infeasible, especially in the presence of random effects. Hence, the fractional polynomials enter the model within the X terms, with associated coefficients in β. The GAMLSS framework includes the ability to estimate the most appropriate powers of fractional polynomial expansion within the iterative fitting algorithm, searching across the standard set of powers, $$p\in \{-2,-1,-\mathrm{0.5,0,0.5,\; 1,\; 2,\; 3}\},$$where the design matrix includes the covariate (in this case, age) raised to the power, namely, $${x}^{p}$$. Fractional polynomials naturally extend to higher-orders, for example a second-order fractional polynomial of the form, $${x}^{{p}_{1}}+{x}^{{p}_{2}}$$ (see Supplementary Information 1.3 for further details).

There are several options for including random effects within the GAMLSS framework depending on the desired covariance structures. We consider the simplest case, including a factor-level (or group-level) random intercept, where the observations are grouped by the study covariate. The random effects are drawn from a normal distribution with zero mean and variance to be estimated, γ Ν(0,δ2). The ability to include random effects is fundamental to accounting for co-dependence between observations. It is therefore possible to take advantage of the flexibility of ‘standard’ GAMLSS, as typically used to develop growth charts24,62,75, while accounting for co-dependence between observations using random effects. The typical applications of GAMLSS assume independent and identically distributed outcomes; however, in this context it is essential to account for within-study covariance implying the observations are no longer independent.

The resulting models were evaluated using several sensitivity analyses and validation approaches. These models of whole-brain and regional morphometric development were robust to variations in image quality, and cross-validated by non-imaging metrics. However, we expect that several sources of variance, including but not limited to MRI data quality and variability of acquisition protocols, may become increasingly important as brain charting methods are applied to more innovative and/or anatomically fine-grained MRI phenotypes. It will be important for future work to remain vigilant about the potential impact of data quality and other sources of noise on robustness and generalizability of both normative trajectories and the centile scores derived from them.

Based on the model selection criteria, detailed in Supplementary Information 1, the final models for normative trajectories of all MRI phenotypes were specified as illustrated below for GMV:

$$\begin{array}{c}{\rm{G}}{\rm{M}}{\rm{V}}\sim {\rm{G}}{\rm{e}}{\rm{n}}{\rm{e}}{\rm{r}}{\rm{a}}{\rm{l}}{\rm{i}}{\rm{z}}{\rm{s}}{\rm{e}}{\rm{d}}\,{\rm{G}}{\rm{a}}{\rm{m}}{\rm{m}}{\rm{a}}(\mu ,\sigma ,\nu )\,{\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\,\\ \log (\mu )={\alpha }_{\mu }+{\alpha }_{\mu ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\alpha }_{\mu ,{\rm{v}}{\rm{e}}{\rm{r}}}({\rm{v}}{\rm{e}}{\rm{r}})+{\beta }_{\mu ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\mu ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\mu ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log ({\rm{a}}{\rm{g}}{\rm{e}}{)}^{2}+{\gamma }_{\mu ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \log (\sigma )={\alpha }_{\sigma }+{\alpha }_{\sigma ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\beta }_{\sigma ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\sigma ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}+{\gamma }_{\sigma {\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \,\nu ={\alpha }_{\nu }\end{array}$$
(2)

For each component of the generalized gamma distribution, $$\alpha$$ terms correspond to fixed effects of the intercept, sex (female or male), and software version used for pre-processing (five categories); $$\beta$$ terms correspond to the fixed effects of age, modelled as fractional polynomial functions with the number of terms reflecting the order of the fractional polynomials; and $$\gamma$$ terms correspond to the study-level random effects. Note that we have explicitly included the link-functions for each component of the generalized gamma, namely the natural logarithm for $$\mu$$ and $$\sigma$$ (since these parameters must be positive) and the identity for $$\nu$$.

Similarly for the other global MRI phenotypes:

$$\begin{array}{c}{\rm{W}}{\rm{M}}{\rm{V}}\sim {\rm{G}}{\rm{e}}{\rm{n}}{\rm{e}}{\rm{r}}{\rm{a}}{\rm{l}}{\rm{i}}{\rm{s}}{\rm{e}}{\rm{d}}\,{\rm{G}}{\rm{a}}{\rm{m}}{\rm{m}}{\rm{a}}(\mu ,\sigma ,\nu )\,{\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\\ \log (\mu )={\alpha }_{\mu }+{\alpha }_{\mu ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\alpha }_{\mu ,{\rm{v}}{\rm{e}}{\rm{r}}}({\rm{v}}{\rm{e}}{\rm{r}})+{\beta }_{\mu ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\mu ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}+{\beta }_{\mu ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\gamma }_{\mu ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \log (\sigma )={\alpha }_{\sigma }+{\alpha }_{\sigma ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\beta }_{\sigma ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\sigma ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}+{\gamma }_{\sigma ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \,\nu ={\alpha }_{\nu },\end{array}$$
(3)
$$\begin{array}{c}{\rm{s}}{\rm{G}}{\rm{M}}{\rm{V}}\sim {\rm{G}}{\rm{e}}{\rm{n}}{\rm{e}}{\rm{r}}{\rm{a}}{\rm{l}}{\rm{i}}{\rm{s}}{\rm{e}}{\rm{d}}\,{\rm{G}}{\rm{a}}{\rm{m}}{\rm{m}}{\rm{a}}(\mu ,\sigma ,\nu )\,{\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\\ \log (\mu )={\alpha }_{\mu }+{\alpha }_{\mu ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\alpha }_{\mu ,{\rm{v}}{\rm{e}}{\rm{r}}}({\rm{v}}{\rm{e}}{\rm{r}})+{\beta }_{\mu ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\mu ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\beta }_{\mu ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}+{\gamma }_{\mu ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \log (\sigma )={\alpha }_{\sigma }+{\alpha }_{\sigma ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\beta }_{\sigma ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\sigma ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\gamma }_{\sigma ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \,\nu ={\alpha }_{\nu },\end{array}$$
(4)
$$\begin{array}{c}{\rm{V}}{\rm{e}}{\rm{n}}{\rm{t}}{\rm{r}}{\rm{i}}{\rm{c}}{\rm{l}}{\rm{e}}{\rm{s}}\sim {\rm{G}}{\rm{e}}{\rm{n}}{\rm{e}}{\rm{r}}{\rm{a}}{\rm{l}}{\rm{i}}{\rm{z}}{\rm{e}}{\rm{d}}\,{\rm{G}}{\rm{a}}{\rm{m}}{\rm{m}}{\rm{a}}(\mu ,\sigma ,\nu )\,{\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\,\\ \log (\mu )=\,{\alpha }_{\mu }+{\alpha }_{\mu ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\alpha }_{\mu ,{\rm{v}}{\rm{e}}{\rm{r}}}({\rm{v}}{\rm{e}}{\rm{r}})+{\beta }_{\mu ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}+{\beta }_{\mu ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\beta }_{\mu ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}\,\log {({\rm{a}}{\rm{g}}{\rm{e}})}^{2}+{\gamma }_{\mu ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \log (\sigma )=\,{\alpha }_{\sigma }+{\alpha }_{\sigma ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\beta }_{\sigma ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\sigma ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\beta }_{\sigma ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log {({\rm{a}}{\rm{g}}{\rm{e}})}^{2}\\ \,\nu ={\alpha }_{\nu },\end{array}$$
(5)
$$\begin{array}{c}{\rm{T}}{\rm{C}}{\rm{V}}\sim {\rm{G}}{\rm{e}}{\rm{n}}{\rm{e}}{\rm{r}}{\rm{a}}{\rm{l}}{\rm{i}}{\rm{z}}{\rm{e}}{\rm{d}}\,{\rm{G}}{\rm{a}}{\rm{m}}{\rm{m}}{\rm{a}}(\mu ,\sigma ,\nu )\,{\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\\ \log (\mu )=\,{\alpha }_{\mu }+{\alpha }_{\mu ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\alpha }_{\mu ,{\rm{v}}{\rm{e}}{\rm{r}}}({\rm{v}}{\rm{e}}{\rm{r}})+{\beta }_{\mu ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\mu ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\beta }_{\mu ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{3}+{\gamma }_{\mu ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \log (\sigma )=\,{\alpha }_{\sigma }+{\alpha }_{\sigma ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\beta }_{\sigma ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\sigma ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\beta }_{\sigma ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log {({\rm{a}}{\rm{g}}{\rm{e}})}^{2}+{\gamma }_{\sigma ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \,\nu ={\alpha }_{\nu }\end{array}$$
(6)
$$\begin{array}{l}{\rm{S}}{\rm{A}}\sim {\rm{G}}{\rm{e}}{\rm{n}}{\rm{e}}{\rm{r}}{\rm{a}}{\rm{l}}{\rm{i}}{\rm{s}}{\rm{e}}{\rm{d}}\,{\rm{G}}{\rm{a}}{\rm{m}}{\rm{m}}{\rm{a}}(\mu ,\sigma ,\nu )\,{\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\,\\ \log (\mu )=\,{\alpha }_{\mu }+{\alpha }_{\mu ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\alpha }_{\mu ,{\rm{v}}{\rm{e}}{\rm{r}}}({\rm{v}}{\rm{e}}{\rm{r}})+{\beta }_{\mu ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\\ \,+{\beta }_{\mu ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\beta }_{\mu ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log {({\rm{a}}{\rm{g}}{\rm{e}})}^{2}+{\gamma }_{\mu ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \log (\sigma )=\,{\alpha }_{\sigma }+{\alpha }_{\sigma ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\beta }_{\sigma ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}+{\beta }_{\sigma ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\beta }_{\sigma ,3}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log {({\rm{a}}{\rm{g}}{\rm{e}})}^{2}+{\gamma }_{\sigma ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \,\nu ={\alpha }_{\nu },\end{array}$$
(7)
$$\begin{array}{l}{\rm{C}}{\rm{T}}\sim {\rm{G}}{\rm{e}}{\rm{n}}{\rm{e}}{\rm{r}}{\rm{a}}{\rm{l}}{\rm{i}}{\rm{z}}{\rm{e}}{\rm{d}}\,{\rm{G}}{\rm{a}}{\rm{m}}{\rm{m}}{\rm{a}}(\mu ,\sigma ,\nu )\,{\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\,\\ \log (\mu )=\,{\alpha }_{\mu }+{\alpha }_{\mu ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\alpha }_{\mu ,{\rm{v}}{\rm{e}}{\rm{r}}}({\rm{v}}{\rm{e}}{\rm{r}})+{\beta }_{\mu ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\\ \,+{\beta }_{\mu ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-2}\,\log ({\rm{a}}{\rm{g}}{\rm{e}})+{\gamma }_{\mu ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \log (\sigma )=\,{\alpha }_{\sigma }+{\alpha }_{\sigma ,{\rm{s}}{\rm{e}}{\rm{x}}}({\rm{s}}{\rm{e}}{\rm{x}})+{\beta }_{\sigma ,1}{({\rm{a}}{\rm{g}}{\rm{e}})}^{-1}+{\beta }_{\sigma ,2}{({\rm{a}}{\rm{g}}{\rm{e}})}^{0.5}+{\gamma }_{\sigma ,{\rm{s}}{\rm{t}}{\rm{u}}{\rm{d}}{\rm{y}}}\\ \,\nu ={\alpha }_{\nu }.\end{array}$$
(8)

No smoothing terms were used in any GAMLSS models implemented in this study, although the fractional polynomials can be regarded as effectively a parametric form of smoothing. Reliably estimating higher order moments requires increasing amounts of data, hence none of our models specified any age-related fixed-effects or random effects in the $$\nu$$ term. However, $${\alpha }_{\nu }$$ was found to be important in terms of model fit and hence we have used a generalized gamma distribution (Supplementary Information 1).

### Defining developmental milestones

GAMLSS modelling also allowed us to leverage the aggregated life-spanning neuroimaging dataset to derive developmental milestones (that is, peaks of trajectories) and compare them to existing literature. The cerebrum tissue classes from 100 studies (Fig. 1, Supplementary Tables 1.11.7, Supplementary Information 18) showed clear, predominantly age-related trends, even prior to any modelling. Comparing these models with multiple non-MRI metrics of brain size demonstrated high correspondence across the lifespan (Supplementary Information 3). Peaks were determined based on the GAMLSS model output (50th centile) for each of the tissue classes and TCV, for both total tissue volumes and rates of change or growth (velocity). A similar series of methodological steps was performed for the set of extended global and regional cortical morphometric phenotypes (Fig. 2, Supplementary Information 7, 8). To further contextualize the neuroimaging trajectories, diagnostic age ranges from previous literature73,76 (blue boxes in Fig. 3) were compared with empirical age ranges of patients with a given diagnosis across the aggregated neuroimaging dataset (black boxes in Fig. 3). Note that age of diagnosis is significantly later than age of symptom onset for many disorders73. Developmental milestones were also compared to published work for brain resting metabolic rate61, from its minimum in infancy to its maximum in early childhood; anthropometric variables (height and weight), which reach a first peak in velocity during infancy and a second peak in velocity in adolescence52; typical acquisition of the six gross motor capabilities62; and pubertal age ranges as defined based on previous reports51,53.

### Centile scores and case–control differences

These normative trajectories of brain development and aging also enabled each individual scan to be quantified in terms of its relative distance from the median of the age-normed and sex-stratified distributions provided by the reference model67,77 (Fig. 4, Supplementary Information 10, 11). Individual centile scores were estimated relative to the reference curves, in a way that is conceptually similar to traditional anthropometric growth charts (Supplementary Information 1). These centiles represent a novel set of population- and age-standardized clinical phenotypes, providing the capacity for cross-phenotype, cross-study and cross-disorder comparison. A single multivariate metric (CMD, Supplementary Information 1.6) was estimated by combining centile scores on multiple MRI phenotypes for each individual (Fig. 4c). Case–control differences in centile scores were analysed with a bootstrapped (500 bootstraps) non-parametric generalization of Welch’s one-way ANOVA. Pairwise, sex stratified, post-hoc comparisons were conducted using non-parametric Monte Carlo permutation tests (10,000 permutations) and thresholded at a Benjamini–Hochberg FDR of q < 0.05.

### Longitudinal stability

To use centile scores in a diagnostically meaningful or predictive way, they need to be stable across multiple measuring points. To assess this intra-individual stability, we calculated the subject-specific IQR of centiles across timepoints for the datasets that included longitudinal scans (N = 9,306, 41 unique studies). Exploratory longitudinal clinical analyses were restricted to clinical groups that had at least 50 subjects with longitudinal data to allow for robust group-wise estimates of longitudinal variability. In addition, there was a subset of individuals with documented clinical progression over the course of longitudinal scans, for instance from mild cognitive impairment to Alzheimer’s disease, where we expected an associated change in centile scored brain structure. To test this hypothesis, we assessed whether these individuals showed longitudinal variation of centile scores (as assessed with IQR) with a direction of change consistent with their clinical progression. See Supplementary Information 14 for further details about the longitudinal stability of centile scores.

### Data sharing and out-of-sample estimation

We have provided an interactive tool (www.brainchart.io) and made our code and models openly available (https://github.com/brainchart/Lifespan). The tool allows the user to visualize the underlying demographics of the primary studies and to explore the normative brain charts in a much more detailed fashion than static images allow. It also provides the opportunity for interactive exploration of case–control differences in centile scores across many diagnostic categories that is beyond the scope of this paper. Perhaps most significantly, the brain chart interactive tool includes an out-of-sample estimator of model parameters for new MRI data that enables the user to compute centile scores for their own datasets without the computational or data-sharing hurdles involved in adding that data to the reference dataset used to estimate normative charts (Fig. 5). Bias and reliability of out-of-sample centile scoring was extensively assessed and endorsed by resampling and cross-validation studies for ‘new’ studies comprising at least 100 scans. Although already based on the largest and most comprehensive neuroimaging dataset to date, and supporting analyses of out-of-sample data, these normative brain charts will continue to be updated as additional data are made available for aggregation with the reference dataset. See Supplementary Information 1.8, 4 for further details about out-of-sample estimation.

### Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.

## Data availability

Model parameters and out-of-sample centile scores are available at www.brainchart.io and on https://github.com/brainchart/Lifespan. Summary statistics are available in the Supplementary Tables (Supplementary Tables 18). Links to open datasets are also listed on https://github.com/brainchart/Lifespan. Availability of other MRI datasets aggregated here is through application procedures individually managed at the discretion of each primary study, with additional information provided in Supplementary Table 1.1 and Supplementary Information 19.

## Code availability

All code is available at https://github.com/brainchart/Lifespan.

## References

1. Cole, T. J. The development of growth references and growth charts. Ann. Hum. Biol. 39, 382–394 (2012).

2. Stasinopoulos, D. & Rigby, R. Generalized additive models for location scale and shape (GAMLSS) in R. J. Stat. Softw. 23, 1–46 (2007).

3. Gilmore, J. H., Knickmeyer, R. C. & Gao, W. Imaging structural and functional brain development in early childhood. Nat. Rev. Neurosci. 19, 123–137 (2018).

4. Tau, G. Z. & Peterson, B. S. Normal development of brain circuits. Neuropsychopharmacology 35, 147–168 (2010).

5. Grydeland, H. et al. Waves of maturation and senescence in micro-structural MRI markers of human cortical myelination over the lifespan. Cereb. Cortex 29, 1369–1381 (2019).

6. Paus, T., Keshavan, M. & Giedd, J. N. Why do many psychiatric disorders emerge during adolescence? Nat. Rev. Neurosci. 9, 947–957 (2008).

7. Jack, C. R. Jr et al. Tracking pathophysiological processes in Alzheimer’s disease: an updated hypothetical model of dynamic biomarkers. Lancet Neurol. 12, 207–216 (2013).

8. Volpe, J. J. Brain injury in premature infants: a complex amalgam of destructive and developmental disturbances. Lancet Neurol. 8, 110–124 (2009).

9. Nosarti, C. et al. Preterm birth and psychiatric disorders in young adult life. Arch. Gen. Psychiatry 69, E1–E8 (2012).

10. Wheater, E. et al. Birth weight is associated with brain tissue volumes seven decades later but not with MRI markers of brain ageing. Neuroimage Clin. 31, https://doi.org/10.1016/j.nicl.2021.102776 (2021).

11. Vigo, D., Thornicroft, G. & Atun, R. Estimating the true global burden of mental illness. Lancet Psychiatry 3, 171–178 (2016).

12. Marquand, A. F. et al. Conceptualizing mental disorders as deviations from normative functioning. Mol. Psychiatry 24, 1415–1424 (2019).

13. Brewer, J. B. Fully-automated volumetric MRI with normative ranges: translation to clinical practice. Behav. Neurol. 21, 21–28 (2009).

14. Heuer, K. & Toro, R. Role of mechanical morphogenesis in the development and evolution of the neocortex. Phys. Life Rev. 31, 233–239 (2019).

15. Werling, D. M. et al. Whole-genome and RNA sequencing reveal variation and transcriptomic coordination in the developing human prefrontal cortex. Cell Rep. 31, 107489 (2020).

16. Gandal, M. J., Leppa, V., Won, H., Parikshak, N. N. & Geschwind, D. H. The road to precision psychiatry: translating genetics into disease mechanisms. Nat. Neurosci. 19, 1397–1407 (2016).

17. Opel, N. et al. Cross-disorder analysis of brain structural abnormalities in six major psychiatric disorders: a secondary analysis of mega- and meta-analytical findings from the ENIGMA consortium. Biol. Psychiatry 88, 678–686 (2020).

18. Li, X. et al. Moving beyond processing and analysis-related variation in neuroscience. Preprint at https://doi.org/10.1101/2021.12.01.470790 (2021).

19. Peterson, M. R. et al. Normal childhood brain growth and a universal sex and anthropomorphic relationship to cerebrospinal fluid. J. Neurosurg. Pediatr. 9, 458–468 (2021).

20. Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).

21. Casey, B. J. et al. The Adolescent Brain Cognitive Development (ABCD) study: imaging acquisition across 21 sites. Dev. Cogn. Neurosci. 32, 43–54 (2018).

22. Zöllei, L., Iglesias, J. E., Ou, Y., Grant, P. E. & Fischl, B. Infant FreeSurfer: an automated segmentation and surface extraction pipeline for T1-weighted neuroimaging data of infants 0–2 years. Neuroimage 218, 116946 (2020).

23. Kim, H. et al. NEOCIVET: towards accurate morphometry of neonatal gyrification and clinical applications in preterm newborns. Neuroimage 138, 28–42 (2016).

24. Borghi, E. et al. Construction of the World Health Organization child growth standards: selection of methods for attained growth curves. Stat. Med. 25, 247–265 (2006).

25. Pomponio, R. et al. Harmonization of large MRI datasets for the analysis of brain imaging patterns throughout the lifespan. Neuroimage 208, 116450 (2020).

26. Dong, H.-M. et al. Charting brain growth in tandem with brain templates at school age. Sci. Bull. 65, 1924–1934 (2020).

27. Zuo, X.-N. et al. Human connectomics across the life span. Trends Cogn. Sci. 21, 32–45 (2017).

28. Chen, H. et al. Quantile rank maps: a new tool for understanding individual brain development. Neuroimage 111, 454–463 (2015).

29. Frangou, S. et al. Cortical thickness across the lifespan: data from 17,075 healthy individuals aged 3–90 years. Hum. Brain Mapp. 43, 431–451 (2021).

30. Habes, M. et al. The Brain Chart of Aging: machine-learning analytics reveals links between brain aging, white matter disease, amyloid burden, and cognition in the iSTAGING consortium of 10,216 harmonized MR scans. Alzheimers Dement. 17, 89–102 (2021).

31. Zhang, H. et al. Growth charts for individualized evaluation of brain morphometry for preschool children. Preprint at https://doi.org/10.1101/2021.04.08.21255068 (2021).

32. Milham, M., Fair, D., Mennes, M. & Mostofsky, S. The ADHD-200 consortium: a model to advance the translational potential of neuroimaging in clinical neuroscience. Front. Syst. Neurosci. 6, 62 (2012).

33. Di Martino, A. et al. The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol. Psychiatry 19, 659–667 (2014).

34. Snoek, L. et al. AOMIC-PIOP1. OpenNeuro https://doi.org/10.18112/OPENNEURO.DS002785.V2.0.0 (2020).

35. Bilder, R. et al. UCLA Consortium for Neuropsychiatric Phenomics LA5c Study. OpenNeuro https://doi.org/10.18112/OPENNEURO.DS000030.V1.0.0 (2020).

36. Nastase, S. A. et al. Narratives. OpenNeuro https://doi.org/10.18112/OPENNEURO.DS002345.V1.1.4 (2020).

37. Alexander, L. M. et al. An open resource for transdiagnostic research in pediatric mental health and learning disorders. Sci. Data 4, 170181 (2017).

38. Snoek, L. et al. AOMIC-PIOP2. OpenNeuro https://doi.org/10.18112/OPENNEURO.DS002790.V2.0.0 (2020).

39. Richardson, H., Lisandrelli, G., Riobueno-Naylor, A. & Saxe, R. Development of the social brain from age three to twelve years. Nat. Commun. 9, 1027 (2018).

40. Kuklisova-Murgasova, M. et al. A dynamic 4D probabilistic atlas of the developing brain. Neuroimage 54, 2750–2763 (2011).

41. Snoek, L. et al. AOMIC-ID1000. OpenNeuro https://doi.org/10.18112/OPENNEURO.DS003097.V1.2.1 (2021).

42. Reynolds, J. E., Long, X., Paniukov, D., Bagshawe, M. & Lebel, C. Calgary preschool magnetic resonance imaging (MRI) dataset. Data Brief. 29, 105224 (2020).

43. Knickmeyer, R. C. et al. A structural MRI study of human brain development from birth to 2 years. J. Neurosci. 28, 12176–12182 (2008).

44. Gilmore, J. H. et al. Individual variation of human cortical structure is established in the first year of life. Biol. Psychiatry Cogn. Neurosci. Neuroimaging 5, 971–980 (2020).

45. Courchesne, E. et al. Normal brain development and aging: quantitative analysis at in vivo MR imaging in healthy volunteers. Radiology 216, 672–682 (2000).

46. Narvacan, K., Treit, S., Camicioli, R., Martin, W. & Beaulieu, C. Evolution of deep gray matter volume across the human lifespan. Hum. Brain Mapp. 38, 3771–3790 (2017).

47. Desikan, R. S. et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31, 968–980 (2006).

48. Vasung, L. et al. Association between quantitative MR markers of cortical evolving organization and gene expression during human prenatal brain development. Cereb. Cortex 31, 3610–3621 (2021).

49. Fjell, A. M. et al. Development and aging of cortical thickness correspond to genetic organization patterns. Proc. Natl Acad. Sci. USA 112, 15462–15467 (2015).

50. Sydnor, V. J. et al. Neurodevelopment of the association cortices: patterns, mechanisms, and implications for psychopathology. Neuron 109, 2820–2846 (2021).

51. Tanner, J. M. Growth at Adolescence 2nd edn (Thomas, 1962).

52. Bozzola, M. & Meazza, C. In Handbook of Growth and Growth Monitoring in Health and Disease (ed. Preedy, V. R.) 2999–3011 (2012).

53. Tanner, J. M., Whitehouse, R. H. & Takaishi, M. Standards from birth to maturity for height, weight, height velocity, and weight velocity: British children, 1965. I. Arch. Dis. Child. 41, 454–471 (1966).

54. Dubois, J. et al. The early development of brain white matter: a review of imaging studies in fetuses, newborns and infants. Neuroscience 276, 48–71 (2014).

55. Huttenlocher, P. R. & Dabholkar, A. S. Regional differences in synaptogenesis in human cerebral cortex. J. Comp. Neurol. 387, 167–178 (1997).

56. Mountcastle, V. B. The columnar organization of the neocortex. Brain 120, 701–722 (1997).

57. Petanjek, Z. et al. Extraordinary neoteny of synaptic spines in the human prefrontal cortex. Proc. Natl Acad. Sci. USA 108, 13281–13286 (2011).

58. Miller, D. J. et al. Prolonged myelination in human neocortical evolution. Proc. Natl Acad. Sci. USA 109, 16480–16485 (2012).

59. Holland, D. et al. Structural growth trajectories and rates of change in the first 3 months of infant brain development. JAMA Neurol. 71, 1266–1274 (2014).

60. Blüml, S. et al. Metabolic maturation of the human brain from birth through adolescence: insights from in vivo magnetic resonance spectroscopy. Cereb. Cortex 23, 2944–2955 (2013).

61. Kuzawa, C. W. et al. Metabolic costs and evolutionary implications of human brain development. Proc. Natl Acad. Sci. USA 111, 13010–13015 (2014).

62. WHO Multicentre Growth Reference Study Group. WHO Motor Development Study: windows of achievement for six gross motor development milestones. Acta Paediatr. 95, 86–95 (2007).

63. Kang, H. J. et al. Spatio-temporal transcriptome of the human brain. Nature 478, 483–489 (2011).

64. Sorva, R., Lankinen, S., Tolppanen, E. M. & Perheentupa, J. Variation of growth in height and weight of children. II. After infancy. Acta Paediatr. Scand. 79, 498–506 (1990).

65. Elliott, L. T. et al. Genome-wide association studies of brain imaging phenotypes in UK Biobank. Nat. Neurosci. 562, 210–216 (2018).

66. Hendrickson, M. A. & Pitt, M. B. Three areas where our growth chart conversations fall short—room to grow. JAMA Pediatr. 176, 123–124 (2021).

67. Marquand, A. F., Rezek, I., Buitelaar, J. K. & Beckmann, C. F. Understanding heterogeneity in clinical cohorts using normative models: beyond case–control studies. Biol. Psychiatry 80, 552–561 (2016).

68. Shen, F. X. et al. Emerging ethical issues raised by highly portable MRI research in remote and resource-limited international settings. Neuroimage 238, 118210 (2021).

69. Sharma, E. et al. Consortium on Vulnerability to Externalizing Disorders and Addictions (cVEDA): a developmental cohort study protocol. BMC Psychiatry 20, 2 (2020).

70. Liu, S. et al. Chinese Color Nest Project: an accelerated longitudinal brain-mind cohort. Dev. Cogn. Neurosci. 52, 101020 (2021).

71. Shansky, R. M. & Murphy, A. Z. Considering sex as a biological variable will require a global shift in science culture. Nat. Neurosci. 24, 457–464 (2021).

72. Engemann, D. A. et al. Combining magnetoencephalography with magnetic resonance imaging enhances learning of surrogate-biomarkers. eLife 9, e54055 (2020).

73. Solmi, M. et al. Age at onset of mental disorders worldwide: large-scale meta-analysis of 192 epidemiological studies. Mol. Psychiatry https://doi.org/10.1038/s41380-021-01161-7 (2021).

74. Wierenga, L. M. et al. Greater male than female variability in regional brain structure across the lifespan. Hum. Brain Mapp. 43, 470–499 (2020).

75. Heude, B. et al. A big-data approach to producing descriptive anthropometric references: a feasibility and validation study of paediatric growth charts. Lancet Digit Health 1, e413–e423 (2019).

76. Erkkinen, M. G., Kim, M.-O. & Geschwind, M. D. Clinical neurology and epidemiology of the major neurodegenerative diseases. Cold Spring Harb. Perspect. Biol. 10, a033118 (2018).

77. Bethlehem, R. A. I. et al. A normative modelling approach reveals age-atypical cortical thickness in a subgroup of males with autism spectrum disorder. Commun Biol 3, 486 (2020).

## Author information

Authors

### Contributions

R.A.I.B., J.S., S.R.W., E.T.B. and A.F.A.-B. designed the study, conducted analyses, wrote and edited the manuscript. J.V. and K.M.A. helped to design the study and contributed to data analysis. All other authors made substantial contributions to the conception or design of the work, the acquisition, analysis or interpretation of data, the creation of new software used in the work, or drafted or substantively revised the Article.

### Corresponding authors

Correspondence to R. A. I. Bethlehem or J. Seidlitz.

## Peer review

### Peer review information

Nature thanks Michael Harms, Michael Milham and the other, anonymous, reviewers for their contribution to the peer review of this work. Peer review reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Supplementary information

### Supplementary Information

This file includes common nomenclature, Supplementary Methods, sensitivity analyses, supplementary analyses, reference database details, replication/validation datasets, a note on data sharing, affiliations, Acknowledgements and supplementary references.

### Supplementary Table 1

Demographic information. SI1.1, sample size and acquisition parameters for each individual study. SI1.2–SI1.42, demographic information per study and sex for each individual brain imaging phenotype (that is, SI1.2 has demographics for all control subjects included in the GMV model, SI1.3 for all subjects included in the WMV model, etc.). Thus these contain all demographics and sample sizes related to the models presented in Figs. 1, 2.

### Supplementary Table 2

Milestones. SI2.1, information on volumetric peaks and rate of change peaks for the global neuroimaging phenotypes as presented in Figs. 1–3. SI2.2, information on volumetric peaks and rate of change peaks for the regional phenotypes as presented in Fig. 2.

### Supplementary Table 3

Case–control centile comparisons. SI3.1–3.7, statistics and sample size for case–control comparisons presented in Fig. 4a (that is, for clinical groups with N > 500) and SI10. Each table reports the sample sizes, test-statistics, effect sizes and corrected and uncorrected P-values per case–control pair and per sex. Corrected P-values represent P-values adjusted using Benjamini–Hochberg FDR correction for multiple comparisons across all case–control pairs of a specific sex within a specific feature. SI3.8–3.14, same statistics as 3.1–3.7 for all clinical groups with N > 250. SI3.15–3.21, same statistics as 3.1–3.7 for all clinical groups with N > 100. SI3.22–3.28, same statistics as 3.1–3.7 for all clinical groups with N > 50.

### Supplementary Table 4

Centile distributions. SI4.1–4.7, statistics on centile distributions of global neuroimaging phenotypes, presenting the estimated number of peaks and the accompanying Hartigan’s dip-test statistic for unimodality per sex per phenotype.

### Supplementary Table 5

Case–control IQR comparisons. SI5.1–5.7, statistics of the case–control comparisons of the interquartile range for subjects that had longitudinal data. Each table reports the sample sizes, test-statistics, effect sizes and corrected and uncorrected P-values per case–control pair and per sex. Corrected P-values represent P-values adjusted using Benjamini–Hochberg FDR correction for multiple comparisons across all case–control pairs of a specific sex within a specific feature.

### Supplementary Table 6

Case–control IQR comparisons for subjects with diagnostic changes. SI6.1–6.7, statistics of the case–control comparisons of the interquartile range for subjects that had longitudinal data and changed diagnostic labels during the course of their longitudinal assessment. Each table reports the sample sizes, test-statistics, effect sizes and corrected and uncorrected P-values per case–control pair and per sex. Corrected P-values represent P-values adjusted using Benjamini–Hochberg FDR correction for multiple comparisons across all case–control pairs of a specific sex within a specific feature. SI6.8.614, demographics for the groups compared in each phenotype.

### Supplementary Table 7

Leave-one-study-out statistics. SI7.1–7.7, statistics of the per study stability of centiles based on the leave-one-study-out (LOSO) analysis. For each study and each feature the Pearson correlation (and corresponding t-value), uncorrected P-value, confidence intervals and degrees of freedom are listed comparing the centiles estimated within the model versus outside the model (that is, using the out-of-sample estimation method).

### Supplementary Table 8

Centile sex differences. SI8.1–8.7, statistics of linear models assessing the effects of sex on centile scores within the clinical cohorts (patient N > 500), including beta estimates, standard errors, degrees of freedom, t-value and uncorrected P-values for all main effects and interactions.

## Rights and permissions

Reprints and Permissions

Bethlehem, R.A.I., Seidlitz, J., White, S.R. et al. Brain charts for the human lifespan. Nature 604, 525–533 (2022). https://doi.org/10.1038/s41586-022-04554-y

• Accepted:

• Published:

• Issue Date:

• DOI: https://doi.org/10.1038/s41586-022-04554-y

• ### A Chinese multi-modal neuroimaging data release for increasing diversity of human brain mapping

• Peng Gao
• Hao-Ming Dong
• Xi-Nian Zuo

Scientific Data (2022)

• ### Charting brain development across the human lifespan

• Heather Wood

Nature Reviews Neurology (2022)

• ### Lessons from infant learning for unsupervised machine learning

• Tarek R. Besold
• Rhodri Cusack

Nature Machine Intelligence (2022)

• ### The normative modeling framework for computational psychiatry

• Saige Rutherford
• Seyed Mostafa Kia
• Andre F. Marquand

Nature Protocols (2022)

• ### Your brain expands and shrinks over time — these charts show how

• Max Kozlov

Nature (2022)