A canonical trajectory of executive function maturation from adolescence to adulthood

Tervo-Clemmens, Brenden; Calabro, Finnegan J.; Parr, Ashley C.; Fedor, Jennifer; Foran, William; Luna, Beatriz

doi:10.1038/s41467-023-42540-8

Download PDF

Article
Open access
Published: 30 October 2023

A canonical trajectory of executive function maturation from adolescence to adulthood

Nature Communications volume 14, Article number: 6922 (2023) Cite this article

15k Accesses
5 Citations
418 Altmetric
Metrics details

Subjects

Abstract

Theories of human neurobehavioral development suggest executive functions mature from childhood through adolescence, underlying adolescent risk-taking and the emergence of psychopathology. Investigations with relatively small datasets or narrow subsets of measures have identified general executive function development, but the specific maturational timing and independence of potential executive function subcomponents remain unknown. Integrating four independent datasets (N = 10,766; 8–35 years old) with twenty-three measures from seventeen tasks, we provide a precise charting, multi-assessment investigation, and replication of executive function development from adolescence to adulthood. Across assessments and datasets, executive functions follow a canonical non-linear trajectory, with rapid and statistically significant development in late childhood to mid-adolescence (10–15 years old), before stabilizing to adult-levels in late adolescence (18–20 years old). Age effects are well captured by domain-general processes that generate reproducible developmental templates across assessments and datasets. Results provide a canonical trajectory of executive function maturation that demarcates the boundaries of adolescence and can be integrated into future studies.

The developmental trajectories of executive function from adolescence to old age

Article Open access 14 January 2021

The development of structural covariance networks during the transition from childhood to adolescence

Article Open access 04 May 2021

Baseline brain function in the preadolescents of the ABCD Study

Article 07 June 2021

Introduction

Adolescence is a unique period of the lifespan, initiated by puberty and characterized by the maturation of cognitive, affective, and social processes that culminate in a transition to independence and adulthood^1,2,3. Among maturational processes, theories from neuroscience and psychology have placed a particular emphasis on the development of goal-directed cognitive abilities (e.g., response inhibition, working memory, task-switching, and planning behaviors) that are hypothesized to index a common process of executive function or cognitive control^4,5,6. In parallel to socioemotional development and environmental influences, a protracted maturation^7,8 and/or stabilization of executive function⁹ into adulthood has been suggested to contribute to lifespan peaks in risk-taking behaviors (e.g., substance use initiation¹⁰; though see also refs. ^11,12) and increased vulnerability to psychiatric disorders¹³ during adolescence. Ongoing executive function changes during adolescence have been used in colloquial, legal^9,14, and scientific (see⁴ for review) contexts to differentiate adolescents from adults and clarify adolescence as a period of continued development.

Adolescent executive function development has been studied in relatively small (N’s ~200^15,16) independent investigations using a broad range of tasks or in relatively large studies (N’s ~ 1000)^17,18 using very narrow assessments of executive function. No large-scale, multi-assessment, multi-dataset reproducibility investigations of adolescent executive function development have been performed. Further, common analytic approaches do not quantitatively define maturational timing and/or plateaus toward adult-levels of performance. The magnitude of executive function changes during adolescence, the precise timing of when adolescents reach adult-levels, and the potential diversity of processes assessed by varying executive function tasks, thus remains widely debated.

Empirical research suggests that while adolescents can perform complex, goal-directed behaviors that rely on executive functions, their performance is not as accurate or as fast as adults^{5,15,16,19,20,21,22,23,24}. Age-related increases in correct response rates (accuracy) and decreases in the speed of responses (i.e., latency/reaction time) have been observed for a broad range of laboratory-based and neuropsychological executive function tasks (e.g., working memory, response inhibition, switching, planning) during adolescence (see refs. ^1,25 for reviews). Theoretical models built from these observations and related observations in animal studies²⁶, as well as broader²⁷ and historical perspectives of psychological development²⁸, have led to hypotheses suggesting that cognitive development continues through adolescence and may reach maturity in the second decade of life (e.g., by 20 years old^9,29) or later (e.g., ≥ 25 years old^8,27) in humans. A range of methodological, analytic, and data availability challenges, however, have thus far prevented direct and comprehensive testing of the maturational timing of adolescent executive function development and the specific age when executive functions reach adult-levels. Nevertheless, understanding not just whether behaviors are changing with age, but also their shape and form, is fundamental to developmental science^15,30,31,32 and corresponding health policies and intervention/prevention strategies for adolescents. Defining the shape and form of cognitive development likewise has key implications for research on mechanisms of ongoing (potentially critical period) plasticity and factors influencing the opening and closing of the adolescent period³.

There are unique challenges to defining the normative maturational timing of adolescent executive function development that arises from multiple sources, including inter-individual differences among participants and across datasets, difficulties in designing analytic frameworks that directly assess maturational timing³³, and potential variability among the many tasks designed to assess executive function^5,34. The first of these challenges is beginning to be addressed through larger study designs (e.g., Nathan Kline Institute-Rockland Sample³⁵ [NKI], National Consortium on Alcohol and Neurodevelopment in Adolescence³⁶ [NCANDA], Philadelphia Neurodevelopmental Cohort³⁷ [PNC]) and data aggregation techniques, as increasing dataset sizes and the inclusion of multiple datasets can better overcome sampling variability^38,39 to estimate generalizable normative developmental trajectories. Addressing the latter challenges, however, requires conceptual and methodological advancements.

Initial investigations in adolescent research have often relied on a fixed-developmental shape (e.g., linear, inverse linear, quadratic regression models) or categorical comparisons (e.g., adolescents versus children/adults) to identify age-related differences^32,33. While essential to establish that age-related differences in executive function generally occur during adolescence, such fixed-developmental, parametric comparisons prevent the systematic investigation of the relative rate and timing of maturation that is essential for precise developmental science. Such approaches likewise have prevented resolution of foundational theories of adolescent neurobehavioral development, where distinct linear and non-linear shapes have been proposed⁴. Therefore, while prominent theories suggest adolescents may reach adult-levels of executive function between 20- and 25 years old, such plateaus in developmental change have not been investigated in most empirical research and are not testable within commonly used analytic frameworks. This lack of resolution on the maturational timing of adolescent executive function also poses challenges for related lifespan research, where a potentially distinct developmental concept of emerging adulthood (~18–25 years old⁴⁰) has likewise been justified, in part, by potential ongoing cognitive changes. New methods (e.g., general additive models⁴¹) that can quantitatively define the potentially non-linear developmental trajectory of executive function during adolescence, as well as multiple large publicly available datasets, now allow for precise estimation of the maturational timing of executive function.

A further challenge to defining the maturational timing of executive function development arises from the potential variability among the many tasks designed to assess executive function. Empirical studies have often focused on an individual, or a relatively narrow subset, of tasks (see for example discussion in refs. ^5,42). Fewer studies have therefore investigated the developmental similarity among potential subprocesses indexed by the dozens of laboratory-based and neuropsychological executive function measures used in the broader literature. While there is a growing use of standardized tasks (e.g., Delis-Kaplan Executive Function System⁴³ [D-KEFS], Cambridge Neuropsychological Test Automated Battery⁴⁴ [CANTAB], Penn Computerized Neurocognitive Battery⁴⁵ [Penn CNB]), systematic comparisons across these instruments are similarly limited. Many neurodevelopmental and psychological theories^4,7,8,9 emphasize a broad unitary process of executive function development, suggesting the maturation of performance on any one of these tasks may generalize to broader executive function development. However, alternative perspectives have also been proposed. Prior work in adults (both healthy college students⁴², as well as patients with frontal lobe damage⁴⁶), for example, has suggested that executive function tasks support a unity/diversity framework, where commonality and correlation are observed amongst all executive function measures (unity), but individual aspects of executive function maintain a degree of separability (diversity). Owing to the focus on individual functions and tasks or narrow subsets in most adolescent research, it nevertheless remains unclear whether adolescent executive function development is driven by multiple, independent processes (diversity) and/or the maturation of a more common domain-general process (unity).

Here we aggregate four large-scale, independent datasets to construct a comprehensive set of executive function data spanning the entire adolescent period as well as the relative transitional periods of late childhood and early adulthood (total age range: 8–35, total N = 10,766, total visits = 13,817) that includes 23 executive function measures from 17 distinct tasks. In addition to large-scale replication, we directly address prior challenges in defining the maturational timing and domain-generality of adolescent executive function development with multiple large independent cohorts (two longitudinal, two cross-sectional), non-linear modeling approaches that directly define maturational timing, and the inclusion of a broad executive function battery that permit the investigation of both potential unitary and diversity processes. Taken together, this work identifies a canonical non-linear developmental trajectory of executive function maturation that generalizes across datasets and assessments, with rapid age-related change from late childhood to early adolescence (10–15 years old), small but significant changes in mid-adolescence (15–18 years old), before stabilizing to adult-levels in late adolescence (18–20 years old). The similarity in developmental trajectories is well accounted for by domain-general processes consistent with theories of unitary executive function and fluid cognition. The insights and data developed here can inform neuroscientific and psychological theories of the adolescent period and guide future translational research in health and disease.

Results

Executive function development follows a canonical trajectory across datasets and tasks

Participants ranging from 8–35 years old (Supplementary Fig. S1) were drawn from two large longitudinal studies of executive function development, including data collected by our group (Luna, N = 196, total visits = 666) and data collected as part of the National Consortium on Alcohol and Neurodevelopment (NCANDA³⁶, N = 831, total visits = 3412), as well as two large cross-sectional studies, including data collected as part of the Nathan Kline Institute-Rockland Sample (NKI³⁵, N = 588), and data from the Philadelphia Neurodevelopmental Cohort (PNC³⁷, N = 9151). Studies relied on community-based samples from across the United States (see Methods) that were balanced for biological sex at birth and in the aggregate, were consistent with national patterns of race and ethnicity (Supplementary Table S1). Family income varied both within and between datasets, but as in previous reports across behavioral sciences⁴⁷, was generally higher than national averages (Supplementary Table S1). Secondary analyses however, suggested the sample composition of included datasets well approximated broader population patterns for primary results (See Supplementary Methods, Supplementary Fig. S2). Across the studies, participants performed a variety of executive function tasks (twenty-three measures from seventeen distinct EF tasks; Supplementary Table S2), including those designed to measure processes of response inhibition (e.g., Antisaccade, Stroop), working memory (e.g., Spatial Span), planning (e.g., Stockings of Cambridge), as well as those from standard computerized neurocognitive batteries (Penn Computerized Neurocognitive Battery⁴⁵) that include tasks designed to measure executive function (e.g., Conditional Exclusion Test, N-Back Test, Continuous Performance Test³⁴) and a neuropsychological executive function battery (Delis-Kaplan Executive Function System [D-KEFs]: Tower, Trails, Design Fluency, Color-Word Interference). Response types included button presses, eye movements, and experimenter-administered neuropsychological performance (e.g., D-KEFs). For most tasks, both latency (speed of responses) and accuracy (e.g., correct response rate) measures were available (see also Methods).

We first examined the developmental trajectory of each executive function measure independently using non-linear regression models with penalized splines (general additive mixed models (GAMM) for longitudinal data; general additive models (GAM) for cross-sectional data: see Methods). Unlike the fixed-developmental shape approaches that are typically used in adolescent research, this allowed us to estimate flexible, data-driven trajectories and explore the shape of development (functional form of age) for each executive function measure. These analyses revealed that nearly all executive function measures (20/23 measures) had corrected significant (corrected p’s < 0.004, [two-sided], calculated via default procedures of GAM that performs an equality test of parameters of the smoothed term to zero⁴⁸; see Supplementary Table S3 for full statistics as well reproducible variable names from public datasets) age-related differences and followed a similar non-linear developmental trajectory, with rapid development in late childhood to mid-adolescence (10–15 years old), smaller changes through mid-adolescence (15–18 years old), before stabilizing to adult-levels in late adolescence (18–20 years old) (Fig. 1A–D). Critically, age-related differences were observed across nearly all tasks from all four independent datasets, with accuracy measures showing significant age-related increases and latency measures showing parallel significant age-related decreases (Fig. 1A–D). The average total age-related change (max-min of GAM/GAMM fits) was large based on conventional effect size standards (mean across all measures from all datasets in standard deviation [z] units: 1.38; Fig. 1A–D). Overlapping visualization of all measures with significant age-related differences from all datasets further highlights a potential canonical shape of normative adolescent executive function development (Fig. 1E).

**Fig. 1: Age trajectories of executive function measures.**

Executive function significantly develops through late adolescence

To precisely quantify periods of significant developmental differences and estimate when measures reached adult-levels, we next examined the local slope (first derivative) of age-related differences across all ages in 1/10th of year intervals for all non-linear GAMM/GAM models. As in prior developmental research in other domains^49,50, a simulation approach (10,000 iterations) was used to construct confidence intervals for the first derivative of the fitted models to assess statistically significant age-related differences at each age (p < 0.05 [two-sided] via simultaneous confidence intervals⁵¹ to account for multiple tests across ages: see Methods). Age-ranges in which the simultaneous 95% confidence interval of the first derivative of the GAM/GAMM fits did not include zero (p < 0.05, two-sided) were classified as statistically significant. We note that a thresholded 95% confidence interval (an unthreholded version can be viewed in full in Supplementary Fig. S3), instead of for example exact p-values, are provided here as in previous work to highlight age ranges of statistical significance^49,50. Consistent with theoretical models of adolescence, significant (p < 0.05 [two-sided] via simultaneous confidence intervals) age-related changes in executive function accuracy (increases) and latency (decreases) were observed during early to middle adolescent periods (10–15 years old) for nearly all measures (Fig. 2A–D). Effect size benchmarks do not yet exist for short-timescale developmental changes, however given the short span of age examined via the derivative (units scaled to per-year change) and the total age-related changes (Fig. 1A–E), local effect sizes are judged to be large (e.g., mean z unit change from 10–15 years old: .142 per-year [accuracy]; −.175 per-year [latency], Fig. 2E; see Fig. 2A–D for z unit scaling for all measures). From middle to late adolescent periods (15–18 years old), smaller but still statistically significant (p < 0.05 [two-sided] via simultaneous confidence intervals) changes were observed for several measures (Fig. 2A–D). After late adolescence (>18 years old), very few measures exhibited statistically significant (p < 0.05 [two-sided] via simultaneous confidence intervals) age-related change (Fig. 2A–D).

**Fig. 2: Developmental periods with significant age-related change in executive function.**

Aggregate analysis across measures and tasks (three-level pointwise meta-analysis: see Methods) support the inference from individual measures and datasets (Fig. 1A–E; see also Supplementary Fig. S4), with statistically significant (p < 0.05 [two-sided] via simultaneous confidence intervals) age-related differences detected throughout early to late adolescent periods (10–18 years old) for both accuracy and latency measures (Fig. 2E). While statistically significant (p < 0.05 [two-sided] via simultaneous confidence intervals) age-related differences could also be observed in this highly powered aggregate analysis until 20 years old for accuracy measures (Fig. 2E), the absolute magnitude of these effects were very small after 18 years old (mean z unit change in accuracy per-year between 18–20-years old: .023 [~1/5^th the average change observed between 10- and 15 years old]); Fig. 2E). A parallel analysis examining the magnitude of change among those measures with statistically significant overall age effects (corrected p’s < 0.004, [two-sided]; see also Fig. 1, Supplementary Table S3) likewise demonstrates that, on average, over 95.0 and 99.7% of the total detectable age-related change between 8–35 years old occurs prior to 18 years old for accuracy and latency, respectively (Supplementary Fig. S5). These results provide robust and reproducible evidence of statistically significant and developmentally specific changes in executive function during early through mid-adolescence that reach maturity between 18 years old and 20 years old and reinforce that adolescence is a period of ongoing development of goal-directed cognition and executive function. A normative maturational stability towards adult-levels of executive function by late adolescence (18- to 20 years old) is highly consistent with what has been theorized in heuristic models of adolescence (~20 years old), but notably earlier than lifespan accounts suggesting executive function changes continue to occur during emerging adulthood (18–25 years old).

Adolescent executive function development is predominantly domain general

Building from the observation that nearly all executive function measures showed the same developmental trajectory and relative maturational timing, we next examined the potential shared information across measures at the per-participant level using between-person (all datasets) and within-person (Luna, NCANDA) correlations and factor analysis (see Methods). Composite metrics were not used here, as they are by construction (linear sums of original measures) correlated with multiple measures. Consistent with a domain general, unity process of executive function, per-participant scores across nearly all measures were moderately correlated (see Methods) in all datasets in both between-person (cross-sectional) and within-person (longitudinal) analyses (Fig. 3A; mean linear, bivariate correlation from data aggregation (“all measures”) |r| = 0.261; Supplementary Table S4 for correlation matrices). Exploratory factor analysis likewise demonstrated that a single domain general factor (via bifactor rotation) explains over 20% (21.9%) of total executive function variance on average, across datasets (Fig. 3B). There was no systematic evidence that the total executive function variance explained by a single domain general factor varied by age (Supplementary Fig. S6). While certain data-driven thresholds to determine the number of supported latent factors (parallel analysis, optimal coordinate, acceleration factor, and a factor analytic Kaiser rule; see Methods, Supplementary Fig. S7) suggest the inclusion of a second or third factor across datasets (Fig. 3B), these factors account for very small amounts of executive function variance (on average, ~6 and 2% respectively, see Fig. 3B; Supplementary Fig. S7 for individual datasets) after accounting for the domain general factor (via bifactor rotation). Visual inspection of loadings for secondary and tertiary factors demonstrate that these factors tended to capture residual effects from specific, single measures or methods (e.g., eye-tracking) or similar, broad domain general patterns (see Supplementary Fig. S7). Additional factors beyond these (4 or more factors) were not suggested for any dataset, under any data-driven threshold (Fig. 3B; Supplementary Fig. S7). Combined, these results provide evidence across studies for a single domain general factor or unity framework of executive function factor that accounts for variance across tasks (see Fig. 3C), although further work with expanded measures can help clarify potential diversity and domain-specific executive function performance (see Discussion).

**Fig. 3: Correlation and factor structure of executive function measures.**

Beyond the general dimensionality of participant-level, individual differences, a primary goal of the current work was to determine the timing and complexity of age-related differences in executive function from adolescence to adulthood. Therefore, we next tested the extent to which age-related, developmental differences in any one specific executive function measure could be explained by the general executive function processes supported in our previous analyses. Through nested model comparisons (see Methods), we determined the percentage of age-related differences on each specific executive function measure explained by a single domain general composite metric of the accuracy and latency metrics from the remaining tasks in the dataset (“leave one task out” composite metric; see Supplementary Fig. S8 for visualization of this procedure) versus a measure and/or task-specific process. As the broadest test of such a domain-general executive function process and consistent with prior suggestions from related literature in aging^52,53, in datasets (Luna, NKI) where multiple measures had the same putative, primary executive function subdomain (see first listed Domain in Supplementary Table S2), these measures were likewise left out of the composite metric (“leave out all measures from the same domain”). To further maximize comparability across studies and to prevent bias from shared, non-executive function visit effects (e.g., practice effects; see Sensitivity Analyses and Supplementary Fig. S9), analyses here were performed with the larger cross-sectional data, but were consistent with longitudinal data (cf., within-person factor structure in Fig. 3B, D).

Results demonstrated that a general component of executive function (as a single composite metric) often explained more than half of age-related information (via deviance testing in model comparison; see Supplementary Fig. S8) in individual executive function measures, with age effects for several measures nearly fully explained by a general executive function process (Fig. 4A–D). Aggregate analysis (three-level meta-analysis) revealed that on average, close to three-fourths (i.e., 75%) of age-related information in any one executive function measure could be explained by a domain-general executive function process (via a single composite metric of [equally weighted] out-of-domain measures; percentage of explained age-related deviance by common executive function for accuracy measures: 79.3%, latency measures: 70.6%; Fig. 4E). There was, however, notable variability between the proportion of explained variance by common executive function across datasets. One possible explanation for these differences is that the datasets (NCANDA, PNC) with fewer executive function measures have less precision to estimate a domain general executive function process. Consistent with this, the percentage of age-related information explained by a common executive function process decreased and became more variable across measures in Luna and NKI datasets in simulations that used iteratively smaller numbers of variables to estimate an executive function composite (see Supplementary Fig. S10). Combined, these results provide the strongest evidence for a core domain general or unitary process related to observed age-related differences in executive function that is reproducible across measures and datasets. Together with our previous analyses, these results support adolescence as a potentially specific period of the lifespan of ongoing executive function, where a core unitary maturational process may give rise to improvements across related but distinct assessments.

**Fig. 4: Contributions from domain-general versus specific processes to age-related differences in executive function.**

Scaled domain general executive function scores generate reproducible normative maturational templates across datasets and tasks

Having established that executive function measures follow a canonical developmental trajectory during adolescence and age-related changes are well captured by domain-general processes, our final analyses sought to build upon these results to create normative maturational templates applicable across datasets and tasks. That is, if a substantial portion of executive function development follows the same trajectory (Figs. 1, 2) and is driven by a common, domain-general process (Figs. 3, 4), we tested whether a simplified normative template of change would be representative across new datasets and tasks and could be used to quantitatively guide future research.

A standard growth chart⁵⁴ constructs a normative template of developmental change and inter-individual variability (e.g., percentile) for a single assessment with a single scale of measurement (e.g., height in inches). Executive function, however, is assessed with dozens of different measurements⁵ and owing to the potential range of participant ages included in any one developmental dataset, the total extent of observed individual variability may substantially differ across datasets, even if developmental change proceeds according to the shape of the canonical executive function trajectory. In the current datasets, scaling to adult performance (standard deviation units based on performance of 20–30-year-olds in each dataset; see Methods) to approximate a common scale provides a further robust demonstration of the shape of the canonical executive function trajectory for domain general accuracy (Fig. 5A) and latency (Fig. 5B) across datasets and tasks (given each dataset includes different measures; see Supplementary Table S2). Differences in the precise scaling (absolute y values at each age) persist, however, as to be expected by datasets taken from different age ranges with different tasks. Furthermore, such universal scaling to adult performance, while potentially useful for creating a common metric across measures and tasks, would not be possible for future studies that only assessed a narrower age range (e.g., 10–18 years old).

**Fig. 5: Scaled domain general executive function scores generate reproducible adolescent growth charts across datasets and tasks.**

We sought to establish a procedure for constructing normative maturational templates applicable to all relevant ages (8–35 years old) that utilizes a linear scaling of the canonical executive function trajectory to a specific measure (via basis function regression; see Methods). Unlike the GAM/GAMMs used to initially derive the canonical executive function trajectory or alternative, multiparameter non-linear models of age, we tested a procedure that only requires a simple linear transformation of the age variable in each dataset (via linear interpolation to the canonical trajectory [estimated out-of-dataset]; see Methods) and is then fit as a single parameter in a general linear model/general linear mixed effects model. This data-driven basis function process (see analogous ideas in functional brain imaging⁵⁵) is therefore the same as what occurs with standard parametric functional forms of age (e.g., linear, inverse linear age [1/age], quadratic polynomial age [age + age²]), but would have the added benefit of its shape/functional form being directly informed by prior developmental data. We tested this procedure to directly assess whether the insights generated in the current work regarding a canonical executive function trajectory could quantitatively guide future research allowing for simplified modeling approaches that are developmentally informed and computationally efficient. To mirror the use of this approach in future developmental research with new datasets and new measures, we tested the generalizability of this procedure through cross-validation (“leave one dataset out”) and compared performance to standard functional forms of age used in developmental research (linear age, inverse linear age [1/age], quadratic polynomial age [age + age²]) that may otherwise be used to understand age-related executive function change and deviations from normative development.

A canonical executive function trajectory, estimated out-of-sample (“leave one dataset out”) and used as a single parameter basis function (e.g., shape of age model for Luna dataset determined by NCANDA, NKI, and PNC datasets; see Supplementary Fig. S11 for visualization of workflow), generally outperformed standard functional forms of age (linear age, inverse linear age [1/age], quadratic polynomial age [age + age²]) during model comparison testing that aggregated multiple metrics of model fit and complexity (Fig. 5C, D). Following model selection criteria based on all metrics across accuracy and latency measures, the simplified, single parameter basis function was the most selected model (55.6% of the time; compare to quadratic [age + age²]: 37.3%; inverse linear age [1/age]: 7.03%; linear age: 0%), which was significantly higher than all other age models (vs. inverse linear age [1/age]: χ² = 21.6, p < 0.001; vs. linear age χ² = 30.6, p < 0.001; all p values two-sided; chi-square test with Yate’s correction for continuity) other than the quadratic model (vs. age + age² (best model 37.3%), χ² = 2.29, p [two-sided] = 0.130). Results were further unchanged when specifically looking at generalizability between Luna and NKI datasets that do not share any measures (data-driven age basis model was best age model overall 69.2%). Consistent with the strength of the basis function being derived from its developmentally precise shape, offsetting the basis function with respect to age led to lower and more variable model performance (Supplementary Fig. S12). Combined, these results establish a simplified, single parameter data-driven basis function version of the canonical executive function trajectory as an alternative, developmentally informed functional form of age that is superior or highly competitive with standard, parametric functional forms of age when applied to new datasets and new measures. Therefore, we suggest that, along with full, multi-parameter complex spline models (GAM/GAMMs used throughout the rest of the manuscript) and standard functional forms of age (e.g., linear, inverse linear, quadratic), such a simplified, developmentally informed basis function may quantitatively (see Data and Code Availability) inform future research on normative development and deviations from normative development in health and disease (see Discussion).

Sensitivity analyses

Sensitivity analyses demonstrated that primary results concerning the magnitude and timing of executive function accuracy and latency development were consistent across males and females (Supplementary S13). Additional sensitivity analyses demonstrated that our primary results did not change when covarying for socioeconomic indicators (parental education and family income, Supplementary S14, S15) and assessments of culturally acquired knowledge (verbal reasoning and vocabulary, Supplementary S16), and remained consistent across mental health inclusion/exclusion thresholds (Supplementary S17). This suggests that mathematically holding these factors constant did not change the current results that focused on aggregate and average executive function changes during adolescence. Thus, our results do not speak to for example past findings suggesting economic disparities impact cognitive measures, and variability between individuals (cf.,^56,57). However, the tools and insights from the current work can be used for future studies focused on relationships between these factors and executive function in more detail (see Discussion). As in previous longitudinal investigations of computerized and neuropsychological performance⁵⁸, age-independent visit effects (e.g. practice effects) on cognitive testing were observed for many executive function tasks in longitudinal samples (Supplementary S9). However, all longitudinal analyses (Luna, NCANDA samples) covaried for a non-linear effect of visit number (see Methods) and we demonstrate replication to two cross-sectional datasets (NKI, PNC) where visit effects could not have occurred, indicating that our primary results are likewise robust to practice effects on cognitive testing.

Discussion

Defining the adolescent period through a reproducible, canonical trajectory of executive function and significant periods of development

The development of executive function has been studied in relatively small (N’s ~200^15,16) independent investigations using a broad range of tasks or in relatively large studies (N’s ~ 1000^17,18; although still smaller than the total sample used here: N = 10,766, total visits =13,817)) with few, very narrow assessments of executive function in intelligence testing. Collectively, prior work demonstrates significant improvements from childhood through adolescence^{5,15,16,19,20,21,22,23,24}, but the precise magnitude, maturational timing, and significant periods of development in executive function during the transition from adolescence to adulthood has not been defined. With four, large independent datasets, and non-linear modeling techniques to identify specific periods of significant development, we provide reproducible and direct evidence that executive functions continue to develop into late adolescence, which has been widely suggested by theory^4,7,8,9 but has rarely been directly tested in empirical research. Building from prominent neurodevelopmental^4,7,8,9 and psychological^27,28 theories, these results highlight adolescence as an essential period of transition during which individuals reach maturity in goal-directed cognition. This suggests that while adolescents clearly possess complex cognitive abilities, including the ability to inhibit prepotent responses, maintain and update information in memory, and abstractly plan for future events, such abilities do not reach their full potential until 18–20 years old (late adolescence). Adolescent periods prior to this age-range (i.e., early to mid-adolescence ~10–15 years old, and mid to late adolescence ~15–18-years old) are therefore likely critical final stages of this type of cognitive development, where deviations from normative development may lead to poorer outcomes in adulthood. Identifying these sensitive, or even critical³, periods of cognitive development is essential for advancing neurocognitive growth-charting to determine normative development and deviations from this normative development in health and disease^19,45, in designing developmentally informed interventions/preventions for youth^59,60,61,62, and policy concerning adolescents^9,14.

Given the reproducible and converging evidence for adolescence as a distinct period of the lifespan, and one now better conceptualized as a period of normative closure in goal-directed cognitive development prior to the establishment of adult-level trajectories, the current results support a broader understanding of the neurobehavioral basis for the adolescent period. Together with essential additional historical and sociocultural frameworks²⁷, such charting of neurobehavioral processes throughout adolescence emphasize the importance of developmentally relevant considerations for adolescents across research and clinical care. Thus, our identification of the maturational timing of executive function, in combination with similar investigations of affective and social processes^2,63 may guide further discussion on how to define the adolescent period and demarcate its boundaries²⁷, essential for basic and translational developmental research. To assist in this pursuit, we have made available summary data (note participant-level data is also available with necessary data use agreements; see Data availability) for the canonical executive function trajectory, with the goal that subsequent work may utilize and continue to refine empirically defined normative maturational templates in executive function research. While such refinement should include ongoing model comparison of other candidate functional forms of age (cf., Fig. 5), we suggest sharing of reproducible and well-powered adolescent trajectories of executive function can be directly integrated in future analysis (e.g., basis function regression) in methods that mirror the development, refinement, and use of summary statistics in other fields (e.g., polygenic risk scores⁶⁴). As in these and related fields⁶⁵, large-scale reproducible normative templates of change can be leveraged to better understand risk factors or consequences of mental and physical health conditions related to executive function during adolescence and across a range of experimental conditions.

Three of the four datasets used here (NCANDA, NKI, PNC), as community samples, did not exclude participants on the basis of mental health presentations. However, our sensitivity analyses demonstrated that our approach (used in an effort to maximize generalizability; c.f.,^66,67) did not bias our results that focused on aggregate and average executive function changes during adolescence. The tools and insights developed here can support future studies of executive function differences in psychopathology both in new datasets, as well as targeted investigations within the current datasets. Normative templates of age-related differences in executive function derived here may also be useful for future research to disambiguate developmental effects and non-developmental visit effects (e.g., practice effects) that, consistent with prior reports⁵⁸, we observed in longitudinal executive function data. Future work may also use these insights towards optimizing developmental study designs with respect to the number of participants, construct breadth of assessments, and the number of longitudinal time points.

The results of the current work provide support for prior theoretical and quantitative work suggesting non-linear developmental trajectories of cognition during adolescence^15,20,32. Updating theoretical models requires broad conceptual consideration, nevertheless, the clear presence of non-linearity in age-related executive function differences from late childhood through adulthood can directly help refine neurodevelopmental models of adolescence. Our results for example provide less support for linear increases of executive function development throughout adolescence⁸ as well as maturational timing of this process after twenty years old²⁷. Instead, our results clearly support a reproducible, canonical non-linear trajectory of executive function development from adolescence to adulthood. The shape and timing of this canonical trajectory is consistent with prior theories of adolescence⁹ and empirical work with fewer executive function assessments and/or smaller samples that suggest non-linear cognitive development processes^17,32. The robust, large-scale multi-dataset replication here provides key advances towards formalizing such a non-linear trajectory, and through the employed data-driven modeling approaches, explicitly defines significant periods of executive function development that identify the potential closing of the adolescent period for this process between 18–20 years old. Such distinctions on the relative bounds of the adolescent period are not only essential for psychological and neuroscientific theories, but also for clinical care and policy. Our work also sets key areas for future work regarding maturational timing in more fundamental studies of executive function development (e.g., disambiguating age-related change from pubertal development, generalizability to populations outside of the United States, targets for brain imaging, and considerations for affective versus nonaffective executive function tasks: see Considerations for Future Work).

Domain general executive function development

While prior work in adults^52,68 and younger children^69,70,71 has provided evidence for a potential unity/diversity framework of executive function, the relative domain-generality versus specificity of executive function has largely not been examined with respect to changes during adolescence. The strongest evidence across the large-scale data aggregated here suggests that age-related differences and longitudinal changes across executive function tasks are driven predominantly by a domain-general process. This indicates that across executive processes (e.g., inhibitory control, attention, working memory, planning) there is a common system of goal-directed cognition that may lead to developmental improvements across multiple contexts. Such domain-general executive function development may help explain, for example, wide-spread differences across executive function tasks in clinical^72,73 and/or population research (e.g., social determinants of health^71,74,75), as well as the tendency for many executive function tasks to engage common neural circuitry^76,77. Domain-general executive function development during adolescence also provides support for general heuristic perspectives of adolescence that emphasize a core set of cognitive development^4,7,8,9. The current work that focused on multi-assessment and multi-dataset reproducibility of trajectories of adolescent executive function across large-scale cross-sectional and longitudinal data further sets priorities for additional within-person modeling (e.g., multivariate sparse functional principal components analysis⁷⁸, multivariate growth curve modeling⁷⁹) in future targeted investigations.

Although we found a considerable degree of commonality in adolescent executive function development, as in related work from adults⁴², current measures and methods do not rule out additional executive function variance relevant to development (even if such domain/measure-specific variance is less prominent than domain-general processes). Our analyses were generally well accounted for by a domain general perspective of executive function, and further exploring this allowed us to examine multi-assessment multi-dataset estimates towards reproducibility and generalizability. However, as in other reports^42,69,70,71, executive function variance was not entirely captured by a single factor. Future work, including using the tools and insights developed here, may address these questions in multi-dataset reproducibility and generalizability investigations. With respect to potential distinction among other cognitive processes, our sensitivity analyses did however demonstrate that the canonical executive functional trajectory was robust to individual differences in measures of culturally acquired knowledge (see Supplementary S16). The results here nevertheless raise further questions regarding the conceptual distinction of executive function performance and development from that of related domain-general concepts like fluid cognition that are theorized to account for the coherence of performance-based cognitive abilities (and the distinction from culturally acquired knowledge) in the context of general ability testing (see⁸⁰ for additional discussion). Future empirical and theoretical work, to add to existing frameworks, will be required to rectify these related but often historically distinct accounts. From either account, we suggest that commonality across measures, while essential for basic and translational research and practical demarcations of adolescent development, be expanded to consider broader sociocultural and historical perspectives as well. The increasing availability of future large-scale population-level cohorts (e.g., Adolescent Cognitive Brain Development [ABCD] Study⁸¹), together with the methods used and developed in the current work, can facilitate future empirical investigations into these areas.

Common driver of executive function development

Conceptually, the potential cognitive and psychological mechanisms of such domain general executive function development remain somewhat of an open question. As inhibitory control tasks (antisaccade, color-word interference, trail-making-test) often had both the highest loadings on domain-general factors observed here (which is consistent with similar prior work in adults⁶⁸) and amongst the largest developmental effects, it is possible a global inhibitory control process provides the most parsimonious explanation for domain-general/unitary executive function. If, as has been suggested, executive function tasks often fail to solely isolate a specific cognitive process (the so-called task “impurity problem”^5,68,82), global inhibitory control processes may give rise to broad executive function changes through adolescence across diverse tasks, each of which requires some level of global, goal-directed inhibition. Nevertheless, we suggest that future work determining the common driver of executive function changes will benefit most from novel dense longitudinal study designs (e.g., repeated ambulatory smartphone/web-based assessment of cognition⁸³) and/or further multi-method investigations (e.g., fMRI⁷⁶) that provide a means to understand temporal processes and/or correlated neurobiology, respectively. This would help protect against the possible circularity of descriptions of a common driver of executive function that are limited to functions assessed contemporaneously and/or with the same methodology. As demonstrated in the current work, however, even without a clear narrative description of the origins of domain-general executive function, the maturation of domain-general executive function provides a means to qualitatively understand the adolescent period and quantitatively guide future work. In pursuit of these goals, the current results emphasize the utility of research designs that include not just large sample sizes and/or longitudinal data, but also multiple measures within a broader construct (executive function). Our results suggest the utility of shared information and/or the potential utility of convergent validity from multiple executive function indicators in outcome research when such construct depth is available. Even when more domain-specific effects are of interest, our results suggest that the estimation of domain-general executive function via a broad battery is optimal, as developmental differences on nearly all measures had sizeable influences from a more general process.

Considerations for future work

The identification of common adolescent executive function development may guide future translational and multidisciplinary research. For example, our results suggest that neuroimaging research of adolescent executive function may be well-suited by leveraging multiple executive function tasks to examine shared information in association with brain structure/function or to better isolate domain-specific effects. Likewise, as has become increasingly common⁷², translational research aiming to uncover adolescent executive function as a possible predictor or consequence of clinical presentations and/or as a target for intervention, may be best suited to approach executive function from a unitary, domain-general process that follows the canonical executive function trajectory revealed here. Methodologically, common metrics of domain-general executive function, and normative templates of change (even in scaled units: basis functions) may serve to increase reproducibility by facilitating overlap and replication efforts across instruments and datasets.

The current project leveraged multiple large independent datasets, developed methodological improvements permitting the identification of maturational timing of executive function, and investigated both common and specific components of executive function processes, but nevertheless potential limitations and explicit suggestions for future work should be considered. First, although this investigation used a comprehensive approach to characterizing executive function, these analyses focused on the most prominent outcome measures from these tests. This approach had the advantage of aligning the current analyses with predominant practices in the literature and the level of granularity supported by large-scale, public datasets, but future work would benefit from alternative and/or model-based, computational parameterizations of behavioral performance^84,85. Furthermore, the breadth of executive functions indexed by these tasks was not exhaustive, and other domains of individual differences in cognition were not explored. For example, by design, this study, and many of the original datasets, did not examine executive function tasks in the context of affective stimuli. That is, the included measures focused on what have been considered affectively neutral cognitive measures. This allowed us to specifically isolate fundamental properties of executive function development as typically understood, but future work with more diverse cognitive batteries should examine whether affective manipulations likewise follow the canonical executive function trajectory established here. Another potential limitation is that the current work did not try to disambiguate age-related changes from pubertal development, given challenges in independently estimating these effects in the presence of large cross-sectional age effects (cf.,⁸⁶). However, it will be important for future work, particularly when focusing on early periods of adolescence to likewise seek large-scale multi-assessment, multi-dataset reproducibility for the specific role of pubertal status in driving executive function development. A further potential limitation arises from our general focus on the average executive function trajectory during adolescence. While we determined that our results were generally robust to multiple participant-level factors, the results of the current work should be interpreted as a normative template and individual and dataset-level variability is expected. Relatedly, while the aggregated datasets and inferences drawn here appear to approximate population patterns from the United States, further work with multinational and multicultural samples is required to determine the generalizability of these results to other countries and cultures. The tools and data developed here can nevertheless provide resources for additional research on deviations from this normative trajectory, promote improved estimates of uncertainty, and ultimately support potential translational efforts seeking to identify clinically relevant executive function-related processes during adolescence.

Methods

Participants

Data for this project were provided from participants of four existing projects (all with publicly available data). One internal dataset (Luna Dataset) and three external datasets (National Consortium on Alcohol & Neurodevelopment in Adolescence³⁶ [NCANDA], Nathan Kline Institute-Rockland Sample³⁵ [NKI], Philadelphia Neurodevelopmental Cohort³⁷ [PNC]) were included based on (1) their inclusion of executive function tasks performed in a developmental or lifespan dataset spanning the entirety of the adolescent period and (2) to aggregate the largest possible dataset to explore the aims of this project. The primary focus of the current work was on the adolescent period. To explicitly capture transitions into and out of adolescence as well as the entire adolescent period³³, we included participants ranging from late childhood to adulthood (8–35 years old). Lower (8 years old) and upper (35 years old) age ranges were selected to be as inclusive as possible, given the overarching goal of capturing non-linear developmental trajectories, while also ensuring that at least two separate datasets had participants in each age range. This meant that only participants from 8–35 years old from the NKI lifespan dataset were included (Full NKI Rockland Sample Range: 6–85). No participants were excluded based on age from the other datasets (Luna, NCANDA, PNC), which were designed to assess childhood to adolescence/adulthood and fully fell within this age range. In order to maximize generalizability and representation within the datasets (see refs. ^66,67 for relevant discussion concerning neurodevelopmental studies), no other participant-level demographic exclusion criteria were applied to the datasets. Instead, we thoroughly examined the potential impact of such factors in a series of sensitivity analyses (see Supplementary Figs. S2; S13-S17).

One dataset was drawn from Dr. Beatriz Luna’s longitudinal study of neurocognitive development (Luna Dataset). From this dataset, the current project included 196 participants (baseline age-range: 8–30 years old; 101 female participants, 92 male, 2 participants both sexes were reported, 1 participant unknown/not reported) dataset in an accelerated longitudinal/cohort sequential design, with participants completing a range of follow-up visits (total participant visits = 666, median number of visits per-participant =3; range of visits per-participant = 1–10; median months between visits = 13.3; range of months between visits = 5.97–81.73; see Supplementary Fig. S1 for graphical depiction of dataset by visit structure). Exclusion criteria for this dataset were medical conditions or medications known to affect eye movements and a history of psychiatric disorders, developmental cognitive disorders, or learning disabilities, in either the participant or a first-degree relative, and IQ scores at baseline below 80. Participants were recruited from the community surrounding the University of Pittsburgh Medical Center.

The second dataset was drawn from the multi-site, National Consortium on Alcohol & Neurodevelopment in Adolescence (NCANDA) (see ref. ³⁶ for detailed sampling strategy and recruitment information). The current project used data from 831 participants (baseline age-range:12–21 years old, 423 female participants, 408 male) in the first five visits of the accelerated longitudinal design (total participant visits = 3412, median number of visits per-participant = 5; range of visits per-participant = 1–5; median months between visits = 12.17; range of months between visits = 4.98–23.97; see Supplementary Fig. S1 for graphical depiction of dataset age by visit structure). Exclusion criteria for NCANDA were Magnetic Resonance Imaging (MRI) contraindications (e.g., claustrophobia, non-removable metal in the body), head injury with a significant loss of consciousness, psychiatric disorders that might influence study completion (e.g., psychosis), and psychiatric medication (see³⁶). A central goal of the NCANDA study was to examine the transition to significant substance use during adolescence and as a result, approximately 50% of the dataset was recruited based on subclinical factors thought to increase the likelihood of alcohol use disorder (AUD; see ref. ³⁶). The inclusion of participants with psychiatric conditions however was shown to not substantively influence the current projects’ analyses through sensitivity analyses (see Supplementary Fig. S17).

The third dataset was drawn from the lifespan Nathan Kline Institute-Rockland Sample (NKI)(see ref. ³⁵ for detailed sampling strategy and recruitment information). The current project used data from 588 participants (age range of participants within the included dataset [see above for age rationale]: 8–35 years old; 284 female participants, 304 male). The NKI-Rockland Sample includes longitudinal follow-up data (up to two visits) on the included tasks here for a very small number of participants (n = 10) within our specified age range. However, given this represented such a small percentage of participants (<2% of dataset) and only included two visits, the current analyses only included the first visit from these participants and thus this dataset was utilized as cross-sectional (see Supplementary Fig. S1 for histogram of included ages). The NKI-Rockland Sample was recruited to match the ethnic and economic demographics of Rockland County, New York. Consistent with the community sampling approach, a moderate number of participants in the NKI dataset used here (n = 286) met criteria (DSM-IV TR) for at least one lifetime diagnosis of a psychiatric disorder. These factors were shown to not substantively influence the current projects’ analyses that focused on average and aggregate developmental changes in executive function through sensitivity analyses (see Supplementary Fig. S17).

The fourth dataset was drawn from the Philadelphia Neurodevelopmental Cohort (PNC) (see ref. ³⁷ for detailed sampling strategy and recruitment information). The current project utilized data from 9151 participants in the cross-sectional, PNC dataset (age range: 8–22 years old; 4753 female participants, 4365 male, 19 participants both sexes were reported, 14 participants unknown/not reported; see Supplementary Fig. S1 for histogram of ages). Exclusion criteria for PNC were being non-ambulatory and not in stable health, non-proficiency in English, physical and cognitive challenges in participation in interviews and neurocognitive assessment, and the presence of a disorder that impaired cognition or motility (see ref. ³⁷ for detailed inclusion information). Given the large community-based sampling procedure of the PNC, this dataset included participants with psychiatric disorders that may be associated with neurocognitive performance. The current project followed previous work with this dataset⁸⁷ regarding data inclusion (see below) and sensitivity analyses examined the influence of these participants on the current project’s analyses (see Supplementary Fig. S17).

In all four datasets, research protocols were approved by the relevant institutional review boards (Luna Dataset: University of Pittsburgh; NCANDA: Duke University, Oregon Health and Sciences University, SRI International, University of Pittsburgh, University of California San Diego; NKI: Nathan Kline Institute; PNC: The University of Pennsylvania and Children’s Hospital of Philadelphia) and participants over 18 provided informed consent, while participants younger than 18 provided written assent and parental consent. To our knowledge, no participant was involved in more than one of the studies. For the current analyses, no statistical method was used to predetermine the included sample size. All four datasets were included in their entirety, apart from analysis-specific exclusions detailed below (Data Processing). As observational studies, the included experiments were not randomized. Likewise, no blinding procedures were employed.

Executive function measures

Data from Luna, NCANDA, NKI, and PNC datasets were used in the current project based on their inclusion of executive function tasks performed in a developmental or lifespan dataset that spanned the adolescent period. Classification of executive function tasks was based on prior theoretical⁵ and empirical work^15,34,42,88, with a general operationalization of goal-directed cognitive behaviors that encompassed processes of inhibition, attention, working memory, switching, or planning. Where possible, prior work with the included tasks and datasets and/or test authors³⁴ was used to define whether specific tasks indexed executive function. To avoid potential influences of verbal skills potentially related to educational attainment, measures relying heavily on reading and language skills were not included (e.g., DKEFS-Twenty Questions, DKEFS-Proverb Test) as primary executive function assessments, but the influence of culturally acquired knowledge was shown to not influence primary results in a sensitivity analysis (Supplementary Fig. S16). Wherever possible, both accuracy and latency measures were selected, except when precedence from research or clinical assessment was clear on a predominant use of accuracy (e.g., DKEFS Tower) or latency (e.g., DKEFS Trail Making Test) measures owing to nearly universal ceiling/floor performance of the corresponding accuracy/latency measure and/or the corresponding measure was not collected/available. See Supplementary Table S2 for the conceptualized subdomains of the included executive function tasks based on author consensus and original test descriptions. See Supplementary Table S3 for reproducible variable names for public datasets (NCANDA, NKI, PNC).

Based on the above criteria, the Luna dataset included twelve measures from six executive function tasks that were completed at each visit: Antisaccade (ANTI), Memory Guided Saccade (MGS), a mixed (MIX) Antisaccade/Visually Guided Saccade/Fixation task, Cambridge Neuropsychological Test Automated Battery [CANTAB] Delayed Matching to Sample (DMS), CANTAB Spatial Span (SSP), CANTAB Stockings of Cambridge (SOC). Each of these tasks have been described in detail elsewhere (see for example, refs. ^15,44). Scoring procedures and outcome measures were based on previous work from our group and general use in the literature. Briefly, the Antisaccade task required participants to inhibit a proponent response (saccade) to a peripheral stimulus (in four possible locations along the horizontal meridian) and saccade towards the opposite hemifield. Both accuracy (correct response rate across trials) and latency (median speed of antisaccades on correct trials) of the Antisaccade task were examined. A second mixed version of the Antisaccade task was also performed, where participants performed an antisaccade but trials with different task demands were also interleaved. Specifically, in 1/3rd of trials, participants were required to saccade towards the peripheral stimulus (visually guided saccade) or in 1/3rd number of trials, simply maintain fixation. Both accuracy and latency of this mixed version were examined, but only calculated for the antisaccade trials (with the same scoring procedure as above), given the visually guided saccade is not thought to rely on executive function (see ref. ¹⁵) and the number of fixation errors was included in a different measure that captured this performance in a goal-oriented context (see below). The Memory Guided Saccade task required participants to saccade towards a peripheral stimulus (in four possible locations along the horizontal meridian), remember its location during a subsequent fixation period, and then saccade towards the remembered location when no stimulus was presented. Both accuracy (difference in degrees between initial saccade and the most precise saccade the final phase⁸⁵, when no stimulus was presented) and latency (median speed of the initial saccade during the final phase across trials⁸⁵) of the Memory Guided Saccade task were examined. We also calculated the number of fixation breaks (FIX) during the middle phase of the memory guided saccade task as a putative measure of inhibition. In addition to the three eye movement tasks, the Luna dataset also included the Delayed Matching to Sample, Spatial Span, and Stockings of Cambridge tasks from the CANTAB Battery, each of which have been broadly used and whose stimuli can be found online (see www.cambridgecognition.com/cantab/). Standard accuracy (Delayed Matching to Sample: Percent Correct; Spatial Span: Span Length; Stockings of Cambridge: Problems Solved in Minimum Moves) and latency (Delayed Matching to Sampe: Median Correct Latency; Stockings of Cambridge: Mean Initial Thinking Time) measures from each of the three CANTAB tasks were examined. For interpretive consistency across measures in the Luna dataset, the direction of the scoring of two accuracy measures (Memory Guided Saccade inaccuracy [see above]; Number of Fixation Breaks) was multiplied by −1 to ensure that higher scores indexed better performance on all accuracy measures.

The NCANDA, PNC, and NKI datasets used versions of the University of Pennsylvania Computerized Neurocognitive Battery (CNB; https://webcnp.med.upenn.edu/). The current project utilized data from three CNB tasks that met our operationalization of executive function and have been classified as executive by the CNB authors³⁴, the Penn Conditional Exclusion Test (PCET), a Penn N-Back Test (PNBK; NCANDA: Penn Short Fractal N-back Test [PNB-F]; PNC & NKI: Penn Letter N-Back Test [PNB-L]), and the Penn Continuous Performance Test: Number and Letter version (PCPT). Standard outcome measures for each task were included for accuracy (PCET: calculated accuracy measure [PCET ACC2]; PNB-F: true positive [correct] responses for 1-back and 2-back trials; PNB-L: true positive [correct] responses for 1-back and 2-back trials; PCPT: sum of true positives for number and letter trials) and latency (PCET: median response time for correct responses; PNB-F: mean of median response time for 1-back and 2-back trials; PNB-L: mean of median responses for 1-back and 2-back trials, PCPT: median response time for correct response to number trials and letter trials). The NCANDA dataset also included a standard Stroop Test (STRP), where the primary measure of average latency over all correct trials was included. The NKI dataset also included four executive function tasks from the Delis-Kaplan Executive Function System⁴³ (D-KEFS) that were included in the current study: color-word interference (CWI), design fluency (DFL), tower (TOW), and the trail-making test (TMT). Again, standard outcome measures were used for these tasks (CWI latency: average of inhibition and inhibition/switching conditions; correlation amongst these measures: r = 0.806; DFL Accuracy^88,89: switching total correct; TOW: Total Achievement Score Total Raw; TMT: Number-Letter Switching). The DKEFS Sort Task was also available for a small percentage of participant visits within our analytic age range (8–35) for the NKI dataset but was not used because over two-thirds of the visits did not have this measure (66.82%), whereas all other NKI measures included had at maximum <4% missingness.

Data processing

All data processing and statistical analyses were performed in R version 4.1.2 (2021)⁹⁰. Luna dataset eye-tracking data was scored with the same automatic scoring algorithms from our previous work^85,91. Scores for all other tasks were generated through released software from the instrument (e.g., Luna dataset CANTAB) and/or included in official data releases (NCANDA, NKI, PNC datasets).

Aggregated data, either from distributed data releases (NCANDA, NKI, PNC) or our in-house database (Luna dataset) were first screened to ensure each visit (participant at testing session) had a valid age, anonymous id variable, and if longitudinal data, visit (i.e., these variables were not missing and were within the expected range, based on the study design) and included expected data. Data that did not meet these minimum criteria were removed from all analyses. As in our prior work, eye-tracking tasks in the Luna dataset (specific task at specific visit) with more than 30% of trials dropped due to poor eye-tracking or missing (i.e., early session termination; cf.,⁹¹) were also removed from all analyses. Next, data inclusion criteria were used to maximize the included dataset sizes and result generalizability, while also ensuring no considerable outlier (i.e., 4 standard deviations and more extreme than 99.9% of the distribution) biased results. Within these procedures, individual executive function measures were first screened for potential univariate leverage points in the association between age and each specific measure within general additive models (GAM: see below) or general additive mixed models (GAMM: see below). Leverage points were defined as those observations (measure for participant at testing session) with a residual from this model that was four standard deviations above the mean and removed from all subsequent analyses. Second, data were examined for potential multivariate outliers among all included executive function measures within each dataset using Mahalanobis distance within the psych package in R⁹². Sessions (all executive function measures for participant at testing session [i.e., study visit]) with a Mahalanobis distance four standard deviations above the mean were removed from all subsequent analyses.

Data analysis

General additive models

General additive models (cross-sectional data: PNC dataset) and general additive mixed models (longitudinal data: Luna, NCANDA, NKI datasets) with penalized smooth plate regression splines via the mgcv package⁴¹ were used to quantify non-linear associations between age and executive function measures. Primary cross-sectional analyses (NKI, PNC) utilized a simple bivariate model examining the smoothed association between age (the independent variable) and executive function (the outcome measure). Primary longitudinal analyses (Luna, NCANDA) additionally included a smoothed term for visit number to account for potential non-developmental visit effects (e.g., practice: see Supplementary S9) and per-participant random intercepts and age slopes via mgcv GAMM. MGCV defaults were used for all parametrization with the exception that the maximum basis dimension for visit number in the NCANDA dataset was adjusted from 10 (the default) to 5 (given there were maximally five visits in this analysis dataset). Age-related fits from these primary GAM/GAMM models are presented in Fig. 1. Pointwise confidence intervals (displayed in Fig. 5) were generated by multiplying standard error estimates from the mgcv GAM/GAMM predict function by 2 and summing this with the predicted fit estimate. Sensitivity analyses (Supplementary S13–S17) examining sociodemographic and cognitive covariates followed the same procedures, with continuous variables (e.g., parental education) modeled as smooth terms and categorical variables (e.g., biological sex) modeled as parametric terms.

Periods of growth and maturational timing

As in previous developmental research in different domains^49,50,87, periods of significant age-related change (age ranges) were defined by estimating the first derivative (finite differences method) in 1/10^th of a year intervals of GAM fits and performing a posterior simulation based on the GAM/GAMM model coefficients. Simultaneous (used given the multiple testing) confidence intervals (CI) were generated with the gratia package⁵¹ with 10,000 simulations. Age ranges in which the simultaneous 95% CI did not include zero (p < 0.05) were classified as significant. Using this method, raster plots in Fig. 2 display color (red or blue) when the derivative is significant and white when the derivative is not significant. An aggregate analysis, pointwise three-level meta-analysis, incorporating all measures from all datasets was performed using the metafor package⁹³ with effects nested in measure and study. A cross-dataset label was used to nest measures from the same tasks (e.g., Penn CNB) across datasets. As in prior methodological work on point-wise meta-analysis with GAMs⁹⁴, meta-analytic estimates were computed across a common span of the independent variable: here, 1/10th year age bins, following linear interpolation of GAM/GAMM first derivatives. The same pointwise, three-level meta-analytic approach was used in aggregate analysis of GAM/GAMM fits in Fig. 5 and Supplementary Fig. S5. Secondary analyses that used an effect size threshold to define maturation scaled the GAM/GAMM fits from 0 (min) to 1 (max) to determine the percentage of total age-related that had occurred for each age (see Supplementary Fig. S5).

Interdependence of performance across executive function tasks

Cross-sectional and longitudinal correlations (linear, bivariate) were computed among executive function measures in each dataset (Fig. 3A). For longitudinal datasets (Luna, NCANDA), baseline refers to the first visit, longitudinal refers to the pooled within-person correlation via disaggregation with the statsBy function in the psych package in R. This approach was chosen to balance interpretability with model complexity for the accelerated longitudinal designs of Luna and NCANDA datasets. Aggregate analysis (“all measures”) in Fig. 3A utilized a three-level meta-analysis via metafor with correlation pairs nested in task pairs and datasets. Exploratory factor analysis (Fig. 3B) via maximum likelihood method and a bifactor rotation was performed with the psych package in R from between- (Luna and NCANDA baseline and NKI, PNC datasets) and within-person correlation matrices (Luna and NCANDA longitudinal). Multiple data-driven thresholds for the number of extracted factors (Fig. 3C) were examined via parallel analysis and the nScree function in the nFactors R package⁹⁵ (95% CI from parallel analysis, factor analytic Kaiser rule, optimal coordinate, acceleration factor).

Contributions from domain-general versus specific processes to age-related differences in executive function were determined via model comparison that is also presented with the same description as well as additional visualization in Supplementary Fig. S8. To maximize comparability across studies and to prevent bias from shared, non-executive function visit effects (e.g., practice effects; see Sensitivity Analyses and Supplementary S9) analyses here were performed with cross-sectional data, although results are consistent with longitudinal data (cf., within-person factor structure in Fig. 3B, D).

First, three GAM models were fit for each dataset for each measure assessing the relationship between age and the specific measure i from subdomain x (measure_{x_i}): model A, a composite metric created from all measures not in the same putative subdomain as measure_{x_i}: composite metric _{M ∌ x}, where _{M ∌ x} represents the set (M) of executive function measures that do not contain measures from subdomain x: model B, and a model where age is estimated from both measure_{x_i} and composite metric _{M ∌ x}: model C. As in primary analyses, the relationship between age and each measure was modeled with penalized splines. For each model (A–C), the percent of deviance explained in age was extracted (following standard estimation in mgcv GAM model). Next, the incremental deviance of age explained by measure_{x_i} over composite metric _{M ∌ x} was computed. Finally, the resulting measure specific age-related deviance was scaled to the original deviance estimate for the specific measure (model A) to create a percent of the original measure’s age effect. The remaining percentage of model A’s deviance was assigned as the domain-general percentage. To ensure consistent interpretability of the directionality of composite metric _{M ∌ x}, measures from the opposing response type were sign flipped (e.g., latency sign flipped before creating equally weighted composite with accuracy measures). Sensitivity analysis examined the influence of the composite measure’s precision in the estimation of domain-general accounts of age-related differences in executive function (see Supplementary Fig. S10).

Normative maturational templates of age-related differences in executive function

We used basis function regression with cross-validation (“leave one dataset out”) to determine whether normative maturational templates of executive function could improve developmental inferences in new datasets and measures. A diagram of this procedure is likewise presented in Supplementary Fig. S11. In each iteration of the procedure, three (out of four) datasets were used to generate canonical executive function trajectories for accuracy and latency measures (measures aggregated across datasets via a pointwise three-level meta-analysis of GAM/GAMM age fits). The resulting output was then smoothed (via a subsequent GAM model), interpolated to the ages of the test (“left out”) dataset, and fit as a single age parameter to each accuracy and latency measure of the left out dataset and compared to typical age models (age+age², inverse age [1/age], linear age [age]) as well as an intercept only (no age) model. Potential age models were evaluated with multiple metrics of model fit and complexity via the performance package in R⁹⁶ (longitudinal models [Luna, NCANDA]: R², adjusted R², Intraclass Correlation Coefficient [ICC], Root Mean Square Error [RMSE], residual standard deviation [Sigma], Akaike’s Information Criterion [AIC], Bayesian Information Criterion [BIC]); cross-sectional models [NKI, PNC)]: R², adjusted R², RMSE, Sigma, AIC, BIC). An additional sensitivity analysis explored the influence of the exact developmental timing of the developmental function with a similar procedure that offset in years (earlier or later) the canonical executive function trajectory (see Supplementary Fig. S12).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

This project used publicly available data for all analyses. Deidentified data for all datasets used in this project are available in public repositories pending appropriate data use agreements. Luna sample: nda.nih.gov/edit_collection.html?id=2831. NCANDA: ncanda.org (Release 4Y V02). NKI: fcon_1000.projects.nitrc.org/indi/enhanced/. PNC: ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000607.v3.p2. The data supporting the individual figures are provided in the Source Data Files. Summary data for the canonical executive function trajectory have been made available at https://github.com/tervoclemmensb/Executive_Function_Charting. Source data are provided with this paper.

Code availability

Analysis code for the current project is available at https://github.com/tervoclemmensb/Executive_Function_Charting. Tervo-Clemmens, B., A Canonical Trajectory of Executive Function Maturation from Adolescence to Adulthood, Executive Function Charting, https://doi.org/10.5281/zenodo.8302417, 2023.

References

Luna, B., Marek, S., Larsen, B., Tervo-Clemmens, B. & Chahal, R. An integrative model of the maturation of cognitive control. Annu. Rev. Neurosci. 38, 151–170 (2015).
Article CAS PubMed PubMed Central Google Scholar
Blakemore, S.-J. & Mills, K. L. Is adolescence a sensitive period for sociocultural processing? Annu. Rev. Psychol. 65, 187–207 (2014).
Article PubMed Google Scholar
Larsen, B. & Luna, B. Adolescence as a neurobiological critical period for the development of higher-order cognition. Neurosci. Biobehav. Rev. 94, 179–195 (2018).
Article PubMed PubMed Central Google Scholar
Shulman, E. P. et al. The dual systems model: review, reappraisal, and reaffirmation. Dev. Cogn. Neurosci. 17, 103–117 (2016).
Article PubMed Google Scholar
Jurado, M. B. & Rosselli, M. The elusive nature of executive functions: a review of our current understanding. Neuropsychol. Rev. 17, 213–233 (2007).
Article PubMed Google Scholar
Denckla, M. B. A theory and model of executive function: a neuropsychological perspective. In: Attention, memory, and executive function, pp. 263–278 (1996).
Casey, B. J., Getz, S. & Galvan, A. The adolescent brain. Dev. Rev. 28, 62–77 (2008).
Article CAS PubMed PubMed Central Google Scholar
Steinberg, L. A dual systems model of adolescent risk-taking. Dev. Psychobiol. 52, 216–224 (2010).
Article PubMed Google Scholar
Luna, B. & Wright, C. Adolescent brain development: Implications for the juvenile criminal justice system. In: APA handbook of psychology and juvenile justice, pp. 91–116 (2016).
Tervo-Clemmens, B., Musket, C. W., Calabro, F. J. & Luna, B. Adolescent neurocognitive development and cannabis use. In: Factors affecting neurodevelopment, 537–550 (Elsevier, 2021).
Willoughby, T., Heffer, T., Good, M. & Magnacca, C. Is adolescence a time of heightened risk taking? An overview of types of risk-taking behaviors across age groups. Dev. Rev. 61, 100980 (2021).
Article Google Scholar
Tervo-Clemmens, B., Quach, A., Calabro, F. J., Foran, W. & Luna, B. Meta-analysis and review of functional neuroimaging differences underlying adolescent vulnerability to substance use. NeuroImage 209, 116476 (2020).
Article PubMed Google Scholar
Kessler, R. C. et al. Lifetime prevalence and age-of-onset distributions of DSM-IV disorders in the National Comorbidity Survey Replication. Arch. Gen. Psychiatry 62, 593–602 (2005).
Article PubMed Google Scholar
Steinberg, L. The influence of neuroscience on US Supreme Court decisions about adolescents’ criminal culpability. Nat. Rev. Neurosci. 14, 513–518 (2013).
Article CAS PubMed Google Scholar
Luna, B., Garver, K. E., Urban, T. A., Lazar, N. A. & Sweeney, J. A. Maturation of cognitive processes from late childhood to adulthood. Child Dev. 75, 1357–1372 (2004).
Article PubMed Google Scholar
Demetriou, A. et al. The development of mental processing: efficiency, working memory, and thinking. Monogr. Soc. Res. Child Dev. 67, 1–55 (2002).
McArdle, J. J., Ferrer-Caja, E., Hamagami, F. & Woodcock, R. W. Comparative longitudinal structural analyses of the growth and decline of multiple intellectual abilities over the life span. Dev. Psychol. 38, 115 (2002).
Article PubMed Google Scholar
Moffitt, T. E. et al. A gradient of childhood self-control predicts health, wealth, and public safety. Proc. Natl Acad. Sci. 108, 2693–2698 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Quach, A. et al. Adolescent development of inhibitory control and substance use vulnerability: a longitudinal neuroimaging study. Dev. Cogn. Neurosci. 42, 100771 (2020).
Article PubMed PubMed Central Google Scholar
Ordaz, S. J., Foran, W., Velanova, K. & Luna, B. Longitudinal growth curves of brain function underlying inhibitory control through adolescence. J. Neurosci. 33, 18109–18124 (2013).
Article CAS PubMed PubMed Central Google Scholar
Anderson, V., Northam, E. & Wrennall, J. Developmental neuropsychology: a clinical approach. (Routledge, 2018).
Passler, M. A., Isaac, W. & Hynd, G. W. Neuropsychological development of behavior attributed to frontal lobe functioning in children. Developmental Neuropsychol. 1, 349–370 (1985).
Article Google Scholar
Cromer, J. A., Schembri, A. J., Harel, B. T. & Maruff, P. The nature and rate of cognitive maturation from late childhood to adulthood. Front. Psychol. 6, 704 (2015).
Article PubMed PubMed Central Google Scholar
Luciana, M., Conklin, H. M., Hooper, C. J. & Yarger, R. S. The development of nonverbal working memory and executive control processes in adolescents. Child Dev. 76, 697–712 (2005).
Article PubMed Google Scholar
Best, J. R. & Miller, P. H. A developmental perspective on executive function. Child Dev. 81, 1641–1660 (2010).
Article PubMed PubMed Central Google Scholar
Spear, L. P. The adolescent brain and age-related behavioral manifestations. Neurosci. Biobehav. Rev. 24, 417–463 (2000).
Article CAS PubMed Google Scholar
Sawyer, S. M., Azzopardi, P. S., Wickremarathne, D. & Patton, G. C. The age of adolescence. Lancet Child Adolesc. Health 2, 223–228 (2018).
Article PubMed Google Scholar
Dahl, R. E. & Hariri, A. R. Lessons from G. Stanley Hall: Connecting new research in biological sciences to the study of adolescent development. J. Res. Adolesc. 15, 367–382 (2005).
Article Google Scholar
Organization, W. H. Young people’s health-a challenge for society: report of a WHO Study Group on Young People and” Health for All by the Year 2000”[meeting held in Geneva from 4 to 8 June 1984]. (World Health Organization, 1986).
Wohlwill, J. F. The age variable in psychological research. Psychol. Rev. 77, 49–64 (1970).
Article Google Scholar
Robinson, K., Schmidt, T. & Teti, D. M. Issues in the use of longitudinal and cross-sectional designs. In: Handbook of research methods in developmental science, pp 1–20 (2005).
Kail, R. V. & Ferrer, E. Processing speed in childhood and adolescence: longitudinal models for examining developmental change. Child Dev. 78, 1760–1770 (2007).
Article PubMed Google Scholar
Luna, B., Tervo-Clemmens, B. & Calabro, F. J. Considerations when characterizing adolescent neurocognitive development. Biol. Psychiatry 89, 96–98 (2021).
Article PubMed Google Scholar
Gur, R. C. et al. Age group and sex differences in performance on a computerized neurocognitive battery in children age 8- 21. Neuropsychology 26, 251 (2012).
Article PubMed PubMed Central Google Scholar
Nooner, K. B. et al. The NKI-Rockland sample: a model for accelerating the pace of discovery science in psychiatry. Front. Neurosci. 6, 152 (2012).
Article PubMed PubMed Central Google Scholar
Brown, S. A. et al. The National Consortium on Alcohol and NeuroDevelopment in Adolescence (NCANDA): a multisite study of adolescent development and substance use. J. Stud. alcohol drugs 76, 895–908 (2015).
Article PubMed PubMed Central Google Scholar
Calkins, M. E. et al. The Philadelphia Neurodevelopmental Cohort: constructing a deep phenotyping collaborative. J. Child Psychol. Psychiatry 56, 1356–1369 (2015).
Article PubMed PubMed Central Google Scholar
Schönbrodt, F. D. & Perugini, M. At what sample size do correlations stabilize? J. Res. Personal. 47, 609–612 (2013).
Article Google Scholar
Marek, S. et al. Reproducible brain-wide association studies require thousands of individuals. Nature 603, 654–660 (2022).
Arnett, J. J. Emerging adulthood: What is it, and what is it good for? Child Dev. Perspect. 1, 68–73 (2007).
Article Google Scholar
Wood, S. mgcv: Mixed GAM computation vehicle with GCV/AIC/REML smoothness estimation. University of BATH (2012).
Miyake, A. et al. The unity and diversity of executive functions and their contributions to complex “frontal lobe” tasks: a latent variable analysis. Cogn. Psychol. 41, 49–100 (2000).
Article CAS PubMed Google Scholar
Delis, D. C., Kaplan, E. & Kramer, J. H. Delis-Kaplan executive function system. (2001).
De Luca, C. R. et al. Normative data from the CANTAB. I: development of executive function over the lifespan. J. Clin. Exp. Neuropsychol. 25, 242–254 (2003).
Article PubMed Google Scholar
Moore, T. M., Reise, S. P., Gur, R. E., Hakonarson, H. & Gur, R. C. Psychometric properties of the penn computerized neurocognitive battery. Neuropsychology 29, 235 (2015).
Article PubMed Google Scholar
Duncan Roger Johnson Michaela Swales Charles Freer, J. Frontal lobe deficits after head injury: unity and diversity of function. Cogn. Neuropsychol. 14, 713–741 (1997).
Article Google Scholar
Henrich, J., Heine, S. J. & Norenzayan, A. The weirdest people in the world? Behav. Brain Sci. 33, 61–83 (2010).
Article PubMed Google Scholar
Wood, S. N. On p-values for smooth components of an extended generalized additive model. Biometrika 100, 221–228 (2013).
Article MathSciNet MATH Google Scholar
Bridgwater, M. et al. Developmental influences on symptom expression in antipsychotic-naïve first-episode psychosis. Psychol. Med. 52, 1698–1709 (2020).
Calabro, F. J., Murty, V. P., Jalbrzikowski, M., Tervo-Clemmens, B. & Luna, B. Development of hippocampal–prefrontal cortex interactions through adolescence. Cereb. Cortex 30, 1548–1558 (2020).
Article PubMed Google Scholar
Simpson, G. L. & Singmann, H. R Package: gratia. Ggplot-based graphics and other useful functions for GAMs fitted using Mgcv, 0.1-0 (Ggplot-based graphics and utility functions for working with GAMs fitted using the mgcv package).[Google Scholar] (2018).
Tucker-Drob, E. M. Global and domain-specific changes in cognition throughout adulthood. Dev. Psychol. 47, 331 (2011).
Article PubMed PubMed Central Google Scholar
Deater-Deckard, K. & Mayr, U. Cognitive change in aging: identifying gene–environment correlation and nonshared environment mechanisms. J. Gerontol. Ser. B: Psychol. Sci. Soc. Sci. 60, 24–31 (2005).
Article Google Scholar
Kuczmarski, R. J. CDC growth charts: United States. (US Department of Health and Human Services, Centers for Disease Control and …, 2000).
Friston, K. J. Models of brain function in neuroimaging. Annu. Rev. Psychol. 56, 57–87 (2005).
Article PubMed Google Scholar
Noble, K. G. et al. Family income, parental education and brain structure in children and adolescents. Nat. Neurosci. 18, 773–778 (2015).
Article CAS PubMed PubMed Central Google Scholar
Engelhardt, L. E., Church, J. A., Paige Harden, K. & Tucker-Drob, E. M. Accounting for the shared environment in cognitive abilities and academic achievement with measured socioecological contexts. Dev. Sci. 22, e12699 (2019).
Article PubMed Google Scholar
Sullivan, E. V. et al. Effects of prior testing lasting a full year in NCANDA adolescents: contributions from age, sex, socioeconomic status, ethnicity, site, family history of alcohol or drug abuse, and baseline performance. Dev. Cogn. Neurosci. 24, 72–83 (2017).
Article PubMed PubMed Central Google Scholar
Romer, D. Adolescent risk taking, impulsivity, and brain development: Implications for prevention. Dev. Psychobiol. 52, 263–276 (2010).
Article PubMed PubMed Central Google Scholar
Dennis, M. et al. The Cannabis Youth Treatment (CYT) Study: main findings from two randomized trials. J. Subst. Abus. Treat. 27, 197–213 (2004).
Article Google Scholar
Skiba, D., Monroe, J. & Wodarski, J. S. Adolescent substance use: reviewing the effectiveness of prevention strategies. Soc. work 49, 343–353 (2004).
Article PubMed Google Scholar
Diamond, A. & Lee, K. Interventions shown to aid executive function development in children 4 to 12 years old. Science 333, 959–964 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Ernst, M. & Fudge, J. L. A developmental neurobiological model of motivated behavior: anatomy, connectivity and ontogeny of the triadic nodes. Neurosci. Biobehav. Rev. 33, 367–382 (2009).
Article PubMed Google Scholar
Dudbridge, F. Power and predictive accuracy of polygenic risk scores. PLoS Genet. 9, e1003348 (2013).
Article CAS PubMed PubMed Central Google Scholar
Marquand, A. F. et al. Conceptualizing mental disorders as deviations from normative functioning. Mol. Psychiatry 24, 1415–1424 (2019).
Article PubMed PubMed Central Google Scholar
LeWinn, K. Z., Sheridan, M. A., Keyes, K. M., Hamilton, A. & McLaughlin, K. A. Sample composition alters associations between age and brain structure. Nat. Commun. 8, 1–14 (2017).
Article CAS Google Scholar
Cosgrove, K. T. et al. Limits to the generalizability of resting-state functional magnetic resonance imaging studies of youth: an examination of ABCD Study® baseline data. Brain. Imaging Behav. 16, 1919–1925 (2022).
Miyake, A. & Friedman, N. P. The nature and organization of individual differences in executive functions: four general conclusions. Curr. Dir. Psychol. Sci. 21, 8–14 (2012).
Article PubMed PubMed Central Google Scholar
Cirino, P. T. et al. A framework for executive function in the late elementary years. Neuropsychology 32, 176 (2018).
Article PubMed PubMed Central Google Scholar
Blair, C., Zelazo, P. D. & Greenberg, M. T. The measurement of executive function in early childhood. Dev. Neuropsychol. 28, 561–571 (2005).
Article PubMed Google Scholar
Blair, C. Developmental science and executive function. Curr. Dir. Psychol. Sci. 25, 3–7 (2016).
Article PubMed PubMed Central Google Scholar
Gur, R. C. et al. Neurocognitive growth charting in psychosis spectrum youths. JAMA Psychiatry 71, 366–374 (2014).
Article PubMed Google Scholar
Willcutt, E. G., Doyle, A. E., Nigg, J. T., Faraone, S. V. & Pennington, B. F. Validity of the executive function theory of attention-deficit/hyperactivity disorder: a meta-analytic review. Biol. Psychiatry 57, 1336–1346 (2005).
Article PubMed Google Scholar
Hackman, D. A., Gallop, R., Evans, G. W. & Farah, M. J. Socioeconomic status and executive function: developmental trajectories and mediation. Dev. Sci. 18, 686–702 (2015).
Article PubMed Google Scholar
Lawson, G. M., Hook, C. J. & Farah, M. J. A meta-analysis of the relationship between socioeconomic status and executive function performance among children. Dev. Sci. 21, e12529 (2018).
Article Google Scholar
Zhang, Z. et al. Neural substrates of the executive function construct, age-related changes, and task materials in adolescents and adults: ALE meta-analyses of 408 fMRI studies. Dev. Sci. 24, e13111 (2021).
Article PubMed Google Scholar
Fu, Z. et al. The geometry of domain-general performance monitoring in the human medial frontal cortex. Science 6, eabm9922 (2021).
Jiang, L. et al. Bayesian multivariate sparse functional principal components analysis with application to longitudinal microbiome multiomics data. Ann. Appl. Stat. 16, 2231–2249 (2022).
Article MathSciNet MATH Google Scholar
Tucker-Drob, E. M. et al. A strong dependency between changes in fluid and crystallized abilities in human cognitive aging. Sci. Adv. 8, eabj2422 (2022).
Article ADS PubMed PubMed Central Google Scholar
Friedman, N. P. et al. Not all executive functions are related to intelligence. Psychol. Sci. 17, 172–179 (2006).
Article PubMed Google Scholar
Volkow, N. D. et al. The conception of the ABCD study: from substance use to a broad NIH collaboration. Dev. Cogn. Neurosci. 32, 4–7 (2018).
Article PubMed Google Scholar
Burgess, P. W. Theory and methodology in executive function research. In: Methodology of frontal and executive function 87–121 (Routledge, 2004).
Germine, L., Strong, R. W., Singh, S. & Sliwinski, M. J. Toward dynamic phenotypes and the scalable measurement of human behavior. Neuropsychopharmacology 46, 209–216 (2021).
Article PubMed Google Scholar
Weigard, A. et al. Cognitive modeling informs interpretation of go/no-go task-related neural activations and their links to externalizing psychopathology. Biol. Psychiatry. Cogn. Neurosci. Neuroimaging 5, 530–541 (2020).
PubMed Google Scholar
Montez, D. F., Calabro, F. J. & Luna, B. The expression of established cognitive brain states stabilizes with working memory development. Elife 6, e25606 (2017).
Article PubMed PubMed Central Google Scholar
Wierenga, L. M. et al. Unraveling age, puberty and testosterone effects on subcortical brain development across adolescence. Psychoneuroendocrinology 91, 105–114 (2018).
Article CAS PubMed Google Scholar
Larsen, B. et al. Longitudinal development of brain iron is linked to cognition in youth. J. Neurosci. 40, 1810–1818 (2020).
Article CAS PubMed PubMed Central Google Scholar
Callahan, B. L., Plamondon, A., Gill, S. & Ismail, Z. Contribution of vascular risk factors to the relationship between ADHD symptoms and cognition in adults and seniors. Sci. Rep. 11, 1–11 (2021).
Article Google Scholar
Suchy, Y., Kraybill, M. L. & Larson, J. C. G. Understanding design fluency: Motor and executive contributions. J. Int. Neuropsychol. Soc. 16, 26–37 (2010).
Article PubMed Google Scholar
Team, R. C. R: a language and environment for statistical computing. http://www.R-project.org/ (2013).
Tervo-Clemmens, B. et al. Neural correlates of rewarded response inhibition in youth at risk for problematic alcohol use. Front. Behav. Neurosci. 11, 205 (2017).
Article PubMed PubMed Central Google Scholar
Revelle, W. & Revelle, M. W. Package ‘psych’. Compr. R. Arch. Netw. 337, 338 (2015).
Google Scholar
Viechtbauer, W. & Viechtbauer, M. W. Package ‘metafor’. The Comprehensive R Archive Network. Package ‘metafor’. http://cran.r-project.org/web/packages/metafor/metafor.pdf (2015).
Sørensen, Ø. et al. Meta-analysis of generalized additive models in neuroimaging studies. NeuroImage 224, 117416 (2021).
Article PubMed Google Scholar
Raiche, G., Magis, D. & Raiche, M. G. Package ‘nFactors’. Repository CRAN, 1–58 (2020).
Lüdecke, D., Ben-Shachar, M. S., Patil, I., Waggoner, P. & Makowski, D. performance: An R package for assessment, comparison and testing of statistical models. J. Open Source Softw. 6, 3139 (2021).

Download references

Acknowledgements

This work was supported by the National Institutes of Health: R03MH113090 (Calabro, Luna), R01MH067924 (Luna), an American Psychological Foundation Visionary Grant (Tervo-Clemmens), and the Staunton Farm Foundation (Luna).

Author information

Authors and Affiliations

Department of Psychiatry & Behavioral Sciences, University of Minnesota, Minneapolis, MN, USA
Brenden Tervo-Clemmens
Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
Brenden Tervo-Clemmens
Department of Psychology, University of Pittsburgh, Pittsburgh, PA, USA
Brenden Tervo-Clemmens & Beatriz Luna
Department of Psychiatry, University of Pittsburgh, Pittsburgh, PA, USA
Finnegan J. Calabro, Ashley C. Parr, Jennifer Fedor, William Foran & Beatriz Luna
Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA, USA
Finnegan J. Calabro & Beatriz Luna
Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA, USA
Jennifer Fedor

Authors

Brenden Tervo-Clemmens
View author publications
You can also search for this author in PubMed Google Scholar
Finnegan J. Calabro
View author publications
You can also search for this author in PubMed Google Scholar
Ashley C. Parr
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer Fedor
View author publications
You can also search for this author in PubMed Google Scholar
William Foran
View author publications
You can also search for this author in PubMed Google Scholar
Beatriz Luna
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception: B.T.-C., B.L. Design: B.T.-C., F.J.C., A.C.P., B.L. Data acquisition, analysis, and interpretation: B.T.-C., F.J.C., A.C.P., J.F., W.F., B.L. Manuscript writing, revising: B.T.-C., F.J.C., A.C.P., J.F., W.F., B.L.

Corresponding author

Correspondence to Brenden Tervo-Clemmens.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks James Ogilvie and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tervo-Clemmens, B., Calabro, F.J., Parr, A.C. et al. A canonical trajectory of executive function maturation from adolescence to adulthood. Nat Commun 14, 6922 (2023). https://doi.org/10.1038/s41467-023-42540-8

Download citation

Received: 27 January 2023
Accepted: 13 October 2023
Published: 30 October 2023
DOI: https://doi.org/10.1038/s41467-023-42540-8

This article is cited by

Goal-directed learning in adolescence: neurocognitive development and contextual influences
- Linda Wilbrecht
- Juliet Y. Davidow
Nature Reviews Neuroscience (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.