A generative network model of neurodevelopmental diversity in structural brain organization

Akarca, Danyal; Vértes, Petra E.; Bullmore, Edward T.; Astle, Duncan E.

doi:10.1038/s41467-021-24430-z

Download PDF

Article
Open access
Published: 09 July 2021

A generative network model of neurodevelopmental diversity in structural brain organization

Nature Communications volume 12, Article number: 4216 (2021) Cite this article

11k Accesses
23 Citations
45 Altmetric
Metrics details

Subjects

Abstract

The formation of large-scale brain networks, and their continual refinement, represent crucial developmental processes that can drive individual differences in cognition and which are associated with multiple neurodevelopmental conditions. But how does this organization arise, and what mechanisms drive diversity in organization? We use generative network modeling to provide a computational framework for understanding neurodevelopmental diversity. Within this framework macroscopic brain organization, complete with spatial embedding of its organization, is an emergent property of a generative wiring equation that optimizes its connectivity by renegotiating its biological costs and topological values continuously over time. The rules that govern these iterative wiring properties are controlled by a set of tightly framed parameters, with subtle differences in these parameters steering network growth towards different neurodiverse outcomes. Regional expression of genes associated with the simulations converge on biological processes and cellular components predominantly involved in synaptic signaling, neuronal projection, catabolic intracellular processes and protein transport. Together, this provides a unifying computational framework for conceptualizing the mechanisms and diversity in neurodevelopment, capable of integrating different levels of analysis—from genes to cognition.

Neurodevelopmental disorders—high-resolution rethinking of disease modeling

Article Open access 25 November 2022

Konstantin Khodosevich & Carl M. Sellgren

Towards a biologically annotated brain connectome

Article 17 October 2023

Vincent Bazinet, Justine Y. Hansen & Bratislav Misic

Longitudinal development of the human white matter structural connectome and its association with brain transcriptomic and cellular architecture

Article Open access 12 December 2023

Guozheng Feng, Rui Chen, … Ni Shu

Introduction

The human brain is highly organized at multiple scales. At the broadest scale, neuronal populations are structurally connected across large anatomical distances with white-matter fiber bundles, forming a set of interconnected networks. This macroscopic organization can be studied via diffusion-weighted magnetic resonance imaging (MRI), which measures the direction of water diffusion in vivo^1,2,3. During childhood the emergence and continual refinement of these large-scale brain networks allows for increasing functional integration and specialization^4,5. This process is thought crucial for the growth of complex cognitive processes such as language⁶ and executive function^{7,8,9,10,11,12}. However, there are individual differences in the organization of these networks across children, and these differences mirror important developmental outcomes. Indeed, differences in macroscopic networks have been implicated across multiple neurodevelopmental conditions¹³, including ADHD¹⁴, autism^15,16, and language disorders¹⁷.

But what mechanisms drive the diversity of macroscopic brain networks? And how do these mechanisms give rise to individual differences in children’s outcomes? There are numerous descriptive theories^{18,19,20,21,22} that speculate about how different levels of analysis (e.g., genes, brain structure, and function) interact to produce these neurodevelopmental differences. However, to date no theories are sufficiently specified that they can simulate individual-level brain networks. In the absence of computational models, it is difficult to establish mechanistic links between individual differences in observations (e.g., gene expression, biological pathways, system wide organization). This theory gap represents a major limitation for understanding neurodevelopmental diversity. The purpose of this study is to address precisely this gap, by modeling the generative wiring properties of a large sample of children at heightened neurodevelopmental risk of poor outcomes. The computational model we implemented is guided by a simple principle: the brain’s structural organization is shaped by an economic trade-off between minimizing wiring costs and adaptively enhancing valuable topological features²³. We hypothesize that the emergence of whole-brain organization reflects the continual trade-off of these factors over time and that tiny differences in the parameters governing the trade-off can produce the neurodiverse outcomes we observe. Somewhat counterintuitively, tight parameter constraints likely enable macroscopic neurodiversity, because large changes in these parameters would produce networks with configurational states that are not observed in reality. Instead, narrow boundaries reflect parameter conditions within which networks can be different, but still maintain adequate structural properties to be functional.

Our work utilizes generative network modeling^24,25, in which connections within a physically embedded network are formed probabilistically over time according to a wide range of potential mathematical constraints. Varying the parameters and wiring rules that govern network formation provides a way of establishing which statistics likely create real networks—in this case structural brain networks in our large heterogeneous sample of children. Specifically, we: (1) tested which topological features should be valued in the wiring trade-off to produce highly accurate individual child connectomes; (2) tested how small changes in these parameters alter the organizational properties of the resulting networks; (3) established relationships between these different wiring parameters and cognitive outcomes; (4) identified genes with expression profiles that were spatially co-located with those topological features; and (5) established the biological pathways that are enriched in these gene lists. Together, this provides a computational framework that mathematically specifies the formation of a network over time, captures individual differences in brain organization and cognition, and incorporates the genetic and biological pathways that likely constrain network formation.

Results

The generative network model

The generative network model (GNM) can be expressed as a simple wiring equation^24,25 (Fig. 1a). If you imagine a series of locations within the brain, at each moment in time the wiring equation calculates which two locations will become connected. It calculates this wiring probability by trading-off the cost of a connection forming, against the potential value of the connection being formed. The equation can be expressed as:

$${P}_{i,j}\propto {({D}_{i,j})}^{\eta }{({K}_{i,j})}^{\gamma },$$

(1)

where D_i,j represents the Euclidean distance between nodes i and j (i.e., “costs”), and K_i,j reflects the value (i.e., “attractiveness”) in forming a connection. P_i,j represents the wiring probability as a function of the product of the parameterized costs and value. The end result is a wiring probability matrix which updates over time as new connections are added.

**Fig. 1: Updating wiring probabilities within the generative network model iteratively, based on dynamically changing graphical structures.**

D_i,j is parameterized by the scalar η, which changes how node-to-node distances influence their probability of connecting. For example, when η is negative, wiring probabilities decline when distances increase, and this reflects the costliness of forming connections with nodes that are distant. This is traded-off against K_i,j, which represents some relationship between nodes, which can be thought of as a topological value (or “rule”) driving the intention for node i to connect with node j. K_i,j is parameterized by a distinct scalar γ. K_i,j can take a range of different forms and can, in principle, be selected from any non-geometric growth rule used to model social and economic networks^26,27,28. One simple example is the “matching” rule²⁴: nodes form connections with other nodes on the basis of their normalized overlap in neighborhood—i.e., whether nodes are connected to similar nodes to themselves (also termed homophily).

To make this more concrete, imagine the following scenario: a network is growing according to the matching rule, preferentially attaching to nodes which are both similarly connected and spatially proximal. In the wiring equation, this would be represented as η being negative (e.g., η = −1), K_i,j represented as normalized neighborhoods between nodes (i.e., matching) and its parameter γ being positive (e.g., γ = 1). In short, a node being far away makes it less likely that a new connection will be formed, but it having a similar a neighborhood increases the likelihood. Suppose that the right caudal anterior cingulate (Node 2, n²) is going to wire to one of its six nearest neighbors. Initially, due to an absent network topology, spatial proximity has a great influence in the formation of new connections—it will wire to its nearest neighbor (Fig. 1b). However, gradually over time, the network’s developing structural topology means that K_i,j (i.e., the relationships between nodes) may now have a greater influence on wiring probabilities. Indeed, the right caudal anterior cingulate may later wire with a node that, although further away than other available nodes, has a greater value (i.e., matching) than the others (Fig. 1c). As the wiring equation separately parameterize costs and value, the presence of a single connection can heavily influence the topology of the network and thus the future updated wiring probabilities. This is because new connections can lead to entirely new overlapping neighbors, which may include distant nodes. As a result, wiring probabilities can change considerably from moment to moment, despite costs remaining fixed (Fig. 1d).

The GNM simulates this process across the whole brain, until the overall number of connections matches those found in the observed brain network. Subsequently, to test the accuracy of the simulation, an energy function, E, must be defined which measures the dissimilarity between simulated and observed networks^24,25:

$$E={\rm{max}}({\rm{KS}}_{k},{\rm{KS}}_{c},{\rm{KS}}_{b},{\rm{KS}}_{e}),$$

(2)

where KS is the Kolmogorov–Smirnov statistic comparing degree k, clustering coefficient c, betweenness centrality b, and edge length e distributions of simulated and observed networks. Minimizing E finds parameters η and γ which generate networks most closely approximating the observed network.

The four measures in the energy equation are good candidates for evaluating the plausibility of simulated networks. They are critical statistical properties of realistic networks and have featured within the most well-documented simulated network models^29,30,31. Moreover, these statistical properties have been implicated in a number of neuropsychiatric conditions^32,33 in addition to being shown to be heritable³⁴.

Small variations in GNM parameter combinations produce accurate and spatially embedded networks

From a basic seed network common to all participants (for detail, see Supplementary Fig. 1 and Methods), we computed the subject-wise optimal GNM (i.e., network with lowest energy) over a range of 10,000 evenly spaced parameter combinations (−7 ≤ η ≤ 7, −7 ≤ γ ≤ 7) using 13 different generative rules (for rule formulae, see Supplementary Table 1) across our large sample of children (N = 270, 178 males, 92 females, mean age = 9 years 10 months, SD age = 2 years 2 months; full sample details can be found at Holmes et al.³⁵). In each case, we computed energy landscapes to contextualize how they perform (Fig. 2a–d). Mirroring findings in adult samples^24,25,35, we found that models driven by geometry and topology outperform the pure geometric spatial model and homophily-based models achieve the lowest energy for our pediatric sample (Fig. 2e). In other words, when one combines the distance penalty with the “matching rule” we described in our concrete example (as shown in Fig. 1), it produces the most plausible simulated brain networks. This difference between generative rules is extremely robust. A post hoc power calculation revealed that the homophily-based rules could be distinguished from the next best class with near-perfect statistical power (t = −10.210, p = 6.705 × 10⁻²¹; N = 270, power > 0.99), and that this difference could be detected with around 70 participants.

**Fig. 2: Sample-averaged energy landscape visualization and generative rule comparisons.**

It is notable that across the matching energy landscape, these plausible networks exist within an extremely narrow window of parameter solutions. That is, as a proportion of the parameter space, the matching rule (and the other homophily-based model “neighbors”) contain the least number of low-energy networks relative to other rules. But as Fig. 2e shows, these networks are the closest to real networks. Thus, varying homophily-based parameters produces the most realistic networks, yet has the lowest variability in the space (Supplementary Fig. 2).

While small, variability within this narrow matching window determines inter-individual differences in brain network growth. This is because small changes in parameters (i.e., the magnitude and direction in which costs and values influence wiring probabilities) can lead to networks which are diverse yet include basic structural properties common to all subjects. To derive more precise estimations of optimal generative parameter combinations, we subsequently generated a new set of 50,000 evenly spaced simulated networks over this narrow low-energy matching window (−3.606 ≤ η ≤ 0.354, 0.212 ≤ γ ≤ 0.495). Focusing on this energy crevasse allows us to detect individual differences in optimal parameter combinations with much greater specificity. In other words, we resampled the parameter combinations focusing within the low-energy window, to make sure we have the most precise estimate of each individual child’s optimal parameters.

In Fig. 2f, we show the spatial distribution of these top performing parameter combinations and Supplementary Table 2 documents their summary statistics. These finely calibrated networks are even more low energy than in the previous analysis. In Supplementary Fig. 3a–d we detail how KS statistics vary across the same space. Importantly, due to the stochastic nature of GNMs, the energy of optimal parameter combinations varies with an average standard deviation (SD) of 0.045 across the sample (1000 independent runs). Therefore, for the rest of this study, we quote our parameter analyses averaged across a variable number of wiring parameters which achieved networks with the lowest energy in the space: N = 1 (equating to 0.002% of the space) N = 10 (0.02%), N = 100 (0.2%), and N = 500 (1.0%).

The optimal η and γ parameters are significantly negatively correlated with each other, such that subjects with large γ parameters tend to have larger negative η (Best N = 1 network: r = −0.284, p = 2.07 × 10⁻⁶; N = 10 networks: r = −0.403, p = 6.08 × 10⁻¹²; r = −0.460, p = 1.58 × 10⁻¹⁵; N = 100 networks: p = −0.460, p = 1.58 × 10⁻¹⁵, N = 500 networks: r = −0.497, p = 3.21 × 10⁻¹⁸) (Supplementary Fig. 3f). Optimally simulated networks, using this simple wiring equation, are so similar to the actual networks that a support vector machine is unable to distinguish them using the parameters from the energy Eq. (2) (mean accuracy = 50.45%, SD = 2.85%).

Replicating previous work, we find that our simulated networks, optimized via the statistical properties included in the energy Eq. (2) via homophily generative mechanisms, accurately capture these properties in observed networks^24,25,36. But do these capture crucial network properties not included in the energy equation, like their spatial embedding? We next examined if the spatial patterning of these network properties arises simply from the generative model.

Averaged across the sample, optimally performing generative models (i.e., those using the “matching” rule) produce networks which significantly correlate with observed networks in terms of their degree (r = 0.522, p = 4.96 × 10⁻⁵), edge length (r = 0.686, p = 1.11 × 10⁻¹¹), and betweenness centrality (r = 0.304, p = 0.012) but not clustering coefficient (r = −0.054, p = 0.663) (Fig. 3). That is, the spatial embedding of these network properties seemingly emerges, to mirror those of the observed networks, despite this not being specified in the growth process. We extended this analysis to new measures outside of the energy equation (Supplementary Fig. 4). While local efficiency and assortativity cannot be significantly predicted across the sample (r = 0.211, p = 0.084 and r = −0.096, p = 0.116, respectively), optimally performing simulated and observed networks correlate positively in terms of their global number of rich clubs (r = 0.316, p = 1.11 × 10⁻⁷), maximized modularity (r = 0.349, p = 3.84 × 10⁻⁹), and transitivity scalar (r = 0.411, p = 2.11 × 10⁻¹²). In short, despite not being specified in the growth process, the simple homophily rule generates many properties of observed brain networks.

**Fig. 3: Spatial embedding of simulated networks grown via optimized homophily generative mechanisms.**

One criticism of our simulations is that their embedding may be an artifact of the seed network (which is on average 10.8% the density of the observed/simulated networks). In short, if by chance the seed network mirrors the final network, it could be inevitable that spatial embedding would emerge, in terms of node degree, betweenness centrality and edge length. If so, one would expect initial local statistical properties of the seed to be significantly associated with regional accuracy of the simulation. To determine if this is the case, we analyzed the regional accuracy of our homophily simulations by determining their regional generative error (as a mismatch between observed and simulated outcome, depicted in Supplementary Fig. 5). Importantly, the average ranked error (Supplementary Fig. 5e) is not correlated with the seed network’s connectivity (r = −0.0711, p = 0.5643). Furthermore, seed features do not correlate with their own feature’s resultant error (Degree, r = −0.0408, p = 0.7410; Betweenness, r = 0.1833, p = 0.1345; Edge length, r = 0.1114, p = 0.3659).

The large heterogenous sample we chose is ideal for this computational approach to understanding diversity, but it is highly likely that this approach will work for more standard typically developing cohorts of children. In Supplementary Figure 6 we replicate our key findings in an independent sample of N = 140 children recruited from local primary schools in the same area (for more details about the cohort, see “Methods”; “Participants” and Johnson et al.³⁷).

Individual differences in wiring parameters mirror connectome organization, gray matter morphology, and cognitive scores

A critical benefit of a generative modeling approach is that it allows us to probe the underlying mechanisms occurring over the development of the network³⁸. As one explicitly specifies the generative mechanisms involved, the statistical properties which “fall out” of the network can be considered as spandrels³⁹; epiphenomena of the network’s development according to the much simpler economical trade-off (in this case, according to the homophily principle). If the generative model is indeed capturing biologically relevant processes, one would expect the wiring equation—at a minimum—to reduce the network’s dimensionality simply into the two wiring parameters used to construct it. Under this view, individual differences in wiring parameters should (1) map to wide-ranging statistical properties of the observed network, which may be considered spandrels and (2) appropriately reduce dimensionality of the connectome such that, for example, one can equivalently predict cognitive scores from parameters as one would be able to from the network properties.

To explore this, we first examined how wiring parameters reflect observed features of brain organization by quantifying how a subject’s η and γ relate to global measures of their observed connectome. Furthermore, for all 270 subjects, cortical morphology data were available. In Fig. 4, we document how global network and morphological measures (most of which are not included in the energy equation) relate to each other, in addition to their reasonably stable association with a varying number of high performing η and γ wiring parameters (specific results are provided in Supplementary Table 3), and their associations with age. Figure 4b shows that η is significantly associated with age—the network parameters needed to form optimal networks over time need to favor longer distance connections for older participants, relative to younger participants²⁵. To disentangle age-related parameter differences from individual differences, we repeated all of our correlations across measures whilst partialling out age (Supplementary Fig. 7). Associations remain when age has been controlled for, demonstrating that age-related changes in optimal parameters are relatively independent of the individual differences in those parameters. This is not only an important step in demonstrating that these parameters generalize to distinct measures (e.g., morphological observations) not used to train the generative models, but also demonstrates that the generative approach is consistent with the notion that wiring parameters themselves have significant associations with numerous statistical properties in a network.

**Fig. 4: Statistical properties of the connectome and cortical morphology, and their relationships with wiring parameters and age.**

Next, we tested the ability of the wiring equation to reduce the dimensionality of the connectome. Specifically, if wiring parameters are accurate decompositions of an individual’s structural network they should predict cognitive outcomes equivocally to observed features of the connectome. For all 270 subjects we had data from a battery of cognitive tasks, including measures of executive function, phonological awareness, working memory, fluid reasoning and vocabulary (for details of the tasks see “Methods”; “Cognitive and learning assessments”).

We tested the relationship between a subject’s age-standardized cognition and (1) their optimal wiring parameters and (2) global measures of their structural connectome (using measures included in the energy equation). This was done using partial least squares (PLS), a multivariate statistical technique which extracts optimally covarying patterns from two data domains⁴⁰. We undertook two separate PLS analyses, which correlated (1) optimal wiring parameter combinations or (2) global connectome measures across our sample, with cognitive performance in the nine tasks, respectively (Fig. 5a). For both analyses, PLS1 was significant in the amount of explained covariance (p_cov = 0.009 and p_cov = 0.049, respectively). PLS1 score predictions, and their cognitive loadings, are extremely similar between wiring parameters and connectome features (r = 0.191, p = 1.63 × 10⁻³, p_corr = 7 × 10⁻⁴ and r = 0.210, p = 5.29 × 10⁻⁴; each p_corr = 7 × 10⁻⁴ and p_corr 4 × 10⁻⁴, respectively, from 10,000 permutations of scores) (Fig. 5b, c).

**Fig. 5: Covarying patterns of wiring parameters and connectome features with cognitive performance across nine cognitive tasks.**

Variability in neurodevelopmental trajectories arises through value-updating over time

While small generative parameter differences result in differential network properties, we have yet to show how this variability may occur over the development of the networks. That is, how do differences in parameter combinations across subjects manifest themselves when the network is developing? To address this, we examined how between-subject variability in optimal GNMs emerge at the level of cortical nodes and their connections. This is possible by simply decomposing the optimal simulation into its constituent parametrized costs (D_i,j)^η, values (K_i,j)^γ, and wiring probabilities (P_i,j) at each time point, for each subject (Fig. 6a, b). This allows us to quantify growth trajectories and thus establish which aspects of network emergence vary most in the sample.

**Fig. 6: Wiring Eq. (1) decomposition and the subsequent variability across subjects in our heterogeneous sample.**

For each subject, we computed the coefficient of variation (CV, σ/μ) of their parameterized costs, matching values and wiring probabilities to compare subject-specific variability, as it emerges throughout the simulated growth of connectomes. While subjects exhibit some variability in how parameterized costs influence wiring probabilities (mean CV 2.27), this is dwarfed by their parameterized values over time (mean CV 33.02). This is because the matching value is dynamic, changing at each iteration (as in Fig. 1d), unlike relative Euclidean distance between nodes which is static. The result is that significant inter-individual variability arises in the probability of connections forming (mean CV 53.08), leading to the emergence of divergent brain organization (Fig. 6c–f). Furthermore, the regional patterning of costs and values is not random (Fig. 6g). Nodes and edges with high matching values decline in their variability, suggesting a consistency across subjects in highly “attractive” nodal structures and their connections. Across the sample, cheaper regions occupy the medial aspects of the cortex while highly valuable regions generally reside in the temporal cortex.

Genomic patterning of network growth

Underlying these macroscopic changes in brain organization across time are a series of complex molecular mechanisms. These are partly governed by genetically coded processes that vary across individuals. We next tested whether these processes may steer the brain network toward a particular growth trajectory within our GNMs.

Nodal cost and nodal “matching” value patterning alongside regional gene expression profiles of 10,027 genes using human adult brain microarray data^41,42 were integrated into two PLS analyses for each subject. For all analyses, gene expression scores at each node were used as the predictor. For each subject’s first analysis, their parameterized nodal costs (calculated as the subject’s ∑ (D_i,:)^η, as visualized Fig. 6g; fourth row, right) was used as the response variable. For each subject’s second analysis, their mean parameterized values (calculated as the subject’s (∑ (K_i,:)^γ averaged over time, as visualized in Fig. 6g; second row, right) was used as the response variable. Each analysis defined PLS components independently which were linear combinations of the weighted gene expression scores at each node (predictor variables) that were most strongly correlated with the subject’s nodal costs and nodal values of their simulated growth trajectory. To limit the variability across regions in terms of the samples available, only left hemispheric gene data were analyzed⁴².

Across our sample, the first PLS component (PLS1) explained on average 65.0% (SD 1.3%) and 56.9% (SD 9.2%) of the covariance between genetic expression and nodal costs, and nodal values, respectively. The average nodal costs PLS1 score significantly correlates with average nodal costs (r = 0.794, p = 2.07 × 10⁻⁸, p_corr = 2 × 10⁻⁴). Similarly, the average nodal values PLS1 score significantly correlates with average nodal values (r = 0.718, p = 1.71 × 10⁻⁶, p_corr = 1 × 10⁻⁴) (Fig. 7a, b). To then characterize the genetic profiles associated with each PLS analysis, we permuted the response variable 1000 times to form a null distribution for the loading of each gene, across each subject’s PLS1. This provides an estimate of how strong the loading would be by chance, and thus which genes exceed p_corr < 0.05. Across subjects, PLS1 provided an average of 581.5 significant genes (SD 101.4) for nodal costs and 437.6 significant genes (SD 167.4) for nodal values (Supplementary Fig. 8a).

**Fig. 7: Over expressed genes which explain variance in brain wiring across subjects.**

Genes do not act in isolation, but instead converge to govern biological pathways across spatial scales. To move from individual genes to biological processes (BPs) and cellular components (CCs), we performed a pathway enrichment analysis⁴³. Pathway enrichment analysis summarizes large gene sets as a smaller list of more easily interpretable pathways that can be visualized to identify main biological themes. Genes were ordered according to their frequency in being significantly associated with connectome growth across subjects for that component. For example, for nodal values PLS1, top of the list was the gene associated with connectome growth in the most subjects (CHI3L1; significant for 49.4% of our sample), the next was the second most frequent gene (PRKAB2; 36.4% of our sample) and so on. Our list stopped when genes were significant for <10% of the sample. This left the nodal costs PLS1 with a list of 1427 genes and the nodal values PLS1 with a list of 1584 genes ordered in terms of importance, which were submitted to pathway enrichment analysis in g:Profiler (https://biit.cs.ut.ee/gprofiler/gost) (Supplementary Fig. 8b)⁴³. g:Profiler searches a collection of gene sets representing GO terms. In the ordered test, it repeats a modified Fisher’s exact test on incrementally larger sub-lists of the input genes and reports the sub-list with the strongest enrichment. Multiple-test correction is applied to produce an adjusted p value (p_adj) for each enrichment^43,44 (as visualized in Supplementary Fig. 8c, d, which can be accessed via the links presented in Supplementary Table 4).

The genes identified within the subject-wise PLS are not random, but instead converge on particular BPs and CCs. The nodal costs PLS1 was most prominently enriched for genes associated with BPs including catabolic processes and protein localization (32 BPs; all p_adj < 9.58 × 10⁻³), cell projection (14 BPs; all p_adj < 4.39 × 10⁻²), immunological processes (34 BPs; all p_adj < 4.82 × 10⁻²), regulation of metabolic processes (8 BPs; all p_adj < 4.75 × 10⁻²), and regulation of cell development and differentiation (4 BPs; all p_adj < 3.87 × 10⁻²). In terms of CCs, nodal costs PLS1 was enriched for genes associated with the ribosome (14 CCs; all p_adj < 2.15 × 10⁻²), vesicular and endoplasmic membranes (19 CCs; all p_adj < 4.90 × 10⁻²) and intracellular organelles (8 CCs; all p_adj < 4.97 × 10⁻²) (Fig. 7c).

The nodal values PLS1 was most prominently enriched for genes associated with BPs including synaptic signaling (29 BPs; all p_adj < 3.96 × 10⁻²), neuronal projection and development (26 BPs; all p_adj < 4.21 × 10⁻²), and synapse organization (2 BPs; all p_adj < 2.92 × 10⁻²). In terms of CCs, nodal values PLS1 was enriched for genes associated with synaptic membranes (60 CCs; all p_adj < 3.15 × 10⁻²) and ion channel complexes (7 CCs; all p_adj < 1.18 × 10⁻²) (Fig. 7d).

In Supplementary Table 4 we provide links so that readers can run our precise gene ontology (GO) queries within a browser and in Supplementary Fig. 8c, d we show a visualization of these enriched gene sets.

Discussion

Diversity in macroscopic human brain organization can be modeled using a generative network. The generative framework does not include time itself as an explicit parameter, but instead models it as a sequence of processes, optimizing its connectivity by renegotiating its costs and value^24,25 continuously over iterations. Despite the simplicity of this equation, it results in the dynamic updating of wiring probabilities over time, with multiple network properties, like spatial embedding, being an emergent property of this dynamic updating. This resonates with theoretical perspectives that implicate dynamic interactions between brain systems over development in progressive, integrative, specialization⁴⁵. We have formalized this process in the context of neurodevelopmental diversity; offering a new perspective on the formation of organized macroscopic networks, their possible biological underpinnings, and their association with functional outcomes like cognitive performance. This reflects a theoretical step-change in understanding diversity in neurodevelopment, being sufficiently well-specified to generate macroscopic brain networks. In turn, this formalization allows for the unpacking of the computational and/or biological constraints that shape the trajectories of networks. Indeed, we anticipate that GNMs may be a powerful tool to model real and biologically feasible artificial networks across many scales.

Small changes in wiring parameters of the GNM lead to divergent macroscopic brain networks, with systematically different network properties. Within the model, the key factor that drives individual differences in growth trajectory is the dynamic nature of updating preferences over time. Specifically, as nodes form new connections this dynamically changes their neighborhoods, and in turn this quickly changes which nodes become “attractive” for subsequent connections. Importantly, individual differences in this process correspond significantly to independent structural data of the same individuals.

Why do the homophily-based generative rules approximate whole-brain networks so well? We propose that the superordinate goal of any developing brain network is to achieve the optimal computational capacity required of it, given finite biological resources. In this light, we suggest that matching produces the lowest-energetic networks precisely because it provides the closest heuristic estimate (compared to those tested here and in other works^24,25,36) of the genuine dynamic reappraisals that occur over developmental time.

This is because by virtue of preferentially wiring with nodes with shared neighborhoods modular architectures emerge⁴⁶, and this reflects the brain’s overarching structure. The modular architecture of the brain has been well studied, and has numerous properties enabling effective flexible computations likely important for functional integration⁴⁷. By virtue of only requiring knowledge of neighborhood overlap, homophily-based methods may incur less informational costs⁴⁸ relative to other methods which require global information, and therefore may be more biologically plausible. It is of note that homophily is a measure somewhat akin to network communicability—another locally knowable measure containing information that closely relates to the shortest path⁴⁹. Finally, a tentative explanation for homophily at the neuronal level can also be provided in terms of Hebbian-like plasticity^24,50 This opens up the exciting possibility for these generative models to model developing neuronal cultures⁵¹ to explore whether similar wiring principles may operate across scales.

Our current GNMs operate at a whole-brain level—i.e., a global set of rules governing network formation. But with more biologically realistic information about regional differences it is possible that an alternative growth model could be fitted⁵². Unlike a generative model, a growth model captures the graded changes of an established network over time. This would allow for time itself to be incorporated as a parameter within the model, making regionally and temporally sensitive modifications to network growth and therefore encompass multiple longitudinal measurements of the same individual over a biologically meaningful timescale. The only work we know of to have done this is by Nicosia et al.⁵² in which data from Caenorhabditis elegans (with birth times of ~300 neurons) where integrated into a growth model to reproduce the developing C. elegans network and it is bi-phasic growth rate. With sufficient detailed information it may be possible in future to take a similar growth modeling approach in humans. Regional variation in nodal costs and values closely mirrored the expression profiles for different sets of genes, which in turn govern different BPs and CCs. Since the advent of genome-wide association studies (GWAS), a huge number of genes have been implicated in developmental disorders, including schizophrenia⁵³ and autism⁵⁴, but also general cognitive functioning⁵⁵. It has been challenging to interpret the consequences of these individual implicated genes. The enrichment analysis that accompanied our GNM takes a very different approach. As far as we are aware, this is the first study aiming to bridge models of whole brain organizational emergence and genetics in this way (for work utilizing various generative models, see^{24,25,36,50,56} and for work that integrates Allen Human Brain Atlas gene data with functional and structural brain imaging, see^{57,58,59,60,61}). Nodal costs covaried with genes enriched for highly costly metabolic processes, including catabolic processes, protein transport and CCs centered around the ribosome and endoplasmic membranes. On the other hand, nodal values covaried with genes enriched for trans-synaptic signaling, neuronal projection and the synaptic membrane. This aligns with recent findings that synaptic genes also colocalize with highly synergistic regions of the brain, which have been suggested to be crucial for human cognitive evolution⁶¹.

The omnigenic model⁶² suggests that complex traits are driven by genes that do not have direct effects on the trait per se, but instead propagate through regulatory networks on much smaller numbers of core genes, with more direct effects. This model explains the vast number of GWAS hits for complex traits, as “peripheral” genes necessarily outnumber “core” genes and thus the sum of their small effects exceeds the contribution of core genes. We suggest the omnigenic model may apply to some aspects of gene-development relationships. That is, the many genes that contribute to each PLS1 may not directly contribute to developmental processes themselves, but in the regulation of activity and growth within brain areas that are particularly important for neurodevelopment. Crucially there is variability in enriched genes across subjects (Supplementary Fig. 8a).

Our sample is a large mixed cohort of children, the majority of whom were referred from specialists in children’s educational and clinical services. This was the ideal testbed for exploring diverse trajectories. The varied referral routes for the cohort makes its composition more reflective of children at heightened neurodevelopmental risk, relative to a more standard case-control design recruited according to strict diagnostic inclusion and exclusion criteria, via a single referral route⁶³. But it is important to note that the modeling also works well in a more typical sample recruited from classrooms in the same area. So, whilst the CALM cohort is ideal for exploring the mechanisms of heterogeneity, these same mechanisms are likely at play in more typical samples. Indeed, this work presents a challenge to the long history of categorizing neurodevelopment into discrete groupings based on observed cognitive and/or behavioral traits. Instead, we suggest divergent outcomes may arise via slight trajectory changes that fall out of the continual negotiation of brain connectivity optimization. While likely that generative preferences are initialized via an individual’s genetic preprograming, small changes in wiring preferences over time—possibly via complex interactions of their time course, endocrinological exposure, learning and environment—have profound effects on the emergence of the developmental trajectory. What results is a continual interaction between network growth preferences and the dynamically developing brain, leading to neurodiverse outcomes.

Whilst our sample is designed to capture children at risk, the findings generalized to a more typical sample, with an even split of boys and girls, recruited in local schools. This suggests that this computational approach could be a powerful tool for developmental scientists more generally. The advent of larger datasets is allowing the study of the developing brain at unprecedented scale and across multiple levels^64,65. Computational frameworks that allow the integration of different datatypes (e.g., multiple imaging modalities, genetic variability) could provide a valuable tool for building developmental theory that goes beyond correlating different datatypes over time, and fully capitalizes on the scale and complexity of those datasets. In the future, this approach could be used to test different theoretical accounts of developmental change, and to make longitudinal predictions where multiple waves of data are available. To realize these opportunities, a crucial next step is to deploy this type of computational modeling to capture the diversity present in larger population-level neuroimaging datasets, with longitudinal data^64,65,66 spanning multiple sites.

This computational framework has a number of limitations that provide scope for future improvements. Our generative models are limited to the binary connections which are assumed to be anatomical. This is inevitably a gross simplification of the complex weighted structure of the connectome. Devising ways in which network connections can change in a more graded fashion is a necessary next step to modeling more complete developmental processes. In the future we will need to capture both the strengthening and weakening of connections that has been shown to occur in human brain development^67,68. Secondly, we currently use one rule, but it is conceivable that different rules govern growth at different points in the trajectory. It may be possible to accurately approximate the rules governing the remodeling of networks over time, modeled either by changing heuristic estimates (e.g., changing of generative rules over time) or attempting to optimize a superordinate goal (e.g., computational efficiency and/or flexibility). Thirdly, our gene enrichment results are correlational, not causative. There remains an explanatory gap in determining whether and how these specific gene profiles support the sensitivity to connection formation. And crucially, the expression data are derived for a microarray analysis of postmortem tissue samples from human adults^41,42. Moreover, while RNA-seq data were used to cross-validate gene expression measures by removing probes with a correlation <0.2, it was not possible to externally validate the data with an independent dataset. Caution is therefore required when interpreting our enrichment analysis due to this lack of external validation⁶⁹. The next steps will involve validating these findings in large-scale developmental cohorts with available gene data, and forming casual links by applying GNMs to individuals with neurodevelopmental disorders of known genetic origin^58,59,70. Fourthly, we have used a parcellation widely used in developmental studies^{71,72,73,74,75}, which aids the comparability of our findings with the wider literature. Previous work has shown that parcellation choice is not a big determent of optimal wiring parameters²⁵. But within the context of a developmental sample there could be subtle differences in the optimal parcellation across participants, and thus the production of individually optimized parcellations^76,77 would allow us to test whether and how developmental change in the parcellation itself influence wiring properties. Finally, as in previous studies^24,25,36, our models utilize Euclidean distance as measures of cost in connection formation. While a simplification, this selection removes any a priori constraints to the generative model that a more biologically specified cost measure may provide (e.g., fiber lengths, which are sparse and thus limit potential connections). A number of studies have shown relatively inconsistent findings in terms of how much Euclidean distance accounts for fiber length, with findings ranging from 22% (RED, in this work) to 79% (Human Connectome Project (HCP)) of variance explained. There may be cohort, parcellation, and tractography effects influencing these relationships. Whilst interpolated fiber lengths have been shown to perform equivalently to Euclidean distances²⁵ within GNMs, it is important to consider that in our modeling these two measures only partially overlap. A crucial next step is to test whether microstructural informed tractography⁷⁸, which may provide a more direct measure of biological cost, improves model performance.

In conclusion, we provide a unifying computational framework for conceptualizing the emergence of structural brain networks and their diversity. The emergence of brain networks can be understood as occurring via continual renegotiations of costs and values, but individuality emergences from their slightly different parameterization.

Methods

The methodological workflow is summarized in Fig. 8.

**Fig. 8: Schematic of the methodological workflow.**

Participants

The sample were made up of children referred by practitioners working in specialist educational or clinical services in the East of England (UK) to the Centre for Attention Learning and Memory (CALM), a research clinic at the MRC Cognition and Brain Sciences Unit, University of Cambridge (see Holmes et al.³⁵ for the full protocol of assessment, and refs. ^{9,10,11,12,13} for prior work using the same cohort). The composition of this cohort is design to be broadly reflective of children at heightened neurodevelopmental risk for poor developmental outcomes. Consent was obtained from parents and assent was obtained from all youngsters. The study protocol was approved by, and data collection proceeded under the permission of, the local NHS Research Ethics Committee (reference: 13/EE/0157). This cohort of children is intentionally heterogenous. Referrers were asked to identify children with cognitive problems related to learning, with primary referral reasons including difficulties with ongoing problems in “language”, “attention”, “memory”, or “learning/poor school progress”. Exclusion criteria were uncorrected problems in vision or hearing, English as a second language, or a causative genetic diagnosis. Children could have single, multiple, or no formally diagnosed learning difficulty or neurodevelopmental disorder. Most referrals were made from Special Educational Needs Coordinators (57.0%), followed by Pediatricians (24.1%) and Speech and Language Therapists (4.2%) (Supplementary Fig. 9a, b). Subsequently the CALM team supplemented the cohort with a smaller set of unreferred children, recruited from the same schools and neighborhoods, so that the cohort captures the full ability spectrum. The CALM cohort contains n = 967 total children (N = 805 referred; N = 162 unreferred). Of these, N = 299 undertook MRI scanning of which N = 279 had usable MRI data (see “MRI acquisition and preprocessing”). N = 270 of these had cognitive data available (see “Cognitive and learning assessments”) (see Supplementary Fig. 9c for a visualization of the cognitive variability across the cohort). This sample includes 65.9% boys, mean age 117.8 months, age range was 66–223 months and 78 that came from the non-referred comparison sample. The increased ratio of boys to girls is what we would expect from epidemiology studies of children at neurodevelopmental risk of poor learning or clinical outcomes⁷⁹.

To validate our modeling, we also included a second cohort of n = 140 typically developing children, who had been recruited from local schools (the RED cohort; mean age 9.34 years, SD age 1.41 years, range 6.82–12.8 years, 45.7% boys). These data were collected under the permission of the Cambridge Psychology Research Ethics Committee (references: Pre.2013.34; Pre.2015.11; Pre.2018.53). Parents/legal guardians provided written informed consent and all children provided verbal assent. This second dataset was previously reported by Johnson et al.³⁷. In Supplementary Table 5, we provide all demographic information of the cohorts used in the study. Race and ethnicity data for the CALM and RED cohorts are not yet available, but are to be published³⁵.

MRI acquisition and preprocessing

MRI data were acquired at the MRC Cognition and Brain Sciences Unit in Cambridge, on the Siemens 3 T Prisma-fit system (Siemens Healthcare) using a 32‐channel quadrature head coil. T1‐weighted volume scans were acquired using a whole brain coverage 3D Magnetization Prepared Rapid Acquisition Gradient Echo sequence acquired using 1 mm isometric image resolution. Echo time was 2.98 ms, and repetition time was 2250 ms. Diffusion scans were acquired using echo‐planar diffusion‐weighted images with an isotropic set of 68 noncollinear directions, using a weighting factor of b = 1000 s mm⁻², interleaved with 4 T2‐weighted (b = 0) volume. Whole brain coverage was obtained with 60 contiguous axial slices and isometric image resolution of 2 mm. Echo time was 90 ms and repetition time was 8500 ms. Both CALM and RED samples underwent the same scanning protocol. N = 299 CALM and N = 167 RED children underwent MRI scanning. Twenty (CALM) and 27 (RED) scans were not useable due to excessive motion (>3 mm movement during the diffusion sequence estimated through FSL eddy), leaving an MRI sample of N = 279 CALM and N = 140 RED children, respectively.

Connectome construction and cortical morphology

MRI scans were converted from the native DICOM to compressed NIfTI‐1 format. Next, correction for motion, eddy currents, and field inhomogeneities was applied using FSL eddy. Furthermore, we submitted the images to nonlocal means de‐noising⁸⁰ using DiPy v0.11⁸¹ to boost signal‐to‐noise ratio. A constant single angle model was fitted to the 60‐gradient‐direction diffusion‐weighted images using a maximum harmonic order of 8 using DiPy. Whole‐brain probabilistic tractography was performed with 8 seeds on all voxels. The step size was set to 0.5 and the maximum number of crossing fibers per voxel to 2. For ROI definition, T1‐weighted images were submitted to nonlocal means denoising in DiPy, robust brain extraction using ANTs v1.9⁸², and reconstruction in FreeSurfer v5.3 (http://surfer.nmr.mgh.harvard.edu). Regions of interest (ROIs) were based on the Desikan–Killiany parcellation of the MNI template⁸³ with 34 cortical ROIs per hemisphere. FreeSurfer v5.3 was used for tissue classification and anatomical labeling. The technical details of these procedures are described elsewhere^84,85,86. FreeSurfer morphology statistics were computed for each ROI.

To construct the connectivity matrix, the number of streamlines intersecting both ROIs was estimated and transformed into a density map for each pairwise combination of ROIs. A symmetric intersection was used so that streamlines starting and ending in each ROI were averaged. Self-connections were removed. To produce binarized connectomes from the resulting 68-by-68 streamline matrix, we enforced an average connectome density of ρ = 10% (as in Betzel et al.²⁵), resulting in a streamline threshold of 27 streamlines (i.e., a minimum of 27 streamlines must have connected two regions for us to consider the presence of an anatomical connection).

Generative network models

Starting with a sparse seed network (25 bi-directional edges that were common across all N = 270 subjects), edges were added one at a time over a series of steps until a total number of connections were placed that equaled that of the target observed connectome (group level connections, mean = 231.4 and SD = 19.1) (As shown in Supplementary Fig. 1d). The same process was separately done for the validation cohort. Each step allows for the possibility that any pair of unconnected nodes will be connected. Connections are formed probabilistically, where the relative probability of connection formation, between nodes i and j, is given by Eq. (1). We used 13 previously studied non-geometric rules^24,25 to produce energy landscapes. D_i,j was defined as the Euclidean distance between node centroids. (D_i,j)^η was computed as a power-law, as shown previously to have better performance than exponentials²⁴. Euclidean distance accounts for 24 and 22% of the variance in fiber length in the CALM and RED samples respectively. These estimates are relatively low by comparison with other cohorts, with previous studies showing Euclidean distance to account for 32% (Nathan Kline Institute, Rockland, New York), 66% (CHUV; University Hospital Center and University of Lausanne), and 79% (HCP) of the variance in fiber length²⁵. Topological parameters were computed using our own internally developed functions adapted from the Brain Connectivity Toolbox (https://sites.google.com/site/bctnet/)⁸⁷.

To evaluate the fitness of synthetic networks and optimize models, we defined an energy function that measures how dissimilar a synthetic network is to the observed network as defined by Betzel et al.²⁵. This is given in Eq. (2). Initially, we ran simulations across a defined a parameter space of 10,000 evenly spaced combinations (−7 ≤ η ≤ 7, −7 ≤ γ ≤ 7), for each generative rule (Fig. 2a–d). Parameter ranges were based on approximate values that had been evaluated in previous work^24,25,36. This was to capture their respective energy landscapes and to estimate their relative effectiveness at generating plausible networks. We then computed a further a set of 50,000 simulations within a much narrower low-energy window (−3.606 ≤ η ≤ 0.354 and 0.212 ≤ γ ≤ 0.495) of the matching algorithm (Fig. 2f) for all subsequent analysis. This is because the matching algorithm attained the lowest-energy networks and therefore best approximated individual-level connectomes. In Supplementary Fig. 2, we computed the SD of the top n = 500 performing (lowest energy) wiring parameters to determine how variable the solutions are.

Cognitive and learning assessments

A large battery of cognitive, learning, and behavioral measures was administered in the CALM clinic³⁵. N = 9 CALM children did not have available cognitive data (of the N = 279 MRI sample) and were therefore excluded, leaving the final sample N = 270 children. These children had no missing data. All cognitive scores were age-standardized, controlling for age. For full details of the processing of cognitive data, see Siugzdaite et al.¹³.

The following nine measures of fluid and crystallized reasoning were included: Matrix Reasoning, a measure of fluid intelligence⁸⁸ (Wechsler Abbreviated Scale of Intelligence); Peabody Picture Vocabulary Test⁸⁹. Phonological processing was assessed using the Alliteration subtest of the Phonological Awareness Battery⁹⁰. Verbal and visuo‐spatial short‐term and working memory were measured using Digit Recall, Dot Matrix, Backward Digit Recall, and Mr X subtests from the Automated Working Memory Assessment^91,92. Learning measures (literacy and numeracy) were taken from the Wechsler Individual Achievement Test II⁹³ and the Wechsler Objective Numerical Dimensions⁹⁴, apart from 78 of controls for which we used multiple subtests from the Woodcock Johnson for Verbal ability⁹⁵.

Gene expression data

Regional microarray expression data were obtained from six postmortem brains provided by the Allen Human Brain Atlas (http://human.brain-map.org/)^41,42 which, as far as we are aware, is the only publicly available human 3D brain map of the transcriptome which covers the full cortex. Of note, other datasets such as the BrainSpan Atlas for the Developing Human Brain (http://brainspan.org) are available, although currently covering 16 ROI.

The Allen Human Brain Atlas dataset is based on microarray analysis of postmortem tissue samples from six human donors aged between 18 and 68 years with no known history of neuropsychiatric or neurological conditions. Data were imported from Arnatkevičiūtė et al.⁴². Since only two of the six brains included samples from the right hemisphere, analyses were conducted on the left hemisphere only. Probes where expression measures do not exceed the background in >50% samples were removed and genes that did not have a corresponding RNA-seq measure were removed. To improve validity of the gene dataset by cross-validating expression measures, probes with a Spearman’s correlation <0.2 with external RNA-seq data were removed, and a representative probe with the highest correlation to RNA-seq data was selected for each gene. Sample assignment was computed by applying a 2 mm distance threshold. In total, a mean of 37.8 ± 22.5 (SD) samples were assigned to each ROI (min = 5; max = 92)⁴².

The fully preprocessed gene data comprised of a 34 by 10,027 matrix of microarray array gene expression data. These data were used for a subsequent PLS analysis (see “Statistics”; “PLS analysis”) and pathway enrichment analysis (see “Gene enrichment analysis and visualization”).

Statistics

Predictions of spatial embedding

To assess the performance of the optimal matching GNMs to produce networks with spatial embedding of topological characteristics, we averaged across each subject’s best performing simulation (which achieved the lowest energy; descriptive statistics shown in the top row of Supplementary Table 2) to produce a single 68 (ROIs) by one vector for each measure. We did the same for their observed connectomes. Figure 3 shows their linear correlations. In Supplementary Fig. 4a, we run the same process for local efficiency (not included in the energy equation). For Supplementary Fig. 4b–e, we correlate the same networks, but not for spatial embedding as these are global network measures outside of the energy equation. Subsequently, we determined the generative error (i.e., mismatch) for the simulations in terms of each statistical network property within the energy equation at the node-level. This was done as a simple subtraction of the observed from the simulated network (as shown in Supplementary Fig. 5). The absolute mean ranked error was then calculated by taking the modulus of the generative error, averaging across the four statistical properties, and taking the rank order. All network measures were calculated using functions from the Brain Connectivity Toolbox⁸⁷.

Global associations of parameters with graphical and morphological measures

In Fig. 4 and Supplementary Table 3, we present group level correlational analysis between η, γ and observed global graph theory and cortical morphology measures. In each case, the observed connectome and morphological measures were averaged across the whole cortex.

PLS analysis

We used PLS regression to address two distinct aspects of the study. First, we used PLS to determine the latent components of the wiring equation and connectome features which best explain cognitive task performance (Fig. 5a). The p_cov and p_corr significance values of each component were determined by permuting the cognitive data 10,000 times and comparing the observed covariance (p_cov) and coefficient of determinations (p_corr) relative to their null distributions. Figure 5b, c shows the correlation of predictor and response scores and response and predictor loadings of the significant PLS1 component (p_cov = 0.009 and p_cov = 0.049 in terms of covariance explained and p_corr = 7 × 10⁻⁴ and p_corr = 4 × 10⁻⁴ in terms of correlation coefficient) for each analysis, respectively. Second, we used PLS to identify the linear combinations of genes that best predicted average nodal costs and values each subject’s optimal simulation (as outlined previously). For this analysis, subject 162 was removed from as they were the only subject to have an optimally performing γ that was positive, which biased results due to being an outlier (as parametrized values are calculated as (K_i,j)^γ), leaving a sample of N = 269. For each of the N = 269 subjects, two PLS analyses were performed, providing 538 separate PLS analyses. We performed permutations of the correlations between average scores and costs/values in the same way as previously to determine significance of the PLS modeling across the sample. To assess the significance of each gene in terms of its loading, we ran N = 1000 permutations of the response variable for each PLS. This allowed us to compute a gene loading p_corr for each component of the PLS which was collapsed across subjects (as visualized in Supplementary Fig. 8b) for gene enrichment analysis (see “Gene enrichment analysis and visualization”).

Variability in the decomposed wiring equation

To determine where variability arises in the growth of the networks, we decomposed the wiring equation for each subject. This was achieved by first running the optimal wiring equation for each subject and taking their cost (a static Euclidean distance matrix), matching and wiring probability matrices at each step in the network growth model. For each subject, we took all edges that existed within the simulation and computed their mean and SD (Fig. 6c–e) and then determined their CV (Fig. 6f shows their distributions). To then explore within-connectome variability, we performed the same analysis but collapsing across subjects to determine how nodes (summed rows of the matrix) and edges (elements of the matrix) vary (Fig. 6g).

Gene enrichment analysis and visualization

We next aimed to elucidate the BPs and CCs for which our gene lists converged on. A BP is defined as representing a specific objective that the organism is genetically programmed to achieve. A BP is accomplished by a particular set of molecular functions carried out by specific gene products (or macromolecular complexes), often in a highly regulated manner and in a particular temporal sequence (https://www.ebi.ac.uk/QuickGO/term/GO:0008150) On the other hand, a CC is defined as a location, relative to cellular compartments and structures, occupied by a macromolecular machine when it carries out a molecular function. There are two ways in which the GO describes locations of gene products: (1) relative to cellular structures (e.g., cytoplasmic side of plasma membrane) or compartments (e.g., mitochondrion), and (2) the stable macromolecular complexes of which they are parts (e.g., the ribosome) (https://www.ebi.ac.uk/QuickGO/term/GO:0005575).

To elucidate BPs and CCs across the sample, genes with a p_corr < 0.05 following permutation testing on each component were deemed significant. This provided an individual-level vector of genes that were significant for an individual for each of the nodal costs and nodal values PLS1. To collapse across subjects, genes were then ordered according to their frequency in being significantly associated with connectome growth across subjects for that component. The list stopped when genes were significant for <10% of the sample. For each subject, PLS1 provided an average of 581.5 significant genes (SD 101.4) for nodal costs and 437.6 significant genes (SD 167.4) for nodal values (Supplementary Fig. 8a). When collapsed across subjects as described, the nodal costs PLS1 had 1427 genes and the nodal values PLS1 had 1584 genes ordered in terms of importance, which were then submitted to a pathway enrichment analysis.

For all information as to the enrichment and visualization pipeline, please refer to Reimand et al.⁴³. In short, GO annotations are the most commonly used resource for pathway enrichment analysis. g:Profiler⁴⁴ (https://biit.cs.ut.ee/gprofiler/gost) searches a collection of gene sets representing GO terms and, in the ordered test, repeats a modified Fisher’s exact test on incrementally larger sub-lists of the input genes and reports the sub-list with the strongest enrichment. Multiple-test correction is applied to produce an adjusted p value (p_adj)^43,44 (as visualized in Supplementary Fig. 8c, d, which can be accessed via the links presented in Supplementary Table 4). To visualize enriched pathways, we used “EnrichmentMap” within Cytoscape v3.8.0 (http://www.cytoscape.org)^43,96. All default parameters were used. Pathways are shown as nodes (representing enriched BPs) that are connected by edges if the pathways share genes. Nodes are colored by their p_adj and edges are sized on the basis of the number of genes shared by the connected pathways. To then identify clusters of themes, AutoAnnotate v1.3.3 was used before manually curating the suggested theme names to accurately reflect all pathways within each theme.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The datasets supporting the current study have not been deposited in a public repository because of restrictions imposed by NHS ethical approval, but are available from the corresponding author on request. Requests for access can be made by research-based institutions for academic purposes. A response can be expected within 1 week. Unidentifiable simulated data can be found at https://osf.io/h9px4/?view_only=984260dcff444b59819961ece9c724ec.

Code availability

Results were generated using code written in R, Python and MATLAB. All code is available at https://github.com/DanAkarca/generativenetworkmodel⁹⁷.

References

Basser, P. J., Pajevic, S., Pierpaoli, C., Duda, J. & Aldroubi, A. In vivo fiber tractography using DT-MRI data. Magn. Reson. Med. 44, 625–632 (2000).
Article CAS PubMed Google Scholar
Conturom, T. E. et al. Tracking neuronal fiber pathways in the living human brain. Proc. Natl Acad. Sci. USA 96, 10422–10427 (1999).
Article ADS Google Scholar
Mori, S., Crain, B. J., Chacko, V. P. & van Zijl, P. C. Three-dimensional tracking of axonal projections in the brain by magnetic resonance imaging. Ann. Neurol. 45, 265–269 (1999).
Article CAS PubMed Google Scholar
Friston, K. Functional integration and inference in the brain. Prog. Neurobiol. 68, 113–143 (2002).
Article PubMed Google Scholar
Mahon, B. Z. & Cantlon, J. F. The specialization of function: cognitive and neural perspectives. Cogn. Neuropsychol. 28, 147–155 (2011).
Article PubMed PubMed Central Google Scholar
Chai, X. J. et al. Intrinsic functional connectivity in the adult brain and success in second-language learning. J. Neurosci. 36, 755–761 (2016).
Article CAS PubMed PubMed Central Google Scholar
Fiske, A. & Holmboe, K. Neural substates of early executive function development. Dev. Rev. 52, 42–62 (2019).
Article PubMed PubMed Central Google Scholar
Assem, M., Glasser, M. F., Van Essen, D. C. & Duncan, J. A domain-general cognitive core defined in multimodally parcellated human cortex. Cereb. Cortex. 30, 4361–4380 (2020).
Article PubMed PubMed Central Google Scholar
Astle, D. E., Bathelt, J., CALM team & Holmes, J. Remapping the cognitive and neural profiles of children who struggle at school. Developmental. Sci. 22, e12747 (2019).
Article Google Scholar
Bathelt, J. et al. Data-driven subtyping of executive function–related behavioral problems in children. J. Am. Acad. Child Adolesc. Psychiatry 57, 252–262 (2018).
Article PubMed PubMed Central Google Scholar
Bathelt, J., Scerif, G., Nobre, A. C. & Astle, D. E. Whole-brain white matter organization. Intell. Educ. Attain. Trends Neurosci. Educ. 15, 38–47 (2019).
Article CAS Google Scholar
Bathelt, J., Gathercole, S. E., Butterfield, S., CALM team, & Astle, D. E. Children’s academic attainment is linked to the global organization of the white matter connectome. Developmental Sci. 21, e12662 (2018).
Siugzdaite, R., Bathelt, J., Holmes, J. & Astle, D. E. Transdiagnostic brain mapping in developmental disorders. Curr. Biol. 30, 1245–1257 (2020).
Article CAS PubMed PubMed Central Google Scholar
Griffiths, K. et al. Altered gray matter organization in children and adolescents with ADHD: a structural covariance connectome study. Transl. Psychiatry 6, e947 (2016).
Article CAS PubMed PubMed Central Google Scholar
Roine, U. et al. Abnormal wiring of the connectome in adults with high-functioning autism spectrum disorder. Mol. Autism 6, 65 (2015).
Article PubMed PubMed Central CAS Google Scholar
Hong, S. et al. Atypical functional connectome hierarchy in autism. Nat. Commun. 10, 1022 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Vydrova, R. et al. Structural alterations of the language connectome in children with specific language impairment. Brain. Lang. 151, 35–41 (2015).
Article PubMed Google Scholar
Castellanos, X. F., Sonuga-Barke, E. J. S., Milham, M. P. & Tannock, R. Characterizing cognition in ADHD: beyond executive dysfunction. Trends Cogn. Sci. 10, 117–123 (2006).
Article PubMed Google Scholar
Heyes, C. M. Where do mirror neurons come from? Neurosci. Biobehav. Rev. 34, 575–583 (2010).
Article PubMed Google Scholar
Vivanti, G. & Rogers, S. J. Autism and the mirror neuron system: Insights from learning and teaching. Philos. Trans. R. Soc. B. 369, 20130184 (2014).
Rogers, S. J. & Williams, J. H. G. Imitation and the social mind: autism and typical development. J. Can. Acad. Child. Adolesc. Psychiatry 17, 91–93 (2006).
Google Scholar
Krishnan, S., Watkins, K. E. & Bishop, D. V. M. Neurobiological basis of language learning difficulties. Trends Cogn. Sci. 20, 701–714 (2016).
Article PubMed PubMed Central Google Scholar
Bullmore, E. & Sporns, O. The economy of brain network organization. Nat. Rev. Neurosci. 13, 336–349 (2012).
Article CAS PubMed Google Scholar
Vértes, P. E. et al. Simple models of human brain functional networks. Proc. Natl Acad. Sci. USA 109, 5868–5873 (2012).
Article ADS PubMed PubMed Central Google Scholar
Betzel, R. F. et al. Generative models of the human connectome. Neuroimage 124, 1054–1064 (2016).
Article PubMed Google Scholar
Kumar, R., Novak. J. & Tomkins, A. Structure and evolution of online social networks. In: Proc. of SIGKDD. ACM, New York. 611–617 (2006).
Borgatti, S. P., Mehra, A., Brass, D. J. & Labianca, G. Network analysis in the social sciences. Science 323, 892–895 (2009).
Article ADS CAS PubMed Google Scholar
Bell, M. et al. Network growth models: a behavioural basis for attachment proportional to fitness. Sci. Rep. 7, 42431 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Barabási, A. L. & Albert, R. Emergence of scaling in random networks. Science 286, 509–512 (1999).
Article ADS MathSciNet PubMed MATH Google Scholar
Watts, D. & Strogatz, S. Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998).
Article ADS CAS PubMed MATH Google Scholar
Albert, R. & Barabási, A. L. Statistical mechanics of complex networks. Rev. Mod. Phys. 74, 47 (2002).
Article ADS MathSciNet MATH Google Scholar
van den Heuvel, M. P. et al. Abnormal rich club organization and functional brain dynamics in schizophrenia. JAMA Psychiat. 70, 783–792 (2013).
Article Google Scholar
Gollo, L. L. et al. Fragility and volatility of structural hubs in the human connectome. Nat. Neurosci. 21, 1107–1116 (2018).
Article CAS PubMed Google Scholar
Bohlken, M. M. et al. Heritability of structural brain network topology: a DTI study of 156 twins. Hum. Brain. Mapp. 35, 5295–5305 (2014).
Article ADS PubMed PubMed Central Google Scholar
Holmes, J., Bryant, A., Gathercole, S. E. & CALM Team. Protocol for a transdiagnostic study of children with problems of attention, learning and memory (CALM). Bmc. Pediatr. 19, 10 (2019).
Article PubMed PubMed Central Google Scholar
Zhang, X. et al. Generative network models of altered structural brain connectivity in schizophrenia. Neuroimage 225, 117510 (2021).
Article PubMed Google Scholar
Johnson, A. et al. Far and wide: associations between childhood socio-economic status and brain connectomics. Dev. Cogn. Neurosci. 48, 100888 (2021).
Article PubMed Google Scholar
Betzel, R. F. & Bassett, D. S. Generative models for network neuroscience: prospects and promise. J. R. Soc. Interface 14, 20170623 (2017).
Rubinov, M. Constraints and spandrels of interareal connectomes. Nat. Commun. 7, 13812 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Abdi, H. Partial least squares regression and projection on latent structure regression (PLS Regression). Wiley Interdiscip. Rev. Comput. Stat. 2, 97–106 (2010).
Article Google Scholar
Hawrylycz, M. J. et al. An anatomically comprehensive atlas of the adult human brain transcriptome. Nature 489, 391–399 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Arnatkevičiūtė, A., Fulcher, B. D. & Fornito, A. A practical guide for lining brain-wide gene expression and neuroimaging data. Neuroimage 189, 353–367 (2019).
Article PubMed Google Scholar
Reimand, J. et al. Pathway enrichment analysis and visualization of omics data using g:Profiler, GSEA, Cytoscape and EnrichmentMap. Nat. Protoc. 14, 482–517 (2019).
Article CAS PubMed PubMed Central Google Scholar
Reimand, J., Kull, M., Peterson, H., Hansen, J. & Vilo, J. g:Profiler—a web-based toolset for functional profiling of gene lists from large-scale experiments. Nucleic Acids Res. 35, W193–W200 (2007).
Article PubMed PubMed Central Google Scholar
Johnson, M. H. Interactive specialization: a domain-general framework for human functional brain development? Dev. Cogn. Neurosci. 1, 7–21 (2011).
Article PubMed Google Scholar
Damicelli, F., Hilgetag, C. C., Hütt, M. T. & Messe, A. Topological reinforcement as a principle of modularity emergence in brain networks. Netw. Neurosci. 3, 589–605 (2019).
Article PubMed PubMed Central Google Scholar
Betzel, R. F. et al. The modular organization of human anatomical brain networks: accounting for the cost of wiring. Netw. Neurosci. 1, 42–68 (2017).
Article PubMed PubMed Central Google Scholar
Avena-Koenigsberger, A. et al. A spectrum of routing strategies for brain networks. PLoS. Comput. Biol. 15, E1006833 (2019).
Article PubMed PubMed Central CAS Google Scholar
Benzi, M. & Klymko, C. Total communicability as a centrality measure. J. Complex Netw. 1, 124–149 (2013).
Article Google Scholar
Goulas, A., Betzel, R. F. & Hilgetag, C. Spatiotemporal ontogeny of brain wiring. Sci. Adv. 5, eaav9694 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Schroeter, M. S., Charlesworth, P., Kitzbichler, M. G., Paulsen, O. & Bullmore, E. T. Emergence of rich-club topology and coordinated dynamics in development of hippocampal functional networks in vitro. J. Neurosci. 35, 5459–5470 (2015).
Article CAS PubMed PubMed Central Google Scholar
Nicosia, V., Vértes, P. E., Schafer, W. R., Latora, V. & Bullmore, E. T. Phase transition in the economically modeled growth of a cellular nervous system. Proc. Natl Acad. Sci. USA 110, 7880–7885 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
Article ADS CAS PubMed Central Google Scholar
Grove, J. et al. Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 51, 431–444 (2019).
Article CAS PubMed PubMed Central Google Scholar
Davies, G. et al. Study of 300,486 individuals identifies 148 independent genetic loci influencing general cognitive function. Nat. Commun. 9, 2098 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Arnatkevičiūtė, A. et al. Genetic influences on hub connectivity of the human connectome. Preprint at https://www.biorxiv.org/content/10.1101/2020.06.21.163915v1 (2019).
Vértes, P. E. et al. Gene transcription profiles associated with inter-modular hubs and connection distance in human functional magnetic resonance imaging networks. Philos. Trans. R. Soc. B. 371, 20150362 (2016).
Bathelt, J., Barnes, J., Raymond, F. L., Baker, K. & Astle, D. Global and local connectivity differences converge with gene expression in a neurodevelopmental disorder of known genetic origin. Cereb. Cortex. 27, 3806–3817 (2017).
Article PubMed Google Scholar
Hawkins, E. et al. Functional network dynamics in a neurodevelopmental disorder of known genetic origin. Hum. Brain Mapp. 41, 530–544 (2020).
Article PubMed Google Scholar
Hansen, J. Y. et al. Molecular signatures of cognition and affect. Preprint at https://www.biorxiv.org/content/10.1101/2020.07.16.203026v1 (2020).
Luppi, A. I. et al. A synergistic core for human brain evolution and cognition. Preprint at https://www.biorxiv.org/content/10.1101/2020.09.22.308981v1 (2020).
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: from polygenic to omnigenic. Cell 169, 1177–1186 (2017).
Article CAS PubMed PubMed Central Google Scholar
Astle, D. E. & Fletcher-Watson, S. Beyond the core-deficit hypothesis in developmental disorders. Curr. Dir. Psychol. Sci. 29, 431–437 (2020).
Article PubMed PubMed Central Google Scholar
Casey, B. J. et al. The adolescent brain cognitive development (ABCD) study: imaging acquisition across 21 sites. Dev. Cogn. Neurosci. 32, 43–54 (2018).
Article CAS PubMed PubMed Central Google Scholar
Karcher, N. R. & Barch, D. M. The ABCD study: understanding the development of risk for mental and physical health outcomes. Neuropsychopharmacol 46, 131–142 (2021).
Article Google Scholar
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Tymofiyeva, O., Hess, C. P., Xu, D. & Barkovich, A. J. Structural MRI connectome in development: challenges of the changing brain. Br. J. Radiol. 87, 20140086 (2014).
Vértes, P. E. & Bullmore, E. T. Annual research review: growth connectomics—the organization and reorganization of brain networks during normal and abnormal development. J. Child Psychol. Psychiatry 56, 299–320 (2015).
Article PubMed Google Scholar
Fulcher, B. D., Arnatkevičiūtė, A. & Fornito, A. Overcoming false-positive gene-category enrichment in the analysis of spatially resolved transcriptomic brain atlas data. Nat. Commun. 12, 1–13 (2021).
Article CAS Google Scholar
Bathelt, J., Astle, D., Barnes, J., Raymond, F. L. & Baker, K. Structural brain abnormalities in a single gene disorder associated with epilepsy, language impairment and intellectual disability. Neuroimage. Clin. 12, 655–665 (2016).
Article PubMed PubMed Central Google Scholar
Tamnes, C. K. et al. Development of the cerebral cortex across adolescence: a multisample study of inter-related longitudinal changes in cortical volume, surface area, and thickness. J. Neurosci. 37, 3402–3412 (2017).
Article CAS PubMed PubMed Central Google Scholar
Martin, R. E. et al. Longitudinal changes in brain structures related to appetitive reactivity and regulation across development. Dev. Cogn. Neurosci. 38, 100675 (2019).
Article PubMed PubMed Central Google Scholar
Bathelt, J., Scerif, G., Nobre, A. C. & Astle, D. E. Whole-brain white matter organization, intelligence, and educational attainment. Trends Neurosci. Educ. 15, 38–47 (2019).
Article CAS PubMed PubMed Central Google Scholar
Alexander, B. et al. Desikan-Killiany-Tourville Atlas compatible version of M-CRIB neonatal parcellated whole brain atlas: the M-CRIB 2.0. Front. Neurosci. 13, 34 (2019).
Article PubMed PubMed Central Google Scholar
Laurent, J. S. et al. Associations among body mass index, cortical thickness, and executive function in children. JAMA Pediatr. 174, 170–177 (2020).
Article PubMed Google Scholar
Eickhoff, S. B., Yeo, B. T. T. & Genon, S. Imaging-based parcellations of the human brain. Nat. Rev. Neurosci. 19, 672–686 (2018).
Article CAS PubMed Google Scholar
Ye Tian, Y., Margulies, D. S., Breakspear, M. & Zalesky, A. Topographic organization of the human subcortex unveiled with functional connectivity gradients. Nat. Neurosci. 23, 1421–1432 (2020).
Article PubMed CAS Google Scholar
Schiavi, S. et al. A new method for accurate in vivo mapping of human brain connections using microstructural and anatomical information. Sci. Adv. 6, eaba8245 (2020).
Russell, G., Rodgers, L. R., Ukoumunne, O. C. & Ford, T. Prevalence of parent-reported ASD and ADHD in the UK: findings from the Millennium Cohort Study. J. Autism Dev. Disord. 44, 31–40 (2014).
Article PubMed Google Scholar
Manjón, J. V. et al. Multicomponent MR image denoising. Int. J. Biomed. Imag. 2009, 756897 (2009).
Garyfallidis, E. et al. Dipy, a library for the analysis of diffusion MRI data. Front. Neuroinform. 8, 8 (2014).
Article PubMed PubMed Central Google Scholar
Avants, B. B. et al. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage 54, 2033–2044 (2011).
Article PubMed Google Scholar
Desikan, R. S. et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31, 968–980 (2006).
Article PubMed Google Scholar
Dale, A. M., Fischl, B. & Sereno, M. I. Cortical surface-based analysis: I. Segmentation and surface reconstruction. Neuroimage 9, 179–194 (1999).
Article CAS PubMed Google Scholar
Fischl, B. & Dale, A. M. Measuring the thickness of the human cerebral cortex from magnetic resonance images. Proc. Natl Acad. Sci. USA 97, 11050–11055 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Fischl, B. et al. Whole brain segmentation: automated labeling of neuroanatomical structures in the human brain. Neuron 33, 341–355 (2002).
Article CAS PubMed Google Scholar
Rubinov, M. & Sporns, O. Complex network measures of brain connectivity: Uses and interpretations. Neuroimage 52, 1059–1069 (2010).
Article PubMed Google Scholar
Wechsler, D. Wechsler Abbreviated Scale of Intelligence 2nd edn (Pearson Assessment, 2011).
Dunn, L. M., & Dunn, D. M. Peabody Picture Vocabulary Test (Pearson Education, 2007).
Frederickson, N., Frith, U. & Reason, R. Phonological Assessment Battery (Manual and Test Materials, 1997).
Alloway, T. Automated Working Memory Assessment (AWMA) (Pearson Assessment, 2007).
Alloway, T. P., Gathercole, S. E., Kirkwood, H. & Elliott, J. Evaluating the validity of the Automated Working Memory Assessment. Educ. Psychol. 28, 725–734 (2008).
Article Google Scholar
Wechsler, D. Wechsler Individual Achievement Test 2nd (UK edn) (Pearson Assessment, 2005).
Wechsler, D. Wechsler Objective Numerical Dimensions (Psychology Corporation, 1996).
Woodcock, R. W., Mather, N., McGrew, K. S. & Wendling, B. J. Woodcock-Johnson III Tests of Cognitive Abilities (Riverside Publishing Company, 2001).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Akarca, D. et al. A generative network model of neurodevelopmental diversity in structural brain organization, generativenetworkmodel v1.0, https://doi.org/10.5281/zenodo.4762612 (2021).

Download references

Acknowledgements

We thank the whole CALM team for their hard work and significant contribution. We also would like to thank the many professionals working in children’s services in the South-East and East of England for their support, and to the children and their families for giving up their time to visit the clinic. We are particularly grateful for J.B. for his help in guiding us through the construction of the streamline connectomes. We would also like to thank the radiographers who support the excellent pediatric scanning at the MRC Cognition and Brain Sciences Unit. D.A. is supported by the Medical Research Council Doctoral Training Programme and Cambridge Trust Vice Chancellor’s Award Scholarship. D.E.A. is supported by Medical Research Council Program Grant MC-A0606-5PQ41, respectively. Both D.E.A. and D.A. are supported by The James S. McDonnell Foundation Opportunity Award. P.E.V. is a fellow of MQ: Transforming Mental Health (MQF17_24) and of the Alan Turing Institute funded by EPSRC grant EP/N510129/1. E.T.B. is an NIHR Senior Investigator, supported by the NIHR Cambridge Biomedical Research Centre. The views expressed are those of the authors, and not necessarily those of the NHS, the NIHR, or the Department of Health and Social Care. All opinions expressed in this publication are those of the authors and do not necessarily reflect the views of the funding agencies.

Author information

Authors and Affiliations

MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK
Danyal Akarca, Kate Baker, Susan E. Gathercole, Joni Holmes, Rogier A. Kievit, Tom Manly, Joe Bathelt, Marc Bennett, Giacomo Bignardi, Sarah Bishop, Erica Bottacin, Lara Bridge, Diandra Brkic, Annie Bryant, Sally Butterfield, Elizabeth M. Byrne, Gemma Crickmore, Edwin S. Dalmaijer, Fánchea Daly, Tina Emery, Laura Forde, Grace Franckel, Delia Fuhrmann, Andrew Gadie, Sara Gharooni, Jacalyn Guy, Erin Hawkins, Agnieszka Jaroslawska, Sara Joeghan, Amy Johnson, Jonathan Jones, Silvana Mareva, Elise Ng-Cordell, Sinead O’Brien, Cliodhna O’Leary, Joseph P. Rennie, Ivan Simpson-Kent, Roma Siugzdaite, Tess A. Smith, Stephani Uh, Maria Vedechkina, Francesca Woolgar, Natalia Zdorovtsova, Mengya Zhang & Duncan E. Astle
Department of Psychiatry, University of Cambridge, Cambridge, UK
Petra E. Vértes & Edward T. Bullmore
The Alan Turing Institute, London, UK
Petra E. Vértes
Department of Clinical Neurosciences, Wolfson Brain Imaging Centre, University of Cambridge, Cambridge, UK
Edward T. Bullmore

Authors

Danyal Akarca
View author publications
You can also search for this author in PubMed Google Scholar
Petra E. Vértes
View author publications
You can also search for this author in PubMed Google Scholar
Edward T. Bullmore
View author publications
You can also search for this author in PubMed Google Scholar
Duncan E. Astle
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

the CALM team

Duncan E. Astle
, Kate Baker
, Susan E. Gathercole
, Joni Holmes
, Rogier A. Kievit
, Tom Manly
, Danyal Akarca
, Joe Bathelt
, Marc Bennett
, Giacomo Bignardi
, Sarah Bishop
, Erica Bottacin
, Lara Bridge
, Diandra Brkic
, Annie Bryant
, Sally Butterfield
, Elizabeth M. Byrne
, Gemma Crickmore
, Edwin S. Dalmaijer
, Fánchea Daly
, Tina Emery
, Laura Forde
, Grace Franckel
, Delia Fuhrmann
, Andrew Gadie
, Sara Gharooni
, Jacalyn Guy
, Erin Hawkins
, Agnieszka Jaroslawska
, Sara Joeghan
, Amy Johnson
, Jonathan Jones
, Silvana Mareva
, Elise Ng-Cordell
, Sinead O’Brien
, Cliodhna O’Leary
, Joseph P. Rennie
, Ivan Simpson-Kent
, Roma Siugzdaite
, Tess A. Smith
, Stephani Uh
, Maria Vedechkina
, Francesca Woolgar
, Natalia Zdorovtsova
& Mengya Zhang

Contributions

All authors conceived the model and wrote the manuscript. All simulations, analysis, and connectome construction were carried out by D.A.

Corresponding author

Correspondence to Danyal Akarca.

Ethics declarations

Competing interests

E.T.B. serves on the Scientific Advisory Board for Sosei Heptares and as a Consultant for GlaxoSmithKline. Other authors declare no competing financial or non-financial interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Akarca, D., Vértes, P.E., Bullmore, E.T. et al. A generative network model of neurodevelopmental diversity in structural brain organization. Nat Commun 12, 4216 (2021). https://doi.org/10.1038/s41467-021-24430-z

Download citation

Received: 26 November 2020
Accepted: 27 May 2021
Published: 09 July 2021
DOI: https://doi.org/10.1038/s41467-021-24430-z

This article is cited by

Assortative mixing in micro-architecturally annotated brain connectomes
- Vincent Bazinet
- Justine Y. Hansen
- Bratislav Misic
Nature Communications (2023)
Spatially embedded recurrent neural networks reveal widespread links between structural and functional neuroscience findings
- Jascha Achterberg
- Danyal Akarca
- Duncan E. Astle
Nature Machine Intelligence (2023)
Towards a biologically annotated brain connectome
- Vincent Bazinet
- Justine Y. Hansen
- Bratislav Misic
Nature Reviews Neuroscience (2023)
Functional orderly topography of brain networks associated with gene expression heterogeneity
- Wei Liu
- Ling-Li Zeng
- Dewen Hu
Communications Biology (2022)
Null models in network neuroscience
- František Váša
- Bratislav Mišić
Nature Reviews Neuroscience (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.