Deep learning identifies partially overlapping subnetworks in the human social brain

Kiesow, Hannah; Spreng, R. Nathan; Holmes, Avram J.; Chakravarty, M. Mallar; Marquand, Andre F.; Yeo, B. T. Thomas; Bzdok, Danilo

doi:10.1038/s42003-020-01559-z

Download PDF

Article
Open access
Published: 14 January 2021

Deep learning identifies partially overlapping subnetworks in the human social brain

Communications Biology volume 4, Article number: 65 (2021) Cite this article

9414 Accesses
9 Citations
40 Altmetric
Metrics details

Subjects

Abstract

Complex social interplay is a defining property of the human species. In social neuroscience, many experiments have sought to first define and then locate ‘perspective taking’, ‘empathy’, and other psychological concepts to specific brain circuits. Seldom, bottom-up studies were conducted to first identify explanatory patterns of brain variation, which are then related to psychological concepts; perhaps due to a lack of large population datasets. In this spirit, we performed a systematic de-construction of social brain morphology into its elementary building blocks, involving ~10,000 UK Biobank participants. We explored coherent representations of structural co-variation at population scale within a recent social brain atlas, by translating autoencoder neural networks from deep learning. The learned subnetworks revealed essential patterns of structural relationships between social brain regions, with the nucleus accumbens, medial prefrontal cortex, and temporoparietal junction embedded at the core. Some of the uncovered subnetworks contributed to predicting examined social traits in general, while other subnetworks helped predict specific facets of social functioning, such as the experience of social isolation. As a consequence of our population-level evidence, spatially overlapping subsystems of the social brain probably relate to interindividual differences in everyday social life.

White matter connectivity in brain networks supporting social and affective processing predicts real-world social network characteristics

Article Open access 03 October 2022

Neural signatures of social inferences predict the number of real-life social contacts and autism severity

Article Open access 20 July 2023

Using second-person neuroscience to elucidate the mechanisms of social interaction

Article 28 May 2019

Introduction

Social interaction is a central activity to the human species, which has enabled the construction of civilizations by collaboration across and between generations¹. This realization has led many investigators to adopt the social brain hypothesis^2,3. The perspective posits that dimensions of social complexity, like group size^4,5 or the capacity to anticipate other individuals’ ongoing thought⁶, have shaped the evolution of brain structure. To this end, the need to adapt to the increasing demands of social complexity and social challenges has likely played a relevant role in natural selection, thus influencing the course of primate brain evolution^2,3. The importance of social interaction for the human species also becomes apparent in its close relation to mental health. For instance, a lack of regular social interactions is known to escalate the risk for various major psychiatric disorders^7,8,9.

To interrogate the relationship between dimensions of everyday social experience may manifest themselves in the human brain, previous structural brain-imaging studies have established the close relationship between markers of social interaction frequency and quantity and grey matter structure in regions, such as the amygdala¹⁰, nucleus accumbens¹¹, and ventromedial prefrontal cortex^4,5. In addition, neuroscientists also commonly rely on carefully curated experimental tasks, which frequently endorse a select set of psychological constructs like ‘theory of mind’, ‘empathy’, or the ‘mirror neuron system’. These hypothesis-guided social, cognitive and affective neuroscience experiments have proven invaluable for localizing neural activity responses in controlled task environments. For instance, in moral decision making, experimental paradigms involving the trolley dilemma have been used to compare the neural correlates underlying emotional-affective processes against those involved in more rational abstract-perspective taking^12,13. Recent trends towards large-scale aggregation of social neuroscience experiments have opened the door to principled across-study integration by an arsenal of meta-analysis techniques. These new tools have allowed neuroscientists to identify the parts of the human brain that respond most consistently when participants are engaged in a diverse set of social-affective experiences^14,15,16.

Despite the merits of locating hots spots or convergence zones related to the social brain based on aggregate summaries from meta-analyses, the constituent regions may obscure distinct social-affective functional systems when collapsing separate studies into averages. In social neuroscience, a majority of previous brain-imaging studies implicitly assumed that a target region is sufficiently described by a single pattern of neural activity obtained through some subtraction analysis, which results in relative increase or decrease of neural response. This pervasive assumption may obfuscate distinct sources of biological variation – within trusted convergence zones – that factor into specific effects in a brain region across individuals. In other words, many previous social neuroscience studies have investigated non-overlapping neural correlates of a certain psychological construct. Instead, few such studies have assessed how different types of variation in a brain region may be implicated in the same neurocognitive process. In particular, much prior social neuroscientific work has been less sensitive to such mutually overlapping sources of variation in the wider population – perhaps due to the fact that population datasets have only emerged recently in imaging neuroscience. Moreover, adaptive social functioning relies on the dynamic coordination of a host of abilities, ranging from lower-sensory processing of social cues like faces to higher processing such as mental scene construction¹⁷.

Evidence from several previous large-scale neuroimaging studies has revealed converging and diverging sets of brain regions in the human brain. For example, using functional brain-imaging data from 1000 participants, Yeo et al.¹⁸ demonstrated that several regions of the higher association cortex, many of which are linked to social-affective processing, are involved in more than one canonical functional connectivity network. However, the authors also reported that several sensory and motor regions mostly participate in only one network. The study proposes that brain regions that participate in more than one brain network may act as hubs for information processing or communication with other brain networks or regions¹⁸. Similarly, based on analyses of task-related and task-free functional brain-imaging, Najafi et al.¹⁹ argued that a given brain region may belong to several subnetworks, or functionally cohesive sets of brain regions. By carefully quantifying the extent to which each brain region belongs to a certain subnetwork in both the resting and task-engaged brain, the authors made a strong case for wide-ranging degrees of overlap between functional brain systems¹⁹. This study concludes that investigating overlapping brain networks provides a richer source of information mapping between brain regions and their functions, which may not be observed when exclusively investigating disjoint brain systems. As such, several functional brain-imaging studies support the notion that the human brain is organized in a scaffold of intermixed brain region assemblies²⁰. Our study builds on this previous research by investigating brain patterns of co-variation and overlap in human social brain structure.

We adopted a data-driven stance on population neuroscience to dissect and describe separable neural systems of social brain regions that preferentially support social-affective processes. All of our analyses capitalized on the UK Biobank (UKB) – currently the largest, uniformly acquired human brain-imaging dataset in the world – to identify the hidden structural components within the social brain. We further characterized the derived social subnetworks by profiling their predictive role in several social lifestyle markers. Importantly, the overall analytical strategy departs from many previous approaches that assumed each specific brain region underpins a unique element of social functioning. This common a-priori assumption neglects the possible existence of subnetworks that may partially overlap with each other in topography and functional implications across the social brain elements^21,22,23. Moreover, many studies restricted themselves to charting patterns in the social brain by clustering or mixture modeling approaches that strictly assign each brain region to one emerging cluster only. These kinds of modeling approaches are also restricted in exploring new ways to determine the practical and empirical relevance of certain brain regions, such as by linking them to key characteristics of the daily social environment.

For these reasons, autoencoder neural network algorithms are a particularly promising avenue^24,25 to fully appreciate and explicitly model potentially complex variation across known brain locations that were previously shown to be closely related to the human social brain¹⁴. Specifically, we brought to bear autoencoder algorithms to extract the most important patterns of spatially overlapping representations in social brain structure. Using the resulting distributed brain representations, we could rebuild the social brain from elementary building blocks of interregional dependencies. In this way, we aimed to show that a single social brain region may have multiple assignments in a defined set of subnetworks. We could thus show the mixed memberships of social brain regions in communities with continuous degrees of overlap^18,19,20.

In this way, hidden subnetwork representations were directly learned from the brain-imaging data themselves by translating autoencoder network solutions from the deep learning community. This artificial neural network approach for pattern discovery inherently yielded empirical validity by gauging the achieved information compression from structural variation of the social brain. Thus, this methodological approach may prove useful by showing which parts of the social brain are most important at constructing the whole social brain structure from mutually overlapping brain representations. To probe the practical relevance of the candidate subnetworks identified in social brain structure, we tested their predictive value across a repertoire of diverse human social traits.

Results

Neural network algorithms learn coherent subnetworks from social brain variation

We distilled hidden subnetwork representations from structural variation across the social brain atlas in ~10,000 UK Biobank participants (Fig. 1; Table 1; Supplementary Fig. 1; and Supplementary Tables 1 and 2). This goal was achieved by charting several artificial neural networks that implement autoencoder variants. For a given algorithm architecture, the information compression performance was computed by invoking back-projection from each participant’s specific subnetwork embedding expressions to recover volume estimates for all 36 social brain atlas regions²⁴. That is, we computed the difference between the actual volumes of each social brain region as measured in each participant and the volumes reconstructed from the participant-wise hidden subnetwork expressions as a metric of parsimony of the derived candidate representations.

**Fig. 1: Schematic on how autoencoder neural networks learn to decompose the social brain into structural co-dependency patterns.**

Table 1 Social lifestyle markers.

Full size table

We have explored autoencoder neural networks that varied in key properties including the depth of the consecutive processing layers, intricacy of modeled intervariable relationships, and different regularization constraints on model parameter estimation (Table 2). Among the deep non-linear autoencoder architectures with Relu activation function, the six-layered autoencoder achieved an explained variance of 0.22 (SD < 0.1 across data splits), as measured by mean absolute error (MAE). The baseline autoencoder architecture with identity unit activation function and one latent processing layer and without regularization constraints was the simplest architecture, and also achieved an MAE of 0.22 (SD < 0.1 across data splits) (Fig. 2). Hence, the information compression performance of the baseline autoencoder in learning a parsimonious representation of the original social brain regions was not outperformed by other probed autoencoder architectures based on the explained variance (i.e., MAE) or stability (i.e., standard deviation over different data splits) (Fig. 2). As deeper non-linear neural network algorithms did not yield statistically defensible performance improvements on our structural brain data, we focused on baseline autoencoder neural networks with one latent layer for all subsequent analyses.

Table 2 Examined types of autoencoder neural networks.

Full size table

**Fig. 2: Model performance for different autoencoder neural networks from deep learning.**

By means of the baseline autoencoder neural networks with identity unit transformations (instead of Relu activation function), we examined the effect of different types of regularization constraints on compression performance. For the purpose of increasing sparsity, we encouraged exactly-zero parameter values during model estimation, corresponding to region relevances, by imposing l1 regularization (MAE = 0.64, SD = 0.02). To instead constrain the estimation of model parameters towards smaller absolute values, we imposed l2 regularization, which yielded better model performance (MAE = 0.35, SD = 0.01). Finally, constraining the network pattern discovery to discourage mutual correlation between the emerging subnetworks using a covariance penalty term yielded performance (MAE = 0.37, SD = 0.01), which ranked in-between that of l1 and l2 penalized neural network algorithms. Hence, imposing different types of regularization constraints on the baseline autoencoder did not outperform the information compression performance of the overall social brain morphology in unseen data, and, as we elaborate next, also led to similar solutions of hidden subnetwork representations.

In a series of similarity tests, we assessed the robustness of our candidate subnetwork solutions for the social brain. Within each of the subnetworks, we compared the relevance patterns of the social brain regions to corresponding hidden representations emerged from the other autoencoder variants with different regularization constraints. The robustness of all assigned region relevances to the derived hidden representations was suggested by subnetwork-wise Pearson’s correlations across the 36 region relevances that were learned by different autoencoder architectures (Supplementary Figs. 2 and 3A–C). The Pearson correlation coefficients averaged over four (Supplementary Fig. 3A–C) different autoencoder neural networks with identity function was ρ = 0.97 (SD = 0.05). In particular, the most deviant architecture among these autoencoders was with covariance penalty loss, which showed a mean Pearson correlation of ρ = 0.92 (SD = 0.03). Additionally, a similar Pearson correlation of ρ = 0.98 (SD = 0.01) was obtained when comparing a given type of identity-function autoencoder neural network obtained from the training set with the corresponding architecture learned on the independent test set (Supplementary Fig. 3D). Together, these confirmatory analyses ascertained that the region relevances derived by the baseline autoencoder were stable over several alternative neural network architectures. All subsequent analysis steps hence placed focus on the baseline autoencoder with an unconstrained parameter estimation (i.e., without penalty for parameter regularization).

After considering the achieved explained variance and confirming robustness of different types of baseline autoencoders, we directed attention to how much information each separate subnetwork carries about variation in the whole social brain structure. In this series of analyses based on the baseline autoencoder, subnetworks 7, 9, and 15 emerged as most dominant. An elbow-shaped pattern after the third top subnetwork (subnetwork 15) showed a drop in information compression performance (Fig. 3, left panel). Put differently, subnetworks 7, 9, and 15 were highlighted as the three top hidden subnetworks because these specific hidden representations showed the highest importance for encapsulating variation in the complete social brain from only a few hidden structural patterns. It is an important quality of this analytical approach that each social brain region can potentially contribute to multiple subnetworks. This property allowed the set of hidden subnetworks to model several spatially overlapping sources of population variation at the same time (Fig. 4).

**Fig. 3: Most explanatory hidden subnetwork representations learned by the autoencoder neural network.**

**Fig. 4: Top three hidden subnetworks that underpin structural dependence patterns in social brain differences.**

Consequently, the particular set of region volume effects in a specific subnetwork should be interpreted in light of the relevances inside of the other concomitant subnetworks (Supplementary Fig. 4), which compose the overall variation in the human social brain. In particular, the nucleus accumbens (NAC) contributed strongly to all three dominant subnetworks 7, 9, and 15 (Fig. 3, right panel). The bilateral MT/V5 yielded similar region relevances for subnetwork 7. Additionally, the right MT/V5 contributed strongly to subnetwork 9. Moreover, subnetwork 7 allocated region relevance to the temporo-parietal junction (TPJ), and frontal pole (FP), while subnetwork 9 also highlighted the relevance of the bilateral supramarginal gyrus (SMG). Furthermore, the bilateral SMG, bilateral TPJ and dorsomedial prefrontal cortex (dmPFC) also substantially contributed to subnetwork 15.

To functionally annotate the learned hidden subnetwork representations (cf. methods), we then performed a descriptive characterization in the context of the previously established functional clusters in the social brain atlas¹⁴. This previous study grouped the 36 constituent regions into four hierarchically differentiated functional circuits: the visual-sensory, intermediate, limbic, and higher-associative clusters. We computed the aggregated (absolute) relevances of all social brain regions inside of each previously defined cluster for the present social subnetwork representations (Figs. 4 and 5). For subnetwork 7, the highest aggregate relevances were found for the visual-sensory cluster (0.17 on z-scale, cf. methods) and limbic cluster (0.14, Fig. 5). For subnetwork 9, similar aggregate relevances were apparent across all four hierarchical clusters, however the visual-sensory (0.16) and intermediate clusters (0.16) yielded the highest identical aggregate relevances (Fig. 5). For subnetwork 15, the intermediate cluster yielded the highest aggregate relevance (0.16) followed by the higher-associative cluster (0.15).

**Fig. 5: Functional and hierarchical annotation of the learned hidden social subnetworks.**

To summarize the unsupervised analyses on subnetwork discovery, if we only had access to each participant’s volume expression from the three most dominant hidden social subnetworks, we could produce a reliable estimate of the complete social brain morphology across UKB participants. That is, the regions assigned with strongest volume effects in the three most dominant subnetworks are sufficient to explain a considerable amount of the interregional structural dependencies that combine to empirical measures of social brain variation.

The discovered subnetworks forecast diverse facets of everyday social life

In the supervised arm of our study, we finally sought understanding of the predictive profiles of the learned social subnetwork representations for relevant indicators of social lifestyle. For this purpose, we assessed each participants’ individual combination of subnetwork expressions as a basis for classifying social traits that have an impact on interindividual variation in everyday social interactions (Fig. 6). Across the examined social markers, our predictive pattern-learning algorithm distinguished more versus less sociality in males and females. Individuals who were socially less satisfied, had fewer social interactions or indicated a lower quality for a given social marker were assigned to the less social group. Instead, those more socially satisfied or with more opportunities for social interaction were assigned to the more social group. We used a four-class linear classification approach, where each fitted instance of the pattern-learning algorithm predicted one group (e.g., social females) against the three remaining groups (e.g., non-social female, and social or non-social male) given the participant-specific embeddings of subnetwork expressions.

**Fig. 6: Overall predictive role of the hidden subnetwork representations for tracking more versus less social exchange.**

We then tested whether a non-linear classification algorithm could outperform our simpler linear classifier (cf. methods) by leveraging potentially exceedingly complicated patterns in social brain variation at population scale. To this end, we used random forest algorithms as a higher capacity estimator to assess the out-of-sample prediction performance of participants’ social traits based on the participants’ subnetwork expressions. Virtually identical prediction accuracies in new participants were observed for both logistic-loss classifier (classification accuracy = 0.29, SD = 0.02 across data splits) and elaborate random forest classifier (classification accuracy = 0.30, SD = 0.03). Note that both classes of predictive algorithms performed better than the chance level of 0.25 in this four-class scenario. However, given the similarity in out-of-sample performance, its overlapping dispersion, and our goal of direct interpretability of most discriminatory social subnetworks, we embraced the simpler logistic-loss classifier for all subsequent analyses.

Across all examined social traits (Fig. 6), interindividual variation in hidden subnetwork 1 (characterized by high relevance of fusiform gyrus, frontal and dorsomedial prefrontal cortex, and posterior mid-cingulate cortex, cf. Supplementary Fig. 4) was particularly informative for detecting differences in regular social experience (predictive model weight w₁ = 0.04, SD = 0.02 across social traits). Instead, across-participant variation in subnetwork 10 (high relevance of anterior insula, rostral anterior cingulate cortex, and supramarginal gyrus) appeared especially tuned to sex differences based on its predictive contribution to the classifier (w₁₀ = 0.04, SD = 0.02), rather than showing salient trait-specific patterns (Supplementary Fig. 5). In line with our study goal, we therefore focused attention on the hidden subnetworks with predictive roles for different social markers. These were the trait-discriminatory social subnetworks 3 (w₃ = 0.03, SD = 0.01), subnetwork 4 (w₄ = 0.03, SD = 0.02), subnetwork 13 (w₁₃ = 0.02, SD = 0.02), subnetwork 15 (w₁₅ = 0.02, SD = 0.01), subnetwork 11 (w₁₁ = 0.02, SD = 0.01), subnetwork 2 (w₂ = 0.02, SD = 0.01), and subnetwork 7 (w₇ = 0.02, SD = 0.01). Thus, volume variation of these hidden subnetworks was the most useful for accurately predicting interindividual differences in social exchange.

In addition to predicting participants’ overall degree of sociality, we next zoomed in on the hidden subnetworks that were able to best predict specific markers of social life (Fig. 7). To tell apart whether participants were lonely or not lonely, interindividual variation in hidden subnetwork 13 (high relevance of right temporo-parietal junction and left posterior superior temporal sulcus) emerged as most useful (e.g., lonely men: w = −0.09, SD = 0.01, more surrounded men: w = 0.00, SD = 0.01 across data splits), in addition to that of subnetwork 3 (high relevance of right supramarginal gyrus, left temporo-parietal junction, and bilateral posterior superior temporal sulcus) and subnetwork 1 (cf. above). For discriminating participants living alone from participants with richer social interaction at home, subnetworks 10 (cf. above) and 4 (high relevance of left temporo-parietal junction, bilateral temporal pole, bilateral cerebellum) yielded the relatively highest predictive role (e.g., subnetwork 4: women living alone: w = −0.05, SD = 0.01, women living with others: w = −0.01, SD = 0.01). Subnetworks 1 (cf. above) and 10 achieved the highest predictive roles for differentiating the social brain morphology of participants with regular exchange with peers for social support (e.g., subnetwork 10: men without social support: w = −0.06, SD = 0.01, men with social support: w = −0.03, SD < 0.01). Both subnetworks 1 and 10 also showed individual predictive roles for high versus low self-reported satisfaction with friendship circles. For disentangling volume patterns in the social brains of participants with more versus less daily social interaction at work, salient predictive contributes were identified for subnetwork 10 and subnetwork 1 (e.g., women without a social job: w = −0.03, SD < 0.01, women with a social job: w = −0.06, SD < 0.01). Interindividual morphological variation in social brain structure for both subnetwork 1 and subnetwork 3 played the biggest predictive role for participants with more monogamous versus more promiscuous romantic relationships (e.g., subnetwork 1: women with one romantic partner: w = −0.05, SD = 0.01, women with more romantic partners: w = −0.01, SD = 0.01).

**Fig. 7: Specific predictive profile of each hidden subnetwork representation for tracking single social markers.**

In sum, all revealed hidden social subnetworks showed specific predictive roles comparing between the examined social markers. Notably, the hidden subnetworks 1 and 10 most frequently achieved among the highest predictive roles for specific individual social markers. As such, each source of population variation in social brain structure reliably tracked largely distinct dimensions of regular social interaction in the family, during leisure time and at work.

Discussion

We set out to uncover elementary building blocks that underpin social brain differences at the population level. The top three of the delineated network representations hidden in the social brain atlas distilled information from sources of population variation and effectively recapitulated the total social brain structure across individuals from the UK Biobank cohort. Specific social brain regions embedded within each subnetwork emerged as especially informative about cohesive dependencies that describe structural relationships across the entire social brain. As a common denominator across several extracted subnetworks, the NAC, TPJ, and medial PFC emerged as three of the core network regions in explaining configurations of mutual dependence in social brain morphology. We show how these separable brain representations can distinctly predict indicators of everyday social life, such as the subjective experience of loneliness. These signatures of cohesive interregional co-variation became apparent by algorithmically dis-assembling and re-assembling structural features of the social brain using autoencoder neural networks.

Many hypothesis-driven social neuroscience studies relied on a set of canonical cognitive concepts for their analysis and interpretation of neural effects. To flank these theory-guided top-down efforts, the present pattern-learning investigation translated algorithmic techniques from the deep learning community. We empowered pattern discovery in the social brain by autoencoder neural networks^24,25. This under-exploited algorithmic technique unlocked insight from uniformly acquired brain scans of the largest brain-imaging cohort recruited from across the United Kingdom.

In our study, the NAC emerged as one of the drivers in how social brain regions coherently co-vary with each other across thousands of participants, which became apparent in all leading social subnetworks. Traditionally recognized to be implicated in reward-guided decision-making processes, a host of social neuroscience research suggests that the NAC is also one of the core brain regions that are consistently recruited to also support rewarding aspects of social interaction²⁶. For instance, a functional brain-imaging experiment reported striatal activity in response to both receiving monetary rewards and receiving positive feedback about one’s own trustworthiness by unknown others²⁷. The authors suggested that social approval from others, such as feedback about one’s own personal reputation, may share a common neural basis with non-social rewards. In addition, the authors also reported medial prefrontal activity only during the social reward trials. This observation was taken to suggest that the mPFC may be specifically involved in the management of one’s own reputation²⁷. In line with our findings, the NAC and mPFC were together flagged as highly relevant in the dominant hidden subnetworks. Consistently, our dominant subnetworks also showed predictive roles for rewarding aspects of social interaction such as friendship satisfaction and having an occupation with frequent social contact.

Indeed, a functional neuro-imaging study assessed neural activity in response to simulating social interactions with friends versus celebrities in an approach-avoidance experiment²⁸. The study showed neural activity responses in the mPFC, NAC, TPJ, posteromedial cortex, SMG, and occipital-temporal junction extending into the MT/V5, specifically when participants interacted with their friends²⁸. The authors²⁸ interpreted that the encounter with close friends may encourage recruitment of interpersonal processes such as empathy, emotion-regulation and reward, all of which may contribute to mental health and positive well-being in the long run. Thus, these reports from functional brain-imaging experiments are in line with our data-led structural findings, and especially highlight the NAC, mPFC, SMG in co-occurring subnetwork representations that resulted from our social brain decomposition. This set of regions also emerged at the heart of meaningful structural inter-dependencies in our leading hidden subnetworks. Furthermore, our findings revealed these key regions to support prediction of interindividual differences in social lifestyle markers.

Neural activity responses in the NAC are not usually thought to encode differences in intentions of the interaction partner per se^26,29,30. Instead, perspective-taking processes are typically attributed to a set of higher-level cortical regions with prominent involvement of both the TPJ and mPFC. For instance, a previous structural brain-imaging study identified an association between the ability to read the mind of others through the eyes and grey matter volume in the mPFC, posteromedial cortex and TPJ³¹. The authors suggest that these social brain regions may contribute to processes necessary to subserve the ability for mental state inference by reading people’s eyes. We extend these previous findings by invigorating the special combined role of the mPFC, posteromedial cortex, and TPJ in brain circuits related to human social interaction from the present view on the social brain through the lens of subnetworks: The mPFC, posteromedial cortex, and TPJ here explained notable volume effects in the context of major sources of population variation in the social brain, especially in our leading subnetworks 15, 7, and 9. In addition, the TPJ was also highlighted as part of subnetwork 13, which we found to help predict loneliness in UK Biobank participants.

In addition to the TPJ as a region critical for realizing high-level social thoughts like perspective taking, a parallel line of research has instead emphasized the mPFC in many additional forms of social interaction^15,32,33. For instance, a series of brain-imaging studies have linked the relationship between several socially responsive regions including the mPFC and indices of interpersonal phenomena^5,6,11, which are reminiscent of our present findings on friendship satisfaction and social support. For instance, a previous structural brain-imaging study mapped grey matter volume in the mPFC, TPJ, and STS to intentionality ability and social network size, suggesting these brain regions to be key neuroanatomical correlates for social skills⁵. Our current results underscore such findings by showing these social brain nodes to represent major sources of population variation, with overlapping volume effects from several subnetworks. This became apparent in region relevances in the hidden subnetworks 5, 6, 8, and 14, as well as the dominant subnetworks 7 and 15. Additionally, the mPFC and other structurally coupled regions have also been found to be linked to anticipating social feedback. For instance, one functional brain-imaging study reported the mPFC, posteromedial cortex, visual association cortex extending into the MT/V5 and ventral striatum, encompassing the NAC, to show more activity when anticipating positive social feedback from novel peers³⁴. This observation suggests that in concert with the NAC and MT/V5 regions, the mPFC may also play a critical role in navigating salient social encounters³⁴. Thus, our investigation confirms and details the central position of the TPJ, mPFC as key drivers in co-occurring neural substrates that support neurocognitive facets central to social behavior.

Little existing data-driven evidence appears to simultaneously focus on the relevance of the mPFC and TPJ to social cognition, perhaps in part due to the location-by-location logic of most brain-imaging studies on cognitive tasks. As one of few exceptions, Schurz and colleagues conducted a coordinate-based meta analysis of various functional brain-imaging experiments using various psychological paradigms to probe perspective-taking, including social animations, reading the mind in other’s eyes, and trait judgment tasks¹⁵. The authors identified foci of meta-analytically derived hotspots of neural response averages that yielded activity convergences situated in the TPJ and mPFC regions. Different from our approach, Schurz et al.¹⁵ used pre-existing topographically distinct clusters based on structural connectivity from diffusion weighted brain imaging. Such clusters have strict topographical boundaries that are mutually exclusive, which conveys rigid a-priori assumptions about what to expect in brain-imaging data like MRI scans³⁵. Hence, many previous brain-imaging studies may have ignored possibly overlapping biological phenomena; and joint volume effects of a particular region volume on the complete social brain morphology. On the interpretational level, Schurz et al.¹⁵ suggested the mPFC to play a role in maintaining mental representations of another person’s social and emotional vantage point to create a model of another person’s mental life. Our present results allow re-contextualization and provide solid grounding for such localizationist interpretations in mutually overlapping subnetwork representations. These are shown here to vary in distinct ways at the population level and be differently associated with markers of social richness.

Compared to this previous study, a Bayesian latent factor meta-analysis is more closely aligned with our present analysis tactic. Yeo et al.²³ examined mutually overlapping components of neural activity with a topographical focus on the higher association cortex and its relation to a general battery of task responses. The study answered the question which of 83 different experimental paradigms, including the n-back test, Stroop test and anti-saccade tasks, exhibit concomitant neural activity changes according to the identified underlying spatially distributed neural activity components. This previous study singled out one functional activity component (component 10), which turned out to be preferentially linked to social cognition. This neural activity component isolated the mPFC, posteromedial cortex, the SMG, and TPJ, all of which were also highlighted in several extracted social brain subnetworks. We complement this previous investigation of general cognitive domains in the higher association cortex by showing coherent structural configurations from a data-driven decomposition of the whole social brain in a larger participant sample, which aims to closely represent the wider UK population.

More broadly, previous cross-modal brain-imaging research has shown that the regions belonging to the human social brain can be hierarchically grouped into (a) lower sensory, (b) limbic, (c) intermediate, and (d) higher-associative neural systems¹⁴. The described functional compartments were derived under the strict assumption that each social brain region is assigned to only a single group at once. To relax such discrete one-to-one attributions, our analyses explicitly quantified the continuous degrees to which a specific subset of social brain regions are relevant in explaining structural variation of multiple subnetworks. Such degrees of multi-to-multi responsibilities therefore allow for each subnetwork to allocate relevance to several of these neural circuits in the social brain. In addition to the TPJ and mPFC, other examples for such regions include the SMG, which has notable relevance in several of our subnetworks. Despite the prevalence of specific brain regions to be relevant in several subnetworks, other subnetworks allocated region relevances more evenly to different functional compartments. For instance, the previously established visual-sensory circuit of the social brain was here most associated with subnetworks 3, 12, and 13. The specificity of such functional annotations is illustrated by the observation that subnetworks 4 and 14 allocated relevance quite evenly between all subsystems. As such, we were not only able to show the prominence of single functional compartments in specific subnetworks, but also an overlap between these different clusters for some subnetworks.

A similar trend is observed in other functionally coherent assemblies of social brain regions, which are usually examined in disparate literature streams. For instance, the putative mirror neuron system is often thought of and studied as a cohesive neural system that includes regions like the IFG, SMG, SMA, pSTS, and MTV5¹⁴. We found that some of these regions (e.g., the SMA) showed population co-variation with other parts of the social brain. Furthermore, these regions did not always turn out to be similarly relevant in different subnetworks. For instance, subnetwork 6 featured the SMG and SMA as strong contributors together with the FP, a region which is not typically believed to be part of the canonical mirror neuron system. We thus provide evidence that widely assumed neurocognitive systems like the mirror neuron system may not prove robust to the totality of ways to study brain-imaging measurements.

As another core finding that ignites future research, our subnetworks 3 and 13 turned out to have predictive roles for interindividual differences in the experience of social isolation. The subjective feeling of loneliness has one of the greatest influences on some of our societies’ biggest public health concerns³⁶, in particular deep consequences for mental illness^7,8. However, few brain-imaging studies were so far dedicated to the brain basis of perceived social isolation, which we attribute to subnetwork 13, especially the right TPJ. As one rare exception, a structural brain-imaging study found volume variability in the right TPJ to be specifically linked to rich and thin online social networks¹⁰. Based on these findings, the authors interpreted the TPJ as a region that is especially sensitive to other people’s intentions. Additionally, TPJ volume decline was reported in participants who self-identified as lonely³⁷. These hints invite the speculation that scarcity of social interaction at home and in everyday life may reverberate in brain morphology in a way that can be quantitatively measured with common MRI scanners at the population scale.

Taken together, a few seminal studies have been dedicated to deploying clustering or latent factor methods in some areas of social neuroscience. Autoencoder neural networks now open the door to abstract away from clustering methods imposing strict boundaries or component discovery. In other words, at the population level, our pattern-learning technique allowed a single element of the social brain to structurally resonate with several different partner nodes. The thus extracted structural dependencies of population volume variation within our data were distinctly related to differences in social traits.

We have tailored autoencoder neural networks from deep learning to perform a data-guided de-construction of an established definition of the human social brain at population scale. Our fresh look into variation of structural organization suggests the existence of spatially overlapping motifs of co-dependence in these neural cicuits. The uncovered structural constellations of cohesive co-variation featured driving positions for the TPJ, NAC, and medial PFC. These nodes within distinct social subnetworks thus probably relate to multifaceted implementations that anchor human-defining cognitive feats, such as encoding and interrogating others’ mental states, forming social judgments, and estimating the expected value of anticipated encounters and events. Consistently, the hidden subnetwork representations, delineated by the autoencoder learning algorithms, revealed different sets of rich associations with indicators of the participants’ social capital. Many of these neurocognitive facets are traditionally studied in largely disconnected parts of the social neuroscience literature. Additionally, the revealed collection of hidden social subnetworks has potentially been overlooked by analytical approaches in widespread use. Our quantitative evidence strengthens the idea that hidden subnetworks with overlapping sources of population-level structural differentiation bring us closer to the primary biology of the social brain.

Methods

Human population data resource

The UK Biobank is a prospective epidemiological resource that provides rich information including brain-imaging, genetics, and multiple biological and lifestyle measurements. Our study focused on the brain-imaging data from the 10,000 participant UKB release (see Supplementary Table 1 for demographic information), since this sample was homogeneously recruited at the same assessment center. We used high-resolution T1-weighted structural magnetic resonance images (MRI), as these measurements can be used to capture whole-brain grey matter morphology³⁸. These brain scans were submitted to preprocessing and quality-control workflows from Alfaro-Almagro and colleagues, FMRIB, University of Oxford, UK³⁹. Use of this uniform preprocessing pipeline increases the comparability of our findings to other and future UKB studies. Moreover, we examined several key markers of social lifestyle (Table 1 and Supplementary Fig. 1). All participants provided written, informed consent and the study was approved by the Research Ethics Committee (REC number 11/NW/0382). Further information on the consent procedure can be found elsewhere (http://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=200).

Preprocessing of structural brain-imaging data

Structural MRI brain scans (T1-weighted 3D MPRAGE sequence at 1 mm isotropic resolution) were preprocessed using gradient distortion correction, field of view reduction using the Brain Extraction Tool⁴⁰ and FLIRT^41,42, as well as non-linear registration to MNI152 standard space at 1 mm resolution using FNIRT⁴³, all based on the FSL software suit (v6.0). To avoid unnecessary interpolation, all image transformations were estimated, combined, and applied by a single interpolation step. Tissue-type segmentation into cerebrospinal fluid, grey matter and white matter was applied using FAST (FMRIB’s Automated Segmentation Tool⁴⁴ to generate full bias-field-corrected images. SIENAX⁴⁵, in turn, was used to derive volumetric measures normalized for head sizes. The ensuing adjusted volume measurements represented the amount of grey matter corrected for individual brain sizes.

Social brain atlas definition

Our study built on a current best-estimate of social brain topography in humans, which only recently became available¹⁴. This topographical atlas of the human social brain was derived by a quantitative large-scale integration of functional MRI findings from 3972 task experiments involving thousands of individuals. In all, 36 regions of interest were thus previously identified (Supplementary Table 2). These 36 already-established locations were also reported to be connectionally and functionally segregated into four network clusters¹⁴, Fig. 4): (i) a visual-sensory cluster (fusiform gyrus, posterior superior temporal sulcus, MT/V5), (ii) a limbic cluster (amygdala, ventromedial prefrontal cortex, rostral anterior cingulate cortex, hippocampus, nucleus accumbens), (iii) an intermediate cluster (inferior frontal gyrus, anterior insula, anterior mid-cingulate cortex, cerebellum, supplementary motor area, supramarginal gyrus), and (iv) a higher-associative cortical cluster (dorsomedial prefrontal cortex, frontal pole, posterior mid-cingulate cortex, posterior cingulate cortex, precuneus, temporo-parietal junction, middle-temporal gyrus, temporal pole).

Our pattern-learning pipelines were thus anatomically guided by brain volume extraction for the 36 consensus brain regions of interest (each associated with one of the four previously established functional clusters in the social brain). In this way, neurobiologically interpretable measures of grey matter volume were obtained in previously established brain locations from the ~10,000 participant release of the UK Biobank^11,38,46. These values were obtained by summarizing whole-brain structural MRI maps based on the topographical compartments of the social brain. We applied a smoothing kernel of 5 mm FWHM to the participants’ structural brain maps to homogenize local neuroanatomical features⁴⁷. Grey matter volume information of each atlas region (cf. above) was averaged in spheres of 5 mm diameter around the consensus location from the previously established social brain atlas¹⁴, thus averaging the preprocessed, tissue-segmented, and brain-size-adjusted MRI signals (cf. above) across the voxels belonging to a given target region¹¹. This procedure yielded a single representative volume measure for each constituent element of our social brain atlas. Note that using spheres of 2.5 mm or 7.5 mm diameter yielded virtually identical results and led to the same conclusions.

This feature engineering approach yielded 36 neurobiologically meaningful volume measures for each UKB participant. Each of these social brain volumes was z-scored across participants by centering to zero mean and scaling the variance to one. These measures of regional brain volume in social brain networks served as the basis for all subsequent analysis steps. Full information on the social brain locations that provided the basis for this study are available online for transparency and reuse at the data-sharing platform NeuroVault (http://neurovault.org/collections/2462/).

Neural network algorithms to discover subnetworks hidden in social brain variation

To seize the opportunity to provide a richer picture of potential subnetworks underlying variation across the social brain atlas, we leveraged artificial autoencoder neural networks (Fig. 1). This family of deep learning algorithms can naturally extend to modeling architectures with multiple latent layers of consecutive non-linear processing^25,48. These algorithms were deployed to extract spatially distributed patterns dormant in the structural MRI data. The representation learning approach directly addressed the question of how the morphological variation across regions of the entire social brain can be re-expressed in a limited set of elementary network representations. This modeling goal was satisfied by imposing a projection of rich input data to a lower-rank bottleneck (Fig. 1) to automatically derive a useful compression of information from structural brain variation into a collection of atomic network patterns^24,25.

The encode-decode modeling scheme yielded one spatially distributed volumetric pattern for each extracted dimension in the bottleneck latent space (Fig. 1, blue nodes). Each of the derived volumetric representations encapsulated one hidden subnetwork that quantitatively delineated coherent interregional dependencies across the entire social brain atlas. As such, using one, or up to all extracted hidden subnetworks, the autoencoder could rebuild (an estimate of) the regional brain structure that constitutes the human social brain as best as possible. If successful, this modeling agenda can unlock evidence for the subnetworks’ empirically tested ability to parsimoniously recapitulate the brain information wedded into the entire social brain atlas. These artificial neural network algorithms provided an attractive solution for the goal of a comprehensive exploration of hidden sources of variation that collectively track structural variation in social brain atlas.

Autoencoder learning architectures can be automatically optimized to improve the fidelity of the constituent subnetworks that together, combine to the collapsed measures of social brain volumes that were actually captured using MRI. The optimization objective was based on the original participant volume expressions by means of searching through a vast space of candidate hidden subnetwork patterns to converge on an optimal representational solution. The model family is naturally scalable because these pattern-learning algorithms are well-known to abstract across several classical methods for dimensionality reduction^24,48. In line with the primary goal of our study, the elected modeling framework allowed for each location of the social brain atlas to exhibit a different relevance in different subnetworks. We hypothesized that spatially overlapping subnetworks are critical to making progress towards a faithful representation of brain compartments closely linked to social-affective processing capacities. Our study hence endorsed the assumptions that a single target region has a certain association strength with several distinct neurocognitive processes, which accommodates the possibility of mixed membership with continuous degrees of spatial overlap.

To guard against overfitting during model building, we carried out a rigorous cross-validation scheme^49,50. In five (outer) folds of data splitting, structural brain scans from 9,933 participants were randomly divided into a training set (total n = 4966, 2575 females, mean age = 55.41 years, SD = 7.54), and a test set (total n = 4967, 2629 females, mean age = 55.27 years, SD = 7.48). In 10 nested (inner) folds of random data splitting, we used 90% of the training set for model parameter estimation, while 10% of the training set were used for model hyperparameter tuning and model architecture selection. In particular, we charted several architectures of autoencoder neural networks (Table 2) on the volume data centered on the 36 social brain regions. To learn hidden network representations from structural brain scans by means of different autoencoder architectures, we used the RMSprop optimizer⁵¹ and a learning rate of 1e-3 based on a grid search of the hyperparameters (see Supplementary Table 3 for details). We probed autoencoder architectures that differed in the number of latent processing layers (i.e., 6, 4, 1), linear versus non-linear activation functions (identity function versus Relu operation at neuron processing units), tied versus non-tied weights and different penalty terms exerting regularization on the weight matrix of the latent layers inside of the layers (l1, l2 and cross-covariance regularization constraints⁵². The process of building hyperparameter-optimized instances of these different artificial neural networks was exclusively performed on the training set (cf. above). In a subsequent step, we evaluated the autoencoder-based information compression performance on unseen participants from the independent test set.

Prediction of social markers based on participants’ subnetwork expressions

We next examined the predictive role of the discovered social subnetworks for differences in social lifestyle based on their variation in our population sample. For this purpose, we tested the subnetwork generalizability for several markers of everyday social life (Table 1). For this supervised arm of our analysis workflow, we used the identical nested cross-validation procedure (cf. above). That is, inside each of the five outer folds, the particular training set participants were further subdivided into ten splits for the purpose of model selection and model hyperparameter tuning (cf. above). The estimated candidate models were compared against each other on the independent (inner) data splits. This approach allowed identification of the model instance with the best hyperparameter configurations, which was based on the highest achieved relative predictive performance. The built hyperparameter-optimized models were then assessed for their absolute predictive performance on never-seen participant data from the test set (outer loop). To obtain an accurate estimate of the expected prediction performance of the model, the fold-wise model performances were subsequently averaged (i.e., across five separate test accuracies) to a single cross-validated prediction performance, which we expect to hold in other independent or future datasets^50,53.

For the supervised prediction of social lifestyle traits, we charted two classes of predictive algorithms that are complementary in representational capacity and thus theoretically achievable prediction power⁵⁴. As a widely used classifier with linear capacity, we opted for Tikhonov-regularized regression with logistic loss function. The key hyperparameter of this pattern-learning classifier was the coefficient for the l2 penalty term. We set this regularization constraint via grid search, ranging from −3 to +3 in seven logarithmically spaced steps. As a commonly employed classifier with a considerably higher capacity to detect and exploit complex predictive patterns, we opted for random forest algorithms⁵⁵. For hyperparameter search, we tuned the maximum depth (2 or 6), the minimal split of samples (2 or 6), and the minimum samples of leaves (2 or 6). We noticed that fitting 100 decision trees showed saturation in prediction accuracy based on the out-of-bag estimates on training data from unseen UKB participants by a given decision tree^50,56. Our rationale was to test for the existence of exploitable non-linear effects in our brain imaging data for predicting social traits. This consideration informed our decision on whether to commit to a high-capacity predictive algorithm, or to resort to a linear predictive algorithm for our supervised characterization of the identified subnetworks.

We performed prediction of interindividual differences for a given social trait based on the autoencoder-derived latent factor projections (cf. above) of social brain volume measures. To ensure balanced groups, the UKB participants were split into more social versus less social lifestyles. Each examined social marker was ensured to have binary encoding (median-split as appropriate) into more social versus less social categories. Our approach also explicitly acknowledged the wide-ranging sex differentiation of social traits in the human brain that is receiving increasing empirical support from neuroimaging studies^11,57. As such, for the prediction goal, we further split the participants according to sex, which yielded four groups for classification: (1) more social males, (2) less social males, (3) more social females, and (4) less social females. Hence, for each particular index of social richness, our classifiers solved a four-class prediction problem. Moreover, the model accounted for age differences into the analysis pipeline by using participant age as an input source of interindividual variation in all predictive models. To enable comparable handling of the multi-class classification problem with both l2-penalized logistic loss and random forest estimators, we used both prediction algorithms in the widely used one-versus-rest scheme⁵⁰. By default, for each social trait, we examined the standard deviation across cross-validation splits for each hidden social network. This decision to report the standard deviation across cross-validation splits (instead of across participants) is based on longstanding practices in the machine learning literature⁵⁰. In doing so, we obtained parameter weights that indicated the predictive role or contribution for each latent autoencoder embedding of social brain morphology for successfully discriminating UKB participants who live in a more versus less rich social environment.

Replication analysis

To see if our unsupervised and supervised results generalize to independent data, we implemented the same data analysis pipeline (cf. above) in new, independent participant samples. We used the recently available 40,000 participant release from the UK Biobank (Data Access Application: 25163). For the replication analysis, the 40,000 participants were randomly divided into four data splits of 10,000 participants. In the unsupervised portion of the replication analysis, the autoencoder solutions from the original analysis were carried out again on each of the four data splits. The unsupervised results revealed a fairly good replication of the hidden subnetworks (Supplementary Table 4). Pearson correlations between the original analysis and the new replication analysis confirmed the robustness of our original results (Supplementary Table 4). As a next step, we carried out the same supervised analysis pipeline from the original discovery data set for the prediction of social traits in the new replication data splits. Our prediction results revealed good replication of estimates of the predictive models over all hidden subnetworks for the four data splits (Supplementary Table 5). Pearson correlations between the predictive model weights of the original discovery data set and the new replication data splits showed moderately good correlations. Thus, our replication analysis in a new independent sample showcases the stability of the derived hidden subnetworks as well as the prediction of the social traits (Supplementary Tables 4–7).

Statistics and reproducibility

All computations and visualizations were performed in the Python scientific computing engine. For the unsupervised arm of the analysis workflow, we used Keras (version 2.4.0)⁵⁸ to create and train the different types of deep autoencoder neural networks, while the predictive algorithms were used as implemented by state-of-the-art implementations in scikit-learn (version 0.21.3)⁵⁹. To shape and visualize the structural MRI data, we used nilearn (version 0.6.2)⁶⁰ and Pysurfer for 3D brain visualization (https://pysurfer.github.io/, version: 0.10.0). We created all additional figures with Seaborn (https://seaborn.pydata.org/, version: 0.11.0) and Bokeh (version 1.3.4)⁶¹.

The structural brain-imaging data used in this study were obtained from the UK Biobank and obtained under the Data Access Application 23827. The present study used the n = 10,000 participant release. All analyses conducted for the present study are reproducible and the scripts for our analysis pipelines can be found at (https://github.com/hannahkiesow/hidden_social_brain).

Furthermore, we implemented the same data analysis pipeline in new, independent participant samples (cf. Methods). Results from the replication analysis displays the robustness and reproducibility of our findings.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All used data are available to other investigators online (ukbiobank.ac.uk). The source data underlying the figures is provided as Supplementary Data 1.

Code availability

All analysis scripts that reproduce the results of the present study are readily accessible to and open for reuse by the reader: https://www.github.com/hannahkiesow/hidden_social_brain.

References

Tennie, C., Call, J. & Tomasello, M. Ratcheting up the ratchet: on the evolution of cumulative culture. Philos. Trans. R. Soc. Lond. B Biol. Sci. 364, 2405–2415 (2009).
Article PubMed PubMed Central Google Scholar
Byrne R. W., Whiten A. Machiavellian Intelligence: Social Expertise And The Evolution Of Intellect In Monkeys, Apes, And Humans (Clarendon Press, 1990).
Humphrey N. K. T. Growing Points In Ethology (Cambridge University Press, 1976).
Dunbar, R. & Shultz, S. Why are there so many explanations for primate brain evolution? Philos. Trans. R. Soc. B Biol. Sci. 372, 20160244 (2017).
Article Google Scholar
Lewis, P. A., Rezaie, R., Brown, R., Roberts, N. & Dunbar, R. I. Ventromedial prefrontal volume predicts understanding of others and social network size. Neuroimage 57, 1624–1629 (2011).
Article PubMed Google Scholar
Powell, J. L., Lewis, P. A., Dunbar, R. I., Garcia-Finana, M. & Roberts, N. Orbital prefrontal cortex volume correlates with social cognitive competence. Neuropsychologia 48, 3554–3562 (2010).
Article PubMed Google Scholar
Bzdok, D. & Dunbar, R. I. M. The neurobiology of social distance. Trends Cogn. Sci. 24, 717–733 (2020).
Article PubMed PubMed Central Google Scholar
Cacioppo, J. T. & Hawkley, L. C. Perceived social isolation and cognition. Trends Cogn. Sci. 13, 447–454 (2009).
Article PubMed PubMed Central Google Scholar
Tost, H. & Meyer-Lindenberg, A. Puzzling over schizophrenia: schizophrenia, social environment and the brain. Nat. Med. 18, 211–213 (2012).
Article CAS PubMed Google Scholar
Kanai, R., Bahrami, B., Roylance, R. & Rees, G. Online social network size is reflected in human brain structure. Proc. Biol. Sci. 279, 1327–1334 (2012).
CAS PubMed Google Scholar
Kiesow H., et al. 10,000 social brains: sex differentiation in human brain anatomy. Sci. Adv. 6, eaaz1170 (2020).
Bzdok D., Groß D., Eickhoff S. B. Handbook of Neuroethics Heildelberg Nova Iorque, Londres (Springer, 2015).
Sevinc, G. & Spreng, R. N. Contextual and perceptual brain processes underlying moral cognition: a quantitative meta-analysis of moral reasoning and moral emotions. PLoS ONE 9, e87427 (2014).
Article PubMed PubMed Central CAS Google Scholar
Alcala-Lopez, D. et al. Computing the social brain connectome across systems and states. Cereb. Cortex 28, 2207–2232 (2018).
Article PubMed Google Scholar
Schurz, M., Radua, J., Aichhorn, M., Richlan, F. & Perner, J. Fractionating theory of mind: a meta-analysis of functional brain imaging studies. Neurosci. Biobehav. Rev. 42, 9–34 (2014).
Article PubMed Google Scholar
Spreng, R. N., Mar, R. A. & Kim, A. S. The common neural basis of autobiographical memory, prospection, navigation, theory of mind, and the default mode: a quantitative meta-analysis. J. Cogn. Neurosci. 21, 489–510 (2009).
Article PubMed Google Scholar
Mesulam, M. M. From sensation to cognition. Brain 121, 1013–1052 (1998).
Article PubMed Google Scholar
Yeo, B. T., Krienen, F. M., Chee, M. W. & Buckner, R. L. Estimates of segregation and overlap of functional connectivity networks in the human cerebral cortex. Neuroimage 88, 212–227 (2014).
Article PubMed Google Scholar
Najafi, M., McMenamin, B. W., Simon, J. Z. & Pessoa, L. Overlapping communities reveal rich structure in large-scale brain networks during rest and task conditions. Neuroimage 135, 92–106 (2016).
Article PubMed Google Scholar
Palla, G., Derenyi, I., Farkas, I. & Vicsek, T. Uncovering the overlapping community structure of complex networks in nature and society. Nature 435, 814–818 (2005).
Article CAS PubMed Google Scholar
Shine, J. M. et al. Human cognition involves the dynamic integration of neural activity and neuromodulatory systems. Nat. Neurosci. 22, 289–296 (2019).
Article CAS PubMed Google Scholar
Spreng, R. N. & Andrews-Hanna, J. R. Brain Mapping: An Encyclopedic Reference. Vol. 1316, (ed. Toga, A. W.) 165–169 (Elsevier, 2015).
Yeo, B. T. et al. Functional specialization and flexibility in human association cortex. Cereb. Cortex 25, 3654–3672 (2015).
Article PubMed Google Scholar
Bzdok D., Eickenberg M., Grisel O., Thirion B. Advances in Neural Information Processing Systems. (2015).
Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504–507 (2006).
Article CAS PubMed Google Scholar
Behrens, T. E., Hunt, L. T. & Rushworth, M. F. The computation of social behavior. Science 324, 1160–1164 (2009).
Article CAS PubMed Google Scholar
Izuma, K., Saito, D. N. & Sadato, N. Processing of social and monetary rewards in the human striatum. Neuron 58, 284–294 (2008).
Article CAS PubMed Google Scholar
Guroglu, B., Haselager, G. J., van Lieshout, C. F., Takashima, A., Rijpkema, M. & Fernandez, G. Why are friends special? Implementing a social interaction simulation task to probe the neural correlates of friendship. Neuroimage 39, 903–910 (2008).
Article PubMed Google Scholar
Bzdok, D. et al. ALE meta-analysis on facial judgments of trustworthiness and attractiveness. Brain Struct. Funct. 215, 209–223 (2011).
Article CAS PubMed Google Scholar
Dohmatob, E., Dumas, G. & Bzdok, D. Dark control: The default mode network as a reinforcement learning agent. Hum. Brain Mapp. 41, 3318–3341 (2020).
Article PubMed PubMed Central Google Scholar
Sato, W. et al. Structural neural substrates of reading the mind in the eyes. Front Hum. Neurosci. 10, 151 (2016).
Article PubMed PubMed Central Google Scholar
Bzdok, D. et al. Segregation of the human medial prefrontal cortex in social cognition. Front. Hum. Neurosci. 7, 232 (2013).
Article PubMed PubMed Central Google Scholar
Eickhoff, S. B., Laird, A. R., Fox, P. T., Bzdok, D. & Hensel, L. Functional segregation of the human dorsomedial prefrontal cortex. Cereb. Cortex 26, 304–321 (2016).
Article PubMed Google Scholar
Powers, K. E., Somerville, L. H., Kelley, W. M. & Heatherton, T. F. Rejection sensitivity polarizes striatal–medial prefrontal activity when anticipating social feedback. J. Cogn. Neurosci. 25, 1887–1895 (2013).
Article PubMed PubMed Central Google Scholar
Bzdok, D., Varoquaux, G., Grisel, O., Eickenberg, M., Poupon, C. & Thirion, B. Formal models of the network co-occurrence underlying mental operations. PLoS Comput Biol. 12, e1004994 (2016).
Article PubMed PubMed Central CAS Google Scholar
Spreng, R. N. et al. The default network of the human brain is associated with perceived social isolation. Nat. Commun. 11, 6393 (2020).
Kanai, R., Bahrami, B., Duchaine, B., Janik, A., Banissy, M. J. & Rees, G. Brain structure links loneliness to social perception. Curr. Biol. 22, 1975–1979 (2012).
Article CAS PubMed PubMed Central Google Scholar
Miller, K. L. et al. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat. Neurosci. 19, 1523–1536 (2016).
Article CAS PubMed PubMed Central Google Scholar
Alfaro-Almagro, F. et al. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage 166, 400–424 (2018).
Article PubMed Google Scholar
Smith, S. M. Fast robust automated brain extraction. Hum. Brain Mapp. 17, 143–155 (2002).
Article PubMed PubMed Central Google Scholar
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825–841 (2002).
Article PubMed Google Scholar
Jenkinson, M. & Smith, S. A global optimisation method for robust affine registration of brain images. Med. Image Anal. 5, 143–156 (2001).
Article CAS PubMed Google Scholar
Andersson J. L., Jenkinson M. & Smith S. Non-linear registration aka Spatial normalisation FMRIB Technial Report TR07JA2. FMRIB Analysis Group of the University of Oxford. 1–22 (2007).
Zhang, Y., Brady, M. & Smith, S. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans. Med. Imaging 20, 45–57 (2001).
Article CAS PubMed Google Scholar
Smith, S. M. et al. Accurate, robust, and automated longitudinal and cross-sectional brain change analysis. Neuroimage 17, 479–489 (2002).
Article PubMed Google Scholar
Kernbach, J. M. et al. Subspecialization within default mode nodes characterized in 10,000 UK Biobank participants. Proc. Natl Acad. Sci. USA 115, 12295–12300 (2018).
Article CAS PubMed PubMed Central Google Scholar
Frangou, S., Chitins, X. & Williams, S. C. Mapping IQ and gray matter density in healthy young people. Neuroimage 23, 800–805 (2004).
Article PubMed Google Scholar
Goodfellow, I., Bengio, Y., Courville, A. & Bengio, Y. Deep Learning. (MIT press, Cambridge, 2016).
Google Scholar
Bzdok, D. Classical statistics and statistical learning in imaging neuroscience. Front. Neurosci. 11, 543 (2017).
Article PubMed PubMed Central Google Scholar
Hastie T., Tibshirani R., Friedman J. The Elements Of Statistical Learning: Data Mining, Inference, And Prediction (Springer Science & Business Media, 2009).
Hinton G., Srivastava N., Swersky K. Neural networks for machine learning lecture 6a overview of mini-batch gradient descent. Cited on 14, (2012).
Cheung B., Livezey J. A., Bansal A. K., Olshausen B. A. Discovering hidden factors of variation in deep networks. arXiv preprint arXiv:14126583, (2014).
Pereira, F., Mitchell, T. & Botvinick, M. Machine learning classifiers and fMRI: a tutorial overview. Neuroimage 45, S199–S209 (2009).
Article PubMed Google Scholar
Bzdok, D. & Yeo, B. T. T. Inference in the age of big data: Future perspectives on neuroscience. Neuroimage 155, 549–564 (2017).
Article PubMed Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Karrer, T. M. et al. Brain-based ranking of cognitive domains to predict schizophrenia. Hum. Brain Mapp. 40, 4487–4507 (2019).
Article PubMed PubMed Central Google Scholar
Tannenbaum, C., Norris, C. M. & McMurtry, M. S. Sex-specific considerations in guidelines generation and application. Can. J. Cardiol. 35, 598–605 (2019).
Article PubMed Google Scholar
Chollet F. others. 2015. Keras: Deep learning library for theano and tensorflow. https://keras io/k (2015).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Google Scholar
Abraham, A. et al. Machine learning for neuroimaging with scikit-learn. Front. Neuroinform. 8, 14 (2014).
Article PubMed PubMed Central Google Scholar
Team B. D. Bokeh: Python Library For Interactive Visualization (Bokeh Development Team Wichita, KS, 2014).

Download references

Acknowledgements

D.B. was supported by the Healthy Brains Healthy Lives initiative (Canada First Research Excellence fund), the CIFAR Artificial Intelligence Chairs program (Canada Institute for Advanced Research), and Canadian Institute for Health Research (CIHR) project grant 438531, and Google (Research/Teaching Award). D.B. and R.N.S. were also supported by NIH-R01 grant AG068563A.

Author information

Authors and Affiliations

Department of Psychiatry, Psychotherapy, and Psychosomatics, RWTH Aachen University, Aachen, Germany
Hannah Kiesow
Laboratory of Brain and Cognition, Montreal Neurological Institute, Department of Neurology and Neurosurgery, McGill University, Montreal, QC, Canada
R. Nathan Spreng
Department of Psychiatry and Psychology, McConnell Brain Imaging Centre, Montreal Neurological Institute, McGill University, Montreal, QC, Canada
R. Nathan Spreng
Department of Psychology, Yale University, New Haven, CT, USA
Avram J. Holmes
Cerebral Imaging Centre, Douglas Research Centre, Montreal, QC, Canada
M. Mallar Chakravarty
Department of Psychiatry, McGill University, Montreal, QC, Canada
M. Mallar Chakravarty
Department of Biological and Biomedical Engineering, McGill University, Montreal, QC, Canada
M. Mallar Chakravarty
Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Radboud University, Nijmegen, The Netherlands
Andre F. Marquand
Department of Cognitive Neuroscience, Radboud University Medical Centre, Nijmegen, The Netherlands
Andre F. Marquand
Department of Neuroimaging, Centre for Neuroimaging Sciences, Institute of Psychiatry, King’s College London, De Crespigny Park, London, UK
Andre F. Marquand
Department of Electrical and Computer Engineering, Centre for Sleep and Cognition, Clinical Imaging Research Centre, N.1 Institute for Health and Institute for Digital Medicine, National University of Singapore, Singapore, 119077, Singapore
B. T. Thomas Yeo
Department of Biomedical Engineering, McConnell Brain Imaging Centre (BIC), Montreal Neurological Institute (MNI), Faculty of Medicine, School of Computer Science, McGill University, Montreal, Canada
Danilo Bzdok
Mila - Quebec Artificial Intelligence Institute, Montreal, Canada
Danilo Bzdok

Authors

Hannah Kiesow
View author publications
You can also search for this author in PubMed Google Scholar
R. Nathan Spreng
View author publications
You can also search for this author in PubMed Google Scholar
Avram J. Holmes
View author publications
You can also search for this author in PubMed Google Scholar
M. Mallar Chakravarty
View author publications
You can also search for this author in PubMed Google Scholar
Andre F. Marquand
View author publications
You can also search for this author in PubMed Google Scholar
B. T. Thomas Yeo
View author publications
You can also search for this author in PubMed Google Scholar
Danilo Bzdok
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.B. designed the study and the quantitative analysis approach and supervised the project. H.K., and D.B. implemented the analyses, interpreted the results, created the figures and drafted the manuscript. H.K., D.B., R.N.S., A.J.H., M.M.C., A.F.M., and B.T.T.Y. contributed to the interpretation of the data as well as revising the manuscript.

Corresponding author

Correspondence to Danilo Bzdok.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Description of Additional Supplementary Files

Supplementary Data 1

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kiesow, H., Spreng, R.N., Holmes, A.J. et al. Deep learning identifies partially overlapping subnetworks in the human social brain. Commun Biol 4, 65 (2021). https://doi.org/10.1038/s42003-020-01559-z

Download citation

Received: 12 August 2020
Accepted: 03 December 2020
Published: 14 January 2021
DOI: https://doi.org/10.1038/s42003-020-01559-z

This article is cited by

Organization of the social cognition network predicts future depression and interpersonal impairment: a prospective family-based study
- Eyal Abraham
- Yun Wang
- Jonathan Posner
Neuropsychopharmacology (2022)
Dissecting the midlife crisis: disentangling social, personality and demographic determinants in social brain anatomy
- Hannah Kiesow
- Lucina Q. Uddin
- Danilo Bzdok
Communications Biology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.