Psychiatry is undergoing a paradigm shift from the acceptance of distinct diagnoses to a representation of psychiatric illness that crosses diagnostic boundaries. How this transition is supported by a shared neurobiology remains largely unknown. In this study, we first identify single nucleotide polymorphisms (SNPs) associated with psychiatric disorders based on 136 genome-wide association studies. We then conduct a joint analysis of these SNPs and brain structural connectomes in 678 healthy children in the PING study. We discovered a strong, robust, and transdiagnostic mode of genome–connectome covariation which is positively and specifically correlated with genetic risk for psychiatric illness at the level of individual SNPs. Similarly, this mode is also significantly positively correlated with polygenic risk scores for schizophrenia, alcohol use disorder, major depressive disorder, a combined bipolar disorder-schizophrenia phenotype, and a broader cross-disorder phenotype, and significantly negatively correlated with a polygenic risk score for educational attainment. The resulting “vulnerability network” is shown to mediate the influence of genetic risks onto behaviors related to psychiatric vulnerability (e.g., marijuana, alcohol, and caffeine misuse, perceived stress, and impulsive behavior). Its anatomy overlaps with the default-mode network, with a network of cognitive control, and with the occipital cortex. These findings suggest that the brain vulnerability network represents an endophenotype funneling genetic risks for various psychiatric illnesses through a common neurobiological root. It may form part of the neural underpinning of the well-recognized but poorly explained overlap and comorbidity between psychiatric disorders.
The representation of mental illness along dimensions that cross diagnostic boundaries is progressively replacing the acceptance of distinct psychiatric diagnoses . This paradigm shift is supported by the high comorbidity between disorders [2, 3], their clinical overlap , and their shared heritability . This transition holds promise to identify more reliable biomarkers  and develop novel treatments  for psychiatric diseases. A crucial step in this endeavor is to unravel shared biological mechanisms underpinning different psychiatric illnesses. One hypothesis is that psychiatric illness reflects largely common disruptions in brain circuits encoding core dimensions of cognition and affect . In support of this hypothesis, case-control neuroimaging studies have identified brain circuit disruptions shared across psychiatric disorders [9,10,11] and a recent study has shown associations between white matter properties and a transdiagnostic behavioral measurement of psychopathology . However, the etiology and interpretation of these associations between neuroimaging findings and diagnostic categories or behaviors remain unclear. They may reflect shared genetic risks, environmental risks, consequences of the disease or behaviors, attempted adaptive responses to illness, or a combination thereof . Understanding causal mechanisms of cross-disorder disease connectomics therefore calls for study designs that go beyond comparing patients with healthy controls . For specific disorders, a few studies have shown that connectomic alterations are present in at-risk individuals (i.e., individuals with a high clinical risk for the disorder, or with an affected parent) providing initial evidence that some connectomic changes may precede the onset of illness [14, 15]. Whether these findings form part of a more general overarching vulnerability to psychiatric illness across diagnostic categories and whether this vulnerability can be imputed to genetic factors remain unknown.
To begin to address this question, we conducted an integrated analysis of structural connectomes (i.e., comprehensive maps of the white matter connections in the brain) and single nucleotide polymorphisms (SNPs) in healthy children. Our analysis focused on SNPs with an established risk for psychiatric illness or for phenotypes related to brain connectivity (i.e., neurological diseases, behavior-cognition, and brain morphology/function). The aim of this analysis was to assess whether genetic risk factors for psychiatric illness are associated with specific patterns of brain connectivity. We used data from 678 children and adolescents in the Pediatric Imaging, Neurocognition, and Genetics (PING) cohort—one of the largest neuroimaging and genetic datasets in healthy children . Importantly, the use of a healthy population factors out the impact of illness state, its sequelae, and treatments, while the use of a pediatric population limits the impact of different environmental exposures on the brain . We used canonical correlation analysis (CCA), which has been applied previously to identify a rich relationship between functional connectomes, demographics, and behavior . The primary input to CCA were data for 1877 SNPs and whole-brain structural connectomes obtained from diffusion MRI (dMRI). CCA seeks modes of population covariation that goes beyond testing whether a predetermined linear combination of SNPs for a specific disorder correlates with a brain imaging phenotype. To complement this approach, we also explored relationships between psychiatric polygenic risk scores (PRS) and the structural connectome.
Methods and materials
All data for this study come from the PING dataset. Recruitment and exclusion criteria are described elsewhere . Participants and their parents gave their written informed consent or assent to participate in study procedures. The Institutional Review Board of data collection sites approved the study. All analyses were conducted on all participants who, as of April 2017, had a T1- and T2-weighted MRI, a full set of dMRI acquired with the PING research protocol, and genotype data. This resulted in a total of 678 participants (336 female, 342 male, mean [SD] age: 12.8 years [4.87 years]; see demographics details in Supplementary Table 1).
The genetic data acquisition (of 550,000 SNPs) and quality control is described elsewhere . SNPs of interest were selected as follows (see also Supplementary Figs. 1, 2 and Supplementary Information). Phenotypes of interest (Supplementary Table 2) were first identified from the NHGRI-EBI genome-wide association studies (GWAS) Catalog , which references all major GWAS to date. SNPs associated with a phenotype of interest at genome-wide significance (p < 5 × 10−8), as reported in the catalog, were identified. When an identified SNP was not part of the PING data, proxies in linkage disequilibrium with that SNP (with R2 > 0.8) were selected instead, in line with standard practice . This process resulted in data for 1877 SNPs based on 136 GWAS for each of the 678 participants. The same process was repeated for SNPs associated with asthma and Type 2 diabetes; these phenotypes were used to test whether modes of covariation are specific to psychiatry or reflect a more general dimension of health. They were selected as archetypal examples of diseases with an established genetic basis and affecting mostly children and mostly adults, respectively.
Social, emotional, and behavioral data
The PhenX questionnaire comprising social, emotional, and behavioral variables [16, 22] was administered to 117 participants of the PING cohort. This questionnaire was always acquired after the imaging data (mean [SD] delay: 15.6 months [7.3 months]; Supplementary Table 1). Only variables with over ten observations and with at least 5% of observations different from the rest (for dichotomous variables) were kept. Data for these variables were encoded so that higher values relate to increased psychiatric vulnerability.
Details of the MRI acquisition are described elsewhere . The MRI preprocessing is described in the supplementary information.
For each subject, a multi-tensor diffusion model was estimated from the preprocessed dMRI data . This multi-tensor model accounts for crossing white matter fascicles and partial volumes of cerebrospinal fluid. The fractional anisotropy (FA) of each tensor represents the fascicle-specific FA (fFA), rather than the voxel-wise FA which is artifactually low in areas of crossing fascicles.
Whole-brain tractography was then performed using a multi-peak stochastic tractography algorithm. The streamlines were initiated at the white/gray matter interface  to increase the tractography reproducibility . The seeding mask was constructed by computing the intersection between the white matter mask and a 2-voxel dilation of the gray matter mask. Fifteen streamlines were stochastically initiated in each seeding voxel. The streamlines were then propagated throughout the multi-tensor field by following, at each step, the tensor of the sub-voxel-interpolated multi-tensor model  best aligned with the current streamline trajectory incorporating both momentum strategy  and tensor deflection . Streamlines were stopped when they (i) touched the gray matter, (ii) reached a curvature larger than 30°, or (iii) reached an fFA < 0.2.
The nodes of the structural connectome were defined by parcellation of the gray matter into 120 regions [29,30,31] (see Supplementary Information). Connectome matrices (symmetric 120 × 120 matrices) were obtained by computing for each pair of regions, the average fFA of streamlines connecting them . Entries in these matrices are referred to as connection strengths.
Data were recorded as a 678 × 7140 connectomic data matrix containing all 7140 connection strengths for all subjects, and a 678 × 1877 genomic data matrix containing all 1877 SNPs for all subjects. The connectomic data matrix was adjusted for age, sex, scanner site, and genetic ancestry factors (GAFs) to avoid spurious correlations caused by these confounds (see below). To avoid overfitting in the CCA estimation, dimensionality reduction with principal component analysis (keeping 100 principal components as in ) was applied to the genomic and adjusted connectomic data matrices independently. The resulting two 678 × 100 dimensionality-reduced matrices were the inputs to CCA in Matlab 2015a.
CCA results in symmetric linear relationships between two sets of variables . Here, each such relationship is a linear combination of SNPs called genetic canonical score and a linear combination of connection strengths called connectomic canonical score so that each individual is represented by two scores. Each linear relationship identified with CCA represents a mode of genome–connectome covariation in the sense that participants with a high genetic canonical score tend to have a high connectomic canonical score.
We also define canonical strengths which are properties of specific connections and SNPs and denote the extent to which these variables are correlated with the modes of covariation. Canonical genetic strengths are defined as the correlation between a SNP and the genetic canonical score and canonical connection strengths are defined as the correlation between a connection strength and the connectomic canonical score.
The genes that SNPs map to were recorded from the GWAS catalog. We used PANTHER-GO 14.0  with Fisher’s exact test and Bonferroni correction to identify which biological processes, cellular components, and molecular functions were most enriched against 20,996 background human genes.
Polygenic risk scores
The focus on SNPs of interest enables interpretation of modes of genome–connectome covariation in terms of etiological pathways. However, individual SNPs are poor predictors of current psychiatric phenotypes. PRS computed from summary statistics of GWAS incorporate information from the entire genome and are much more predictive at the expense of being less directly interpretable . The two approaches thus complement one another. PRS were used in the present study to assess whether modes of genome–connectome covariation were related to increased genetic risks for different psychiatric phenotypes.
All phenotypes meeting the following criteria were selected for PRS analysis: (i) at least one GWAS has been conducted for that phenotype and its summary statistics made publicly available on the Psychiatric Genomics Consortium repository, (ii) a PRS has been calculated for that phenotype using the same summary statistics, (iii) the PRS explains at least 1% of the variance in that phenotype as validated within at least one external population. This resulted in PRS available for nine psychiatric phenotypes: major depressive disorder , bipolar disorder , schizophrenia , alcohol use disorder , autism spectrum disorders , eating disorder , attention-deficit hyperactivity disorder , a combined bipolar disorder and schizophrenia phenotype , and a broader cross-disorder phenotype . We also added a PRS for educational attainment (which is known to predict well over 1% of the variance in this phenotype ) as this phenotype was also found to be (negatively) associated with the brain vulnerability network at the level of index SNPs.
PRS for each phenotype were calculated using PRSice 2.0 (which uses PLINK 1.9) . The summary statistics of the original study (i.e., GWAS or meta-analysis thereof) were used as the base populations and the PING cohort was used as the target population. High-resolution p-thresholding is used by default in PRSice (this includes p-thresholds from 5 × 10−8 to 1.0 by steps of 5 × 10−5). It was used in this analysis, albeit with an upper bound set to the optimal threshold identified in the original study (since by definition SNPs not meeting this threshold do not contribute predictive power to the phenotype of interest).
The data-driven selection of covariates is described in the Supplementary Information.
To assess whether the modes of covariation were statistically significant, permutation tests with 10,000 permutations were applied. At each permutation, the subject IDs of one input matrix were randomly permuted relative to the other, CCA was applied to the result, and the maximum correlation coefficient achieved (in absolute value) was recorded. Comparison against this null distribution of maximum correlation coefficients therefore controls for multiple comparisons across modes of covariation.
The canonical genetic and connection strengths were also computed within each permutation resulting in p values for these variables. FDR-correction for multiple comparison using the Benjamini–Hochberg method  was applied to all canonical connection strengths together and all canonical genetic strengths within each phenotype.
Because CCA seeks the highest correlations in the data, it results in large correlations even under the null hypothesis. P values obtained from permutation tests account for this. However, to assess effect sizes, we computed the proportion of the variance in connectomes and SNPs explained by the modes of covariation and compared it with that explained under the null, following the same approach as in .
The correlation between the PRS for psychiatric phenotypes and the connectomic canonical score was calculated at each p-threshold using PRSice 2.0 . GAFs were included as covariates in the analysis (even though the connectomic data matrix was already adjusted for these factors) to control for any residual influence of such factors at the polygenic level. This was achieved in PRSice 2.0 by subtracting from the R2 of the full regression model (including PRS and GAFs as independent variables), the R2 of the null model (in which only GAFs are included as independent variables). The two-tailed p value for the association between the PRS for a base phenotype and the connectomic canonical score was obtained using permutation tests (as implemented in PRSice 2.0), which correct for the multiple comparisons resulting from testing associations at different p-thresholds.
Social, emotional, and behavioral variables were adjusted for age and sex via linear regression for continuous variables,
and via logistic regression for dichotomous variables,
To test the hypothesis that canonical brain networks are associated with these adjusted variables, correlation coefficients were first calculated between them and the connectomic canonical scores. The corresponding p values were collectively analyzed using a permutation-based combined probability test  with 1000 permutations. This test accounts for nonindependent p values (since some social, emotional, and behavioral variables are highly correlated). One-tailed p values were used as inputs to the combined probability test to guarantee that a significant result reflects a positive correlation between the connectomic canonical scores and behaviors related to psychiatric vulnerability (and not just any association between the mode of covariation and social, emotional, and behavioral variables, see Supplementary Information).
We applied mediation analysis (using the lavaan R package 0.6–3 ) to test whether significant correlations between connectomic canonical scores and social, emotional, and behavioral variables might reflect a mediation of the effect of genetic risks on behaviors through brain networks . In this analysis, the genetic canonical score was the independent variable, social, emotional, and behavioral variables were dependent variables, and the connectomic canonical score was the mediator. The p values for the mediation of connectomic canonical scores on dependent variables were collectively analyzed using the same permutation-based combined probability test as above .
Robustness of the results was evaluated by reproducing the analysis in six scenarios: (i) in two mutually exclusive subgroups randomly selected from the population, (ii) after restricting the subjects to a more homogenous population of European genetic ancestry, (iii) by adjusting SNPs and brain connections (rather than brain connections alone) to ancestry factors, (iv) after changing the number of principal components retained, (v) by restricting the input SNPs to specific subgroups of psychiatric phenotypes thereby testing whether canonical brain networks are truly transdiagnostic, (vi) after excluding participants exhibiting behaviors related to psychiatric vulnerability. Details of these analyses are given in the Supplementary Information.
Our analysis revealed five main findings. First, there was a single, highly significant, mode of genome–connectome covariation (correlation between genetic and connectomic canonical scores: r = 0.74, permutation test: p < 10−4 corrected for multiple comparisons across all modes estimated). It explains 6.43 times more of the variance in connectomes, and 3.33 times more of the variance in SNPs, than does the null distribution of CCA modes. All subsequent modes were non-significant (p > 0.1).
Second, SNPs related to a range of psychiatric and behavioral-cognitive phenotypes were significantly correlated with the mode of covariation (Fig. 1a; FDR-corrected p value < 0.05). SNPs correlated with the mode of covariation were distributed across the genome (Fig. 1b). The allele that significantly correlated positively with the genetic canonical score for the top SNP of each psychiatric phenotype encodes an increased risk for that phenotype. In other words, high canonical scores were associated with an increased risk for a wide range of psychiatric phenotypes. These SNPs map to a range of loci (Supplementary Table 3), which in turn map to a range of ontologies (Table 1). In contrast to these associations with psychiatric disorders, SNPs related to neurological disorders showed no such correlations (FDR-corrected p value > 0.05) with the notable exception of migraine without aura (r = 0.19, FDR-corrected p = 0.0015), which parallels the observation that this is the only neurological condition to share heritability with psychiatric disorders . In terms of brain morphology and function, SNPs for facial width and white matter hyperintensities (despite the absence of actual hyperintensities in the PING population) were negatively correlated with the CCA mode. The canonical brain network was found to be virtually identical when SNPs related to brain structure, function, and neurological phenotypes were excluded from the analysis (correlation with the original canonical network r = 0.995, p < 0.001) demonstrating that these phenotypes do not drive the results. To further explore whether the identified mode of genome–connectome covariation is specific to psychiatric phenotypes, we computed its correlation with SNPs associated with two nonpsychiatric diseases with a well-established genetic component—asthma and type 2 diabetes—and found no significant association (r = 0.08 and 0.12, FDR-corrected p = 0.84 and 0.066, respectively).
At the polygenic level (Fig. 2 and Supplementary Fig. 3 and Supplementary Table 4), the connectomic canonical score was found to be positively and significantly associated with a PRS for schizophrenia (R2 = 1.1%, uncorrected p = 0.0042, permutation-corrected p = 0.030), major depressive disorder (R2 = 1.2%, uncorrected p = 0.0026, permutation-corrected p = 0.016), alcohol use disorder (R2 = 1.4%, uncorrected p = 0.00088, permutation-corrected p = 0.023), a cross-disorder PRS (R2 = 0.9%, uncorrected p = 0.011, permutation-corrected p = 0.030), and a combined bipolar-schizophrenia PRS (R2 = 2.1%, uncorrected p = 4.8 × 10−5, permutation-corrected p = 0.0010). It was also found to be positively associated with a PRS for bipolar disorder (R2 = 1.1%, uncorrected p = 0.0045, permutation-corrected p = 0.068) and for autism spectrum disorders (R2 = 0.7%, uncorrected p = 0.020, permutation-corrected p = 0.21) but these associations were not statistically significant after correction for multiple p-thresholds. It was not found to be significantly associated with attention-deficit hyperactivity disorder (R2 = 0.2%, uncorrected p = 0.27, permutation-corrected p = 0.91) or eating disorders (R2 = 0.2%, uncorrected p = 0.17, permutation-corrected p = 0.79). These results strongly suggest that the mode of genome–connectome covariation is driven by SNPs associated with psychiatric phenotypes, and that high canonical scores relate to an increased genetic vulnerability to psychiatric illness.
Third, the brain network identified by CCA was made of both positive and negative canonical connection strengths (Fig. 3a, b). By definition, positive (respectively negative) canonical connections tend to be higher (respectively lower) in individuals with high canonical scores. Overall, high canonical scores were negatively correlated with average connection strength (r = −0.17, 95% C.I. [−0.24, −0.09], permutation test p < 10−4) implying that individuals with high canonical scores have on average weaker connectivity. Hubs in the network overlap with three different systems (Fig. 3c; Supplementary Tables 5, 6): the default-mode network (including the angular gyri, posterior cingulate cortices, precuneus, and supramarginal gyri), a network of cognitive control [9, 10] (including the anterior and mid-cingulate cortices, the inferior frontal gyri, and the supplementary motor area), and the occipital cortex. Further details on how the anatomy of the brain vulnerability network overlaps with known systems are provided in the Supplementary Information and Supplementary Fig. 4.
Fourth, the canonical brain network was broadly reproduced when the input SNPs were restricted to specific subgroups of psychiatric phenotypes related to psychosis and autism (correlation with the original canonical network r = 0.80, p < 0.001), mood disorders (r = 0.77, p < 0.001), and addiction (r=0.54, p < 0.001). This finding supports the transdiagnostic nature of the canonical brain network by rejecting the explanation that it is a juxtaposition of subnetworks related to specific categories of psychiatric disorders.
Fifth, connectomic canonical scores were significantly correlated with measurements of social and emotional function and substance exposure, despite these variables being independent from the inputs to CCA (Fig. 4 and Supplementary Fig. 5; permutation test: p = 0.029). Canonical scores were significantly associated with increased marijuana, alcohol, sedatives, and caffeine exposure and misuse, panic disorder score, perceived stress, and impulsive behavior while being negatively associated with academic satisfaction. Mediation analysis revealed that the canonical brain network mediates the influence of genes on these variables (permutation test: p = 0.006). This finding implies that even though the PING cohort is a healthy population, individuals with higher canonical scores exhibit behaviors that are related to psychiatric vulnerability.
We also conducted various analyses to test the robustness of the results. The mode of population covariation was found to be robust when reproduced in two mutually exclusive subgroups of participants randomly sampled from the population (correlation between the original and reproduced canonical genetic strengths: 0.89 and 0.93; 95% C.I. [0.87, 0.91] and [0.92, 0.95], respectively; p < 0.001 for both; and correlation between the original and reproduced canonical connection strengths: 0.65 and 0.68; 95% C.I. [0.62, 0.68] and [0.65, 0.71], respectively; p < 0.001 for both). It was also found to be robust when both genetic and connectomic data (rather than connectomic data alone) were adjusted for admixture genetic variables (correlation between canonical genetic strengths: 0.86; 95% C.I. [0.83, 0.89]; and between canonical connection strengths: 0.95; 95% C.I. [0.95, 0.96]; p < 0.001 for both) and when the dataset was restricted to a more homogenous population of European genetic ancestry (correlation between canonical genetic strengths; 0.78; 95% C.I. [0.74, 0.82]; and between canonical connection strengths: 0.74; 95% C.I. [0.71, 0.76]; p < 0.001 for both). Furthermore, it was found to be robust when participants exhibiting behaviors related to psychiatric vulnerability were excluded from the analysis (correlation between the original and subject-reduced canonical genetic strengths: 0.92, p < 0.001 and between the original and subject-reduced canonical connection strengths: 0.96, p < 0.001) demonstrating that these participants do not drive the results but can rather be considered as extremes on the vulnerability spectrum. A breakdown of the mode of genome–connectome covariation by age, sex, scanner manufacturer, and genetic ancestry is also provided in Supplementary Fig. 6. Finally, it was also robust to the number of principal components (from 50 to 150) used in the preprocessing of the connectomic and genomic data (correlation between canonical genetic strengths: 0.86–0.98; and between canonical connection strengths: 0.85–0.96; p < 0.001 for both; Supplementary Fig. 7).
The findings presented here support the existence of a structural brain network encoding genetic vulnerability to psychiatric illness across diagnostic categories. Meta-analyses of case-control studies linking brain functional connectivity to psychiatric disorders had suggested that neuroimaging findings are in part shared across psychiatric diagnoses . But such studies are irrevocably limited by the confounders of illness state, its treatment, and its sequelae. The emergence of common psychopathology factors [4, 12], their heritability [12, 50], and neuroimaging correlates [12, 51, 52] further suggested that a shared neurobiology may explain the clinical overlap between psychiatric diagnoses. However, the etiology of these factors remains unclear as they are only partially heritable [4, 12], and their interpretation in terms of vulnerability to psychiatric illness is merely postulated since prospective cohort studies are lacking.
By contrast, the present findings are solely driven by SNPs that confer an established risk for psychiatric disorders. This enables interpretation of the resulting brain network as a “vulnerability network” and the corresponding connectomic canonical scores as “brain vulnerability scores.” Unlike functional brain networks, this vulnerability network is made of white matter fascicles connecting cortical and subcortical areas, which are thought to be state-independent —a necessary condition to qualify as an endophenotype [54, 55]. Whether it represents a true endophenotype or is rather a biomarker  will ultimately require longitudinal studies following cohorts with high brain vulnerability scores. But the fact that brain vulnerability scores mediate the impact of genetic vulnerability onto behaviors related to psychiatric illness suggest that the vulnerability network forms part of its pathogenesis . The mechanism by which this shared endophenotype turns into different clinical presentations remains to be discovered. Clinical presentations likely emerge from complex pathophysiological pathways involving interactions between environmental and genetic risks. In such intricate pathways, the vulnerability network might be a central element funneling genetic risks through a common neurobiological root.
The vulnerability network overlaps with a network of cognitive control identified by a meta-analysis of case-control studies across major psychiatric diagnoses  and with the occipital cortex. The latter is consistent with the growing evidence for the involvement of occipital cortices in psychopathology as part of a speculated disruption in the selection and processing of sensory information . These overlaps thereby replicate previous findings whilst helping elucidate their etiology. Conversely, previous transdiagnostic findings which diverge from the vulnerability network may result from nongenetic causes such as environmental factors . For instance, the finding of a relation between decreased fronto-occipital white matter connectivity and a common psychopathology factor  may reflect its known association with adverse childhood experience [56,57,58]. Finally, the vulnerability network presents two important specificities, which had not yet been clearly identified as phenotypes shared between psychiatric diagnoses. First, the association between the mode of covariation and a decreased average connectivity suggests that hypoconnectivity is a shared phenotype, which is, at least to a degree, genetically driven and which confers an increased vulnerability to psychiatric illness. This may explain why meta-analyses of case-control studies of white matter integrity more consistently find hypoconnectivity than hyperconnectivity in schizophrenia , depression , and bipolar disorder . Second, the overlap with the default-mode network may explain why the latter is frequently found to be altered in psychiatric disorders. This overlap supports the suggestion that the default-mode network may also represent a transdiagnostic phenotype  while proposing that its disruption may be present before the onset of symptoms and be attributable in part to genetic factors.
The evolution of the brain vulnerability network through the lifespan and its changes in disease remain to be elucidated. Complex gene–environment interactions may substantially alter the association between the vulnerability network and the genetic architecture of psychiatric disorders in adults (indeed this was a major motivation for conducting the present study in a pediatric population). For instance, it might be that the cumulative effect of environmental risk factors for psychiatric illnesses also results in changes to the brain vulnerability network. In that case, high brain vulnerability scores in adults might occur in the absence of genetic predispositions. Conversely, a child with high brain vulnerability score developing in a favorable environment might reach adulthood without ever receiving a psychiatric diagnosis, perhaps reflecting an effect of the favorable environment upon their white matter microstructure. In an initial attempt to explore this question, we used data from 20,827 participants of the UK Biobank in whom suitable MRI data were available [63, 64] (see Supplementary Information and Supplementary Table 7 for details on the methods). Three imaging-derived phenotypes in UK Biobank overlap with the brain vulnerability network (Supplementary Fig. 8) and were predicted to be negatively associated with the genetic canonical score (if the mode of genome–connectome covariation remains unchanged in adulthood): the FA of the forceps major (most significantly) and the two acoustic radiations. We found that the FA of the forceps major was significantly and negatively associated with the genetic canonical scores (r = −0.02; p = 0.0088, Bonferroni-corrected p = 0.026). Moreover, it was significantly lower among participants who had a self-reported psychiatric diagnosis compared with other UK Biobank participants (mean standardized FA −0.056 in “patients” vs. 0.012 in “controls;” 95% C.I. of the difference [0.034, 0.11]; Bonferroni-corrected p = 0.00042). These results thus replicate the association of lower occipito-occipital connectivity with genetic predisposition to psychiatric illness (and, even more strongly, with actual psychiatric diagnosis). Further analyses of imaging data from well-phenotyped and genotyped population samples will help clarify the temporal stability and/or dynamic nature of the vulnerability network across the lifespan.
In summary, we discovered a pattern of structural brain connectivity in children which robustly encodes a genetic vulnerability to psychiatric illness which crosses diagnostic boundaries. This brain network shows overlap with a network of cognitive control, the default-mode network, and the occipital cortex, and mediates the impact of genetic risk on behaviors related to psychiatric vulnerability. These findings help explain the shared neurobiology underpinning various psychiatric diagnoses and may contribute to understanding the mechanisms underlying the emergence of psychiatric illness and the interventions aimed at preventing it.
All MRI preprocessing and estimation of structural connectomes were completed using the image analysis software distributed as the Computational Radiology Kit version 1.6.3 (available at http://crl.med.harvard.edu).
Insel T, Cuthbert B, Garvey M, Heinssen R, Pine DS, Quinn K, et al. Research domain criteria (RDoC): toward a new classification framework for research on mental disorders. Am J Psychiatry. 2010;167:748–51.
Kessler RC, Chiu WT, Demler O, Merikangas KR, Walters EE. Prevalence, severity, and comorbidity of 12-month DSM-IV disorders in the National Comorbidity Survey Replication. Arch Gen Psychiatry. 2005;62:617–27.
Plana-Ripoll O, Pedersen CB, Holtz Y, Benros ME, Dalsgaard S, de Jonge P, et al. Exploring comorbidity within mental disorders among a Danish national population. JAMA Psychiatry. 2019. https://doi.org/10.1001/jamapsychiatry.2018.3658.
Caspi A, Houts RM, Belsky DW, Goldman-Mellor SJ, Harrington H, Israel S, et al. The p factor: one general psychopathology factor in the structure of psychiatric disorders? Clin Psychol Sci. 2014;2:119–37.
Brainstorm Consortium, Anttila V, Bulik-Sullivan B, Finucane HK, Walters RK, Bras J, et al. Analysis of shared heritability in common disorders of the brain. Science. 2018;360:eaap8757.
Kapur S, Phillips AG, Insel TR. Why has it taken so long for biological psychiatry to develop clinical tests and what to do about it? Mol Psychiatry. 2012;17:1174.
Cuthbert BN, Insel TR. Toward the future of psychiatric diagnosis: the seven pillars of RDoC. BMC Med. 2013;11:126.
Buckholtz JW, Meyer-Lindenberg A. Psychopathology and the human connectome: toward a transdiagnostic model of risk for mental illness. Neuron. 2012;74:990–1004.
McTeague LM, Huemer J, Carreon DM, Jiang Y, Eickhoff SB, Etkin A. Identification of common neural circuit disruptions in cognitive control across psychiatric disorders. Am J Psychiatry. 2017;174:676–85.
Goodkind M, Eickhoff SB, Oathes DJ, Jiang Y, Chang A, Jones-Hagata LB, et al. Identification of a common neurobiological substrate for mental illness. JAMA Psychiatry. 2015;72:305–15.
Baker JT, Dillon DG, Patrick LM, Roffman JL, Brady RO Jr, Pizzagalli DA, et al. Functional connectomics of affective and psychotic pathology. Proc Natl Acad Sci USA. 2019;116:9050–9.
Alnæs D, Kaufmann T, Doan NT, Córdova-Palomera A, Wang Y, Bettella F, et al. Association of heritable cognitive ability and psychopathology with white matter properties in children and adolescents. JAMA Psychiatry. 2018;75:287–95.
van den Heuvel MP, Sporns O. A cross-disorder connectome landscape of brain dysconnectivity. Nat Rev Neurosci. 2019;20:435–46.
Collin G, Scholtens LH, Kahn RS, Hillegers MHJ, van den Heuvel MP. Affected anatomical rich club and structural–functional coupling in young offspring of schizophrenia and bipolar disorder patients. Biol Psychiatry. 2017;82:746–55.
Schmidt A, Crossley NA, Harrisberger F, Smieskova R, Lenz C, Riecher-Rössler A, et al. Structural network disorganization in subjects at clinical high risk for psychosis. Schizophr Bull. 2017;43:583–91.
Jernigan TL, Brown TT, Hagler DJ Jr, Akshoomoff N, Bartsch H, Newman E, et al. The pediatric imaging, neurocognition, and genetics (PING) data repository. Neuroimage. 2016;124:1149–54.
Reus LM, Shen X, Gibson J, Wigmore E, Ligthart L, Adams MJ, et al. Association of polygenic risk for major psychiatric illness with subcortical volumes and white matter integrity in UK Biobank. Sci Rep. 2017;7:42140.
Smith SM, Nichols TE, Vidaurre D, Winkler AM, Behrens TEJ, Glasser MF, et al. A positive-negative mode of population covariation links brain connectivity, demographics and behavior. Nat Neurosci. 2015;18:1565–7.
Noble KG, Houston SM, Brito NH, Bartsch H, Kan E, Kuperman JM, et al. Family income, parental education and brain structure in children and adolescents. Nat Neurosci. 2015;18:773–8.
MacArthur J, Bowler E, Cerezo M, Gil L, Hall P, Hastings E, et al. The new NHGRI-EBI Catalog of published genome-wide association studies (GWAS Catalog). Nucleic Acids Res. 2017;45:D896–901.
Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O’Donnell CJ, de Bakker PIW. SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics. 2008;24:2938–9.
Hamilton CM, Strader LC, Pratt JG, Maiese D, Hendershot T, Kwok RK, et al. The PhenX Toolkit: get the most from your measures. Am J Epidemiol. 2011;174:253–60.
Taquet M, Scherrer B, Boumal N, Peters JM, Macq B, Warfield SK. Improved fidelity of brain microstructure mapping from single-shell diffusion MRI. Med Image Anal. 2015;26:268–86.
Girard G, Whittingstall K, Deriche R, Descoteaux M. Towards quantitative connectivity analysis: reducing tractography biases. Neuroimage. 2014;98:266–78.
St-Onge E, Daducci A, Girard G, Descoteaux M. Surface-enhanced tractography (SET). Neuroimage. 2018;169:524–39.
Taquet M, Scherrer B, Commowick O, Peters JM, Sahin M, Macq B, et al. A mathematical framework for the registration and analysis of multi-fascicle models for population studies of the brain microstructure. IEEE Trans Med Imaging. 2014;33:504–17.
Peters JM, Sahin M, Vogel-Farley VK, Jeste SS, Nelson CA 3rd, Gregas MC, et al. Loss of white matter microstructural integrity is associated with adverse neurological outcome in tuberous sclerosis complex. Acad Radiol. 2012;19:17–25.
Lazar M, Weinstein DM, Tsuruda JS, Hasan KM, Arfanakis K, Meyerand ME, et al. White matter tractography using diffusion tensor deflection. Hum Brain Mapp. 2003;18:306–21.
Akhondi-Asl A, Warfield SK. Simultaneous truth and performance level estimation through fusion of probabilistic segmentations. IEEE Trans Med Imaging. 2013;32:1840–52.
Velasco-Annis C, Akhondi-Asl A, Stamm A, Warfield SK. Reproducibility of Brain MRI segmentation algorithms: empirical comparison of local MAP PSTAPLE, FreeSurfer, and FSL-FIRST. J Neuroimaging. 2018;28:162–72.
Fonov V, Evans AC, Botteron K, Almli CR, McKinstry RC, Collins DL, et al. Unbiased average age-appropriate atlases for pediatric studies. Neuroimage. 2011;54:313–27.
Hotelling H. Relations between two sets of variates. Biometrika. 1936;28:321–77.
Mi H, Huang X, Muruganujan A, Tang H, Mills C, Kang D, et al. PANTHER version 11: expanded annotation data from gene ontology and reactome pathways, and data analysis tool enhancements. Nucleic Acids Res. 2017;45:D183–9.
Martin AR, Daly MJ, Robinson EB, Hyman SE, Neale BM. Predicting polygenic risk of psychiatric disorders. Biol Psychiatry. 2019;86:97–109.
Howard DM, Adams MJ, Clarke T-K, Hafferty JD, Gibson J, Shirali M, et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat Neurosci. 2019;22:343–52.
Stahl EA, Breen G, Forstner AJ, McQuillin A, Ripke S, Trubetskoy V, et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nat Genet. 2019;51:793–803.
Ripke S, Neale BM, Corvin A, Walters JTR, Farh K-H, Holmans PA, et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:421–7.
Walters RK, Polimanti R, Johnson EC, McClintick JN, Adams MJ, Adkins AE, et al. Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nat Neurosci. 2018;21:1656–69.
Grove J, Ripke S, Als TD, Mattheisen M, Walters RK, Won H, et al. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet. 2019;51:431–44.
Watson HJ, Yilmaz Z, Thornton LM, Hübel C, Coleman JRI, Gaspar HA, et al. Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nat Genet. 2019;51:1207–14.
Demontis D, Walters RK, Martin J, Mattheisen M, Als TD, Agerbo E, et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat Genet. 2019;51:63–75.
Bipolar Disorder and Schizophrenia Working Group of the Psychiatric Genomics Consortium. Genomic dissection of bipolar disorder and schizophrenia, including 28 subphenotypes. Cell. 2018;173:1705–15.e16.
Cross-Disorder Group of the Psychiatric Genomics Consortium. Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet. 2013;381:1371–9.
Lee JJ, Wedow R, Okbay A, Kong E, Maghzian O, Zacher M, et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat Genet. 2018;50:1112–21.
Choi SW, O’Reilly PF. PRSice-2: Polygenic Risk Score software for biobank-scale data. Gigascience. 2019;8:giz082.
Benjamini Y, Hochberg Y. On the adaptive control of the false discovery rate in multiple testing with independent statistics. J Educ Behav Stat. 2000;25:60–83.
Dai H, Leeder JS, Cui Y. A modified generalized Fisher method for combining probabilities from dependent tests. Front Genet. 2014;5:32.
Rosseel Y. lavaan: an R package for structural equation modeling. J Stat Softw. 2012;48:1–36.
Baron RM, Kenny DA. The moderator–mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. J Pers Soc Psychol. 1986;51:1173.
Neumann A, Pappa I, Lahey BB, Verhulst FC, Medina-Gomez C, Jaddoe VW. et al. Single nucleotide polymorphism heritability of a general psychopathology factor in children. J Am Acad Child Adolesc Psychiatry. 2016;55:1038–45.e4.
Elliott ML, Romer A, Knodt AR, Hariri AR. A connectome-wide functional signature of transdiagnostic risk for mental illness. Biol Psychiatry 2018. https://doi.org/10.1016/j.biopsych.2018.03.012.
Romer AL, Knodt AR, Houts R, Brigidi BD, Moffitt TE, Caspi A, et al. Structural alterations within cerebellar circuitry are associated with general liability for common mental disorders. Mol Psychiatry. 2018;23:1084–90.
Park H-J, Friston K. Structural and functional brain networks: from connections to cognition. Science. 2013;342:1238411.
Kendler KS, Neale MC. Endophenotype: a conceptual analysis. Mol Psychiatry. 2010;15:789–97.
Walters JTR, Owen MJ. Endophenotypes in psychiatric genetics. Mol Psychiatry. 2007;12:886–90.
Benedetti F, Bollettini I, Radaelli D, Poletti S, Locatelli C, Falini A, et al. Adverse childhood experiences influence white matter microstructure in patients with bipolar disorder. Psychol Med. 2014;44:3069–82.
Rodrigo MJ, León I, Góngora D, Hernández-Cabrera JA, Byrne S, Bobes MA. Inferior fronto-temporo-occipital connectivity: a missing link between maltreated girls and neglectful mothers. Soc Cogn Affect Neurosci. 2016;11:1658–65.
Hanson JL, Adluru N, Chung MK, Alexander AL, Davidson RJ, Pollak SD. Early neglect is associated with alterations in white matter integrity and cognitive functioning. Child Dev. 2013;84:1566–78.
Kelly S, Jahanshad N, Zalesky A, Kochunov P, Agartz I, Alloza C, et al. Widespread white matter microstructural differences in schizophrenia across 4322 individuals: results from the ENIGMA Schizophrenia DTI Working Group. Mol Psychiatry. 2018;23:1261–9.
Liao Y, Huang X, Wu Q, Yang C, Kuang W, Du M, et al. Is depression a disconnection syndrome? Meta-analysis of diffusion tensor imaging studies in patients with MDD. J Psychiatry Neurosci. 2013;38:49–56.
Wise T, Radua J, Nortje G, Cleare AJ, Young AH, Arnone D. Voxel-based meta-analytical evidence of structural disconnectivity in major depression and bipolar disorder. Biol Psychiatry. 2016;79:293–302.
Gong Q, Hu X, Pettersson-Yeo W, Xu X, Lui S, Crossley N, et al. Network-level dysconnectivity in drug-naïve first-episode psychosis: dissociating transdiagnostic and diagnosis-specific alterations. Neuropsychopharmacology. 2017;42:933–40.
Alfaro-Almagro F, Jenkinson M, Bangerter NK, Andersson JLR, Griffanti L, Douaud G, et al. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage. 2018;166:400–24.
Elliott LT, Sharp K, Alfaro-Almagro F, Shi S, Miller KL, Douaud G, et al. Genome-wide association studies of brain imaging phenotypes in UK Biobank. Nature. 2018;562:210–6.
The authors thank Hakim Ouaalam who helped with the preprocessing and quality assurance of the PING data. This work was funded by a Wellcome Trust Strategic Award 098369/Z/12/Z, the National Institute for Health Research (NIHR) Oxford Health Biomedical Research Centre, the Oxford University Clinical Academic Graduate School, the Foulkes Foundation, and a Royal College of Psychiatrists Foundation Fellowship. This investigation was also supported in part by NIH grants R01 NS079788, R01 EB019483, BCH IDDRC U54 HD090255, and S10 OD025111, by a NARSAD Distinguished Investigator award from the Brain and Behavior Research Foundation, and by a research grant from the Boston Children’s Hospital Translational Research Program. The views expressed are those of the authors and not necessarily those of the National Health Service, NIHR, or the Department of Health. Data used in preparation of this article were obtained from the controlled access PING dataset (NIH Data Archive project number: 2607) as part of the NIMH-supported Research Domain Criteria database (RDoCdb). RDoCdb is a collaborative informatics system created by the National Institute of Mental Health to store and share data resulting from grants funded through the Research Domain Criteria (RDoC) project. As such, the investigators within the PING study provided data but did not participate in analysis or writing of this report.
Conflict of interest
The authors declare that they have no conflict of interest.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Taquet, M., Smith, S.M., Prohl, A.K. et al. A structural brain network of genetic vulnerability to psychiatric illness. Mol Psychiatry (2020). https://doi.org/10.1038/s41380-020-0723-7