The reliability and heritability of cortical folds and their genetic correlations across hemispheres


Cortical folds help drive the parcellation of the human cortex into functionally specific regions. Variations in the length, depth, width, and surface area of these sulcal landmarks have been associated with disease, and may be genetically mediated. Before estimating the heritability of sulcal variation, the extent to which these metrics can be reliably extracted from in-vivo MRI must be established. Using four independent test-retest datasets, we found high reliability across the brain (intraclass correlation interquartile range: 0.65–0.85). Heritability estimates were derived for three family-based cohorts using variance components analysis and pooled (total N > 3000); the overall sulcal heritability pattern was correlated to that derived for a large population cohort (N > 9000) calculated using genomic complex trait analysis. Overall, sulcal width was the most heritable metric, and earlier forming sulci showed higher heritability. The inter-hemispheric genetic correlations were high, yet select sulci showed incomplete pleiotropy, suggesting hemisphere-specific genetic influences.


Genetic drivers of brain structural and functional differences are important to identify as potential risk factors for heritable brain diseases, and targets for their treatment. Large-scale neuroimaging consortia, including the ENIGMA1 consortium, have identified common genetic variants that have small but significant associations with variations in brain morphology2. Studies have even identified genetic correlations between human brain structure and risk for disease3,4.

Enriched in neuronal cell bodies, the cortical gray matter plays an important role in human cognitive functions and behavior, including sensory perception and motor control5. Macroscale anatomical features of the human cortex can be reliably extracted from structural magnetic resonance imaging (MRI) scans, and among the most common are regional thickness and surface area measures. These MRI-based features show robust alterations in several neurological, neurodevelopmental, and psychiatric disorders6, and are influenced by both environmental and genetic variation7.

Gyrification of the cortical surface occurs in an orchestrated pattern8 during fetal life and into adolescence9, forming sulci (fissures) and gyri (ridges) in the cortical gray matter. The mechanisms of brain folding are not fully understood10,11, but the process is largely preserved among humans and nonhuman primates. The brain sulci delimit cortical areas with specific functionalities and are generally consistent across subjects12,13,14,15. The complexity and intersubject variability of brain gyrification are influenced by developmental, aging, and pathological processes, all of which are genetically and environmentally influenced16,17.

Large-scale neuroimaging studies have begun to discover common and rare genetic variants that contribute to brain variability as estimated using in vivo brain scans, such as MRI18; genome-wide association studies (GWAS) find that, as with other complex traits, individual common variants typically explain <1% of the population variance in MRI derived measures; still, common genetic factors account for a large fraction of the variance in aggregate2,19,20,21. Successful efforts to discover common variants that affect cortical structure require tens of thousands of scans, as well as independent samples for replication and generalization. Large-scale biobanks have amassed tens of thousands of MRI scans22. Even so, to replicate effects and ensure the generalizability of findings to other scanned populations, we must first assess that the brain measures are reliably extracted across a variety of possible MRI scanning paradigms. This reliability is the basis for pooling statistical effects across individual studies in multisite consortia such as ENIGMA1 and CHARGE23.

Sulcal-based morphometry provides in-depth analyses of the cortical fissures, or folds, as seen on MRI. Measures of sulcal morphometry—including length, depth, width, and surface area—among others—have been associated with brain maturation in adolescents24, neurodegenerative changes in the elderly24,25, and neuropsychiatric disorders such as schizophrenia26,27, bipolar disorder28, and autism spectrum disorder29; altered fissuration is also found in several genetic disorders, such as Williams syndrome30,31. Effects on sulcal patterns have been reported as being partially independent of those on cortical thickness or surface area24,32.

Effects on sulcal patterns have been reported being partially independent of those on cortical thickness or surface area24,32. Previous studies have investigated the genetics of cortical folds, but not across the full brain, and without ensuring the reliability of the measures themselves, or the heritability estimates. Kochunov et al.33 analyzed the effects of age on sulcal shape descriptors in a subset of 14 sulci finding wider sulci with older age  in the adult human brain. The central sulcus has been the focus of many earlier publications34,35,36,37. Its depth has been reported to be highly heritable, with the degree of heritability varying along its profile37. In recent works, La Guen et al.38 studied the heritability of sulcal pits in the Human Connectome Project (HCP) and the genetic correlation of sulcal width across ten sulci in the UK Biobank39. The heritability of the depth, length, and surface area of primary sulci has been studied in baboons40. It has been suggested that deeper, earlier forming, sulci have higher heritability41, although this hypothesis has not been confirmed. The reliability of the findings across populations, and the extent to which heritability depends on the reliability of the measures, has not been investigated.

Here we: (a) estimate the reliability of four shape descriptors extracted from sulci across the whole brain; (b) evaluate heritability of these measures across four independent cohorts (three family-based cohorts and one cohort of unrelated participants); (c) determine the extent to which the heritability estimates depend on reliability; and (d) provide insights into the relationship between early forming sulci and higher heritability as well as cortical lateralization.

We performed an extensive reliability (N = 110) and heritability (N = 13,113) analysis. Reliability was estimated from four cohorts, totaling 110 participants (19–61 years of age, 47% females) who underwent two T1-weighted brain MRI scans across different brain imaging sessions. We included data sets for which we would expect minimal or no structural changes between scans, so we limited the analysis to healthy individuals aged 18–65, with an inter-scan interval < 90 days. See Table 1 for more details.

Table 1 Cohorts analyzed for the test–retest study.

We analyzed heritability in four independent cohorts, three with a family-based design and one using single-nucleotide polymorphism (SNP)-based heritability estimates. The cohorts included two twin-based samples (Queensland Twin Imaging study (QTIM) and HCP), one cohort of extended pedigrees (the Genetics of Brain Structure and Function; GOBS), and another of over 9000 largely unrelated individuals (the UK Biobank) (Table 2). Heritability estimates are population specific, but here our aim was to understand the heritability pattern across populations and estimate the degree to which genetic effects are consistently observed. We pooled information from all twim and family-based cohorts to estimate the generalized heritability values using meta- and mega-analytic methods42,43.

Table 2 Genetic analysis: demographics for the four cohorts analyzed in this study.

We estimated reliability and heritability for measures of each sulcus in the left and right hemispheres, separately. As there is limited evidence for genetic lateralization across most of the human brain44,45,46, we also evaluated the heritability estimates of the measures for each sulcus averaged across the two hemispheres. This may lead to more stable measurements and, if the bilateral measures are influenced by similar genetic factors, then more stable measures could lead to better powered genetic studies. We also assessed the genetic correlation between the measures across hemispheres. Sulci with limited genetic correlations between hemispheres may reveal novel insight into the brain’s lateralization and identify key biomarkers for relating lateralized traits, such as language and handedness, to brain structure47.


Measurement reliability and its relationship to heritability

Supplementary Data 1 reports the sulcal nomenclature, including the abbreviation and full name for each sulcus. Reliability estimates may be found in Supplementary Data 24 for intraclass correlation (ICC) and Supplementary Data 58 for the bias evaluation; heritability estimates are reported in Supplementary Data 920 for the univariate analysis and Supplementary Data 2124 for the bivariate analysis. We summarize the results below.

Intraclass correlation (ICC)

The ICC meta-analysis resulted in an ICC interquartile range of 0.59–0.82 Sulcal mean depth, width, and surface area showed similar reliability estimates, while the length showed lowest ICC (Table 3 and Fig. 1). For all descriptors other than mean depth, a higher fraction of sulci had “good” reliability, defined as ICC > 0.7548, after averaging metrics across corresponding left and right hemispheres, for all the descriptors (Fig. 1b) The fraction of sulci reaching ICC > 0.75 went from 24 (before averaging) to 39% (after averaging) for sulcal length, from 37 to 48% for the width, and from 48 to 59% for the surface area; mean depth remained consistant at 57%.

Table 3 Meta-analysis of ICC estimated from four independent cohorts for sulcal length, mean depth, width, and surface area.
Fig. 1: Intraclass correlation reliability estimates for sulcal length, depth, width and surface area.

a Sulcal-based meta-analysis of intraclass correlation (ICC) for bilaterally averaged sulcal measures (N = 110). Sulcal length showed generally “good” reproducibility, although no regions had ICC > 0.959. Mean depth showed “excellent” reproducibility (ICC > 0.9) for: the inferior frontal sulcus (S.F.inf.) and the superior frontal sulcus (S.F.sup.); sulcal width showed “excellent” reproducibility for: intraparietal sulcus (F.I.P.), superior postcentral intraparietal superior sulcus (F.I.P.Po.C.inf.), central sulcus (S.C.), superior postcentral sulcus (S.Po.C.sup.). Surface area showed “excellent” reproducibility for the central sulcus (S.C.), subcallosal sulcus (S.Call.), and the anterior occipito-temporal lateral sulcus ( b The intraclass correlation (ICC) for left, right, and bilaterally averaged sulcal length, mean depth, width, and surface area across the whole brain is plotted for the four test–retest cohorts. KKI showed the highest ICC across sulci.

The meta-analysis of ICC captures the consensus  in the reliability across cohorts for each sulcus. Reliability measures depend to some extent on the cohort examined, or the scanning acquisition parameters. For example, for QTIM, which was collected at 4 T, the ICC is classified as “good” (ICC > 0.75) for the left sulcal surface area of the collateral sulcus (F.Coll.), but “poor or moderate” (ICC < 0.75) in OASIS for the same trait. Figure 1a shows the meta-analysis of ICC across the four cohorts, and highlights patterns for “good” and “excellent” (ICC > 0.9) reliability.

For a detailed breakdown of the ICC for measures of sulci morphometry per cohort, please see Supplementary Fig. 1 for the left hemisphere, Supplementary Fig. 2 for the right, and Supplementary Fig. 3 for bilaterally averaged measures.

For the complete meta-analyzed ICC results, please see Supplementary Figs. 47 for length, depth, width, and surface area respectively, all of which are tabulated in Supplementary Data 4.

For each sulcus, we averaged the reliability estimates across all four sulcal descriptors to find the most reliable sulci overall. The central sulcus (S.C.) gave the most reliable sulcal measures, followed by the median frontal sulcus (S.F.median), the intraparietal sulcus (F.I.P.), the occipito-temporal lateral sulcus (, the Sylvian sulcus (S.C.Sylvian), the sub-parietal sulcus (S.s.P.), the occipital lobe, and the superior temporal sulcus (S.T.s.) (Supplementary Fig. 8).

Bias (b)

We explored test–retest (TRT) consistency in terms of the “bias” (b, Eq. (4)), with Bland–Altman analyses. As in ref. 49, the generally low bias values showed high TRT consistency of sulcal shape measures (Supplementary Data 5). Bias values ≥ 0.1 are considered high, and were noted mainly for length estimates—e.g., for the length of the left and right anterior/posterior sub-central ramus of the lateral fissure (, and the length of the left and right insula (See Supplementary Data 68 for bias estimates across the left, right and bilaterally averaged sulcal metrics). Paralleling the higher ICC in bilaterally averaged measures, lower “bias” estimates were obtained with individual sulcal measures averaged across the left and right hemispheres (Supplementary Data 8).

ICC and bias (b) of bilaterally averaged sulcal metrics were significantly negatively correlated for all metrics except for length, in particular rlength = −0.11 [pval = 0.07], rmean-depth = −0.14 [pval = 0.02], rwidth = −0.25 [pval = 4.6 × 10−5], rsurface-area = −0.25 [pval = 1.2 × 10−5], suggesting, as expected, that a lower bias between test and retest measurements relates to higher reproducibility as estimated by ICC.

Heritability estimates for the cortical folding patterns

Across descriptors and sulci, heritability estimates (h2) showed a similar pattern across the three family-based cohorts, QTIM, HCP, GOBS (Supplementary Figs. 911); the GOBS cohort shows lower heritability, (h2 = 0.3 ± 0.1), compared to QTIM (h2 = 0.4 ± 0.1) and HCP (h2 = 0.4 ± 0.1); GOBS is a cohort with an extended pedigree design and a wide age range (18–85 years of age), while both HCP and QTIM are twin-based cohorts of young adults aged 25–35 years and 20–30 years, respectively.

The generalized heritability profile of cortical folding was obtained by meta-analyzing the estimates across these three independent family-design cohorts, and is highlighted in Fig. 2a. Aggregate heritability estimates were also calculated in a mega-analytic manner, where 3030 subjects from the family-based cohorts (QTIM, HCP, and GOBS) were pooled (after adjusting for covariates within cohort and normalizing across cohorts) before computing heritability estimates as in prior work42,43. As expected, we found similarities between meta- and mega-analysis derived heritability estimates as indicated by a significant Pearson’s correlation between these two approaches (r 0.84, p = 10−3–10−7; see Supplementary Fig. 13 for more details). Individual heritability estimates, standard errors (SE), and p values for bilaterally averaged sulcal length, mean depth, width, and surface area are tabulated in Supplementary Data 918 for each cohort, and in Supplementary Data 1920 for the meta- and mega-analyses.

Fig. 2: Heritability estimates.

Heritability estimates (h2) are mapped, for each bilaterally averaged sulcal descriptor. a The results of the inverse-variance weighted meta-analysis of the heritability estimates across three family-based cohorts QTIM, HCP, and GOBS highlight an overall heritability profile across 3030 individuals. b Heritability estimates (h2) calculated from sulcal features extracted from MRI scans of 10,083 unrelated individuals scanned as part of the UK Biobank were calculated using the genome-wide complex trait analysis (GCTA) package. The regional sulcal metrics that were found to be significantly heritable in the large population sample largely overlap with those found to be most highly heritable across the family-based studies. We highlight only regions that had significant heritability estimates in sulci that had an ICC > 0.75 (see Supplementary Data 24 for sulcal-based values of ICC). Significant regions survived Bonferroni correction for multiple comparisons across all bilateral traits and regions (p < 0.05/(61 × 4)); darker red colors indicate higher heritability estimates. The left hemisphere was used for visualization purposes.

For many sulcal features in the UK Biobank, the SNP-based heritability estimates were ~25% of the estimates derived from the family-based studies (h2 = 0.2 ± 0.1; Fig. 2b). The heritability estimates for the UK Biobank are reported in Supplementary Data 18.

Across the cortex, the global sulcal descriptors were significantly heritable for all cohorts. The patterns of heritability estimates were largely coherent between the family-based and large-scale population studies. The width was the most heritable measurement, while the length was the least, showing significant heritability estimates for only sparse regions of the cortex. The heritability of sulcal length was more frequently significant when not adjusting for ICV; we find minimal differences in the overall h2 estimates for sulcal depth and width before and after covarying for ICV (Supplementary Fig. 14).

The overall meta-analyzed reliability was significantly correlated with the heritability estimates meta-analyzed across the family-based cohorts: r = 0.36 (pval = 1 × 10−7) for sulcal length, r = 0.31 (pval = 4.1 × 10−6) for mean depth, r = 0.26 (pval = 7 × 10−5) for sulcal width, and r = 0.25 (pval = 1 × 10−4) for surface area (Supplementary Fig. 15); the reliability estimates were also correlated with heritability estimates in the UK Biobank for mean depth (r = 0.43, pval = 2 × 10−3) and sulcal width (r = 0.38, pval = 4 × 10−3) (Supplementary Fig. 16).

A few bilaterally averaged sulcal regions and metrics with “poor” reliability (ICC < 0.75) showed significant heritability estimates. These included the length of the parieto-occipital fissure (F.P.O.) [ICC = 0.66, h2 = 0.18 (pval = 1 × 10−5)], the mean depth of the ascending ramus of the lateral fissure (F.C.L.r.asc.) [ICC = 0.74, h2 = 0.2 (pval = 2.2 × 10−6)], the surface area of the anterior inferior frontal sulcus (S.F.inf.ant.) [ICC = 0.65, h2 = 0.17 (pval = 4.7 × 10−6)], and the width of the calloso-marginal ramus of the lateral fissure (F.C.M.ant.) [ICC = 0.63, h2 = 0.34 (pval = 1 × 10−16)] (Supplementary Data 4 and 19). For UK Biobank, the length of S.T.pol. [ICC = 0.70, h2 = 0.14 (pval = 6 × 10−5)], the width and the surface area for the insula [ICC = 0.65, h2 = 0.14 (pval = 2.6 × 10−5)] and [ICC = 0.65, h2 = 0.16 (pval = 3.8 × 10−6)], respectively (Supplementary Data 18).

The heritability estimates for the global measures (i.e., the sum across sulci) of sulcal length, mean depth, width, and surface area (covarying for ICV, age, and sex variables) are also reported in Supplementary Fig. 17. QTIM, HCP, and GOBS showed similar trends across descriptors and hemispheres; only QTIM had generally higher heritability estimates for sulci in the right hemisphere compared to those in the left (paired t-test: pval = 1.5 × 10−10).

Thirty-three percent (36% for mega-analysis) of the total number of bilaterally averaged sulci showed significant h2 for sulcal length, 57% (59% for mega-analysis) for mean depth, 67% (65% for mega-analysis) for width, and 62% (60% for mega-analysis) for the surface area. Six sulci were significantly heritable for only one of the four descriptors (one for mega-analysis). No sulcus show significant heritability for length only. Sulci that were significantly heritable across descriptors included the intraparietal sulcus, occipital lobe, subcallosal sulcus, internal frontal sulcus, orbital sulcus, anterior inferior temporal sulcus, and the polar temporal sulcus, among others; in total 15 sulci were significantly heritable across all four descriptors in the meta-analysis, and 19 for the mega-analysis (see Supplementary Data 1920).

A significant Pearson’s correlation was identified between heritability estimates averaged across sulcal descriptors and the approximate appearance of sulci (in weeks) during development50 (Supplementary Fig. 17, r = −0.62, p = 0.0025).

Genetic correlations between sulcal shape descriptors of the left and right cortical hemispheres

Genetic correlation across the hemispheres: Averaging brain-imaging derived traits across the left and right hemispheres, as above, has been shown to reduce noise due to measurement error in large scale, multi-cohort efforts2,20,51,52. Improvements in the signal-to-noise ratio may be essential for discovering single common genetic variants that explain < 1% of the overall population variability in a trait. However, by assessing left and right separately, we may be able to discover lateralized genetic effects, if they exist.

Bivariate variance components models confirmed that the genetic correlations between the same global sulcal descriptor on the left and right hemispheres of the brain were significant (ρG 0.92 ± 0.10) (Supplementary Data 2124).

Fig. 3: Genetic correlations between sulcal shape descriptors of the left and right cortical hemispheres.

Left: the genetic correlations (ρG) between corresponding sulcal descriptors on the left and right hemispheres were assessed in three family based cohorts and meta-analyzed correlation values are mapped onto the brain. Right: the −log10 of the p value comparing the resulting genetic correlation to a perfect overlap (ρG = 1) are mapped. Significant values here suggest that the genetic components of variance may be partially unique across the left and right homologous sulcal metrics; i.e, despite a genetic correlation between hemispheres, lateralized genetic effects may be detectable. Sulci are mapped to the left hemisphere for visualization purposes.

The genetic correlation (ρG) between left and right homologous regions was computed for sulcal metrics that showed both “good” reliability estimates (ICC > 0.75) and significant univariate heritability estimates for both the left and right metrics; (Fig. 3, Supplementary Data 21–24). The genetic correlations between the left and right sulcal metrics was generally highest for the sulcal width metric. The width of the central sulcus, the inferior frontal sulcus, intermediate frontal sulcus, superior frontal sulcus, posterior lateral sulcus, the superior postcentral intraparietal superior sulcus, and the intraparietal sulcus, and the surface of the occipital lobe showed significant genetic correlations across all tested cohorts.

Two sets of p values are obtained when performing genetic correlations with bivariate variance components models: a more traditional p value comparing the correlation to the null hypothesis of no correlation (ρG = 0), and another p value comparing the genetic correlation to the hypothesis of a perfect overlap (ρG = 1), assessing the difference between the genetic correlation obtained and a correlation of 1. Bonferroni correction for the more traditional p values was conducted by correcting for the number of traits tested  so p < 0.05/[N], where N = 11 heritable sulci with ICC > 0.75 for length, +31 for mean depth +36 for width +37 for surface area, for a total of 115 traits. The second set of p values comparing the genetic correlation (ρG) between hemisphere homologs to the indistinguishable value of 1 are listed in Supplementary Data 24, and the −log10(p values) of those significantly different than 1 are mapped in Fig. 3. For these regions, the 95% confidence interval surrounding the correlation estimate did not contain 1. These sulci represent regions and descriptors that may have diverging genetic influences across hemispheres. The meta-analysis revealed evidence for lateralized genetic effects in sulcal width or surface area of the occipital lobe, the intraparietal lobe, the median frontal sulcus, the intermediate left frontal sulcus, and the collateral sulcus; no evidence for lateralization of genetic influences was detected with either sulcal length or depth metrics.

Phenotypic correlations (ρP) between the left and right indices were on average less than 0.5 in each cohort (Supplementary Fig. 18). Sulcal width showed the highest (ρP = 0.38 ± 0.15) meta-analyzed correlation between left and right homologs compared to the other sulcal descriptors (0.29 ± 0.07 for sulcal length, 0.30 ± 0.11 for mean depth, and 0.33 ± 0.12 for surface area).


Our study has four main findings: (1) many of the sulci common across individuals were reliably extracted across a variety of MRI acquisition parameters, with some sulcal shape descriptors being more reliable than others; (2) cortical folding patterns were highly heritable and sulcal shape descriptors such as sulcal width may be promising phenotypes for genetic analysis of cortical gyrification; (3) the proportion of variance attributed to additive genetic factors  varied regionally, with the earlier forming sulci having higher heritability estimates than later forming sulci; (4) incomplete pleotropy was identified between select left and right sulcal descriptors, suggesting sulcal analyses may provide insights into genetic factors underlying the lateralization of brain structure.

Cortical sulci may serve as prominent landmarks for identifying homologous functional regions across individuals35,36. BrainVISA offers the ability to automatically extract and characterize the sulci at high spatial resolutions, by segmenting and labeling 123 sulci across the cerebral cortex. Here, we analyzed four sulcal shape descriptors: length, mean depth, width, and surface area. Sulcal length has been associated with neurodevelopmental processes29,53,54, while sulcal depth and width have been correlated with aging and neurodegenerative processes33,55,56,57. Sulcal surface area represents a combination of depth, width and length features.

A primary goal of this work was to identify the sulci and the corresponding shape metrics that may be reliably extracted irrespective of the specific MRI scanner or  scan acquisition protocol, to ensure a globally viable trait for disease related biomarker and genetic association analyses. Poor reliability may be attributable to measurement errors, which could lead to a ceiling effect on heritability estimates. This is because highly heritable traits can only be detected if the traits are robustly measured58 and low reliability could lead to an underestimation of the true heritability59. While heritability is a population-specific estimate, one main goal of imaging genetics consortia such as ENIGMA1 and CHARGE23 is to identify genetic variants that affect brain structure and function in populations around the world. Therefore, it is of utmost importance to ensure that measures are reliably extracted across different data sets, and furthermore, are heritable  across different populations. Even beyond imaging genetics, the reliability of the measurements and the reproducibility of any set of results are essential for reproducible science at large.

Here we identified the most reliable sulcal regions using test-retest (TRT) data from four cohorts with independent samples and different scanning protocols to ensure the robustness of results. We assessed bias, a subject-based index of consistency49 as well as ICC, which compares the within-subject variance to the between-subjects variance. ICC may be affected by the homogeneity of the population under study; when variability in the population is low, for example, if age range is limited, then lower ICC values may be expected, while bias would be unaffected. Our results show high consistency between test and retest (“bias” < 0.149 on average). Furthermore, in considering the number of sulci that had ICC estimates  greater than 0.9, sulcal width was the most reliable metric among the descriptors analyzed. Although some visual quality control was conducted on individual sulcal extractions, we did not ensure the anatomical validity of the entire set of sulcal labels for each of the individual MRI scans used in this study.  Our reliability results are therefore more a reflection of methodological consistency, rather than anatomical accuracy.

A study examining the relationship between reproducibility and heritability of different brain structures in the QTIM cohort60 found a correlation between ICC and heritability, with a large percentage of traits showing low reliability (ICC < 0.75)60. Here we showed that most of the reliable sulcal shape descriptors were also highly heritable. This trend might be due to the lower variance across subjects for more robust anatomical regions, such as the central sulcus, which are easier to identify with automated image processing pipelines and less prone to segmentation errors. However, even in regions with “excellent” reliability (ICC > 0.9), we identified a range of heritability estimates, suggesting  that not all reliable traits are necessarily highly heritable59.

Many earlier works have focused exclusively on the central sulcus37,40,61. We have replicated findings of significant heritability in the central sulcus and further, showed that it is indeed the sulcus  with the highest heritability estimate across the entire cortex. However, out of 61 total bilateral cortical sulci, it is only one of 34 that showed significant heritability estimates across all four shape descriptors.

Our results indicated significant heritability estimates for sulcal surface area and width in several medial frontal regions, partially confirming findings in ref. 40. Our results also confirmed prior findings of sulcal heritability in the temporal lobe62 and the corpus callosum area63 and are also in line with studies showing high estimated heritability in prefrontal and temporal lobes for cortical thickness and surface area64,65,66,67,68,69,70, especially for sulcal mean depth and width.

The sulcal descriptors identified as being heritable in this work may serve as phenotypes for large-scale genome-wide association studies, or GWAS, enhancing  our ability to identify specific genomic variants that influence brain structure and disease risk. These reliable and heritable sulcal measures may also serve as biomarkers for understanding genetically mediated brain disorders. The significant correlation identified between heritability estimates averaged across sulcal descriptors and the appearance of sulci (in weeks) during development50 implies that sulci appearing early in brain development71,72, including the central sulcus, Sylvian fissure, parieto-occipital lobes, and superior temporal sulcus50 may be under stronger genetic control. However, some regions including the frontal lobe and the temporal sulcus also had high heritability, even though these regions are reported to develop later72, suggesting more work is needed to identify the developmental role in the regional genetic architecture.

Across three independent family-based cohorts, QTIM—an Australian cohort of young adult twins and siblings—HCP, a North American cohort of twins and siblings, and GOBS—a Mexican-American cohort of extended pedigrees, we found similar patterns of heritability for four descriptors of sulcal morphometry. Globally, we found sulcal heritability estimates of ~0.3–0.4, similar to estimates in other species, including Papio baboons40. Heritability estimates from GOBS were lower than for QTIM or HCP, as may be expected for an extended pedigree design when compared to twin designs73. It has also been proposed that higher image quality, and therefore lower measurement error, could lead to higher heritability estimates74. GOBS and HCP MRI volumes were acquired with a 3T scanner and HCP has higher spatial resolution compared to GOBS. QTIM was acquired with slightly lower spatial resolution but at higher magnetic field strength (4T). Further analyses will be needed to investigate how the signal-to-noise estimates (SNR) vary across cohorts and how this affects heritability estimation74. SNP-based heritability estimated in the UK Biobank showed a similar h2 pattern (Supplementary Fig. 20) across the brain, but with lower h2 values compared to the family-based cohorts. This may be partially due to the “missing heritability” effect in the SNP-based heritability estimation75. We note that recent work has found less discrepancy between twin-based heritability estimates and those derived from large-scale population studies of approximately 20,000 individuals76, therefore, larger population samples may be required to better power our SNP-based heritability estimates and help determine the true extent of the missing heritability.

Apart from work by the ENIGMA Laterality group77, many published ENIGMA studies2,20,43,78 performed analyses on pooled bilateral measures of brain structure, averaging data from the left and right hemispheres. We indeed found that for most sulcal descriptors, averaging the sulcal measures across hemispheres provided more regions with reliable estimates, and more consistent heritability estimates across cohorts. The bivariate genetic analysis used to estimate the genetic correlation between left and right sulcal measures, further confirmed strong and significant genetic correlations between hemispheres.

A genetic correlation between measures across the right and left hemispheres indicates pleiotropy, suggesting that genetic influences underlying the structure and variability in the measures tends to overlap. In family-based studies, bivariate variance components analysis may be used to determine the genetic correlation between traits as in this work. When a significant genetic correlation is identified, the confidence interval around the genetic correlation often includes one, suggesting the underlying genetic influences of the measures were not statistically distinguished from each other. Incomplete pleiotropy is suggested when genetic correlations are significant, but the confidence intervals around the correlations do not include one. While in SNP-based genetic correlation models, incomplete pleiotropy may be suggested over complete pleiotropy in the presence of measurement error, in a bivariate polygenic model, measurement error falls into the environmental component of variance and the environmental correlation, and therefore, does not influence the maximum-likelihood estimate of the genetic correlation; i.e, measurement error makes it more difficult to reject the null hypothesis that the genetic correlation is one. Features that exhibit unique genetic influences in one hemisphere may reveal insights into the biological causes of brain lateralization that may play an important role in neurodevelopmental or psychiatric disorders. Evidence of less genetic control in the left hemisphere has been found in refs. 50,62, where the authors found higher cortical gyrification complexity in the right hemisphere at an early development stage.

Here we found incomplete pleiotropy, or suggested asymmetrical genetic influences, in the frontal lobe (width). This may relate to disorder-specific abnormalities seen in brain folding patterns, for example, as reported in a postmortem study on schizophrenia79. Incomplete pleiotropy was also detected in sulci of the occipital lobe, a highly polygenic region80; structural abnormalities in this region have been associated with Parkinson’s disease81,82, posterior cortical atrophy, a disorder causing visual dysfunction, and logopenic aphasia83.

Some regions that showed this suggested lateralization of genetic effects for sulcal descriptors, showed the same effect for other measures extracted from the cortex; for example, the effect seen with the sulcal surface area of the collateral fissure was also detected with the corresponding gyral  surface area (Supplementary Fig. 21). However, for the occipital lobe, we found evidence for lateralization of genetic effects with sulcal width, but not with either cortical thickness or surface area of corresponding gyri. This suggests that sulcal descriptors may offer additional insights into cortical development and lateralization, beyond more commonly analyzed metrics of gyral morphometry. As a larger than expected portion of our study population was right-handed, our findings may be biased towards right handed individuals and may not be fully representive; the degree of cerebral volume asymmetry has been shown to be lower for non-right-handed twins than right-handed pairs62 and future investigations focusing on the genetics of brain gyrification and lateralization across handedness are needed to confirm these findings.

The genetic influences on brain cortical structure are regionally dependent, and differ according to the metric, or descriptor, being evaluated. For example, the genetic correlation between average cortical thickness and total surface area has been shown to be weak and negative, with largely different genetic compositions32. Different metrics are often used to describe and quantify different biological processes such as those such as length and surface area with potentially more developmental orgins, and others including sulcal width that may capture more degenerative processes. In nonhuman primates, brain cortical folding was also found to be influenced by genetic factors largely independent of those underlying brain volume84,85. Measuring cortical folding through sulcal-based morphometry could therefore highlight brain metrics beyond thickness and surface area, and may complement these more traditional measures to reveal a deeper understanding of the processes underlying variation in human brain structure, its association with disease and the underlying genetic risk factors. Our findings suggest that conducting a GWAS of sulcal features may be particularly informative for the sulcal width—the most heritable of the four tested metrics. Although for most sulci, the genetic components of variances were largely indistinguishable (i.e., highly correlated) across the two hemispheres, our results suggest that conducting a separate GWAS of sulcal measures in select frontal, temporal, and occipital regions may provide added insight into the biological mechanisms that drive hemispheric specialization. The discovery and replication of specific genetic influences on brain structure require very highly powered analyses, achievable through large-scale studies and collaboration. Harmonized imaging and genetic analysis protocols, rigorous quality assurance, reproducibility assessments, along with statistical rigor are vital in the collaborative endeavors such as those proposed by the ENIGMA consortium.  To allow for a variety of such international collaborations, the customized MRI image processing protocol using and extending the BrainVISA toolkit as in this work, has been made freely available at:


Participants and MRI imaging

Queensland Twin Imaging study (QTIM): Brain MRI from 1008 right-handed participants86, 370 females and 638 males, were used in this study. This included 376 dizygotic (DZ) and 528 monozygotic (MZ) twins (one set of DZ triplets) and 104 siblings, with an average age of 22.7 ± 2.7 years [range: 18–30]. T1-weighted images were acquired on a 4T Bruker Medspec scanner with an inversion recovery rapid gradient echo sequence. Acquisition parameters were inversion/repetition/echo time (TI/TR/TE) = 700/1500/3.35 ms; flip angle = 8°; with an acquisition matrix of 256 × 256; voxel size = 0.94 × 0.90 × 0.94 mm3.

Human Connectome Project (HCP): 816 participants87, 362 females and 454 males, average age 29.1 ± 3.5 years [range: 22–36]. These included 412 siblings, 205 DZ and 199 MZ twins, including triplets. T1-weighted images were acquired using a 3T Siemens scanner. MRI parameters: (TI/TR/TE) = 1000/2400/2.14 ms; flip angle = 8°; voxel size = 0.7 mm isotropic; acquisition matrix = 224 × 224. The subset of TRT scans includes all right-handed subjects.

Genetics of Brain Structure and Function (GOBS): A total of 1205 individuals of Mexican-American ancestry from extended pedigrees (71 families, average size 14.9 [1–87] people) were included in the analysis. Sixty-four percent of the participants were female and ranged in age from 18 to 97 (mean ± SD: 47.1 ± 14.2) years. Individuals in this cohort have actively participated in research for over 18 years and were randomly selected from the community with the constraints that they are of Mexican-American ancestry, part of a large family, and live within the San Antonio, Texas region. Imaging data were acquired at the UTHSCSA Research Imaging Center on a Siemens 3T Trio scanner (Siemens, Erlangen, Germany). Isotropic (800 µm) 3D Turbo-flash T1-weighted images were acquired with the following parameters: TE/TR/TI = 3.04/2100/785 ms, flip angle = 13°. Seven images were acquired consecutively using this protocol for each subject and the images were then co-registered and averaged to increase the signal-to-noise ratio and reduce motion artifacts88.

UK Biobank: Analyses were conducted on the 2017 imputed genotypes restricted to variants present in the Haplotype Reference Consortium89,90. UK Biobank bulk imaging data were made available under application #11559 in July 2017. We analyzed 10,083 participants (4807 females), mean age = 62.4 ± 7.3 years [range: 45–79]. Voxel matrix: 1.0 × 1.0 × 1.0 mm—acquisition matrix: 208 × 256 × 256. 3D MP-RAGE, TI/TR = 880/2000 ms, sagittal orientation, in-plane acceleration factor = 2. Raw MRI data were processed using the ENIGMA FreeSurfer and sulcal analysis protocols. Following processing, all images were visually inspected for quality control of FreeSurfer gray/white matter classifications. For all subjects, the central sulcus segmented and labeled by BrainVISA was visually assessed for labeling accuracy.

KKI (Kennedy Krieger Institute—Multi-Modal MRI Reproducibility Resource): 21 healthy volunteers with no history of neurological conditions (10 F, 22–61 years old) were recruited. All data were acquired using a 3T MRI scanner (Achieva, Philips Healthcare, Best, The Netherlands) with body coil excitation and an eight-channel phased array SENSitivity Encoding (SENSE) head-coil for reception. All scans were completed during a 2-week interval. The resulting data set consisted of 42 “1-h” sessions of 21 individuals. MP-RAGE T1-weighted scans were acquired with a 3D inversion recovery sequence: (TR/TE/TI = 6.7/3.1/842 ms) with a 1.0 × 1.0 × 1.2 mm3 resolution over a field of view of 240 × 204 × 256 mm acquired in the sagittal plane. The SENSE acceleration factor was 2 in the right–left direction. Multi-shot fast gradient echo (TFE factor = 240) was used with a 3-s shot interval and the turbo direction being in the slice direction (right–left). The flip angle was 8°. No fat saturation was employed91,

OASIS: This TRT reliability data set contains 20 right-handed subjects (19–34 years old) without dementia imaged on a subsequent visit within 90 days of their initial session. MP-RAGE T1-weighted scans were acquired on a 1.5-T Vision scanner (Siemens, Erlangen, Germany): (TR/TE/TI = 9.7/4.0/20 ms) with an in-plane resolution of 1.0 × 1.0 × mm2 resolution over a FOV of 256 × 256 mm acquired in the sagittal plane. Thickness/gap = 1.25/0 mm; flip angle = 10° (

MRI image processing and sulcal extraction. Anatomical images (T1-weighted) were corrected for intensity inhomogeneities and segmented into gray and white matter tissues using FreeSurfer (; segmentations and regional labels were quality controlled using ENIGMA protocols for outlier detection and visual inspection ( BrainVISA ( was run for sulcal extraction, identification, and sulcal-based morphometry. Morphologist 2015, an image processing pipeline included in BrainVISA, was used to quantify sulcal parameters. Briefly, the Morphologist 2015 segmentation pipeline computes left and right hemisphere masks, performs gray and white matter classification, reconstructs a gray/white surface and a spherical triangulation of the external cortical surface, independently for both hemispheres. Sulcal labeling has been performed using BrainVISA suite which implements the algorithm fully described in the cited paper by Perrot et al.93. It relies on a probabilistic atlas of sulci. The sulcal recognition is made by combining localization and shape information. The atlas is described in detail and freely accessible here: and can be visualized online here:

To improve sulcal extraction and build on current protocols used by hundreds of collaborators within ENIGMA, quality controlled FreeSurfer outputs (orig.mgz, ribbon.mgz, and were directly imported into the pipeline to avoid recomputing several steps, including intensity inhomogeneity correction and gray/white matter classification. Sulci were then automatically labeled according to a predefined anatomical nomenclature of 62 sulcal labels for the left hemisphere and 61 sulcal labels for the right hemisphere94,95. The protocol developed for this work is available at (ENIGMA-Sulci).

Sulci descriptors and quality control. Analyzing the shape of the cortex through sulcal-based morphometry allows us to quantify the geometry of a sulcus in terms of several distinct and complementary descriptors, consisting of length, mean depth, surface area, and width (or fold opening) of all extracted and labeled sulci. Cortical thickness and surface area have both been found to be moderately to highly heritable, yet with largely independent and even negatively correlated genetic influences7,80,96. Cortical thickness, surface area, and folding tend to exhibit different age-related trajectories97,98. In particular, cortical thickness represents the laminar organization of the cerebral cortex, which contains about 14 billion neurons99. Each of the layers forming the cortex100 has a different cellular organization, mostly distinguished on the basis of pyramidal cells in the various laminae100. Surface area may reflect the number of radial columns perpendicular to the pial surface98 and sulcal morphometry may additionally relate to the microstructure of the neuronal sheets and to the local axonal connectivity within a cortical region, which may influence the degree of folding84.

The length of a sulcus is measured in millimeters as the geodesic length of the junction between a sulcus and the hull of the brain. The mean depth corresponds to the average of the depth across all the vertices along the bottom of a sulcus (the depth of a vertex located at the bottom of a sulcus is defined as the geodesic distance along the sulcus to the brain hull). The surface area is the total area of the sulcal surface. The enclosed cerebrospinal fluid (CSF) volume divided by the sulcal surface area gives the width, a gross approximation of the average width of the CSF in the fold61 (see Supplementary Fig. 32 for a representation of sulcal shape descriptors).

To further quality control the extracted sulcal measures and identify subjects whose sulci were not optimally identified, we consider as outliers those subjects showing abnormal values for at least one of the descriptors for each sulcus. That is, for a given sulcus, the z-score across subjects is computed for each descriptor. The set of subjects showing an absolute z-score > 2.5 for one or more descriptors was discarded from further analysis101. Therefore, if the length of the central sulcus for a given subject was an outlier but width, depth, and surface area were not, that subject’s central sulcus was removed from further evaluation; this ensured that the same set of subjects were used for all analyses across descriptors. This led to discarding 3% of subjects for each sulcus.

Statistics and reproducibility

Univariate and bivariate quantitative genetic analyses: The relative influences of genetic and environmental factors on human traits can be estimated by modeling the known genetic relationship between individuals and relating it to observed covariance in measured traits; in twin studies, MZ twin pairs—who typically share all their common genetic variants—are compared to DZ twin pairs, who share, on average, 50%. The same principle can be used for extended pedigrees, in which many individuals have varying degrees of relatedness. Here, we used both twins and extended pedigrees to estimate the heritability of these in-depth cortical sulcal measures. For a given cohort of participants, the narrow-sense heritability (h2) is defined as the proportion of the observed variance in a trait (σ2p) that can be attributed to additive genetic factors (σ2g):

$$h^2 = {{\sigma _g^2} \over {\sigma _p^2}}.$$

Variance components methods, implemented in the Sequential Oligogenic Linkage Analysis Routines (SOLAR) software package102, were used for all genetic analyses. Heritability (h2) is the proportion of total phenotypic variance accounted for by additive genetic factors and is assessed by contrasting the observed phenotypic covariance matrix with the covariance matrix predicted by kinship. High heritability indicates that the covariance of a trait is greater among more closely related (genetically similar) individuals; here, for example, MZ twins as compared to DZ twins and siblings. Using SOLAR-ECLIPSE imaging genetics tools (, we investigated the heritability profile of four sulcal descriptors for sulci across the whole brain: 62 on the left and 61 on the right hemisphere.

Prior to testing for the significance of heritability, sulcal descriptor values for each individual are adjusted for a series of covariates. We estimated the influence of specific variables (additive genetic variation and covariates including intracranial volume, sex, age, age2, age × sex interaction, age2 × sex interaction) to calculate the sulcal trait heritability and its significance (p value) for accounting for a component of each trait’s variance within this population.

The significance threshold for heritability analysis of individual sulci was set to be p ≤ (0.05/m*4), where m = 61 (number of bilateral sulci), and the times 4 corresponding to the number of shape descriptors assessed. We set m = 123 when left and right sulcal heritability were estimated separately. This reduced the probability of Type 1 errors associated with multiple measurements.

For bivariate genetic correlation estimates, classical quantitative genetic models were used to partition the phenotypic correlation (ρP) between the left and the corresponding right sulcal measures into the genetic (ρG), and a unique environmental (ρE) components, for each pair of traits. Just as with the univariate model, the bivariate phenotype of an individual is modeled as a linear function of kinship coefficients that express relatedness among all individuals within the cohort (MZ twins share all their additive genetic information and DZ twins and siblings share on average 50%). The significance of ρG and ρE was estimated from the likelihood ratio test when comparing the model to ones where the correlation components are constrained to be 0102,103,104. This estimates ρG and ρE and their standard error (SE). The significance of these coefficients is determined by a z-test of their difference from 0. If ρG differs significantly from 0, then a significant proportion of the traits’ covariance is influenced by shared genetic factors.

In this case, we tested another model where the genetic correlation factor ρG is fixed to 1. Fixing ρG to 1 suggests that the additive genetic components comprising the two traits overlap completely, and there is no detectable unique genetic composition for the individual traits. Once again, the log-likelihood of this model is compared to one where the parameters are freely optimized. If ρG is not found to significantly differ from 1, then we cannot reject the hypothesis that both heritable traits are driven by the same set of genetic factors. If ρG is significantly different from 0 and significantly different from 1, then the traits share a significant portion of their variance, however, each is also likely to be partially driven by a unique set of genetic factors.

Some considerations should be made regarding the measurement error of the traits analyzed here: ρG is the correlation between the latent genetic effects on the two traits irrespective of the proportion of phenotypic variance these latent effects explain (i.e., heritability). Measurement error, which is uncorrelated between individuals regardless of their relatedness, falls into the environmental component and environmental correlations. Measurement error therefore influences h2, ρE, ρP, but not ρG.

In practice, measurement error does make ρG harder to estimate, because low heritability means that the underlying genetic effects cannot be estimated with precision. This causes the SE of the ρG estimate to increase, but critically, does not change its maximum-likelihood estimate systematically. So measurement error makes it harder to reject the null hypothesis that ρG = 1.

Moreover, the bivariate polygenic model used here to estimate the left–right genetic correlation is a linear function of laterality (LR). Indeed, the genetic variance of LR is:

$$\sigma _g^2\left( L \right) + \sigma _g^2\left( R \right) - 2 \times \rho _g \times \sqrt {\sigma _g^2\left( L \right) \times \sigma _g^2\left( R \right)},$$

where \(\sigma _g^2\left( L \right)\) and \(\sigma _g^2\left( R \right)\) are the genetic variance for the left and right traits. The phenotypic variance is similarly defined so that the heritability of LR can be obtained. But if LR shows significant heritability, it could be because: (1) genetic overlap is incomplete and/or (2) L and R have unequal genetic variances. So studying laterality is not recommended here because (1) and (2) are confounded.

Meta-analysis of additive genetic variance

Meta-analysis calculates weighted mean heritability (h2) and SE estimates based on measurements from individual cohorts42,43. We weighted the heritability estimate from each cohort by the heritability SE, as extracted from the variance component model of SOLAR. The heritability weighted by SE42,43 is:

$$h_{{\rm{MA - SE}}}^2\left( S \right) = \frac{{\mathop {\sum }\nolimits_j se_j^{ - 2} \times h_j^2(S)}}{{\mathop {\sum }\nolimits_j se_j^{ - 2}}},$$

where S = 1 to Ns indexes the sulci and j = 1,2,3 indexes the cohorts.

Mega-analysis of additive genetic variance

While meta-analyses compute first the heritability independently for each cohort and then combine the results, mega-analyses combine first different cohorts and then run a single computation for heritability evaluation. We use a program (polyclass), developed for SOLAR105 for mega-analysis of heritability on sulci descriptors43,106. This function fits the model after combining the pedigrees of QTIM, HCP, and GOBS into a single pedigree (for more details see refs. 42,43).

Meta-analysis of genetic correlation

A meta-analysis of genetic correlation is calculated weighting the genetic correlation computed for each cohort by its sample size:

$$\rho _{G - {\rm{MA}}}\left( S \right) = \frac{{\mathop {\sum }\nolimits_j \rho _{G_j}^2(S) \times N_{\rm{sub}}}}{{\mathop {\sum }\nolimits_j N_{{\rm{sub}}(j)}}},$$

where S = 1 to Ns indexes the sulci, j = 1, 2, 3 indexes the cohorts, and \(N_{{\rm{sub}}(j)}\) is the sample size of cohort j.

To combine p values in a meta-analysis, we used the Edgington’s method that represents a compromise between methods more sensitive to largest p values (e.g., Pearson’s method) and methods more sensitive to smallest p values (e.g., Fisher’s method)107,108:

$${\rm{Meta}}\,p {\hbox{-}} {\rm{value}} = \frac{{S^k}}{{k!}} - \left( {k - 1} \right)C1\frac{{\left( {S - 1} \right)^k}}{{k!}} + \left( {k - 2} \right)C2\frac{{\left( {S - 2} \right)^k}}{{k!}},$$

where S is the sum of o values and k the number of tests (i.e., k = 3 cohorts in our study). The corrective additional terms are used if the number subtracted from S in the numerator is <S. All the p values in the meta-analyses estimated were computed using this method.

SNP-based heritability analysis

We used genome-wide complex trait analysis (GCTA)109 to estimate the heritability from the individual genotypes. Genotypes on the autosomal chromosomes were used to calculate the genetic relationship matrix with GCTA109. Heritability was calculated using a linear mixed model, with age, sex, ICV, and the first four genetic components from multidimensional scaling analysis as fixed covariates. We also covaried for the presence of any diagnosed neurological or psychiatric disorder. In our analysis, we excluded participants with non-European ancestry, missing genotypes, or phenotypes, and mismatched sex information.

Reliability analysis

Sulcal measurement reliability: To evaluate the reliability of the sulcal shape descriptors, we analyzed their variability, or reproducibility error, across the TRT sessions for each of the four TRT cohorts. For each MRI scan there are several sources of variability, including variability from hydration status, variability due to slightly different acquisitions in the two sessions (head position change in the scanner, motion artifacts, scanner instability, etc.), and finally variability due to the imaging processing methods themselves.

There could also be variability in the reliability estimates depending on the type of MRI system used (vendor, model, and acquisition parameters), so it is important to address the issue of reliability across a variety of platforms. We used two indices of reliability: (1) the dimensionless measure of absolute percent bias of descriptor, b (sulcal length, mean depth, width, and surface area) of a sulcus with respect to its average and (2) the ICC coefficient. b is computed as follows:

$$b = 100 \times \frac{{{\rm{test - retest}}}}{{\left( {{\rm{test + retest}}} \right)/2}}.$$

The estimation of the means is more robust than the estimation of the variance from the signed differences, in particular for smaller sets of subjects. The distributions of sulcal measurement differences plotted the mean across sessions were examined with a Bland–Altman analysis110. These plots show the spread of data, the bias (i.e., mean difference), and the limits of agreement (±1.96 SD), and were used to confirm that the distributions were approximately symmetric around 0 and to check for possible outliers. While the ICC estimates the relation between within-subject variance and between-subjects variance, b offers a subject-based index that might be used to find outliers. If scan and rescan are perfectly reliable, b should be equal to 0. The cases where b is >0.1, as in ref. 49, are considered unreliable.

The ICC coefficient was computed to quantify the reproducibility for sulcal-based measurements. ICC is defined as follows:

$${\rm{ICC}} = \frac{{\sigma _{\rm{BS}}^2}}{{\sigma _{\rm{BS}}^2 + \sigma _{\rm{WS}}^2}},$$

providing an adequate relation of within-subject (σ2WS) and between-subject (σ2BS) variability111,112,113.

The ICC estimates the proportion of total variance that is accounted for by the σ2BS. Values below 0.4 are typically classified as “poor” reproducibility, between 0.4 and 0.75 as “fair to good,” and higher values as “excellent” reproducibility48.

Equation (5) was used to estimate the ICC for each sulcal descriptor, independently for each cohort. The four cohorts were then combined into a meta-analysis (ICCMA−SE), similar to Eq. (1), in order to account for intra-site variability end to better estimate the sulcal reliability:

$${{\rm{ICC}}_{{\rm{MA}} - {\rm{SE}}}}(S) = \frac{{\mathop {\sum }\nolimits_j se_j^{ - 2} \times {\rm{ICC}}(S)}}{{\mathop {\sum }\nolimits_j se_j^{ - 2}}},$$

where j = 1, 2, 3, 4 indexes the cohorts. The SE was computed like SE = ICC/Z, where Z is obtained from a normal distribution knowing the p value. \({{\rm{ICC}}_{{\rm{MA}} - {\rm{SE}}}}\) was computed only if the cohort-based ICC computed with Eq. (5) was estimated for at least 3/4 cohorts.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

OASIS: the OASIS data are distributed to the greater scientific community under the Creative Commons Attribution 4.0 license. All data are available via www.oasis-brains.org92. KKI (Kennedy Krieger Institute—Multimodal MRI Reproducibility Resource): open access: QTIM: data from the QTIM cohort used in this paper can be applied for by contacting M.J.W. ( Access to data by qualified investigators are subject to scientific and ethical review. Summary results from cohort QTIM are available as part of the supplementary data52. HCP: family status and other potentially sensitive information are part of the Restricted Data that is available only to qualified investigators after signing the Restricted Data Use Terms. Open access data (all imaging data and most of the behavioral data) are available to those who register and agree to the Open Access Data Use Terms. Restricted data elements that could be potentially used to identify subjects include family structure (twin or non-twin status and number of siblings); birth order; age by year; handedness; ethnicity and race; body height, weight, and BMI; and a number of other categories. Each qualified investigator wanting to use restricted data must apply for access and agree to the Restricted Data Use Terms ( GOBS: data from the GOBS cohort used in this paper can be applied for by contacting D.C.G. ( or J. Blangero ( Access to data by qualified investigators are subject to scientific and ethical review and must comply with the European Union General Data Protection Regulations (GDPR)/all relevant guidelines. The completion of a material transfer agreement (MTA) signed by an institutional official will be required. Summary results from cohort GOBS are available as part of the supplementary data. UK Biobank: access to data from the UK Biobank can be obtained by approved scientists through application with UK Biobank (

Code availability

The image processing protocol developed for this work is available at (ENIGMA-Sulci).


  1. 1.

    Thompson, P. M. et al. ENIGMA and Global Neuroscience: a decade of large-scale studies of the brain in health and disease across more than 40 countries. Transl. Psychiatry 10, 100 (2020).

    PubMed  PubMed Central  Article  Google Scholar 

  2. 2.

    Hibar, D. P. et al. Common genetic variants influence human subcortical brain structures. Nature 520, 224–229 (2015).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  3. 3.

    Lee, P. H. et al. Partitioning heritability analysis reveals a shared genetic basis of brain anatomy and schizophrenia. Mol. Psychiatry 22, 1224 (2017).

    CAS  PubMed  Article  Google Scholar 

  4. 4.

    Chauhan, G. et al. Association of Alzheimer’s disease GWAS loci with MRI markers of brain aging. Neurobiol. Aging 36, 1765.e7–1765.e16 (2015).

    CAS  Article  Google Scholar 

  5. 5.

    Kanai, R. & Rees, G. The structural basis of inter-individual differences in human behaviour and cognition. Nat. Rev. Neurosci. 12, 231–242 (2011).

    CAS  PubMed  Article  Google Scholar 

  6. 6.

    Schmaal, L. et al. Cortical abnormalities in adults and adolescents with major depression based on brain scans from 20 cohorts worldwide in the ENIGMA major depressive disorder working group. Mol. Psychiatry 22, 900–909 (2017).

    CAS  PubMed  Article  Google Scholar 

  7. 7.

    Winkler, A. M. et al. Cortical thickness or grey matter volume? The importance of selecting the phenotype for imaging genetics studies. NeuroImage 53, 1135–1146 (2010).

    PubMed  Article  Google Scholar 

  8. 8.

    Kochunov, P. et al. Mapping primary gyrogenesis during fetal development in primate brains: high-resolution in utero structural MRI of fetal brain development in pregnant baboons. Front. Neurosci. 4, 20 (2010).

    PubMed  PubMed Central  Google Scholar 

  9. 9.

    Tallinen, T. et al. On the growth and form of cortical convolutions. Nat. Phys. 12, 588–593 (2016).

    CAS  Article  Google Scholar 

  10. 10.

    Borrell, V. How cells fold the cerebral cortex. J. Neurosci. 38, 776–783 (2018).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  11. 11.

    Kroenke, C. D. & Bayly, P. V. How forces fold the cerebral cortex. J. Neurosci. 38, 767–775 (2018).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  12. 12.

    Ono, M., Kubik, S. & Abernathey, C. D. Atlas of the Cerebral Sulci (Thieme, 1990).

  13. 13.

    Brodmann, K. Brodmann’s: Localisation in the Cerebral Cortex (Springer Science & Business Media, 2007).

  14. 14.

    Talairach, J. & Tournoux, P. Co-planar Stereotaxic Atlas of the Human Brain: 3-Dimensional Proportional System: An Approach to Cerebral Imaging. (George Thieme Verlag, 1988).

  15. 15.

    Amiez, C. et al. Sulcal organization in the medial frontal cortex provides insights into primate brain evolution. Nat. Commun. 10, 3437 (2019).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  16. 16.

    Ronan, L. & Fletcher, P. C. From genes to folds: a review of cortical gyrification theory. Brain Struct. Funct. 220, 2475–2483 (2015).

    CAS  PubMed  Article  Google Scholar 

  17. 17.

    Zilles, K., Palomero-Gallagher, N. & Amunts, K. Development of cortical folding during evolution and ontogeny. Trends Neurosci. 36, 275–284 (2013).

    CAS  PubMed  Article  Google Scholar 

  18. 18.

    Sønderby, I. et al. 188. ENIGMA-CNV: unraveling the effects of rare copy number variants on brain structure. Biol. Psychiatry 85, S78 (2019).

    Article  Google Scholar 

  19. 19.

    Adams, H. H. H. et al. Novel genetic loci underlying human intracranial volume identified through genome-wide association. Nat. Neurosci. 19, 1569–1582 (2016).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  20. 20.

    Stein, J. L. et al. Identification of common variants associated with human hippocampal and intracranial volumes. Nat. Genet. 44, 552–561 (2012).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  21. 21.

    Hibar, D. P. et al. Novel genetic loci associated with hippocampal volume. Nat. Commun. 8, 13624 (2017).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  22. 22.

    Elliott, L. T. et al. Genome-wide association studies of brain imaging phenotypes in UK Biobank. Nature 562, 210–216 (2018).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  23. 23.

    Psaty, B. M. et al. Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium: design of prospective meta-analyses of genome-wide association studies from 5 cohorts. Circ. Cardiovasc. Genet. 2, 73–80 (2009).

    PubMed  PubMed Central  Article  Google Scholar 

  24. 24.

    Alemán-Gómez, Y. et al. The human cerebral cortex flattens during adolescence. J. Neurosci. 33, 15004–15010 (2013).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  25. 25.

    Im, K. et al. Sulcal morphology changes and their relationship with cortical thickness and gyral white matter volume in mild cognitive impairment and Alzheimer’s disease. Neuroimage 43, 103–113 (2008).

    PubMed  Article  Google Scholar 

  26. 26.

    Cachia, A. et al. Cortical folding abnormalities in schizophrenia patients with resistant auditory hallucinations. Neuroimage 39, 927–935 (2008).

    PubMed  Article  Google Scholar 

  27. 27.

    Penttilä, J. et al. Global and temporal cortical folding in patients with early-onset schizophrenia. J. Am. Acad. Child Adolesc. Psychiatry 47, 1125–1132 (2008).

    PubMed  Article  Google Scholar 

  28. 28.

    Penttilä, J. et al. Cortical folding in patients with bipolar disorder or unipolar depression. J. Psychiatry Neurosci. 34, 127–135 (2009).

    PubMed  PubMed Central  Google Scholar 

  29. 29.

    Auzias, G. et al. Atypical sulcal anatomy in young children with autism spectrum disorder. Neuroimage Clin. 4, 593–603 (2014).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  30. 30.

    Kippenhan, J. S. et al. Genetic contributions to human gyrification: sulcal morphometry in Williams syndrome. J. Neurosci. 25, 7840–7846 (2005).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  31. 31.

    Thompson, P. M. et al. Abnormal cortical complexity and thickness profiles mapped in Williams syndrome. J. Neurosci. 25, 4146–4158 (2005).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  32. 32.

    Panizzon, M. S. et al. Distinct genetic influences on cortical surface area and cortical thickness. Cereb. Cortex 19, 2728–2735 (2009).

    PubMed  PubMed Central  Article  Google Scholar 

  33. 33.

    Kochunov, P. et al. Age-related morphology trends of cortical sulci. Hum. Brain Mapp. 26, 210–220 (2005).

    PubMed  PubMed Central  Article  Google Scholar 

  34. 34.

    Sun, Z. Y. et al. The effect of handedness on the shape of the central sulcus. Neuroimage 60, 332–339 (2012).

    PubMed  Article  Google Scholar 

  35. 35.

    Pizzagalli, F., Auzias, G., Delon-Martin, C. & Dojat, M. Local landmark alignment for high-resolution fMRI group studies: toward a fine cortical investigation of hand movements in human. J. Neurosci. Methods 218, 83–95 (2013).

    CAS  PubMed  Article  Google Scholar 

  36. 36.

    Coulon, O. et al. Two new stable anatomical landmarks on the central sulcus: definition, automatic detection, and their relationship with primary motor functions of the hand. In Conf. Proc. IEEE Engineering in Medicine and Biology Society 7795–7798 (IEEE, Boston, MA, 2011).

  37. 37.

    McKay, D. R. et al. Sulcal depth-position profile is a genetically mediated neuroscientific trait: description and characterization in the central sulcus. J. Neurosci. 33, 15618–15625 (2013).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  38. 38.

    Le Guen, Y. et al. Genetic influence on the sulcal pits: on the origin of the first cortical folds. Cereb. Cortex 28, 1922–1933 (2018).

    PubMed  Article  Google Scholar 

  39. 39.

    Le Guen, Y. et al. eQTL of KCNK2 regionally influences the brain sulcal widening: evidence from 15,597 UK Biobank participants with neuroimaging data. Brain Struct. Funct. 224, 847–857 (2019).

    PubMed  Article  CAS  Google Scholar 

  40. 40.

    Kochunov, P. et al. Genetics of primary cerebral gyrification: heritability of length, depth and area of primary sulci in an extended pedigree of Papio baboons. Neuroimage 53, 1126–1134 (2010).

    CAS  PubMed  Article  Google Scholar 

  41. 41.

    Lohmann, G., von Cramon, D. Y. & Steinmetz, H. Sulcal variability of twins. Cereb. Cortex 9, 754–763 (1999).

    CAS  PubMed  Article  Google Scholar 

  42. 42.

    Jahanshad, N. et al. Multi-site genetic analysis of diffusion images and voxelwise heritability analysis: a pilot project of the ENIGMA–DTI working group. NeuroImage 81, 455–469 (2013).

    PubMed  PubMed Central  Article  Google Scholar 

  43. 43.

    Kochunov, P. et al. Multi-site study of additive genetic effects on fractional anisotropy of cerebral white matter: comparing meta and megaanalytical approaches for data pooling. Neuroimage 95, 136–150 (2014).

    PubMed  PubMed Central  Article  Google Scholar 

  44. 44.

    Jahanshad, N. et al. Genetic influences on brain asymmetry: a DTI study of 374 twins and siblings. Neuroimage 52, 455–469 (2010).

    PubMed  PubMed Central  Article  Google Scholar 

  45. 45.

    Eyler, L. T. et al. Conceptual and data-based investigation of genetic influences and brain asymmetry: a twin study of multiple structural phenotypes. J. Cogn. Neurosci. 26, 1100–1117 (2014).

    PubMed  Article  Google Scholar 

  46. 46.

    Peper, J. S., Brouwer, R. M., Boomsma, D. I., Kahn, R. S. & Hulshoff Pol, H. E. Genetic influences on human brain structure: a review of brain imaging studies in twins. Hum. Brain Mapp. 28, 464–473 (2007).

    PubMed  PubMed Central  Article  Google Scholar 

  47. 47.

    Guadalupe, T. et al. Differences in cerebral cortical anatomy of left- and right-handers. Front. Psychol. 5, 261 (2014).

    PubMed  PubMed Central  Article  Google Scholar 

  48. 48.

    Landis, J. R. & Koch, G. G. The measurement of observer agreement for categorical data. Biometrics 33, 159–174 (1977).

    CAS  PubMed  Article  Google Scholar 

  49. 49.

    Klein, A. et al. Mindboggling morphometry of human brains. PLoS Comput. Biol. 13, e1005350 (2017).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  50. 50.

    Dubois, J. et al. Mapping the early cortical folding process in the preterm newborn brain. Cereb. Cortex 18, 1444–1454 (2008).

    CAS  PubMed  Article  Google Scholar 

  51. 51.

    van Erp, T. G. M. et al. Subcortical brain volume abnormalities in 2028 individuals with schizophrenia and 2540 healthy controls via the ENIGMA consortium. Mol. Psychiatry 21, 585 (2016).

    PubMed  Article  Google Scholar 

  52. 52.

    de Zubicaray, G. I. et al. Meeting the challenges of neuroimaging genetics. Brain Imaging Behav. 2, 258–263 (2008).

    PubMed  PubMed Central  Article  Google Scholar 

  53. 53.

    Cachia, A. et al. The shape of the ACC contributes to cognitive control efficiency in preschoolers. J. Cogn. Neurosci. 26, 96–106 (2014).

    PubMed  Article  Google Scholar 

  54. 54.

    Muellner, J. et al. Altered structure of cortical sulci in gilles de la Tourette syndrome: further support for abnormal brain development. Mov. Disord. 30, 655–661 (2015).

    PubMed  Article  Google Scholar 

  55. 55.

    Liu, T. et al. Longitudinal changes in sulcal morphology associated with late-life aging and MCI. Neuroimage 74, 337–342 (2013).

    PubMed  Article  Google Scholar 

  56. 56.

    Reiner, P. et al. Sulcal span in Azheimer’s disease, amnestic mild cognitive impairment, and healthy controls. J. Alzheimers Dis. 29, 605–613 (2012).

    PubMed  Article  Google Scholar 

  57. 57.

    Hamelin, L., Bertoux, M., Bottlaender, M. & Corne, H. Sulcal morphology as a new imaging marker for the diagnosis of early onset Alzheimer’s disease. Neurobiol. Aging 36 2932–2939 (2015).

    PubMed  Article  Google Scholar 

  58. 58.

    Ge, T., Holmes, A. J., Buckner, R. L., Smoller, J. W. & Sabuncu, M. R. Heritability analysis with repeat measurements and its application to resting-state functional connectivity. Proc. Natl Acad. Sci. USA 114, 5521–5526 (2017).

    CAS  PubMed  Article  Google Scholar 

  59. 59.

    Elliott, M. L. et al. General functional connectivity: shared features of resting-state and task fMRI drive reliable and heritable individual differences in functional brain networks. Neuroimage 189, 516–532 (2019).

    PubMed  PubMed Central  Article  Google Scholar 

  60. 60.

    Roshchupkin, G. V. et al. Heritability of the shape of subcortical brain structures in the general population. Nat. Commun. 7, 13738 (2016).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  61. 61.

    Pizzagalli, F. et al. Genetic analysis of cortical sulci in 1,009 adults. In 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI) (2016)

  62. 62.

    Geschwind, D. H., Miller, B. L., DeCarli, C. & Carmelli, D. Heritability of lobar brain volumes in twins supports genetic models of cerebral laterality and handedness. Proc. Natl Acad. Sci. USA 99, 3176–3181 (2002).

    CAS  PubMed  Article  Google Scholar 

  63. 63.

    Woldehawariat, G. et al. Corpus callosum size is highly heritable in humans, and may reflect distinct genetic influences on ventral and rostral regions. PLoS ONE 9, e99980 (2014).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  64. 64.

    Pol, H. E. H. et al. Genetic contributions to human brain morphology and intelligence. J. Neurosci. 26, 10235–10242 (2006).

    CAS  Article  Google Scholar 

  65. 65.

    Joshi, A. A. et al. The contribution of genes to cortical thickness and volume. Neuroreport 22, 101–105 (2011).

    PubMed  PubMed Central  Article  Google Scholar 

  66. 66.

    Kremen, W. S. et al. Genetic and environmental influences on the size of specific brain regions in midlife: the VETSA MRI study. Neuroimage 49, 1213–1223 (2010).

    PubMed  Article  Google Scholar 

  67. 67.

    Lenroot, R. K. et al. Differences in genetic and environmental influences on the human cerebral cortex associated with development during childhood and adolescence. Hum. Brain Mapp. 30, 163–174 (2009).

    PubMed  Article  Google Scholar 

  68. 68.

    Rimol, L. M. et al. Cortical thickness is influenced by regionally specific genetic factors. Biol. Psychiatry 67, 493–499 (2010).

    CAS  PubMed  Article  Google Scholar 

  69. 69.

    Thompson, P. M. et al. Genetic influences on brain structure. Nat. Neurosci. 4, 1253–1258 (2001).

    CAS  PubMed  Article  Google Scholar 

  70. 70.

    Wright, I. C., Sham, P., Murray, R. M., Weinberger, D. R. & Bullmore, E. T. Genetic contributions to regional variability in human brain structure: methods and preliminary results. Neuroimage 17, 256–271 (2002).

    CAS  PubMed  Article  Google Scholar 

  71. 71.

    Chi, J. G., Dooling, E. C. & Gilles, F. H. Gyral development of the human brain. Ann. Neurol. 1, 86–93 (1977).

    CAS  PubMed  Article  Google Scholar 

  72. 72.

    White, T., Su, S., Schmidt, M., Kao, C. -Y. & Sapiro, G. The development of gyrification in childhood and adolescence. Brain Cogn. 72, 36–45 (2010).

    PubMed  Article  Google Scholar 

  73. 73.

    Docherty, A. R. et al. Comparison of twin and extended pedigree designs for obtaining heritability estimates. Behav. Genet. 45, 461–466 (2015).

    PubMed  PubMed Central  Article  Google Scholar 

  74. 74.

    Kochunov, P. et al. Heritability of fractional anisotropy in human white matter: a comparison of Human Connectome Project and ENIGMA-DTI data. Neuroimage 111, 300–311 (2015).

    PubMed  PubMed Central  Article  Google Scholar 

  75. 75.

    Mayhew, A. J. & Meyre, D. Assessing the heritability of complex traits in humans: methodological challenges and opportunities. Curr. Genomics 18, 332–340 (2017).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  76. 76.

    Le Guen, Y. et al. Heritability of surface area and cortical thickness: a comparison between the Human Connectome Project and the UK Biobank dataset. In 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019) 1887–1890 (IEEE, Venice, Italy, 2019).

  77. 77.

    Mesulam, M. -M. et al. Asymmetry and heterogeneity of Alzheimer’s and frontotemporal pathology in primary progressive aphasia. Brain 137, 1176–1192 (2014).

    PubMed  PubMed Central  Article  Google Scholar 

  78. 78.

    Jahanshad, N. et al. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity. Proc. Natl Acad. Sci. USA 110, 4768–4773 (2013).

    CAS  PubMed  Article  Google Scholar 

  79. 79.

    Highley, J. R. et al. Schizophrenia and the frontal lobes: post-mortem stereological study of tissue volume. Br. J. Psychiatry 178, 337–343 (2001).

    CAS  PubMed  Article  Google Scholar 

  80. 80.

    Grasby, K. L. et al. The genetic architecture of the human cerebral cortex. Science 367, eaay6690 (2020).

    CAS  PubMed  Article  Google Scholar 

  81. 81.

    Silbert, L. C. & Kaye, J. Neuroimaging and cognition in Parkinson’s disease dementia. Brain Pathol. 20, 646–653 (2010).

    PubMed  PubMed Central  Article  Google Scholar 

  82. 82.

    Burton, E. J., McKeith, I. G., Burn, D. J., Williams, E. D. & O’Brien, J. T. Cerebral atrophy in Parkinson’s disease with and without dementia: a comparison with Alzheimer’s disease, dementia with Lewy bodies and controls. Brain 127, 791–800 (2004).

    PubMed  Article  Google Scholar 

  83. 83.

    Risacher, S. L. & Saykin, A. J. Neuroimaging biomarkers of neurodegenerative diseases and dementia. Semin. Neurol. 33, 386–416 (2013).

    PubMed  PubMed Central  Article  Google Scholar 

  84. 84.

    Rogers, J. et al. On the genetic architecture of cortical folding and brain volume in primates. Neuroimage 53, 1103–1108 (2010).

    PubMed  PubMed Central  Article  Google Scholar 

  85. 85.

    Atkinson, E. G., Rogers, J., Mahaney, M. C., Cox, L. A. & Cheverud, J. M. Cortical folding of the primate brain: an interdisciplinary examination of the genetic architecture, modularity, and evolvability of a significant neurological trait in pedigreed baboons (genus Papio). Genetics 200, 651–665 (2015).

    PubMed  PubMed Central  Article  Google Scholar 

  86. 86.

    Whelan, C. D. et al. Heritability and reliability of automatically segmented human hippocampal formation subregions. Neuroimage 128, 125–137 (2016).

    PubMed  Article  Google Scholar 

  87. 87.

    Van Essen, D. C. et al. The Human Connectome Project: a data acquisition perspective. Neuroimage 62, 2222–2231 (2012).

    PubMed  PubMed Central  Article  Google Scholar 

  88. 88.

    Olvera, R. L. et al. Common genetic influences on depression, alcohol, and substance use disorders in Mexican-American families. Am. J. Med. Genet. B Neuropsychiatr. Genet. 156B, 561–568 (2011).

    CAS  PubMed  Article  Google Scholar 

  89. 89.

    McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet 48, 1279–1283 (2016).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  90. 90.

    Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).

    PubMed  PubMed Central  Article  Google Scholar 

  91. 91.

    Landman, B. A. et al. Multi-parametric neuroimaging reproducibility: a 3-T resource study. Neuroimage 54, 2854–2866 (2011).

    PubMed  Article  Google Scholar 

  92. 92.

    Marcus, D. S. et al. Open Access Series of Imaging Studies (OASIS): cross-sectional MRI data in young, middle aged, nondemented, and demented older adults. J. Cogn. Neurosci. 19, 1498–1507 (2007).

    PubMed  Article  Google Scholar 

  93. 93.

    Perrot, M., Rivière, D. & Mangin, J. -F. Cortical sulci recognition and spatial normalization. Med. Image Anal. 15, 529–550 (2011).

    PubMed  Article  Google Scholar 

  94. 94.

    Rivière, D. et al. Automatic recognition of cortical sulci of the human brain using a congregation of neural networks. Med. Image Anal. 6, 77–92 (2002).

    PubMed  Article  Google Scholar 

  95. 95.

    Mangin, J. F. et al. Object-based morphometry of the cerebral cortex. IEEE Trans. Med. Imaging 23, 968–982 (2004).

    CAS  PubMed  Article  Google Scholar 

  96. 96.

    Strike, L. T. et al. Genetic complexity of cortical structure: differences in genetic and environmental factors influencing cortical surface area and thickness. Cereb. Cortex 29, 952–962 (2019).

    PubMed  Article  Google Scholar 

  97. 97.

    Raznahan, A. et al. How does your cortex grow? J. Neurosci. 31, 7174–7177 (2011).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  98. 98.

    Rakic, P. Evolution of the neocortex: a perspective from developmental biology. Nat. Rev. Neurosci. 10, 724 (2009).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  99. 99.

    Carpenter M. B. Core Text of Neuroanatomy 2nd edn 354 (Williams and Wilkins, Baltimore, 1978).

  100. 100.

    Braak, H. Architectonics as seen by lipofuscin stains. In Cerebral Cortex, vol. 1:Cellular Organization of the Cerebral Cortex (eds Peters, A. & Jones E.G.) 59–104 (Plenum Press, New York, 1984).

  101. 101.

    Pizzagalli, F. et al. The core genetic network underlying sulcal morphometry. In 12th International Symposium on Medical Information Processing and Analysis, Vol. 10160 (International Society for Optics and Photonics, 2017).

  102. 102.

    Almasy, L. & Blangero, J. Multipoint quantitative-trait linkage analysis in general pedigrees. Am. J. Hum. Genet. 62, 1198–1211 (1998).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  103. 103.

    Almasy, L., Dyer, T. D. & Blangero, J. Bivariate quantitative trait linkage analysis: pleiotropy versus co-incident linkages. Genet. Epidemiol. 14, 953–958 (1997).

    CAS  PubMed  Article  Google Scholar 

  104. 104.

    Williams-Blangero, S. & Blangero, J. Quantitative genetic analysis of skin reflectance: a multivariate approach. Hum. Biol. 64, 35–49 (1992).

    CAS  PubMed  Google Scholar 

  105. 105.

    Blangero, J. et al. Quantitative trait nucleotide analysis using Bayesian model selection. 2005. Hum. Biol. 81, 829–847 (2009).

    PubMed  Article  Google Scholar 

  106. 106.

    Blangero, J. et al. Quantitative trait nucleotide analysis using Bayesian model selection. Hum. Biol. 77, 541–559 (2005).

    PubMed  Article  Google Scholar 

  107. 107.

    Edgington, E. S. An additive method for combining probability values from independent experiments. J. Psychol. 80, 351–363 (1972).

    Article  Google Scholar 

  108. 108.

    Heard, N. A. & Rubin-Delanchy, P. Choosing between methods of combining p-values. Biometrika 105, 239–246 (2018).

    Article  Google Scholar 

  109. 109.

    Yang, J. et al. Common SNPs explain a large proportion of the heritability for human height. Nat. Genet. 42, 565–569 (2010).

    CAS  PubMed  PubMed Central  Article  Google Scholar 

  110. 110.

    Bland, J. M. & Altman, D. G. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1, 307–310 (1986).

    CAS  Article  PubMed  Google Scholar 

  111. 111.

    Andreotti, J. et al. Validation of network communicability metrics for the analysis of brain structural networks. PLoS ONE 9, e115503 (2014).

    PubMed  PubMed Central  Article  CAS  Google Scholar 

  112. 112.

    Lachin, J. M. The role of measurement reliability in clinical trials. Clin. Trials 1, 553–566 (2004).

    PubMed  Article  Google Scholar 

  113. 113.

    Shrout, P. E. & Fleiss, J. L. Intraclass correlations: uses in assessing rater reliability. Psychol. Bull. 86, 420–428 (1979).

    CAS  PubMed  Article  Google Scholar 

Download references


This research was funded in part by NIH ENIGMA Center grant U54 EB020403, supported by the Big Data to Knowledge (BD2K) Centers of Excellence program funded by a cross-NIH initiative. Additional grant support was provided by: R01 AG059874, R01 MH117601, R01 MH121246, and P41 EB015922. QTIM was supported by NIH R01 HD050735 and the NHMRC 486682, Australia; GOBS: financial support for this study was provided by the National Institute of Mental Health grants MH078143 (PI: D.C.G.), MH078111 (PI: J. Blangero), and MH083824 (PI: D.C.G. and J. Blangero); HCP data were provided [in part] by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research and by the McDonnell Center for Systems Neuroscience at Washington University; UK Biobank: this research was conducted using the UK Biobank Resource under Application Number “11559”; BrainVISA’s Morphologist software development received funding from the European Union’s Horizon 2020 Framework Programme for Research and Innovation under Grant Agreement Nos. 720270 and 785907 (Human Brain ProjectSGA1 & SGA2), and by the FRM DIC20161236445. OASIS: Cross-Sectional: Principal Investigators: D. Marcus, R. Buckner, J. Csernansky, J. Morris; P50 AG05681, P01 AG03991, P01 AG026276, R01 AG021910, P20 MH071616, U24 RR021382. KKI was supported by NIH grants NCRR P41 RR015241 (Peter C.M. van Zijl), 1R01NS056307 (Jerry Prince), 1R21NS064534-01A109 (Bennett A. Landman/Jerry L. Prince), 1R03EB012461-01 (Bennett A. Landman). N.J. and P.M.T. are joint principal investigators for a research grant from Biogen, Inc. (Boston, USA) for processing of imaging data, some of which was used in this paper. No funding sources were involved in the design, analysis or outcomes of the study. We thank Anderson M. Winkler for comments and corrections on our initial biorxiv preprint, and Alessandra Griffa for helping to map the FreeSurfer results.

Author information




F.P. and N.J. designed the research, analyzed and reviewed the data and code, and drafted the initial paper; G.A., P.K., S.R.M., D.R., and J.F.M. provided analytical support and expertise; Q.Y., J.D.B., J.F., and A.A. processed and quality controlled the data; D.C.G., J.B., N.G.M., K.L.M., G.I.Z., M.J.W., and P.M.T. collected data and assisted with research design. All authors reviewed, edited, and provided critical feedback on the paper.

Corresponding authors

Correspondence to Fabrizio Pizzagalli or Neda Jahanshad.

Ethics declarations

Competing interests

N.J. and P.M.T. are joint principal investigators for a research grant from Biogen, Inc. (Boston, USA) for processing of imaging data, some of which was used in this paper. Biogen, Inc had no role in the conceptualization, design, data collection, analysis, decision to publish, or preparation of the manuscript. The remaining authors declare no competing financial or non-financial interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Pizzagalli, F., Auzias, G., Yang, Q. et al. The reliability and heritability of cortical folds and their genetic correlations across hemispheres. Commun Biol 3, 510 (2020).

Download citation

Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing