Volumetric variations of the human brain are heritable and are associated with many brain-related complex traits. Here we performed genome-wide association studies (GWAS) of 101 brain volumetric phenotypes using the UK Biobank sample including 19,629 participants. GWAS identified 365 independent genetic variants exceeding a significance threshold of 4.9 × 10−10, adjusted for testing multiple phenotypes. A gene-based association study found 157 associated genes (124 new), and functional gene mapping analysis linked 146 additional genes. Many of the discovered genetic variants and genes have previously been implicated in cognitive and mental health traits. Through genome-wide polygenic-risk-score prediction, more than 6% of the phenotypic variance (P = 3.13 × 10−24) in four other independent studies could be explained by the UK Biobank GWAS results. In conclusion, our study identifies many new genetic associations at the variant, locus and gene levels and advances our understanding of the pleiotropy and genetic co-architecture between brain volumes and other traits.
This is a preview of subscription content
Subscribe to Nature+
Get immediate online access to the entire Nature family of 50+ journals
Subscribe to Journal
Get full journal access for 1 year
only $4.92 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Tax calculation will be finalised during checkout.
Get time limited or full article access on ReadCube.
All prices are NET prices.
The data used in this work were obtained from five publicly available datasets: the UKB study, the HCP study, the PING study, the PNC study and the ADNI study. We used 50 sets of publicly available GWAS summary statistics from several GWAS databases. The data resources are summarized in Supplementary Table 24. All UKB and meta-analysis GWAS summary statistics of 101 ROI volumes can be found at https://github.com/BIG-S2/GWAS.
We made use of publicly available software and tools. All codes used to generate results that are reported in this paper are available upon request.
Ritchie, S. J. et al. Beyond a bigger brain: multivariable structural brain imaging and intelligence. Intelligence 51, 47–56 (2015).
Davies, G. et al. Genome-wide association study of cognitive functions and educational attainment in UK Biobank (N=112 151). Mol. Psychiatry 21, 758–767 (2016).
van der Meer, D. et al. Brain scans from 21,297 individuals reveal the genetic architecture of hippocampal subfield volumes. Mol. Psychiatry https://doi.org/10.1038/s41380-018-0262-7 (2018).
Caldiroli, A. et al. The relationship of IQ and emotional processing with insula volume in schizophrenia. Schizophrenia Res. 202, 141–148 (2018).
Vreeker, A. et al. The relationship between brain volumes and intelligence in bipolar disorder. J. Affect. Disord. 223, 59–64 (2017).
Wigmore, E. M. et al. Do regional brain volumes and major depressive disorder share genetic architecture? A study of Generation Scotland (n=19 762), UK Biobank (n= 24 048) and the English Longitudinal Study of Ageing (n=5766). Transl. Psychiatry 7, e1205 (2017).
Wen, W. et al. Distinct genetic influences on cortical and subcortical brain structures. Sci. Rep. 6, 32760 (2016).
den Braber, A. et al. Heritability of subcortical brain measures: a perspective for future genome-wide association studies. NeuroImage 83, 98–102 (2013).
Eyler, L. T. et al. Conceptual and data-based investigation of genetic influences and brain asymmetry: a twin study of multiple structural phenotypes. J. Cogn. Neurosci. 26, 1100–1117 (2014).
Blokland, G. A., de Zubicaray, G. I., McMahon, K. L. & Wright, M. J. Genetic and environmental influences on neuroimaging phenotypes: a meta-analytical perspective on twin imaging studies. Twin Res. Hum. Genet. 15, 351–371 (2012).
Kremen, W. S. et al. Genetic and environmental influences on the size of specific brain regions in midlife: the VETSA MRI study. NeuroImage 49, 1213–1223 (2010).
Jansen, A. G., Mous, S. E., White, T., Posthuma, D. & Polderman, T. J. What twin studies tell us about the heritability of brain development, morphology, and function: a review. Neuropsychol. Rev. 25, 27–46 (2015).
Zhao, B. et al. Heritability of regional brain volumes in large-scale neuroimaging and genetic studies. Cereb. Cortex 29, 2904–2914 (2018).
Elliott, L. T. et al. Genome-wide association studies of brain imaging phenotypes in UK Biobank. Nature 562, 210–216 (2018).
Biton, A. et al. Polygenic architecture of human neuroanatomical diversity. Preprint at bioRxiv https://doi.org/10.1101/592337 (2019).
Toro, R. et al. Genomic architecture of human neuroanatomical diversity. Mol. Psychiatry 20, 1011–1016 (2015).
Hibar, D. P. et al. Common genetic variants influence human subcortical brain structures. Nature 520, 224–229 (2015).
Yang, J. et al. Common SNPs explain a large proportion of the heritability for human height. Nat. Genet. 42, 565–569 (2010).
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: from polygenic to omnigenic. Cell 169, 1177–1186 (2017).
Timpson, N. J., Greenwood, C. M. T., Soranzo, N., Lawson, D. J. & Richards, J. B. Genetic architecture: the shape of the genetic contribution to human traits and disease. Nat. Rev. Genet. 19, 110–124 (2017).
Hibar, D. P. et al. Novel genetic loci associated with hippocampal volume. Nat. Commun. 8, 13624 (2017).
Franke, B. et al. Genetic influences on schizophrenia and subcortical brain volumes: large-scale proof of concept. Nat. Neurosci. 19, 420–431 (2016).
Guadalupe, T. et al. Human subcortical brain asymmetries in 15,847 people worldwide reveal effects of age and sex. Brain Imaging Behav. 11, 1497–1514 (2017).
Ikram, M. A. et al. Common variants at 6q22 and 17q21 are associated with intracranial volume. Nat. Genet. 44, 539–544 (2012).
Bis, J. C. et al. Common variants at 12q14 and 12q24 are associated with hippocampal volume. Nat. Genet. 44, 545–551 (2012).
Satizabal, C. L. et al. Genetic architecture of subcortical brain structures in over 40,000 individuals worldwide. Preprint at bioRxiv https://doi.org/10.1101/173831 (2017).
Davies, G. et al. Study of 300,486 individuals identifies 148 independent genetic loci influencing general cognitive function. Nat. Commun. 9, 2098 (2018).
Nagel, M. et al. Meta-analysis of genome-wide association studies for neuroticism in 449,484 individuals identifies novel genetic loci and pathways. Nat. Genet. 50, 920–927 (2018).
Savage, J. E. et al. Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nat. Genet. 50, 912–919 (2018).
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
Satterthwaite, T. D. et al. Neuroimaging of the Philadelphia neurodevelopmental cohort. Neuroimage 86, 544–553 (2014).
Weiner, M. W. et al. The Alzheimer’s Disease Neuroimaging Initiative: a review of papers published since its inception. Alzheimer’s Dement. 9, e111–e194 (2013).
Jernigan, T. L. et al. The Pediatric Imaging, Neurocognition, and Genetics (PING) data repository. Neuroimage 124, 1149–1154 (2016).
Somerville, L. H. et al. The Lifespan Human Connectome Project in Development: a large-scale study of brain connectivity development in 5–21 year olds. NeuroImage 183, 456–468 (2018).
Avants, B. B. et al. A reproducible evaluation of ANTs similarity metric performance in brain image registration. NeuroImage 54, 2033–2044 (2011).
Tustison, N. J. et al. Large-scale evaluation of ANTs and FreeSurfer cortical thickness measurements. NeuroImage 99, 166–179 (2014).
Yang, J., Zeng, J., Goddard, M. E., Wray, N. R. & Visscher, P. M. Concepts, estimation and interpretation of SNP-based heritability. Nat. Genet. 49, 1304–1310 (2017).
de Leeuw, C. A., Mooij, J. M., Heskes, T. & Posthuma, D. MAGMA: generalized gene-set analysis of GWAS data. PLoS Computational Biol. 11, e1004219 (2015).
Watanabe, K., Taskesen, E., Bochoven, A. & Posthuma, D. Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017).
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015).
1000 Genomes Project Consortium A global reference for human genetic variation. Nature 526, 68–74 (2015).
Buniello, A. et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 47, D1005–D1012 (2018).
Chen, C.-H. et al. Leveraging genome characteristics to improve gene discovery for putamen subcortical brain structure. Sci. Rep. 7, 15736 (2017).
Stein, J. L. et al. Identification of common variants associated with human hippocampal and intracranial volumes. Nat. Genet. 44, 552–561 (2012).
Furney, S. et al. Genome-wide association with MRI atrophy measures as a quantitative trait locus for Alzheimer’s disease. Mol. Psychiatry 16, 1130–1138 (2011).
Baranzini, S. E. et al. Genome-wide association analysis of susceptibility and clinical phenotype in multiple sclerosis. Hum. Mol. Genet. 18, 767–778 (2008).
Fischl, B. FreeSurfer. NeuroImage 62, 774–781 (2012).
Hibar, D. P. et al. Genome-wide association identifies genetic variants associated with lentiform nucleus volume in N=1345 young and elderly subjects. Brain Imaging Behav. 7, 102–115 (2013).
Sprooten, E. et al. White matter integrity as an intermediate phenotype: exploratory genome-wide association analysis in individuals at high risk of bipolar disorder. Psychiatry Res. 206, 223–231 (2013).
Kim, S. & Webster, M. Integrative genome-wide association analysis of cytoarchitectural abnormalities in the prefrontal cortex of psychiatric disorders. Mol. Psychiatry 16, 452–461 (2011).
Klein, M. et al. Genetic markers of ADHD-related variations in intracranial volume. Am. J. Psychiatry 176, 228–238 (2019).
Hill, W. et al. A combined analysis of genetically correlated traits identifies 187 loci and a role for neurogenesis and myelination in intelligence. Mol. Psychiatry 24, 169–181 (2019).
Jun, G. et al. A novel Alzheimer disease locus located near the gene encoding tau protein. Mol. Psychiatry 21, 108–117 (2016).
Luciano, M. et al. Association analysis in over 329,000 individuals identifies 116 independent variants influencing neuroticism. Nat. Genet. 50, 6–11 (2018).
Lee, J. J. et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet. 50, 1112–1121 (2018).
Edwards, T. L. et al. Genome‐wide association study confirms SNPs in SNCA and the MAPT region as common risk factors for Parkinson disease. Ann. Hum. Genet. 74, 97–109 (2010).
Turley, P. et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat. Genet. 50, 229–237 (2018).
Okbay, A. et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nat. Genet. 48, 624–633 (2016).
Zheng, H.-F. et al. WNT16 influences bone mineral density, cortical bone thickness, bone strength, and osteoporotic fracture risk. PLoS Genet. 8, e1002745 (2012).
Pardiñas, A. F. et al. Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat. Genet. 50, 381–389 (2018).
Li, Z. et al. Genome-wide association analysis identifies 30 new susceptibility loci for schizophrenia. Nat. Genet. 49, 1576–1583 (2017).
Marioni, R. E. et al. GWAS on family history of Alzheimer’s disease. Transl. Psychiatry 8, 99 (2018).
Linnér, R. K. et al. Genome-wide association analyses of risk tolerance and risky behaviors in over 1 million individuals identify hundreds of loci and shared genetic influences. Nat. Genet. 51, 245–257 (2019).
Kamboh, M. et al. Genome-wide association study of Alzheimer’s disease. Transl. Psychiatry 2, e117 (2012).
Finucane, H. K. et al. Heritability enrichment of specifically expressed genes identifies disease-relevant tissues and cell types. Nat. Genet. 50, 621–629 (2018).
Pers, T. H. et al. Biological interpretation of genome-wide association studies using predicted gene functions. Nat. Commun. 6, 5890 (2015).
Jansen, P. R. et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nat. Genet. 51, 394–403 (2019).
Skol, A. D., Scott, L. J., Abecasis, G. R. & Boehnke, M. Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies. Nat. Genet. 38, 209–213 (2006).
Hanscombe, K. B. et al. Genetic factors influencing coagulation factor XIII B-subunit contribute to risk of ischemic stroke. Stroke 46, 2069–2074 (2015).
Thompson, P. M. et al. The ENIGMA consortium: large-scale collaborative analyses of neuroimaging and genetic data. Brain Imaging Behav. 8, 153–182 (2014).
Nave, G., Jung, W. H., Karlsson Linnér, R., Kable, J. W. & Koellinger, P. D. Are bigger brains smarter? Evidence from a large-scale preregistered study. Psychological Sci. 30, 43–54 (2019).
Walhovd, K. B. & Fjell, A. M. White matter volume predicts reaction time instability. Neuropsychologia 45, 2277–2284 (2007).
Delorme, S. et al. Reaction time is negatively associated with corpus callosum area in the early stages of CADASIL. Am. J. Neuroradiol. 38, 2094–2099 (2017).
The International Schizophrenia Consortium Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 460, 748–752 (2009).
Watanabe, K. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 51, 1339–1348 (2019).
Thompson, P. M. et al. Genetic influences on brain structure. Nat. Neurosci. 4, 1253–1258 (2001).
Peper, J. S., Brouwer, R. M., Boomsma, D. I., Kahn, R. S. & Hulshoff Pol, H. E. Genetic influences on human brain structure: a review of brain imaging studies in twins. Hum. Brain Mapp. 28, 464–473 (2007).
Yoon, U., Perusse, D., Lee, J.-M. & Evans, A. C. Genetic and environmental influences on structural variability of the brain in pediatric twin: deformation based morphometry. Neurosci. Lett. 493, 8–13 (2011).
Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Gusev, A. et al. Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights. Nat. Genet. 50, 538–548 (2018).
Miller, K. L. et al. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat. Neurosci. 19, 1523–1536 (2016).
Smith, S. M. & Nichols, T. E. Statistical challenges in “big data” human neuroimaging. Neuron 97, 263–268 (2018).
Fortin, J.-P. et al. Removing inter-subject technical variability in magnetic resonance imaging studies. NeuroImage 132, 198–212 (2016).
Fortin, J.-P. et al. Harmonization of cortical thickness measurements across scanners and sites. NeuroImage 167, 104–120 (2018).
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Gao, F. et al. XWAS: a software toolset for genetic data analysis and association studies of the X chromosome. J. Heredity 106, 666–671 (2015).
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164–e164 (2010).
GTEx Consortium The genotype-tissue expression (GTEx) pilot analysis: multitissue gene regulation in humans. Science 348, 648–660 (2015).
Ramasamy, A. et al. Genetic variability in the regulation of gene expression in ten regions of the human brain. Nat. Neurosci. 17, 1418–1428 (2014).
Fromer, M. et al. Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nat. Neurosci. 19, 1442–1453 (2016).
Schmitt, A. D. et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 17, 2042–2059 (2016).
Roadmap Epigenomics Consortium et al. Integrative analysis of 111 reference human epigenomes. Nature 518, 317–330 (2015).
ENCODE Project Consortium An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74 (2012).
Liberzon, A. et al. Molecular signatures database (MSigDB) 3.0. Bioinformatics 27, 1739–1740 (2011).
International HapMap 3 Consortium Integrating common and rare genetic variation in diverse human populations. Nature 467, 52–58 (2010).
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C. A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
This research was partially supported by US National Institutes of Health (NIH) grants MH086633 (H.Z.) and MH116527 (T.Li), and a grant from the Cancer Prevention Research Institute of Texas (H.Z.). We thank the individuals represented in the UKB, ADNI, HCP, PING and PNC datasets for their participation and the research teams for their work in collecting, processing and disseminating these datasets for analysis. This research has been conducted using the UKB resource (application number 22783), subject to a data transfer agreement. We gratefully acknowledge all of the studies and databases that made GWAS summary data available. Part of data collection and sharing for this project was funded by the ADNI (NIH grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering and through generous contributions from the following: Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica; Biogen Idec; Bristol-Myers Squibb; Eisai; Elan Pharmaceuticals; Eli Lilly and Company; EuroImmun; Roche and its affiliated company Genentech; Fujirebio; GE Healthcare; IXICO; Janssen Alzheimer Immunotherapy Research and Development; Johnson and Johnson Pharmaceutical Research and Development; Medpace.; Merck and Co.; Meso Scale Diagnostics; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer; Piramal Imaging; Servier; Synarc; and Takeda Pharmaceutical. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the NIH (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California. Part of the data collection and sharing for this project was funded by the PING study (US NIH grant RC2DA029475). PING is funded by the National Institute on Drug Abuse and the Eunice Kennedy Shriver National Institute of Child Health and Human Development. PING data are disseminated by the PING Coordinating Center at the Center for Human Development, University of California, San Diego. Support for the collection of the PNC datasets was provided by grant RC2MH089983 awarded to R. Gur and RC2MH089924 awarded to H. Hakonarson. All PNC subjects were recruited through the Center for Applied Genomics at The Children’s Hospital in Philadelphia. HCP data were provided by the HCP, WU-Minn Consortium (Principal Investigators: D. Van Essen and K. Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
A list of members and affiliations appears in the Supplementary Note.
Supplementary Figs. 1–17 and Note
Supplementary Tables 1–30
GWAS Manhattan plots for all 101 ROI volumes (n = 19,629 subjects).
GWAS quantile–quantile plots for all 101 ROI volumes (n = 19,629 subjects).
About this article
Cite this article
Zhao, B., Luo, T., Li, T. et al. Genome-wide association analysis of 19,629 individuals identifies variants influencing regional brain volumes and refines their genetic co-architecture with cognitive and mental health traits. Nat Genet 51, 1637–1644 (2019). https://doi.org/10.1038/s41588-019-0516-6
Innovative computational approaches shed light on genetic mechanisms underlying cognitive impairment among children born extremely preterm
Journal of Neurodevelopmental Disorders (2022)
Nature Genetics (2022)
Multivariate genome-wide association study on tissue-sensitive diffusion metrics highlights pathways that shape the human brain
Nature Communications (2022)
Molecular Psychiatry (2022)
Molecular Psychiatry (2022)