General cognitive function is substantially heritable across the human life course from adolescence to old age. We investigated the genetic contribution to variation in this important, health- and well-being-related trait in middle-aged and older adults. We conducted a meta-analysis of genome-wide association studies of 31 cohorts (N=53 949) in which the participants had undertaken multiple, diverse cognitive tests. A general cognitive function phenotype was tested for, and created in each cohort by principal component analysis. We report 13 genome-wide significant single-nucleotide polymorphism (SNP) associations in three genomic regions, 6q16.1, 14q12 and 19q13.32 (best SNP and closest gene, respectively: rs10457441, P=3.93 × 10−9, MIR2113; rs17522122, P=2.55 × 10−8, AKAP6; rs10119, P=5.67 × 10−9, APOE/TOMM40). We report one gene-based significant association with the HMGN1 gene located on chromosome 21 (P=1 × 10−6). These genes have previously been associated with neuropsychiatric phenotypes. Meta-analysis results are consistent with a polygenic model of inheritance. To estimate SNP-based heritability, the genome-wide complex trait analysis procedure was applied to two large cohorts, the Atherosclerosis Risk in Communities Study (N=6617) and the Health and Retirement Study (N=5976). The proportion of phenotypic variation accounted for by all genotyped common SNPs was 29% (s.e.=5%) and 28% (s.e.=7%), respectively. Using polygenic prediction analysis, ~1.2% of the variance in general cognitive function was predicted in the Generation Scotland cohort (N=5487; P=1.5 × 10−17). In hypothesis-driven tests, there was significant association between general cognitive function and four genes previously associated with Alzheimer’s disease: TOMM40, APOE, ABCG1 and MEF2C.
The systems with which humans face the challenges of the external and internal environments tend to show deterioration in their mean level of efficiency as people age. For example, the immune system,1 cardiovascular2 and respiratory systems,3 renal function4 and stress responses5 are, on average, not as efficient in old age as they were in young adulthood. When met with a challenge to a system, healthy older people tend to have a reduced reserve capacity compared with younger adults. The nervous system also declines, with sensory function6, 7 and motor strength and co-ordination8 not at their highest mean level in older age.
The brain’s cognitive functions show a heterogeneity of age-related changes. Some capabilities such as vocabulary, some number skills and general knowledge withstand the aging trend and tend (in the absence of neurological disease) to remain intact into older age, although they decline eventually.9, 10, 11 These cognitive functions, which rely on the access to stored information, are called crystallized abilities.12
By contrast, fluid cognitive abilities, which rely on on-the-spot information processing, show more age-related decline in mean levels.9, 10, 11, 12, 13, 14 Through fluid cognitive abilities, the brain equips us to, for example, recognize and recall previously unseen stimuli, and to make associations between previously unrelated stimuli (memory); induce general rules from sets of occurrences and to apply these rules to new situations (reasoning); perform simple, repetitive cognitive tasks accurately and at high speed (processing speed); compute mental two- and three-dimensional transformations of objects and locations (spatial ability); and organize thinking (executive functioning). By comparison with crystallized abilities, tests of fluid cognitive abilities tend to employ unfamiliar and often abstract materials, and draw less on stored knowledge, education and broader enculturation. Each domain of fluid cognitive ability is important and is a major object of research in its own right.15, 16 However, when taken together the cognitive domains show an interesting regularity. If we apply a number of cognitive tests that assess diverse cognitive functions to a large and varied sample, the correlations are universally positive; people who tend to do well on one domain of fluid cognitive ability tend to do well in all of the others, although the associations are far from perfect.17 This means that some of the observed interindividual variation in any single cognitive domain can be attributed to at least four sources: how good they are generally at all fluid cognitive tests, how good they are at that fluid cognitive domain, how good they are at specific tests within any one domain, and error of measurement.18
In the present study, we focus on the variance that crosses a number of different fluid cognitive functions, that is, on general fluid cognitive ability. We examine the genetic contributions to this ability in middle and older age. There are good reasons for doing so. General cognitive ability remains a strong source of cognitive variation in older age, probably accounting for about as much cognitive variation as in young adulthood.19 General cognitive function accounts for a substantial proportion of the age-related variance9, 20and genetic variance21, 22 of people’s cognitive test differences in middle and older age. Across adulthood, and especially in older age, lower fluid cognitive ability and greater decline in it across the life course are associated with earlier mortality.23, 24, 25 It can also be argued that if mean levels of fluid cognitive tests show decline with age, then people’s differences in cognitive functions become more important as they grow older, because the levels of cognitive functioning that remain in older age are nearer to the critical levels that are needed for everyday functions.26
To date, behavior genetic research—using twin, adoption and family designs—shows that general cognitive ability is substantially heritable across the life course, from late childhood to old age.22 The heritability of general cognitive functioning in old age might decrease slightly from its levels in young and middle adulthood. Candidate gene studies have found that variation in APOE genotype is the only reliable individual genetic associate of cognitive function in older age, but that might apply especially to cognitive change rather than cognitive level in old age.27, 28, 29 Using the genome-wide complex trait analysis procedure (GCTA), genome-wide association studies (GWAS) found that ~51% (the s.e. was large, at 11%) of the variation in general fluid cognitive function in late middle age and older age could be accounted for by genetic variation that is tagged by single-nucleotide polymorphisms (SNPs) on the Illumina610-Quadv1 chip.30 That study was conducted in a total discovery sample of 3511 individuals, with replication in 670 independent individuals. It found no genome-wide significant single SNP associations. From other GWAS studies of complex traits, we now know that this sample size is likely to be too small, by an order of magnitude, to detect genome-wide significant SNPs.31
In summary, general fluid cognitive functioning is a key aspect of health in older age. It is important to identify the determinants of its individual differences, both environmental and genetic. To date, studies have been too small to detect the expected small genetic effects. Here we conduct a meta-analysis of GWAS studies of general fluid cognitive functioning in middle and older age from the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium, with a total of 53 949 individuals.
Materials and methods
This report includes individuals from 31 population-based cohorts participating in the Cohorts for Heart and Aging Research in Genomic Epidemiology consortium (Supplementary Table S1 and Supplementary Information 1 Section 1). All participants were of European ancestry and aged 45 years or older. Exclusion criteria included prevalent dementia and clinical stroke (including self-reported stroke). The total sample size was 53 949 individuals (Nmen=23 030, Nwomen=30 919).
General fluid cognitive function phenotype
For each of the cohorts, a general fluid cognitive function component phenotype was constructed from a number of cognitive tasks, each testing a different cognitive domain. In order to construct this measure, each cohort was required to have tasks that tested at least three different cognitive domains. Principal component analysis was applied to the cognitive task scores to derive a measure of general cognitive function, which was the score on the first unrotated principal component. Correlations between each test score and the general cognitive function score were calculated for each cohort, to confirm that all cohorts’ general cognitive function scores had been scored in the required direction, that is, with a higher test score indicating higher cognitive function. Further details of the cognitive tasks undertaken and of the phenotype construction in each cohort are provided in Supplementary Information 1 Section 2. In summary, there was a clear single component accounting for between 33.7% and 62.3% (mean=49.6%) of the total cognitive test variance in all cohorts.
Cohorts used different batteries of cognitive tests, which means that there will be phenotypic heterogeneity. However, it has been shown that the individual differences found on the general cognitive component derived from different cognitive test batteries are very similar.32, 33 Here we give an example of the similarity of the scores obtained when using two different sets of tests to derive the general cognitive ability component. This can be illustrated in the Lothian Birth Cohort 1936, because it has such a large battery of cognitive tests.34 Two general fluid-type cognitive function component phenotypes could be derived, each using a different battery of cognitive tests. Of course, only one of these was used in the GWAS study. The first comprised six non-verbal tests from the Wechsler Adult Intelligence Scale-III UK; these were Block Design, Digit Symbol, Symbol Search, Letter-Number Sequencing, Backward Digit Span and Matrix Reasoning. The second contained the Moray House Test, Logical Memory, Spatial Span, Four Choice Reaction Time and Verbal Fluency. These two general cognitive function phenotypes calculated from two non-overlapping batteries of cognitive tests in the Lothian Birth Cohort 1936 had a correlation of r=0.79 (P<0.001). Thus, we use this as a methodological illustration to show that there is substantial overlap between the general fluid cognitive ability components from different sets of tests.
Genotyping and quality control
Each cohort applied quality control (QC) measures based on SNP and sample-based missingness, minor allele frequency, Hardy–Weinberg equilibrium, relatedness and evidence of non-Caucasian descent. Cohort-specific thresholds for these QC measures along with details of genotyping, imputation methods and reference panels are detailed in Supplementary Table S2.
Genotype–phenotype association analyses
Genotype–phenotype association analyses were performed, using an additive model, on imputed SNP dosage scores within each cohort (Supplementary Figures S1 and S2). Adjustments for age, sex and population stratification, if required, were included in the model. Cohort-specific covariates—for example, site or familial relationships—were also fitted as required. The cohort-specific association results were subjected to QC procedures before meta-analysis. SNPs were excluded based on imputation quality (IMPUTE info<0.4, MACH r2<0.3, BIMBAM r2<0.3) and minor allele frequency (<0.5%). Only SNPs that passed these QC criteria in >50% of individuals were included in the meta-analysis (2 478 500 SNPs). A meta-analysis of the 31 cohorts was performed using the METAL package with an inverse variance weighted model implemented and single genomic control applied (http://www.sph.umich.edu/csg/abecasis/Metal). SNP-based results were also compared to published results for educational attainment35 (Social Science Genetic Association Consortium) and general cognitive function in childhood36 (Childhood Intelligence Consortium).
Gene-based tests of association were performed using the VEGAS software.37 In addition, we examined the gene-based findings for association with published candidate genes previously associated with Alzheimer’s disease (AD) or neuropathological features of AD and related dementias,38, 39, 40, 41, 42, 43, 44 educational attainment35 and general cognitive function in childhood.36
In order to perform prediction analyses in Generation Scotland (GS), a meta-analysis was performed, which excluded this cohort. A multi-SNP prediction model was created using the profile scoring method implemented in PLINK.45 This uses the effect sizes estimated in the meta-analysis. The GS best-guess imputed data were used for this analysis, an imputation quality threshold of r2>0.9 was applied, and the remaining SNPs were pruned to remove those in linkage disequilibrium (based on r2>0.25 within a 200-SNP sliding window). The estimated effect sizes from the meta-analysis for each of these SNPs were then used to calculate a prediction score for each GS individual. A series of prediction scores was created based on the inclusion of SNPs with a range of association P-values: all SNPs and SNPs with P<0.5, P<0.1, P<0.05 or P<0.01. Linear or logistic regressions of the prediction score and cognitive phenotypes, and some health outcomes previously associated with cognitive function46 were performed. We calculated the proportion of phenotypic variance that was predicted by adding the prediction score to a ‘null’ model that adjusted for age, sex and population stratification (four principal components). The cognitive phenotypes from GS that were included in the prediction analysis were general cognitive function, general fluid cognitive function, Wechsler Digit Symbol Substitution Task,47 Wechsler Logical Memory Test,48 Verbal Fluency49 and the Mill Hill Vocabulary Scale (junior+senior synonyms).50 The health outcomes included in the prediction analysis were self-reported cardiovascular disease, type 2 diabetes and hypertension. Polygenic prediction analyses were also performed in GS for cognitive phenotypes using published GWAS results for AD39 (International Genomics of Alzheimer’s Project) and educational attainment.35
Estimation of SNP-based heritability using GCTA analysis
The GCTA program51 was used to estimate the proportion of variance explained by all common SNPs for general cognitive function in the Atherosclerosis Risk in Communities Study (ARIC) and Health and Retirement Study (HRS) cohorts. These cohorts were selected to be used for this analysis, because they are two of the largest cohorts in the study. The total numbers of individuals included in these analyses were 6617 for the ARIC cohort and 5976 for the HRS cohort. These totals differ from the single SNP analyses, because close relatives were excluded from this analysis. One individual was excluded from any pair of individuals that had an estimated coefficient of relatedness of >0.025 to ensure that effects due to shared environment were not included. The same covariates were included in the GCTA analyses as for the SNP-based association analyses.
Pathway and functional genomic analyses
The following pathway and functional genomic methods were performed: INRICH52 and Ingenuity Pathway Analysis (Ingenuity Systems, www.ingenuity.com), and reference was made to The Genotype-Tissue Expression Portal (http://www.gtexportal.org), Regulome DB database53 and the Human Brain Transcriptome Project (hbatlas.org).54 Full details are given in Supplementary Methods.
The SNP-based meta-analysis identified 13 SNPs associated with general cognitive function at a genome-wide significance level (P<5 × 10−8) (Figure 1, Figure 2 and Supplementary Figure S3). These SNPs were located in three genomic regions, 6q16.1, 14q12 and 19q13.32. The top SNP in each region and genes contained in the region were, 6q16.1, rs10457441 (P=3.93 × 10−9; MIR2113), 14q12, rs17522122 (P=2.55 × 10−8; AKAP6/NPAS3) and 19q13.32, rs10119 (P=5.67 × 10−9; TOMM40/APOE) (Figure 2). The effect size of rs10119 was significantly correlated with mean age of the cohort (r=−0.424, P=0.022; Figure 3 and Supplementary Figure S4). There was no significant correlation with cohort age for the other two SNPs (rs10457441 and rs17522122) (Supplementary Figures S5 and S6). All 361 SNPs from the meta-analysis with a P-value less than a suggestive significance threshold of P<1 × 10−5 are detailed in Supplementary Table S3.
Gene-based tests of association yielded one genome-wide significant result (P<2.8 × 10−6), for the gene HMGN1 (P=1 × 10−6) located on chromosome 21. All 184 gene-based association results with a P-value below the suggestive threshold (P<1 × 10−3) are detailed in Supplementary Table S4.
Supplementary Table S5 shows the gene-based association results for 29 genes previously reported to be associated with AD or neuropathologic features of AD and related dementias.38, 39, 40, 41, 42, 43, 44 Four of these genes, TOMM40, APOE, MEF2C and ABCG1, were associated with general fluid cognitive function at P<0.01. Owing to linkage disequilibrium, the association P-values for APOE and TOMM40 are not independent. APOE and TOMM40 are located in the region of chromosome 19 identified by the SNP-based meta-analysis.
Supplementary Tables S6A and S7A show the top SNP-based results from published GWAS of educational attainment35 and general cognitive function in childhood.36 It should be noted that these are not independent studies due to sample overlap in some cohorts between the current study and both of these previously published studies (overlaps are: educational attainment N~30 000; general cognitive function in childhood N~1500). Of the 361 suggestively significant SNPs from the current meta-analysis (Supplementary Table S3), 188 and 192 SNPs demonstrated nominal significance (P<0.05) with the educational attainment phenotypes of years of education and college completion, respectively (Supplementary Table S7B). Sixteen SNPs achieved P<1 × 10−6 in the educational attainment analyses35; six of these achieved nominal significance in the current meta-analysis (Supplementary Table S7A). Of the top 100 SNPs in the general cognitive function in childhood GWAS,36 11 reached a nominal level of significance in the current study (Supplementary Table S6A). Comparisons of gene-based results are shown in Supplementary Tables S6B, S7C and S7D. For the educational attainment phenotypes, 17 and 14 of the top 25 genes associated with years of education and college completion,35 respectively, were nominally significant in the current gene-based results. For childhood general cognitive function,36 eight of the top 20 gene-based findings achieve nominal significance in the current study.
Results from the meta-analysis are consistent with a polygenic model of inheritance. This is demonstrated by the clear early deviation from the null hypothesis observed in the QQ plot (λ=1.14) (Figure 1). To investigate further whether the general cognitive phenotype is under polygenic control as is indicated by the association meta-analyses, we applied the analysis within the GCTA set of methods51 that estimates the proportion of phenotypic variance explained by all SNPs in two of the largest single cohorts: ARIC and HRS. For the ARIC and HRS cohorts, respectively, the GCTA method returned SNP heritability estimates of 0.29 (s.e.=0.05; N=6617; P=2.34 × 10−9) and 0.28 (s.e.=0.07; N=5976; P=2.00 × 10−5).
The results from the polygenic prediction analyses are shown in Supplementary Table S8. The maximum proportion of phenotypic variance explained in GS using the prediction set derived from the meta-analysis excluding GS was 1.27% (P=1.5 × 10−17) for general cognitive function when using the P<0.50 SNP set (N=47 322). The proportion of variance explained in other cognitive domains ranged from near-zero values to about 1% (Supplementary Table S8). The polygenic score did not significantly predict cardiovascular disease, hypertension or type 2 diabetes in GS (all P>0.01). Supplementary Table S9 shows the results from the polygenic prediction using the published results for educational attainment35 (years of education and completion of a college degree) and AD.39 For educational attainment, the maximum proportion of phenotypic variance explained in GS for the general cognitive phenotype was 0.54% (P=2.78 × 10−8) when using the P<0.50 SNP set (N=40 239). For AD, significant predictions were observed for only the more inclusive SNP sets (P<0.50 and P<1) and the maximum proportion of phenotypic variance explained was 0.19%. Owing to the interdependency of the cognitive function measures, health outcomes and polygenic scores, no correction for multiple testing was applied to the P-values presented.
Supplementary Table S10 lists Gene Ontology gene sets that showed nominal enrichment before correction for multiple testing in the INRICH analysis. None of the Gene Ontology gene sets remained significant after correction for multiple testing. The highest ranked 70-node network in the Ingenuity Pathway Analysis included 58 molecules with CDK2 as a central hub (Supplementary Figure S7). The second-ranked 70-node network included 38 molecules and had three multi-connected hubs: RHOA, NUPR1 and SRF. The highest ranked 140-node network includes 103 molecules and combines the top two 70-node networks (Supplementary Figure S8). The top function categories associated with this network are cell cycle, cell death and survival, and gene expression. The top canonical pathways were inositol pyrophosphates biosynthesis, tRNA charging, Ga12/13 signaling, IL-15 production and role of NFAT in regulation of the immune response.
Using the Genotype-Tissue Expression Portal database (http://www.gtexportal.org), no cis–expression quantitative trait loci associations were identified for the 13 genome-wide significant SNPs. Supplementary Figure S9 shows differential expression of the top two genes from the VEGAS analysis, HMGN1 and BRWD1, in six brain regions over the human lifecourse (http://hbatlas.org/pages/hbtd). Neither gene demonstrated differential expression across the brain regions shown in middle and older age. For this study, data mining of regulatory elements was restricted to normal brain relevant cell lines/tissues. Supplementary Table S11 demonstrates evidence of regulatory elements associated with 7 of the 13 genome-wide significant SNPs (http://www.regulomedb.org/).
In this genome-wide association study of general cognitive function in middle and older age, with a total N of 53 949 participants, we report 13 genome-wide significant SNP-based associations in the three genomic regions 6q16.1, 14q12 and 19q13.32. There was one gene-based significant association with the HMGN1 gene located on chromosome 21. We observed association of general cognitive function with four genes previously associated with AD or neuropathological features of AD and related dementias (TOMM40, APOE, MEF2C and ABCG1). The results from the meta-analysis are consistent with a polygenic model of inheritance, indicating that many variants of small effect contribute to the additive genetic influences on general cognitive function. Using GCTA, we present consistent estimates of the lower bound of the narrow sense heritability of general fluid cognitive function, 0.29 and 0.28, from two of the largest cohorts (ARIC and HRS). We are able to predict, using only common SNP data to create a polygenic score, ~1.2% of the variance in general cognitive function in an independent sample (GS). Pathway and network analyses did not produce significant findings. None of the 13 SNPs achieving genome-wide significance were coding variants. However, functional annotation provided evidence of regulatory elements for seven SNPs, suggesting that they might have a functional non-protein coding effect.
The 19q13.32 region, which includes the APOE ɛ2/3/4 haplotype and was associated with general cognitive function in this study, has previously been associated with cognitive phenotypes in old age,55, 56, 57, 58 AD42, 59, 60, 61 and non-pathological cognitive aging.28, 29 Here we find that the APOE/TOMM40 region is also associated with general cognitive function in middle and older age. The only published GWAS of general cognitive function in older age, to date, did not report any significant APOE/TOMM40 findings.30 From the data presented here, it is not possible to identify a single SNP or gene within this region that is driving the association, as it is a gene-dense region that is known to exhibit a strong pattern of linkage disequilibrium. Davies et al.28 performed a fine-mapping analysis of this region, which indicated that the observed association with non-pathological cognitive aging was being driven by APOE-based variation.28 A functional analysis of the APOE locus, including TOMM40, found that multiple APOE locus cis-regulatory elements influence both APOE and TOMM40 promoter activity.62 Functional annotation of the top SNP (rs10119) in the present study demonstrated evidence of regulatory elements, indicating regions of active transcription and epigenetic modification. All of these factors suggest that there is a complex transcriptional regulatory structure modulating regional gene expression at the APOE/TOMM40 locus. Drawing on the large number of cohorts in the present paper, we found a significant correlation between mean cohort age and the effect size of rs10119 (located in the APOE/TOMM40 region) on general cognitive function in older age. The effect was near to zero at younger mean ages and larger at older ages. This might help toward resolving some of the uncertainty about age moderation of APOE ɛ4 status associations with cognitive function that are based on single studies.63, 64, 65
The 14q12 region identified by the meta-analysis contains both the AKAP6 and NPAS3 genes. AKAP6 (A kinase (PRKA) anchor protein 6) is highly expressed in various brain regions, and cardiac and skeletal muscle. It is specifically localized to the sarcoplasmic reticulum and nuclear membrane, and is involved in anchoring protein kinase A to the nuclear membrane or sarcoplasmic reticulum.66 SNPs within this gene have tentatively been associated with several diseases/traits by GWAS: rs4296166 has been associated with risk of AD at a suggestive level of significance,67 rs2383378 was suggestively associated with anorexia nervosa,68 rs2300835 was associated with fasting insulin levels in the discovery sample of a GWAS of fasting glycemic traits, but failed to replicate,69 and rs1951681 and rs3784178 were suggestively associated with economic and political preferences (environmentalism).70
NPAS3 (neuronal PAS domain protein 3) encodes a member of the basic helix-loop-helix and PAS domain-containing family of transcription factors that is expressed in the brain. The encoded protein is localized to the nucleus and may regulate genes involved in neurogenesis.71 It has been associated with brain development and potentially human brain evolution.72 A balanced reciprocal translocation t(9,14)(q34.2;q13) that disrupts NPAS3 was identified in a mother and daughter with schizophrenia and schizophrenia co-morbid with mild learning disability, respectively.73, 74 A GWAS has since reported three coding SNPs (rs12434716, c.1654G>C, P=0.009; rs10141940, c.2208C>T, P=0.01; and rs10142034, c.2262C>G, P=0.01) to be associated with schizophrenia.75
The genome-wide significant SNPs in the 6q16.1 region are located ~100 kb downstream of an uncharacterized microRNA gene MIR2113. This region has been previously associated with bipolar disorder (rs12202969, P=1.08 × 10−8)76 and educational attainment.35 Evidence of regulatory elements associated with six of the top SNPs in this region, within normal brain-related tissues and cell lines, was identified using the Regulome DB database. The regulatory elements identified include histone modifications, DNase hypersensitive sites and position weight matrix sites. This evidence suggests that the associated SNPs are in sites of active transcription and could have a regulatory role on transcription.
The significant finding from the gene-based analysis, HMGN1 (high mobility group nucleosome-binding domain 1), encodes a nucleosome-binding protein that is associated with transcriptionally active chromatin.77 HMGN1 negatively regulates methyl CpG-binding protein 2 (MeCP2), a DNA-binding protein that is mutated in the neurodevelopmental disorder Rett syndrome78 and is known to affect neurological functions including X-linked mental retardation, various autism spectrum disorders in humans and the behavior of mice. It is overexpressed in Down syndrome and it has been suggested that epigenetic changes resulting from altered HMGN1 levels could have a role in the etiology of several neurodevelopmental disorders including autism and Down syndrome.79
Four of the 29 genes previously reported to be associated with AD38, 39, 40, 41, 42, 44 or neuropathological features of AD and related dementias43 were associated with general cognitive function (at P<0.01). These were TOMM40, APOE, MEF2C and ABCG1. These results suggest that there is overlap between the genetic contribution to ‘normal’ and ‘pathological’ cognitive variation in older age. A polygenic prediction analysis using a large published GWAS of AD39 significantly predicted only 0.19% of the phenotypic variation in general cognitive function in GS. However, Harris et al.80 reported no significant association of polygenic score for AD with general cognitive ability when using smaller sample sizes for both the creation of the polygenic score and the prediction analysis. All known cases of clinical dementia were removed from the contributing cohorts. Of course, some or all of the effects we found could be driven by the inadvertent inclusion of individuals in a prodromal stage of dementia, and that we have picked up the genetic effects on this. This is an important issue that is impossible to eliminate entirely. Ideally, one would wish to know, prospectively, which people in the current cohorts eventually developed dementia and then re-run the analyses after omitting them. However, some people will die or be lost to contact before such an assessment could be made, thus preventing complete ascertainment. On the other hand, it is possible to envisage a study that tracks individuals and re-analyses those who have kept in contact and those who do not, say over a 10-year period, develop dementia. This could clarify whether the effects we found here were on the normal range of age-related cognitive change. The present study includes individuals within and beyond the ninth decade. It is also important to note that the cognitive phenotype we measured is a composite of people’s stable trait levels and any age-related change that has occurred. Therefore, genetic effects might be contributing to either.
The results of this study (SNP and gene-based) were compared with those of previously published large GWAS of educational attainment35 and general cognitive function in childhood.36 Before discussing these results further, it should be noted that there is sample overlap between both of these studies and the current study (overlaps are: educational attainment, N~30 000; childhood general cognitive function, N~1500). Around 50% of the suggestive SNPs from the current study are nominally significant for educational attainment and the 6q16.1 region is reported to be genome-wide significant for both general cognitive function and educational attainment. The gene-based findings also demonstrated some consistency, with more than half of the top 25 genes for years of education and college completion achieving nominal significance in the current study. Of the top 100 SNPs for childhood cognitive function, 11 achieved nominal significance in the current study along with eight of the top 20 gene-based findings. These findings suggest that there is overlap between the genetic contribution to general cognitive function in late-middle and older age, and both educational attainment and childhood general cognitive function. This has also been explored in a study, which used education as a proxy phenotype for general cognitive function.81 The bivariate heritability of educational attainment and general cognitive function has been previously estimated in GS using both pedigree-based and SNP-based methods (biv h2=0.78, N~20 500 and biv h2=0.59, N~6600, respectively).82 The genetic correlation and bivariate heritability of childhood and older age general cognitive functions have also been previously estimated in a relatively small sample (N~1900), (rg=0.62; biv SNP h2=0.21).83
This study provides further evidence that general cognitive function is heritable and under polygenic control. These findings are consistent with, and add considerably to those from the Cognitive Ageing in Genetics in England and Scotland consortium.30 The early deviation from the expected distribution in the QQ plot could be indicative of two outcomes; first, polygenic effects and, second, population stratification. All of the individual cohorts applied the required number of principal component analysis/multidimensional scaling components to their initial association analyses to adjust for stratification that may be present within the individual cohorts. None of the individual cohorts demonstrate both an early deviation and an inflated λ-value, which would indicate that stratification may be present (Supplementary Figure S2). Single genomic control was implemented in the meta-analysis. The proportion of phenotypic variance explained by all SNPs is an estimate of the lower bound of the narrow sense heritability. The estimates calculated from the ARIC and HRS cohorts suggest that 29% (s.e.=0.05) and 28% (s.e.=0.07), respectively, of the variation in general cognitive function can be attributed to common SNPs that are in linkage disequilibrium with causal variants in these cohorts. Whereas these estimates are lower than a previously-published estimate from the Cognitive Ageing in Genetics in England and Scotland consortium in a smaller sample (51%; s.e.=0.11; N=3511),30 they are slightly higher than an estimate reported from a similar sample size in GS (21%; s.e.=0.05; N=6648).82 To date, these are the only GCTA-based estimates of the general cognitive function phenotype in unrelated early-middle to older age individuals. The sample-size weighted mean of these estimates is 0.30 (Figure 4). It should be noted that these four estimates were not based on the same set of common SNPs, as only directly genotyped SNPs were included, and that the general cognitive function phenotypes in each cohort were constructed using a different set of cognitive tests. Therefore, the mean of 0.30 is likely to be an underestimate.
Whereas the combined sample size in the present report is a strength, the results observed here suggest that an even larger sample size is required in order to seek replication of the findings of the present study and to identify more genome-wide significant findings. For this reason, we chose to present a single discovery meta-analysis of the largest combined sample available. If general cognitive function is similar to other complex traits, the individual effects of common SNPs will be very small. From studies of other polygenic complex traits, it has been observed that the number of discovered variants is strongly correlated with experimental sample size.31 This predicted increase in detectable associations for complex traits, when sample sizes increase, has been observed in GWASs of both height—a study of 183 727 individuals reported 180 significant associations, of which >100 were novel loci compared with previous studies of fewer individuals (N<40 000)—and schizophrenia, in which a study of 21 246 cases and 38 072 controls reported 22 significant associations, of which 13 were novel loci compared with previous studies of fewer individuals (N<18 000 cases).84, 85 This is also demonstrated in the present study when compared with the previously published Cognitive Ageing in Genetics in England and Scotland consortium study (N=3511), which reported no genome-wide significant SNP associations with general cognitive function in older age.30
Phenotypic heterogeneity is a limitation of this study. Each cohort used a different set of cognitive tests to create the general cognitive function phenotype. We demonstrated using the Lothian Birth Cohort 1936 cohort that two general cognitive function phenotypes calculated in the same set of individuals but using a different battery of tests for the principal component analysis are highly (~0.8) but imperfectly correlated. This is consistent with the finding that general cognitive function phenotypes derived from different test batteries are highly correlated and measurement of this phenotype is not dependent on the use of specific cognitive tasks.32, 33 Given, especially, that a relatively small number of tests contributed to the general cognitive component in some of the cohorts, the heterogeneity of the phenotype will be a limitation on the discovery of genetic effects.
In order to dissect the regions of association identified in the present study further, deep sequencing of these regions in a larger sample would be required followed by in-depth functional genomics studies. This type of approach may elucidate the mechanisms underlying the observed associations and identify the causal variants within these associated genomic regions.
In conclusion, we report the largest meta-analysis of GWAS studies, to date, of fluid general cognitive function in middle and older age. We also report results showing that general cognitive function is heritable and highly polygenic, extending findings of previous studies involving general cognitive function in older individuals. We show genome-wide significant SNP-based associations within three genomic regions 6q16.1 (MIR2113), 14q12 (AKAP6/NPAS3 region) and 19q13.32 (TOMM40/APOE region), and a genome-wide significant gene-based association with the HMGN1 gene located on chromosome 21. The 19q13.32 region has long been associated with AD and more recently was associated with non-pathological cognitive aging;28 the 6q16.1, 14q12 and HMGN1 regions contain genes associated with development of the brain,78, 79 neurological function,71, 72, 79 psychiatric disease73, 74, 75, 76 and educational attainment.35
Vallejo AN . Immunological hurdles of ageing: indispensable research of the human model. Ageing Res Rev 2011; 10: 315–318.
Laurent S . Defining vascular aging and cardiovascular risk. J Hypertens 2012; 30: S3–S8.
Vaz Fragoso A, Gill TM . Respiratory impairment and the aging lung: a novel paradigm for assessing pulmonary function. J Gerontol Ser A Biol Sci Med Sci 2012; 67: 264–275.
Bolignano D, Mattace-Raso F, Sijbrands EJG, Zoccali C . The aging kidney revisited: a systematic review. Ageing Res Rev 2014; 14: 65–80.
Ferrari E, Magri F . Role of neuroendocrine pathways in cognitive decline during aging. Ageing Res Rev 2008; 7: 225–233.
Lindenberger U, Baltes PB . Sensory functioning and intelligence in old age: a strong connection. Psychol Aging 1994; 9: 339–355.
Pathai S, Shiels PG, Lawn SD, Cook C, Gilbert C . The eye as a model of ageing in translational research: molecular, epigenetic and clinical aspects. Ageing Res Rev 2013; 12: 490–508.
Hardy R, Cooper R, Aihie Sayer A, Ben-Shlomo Y, Cooper C, Deary IJ et al. Body mass index, muscle strength, and physical performance in older adults from eight cohort studies: the HALCyon programme. PLoS One 2013; 8: e56483.
Salthouse T . Localizing age-related individual differences in a hierarchical structure. Intelligence 2004; 32: 541–561.
Schaie KW . Developmental Influences on Adult Intelligence. Oxford University Press: New York, USA, 2005.
Finkel D, Reynolds CA, McArdle JJ, Pedersen NL . Cohort differences in trajectories of cognitive ageing. J Gerontol Ser B Psychol Sci Soc Sci 2007; 62B: P286–P294.
Horn JL . Theory of fluid and crystallised intelligence. In: Sternberg RJ (ed). Encyclopedia of Human Intelligence. Macmillan: New York, NY, USA, 1994.
Salthouse T . When does age-related cognitive decline begin? Neurobiol Aging 2009; 30: 507–514.
Josefsson M, de Luna X, Pudas S, Nilsson L-G, Nyberg L . Genetic and lifestyle predictors of 15-year longitudinal change in episodic memory. JAGS 2012; 60: 2308–2312.
Craik FIM, Salthouse TA . The Handbook of Aging and Cognition 3rd Edition, Psychology Press: New York, USA, 2008.
Hofer SM, Alwin DF . Handbook of Cognitive Aging. Sage: Los Angeles, USA, 2008.
Carroll JB . Human Cognitive Abilities: A Survey of Factor-Analytic Studies. Cambridge University Press: Cambridge, UK, 1993.
Deary IJ, Penke L, Johnson W . The neuroscience of human intelligence differences. Nat Rev Neurosci 2010; 11: 201–211.
Tucker-Drob E . Differentiation of cognitive abilities across the life span. Dev Psychol 2009; 45: 1097–1118.
Tucker-Drob E . Global and domain-specific changes in cognition throughout adulthood. Dev Psychol 2011; 47: 331–343.
Petrill SA, Plomin R, Berg S, Johansson B, Pedersen NL, Ahern F et al. The genetic and environmental relationship between general and specific cognitive abilities in twins age 80 and older. Psychol Sci 1998; 9: 183–189.
Deary IJ, Johnson W, Houlihan L . Genetic foundations of human intelligence. Hum Genet 2009; 126: 215–232.
Murray C, Pattie A, Starr JM, Deary IJ . Does cognitive ability predict mortality in the ninth decade? The Lothian Birth Cohort 1921. Intelligence 2012; 40: 490–498.
Shipley BA, Der G, Taylor MD, Deary IJ . Association between mortality and cognitive change over 7 years in a large representative sample of UK residents. Psychosom Med 2007; 69: 640–650.
Calvin CM, Deary IJ, Fenton C, Roberts B, Der G, Leckenby N et al. Intelligence in youth and all-cause mortality: systematic review with meta-analysis. Int J Epidemiol 2011; 40: 626–644.
Tucker-Drob E . Neurocognitive functions and everyday functions change together in old age. Neuropsychology 2011; 25: 368–377.
Payton A . The impact of genetic research on our understanding of normal cognitive ageing: 1995 to 2009. Neuropsychol Rev 2009; 19: 451–477.
Davies G, Harris SE, Reynolds CA, Payton A, Knight HM, Liewald DC et al. A genome-wide association study implicates the APOE locus in non-pathological cognitive ageing. Mol Psychiatr 2014; 19: 76–87.
Zhang C, Perce BL . Genetic susceptibility to accelerated cognitive decline in the US Health and Retirement Study. Neurobiol Aging 2013; 35 1512: e18.
Davies G, Tenesa A, Payton A, Yang J, Harris SE, Liewald DC et al. Genome-wide association studies establish that human intelligence is highly heritable and polygenic. Mol Psychiatr 2011; 16: 996–1005.
Visscher PM, Brown MA, McCarthy MI, Yang J . Five years of GWAS discovery. Am J Hum Genet 2012; 90: 7–24.
Johnson W, Bouchard TJ, Krueger RF, McGue M, Gottesman II . Just one g: consistent results from three test batteries. Intelligence 2004; 32: 95–107.
Johnson W, te Nijenhuis J, Bouchard T Jr . Still just 1 g: Consistent results from five test batteries. Intelligence 2008; 36: 81–95.
Deary IJ, Gow AJ, Taylor MD, Corley J, Brett C, Wilson V et al. The Lothian Birth Cohort 1936: a study to examine influences on cognitive ageing from age 11 to age 70 and beyond. BMC Geriatr 2007; 7: 28.
Rietveld CA, Medland SE, Derringer J, Yang J, Esko T, Martin NW et al. GWAS of 126,559 individuals identifies genetic variants associated with educational attainment. Science 2013; 340: 1467–1471.
Benyamin B, Pourcain B, Davis OS, Davies G, Hansell NK, Brion MJ et al. Childhood intelligence is heritable, highly polygenic and associated with FNBP1L. Mol Psychiatry 2014; 19: 253–258.
Liu JZ, McRae AF, Nyholt DR, Medland SE, Wray NR, Brown KM et al. A versatile gene-based test for genome-wide association studies. Am J Hum Genet 2010; 87: 139–145.
Lambert JC, Heath S, Even G, Campion D, Sleegers K, Hiltunen M et al. Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer's disease. Nat Genet 2009; 41: 1094–1099.
Lambert JC, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease. Nat Genet 2013; 45: 1452–1458.
Seshadri S, Fitzpatrick AL, Ikram MA, DeStefano AL, Gudnason V, Boada M et al. Genome-wide analysis of genetic loci associated with Alzheimer disease. JAMA 2010; 303: 1832–1840.
Hollingworth P, Harold D, Sims R, Gerrish A, Lambert JC, Carrasquillo MM et al. Common variants at ABCA7 MS4A6A/MS4A4E EPHA1 CD33 and CD2AP are associated with Alzheimer's disease. Nat Genet 2011; 43: 429–435.
Harold D, Abraham R, Hollingworth P, Sims R, Gerrish A, Hamshere ML et al. Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer's disease. Nat Genet 2009; 41: 1088–1093.
Beecham GW, Hamilton K, Naj AC, Martin ER, Huentelman M, Myers AJ et al. Genome-wide association meta-analysis of neuropathologic features of Alzheimer’s disease and related dementias. PLoS Genet 2014; 10: e1004606.
Bertram L, Lange C, Mullin K, Parkinson M, Hsiao M, Hogan MF et al. Genome-wide association analysis reveals putative Alzheimer’s disease susceptibility loci in addition to APOE. Am J Hum Genet 2008; 83: 623–632.
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007; 81: 559–575.
Deary IJ, Weiss A, Batty GD . Intelligence and personality as predictors of illness and death: How researchers in differential psychology and chronic disease epidemiology are collaborating to understand and address health inequalities. Psychol Sci Publ Interest 2010; 11: 53–79.
Wechsler D . WAIS-IIIUK Administration and Scoring Manual. Psychological Corporation: London, UK, 1998.
Wechsler D . WMS-IIIUK Administration and Scoring Manual. Psychological Corporation: London, UK, 1998.
Lezak M . Neuropsychological Assessment. Oxford University Press: Oxford, UK, 1995.
Raven JC, Court JH, Raven J . Manual for Raven’s Progressive Matrices and Vocabulary Scales. Lewis HK: London, UK, 1977.
Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR et al. Common SNPs explain a large proportion of the heritability for human height. Nat Genet 2010; 42: 565–569.
Lee PH, O'Dushlaine C, Thomas B, Purcell SM . INRICH: interval-based enrichment analysis for genome-wide association studies. Bioinformatics 2012; 28: 2.
Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res 2012; 22: 1790–1797.
Kang HJ, Kawasawa YI, Cheng F, Zhu Y, Xu X, Li M et al. Spatio-temporal transcriptome of the human brain. Nature 2011; 478: 483–489.
Wisdom NM, Callahan JL, Hawkins KA . The effects of apolipoprotein E on non-impaired cognitive functioning: a meta-analysis. Neurobiol Aging 2011; 32: 63–74.
De Jager PL, Shulman JM, Chibnik LB, Keenan BT, Raj T, Wilson RS et al. A genome-wide scan for common variants affecting the rate of age-related cognitive decline. Neurobiol Aging 2011; 33: 15.
Small BJ, Rosnick CB, Fratiglioni L, Backman L, Apolipoprotein E . and cognitive performance: a meta-analysis. Psychol Aging 2004; 19: 592–600.
Deary IJ, Whiteman MC, Pattie A, Starr JM, Hayward C, Wright AF et al. Cognitive change and the APOE e4 allele. Nature 2002; 418: 932.
Roses AD, Lutz MW, Amrine-Madsen H, Saunders AM, Crenshaw DG, Sundseth SS et al. A TOMM40 variable-length polymorphism predicts the age of late-onset Alzheimer’s disease. Pharmacogenomics J 2009; 10: 375–384.
Corder EH, Saunders AM, Strittmatter WJ, Schmechel DE, Gaskell PC, Small GA et al. Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer’s disease in late onset families. Science 1993; 261: 921–923.
Strittmatter WJ, Saunders AM, Schmechel D, Pericak-Vance M, Enghild J, Salvesen GS et al. Apolipoprotein E: high avidity binding to beta-amyloid and increased frequency of type 4 allele in late-onset familial Alzheimer’s disease. Proc Natl Acad Sci USA 1993; 90: 1977–1981.
Bekris LM, Lutz F, Yu C . Functional analysis of APOE locus genetic variation implicates regional enhancers in the regulation of both TOMM40 and APOE. J Hum Genet 2011; 57: 18–25.
Bunce D, Bielak AAM, Anstey KJ, Cherbuin N, Batterham PJ, Easteal S . APOE genotype and cognitive change in young, middle-aged and older adults living in the community. J Gerontol A Biol Sci Med Sci 2013; 69: 379–386.
Nilsson LG, Adolfsson R, Bäckman L, Cruts M, Nyberg L, Small BJ et al. The influence of APOE status on episodic and semantic memory: data from a population-based study. Neuropsychology 2006; 20: 645–657.
Blair CK, Folsom AR, Knopman DS, Bray MS, Mosley TH, Boerwinkle E . APOE genotype and cognitive decline in a middle-aged cohort. Neurology 2005; 64: 268–276.
Kapiloff MS, Schillace RV, Westphal AM, Scott JD . mAKAP: an A-kinase anchoring protein targeted to the nuclear membrane of differentiated myocytes. J Cell Sci 1999; 112: 2725–2736.
Seshadri S, Fitzpatrick AL, Ikram MA, DeStefano AL, Gudnason V, Boada M et al. Genome-wide analysis of genetic loci associated with Alzheimer disease. JAMA 2010; 303: 1832–1840.
Wang K, Zhang H, Bloss CS, Duvvuri V, Kaye W, Schork NJ et al. A genome-wide association study on common SNPs and rare CNVs in anorexia nervosa. Mol Psychiatry 2011; 16: 949–959.
Manning AK, Hivert MF, Scott RA, Grimsby JL, Bouatia-Naji N, Chen H et al. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nat Genet 2012; 44: 659–669.
Benjamin DJ, Cesarini D, van der Loos MJHM, Dawes CT, Koellinger PD, Magnusson PKE et al. The genetic architecture of economic and political preferences. Proc Natl Acad Sci USA 2012; 109: 8026–8031.
Ledent V, Paquet O, Vervoort M . Phylogenetic analysis of the human basic helix-loop-helix proteins. Genome Biol 2002; 3: RESEARCH0030.
Kamm GB, Pisciottano F, Kliger R, Franchini LF . The developmental brain gene NPAS3 contains the largest number of accelerated regulatory sequences in the human genome. Mol Biol Evol 2013; 30: 1088–1102.
Kamnasaran D, Muir WJ, Ferguson-Smith MA, Cox DW . Disruption of the neuronal PAS3 gene in a family affected with schizophrenia. J Med Genet 2003; 40: 325–332.
Pickard BS, Malloy MP, Porteous DJ, Blackwood DH, Muir WJ . Disruption of a brain transcription factor, NPAS3, is associated with schizophrenia and learning disability. Am J Med Genet B Neuropsychiatr Genet 2005; 136B: 26–32.
Macintyre G, Alford T, Xiong L, Rouleau GA, Tibbo PG, Cox DW . Association of NPAS3 exonic variation with schizophrenia. Schizophr Res 2010; 120: 143–149.
Mühleisen TW, Leber M, Schulze TG, Strohmaier J, Degenhardt F, Treutlein J et al. Genome-wide association study reveals two new risk loci for bipolar disorder. Nat Commun 2014; 11: 3339.
Postnikov Y, Bustin M . Regulation of chromatin structure and function by HMGN proteins. Biochim Biophys Acta 2010; 1799: 62–68.
Amir R, Van den Veyver IB, Wan M, Tran CQ, Francke U, Zoghbi HY . Rett syndrome is caused by mutations in X-linked MECP2, encoding methyl-CpG-binding protein 2. Nat Genet 1999; 23: 185–188.
Abuhatzira L, Shamir A, Schones DE, Schäffer AA, Bustin M . The chromatin-binding protein HMGN1 regulates the expression of methyl CpG-binding protein 2 (MECP2 and affects the behavior of mice. J Biol Chem 2011; 286: 42051–42062.
Harris SE, Davies G, Luciano M, Payton A, Fox HC, Haggarty P et al. Polygenic risk for Alzheimer’s disease is not associated with cognitive ability or cognitive aging in non-demented older people. J Alzheimers Dis 2014; 39: 565–574.
Rietveld CA, Esko T, Davies G, Pers TH, Turley P, Benyamin B et al. Common genetic variants associated with cognitive performance identified using the proxy-phenotype method. Proc Natl Acad Sci USA 2014; 111: 13790–13794.
Marioni RE, Davies G, Hayward C, Liewald D, Kerr SM, Campbell A et al. Molecular genetic contributions to socioeconomic status and intelligence. Intelligence 2014; 44: 26–32.
Deary IJ, Yang J, Davies G, Harris SE, Tenesa A, Liewald D et al. Genetic contributions to stability and change in intelligence from childhood to old age. Nature 2012; 482: 212–215.
Lango Allen H, Estrada K, Lettre G, Berndt SI, Weedon MN, Rivadeneira F et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 2010; 467: 832–838.
Ripke S, O'Dushlaine C, Chambert K, Moran JL, Kähler AK, Akterin S et al. Genome-wide association analysis identifies 13 new risk loci for schizophrenia. Nat Genet 2013; 45: 1150–1159.
We thank Peter M Visscher for advice on statistical genetic analyses and for comments on earlier versions of the manuscript, and Stuart J Ritchie for assistance with figures. For cohort acknowledgements, see Supplementary Information 1 Section 3.
Bruce M Psaty serves on a DSMB for a clinical trial of a device funded by the manufacturer (Zoll LifeCor). All other authors declare no conflict of interest.
Supplementary Information accompanies the paper on the Molecular Psychiatry website
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Davies, G., Armstrong, N., Bis, J. et al. Genetic contributions to variation in general cognitive function: a meta-analysis of genome-wide association studies in the CHARGE consortium (N=53 949). Mol Psychiatry 20, 183–192 (2015). https://doi.org/10.1038/mp.2014.188
This article is cited by
Genome-wide association study of brain biochemical phenotypes reveals distinct genetic architecture of Alzheimer’s disease related proteins
Molecular Neurodegeneration (2023)
Cataloging the potential SNPs (single nucleotide polymorphisms) associated with quantitative traits, viz. BMI (body mass index), IQ (intelligence quotient) and BP (blood pressure): an updated review
Egyptian Journal of Medical Human Genetics (2022)
Interplay between polygenic propensity for ageing-related traits and the consumption of fruits and vegetables on future dementia diagnosis
BMC Psychiatry (2022)
Shaking up the silence: consequences of HMGN1 antagonizing PRC2 in the Down syndrome brain
Epigenetics & Chromatin (2022)
A correlation map of genome-wide DNA methylation patterns between paired human brain and buccal samples
Clinical Epigenetics (2022)