Genome-wide association study of school grades identifies genetic overlap between language ability, psychopathology and creativity

Rajagopal, Veera M.; Ganna, Andrea; Coleman, Jonathan R. I.; Allegrini, Andrea; Voloudakis, Georgios; Grove, Jakob; Als, Thomas D.; Horsdal, Henriette T.; Petersen, Liselotte; Appadurai, Vivek; Schork, Andrew; Buil, Alfonso; Bulik, Cynthia M.; Bybjerg-Grauholm, Jonas; Bækvad-Hansen, Marie; Hougaard, David M.; Mors, Ole; Nordentoft, Merete; Werge, Thomas; Mortensen, Preben Bo; Breen, Gerome; Roussos, Panos; Plomin, Robert; Agerbo, Esben; Børglum, Anders D.; Demontis, Ditte

doi:10.1038/s41598-022-26845-0

Download PDF

Article
Open access
Published: 09 January 2023

Genome-wide association study of school grades identifies genetic overlap between language ability, psychopathology and creativity

Veera M. Rajagopal^1,2,3,4,
Andrea Ganna^5,6,7,
Jonathan R. I. Coleman^8,9,
Andrea Allegrini⁸,
Georgios Voloudakis^10,11,12,
Jakob Grove^1,2,3,4,13,
Thomas D. Als^1,2,3,4,
Henriette T. Horsdal^2,14,
Liselotte Petersen^2,14,
Vivek Appadurai^2,15,
Andrew Schork^2,15,16,
Alfonso Buil^2,15,
Cynthia M. Bulik^17,18,19,
Jonas Bybjerg-Grauholm^2,20,
Marie Bækvad-Hansen^2,20,
David M. Hougaard^2,20,
Ole Mors^2,21,
Merete Nordentoft^2,22,23,
Thomas Werge^2,15,23,24,
iPSYCH-Broad Consortium,
Preben Bo Mortensen^2,4,14,25,
Gerome Breen^8,9,
Panos Roussos^10,11,12,
Robert Plomin⁸,
Esben Agerbo^2,3,25,
Anders D. Børglum^1,2,3,4 &
…
Ditte Demontis^1,2,3,4

Scientific Reports volume 13, Article number: 429 (2023) Cite this article

13k Accesses
4 Citations
185 Altmetric
Metrics details

Subjects

Abstract

Cognitive functions of individuals with psychiatric disorders differ from that of the general population. Such cognitive differences often manifest early in life as differential school performance and have a strong genetic basis. Here we measured genetic predictors of school performance in 30,982 individuals in English, Danish and mathematics via a genome-wide association study (GWAS) and studied their relationship with risk for six major psychiatric disorders. When decomposing the school performance into math and language-specific performances, we observed phenotypically and genetically a strong negative correlation between math performance and risk for most psychiatric disorders. But language performance correlated positively with risk for certain disorders, especially schizophrenia, which we replicate in an independent sample (n = 4547). We also found that the genetic variants relating to increased risk for schizophrenia and better language performance are overrepresented in individuals involved in creative professions (n = 2953) compared to the general population (n = 164,622). The findings together suggest that language ability, creativity and psychopathology might stem from overlapping genetic roots.

The impact of exercise on gene regulation in association with complex trait genetics

Article Open access 01 May 2024

Genome-wide association studies

Article 26 August 2021

Sleep quality, duration, and consistency are associated with better academic performance in college students

Article Open access 01 October 2019

Introduction

Psychiatric disorders are common and have a complex aetiology with contributions from both genetic and environmental factors¹. They are typically characterized by an early age of onset². While some disorders emerge during childhood, for example, autism spectrum disorder (ASD) and attention deficit hyperactivity disorder (ADHD), some emerge during adolescence or early adulthood, for example, schizophrenia (SCZ), bipolar disorder (BD) and anorexia nervosa (AN). Even long before the symptoms manifest, individuals often show signs of psychopathology³. Several epidemiological studies have found atypical school performance as a risk factor for psychiatric disorders. Poor school performance has been shown as a risk factor for SCZ³ whereas excellent school performance for BD⁴. Atypical school performance has been reported also in unaffected children and siblings of psychiatric patients suggesting that genetic factors could be involved^5,6.

Large-scale GWASs have been conducted for many major psychiatric disorders to date^{7,8,9,10,11,12}, demonstrating a complex genetic architecture and strong genetic correlations with a broad range of phenotypes, including cognitive phenotypes such as educational attainment¹³ and intelligence¹⁴. We have previously reported a strong negative genetic correlation between ADHD⁷ and educational attainment and a moderate positive genetic correlation between ASD⁸ and educational attainment in the largest GWASs of ADHD⁷ and ASD⁸ respectively. Likewise, a GWAS has reported a strong positive genetic correlation between AN and educational attainment¹².

For most of the psychiatric disorders, the genetic correlations with educational attainment align with the corresponding phenotypic associations with school performance reported in epidemiological studies^15,16,17. For some disorders such as SCZ, however, the genetic correlations contradict phenotypic associations¹⁸. Although clinical and epidemiological studies have documented that individuals with—or at risk for—SCZ perform poorly in school¹⁹ and score low in neurocognitive assessments²⁰, the genetic correlation of SCZ with educational attainment is not negative, but rather positive, albeit weak¹⁸. Similarly, although BD has been shown to be associated with cognitive deficits²¹, it shows a positive genetic correlation with educational attainment¹⁰. Such findings indicate that heterogeneity exists in the genetic overlap between psychiatric disorders and cognitive phenotypes. The current large GWASs of educational attainment¹³ and intelligence¹⁴ do not inform about what causes such heterogeneities. This could be due to that educational attainment and intelligence are composite measures and capture multiple cognitive domains each of which correlates differently with psychiatric disorders (some correlating positively and some negatively). Also, it has been suggested that the cognitive component in educational attainment that correlates positively with disorders such as SCZ and BD might represent creativity²², as creativity has been shown to associate positively with these disorders^21,22. However, it is not clear which cognitive domain specifically corresponds to creativity. Given these backgrounds, performing GWASs of specific cognitive domains and studying their genetic associations with psychiatric disorders and creativity might offer significant insights into the complex relationship between educational attainment, psychiatric risk and creativity.

Compared to educational attainment, school grades offer fine-grained information. For example, grades in language and math subjects might serve as proxies for verbal and numerical cognition respectively. However, no large-scale GWASs of subject-specific school grades have been conducted so far. The existing GWASs of school grades were based on small sample sizes insufficient to study genetic overlap with the psychiatric disorders^23,24.

Here we present the largest GWAS of school grades to date disentangling the polygenic architecture of various domains of school performance and their complex phenotypic and genetic relationships with six major psychiatric disorders (ADHD, ASD, SCZ, BD, major depressive disorder (MDD) and AN). The overall study design is shown in Supplementary Fig. 1. Our discovery sample comes from iPSYCH²⁵ and the Anorexia Nervosa Genetics Initiative (ANGI)²⁶, large population-based Danish cohorts of individuals with and without psychiatric disorders for whom information on school grades was available through the Danish education register²⁷. Using principal component analysis (PCA), we decomposed the school grades in Danish, English and mathematics into orthogonal principal components (hereafter, E-factors) thereby capturing distinct cognitive domains. The GWASs of the E-factors (E1, E2, E3 and E4) identified multiple genome-wide significant loci. Among the E-factors, E1 correlated the most with educational attainment²⁸ and intelligence¹⁴. E2 captured differences between language and math performances and showed positive phenotypic and genetic correlations with almost all the psychiatric disorders, thereby shedding light on the differential relationship of psychiatric disorders with math and language cognitive domains. In addition, E2 showed a positive genetic association with creativity, suggesting a shared genetic basis between language ability and creativity.

Results

Sample characteristics

Totally 30,982 individuals from iPSYCH and ANGI were studied, of which 18,495 (60%) had a diagnosis for at least one of the six psychiatric disorders, and 12,487 (40%) did not have any. The sample characteristics are shown in Table 1. The age of the study individuals (as of Dec 2016) ranged between 15 to 38 years (mean = 24.2; SD = 4.2). There were 14,606 males and 16,376 females. The school grades were from the exit exam (or ninth-level exam or FP9) given at the end of compulsory schooling in Denmark. Age at the time of examination (hereafter, exam age) ranged between 14.5 and 17.5 years (mean = 15.7; SD = 0.42). Grades in three subjects namely Danish, English and mathematics were analyzed. Individuals with psychiatric diagnoses (as of Dec 2016) were considered as cases irrespective of whether the diagnoses were given before or after the exit exam.

Table 1 Sample characteristics.

Full size table

Decomposition of school grades into cognitive domains

For each individual we had information on the following grades: Danish written, oral and grammar, English oral, mathematics written and oral (if sat for the exam before 2007), mathematics problem solving with help and problem solving without help (if sat for the exam after 2006; “Methods”). All the grades individually showed substantial heritability and strongly correlated with each other, both phenotypically and genetically (Supplementary Fig. 2). Individual GWASs of these grades may yield largely similar results and may inform little about the genetic overlap of distinct performance domains with psychiatric disorders. Hence, we decomposed the grades into latent factors (E-factors) that might represent distinct cognitive domains, using a principal component analysis (PCA). Since the mathematics exams were restructured in 2007, we performed PCA separately for grades given during 2002–2006 and 2007–2016 (“Methods”). We identified four informative E-factors that were reproducible between the two PCAs in terms of subject loadings and showed near-perfect genetic correlations (rg ~ 1) between the two datasets (Fig. 1a–c and Supplementary Table 1). After combining the two datasets, the four E-factors together explained 89.5% of the variance in the school grades (E1 = 56%; E2 = 13.5%; E3 = 10.5%; E4 = 8.5%).

We discuss in detail how to interpret the four E-factors based on their subject-specific loadings in the Supplementary Note. Briefly, E1 captured overall school performance (analogous to general cognitive ability factor (g) derived from a battery of cognitive tests²⁹), E2 captured language performance relative to math, E3 captured oral performance relative to written, and E4 captured Danish performance relative to English (Table 2). We also repeated the PCA only in the control individuals (N = 12,487) and found similar subject loadings to that of the main PCA (Supplementary Fig. 3). Hence, the subject loading pattern in the main PCA was not biased due to the inclusion of individuals with psychiatric disorders.

Table 2 Description of E-factors.

Full size table

GWAS of E-factors

We performed GWASs for the four E-factors using a genetically homogenous sample of unrelated Europeans that comprised both individuals with and without psychiatric disorders (“Methods”). Psychiatric diagnoses as well as sex and exam age were included in the covariates as they were all significantly associated with the E-factors (“Methods”; Supplementary Table 2 and Supplementary Note).

We identified seven genome-wide significant loci (P < 5 × 10^–8), of which four were associated with E1, two with E2, one with E3 and none with E4 (Supplementary Table 3; Supplementary Fig. 4; Supplementary Dataset 1). Among these, only three remained strictly significant (P < 1.2 × 10^–8) after adjusting for the four GWASs conducted. In a phenome-wide association study (see “Methods”) of the index variants in the seven loci, six were significantly associated with multiple cognitive phenotypes (Supplementary Table 4; Supplementary Dataset 2). Biological annotations of the seven loci are discussed in the Supplementary Note. The SNP-based heritability estimates were as follows: E1 = 0.29 (SE = 0.01; P < 1.0 × 10^–300), E2 = 0.18 (SE = 0.01; P < 1.0 × 10^–300), E3 = 0.13 (SE = 0.01; P < 1.0 × 10^–300) and E4 = 0.08 (SE = 0.01; P = 1.0 × 10^–13). We observed moderate levels of inflation in the GC lambda values, which were likely due to polygenicity rather than biases such as population stratification and cryptic relatedness as suggested by LD score regression analysis³⁰ (Supplementary Table 5).

Association of E-factors with educational attainment and intelligence

Among the four E-factors, E1 captured overall school performance and showed the strongest genetic correlations with educational attainment²⁸ (r_g = 0.90; SE = 0.03; P = 4.8 × 10^–198) and intelligence¹⁴ (r_g = 0.80; SE = 0.03; P = 3.3 × 10^–128) (Fig. 2a; Supplementary Table 6). We also studied the associations of polygenic scores for educational attainment and intelligence (“Methods”) with the E-factors. The polygenic scores showed strong associations with E1 and explained 8.3% (educational attainment) and 4.9% (intelligence) of the variance in E1 (Fig. 2b,c; Supplementary Table 7). The genetic associations of educational attainment and intelligence with E2, E3, and E4 were only modest (Fig. 2; Supplementary Tables 6, 7; Supplementary Note).

Phenotypic and genetic associations of E-factors with psychiatric disorders

Next, we evaluated the phenotypic and genetic associations of the E-factors with six psychiatric disorders (ADHD⁷, ASD⁸, SCZ³¹, BD³², MDD¹¹ and AN¹²). Phenotypic associations were evaluated by comparing the E-factor scores of each psychiatric disorder group against the controls. Genetic associations were evaluated using two approaches. First, we studied the genetic correlations of the E-factors with the six psychiatric disorders using LD score regression. Second, we constructed polygenic scores for the six psychiatric disorders in the iPSYCH cohort using variant effect sizes from past psychiatric GWASs and studied their associations with the E-factors only in the controls.

E1 (overall school performance) showed strong phenotypic associations with four out of the six psychiatric disorders analyzed (Fig. 3a; Supplementary Table 8). The average E1 scores of ADHD, MDD and SCZ were significantly lower than controls and that of AN was significantly higher than controls. The average E1 scores of ASD and BD did not differ significantly from the controls. E1 showed significant negative genetic correlations with ADHD and MDD, but not with SCZ (Fig. 3b; Supplementary Table 9). Despite having a strong negative phenotypic association with SCZ, E1 showed a weak positive genetic correlation with SCZ, which was not statistically significant. Finally, E1 showed significant positive genetic correlations with ASD, BD and AN. Associations of polygenic scores for the six psychiatric disorders with the E-factors only in the controls recapitulated the genetic correlation findings (Fig. 3c; Supplementary Table 10).

E2 (language performance relative to math) showed significant phenotypic associations with all six psychiatric disorders (Fig. 3a; Supplementary Table 8). The average E2 scores of all six disorder groups were substantially higher than controls (meaning cases performed better in language relative to math compared to controls). Similar to the phenotypic associations, E2 showed positive genetic correlations with all six disorders (Fig. 3b; Supplementary Table 9), though statistical significance was achieved only for SCZ (r_g = 0.20; SE = 0.04; P = 2.1 × 10^–5) and MDD (r_g = 0.28; SE = 0.06; P = 1.1 × 10^–5). Associations of psychiatric polygenic scores only in the controls recapitulated the genetic correlation findings (Fig. 3c; Supplementary Table 10). The results overall suggested that a better performance in language relative to math was seen in all six psychiatric disorders and this differential performance seems to have a strong genetic basis, particularly in SCZ and MDD.

Although E3 (oral performance relative to written) and E4 (Danish performance relative to English) showed significant phenotypic associations with many of the psychiatric disorders, these associations were not supported by the genetic correlation and polygenic score analysis, which could be due to a lack of power as they explained relatively smaller amounts of variances in the school grades (Fig. 3; Supplementary Tables 8, 9, 10; Supplementary Note).

Association of language and math performances with psychiatric disorders

While the interpretation of E1 findings is straightforward, the interpretation of E2 findings is not. Observing higher E2 scores in cases compared to controls does not always imply that language grades in cases are higher than controls and math grades are lower than controls. Even when both the language and math grades are lower in cases than in controls, the E2 scores in cases can be still higher than controls, if the language grades of cases are relatively higher than their math grades. Thus, to gain clarity on the relationship of E2 with the six psychiatric disorders, we compared the actual math and language grades between cases and controls. We calculated the mean across all math grades and the mean across all Danish and English grades for each individual and used them as measures of math and language performances respectively. Using a multiple logistic regression analysis (adjusted for exam age and sex), both math and language grades were included in the same regression model, thereby testing if the language performance differed between cases and controls after controlling for the differences in the math performance and vice versa.

For all six disorders, we observed substantial differences between math and language performances in cases compared to controls (Fig. 4a; Supplementary Table 11). At the phenotypic level, math grades were significantly lower in cases compared to controls for all disorder groups except AN (Fig. 4a; Supplementary Table 11). Notably, among all the disorder groups, the SCZ cases scored on average lowest in math compared to controls (Beta = − 0.44; SE = 0.03; P = 3 × 10^–33). In contrast to math grades, the language grades were significantly higher in cases compared to controls in BD (Beta = 0.24; SE = 0.04; P = 8.5 × 10^–8), ASD (Beta = 0.15; SE = 0.02; P = 2 × 10^–11) and AN (Beta = 0.35; SE = 0.03; P = 7.4 × 10^–25). For ADHD, the language grades (Beta = − 0.18; SE = 0.02; P = 1 × 10^–17) were lower in cases compared to controls, though the difference was only less than half of the math difference (Beta = − 0.52; SE = 0.02; P = 2.4 × 10^–123). For SCZ (Beta = 0.01; SE = 0.03; P = 0.64) and MDD (Beta = 0.04; SE = 0.01; P = 0.006; adjusted P = 0.07), no statistically significant differences were seen in the language grades between cases and controls.

Next, we asked if the impact of psychiatric disorders on language-math performance differences could extend beyond cases to the general population. That is, even in individuals without any psychiatric diagnosis, do genetic risks for psychiatric disorders associate with the language-math performance gap? To test this, we studied the associations of polygenic scores for the six psychiatric disorders with language and math performances only in the controls using multiple linear regression analysis. Both math and language grades were included in the same regression model to measure the correlation of language grades with polygenic risk for psychiatric disorders after controlling for math grades and vice versa.

We found that the polygenic score associations were in the same direction as the corresponding phenotypic associations. The results suggested that individuals who were not diagnosed with psychiatric disorders, but with a higher polygenic risk for psychiatric disorders, performed better in language relative to math (Fig. 4b; Supplementary Table 12). With regard to SCZ and BD, we observed an inverse association pattern between phenotypic analyses and polygenic score analyses (Fig. 4a,b). The SCZ cases had significantly poorer math grades, but not better language grades compared to controls (math: Beta = − 0.44; SE = 0.03; P = 3 × 10^–33; language: Beta = 0.01; SE = 0.03; P = 0.64). However, individuals without SCZ, but with a higher polygenic risk for SCZ had significantly poorer math grades as well as better language grades (math: Beta = − 0.07; SE = 0.01; P = 1.4 × 10^–11; language: Beta = 0.10; SE = 0.01; P = 5.1 × 10^–16). This pattern was inverse for BD. The BD cases had significantly poorer math grades as well as better language grades compared to controls (math: Beta = − 0.37; SE = 0.04; P = 1 × 10^–16; language: Beta = 0.24; SE = 0.04; P = 8.5 × 10^–8). However, individuals without BD, but with a higher polygenic risk for BD had better language grades, but not poorer math grades (math: Beta = − 0.002; SE = 0.01; P = 0.88; language: Beta = 0.12; SE = 0.02; P = 2 × 10^–8). A similar inverse pattern between SCZ and BD has been known with regard to genetic correlations with educational attainment and intelligence. SCZ shows a significant negative genetic correlation with intelligence, but a small positive genetic correlation with educational attainment¹⁸ (also E1) whereas BD shows a significant positive genetic correlation with educational attainment¹⁰ (also E1) but no genetic correlation with intelligence. The results point to a unique inverse relationship of SCZ and BD with cognition, though the molecular mechanisms underlying this relationship are unclear.

Replication in TEDS

To replicate our findings, we analyzed 4547 genotyped individuals from the Twins Early Development Study (TEDS) cohort³³ for whom General Certificate of Secondary Education (GCSE) school grades in English, math and science were available. The profile of the school grades in TEDS was similar to the one in iPSYCH: school grades were from the end of compulsory schooling; individuals were aged 15–16 years at the time of the examinations; the school grades were available in both math and language exams.

PCA of English, math and science grades in TEDS yielded subject loadings that were similar to the subject loadings in the iPSYCH: E1 (first PC) had similar positive loadings from all three subjects; E2 (second PC) had positive loading from English and negative loadings from math and science; importantly, the English (0.81) and math (− 0.50) loadings were higher than science loading (− 0.27) suggesting that E2 captured mainly math and language performances (Fig. 5a). The factors E1 and E2 explained 83.4% and 10.4% of the variance in the school grades in TEDS respectively. Since the grades were not broken down to written and oral exams or a grade in a foreign language that was taken by everybody was not available (unlike Denmark, where the English exam is compulsory and so taken by everybody), we could not derive factors in TEDS equivalent to E3 and E4 in the iPSYCH.

We performed GWASs of E1 and E2 in TEDS and tested their genetic correlations with E1 and E2 in iPSYCH. The factors E1 and E2 in the two cohorts correlated almost completely (E1: r_g = 0.99; SE = 0.13; P = 3.3e−13; E2: r_g = 1.07; SE = 0.77; P = 0.16; Fig. 5b; Supplementary Table 13). The genetic correlation of E2 between the cohorts, however, was not statistically significant due to the relatively smaller sample size of the TEDS. Nevertheless, when we predicted E1 and E2 in TEDS using polygenic scores (constructed using the effect sizes from GWASs of E1 and E2 in iPSYCH), we observed significant associations supporting the genetic correlation analyses (Fig. 5c; Supplementary Table 14).

We also tested the genetic associations of E1 and E2 with the psychiatric disorders in TEDS using polygenic score analysis (Fig. 5d; Supplementary Table 15). The polygenic scores for the six psychiatric disorders were associated with E1 and E2 in TEDS in the same directions as the corresponding associations in iPSYCH. Notably, we observed positive associations between E2 and all six psychiatric disorders, albeit most of the associations except SCZ were only borderline significant; SCZ showed the strongest association with E2 (Beta = 0.06; SE = 0.01; P = 3.6 × 10^–06). Hence, overall, we observed an agreement in the results between iPSYCH and TEDS that the genetic variants associated with E2 (better performance in language relative to math) were also associated with increased risk for psychiatric disorders, especially SCZ.

Association of E2 with creativity

Creativity has been historically believed to relate positively with psychopathology³⁴. Many epidemiological studies^35,36,37 and genetic studies^38,39 have reported supporting findings. Our analyses showed that E2 was associated with increased risks for multiple psychiatric disorders as well as with increased language performance. Hence, we asked if common variants associated with E2 also associate with creativity. To evaluate this we analyzed 167,575 individuals from the Million Veterans Program (MVP) biobank⁴⁰, for whom information on occupation category was available. We classified individuals employed in the category ‘arts, design, entertainment, sports and media’ as creative professionals and the rest as controls. We constructed polygenic scores for all four E-factors using effect sizes from the iPSYCH GWASs and compared the scores between the creative professionals and controls after adjusting for their highest level of education, age and sex along with other covariates (“Methods”). For sensitivity analysis, we also compared the scores of individuals in each of the other occupation categories against the rest. We found that the E2 polygenic scores were significantly higher in creative professionals compared to others. This association was specifically observed between E2 and creative occupation. That is, among the four E-factors, E2 showed the strongest association with creativity, and among 24 occupation categories, creative occupations showed the strongest association with E2. (Fig. 6; Supplementary Table 16). The results suggested that individuals with a higher genetic predisposition to perform better in language relative to math in school (indexed by a higher polygenic score for E2) have higher odds of being employed in a creative occupation later in life.

Discussion

To our knowledge, the results presented here represent the most comprehensive report to date on the phenotypic and genetic differences in subject-specific school performances in individuals with and without psychiatric disorders. The study has been possible due to the unique register-based resources linked to the iPSYCH²⁵ and ANGI²⁶ cohorts, which enabled us to combine genetic information of the study individuals with their school grades in the Danish education register²⁷. Our cohort is the largest available to date for the study of the genetics of school grades. Our study poses notable advantages compared to previous GWASs of educational attainment^13,28. The phenotype, school grades, is fine-grained and hence captures more variance, for example, multiple individuals with the same number of years of education could differ substantially in their school grades. Our phenotype is objectively measured (graded by the teachers) unlike educational attainment, which is mostly self-reported and hence likely to have more imprecision and heterogeneity compared to school grades. Our study participants were all from a single large cohort hence the study variables—school grades and psychiatric diagnoses—are likely to have less heterogeneity compared to meta-analytic studies where different cohorts follow different methods of ascertainment. Our study individuals were identified through nationwide registers and hence not subject to voluntary participation bias that has been shown to affect GWAS results, particularly the genetic correlations⁴¹.

The availability of grades in multiple subjects for the same individuals enabled us to perform a PCA and decompose the school grades into distinct factors (E1, E2, E3, and E4) each representing unique cognitive abilities such as math and language. We replicated the PCs E1 and E2 in an independent cohort, TEDS³³, and demonstrated near-perfect genetic correlations for E1 and E2 between iPSYCH and TEDS, thereby showing that the latent factors E1 and E2 are robust and reproducible.

Factor E1 measured the overall school performance and showed a strong genetic correlation with educational attainment²⁸. Importantly, both E1 and educational attainment showed similar genetic correlations with the six psychiatric disorders. The results suggest that genetic relationships of psychiatric disorders with cognitive function inferred based on phenotypes (educational attainment and fluid intelligence) measured later in life (i.e., timepoints typically beyond the age of onset of most psychiatric disorders) are highly similar to the relationships inferred based on phenotypes (school performance) measured in early in life (i.e., timepoints before the age of onset most psychiatric disorders, except the ones with childhood-onset—ASD and ADHD). Although the results do not inform about causality or its direction, they inform that the observed genetic relationships are not merely a reflection of the effects of psychiatric disorders on cognitive and socioeconomic outcomes later in life.

The main findings of our study are those related to E2 as they offer novel insights into the relationship between psychiatric disorders and cognition. E2 measured the language performance relative to math performance and showed significant positive phenotypic associations with all six psychiatric disorders, which were supported by genetic correlations and/or polygenic score associations. Further analysis of actual language and math grades confirmed that the positive associations of E2 with the six psychiatric disorders are indeed due to increased language and decreased math performances in the cases compared to controls. Replication analysis in TEDS further validated the findings. In the TEDS cohort, the polygenic scores for all six psychiatric disorders showed associations with E2 in the positive direction. However, the association was statistically significant only for SCZ, which is likely due to the relatively smaller sample size in TEDS.

The E2 findings add two important insights. Firstly, the findings suggest that the positive genetic correlations of ASD⁸, BD¹⁰ and AN¹² with educational attainment reported by earlier GWAS were likely driven by language-specific cognition. Though individuals with higher genetic risk for ASD or BD or AN are genetically predisposed to attain higher education as suggested by the positive genetic correlation with educational attainment, we speculate that they do so by choosing a field that requires language skills rather than math skills. GWASs of educational attainment separately in language and math-related fields in the future might help to confirm our hypothesis.

Secondly, the positive genetic association between E2 and risk for psychiatric disorders suggests that the genetic variants associated with increased risk for psychiatric disorders are associated with poor math ability or better language ability or possibly both. It is however unclear if the positive association between psychiatric risk and language ability is a true biological link. If so, this association might have an evolutionary basis as has been suggested previously, particularly in the context of SCZ^42,43. Alternatively, the association of psychiatric risk variants with increased language ability could be simply due to that the individuals with the risk variants compensate for their math deficits by being good in non-math subjects including language.

Our final analysis suggested that individuals genetically predisposed to perform relatively better in language than in math at school more often choose a creative occupation later in life such as writing, acting and music. As we have demonstrated that these individuals were also at risk for psychiatric disorders, the finding aligns with two previous studies that demonstrated that individuals with increased genetic risk for SCZ and BD are more often involved in creative occupations³⁸ or score better in creativity tests compared to the general population³⁹. However, it is unclear if the association with creative occupations indicates that the individuals with the risk variants become creative due to their better language skills or simply choose creative professions as such professions suit well to their relatively poor math skills.

Our study has the following limitations. First, our interpretations assume that school performances in language and mathematics exams capture the cognitive abilities related to language and math domains respectively. However, our assumptions may not be entirely true as other factors including family⁴⁴ and school socioeconomic statuses⁴⁵ influence students’ school performances. Furthermore, like any other GWASs^13,14, the effect sizes that we report are likely to be inflated as we haven’t accounted for the rearing environment⁴⁴. Future studies that use a within-family design⁴⁶ might help address these limitations. Second, although our primary sample, iPSYCH, has a better sampling design and do not suffer from participation bias⁴¹, the individuals included in the current study are only a subset and are not representative of the full iPSYCH cohort. We included only those who were functional enough to go to school and attend the exams. Hence, our study sample is slightly biased towards better-functioning individuals and as a result, many of our findings cannot be generalized to the disorders, for example, findings related to ASD might apply only to the high-functioning subtypes. Third, not all the individuals in the case groups received their diagnoses before the exams. This factor was not accounted for in our analyses since the diagnoses were register-based and it is therefore not possible to confirm if the individuals were asymptomatic before the date when the diagnoses were first registered. Fourth, as our sample is highly enriched for psychiatric disorders, we included psychiatric diagnoses as covariates in the GWASs. Recent studies have shown that including heritable covariates in GWAS might lead to collider bias⁴⁷. However, this is less likely in our case as we have replicated the genetic correlation findings in only controls using polygenic score analysis and in addition, we also replicated the findings in an independent cohort, which is representative of the general population.

In summary, through an extensive analysis of subject-specific school grades in a large sample, we have convincingly demonstrated for the first time that individuals with psychiatric disorders exhibit wide differences in their math and language cognitions. These differences seem to have a genetic basis understanding which is important as the knowledge may help to treat the cognitive deficits associated with these disorders.

Online methods

Study cohorts

Our study individuals come from four cohorts: iPSYCH²⁵, ANGI²⁶, TEDS³³ and MVP⁴⁰. The main analyses were performed in iPSYCH and ANGI cohorts. iPSYCH is a large Danish case cohort established for the study of genetic and environmental risk factors associated with major psychiatric disorders (ADHD, ASD, MDD, SCZ, BD). ANGI is an international collaboration established between scientists in the United States, Australia/New Zealand, Sweden and Denmark to establish a cohort of individuals with and without AN. The ANGI participants involved in our study were those recruited in Denmark along with the iPSYCH participants. Hence, they were processed along with iPSYCH samples. All further descriptions about sample recruitment, genotyping and ethical approvals of the iPSYCH cohort apply to ANGI as well. The phase-1 release²⁵ of the iPSYCH cohort, after QC, comprises 77,639 individuals—51,101 with one or more of the six disorders and 27,605 controls—who were identified from a baseline sample comprising of the entire Danish population (N = 1,472,762) born between 1981 and 2005. The cases were selected based on the psychiatric diagnosis information in the Danish Psychiatric Central Research Register⁴⁸ and Danish National Hospital Register⁴⁹. The controls were randomly sampled from the baseline sample. In the current study, 30,982 genotyped individuals who had school grades information available in the Danish education register²⁷ were included.

TEDS³³ is a large twin cohort comprising more than 16,000 twin pairs who were born either in England or Wales between 1994 and 1996. The twins were around 18 months old at the time of recruitment and were followed up longitudinally. Information on cognitive abilities, educational attainment, behaviour and emotion was collected. Among the TEDS participants, 4547 individuals, comprising one of each of the twin pairs, who were genotyped and also had information on their GCSE school grades were included in the current study.

The MVP⁴⁰ is a large biobank at the Department of Veterans Affairs (VA), USA, established to study genetic and environmental influences on human diseases and traits. The participants being recruited into the study are active users of the US Veterans healthcare system. The MVP v3.0 data release comprised 455,789 genotyped individuals. Information on demographics, health, lifestyle, medical history and family history were collected through questionnaires or through electronic health records in the US Veterans healthcare system. Among those in the MVP v3.0 release, 165,575 individuals, who were identified to be of European ancestries and provided information on occupation and educational attainment, were included in the current study.

Ascertainment of psychiatric disorders in the iPSYCH cohort

The psychiatric diagnoses of the iPSYCH case samples were identified through Danish Psychiatric Central Research Register⁴⁸ and Danish National Hospital Register⁴⁹. The diagnosis codes are based on the International Classification of Diseases, 10th revision. The ICD-10 codes of the six disorders as follows: ADHD—F90.0; ASD—F84.0, F84.1, F84.5, F84.8 or F84.9; MDD—F32-39 [Since 96% of the individuals had either F32 (depressive episode) or F33 (recurrent depressive disorder), we call it as MDD rather than as affective disorders]; BD—F30-31; SCZ—F20; AN—F50.0. Individuals with mental retardation (ICD10 F70-79) were excluded.

School grades in the iPSYCH cohort

The school grades in the iPSYCH cohort were from the exit exam (also called as ninth-level exam or FP9) given at the end of compulsory schooling in Denmark. The exam grades were obtained from Danish education registers²⁷ that maintain school grades from all public schools in Denmark since 2001.

We chose three subjects namely Danish, English and math for the current study. These subjects were chosen because they were compulsory, and so were available for a maximum number of individuals in the cohort. The examination types included written, oral and grammar in Danish, oral in English, and either written and oral or problem-solving with and without aids in math. The data were from the exams conducted between 2002 and 2016.

The grades were on a seven-point scale: − 3 (unacceptable performance), 00 (inadequate performance), 02 (adequate performance), 4 (fair performance), 7 (good performance), 10 (very good performance) and 12 (excellent performance). The seven-point grading system was followed only since 2007. Before 2007, a ten-point grading system was followed. The grades included in our study, from the years 2002–2006, were converted to seven-point grades using the conversion table provided by the Danish Ministry of Education (https://ufm.dk/en/education/the-danish-education-system/grading-system/old-grading-scale). A minimum score of 02 is considered as passing grade. The grade 00 is given if the student has handed over a blank paper or performed extremely poorly. Absentees without reason were graded − 3. Absentees with an acceptable reason, for example, acute illness, are not graded but are given an opportunity to take the exam in the subsequent year.

We applied strict sample-level quality control based on the school grade data. We removed individuals who were either younger than 14.5 years or older than 17.5 years at the time of the examinations. The common age group of the students taking the ninth-level exams in Denmark is 15 and 16 years. When multiple grades were available for a student, we considered only the grade obtained on the first attempt. We included only individuals who had grades in all the examinations in Danish, English and math chosen for the current study. Also, we removed individuals who had grades in different subjects from different years to avoid heterogeneity that may arise if the student was taught by different teachers, was at different schools or had different peers between the years.

School grades in the TEDS cohort

The school grades in the TEDS cohort were from the GCSE exams given at the end of compulsory schooling in UK^33,50. The grades were available in three compulsory subjects namely English, math and science. Unlike iPSYCH data, under each subject only one grade was available. The GCSE grades were self-reported by either the participants or their parents. A validation of the self-reported school grades in TEDS has been performed previously by correlating with the grades extracted from the national pupil database (NPD) (https://data.gov.uk/dataset/9e0a13ef-64c4-4541-a97a-f87cc4032210/national-pupil-database) for a subset of the individuals, which showed that the self-reported grades correlate highly (> 0.95) with NPD grades³³. The GCSE grades ranged between four (G; lowest grade) to 11 (A+; highest grade), with four being the lowest pass grade.

Occupation and educational attainment in the MVP cohort

Information on primary occupation was collected from all MVP participants through a questionnaire. Only the category of occupation was collected. The participants chose one of the 24 categories (Supplementary Table 17) that matched the best with their primary occupation at the time of recruitment. Individuals who chose the category: ‘Arts, Design, Entertainment, Sport and Media’ were considered creative professionals, and the rest as controls. The highest level of education achieved by the participants was also collected through a questionnaire. The participants chose one of the following answers: less than high school; high school diploma/GED; some college credit, but no degree; associate’s degree (e.g., AA, AS); bachelor’s degree (e.g., BA, BS); master’s degree (e.g., MA, MS, MBA); professional or doctoral degree. The categories are then converted to the number of years of education following International Standard Classification of Education (ISCED) 1997 guidelines (Supplementary Table 18).

Genotyping and Imputation in the iPSYCH cohort

The source of DNA for genotyping was dried blood spot—two punches of diameter 3.2 mm equivalent to a volume of 6 µl of blood²⁵. The blood spots of the iPSYCH participants were taken from the Danish neonatal screening biobank, which stores dried blood spots, taken 4–7 days after birth from the heel of the neonate, for all individuals born in Denmark since 1981^51,52. The blood spots were matched with the individuals using the unique identification number that is used across all the registers in Denmark. The extracted DNA was then whole genome amplified in triplicates before genotyping. The genotyping was performed using Illumina Infinium PsychChip v1.0 array. Following standard QC of the genotyped markers (e.g., call rate > 0.98, MAF > 0.01, Hardy–Weinberg equilibrium P value > 1 × 10^–6), phasing and imputation was carried out. Phasing was performed using SHAPEIT3⁵³ and imputation was performed using IMPUTE2⁵⁴ with 1000 genomes phase-3 as the reference panel.

Genotyping and imputation in the TEDS cohort

The Source of DNA for genotyping in the TEDS cohort was saliva collected at the time of recruitment³³. The DNA extracted from the saliva was genotyped using either Illumina HumanOmniExpressExome chip or Affymetrix Gene Chip 6.0. Following standard QC procedures, the genotyped markers are phased using EAGLE-2⁵⁵, followed by imputation using MaCH⁵⁶ with Haplotype reference consortium (release 1.1)⁵⁷ as the reference panel. Both phasing and imputation were performed through the Sanger imputation services⁵⁸. Imputation was performed separately for individuals genotyped using Illumina and those genotyped using Affymetrix. The genotyping chip used was accounted for in the genetic analysis by including a dummy variable for the two chips as a covariate.

Genotyping and imputation in the MVP cohort

The Source of DNA for genotyping in the MVP cohort was peripheral venous blood collected at the time of recruitment or during the follow-up visits^40,59. A genotyping chip, called MVP chip (modified Affymetrix Axiom Biobank array), was specifically designed for the MVP biobank. The MVP chip contains 723,000 markers enriched for exome SNPs, validated tag SNPs for diseases including psychiatric disorders, and variants specific to African American and Hispanic populations. Phasing and imputation were performed using either MaCH⁵⁶ or minimac⁶⁰ and SHAPEIT3⁵³ or IMPUTE2⁵⁴ respectively.

Relatedness and population stratification in the iPSYCH cohort

All the individuals involved in the current study were unrelated and had European ancestries. Related pairs of individuals were identified using identity by descent (IBD) analysis using Plink v1.90⁶¹. One of each related pair (PIHAT > 0.20) was randomly excluded. PCA was performed in the unrelated individuals using approximately 23,000 imputed variants of high quality (imputation info score > 0.90, MAF > 0.05, missing rate < 0.01 and LD independent [r² < 0.1]). Among the study individuals, a subset whose parents and paternal and maternal grandparents were born in Denmark were identified based on the Danish civil register. These individuals were used as a reference group to identify population outliers. The first five PCs of the reference individuals were used to construct a five-dimensional ellipsoid with a diameter of eight standard deviations (calculated from the PCs). Those individuals who fell outside the ellipsoid were considered non-Europeans and excluded from the study.

Relatedness and population stratification in the TEDS cohort

Within each of the genotyped twins, one individual was randomly selected for the current study. Relatedness was estimated among those selected using IBD analysis using Plinkv1.90⁶¹ and one of each related pair (PIHAT > 0.125) was further removed randomly. PCA analysis was performed using EIGENSTRAT⁶² for the unrelated individuals after merging with the 1000 genomes EUR samples. With the 1000 genomes EUR samples as reference, ancestry outliers were removed iteratively based on the first 10 PCs.

Relatedness and population stratification in the MVP cohort

Relatedness between MVP participants was inferred using the kinship coefficient calculated using the software KING⁶³. Related individuals are removed using a kinship coefficient cut-off ≥ 0.088. Individuals of European ancestries were identified using a machine learning algorithm called HARE⁶⁴ that uses information about both self-reported ethnicities as well as principal components derived from PCA of genetic markers. The PCA was performed using EIGENSOFT v.6 (https://www.nature.com/articles/ng1847).

PCA of school grades

PCA of school grades in the iPSYCH cohort was performed in R using the ‘principal’ function from the ‘psych’ R-package. The datasets from the years 1990–2006 (dataset 1) and 2007–2016 (dataset 2) were analyzed separately since the math grade types differed between the two. Both the PCAs yielded six PCs that explained 100% of the variance in the school grades. The PCs were rotated using simplimax algorithm⁶⁵ and then used for the analysis. We focused on the first four PCs that explained ~ 90% of the variance in the school grades. The subject-specific loadings of the four PCs were similar between datasets 1 and 2. Also, the genetic correlations of the four PCs in dataset 1 with the corresponding PCs in dataset 2 were close to 1. Hence, we combined the PCs of both datasets and analyzed them together. The dataset origin of the PC values was coded as a binary variable and included as a covariate in all related analyses. PCA of school grades in the TEDS cohort was performed in R using ‘prcomp’ function from the base R-package.

GWAS

The GWASs in the iPSYCH cohort were performed in Plink v.1.90⁶¹ using linear regression adjusted for age (age at the time of examination), sex, first 10 PCs, genotyping batches, group variable for PCA of school grades and psychiatric diagnoses. Totally 6,391,200 variants with MAF > 0.01 and INFO > 0.80 were included in the final analysis.

The GWASs in the TEDS cohort were performed in Plink v.1.90⁶¹ using linear regression adjusted for age, sex, first 10 PCs and genotyping chips. Totally 5,266,884 variants with MAF > 0.01 and INFO > 0.80 were included in the final analysis.

Phenome-wide association analysis

The Phenome-wide association analysis was performed using GWAS atlas⁶⁶, an online database of GWAS summary statistics. The GWAS atlas database holds summary statistics for 4571 GWASs (the number represents unique studies, but not unique phenotypes). Using variant identifier (RSID) of the index variants in the seven genome-wide loci, we queried the GWAS atlas and obtained all the associations with P value < 0.05. The phenotypes are provided along with category labels such as cognitive and psychiatric. All the associations under the cognitive category are provided in Supplementary Table 5 (educational attainment, though categorized as environmental, is also included in the list).

SNP based heritability

The SNP-based heritability of the school grades and the E-factors was measured using genome-based restricted maximum likelihood (GREML) analysis implemented in the genome-wide complex trait analysis (GCTA) software⁶⁷. A genetic relationship matrix (GRM) was constructed using around 7 million genetic variants (MAF > 0.01; INFO > 0.2; Missing rate < 0.95) for the whole iPSYCH cohort. The SNP-based heritability was then calculated for only the individuals included in this study (N = 30,982) using the GCTA-GREML analysis adjusted for the same covariates as the main GWAS.

Polygenic scores derivation

In the iPSYCH cohort, polygenic scores for ASD⁸, ADHD⁷, SCZ³¹, BD³², MDD¹¹, AN²⁶, educational attainment²⁸ and intelligence¹⁴ were derived using effect sizes from summary statistics of published studies with large sample sizes. The GWASs of ASD and ADHD, however, were based on predominantly the iPSYCH sample. Hence, the polygenic scores for ASD and ADHD were derived in-sample using a leave-one-out approach as described previously^7,8. Briefly, we divided the full iPSYCH sample into ten groups and derived polygenic scores for ASD and ADHD in each group separately. The SNP weights required for polygenic score calculation in each group came from a GWAS of ASD and ADHD performed in the rest of the nine groups combined, thereby ensuring no sample overlap between training and target samples. The summary statistics were LD clumped using the 1000 genome EUR reference panel to identify LD-independent variants. The clumped summary statistics were then used for constructing polygenic scores using Plink v1.90⁶¹. Ten P-value thresholds (S1 = 5 × 10⁻⁸, S2 = 1 × 10⁻⁶, S3 = 1 × 10⁻⁴, S4 = 1 × 10⁻³, S5 = 0.01, S6 = 0.05, S7 = 0.1, S8 = 0.2, S9 = 0.5 and S10 = 1.0) were used, yielding ten polygenic scores for each trait. The polygenic score based on the threshold that gave the best prediction (i.e., explained the maximum variance) was used in the association analysis. We acknowledge that this method of generating polygenic scores, called ‘clumping and thresholding’ (C&T), has been superseded by newer methods that offer better prediction performances^68,69. However, the C&T-based scores that we generated had enough predictive power in most cases to perform statistical association tests and infer the direction of the associations.

In the TEDS cohort, polygenic scores were constructed for six psychiatric disorders using the same summary statistics that were used for the polygenic score construction in the iPSYCH cohort. In addition, polygenic scores were constructed for E-factors using effect sizes from the GWASs of E-factors in iPSYCH. The summary statistics were LD clumped using 1000 genomes EUR reference panel to identify LD-independent variants. The clumped summary statistics were then used for constructing polygenic scores using PRSice v2.2.3⁷⁰.

In the MVP cohort, polygenic scores were constructed for E-factors (E1, E2, E3 and E4) using effect sizes from the iPSYCH GWAS of E-factors. The summary statistics were processed using PRS-CS software⁶⁸ to generate weights (posterior SNP effect sizes). Default settings were used for calculating weights using PRS-CS (γ-γ prior = 1; parameter b in γ-γ prior = 0.5; MCMC iterations = 1000; number of burn-in iterations = 500; thinning of the Markov chain factor = 5). Then, based on the derived weights individual-level polygenic scores for E-factors were calculated using Plink v2.0⁷¹ software.

Polygenic scores analysis

The polygenic score associations were tested using either linear (if analyzing school grades) or logistic regression (if analyzing occupation category). The covariates used in the iPSYCH and TEDS cohorts were the same as the ones used in the corresponding GWASs. In the MVP cohort, the following covariates were used: age at recruitment, sex, first 20 ancestral PCs, genotyping batches and number of years of education. The variance explained by the polygenic scores was interpreted using R² (if continuous outcomes: E-factors) or Nagelkerke pseudo R² (if binary outcomes: occupation). Two regression models were constructed: one with the polygenic score and all covariates included (model 1) and the other with only the covariates included (model 2). The reported R² values were calculated by subtracting the R² of model 2 from R² of model 1.

Genetic correlations

The genetic correlations were calculated using either GCTA bivariate REML⁶⁷ or LD score bivariate regression³⁰. The genetic correlations between individual school grades, between the E-factors from dataset 1 and E-factors from dataset 2, and pairwise genetic correlations between the E-factors (after combing both datasets) were all calculated using GCTA using individual genotypes. The genetic correlations between the E-factors and other traits (years of education, intelligence and the six psychiatric disorders), and between the iPSYCH E-factors and TEDS E-factors were calculated using LD score regression using GWAS summary statistics. We used the precalculated LD scores available from the LD score regression website. Only the HapMap variants (SNP list available from the LD score regression website) were used for the analysis. The reference and effect alleles in the summary statistics are aligned with that of the HapMap list. Then, the summary statistics files were munged using the munge script from LDSC software (with default settings). The munged files were then used to estimate genetic correlations.

Multiple testing corrections

The statistical significance of all the analyses was assessed after multiple testing corrections. The P value threshold in each of the analyses was decided based on the number of unique hypotheses tested. In the analysis of the heritability of E-factors, the P-value threshold was set to 0.01 (0.05/4). In the GWAS analysis, the P value threshold was set to 1.25 × 10^–8 (5 × 10^–8/4). In the analysis of genetic correlations and polygenic score associations of E-factors with educational attainment and intelligence, the P-value threshold was set to 0.006 (0.05/8). In the analysis of phenotypic associations, genetic correlations and polygenic score associations between the E-factors (N = 4) and the psychiatric disorders (N = 6), the P value threshold was set to 0.002 (0.05/24). In the analysis of phenotypic associations and polygenic score associations of math and language (N = 2) with psychiatric disorders (N = 6), the P-value threshold was set to 0.004 (0.05/12). In the analysis of genetic correlations and polygenic score associations of iPSYCH E-factors (N = 4) with TEDS E-factors (N = 2), the P value threshold was set to 0.006 (0.05/8). In the analysis of polygenic score associations of TEDS E-factors (N = 2) with psychiatric disorders (N = 6), the P value threshold was set to 0.004 (0.05/12). In the analysis of polygenic score associations of E2 (N = 1) with occupations (N = 24), the P value threshold was set to 0.002 (0.05/24).

Ethics approval and consent to participate

All the analyses in the iPSYCH data are within the permissions received from the Danish Scientific Ethics Committee, the Danish Health Data Authority, the Danish data protection agency and the Danish Neonatal Screening Biobank Steering Committee²⁵. The iPSYCH project is based on individuals recruited via Danish registers, and obtaining informed consent from the participants has been exempted by the Danish ethical committee in accordance with the Act of Research Ethics Review of Health Research Projects (in Danish: Komitéloven), Section 10(1) (https://ipsych.dk/en/data-security/health-research-and-ethical-approval/). The analyses in the TEDS cohort are within the permissions received from the King’s College London Ethics Committee (reference: PNM/09/10-104)³³. Parental consent was obtained for all the TEDS participants before data collection. All the participants in the MVP cohort have given written informed consent at the time of recruitment. The analysis involving MVP data in the current study is approved by the VA Central Institutional Review Board. We confirm that all the analyses we report in this manuscript were performed in accordance with relevant guidelines/regulations as recommended by the respective ethics committees.

Data availability

The GWAS summary statistics are available for download at https://ipsych.dk/en/research/downloads/.

References

Krystal, J. H. & State, M. W. Psychiatric disorders: Diagnosis to therapy. Cell 157, 201–214 (2014).
Article CAS Google Scholar
Pedersen, C. B. et al. A comprehensive nationwide study of the incidence rate and lifetime risk for treated mental disorders. JAMA Psychiat. 71, 573–581 (2014).
Article Google Scholar
Sandstrom, A., Sahiti, Q., Pavlova, B. & Uher, R. Offspring of parents with schizophrenia, bipolar disorder, and depression: A review of familial high-risk and molecular genetics studies. Psychiatr. Genet. 29, 160–169 (2019).
Article CAS Google Scholar
Vreeker, A. et al. High educational performance is a distinctive feature of bipolar disorder: A study on cognition in bipolar disorder, schizophrenia patients, relatives and controls. Psychol. Med. 46, 807–818 (2016).
Article CAS Google Scholar
Ranning, A. et al. School performance from primary education in the adolescent offspring of parents with schizophrenia and bipolar disorder- a national, register-based study. Psychol. Med. 48, 1993–2000 (2018).
Article Google Scholar
Chien, Y.-L., Tu, E.-N. & Gau, S.S.-F. School functions in unaffected siblings of youths with autism spectrum disorders. J. Autism Dev. Disord. 47, 3059–3071 (2017).
Article Google Scholar
Demontis, D. et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat. Genet. 51, 63–75 (2019).
Article CAS Google Scholar
Grove, J. et al. Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 51, 431–444 (2019).
Article CAS Google Scholar
Trubetskoy, V. et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature 604, 502–508 (2022).
Article ADS CAS Google Scholar
Mullins, N. et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat. Genet. 53, 817–829 (2021).
Article CAS Google Scholar
Wray, N. R. et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat. Genet. 50, 668–681 (2018).
Article CAS Google Scholar
Watson, H. J. et al. Genome-wide association study identifies eight risk loci and implicates metabo-psychiatric origins for anorexia nervosa. Nat. Genet. 51, 1207–1214 (2019).
Article CAS Google Scholar
Okbay, A. et al. Polygenic prediction of educational attainment within and between families from genome-wide association analyses in 3 million individuals. Nat. Genet. 54, 437–449 (2022).
Article CAS Google Scholar
Savage, J. E. et al. Genome-wide association meta-analysis in 269,867 individuals identifies new genetic and functional links to intelligence. Nat. Genet. 50, 912–919 (2018).
Article CAS Google Scholar
Fröjd, S. A. et al. Depression and school performance in middle adolescent boys and girls. J. Adolesc. 31, 485–498 (2008).
Article Google Scholar
Keen, D., Webster, A. & Ridley, G. How well are children with autism spectrum disorder doing academically at school? An overview of the literature. Autism Int. J. Res. Pract. 20, 276–294 (2016).
Article Google Scholar
Sundquist, J., Ohlsson, H., Winkleby, M. A., Sundquist, K. & Crump, C. School achievement and risk of eating disorders in a Swedish National Cohort. J. Am. Acad. Child Adolesc. Psychiatry 55, 41-46.e1 (2016).
Article Google Scholar
Bansal, V. et al. Genome-wide association study results for educational attainment aid in identifying genetic heterogeneity of schizophrenia. Nat. Commun. 9, 3078 (2018).
Article ADS CAS Google Scholar
MacCabe, J. H. et al. Scholastic achievement at age 16 and risk of schizophrenia and other psychoses: A national cohort study. Psychol. Med. 38, 1133–1140 (2008).
Article CAS Google Scholar
Dickinson, D. Zeroing in on early cognitive development in schizophrenia. Am. J. Psychiatry 171, 9–12 (2014).
Article Google Scholar
Lima, I. M. M., Peckham, A. D. & Johnson, S. L. Cognitive deficits in bipolar disorders: Implications for emotion. Clin. Psychol. Rev. 59, 126–136 (2018).
Article Google Scholar
Escott-Price, V. et al. Genetic liability to schizophrenia is negatively associated with educational attainment in UK Biobank. Mol. Psychiatry 25, 703–705 (2020).
Article Google Scholar
Donati, G., Dumontheil, I., Pain, O., Asbury, K. & Meaburn, E. L. Evidence for specificity of polygenic contributions to attainment in English, Maths and Science during adolescence. Sci. Rep. 11, 3851 (2021).
Article ADS CAS Google Scholar
Davis, O. S. P. et al. The correlation between reading and mathematics ability at age twelve has a substantial genetic component. Nat. Commun. 5, 4204 (2014).
Article Google Scholar
Pedersen, C. B. et al. The iPSYCH2012 case–cohort sample: New directions for unravelling genetic and environmental architectures of severe mental disorders. Mol. Psychiatry 23, 6–14 (2018).
Article CAS Google Scholar
Thornton, L. M. et al. The Anorexia Nervosa Genetics Initiative (ANGI): Overview and methods. Contemp. Clin. Trials 74, 61–69 (2018).
Article Google Scholar
Jensen, V. M. & Rasmussen, A. W. Danish education registers. Scand. J. Public Health 39, 91–94 (2011).
Article Google Scholar
Lee, J. J. et al. Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet. 50, 1112–1121 (2018).
Article CAS Google Scholar
Deary, I. J., Penke, L. & Johnson, W. The neuroscience of human intelligence differences. Nat. Rev. Neurosci. 11, 201–211 (2010).
Article CAS Google Scholar
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015).
Article CAS Google Scholar
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
Article ADS CAS Google Scholar
Stahl, E. A. et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nat. Genet. 51, 793–803 (2019).
Article CAS Google Scholar
Rimfeld, K. et al. Twins early development study: A genetically sensitive investigation into behavioral and cognitive development from infancy to emerging adulthood. Twin Res. Hum. Genet. Off. J. Int. Soc. Twin Stud. 22, 508–513 (2019).
Article Google Scholar
Becker, G. The association of creativity and psychopathology: Its cultural-historical origins. Creat. Res. J. 13, 45–53 (2001).
Article Google Scholar
Kyaga, S. et al. Creativity and mental disorder: Family study of 300,000 people with severe mental disorder. Br. J. Psychiatry J. Ment. Sci. 199, 373–379 (2011).
Article Google Scholar
Andreasen, N. C. Creativity and mental illness: Prevalence rates in writers and their first-degree relatives. Am. J. Psychiatry 144, 1288–1292 (1987).
Article CAS Google Scholar
Post, F. Creativity and psychopathology. A study of 291 world-famous men. Br. J. Psychiatry J. Ment. Sci. 165, 22–34 (1994).
Article CAS Google Scholar
Power, R. A. et al. Polygenic risk scores for schizophrenia and bipolar disorder predict creativity. Nat. Neurosci. 18, 953–955 (2015).
Article CAS Google Scholar
Li, H. et al. Genome-wide association study of creativity reveals genetic overlap with psychiatric disorders, risk tolerance, and risky behaviors. Schizophr. Bull. 46, 1317–1326 (2020).
Article Google Scholar
Gaziano, J. M. et al. Million Veteran Program: A mega-biobank to study genetic influences on health and disease. J. Clin. Epidemiol. 70, 214–223 (2016).
Article Google Scholar
Pirastu, N. et al. Genetic analyses identify widespread sex-differential participation bias. Nat. Genet. 53, 663–671 (2021).
Article CAS Google Scholar
Sikela, J. M. & Searles Quick, V. B. Genomic trade-offs: Are autism and schizophrenia the steep price of the human brain?. Hum. Genet. 137, 1–13 (2018).
Article CAS Google Scholar
Crow, T. J. Schizophrenia as the price that homo sapiens pays for language: A resolution of the central paradox in the origin of the species. Brain Res. Brain Res. Rev. 31, 118–129 (2000).
Article CAS Google Scholar
Morris, T. T., Davies, N. M., Hemani, G. & Smith, G. D. Population phenomena inflate genetic associations of complex social traits. Sci. Adv. 6, eaay0328 (2020).
Article ADS CAS Google Scholar
Belfi, B., Haelermans, C. & De Fraine, B. The long-term differential achievement effects of school socioeconomic composition in primary education: A propensity score matching approach. Br. J. Educ. Psychol. 86, 501–525 (2016).
Article Google Scholar
Selzam, S. et al. Comparing within- and between-family polygenic score prediction. Am. J. Hum. Genet. 105, 351–363 (2019).
Article CAS Google Scholar
Aschard, H., Vilhjálmsson, B. J., Joshi, A. D., Price, A. L. & Kraft, P. Adjusting for heritable covariates can bias effect estimates in genome-wide association studies. Am. J. Hum. Genet. 96, 329–339 (2015).
Article CAS Google Scholar
Mors, O., Perto, G. P. & Mortensen, P. B. The Danish Psychiatric Central Research Register. Scand. J. Public Health 39, 54–57 (2011).
Article Google Scholar
Andersen, T. F., Madsen, M., Jørgensen, J., Mellemkjoer, L. & Olsen, J. H. The Danish National Hospital Register. A valuable source of data for modern health sciences. Dan. Med. Bull. 46, 263–268 (1999).
CAS Google Scholar
Selzam, S. et al. Predicting educational achievement from DNA. Mol. Psychiatry 22, 267–272 (2017).
Article CAS Google Scholar
Nørgaard-Pedersen, B. & Hougaard, D. M. Storage policies and use of the Danish Newborn Screening Biobank. J. Inherit. Metab. Dis. 30, 530–536 (2007).
Article Google Scholar
Hollegaard, M. V. et al. Archived neonatal dried blood spot samples can be used for accurate whole genome and exome-targeted next-generation sequencing. Mol. Genet. Metab. 110, 65–72 (2013).
Article CAS Google Scholar
O’Connell, J. et al. Haplotype estimation for biobank-scale data sets. Nat. Genet. 48, 817–820 (2016).
Article Google Scholar
Howie, B., Marchini, J. & Stephens, M. Genotype imputation with thousands of genomes. G3 Bethesda Md 1, 457–470 (2011).
Article Google Scholar
Loh, P.-R., Palamara, P. F. & Price, A. L. Fast and accurate long-range phasing in a UK Biobank cohort. Nat. Genet. 48, 811–816 (2016).
Article CAS Google Scholar
Li, Y., Willer, C. J., Ding, J., Scheet, P. & Abecasis, G. R. MaCH: Using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet. Epidemiol. 34, 816–834 (2010).
Article Google Scholar
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283 (2016).
Article CAS Google Scholar
Das, S. et al. Next-generation genotype imputation service and methods. Nat. Genet. 48, 1284–1287 (2016).
Article CAS Google Scholar
Hunter-Zinck, H. et al. Genotyping array design and data quality control in the million veteran program. Am. J. Hum. Genet. 106, 535–548 (2020).
Article CAS Google Scholar
Fuchsberger, C., Abecasis, G. R. & Hinds, D. A. minimac2: Faster genotype imputation. Bioinformatics 31, 782–784 (2015).
Article CAS Google Scholar
Purcell, S. et al. PLINK: A tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS Google Scholar
Wu, C., DeWan, A., Hoh, J. & Wang, Z. A comparison of association methods correcting for population stratification in case-control studies. Ann. Hum. Genet. 75, 418–427 (2011).
Article Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinform. Oxf. Engl. 26, 2867–2873 (2010).
Article CAS Google Scholar
Fang, H. et al. Harmonizing genetic ancestry and self-identified race/ethnicity in genome-wide association studies. Am. J. Hum. Genet. 105, 763–772 (2019).
Article CAS Google Scholar
Kiers, H. A. L. Simplimax: Oblique rotation to an optimal target with simple structure. Psychometrika 59, 567–579 (1994).
Article MATH Google Scholar
Watanabe, K. et al. A global overview of pleiotropy and genetic architecture in complex traits. Nat. Genet. 51, 1339–1348 (2019).
Article CAS Google Scholar
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: A tool for genome-wide complex trait analysis. Am. J. Hum. Genet. 88, 76–82 (2011).
Article CAS Google Scholar
Ge, T., Chen, C.-Y., Ni, Y., Feng, Y.-C.A. & Smoller, J. W. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019).
Article ADS Google Scholar
LDpred2: better, faster, stronger|Bioinformatics|Oxford Academic. https://academic.oup.com/bioinformatics/article/36/22-23/5424/6039173.
Choi, S. W. & O’Reilly, P. F. PRSice-2: Polygenic Risk Score software for biobank-scale data. GigaScience 8, giz082 (2019).
Article Google Scholar
Chang, C. C. et al. Second-generation PLINK: Rising to the challenge of larger and richer datasets. GigaScience 4, 7 (2015).
Article Google Scholar

Download references

Funding

(1) The iPSYCH project is funded by the Lundbeck Foundation (grant numbers R102-A9118 and R155-2014-1724) and the universities and university hospitals of Aarhus and Copenhagen. The Danish National Biobank resource was supported by the Novo Nordisk Foundation. Data handling and analysis on the GenomeDK HPC facility was supported by NIMH (1U01MH109514-01 to Michael O’Donovan and ADB). High-performance computer capacity for handling and statistical analysis of iPSYCH data on the GenomeDK HPC facility was provided by the Center for Genomics and Personalized Medicine, Aarhus University and Central Region Denmark, and Centre for Integrative Sequencing, iSEQ, Aarhus University (grant to ADB). (2) The Anorexia Nervosa Genetics Initiative (ANGI) was an initiative of the Klarman Family Foundation. (3) The PhD fellowship of V.M.R was fully funded by the Graduate School of Health, Aarhus University, Aarhus, Denmark. (4) G.V is supported by the Leon Levy Foundation (Leon Levy Fellowship in Neuroscience) and by NIH grant R01MH109677. (5) P.R is supported by the National Institutes of Health (R01AG050986 Roussos, R01MH109677 Roussos, U01MH116442 Roussos, R01MH110921 Roussos) and the Veterans Affairs (Merit grant BX004189 and BX002395 Roussos). (6) We gratefully acknowledge the ongoing contribution of the participants in the Twins Early Development Study (TEDS) and their families. TEDS is supported by a programme grant to RP from the UK Medical Research Council (MR/M021475/1 and previously G0901245), with additional support from the US National Institutes of Health (AG046938). The research leading to these results has also received funding from the European Research Council under the European Union’s Seventh Framework Programme (FP7/2007- 2013)/grant agreement n° 602768 and ERC grant agreement n° 295366. (7) R.P is supported by a Medical Research Council Professorship award (G19/2). This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement no. 721567. (8) A.G.A. has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska‐Curie grant agreement no. 721567. (9) The authors acknowledge use of the research computing facility at King’s College London, Rosalind (https://rosalind.kcl.ac.uk), which is delivered in partnership with the National Institute for Health Research (NIHR) Biomedical Research Centres at South London & Maudsley and Guy’s & St. Thomas’ NHS Foundation Trusts, and part-funded by capital equipment grants from the Maudsley Charity (award 980) and Guy’s & St. Thomas’ Charity (TR130505). The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR, King’s College London, or the Department of Health and Social Care.

Author information

A list of authors and their affiliations appears at the end of the paper.

Authors and Affiliations

Department of Biomedicine, Aarhus University, Aarhus, Denmark
Veera M. Rajagopal, Jakob Grove, Thomas D. Als, Manuel Mattheisen, Jonatan Pallesen, Anders D. Børglum & Ditte Demontis
The Lundbeck Foundation Initiative for Integrative Psychiatric Research (iPSYCH), Aarhus, Denmark
Veera M. Rajagopal, Jakob Grove, Thomas D. Als, Henriette T. Horsdal, Liselotte Petersen, Vivek Appadurai, Andrew Schork, Alfonso Buil, Jonas Bybjerg-Grauholm, Marie Bækvad-Hansen, David M. Hougaard, Ole Mors, Merete Nordentoft, Thomas Werge, Christine S. Hansen, Manuel Mattheisen, Jonatan Pallesen, Carsten Bcker Pedersen, Marianne Giørtz Pedersen, Wesley K. Thompson, Preben Bo Mortensen, Esben Agerbo, Anders D. Børglum & Ditte Demontis
Center for Genome Analysis and Personalized Medicine, Aarhus, Denmark
Veera M. Rajagopal, Jakob Grove, Thomas D. Als, Manuel Mattheisen, Jonatan Pallesen, Esben Agerbo, Anders D. Børglum & Ditte Demontis
Centre for Integrative Sequencing, iSEQ, Aarhus University, Aarhus, Denmark
Veera M. Rajagopal, Jakob Grove, Thomas D. Als, Manuel Mattheisen, Jonatan Pallesen, Preben Bo Mortensen, Anders D. Børglum & Ditte Demontis
Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
Andrea Ganna & Mark J. Daly
Analytic and Translational Genetics Unit, Massachusetts General Hospital, Harvard Medical School, Boston, USA
Andrea Ganna, Claire Churchhouse, Mark J. Daly, Jacqueline Goldstein, Daniel P. Howrigan, Hailiang Huang, Alicia R. Martin, Benjamin M. Neale, Duncan S. Palmer, Timothy Poterba, Stephan Ripke, F. Kyle Satterstrom, Patrick Turley & Raymond K. Walters
Broad Institute, Cambridge, USA
Andrea Ganna, Claire Churchhouse, Mark J. Daly, Jacqueline Goldstein, Alicia R. Martin, Benjamin M. Neale, Timothy Poterba & F. Kyle Satterstrom
Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology, and Neuroscience, King’s College London, London, UK
Jonathan R. I. Coleman, Andrea Allegrini, Gerome Breen & Robert Plomin
National Institute of Health Research Maudsley Biomedical Research Centre, South London and Maudsley National Health Service Trust, London, UK
Jonathan R. I. Coleman & Gerome Breen
Department of Psychiatry, Pamela Sklar Division of Psychiatric Genomics and Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Georgios Voloudakis & Panos Roussos
Department of Genetics and Genomic Sciences and Institute for Genomics and Multiscale Biology, Icahn School of Medicine at Mount Sinai, New York, NY, USA
Georgios Voloudakis & Panos Roussos
James J. Peters VA Medical Center, Bronx, NY, USA
Georgios Voloudakis & Panos Roussos
Bioinformatics Research Centre, Aarhus University, Aarhus, Denmark
Jakob Grove, Carsten Bcker Pedersen & Marianne Giørtz Pedersen
The National Centre for Register-Based Research (NCRR), Aarhus University, Aarhus, Denmark
Henriette T. Horsdal, Liselotte Petersen, Christine S. Hansen, Wesley K. Thompson & Preben Bo Mortensen
Institute of Biological Psychiatry, Mental Health Services of Copenhagen, Copenhagen, Denmark
Vivek Appadurai, Andrew Schork, Alfonso Buil & Thomas Werge
Neurogenomics Division, The Translational Genomics Research Institute (TGEN), Phoenix, AZ, USA
Andrew Schork
Department of Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, USA
Cynthia M. Bulik & Joanna Martin
Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Cynthia M. Bulik
Department of Nutrition, University of North Carolina at Chapel Hill, Chapel Hill, USA
Cynthia M. Bulik & Christine S. Hansen
Department for Congenital Disorders, Statens Serum Institut, Copenhagen, Denmark
Jonas Bybjerg-Grauholm, Marie Bækvad-Hansen & David M. Hougaard
Psychosis Research Unit, Aarhus University Hospital-Psychiatry, Aarhus, Denmark
Ole Mors
Mental Health Center Copenhagen, Mental Health Services in The Capital Region of Denmark, Copenhagen, Denmark
Merete Nordentoft
Department Clinical Medicine, Faculty of Health Science, University of Copenhagen, Copenhagen, Denmark
Merete Nordentoft & Thomas Werge
Center for GeoGenetics, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
Thomas Werge, Carsten Bcker Pedersen & Marianne Giørtz Pedersen
Centre for Integrated Register-Based Research (CIRRAU), Aarhus University, Aarhus, Denmark
Preben Bo Mortensen & Esben Agerbo
Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA, USA
Rich Belliveau, Felecia Cerrato, Kimberly Chambert, Claire Churchhouse, Mark J. Daly, Ashley Dumont, Jacqueline Goldstein, Daniel P. Howrigan, Hailiang Huang, Julian Maller, Alicia R. Martin, Joanna Martin, Jennifer Moran, Benjamin M. Neale, Duncan S. Palmer, Timothy Poterba, Stephan Ripke, F. Kyle Satterstrom, Patrick Turley & Raymond K. Walters
Department of Psychology, Washington University in St. Louis, St. Louis, MO, 63130, USA
Caitlin E. Carey
Genomics Plc, Oxford, UK
Julian Maller
Vertex Pharmaceuticals, Abingdon, UK
Julian Maller
MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff, University, Cardiff, UK
Joanna Martin
Department of Psychiatry, Psychosomatics and Psychotherapy, University of Würzburg, Würzburg, Germany
Manuel Mattheisen
Department of Clinical Neuroscience, Karolinska Institutet, Stockholm, Sweden
Manuel Mattheisen
Department of Psychiatry and Psychotherapy, Charité-Universitätsmedizin, Berlin, Germany
Stephan Ripke
NORMENT-KG Jebsen Centre for Psychosis Research, University of Oslo, Oslo, Norway
Wesley K. Thompson
Division of Mental Health and Addiction, Oslo University Hospital, Oslo, Norway
Wesley K. Thompson
Behavioral and Health Genomics CenterCenter for Economic and Social Research, University of Southern, California, 635 Downey Way, Los Angeles, CA, 90089, USA
Patrick Turley

Authors

Veera M. Rajagopal
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Ganna
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan R. I. Coleman
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Allegrini
View author publications
You can also search for this author in PubMed Google Scholar
Georgios Voloudakis
View author publications
You can also search for this author in PubMed Google Scholar
Jakob Grove
View author publications
You can also search for this author in PubMed Google Scholar
Thomas D. Als
View author publications
You can also search for this author in PubMed Google Scholar
Henriette T. Horsdal
View author publications
You can also search for this author in PubMed Google Scholar
Liselotte Petersen
View author publications
You can also search for this author in PubMed Google Scholar
Vivek Appadurai
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Schork
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Buil
View author publications
You can also search for this author in PubMed Google Scholar
Cynthia M. Bulik
View author publications
You can also search for this author in PubMed Google Scholar
Jonas Bybjerg-Grauholm
View author publications
You can also search for this author in PubMed Google Scholar
Marie Bækvad-Hansen
View author publications
You can also search for this author in PubMed Google Scholar
David M. Hougaard
View author publications
You can also search for this author in PubMed Google Scholar
Ole Mors
View author publications
You can also search for this author in PubMed Google Scholar
Merete Nordentoft
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Werge
View author publications
You can also search for this author in PubMed Google Scholar
Preben Bo Mortensen
View author publications
You can also search for this author in PubMed Google Scholar
Gerome Breen
View author publications
You can also search for this author in PubMed Google Scholar
Panos Roussos
View author publications
You can also search for this author in PubMed Google Scholar
Robert Plomin
View author publications
You can also search for this author in PubMed Google Scholar
Esben Agerbo
View author publications
You can also search for this author in PubMed Google Scholar
Anders D. Børglum
View author publications
You can also search for this author in PubMed Google Scholar
Ditte Demontis
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

iPSYCH-Broad Consortium

Rich Belliveau
, Caitlin E. Carey
, Felecia Cerrato
, Kimberly Chambert
, Claire Churchhouse
, Mark J. Daly
, Ashley Dumont
, Jacqueline Goldstein
, Christine S. Hansen
, Daniel P. Howrigan
, Hailiang Huang
, Julian Maller
, Alicia R. Martin
, Joanna Martin
, Manuel Mattheisen
, Jennifer Moran
, Benjamin M. Neale
, Jonatan Pallesen
, Duncan S. Palmer
, Carsten Bcker Pedersen
, Marianne Giørtz Pedersen
, Timothy Poterba
, Stephan Ripke
, F. Kyle Satterstrom
, Wesley K. Thompson
, Patrick Turley
& Raymond K. Walters

Contributions

(1) V.R., A.B and D.D designed the study. (2) J.C., A.A., G.V., J.G., T.A., V.A., A.S., A.B., C.B., J.B., M.B., D.H., O.M., M.N., T.W., P.B., G.B., P.R., R.P., E.A. and A.B. provided data or contributed key roles in data generation. (3) V.R., A.G., J.C., A.A., G.V., J.G., T.A., H.H., L.P., V.A., A.S., A.B., J.B. and M.B. contributed to data analysis. (4) V.R., A.G., J.G., T.A., E.A., A.B. and D.D. interpreted the results. (5) V.R. and D.D. wrote the manuscript. (6) All the authors (V.R., A.G., J.C., A.A., G.V., J.G., T.A., H.H., L.P., V.A., A.S., A.B., C.B., J.B., M.B., D.H., O.M., M.N., T.W., P.B., G.B., P.R., R.P., E.A., A.B. and D.D.) reviewed and contributed to revising the manuscript. (7) E.A., A.B. and D.D. supervised the project. Author contributions are also reported in Supplementary Table 19. This manuscript does not include any information or images that could lead to the identification of the participants and require consent for publication.

Corresponding authors

Correspondence to Veera M. Rajagopal or Ditte Demontis.

Ethics declarations

Competing interests

(1) Ditte Demontis has received speaking fee from Takeda. (2) CM Bulik reports: Shire (grant recipient, Scientific Advisory Board member); Idorsia (consultant); Pearson (author, royalty recipient). (3) Other authors have no competing interests to declare.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Tables.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Rajagopal, V.M., Ganna, A., Coleman, J.R.I. et al. Genome-wide association study of school grades identifies genetic overlap between language ability, psychopathology and creativity. Sci Rep 13, 429 (2023). https://doi.org/10.1038/s41598-022-26845-0

Download citation

Received: 31 August 2022
Accepted: 21 December 2022
Published: 09 January 2023
DOI: https://doi.org/10.1038/s41598-022-26845-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.