Introduction

Attention-deficit/hyperactivity disorder (ADHD) is a common neurodevelopmental disorder that affects 5–10% of children and is characterized by excessive inattentive, hyperactive, and impulsive symptoms [1]. It is well-established that genetic factors contribute to ADHD liability; twin and family studies consistently estimate the heritability of ADHD at 70–80% [2,3,4,5,6,7]. More recently, the largest genome-wide association study (GWAS) of clinical ADHD to date identified the first genome-wide significant loci associated with ADHD, and estimated the proportion of phenotypic variance explained by measured single nucleotide polymorphisms (SNPs) at 22% (standard error (SE) = 0.01) [8]. Significant SNP-heritability has also been reported by the largest GWAS of ADHD symptoms in population-based samples (5%(SE = 0.06) to 34%(SE = 0.17)) [9], with a near-complete sharing of genetic risks reported across these different ADHD definitions (rg = 0.94(SE = 0.20)) [8].

In addition to evidence of a strong genetic liability, it is well known that ADHD is highly comorbid with other psychiatric conditions [10]. One important question is therefore the degree to which genetic risk implicated in ADHD is disorder-specific. Findings from population-based twin studies have reported moderate to strong genetic correlations between ADHD and other childhood psychiatric conditions, including autistic traits (rg range 0.54–87) [11], learning disabilities (rg range 0.31–41) [12,13,14], oppositional defiant and conduct problems (rg range 0.46–74) [15,16,17], anxiety (rg range 0.45–58), and depression (rg range 0.34–77) [11]. Molecular genetic studies also support a genetic overlap between ADHD and a broad range of psychiatric conditions. For example, ADHD polygenic risk scores (i.e., weighted sum scores of an individual’s estimated total burden of risk alleles associated with a phenotype (PRS)) [18] have been associated with lower educational attainment, cognition, and conduct problems [8, 19,20,21,22]. Mixed findings have been reported for ADHD PRS and associations with autism and depression [23,24,25,26,27,28].

Based on the extensive phenotypic and genetic overlap among psychiatric disorders, it has been suggested that comorbidity may be attributed to a general factor of psychopathology that increases risk for virtually all prevalent psychiatric conditions [29, 30]. Twin and sibling studies have shown that a single latent shared genetic factor can account for on average 45% of variance in childhood externalizing, internalizing, and phobia symptoms [29, 31], 31% of variance in childhood neurodevelopmental symptoms [32], and 22% of disorder liability in several clinical psychiatric diagnoses, including ADHD [33]. Further, the twin-based heritability of a latent general psychopathology factor has been estimated at 43% in one study [31] and the SNP-heritability to 18%(SE = 0.10) [34] and 38%(SE = 0.16) [35] in two separate population-based pediatric samples.

These findings suggest that the co-occurrence of childhood psychiatric conditions is, at least in part, due to shared common genetic risk variants. Given the extensive comorbidity in ADHD and shared genetic risks with other psychopathology, it can thus be hypothesized that a proportion of the genetic risk variants associated with ADHD in recent GWAS [8] might not be disorder-specific, but rather act to increase risk for general childhood psychopathology more broadly. However, we are not aware of any studies addressing this question using molecular genetic data. The aims of the current study were therefore to: (1) examine whether ADHD PRS are associated with a range of neurodevelopmental, externalizing, and internalizing symptom dimensions in a large general population sample, and (2) quantify the extent to which any observed associations between ADHD PRS and the aforementioned symptom dimensions can be attributed to a general childhood psychopathology factor.

Methods

Study population

The Child and Adolescent Twin Study in Sweden (CATSS) is an ongoing study targeting all 9-year-old (born after June 1995) and 12-year-old (born before July 1995) twins born in Sweden since July 1992. Parents were contacted for a telephone interview on the twins’ 9th or 12th birthdays. The overall response rate in CATSS is 80% [36]. Analyses comparing non-responders and responders have shown that participating families generally have higher socio-economic status, lower rates of parental psychiatric illness, and child clinical diagnosis for ADHD, autism spectrum disorder (ASD), and learning disabilities [36]. CATSS was approved by the ethics committee at Karolinska Institutet and all participants gave informed consent. The study has been described in detail elsewhere [36].

Genotyping and imputation

A total of 11,551 CATSS twins were genotyped using the Illumina Infinium PsychArray-24 BeadChip. Prior to analysis, stringent quality control (QC) procedures were performed on the genotyped markers and individuals using standardized procedures. After QC, 561,187 genotyped SNPs and 11,081 samples were retained. Genotypes for another 2495 monozygotic (MZ) twins were imputed from their genotyped co-twin, resulting in a sample size of 13,576 samples with genotype data. Details of the QC protocol, imputation, and principal components extraction are presented in supplementary note 1 and Figure S1. CATSS participants without available genetic data differed to those included in the current sample, in that they showed higher levels of parent-reported ADHD symptoms and clinical ADHD diagnosis were more likely to be male, and have parents with lower education levels (see supplementary Table S1).

Polygenic risk scores

ADHD polygenic risk scores (PRS) were generated in CATSS based on summary statistics from what should theoretically be the most powerful discovery sample available : a meta-analysis of the largest GWAS of clinically diagnosed ADHD (20,183 cases, 35,191 controls) [8] and the largest GWAS of ADHD symptoms (17,666 children from population-based samples). Details of the discovery sample are provided in supplementary note 2 [9]. We calculated standardized betas for each SNP, based on available z scores, effective sample size and allele frequency in the discovery GWAS [37]. After excluding individuals with parent-reported cerebral palsy, Down syndrome, brain injury, and chromosomal abnormalities (supplementary Figure S1), ADHD PRS were derived in CATSS from best-guess imputed genotypes across a range of seven p value thresholds (0.00001 ≤ PT ≤ 1). Indels, multi-allelic and symmetric/ambiguous SNPs were excluded. Autosomal SNPs with a minor allele frequency (MAF) ≥ 0.05 and good imputation quality (INFO score) ≥ 0.8 were clumped (linkage disequilibrium threshold R2 > 0.1, ± 1000 kb) using PLINK.v.1.9 [38]. Retained reference alleles were scored across the set of SNPs in PLINK (applying the command–score no-mean-imputation) using standard procedures [39, 40]. In line with previous publications, we used the PRS including SNPs at a threshold of PT ≤ 0.50 for the main analysis [20, 41]. Sensitivity analyses of PRS associations across other p value thresholds are presented in the supplementary materials.

Childhood psychiatric symptoms

Childhood psychiatric symptoms were assessed using the Autism-Tics, ADHD, and Other Comorbidities inventory (A-TAC). A-TAC is a 96-items questionnaire corresponding to DSM-IV definitions of childhood psychiatric disorders. Questions assess lifetime symptoms in relation to same-age peers [42]. We selected the 62 symptoms items measuring inattention (IA), hyperactivity/impulsivity (H/I), ASD, learning difficulties (LD), oppositional defiant disorder (ODD), conduct disorder (CD), depression (DEP) and anxiety (ANX). The A-TAC neurodevelopmental and externalizing scales have been validated, showing strong internal consistency and moderate to strong predictive validity [42,43,44,45]. A-TAC anxiety and depression items have not been validated and were only assessed in twins born from 1992 to 1997. For twins born after 1997, depression symptoms were instead assessed using the Short Mood and Feelings Questionnaire (SMFQ), a 13-item questionnaire measuring child depressive symptoms experienced in the last 2 weeks [46]. Anxiety symptoms were assessed using the Screen for Child Anxiety Related Emotional Disorders (SCARED), a 41-item questionnaire measuring symptoms experienced in the last three months across five anxiety dimensions: panic disorder (PD), generalized anxiety disorder (GAD), separation anxiety disorder (SAD), school anxiety (SA) and social phobia (SP) [47]. SMFQ and SCARED are validated questionnaires, with strong internal consistency and moderate predictive validity for clinical diagnoses [47,48,49,50,51].

All scales were rated according to three response categories: “no” (coded 0), “yes, to some extent” (coded 1), and “yes” (coded 2). As the A-TAC internalizing scales, SCARED, and SMFQ are not directly comparable measures, we used a split-sample approach based on the available internalizing assessments. The final sample sizes with genotype and phenotype data were 6603 (3483 unrelated individuals) for the A-TAC subsample, and 6854 (3634 unrelated individuals) in the SMFQ/SCARED subsample. Based on previous simulation studies, this sample size is more than adequate for estimating complex structural equation models (SEM) and PRS associations [52, 53].

Statistical analyses

We estimated associations between ADHD PRS and ADHD symptom dimensions (IA, H/I) and related neurodevelopmental (ASD, LD), externalizing (ODD, CD), and internalizing (DEP, ANX) symptom dimensions, using confirmatory factor analysis and regression analyses implemented via SEM. Path diagrams of the models are presented by subsample in Fig. 1.

Fig. 1
figure 1

Path diagram of the general factor model in the A-TAC subsample A and the SMFQ/SCARED subsample B. Path diagram for the general factor models, presented by study subsample. Latent factors are depicted as circles. For clarity, covariates (age, sex and the six principal components) and correlations across latent trait factors are omitted in the above graphical representation. The models consisted of a latent general psychopathology factor (GP) and specific latent trait factors reflecting symptoms dimensions of inattention (IA), hyperactivity/impulsivity (H/I), autism spectrum disorder (ASD), learning difficulties (LD), oppositional defiant disorder (ODD), conduct disorder (CD), depression (DEP), and anxiety (ANX) or panic disorder (PD), generalized anxiety (GAD), separation anxiety (SAD), school anxiety (SA), and social phobia (SP). Variances for all latent factors were fixed at 1. Measured variables are depicted as squares, and include the ADHD PRS and all symptoms items from the Autism-Tics, ADHD, and Other Comorbidities inventory (A-TAC), the Short Mood and Feelings Questionnaire (SMFQ), and the Screen for Child Anxiety Related Emotional Disorders (SCARED). Numbers 1…X indicate the number of symptom items loading onto each specific latent trait factor. β1–βx represent the regression coefficients, regressing each latent variable onto ADHD PRS. Note that the corresponding path diagrams for the correlated factors models are identical, excluding the general psychopathology factor

We first fitted a correlated factors model where symptoms from each subscale were set to load onto a corresponding single latent trait factor. All the latent trait factors were allowed to correlate. In the A-TAC subsample, a correlated factors model with eight latent trait factors was fitted, corresponding to symptoms dimensions of IA, H/I, ASD, LD, ODD, CD, DEP, and ANX. In the SMFQ/SCARED subsample, we fitted a correlated factors model with 12 latent trait factors, including the first six factors outlined above, one DEP factor measured via SMFQ, and five latent anxiety factors corresponding to the SCARED subscales of PD, GAD, SAD, SA, and SP. Previous studies of the SCARED have shown that this five-factor structure has the best psychometric properties for the questionaire [54,55,56].

Second, we fitted a general factor (or bifactor) model, which in addition to the latent trait factors, included a general psychopathology factor. The general factor model quantifies the extent to which covariance among symptom dimensions reflects both a general factor (on which all assessed symptoms load), and a number of specific latent trait factors (on which only a subset of the symptoms load) [57, 58]. Correlations between the specific latent trait factors and the general factor are fixed at zero, whereas correlations between the specific latent trait factors are free to vary.

In both models, the latent factors were regressed on ADHD PRS using SEM, with sex, age and the first six PCs (to account for population stratification) included as covariates. We evaluated whether the models provided a good fit to the underlying data using the comparative fit index, and the root mean square error of approximation (RMSEA) [59]. A likelihood ratio test was used to test whether the exclusion of the general psychopathology factor from the correlated factors model led to a significant decrease in model fit. To account for the non-independence of twin data, family clusters were specified and standard errors were estimated using a sandwich estimator. All models were run using Mplus [60].

Sensitivity analyses

We performed several sensitivity analyses to test the robustness of results from the general factor model (see supplementary note 3 for details). First, to test whether observed associations were driven by ADHD cases, we re-ran analyses excluding children with an ADHD diagnosis (ICD and/or ≥ 8 A-TAC DSM-based symptoms). For completeness, we present the number children with an ICD diagnosis corresponding to the any of the assessed symptoms dimensions in supplementary Table S2. Second, we excluded one twin in every monozygotic pair to confirm that estimates were not inflated by the inclusion of genetically identical individuals. Third, we tested for sex differences in the association between ADHD PRS and the latent factors. Finally, we tested whether ADHD PRS showed similar associations with the latent factors across a range of seven p value thresholds (0.00001 ≤ PT ≤ 1).

Code availability

Computer codes are available upon request from the corresponding author.

Results

Correlated factors model

The correlated factors model fit the data well in both subsamples (CFI > 0.94 and RMSEA < 0.02) [59] (Table 1). All symptoms loaded positively and significantly onto their corresponding latent trait factor. Standardized factor loadings are reported in Table 2. Neurodevelopmental, externalizing, and internalizing latent factors were moderately to strongly correlated in the A-TAC subsample (mean r = 0.64, range = 0.44–83). Similar correlations were found in the SMFQ/SCARED subsample across neurodevelopmental, externalizing and the depression latent factors (mean r = 0.62, range = 0.40–84), whereas correlations with the latent anxiety factors were generally weaker (mean r = 0.40, range = 0.11–65). Notably, SP showed a particularly weak correlation with H/I (r = 0.11), and overall lower correlations with all latent factors (mean r = 0.33, range = 0.11–52). Correlations across the latent trait factors, before and after regression of PRS and covariates, are presented in supplementary Figures S2S3.

Table 1 Model fit for the correlated factors model and the general factor model in the A-TAC subsample and the SMFQ/SCARED subsample
Table 2 Standardized factor loadings from the correlated factors model and the general factor model in the A-TAC subsample and the SMFQ/SCARED subsample

In both subsamples, higher ADHD PRS were statistically significantly associated with higher symptom levels in all latent neurodevelopment, externalizing, and depression factors, after adjusting for covariates. ADHD PRS was not statistically significantly associated with any of the latent anxiety factors, with the exception of the latent PD factor (β = 0.06, p = 0.011). Standardized regression results for PRS p value threshold ≤ 0.5 are reported in Table 3, and results across p value thresholds in supplementary Figures S4S5.

Table 3 Association between ADHD PRS and latent trait factors in the correlated factors model and the general factor model (PRS p value threshold < 0.5)

General factor model

The general factor model also fit the data well in both subsamples (CFI > 0.96, RMSEA < 0.02). Furthermore, omitting the general psychopathology factor resulted in a statistically significant decrease in model fit (Table 1). Standardized factor loadings are presented in Table 2. In both subsamples, all symptoms loaded positively and significantly onto the general psychopathology factor. Mean loadings were strongest for neurodevelopmental symptoms (A-TAC subsample mean loading = 0.64, range = 0.39–91: SMFQ/SCARED subsample mean loading = 0.64, range = 0.40–91), slightly lower for externalizing symptoms (A-TAC subsample mean loading = 0.51, range = 0.32–60: SMFQ/SCARED subsample mean loading = 0.49, range = 0.32–62) and weakest for internalizing symptoms (A-TAC subsample mean loading = 0.41, range = 0.24–61: SMFQ/SCARED subsample mean loading = 0.32, range = 0.09–66). The general psychopathology factor explained 56% of the covariance across traits (explained common variance) in the A-TAC subsample and 40% in the SMFQ/SCARED subsample [61]. Correlations across the specific latent trait factors were attenuated in the general factor model. Notably, correlations between LD and all other factors were attenuated towards the null or showed an inverse association. Further, SP became significantly negatively correlated with IA and H/I (supplementary Figures S2S3).

In both subsamples, higher ADHD PRS were significantly associated with the general psychopathology factor (β = 0.09–10, p < 0.0001), explaining ~ 1% of the variance in the general factor after adjusting for covariates. After accounting for covariance across all symptoms via the general psychopathology factor, the association between ADHD PRS and the specific latent H/I factor remained significant in both subsamples (β = 0.06–8, p < 0.0001), explaining 0.37–69% of the variance in the specific H/I factor after adjusting for covariates. In the SMFQ/SCARED subsample, we also observed a significant negative association between ADHD PRS and the specific latent SP factor (β = −0.05, p = 0.004). Standardized regression results for ADHD PRS p value threshold ≤ 0.5 are reported in Table 3, and across the range of p value thresholds in supplementary figures S4S5.

Sensitivity analyses

After excluding ADHD cases, ADHD PRS remained statistically significantly associated with the general psychopathology factor and the specific H/I factor in both subsamples. Results excluding one MZ twin per pair did not differ markedly from the main analyses. Analyses testing for sex differences in the association between ADHD PRS and the latent factors showed a similar pattern of results as the main analyses, although PRS associations were generally stronger in males (general factor R2 = 1.00–14%: H/I R2 = 0.81%), than in females (general factor R2 = 0.36–81%: H/I R2 = 0.03–1.00%).

Discussion

Results from this study show that common GWAS variants that increase the risk for ADHD as captured by PRS, are not only associated with ADHD symptoms (R2 = 0.83–1.69%) in an independent population-based sample, but also with a range of childhood neurodevelopmental (R2 = 0.40–0.53%), externalizing (R2 = 0.41–1.19%), and to a lesser extent, internalizing symptom dimensions (R2 = 0–0.41%). Importantly, when modeling the shared variance across these symptom dimensions, we found that the associations were largely accounted for by a general childhood psychopathology factor. The significant association between ADHD PRS and a general psychopathology factor (R2 = 0.86–1.06%) suggests that a considerable portion of the genetic variants associated with ADHD, that are captured by PRS and shared with other measures, reflect a non-specific genetic liability toward broad childhood psychopathology.

Beyond the association between ADHD PRS and a general psychopathology factor, results also showed a unique association between ADHD PRS and specific H/I (R2 = 0.37–0.69%). About 2/3 of the association between ADHD PRS and H/I could be attributed to general variance shared across childhood psychopathology symptoms, and ~ 1/3 to variance specific to hyperactivity/impulsivity. These finding were robust across both subsamples and provide important molecular genetic confirmation of results from previous twin studies, showing that whilst a substantial proportion of genetic influences on ADHD symptoms are shared with a general psychopathology factor, there are also ADHD-specific genetic influences [29, 32, 33].

In contrast, there was no significant association between ADHD PRS and specific inattention after accounting for covariance across all symptom dimensions via the general factor. There are several potential reasons for the differential pattern of ADHD PRS association across ADHD symptom dimensions; it is possible that hyperactive/impulsive symptoms are stronger drivers of ADHD diagnosis, leading to an overrepresentation of combined and primarily hyperactive/impulsive ADHD cases in the clinical discovery GWAS. The lack of association between ADHD PRS and specific inattention is likely also explained by the fact that the majority of the inattentive symptoms loaded very strongly onto the general psychopathology factor, leaving little variance in the specific inattention factor. This may suggest that inattention is phenotypically and genetically more closely linked to a general liability for childhood psychopathology, or it may reflect measurement properties of the A-TAC. To disentangle these explanations, results would need to be replicated in a different sample using other measures and/or raters.

In the correlated factors model, we found significant ADHD PRS associations with neurodevelopmental, externalizing, and depression symptom dimensions, but not with anxiety. Anxiety symptoms also showed the weakest loadings onto a general factor, both when measured by A-TAC and SCARED. These results suggest that anxiety may be less genetically associated with ADHD in childhood, as compared with other psychopathology dimensions. Such a conclusion is generally consistent with findings from twin and molecular studies [11]. Nonetheless, it is also possible that parent ratings do not fully capture variation in child anxiety symptoms at this age [51], which may in part explain the observed overall weaker associations seen for anxiety. Somewhat surprisingly, we observed a significant negative association between ADHD PRS and specific SP in the general factor model in the SMFQ/SCARED subsample. SP was also the only latent anxiety factor to show a significant negative association with ADHD symptoms in the general factor model. This suggests that the negative association between ADHD PRS and SP may reflect the previously reported tendency for internalizing and externalizing symptom dimensions to become inversely associated in general factor models [30, 62, 63].

Analyses stratified by sex generally showed stronger PRS associations in males relative to females, particularly for associations with the general psychopathology factor and specific hyperactivity/impulsivity. Although this could reflect sex-specific genetic differences, it is more likely to be explained by lower levels of hyperactivity/impulsivity, neurodevelopmental, and externalizing symptoms in females [64].

Finally, by demonstrating that PRS derived from a GWAS meta-analysis of clinically diagnosed and population trait ADHD are also linked to a wide range of childhood psychopathology problems in the general population, this study further highlights the utility of recently developed multivariate GWAS methods [65,66,67]. Possibilities for joint analysis of GWAS data across psychiatric conditions, and across clinical and population samples, are important not only to boost power, but also to identify genetic factorswhich influence broader psychopathology dimensions [57].

Limitations

Results from this study must be interpreted in the context of the study limitations. First, we relied on parent ratings of childhood psychiatric symptoms, which may inflate cross-trait covariance and possibly lead to overestimation of the general psychopathology factor [62]. Although we cannot exclude this possibility, loadings and variance explained by the general psychophatology factor in this study was generally in line with previous findings using multiple raters [62] or register-based clinical diagnoses [33]. Second, the current study is cross-sectional. Although there is strong evidence for genetic stability in ADHD [23, 68], the phenotypic expression of psychiatric symptoms changes across development, meaning that the pattern and strength of ADHD PRS associations may also differ with age [23, 41]. Third, families who participated in CATSS generally had lower rates of psychiatric disorder compared with non-responders [36]. Further, the current study sample had significantly lower rates of ADHD compared with CATSS participants who did not provide genotype data (supplementary Table S2). It is therefore likely that children with higher levels of psychopathology and genetic load for ADHD were underrepresented in this study, which may have attenuated the estimated associations. Fourth, the trait variance explained by PRS is generally small; in the recent clinical ADHD GWAS, ADHD PRS accounted for only ~ 5.5% of the variance in case–control ADHD status [8]. Genetic cross-disorder overlap identified in secondary PRS studies is typically even smaller (< 1% variance explained), and this study is no exception. Thus, our findings very likely do not reflect the total genetic overlap between ADHD and related childhood psychopathology. As the predictive utility of PRS are largely a function of the power of GWAS discovery samples from which the scores were derived, association testing with PRS and comparison of results across PRS studies will likely improve as GWAS discovery sample sizes increase [40, 53]. Such developments have been seen for schizophrenia and in other areas of medicine [53, 69]. Finally, PRS analyses do not distinguish the specific genetic loci driving the observed PRS-trait associations. We were therefore unable to separate genetic variants underpinning the association with specific hyperactivity/impulsivity, from those associated with a general liability towards childhood psychopathology.

Conclusion

Results from this study indicate that genetic risk variants associated with ADHD and captured by PRS also influence a more a general genetic liability toward broad childhood psychopathology. Beyond contributing to shared genetic liability, ADHD PRS also seem to capture genetic risk with stronger and/or unique effects on hyperactivity/impulsivity. Our findings emphasize the utility of adopting a more dimensional, multivariate framework, and the need to account for the inter-related nature of psychiatric conditions when studying the genetic architecture of childhood psychopathology [57].