Introduction

Attention deficit hyperactivity disorder (ADHD) is one of the most common neurodevelopmental disorders, with a prevalence rate of approximately 7% in children and 4.4% in adults1,2,3. ADHD is associated with inattention, impulsivity, hyperactivity, and an array of cognitive deficits with varying degrees of severity4,5. For instance, as compared to healthy controls, individuals with ADHD showed lower capacity of working memory6,7,8,9, a key cognitive function that is closely associated with fluid intelligence10,11,12. Extensive research has focused on identifying the neural markers of ADHD, and many studies demonstrated gray matter volume (GMV) reduction in cortical and subcortical structures, including the basal ganglia (BG), in adults and children with ADHD13,14.

A number of meta-analyses aimed to investigate the consistency of the volumetric reductions in ADHD. ADHD relative to typically developing children showed global volumetric reductions, with the cerebellum, splenium of the corpus callosum, and right caudate showing the most significant differences15. Other studies identified GMV reduction more specifically in the BG16,17, and the volumetric deficits appeared to normalize with age and treatment with stimulant medications17. A meta-analysis of 11 studies comprising 320 ADHD cases and 288 controls showed GMV reduction in the right globus pallidus and putamen, as well as bilateral caudate in children with ADHD, and GMV reduction in the anterior cingulate cortex (ACC) in adults with ADHD18. A higher percentage of treated relative to non-treated participants were associated with fewer deficits, suggesting a remediating effect of the medications. The ENIGMA ADHD Working Group reported smaller volumes of the accumbens, amygdala, caudate, putamen, and hippocampus in ADHD cases at 4–63 years of age, relative to controls19. In particular, the effect sizes were highest in children, whereas case–control differences were not present in adults. Thus, it appears that regional volume reduction is more prominent in children than in adults with ADHD and neural development and/or chronic treatment with medications may account for the differences.

On the other hand, many other studies have described smaller volumes in the frontal cortex, temporal cortex, insula, and cerebellum in children or adults20,21,22,23,24,25,26,27,28,29 with ADHD but did not necessarily implicate the BG. To confirm BG volumetric reduction as a neural marker of ADHD would require the investigation of a large sample of children within a limited age range with the absence of the effects of prolonged medications.

Previous work has also studied sex differences in ADHD traits and GMVs in association with ADHD. ADHD is more commonly observed in boys than girls30,31. Earlier studies have suggested that boys with ADHD showed lower GMVs in the ACC and BG, whereas girls with ADHD exhibited higher GMV in the ACC and no difference in the BG, as compared to typically developing children32,33,34. Therefore, it would be critical to consider sex differences in examining volumetric reduction in the BG and other cortical and subcortical structures as neural markers of ADHD.

ADHD is highly heritable35, with 60–80% of variance in ADHD risk attributed to genetic effects36,37,38,39,40,41. Previous work has also suggested the heritability of the volumetric markers of ADHD. Alterations in total and BG GMV have been observed not only in individuals with ADHD but also in their unaffected siblings and relatives, suggesting genetic and shared environmental risks42,43. Individuals with ADHD showed reduction in total and regional cortical GMVs, and their unaffected siblings showed intermediate GMVs between individuals with ADHD and healthy controls25,44. Indeed, recent studies have aimed specifically to characterize the molecular genetic bases of the volumetric features of ADHD45,46,47,48. Studies have reported evidence of global pleiotropy for variants affecting ADHD risk for total intracranial and subcortical regional volumes49. Another ENIGMA study also showed shared genetic heritability with brain structural correlates but did not report the exact heritability estimates of ADHD or of the structural correlates50.

In addition, affected twins showed significantly smaller caudate than their unaffected co-twins51, suggesting unique environmental influences. Further, genetic and environmental risk factors may contribute to distinct regional volume losses52, and the sex differences in ADHD symptoms may result from the different genetic and environmental influences on brain structures53.

The present study aimed to examine sex- shared and specific volumetric markers of ADHD traits for the whole brain. Because of the focus of previous studies on the BG, we also performed region-of-interest (ROI) analyses specifically of the caudate, putamen, and pallidum. Using a large data set of children (9–10 years) of the Adolescent Brain Cognition Development (ABCD) study, we employed voxel-based morphometry to identify GMV correlates of ADHD in girls and boys combined as well as separately. We evaluated how the volumetric deficits contribute to cognitive dysfunction, as evaluated in an N-back task, and assessed sex differences in the relationships between volumetric and cognitive deficits. Finally, we estimated the heritability of the neural and behavioral markers and examined sex differences in the heritability estimates. The overall goal was to characterize structural brain alterations during an early and prodromal stage of ADHD, as with many of the ENIGMA studies of ADHD19,54, and to inform longitudinal research of later releases of the ABCD data.

Methods

Dataset

The ABCD Release 2.0 cohort comprised 11,601 children; however, 99 subjects were not included in the current study because of questionable image quality or poor image segmentation (details in the section on Voxel-based morphometry). Thus, the current sample consisted of 11,502 subjects (5,464 girls, age 9–10 years). Of the 11,502 children, 634 and 2,826 were identified as monozygotic twins (MZ) and dizygotic twins/siblings (DZ), respectively. The ABCD data were collected from 21 research sites across the country. Children have been participating at baseline and follow-up assessments, which will continue over a period of 10 years. The consortium workgroups established standardized assessments of physical and mental health, neurocognition, substance use, culture, and environment, as well as multimodal structural and functional brain imaging and bioassay protocols. Structural magnetic resonance imaging (MRI) data were acquired using an optimized protocol for 3 T machines, including Siemens Prisma, GE 750 and Philips, with voxel size 1 mm isotropic55.

Assessments and cognitive test

The children were assessed with the ABCD Parent Child Behavior Checklist (CBCL) from the Achenbach System of Empirically Based Assessment56 for dimensional psychopathology and adaptive functioning. The CBCL is widely used to identify ADHD and other problematic behaviors in children57,58. We retrieved the “cbcl_scr_dsm5_adhd_t” values with the t-score in the DSM-5 scale (ADHD score, hereafter) and normed by sex, age, informant, and ethnicity59. Seven ADHD items in the CBCL DSM-oriented scale were used in the ABCD study: #cbcl_q04_p - Fails to finish things they start; #cbcl_q08_p - Can't concentrate, and can't pay attention for long; #cbcl_q10_p - Can't sit still, restless, or hyperactive; #cbcl_q41_p - Impulsive or acts without thinking; #cbcl_q78_p - Inattentive or easily distracted; #cbcl_q93_p - Talks too much; and #cbcl_q104_p - Unusually loud. Each item’s response was rated on a scale of 0 = Not True, 1 = Somewhat or Sometimes True, and 2 = Very True or Often True. The sum of raw scores was transformed to normalized T score (range 50–70) based on the percentiles of national normative sample of nonreferred populations60. Investigators have used different CBCL cutoff scores for a diagnosis of ADHD58. Here, for instance, with a cutoff of 65 (ADHD T score), 451 (7.5%) boys and 272 (5%) girls would be considered to have a diagnosis of ADHD. With a cutoff of 70 or the 98th percentile of national normative samples,193 (3.2%) boys and 75 (1.4%) girls would be considered to have a diagnosis of ADHD.

Participants performed an N-Back task55,61. Briefly, there were four blocks each with 2- and 0- back conditions in each run, with a total of 2 runs. Participants were required to respond to a set of stimuli of emotional faces or places. Each block consisted of 10 trials (2.5 s each) and 4 fixation trials (15 s each). At each trial, participants were instructed to respond to whether the picture was a “match” or “no-match” of a pre-specified target (0-back) or the stimulus shown two trials back (2-back). Here, we employed the accuracy rate of 2-back trials as a measure of the capacity of working memory.

Customized pediatric template construction

In order to perform voxel-based morphometry with appropriate templates, we constructed customized tissue probability maps (TPMs) and DARTEL templates62, as well as an average T1 anatomical template for visualization63. A cohort of 1000 children (500 girls) was selected from the ABCD dataset according to the following procedure. We generated 10,000 random samples with 1000 children (half girls) and selected the one with age and scan site distributions closest to those of the entire cohort. We used SPM Segment to generate the individual’s tissue maps and the TOM8 Toolbox (http://dbm.neuro.uni-jena.de/software/tom) to create the population TPMs and T1 anatomical template, controlling for the effects of age and sex64. DARTEL templates were constructed using utilities available in SPM. This involved creating gray (rp1) and white (rp2) matter segments after affine registration followed by DARTEL nonlinear image registration, whereby all selected images were iteratively aligned with a template generated from their own mean, and finally normalized to the MNI space (ICBM template).

Voxel-based morphometry (VBM)

We implemented the VBM analysis to quantify regional GMVs with the CAT12 toolbox (http://dbm.neuro.uni-jena.de/vbm/). The details of VBM analysis have been described in our previous studies65,66. The VBM analysis identifies differences in the local composition of brain tissues, accounting for large-scale variation in gross anatomy and location. The analysis includes spatially normalizing individuals’ structural images to the same stereotactic space, segmenting the normalized images into distinct brain tissues, and smoothing the gray matter images. We used the raw images to avoid potential interference with the CAT12 preprocessing pipeline. T1-images were first co-registered to the MNI template using a multiple-stage affine transformation during which the 12 parameters were estimated. Co-registration was performed with a coarse affine registration using mean square differences, followed by a fine affine registration using mutual information. Coefficients of the basis functions that minimized the residual squared difference between the individual image and the template were estimated. Our custom TPMs constructed from 1,000 ABCD children were used in the initial affine transformation. T1 images were then preprocessed with spatial-adaptive non-local means denoising filters67 as well as Markov random fields, corrected for intensity bias field and segmented into cerebrospinal fluid, gray, and white matter68. Segmented and the initially registered tissue class maps were normalized using DARTEL62, a fast diffeomorphic image registration algorithm of SPM. As a high-dimensional non-linear spatial normalization method, DARTEL generates mathematically consistent inverse spatial transformations. We used our custom DARTEL template in MNI space, as constructed from 1,000 ABCD children, to drive DARTEL normalization. Skull-stripping and final clean-up were performed with default parameters in the CAT12 to remove remaining meninges and correct for volume effects in some regions. In particular, skull-stripping was performed by refining the probability maps of SPM using adaptive probability region-growing, and the final clean-up routine consisted of morphological, distance and smoothing operations after the final segmentation. Normalized GM maps were modulated to obtain the absolute volume of GM tissue corrected for individual brain sizes. Finally, the GM maps were smoothed by convolving with an isotropic Gaussian kernel (FWHM = 8 mm).

Quality check of images was performed visually and quantitatively with tools available in the CAT12 toolbox69. One axial slice (z = 0) per subject was plotted and visually checked (option “Display slices”), and outliers were detected by computing the voxel-wise cross-correlation of GM density across subjects (option “Check sample homogeneity”). A total of 47 subjects presented clearly faulty segmentation of brain tissues and were removed from the group analyses. The faulty segmentation likely resulted from poor contrast or artifact of the structural images or abnormal brain shapes. Additionally, for each subject, pairwise correlations were computed voxel-wise between the subject's GMV and all the other subjects' GMV. The mean correlation represented how similar the subject's GMV was to the rest of the sample. Fifty-two subjects with a mean correlation < 0.70, suggesting a higher variance, were also removed.

Group analyses

In group analyses, we first compared age (in months) between girls and boys. We used two-sample t tests to sex differences in ADHD scores and 2-back accuracy rates. We computed correlations between 2-back accuracy rates and ADHD scores in girls and boys separately. Slope tests were used to assess sex differences in the correlations.

For the GMV data, we first examined sex differences in the whole-brain GMV using a two-sample t test with age (in months), total intracranial volume (TIV), ADHD score, study site, and scanner model as covariates. We performed a whole-brain linear regression against the ADHD score in girls and boys combined, as well as in girls and boys separately, with age, TIV, study site, and scanner model as covariates. We also performed the same whole-brain linear regression analyses without including TIV as a covariate. The results were evaluated with a voxel p < 0.05, corrected for family-wise error (FWE) of multiple comparisons, based on Gaussian random field theory as implemented in the SPM. Clusters were overlaid on the custom MRI template obtained from the 1,000 ABCD children. Effect sizes were computed using tools available in the CAT12, by approximating Cohen’s d70 from the t-statistics using the expression \(d=\frac{2t}{\sqrt{df}}\) as employed in the study of Kleber et al.71. The effect sizes of two-sample t tests were computed according to the equivalence \(d=t\sqrt{\frac{1}{{n}_{1}}+\frac{1}{{n}_{2}}}\) given the sample sizes \({n}_{1}\) and \({n}_{2}\) of the two groups72. Customized computations were implemented and verified with the equivalent effect size (Hedge’s g) calculated using the MES toolbox (https://github.com/hhentschke/measures-of-effect-size-toolbox)73.

For the ROIs identified from linear regressions in girls or boys alone, we tested sex differences in the correlation directly with slope tests, with the same set of covariates and showed two-tailed p values74. Note that the analysis did not represent “double-dipping”, as the slope tests may confirm or refute sex differences75,76,77,78,79. This is because the regression maps were identified with a threshold and for example, a cluster showing correlation in boys could show a correlation that just missed the threshold in girls; thus, slope tests were needed to examine whether the correlations were indeed different between the sexes. Sex differences in the extent to which the GMVs correlated with ADHD scores could also be tested in a linear mixed-effects model with an interaction term. We conducted the analyses in SPSS 26.0, with sex, ADHD score, and sex × ADHD score as fixed effects, twin/sibling status as a random effect variable in the model, after controlling for age, race, study site, scanner model, and TIV.

A total of 1,052 children received psychotropic medications (some multiple medications) at the time of the study (psychostimulants: n = 941; antidepressants: n = 176; other psychotropics, including those for epilepsy: n = 25). Children receiving stimulant treatment likely had more significant ADHD symptoms and, as expected, demonstrated significantly higher ADHD scores than unmedicated children (see “Results” Section). We conducted a covariance analysis followed by Tukey’s range test to examine the effects of medication on the BG GMVs. Besides, we performed an additional set of whole-brain regression analyses on ADHD scores in medication-naïve children only.

Heritability of ADHD score and volumetric correlates

Of the current sample, 634 and 2,826 were identified as monozygotic twins (MZ) and dizygotic twins/siblings (DZ), respectively. Thus, we examined whether the ADHD scores, as well as the volumetric correlates, were more significantly related in MZ, as compared to DZ and unrelated pairs (UR), and in DZ as compared to UR. To this end, we computed the Pearson’s correlations each for the ADHD scores, 2-back accuracy rates as well as the volumetric correlates for the MZ and DZ pairs. For the UR, pairs of children were randomly constructed by shuffling and splitting the sample into halves. This procedure was repeated 100 times, and the mean regression lines were computed. For the correlations in MZ and DZ pairs, 95% confidence intervals (CI) were also calculated. We performed slope tests, pairwise, to examine the differences between MZ, DZ and UR. The details of correlation analyses have been described in one of our previous studies65. The analyses of the GMVs were done for the clusters combined of those identified with positive and negative correlations, respectively, from all subjects and of the “girl-specific” and “boy-specific” correlates.

For girls and boys combined and separately, we used Mplus 8 to compute the genetic influence (heritability estimates), shared environmental influence, and unique environmental influence80,81 for the ADHD scores and 2-back accuracy rates based on univariate ACE models82, with age, race, and study site as covariates. We also estimated univariate ACE models for the volumetric correlates, with scanner model and TIV as additional covariates. Only same-sex DZ pairs were included for the analyses of girls and boys separately. The ACE model decomposes the observed variance into additive genetic factors (A), also known as heritability, shared environmental factors (C), and unique environmental factors (E), in addition to measurement errors83. The correlation between the additive genetic variance is fixed to 1.0 for MZ and 0.5 for DZ. The correlation between the shared environmental variance is set to 1.0 for both MZ and the DZ based on the equal environment assumption. The correlation between the non-shared or unique environmental variance is set to zero. The expected variance-covariance matrices within the MZ and DZ are as follows:

$$Cov{\text{MZ }} = \left[ {\begin{array}{*{20}c} {a^{2} + c^{2} + e^{2} } & {a^{2} + c^{2} } \\ {a^{2} + c^{2} } & {a^{2} + c^{2} + e^{2} } \\ \end{array} } \right]$$
$$Cov{\text{DZ }} = \left[ {\begin{array}{*{20}c} {a^{2} + c^{2} + e^{2} } & {0.5a^{2} + c^{2} } \\ {0.5a^{2} + c^{2} } & {a^{2} + c^{2} + e^{2} } \\ \end{array} } \right]$$

where a, c, and e represent the path coefficients for the A, C, and E factors, respectively84. The variance of A, C, and E was estimated using the maximum likelihood method on the variance–covariance matrices and 95% CI of A, C, and E were computed: 95% CI = mean ± 1.96 × standard error, with the assumption that the population standard deviation is a known value. Values of χ2/df, root-mean-square error of approximation (RMSEA), and Tucker-Lewis index (TLI) were used as model fit indices. A χ2/df < 2, an RMSEA < 0.06, or a TLI > 0.95 indicates good fit. Genetic variance (a2) < 0.30, 0.30–0.60, and > 0.60 is considered low, moderate, and high, respectively85. We performed two-sample t tests to compare girls and boys in the genetic variance of the ADHD score, sex-specific volumetric correlates, and 2-back accuracy.

Because of the focus on BG, we also estimated the ACE models of the caudate, putamen and pallidum GMVs with the AAL masks for girls and boys combined and separately, with the same set of covariates. Likewise, we performed two-sample t tests to examine sex differences.

Ethical approval and informed consent

We have obtained permission from the ABCD to use the Open and Restricted Access data for the current study. All methods were performed in accordance with ABCD Bioethics and Medical Oversight Guidelines and Procedures. The ABCD data is publicly available at https://abcdstudy.org/. The ABCD data repository grows and changes over time. The data used in this report (NDA project ID 2573, Release 2.0) came from http://dx.doi.org/10.15154/1503209. All recruitment procedures and informed consent forms, including consent to share de-identified data, were approved by the Institutional Review Board of the University of California San Diego with the study number 160091. Informed consent from parent and assent from child under 18 years old was obtained from all individuals prior to the study.

Results

ADHD scores and 2-back accuracy rates

The distributions of ADHD T and raw scores of girls and boys are shown in Supplementary Fig. S1. Girls and boys were different in age (in months), with boys older than girls (mean ± SD: 119.10 ± 7.50 vs. 118.80 ± 7.40 months, t = 2.20, p = 0.028; two-tailed two-sample t test). Boys showed significantly higher ADHD scores than girls (53.61 ± 5.98 vs. 52.71 ± 5.08; t = 8.66, p < 0.001; two-tailed t test with age, race, twin status, and study site as covariates). However, boys and girls showed the same range in ADHD scores (50 to 80) with comparable variability (coefficient of variation: 0.111 for boys and 0.097 for girls).

Boys showed a significantly higher 2-back accuracy rate than girls (75.60 ± 15.09% vs. 73.67 ± 13.98%; t = 6.13, p < 0.001; two-tailed t test with the same covariates). Two-back accuracy rates were significantly and negatively correlated with ADHD scores in boys (r = − 0.133, p < 0.001) and in girls (r = − 0.116, p < 0.001). A slope test showed that boys and girls did not differ significantly in the correlations (t = 0.399, p = 0.690).

Sex differences in GMVs

We first examined sex differences in GMV with a two-sample t test controlling for age (in months), ADHD score, study site, scanner model, and TIV. As shown in Supplementary Fig. S2, boys relative to girls showed higher GMVs nearly across the entire brain (p < 0.05, FWE corrected). Girls relative to boys showed higher GMVs in relatively small clusters in the left caudate head (x = − 6, y = 13, z = 16, 520 voxels, T = 6.94), bilateral inferior frontal gyri (x = 33, y = 16, z = 24, 1,407 voxels, T = 6.81; x = − 34, y = 13, z = 24, 1,180 voxels, T = 6.15) and right intraparietal sulcus (x = 35, y = − 36, z = 35, 2,377 voxels, T = 7.94). The unthresholded map is shown in Supplementary Fig. S5a.

GMV correlates of ADHD scores

We examined regional volumetric correlates of ADHD scores in linear regressions for all subjects and girls and boys separately, all with age (in months), study site, scanner model, and TIV as covariates. Brain regions with GMVs in correlation with ADHD scores are shown in Fig. 1 and the clusters are summarized in Supplementary Table S1 (p < 0.05, FWE corrected). Overall, in boys and girls combined, ADHD scores were positively correlated with a single cluster in the area of the hypothalamus (x = − 5, y = − 13, z = − 9, voxel Z = 6.88, 689 voxels). The inset of Fig. 1 shows the cluster overlaid on a hypothalamus mask86,87. In contrast, higher ADHD scores were associated with the reduction of GMVs in an extensive array of brain regions, including bilateral caudate and putamen, orbitofrontal cortex (OFC), visual cortex, somatomotor cortex, and temporal cortex. The unthresholded map is shown in Supplementary Fig. S5b.

Figure 1
figure 1

GMV correlates of ADHD scores for girls and boys combined (top panels), and for girls (middle) and boys (bottom), separately. Whole-brain regression, p < 0.05, FWE corrected. Color bars show voxel T values (left, cool colors to indicate negative correlation) and the corresponding effect sizes (right, Cohen’s d). A single cluster in the area of the hypothalamus showed a positive correlation with ADHD scores in girls and boys combined; the inset shows the cluster overlaid on a hypothalamus mask (cyan) with overlapping voxels highlighted in dark green. For the clusters identified of girls and boys separately, we used exclusive masking to highlight those that appeared specific to girls (pink) and to boys (light green). Overall, boys relative to girls showed more widespread and significant GMV reductions in relation to ADHD scores.

Except for the hypothalamus, 2-back accuracy rate (%) were positively correlated with the volumetric correlates identified of boys or girls or of all three BG subregions (r’s ranging from 0.066 to 0.190, all p’s < 0.001) in linear regressions with the same set of covariates. None of these correlations showed a significant sex difference in slope. These results are shown in Supplementary Table S2.

In multiple regressions for boys and girls separately, overall boys relative to girls appeared to show more significant GMV reductions in link with higher ADHD scores, broadly in the frontal (particularly orbitofrontal) and parietal cortical regions as well as subcortical structures, such as the thalamus. In contrast, girls relative to boys appeared to demonstrate more significant GMV reductions in association with higher ADHD scores in the bilateral inferior/middle temporal cortex, frontopolar cortex, medial occipital cortex, and the precuneus. No voxels showed GMVs in positive correlations with ADHD scores in girls or boys alone. These findings are shown in Fig. 1, where we highlighted the voxels that appeared to be specific to girls by masking the findings with boys’ clusters, and vice versa.

As some morphometric studies of ADHD also showed imaging findings without specifically accounting for TIV. Thus, we performed additional analyses without including TIV as a covariate and showed the results evaluated at p < 0.05 FWE corrected in Supplementary Fig. S3. Overall, the findings are similar though more prominent. For instance, in girls and boys combined, the ADHD trait was associated with higher GMV of the hypothalamus and lower GMVs of the OFC and BG. The unthresholded map is shown in Supplementary Fig. S5c.

Sex-specific GMV correlates of ADHD scores

We extracted the GMV estimates of the clusters identified from whole-brain analyses and performed slope tests to examine whether there were sex differences in the linear regressions of the GMVs against ADHD scores with age, race, twin status, study site, scanner model, and TIV as covariates. Slope tests did not reveal significant sex differences in the slopes of the regressions of hypothalamus GMVs vs. ADHD scores (t = 0.250, p = 0.803) (Fig. 2A). We combined the clusters identified each for girls and boys as “girls’ or boys’ joint ROIs”. We did the same for those “sex-specific” voxels as “girls’ or boys’ specific joint ROIs”. The results showed that the association between GMVs and ADHD scores was more negative in boys (r = − 0.112, p < 0.001) than in girls (r = − 0.072, p < 0.001) for boys’ joint ROIs, with a significant slope difference (t = 2.788, p = 0.005; corrected p = 0.05/8 = 0.00625; Fig. 2B). Likewise, the association between GMVs and ADHD scores was more negative in boys (r = − 0.112, p < 0.001) than in girls (r = − 0.068, p < 0.001) for boys’ specific joint ROIs, with a significant slope difference (t = 2.99, p = 0.003; corrected p = 0.05/8 = 0.00625; Fig. 2D). None of the girls’ joint ROIs showed a correlation with ADHD scores significantly different between girls and boys in slope tests (both p’s ≥ 0.248, Fig. 2C, E).

Figure 2
figure 2

Linear correlations between ADHD scores and GMVs for (A) hypothalamus, (B) girls’ joint ROIs, (C) boys’ joint ROIs, (D) girls’ specific ROIs, (E) boys’ specific ROIs, (F) bilateral caudate, (G) bilateral putamen, and (H) bilateral pallidum, plotted separately for girls (in red) and boys (in blue), with age, race, twin status, study site, scanner model, and TIV as covariates. Linear correlation for girls and boys combined is also shown for hypothalamus (in green). Only boys’ joint and specific joint ROIs showed GMVs in negative correlation with ADHD score more significantly in boys than in girls (p’s < 0.05/8 = 0.00625). Note that the residual scores after controlling for covariates are shown here. Dashed lines represent 95% confidence intervals of the mean regressions (solid lines). a.u. = arbitrary unit of GMV estimates; Bilat. = Bilateral. ***p < 0.001, **p < 0.005, and *p < 0.05. See Supplementary Table S3 for statistics.

We also examined sex differences specifically for the BG subregions, using the AAL masks, and the results showed girls and boys did not differ in the correlations of GMVs with ADHD scores for bilateral caudate, putamen, or pallidum (all p’s ≥ 0.159, slope tests, Fig. 2F, G and H).

Supplementary Table S3summarizes the statistics of individual regressions in girls and boys separately and slope tests on sex differences. Linear mixed-effects models showed that the interaction term was significant for boys’ specific joint ROIs (p = 0.034), consistent with the slope tests, but not any other ROIs (all p’s ≥ 0.057). Note that none of the results on sex differences would be significant with correction for multiple comparisons. The results are summarized in Supplementary Table S4.

The effects of medications

A total of 941 (8.2%) children received psychostimulant medications, who relative to unmedicated children showed more significant ADHD symptoms [ADHD score (mean ± SD): 60.70 ± 7.96 versus 52.51 ± 4.78; F = 2,164.32; p < 0.001, covariance analysis with age, sex, race, study site, and twin status as covariates]. The GMVs (mean ± SD) of the caudate were 0.398 ± 0.046 and 0.393 ± 0.049, of the putamen were 0.482 ± 0.049 and 0.481 ± 0.050, and of the pallidum were 0.531 ± 0.060 and 0.524 ± 0.058 for unmedicated and stimulant-treated children, respectively. Covariance analysis with age, sex, race, study site, twin status, scanner model, and TIV as covariates showed a significant effect of psychostimulant for the GMVs of bilateral caudate (F = 41.63, p < 0.001), putamen (F = 51.32, p < 0.001), and pallidum (F = 72.94, p < 0.001). Thus, psychostimulant-treated children showed higher ADHD scores and relatively smaller BG GMVs as compared to medication-naïve children.

In an additional analysis, we excluded all medicated children and performed the same set of regression analyses on ADHD scores (n = 10,450; 5,174 girls). The results showed largely similar volumetric correlates of ADHD scores, though diminished in effect sizes, likely because of the exclusion of the children with more significant ADHD traits (Supplementary Fig. S4 and Table S5, p < 0.05 FWE corrected). The unthresholded map is shown in Supplementary Fig. S5d.

Heritability of ADHD score and volumetric correlates

We computed the Pearson’s correlations between MZ pairs, between DZ pairs and between UR pairs in ADHD scores and volumetric correlates for all and for girls and boys separately. The results are shown in Fig. 3. Overall, for both ADHD scores and the volumetric correlates, the correlations were significantly stronger in MZ pairs than in DZ and UR pairs as well as in DZ pairs than in UR pairs. Supplementary Tables S6 and S7 summarize the statistics of individual sets of correlation and of slope tests for ADHD scores and GMV correlates, respectively.

Figure 3
figure 3

Pearson’s correlations between monozygotic (MZ) pairs, between dizygotic/siblings (DZ) pairs and between unrelated (UR) pairs in ADHD score for (A) all, (B) girls, and (C) boys; and in GMV correlates in (D) positive and (E) negative correlations with ADHD scores for all, as well as (F) “girl-specific” and (G) “boy-specific” GMV correlates for all. S1 and S2: subject 1 and 2 of the pair. The r and p values of the correlations and the results of slope tests are shown in Supplementary Tables S6 and S7, respectively.

Table 1 shows the heritability estimates and shared and non-shared environmental effects of ADHD scores and 2-back accuracy rates with age, sex (only for “all”), race, and study site as covariates as well as volumetric correlates with scanner model and TIV as additional covariates in girls and boys combined and separately. The ACE models showed good model fits for the great majority of measures (Supplementary Table S8). For girls and boys combined and separately, the ADHD scores (a2 = 0.54–0.59) were moderately heritable, GMV correlates (a2 = 0.60–0.79) were highly heritable, and 2-back accuracy rates were near moderately heritable (a2 = 0.30–0.39). The shared-environmental effects on the measures were negligible or weak (c2 = 0.00–0.19). Two-sample t tests showed no significant sex differences in the heritability estimates of ADHD scores, volumetric correlates, or 2-back accuracy rates (all p’s ≥ 0.764).

Table 1 Genetic, shared environmental, and non-shared environmental effects of ADHD scores, GMV correlates, and 2-back%.

The model fits and ACE estimation for the GMVs of BG subregions are shown in Supplementary Table S9a and b, respectively. The ACE models showed good model fits for the great majority of these volumetric measures. The heritability of caudate, putamen and pallidum volumes were in the range of 0.8–0.9, slightly higher than those reported in an earlier study of family-based cohorts88. Two-sample t tests showed that there were no significant differences in the heritability of caudate, putamen, or pallidum GMVs between girls and boys (all p’s ≥ 0.936).

Discussion

We observed negative correlations between the ADHD scores and GMVs in a wide array of brain regions in both boys and girls, in line with previous findings17,89,90,91. More significant and broadly distributed in boys than in girls, the volumetric reductions clearly involved bilateral caudate and putamen. As part of the cognitive control network92,93,94, the BG play important roles in executive functions such as attention, memory, response inhibition, and set-shifting, known to be impaired in ADHD95,96. Indeed, we showed that ADHD scores and bilateral caudate GMVs were negatively and postively, respectively, correlated with 2-back accuracy in both girls and boys. Both ADHD traits and the volumetric markers are heritable. The heritability of ADHD scores, volumetric markers, and working memory were similar between the sexes. Together, the current findings confirm BG volumetric reduction in children with ADHD traits, overall more significant GMV reductions in boys than in girls, and the roles of caudate GMV reduction in working memory dysfunction. We have also observed a cluster in the hypothalamus that showed GMV in positive correlation with ADHD score across all subjects but not in boys or girls alone. With future releases of the ABCD data, it would be of tremendous interest and translational significance to investigate how these structural brain markers predict the development of full-blown ADHD and comorbidities in these children. We discussed the main findings below.

Volumetric correlates of ADHD traits

We observed a cluster in the hypothalamus that showed GMV in positive correlation with ADHD scores across all subjects but not in boys or in girls alone examined at the same threshold. The hypothalamus is known for its role in motivated behavior and stress response. Studies have reported altered activity of the hypothalamus-pituitary axis, as evidenced in the abnormal diurnal rhythm of cortisol secretion and cortisol response to stress, in children with ADHD97,98,99. However, it remains unclear whether the neuroendocrine abnormalities reflect experienced stress or the pathophysiological process of ADHD per se97,100. A recent imaging study implicated hypothalamic dysfunction during post-error slowing in the stop signal task, a deficit correlated with the inattention score, in children with ADHD101. In broad consistency, earlier volumetric studies have associated smaller hypothalamus GMV with behavioral inhibition traits in children65 and young adults102. The functional implication of enlarged hypothalamus in children with higher ADHD traits, if replicated, remains to be investigated.

Most notable among the cortical regions showing significant GMV reductions in link with ADHD scores in boys are bilateral, including both medial and lateral, OFC. The OFC and orbito-frontal-limbic circuit have been implicated in emotional impulsivity27, and volumetric deficits of the OFC appear consistent with affective dysfunction in some patients with ADHD89. On the other hand, few extant VBM studies reported OFC GMV reduction in ADHD27,52,103. Children in the ABCD cohort were of 9 to 10 years of age, younger than the subject population reported in most of the earlier studies, which have suggested age-related changes in the volumetric correlates of ADHD19,104. For instance, machine learning demonstrated that although structural MRI data can significantly separate ADHD from control participants for both children and adults, prediction performance and effect sizes were better for the child than the adult samples105. It thus remains to be seen whether OFC volumetric deficits may perhaps normalize with age but would continue to manifest in individuals with higher emotional impulsivity.

In girls and boys combined as well as in boys alone, the occipital cortex showed diminished GMV in relation to ADHD traits, consistent with earlier reports of occipital cortical GMV reduction in children with ADHD with a mean age of around 10 years106,107 and in adolescents with ADHD24. Other studies showed that the baseline GMV of the occipital cortex predicted inattention symptoms in a 2-year follow-up and was associated with the genetic risk for ADHD108. The GMV of the occipital cortex may predict stimulant treatment response in ADHD109. Another study of college students associated greater impulsivity scores to higher volumes in the occipital cortex specifically in the region that represents the peripheral visual field110. Together, prior literature provides evidence implicating occipital cortical GMV in ADHD traits, in accord with the visual cortical dysfunction in link with attention deficits in ADHD111.

Sex differences in cerebral volumetric reductions in ADHD

The volumetric changes associated with ADHD scores are more remarkable and broadly distributed in boys, consistent with a higher incidence and greater severity of ADHD in boys than in girls23,32,33,112. In contrast, although girls relative to boys appeared to show more significant deficits in the inferior/middle temporal cortex and the precuneus, the differences did not show significantly in slope tests. In boys relative to girls, more widespread deficits were found in the OFC, frontoparietal regions, including the ACC, and subcortical structures such as the thalamus and BG, which are the core structures of the cognitive control network113,114. These sex differences are confirmed by slope tests but not by the interaction effects with correction for multiple comparisons, and the slope tests restricted to the BG nuclei did not demonstrate significant sex differences115,116,117. These findings are broadly consistent with those reported from the ENIGMA studies of ADHD118 and those implicating a link between caudate nucleus and deficits of executive function in ADHD27,111. On the other hand, although the findings together are commensurate with the proposition of frontal-striatal network dysfunction in ADHD, evidence also suggests the roles of other cortical structures, including the occipital cortical regions, in the etiological processes of attention deficits in ADHD111. Our findings enrich the theory by identifying the structural deficits of these regions in ADHD and provide a potential explanation of more significant ADHD symptoms in boys. Note that boys and girls did not demonstrate significant differences in the correlations of ADHD scores or volumetric correlates with 2-back accuracy, or in the heritability of the volumetric markers, ADHD score, or 2-back accuracy. We speculate that these volumetric correlates may more broadly represent neural markers of sex differences in clinical manifestations, including greater impulsivity in boys than in girls with ADHD119,120,121, which should be explored further. Studies investigating the heritability, particularly those involving the computation of polygenic risk scores, of both neural and behavioral markers would help addressing the inter-relationships between gene, brain, and behavior in ADHD122.

Medications and volumetric reductions

As expected, children medicated with stimulants showed higher ADHD scores than those who were medication naïve. We demonstrated in medication-naïve children similar albeit slightly weaker volumetric reductions in association with ADHD traits. In particular, the volumetric reductions in the putamen and pallidum were no longer significant, suggesting that the lentiform nucleus may play a unique role in the pathogenesis of more significant forms of ADHD. Indeed, studies focusing on subcortical structures have typically reported putamen volumetric deficits in ADHD27,123. Further, lesions of the putamen have been associated with the development of secondary ADHD124,125. In contrast, the GMV reduction of the precuneus remains strongly associated with ADHD scores in medication-naïve girls, suggesting potentially distinct volumetric markers of ADHD traits in the female sex.

Heritability of ADHD traits, volumetric markers, and working memory

ADHD traits in children were near highly heritable (a2 = 0.59, all subjects), suggesting that variance in the ADHD symptoms can largely be accounted for by genetic factors126,127,128. In accord with previous findings42, the volumetric correlates were also highly heritable, with a2 = 0.60 for the hypothalamus and a2 ~ 0.70 for the “negative” correlates. On the other hand, studies have also supported the roles of environmental factors in the development of ADHD129,130. For instance, ADHD has been associated with low socio-economic status131. In particular, with e2 at 0.41 across all subjects, 0.46 in girls, and 0.44 in boys, unique environmental factors modestly explained the variance in ADHD symptoms, in accord with previous reports132. An earlier longitudinal study showed that as the ADHD symptoms evolved from childhood to adolescence, the unique environmental influences may result from a socialization process dominated by interpersonal relationships with parents, teachers, and peers36.

The GMVs of the BG subregions (caudate, putamen and globus pallidum) were strongly influenced by genetics (a2 = 0.79–0.90; all subjects), largely in line with earlier reports that genetic variation accounted for 43–85% of the variance in the BG subregional GMVs in children133,134,135. Other studies with ~ 200 twin pairs showed comparable heritability for BG GMVs in young and old adults (a2 = 0.65–0.88)136,137,138. Thus, the heritability of BG GMVs appears to be stable across age. Two-back accuracy rate was moderately heritable (a2 = 0.30; all subjects), though not to the extent observed in young adults (a2 = 0.73; 60 twin pairs)139. The trend of increased heritability for working memory across age has also been observed for other cognitive functions85,140.

We did not observe sex differences in the heritability of ADHD traits, volumetric markers, or working memory, in accord with an earlier work141. Another study likewise found no evidence for sex differences in the genetic influences on externalizing disorders, including ADHD, conduct disorder, and oppositional defiant disorder in adolescents142. Given that ADHD appears to be more prevalent in the male sex, the sex difference may manifest in other behavioral or structural or functional neural markers of ADHD. Identifying the sex-related markers would help in investigating sex-specific genetic polymorphisms of ADHD141,143.

Limitations of the study and conclusions

A number of limitations need to be considered. First, ADHD is known to be comorbid with many behavioral conditions, including depressive144 and conduct disorders145. We did not control for these and many other clinical variables in data analyses; thus, the impact of comorbidities on the current findings remains to be clarified, as demonstrated in the ENIGMA studies of ADHD146,147,148. Second, the correlations between ADHD scores and GMVs are relatively weak, which may reflect the non-clinical populations of the ABCD cohort. On the other hand, it is worth noting that the effect sizes were within the range reported of volumetric markers of ADHD cases19. Third, pubertal maturation may contribute to sex differences in brain development149. The ABCD study includes pubertal survey and provides an opportunity for future work to investigate the effects pubertal age on brain development150. Fourth, we acknowledge the limitations of twin design analyses, including the difficulty in detecting shared environmental effects and inflated estimates of heritability84,151. Fifth, these are cross-sectional findings; with release of more and more follow-up data, investigators would be able to examine how the volumetric deficits evolve along with clinical manifestations of ADHD into late adolescence. Finally, whereas the current work focuses on quantifying GMVs, previous studies have investigated other morphometric measures, including cortical thickness and surface area, to better understand cerebral structural alterations in ADHD32,33,54.

To conclude, we confirmed cerebral, including basal ganglia, volumetric deficits of ADHD traits in children and potential sex differences in these structural alterations in relation to ADHD traits. ADHD traits and their volumetric markers are highly heritable with no evidence of sex differences in heritability. These findings would inform future research of the neural markers of ADHD and of the clinical trajectories and efficacy of treatment as the ABCD children are followed through late adolescence.