Non-coding structural variation differentially impacts attention-deficit hyperactivity disorder (ADHD) gene networks in African American vs Caucasian children


Previous studies of attention-deficit hyperactivity disorder (ADHD) have suggested that structural variants (SVs) play an important role but these were mainly studied in subjects of European ancestry and focused on coding regions. In this study, we sought to address the role of SVs in non-European populations and outside of coding regions. To that end, we generated whole genome sequence (WGS) data on 875 individuals, including 205 ADHD cases and 670 non-ADHD controls. The ADHD cases included 116 African Americans (AA) and 89 of European Ancestry (EA) with SVs in comparison with 408 AA and 262 controls, respectively. Multiple SVs and target genes that associated with ADHD from previous studies were identified or replicated, and novel recurrent ADHD-associated SV loci were discovered. We identified clustering of non-coding SVs around neuroactive ligand-receptor interaction pathways, which are involved in neuronal brain function, and highly relevant to ADHD pathogenesis and regulation of gene expression related to specific ADHD phenotypes. There was little overlap (around 6%) in the genes impacted by SVs between AA and EA. These results suggest that SVs within non-coding regions may play an important role in ADHD development and that WGS could be a powerful discovery tool for studying the molecular mechanisms of ADHD


Attention-deficit hyperactivity disorder (ADHD) has a prevalence of ~ 6–8% in children with male patients outnumbering females by almost double1. Symptoms persist into adulthood in over two thirds of cases, causing significant life-long impairments2,3. In the last decade, multiple studies have attempted to investigate the genetic susceptibility of ADHD, most notably by assessing the enrichment of copy number variations (CNVs)4 and single nucleotide variants (SNVs) from genome-wide association studies (GWAS)5. However, the current understanding of this complex trait is incomplete and attempts to replicate previous studies have been inconsistent, due in part to the highly heterogenous phenotype of ADHD, as well as other factors, such as complicated molecular mechanisms underlying ADHD networks and limitations of genotyping arrays to study structural variations (SV)6. Previous studies also show that the susceptibility of ADHD is more likely to be impacted by biological pathways instead of a particular gene6,7,8, and by structural variations (SVs), such as copy number variations (CNVs), inversions, translocations that may play important roles in the regulation of ADHD gene networks9,10. Most of previously published studies have focused on coding regions and have been carried out primarily in patients of European ancestry while intronic and intergenic regions were often omitted from analyses. However, non-coding genomic structural variations and non-coding DNA sequences have been shown to play important roles in many human diseases, including neurodevelopmental diseases such as autism and intellectual disability11,12. In this regard, the most recent large GWAS studies on 55,374 individuals, including 20,183 ADHD patients, highlighted that variants in non-coding regions, such as non-coding RNA and intergenic region were significantly associated with ADHD susceptibility13. In addition, previous studies have largely focused on European Caucasian populations leaving out studies in African populations and other populations.

To address the limitations of the previous studies of ADHD, in this study we have generated deep whole genome sequencing (WGS) data on 875 individuals, including 205 ADHD patients and 670 non-ADHD controls, in order to explore the impact of SVs, especially SVs within non-coding regions, on the pathogenesis of ADHD. We have also included a significant number of African Americans in the study, including 116 cases and 408 controls, to expand the analysis into another population other than Europeans. The results suggest that SVs within non-coding regions play critical roles in the molecular mechanisms underlying ADHD and that population-specific SVs are present. This information would be useful for future studies of ADHD genetic network regulation and drug development.


Exonic/splicing SVs impact structure of genes related to neurodevelopment procedures

Approximately 160,000 structural variations (SVs) were identified in ADHD patients (Fig. 1b), of those, 0.96% were classified as exonic, 0.59% as splicing, 42.3% as intronic and 56.13% as intergenic. Exonic/splicing usually have more significant impacts since they alter the coding regions and splicing sites directly As expected, they accounted for a small proportion (~ 1.5%) of the total of which 37 were significantly with ADHD threshold 0.05 and 9 with threshold 0.01 (Table 1). In addition to the 37 exonic/splicing SVs that associated with ADHD we identified 451 rare ADHD-associated SVs in AA (Supplementary Table 2a) and 382 in EA (Supplementary Table 2b), 41 SVs are only existed in ADHD cases and were absent from controls (Table 2). A recurrent 320 bp long deletion was identified for three AA ADHD patients at chr5:171723712–171724032, at the splicing site of non-coding RNA LOC100288254.

Figure 1

Patient summary and distribution of structural variations (SVs) for ADHD vs control. (a) represents the number of ADHD patients and non-ADHD controls with race information; (b) distribution of structural variations (SVs) for 206 ADHD patients based on whole genome sequencing (WGS). Intergenic and intronic variations accounted for over 98% of the SVs.

Table 1 ADHD-associated Exonic/splicing SVs that passed the statistical threshold 0.01.
Table 2 Rare recurrent exonic/splicing SVs that were only found in ADHD patients.

SVs within non-coding regions reveal known and possibly novel ADHD-associated genes

Beside exonic/splicing SVs, we also evaluated association of non-coding SVs in ADHD. The novel intronic SVs are listed in Supplementary Tables 3, 4, 5 for AA, EA, and meta-analysis, respectively. The majority of selected ADHD-associated SV-genes were impacted by SVs within non-coding regions (Table 3), furthermore based on the ADHDgene database14, there are no known exonic/splicing SV-genes from previous studies passed the statistic threshold. However, a novel exonic deletion in IQSEC3 passed the ethnicity meta-analysis (p value = 0.0083, Supplementary Table 6). IQSEC3 is a neuronal exchange gene related to speech, i.e. childhood apraxia of speech, and down-regulated in autism and schizophrenia15.

Table 3 Selected SV-associated genes targets based on p value.

An example network pathway of SVs within non-coding regions is neuroactive ligand-receptor interaction, a pathway critical in neuronal brain function, known to be highly relevant to ADHD development and regulation of differential gene expression in different ADHD-related brain regions16,17. Non-coding SVs such as intronic deletion of HTR1F, intronic translocation of CHRNA3, intergenic translocation of GRIN2A, and intronic insertions of GRM5, were found significantly enriched in ADHD patients (Table 4).

Table 4 Non-coding SV-genes in neuroactive ligand-receptor interaction pathway.

Structural variations show differences in two ethnicities

No obvious differences in SV prevalence types between the AA and EA (Supplementary Fig. 1), however, impacted ADHD-associated SV-genes, which reach statistical significance, are different between two ethnicities (Fig. 2). There were 686 ADHD-associated SV-genes for AA based on statistical tests (Supplementary Table 3), and 439 ADHD-associated SV-genes for EA (Supplementary Table 4). Only 34 genes shared between two ethnicities (Supplementary Table 5), which counted 5%/8% for entire SV-gene set. Meta-analysis identified 234 ADHD-associated SV-genes (Supplementary Table 6), and only four ADHD-associated SV-genes were found in previous literatures (Table 5). Actually, genes in meta-analysis results are still impacted by ethnicities, for example MYBPC1 has intergenic SVs with p value 0.017 in meta-analysis, and the p value is 0.79 in AA and 0.0032 in EA, in other words, this meta-significant SV-gene passed through meta-analysis because highly ADHD-associated in EA and not significant at all in AA.

Figure 2

Venn diagram of overlap SV-genes between AA and EA, including all SVs, exonic SVs, intronic SVs, and intergenic SV, respectively. (a) SV-genes which significantly associated with ADHD patients (chi-square p value <  = 0.05); (b) SV-genes supported by previous ADHD studies and significantly associated with ADHD patients (chi-square p value <  = 0.05).

Table 5 Significant ADHD-associated non-coding SV-genes which have previously ADHD literature support based on meta-analysis.


Attention deficit hyperactivity disorder (ADHD) is the most common neurobiological disorder in children, with a prevalence of 6–8%6. In this study, we identified 37 exonic/splicing SVs, several involving genes that have been previously reported in neurological and mental diseases, such as VPS53, which has been previously associated with a neurological conditions and Parkinson disease18. Consequently, we identified 40 novel recurrent SV genes associated with ADHD, where the SVs occurred exclusively in ADHD or have frequency less than 0.5% in non-ADHD controls. Those novel recurrent SVs could be also important in ADHD development. For example, we identified a novel 320 bp deletion at splicing sites of non-coding RNA LOC100288254, which is a recurrent SV in three independent ADHD patients and only seen in ADHD patients. Additional notable recurrent rare SVs included an exonic insertion of a non-coding RNA, MIR137, which has been shown to play a significant role in neural development and neoplastic transformation19, splicing inversion in DRD4 which has previously been implicated in ADHD20, and an exonic translocation of BPTF, which causes expressive language delay and intellectual disability21. BPTF, which exonic translocation was identified in two independent individuals, was considered as a candidate gene in neurodevelopmental disorder based on exome pool-seq22, and believed to be the cause of syndromic developmental, speech delay, postnatal microcephaly, and dysmorphic features in recent study21. We also observed that the non-coding RNA LINC00461, which wasone of the 12 significant loci in the study by Demontis et al.13, had an intronic insertion in six ADHD patients with chi-square p value = 0.02.

In addition, this study reveals that SVs within non-coding regions may be more critical in ADHD biological networks than they used to believe. One typical example is the neuroactive ligand-receptor interaction pathway, a pathway critical in neuronal brain function, known to be highly relevant to ADHD pathogenesis. In this study, 17 SV genes were found significantly different between ADHD patients and controls, including four genes CHRNA3, GRM5, HTR1F, GRIN2A which were supported by previous literature4,23,24,25. All the identified structural variations related to this pathway are either intronic or intergenic. Similar situations were found in other neurodevelopmental pathways, such as MAPK signaling pathway and Axon guidance. Functional role of SVs in non-coding regions in ADHD therefore warrant further investigation. We also show that there is only a small portion of overlap between the two ethnicities of SV impacted genes, and the result was further replicated as we limited the SV-genes to known ADHD genes based on the ADHD gene database14. 25 ADHD-associated SV-genes have been previously studied and reported in the literature (Table 6), and only one gene, AGBL1, with intergenic SVs shows statistically significant difference in both ethnicities. AGBL1 was the top locus in the largest ADHD genome-wide meta-analysis done26 and mutation in this gene showed significant association with learning performance27. Taken together, the results suggest that impacted ADHD-associated genes differ between the two ethnicities, suggesting that ADHD analysis without population information could miss potential disease genes.

Table 6 Significant ADHD-associated non-coding SV-genes which have previously ADHD literature support in AA and EA, respectively.

While the majority of those ADHD-associated SVs are located in non-coding regions, the question is how these SVs impact ADHD pathways in the two ethnicity groups: are the pathways different or do the SVs impact same pathways but different gene modules? In order to explore the answers, we further mapped these SVs within non-coding regions into the highly studied ADHD pathways, including neuroactive ligand-receptor interaction pathway. For the ADHD SV-genes which are significantly different in ADHD and non-ADHD controls (p value ≤ 0.05) or have a trend towards significance (p value ≤ 0.1), 10 of them belong to neuroactive ligand-receptor interaction pathway and five genes are AA SV-genes and the left are EA specific SV-genes (Table 4). The results also suggest that SVs, especially SVs within non-coding regions, impact the same gene families but different gene members, such as intronic SVs of CHRNA3 in AA versus CHRNA4 in EA, and intronic SVs of HTR1F in AA versus HTR2C in EA. Based on the enrichment studies for those ADHD-associated pathways, it suggests that the SVs within non-coding regions impact the same ADHD-associated pathways for both ethnicities, but different genes in the same gene families.

In summary, we have conducted a genomic-level study in ADHD patients using whole genome sequencing that takes non-coding genes/regions and ethnicity factors into consideration. The results show that non-coding region SVs and non-coding genes may play a role in the development and progression of ADHD, and WGS may be a powerful tool to explore ADHD molecular mechanisms. Additionally, our study highlights that genomic-level population differences exist between Caucasian and African American patients, especially for non-coding SVs in neuronal genes and that these variants may influence response to specific medications. For the potential evolutional advantages of ADHD in human history28, the same ADHD-associated pathways though different genes were involved in the adaption to the environmental selection for survival in the two major human populations. On the other hand, we admit that the current study is limited by the sample size because of the significant expense of WGS. The SVs highlighted by our study warrant further study and confirmation.


Patient selection

The patients were randomly selected from the Philadelphia Neurodevelopmental Cohort (PNC), archived in the biobank of the Center for Applied Genomics (CAG) at the Children's Hospital of Philadelphia (CHOP), with full electronic medical record (EMR). Psychopathology of the cohort was assessed using a computerized, structured interview (GOASSESS)29. The diagnosis of ADHD was based on the Diagnostic and Statistical Manual of Mental Disorders-Fourth Edition (DSM-IV) originally, and later confirmed by DSM-V. There were 205 ADHD cases, including 116 African Americans (AA) and 89 European Americans (EA), and 670 controls, including 408 AA and 262 EA (Supplementary Table 1, Fig. 1a).

Structural variations (SVs) detections

The average coverage for WGS data is 30 ×. The structural variations (SVs), including insertions, deletions, duplications, inversions and translocations, were detected by MANTA Structural Variant Caller developed by Illumina30. For quality control, we only included SVs that passed MANTA’s default filters, which required the length of corresponding SVs to be at least 50 bp and rated “PASS” based on MANTA threshold. Passing SVs were stratified into different classes based on their sequence content. Using the hg19 GENCODE reference sequence, if the start and end points of a SV mapped within an exon, it was annotated as “exonic’; if the start and end points of a SV were located within an intronic region and the SV does not spanned an exon, it was annotated as “intronic”; if the SV was located across exon/intron border sites, it was annotated as “splicing”; and the remaining SVs were annotated as “intergenic”.

Exonic, intronic and splicing SVs were annotated with the impacted gene, and intergenic SVs were annotated with their closet gene based on genomic locus. The corresponding annotated gene, either the SVs impacted gene or the closet gene, was named as “SV-gene”. Association of SV-genes in ADHD were calculated for AA and EA independently using Chi-square tests and Fisher’s exact tests. Bonferroni correction was used for correction of multiple testing by the number of tested variations or genes. Significance was set at 0.05 after Bonferroni correction. We only included risk variants in downstream analyses, i.e. SVs that had odd ratios greater than 1. Meta-analysis (combined Chi-square test) was applied when combing two ethnicities together. Pathway analysis is based on DAVID bioinformatics platforms.

Ethic statement

We confirm that all methods were carried out in accordance with relevant guidelines and regulations and all experimental protocols were approved by the Children’s Hospital of Philadelphia (CHOP). Informed consent was obtained from all subjects or, if subjects are under 18, from a parent and/or legal guardian.


  1. 1.

    Polanczyk, G. V., Willcutt, E. G., Salum, G. A., Kieling, C. & Rohde, L. A. ADHD prevalence estimates across three decades: an updated systematic review and meta-regression analysis. Int. J. Epidemiol. 43, 434–442. (2014).

    Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Visser, S. N. et al. Trends in the parent-report of health care provider-diagnosed and medicated attention-deficit/hyperactivity disorder: United States, 2003–2011. J. Am. Acad. Child Adolesc. Psychiatry 53, 34–46. (2014).

    Article  PubMed  Google Scholar 

  3. 3.

    Barbaresi, W. J. et al. Mortality, ADHD, and psychosocial adversity in adults with childhood ADHD: a prospective study. Pediatrics 131, 637–644. (2013).

    Article  PubMed  PubMed Central  Google Scholar 

  4. 4.

    Elia, J. et al. Genome-wide copy number variation study associates metabotropic glutamate receptor gene networks with attention deficit hyperactivity disorder. Nat. Genet. 44, 78–84. (2011).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  5. 5.

    Lantieri, F., Glessner, J. T., Hakonarson, H., Elia, J. & Devoto, M. Analysis of GWAS top hits in ADHD suggests association to two polymorphisms located in genes expressed in the cerebellum. Am. J. Med. Genet. B Neuropsychiatr. Genet. 153B, 1127–1133. (2010).

    Article  PubMed  CAS  Google Scholar 

  6. 6.

    Connolly, J. J., Glessner, J. T., Elia, J. & Hakonarson, H. ADHD & pharmacotherapy: past, present and future: a review of the changing landscape of drug therapy for attention deficit hyperactivity disorder. Ther. Innov. Regul. Sci. 49, 632–642. (2015).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  7. 7.

    Acosta, M. T., Arcos-Burgos, M. & Muenke, M. Attention deficit/hyperactivity disorder (ADHD): complex phenotype, simple genotype?. Genet. Med. 6, 1–15. (2004).

    Article  PubMed  Google Scholar 

  8. 8.

    Akutagava-Martins, G. C., Rohde, L. A. & Hutz, M. H. Genetics of attention-deficit/hyperactivity disorder: an update. Expert. Rev. Neurother. 16, 145–156. (2016).

    Article  PubMed  CAS  Google Scholar 

  9. 9.

    Glessner, J. T. et al. Copy number variation meta-analysis reveals a novel duplication at 9p24 associated with multiple neurodevelopmental disorders. Genome Med. 9, 106. (2017).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  10. 10.

    Glessner, J. T., Connolly, J. J. & Hakonarson, H. Rare genomic deletions and duplications and their role in neurodevelopmental disorders. Curr. Top. Behav. Neurosci. 12, 345–360. (2012).

    Article  PubMed  Google Scholar 

  11. 11.

    Zhang, F. & Lupski, J. R. Non-coding genetic variants in human disease. Hum. Mol. Genet 24, R102-110. (2015).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  12. 12.

    Bulayeva, K. et al. Genomic structural variants are linked with intellectual disability. J. Neural. Transm (Vienna) 122, 1289–1301. (2015).

    Article  CAS  Google Scholar 

  13. 13.

    Demontis, D. et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat. Genet 51, 63–75. (2019).

    Article  PubMed  CAS  Google Scholar 

  14. 14.

    Zhang, L. et al. ADHDgene: a genetic database for attention deficit hyperactivity disorder. Nucleic Acids Res 40, D1003-1009. (2012).

    Article  PubMed  CAS  Google Scholar 

  15. 15.

    Thevenon, J. et al. 12p13.33 microdeletion including ELKS/ERC1, a new locus associated with childhood apraxia of speech. Eur. J. Hum. Genet 21, 82–88. (2013).

    Article  PubMed  CAS  Google Scholar 

  16. 16.

    Khadka, S. et al. Multivariate Imaging Genetics Study of MRI Gray Matter Volume and SNPs Reveals Biological Pathways Correlated with Brain Structural Differences in Attention Deficit Hyperactivity Disorder. Front Psychiatry 7, 128. (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  17. 17.

    Lima Lde, A. et al. An integrative approach to investigate the respective roles of single-nucleotide variants and copy-number variants in Attention-Deficit/Hyperactivity Disorder. Sci Rep 6, 22851. (2016).

    ADS  Article  PubMed  CAS  Google Scholar 

  18. 18.

    Kun-Rodrigues, C. et al. A systematic screening to identify de novo mutations causing sporadic early-onset Parkinson’s disease. Hum Mol Genet 24, 6711–6720. (2015).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  19. 19.

    Mahmoudi, E. & Cairns, M. J. MiR-137: an important player in neural development and neoplastic transformation. Mol. Psychiatry 22, 44–55. (2017).

    Article  PubMed  CAS  Google Scholar 

  20. 20.

    Tovo-Rodrigues, L. et al. DRD4 rare variants in Attention-Deficit/Hyperactivity Disorder (ADHD): further evidence from a birth cohort study. PLoS ONE 8, e85164. (2013).

    ADS  Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. 21.

    Stankiewicz, P. et al. Haploinsufficiency of the chromatin remodeler BPTF causes syndromic developmental and speech delay, postnatal microcephaly, and dysmorphic features. Am. J. Hum. Genet. 101, 503–515. (2017).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  22. 22.

    Popp, B. et al. Exome Pool-Seq in neurodevelopmental disorders. Eur. J. Hum. Genet. 25, 1364–1376. (2017).

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  23. 23.

    Polina, E. R. et al. ADHD diagnosis may influence the association between polymorphisms in nicotinic acetylcholine receptor genes and tobacco smoking. Neuromol. Med. 16, 389–397. (2014).

    Article  CAS  Google Scholar 

  24. 24.

    Franke, B. et al. The genetics of attention deficit/hyperactivity disorder in adults, a review. Mol Psychiatry 17, 960–987. (2012).

    Article  PubMed  CAS  Google Scholar 

  25. 25.

    Kim, J. I. et al. The GRIN2B and GRIN2A Gene Variants Are Associated With Continuous Performance Test Variables in ADHD. J Atten Disord (2016).

    Article  PubMed  Google Scholar 

  26. 26.

    Neale, B. M. et al. Meta-analysis of genome-wide association studies of attention-deficit/hyperactivity disorder. J Am Acad Child Adolesc Psychiatry 49, 884–897. (2010).

    Article  PubMed  PubMed Central  Google Scholar 

  27. 27.

    Mehta, C. M., Gruen, J. R. & Zhang, H. A method for integrating neuroimaging into genetic models of learning performance. Genet Epidemiol 41, 4–17. (2017).

    Article  PubMed  Google Scholar 

  28. 28.

    Shelley-Tremblay, J. F. & Rosen, L. A. Attention deficit hyperactivity disorder: an evolutionary perspective. J. Genet. Psychol. 157, 443–453 (1996).

    Article  CAS  Google Scholar 

  29. 29.

    Calkins, M. E. et al. The Philadelphia Neurodevelopmental Cohort: constructing a deep phenotyping collaborative. J. Child. Psychol. Psychiatry 56, 1356–1369. (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  30. 30.

    Chen, X. et al. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 32, 1220–1222. (2016).

    Article  PubMed  CAS  Google Scholar 

Download references


We thank the Center for Applied Genomics (CAG) staff and supports from Children’s Hospital of Philadelphia (CHOP).

Author information




Y.L., X.C., P.M.A.S. and H.H. designed the study. L.T., D.L. and H.Q., P.M.A.S. and H.H. acquired the data, L.T., D.L. and H.Q., X.C. and Y.L. analyzed the data. Y.L., X.C., H.Q. and J.G. wrote the article and prepared figures and tables. All authors reviewed the manuscript and approved the final version to be published.

Corresponding authors

Correspondence to Patrick M. A. Sleiman or Hakon Hakonarson.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Liu, Y., Chang, X., Qu, H. et al. Non-coding structural variation differentially impacts attention-deficit hyperactivity disorder (ADHD) gene networks in African American vs Caucasian children. Sci Rep 10, 15252 (2020).

Download citation


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.


Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing