Introduction

Attention deficit hyperactivity disorder (ADHD) is a neurobehavioral disorder characterized by inattentiveness, hyperactivity, and impulsivity1. Children with ADHD have difficulties in learning, family relationships, and social interaction2. ADHD affects 5–7% of children and adolescents worldwide3,4. Furthermore, at least 5% of children do not meet the full diagnostic criteria despite exhibiting ADHD symptoms5. ADHD in children often persists into late adolescence and adulthood, which is a risk factor of other mental health issues, including antisocial behaviors, self-harm, and substance misuse5.

Although the etiology of ADHD is largely unknown and complex, it is characterized by numerous gene-environmental interactions6. Preconceptional, gestational, and perinatal conditions have been indicated to affect ADHD6. Maternal nutrition during pregnancy, pesticides and heavy metal exposure during gestation, premature birth have been reported as risk factors of ADHD in children6. Prenatal exposure to air pollution is also a potential risk factor for ADHD7,8. Previous studies have reported an association between children’s behavioral development and prenatal exposure to sulfur dioxide (SO2)9,10, one of the major gaseous pollutants derived from coal-fired power plants, smelters, and industrial emissions11. Although the relation between SO2 exposure and ADHD has not been well investigated, SO2 has been associated with various neurodevelopmental deficits, such as fine motor skills, executive function, and ADHD-related hospital admissions12,13.

Epigenetic processes such as DNA methylation (DNAm) have been proposed to underly the association between environmental exposures and ADHD14. A previous study has demonstrated that methylation differences of the growth factor-independent 1 transcriptional repressor (GFI1) region partially mediated the association between maternal smoking during pregnancy and ADHD symptoms at age 615. Another study found that long-term prenatal exposure to paracetamol (acetaminophen) is associated with DNAm differences in children diagnosed with ADHD16. Recent evidence suggests that air pollution may affect methylation through an oxidative stress pathway17,18. Air pollution-induced reactive oxygen species oxidize 5-hydroxymethylcytosine causing DNA demethylation19. It also leads to the hypomethylation of CpG cytosine residues by inhibiting the activity of methyltransferases via alteration in their sequence alignment to the corresponding base sequences of the DNA20. To date, no study has investigated the mediation effect of DNAm on the association between SO2 and ADHD.

In this study, we aimed to (1) examine the association between prenatal exposure to SO2 and ADHD symptoms at multiple ages during childhood and (2) investigate the effect of DNAm on the association between SO2 exposure and ADHD symptoms. We first analyzed the association between prenatal SO2 and ADHD symptoms at ages 4, 6 and 8, which are critical periods of symptom manifestation. Then, we identified CpG sites (CpGs) associated with both prenatal SO2 exposure and ADHD symptoms during childhood. We utilized both candidate gene analysis approach by targeting CpGs associated with ADHD symptoms, and an epigenome-wide association study. We also aimed to detect co-methylated CpGs from a module correlated with both prenatal SO2 exposure and ADHD symptoms using weighted gene co-metylation network analysis (WGCNA).

Results

General characteristics of study participants

There were significant differences in maternal smoking and SO2 exposure during the 1st trimester between the entire cohort and the sub-cohort (Table 1). Other covariates were not significantly different between the entire cohort and sub-cohort. The mean age of mothers at pregnancy was 31.3 years, most of the mothers were college graduates (70.5–71.7%), while 76.9–79.6% of mothers were exposed to environmental tobacco smoking (ETS) during pregnancy. Mean maternal IQ was 116–118. Mean maternal SO2 exposure during pregnancy was 0.0044–0.0060 ppm depending on the trimester. Among children, 51.0–53.5% were girls, 86.3–90.3% were singleton, 88.0–91.8% were born full-term. Mean postnatal SO2 exposure ranged from 0.052 to 0.058 ppm depending on the ages. Mean ADHD rating scale (ARS) was 5.96–6.96 at age 4, 5.87–6.04 at age 6, and 5.57–6.34 at age 8. In the sub-cohort, prenatal and postnatal SO2 exposure were not significantly different according to the trimester or ages (Fig. 1).

Table 1 General characteristics of study participants.
Figure 1
figure 1

The prenatal SO2 exposure according to the trimester (left) and postnatal SO2 exposure according to the ages of 2, 4 and 6 years (right).

Selection of candidate CpGs

After the systematic review, a total of 22 studies were selected17,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41 (Table S1). A total of 597 CpGs were pooled from these studies (Table S2), and final 375 CpGs were selected for analysis. The detailed result of the systematic review and selection of candidate CpGs is presented in Supplementary Materials and Fig. S1.

Association between prenatal exposure to SO2 and ADHD symptoms

In the entire cohort (n = 329), one interquartile range (IQR) increase in prenatal SO2 exposure during the 1st trimester of pregnancy was significantly associated with 8.38% (95% CI 3.19, 13.83) increase in ARS at age 4; however, such association was not observed at age 6 or 8 (Table 2). The associations between prenatal SO2 exposure during the entire period or 2nd trimester of pregnancy and childhood ADHD symptoms at ages 4, 6 and 8 were not significant. Prenatal SO2 exposure during the 3rd trimester of pregnancy was not associated with ADHD symptoms at ages 4 or 6, but was negatively associated with ADHD symptoms at age 8.

Table 2 Prenatal SO2 exposure and ADHD rating scale at 4, 6 and 8 years of age in the cohort (n = 329).

Association between prenatal exposure to air pollution and DNAm

DNAm at age 2 at the candidate CpGs was not associated with prenatal SO2 exposure at the 1st–3rd, 1st, or 2nd trimester of pregnancy (Tables S3S5). However, DNAm in children at age 2 at cg07583420 (INS-IGF2), cg20296524 (TARBP1), cg15705054 (PBXIP1), cg05075097 (INS-IGF2), and cg25163476 (INS-IGF2) were positively associated with an IQR increase in maternal SO2 exposure during the 3rd trimester of pregnancy (Table 3, Fig. 2A, Table S6). DNAm at cg05951817 (SLC6A4) was negatively associated with SO2 exposure during the 3rd trimester. DNAm at the 375 candidate CpGs in children at age 6 was not associated with prenatal SO2 exposure in any trimester (Tables S7S10). We also conducted the epigenome-wide analysis for 326,898 CpGs to investigate the association between prenatal SO2 exposure and DNAm at age 2. We found that a total of 6,733 CpG sites were associated with prenatal exposure during the 3rd trimester with FDR-corrected p-value < 0.05 (Table S11, Fig. 2B), and the 6 CpGs listed above were included in the 6733 CpGs. Conversely, neither candidate gene analysis nor EWAS yielded significant CpGs within FDR-corrected p-value < 0.05 at age 6 (Fig. 2C).

Table 3 Association between prenatal exposure to sulfur dioxide during the 3rd trimester (per IQR increase) and DNAm at ADHD associated CpGs at 2 years of age* (FDR corrected p-value < 0.05) (n = 51).
Figure 2
figure 2

The association between prenatal SO2 exposure, DNA methylation, and ADHD rating score. (A) The association between prenatal SO2 exopusre and DNA methyhlation. (B) Manhattan plot showing the association between prenatal SO2 exposure during the 3rd trimester and DNA methylation at age 2. The green dots indicate CpGs selected from literature review for a candidate gene analysis. The blue line is the threshold for FDR-corrected p-value < 0.05. The CpGs significantly associated with prenatal SO2 exposure from the candidate gene analysis were above the threshold line (cg07583420 (INS-IGF2), cg20296524 (TARBP1), cg15705054 (PBXIP1), cg05075097 (INS-IGF2), and cg25163476 (INS-IGF2)). (C) Manhattan plot showing the association between prenatal SO2 exposure during the 3rd trimester and DNA methylation at age 6. (D) The association between DNA methylation at the CpGs linked with prenatal SO2 exposure and the ADHD rating scale at ages 4, 6 and 8.

Association between postnatal SO2 exposure and DNAm

The associations between the postnatal SO2 exposures at age of 2 years and DNAm at age 2, or the association between the postnatal SO2 exposure at ages 2, 4 or 6 years and DNAm at age 6 were not significant (Tables S12S15). The associations between the cumulative postnatal SO2 exposures from age 2 to 4 or from 2 to 6 and DNAm at age 6 were not significant (Tables S16,S17).

Association between DNAm and ADHD symptoms

Among the 6 CpGs with significant associations at age 2 with prenatal SO2 exposure, DNAm level at cg07583420 (INS-IGF2) were associated with 37.25% (95% CI 8.12, 74.22), 37.00% (95% CI 4.23, 80.08), and 34.82% (95% CI 5.44, 72.40) increase in ARS at ages 4, 6 and 8, respectively (Fig. 2D, Table S18).

Among 6733 CpGs from EWAS between prenatal SO2 exposure during the 3rd trimester and DNAm, 58 CpGs, 219 CpGs, and 2063 CpGs were associated with ARS at ages 4, 6, and 8 years, respectively (Tables S19S21). Among these, DNAm at 58 CpGs was persistently associated with ARS at ages 4, 6 and 8 years (Fig. S2).

Mediation analysis

The indirect effect of prenatal exposure during the 3rd trimester on ARS in childhood through DNAm at cg07583420 (INS-IGF2) was positive and significant at ages 4 (p-value 0.028) and 6 (p-value 0.004) but not significant at age 8 (Table 4, Fig. 3). Direct effects, from prenatal exposure to childhood ARS, were marginally significant at ages 6 (p-value 0.060) and 8 (p-value < 0.001). Particularly at age 6, indirect effect, direct effect, and total effect (indirect effect + direct effect) were all positive and significant or marginally significant.

Table 4 Mediation analysis of the path from prenatal exposure to SO2 to ADHD rating scale (ARS) at 4, 6 and 8 years of age through DNAm at cg07583420 (INS-IGF2).
Figure 3
figure 3

Mediation analysis showing the association between prenatal exposure to SO2 and ADHD symptoms in children through DNA methylation at cg07583420 (INS-IGF2).

Identifying methylation quantitative trait loci (mQTL)

A total of 60 SNPs were found within 100 kb from either side from cg07583420 (INS-IGF2). Four SNPs showed significant association with the DNA methylation levels at cg07583420 (p-value < 0.05): rs139951739, rs6578246, rs17885785, and rs3213223. However, none was significant when corrected for multiple comparisons using FDR method (Table S22, Fig. S3).

Pathway enrichment analysis

Reactome pathways analysis showed that mostly NOTCH signalling was involved in genes annotated to the 58 CpGs associated with prenatal SO2 exposure during the 3rd trimester and ARS through 4, 6 and 8 years (Table S23, Figure S4).

WGCNA

A total of 375 CpGs were analyzed for co-methylation. Two modules were detected: grey (316 CpGs) and turquoise (59 CpGs) (Fig. 4A,B). The correlation between the grey module and prenatal exposure to SO2 during the 3rd trimester was not significant (Fig. 4C). CpGs from the grey modules were analyzed again for co-methylation, and the turquoise module was correlated with ARS at age 4 (Pearson’s coefficient correlation 0.38, p-value 0.003), and the grey module was correlated with ARS at age 8 (Pearson’s coefficient correlation 0.28, p-value 0.03) (Fig. 4D). CpGs from the final turquoise module were assessed, and the CpGs associated with SO2 exposure during the 3rd trimester of pregnancy including INS-IGF2, PBXIP1, and SLC6A4 (Table 3) were found from the turquoise module (Fig. 4E), suggesting that these CpGs are co-methylated in relation with ARS at age 4.

Figure 4
figure 4

Weighted gene co-methylation network analysis of DNA methylation at CpGs for candidate gene analysis. (A) Dendrogram produced by hierarchical chlustering of samples. Modules of co-methylated CpGs are indicated by different color (grey and turquoise) beneath the dendrogram. (B) Network heatmap plot. Dendrograms from hierarchical clustering are corresponded to the color-coded modules (grey and turquoise). In the heatmap, bright yellow color indicate high co-expression inter-connectedness. (C) Correlating modules with exposure variables (prenatal SO2 exposure by trimester). (D) Correlating modules with outcome variable (ADHD rating scales at age 4, 6 and 8). (E) Network between CpGs from the turquoise module.

Discussion

To the best of our knowledge, this is the first study to report the epigenetic effect on the positive association between prenatal SO2 exposure and ADHD symptoms at ages 4, 6 and 8 years in a mother–child cohort. For a candidate gene analysis, we selected 375 CpGs from a systematic literature review of EWAS of ADHD. In the sub-cohort with available DNAm measurements at age 2, an IQR increase in prenatal SO2 exposure during the 3rd trimester was associated with an increased level of DNAm at CpGs of INS-IGF2, TARBP1, PBXIP1, and SLC6A4. Among these, DNAm level at cg07583420 (INS-IGF2) was positively associated with ARS at ages 4, 6 and 8, persistently (Fig. 5). When significant CpGs (n = 6733) from EWAS were tested for the association with ARS, 58 CpGs were persistently associated with ARS at ages 4, 6 and 8 years, which were involved in NOTCH signalling pathway. DNAm at age 6 was not associated with prenatal SO2 exposure. We further investigated the association between postnatal exposure to SO2 and DNAm by several different approaches, which showed no significant result. This implies that prenatal SO2 exposure may affect DNAm more significantly than postnatal SO2 exposure.

Figure 5
figure 5

The overview of the study design and results (Candidate gene analysis).

Prenatal environmental exposure may affect ADHD development through DNAm, which may lead to long-term phenotype changes42. The fact that DNAm was associated with prenatal SO2 at age 2 but not at age 6 may be related to the trajectory of global DNAm over the developmental stages, which fluctuates at an earlier stage of development and later stabilized43. This implies that a sensitive window period of DNAm may exist in which DNAm at earlier childhood is more susceptible to environmental triggers than later childhood44. For example, DNAm at several CpGs at age 7 was mostly predicted by early childhood adversity before age 345. To investigate the period of clinical manifestation of ADHD with respect to DNAm, we examined ADHD symptoms at ages 4, 6 and 8. Surprisingly, DNAm was associated with ADHD symptoms during the wide range of early childhood from ages 4 to 8 consistently rather than focal stage of childhood, suggesting that the effect of DNAm can extend to a long period.

DNAm is also under genetic control if mQTL is identified adjacent to cg07583420 (INS-IGF2). In the present study, we could not identify statistically significant mQTL, possibly due to the small sample size. Further studies with a larger sample size are therefore warranted to elucidate the complex interplay between genetic, epigenetic, and environmental factors in the etiology of ADHD in children.

Although ADHD is associated with various compounds of air pollution, studies on the relationship between SO2 and neurodevelopmental outcomes, like ADHD, are limited. SO2 exposure prenatally up to 12 months was negatively associated with fine motor scores at 18 months of age12, and SO2 exposure was related to poor executive function in 6–12-year-old children in a cross-sectional study in China12. Another study reported an association between short-term exposure to SO2 and ADHD-related hospital admission in adolescents aged 10–19 years13.

The biological mechanism underlying the association between SO2 exposure and ADHD remains unknown. Therefore, only a few hypothetical mechanisms can be inferred from previous studies. SO2 can increase lipid peroxidation in the brain and generate reactive oxygen species46, as well as various inflammatory cytokines. These reactive oxygen species and inflammatory cytokines can relocate to the central nervous system via systemic circulation, thereby inducing neuroinflammation. SO2 has also been reported to induce neurotoxicity through protein oxidation, DNA protein cross-links, apoptosis, and damage of cell constituting the central nervous systems including cerebral cortex neurons, glial cell and nerve fibers47,48.

Another possible link is the association between SO2 and other regional pollutants. The region close to power plants or industrial facilities using fossil fuel shows higher SO2 level49. A cohort study with 5193 children including 116 patients with ADHD reported that the proximity to an industrial estate in the study area was associated with an increased risk of ADHD, and suggested prenatal exposure to organochlorine compound released from the industrial estate as a potential explanation for the finding50.

Notably, the result of WGCNA showed that the CpGs (cg07583420 (INS-IGF2), cg15705054 (PBXIP1), cg05075097 (INS-IGF2), and cg25163476 (INS-IGF2)) associated with prenatal SO2 exposure were co-methylated, corresponding to the module simultaneously correlated with prenatal SO2 during the 3rd trimester and ARS at age 4. IGF2 plays an integral role in brain development after birth51. IGF2 is associated with developmental abnormalities in the structure and/or function of the cerebellum and the hippocampus52,53, both of which are associated with ADHD. Higher IGF2 methylation can predict ADHD symptoms in youth with conduct disorders33. In mice, IGF2 enhancer deletion disrupted levels of striatal dopamine, which has been suggested to be involved in the pathophysiology of ADHD54.

Notch signalling pathway plays an essential role in embryogenesis and organogenesis55, particularly in regulation of neurogenesis56, and is also known to be involved in schizophrenia and bipolar disorder57. In a mice model, damages in neural stem cells affected cognitive impairment during Pb exposure, which was dependent on the Notch pathway58. Prenatal exposure to ketamine in rats, which increased the expression level of Notch1, inhibited the proliferation and differentiation of neural stem cells in hippocampus and impaired neurocognitive function including learning and memory in adulthood56.

Our study presents certain limitations. First, the small sample size limits the statistical power of the results, hence, further replication of these results in larger population is warranted. ADHD symptoms were assessed by a parent-rated questionnaire on a continuum, rather than analyzed dichotomously according to a formal diagnosis by a specialist. Therefore, the results of this study may differ in a clinical sample. SO2 gaseous exposure could have greater temporal variability, therefore, the results should be interpreted cautiously. In addition, DNAm measured in the peripheral blood may not reflect the methylation signature in the brain, as epigenetic markers are tissue specific. In the absence of gene expression data, direct conclusions about the transcriptional consequences of the DNAm changes could not be made. Despite these limitations, our study also has notable merits. We employed a prospective study design, with prenatal exposure assessment, methylation profiles at age 2 and 6, and longitudinal neurocognitive outcome measurements at ages 4, 6 and 8. Investigation of the effect of methylation provided insight into the mechanism underlying the association between air pollution and ADHD. The longitudinal study design enabled identification of susceptible window period of DNAm changes in association with prenatal environmental exposure, which was earlier childhood in this study. We adjusted for potential covariates related to both the exposure and outcome, including both maternal and childhood characteristics.

Conclusion

In this study, we found that prenatal exposure to SO2 contributed to differential methylation at a CpG site located within INS-IGF2 at age 2, which, in turn, was associated with ADHD symptoms at ages 4, 6 and 8 years. These findings indicate that an epigenetic mechanism involving methylation could underlie the relationship between the toxicity of SO2 and neurodevelopment in children.

Methods

Study population

Study participants were selected from the Environment and Development of Children (EDC) Study, a prospective cohort study that investigated the environmental effects on growth and neurodevelopment59. Mothers participating in the Congenital Anomaly Study cohort were contacted during 2012–2015 after birth, and a total of 726 mother–child pairs were recruited for EDC study, followed by regular follow-up at 2-year intervals (i.e. 2, 4, 6 and 8 years). We collected epidemiological information, anthropometric characteristics, and neurocognitive outcomes biennially. ADHD symptoms were evaluated at ages 4, 6, and 8. Among these children, 329 had information on ADHD symptoms all at ages 4, 6 and 8.

In a sub-cohort, we selected 60 children at age 2 and analyzed DNAm from whole blood at age 2 and 6 repeatedly. Similar to the main EDC cohort, we examined neurocognitive functions including symptoms of ADHD at ages 4, 6 and 8. Among 60 children, 54 at age 4, 60 at age 6, and 57 at age 8 were assessed for ADHD symptoms. Those assessed at these ages were included in the main analysis (n = 51).

Ethical statements

The methods of this study were approved by the Institutional Review Board of Seoul National University College of Medicine (IRB No. 1201-010-392) and was conducted according to the guidelines and regulations of the Declaration of Helsinki. Informed consent was obtained from mothers according to the Institutional Review Board of Seoul National University College of Medicine (IRB No. 1201-010-392).

Measurement of air pollution

We used the levels of air pollution including district-specific monthly SO2 as the main exposure. The levels of SO2 were collected from the publicly available data from Air Korea (http://www.airkorea.or.kr/eng) from the Korea Ministry of Environment, which monitors the hourly air pollution concentration levels at 257 stations nationwide. We assigned air pollution levels to the various pregnancy stages, such as the 1st, 2nd, 3rd trimester, as a proxy of exposure to pregnant women based on the residential address. The SO2 levels were measured from the nearest monitoring station from the residential address of each participant. The nearest monitoring stations was assigned according to the Euclidean distance between each residential address and the closest monitoring station using ArcGIS.

DNAm analysis

Bisulfite sequencing and microarray

The Illumina Infinium HumanMethylation EPIC BeadChip (850 K) was used for the samples from the 2-year-olds, and the Illumina Infinium HumanMethylation 450 K BeadChip was used for the samples of the 6-year-olds (Illumina, San Diego, CA, USA). Detailed experimental procedures are presented in the Supplementary Materials.

Quality control and probe filtering

Array CpG probes which had a detection p-value > 0.05 in more than 25% of samples were filtered out. Filtered data were normalized using the Beta Mixture Quantile (BMIQ) method60 and corrected for batch effect using ComBat package in R. CpGs with at least one “not-available” (NA) values for the normalization were excluded, leaving 865,688 CpGs from the EPIC BeadChip (850 K) at age 2, and 460,960 CpGs from the 450 K BeadChip at age 6. Among these CpGs, we selected the overlapping 430,101 CpGs. After excluding SNP-associated CpGs (SNP distance ≤ 1 or SNP minor allele frequency ≥ 0.05), CpGs corresponded to non-CpG loci or the X or Y chromosomes, and CpGs at cross-reactive probes, and multimodal CpGs, 326,898 CpGs were finally selected for analysis (Fig. S1). We identified multimodal CpGs via the Dip test61 using diptest R package62 (Dip test’s p-value < 0.05).

Systematic review of literature and selection of candidate CpGs

We selected candidate CpGs through a systematic literature review. The detailed procedure of systematic review is shown in the Supplementary Materials.

SNP Genotyping

We used Axiom®2.0 Reagent Kit (Affymetrix Axiom®2.0 Assay User Guide) according to manufacturer’s protocol; the detailed procedure is provided in the Supplementary Materials. We used Korean Chip (K-CHIP) available from the K-CHIP consortium to produce Genotype data. K-CHIP was designed by the Center for Genome Science, Korea National Institute of Health, Korea (4845-301, 3000-3031).

ADHD symptom screening

To evaluate ADHD symptoms in children at ages 4, 6, and 8, the Korean version of the ARS IV (K-ARS) was completed by the parents63,64. K-ARS is a standardized screening tool for ADHD symptoms in Korean children65 and has shown validity and reliability66. K-ARS, which has also been used in relation to environmental exposure67, is composed of 18 questions corresponding to the diagnostic criteria of ADHD according to the Diagnostic and Statistical Manual of Mental Disorders, 4th Edition. Nine questions evaluate inattention, and the other 9 are related to hyperactivity and impulsivity. Each item is rated from 0 to 3, with a total score ranging from 0 to 5468.

Covariates

Covariates were pooled from the literature review69,70,71. Covariates included maternal age at pregnancy, maternal educational level (middle school graduate, high school graduate, college graduate, or graduate school attendance), maternal smoking status during pregnancy (current smoker, ex-smoker, never-smoker), ETS during pregnancy (yes/no), child’s sex, gestational age (weeks), multiple births (singleton or twin/triplet), season of child’s birth, cell type fraction, and maternal IQ evaluated by the short version of the Korean Wechsler Adult Intelligence Scale72. Cell type distribution, defined as the fraction of CD8 + T cells, CD4 + T cells, natural killer (NK) cells, B cells, monocytes, and neutrophils, was estimated by using the adult leukocyte reference panel73 and Minfi R package74.

Statistical analysis

We investigated the association between trimester-specific prenatal exposure to SO2 (ppm) and ARS at ages 4, 6 and 8 in the cohort of 329 mother–child pairs by linear regression analysis by adjusting for maternal age at pregnancy, mother’s education levels, maternal smoking, maternal ETS, and season of child’s birth. Covariates were selected based on directed acyclic graph75 using the publicly available program (http://www.dagitty.net/) (Fig. S5).

In a sub-cohort (n = 51), we investigated the association between prenatal trimester-specific exposure to SO2 (ppm) and DNAm at the CpGs selected from the literature review (n = 375) at ages 2 and 6, using multivariable linear regression. We expressed the changes in DNAm per interquartile range (IQR) increase in SO2. The regression model for the association between prenatal SO2 and DNAm was adjusted for maternal age at pregnancy, mother’s educational level, maternal smoking, ETS during pregnancy, and season of the child’s birth (Fig. S6). To investigate the association between postnatal SO2 exposure and DNA methylation levels, we used a linear regression model for the average SO2 level during age 2 (12 months) and DNAm at age 2, and the average SO2 level during age 2, 4 and 6 each and DNAm at age 6, adjusted for mother’s age, postnatal ETS, postnatal mother’s smoking, and the season of birth. To evaluate the effects of cumulative SO2 exposure, we have analyzed the association between SO2 exposure at ages 2–4 years and 2–4–6 years and DNAm at age 6. SO2 exposure at ages 2–4 years was taken from the average of SO2 exposure at ages 2 and 4 years, and SO2 exposure at 2–4-6 years was taken from the average of SO2 exposures at age 2, 4 and 6 years. Benjamini–Hochberg method76 was used to correct the effects of multiple comparisons by using FDR. We also performed EWAS for 326,898 CpGs at ages 2 and 6 with prenatal SO2 exposure by trimester, adjusted by the same covariates.

Furthermore, we investigated the association between an increase in DNAm at prenatal SO2 exposure-associated CpGs at ages 2 and ARS at ages 4, 6, and 8, after adjusting for mother’s age at pregnancy, mother’s education level, maternal smoking, ETS during pregnancy, season of the child’s birth, maternal IQ, the child’s sex, preterm birth, and multiple births (Fig. S7).

WGCNA was used to detect co-methylated modules (clusters of CpGs) using WGCNA R package77,78. Using beta-value of methylation at the CpGs from candidate gene analysis, weighted co-methylation networks (module) were identified. For each module, hierarchical clustering was performed for all samples, and the dendrogram were grouped into modules. Then the correlation was tested between the modules and trimester-specific SO2 exposure during pregnancy, using Pearson correlation coefficient. We selected the module with a significant correlation with exposure variables, then pooled CpGs corresponding to the module of interest. Next, the methylation levels at these CpGs were used to detect co-methylation module, and the correlation between the modules and ADHD symptom scores at ages 4, 6 and 8 were then analyzed. The networks between CpGs were visualized using Cytoscape ver 3.8.2. (https://cytoscape.org/).

We investigated the mediation effect of DNAm at the significant CpGs on the association between prenatal SO2 and ARS in childhood using Mediation R package79. Indirect effect indicates the effect of prenatal exposure to SO2 affecting ARS through DNAm, whereas direct effect refers to the effect of prenatal SO2 exposure directly affecting ARS, not through DNAm. The total effect is defined as the sum of indirect and direct effects.

To identify mQTL, we searched for SNPs positioned within 100 kb window to either side of the significant CpGs, then analyzed the correlation between DNA methylation levels at the CpGs and SNP genotypes. The p-value for the correlation between genotype and DNAm levels was calculated using the Kruskal–Wallis test. Multiple comparisons were corrected via FDR.

For functional enrichment analysis, Gene Ontology (GO) terms and Reactome cell signaling pathways were identified using topGO R package80 and ReactomePA R package81.