Folate is vital for fetal development. Periconceptional folic acid supplementation and food fortification are recommended to prevent neural tube defects. Mechanisms whereby periconceptional folate influences normal development and disease are poorly understood: epigenetics may be involved. We examine the association between maternal plasma folate during pregnancy and epigenome-wide DNA methylation using Illumina’s HumanMethyl450 Beadchip in 1,988 newborns from two European cohorts. Here we report the combined covariate-adjusted results using meta-analysis and employ pathway and gene expression analyses. Four-hundred forty-three CpGs (320 genes) are significantly associated with maternal plasma folate levels during pregnancy (false discovery rate 5%); 48 are significant after Bonferroni correction. Most genes are not known for folate biology, including APC2, GRM8, SLC16A12, OPCML, PRPH, LHX1, KLK4 and PRSS21. Some relate to birth defects other than neural tube defects, neurological functions or varied aspects of embryonic development. These findings may inform how maternal folate impacts the developing epigenome and health outcomes in offspring.
Folate (vitamin B9) is vital for fetal development. Folic acid supplementation at 0.4 mg per day or higher is recommended worldwide before and in the very early stages of pregnancy to reduce the incidence of neural tube defects (NTDs). Over 50 countries have introduced programs to fortify the food supply with folic acid to increase folate levels in women of childbearing age1. Rates of NTDs have clearly decreased following fortification2 and there is increasing interest in the possibility that higher maternal folate prevents additional birth defects including oral clefts, cardiac defects and others3. A large international trial has been launched of supplementation with 4 mg versus the standard 0.4 mg to attempt to address these questions3.
Other beneficial effects of higher maternal folate levels have been reported in humans. These include reduced risk of low birth weight, pre-term delivery, language delay, leukaemia, childhood brain tumours and autism, although the evidence is inconsistent4,5. In the United States, food fortification has led to an increase in folate intake twice as large as anticipated6, and therefore concern has been raised about possible adverse effects, such as cancer in adults, as a result of this population-wide intervention1. Furthermore, higher folic acid intake during pregnancy has been associated with an increased risk of childhood retinoblastoma and early respiratory illness4.
The mechanisms whereby folic acid prevents NTDs and potentially other birth defects and later health outcomes are poorly understood7 but could involve epigenetic changes. Folate is a critical component in the one-carbon metabolism pathway providing methyl groups for a range of biochemical reactions, including methylation of DNA8. DNA methylation is an important epigenetic determinant of gene expression, and differential methylation has been associated with multiple diseases9. Periconceptional maternal folate levels may alter methylation patterns established in utero that are vital for fetal development, which could impact later health outcomes in the offspring.
In mouse models, in utero dietary methyl donor supplementation has been associated with altered methylation patterns and disease phenotypes4. The brains of human fetuses with NTDs had lower global methylation compared with controls, which was positively correlated with maternal folate levels10. With respect to gene-specific differential methylation, perinatal folate has also been associated with differential methylation in specific imprinted genes, such as IGF2 and H19, in offspring, but reported results are inconsistent11. The only published study using a platform with reasonable genome-wide coverage, the Illlumina HumanMethyl450 Beadchip (450 K), investigated 23 subjects and reported that folic acid supplementation during pregnancy was related to differential methylation upstream of the gene ZFP57, which plays a central role in the regulation and maintenance of imprinting12.
Some countries, such as Norway and the Netherlands, do not fortify the food supply with folic acid. These populations may be particularly useful for examining the biological implications of periconceptional folic acid supplementation on offspring health, as greater variability in the dose and the source of folate may exist compared with fortified populations.
To better understand the biological implications of folate status on the developing fetus, we examine the association between maternal plasma folate during pregnancy and epigenome-wide differential DNA methylation in newborn cord blood using the Illumina HumanMethyl450 (450 K) Beadchip. We include 1,988 newborns from two European pregnancy cohorts of Caucasian ancestry, the Norwegian Mother and Child Cohort Study (MoBa), and the Generation R Study (Generation R). We combine results using meta-analysis. Secondary pathway analyses and gene expression analyses are also explored.
In MoBa participants (N=1,275), maternal plasma folate levels ranged from 1.6 to 53.2 nmol l−1 (mean=11.9). The maternal plasma folate levels in Generation R (N=713) ranged from 4.1 to 45.3 nmol l−1 (mean=20.3; Table 1). The mean maternal age was ∼30 years for both cohorts. Approximately, 15% of MoBa mothers and 25% of Generation R mothers smoked during the pregnancy and over 60% obtained college or more advanced levels of education in both studies (Table 1).
Our meta-analysis of the association between maternal plasma folate levels during pregnancy and differential DNA methylation in newborn cord blood, adjusted for covariates, resulted in 443 false discovery rate (FDR)-significant CpGs (Benjamini and Hochberg FDR-corrected P (PBH)<0.05; Fig. 1). Genes with two or more FDR-significant CpGs, where at least one CpG was within the gene, were prioritized for further discussion (Table 2). Results for all FDR-significant CpGs are shown in Supplementary Data 1 (sorted by the uncorrected P value) and Supplementary Data 2 (sorted by chromosome and position). The vast majority of the FDR-significant CpGs were robust to covariate adjustment as well as adjustment for cell type; coefficients from the unadjusted, covariate-adjusted, and covariate- and cell-type-adjusted models were in the same direction and had a similar magnitude of effect (Supplementary Data 1 and 2). More detailed gene information is provided in Supplementary Table 1. The genomic inflation factor (lambda)13 values for the unadjusted, covariate-adjusted, and covariate- and cell-type-adjusted models were 0.96, 1.07 and 1.16, respectively (Supplementary Figs 1–3). Among the 443 FDR-significant CpGs in the covariate-adjusted meta-analysis model, increasing levels of maternal plasma folate during pregnancy were associated with decreased methylation of 416 (94%) and increased methylation of 27 (6%) CpGs. There were 48 CpGs that also met the strict Bonferroni threshold for statistical significance (P<1.19 × 10−7, correcting for 419,905 tests). The direction of effects for the statistically significant CpGs was largely consistent in the MoBa and Generation R populations (Table 2; Supplementary Data 1 and 2).
We considered whether vitamin B12, a co-factor with folate in one-carbon metabolism, contained in most multivitamins, along with other B vitamins such as B6 and riboflavin, might confound associations between folate and methylation. Vitamin B12 and folate levels were modestly positively correlated (Spearman correlation 0.11 in MoBa, 0.14 in Generation R, P<0.001 for both). When we adjusted for vitamin B12, the coefficients for folate in relation to methylation changed only minimally (median change 4.9%, 25–75th percentile 2.3–8.2%, N=1,933 subjects). In addition to the consistency of effect estimates after adjustment, results remained statistically significant for 376 (85%) at Bonferroni correction for 443 tests, P<1.13 × 10−4, and all 443 CpGs had P<9 × 10−4 (Supplementary Data 3). Thus, vitamin B12 does not confound the folate–methylation associations we observed.
Women with higher folate levels, which largely reflect supplement use, might be more likely to take multivitamins and/or separate supplements such as cod liver or fish oils that are common in Norway. However, vitamin D (total of D2 and D3) levels were modestly correlated with folate levels (Spearman correlation coefficient=0.14 in MoBa, 0.23 in Generation R, P<0.001 for both cohorts). Adjustment for vitamin D only minimally altered effect estimates for folate in relation to methylation (median absolute value of change 7.3%, 25–75th percentile 3.3–12.3%). Despite the reduction in power due to the smaller sample size for these adjusted analyses (N=1,664), 70% of CpGs significantly related to folate in the main model remained Bonferroni significant after adjustment for vitamin D (308 with P<1.13 × 10−4; Supplementary Data 3).
We performed additional analyses adjusting for two single-nucleotide polymorphisms (SNPs) in the MTHFR gene that influence one-carbon metabolism and are correlated with plasma folate: rs1801133 and rs1801131 (refs 14, 15). These SNPs are in moderate linkage disequilibrium with each other (r2=0.20–21 in the two studies). Adjustment for these two SNPs made little difference in the effect estimates compared with the main model; median change in coefficient=3.8% (25–75th percentile=2.0–6.9%) and 85% of CpGs remained statistically significant despite reduction in sample size to 1,880 (P<1.13 × 10−4, correction for 443 tests). Thus, these genetic variants do not confound the relationship between folate and methylation.
Homocysteine, unlike folate or vitamin B12, is not a nutrient that plays a role as a methyl donor or carrier, but is a product formed during transmethylation in the one-carbon metabolism cycle. It could be regarded as an intermediate on the causal pathway between folate and methylation. In addition, like plasma folate, it is an excellent marker of folate status. Homocysteine was strongly correlated with maternal plasma folate in MoBa (Spearman correlation=−0.49, P<0.001) and moderately correlated in Generation R (Spearman correlation=−0.24, P<0.001), making it challenging to estimate independent effects. Given these various factors, inclusion of homocysteine in the model led to a moderate change in the coefficients for folate in relation to methylation (median change 10.7%, 25–75th percentile 5.8–17.2%. N=1,931 subjects) and only 137 (31%) CpGs remained statistically significant (P<1.13 × 10−4, correction for 443 tests).
We also examined whether the associations with methylation seen for maternal folate levels are also seen for newborn folate levels in a subset of 572 subjects in Generation R. Thus, this analysis is not well powered compared with our maternal folate analysis with 1,988 subjects. However, of the 443 FDR-significant findings for maternal folate in the meta-analysis there were 60 (14%) with nominal P values<0.05 for newborn folate which is higher than the 5% expected by chance alone (Kolmogorov P<1.2 × 10−13). This supports the interpretation that some similar loci are differentially methylated in response to infant folate, although we were severely underpowered to address this properly.
Pathway analysis with the FDR-significant CpGs showed strong and consistent enrichment of fundamental development pathways and of neurodevelopmental pathways (Supplementary Tables 2–4). The biological processes implicated from the DAVID pathway analysis included cell development, embryonic morphogenesis, development, regulation of multicellular organismal processes, cell–cell signalling, embryonic development, forebrain development and, notably, neural tube development (Supplementary Table 2). Ingenuity Pathway Analysis (IPA) results indicated pathways related to nervous system development and function, cell–cell signalling and basic developmental processes (Supplementary Table 3). Gene ontology enrichment analysis and visualization tool results included pathways related to the synaptic signalling, cell–cell signalling, regulation of cAMP biosynthetic process, single-organism behaviour, single-organism signalling, signalling, regulation of gastrulation and the regulation of nervous system development (Supplementary Table 4).
Methylation expression analysis
Of the 365 CpGs associated with folate that we were able to match to a gene transcript (±250 kb), 43 CpGs were significantly associated with altered expression of nearby genes (PBH<0.1). For most CpGs, increased methylation was associated with decreased gene expression (Supplementary Table 5).
Our study is the largest to date using the Illumina 450 K epigenome-wide platform to evaluate the impact of maternal plasma folate levels during pregnancy on DNA methylation in newborns. We meta-analysed results from two population-based birth cohort studies in Northern Europeans that measured DNA methylation using the same platform. We observed epigenome-wide FDR-significant associations between maternal plasma folate and DNA methylation in cord blood at 443 CpGs.
It is notable that many of the implicated genes have functional relevance to various developmental pathways. Some are relevant to NTDs, the indication for maternal folic acid supplementation, and others to distinct developmental conditions that have not been previously associated with maternal folate levels. Additional genes we identified have been implicated in conditions where there is some concern about possible adverse effects of higher folate levels, such as breast cancer progression16. Due to the large number of genes significantly differentially methylated in relation to folate (Supplementary Data 1 and 2), we focus this discussion primarily on genes with two or more CpGs at genome-wide significance after FDR correction (PBH<0.05) where at least one CpG is within the gene (Table 2).
We observed the largest number (nine) of statistically significant CpGs mapping to the gene adenomatosis polyposis coli 2 gene (APC2). APC2 is expressed in both human fetal and adult brain17 and in the peripheral nervous system18. It plays a critical role in the brain development in several model systems19. APC2 may also play a role in cancer aetiology. A homologue of the tumour suppressor gene APC20, APC2, is involved in the regulation of the Wnt signalling pathway, which impacts both normal development and tumorigenesis21. Studies in mice have reported associations between periconceptional maternal folate and methylation of APC genes22. In two human breast cancer lines, folate leads to methylation-mediated silencing of APC and other tumour suppressor genes, raising concern about the risk of tumour progression23. Thus, folate-related methylation of APC2 during fetal development could impact both pathways of neurodevelopment and carcinogenesis.
GRM8 encodes a glutamate receptor that interacts with L-glutamate, the major excitatory neurotransmitter in the central nervous system. Glutamatergic neurotransmission is ubiquitous in normal brain function24 and is perturbed in various neuropathologies. In humans, copy-number variations of GRM8 have been associated with neurodevelopmental disorders such as attention-deficit hyperactivity disorder25 and autism spectrum disorder26.
A number of genes we identified as differentially methylated in newborns in relation to maternal folate are known to harbour mutations that have been causally implicated in various developmental abnormalities other than NTDs, the indication for folic acid supplementation in pregnancy. These include several with two or more statistically significant CpGs (Table 2) such as SLC16A12, implicated in juvenile cataracts with microcornea and renal glucosuria27; and KLK4, implicated in the dental malformation amelogenesis imperfecta28. Mutations in LHX1 have been associated with abnormalities in uterine development29, and recent evidence suggests an important role in retinal development30. Several genes with one CpG at genome-wide statistical significance (Supplementary Data 1 and 2) also harbour mutations that are causal for various development malformations. These include IHH involved in skeletal malformations, ROBO3 involved in horizontal gaze palsy with progressive scoliosis, PCSK9 involved in familial hypercholesterolemia, FAM83H related to amelogenesis imperfecta type 3 and GJA3 associated with congenital cataracts. Taken together, these findings suggest a role for periconceptional folate levels in birth defects not previously known to be related to this nutrient.
Our agnostic evaluation of maternal folate levels and DNA methylation in newborns also identified genes related to various neurologic diseases. Genetic variation in OPCML and PRPH has been associated with the neurodegenerative disease amyotrophic lateral sclerosis31,32. In genome-wide association studies, CSMD1 has been associated with schizophrenia and autism33.
Some previous studies of folate and methylation have examined the H19 imprinted region11,34. We identified three significant CpGs located 45–48-Kb upstream of H19 among 77 CpGs on the platform that are within 48 up- or downstream of H19.
The largest number of statistically significant associations at any locus, 31, are on chromosome 12 and, based on our extended annotation, are nearest to ALG10. Two CpGs are 262–573-kb upstream; the other 29 CpGs are 261–573-kb downstream. None are in ALG10. Most are in a CpG island near the centromere and there are no features that suggest functional impact.
In the only previous study using the 450 K platform, Amarasekera et al12 reported differential methylation in relation to maternal folate in a 923-bp region on chromosome 6, 3-kb upstream of ZFP57. Our studies differ in sample sizes, design and analysis methods. However, when we evaluate the 20 CpGs that map to ZFP57, we find 5 with uncorrected P values of 0.05 or smaller—more than would be expected by chance alone. Thus, our data provide support for association at this locus.
From correlation analysis of 450 K methylation data and gene expression in white blood cells in adults, after correction for multiple testing, 43 CpGs that we implicated in relation to maternal folate were also related to expression of nearby genes (Supplementary Table 5). Although correlation of 450 K methylation with gene expression in the same newborn samples would have been preferable, we were only able to examine correlations in a population of Dutch adults. The most statistically significant correlation between methylation and gene expression was observed for the gene PRSS21 (protease serine 21 (testisin)); four CpGs were both significantly associated with maternal folate (Table 2) and expression of this gene (Supplementary Table 5). PRSS21 is a tumour suppressor gene silenced by aberrant methylation in testicular germ cell tumours35. Testicular germ cell tumours are diagnosed in early adulthood and can manifest as early as 15 years of age. Prenatal origin of this tumour has been proposed36; perhaps, methylation in utero, influenced by maternal folate levels, could play a role in this pathogenesis.
Because other important factors in one-carbon metabolism could potentially explain associations between folate levels and DNA methylation in cord blood, we performed various sensitivity analyses (Supplementary Data 3). On the basis of these analyses, vitamin B12 does not confound the folate–methylation association. This lack of confounding by B12 should extend to other B vitamins such as B6 and riboflavin that are present in multivitamins along with B12. We did not have data in both studies on choline, a nutrient that can serve as a source of one-carbon units. However, in MoBa, where choline was measured, there was no correlation with folate levels (Spearman correlation=−0.034, P=0.23) and thus choline should not confound associations between folate and methylation. Vitamin D is not part of the one-carbon metabolism cycle but might impact methylation by other mechanisms37. We performed analyses in a subsample taking vitamin D into account as proxy for intake of other supplements or possibly healthy dietary patterns and observed no major differences in results. Adjustment of the folate–methylation association for homocysteine, a product formed in one-carbon metabolism that is itself an excellent marker of folate status, resulted in a substantial reduction in the number of statistically significant findings. Although caution is required, both because folate and homocysteine are correlated, and because they operate together in a cycle rather than a clear unidirectional pathway, this attenuation could be interpreted as homocysteine, at least in part, mediating some of the associations between folate and methylation.
Given the role of folate as a major provider of methyl groups in the one-carbon metabolism pathway, our finding of reduced methylation with higher folate at the majority of the implicated CpGs may seem counterintuitive. However, methyl groups from the one-carbon metabolism pathway are used in a range of biological processes and the complex interactions of these systems may not necessarily result in linear relationships. Indeed, there is evidence that effects of folate on folate-dependent enzymes may switch directions at the higher intracellular concentrations that may accompany folic acid supplementation38. Folic acid, in vitamin supplements or food fortification, is a synthetic folate with possible effects that differ from those of natural occurring folate species. There is recent evidence that folic acid interferes with the inhibitory effect of S-adenosylmethionine (SAM) on methylenetetrahydrofolate reductase (MTHFR)39 and may inhibit MTHFR activity, thereby reducing the amount of 5-methyl-tetrahydrofolate, SAM and the SAM/S-adenosylhomocysteine (SAH) ratio40. The SAM/SAH ratio has been referred to as the methylation potential; low SAM/SAH ratio may decrease DNA methylation. This may explain the inverse relationship we observe in our study but additional research is needed to more fully explain the complex biochemistry behind these observations. Of note, inverse correlations between prenatal folate status and DNA methylation at differentially methylated loci have been identified in the other population studies including Hoyo et al.34 and Amarasekera et al.12
Although the health outcomes that have been related to folic acid supplementation involve target tissues such as the nervous system, we only had cord blood available for assessment of methylation. We do not know whether differential methylation at the sites that we observed in cord blood would be observed in relevant target tissues. While divergence in epigenetic patterns is critical for cell-type regulation, there is also evidence of similarities in patterns among some tissues41,42,43. We do not have data on methylation at older ages and thus the question of whether the differential methylation at these loci seen at birth in relation to maternal folate persists to later childhood would need to be addressed in future studies.
We measured folate using two different platforms in the two studies. Both are valid methods for the measurements of folate. Levels were reasonably similar although slightly higher in Generation R, which could reflect a difference in the platforms, differences in folate intake or the earlier timing of measurements in Generation R (∼12-week gestation in Generation R versus ∼18-week gestation in MoBa). Nonetheless, the top findings were consistent in both cohorts and thus robust to differences in measurement platforms. This may increase their generalizability to other populations.
One-carbon metabolism is a complex pathway with influences from multiple genetic, hormonal and environmental factors. Despite our attempt to account for other important dietary intake involved in one-carbon metabolism, other supplementation and genetic variants, residual confounding could still be present and influence the observed associations of folate levels in pregnancy with methylation at birth.
The MoBa and Generation R cohorts offer a unique opportunity to study the epigenetic effect of folic acid supplementation in the absence of food supply fortification. It is possible that results may differ in populations exposed to fortification.
We identified multiple novel genes not previously implicated in biological responses to folate. Many of the implicated genes have functional relevance to various developmental pathways, including the nervous system. Some of these are relevant not only to NTDs, the indication for maternal folic acid supplementation, but also to other developmental abnormalities that have not been previously associated with maternal folate levels. The associations between periconceptional folate and these conditions are difficult to study because the abnormalities are rare and both supplementation and fortification are now widespread. Other genes identified are implicated in conditions where concern exists about possible adverse effects of higher folate levels, such as breast cancer progression16. These findings may provide new insights into mechanisms for the associations between maternal folate status and health outcomes in the offspring. Given that food fortification programs have greatly increased the folate status of the population, greater understanding of the biological effects of this nutrient is important. The large number of novel genes identified using our genome-wide methylation approach may shed light on the protean effects of folate on human health.
This analysis included participants of the Norwegian Mother and Child Cohort Study (MoBa)44,45 and participants of the Generation R Study from the Netherlands. The study populations and cohort-specific methods described below are more extensively detailed in the Supplementary Information (Supplementary Note). The MoBa participants were mother–offspring pairs from a substudy measuring maternal plasma folate during pregnancy46. The Generation R Study is a population-based prospective cohort study from fetal life onwards47,48. For this analysis, information on plasma folate and DNA methylation was available for 1,289 mothers and their children from the MoBa study (1,275 with complete covariate data) and 790 Caucasian mothers, and their children from the Generation R Study (713 with complete covariate data).
The MoBa study was approved by the Regional Committee for Ethics in Medical Research, the Norwegian Data Inspectorate and the Institutional Review Board of the National Institute of Environmental Health Sciences, USA, and written informed consent was provided by all mothers participating. The Generation R Study has been approved by the Medical Ethical Committee of the Erasmus MC, University Medical Center Rotterdam, Netherlands and written consent was obtained from participating parents of their children.
Maternal plasma folate measurements
Both cohorts measured maternal plasma folate during pregnancy. For MoBa, maternal blood samples were drawn during pregnancy (median weeks gestation=18 weeks, 25–75th percentile=16–21 weeks) in EDTA-lined tubes, centrifuged within 30 min after collection and stored at 4 °C in the hospital where they were collected. Samples were then shipped overnight to the Biobank of MoBa at the Norwegian Institute of Public Health in Oslo. Upon receipt (1–2 days after blood collection), plasma was aliquoted onto polypropylene microtiter plates, sealed with heat-sealing foil sheets and stored at −80 °C. Plasma folate concentration was measured at Bevital AS (www.bevital.no) by microbiological assay, using a chloramphenicol-resistant strain of Lactobacillus casei49, which measures biologically active folate species, predominantly 5-methyl-tetrahydrofolate. The coefficient of variation (CV) for this assay corresponds to 4% within day and 5% between days, at population median.
For the Generation R cohort, venous blood samples were drawn at enrolment of the mothers in early pregnancy (median weeks gestation=12.9 weeks; 25–75th percentile=12.1–13.9 weeks) and stored at room temperature for a maximum of 3 h. Samples were transported to a laboratory facility of the regional laboratory in Rotterdam, Netherlands (Star-Medisch Diagnostisch Centrum) for additional processing and storage at −80 °C. The samples were analysed at the Department of Clinical Chemistry at the Erasmus MC, University Medical Center Rotterdam, Netherlands. After thawing, folate concentrations were analysed using an immunoelectrochemoluminence assay on the Architect System (Abbott Diagnostics BV). Between-run CVs for plasma measurements were 8.9% at 5.6 nmol l−1, 2.5% at 16.6 nmol l−1 and 1.5% at 33.6 nmol l−1 with an analytic range of 1.8–45.3 nmol l−1 for plasma folate.
Each cohort had information on maternal age, education and parity from questionnaires completed by the mother or from birth registry records. Maternal smoking during pregnancy was ascertained with questionnaires (both cohorts) and cotinine levels (MoBa). Plasma levels of vitamin B12, vitamin D and total homocysteine from samples taken during pregnancy were available for both cohorts. Mothers in both cohorts were genotyped for two SNPs in the (NAD(P)H) MTHFR gene, rs1801131 and rs1801133. Additional detail on these measurements is in the Supplementary Information (Supplementary Note).
DNA methylation measurements
DNA was extracted from cord blood and bisulfite conversion performed (EZ-96 DNA Methylation kit, Zymo Research Corporation, Irvine, USA). Samples were processed with Illumina’s Infinium HumanMethylation450 BeadChip (Illumina Inc., San Diego, USA) followed by cohort-specific laboratory quality control. Each cohort calculated the methylation betas, and normalized the betas using a published method50,51.
Estimation of cell-type proportions
Both the MoBa and Generation R studies estimated cell-type proportion with the Houseman method52 as implemented in the R minfi package53 using the Reinius et al. data set for reference54. Cell-type correction was applied by including the six estimated cell-type proportions as covariates in cohort-specific statistical models.
Cohort-specific statistical analyses
The cohort-specific statistical models were run independently. For each cohort, we used robust linear regression models in R55 to evaluate the association between natural log-transformed maternal plasma folate and cord blood DNA methylation for each probe while accounting for potential heteroskedasticity and/or influential outliers. Models were adjusted for maternal age, education, smoking during pregnancy, parity and for batch effects (adjustment for plate in Generation R, correction using ComBat50 in MoBa). Additional correction for study design was done in MoBa (whether the participant was in the MoBa1 or MoBa2 data set). Sex of the child was not expected to be associated with maternal plasma folate and was therefore not included as a covariate in the analyses. The adjustment variables were selected on a priori considerations and because they were also associated with maternal plasma folate levels at P<0.05.
The probe-specific quality control resulted in 473,731 CpGs in the MoBa cohort and 436,013 CpGs in the Generation R cohort. The meta-analysis was limited to the 425,749 CpGs common to both cohorts. An additional 5,844 CpGs were excluded for having a SNP mapping to the last five nucleotides of the probe sequence and with a minor allele frequency 5% in the CEU (Utah residents with North and Western European ancestry) population, curated by 1000G projects (http://www.1000genomes.org/, 06/2011 release, 87 individuals), HapMap project (http://hapmap.ncbi.nlm.nih.gov/, release 28, 8/2010, 174 individuals) and dbSNP (http://www.ncbi.nlm.nih.gov/projects/SNP/, build 134, 8/2011, 116 individuals). This left 419,905 CpGs for the final meta-analyses.
Fixed-effect meta-analysis weighted by the inverse of the variance was completed using METAL56. Multiple testing was accounted for by using the FDR procedure by Benjamini and Hochberg (BH)57. For each CpG, the resulting BH corrected P values are denoted by PBH. CpGs with PBH<0.05 were considered statistically significant. CpGs that were statistically significant based on the more stringent Bonferroni correction (uncorrected P<1.19 × 10−7 to account for 419,905 tests) were noted. We present the covariate-adjusted model without cell-type adjustment as the primary results. In the Supplementary Information, we present results additionally adjusted for cell type and results without covariate adjustment.
We performed sensitivity analyses to assess whether the associations observed between folate and methylation might be explained by levels of vitamin B12, a dietary co-factor involved in regulating carbon unit bioavailability. Vitamin B12 is generally present in multivitamins that pregnant women in our studies may have taken in addition to, or in lieu of, separate folic acid supplements. Multivitamin supplements containing B12 typically contain other B vitamins including vitamin B6, which is also involved in one-carbon metabolism. Because mothers with higher folate levels may have higher intakes of other vitamin supplements not involved in the one-carbon metabolism pathway, or healthier diets in general, we also performed separate analyses adjusting for maternal plasma vitamin D levels during pregnancy. We also examined two SNPs in MTHFR involved in modulation of one-carbon metabolism: rs1801131 and rs1801133 (refs 14, 15). We evaluated the impact of adjustment for total homocysteine on the association between maternal plasma folate and DNA methylation in newborns. Finally, we examined whether the associations with methylation seen for maternal folate levels are also seen for newborn folate levels in a subset of 572 subjects in Generation R.
To better understand the functional relationships between differentially methylated CpGs, we evaluated the FDR-significant CpGs with pathway analysis using three independent software programs. First, gene ontology analysis was performed using the IPA (www.ingenuity.com) based on the content version of 21249400 (release date: 22 September 2014). For a given category in IPA, Fisher’s exact test was used to measure the probability that the category was randomly associated (P<0.05 defined as significantly enriched). Second, the NIAID’s DAVID Bioinformatics Resources 6.7 (ref. 58) was used to analyse enrichments in main categories: biological process, cellular component, molecular function and KEGG pathway. Third, we used gene ontology enrichment analysis and visualization tool59 to identify the most informative terms that are significantly enriched.
Methylation expression analysis
We evaluated the association between methylation and quantitative levels of gene expression for our top CpGs. We used messenger RNA gene expression and 450 K methylation data both from white blood cells from adults over 45 years of age in the Rotterdam Study, a population-based prospective cohort study in Rotterdam, the Netherlands. Among the 443 FDR-significant CpGs associated with folate, we were able to match 365 CpGs to a gene transcript in our gene expression data set within a region of 250-kb upstream or downstream of the CpG (total region 500 kb). We analysed the associations of these CpGs with expression levels of the corresponding gene transcripts.
Accession codes: The complete genome wide meta-analysis results file has been deposited in the database of genotypes and phenotypes (dbGaP) under the accession number phs001059.v1.p1. Access to individual-level Illumina HumanMethyl450 Beadchip data for the MoBa study dataset is available by application to the Norwegian Institute of Public Health using a form available on the English language portion of their website at http://www.fhi.no/eway/. Specific questions regarding MoBa data access can be directed to Wenche Nystad: Wenche.Nystad@fhi.no. Requests for access to the individual level data for the Generation R study can be directed to Liesbeth Duijts: email@example.com. For both studies the study management teams will verify with their local ethical committees that the applications are consistent with the consent provided. Applicants will need to obtain IRB approval or exemption from their local institutional review boards.
How to cite this article: Joubert, B. R. et al. Maternal plasma folate impacts differential DNA methylation in an epigenome-wide meta-analysis of newborns. Nat. Commun. 7:10577 doi: 10.1038/ncomms10577 (2016).
This research was supported in part by the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences (Z01-ES-49019). Additional funding support was provided by the NIH Office of Dietary Supplements. We acknowledge Shuangshuang Dai of Integrative Bioinformatics at the NIEHS and Jianping Jin of Westat for their expert data management and programming assistance. The Norwegian Mother and Child Cohort Study is supported by the Norwegian Ministry of Health and the Ministry of Education and Research, NIH/NIEHS (contract no. N01-ES-75558), NIH/NINDS (grant no.1 UO1 NS 047537-01) and the Norwegian Research Council/FUGE (grant no. 151918/S10). We are grateful to all the participating families in Norway who take part in this ongoing cohort study. E.B. was supported by the Adam J Berry Memorial Scholarship administered by the Australian Academy of Science and the Foundation for the National Institutes of Health. The Generation R Study is conducted by the Erasmus Medical Center in close collaboration with the School of Law and Faculty of Social Sciences of the Erasmus University Rotterdam, the Municipal Health Service Rotterdam area, Rotterdam, the Rotterdam Homecare Foundation, Rotterdam and the Stichting Trombosedienst & Artsenlaboratorium Rijnmond, Rotterdam. We gratefully acknowledge the contribution of children and parents, general practitioners, hospitals, midwives and pharmacies in Rotterdam. The study protocol was approved by the Medical Ethical Committee of the Erasmus Medical Centre, Rotterdam. We thank Mr Michael Verbiest, Ms Mila Jhamai, Ms Sarah Higgins, Mr Marijn Verkerk and Dr Lisette Stolk for their help in creating the EWAS database. The Generation R Study is made possible by financial support from the Erasmus Medical Center, Rotterdam, the Erasmus University Rotterdam and the Netherlands Organization for Health Research and Development. J.F.F. has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 633595. O.F. works in ErasmusAGE, a centre for aging research across the life course funded by Nestlé Nutrition (Nestec Ltd), Metagenics Inc and AXA. Nestlé Nutrition (Nestec Ltd.), Metagenics Inc. and AXA had no role in design and conduct of the study, in the collection, management, analysis and interpretation of the data, or in the preparation, review and approval of the manuscript. A.D. received an additional grant from the Netherlands Organization for Health Research and Development (VENI 916.12.154) and the EUR Fellowship. V.W.V.J. received an additional grant from the Netherlands Organization for Health Research and Development (VIDI 016.136.361) and a Consolidator Grant from the European Research Council (ERC-2014-CoG-64916). L.D. received an additional grant from the Lung Foundation Netherlands (no 3.2.12.089; 2012). The generation and management of the Illumina 450 K methylation array data (EWAS data) for the Generation R Study was executed by the Human Genotyping Facility of the Genetic Laboratory of the Department of Internal Medicine, Erasmus Medical Center, the Netherlands. The EWAS data was funded by a grant to V.W.V.J. from the Netherlands Genomics Initiative (NGI)/Netherlands Organisation for Scientific Research (NWO) Netherlands Consortium for Healthy Aging (NCHA; project nr. 050-060-810), and by funds from the Genetic Laboratory of the Department of Internal Medicine, Erasmus Medical Center.
Meta-analysis of the association between maternal plasma folate during pregnancy and DNA methylation in newborns: CpGs statistically significant after FDR correction in covariate adjusted models, sorted by P value
Meta-analysis of the association between maternal plasma folate during pregnancy and DNA methylation in newborns: CpGs statistically significant after FDR correction in covariate adjusted models, sorted by chromosome and position. For comparison results from unadjusted and cell type adjusted meta-analyses are also presented
Sensitivity analysis results for the FDR-significant CpGs from the main covariate-adjusted model