Multi-omics signatures of the human early life exposome

Maitre, Léa; Bustamante, Mariona; Hernández-Ferrer, Carles; Thiel, Denise; Lau, Chung-Ho E.; Siskos, Alexandros P.; Vives-Usano, Marta; Ruiz-Arenas, Carlos; Pelegrí-Sisó, Dolors; Robinson, Oliver; Mason, Dan; Wright, John; Cadiou, Solène; Slama, Rémy; Heude, Barbara; Casas, Maribel; Sunyer, Jordi; Papadopoulou, Eleni Z.; Gutzkow, Kristine B.; Andrusaityte, Sandra; Grazuleviciene, Regina; Vafeiadi, Marina; Chatzi, Leda; Sakhi, Amrit K.; Thomsen, Cathrine; Tamayo, Ibon; Nieuwenhuijsen, Mark; Urquiza, Jose; Borràs, Eva; Sabidó, Eduard; Quintela, Inés; Carracedo, Ángel; Estivill, Xavier; Coen, Muireann; González, Juan R.; Keun, Hector C.; Vrijheid, Martine

doi:10.1038/s41467-022-34422-2

Download PDF

Article
Open access
Published: 21 November 2022

Multi-omics signatures of the human early life exposome

Nature Communications volume 13, Article number: 7024 (2022) Cite this article

18k Accesses
52 Citations
125 Altmetric
Metrics details

Subjects

Abstract

Environmental exposures during early life play a critical role in life-course health, yet the molecular phenotypes underlying environmental effects on health are poorly understood. In the Human Early Life Exposome (HELIX) project, a multi-centre cohort of 1301 mother-child pairs, we associate individual exposomes consisting of >100 chemical, outdoor, social and lifestyle exposures assessed in pregnancy and childhood, with multi-omics profiles (methylome, transcriptome, proteins and metabolites) in childhood. We identify 1170 associations, 249 in pregnancy and 921 in childhood, which reveal potential biological responses and sources of exposure. Pregnancy exposures, including maternal smoking, cadmium and molybdenum, are predominantly associated with child DNA methylation changes. In contrast, childhood exposures are associated with features across all omics layers, most frequently the serum metabolome, revealing signatures for diet, toxic chemical compounds, essential trace elements, and weather conditions, among others. Our comprehensive and unique resource of all associations (https://helixomics.isglobal.org/) will serve to guide future investigation into the biological imprints of the early life exposome.

Quantitative methods for metabolomic analyses evaluated in the Children’s Health Exposure Analysis Resource (CHEAR)

Article 23 September 2019

Molecular mechanisms of environmental exposures and human disease

Article 30 January 2023

Longitudinal associations of an exposome score with serum metabolites from childhood to adolescence

Article Open access 22 July 2024

Introduction

A large proportion of environmental risk factors remains unknown or poorly defined, although the environmental contribution to disease risk is estimated to be 70–90%^1,2. More than a decade ago, the term “exposome” was coined to encompass all environmental factors (i.e. non-genetic factors) to which humans are exposed throughout the life course³. Historically, environmental health studies focused almost exclusively on single exposure factors such as air pollution, lead, or pesticides. The central tenet of the exposome concept is a call for a holistic and systematic approach to assessing the impacts of environment on health. Moreover, the exposome includes not only external exposures, but also the internal biological responses to these exposures through the interrogation of high-dimensional molecular data^3,4,5,6.

Of particular interest is the early detection of physiological changes at the molecular level related to environmental exposures before the manifestation of clinical symptoms in healthy populations. Such information may support the biological plausibility of environment-health associations in population studies, help to understand toxicological mechanisms or elucidate how multiple exposures may be grouped based on their common influence on biological pathways (e.g. inflammation) or their source of exposure (e.g. diet). It can also help to identify exposure biomarkers to predict current and past exposures. Integrative personal omics profiling studies, gathering high-throughput data on multiple molecular layers, have demonstrated that personal molecular profiles may be particularly useful to assess disease risk, detect early preclinical conditions and initiate preventive strategies^7,8,9.

Foetal and childhood development has life-long consequences and is critical for many chronic diseases including obesity, cardiometabolic diseases^10,11,12, attention-deficit and hyperactivity disorders (ADHD)¹³ and lung function¹⁴. Therefore, early life is a particularly important period to study the early biological triggers of disease: exposures during these developmentally vulnerable periods may have pronounced effects at the molecular level that may remain clinically undetectable until adulthood.

The molecular mechanisms through which early-life environmental exposures may impact birth outcomes and long-term health in humans have primarily been studied through the lens of epigenetics. It is thought that the epigenome orchestrates cellular responses to environmental perturbations and provides cell memory and plasticity¹⁵. Among all epigenetic marks, DNA methylation is the most studied in epidemiological settings; and among all exposures, tobacco smoke is the most investigated^16,17,18,19. To a lesser extent, other diverse exposures, from metals and air pollution to socio-economic factors, have been linked to differential methylation and are catalogued in public databases¹⁶ (http://www.ewascatalog.org/). Although epigenetic marks regulate gene transcription and thus the proteome, the relationships between these and the exposome are less studied¹⁷. The metabolome, which can reflect physiological responses and microbiome activity, as well as the direct internalization of exposures, has received particular attention in exposome research^5,18,19,20. However, there is a clear lack of large-scale studies that evaluate multi-omics signatures of a wide range of environmental exposures.

In this work, we aimed to associate the personal early life exposome, measured in 1301 mother–child pairs of the Human Early Life Exposome (HELIX) project, with deep molecular phenotype data assessed in childhood and defined by the blood methylome and transcriptome, plasma proteins, and serum and urinary metabolites²¹. By systematically documenting all associations between the exposome and the molecular phenotypes, we provide a unique resource (https://helixomics.isglobal.org/) for the identification of novel exposure biomarkers and early biological effects during developmentally vulnerable life periods.

Results

Building the early life exposome and the multi-omics phenotypes in HELIX children

We assessed the early life exposome in 1301 mother–child pairs from the HELIX project, a multi-centre longitudinal population-based cohort study in 6 locations in Europe (Spain, UK, France, Lithuania, Norway and Greece) (Fig. 1 and Supplementary Data 1A)²¹. We measured 91 environmental exposures in pregnancy and 116 in childhood, when children were between 6-11 years old. Exposures covered 19 families: meteorological factors, natural spaces, indoor and outdoor air pollution, built environment, road traffic, noise, water disinfection by-products, tobacco smoking, lifestyle factors (diet, physical activity), social and economic capital, essential minerals and chemical pollutants (non-essential metals, organochlorines, organophosphate pesticides, polybrominated diphenylethers, perfluoralkyl substances, phenols and phthalates) (Fig. 1). Exposure levels in the HELIX cohorts are described further elsewhere^22,23,24. Correlation patterns among exposure variables adjusted for cohort are shown in Supplementary Data 1B (pregnancy), 1C (childhood) and 1D (for the same exposure variable among the two periods). Exposure assessment tools included mass spectrometry-based measurement of biomarkers of chemical exposure in urine and blood, exposure monitors, remote sensing and geospatial methods, and questionnaire-based interviews.

**Fig. 1: An overview of the early-life exposome and multi-omics signature study.**

For these same children, aged between 6 and 11 years, we performed in-depth multi-omics molecular phenotyping, including measurement of blood DNA methylation (450K, Illumina), blood gene expression (HTA v2.0, Affymetrix), blood miRNA expression (SurePrint Human miRNA rel 21, Agilent), plasma proteins (3 Luminex multiplex assays), serum metabolites (targeted LC-MS/MS metabolomic assay, Biocrates AbsoluteIDQ p180 kit), and urinary metabolites (¹H nuclear magnetic resonance (NMR) spectroscopy) (Fig. 1 and Supplementary Data 1E). While blood DNA methylation and transcriptomics were measured genome-wide with 386,518 CpGs, 58,254 transcript clusters (TCs) and 1117 miRNAs; the other omics followed a semi-targeted or targeted approach. Plasma proteins included a total of 36 cytokines, apolipoproteins and adipokines (Supplementary Data 1F)²⁵. The serum metabolites (N = 177) included amino acids, biogenic amines, acylcarnitines, glycerophospholipids, sphingolipids and sum of hexoses, covering a wide range of analytes and metabolic pathways in one targeted assay (Supplementary Data 1G)²⁶. Urine metabolites (N = 44) mainly included amino acids, organic acids, nicotinamides, amines and gut microbial-derived phenols (Supplementary Data 1H)²⁶. Around 91% of the children had molecular data from at least 4 of the omics platforms. Detailed information on the HELIX participants, exposure assessment and omics measurements can be found in Supplementary Information.

Results of the exposome-omics-wide association study (ExWAS)

We first systematically tested the association between each exposure variable and each molecular feature, successively and independently, through an ExWAS, using an analogous statistical approach to that of Genome-Wide Association Studies (GWAS) (Fig. 1 and Supplementary Information). Overall, we tested >30 M exposure-omics associations (>0.3 M molecular features * ~100 exposures * 2 exposure periods) through linear regression models adjusted for the same set of confounders: cohort, child’s age, sex, z-score body mass index (zBMI), ancestry, maternal education and omics specific covariates. Results of all these associations can be viewed in the web catalogue: https://helixomics.isglobal.org/ (for genome-wide omics platforms, only results with p values <0.01 are included).

To identify statistically significant exposure-omics associations, correction for multiple comparisons was applied for each exposure within each omics dataset. For this, we considered significant associations the ones with p values below a False Discovery Rate (FDR) of 0.05 for genome-wide omics, and below a modified version of the Bonferroni cut-off for the proteins and metabolites (which consists in dividing the nominal p value by the effective number of tests (ENT) determined from the correlation structure of the omics dataset (Supplementary Data 1I and Supplementary Information). With these criteria, 1170 exposure-omics associations were statistically significant. Associations between the pregnancy exposome and molecular phenotypes totalled 249, including 52 unique exposures and 209 unique molecular features, while the 921 associations with the childhood exposome corresponded to 84 unique exposures and 454 unique molecular features. All 1170 statistically significant associations are shown in Supplementary Data 2.

Miami plots display exposure-omics associations by family of exposure and molecular layer (Fig. 2A1, B1). The pregnancy exposome was predominantly associated with child DNA methylation (70% of the associations observed) (Fig. 2A2); in contrast, the childhood exposome was associated with all molecular layers, with the serum metabolome showing the highest number of associations (43% of the associations observed) (Fig. 2B2). Pregnancy exposures within the most associations included molybdenum (Mo), cadmium (Cd), cotinine (biomarker of tobacco exposure) and maternal smoking (questionnaire data) (Fig. 2A3). Childhood exposures with the most associations included copper (Cu), organochlorine compounds (PCB 118), and perfluroalkyl substances (PFOS), caesium (Cs) and humidity (Fig. 2B3). Other exposures such as outdoor air pollution, built environment, road traffic, and noise, showed few associations. Among 83 exposures measured in both the pregnancy and childhood periods, 14 exposure-omics pairs were statistically significant in the two periods: 6 CpGs related to tobacco smoking, and several long chain fatty acids related to cotinine, hexachlorobenzene (HCB), perfluoroundecanoate (PFUnDA) and Hg (Supplementary Data 3).

**Fig. 2: Results of the Exposome-omics-Wide Association Study (ExWAS) for the pregnancy and childhood exposomes.**

Robustness of results with respect to ancestry, child zBMI and cohort

For the 1170 significant exposure-omics associations, we conducted several sensitivity analyses. First, HELIX consists of 1171 European ancestry children and the rest from other ancestries, with Pakistani ancestry the second most common (102 children). We repeated the ExWAS in children only of European ancestry, and did not note substantial differences in effect size (i.e. more than doubling) between the two models (Fig. 3A).

**Fig. 3: Robustness of main exposome-omics associations.**

Second, due to the potential influence of child adiposity both on the blood levels of lipophilic pollutants and on some molecular features, we compared the associations with and without adjustment for child zBMI, as a proxy of child adiposity. We observed that 12 associations had more than a doubling in the effect size (Fig. 3B). They included lipophilic chemicals (PCB 170, PCB 153 and PCB 180) and proteins known to be produced by the adipose tissue (IL1beta, leptin and IL6).

Third, we investigated heterogeneity across cohorts by running the 1170 exposure-omics associations by cohort. Around half of all associations presented heterogeneity values (I²) < 0.5, with variations by period and molecular layer (Supplementary Information—Fig. S1). Besides the I² statistic which might be overestimated in meta-analysis with a small number of studies²⁷, we also visually inspected the forest plots. While some associations seemed to be very consistent between cohorts even with a high I² (e.g. maternal cadmium and methylation at CpG cg19089201), for others there was more heterogeneity with some cohorts acting as outliers (e.g. child meteorological conditions and serotonin) (Fig. 3C). The forest-plots for the 1170 exposure-omics associations are provided in Supplementary Data 11.

Finally, given the correlated nature of the exposome, we ran multi-exposure models for those omics features associated with more than one exposure, when these exposures had a correlation <0.8 and belonged to different exposure families (except individual exposures that belonged to diet, metals or parabens that we considered as separate groups). Results of these analyses are shown in Supplementary Data 4A, B. For prenatal exposures, the strongest effect change was observed for maternal cadmium (Cd) levels, which showed a reduction of >25% of the association with the molecular trait adjusting for smoking related variables. For childhood exposures, the strongest effect changes were observed for Hg, As, Se, PFAS and dietary patterns (e.g. fish and KIDMED score), for indoor PM and parental smoking, and for BPA and meteorological variables. They are discussed below in more detail.

Network integration of multi-omics signatures of the exposome

To visualize whether a molecular feature was connected to several exposures, and vice versa, we built period-specific multi-omics exposome networks, based on the 1170 statistically significant exposome-omics associations. The nodes of these networks are the 538 unique molecular features or exposures involved in these associations, and the edges are the 1170 exposure-omics associations.

The pregnancy exposome network, mostly composed of CpGs (70%), was very disconnected having on average 1.3 connexions per node (i.e. degree) and an average shortest path length of 1.9 (Fig. 4 and Supplementary Data 5A). This number represents the average length (number of nodes) of the shortest path between each node and any other node, 1.9 being a low value. This lack of connectivity can be explained by the wide-spacing along the genome of the CpG sites assessed with the 450 K array and their relatively low correlation. The pregnancy exposome network contained 3 main connected components (referred to as clusters, and labelled “preg#…”), the largest of which contained less than 30% of all nodes. These 3 clusters varied greatly in their size, their number of exposures and the type of omics data comprising them (Table 1).

**Fig. 4: Network map of the multi-omics signatures of the pregnancy exposome.**

Table 1 Pregnancy and childhood exposome clusters based on associations with multi-omics profiles measured in childhood (N = 1301)

Full size table

The childhood exposome network was more densely connected, with an average of 1.9 connexions per node and an average shortest path length of 4.3. The biggest connected component contained 90% of all nodes (Fig. 5). This connectivity highlights the correlated nature of the serum and urine metabolome, which represented the majority of the exposure-omics associations of the network (43 and 26% respectively). Within the biggest connected component, we identified 11 interconnected subcomponents (i.e. clusters, named as “childhood#…”) using an unsupervised structural clustering method (Table 1 and Supplementary Data 5B)^28,29.

**Fig. 5: Network map of the multi-omics signatures of the childhood exposome.**

Next, we aimed to evaluate the biological interpretation of the exposure-omics associations included in the 3 pregnancy and 11 childhood clusters. First, we did a systematic search of overlap with the literature on DNA methylation associations with exposures and traits (EWAS Atlas/Catalogue^16,30, Fig. 6A–C and Supplementary Data 6) and on metabolite associations with dietary patterns and pollutants (ExposomeExplorer³¹, Fig. 7). Second, we conducted functional enrichment analyses using several public databases (Fig. 6B–D and Supplementary Data 7). Methodological details can be found in Supplementary Information. Below, we describe the main findings for groups of exposures.

**Fig. 6: Biological interpretation of the exposome-omics associations through literature overlap and functional enrichment.**

**Fig. 7: Metabolite signatures of the childhood exposome and dietary sources.**

Maternal smoking shows robust and long-lasting effects in the child methylome and novel signatures for prenatal cadmium and indoor air pollution are detected

Methylation signatures for maternal smoking at different ages have been well documented³². In HELIX, maternal smoking during pregnancy assessed using questionnaires and urinary maternal cotinine levels associated with 24 unique CpGs (cluster preg#1), representing 9 unique loci (2 Mb) annotated to 8 genes, that largely overlap with smoking-sensitive CpGs described in the EWAS Atlas/Catalogue (Fig. 6A–C and Supplementary Data 6). Child exposure to second-hand smoke also overlapped with existing literature, but to a lesser extent than maternal smoking (cluster childhood#7). Period specific smoking effects in HELIX have been investigated elsewhere²⁵. Functional enrichment analysis identified the following pathways: axon development, cognition, cholinergic synapse, insulin signalling, and several types of cancer (Fig. 6B–D and Supplementary Data 7, highlighted in yellow).

Prenatal cadmium (Cd), a heavy metal, was associated with child blood methylation, and mapped with maternal smoking in cluster preg#1. The multi-exposure analyses suggested some overlap between these signals (Supplementary Data 4A). This could be partially explained by the fact that Cd is a component of tobacco³³ and in our dataset mothers who smoked showed almost twice the level of Cd compared to non-smokers. However, we identified 14 additional CpGs that were unique to Cd (Fig. 2A, B and Supplementary Data 8A). When restricting our analysis of maternal Cd to non-smoker mothers (N = 998), 51 CpGs (48 loci) were identified (Supplementary Information—Fig. S2C, D and Supplementary Data 8B). These did not overlap with known smoking effects, nor with CpGs associated with urinary Cd in adult blood or with placental Cd in placental tissue^34,35.

We further found several associations for air quality during childhood, which did not overlap between outdoor and indoor pollutants. Among the most interesting, home indoor air pollution exposure to benzene was associated with 9 CpGs (cluster childhood#9), one of them related to PM_2.5 levels in previous studies (Fig. 6C and Supplementary Data 6). Moreover, home indoor levels of PM_2.5 absorbance, a marker of black/elemental carbon originating from combustion, were associated with methylation of 9 CpGs, including two in common with tobacco exposure (Fig. 6C and Supplementary Data 6), and with decreased levels of serum branched amino acids (BCAA), C4 acylcarnitine and two sphingolipids (cluster childhood#7). Some of these associations were attenuated after adjusting for parental smoking (Supplementary Data 4B).

The serum and urinary metabolome reveal principal dietary routes of exposure to chemical pollutants

Cluster childhood#3 contained fish intake (information collected through questionnaire), toxic metals (mercury (Hg) and arsenic (As)), the per- and polyfluoroalkyl substances (PFAS), and non-toxic essential elements (selenium (Se) and caesium (Cs)), together with serum lipids containing polyunsaturated fatty acids (PUFA) and urinary trimethylamine N-oxide (TMAO), dimethylamine and homarine (Fig. 7A). Using systematic metabolite-diet associations found in previous population studies archived in ExposomeExplorer³⁶, we confirmed the dietary origin of these exposure-metabolite associations, in this case to fish intake and animal products (Fig. 7C). In addition, multi-exposure models confirmed that most of these associations in particular with Hg, As and PFAS were attenuated after adjusting for diet and other co-exposures. This was not true for TMAO and As which remained one of the strongest association even after adjusting for PCB 180, Hg, Fish and PFUNDA (Supplementary Data 4B).

Similarly, cluster childhood#6 contained 21 out of the 44 urinary metabolites measured, including hippurate, proline betaine and N-methylnicotinic acid which are known biomarkers of fruit and vegetable intake^26,37 (Fig. 7B, C). The cluster also included organophosphate (OP) pesticides measured in urine which suggested a potential route of exposure through dietary intake of fruits and vegetables.

Also in cluster childhood#6, we found the DiNP metabolites, phthalate family members primarily used to produce flexible plastics such as food packaging. In contrast, DEHP metabolites (MEOHP, MEHHP, MECPP, MEHP), also phthalates found in plastics, mapped in cluster childhood#5 and were associated with 13 CpGs, with no clear overlap with reported traits/exposures (Fig. 6C and Supplementary Data 6). MEOHP and MECPP were also negatively associated with a number of serum sphingomyelins (SM C16:0, SM C18:0, SM C18:1, SM C20:2, SM (OH) C14:1 and SM (OH) C16:1). Pregnancy exposure to DEHP metabolites and parabens, synthetic compounds present in personal care products, also showed negative associations with sphingomyelins (SM (OH) C16:1) and valine in children.

Essential trace elements are key components of the exposome

Essential trace elements are required by living organisms to ensure normal development and maintenance of biological functions, but can also be toxic when present in excess. We measured 9 essential elements in whole blood (Co, Cu, Mn, Mo, Na, K, Mg, Zn, Se), and found a remarkable number of exposure-omics associations, mostly with maternal molybdenum (Mo), and child copper (Cu) (Supplementary Data 2).

Maternal Mo was related to the methylation levels of 72 CpGs, representing 63 loci (cluster preg#2). No relevant gene-sets were identified for genes annotated to these 72 CpGs, but 13 of them have previously been related to gestational age (Fig. 6A and Supplementary Data 6). Mo acts as a co-factor of 4 human enzymes which are involved in various key reactions, including the regulation of sulfur-containing amino acids such as methionine³⁸. In our dataset, maternal Mo was associated with higher methionine levels in childhood (Supplementary Data 2).

Child Cu was associated with 89 molecular features, distributed across different omics layers (cluster childhood#2). One of the associations with the lowest p value was with increased levels of the C-reactive protein (CRP), a marker of inflammation. Moreover, Cu-associated CpGs have previously been linked to obesity, type 2 diabetes and rheumatoid arthritis, a chronic inflammatory disorder, among others (Fig. 6C and Supplementary Data 6). Enriched pathways for Cu included: immune response, lipid storage and sequestering of metal ions (Fig. 6D and Supplementary Data 7, highlighted in green). Adjusting for co-exposures (e.g. Pb) did not change substantially these associations (Supplementary Data 4B).

Furthermore, during childhood, other essential trace elements were associated with multiple molecular features with no overlap among them (Supplementary Data 2), as expected due to their intrinsic essential roles. For instance, zinc (Zn) was related to higher transcription of CA1 (Carbonic anhydrase 1), whose expression is known to be influenced by Zn²⁺ availability and which uses Zn²⁺ as a cofactor for its enzymatic activity³⁹.

Weather conditions are associated with signatures in all omics layers

Weather conditions or meteorological factors (temperature, humidity, cloud coverage and atmospheric pressure), in particular when extreme, are strong determinants of health and mortality⁴⁰. However, there are no studies systematically assessing their influence on molecular phenotypes. We estimated weather conditions through geographical information coupled with data from meteorological stations (Supplementary Information). In childhood, weather conditions over the month before the omics measurement, were associated with all molecular layers, except for the urinary metabolome (cluster childhood#4). Serum metabolites associated with meteorological variables included taurine, asymmetric dimethylarginine (ADMA), acylcarnitine C5, and serotonin, which have been previously reported as biomarkers of sleep deprivation, circadian rhythm and in the aetiology of depression^41,42,43 (Supplementary Data 2). They were also associated with three proteins: adiponectin, MCP1 and HGF. Adiponectin, an essential regulator of thermogenesis^44,45, increased with humidity (higher in winter in Europe) and decreased with ultraviolet radiation (higher in summer) (Supplementary Data 2). This is in line with previous studies showing that exposure to cold temperatures for 2 h increases adiponectin plasma levels⁴⁶. The magnitude of some of these associations (carnitine C5, adiponectin, serotonin) were attenuated by more than 50% after adjusting for exposure to bisphenol A (BPA), which was previously found to reduce adiponectin release⁴⁷ (Supplementary Data 4B). Finally, the CpGs associated with weather conditions overlapped with CpGs reported for infections, among others (Fig. 6C and Supplementary Data 6); and genes related to temperature were enriched for cellular response to type I interferon (Fig. 6D and Supplementary Data 7, highlighted in blue). Infectious diseases follow seasonal patterns and are more prevalent under particular meteorological conditions, as recently shown with in the different COVID-19 pandemic waves^48,49.

Persistent organic pollutants (POPs) and multi-omics alterations in children

We found that POPs in children, especially dioxin-like PCB 118 (69 associations), HCB (28) and PCB 138 (14), were associated with DNA methylation, serum metabolites and plasma proteins (IL1B and leptin) grouped in cluster childhood#1 (Fig. 5). CpGs in this cluster have previously been reported to be related to the inflammatory disease rheumatoid arthritis (Supplementary Data 6), and IL1B and leptin are produced by the fat tissue as commented above. We also observed an unique positive association of PCB 180 and urinary TMAO, without any other associations with other fish-related metabolites described above.

Replication of exposure-omics associations across molecular layers and biological matrices

We investigated whether childhood exposome associations with DNA methylation, gene and miRNA expression, all assessed in blood cells, pointed to the same genes. For each CpG, we identified cis expression quantitative trait methylations (eQTMs), meaning correlations between gene expression and DNA methylation (Supplementary Information). Out of the 187 CpGs associated with the childhood exposome, 9 had eQTMs in a total of 11 genes (Supplementary Data 9A). However, none of these eQTMs was nominally associated with the same exposures as the CpG site. We also searched for targeted genes of the 49 miRNAs associated with the childhood exposome using the miRwalk v3 tool⁵⁰ (Supplementary Data 9B). Seventeen out of the 1267 targeted genes were associated with the same exposure as the original miRNA and in the expected direction (higher miRNA levels − lower gene expression). They encompassed 7 unique exposures (Cd, Cu, K, PFOA, blue spaces and meteorological factors) and 9 unique miRNAs (Supplementary Data 9C).

We also compared the overlap of childhood exposure associations for 12 metabolites (amino acids, glucose, carnitine and creatinine) that were measured in both urine and serum, and whose correlation can be found in Supplementary Data 10A. At nominal significance, 27.3% of the urine associations replicated in serum; and 7% of the serum associations replicated in urine (Supplementary Data 10B, C). Not surprisingly, replicated associations involved metabolites with the highest correlation between matrices (carnitine, glycine and creatinine) (Supplementary Data 10A).

Discussion

This is the first exposome study to systematically associate a wide range of environmental exposures during vulnerable early life periods with multi-omics signatures in childhood. We observed 1170 unique associations between exposures and molecular features, 249 relating to pregnancy and 921 to childhood exposures. By partitioning these associations into network clusters for visualization and by conducting systematic biological interpretation, this study reveals potential biological responses and sources of exposure. Our findings confirm persistent methylation changes associated with maternal tobacco smoking in pregnancy⁵¹ and principal sources of exposure to chemical pollutants through diet, based on food-related biomarkers. Furthermore, we identify novel associations notably with essential trace elements, weather conditions, indoor air quality, persistent pollutants, phthalates and parabens. Our comprehensive resource of all associations (https://helixomics.isglobal.org/) is the first of its kind and will serve to guide future investigation on the biological imprints of the early life exposome.

Our web catalogue has several applications: creating biomarkers of exposure, identifying sources of exposures and understanding biological mechanisms. Data generated in this study provide a resource for the development of epigenetic biomarkers of past exposures⁵². For instance, it was generally believed that the essential element molybdenum (Mo) is safe for human health⁵³; however, there is growing evidence that excess of Mo is associated with developmental effects and with adverse health outcomes^{20,54,55,56,57,58}. In this study, maternal levels of Mo were associated with methylation changes in a remarkable number of CpGs, which were persistent at least until childhood (when we detected them). The methylation in these CpGs could be used to predict prenatal exposure levels.

Also, our study demonstrates the ability of metabolomics to accurately reflect dietary sources and potential gut microbial effect of exposures. The strongest, most significant associations among all exposome-omics tested were found for As and Hg with trimethylamine-N-oxide (TMAO) and glycerophospholipids. Most of these associations, except the TMAO-As association, were attenuated after adjusting for fish intake and other fish-related compounds. Indeed, TMAO was previously demonstrated to discriminate high against low fish intake, whereas homarine (a metabolite found in shellfish muscle) for high/non shellfish intake in populations with high seafood intake such as in Spain and Japan^59,60. TMAO–As association that remained the strongest association after adjusting for fish related exposures also suggests the independent role of the gut microbiome. This finding corroborates our previous study in pregnant women from the Spanish INMA cohort⁵⁹. Other evidence indicate that gut microbiome may alter arsenic metabolism and neurodevelopmental susceptibility to this exposure^61,62. Importantly, we illustrate in this study that many anthropogenic chemicals are delivered to the body through diet (in this case fruit and fish intake), which biological effect may be altered by the gut microbiome, adding to the complexity of metabolomic profiles in human biospecimens and creating an extensive network of nutrient–pollutant interactions that remains vastly unknown and poorly defined by conventional assessment methods⁶³.

Among the novel molecular signatures identified, six groups of exposures highlighted plausible biological mechanisms to disease. First, Cu is an essential trace element required for numerous cellular processes, including mitochondrial respiration, antioxidant defence, neurotransmitter synthesis, and iron metabolism, among others⁶⁴. In previous HELIX studies, Cu has been related to several health outcomes such as poorer lung function⁶⁵, higher BMI^66,67 and blood pressure⁶⁸, and increased ADHD symptomatology⁶⁹, and here we show potential perturbed pathways that may mediate these associations: immune response, lipid storage and sequestering of metal ions. Second, pathways identified for tobacco smoke (axon development, cognition, cholinergic synapse, insulin signalling, and several types of cancer) were similarly in line with the effects of maternal smoking on health outcomes detected in HELIX children (higher blood pressure⁶⁸ and BMI⁶⁶, and increased behavioural problems⁶⁹). We acknowledge that, as DNA methylation was measured in blood, the identification of pathways relevant for other tissues (i.e. brain and axon development) has to be analysed with caution. It could be that DNA methylation marks are maintained across tissues if exposure happens early in development, or that the same genes are involved in different pathways in different tissues. Third, indoor air quality during childhood was associated with metabolic markers (BCAA and acylcarnitines). The HELIX study was the first to find an association between indoor air pollution and child obesity⁶⁶. Dysregulated metabolism of BCAAs and acylcarnitines has been associated with obesity and insulin resistance in numerous studies⁷⁰ and was detected in young obese participants exposed to near-roadway air pollution⁷¹. Altered BCAA and acylcarnitine metabolism may be an important biomarker to study further in relation to air pollution and cardio-metabolic disease risk in later life. Fourth, POPs have consistently been associated with adverse heath outcomes^72,73. Besides associations likely linked to fat distribution in children, we also observed a positive association of PCB 180 and TMAO, a product of gut microbiota and liver hepatic flavin containing monooxygenase (FMO3) enzyme activity. This association was previously reported in animals and humans and appeared independent of potential common dietary sources of PCBs and TMAO, and of BMI⁷⁴. Currently, TMAO is proposed as a causative agent of cardio-vascular disease⁷⁵ but further investigations on the mechanistic link between PCBs, FMO3 activity/expression and cardio-vascular outcomes are needed. Fifth, we found associations with high molecular weight phthalates and parabens, which are synthetic compounds rapidly metabolized in the body and suspected of being endocrine disruptors⁷⁶ and affecting health in a sex-specific manner²⁴. Exposure to phthalates occurs mostly through diet, dust ingestion, and to a less extent through inhalation⁷⁷. Metabolic signatures of phthalates and parabens were not clearly related to dietary patterns but to an endogenous metabolic pathway, the sphingomyelins, which are important structural lipid components of cell membranes involved in signalling and implicated in many disorders^78,79. Intermediates of sphingosine biosynthesis and valine have been reported to be upregulated in pregnant women exposed to phthalates⁸⁰ and parabens⁸¹. Sixth, our results also provide insights into potential mechanisms of action for weather conditions: they appear to have direct effects (e.g. regulating thermogenesis) and indirect effects (e.g. determining other exposures such as virus survival), or they can also represent proxies of other variables (e.g. hours of daylight or dietary changes due to seasonal variation). The investigation of meteorological conditions in larger longitudinal omics datasets covering seasonal patterns will be needed to elucidate the final causal mechanisms.

Our study indicates that the choice of molecular layer and biological matrix is key in the design of exposome studies. Most of the associations we found for the pregnancy exposome involved the methylome (70% of the associations observed). This is in line with previous publications that suggest that the epigenome acts as the main source of cellular ‘memory’ and plasticity^82,83. Although, it may partially reflect the nature of our study design and omics coverage (i.e. number of markers analysed in each omics layer and their intra-omics correlations). In contrast, recent exposures during childhood were associated with features across all omics layers. Evidence to date suggests that the metabolome in particular is strongly influenced by the immediate environment, and may thus be more sensitive for detecting associations in cross-sectional settings¹⁷. Nevertheless, many cross-sectional associations with the methylome were found and, although fewer, long-term associations with other omics were also found. Moreover, the low correspondence between the methylome and miRNAome with the transcriptome highlights the high complexity of transcriptional regulation and suggests that each molecular layer might capture a window of the effects of the exposome. Our findings also indicate the importance of the biological matrix. Although we could not make a comprehensive comparison of the urinary and sera metabolomes because of the use of different platforms to assess them, among comparable metabolites, only a few showed consistent associations with the exposome in both biological matrices. Thus, both biological matrices and others should ideally be explored in exposome studies, providing complementary information. Finally, we observed little overlap in associations for the pregnancy and childhood exposome, likely due to the low inter-period correlation of exposures, the differences in the exposure route or dose between periods, and the dynamics of the molecular response (i.e. our study is able to capture long-term responses of the pregnancy exposome but only short-term responses of the childhood exposome). This highlights the importance of the windows of exposure and the choice of life course framework for exposome studies.

Our study has multiple strengths. First, the comprehensive assessment of environmental exposures in two critical developmental time periods, including highly sensitive biomarkers for many chemical exposures and wide-ranging geospatial modelling of the outdoor and built environment. Second, the extensive multi-omics assessment of molecular phenotypes. Third, the wide geographic coverage and relatively large sample size for which we were able to measure many exposures and omics features. Finally, we conducted several sensitivity analyses, that confirmed that findings were robust to ancestry and zBMI, with the exception of some lipophilic exposure compounds and particular molecular features.

Our study also has some limitations. First, omics platforms have a coverage bias and biological interpretability issues. For instance, the LC-MS/MS (Biocrates) method has a low coverage and does not give specific fatty acid side-chain composition for lipids, but it is widely used in large cohort studies and provides reproducible measurements with unambiguous annotation, easily comparable to other studies^{84,85,86,87,88}. We note that there are additional molecular layers and omics technologies of interest for future exposome studies, which were not included in our study, such as the gut metagenome, sensitive high-resolution mass spectrometry or single cell methods^89,90,91. Moreover, the effect of genetic variation, alone or in combination with the exposome, was not considered in this study. Second, different exposures are measured with different types and levels of measurement error. For example, urine levels of non-persistent chemicals have a high intra-individual variability and are expected to suffer particularly from classical-type measurement error resulting in an attenuation bias⁹². Repeated sampling strategies and longitudinal designs, might help to disentangle the persistent metabolic effects of endocrine disruptors suggested in our study. Exposures measured by models and questionnaires are expected to suffer from other types of measurement errors with less predictable effects⁹³. Moreover, the correlated nature of the exposome makes identification of driving exposures difficult. Here we tried to separate the effects with mutually adjusted or stratified models. For example by running stratified models in non-smoker mothers we identified Cd-specific effects. Besides tobacco smoke, Cd might have other origins such as rice, potatoes and wheat, when frequently consumed in large quantities⁹⁴. Mixture or multi-pollutant approaches aim to tackle this more systematically, however these are not yet suitable for high-dimensional omics datasets such as ours^95,96. Third, our comparison with previous literature and functional enrichment analyses are limited by existing bias in public databases. Fourth, although the majority of epidemiological studies utilize biological samples that are most readily accessible for the measurement of omics profiles, these may not be the ideal target tissue for the relevant health outcomes. Fifth, some associations presented high heterogeneity across cohorts (e.g. humidity and serotonin). This can be explained by the different exposure levels, the different correlation with confounders, or the relatively small sample size within each cohort. Finally, although our models were adjusted for confounders, residual confounding might still be present and causal links would need to be proven through interventions, Mendelian randomization analyses, cross-contextual studies, or in vivo/in vitro models.

To conclude, this first comprehensive study of the multi-omics signatures of the early life exposome demonstrates that molecular phenotypes can reveal biological responses to or sources of environmental exposures at an early time point in life. Besides the main findings described here, the entire result catalogue is publicly available (https://helixomics.isglobal.org/), enabling exploration of the complete list of exposome-omics relationships. With the rich exposome and molecular information available, we provide a valuable resource to the scientific community for the development and validation of exposure and response biomarkers, to identify dietary sources of exposures, to improve our understanding of disease aetiology, and finally to promote public health policies.

Methods

Local ethical committees approved the studies that were conducted according to the guidelines laid down in the Declaration of Helsinki. The ethical committees for each cohort were the following: BIB: Bradford Teaching Hospitals NHS Foundation Trust, EDEN: Agence nationale de sécurité du médicament et des produits de santé, INMA: Comité Ético de Inverticación Clínica Parc de Salut MAR, KANC: LIETUVOS BIOETIKOS KOMITETAS, MoBa: Regional komité for medisinsk og helsefaglig forskningsetikk, Rhea: Ethical committee of the general university hospital of Heraklion, Crete. Informed consent was obtained from a parent and/or legal guardian of all participants in the study. Participants did not receive any compensation.

Population

Mother–child pairs (N = 1301) from 6 established and ongoing longitudinal population-based birth cohort studies in Europe were included in the HELIX subcohort study: the Born in Bradford (BiB) study in the UK⁹⁷, the Étude des Déterminants pré et postnatals du développement et de la santé de l’Enfant (EDEN) study in France⁹⁸, the INfancia y Medio Ambiente (INMA) cohort in Spain⁹⁹, the Kaunus cohort (KANC) in Lithuania¹⁰⁰, the Norwegian Mother, Father and Child Cohort Study (MoBa)¹⁰¹, and the RHEA Mother Child Cohort study in Crete, Greece¹⁰² (Supplementary Information and Supplementary Data 1A). A follow-up examination of the children between ages 6 and 11 years was carried out with fully standardized protocols across the six cohorts, in order to assess child health outcomes, to fully characterize the pregnancy and childhood exposome, and to measure several molecular phenotypes²¹. During the clinical examination, urine (pooled spot urine samples from before bedtime and first morning void) and blood samples were collected from the children. Urine and blood samples previously collected from mothers during pregnancy were also available for biomarkers of chemical exposure assessment.

Exposome measures in pregnancy and childhood

Two main windows of exposure were considered: a prenatal window including the pregnancy period or measures of long-term maternal exposures (e.g. persistent pollutants), and a cross-sectional window including the exposome data of children at the same time as of omics sampling (childhood). A total of 91 pregnancy and 116 childhood exposures were investigated in the study, including the outdoor exposome (air pollution, built environment, noise, green and blue space, and meteorological data), the chemical exposome (cotinine, metals, POPs, PFAS, phthalates, phenols, and organophosphates), and social and lifestyle factors (exposure to tobacco smoking, diet and physical activity). Details on the exposure assessment methods and exposure factors can be seen in Supplementary Information. Exposures were either continuous variables or categorical variables with two or more levels. Continuous exposure variables were transformed to achieve linearity or categorized, when needed. Missing data were imputed using a chained equations method¹⁰³ implemented in the mice v3.4.0 R package¹⁰⁴. One imputed dataset was used in this study. Further details on exposure levels can be found elsewhere^22,23,24. Correlations between exposures were estimated as follows: for continuous vs continuous variables—Pearson’s correlation; for continuous vs categorical variables—R² of a lineal model; for categorical vs categorical variables—Cramér’s V test. More information can be found in Supplementary Information and Supplementary Data 1C–E.

Child molecular phenotypes

We used both targeted and untargeted methods to assess child molecular phenotypes. Blood DNA methylation was assessed with the Illumina 450 K array; blood gene expression, with the Affymetrix HTA v2.0 array; blood miRNA expression, with the Agilent SurePrint Human miRNA rel 21 array; plasma proteins, with 3 Luminex multiplex assays; serum metabolites, with the targeted LC-MS/MS metabolomic assay Biocrates AbsoluteIDQ p180 kit; and urinary metabolites, with ¹H nuclear magnetic resonance (NMR) spectroscopy. An extended version of the omics protocols and lists of biomarkers assessed in the targeted assays is available in Supplementary Information and Supplementary Data 1E–H.

Statistical analysis (ExWAS)

We fitted linear regressions between each exposure variable and each molecular feature adjusting for covariates, using the limma v3.46.0 R package¹⁰⁵ implemented in omicRexposome v1.12.1¹⁰⁶. Main covariates for all omics were: cohort, child’s sex, child’s age, child sex and age z-score BMI calculated according to WHO reference curves^107,108, child’s ethnicity defined in three categories (European ancestry; Pakistani or Asian; and other), and self-reported maternal education categorized in low, medium and high. In addition, plasma protein, serum metabolite models were adjusted for time to last meal and hour of blood collection and urinary metabolite models for sample type (bedtime, morning or pool), and technical batch. Blood methylation and transcriptomics data were corrected by surrogate variables (SVs), which captured both batch effects and blood cell type composition.

In all omics, except for methylation, the effect size is reported as a log2 fold change (log2FC) of the molecular phenotype levels between categories of discrete exposure variables or for interquartile range (IQR) of continuous exposure variables. For DNA methylation, the effect size is reported as a difference in methylation levels between categories of discrete exposure variables or for IQR of continuous exposure variables.

Multiple testing correction was applied for each exposure and within each omics layer. For methylation, gene expression and miRNAs we used the False Discovery Rate (FDR)–Benjamini–Hochberg (BH) method¹⁰⁹. For other omics, proteins, urine and serum metabolites, we calculated the effective number of tests (ENT) which is based on the correlation structure of the data¹¹⁰, and the nominal p value (0.05) divided by that number. We also calculated a more stringent threshold correcting for all tests performed (across all molecular features from all omics platforms and the full exposome, including both periods), resulting in a p value cut off of 1E−09. More details can be found in Supplementary Information and Supplementary Data 1I.

Sensitivity analyses

A set of sensitivity analyses were conducted. First, analyses were restricted to children of European ancestry (90%). Second, models were run again without adjustment for child zBMI. The difference in the effect size among main models and alternative models was calculated as (effect size main model − effect size alternative model)/effect size main model × 100. Third, top hit associations were run by cohort and combined through fixed- and random-effects inverse variance weighted meta-analyses using the meta v4.16-1 R package¹¹¹, and forest-plots were visually inspected. I² was used to evaluate heterogeneity in the results across cohorts. Fourth, we performed multi-exposure linear models by period for those omics features associated with more than one exposure, when these exposures had a correlation <0.8 and belonged to different exposure families (except individual exposures that belonged to diet, metals or parabens that we considered as separate groups).

Exposure-omics network analyses

We conducted network analyses using the list of ExWAS associations passing multiple testing correction (1170). We built a network for each exposure period (period-specific), and each network contained all the molecular layers (multi-omics). Molecular features and exposures were considered as the nodes of the network and the edges represented omics-exposure associations (based on the ExWAS results). Networks visualization was carried out using Cytoscape 3.6.1 (http://cytoscape.org) and were automatically arranged using the Cytoscape force-directed layout which aims to highlight the underlying topology of the graph¹¹². The association effect size was set as the numeric edge column to use as a weight for the length of the edges. In order to find densely connected regions in the network, clustering of the childhood network was done based on Community Clustering (GLay) using clusterMaker2 v2.0^28,29.

Comparison with literature

Molecular features of significant associations were checked against previous literature findings based on existing databases reporting associated exposures or traits: the EWAS Catalogue (http://ewascatalog.org/)³⁰, the EWAS Atlas (http://bigd.big.ac.cn/ewas/index)¹⁶ and the Exposome Explorer database (http://exposome-explorer.iarc.fr/)^31,36. More details can be found in Supplementary Information and Supplementary Data 1J.

Functional enrichment analyses

Functional enrichment analyses were restricted to molecular layers with features which could be easily annotated at the gene level: DNA methylation, gene and miRNA expression, and proteins. For exposures with at least one significant association, we retrieved all molecular features associated at p value <1E-03. Then, we annotated these molecular features to genes as described in Supplementary Information, and obtained a unique list of “dysregulated” genes by combining genes detected in any of the molecular layers. ClusterProfiler v3.8.0 R package¹¹³ was used to check whether this list of genes was enriched for gene-sets (Gene Ontology (GO) Biological Processes terms, KEGG, Molecular Signatures Database—C2 curated gene sets), diseases (DisGeNET), and transcription factor and miRNA binding motifs (Molecular Signatures Database—C3 motifs and transcription factors motifs). Multiple-testing was corrected with the FDR–BN method within each exposure and only gene sets with >3 genes are reported.

Expression quantitative trait methylation (eQTMs) and miRNA gene target prediction

To identify experimentally validated target genes for miRNAs we used miRwalk v3⁵⁰. Expression quantitative trait methylations in cis (cis-eQTMs) were identified using HELIX data. First, we paired each transcript cluster (TC) to all CpGs closer than 500 kb from its transcription start site (TSS), and then, for each CpG-TC pair we fitted a linear regression model between gene expression and methylation levels adjusted for age, sex and cohort. More details on the analyses and the multiple-testing correction can be found in Supplementary Information and elsewhere¹¹⁴.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The summarized results (exposure, omics biomarker, effect, standard error, p value) generated during this study are available at https://helixomics.isglobal.org/. The raw data supporting the current study are available from the corresponding author on request subject to ethical and legislative review. The “HELIX Data External Data Request Procedures” are available with the data inventory in this website: http://www.projecthelix.eu/data-inventory. The document describes who can apply to the data and how, the timings for approval and the conditions to data access and publication.

Code availability

The code to test the relationship between the pregnancy and childhood exposomes and molecular features is available through the omicRexposome v1.12.1 R package¹⁰⁶.

References

Lim, S. S. et al. A comparative risk assessment of burden of disease and injury attributable to 67 risk factors and risk factor clusters in 21 regions, 1990-2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet 380, 2224–2260 (2012).
Article PubMed PubMed Central Google Scholar
Rappaport, S. M. & Smith, M. T. Epidemiology. Environment and disease risks. Science 330, 460–461 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Wild, C. P. Complementing the genome with an ‘exposome’: the outstanding challenge of environmental exposure measurement in molecular epidemiology. Cancer Epidemiol. Biomark. Prev. 14, 1847–1850 (2005).
Article CAS Google Scholar
Vermeulen, R., Schymanski, E. L., Barabási, A.-L. & Miller, G. W. The exposome and health: where chemistry meets biology. Science 367, 392 LP–392396 (2020).
Article ADS Google Scholar
Niedzwiecki, M. M. et al. The exposome: molecules to populations. Annu. Rev. Pharmacol. Toxicol. 59, 107–127 (2019).
Article CAS PubMed Google Scholar
Wild, C. P. The exposome: from concept to utility. Int. J. Epidemiol. 41, 24–32 (2012).
Article PubMed Google Scholar
Li-Pook-Than, J. & Snyder, M. IPOP goes the world: Integrated personalized omics profiling and the road toward improved health care. Chem. Biol. 20, 660–666 (2013).
Article CAS PubMed PubMed Central Google Scholar
Schüssler-Fiorenza Rose, S. M. et al. A longitudinal big data approach for precision health. Nat. Med. 25, 792–804 (2019).
Article PubMed Google Scholar
Contrepois, K. et al. Molecular choreography of acute exercise. Cell 181, 1112.e16–1130.e16 (2020).
Article Google Scholar
Franks, P. W. et al. Childhood obesity, other cardiovascular risk factors, and premature death. N. Engl. J. Med. 362, 485–493 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hardy, R., Lawlor, D. A. & Kuh, D. A life course approach to cardiovascular aging. Future Cardiol. 11, 101–113 (2015).
Article CAS PubMed Google Scholar
Juonala, M. et al. Childhood adiposity, adult adiposity, and cardiovascular risk factors. N. Engl. J. Med. 365, 1876–1885 (2011).
Article CAS PubMed Google Scholar
Arango, C. et al. Preventive strategies for mental health. Lancet Psychiatry 5, 591–604 (2018).
Article PubMed Google Scholar
Bui, D. S. et al. Childhood predictors of lung function trajectories and future COPD risk: a prospective cohort study from the first to the sixth decade of life. Lancet Respir. Med. 6, 535–544 (2018).
Article PubMed Google Scholar
Cavalli, G. & Heard, E. Advances in epigenetics link genetics to the environment and disease. Nature 571, 489–499 (2019).
Article ADS CAS PubMed Google Scholar
Li, M. et al. EWAS Atlas: a curated knowledgebase of epigenome-wide association studies. Nucleic Acids Res. 47, D983–D988 (2019).
Article CAS PubMed Google Scholar
Everson, T. M. & Marsit, C. J. Integrating -omics approaches into human population-based studies of prenatal and early-life exposures. Curr. Environ. Health Rep. 5, 328–337 (2018).
Article PubMed PubMed Central Google Scholar
Athersuch, T. J. The role of metabolomics in characterizing the human exposome. Bioanalysis 4, 2207–2212 (2012).
Article CAS PubMed Google Scholar
Rappaport, S. M., Barupal, D. K., Wishart, D., Vineis, P. & Scalbert, A. The blood exposome and its role in discovering causes of disease. Environ. Health Perspect. 122, 769–774 (2014).
Article PubMed PubMed Central Google Scholar
Gauglitz, J. M. et al. Untargeted mass spectrometry-based metabolomics approach unveils molecular changes in raw and processed foods and beverages. Food Chem. 302, 125290 (2020).
Article CAS PubMed Google Scholar
Maitre, L. et al. Human Early Life Exposome (HELIX) study: a European population-based exposome cohort. BMJ Open 8, e021311 (2018).
Article PubMed PubMed Central Google Scholar
Robinson, O. et al. The urban exposome during pregnancy and its socioeconomic determinants. Environ. Health Perspect. 126, 77005 (2018).
Article Google Scholar
Tamayo-Uria, I. et al. The early-life exposome: description and patterns in six European countries. Environ. Int. 123, 189–200 (2019).
Article CAS PubMed Google Scholar
Haug, L. S. L. S. et al. In-utero and childhood chemical exposome in six European mother-child cohorts. Environ. Int. 121, 751–763 (2018).
Article CAS PubMed Google Scholar
Vives-Usano, M. et al. In utero and childhood exposure to tobacco smoke and multi-layer molecular signatures in children. BMC Med. 18, 243 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lau, C.-H. E. C. H. E. et al. Determinants of the urinary and serum metabolome in children from six European populations. BMC Med. 16, 202 (2018).
Article CAS PubMed PubMed Central Google Scholar
von Hippel, P. T. The heterogeneity statistic I2 can be biased in small meta-analyses. BMC Med. Res. Methodol. 15, 1–8 (2015).
Google Scholar
Newman, M. E. J. & Girvan, M. Finding and evaluating community structure in networks. https://arxiv.org/pdf/cond-mat/0308217.pdf (2003).
Su, G., Kuchinsky, A., Morris, J. H., States, D. J. & Meng, F. GLay: community structure analysis of biological networks. Bioinformatics 26, 3135–3137 (2010).
Article CAS PubMed PubMed Central Google Scholar
Battram, T. et al. The EWAS Catalog: a database of epigenome-wide association studies. OSF Prepr. https://doi.org/10.31219/OSF.IO/837WN (2021).
Neveu, V., Nicolas, G., Salek, R. M., Wishart, D. S. & Scalbert, A. Exposome-Explorer 2.0: an update incorporating candidate dietary biomarkers and dietary associations with cancer risk. Nucleic Acids Res. 48, D908–D912 (2020).
CAS PubMed Google Scholar
Joubert, B. R. et al. Children’ s Health 450K epigenome-wide scan identifies differential DNA methylation in newborns related to maternal smoking during pregnancy. Environ. Health Perspect. 120, 1425–1432 (2012).
Article CAS PubMed PubMed Central Google Scholar
Satarug, S. Dietary cadmium intake and its effects on kidneys. Toxics 6, 15 (2018).
Everson, T. M. et al. Cadmium-associated differential methylation throughout the placental genome: epigenome-wide association study of two U.S. birth cohorts. Environ. Health Perspect. 126, 017010 (2018).
Article PubMed PubMed Central Google Scholar
Domingo-Relloso, A. et al. Cadmium, smoking, and human blood DNA methylation profiles in adults from the Strong Heart Study. Environ. Health Perspect. 128, 67005 (2020).
Neveu, V. et al. Exposome-Explorer: a manually-curated database on biomarkers of exposure to dietary and environmental factors. Nucleic Acids Res. 45, D979–D984 (2017).
Article CAS PubMed Google Scholar
Heinzmann, S. S., Holmes, E., Kochhar, S., Nicholson, J. K. & Schmitt-Kopplin, P. 2-Furoylglycine as a candidate biomarker of coffee consumption. J. Agric. Food Chem. 63, 8615–8621 (2015).
Article CAS PubMed Google Scholar
Schwarz, G. Molybdenum cofactor and human disease. Curr. Opin. Chem. Biol. https://doi.org/10.1016/j.cbpa.2016.03.016 (2016).
Lionetto, M. G., Caricato, R., Giordano, M. E. & Schettino, T. The complex relationship between metals and carbonic anhydrase: New insights and perspectives. Int. J. Mol. Sci. https://doi.org/10.3390/ijms17010127 (2016).
EEA. Climate change, impacts and vulnerability in Europe 2016 — European Environment Agency. https://www.eea.europa.eu/publications/climate-change-impacts-and-vulnerability-2016 (2017).
Davies, S. K. et al. Effect of sleep deprivation on the human metabolome. Proc. Natl Acad. Sci. USA 111, 10761–10766 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Selley, M. L. Increased (E)−4-hydroxy-2-nonenal and asymmetric dimethylarginine concentrations and decreased nitric oxide concentrations in the plasma of patients with major depression. J. Affect. Disord. 80, 249–256 (2004).
Article CAS PubMed Google Scholar
Nasca, C. et al. Acetyl-L-carnitine deficiency in patients with major depressive disorder. Proc. Natl Acad. Sci. USA 115, 8627–8632 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Wei, Q. et al. Adiponectin is required for maintaining normal body temperature in a cold environment. BMC Physiol. https://doi.org/10.1186/s12899-017-0034-7 (2017).
Jankovic, A. et al. Endocrine and metabolic signaling in retroperitoneal white adipose tissue remodeling during cold acclimation. J. Obes. https://doi.org/10.1155/2013/937572 (2013).
Imbeault, P., Dépault, I. & Haman, F. Cold exposure increases adiponectin levels in men. Metabolism https://doi.org/10.1016/j.metabol.2008.11.017 (2009).
Hugo, E. R. et al. Bisphenol A at environmentally relevant doses inhibits adiponectin release from human adipose tissue explants and adipocytes. Environ. Health Perspect. 116, 1642–1647 (2008).
Article CAS PubMed PubMed Central Google Scholar
Abhimanyu & Coussens, A. K. The role of UV radiation and Vitamin D in the seasonality and outcomes of infectious disease. Photochem. Photobiol. Sci. https://doi.org/10.1039/c6pp00355a (2017).
Fontal, A. et al. Climatic signatures in the different COVID-19 pandemic waves across both hemispheres. Nat. Comput. Sci. 1, 655–665 (2021).
Article Google Scholar
Sticht, C., De La Torre, C., Parveen, A. & Gretz, N. Mirwalk: An online resource for prediction of microrna binding sites. PLoS ONE https://doi.org/10.1371/journal.pone.0206239 (2018).
Rauschert, S. et al. Machine learning-based dna methylation score for fetal exposure to maternal smoking: development and validation in samples collected from adolescents and adults. Environ. Health Perspect. 128, 1–11 (2020).
Article Google Scholar
Reese, S. E. et al. DNA methylation score as a biomarker in newborns for sustained maternal smoking during pregnancy. Environ. Health Perspect. 125, 760–766 (2017).
Article CAS PubMed Google Scholar
Novotny, J. A. & Peterson, C. A. Molybdenum. Adv. Nutr. 9, 272–273 (2018).
Article PubMed PubMed Central Google Scholar
Meeker, J. D. et al. Cadmium, lead, and other metals in relation to semen quality: human evidence for molybdenum as a male reproductive toxicant. Environ. Health Perspect. 116, 1473–1479 (2008).
Article CAS PubMed PubMed Central Google Scholar
Meeker, J. D. et al. Environmental exposure to metals and male reproductive hormones: circulating testosterone is inversely associated with blood molybdenum. Fertil. Steril. 93, 130–140 (2010).
Article CAS PubMed Google Scholar
Zheng, Y. et al. Evaluating associations between early pregnancy trace elements mixture and 2nd trimester gestational glucose levels: a comparison of three statistical approaches. Int. J. Hyg. Environ. Health 224, 113446 (2020).
Article PubMed Google Scholar
Yin, S. et al. Essential trace elements in placental tissue and risk for fetal neural tube defects. Environ. Int. 139, 105688 (2020).
Article CAS PubMed Google Scholar
Vázquez-Salas, R. A. et al. Prenatal molybdenum exposure and infant neurodevelopment in Mexican children. Nutr. Neurosci. 17, 72–80 (2014).
Article PubMed Google Scholar
Maitre, L. et al. Urine metabolic signatures of multiple environmental pollutants in pregnant women: an exposome approach. Environ. Sci. Technol. 52, 13469–13480 (2018).
Article ADS CAS PubMed Google Scholar
Gibson, R. et al. The association of fish consumption and its urinary metabolites with cardiovascular risk factors: the International Study of Macro-/Micronutrients and Blood Pressure (INTERMAP). Am. J. Clin. Nutr. 111, 280–290 (2020).
Article PubMed Google Scholar
Coryell, M., Mcalpine, M., Pinkham, N. V, Mcdermott, T. R. & Walk, S. T. The gut microbiome is required for full protection against acute arsenic toxicity in mouse models. Nat. Commun. 9, 5424 (2018).
Laue, H. E. et al. Bacterial modification of the association between arsenic and autism-related social behavior scores. Expo. Health https://doi.org/10.1007/S12403-022-00494-0/FIGURES/3 (2022).
Cano-Sancho, G. & Casas, M. Interactions between environmental pollutants and dietary nutrients: current evidence and implications in epidemiological research. J. Epidemiol. Community Health https://doi.org/10.1136/jech-2020-213789 (2020).
De Bie, P., Muller, P., Wijmenga, C. & Klomp, L. W. J. Molecular pathogenesis of Wilson and Menkes disease: correlation of mutations with molecular defects and disease phenotypes. J. Med. Genet. https://doi.org/10.1136/jmg.2007.052746 (2007).
Agier, L. et al. Early-life exposome and lung function in children in Europe: an analysis of data from the longitudinal, population-based HELIX cohort. Lancet Planet. Health 3, e81–e92 (2019).
Article PubMed Google Scholar
Vrijheid, M. et al. Early-life environmental exposures and childhood obesity: an exposome-wide approach. Environ. Health Perspect. 128, 1–14 (2020).
Article Google Scholar
Cadiou, S. et al. Using methylome data to inform exposome-health association studies: an application to the identification of environmental drivers of child body mass index. Environ. Int. 138, 105622 (2020).
Article CAS PubMed PubMed Central Google Scholar
Warembourg, C. et al. Early-life environmental exposures and blood pressure in children. J. Am. Coll. Cardiol. 74, 1317–1328 (2019).
Article CAS PubMed PubMed Central Google Scholar
Maitre, L. et al. Early-life environmental exposure determinants of child behavior in Europe: a longitudinal, population-based study. Environ. Int. 153, 106523 (2021).
Newgard, C. B. Metabolomics and metabolic diseases: where do we stand? Cell Metab. 25, 43–56 (2017).
Article CAS PubMed Google Scholar
Chen, Z. et al. Near-roadway air pollution exposure and altered fatty acid oxidation among adolescents and young adults – the interplay with obesity. Environ. Int. 130, 104935 (2019).
Article CAS PubMed PubMed Central Google Scholar
Vrijheid, M., Casas, M., Gascon, M., Valvi, D. & Nieuwenhuijsen, M. Environmental pollutants and child health-a review of recent concerns. Int. J. Hyg. Environ. Health 219, 331–342 (2016).
Article CAS PubMed Google Scholar
Güil-Oumrait, N. et al. Prenatal exposure to persistent organic pollutants and markers of obesity and cardiometabolic risk in Spanish adolescents. Environ. Int. 151, (2021).
Petriello, M. C. et al. Relationship between serum trimethylamine N-oxide and exposure to dioxin-like pollutants. Environ. Res. 162, 211–218 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhu, W., Wang, Z., Tang, W. H. W. & Hazen, S. L. Gut microbe-generated trimethylamine N-oxide from dietary choline is prothrombotic in subjects. Circulation 135, 1671–1673 (2017).
Braun, J. M. Early-life exposure to EDCs: role in childhood obesity and neurodevelopment. Nat. Rev. Endocrinol. 13, 161–173 (2017).
Article ADS CAS PubMed Google Scholar
Giovanoulis, G. et al. Multi-pathway human exposure assessment of phthalate esters and DINCH. Environ. Int. 112, 115–126 (2018).
Article CAS PubMed Google Scholar
Chakinala, R. C., Khatri, A., Gupta, K., Koike, K. & Epelbaum, O. Sphingolipids in COPD. Eur. Respir. Rev. https://doi.org/10.1183/16000617.0047-2019 (2019).
Ono, J. G. et al. Decreased sphingolipid synthesis in children with 17q21 asthma-risk genotypes. J. Clin. Investig. https://doi.org/10.1172/JCI130860 (2020).
Zhou, M. et al. Metabolomic markers of phthalate exposure in plasma and urine of pregnant women. Front. Public Health 6, 298 (2018).
Article PubMed PubMed Central Google Scholar
Zhao, H. et al. Paraben exposure related to purine metabolism and other pathways revealed by mass spectrometry-based metabolomics. Environ. Sci. Technol. 54, 3447–3454 (2020).
Article ADS CAS PubMed Google Scholar
Cavalli, G. & Heard, E. Advances in epigenetics link genetics to the environment and disease. Nature 571, 489–499 (2019).
Article ADS CAS PubMed Google Scholar
Tsai, P.-C. et al. Smoking induces coordinated DNA methylation and gene expression changes in adipose tissue with consequences for metabolic health. Clin. Epigenetics 10, 126 (2018).
Siskos, A. P. et al. Interlaboratory reproducibility of a targeted metabolomics platform for analysis of human serum and plasma. Anal. Chem. 89, 656–665 (2017).
Article CAS PubMed Google Scholar
Wong, H. L. et al. Reproducibility and correlations of multiplex cytokine levels in asymptomatic persons. Cancer Epidemiol. Biomark. Prev. 17, 3450–3456 (2008).
Article CAS Google Scholar
Floegel, A. et al. Identification of serum metabolites associated with risk of type 2 diabetes using a targeted metabolomic approach. Diabetes 62, 639–648 (2013).
Article CAS PubMed PubMed Central Google Scholar
Illig, T. et al. A genome-wide perspective of genetic variation in human metabolism. Nat. Genet. 42, 137–141 (2010).
Article CAS PubMed Google Scholar
Varma, V. R. et al. Brain and blood metabolite signatures of pathology and progression in Alzheimer disease: a targeted metabolomics study. PLoS Med. 15, e1002482 (2018).
Article PubMed PubMed Central Google Scholar
Petrick, L. M., Uppal, K. & Funk, W. E. Metabolomics and adductomics of newborn bloodspots to retrospectively assess the early-life exposome. Curr. Opin. Pediatr. 32, 300–307 (2020).
Article PubMed PubMed Central Google Scholar
Walker, D. I. et al. The metabolome: a key measure for exposome research in epidemiology. Curr. Epidemiol. Rep. 6, 93–103 (2019).
Article PubMed PubMed Central Google Scholar
Jiang, C. et al. Dynamic human environmental exposome revealed by longitudinal personal monitoring. Cell 175, 277.e31–291.e31 (2018).
Article Google Scholar
Casas, M. et al. Variability of urinary concentrations of non-persistent chemicals in pregnant women and school-aged children. Environ. Int. 121, 561–573 (2018).
Nieuwenhuijsen, M. J. et al. Variability in and agreement between modeled and personal continuously measured black carbon levels using novel smartphone and sensor technologies. Environ. Sci. Technol. 49, 2977–2982 (2015).
Article ADS CAS PubMed Google Scholar
Järup, L. & Åkesson, A. Current status of cadmium as an environmental health problem. Toxicol. Appl. Pharmacol. 238, 201–208 (2009).
Article PubMed Google Scholar
Jain, P. et al. A multivariate approach to investigate the combined biological effects of multiple exposures. J. Epidemiol. Community Health 72, 564–571 (2018).
Article PubMed Google Scholar
Park, S. K., Zhao, Z. & Mukherjee, B. Construction of environmental risk score beyond standard linear models using machine learning methods: application to metal mixtures, oxidative stress and cardiovascular disease in NHANES. Environ. Health 16, 102 (2017).
Wright, J. et al. Cohort profile: The Born in Bradford multi-ethnic family cohort study. Int. J. Epidemiol. 42, 978–991 (2013).
Article PubMed Google Scholar
Heude, B. et al. Cohort profile: The EDEN mother-child cohort on the prenatal and early postnatal determinants of child health and development. Int. J. Epidemiol. 45, 353–363 (2016).
Article PubMed Google Scholar
Guxens, M. et al. Cohort profile: The INMA–INfancia y Medio Ambiente–(Environment and Childhood) Project. Int. J. Epidemiol. 41, 930–940 (2011).
Grazuleviciene, R., Danileviciute, A., Nadisauskiene, R. & Vencloviene, J. Maternal smoking, GSTM1 and GSTT1 polymorphism and susceptibility to adverse pregnancy outcomes. Int. J. Environ. Res. Public Health 6, 1282–1297 (2009).
Article CAS PubMed PubMed Central Google Scholar
Magnus, P. et al. Cohort profile update: The Norwegian Mother and Child Cohort Study (MoBa). Int. J. Epidemiol. 45, 382–388 (2016).
Article PubMed Google Scholar
Chatzi, L. et al. Cohort profile: The Mother-Child Cohort in Crete, Greece (Rhea Study). Int. J. Epidemiol. 46, 1392–1393k (2017).
Article PubMed Google Scholar
White, I. R., Royston, P. & Wood, A. M. Multiple imputation using chained equations: Issues and guidance for practice. Stat. Med. 30, 377–399 (2011).
Article MathSciNet PubMed Google Scholar
van Buuren, S. & Groothuis-Oudshoorn, K. mice: Multivariate Imputation by Chained Equations in R. J. Stat. Softw. 45, 1–67 (2011).
Article Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Article PubMed PubMed Central Google Scholar
Hernandez-Ferrer, C. et al. Comprehensive study of the exposome and omic data using rexposome Bioconductor packages. Bioinformatics https://doi.org/10.1093/bioinformatics/btz526 (2019).
WHO. BMI-for-Age (5-19 Years) (WHO, 2015).
de Onis, M. et al. Development of a WHO growth reference for school-aged children and adolescents. Bull. World Health Organ. 85, 660–667 (2007).
Article PubMed PubMed Central Google Scholar
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B 57, 289–300 (1995).
MathSciNet MATH Google Scholar
Li, M.-X., Yeung, J. M. Y., Cherny, S. S. & Sham, P. C. Evaluating the effective numbers of independent tests and significant p-value thresholds in commercial genotyping arrays and public imputation reference datasets. Hum. Genet. 131, 747–756 (2012).
Article CAS PubMed Google Scholar
Schwarzer, G. Package ‘meta’. R News https://doi.org/10.1007/978-3-319-21416-0 (2007).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Article CAS PubMed PubMed Central Google Scholar
Yu, G. clusterProfiler: a universal enrichment tool for functional and comparative study. Innovation https://doi.org/10.1016/j.xinn.2021.100141 (2021).
Ruiz-Arenas, C. et al. Identification of autosomal cis expression quantitative trait methylation (cis eQTMs) in children’s blood. Elife https://doi.org/10.7554/eLife.65310 (2022).

Download references

Acknowledgements

We would like to thank all the families for their generous contribution. The study has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 874583 (ATHLETE project). Data were collected as part of the European Community’s Seventh Framework Programme (FP7/2007-206) under grant agreement no 308333 (HELIX project). BiB received core infrastructure funding from the Wellcome Trust (WT101597MA) and a joint grant from the UK Medical Research Council (MRC) and Economic and Social Science Research Council (ESRC) (MR/N024397/1). INMA data collections were supported by grants from the Instituto de Salud Carlos III, CIBERESP, and the Generalitat de Catalunya-CIRIT. KANC was funded by the grant of the Lithuanian Agency for Science Innovation and Technology (6-04-2014_31V-66). The Norwegian Mother, Father and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research. The Rhea project was financially supported by European projects (EU FP6-2003-Food-3-NewGeneris, EU FP6. STREP Hiwate, EU FP7 ENV.2007.1.2.2.2. Project No 211250 Escape, EU FP7-2008-ENV-1.2.1.4 Envirogenomarkers, EU FP7-HEALTH-2009- single stage CHICOS, EU FP7 ENV.2008.1.2.1.6. Proposal No 226285 ENRIECO, EU- FP7- HEALTH-2012 Proposal No 308333 HELIX), and the Greek Ministry of Health (Program of Prevention of obesity and neurodevelopmental disorders in preschool children, in Heraklion district, Crete, Greece: 2011-2014; “Rhea Plus”: Primary Prevention Program of Environmental Risk Factors for Reproductive Health, and Child Health: 2012-15). ISGlobal acknowledges support from the Spanish Ministry of Science and Innovation through the “Centro de Excelencia Severo Ochoa 2019-2023” Program (CEX2018-000806-S), and support from the Generalitat de Catalunya through the CERCA Program. L.M. is funded by a Juan de la Cierva-Incorporación fellowship (IJC2018-035394-I) awarded by the Spanish Ministerio de Economía, Industria y Competitividad. M.V.-U. and C.R.-A. were supported by a FI fellowship from the Catalan Government (FI-DGR 2015 and #016FI_B 00272). M. Casas received funding from Instituto Carlos III (Ministry of Economy and Competitiveness) (CD12/00563 and MS16/00128).

Author information

These authors contributed equally: Léa Maitre, Mariona Bustamante, Juan R. González, Hector C. Keun, Martine Vrijheid.

Authors and Affiliations

Institute for Global Health (ISGlobal), Barcelona, Spain
Léa Maitre, Mariona Bustamante, Carles Hernández-Ferrer, Marta Vives-Usano, Carlos Ruiz-Arenas, Dolors Pelegrí-Sisó, Maribel Casas, Jordi Sunyer, Mark Nieuwenhuijsen, Jose Urquiza, Juan R. González & Martine Vrijheid
Universitat Pompeu Fabra (UPF), Barcelona, Spain
Léa Maitre, Mariona Bustamante, Carles Hernández-Ferrer, Marta Vives-Usano, Carlos Ruiz-Arenas, Dolors Pelegrí-Sisó, Maribel Casas, Jordi Sunyer, Mark Nieuwenhuijsen, Jose Urquiza, Eva Borràs, Eduard Sabidó, Juan R. González & Martine Vrijheid
Consorcio de Investigación Biomédica en Red de Epidemiología y Salud Pública (CIBERESP), Madrid, Spain
Léa Maitre, Mariona Bustamante, Carles Hernández-Ferrer, Marta Vives-Usano, Carlos Ruiz-Arenas, Dolors Pelegrí-Sisó, Maribel Casas, Jordi Sunyer, Mark Nieuwenhuijsen, Jose Urquiza, Juan R. González & Martine Vrijheid
Center for Genomic Regulation (CRG), Barcelona Institute of Science and Technology, Barcelona, Spain
Mariona Bustamante, Marta Vives-Usano, Eva Borràs, Eduard Sabidó & Xavier Estivill
Department of Mathematics, Imperial College London, South Kensington, London, UK
Denise Thiel
MRC Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
Chung-Ho E. Lau & Oliver Robinson
Division of Systems Medicine, Department of Metabolism, Digestion and Reproduction, Imperial College London, London, UK
Chung-Ho E. Lau, Muireann Coen & Hector C. Keun
Cancer Metabolism & Systems Toxicology Group, Division of Cancer, Department of Surgery & Cancer, Imperial College London, London, UK
Alexandros P. Siskos & Hector C. Keun
Bradford Institute for Health Research, Bradford Teaching Hospitals NHS Foundation Trust, Bradford, UK
Dan Mason & John Wright
Team of Environmental Epidemiology applied to Reproduction and Respiratory Health, Institute for Advanced Biosciences (IAB), Inserm, CNRS, Université Grenoble Alpes, Grenoble, France
Solène Cadiou & Rémy Slama
Centre for Research in Epidemiology and Statistics (CRESS), Inserm, Université de Paris, Paris, France
Barbara Heude
Hospital del Mar Medical Research Institute (IMIM), Barcelona, Spain
Jordi Sunyer
Division of Climate and Environmental Health, Norwegian Institute of Public Health, Oslo, Norway
Eleni Z. Papadopoulou, Kristine B. Gutzkow, Amrit K. Sakhi & Cathrine Thomsen
Department of Environmental Sciences, Vytautas Magnus University, Kaunas, Lithuania
Sandra Andrusaityte & Regina Grazuleviciene
Department of Social Medicine, Faculty of Medicine, University of Crete, Heraklion, Greece
Marina Vafeiadi
Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA
Leda Chatzi
Computational Biology program, CIMA-University of Navarra, Pamplona, Spain
Ibon Tamayo
Medicine Genomics Group, Centro de Investigación Biomédica en Red Enfermedades Raras (CIBERER), University of Santiago de Compostela, CIMUS, Santiago de Compostela, Spain
Inés Quintela & Ángel Carracedo
Galician Foundation of Genomic Medicine, Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Servicio Gallego de Salud (SERGAS), Santiago de Compostela, Galicia, Spain
Ángel Carracedo
Oncology Safety, Clinical Pharmacology and Safety Sciences, R&D, AstraZeneca, Cambridge, UK
Muireann Coen

Authors

Léa Maitre
View author publications
You can also search for this author in PubMed Google Scholar
Mariona Bustamante
View author publications
You can also search for this author in PubMed Google Scholar
Carles Hernández-Ferrer
View author publications
You can also search for this author in PubMed Google Scholar
Denise Thiel
View author publications
You can also search for this author in PubMed Google Scholar
Chung-Ho E. Lau
View author publications
You can also search for this author in PubMed Google Scholar
Alexandros P. Siskos
View author publications
You can also search for this author in PubMed Google Scholar
Marta Vives-Usano
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Ruiz-Arenas
View author publications
You can also search for this author in PubMed Google Scholar
Dolors Pelegrí-Sisó
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Robinson
View author publications
You can also search for this author in PubMed Google Scholar
Dan Mason
View author publications
You can also search for this author in PubMed Google Scholar
John Wright
View author publications
You can also search for this author in PubMed Google Scholar
Solène Cadiou
View author publications
You can also search for this author in PubMed Google Scholar
Rémy Slama
View author publications
You can also search for this author in PubMed Google Scholar
Barbara Heude
View author publications
You can also search for this author in PubMed Google Scholar
Maribel Casas
View author publications
You can also search for this author in PubMed Google Scholar
Jordi Sunyer
View author publications
You can also search for this author in PubMed Google Scholar
Eleni Z. Papadopoulou
View author publications
You can also search for this author in PubMed Google Scholar
Kristine B. Gutzkow
View author publications
You can also search for this author in PubMed Google Scholar
Sandra Andrusaityte
View author publications
You can also search for this author in PubMed Google Scholar
Regina Grazuleviciene
View author publications
You can also search for this author in PubMed Google Scholar
Marina Vafeiadi
View author publications
You can also search for this author in PubMed Google Scholar
Leda Chatzi
View author publications
You can also search for this author in PubMed Google Scholar
Amrit K. Sakhi
View author publications
You can also search for this author in PubMed Google Scholar
Cathrine Thomsen
View author publications
You can also search for this author in PubMed Google Scholar
Ibon Tamayo
View author publications
You can also search for this author in PubMed Google Scholar
Mark Nieuwenhuijsen
View author publications
You can also search for this author in PubMed Google Scholar
Jose Urquiza
View author publications
You can also search for this author in PubMed Google Scholar
Eva Borràs
View author publications
You can also search for this author in PubMed Google Scholar
Eduard Sabidó
View author publications
You can also search for this author in PubMed Google Scholar
Inés Quintela
View author publications
You can also search for this author in PubMed Google Scholar
Ángel Carracedo
View author publications
You can also search for this author in PubMed Google Scholar
Xavier Estivill
View author publications
You can also search for this author in PubMed Google Scholar
Muireann Coen
View author publications
You can also search for this author in PubMed Google Scholar
Juan R. González
View author publications
You can also search for this author in PubMed Google Scholar
Hector C. Keun
View author publications
You can also search for this author in PubMed Google Scholar
Martine Vrijheid
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

O.R., L.M., M.B., J.W., R.S., J.S., C.T., M. Coen, J.R.G., H.K. and M. Vrijheid designed the omics study in HELIX. The following authors participated in omics data acquisition and quality control: A.C., I.Q., M.B., C.R.-A. (DNA methylation), X.E., M.B., M.V.-U. (transcriptomics), E.S., E.B., M.B., J.R.G. (proteomics), C.E.L., A.P.S., L.M., H.K., M. Coen (metabolomics). O.R., J.W., D.M., R.S., B.H., S.A., J.S., M. Casas, K.B.G., E.P., R.G., L.C., C.T, M. Vafeiadi and A.K.S. are the PIs of the cohorts or participated in sample and exposure data acquisition. M. Casas, C.T., A.K.S., M.N. and I.T. measured the pregnancy and postnatal exposomes. C.H.-F., L.M., M.B., D.T., S.C., J.U., D.P.-S. and J.R.G. performed statistical analyses and J.R.G. functional enrichment analyses. The HELIX project was coordinated by M. Vrijheid. L.M., M.B. and M. Vrijheid wrote the original draft of the paper and C.H.-F., A.P.S., O.R., J.W., D.M., S.C., K.B.G., E.P., C.T., A.K.S., M.N., J.U., M. Coen and H.K. contributed to reviewing and editing the manuscript. All authors read and approved the manuscript.

Corresponding author

Correspondence to Martine Vrijheid.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Dataset 1

Supplementary Dataset 2

Supplementary Dataset 3

Supplementary Dataset 4

Supplementary Dataset 5

Supplementary Dataset 6

Supplementary Dataset 7

Supplementary Dataset 8

Supplementary Dataset 9

Supplementary Dataset 10

Supplementary Dataset 11

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Maitre, L., Bustamante, M., Hernández-Ferrer, C. et al. Multi-omics signatures of the human early life exposome. Nat Commun 13, 7024 (2022). https://doi.org/10.1038/s41467-022-34422-2

Download citation

Received: 15 April 2021
Accepted: 25 October 2022
Published: 21 November 2022
DOI: https://doi.org/10.1038/s41467-022-34422-2

This article is cited by

Epigenetics of the non-coding RNA nc886 across blood, adipose tissue and skeletal muscle in offspring exposed to diabetes in pregnancy
- Line Hjort
- Sandra Stokholm Bredgaard
- Louise Torp Dalgaard
Clinical Epigenetics (2024)
Social inequalities in pregnancy metabolic profile: findings from the multi-ethnic Born in Bradford cohort study
- Ahmed Elhakeem
- Gemma L. Clayton
- Martine Vrijheid
BMC Pregnancy and Childbirth (2024)
Genotype × environment interactions in gene regulation and complex traits
- Carly Boye
- Shreya Nirmalan
- Francesca Luca
Nature Genetics (2024)
Lifestyle differences between co-twins are associated with decreased similarity in their internal and external exposome profiles
- Gabin Drouard
- Zhiyang Wang
- Jaakko Kaprio
Scientific Reports (2024)
Metabolic profiling of smoking, associations with type 2 diabetes and interaction with genetic susceptibility
- Yuxia Wei
- Sara Hägg
- Sofia Carlsson
European Journal of Epidemiology (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.