Abstract
Genetic evidence suggests glioma risk is altered by leukocyte telomere length, allergic disease (asthma, hay fever or eczema), alcohol consumption, childhood obesity, low-density lipoprotein cholesterol (LDLc) and triglyceride levels. DNA methylation (DNAm) variation influences many of these glioma-related traits and is an established feature of glioma. Yet the causal relationship between DNAm variation with both glioma incidence and glioma risk factors is unknown. We applied a two-step Mendelian randomization (MR) approach and several sensitivity analyses (including colocalization and Steiger filtering) to assess the association of DNAm with glioma risk factors and glioma incidence. We used data from a recently published catalogue of germline genetic variants robustly associated with DNAm variation in blood (32,851 participants) and data from a genome-wide association study of glioma risk (12,488 cases and 18,169 controls, sub-divided into 6191 glioblastoma cases and 6305 non-glioblastoma cases). MR evidence indicated that DNAm at 3 CpG sites (cg01561092, cg05926943, cg01584448) in one genomic region (HEATR3) had a putative association with glioma and glioblastoma risk (False discovery rate [FDR] < 0.05). Steiger filtering provided evidence against reverse causation. Colocalization presented evidence against genetic confounding and suggested that differential DNAm at the 3 CpG sites and glioma were driven by the same genetic variant. MR provided little evidence to suggest that DNAm acts as a mediator on the causal pathway between risk factors previously examined and glioma onset. To our knowledge, this is the first study to use MR to appraise the causal link of DNAm with glioma risk factors and glioma onset. Subsequent analyses are required to improve the robustness of our results and rule out horizontal pleiotropy.
Similar content being viewed by others
Introduction
Brain tumours such as glioma are responsible for the greatest number of years lost to cancer to those under 40 years of age1 despite having age adjusted incidence rates ranging from just 4.67 to 5.73 per 100,0002,3. A serious health burden is posed by glioma due to their poor prognosis, with an overall 5-year survival rate of under 20% and significant morbidity in survivors4,5,6. While there have been many attempts to ascertain risk factors for glioma, evidence has been inconsistent, and the aetiology of glioma remains largely unclear7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23.
Mendelian randomization (MR) studies have provided some evidence to implicate genetically predicted leukocyte telomere length, allergic diseases (asthma, hay fever or eczema), alcohol consumption, childhood extreme obesity, low-density lipoprotein cholesterol (LDLc) and triglyceride levels as causally relevant risk factors for glioma24. The underlying biological mechanisms by which these traits causally relate to glioma risk remains to be established.
One approach to understanding the aetiological pathways influencing glioma onset is to exploit the increasing body of molecular phenotype data to examine epigenetic pathways. Epigenetic changes include chemical modifications that do not change the sequence of DNA but can alter gene expression25. The most commonly measured form of epigenetic mark is DNA methylation (DNAm), whereby a methyl group (–CH3) is either added or subtracted to a cytosine nucleotide adjacent to a guanine nucleotide within the DNA sequence (cytosine-phosphate-guanine [CpG] site)25. One method to examine DNAm variation linked to glioma incidence is to undertake an epigenome-wide association study (EWAS)26,27,28,29,30,31,32,33,34,35,36. However, most EWAS have been limited by very modest sample sizes or have been undertaken using glioma tumour tissue which are potentially biased through confounding by treatment thus restricting any inferences that can be made with respect to disease aetiology.
As recent studies have reported that DNAm influences glioma-related traits including allergic diseases37, telomere length38 childhood obesity39 and glioma risk40, we sought to assess the causal relationship of DNAm with glioma risk factors identified in a prior study24 (Table 1) and glioma incidence using two-step MR41. We used a recently published catalogue of germline genetic variants robustly associated with DNAm variation in blood, namely methylation quantitative trait loci (mQTL)42, as a proxy for DNAm variation in blood, rather than measuring DNAm variation directly. As glioma is a disease with a high degree of heterogeneity, with differing genetic profiles both intra- and inter-tumourally43, we performed a subtype analysis by splitting the glioma outcome data into glioblastoma or non-glioblastoma. An overview of the research questions can be found in Fig. 1.
Results
Does DNAm causally influence both glioma risk and glioma risk factors?
Using the full summary statistics for the 232,476 CpG sites (n = 32,851) reported in GoDMC, instrumental variables (IVs) were constructed (P < 5 × 10−8 and r2 < 0.001) to act as a proxy for 42,659 CpG sites that could be used in a two-sample MR framework.
Two-sample MR was used to investigate the potential causal effect of DNAm variation at 42,659 CpG sites and glioma risk. For glioma risk there was MR evidence for 284 CpG-glioma effects that met the false discovery rate (FDR) correction threshold (< 0.05). MR results that met the FDR threshold can be found in Appendix 1.1. F-statistic calculations indicated that all 284 CpG sites linked to glioma had an F-statistic > 10 (Appendix 1.2) which suggests that the MR estimate was less likely to be affected by weak instrument bias.
As a sensitivity analysis, colocalization was used to establish the probability that DNAm and glioma were driven by the same causal variant at each locus. In the colocalization analyses, we found suggestive evidence (H4 > 70%) that DNAm at 3 of the 284 CpG sites and glioma were driven by the same genetic variant. Next, we examined the directionality of DNAm at the 3 CpG sites and glioma risk using the Steiger filtering method: the 3 CpG sites showed evidence that the direction of effect was methylation influencing glioma risk (Fig. 2). Complete results from both MR and sensitivity analysis are summarised in Table 2.
In the subtype analysis, there were 209 CpG-glioblastoma (F-statistic > 10) MR estimates that met the FDR correction threshold (FDR < 0.05) (Appendix 1.3). 3 CpG-glioblastoma associations showed evidence of colocalization and all 3 CpG sites showed evidence that the direction of effect was methylation influencing glioblastoma risk (Fig. 2). The full MR results and results from each sensitivity analysis is summarised in Table 3.
For the glioma subtypes there were 175 CpG-non-glioblastoma effects (F-statistic > 10) that met the FDR correction threshold (< 0.05) (Appendix 1.4). Of these 175 CpG sites, 0 CpG-non-glioblastoma effects showed strong evidence of colocalization.
The 3 CpG sites that showed MR and colocalization evidence of an association with glioma and glioblastoma are displayed in Fig. 3. In summary, the results indicate that increased levels of DNAm at cg01584448 increases risk of glioma (OR 5.62, 95% CI 3.37–9.36, p-adjusted 1 × 10− 7) and glioblastoma (OR 9.02, 95% CI 4.81–16.91, p-adjusted 1.88 × 10− 8). cg5926943 and cg01561092 were associated with a decrease in the risk of both glioma (OR 0.38, 95% CI 0.28–0.51, p-adjusted 1.16 × 10− 7; OR 0.85, 95% CI 0.79–0.92, p-adjusted 1.48 × 10− 2) and glioblastoma (OR 0.29, 95% CI 0.20–0.41, p-adjusted 1.33 × 10− 8; OR 0.79, 95% CI 0.72–0.87, p-adjusted 1.55 × 10− 3), respectively.
Appraising the causal role of DNA methylation on glioma risk factors
We performed two-sample MR to examine the causal role of DNAm variation at the 3 CpG sites altering risk of glioma or glioma subtypes with glioma risk factors. The results from the extensive analysis are present in Table 4. We identified 5 associations that survived the FDR corrected p-value threshold (p-adjusted < 0.05). Two of these associations were robust to colocalization and Steiger filtering. The results indicate that DNAm variation at cg05926943 and cg01561092 are associated with an increase in telomere length (OR 1.12, 95% CI 1.08–1.15, p-adjusted 3.90 × 10− 11: OR 1.04, 95% CI 1.03–1.06, p-adjusted 8.96 × 10− 7), respectively (Fig. 3).
Overlap with gene expression
DNAm variation at the 3 CpG sites (cg01561092, cg05926943, cg01584448) found to putatively influence glioma and glioblastoma risk were used to investigate hypothesis driven tissue-specific effects. We hypothesised that DNAm that influences glioma and glioblastoma risk may be influenced by gene expression in blood and brain tissue. All 3 CpG sites were annotated to the gene HEATR3 (Ensemble ID ENSG00000155393).
To evaluate the association of gene expression with glioma and glioblastoma risk at HEATR3 in blood tissue, instruments were constructed using eQTLGen Consortium (n = 31,684).
In the MR analysis, we observed evidence that survived the FDR corrected p-value threshold (p-adjusted < 0.05), colocalization and Steiger filtering, that gene expression at HEATR3 was associated with an increase in glioma risk (OR 1.20, 95% CI 1.11–1.29, p-adjusted 7.61 × 10− 6) and an increase in glioblastoma risk (OR 1.28, 95% CI 1.16–1.41, p-adjusted 2.54 × 10− 7) (Table 5).
When comparing the DNAm MR results with the gene expression MR results, the direction of effect estimated for HEATR3 is consistent with cg01584448. The direction of the estimated effect for the two CpG sites (cg01561092, cg05926943) was discordant with gene expression (Fig. 4).
To establish if the associations between the CpG sites and glioma is mediated by changes in gene expression at HEATR3 in blood tissue we applied “moloc”. Moloc assessed the likelihood that DNAm, gene expression and glioma susceptibly are driven by the same causal variant. The results indicated suggestive evidence (PPA > 70%) of colocalization between gene expression and glioma (but not by DNAm at cg01561092). Similarly, colocalization between DNAm and glioma at cg05926943 was observed but not with gene expression. The results provided evidence of two distinct causal variants for methylation and expression at cg01584448 (Table 6).
Next, to establish if there was an association between gene expression and glioma or glioblastoma risk at HEATR3 in brain tissue, instruments were constructed using data from GTEx v8 (n = 1194).
The two associations from the MR analysis survived the FDR corrected p-value threshold, however, neither showed evidence of colocalization suggesting the MR result may be biased by genetic confounding. The results from the extensive analyses are provided Table 7.
Does DNA methylation mediate the effect of risk factors on glioma?
Appraising the causal role of glioma risk factors on DNAm
We performed two-sample MR to investigate the potential causal role of allergic disease, triglycerides, LDLc, alcohol consumption, telomere length and childhood obesity with DNAm variation at 42,659 CpG sites. The MR analysis indicated little evidence of a causal role for any of the glioma related traits on DNAm variation (Bonferroni corrected P value < 0.0083) (Table 8).
Discussion
Extensive analyses were conducted to establish the role of DNAm on the causal pathway leading to glioma onset. MR evidence robust to the FDR p-value threshold and Steiger filtering identified 3 CpG sites (cg01561092, cg05926943, cg01584448) in one genomic region (HEATR3) that have a putative association with glioma and glioblastoma risk. In support of these findings, MR provided evidence that higher levels of gene expression of HEATR3 in blood tissue was associated with an increased risk of glioma and glioblastoma. MR provided little evidence to suggest any CpG sites influenced non-glioblastoma. By examining the role of DNAm variation at these 3 CpG sites with putative glioma related traits (alcohol consumption, allergic disease, childhood obesity, LDL cholesterol, triglycerides, and telomere length), we report evidence that 2 of these CpG sites (cg01561092, cg05926943) influenced telomere length. MR offered little evidence to suggest that DNAm acts as a mediator on the causal pathway between glioma related traits previously examined and glioma onset.
Higher levels of methylation at cg01584448 were associated with an increase in glioma and glioblastoma risk. Whereas higher levels of methylation at cg5926943 and cg01561092 were associated with a lower risk of glioma and glioblastoma. To elucidate the observed putative association, the CpG sites were annotated to their closest gene. As the CpG sites reside in close genomic positions they were mapped to the same gene, a known oncogene, HEATR3, which has been associated with glioma risk in previous studies44,45,46; thus, providing evidence that the genomic region is relevant. Here MR, colocalization and Steiger filtering offered further evidence that differential gene expression of HEATR3 within blood tissue increased the risk of glioma and glioblastoma. A conflicting pattern of DNAm was observed for cg5926943 and cg01561092 as they displayed an opposite correlation with gene expression. A prior study reported an inverse correlation between DNAm and gene expression for various CpGs and their closest gene, in several cancers47. Similarly, Houshdaran et al. reported that DNAm inversely correlated with gene expression in ovarian cancer cell lines48. Thus, it is possible that the inverse correlation indicates co-regulation of DNAm and gene expression with glioma development.
Due to the complex nature of this interaction between DNAm and gene expression, moloc was implemented to establish if glioma, DNAm and gene expression shared a common causal genetic variant, to provide further supporting evidence of an underlying causal association between these traits rather than findings being driven through genetic confounding (e.g., LD between an mQTL and a variant influencing glioma risk). The results from the moloc analysis indicated that gene expression colocalizes with glioma but not with DNAm at cg01561092. Similarly, colocalization between DNAm and glioma at cg05926943 was observed but not with gene expression. There was evidence of two distinct causal variants for methylation and expression at cg01584448. There is evidence of colocalization between two of the traits at each CpG site (gene expression and glioma risk; methylation and glioma risk) thus it is possible that gene expression is under the control of methylation of a region rather than specific CpG sites.
The incidence and mortality of high-grade glioma increases with age, with the median age at diagnosis of 64 years49. The 3 CpG sites putatively associated with glioma risk in this study have been linked to age in previous EWAS50. Age-specific differences in glioma susceptibility could reveal clues about glioma aetiology. Additionally, previous models of age, based on DNAm have demonstrated an ability to predict the risk of both disease and survival in pre-cancerous tissue, including brain tissue51,52,53. These findings provide a rationale to evaluate whether an association exists between these epigenetic markers and age at diagnosis in glioma and subsequently whether DNAm can act as a prognostic marker.
Prior epidemiological studies have reported that longer leukocyte telomere length is linked to an increased risk of glioma24,54. Here, we provide evidence to further elucidate the molecular mechanism between telomere length, DNAm and glioma risk. Contrary to previous studies, we observed evidence that DNAm influencing the CpG sites (cg01561092, cg05926943) decreased glioma risk and increased leukocyte telomere length. The conflicting correlation could be a result of the complexity of the association underlying glioma development. A noteworthy concern is that since methylation was studied in blood tissue, which is unlikely to accurately proxy DNAm in the brain, the associations may be biased by confounding by tissue heterogeneity.
There was little evidence to suggest the glioma related traits influence cancer development through DNAm. These null results could reflect the fact that DNAm is not a causal mediator between these traits and glioma onset, or it could be a consequence of this MR study being underpowered since the variance explained by the IV for the trait was limited. In an attempt to reduce weak instrument bias, we obtained the summary data to proxy the glioma related traits from GWAS with a large sample size to improve the reliability of the causal estimates and we only used SNPs with an F statistic greater than 10.
An important consideration in the interpretation of this analysis is explained in detail by Min JL et al.42. The blood measured mQTL data utilised in this chapter, obtained from the GoDMC data set42, cannot be regarded as mediating the genetic association to the trait even when there is colocalization evidence of a shared genetic variants. Rather, when DNAm shows evidence of colocalizing with a complex trait, such as glioma and telomere length, then this is likely due to common cause. Therefore, despite CpG sites showing evidence of colocalization, it is possible that second instrumental variable assumption has been violated, as there could be a common cause for both DNAm and glioma risk. To establish if the CpG sites identified here are truly implicated in glioma onset more detailed analyses are required to triangulate evidence and to fully understand the mechanistic pathways.
Another limitation of this study is the fact that we used single-instrument MR to examine causal relationships and consequently was not properly able to appraise possible horizontal pleiotropic effects. We took measures to minimise this possibility: instruments were limited to cis-mQTLs as trans-mQTLs are more likely to have effects on methylation and glioma risk via distinct mechanisms; and colocalization techniques were implemented to test whether the putative causal variant is shared by the exposure (e.g., risk factor or DNAm) and the outcome (e.g., glioma or DNAm)55,56,57 thus increasing the probability that the two traits have a shared causal mechanism55,58.
Despite these limitations, this analysis has numerous strengths, including the use of two-sample MR to examine the causal role of DNAm in glioma risk by exploiting a vast epigenetic resource and the largest glioma GWAS. Thus, leading to increased statistical power and precision of effect estimates. Furthermore, to ensure IVs were valid, genetic instruments were constructed using a strict inclusion criteria and quality control steps were undertaken. For example, only cis-variants were included and instrument strength was checked. In addition, the orientation of the causal effect was inferred to reduce the likelihood of reverse causation.
Methods
Reported results from all analyses are MR effect estimates that met either the false discovery rate (FDR) threshold (when DNAm or gene expression is the exposure) or the Bonferroni-corrected p-value threshold (glioma related traits is the exposure), showed evidence of colocalization59 to rule out genetic confounding, and displayed little evidence to suggest reverse causation through Steiger filtering (Fig. 2)60. All MR analyses were conducted using the “TwoSampleMR” package in R studio (version 4.1.0) using the computational facilities of the Advanced Computing Research Centre, University of Bristol (http://www.bristol.ac.uk/acrc/).
When DNAm or gene expression were instrumented as the exposure, we opted to use a more liberal FDR corrected p-value threshold, as we did not expect complete independence of all statistical tests (within overall glioma, glioblastoma, or non-glioblastoma analyses), compared to the Bonferroni p-value threshold used, when a glioma related trait was instrumented as the exposure.
Mendelian randomization estimate
In cases where there was a single nucleotide polymorphism (SNP) to act as a proxy for the exposure of interest (e.g., DNAm), the causal effect estimates from MR were calculated using the Wald ratio (βGD/βGP)61 and standard errors approximated using the delta method62. Where the exposure (e.g., DNAm variation at a CpG site) was instrumented by multiple independent SNPs (r2 < 0.001), causal effect estimates were calculated using the random effects inverse variance weighted (IVW) method to allow overdispersion, where the Wald ratios were combined into a single causal estimate by meta-analysis63.
Colocalization
IV2 violations can occur through genetic confounding if genetic variants are correlated through linkage LD (Fig. 3). Therefore, for associations which met the p-value threshold (FDR < 0.05) we applied pairwise conditional and colocalization (PWCoCo)57 to determine whether the genetic variant associated with the exposure, e.g., DNAm, was the same genetic variant altering the outcome e.g., glioma (i.e., as identified in glioma genome wide association study [GWAS]), thus permitting evaluation of the presence of genetic confounding64. Colocalization requires providing prior probabilities that any random SNP within the genomic region of interest is associated with the exposure, the outcome or both (p1 = 1e−4, p2 = 1e−4, p12 = 1e−5). SNPs from a ± 250KBP window were extracted around the instrumented SNP(s) for each putative causal SNP from the exposure and outcome GWAS. A posterior probability for H4 > 0.8 was designated as “strong” and 0.7 > a posterior probability for H4 < 0.8 as “suggestive” evidence.
Directionality test
To increase the likelihood that MR infers the correct causal direction between the exposure (e.g., DNAm) and the outcome (e.g., glioma), we applied the Steiger filtering method to test for reverse causation60. Steiger filtering removes SNPs that explain more of the variance in the outcome than the exposure and therefore the MR estimate is less likely to biased by misspecification in the MR model. Steiger filtering was performed for the putative causal variants identified in the MR analysis that showed evidence of colocalization.
Hypothesis 1
A summary of the research questions addressed in hypothesis 1 is displayed in Fig. 6.
Step 1: evaluating the relationship between DNA methylation and glioma risk
Instrument selection
Two-sample MR was implemented to ascertain the potential causal effects of circulating DNAm on glioma risk. To create genetic IVs for DNAm as the exposure we used effect estimates for germline cis-SNPs (SNPs within a ± 250KBP window of the CpG site) robustly associated with DNAm at CpG site (mQTL) at genome wide significance (P < 5 × 10–8)42 that had undergone LD clumping (r2 < 0.001) from the mQTL database Genetics of DNA Methylation Consortium (GoDMC) [http://www.godmc.org.uk/] (n = 32,851)42. To measure instrument strength, we examined the variance in DNAm explained by the mQTLs (R2) and the F statistic65.
Outcome selection
For the glioma outcome, summary data were obtained from a GWAS meta-analysis of 12,488 glioma cases and 18,160 controls66. MR analyses were performed to assess the causal impact of DNAm variation on glioma subtypes: glioblastoma (6,183 cases) and non-glioblastoma (5,820 cases).
Mendelian randomization effect estimate and p-value threshold
MR effect estimates are reported as odds ratios (OR) (95% confidence intervals (CI)) per 1 standard deviation (SD) increase in genetically proxied DNAm.
Step 2: evaluating the relationship between DNA methylation and glioma related traits
Instrument selection
As described above, IVs for DNAm were generated (r2 < 0.001, P < 5 × 10–8) for CpG sites associated with either glioma, glioblastoma, and/or non-glioblastoma in step 1 above.
Outcome selection
For the outcome, summary data for the putative glioma related traits24 (genetically predicted leukocyte telomere length, allergic disease, alcohol consumption, childhood extreme obesity, LDLc and triglyceride levels) was obtained from MR-Base (a curated data base that contains complete GWAS results)67 (Table 9).
Follow up tissue-specific Mendelian randomization analysis.
For the CpG sites that showed robust evidence of an effect with glioma risk, we investigated whether variation in tissue-specific gene expression was responsible for the effect with glioma risk. For the analysis we utilised blood tissue by incorporating gene expression data from the eQTLGen Consortium (n = 31,684) (https://www.eqtlgen.org/)68 and brain tissue utilising gene expression data from 13 brain tissues from The Genotype-Tissue Expression project (GTEx) v8 (n = 1194)69.
CpG sites were annotated to genes using the R package meffil70. IVs for genes were constructed using effect estimates for germline cis-SNPs (within a ± 250KBP window) associated with gene expression variation in brain and blood, namely expression quantitative trait loci (eQTLs) at genome wide significance (P < 5 × 10–8)42 that had undergone LD clumping (r2 < 0.001). To measure instrument strength, we examined the variance in gene expression explained by the eQTLs (R2) and the F statistic65.
Multiple trait colocalization
For genes that appeared to overlap with the CpG sites of interest we applied multiple trait colocalization (moloc)71 to investigate whether the same genetic variant influences proximal DNAm, proximal gene expression and glioma risk. Such analyses can provide evidence to support gene expression and DNAm residing on the same causal pathway to glioma onset72. We implemented “moloc” using data from three different data sources: DNAm data from the mQTL database GoDMC [http://www.godmc.org.uk/] (n = 32,851)42, gene expression data from the eQTLGen Consortium (n = 31,684) (https://www.eqtlgen.org/)68 and GWAS meta-analysis data for glioma66. Moloc default prior probabilities were implemented (p1 = 1 × 10–4, p2 = 1 × 10–6 and p3 = 1 × 10–7), p1 was used for one association, p2 for two associations, and p3 for colocalization of all three associations. We examined colocalization with expression of all genes with a ± 250KBP window of the CpG site of interest. At least 50 variants (minor allele frequency [MAF] > 0.05) common to all three datasets were required for the analysis. A posterior probability of greater than 70% was considered suggestive evidence of colocalization. All analyses were undertaken in R version 4.1.0.
Hypothesis 2
A summary of the research questions addressed in hypothesis 2 is displayed in Fig. 7.
Step 1: evaluating the relationship between glioma related trait and DNA methylation
Genetic instruments for the glioma related traits were collated from MR-Base67 or directly from the relevant GWAS (details of studies used to obtain genetic instruments are given in Table 10).
Genetic instruments were created using SNPs with an F statistic equal to or greater than 10, shown to be robustly (P < 5 × 10− 8) and independently (r2 < 0.001) associated with the glioma related trait under examination in individuals of European ancestry.
Outcome selection
For the outcome, summary data were obtained from the mQTL database GoDMC [http://www.godmc.org.uk/] (n = 32,851)42.
Mendelian randomization estimate and p-value threshold
The MR estimate was expressed as SD increase in methylation per unit increase in the glioma related trait. A Bonferroni-corrected p-value threshold, P value < 0.0083 (0.05/6 as there were 6 traits included in the analysis), was used to evaluate the strength of the statistical evidence.
Step 2: evaluating the relationship between DNA methylation associated with glioma related traits and glioma risk
Using IVs for the CpG sites that were influenced by putative glioma related traits, we examined if DNAm variation at these CpG sites had an MR effect on glioma risk using the glioma GWAS (12,488 glioma cases and 18,160 controls)66. MR effect estimates are reported as the OR (95% CI) per 1 SD increase in genetically proxied DNAm.
Ethics approval and consent to participate
Ethical approval was not required for this specific analysis as the entirety of the data was sourced from the summary statistics of a published GWAS and no individual-level data were used.
Data availability
Genetic instrument for DNAm can be obtained from the mQTL database GoDMC [http://www.godmc.org.uk/] (n = 32,851). Genetic instruments used to proxy the six risk factors can be found through MR-Base (http://www.mrbase.org/) or from the individual reference papers. Meta-analysed glioma GWAS data were acquired from the study by Melin et al.66., which is a meta-analysis of eight independent GWAS studies (UK73, French74, German75, MDA44, UCSF-SFAGS44, GliomaScan76, GICC64 and UCSF/Mayo77). Genotype data from the Glioma International Case–Control Consortium Study GWAS are available from the database of Genotypes and Phenotypes (dbGaP) under accession phs001319.v1.p1. Genotypes from the GliomaScan GWAS can be accessed through dbGaP accession phs000652.v1.p1.
Abbreviations
- CI:
-
Confidence interval
- GWAS:
-
Genome-wide association study
- IV:
-
Instrumental variable
- MR:
-
Mendelian randomization
- IVW:
-
Inverse variance weighted
- OR:
-
Odds ratio
- SNP:
-
Single nucleotide polymorphism
- SD:
-
Standard deviation
- LDLc:
-
Low-density lipoprotein cholesterol
- DNAm:
-
DNA methylation
- EWAS:
-
Epigenome-wide association study
- mQTL:
-
Methylation quantitative trait loci
- CpG:
-
Cytosine-phosphate-guanine
- LD:
-
Linkage disequilibrium
- GoDMC:
-
Genetics of DNA methylation consortium
References
Burnet, N. G. et al. Years of life lost (YLL) from cancer is an important measure of population burden–and should be considered when allocating research funds. Br. J. Cancer 92(2), 241–245 (2005).
Larjavaara, S. et al. Incidence of gliomas by anatomic location. Neuro. Oncol. 9(3), 319–325 (2007).
Gousias, K. et al. Descriptive epidemiology of cerebral gliomas in northwest Greece and study of potential predisposing factors, 2005–2007. Neuroepidemiology 33(2), 89–95 (2009).
Ostrom, Q. T. et al. The epidemiology of glioma in adults: A “state of the science” review. Neuro Oncol. 16(7), 896–913 (2014).
Visser, O. et al. Survival of adults with primary malignant brain tumours in Europe; Results of the EUROCARE-5 study. Eur. J. Cancer. 51(15), 2231–2241 (2022).
UK, C.R. Survival for All Types of Brain Tumour. (2016). http://www.cancerresearchuk.org/health-professional/cancer-statistics/statistics-by-cancer-type/brain-other-cns-and-intracranial-tumours/incidence#collapseTen. Accessed 9 April 2018.
Wiedmann, M. K. H. et al. The impact of body mass index and height on the risk for glioblastoma and other glioma subgroups: A large prospective cohort study. Neuro Oncol. 19(7), 976–985 (2017).
Seliger, C. et al. Diabetes, use of antidiabetic drugs, and the risk of glioma. Neuro Oncol. 18(3), 340–349 (2016).
Houben, M. P. W. A. et al. Hypertension as a risk factor for glioma? Evidence from a population-based study of comorbidity in glioma patients. Ann. Oncol. 15(8), 1256–1260 (2004).
Linos, E. et al. Atopy and risk of brain tumors: A meta-analysis. J. Natl. Cancer Inst. 99(20), 1544–1550 (2007).
Holick, C. N. et al. Coffee, tea, caffeine intake, and risk of adult glioma in three prospective cohort studies. Cancer Epidemiol. Biomark. Prev. 19(1), 39–47 (2010).
Preston-Martin, S. & Mack, W. Gliomas and meningiomas in men in Los Angeles County: Investigation of exposures to N-nitroso compounds. IARC Sci. Publ. 105, 197–203 (1991).
Kaplan, S., Novikov, I. & Modan, B. Nutritional factors in the etiology of brain tumors: Potential role of nitrosamines, fat, and cholesterol. Am. J. Epidemiol. 146(10), 832–841 (1997).
Zhao, L., Zheng, Z. & Huang, P. Diabetes mellitus and the risk of glioma: A meta-analysis. Oncotarget 7(4), 4483–4489 (2016).
Seliger, C. et al. Statin use and risk of glioma: Population-based case-control analysis. Eur. J. Epidemiol. 31(9), 947–952 (2016).
Zhou, S. et al. Association between vitamin C intake and glioma risk: Evidence from a meta-analysis. Neuroepidemiology 44(1), 39–44 (2015).
Li, H. X. et al. A meta-analysis of association between pesticides exposure and glioma risk in adults. J. Craniofac. Surg. 26(7), e672–e673 (2015).
Lee, S. T. et al. Interaction of allergy history and antibodies to specific varicella-zoster virus proteins on glioma risk. Int. J. Cancer 134(9), 2199–2210 (2014).
Andersen, L. et al. Hormonal contraceptive use and risk of glioma among younger women: A nationwide case-control study. Br. J. Clin. Pharmacol. 79(4), 677–684 (2015).
Little, R. B. et al. Anthropometric factors in relation to risk of glioma. Cancer Causes Control 24(5), 1025–1031 (2013).
Malerba, S. et al. A meta-analysis of coffee and tea consumption and the risk of glioma in adults. Cancer Causes Control 24(2), 267–276 (2013).
Kabat, G. C. et al. Reproductive factors and exogenous hormone use and risk of adult glioma in women in the NIH-AARP diet and health study. Int. J. Cancer 128(4), 944–950 (2011).
Howell, A. E. et al. Use of mendelian randomization for identifying risk factors for brain tumors. Front. Genet. 9, 525–525 (2018).
Howell, A. E. et al. Testing for causality between systematically identified risk factors and glioma: A Mendelian randomization study. BMC Cancer 20(1), 508 (2020).
Hamilton, J. P. Epigenetics: Principles and practice. Dig. Dis. 29(2), 130–135 (2011).
Etcheverry, A. et al. DNA methylation in glioblastoma: Impact on gene expression and clinical outcome. BMC Genomics 11(1), 701 (2010).
Brat, D. J. et al. Comprehensive, integrative genomic analysis of diffuse lower-grade gliomas. N. Engl. J. Med. 372(26), 2481–2498 (2015).
Noushmehr, H. et al. Identification of a CpG island methylator phenotype that defines a distinct subgroup of glioma. Cancer Cell 17(5), 510–522 (2010).
Mock, A. et al. LOC283731 promoter hypermethylation prognosticates survival after radiochemotherapy in IDH1 wild-type glioblastoma patients. Int. J. Cancer 139(2), 424–432 (2016).
Wiestler, B. et al. Integrated DNA methylation and copy-number profiling identify three clinically and biologically relevant groups of anaplastic glioma. Acta Neuropathol. 128(4), 561–571 (2014).
Lock, E. F. & Dunson, D. B. Bayesian genome- and epigenome-wide association studies with gene level dependence. Biometrics 73(3), 1018–1028 (2017).
Sturm, D. et al. Hotspot mutations in H3F3A and IDH1 define distinct epigenetic and biological subgroups of glioblastoma. Cancer Cell 22(4), 425–437 (2012).
Wiestler, B. et al. Assessing CpG island methylator phenotype, 1p/19q codeletion, and MGMT promoter methylation from epigenome-wide data in the biomarker cohort of the NOA-04 trial. Neuro Oncol. 16(12), 1630–1638 (2014).
Schliesser, M. G. et al. Prognostic relevance of miRNA-155 methylation in anaplastic glioma. Oncotarget 7, 50 (2016).
Sabedot, T. S. et al. A serum-based DNA methylation assay provides accurate detection of glioma. Neuro Oncol. 23(9), 1494–1508 (2021).
Noushmehr, H. et al. Detection of glioma and prognostic subtypes by non-invasive circulating cell-free DNA methylation markers. BioRxiv 2019, 601245 (2019).
Xu, C.-J. et al. Shared DNA methylation signatures in childhood allergy: The MeDALL study. J. Allergy Clin. Immunol. 147(3), 1031–1040 (2021).
Dong, Y. et al. Associations between global DNA methylation and telomere length in healthy adolescents. Sci. Rep. 7(1), 4210 (2017).
He, F. et al. Association between DNA methylation in obesity-related genes and body mass index percentile in adolescents. Sci. Rep. 9(1), 2079 (2019).
Dahlin, A. M. et al. Relation between established glioma risk variants and DNA methylation in the tumor. PLoS ONE 11(10), e0163067 (2016).
Relton, C. L. & Smith, G. D. Two-step epigenetic Mendelian randomization: A strategy for establishing the causal role of epigenetic processes in pathways to disease. Int. J. Epidemiol. 41(1), 161–176 (2012).
Min, J. L. et al. Genomic and phenomic insights from an atlas of genetic effects on DNA methylation. MedRxiv 2020, 20180406 (2020).
Simon, M. et al. Genetic risk profiles identify different molecular etiologies for glioma. Clin. Cancer Res. 16(21), 5252–5259 (2010).
Shete, S. et al. Genome-wide association study identifies five susceptibility loci for glioma. Nat. Genet. 41(8), 899–904 (2009).
Rafnar, T. et al. Sequence variants at the TERT-CLPTM1L locus associate with many cancer types. Nat. Genet. 41(2), 221–227 (2009).
Egan, K. M. et al. Cancer susceptibility variants and the risk of adult glioma in a US case-control study. J. Neurooncol. 104(2), 535–542 (2011).
Spainhour, J. C. et al. Correlation patterns between DNA methylation and gene expression in the cancer genome atlas. Cancer Inform. 18, 1176935119828776 (2019).
Houshdaran, S. et al. DNA methylation profiles of ovarian epithelial carcinoma tumors and cell lines. PLoS ONE 5(2), e9359 (2010).
Young, J. S. et al. Management of glioblastoma in elderly patients. J. Neurol. Sci. 380, 250–255 (2017).
Mulder, R. H. et al. Epigenome-wide change and variation in DNA methylation in childhood: Trajectories from birth to late adolescence. Hum. Mol. Genet. 30(1), 119–134 (2021).
Klutstein, M. et al. Contribution of epigenetic mechanisms to variation in cancer risk among tissues. Proc. Natl. Acad. Sci. USA 114(9), 2230–2234 (2017).
Yang, Z. et al. Correlation of an epigenetic mitotic clock with cancer risk. Genome Biol. 17(1), 205 (2016).
Curtius, K. et al. A molecular clock infers heterogeneous tissue age among patients with Barrett’s esophagus. PLoS Comput. Biol. 12(5), e1004919 (2016).
Haycock, P. C. et al. Association between telomere length and risk of cancer and non-neoplastic diseases: A Mendelian randomization study. JAMA Oncol. 3(5), 636–651 (2017).
Giambartolomei, C. et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10(5), e1004383 (2014).
Robinson, J. W. et al. Transcriptome-wide Mendelian randomization study prioritising novel tissue-dependent genes for glioma susceptibility. Sci. Rep. 11(1), 2329 (2021).
Zheng, J. et al. Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases. Nat. Genet. 52(10), 1122–1131 (2020).
Guo, H. et al. Integration of disease association and eQTL data using a Bayesian colocalisation approach highlights six candidate causal genes in immune-mediated diseases. Hum. Mol. Genet. 24(12), 3305–3313 (2015).
Fortune, M. D. et al. Statistical colocalization of genetic risk variants for related autoimmune diseases in the context of common controls. Nat. Genet. 47(7), 839–846 (2015).
Hemani, G., Tilling, K. & Smith, G. D. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. PLOS Genet. 13(11), e1007081 (2017).
Wald, A. The fitting of straight lines if both variables are subject to error. Ann. Math. Stat. 11(3), 284–300 (1940).
Thomas, D. C., Lawlor, D. A. & Thompson, J. R. Re: Estimation of bias in nongenetic observational studies using “Mendelian triangulation” by Bautista et al. Ann. Epidemiol. 17(7), 511–513 (2007).
Burgess, S., Butterworth, A. & Thompson, S. G. Mendelian randomization analysis with multiple genetic variants using summarized data. Genet. Epidemiol. 37(7), 658–665 (2013).
Amirian, E. S. et al. The glioma international case-control study: A report from the genetic epidemiology of glioma international consortium. Am. J. Epidemiol. 183(2), 85–91 (2016).
Burgess, S., Thompson, S. G. & Collaboration, C. C. G. Avoiding bias from weak instruments in Mendelian randomization studies. Int. J. Epidemiol. 40(3), 755–764 (2011).
Melin, B. S. et al. Genome-wide association study of glioma subtypes identifies specific differences in genetic susceptibility to glioblastoma and non-glioblastoma tumors. Nat. Genet. 49(5), 789–794 (2017).
Hemani, G. et al. MR-Base: A platform for systematic causal inference across the phenome using billions of genetic associations. BioRxiv (2016).
Võsa, U. et al. Unraveling the polygenic architecture of complex traits using blood eQTL metaanalysis. Nat. Genet. 2018, 447367 (2018).
Qi, T. et al. Identifying gene targets for brain-related traits using transcriptomic and methylomic data from blood. Nat. Commun. 9(1), 2282 (2018).
Min, J. L. et al. Meffil: Efficient normalization and analysis of very large DNA methylation datasets. Bioinformatics 34(23), 3983–3989 (2018).
Giambartolomei, C. et al. A Bayesian framework for multiple trait colocalization from summary association statistics. Bioinformatics 34(15), 2538–2545 (2018).
Richardson, T. G. et al. Systematic Mendelian randomization framework elucidates hundreds of CpG sites which may mediate the influence of genetic variants on disease. Hum. Mol. Genet. 27(18), 3293–3304 (2018).
Cardis, E. et al. The INTERPHONE study: Design, epidemiological methods, and description of the study population. Eur. J. Epidemiol. 22(9), 647–664 (2007).
Sanson, M. et al. Chromosome 7p11.2 (EGFR) variation influences glioma risk. Hum. Mol. Genet. 20(14), 2897–2904 (2011).
Kinnersley, B. et al. Genome-wide association study identifies multiple susceptibility loci for glioma. Nat. Commun. 6, 8559 (2015).
Rajaraman, P. et al. Genome-wide association study of glioma and meta-analysis. Hum. Genet. 131(12), 1877–1888 (2012).
Wrensch, M. et al. Variants in the CDKN2B and RTEL1 regions are associated with high-grade glioma susceptibility. Nat. Genet. 41(8), 905–908 (2009).
Acknowledgements
We would like to thank the GoDMC and the authors from the meta-analysed glioma GWAS (UK, French, German, MDA, UCSF-SFAGS, GliomaScan, GICC and UCSF/Mayo) for allowing us to access the summary statistics for this analysis.
Funding
AHE and KKM were supported by the Brain Tumour Bank and Research Fund, Southmead hospital charity (Charity Registration Number:1055900). RMM and CR are supported by a Cancer Research UK Programme Grant, the Integrative Cancer Epidemiology Programme (C18281/A29019) and the National Institute for Health Research (NIHR) Bristol Biomedical Research Centre which is funded by the National Institute for Health Research and is a partnership between University Hospitals Bristol NHS Trust, Weston NHS Foundation Trust and the University of Bristol. RMM is a National Institute for Health Research Senior Investigator (NIHR202411). The views expressed in this publication are those of the author(s) and not necessarily those of the NIHR or the UK Department of Health and Social Care.
Author information
Authors and Affiliations
Contributions
K.K.M. and J.Z. managed the project, supported by C.R. and R.M.M. who acted as co-supervisors to A.E.H. A.E.H. drafted the manuscript. A.E.H. and J.Z. performed statistical analyses. A.E.H. acquired and analysed the data. All authors made substantial contributions and revisions to the drafts and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Howell, A.E., Relton, C., Martin, R.M. et al. Role of DNA methylation in the relationship between glioma risk factors and glioma incidence: a two-step Mendelian randomization study. Sci Rep 13, 6590 (2023). https://doi.org/10.1038/s41598-023-33621-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-33621-1