Abstract
To better understand DNA methylation in Alzheimer’s disease (AD) from both mechanistic and biomarker perspectives, we performed an epigenome-wide meta-analysis of blood DNA methylation in two large independent blood-based studies in AD, the ADNI and AIBL studies, and identified 5 CpGs, mapped to the SPIDR, CDH6 genes, and intergenic regions, that are significantly associated with AD diagnosis. A cross-tissue analysis that combined these blood DNA methylation datasets with four brain methylation datasets prioritized 97 CpGs and 10 genomic regions that are significantly associated with both AD neuropathology and AD diagnosis. An out-of-sample validation using the AddNeuroMed dataset showed the best performing logistic regression model includes age, sex, immune cell type proportions, and methylation risk score based on prioritized CpGs in cross-tissue analysis (AUC = 0.696, 95% CI: 0.616 − 0.770, P-value = 2.78 × 10−5). Our study offers new insights into epigenetics in AD and provides a valuable resource for future AD biomarker discovery.
Similar content being viewed by others
Introduction
Alzheimer’s disease (AD) is the most common cause of dementia and affects about 11% of people 65 years and older in the US1. With the rising elderly population, AD has become a major public health concern. In addition to genetics, it has been increasingly appreciated that epigenetic marks such as DNA methylation also play an important role in AD2,3,4,5. Currently, there is a lack of objective, inexpensive, and minimally invasive biomarkers for AD. Methylated DNA is more stable than mRNA and provides an excellent source of biomarkers6. A number of recent studies identified DNA methylation differences in blood samples of AD subjects. Fransquet et al. (2018)7 conducted a systematic review of 36 studies comparing DNA methylation between AD cases and controls in blood samples and found that 67% of these studies reported significant methylation differences between the groups. Most recently, it was shown that DNA methylation changes could be detected in blood at least three years before the onset of dementia symptoms8. In particular, blood methylation levels at several candidate loci9,10,11,12,13, such as the COASY, HOXB6, and APP genes, were significantly different between AD patients and healthy controls.
In this study, we studied DNA methylation differences in AD from both mechanistic and biomarker perspectives. To help improve power, we meta-analyzed two large blood-based AD EWAS measured by the same Illumina Infinium MethylationEPIC BeadChips, and conducted by the Alzheimer’s Disease Neuroimaging Initiative (ADNI)14 and the Australian Imaging, Biomarkers, and Lifestyle (AIBL)15 consortiums recently. In addition, we also performed a cross-tissue meta-analysis by combining the DNA methylation datasets measured on blood samples with four additional DNA methylation datasets measured on over one thousand brain prefrontal cortex samples, to prioritize significant methylation differences associated with both AD diagnosis and AD neuropathology. To understand the functional roles of the methylation differences, we conducted several integrative analyses of methylations with genetic variants, gene expressions, and transcription factor binding sites. Moreover, we evaluated the feasibility of the DNA methylation differences as potential biomarkers for diagnosing AD in an external blood sample dataset. Our analysis results provide a valuable resource for future mechanistic and biomarker studies in AD.
Results
Study datasets
Our meta-analysis of blood samples included a total of 1284 DNA methylation samples (427 AD cases, 857 cognitive normals) from two large independent AD EWAS, the ADNI14 and the AIBL16 studies. The ADNI study is a longitudinal study with DNA methylation samples collected at baseline and multiple follow-up visits ranging from 6 months to 60 months14. The average and median times between the first and last visits for the subjects were 18 months and 24 months, respectively. Similarly, the AIBL study is also a longitudinal study with DNA methylation samples collected at 18-month follow-up. A total of 797 unique individuals older than 65 years of age were included in this study. The mean ages of the subjects were 77.13 (±6.64) years and 73.37 (±5.79) years, and the percentages of females were 46% and 54% in the ADNI and AIBL studies, respectively (Supplementary Data 1).
Meta-analysis identified methylation differences significantly associated with AD at individual CpGs and genomic regions in the blood
After adjusting covariate variables age, sex, batch, and immune cell-type proportions and correcting inflation in each dataset (see details in Methods), the inverse-variance fixed-effects meta-analysis model identified 5 CpGs, mapped to the SPIDR, CDH6 genes, and intergenic regions at 5% false discovery rate (FDR) (Table 1, Supplementary Fig. 1). Among the genes that are associated with these CpGs, the SPIDR gene encodes the scaffolding protein involved in DNA repair, which regulates the specificity of homologous recombination to achieve a high degree of fidelity17. The CDH6 gene encodes a cadherins protein, which regulates synaptogenesis and synaptic plasticity18,19,20. Recently, CDH6 levels in plasma were also shown to be associated with AD in carriers of the APOE ε4 allele21. The most significant probe cg03429569 is located at 3018 bp upstream of the HOXD10 gene, a member of the HOX family, which was recently shown to be involved in the Rho/ROCK signaling pathway, essential for neuronal degeneration22 and regeneration23. An additional 45 CpGs were identified at a more relaxed significance threshold of P-value < 1 × 10−5 (Fig. 1, Supplementary Data 2). For these 50 AD-associated CpGs, the odds ratios ranged from 0.833 to 1.156 in the ADNI dataset and 0.748 to 1.178 in the AIBL dataset. In the meta-analysis, the mean and median effect size of methylation beta values (i.e., parameter estimate for methylation beta values in the logistic regression model) across all CpGs were −0.33 and −0.27, respectively. The mean and median effect size of methylation beta values for the 50 AD-associated CpGs were −3.20 and −6.00, respectively. The majority of the 50 significant CpGs were hypomethylated in AD subjects (35 CpGs), located outside CpG islands or shores (44 CpGs), or were in distal regions located greater than 2k bp from the TSS (41 CpGs). These observations are consistent with the knowledge that during aging, the strongest risk factor for AD, DNA methylation levels at intergenic regions are marked with hypomethylation24. Only a small number (9 CpGs) of these 50 CpGs were located in promoters of genes, which included NR1I2, STMN2, TREML1, FKBP5, EIF2D, PXK, UPK3B, TDGF1, and RCCD1.
Using meta-analysis P-values for individual CpGs as input, comb-p25 software identified 9 differentially methylated regions (DMRs) at 5% Sidak P-value after multiple comparison corrections (Table 2). The number of CpGs in these DMRs ranged from 3 to 9. There was little overlap between the DMRs and the AD-associated CpGs; only 3 DMRs overlapped with the top 50 CpGs. Among the 9 DMRs, the majority (6 out of 9) were hypomethylated in AD. Five DMRs were in the promoter regions of the PM20D1, NNAT, EIF2D/LGTN, and C1orf65 genes. The PM20D1 gene is associated with response to accumulation of amyloid-β in AD brains26,27, the NNAT gene is associated with neurodegeneration28, and the EIF2D gene is critical for adaption to cellular stress in neurodegenerative diseases such as AD29. Also, blood methylation level at cg27202708 in the C1orf65 DMR was one of 14 CpG sites previously shown to be associated with regional brain volumes measured by magnetic resonance imaging30. Interestingly, among these significant CpGs and DMRs, 20 CpGs and 3 DMRs were also located in enhancer regions (Supplementary Data 2, Table 2), which are regulatory DNA sequences that transcription factors bind to activate gene expressions31,32. Taken together, these results demonstrated the results of our meta-analysis are consistent with recent epigenomics literature in brain research. In addition to replicating methylation differences in genes previously implicated in AD (e.g., PM20D1), we also nominated additional genes associated with AD (e.g., C1orf65).
Cross-tissue meta-analysis identified AD-associated DNA methylation differences in both brain and blood
To identify CpGs and genomic regions with DNA methylation differences in both brain and blood samples, we next performed a cross-tissue meta-analysis of six datasets by additionally including four brain prefrontal cortex (PFC) samples datasets, generated by the ROSMAP2, Mt. Sinai4, London3, and Gasparoni33 methylation studies, along with the two blood sample datasets described above. We previously meta-analyzed these four PFC brain datasets and identified a number of CpGs significantly associated with AD Braak stage5, a standardized measure of neurofibrillary tangle burden determined at autopsy34. Supplementary Data 3 includes detailed information (e.g., Braak stage, clinical diagnosis, PMI) for the brain samples. At 5% FDR, our cross-tissue meta-analysis identified 365 CpGs and 40 DMRs. We then prioritized 97 CpGs and 10 DMRs by requiring these CpGs and DMRs to be also FDR-significant (i.e., FDR < 0.05) in our previous brain sample meta-analysis5 and at least nominally significant (i.e., P-value < 0.05) in our current blood meta-analysis (Fig. 2). Among these prioritized CpGs and DMRs, about half of them (52 CpGs and 6 DMRs) were located in enhancer regions31 (Supplementary Data 4, Table 3). Also, the majority (74 CpGs and 8 DMRs) showed opposite directions of changes in the brain and blood. Among the top 10 most significant prioritized CpGs and DMRs, only a few methylation differences showed a consistent direction of change in both tissues: one CpG (cg05157625) and one DMR (chr1:167090618-167090757) were mapped to gene body of the RIN3 gene and intergenic region, respectively. A second DMR (chr17:62009607-62009835) was located in the promoter region of the CD79B gene (Table 3). Beyond PFC, additional brain regions in the cortex, such as temporal gyrus (TG) and entorhinal cortex (EC) are often also affected by neurodegeneration in AD. Smith et al. studied differential methylation in the cortex associated with AD Braak stage and identified 236, 95, and 10 significant CpGs (at 5% Bonferroni adjusted P-value) in the PFC, TG and EC, respectively35. Our cross-tissue analyses of the blood samples with brain regions in the TC and EC nominated significant DNA methylation differences at 8 CpGs and 1 CpG, respectively (Supplementary Figs. 2, 3, Supplementary Data 5). These CpGs and DMRs that are significant in both brain and blood tissues highlighted AD-associated DNA methylation differences in the periphery that are also altered in the brain.
Integrative multi-omics analyses revealed functional implications of the DNA methylation differences in AD
To better understand the functional roles of these significant DMRs and CpGs, we next performed several integrative analyses using matched DNA methylation and gene expression samples measured on the same subjects. More specifically, for blood sample analysis, we analyzed matched blood samples from 265 independent subjects (84 AD cases and 181 controls) in the ADNI study. For brain sample analysis, we analyzed matched samples measured on prefrontal cortex tissues of 529 subjects (428 AD cases and 101 controls) in the ROSMAP study.
Correlation of methylation levels of significant CpGs and DMRs in AD with expressions of nearby genes
We first correlated DNA methylation levels of the significant DMRs or CpGs with the expression levels of genes found in their vicinity. For the 50 CpGs that reached P-value < 10−5 in blood sample meta-analysis, after removing effects of covariate variables in both DNA methylation and gene expression levels separately (Methods), we found 2 CpGs (cg07886485 and cg23963071) located in promoter regions of the PXK and SERPINB9 genes, respectively, were significantly associated with target gene expression levels at 5% FDR (Supplementary Data 6a, Supplementary Fig. 4). One DMR (chr1:205819345-205819464), also located in the promoter region, showed a strong negative association with the gene expression level of the target gene PM20D1 (P-value < 2.22 × 10−16, FDR < 2.22 × 10−16).
Among the 97 CpGs and 10 DMRs that were significant in the cross-tissue analysis of brain and blood samples (Fig. 2), no methylation-gene expression pairs reached 5% FDR in the ADNI dataset. In the ROSMAP36 brain sample dataset, after removing covariate effects in DNA methylation and gene expression separately, 10 CpGs and one DMR were significantly associated with their target genes (Supplementary Data 6b). Among them, DNA methylations at 7 CpGs and 1 DMR, located on MYO1F, RIN3, HOXA7, T, PRAM1, CSDC2, L3MBTL2, and THBS1 genes, showed the same direction of association with target gene expressions in both brain and blood samples. The greater number of methylation to gene expression associations detected in brain samples compared to blood samples could be due to the larger sample size of matched methylation-RNA brain samples available (529 ROSMAP brain samples vs. 265 ADNI blood samples).
In this section, we identified CpGs and DMRs that influence target gene expression directly. In the next section, we discuss identifying CpG methylations that influence target gene expression indirectly by modulating transcription factor activity.
MethReg integrative analysis associated CpGs with putative transcription factors and target genes
Transcription factors (TFs) are proteins that bind to DNA to facilitate transcription. Recent studies demonstrated that CpG methylation-dependent transcriptional regulation is a widespread phenomenon37,38,39. In particular, the binding of TFs onto DNA can be affected by DNA methylation, and DNA methylation can also be altered by proteins associated with the TFs. To better understand the regulatory roles of the AD-associated CpGs, we next performed an integrative analysis of DNA methylation, gene expression, and TF binding data to prioritize CpG-TF-target gene triplets in which regulatory activities of the TFs on target gene expressions are most likely influenced by CpG methylations, using our recently developed MethReg R package40. To estimate TF activities, the R package GSVA41, which assesses expression levels of transcriptional targets of the TF proteins, was used.
At 5% FDR, our analysis of the ADNI blood samples identified a total of 8 CpG-TF-target gene triplets (Supplementary Data 7, Supplementary Figs. 5–7). The methylation-sensitive TFs in these triplets were involved in various biological processes previously implicated in AD, such as TGFβ signaling (SMAD3)42, lipid metabolism (SREBF1)43, inflammatory response (CEBPB, XBP1), and cell cycle control (MYC). Similarly, at 5% FDR, our analysis of ROSMAP brain samples identified 9 CpG-TF-target gene triplets (Supplementary Data 8, Supplementary Figs. 8–11). The methylation-sensitive TFs in these triplets included several critical regulators in AD, such as the estrogen receptor (ESR1)44 and the glucocorticoid receptor (NR3C1)45, as well as factors in biological pathways previously shown to be important in AD pathogenesis, such as antioxidant response (MAFF)46,47, TGFβ signaling (SMAD1)42, Wnt signaling (TCF7L2)48, and hypoxia (H1F1A)49.
One example is the triplet cg16908123-RUNX3-C1orf100 in the analysis of the ADNI blood samples, in which DNA methylation at cg16908123 appeared to attenuate the repression of target gene C1orf100 by TF RUNX3 (Supplementary Fig. 5, Supplementary Data 7c). Notably, in samples with low cg16908123 methylation levels, higher TF activities corresponded to more repression of the target gene (P-value = 1.18 × 10−5). On the other hand, when methylation is high, the target gene is relatively independent of TF activities. Therefore, methylation at cg16908123 and the TF RUNX3 jointly regulate C1orf100 gene expression. In contrast, individually, neither methylation at CpG cg16908123 nor TF activities for RUNX3 were associated with gene expression of the target gene C1orf100.
The transcription factor RUNX3 is a critical regulator for linage specificity of hematopoietic stem and progenitor cells, which ensures a balanced output of peripheral blood cell types. It has been observed that during aging, RUNX3 level decreases, accompanied by shifts in blood cell types50. Previously, several studies also demonstrated RUNX3 activity is primarily regulated by DNA methylation51,52,53. The target gene C1orf100 was recently discovered to be associated with white cell telomere length54. Although the role of C1orf100 is not well understood in AD, given shortening of telomere length is a hallmark of aging, DNA methylation differences and transcription factor activities that affect this gene are particularly relevant for AD. In both ADNI and AIBL studies, methylations at cg16908123 were significantly hyper-methylated in AD samples (Supplementary Data 4), consistent with the MethReg prediction that CpG methylation attenuates the effect of RUNX3 and the previous observations that RUNX3 activity decreases with aging.
Our MethReg analysis revealed a number of additional TFs that interact with AD-associated CpG methylations to jointly regulate target gene expressions (Supplementary Data 7, 8). Importantly, for these methylation-sensitive TFs, the TF-target associations are often only present in a subset of samples with high (or low) methylation levels, thus might be missed by analyses that use all samples (Supplementary Figs. 5–11). Although many of the TFs have previously been implicated to AD, our integrative analysis provided additional information on the specific roles of the TFs in transcriptional regulation, and identified target genes for these TFs in AD, by nominating plausible TF-target gene associations that are mediated by DNA methylations.
Integrative analysis revealed gene expressions associated with DNA methylation differences in the blood and the brain converge in biological pathways
To further understand the functional effects of the significant CpGs as a group, we next performed an additional integrative analysis55 (Supplementary Fig. 12). Covariate effects were first removed from DNA methylation and gene expression data by fitting separate linear models (Methods). We next performed a principal component analysis. For each sample, we summarized covariate-adjusted methylation values in the significant CpGs by the first PC (PC1), which is a weighted linear combination of the methylation values; these are the methylation PC scores (MPS). Then, we tested the association between the MPS and covariate-adjusted genome-wide gene expression levels using linear models, ranked the genes by the absolute value of t-statistics for MPS, and performed pathway analysis using the GSEA method56,57. This analysis was performed for ADNI blood samples and ROSMAP brain samples separately.
At 5% FDR, we identified 1864, 80 genes, and 13 genes that were significantly associated with the methylation PC scores in the brain, blood, or both tissues, respectively (Supplementary Data 9). A substantial number of the 13 genes significant in both tissues included genes involved in the inflammatory response, such as CD79A, LY86, SP100, CD163, CD200, and MS4A1, recapitulating the prominence of immune processes in AD58. For GSEA pathway analysis, at 5% FDR, 830, 12, and 10 Canonical Pathways (CP) were significant in the analyses of the brain, blood, or both tissues (Supplementary Data 10). Similarly, 1895, 52, and 47 Gene Ontology (GO) terms were significant in the analysis of brain, blood, or both tissues (Supplementary Data 11). Notably, the number of significant GO terms and pathways overlapping in analyses of brain and blood was significantly more than expected by chance (CP: P-value = 0.019, GO: P-value = 4.50 × 10−10).
Among the pathways that reached 5% FDR significance in brain or blood analyses, a number of pathways were involved in inflammatory responses in AD, such as neutrophil degranulation, antigen processing and presentation, interferon signaling, and activation of nuclear factor kappa B pathways. Additional significant biological processes included biological processes previously shown to be important in AD, such as glycolysis, antiviral mechanism, endocytosis, mRNA translation59,60, and retrograde transport61. Importantly, some of these biological processes also pointed to potential biomarkers and therapeutic targets for the treatment of AD. For example, the Rho GTPase signaling pathway, the second most significant pathway in brain samples analysis (P-value = 3.62 × 10−21, FDR = 3.28 × 10−18), plays critical roles in regulating synaptic plasticity of the neurons and has been studied as a viable target for AD62,63,64. Also, within the ATM signaling pathway, which was significant in blood sample analysis (P-value = 1.19 × 10−4, FDR = 0.026), the ATM protein is a central regulator for DNA damage response and has been proposed as a promising new target for treating neurodegeneration recently65,66.
A total of 10 canonical pathways and 47 GO terms reached 5% FDR in both brain and blood sample analyses (Supplementary Data 10b, 11c). To further compare the genes that contributed most to the enrichment scores in these pathways56, we next estimated the Jaccard similarity coefficients (percentage of overlapping genes) for leading-edge genes, which most strongly contributed to pathway enrichment, in each pathway in the analyses of brain and blood samples. Among all pathways that were significant in both tissues, the Jaccard coefficients ranged from 0.19 to 0.36 for canonical pathways and 0.22 to 0.42 for GO BP terms (Supplementary Data 10b, 11c), indicating only a moderate percentage of overlapping leading-edge genes in the same pathway across different tissues. Interestingly, neutrophil degranulation ranked as the most significant pathway in both brain and blood sample analyses. However, for this pathway, only 29.5% of leading-edge genes were shared in brain and blood sample analyses (Supplementary Data 10c). These results suggested there might be a convergence in pathways across the brain and blood in gene expression changes associated with the methylation PC scores.
Correlation and overlap with genetic susceptibility loci
To identify methylation quantitative trait loci (mQTLs) for the significant DMRs and CpGs, we next performed look-up analyses using the GoDMC database67 for mQTLs in blood and the xQTL sever68 for mQTLs in the brain. Among the CpGs mapped to the 50 AD-associated CpGs (Supplementary Data 2) or located in AD-associated DMRs in blood samples analysis (Table 2), 39 CpGs had mQTLs in cis, and 7 CpGs had mQTLs in trans. Similarly, among the 97 prioritized CpGs or CpGs located in the 10 prioritized DMRs in our cross-tissue analysis (Fig. 2), 13, 104, and 11 CpGs had mQTLs in the brain, blood, or both, respectively. Among them, 72 CpG – mQTL pairs were significant in both brain and blood sample analysis (Supplementary Data 12). The larger number of mQTLs detected in the blood could be due to the larger sample size of GoDMC, which used a meta-analysis design67, compared to xQTL, which was computed based on a single dataset (ROSMAP)68.
Next, to evaluate if the significant mQTLs in the brain and blood overlapped with genetic risk loci implicated in AD, we compared the mQTLs with the 24 LD blocks of genetic variants reaching genome-wide significance in a recent meta-analysis of AD GWAS69. While no brain mQTLs overlapped with the 24 LD blocks, 3045 blood mQTLs were mapped to the LD region chr6:32395036-32636434, which included genetic variants mapped to the HLA-DRA, HLA-DRB5, HLA-DRB1, HLA-DQA1, and HLA-DQB1 genes (Supplementary Data 13). In addition, we also evaluated if the significant methylation differences overlapped with the genetic risk loci implicated in AD. We found only 3 AD-associated CpGs overlapped with genetic variants mapped to the TREM2, SPI1, and ACE genes (Supplementary Data 14), and no DMRs overlapped with any of the 24 LD blocks. These results are consistent with another recent study that meta-analyzed 11 blood-based EWAS of neurodegenerative disorders, including AD, amyotrophic lateral sclerosis, and Parkinson’s disease, which also did not find evidence for overlap between significant EWAS loci and GWAS loci16. The lack of commonality between genetic and epigenetic loci in AD could be due to a lack of power in EWAS and/or GWAS but could also reflect the relatively independent roles of genetic variants and DNA methylation in influencing AD susceptibility70,71.
Out-of-sample validations of AD-associated DNA methylation differences in an external dataset
To evaluate the feasibility of the identified methylation differences for predicting AD diagnosis, we next performed out-of-sample validations using an external DNA methylation dataset generated by the AddNeuroMed study, which included 83 AD cases and 88 control samples with ages greater than 65 years10. To this end, we computed the methylation risk scores (MRS)72, which were shown to have excellent discrimination of smoking status, and moderate discrimination of obesity, alcohol consumption, HDL cholesterol73, and amyotrophic lateral sclerosis case-control status74 recently.
More specifically, for each sample in the AddNeuroMed dataset, we computed MRS by summing the methylation beta values of the significant CpGs weighted by their estimated effect sizes in the AIBL dataset. Among the 97 significant CpGs in cross-tissue analysis, 91 CpGs are available in the AddNeuroMed dataset and were used for computing MRS. Several logistic regression models were estimated using the AIBL dataset and then tested on the AddNeuroMed dataset. We considered logistic regression models with three sources of variations that might affect the prediction for AD diagnosis: known clinical factors (i.e., age and sex), estimated cell-type proportions for each sample, and MRS. When tested individually, the models that included age and sex, cell types, or MRS alone had AUCs of 0.649. 0.576, and 0.614, respectively (Fig. 3). When combined with estimated cell types or MRS, prediction performance for the model with clinical factors (i.e., age and sex) improved to AUCs of 0.663 and 0.688, respectively. Notably, MRS was computed based on fewer than one hundred CpGs, while the six variables corresponding to cell type proportions were estimated using all CpGs on the array. The best performing model included age, sex, MRS, and cell types (AUC = 0.696, 95% CI: 0.616 − 0.770) (Figs. 3 and 4), significantly more predictive than a random classifier with an AUC of 0.5 (P-value = 2.78 × 10−5).
The same logistic regression model trained using both ADNI and AIBL datasets (instead of AIBL alone) performed slightly worse with an AUC of 0.678, which might be due to batch effect of different training datasets. The model that included MRS based on AD-associated CpGs (instead of prioritized CpGs) also performed worse with an AUC of 0.609, probably because CpGs with cross-tissue differences also leveraged information from additional brain sample datasets. In addition to MRS, we also evaluated the performance of MPS (methylation PC scores) described above, which sums methylation beta values in the testing dataset weighed by loadings of the first principal component in PCA analysis. The best performing logistic regression model involving MPS was also estimated using the AIBL dataset, included variables age, sex, cell types, and MPS computed based on the same 91 prioritized cross-tissue CpGs, and achieved an AUC of 0.662.
Discussion
We performed a comprehensive meta-analysis of two large independent AD blood samples EWAS, which generated DNA methylation profiles using the same Infinium MethylationEPIC BeadChip. After correcting for multiple comparisons, a total of 5 CpGs reached 5% FDR (Table 1), and an additional 45 CpGs reached P-value < 1 × 10−5 (Supplementary Data 2). Among them, two CpGs (cg03546163 and cg14195992), mapped to the FKBP5 and SPIDR genes, also reached genome-wide significance in another large meta-analysis of multiple EWAS (of Amyotrophic lateral sclerosis, Parkinson’s disease, and AD)16, suggesting these two CpGs corresponded to susceptibility loci common in neurodegenerative diseases. In mouse models of AD, FKBP5 was shown to promote tau protein aggregation and is primarily regulated by DNA methylation75. It has been observed that FKBP5 expression levels increase with age and are upregulated in AD brains75. Our results and those from Nabais et al. (2021) provided additional evidence that DNA methylation at cg03546163 is also significantly hypomethylated in the blood of AD subjects16, suggesting FKBP5 methylation might be a viable candidate biomarker for tau pathology. Moreover, genetic variants on FKBP5 have also been implicated in stress-related disorders such as major depressive disorder76,77, suicide behavior77, posttraumatic stress disorder78, and childhood maltreatment79. These results are consistent with the increased dementia risk observed in patients with psychiatric disorders such as depression80.
Our most significant DMR is located on the PM20D1 gene, where highly significant promoter hypomethylation was observed in AD patients (P-value = 5.16 × 10−11, Sidak adjusted P-value = 3.12 × 10−7). In addition, the strong negative methylation-gene expression correlation at this locus was the strongest association we observed in our study (P-value < 2.2 × 10−16, FDR < 2.2 × 10−16). Previously, promoter hyper-methylation of the PM20D1 gene, which is associated with responses to the neurotoxic insults in AD brains, has been consistently observed in multiple AD brain samples EWAS27. Moreover, it was shown that overexpression of PM20D1 is associated with decreased amyloid-β levels and reduces cell death both in vitro and in vivo; thus, it may have a neuroprotective role against AD26,27.
The second most significant DMR is located in the promoter region of the NNAT gene, which encodes the neuronatin protein that aggregates and causes cell death in another neurodegenerative disease called Lafora disease28. The NNAT gene is mainly expressed in the brain during neurogenesis and is critical for maintaining synaptic plasticity of the neurons81. Interestingly, NNAT is an imprinted gene and normally only expresses the paternal allele, while the maternal allele is suppressed by DNA methylation. Changes in neuronatin might be a common downstream effect due to neuronal loss in multiple diseases such as Lafora disease, diabetes, and cancer28. Importantly, DNA methylation at the NNAT gene is diet responsive and can be altered by food enriched with methyl group donors, including folate, choline, methionine, and vitamin B1282. Future studies are needed to evaluate the effect of methyl donor diet on AD.
Consistent with observations from individual EWAS studies3,83, the CpGs and DMRs identified in this meta-analysis of AD blood samples EWAS were mostly distinct from the significant CpGs previously identified in our meta-analysis of AD brain samples EWAS5. Among the AD-associated CpGs and DMRs, only one CpG, cg06357748 mapped to the RAD52 gene, was significant in both studies (i.e., it reached 5% FDR in brain samples meta-analysis5 and P-value < 10−5 in the current blood samples meta-analysis). Interestingly, we also found that a group of cis-mQTLs was strongly associated with cg06357748 in both brain and blood with effects in the same direction (Supplementary Data 12). RAD52 is a critical protein in the RNA-templated recombination repair pathway that mends oxidative damage to DNA in neurons, and it was recently shown that RAD52 activities could be inhibited by high concentrations of amyloid β, which may lead to neuronal genomic instability and neurodegeneration84. Using the same significance threshold (5% FDR), we detected much fewer significant CpGs (5 CpGs) in our current blood samples meta-analysis than in the previous brain samples meta-analysis. This is probably due to the larger between-samples variability in blood samples. For example, in the London dataset3, which had brain and blood DNA methylation levels measured on the same subjects, we observed the average standard deviation for methylation M-values85 across all CpGs in the blood samples is 0.303 in healthy controls, compared to 0.265 for brain samples from the same subjects.
Moreover, in contrast to results from our brain samples meta-analysis5, in which the majority of methylation differences were located in CpG islands and hyper-methylated in AD, we found the majority of methylation differences in our blood samples meta-analysis were located in open seas and hypo-methylated in AD. A number of other recent studies also observed hypomethylated CpGs in the blood of AD patients. For example, Fransquet et al. (2020)8 observed the majority of differentially methylated CpGs between dementia cases at diagnosis and controls in their study also had lower methylation levels in the cases; Madrid et al. (2018)12 discovered hypomethylated CpGs in the B3GALT4 and ZADH2 genes in patients with late-onset AD; Mitsumori et al. (2020)13 identified and replicated lower DNA methylation levels in CpG island shores of CR1, CLU, and PICALM genes in the blood of Japanese AD patients.
Our results are also consistent with previous studies on aging, the strongest risk factor for AD. DNA methylation changes throughout the lifetime, and it has been observed that as people age, methylation decreases in intergenic regions but increases at many promoter-associated CpGs islands regions86. For example, a comparison of DNA methylation in CD4 + T cells of a centenarian with a newborn revealed pervasive hypomethylation across the genome24 in the aged individual. Similarly, Reynolds et al. (2014)87 studied age-related CpGs that are associated with gene expression (age-eMS) in human monocytes and T cells and found age-eMS tended to be hypomethylated as people become older. These previous observations of hypomethylation during aging, combined with our observation of hypomethylation in AD subjects compared to cognitively normal subjects of similar ages (Supplementary Data 1), are consistent with the hypothesis that some age-associated DNA methylation changes are accentuated by AD88,89,90.
While it is difficult to infer whether epigenetic variations are a cause or consequence of the disease process, the DNA methylation differences we observed in this study can provide a useful source of candidate markers associated with neuronal and axonal cell injuries in AD. A number of the differentially methylated genes corresponded to gene expression and protein biomarkers already being explored for diagnosing or tracking progressions in AD. For example, CSF levels of the CDH6 protein (Table 1) were recently shown to be significantly associated with CSF p-tau and t-tau levels as well as amyloid-beta in a large cohort of dementia patients21. Our second most significant CpG, cg14195992, which also reached genome-wide significance in Nabis et al. (2020)16, is mapped to the gene body of the SPIDR gene. Previously, gene expression at the SPIDR gene was among one of 48 selected genes for a classifier that discriminated AD cases from control samples91.
This study has several limitations. First, the methylation levels analyzed here were measured on whole blood, which contains a complex mixture of cell types. To reduce confounding effects due to different cell types, we included estimated cell-type proportions as covariate variables in all our analyses. After multiple comparison corrections using FDR, we only identified a few significant associations between AD-associated CpGs or DMRs with expression levels of nearby genes, this lack of association might be due to larger variability in the samples introduced by cellular heterogeneity. Future studies that utilize single-cell technology for gene expression and DNA methylation might improve power and shed more light on the particular cell types affected by the AD-associated DNA methylation differences discovered in this study. Second, in the analysis of blood samples, we used clinical diagnosis for AD as our endpoint. However, as the pathophysiological process of AD can begin many years before the onset of clinical symptoms92,93, there can be disagreement between neuropathology and clinical phenotypes. Our analysis may have included individuals with disparate brain pathology and clinical diagnosis. These individuals would likely have diluted DNA methylation to AD association signals in our study; therefore, our meta-analysis results may be conservative. Future studies that utilize in vivo neuroimaging endophenotypes that measure amyloid and tau might improve the power of blood-based DNA methylation studies. Third, in the meta-analysis of blood samples, only 5 CpGs reached 5% FDR, we therefore examined 45 additional CpGs at the less stringent significance threshold of P-value < 10−5. Also, to select DNA methylation differences that are significant in the brain, blood, and cross-tissue meta-analyses (Fig. 2), instead of using the FDR-significant CpGs in blood samples meta-analysis, we intersected nominally significant (i.e., P-value < 0.05) CpGs and DMRs in blood sample meta-analysis with FDR-significant DNA methylation differences in brain and cross-tissue meta-analyses. These more relaxed significance thresholds might correspond to higher false-positive rates. To help prioritize the most biologically relevant DNA methylation differences, we performed several integrative analyses of methylation with genetic variants, gene expressions, and transcription factor binding sites. Fourth, we did not consider MCI subjects in this study because there is considerable heterogeneity among MCI subjects, with subjects converting to AD at different trajectories. As ADNI is currently conducting additional phases of the study, future analyses with a larger sample size will make it possible to detect more DNA methylation differences in AD as well as in MCI subjects. Fifth, the MRS-based risk prediction model could also be further improved. Because DNA methylation samples in the testing dataset (AddNeuroMed) were measured by 450k arrays, which are different from the EPIC arrays used by the AIBL study, we only included CpGs that mapped to both arrays in the computation of our MRS. The performance of our MRS-based prediction models can be assessed more accurately using future testing dataset measured by the EPIC arrays. Also, in the out-of-sample validation analysis, we did not include other important factors such as APOE genotype, which might also significantly predict AD diagnosis94 because we did not have access to APOE information in the AIBL and AddNeuroMed datasets. Our internal validation using the ADNI dataset suggested additionally including APOE (ɛ4 allele) into our best-performing logistic regression model, which included MRS, age, sex, and estimated cell types (Figs. 3, 4), could substantially improve prediction performance. More specifically, a 10-fold cross-validation using the ADNI dataset showed the estimated average AUCs for the best performing logistic regression models with and without APOE status were 0.691 and 0.810, respectively (Supplementary Data 16). Finally, the associations we identified do not necessarily reflect causal relationships. Additional studies are needed to establish the causality of the nominated DNA methylation markers.
Although brain tissues are ideal for studying AD, currently, it still is not feasible to obtain methylation levels in brain tissues from living human subjects. On the other hand, because of the relative ease of obtaining blood samples, measuring blood methylation levels is a practical alternative. In this study, we identified a number of DNA methylation differences consistently associated with AD diagnosis in blood samples of two large independent datasets of subjects. Our integrative analysis of DNA methylation differences in the blood with those in the brain, as well as gene expression and TF binding sites information prioritized a number of CpGs, genes, pathways, and regulators that are associated with both neuropathology and/or AD diagnosis, many of which were involved in the inflammatory responses in AD. Consistent with previous studies, we found the patterns of DNA methylation differences in the brain and blood resemble those observed during aging. Given advanced age is the greatest risk factor for AD, our results highlight the need for a better understanding of epigenetic changes during normal aging to design prevention and treatment strategies for AD88. Despite the limited agreement at significant individual CpGs across tissues, we did find that expression changes associated with the combined DNA methylation differences (methylations summarized by principal component score) in brain or blood were enriched in a number of common pathways, suggesting systems biology approaches95 that uncover the pathways disrupted in AD subjects might be a useful strategy for identifying peripheral AD biomarkers that reflect changes in the brain. With respect to genetic variants, consistent with previous studies70,71, we also observed AD-associated DNA methylation differences are mostly independent of genetic effects, suggesting multi-omics models96 that leverage complementary information from both the genome and the epigenome would be helpful for predicting AD risk. Finally, given the relatively modest sample size of our training dataset, the significant discriminatory classification of AD samples with our MRS-based risk prediction model demonstrated DNA methylation might be a predictive biomarker for AD. Future studies that validate our findings in larger and more diverse community-based cohorts are warranted.
Methods
Study datasets
Our meta-analysis included a total of 1284 whole blood samples from two independent datasets, generated by the ADNI14 and AIBL16 studies. The external validation samples included 171 whole blood samples generated by the cross-European AddNeuroMed study10. DNA methylation samples for the ADNI, AIBL, and AddNeuroMed studies were obtained from adni.loni.usc.edu, Gene Expression Omnibus (GEO) (accession: GSE153712 and GSE144858), respectively. Only samples older than 65 years were analyzed.
Preprocessing of DNA methylation data
All DNAm samples were measured by the same Illumina HumanMethylation EPIC beadchip, which included more than 850,000 CpGs97. Supplementary Data 15 shows the number of CpGs and samples removed at each quality control step. More specifically, quality control for CpG probes included several steps: first, we selected probes with a detection P-value < 0.01 for all the samples in the dataset. A small detection P-value corresponds to a significant difference between signals in the probes compared to background noise. Next, using function rmSNPandCH from the DMRcate R package (version 2.4.1), we removed probes that are located on the X and Y chromosomes, are cross-reactive98, located close to single nucleotide polymorphism (SNPs) (i.e., an SNP with minor allele frequency (MAF) ≥ 0.01 was present in the last five base pairs of the probe), or are associated with cigarette smoking99.
Quality control for samples included restricting our analysis to samples with good bisulfite conversion efficiency (i.e., ≥85%). In addition, principal component analysis (PCA) was used to exclude outlier samples. To this end, PCA was performed using the 50,000 most variable CpGs for each dataset, and samples within ±3 standard deviations from the mean of PC1 and PC2 were selected to be included in the final sample set. For the ADNI dataset, we also selected one sample among multiple technical replicate samples and removed samples without slide information or matched clinical data.
The quality-controlled methylation datasets were next subjected to the QN.BMIQ normalization procedure as recommended by a recent systematic study of different normalization methods100. More specifically, we first applied quantile normalization as implemented in the lumi R package (version 2.42.0) to remove systematic effects between samples. Next, we applied the β-mixture quantile normalization (BMIQ) procedure as implemented in the wateRmelon R package (version 1.34.0) to normalize beta values of type 1 and type 2 design probes in the Illumina arrays.
Blood sample meta-analysis
For the AIBL dataset, the association between CpG methylations and diagnosis (dementia vs. cognitive normal or CN) was assessed using logistic regression models with logit (probability of dementia) as the outcome, methylations beta values as the main independent variable, and covariate variables age, sex, methylation plate, and estimated cell-type proportions (B lymphocytes, natural killer cells, CD4 + T lymphocytes, CD8 + T lymphocytes, monocytes, neutrophils). For the analysis of ADNI dataset, which is a longitudinal study with some subjects contributing multiple observations, we applied logistic mixed-effects models that additionally included subjects random effects to account for correlations from multiple observations generated from the same subjects.
More specifically, cell type proportions were estimated using the EpiDISH101 R package (version 2.6.0). For the AIBL dataset, logistic regression models were fitted using the glm() function in R software (version 4.0.3). For the ADNI dataset, logistic mixed-effects models were fitted using Procedure GLMMIX in SAS software (version 9.4). In the AIBL dataset, because age information was not available, sample ages were estimated using the DNAm-based-age-predictor102 (https://github.com/qzhang314/DNAm-based-age-predictor/, elastic net method). For ADNI samples, age was calculated as the difference between the date on which blood was drawn and the birthdate of the subject.
We estimated genomic inflation factors (lambda values) using both the conventional approach103 and the bacon method104, which is specifically proposed for a more accurate assessment of inflation in EWAS. Briefly, the bacon method uses a Bayesian algorithm to estimate a three-component normal mixture given the observed test statistics (e.g., t-statistics corresponding to the effect of DNA methylation in regression models) where one component reflects the null distribution, and two other components correspond to the positive and negative associations in the data. Mean and standard deviations of the estimated (empirical) null distribution correspond to bias and inflation of the test statistics. The lambda values (λ) by the conventional approach were 0.54 and 1.24, and lambdas based on the bacon approach (λ.bacon) were 0.73 and 0.96 for the ADNI and AIBL datasets, respectively.
Next, for more accurate statistical assessment, genomic correction using the bacon method104, as implemented in the bacon R package, was applied to obtain bacon-corrected effect sizes, standard errors, and P-values for each dataset. Efron (2010) showed that in large-scale simultaneous testing situations (e.g., when many CpGs are tested in an analysis), serious defects in the theoretical null distribution may become obvious, while empirical Bayes methods can provide much more realistic null distributions105. By definition, the bacon-corrected test statistics have estimated an inflation factor of 1 because empirical null distributions were used in their estimation. Indeed, after bacon-correction, the estimated inflation factors were λ = 0.97 and 1.02, and λ.bacon = 0.98 and 0.97, for the ADNI and AIBL datasets, respectively.
To meta-analyze individual CpG results across different datasets, we used the inverse-variance weighted fixed-effects model, which was implemented in the meta R package (version 4.18.0). The estimated effect sizes from the meta-analysis of logistic regression model results were then exponentiated to compute odds ratios for a one percent increase in beta values (i.e., increase in beta values by 0.01).
For the region-based meta-analysis, we used the comb-p method25. Briefly, comb-p takes single CpG P-values and locations of CpG sites to scan the genome for regions enriched with a series of adjacent low P-values. In our analysis, we used meta-analysis P-values of the two whole blood sample datasets as input for comb-p. As comb-p uses the Sidak method to account for multiple comparisons, we considered DMRs with Sidak P-values less than 0.05 to be significant. We used the default setting for our comb-p analysis, with parameters -seed 1e-3 and -dist 200, which required a P-value of 10−3 to start a region and extend the region if another P-value was within 200 base pairs.
Cross-tissue meta-analysis Our cross-tissue meta-analysis included the 1284 blood samples from the ADNI (n = 793) and AIBL (n = 491) datasets described above, and an additional 1030 prefrontal cortex brain samples from four independent datasets, which included samples from the ROSMAP (n = 726), Mt. Sinai (n = 141), London (n = 107), and Gasparoni (n = 56) studies that we previously analyzed in our brain samples meta-analysis5. We used Stouffer’s Method106, as implemented in sumz() function of R package metap, to combine weighted z-scores (transformed from P-values) in these six datasets. For each study, weights were specified based on the square root of the total number of subjects in each study107. For region-based analysis, because DMRs identified with comb-p often vary by datasets so cannot be meta-analyzed, the coMethDMR108 R package (https://github.com/TransBioInfoLab/coMethDMR) was used to analyze a common set of genomic regions across datasets. For the cross-tissue meta-analysis of blood samples with other brain regions in the temporal gyrus and entorhinal cortex, we combined dataset-specific P-values of the CpGs in AIBL and ADNI with dataset-specific P-values for the CpGs in Supplementary Data 3, 5 of Smith et al.35 using Stouffer’s Method106 as described above.
Functional annotation of significant methylation differences
The significant methylation differences at individual CpGs and DMRs were annotated using both the Illumina (UCSC) gene annotation and GREAT (Genomic Regions Enrichment of Annotations Tool) software109 that associates genomic regions to target genes. With the default “Basal plus method”, GREAT links each gene to a regulatory region consisting of a basal domain that extends 5 kb upstream and 1 kb downstream from its transcription start site and an extension up to the basal regulatory region of the nearest upstream and downstream genes within 1 Mb. To assess the overlap between our significant CpGs and DMRs (CpG or DMR location + /− 250 bp) with enhancers, we used enhancer–gene maps generated from 131 human cell types and tissues described in Nasser et al. (2021)31, available at https://www.engreitzlab.org/resources/. More specifically, we selected enhancer-gene pairs with “positive” predictions from the ABC model, which included only expressed target genes, does not include promoter elements, and has an ABC score higher than 0.015. In addition, we also required the enhancer-gene pairs were identified in cell lines relevant to this study (https://github.com/TransBioInfoLab/AD-meta-analysis-blood/blob/main/code/annotations/).
Correlations between methylation levels of significant CpGs and DMRs in AD with expressions of nearby genes
For blood sample analysis, to evaluate the DNA methylation effect on the gene expression of nearby genes, we analyzed matched gene expression (Affymetrix Human Genome U 219 array) and DNA methylation (EPIC array) data from 265 independent subjects in the ADNI study. We considered the 97 prioritized CpGs (Supplementary Data 4), the 50 CpGs that achieved P-value < 1 × 10−5 in AD vs. CN analysis (Supplementary Data 2), as well as CpGs located in the 10 prioritized DMRs (Table 3) and 9 significant DMRs in the AD vs. CN comparison (Table 2).
To associate genes with DNA methylation sites, we used the MethReg R package40 and considered CpGs located in the promoter regions and distal regions separately. More specifically, for CpGs located in the promoter region (within ±2 kb around the transcription start sites (TSS)), we tested the association between CpG methylation with expression levels of the target genes. On the other hand, for CpGs in the distal regions (>2 kb from TSS), we tested the association between CpG methylation with expression levels of ten nearest genes upstream and downstream from the CpG. For gene expression data, when multiple probes were mapped to a gene, we used the median gene expression level over all probes mapped to the gene as its gene expression level.
To reduce the effect of potential confounding effects, when testing for methylation to gene expression associations, we first adjusted for age at visit, sex, immune cell-type proportions (for B lymphocytes, natural killer cells, CD4 + T lymphocytes, CD8 + T lymphocytes, monocytes, neutrophils), and batch effects in both DNA methylation and gene expression levels separately and extracted residuals from the linear models. Immune cell-type proportions were estimated using the R packages EpiDISH101 and Xcell110 (https://github.com/dviraran/xCell) for DNA methylation and gene expression data, respectively. A separate robust linear model was then used to test the association between methylation residuals and gene expression residuals, adjusting for AD status. The analysis of DMRs was performed similarly, except by replacing CpG methylation levels with the median methylation level of all CpGs located within the DMR.
For the analysis of brain samples, we considered the 97 prioritized CpGs (Supplementary Data 4) and CpGs within ours 10 prioritized DMRs (Table 3) in cross-tissue analysis, and performed similar analyses using matched RNA-seq and DNA methylation brain samples from 529 independent subjects in the ROSMAP study. First, we removed confounding effects in DNA methylation data by fitting the linear model DNA methylation M value ~ neuron.proportions + batch + sample.plate array + ageAtDeath + sex and extracting residuals from this model, which are the methylation residuals. The proportions of neurons in each sample were estimated using the CETS R package111. Similarly, we also removed potential confounding effects in RNA-seq data by fitting model log2(normalized FPKM values + 1) ~ age at death + sex + markers for cell types and extracting gene expression residuals. The last term, “markers for cell types,” included multiple covariate variables to adjust for the multiple types of cells in the brain samples. More specifically, we used the expression levels of genes that are specific for the five main cell types present in the central nervous system2: ENO2 for neurons, GFAP for astrocytes, CD68 for microglia, OLIG2 for oligodendrocytes, and CD34 for endothelial cells, and included these as variables in the above linear regression model.
We then tested for association between methylation residuals and gene expression residuals, adjusting for the Braak stage using a separate robust linear model residualsexpression ~ residualsDNAm + Braak stage. The analysis of DMRs was performed similarly, except by replacing CpG methylation levels with the median methylation level of all CpGs located within the DMR.
MethReg integrative analysis
To create the CpG-TF-target gene triplets, we first linked a given CpG to transcription factors (TFs) with binding sites within ±250 bp of the CpG, by using information from the ReMap2020 database112, which contains regulatory regions for over one thousand transcriptional regulators obtained using genome-wide DNA-binding experiments such as ChIP-seq. Next, we linked a CpG to a gene if the CpG was within its promoter region; otherwise, we considered the CpG to be in the distal regions (>2k bp from any promoter regions) and linked it to 5 genes upstream and 5 genes downstream of the CpG location. The CpG-TF pairs are then combined with CpG-target gene pairs to create triplets of CpG-TF-target genes. We used the same 265 and 529 matched methylation-RNA samples from ADNI and ROSMAP studies described in the section above for blood and brain samples analysis. Methylation residuals and gene expression residuals were obtained in the same way as described above and used as input for the MethReg analysis. TF activities were estimated using the GSVA41 R package. The MethReg analyses were performed using the MethReg R package40.
Integrative analysis of DNA methylation differences in the brain and blood with transcriptome-wide gene expressions
Covariates adjusted DNA methylation levels (i.e., DNA methylation residuals described above) at AD-associated CpGs in blood samples or Braak-associated CpGs in brain samples were summarized using principal component analysis (PCA) (Supplementary Fig. 12). More specifically, for blood sample analysis, the first principal component (PC1) was computed by performing PCA analysis on the ADNI dataset with the 50 CpGs that achieved P-value < 10−5 in AD vs. CN analysis (Supplementary Data 2); while for the brain sample analysis, PCA was performed on the ROSMAP dataset, and PC1 was computed for the 3751 CpGs identified in our previous brain sample meta-analysis5. For each sample, PC1 was estimated using the prcomp() function in R, and they represented the methylation PC scores (MPS).
Next, we tested the association between log-transformed gene expressions and the methylation PC scores using linear regression models, adjusting for age, sex, batch, and estimated immune cell types in brain and blood samples separately. Gene Set Enrichment Analysis56 was performed using the fgsea R package (version 1.17.1). As a ranking measure for each gene, we used the absolute value of t-statistics corresponding to methylation PC scores in the linear model, which associated gene expression levels with the methylation PC scores. We analyzed gene sets from the Molecular Signatures Database (MSigDB)56, accessed by the msigdbr R package (version 7.4.1.), including the GO Biological Process terms (C5:BP) and Canonical pathways (BIOCARTA, KEGG, REACTOME, WikiPathways) (C2:CP). To evaluate if the number of significant GO terms and pathways in the analyses of brain and blood samples was significantly more than expected by chance, we used Fisher’s exact test.
Correlation and overlap with genetic susceptibility loci
We searched mQTLs using the GoDMC database67 and the xQTL sever68, which were downloaded from http://mqtldb.godmc.org.uk/downloads and http://mostafavilab.stat.ubc.ca/xQTLServe/, respectively. To select significant blood mQTLs in GoDMC, we used the same criteria as the original study67, that is, considering a cis P-value smaller than 10−8 and a trans P-value smaller than 10−14 as significant. The 24 LD blocks of genetic variants reaching genome-wide significance were obtained from Supplementary 8 of Kunkle et al. (2019)69.
Out-of-sample validation
The best-performing risk prediction model was trained using samples from the AIBL dataset and tested on the AddNeuroMed dataset. More specifically, first, we computed the Methylation Risk Scores (MRS) as the sum of methylation beta values for prioritized CpGs in cross-tissue analysis (Supplementary Data 4) weighted by their estimated effect sizes (i.e., parameter estimate for methylation beta values in logistic regression after bacon-correction) in the AIBL dataset72. We included 91 prioritized CpGs that were available in the AddNeuroMed dataset from GEO. Next, the logistic regression model logit (Pr (AD)) ~ MRS + age + sex + B + NK + CD4T + CD8T + Mono + Neutro was fitted to the AIBL dataset using glm() function, and predict.glm() was used to apply the logistic regression model to AddNeuroMed dataset. The last six variables in the logistic regression model correspond to estimated proportions of different blood cell types of B-cells, Natural Killer (NK) cells, CD4 + T-cells, CD8 + T-cells, Monocytes, and Neutrophils, obtained using the EpiDish R package101. The R package pROC was used to estimate receiver operating characteristic curves (ROCs) and area under the ROC curves (AUCs). Similarly, logistic regression models with a subset of the variables in the above model (e.g., only age and sex) were similarly developed using the AIBL dataset and tested on the AddNeuroMed dataset. To determine if a logistic regression model predicted AD diagnosis significantly better than chance, we used the Wilcoxon rank-sum test to compare estimated probabilities for AD cases versus controls113.
Internal validation to assess the impact of APOE genotype
Among the three public datasets (AIBL, AddNeuroMed, ADNI) we analyzed, the APOE genotype was only available for the ADNI dataset. To assess the added prediction accuracy of APOE gene, we performed internal validations (i.e., 10-fold cross-validations) using the ADNI dataset, by comparing our best performing model logit (Pr (AD)) ~ MRS + age + sex + B + NK + CD4T + CD8T + Mono + Neutro with the model that additionally included APOE (ɛ4 allele genotype). To obtain an independent set of samples, only the last visit of each subject in the ADNI dataset was used for this analysis. The function createFolds() in caret R package was used to divide the data into ten folds. Average AUCs over the ten iterations in the 10-fold cross-validations for the models with and without APOE were then estimated and compared.
In all analyses, to account for multiple comparisons, we computed FDR using the method of Benjamini and Hochberg114. Associations with 5% or less FDR were considered to be FDR significant. All analyses were performed using the R (https://www.r-project.org/; version 4.1.0), Python (version 2.7.12), SAS (version 9.4), and GREAT (version 4) software.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
All datasets analyzed in this study are publicly available as described in Methods. In particular, ADNI can be accessed from http://adni.loni.usc.edu, AIBL and AddNeuroMed datasets can be accessed from Gene Expression Omnibus (GEO) (accession: GSE153712, GSE144858). The Molecular Signatures Database (MSigDB) can be accessed from https://www.gsea-msigdb.org/gsea/msigdb/. The meta-analysis results generated by this study are available at https://github.com/TransBioInfoLab/AD-meta-analysis-blood.
Code availability
The scripts for the analysis performed in this study can be accessed at https://github.com/TransBioInfoLab/AD-meta-analysis-blood/ and the zenodo repository (https://doi.org/10.5281/zenodo.6787981) .
References
Rajan, K. B. et al. Population estimate of people with clinical Alzheimer’s disease and mild cognitive impairment in the United States (2020-2060). Alzheimers Dement 17, 1966–1975 (2021).
De Jager, P. L. et al. Alzheimer’s disease: early alterations in brain DNA methylation at ANK1, BIN1, RHBDF2 and other loci. Nat. Neurosci. 17, 1156–1163 (2014).
Lunnon, K. et al. Methylomic profiling implicates cortical deregulation of ANK1 in Alzheimer’s disease. Nat. Neurosci. 17, 1164–1170 (2014).
Smith, R. G. et al. Elevated DNA methylation across a 48-kb region spanning the HOXA gene cluster is associated with Alzheimer’s disease neuropathology. Alzheimers Dement 14, 1580–1588 (2018).
Zhang, L. et al. Epigenome-wide meta-analysis of DNA methylation differences in prefrontal cortex implicates the immune processes in Alzheimer’s disease. Nat. Commun. 11, 6114 (2020).
Paziewska, A. et al. DNA methylation status is more reliable than gene expression at detecting cancer in prostate biopsy. Br. J. Cancer 111, 781–789 (2014).
Fransquet, P. D. et al. Blood DNA methylation as a potential biomarker of dementia: A systematic review. Alzheimers Dement 14, 81–103 (2018).
Fransquet, P. D. et al. Blood DNA methylation signatures to detect dementia prior to overt clinical symptoms. Alzheimers Dement (Amst.) 12, e12056 (2020).
Kobayashi, N. et al. Increased blood COASY DNA methylation levels a potential biomarker for early pathology of Alzheimer’s disease. Sci. Rep. 10, 12217 (2020).
Roubroeks, J. A. Y. et al. An epigenome-wide association study of Alzheimer’s disease blood highlights robust DNA hypermethylation in the HOXB6 gene. Neurobiol. Aging 95, 26–45 (2020).
Fransquet, P. D. et al. DNA methylation analysis of candidate genes associated with dementia in peripheral blood. Epigenomics 12, 2109–2123 (2020).
Madrid, A. et al. DNA Hypomethylation in Blood Links B3GALT4 and ZADH2 to Alzheimer’s Disease. J. Alzheimers Dis. 66, 927–934 (2018).
Mitsumori, R. et al. Lower DNA methylation levels in CpG island shores of CR1, CLU, and PICALM in the blood of Japanese Alzheimer’s disease patients. PLoS One 15, e0239196 (2020).
Vasanthakumar, A. et al. Harnessing peripheral DNA methylation differences in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) to reveal novel biomarkers of disease. Clin. Epigenetics 12, 84 (2020).
Ellis, K. A. et al. Enabling a multidisciplinary approach to the study of ageing and Alzheimer’s disease: an update from the Australian Imaging Biomarkers and Lifestyle (AIBL) study. Int Rev. Psychiatry 25, 699–710 (2013).
Nabais, M. F. et al. Meta-analysis of genome-wide DNA methylation identifies shared associations across neurodegenerative disorders. Genome Biol. 22, 90 (2021).
Wan, L. et al. Scaffolding protein SPIDR/KIAA0146 connects the Bloom syndrome helicase with homologous recombination repair. Proc. Natl Acad. Sci. USA 110, 10646–10651 (2013).
Arikkath, J. & Reichardt, L. F. Cadherins and catenins at synapses: roles in synaptogenesis and synaptic plasticity. Trends Neurosci. 31, 487–494 (2008).
Fannon, A. M. & Colman, D. R. A model for central synaptic junctional complex formation based on the differential adhesive specificities of the cadherins. Neuron 17, 423–434 (1996).
Yamagata, M., Duan, X. & Sanes, J. R. Cadherins Interact With Synaptic Organizers to Promote Synaptic Differentiation. Front Mol. Neurosci. 11, 142 (2018).
Ahmad, S. et al. CDH6 and HAGH protein levels in plasma associate with Alzheimer’s disease in APOE epsilon4 carriers. Sci. Rep. 10, 8233 (2020).
Fujimura, M., Usuki, F., Kawamura, M. & Izumo, S. Inhibition of the Rho/ROCK pathway prevents neuronal degeneration in vitro and in vivo following methylmercury exposure. Toxicol. Appl Pharm. 250, 1–9 (2011).
Tan, H. B., Zhong, Y. S., Cheng, Y. & Shen, X. Rho/ROCK pathway and neural regeneration: a potential therapeutic target for central nervous system and optic nerve damage. Int J. Ophthalmol. 4, 652–657 (2011).
Heyn, H. et al. Distinct DNA methylomes of newborns and centenarians. Proc. Natl Acad. Sci. USA 109, 10522–10527 (2012).
Pedersen, B. S., Schwartz, D. A., Yang, I. V. & Kechris, K. J. Comb-p: software for combining, analyzing, grouping and correcting spatially correlated P-values. Bioinformatics 28, 2986–2988 (2012).
Sanchez-Mut, J. V., Glauser, L., Monk, D. & Graff, J. Comprehensive analysis of PM20D1 QTL in Alzheimer’s disease. Clin. Epigenetics 12, 20 (2020).
Sanchez-Mut, J. V. et al. PM20D1 is a quantitative trait locus associated with Alzheimer’s disease. Nat. Med 24, 598–603 (2018).
Joseph, R. M. Neuronatin gene: Imprinted and misfolded: Studies in Lafora disease, diabetes and cancer may implicate NNAT-aggregates as a common downstream participant in neuronal loss. Genomics 103, 183–188 (2014).
Ma, T. et al. Suppression of eIF2alpha kinases alleviates Alzheimer’s disease-related plasticity and memory deficits. Nat. Neurosci. 16, 1299–1305 (2013).
Liu, J. et al. The association of DNA methylation and brain volume in healthy individuals and schizophrenia patients. Schizophr. Res 169, 447–452 (2015).
Nasser, J. et al. Genome-wide enhancer maps link risk variants to disease genes. Nature 593, 238–243 (2021).
Mukherjee, S., Erickson, H. & Bastia, D. Enhancer-origin interaction in plasmid R6K involves a DNA loop mediated by initiator protein. Cell 52, 375–383 (1988).
Gasparoni, G. et al. DNA methylation analysis on purified neurons and glia dissects age and Alzheimer’s disease-specific changes in the human cortex. Epigenetics Chromatin 11, 41 (2018).
Braak, H. & Braak, E. Staging of Alzheimer’s disease-related neurofibrillary changes. Neurobiol. Aging 16, 271–278 (1995). discussion 278-84.
Smith, R. G. et al. A meta-analysis of epigenome-wide association studies in Alzheimer’s disease highlights novel differentially methylated loci across cortex. Nat. Commun. 12, 3517 (2021).
Bennett, D. A. et al. Religious Orders Study and Rush Memory and Aging Project. J. Alzheimers Dis. 64, S161–S189 (2018).
Zhu, H., Wang, G. & Qian, J. Transcription factors as readers and effectors of DNA methylation. Nat. Rev. Genet 17, 551–565 (2016).
Yin, Y. et al. Impact of cytosine methylation on DNA binding specificities of human transcription factors. Science 356, eaaj2239 (2017).
Wang, Z. et al. Complex impact of DNA methylation on transcriptional dysregulation across 22 human cancer types. Nucleic Acids Res 48, 2287–2302 (2020).
Silva, T.C., Young, J.I., Martin, E.R., Chen, X. & Wang, L. MethReg: estimating the regulatory potential of DNA methylation in gene transcription. Nucleic Acids Res. 50, e51 (2022).
Hanzelmann, S., Castelo, R. & Guinney, J. GSVA: gene set variation analysis for microarray and RNA-seq data. BMC Bioinforma. 14, 7 (2013).
von Bernhardi, R., Cornejo, F., Parada, G. E. & Eugenin, J. Role of TGFbeta signaling in the pathogenesis of Alzheimer’s disease. Front Cell Neurosci. 9, 426 (2015).
Bertolio, R. et al. Sterol regulatory element binding protein 1 couples mechanical cues and lipid metabolism. Nat. Commun. 10, 1326 (2019).
Boada, M. et al. Estrogen receptor alpha gene variants are associated with Alzheimer’s disease. Neurobiol. Aging 33, 198 e15–24 (2012).
Canet, G., Chevallier, N., Zussy, C., Desrumaux, C. & Givalois, L. Central Role of Glucocorticoid Receptors in Alzheimer’s Disease and Depression. Front Neurosci 12, 739 (2018).
Wang, Q. et al. Meta-Analysis of Parkinson’s Disease and Alzheimer’s Disease Revealed Commonly Impaired Pathways and Dysregulation of NRF2-Dependent Genes. J. Alzheimers Dis. 56, 1525–1539 (2017).
Wang, X. et al. Responsive Expression of MafF to beta-Amyloid-Induced Oxidative Stress. Dis. Markers 2020, 8861358 (2020).
Blom, E. S. et al. Increased mRNA Levels of TCF7L2 and MYC of the Wnt Pathway in Tg-ArcSwe Mice and Alzheimer’s Disease Brain. Int J. Alzheimers Dis. 2011, 936580 (2010).
Ogunshola, O. O. & Antoniou, X. Contribution of hypoxia to Alzheimer’s disease: is HIF-1alpha a mediator of neurodegeneration? Cell Mol. Life Sci. 66, 3555–3563 (2009).
Balogh, P. et al. RUNX3 levels in human hematopoietic progenitors are regulated by aging and dictate erythroid-myeloid balance. Haematologica 105, 905–913 (2020).
Meng, G., Zhong, X. & Mei, H. A Systematic Investigation into Aging Related Genes in Brain and Their Relationship with Alzheimer’s Disease. PLoS One 11, e0150624 (2016).
So, K. et al. Multiple tumor suppressor genes are increasingly methylated with age in non-neoplastic gastric epithelia. Cancer Sci. 97, 1155–1158 (2006).
Tserel, L. et al. Age-related profiling of DNA methylation in CD8+ T cells reveals changes in immune response and transcriptional regulator genes. Sci. Rep. 5, 13107 (2015).
Mangino, M. et al. A genome-wide association study identifies a novel locus on chromosome 18q12.2 influencing white cell telomere length. J. Med Genet 46, 451–454 (2009).
Ma, Y. et al. Epigenomic features related to microglia are associated with attenuated effect of APOE epsilon4 on Alzheimer’s disease risk in humans. Alzheimers Dement 18, 688–699 (2022).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).
Korotkevich, G. et al. Fast gene set enrichment analysis. bioRxiv, https://www.biorxiv.org/content/10.1101/060012v3.full.pdf (2021).
Heneka, M. T. et al. Neuroinflammation in Alzheimer’s disease. Lancet Neurol. 14, 388–405 (2015).
Garcia-Esparcia, P. et al. Altered mechanisms of protein synthesis in frontal cortex in Alzheimer disease and a mouse model. Am. J. Neurodegener. Dis. 6, 15–25 (2017).
Hernandez-Ortega, K., Garcia-Esparcia, P., Gil, L., Lucas, J. J. & Ferrer, I. Altered Machinery of Protein Synthesis in Alzheimer’s: From the Nucleolus to the Ribosome. Brain Pathol. 26, 593–605 (2016).
Tammineni, P., Ye, X., Feng, T., Aikal, D. & Cai, Q. Impaired retrograde transport of axonal autophagosomes contributes to autophagic stress in Alzheimer’s disease neurons. Elife 6, e21776 (2017).
Luo, L. Rho GTPases in neuronal morphogenesis. Nat. Rev. Neurosci. 1, 173–180 (2000).
Aguilar, B. J., Zhu, Y. & Lu, Q. Rho GTPases as therapeutic targets in Alzheimer’s disease. Alzheimers Res Ther. 9, 97 (2017).
Guiler, W., Koehler, A., Boykin, C. & Lu, Q. Pharmacological Modulators of Small GTPases of Rho Family in Neurodegenerative Diseases. Front Cell Neurosci. 15, 661612 (2021).
Lu, X. H. et al. Targeting ATM ameliorates mutant Huntingtin toxicity in cell and animal models of Huntington’s disease. Sci. Transl. Med 6, 268ra178 (2014).
Flemming, A. Huntington disease: Banking on ATM. Nat. Rev. Drug Disco. 14, 92 (2015).
Min, J. L. et al. Genomic and phenotypic insights from an atlas of genetic effects on DNA methylation. Nat. Genet 53, 1311–1321 (2021).
Ng, B. et al. An xQTL map integrates the genetic architecture of the human brain’s transcriptome and epigenome. Nat. Neurosci. 20, 1418–1426 (2017).
Kunkle, B. W. et al. Genetic meta-analysis of diagnosed Alzheimer’s disease identifies new risk loci and implicates Abeta, tau, immunity and lipid processing. Nat. Genet 51, 414–430 (2019).
Klein, H. U., Bennett, D. A. & De Jager, P. L. The epigenome in Alzheimer’s disease: current state and approaches for a new path to gene discovery and understanding disease mechanism. Acta Neuropathol. 132, 503–514 (2016).
Chibnik, L. B. et al. Alzheimer’s loci: epigenetic associations and interaction with genetic factors. Ann. Clin. Transl. Neurol. 2, 636–647 (2015).
Huls, A. & Czamara, D. Methodological challenges in constructing DNA methylation risk scores. Epigenetics 15, 1–11 (2020).
McCartney, D. L. et al. Epigenetic prediction of complex traits and death. Genome Biol. 19, 136 (2018).
Nabais, M. F. et al. Significant out-of-sample classification from methylation profile scoring for amyotrophic lateral sclerosis. NPJ Genom. Med 5, 10 (2020).
Blair, L. J. et al. Accelerated neurodegeneration through chaperone-mediated oligomerization of tau. J. Clin. Invest 123, 4158–4169 (2013).
Binder, E. B. et al. Polymorphisms in FKBP5 are associated with increased recurrence of depressive episodes and rapid response to antidepressant treatment. Nat. Genet 36, 1319–1325 (2004).
Hernandez-Diaz, Y. et al. Association between FKBP5 polymorphisms and depressive disorders or suicidal behavior: A systematic review and meta-analysis study. Psychiatry Res 271, 658–668 (2019).
Binder, E. B. et al. Association of FKBP5 polymorphisms and childhood abuse with risk of posttraumatic stress disorder symptoms in adults. JAMA 299, 1291–1305 (2008).
Appel, K. et al. Moderation of adult depression by a polymorphism in the FKBP5 gene and childhood physical abuse in the general population. Neuropsychopharmacology 36, 1982–1991 (2011).
Holmquist, S., Nordstrom, A. & Nordstrom, P. The association of depression with subsequent dementia diagnosis: A Swedish nationwide cohort study from 1964 to 2016. PLoS Med 17, e1003016 (2020).
Oyang, E. L., Davidson, B. C., Lee, W. & Poon, M. M. Functional characterization of the dendritically localized mRNA neuronatin in hippocampal neurons. PLoS One 6, e24879 (2011).
Konycheva, G. et al. Dietary methyl donor deficiency during pregnancy in rats shapes learning and anxiety in offspring. Nutr. Res 31, 790–804 (2011).
Yu, L. et al. Methylation profiles in peripheral blood CD4+ lymphocytes versus brain: The relation to Alzheimer’s disease pathology. Alzheimers Dement 12, 942–951 (2016).
Welty, S. et al. RAD52 is required for RNA-templated recombination repair in post-mitotic neurons. J. Biol. Chem. 293, 1353–1362 (2018).
Du, P. et al. Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis. BMC Bioinforma. 11, 587 (2010).
Jones, M. J., Goodman, S. J. & Kobor, M. S. DNA methylation and healthy human aging. Aging Cell 14, 924–932 (2015).
Reynolds, L. M. et al. Age-related variations in the methylome associated with gene expression in human monocytes and T cells. Nat. Commun. 5, 5366 (2014).
Kennedy, B. K. et al. Geroscience: linking aging to chronic disease. Cell 159, 709–713 (2014).
McKinney, B. C. et al. DNA methylation in the human frontal cortex reveals a putative mechanism for age-by-disease interactions. Transl. Psychiatry 9, 39 (2019).
McKinney, B. C. & Sibille, E. The age-by-disease interaction hypothesis of late-life depression. Am. J. Geriatr. Psychiatry 21, 418–432 (2013).
Lunnon, K. et al. A blood gene expression marker of early Alzheimer’s disease. J. Alzheimers Dis. 33, 737–753 (2013).
Jack, C. R. Jr. et al. Hypothetical model of dynamic biomarkers of the Alzheimer’s pathological cascade. Lancet Neurol. 9, 119–128 (2010).
Sperling, R. A. et al. Toward defining the preclinical stages of Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement 7, 280–292 (2011).
Sims, R., Hill, M. & Williams, J. The multiplex model of the genetics of Alzheimer’s disease. Nat. Neurosci. 23, 311–322 (2020).
Castrillo, J. I., Lista, S., Hampel, H. & Ritchie, C. W. Systems Biology Methods for Alzheimer’s Disease Research Toward Molecular Signatures, Subtypes, and Stages and Precision Medicine: Application in Cohort Studies and Trials. Methods Mol. Biol. 1750, 31–66 (2018).
Odom, G. J., Colaprico, A., Silva, T. C., Chen, X. S. & Wang, L. PathwayMultiomics: An R Package for Efficient Integrative Analysis of Multi-Omics Datasets With Matched or Un-matched Samples. Front Genet 12, 783713 (2021).
Pidsley, R. et al. Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling. Genome Biol. 17, 208 (2016).
Chen, Y. A. et al. Discovery of cross-reactive probes and polymorphic CpGs in the Illumina Infinium HumanMethylation450 microarray. Epigenetics 8, 203–209 (2013).
Joehanes, R. et al. Epigenetic Signatures of Cigarette Smoking. Circ. Cardiovasc Genet 9, 436–447 (2016).
Wang, T. et al. A systematic study of normalization methods for Infinium 450K methylation data using whole-genome bisulfite sequencing data. Epigenetics 10, 662–669 (2015).
Zheng, S. C., Breeze, C. E., Beck, S. & Teschendorff, A. E. Identification of differentially methylated cell types in epigenome-wide association studies. Nat. Methods 15, 1059–1066 (2018).
Zhang, Q. et al. Improved precision of epigenetic clock estimates across tissues and its implication for biological ageing. Genome Med 11, 54 (2019).
Delvin, B. & Roeder, K. Genomic Control for Association Studies. Biometrics 55, 997–1004 (1999).
van Iterson, M., van Zwet, E. W., Consortium, B. & Heijmans, B. T. Controlling bias and inflation in epigenome- and transcriptome-wide association studies using the empirical null distribution. Genome Biol. 18, 19 (2017).
Efron, B. Correlated z-values and the accuracy of large-scale statistical estimates. J. Am. Stat. Assoc. 105, 1042–1055 (2010).
Whitlock, M. C. Combining probability from independent tests: the weighted Z-method is superior to Fisher’s approach. J. Evol. Biol. 18, 1368–1373 (2005).
Zaykin, D. V. Optimally weighted Z-test is a powerful method for combining probabilities in meta-analysis. J. Evol. Biol. 24, 1836–1841 (2011).
Gomez, L. et al. coMethDMR: accurate identification of co-methylated and differentially methylated regions in epigenome-wide association studies with continuous phenotypes. Nucleic Acids Res 47, e98 (2019).
McLean, C. Y. et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501 (2010).
Aran, D., Hu, Z. & Butte, A. J. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 18, 220 (2017).
Guintivano, J., Aryee, M. J. & Kaminsky, Z. A. A cell epigenotype specific model for the correction of brain cellular heterogeneity bias and its application to age, brain region and major depression. Epigenetics 8, 290–302 (2013).
Cheneby, J. et al. ReMap 2020: a database of regulatory regions from an integrative analysis of Human and Arabidopsis DNA-binding sequencing experiments. Nucleic Acids Res 48, D180–D188 (2020).
Mason, S. J. & Graham, N. E. Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretation. Q. J. R. Meteorological Soc. 128, 2145–2166 (2002).
Benjamini, Y. & Y, H. Controlling the false discovery rate—a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289–300 (1995).
Acknowledgements
This research was supported by US National Institutes of Health grants R21AG060459 (L.W), R01AG061127 (L.W.), and R01AG062634 (E.R.M, L.W.). The ROSMAP study data were provided by the Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago. Data collection for the ROSMAP dataset was supported through funding by NIA grants P30AG10161, R01AG15819, R01AG17917, R01AG30146, R01AG36836, U01AG32984, U01AG46152, the Illinois Department of Public Health, and the Translational Genomics Research Institute. Data used in preparation of this article were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). As such, the investigators within the ADNI contributed to the design and implementation of ADNI and/or provided data but did not participate in analysis or writing of this report. A complete listing of ADNI investigators can be found at: http://adni.loni.usc.edu/wp-content/uploads/how_to_apply/ADNI_Acknowledgement_List.pdf Data collection and sharing for the ADNI dataset was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer’s Association; Alzheimer’s Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.
Author information
Authors and Affiliations
Contributions
L.W., J.Y., E.R.M., T.C.S. designed the computational analysis. T.C.S., L.Z., L.G., M.A.S., A.V., L.W. analyzed the data. L.W., J.Y., E.R.M, T.C.S., X.C. contributed to interpretation of the results. T.C.S, L.W. wrote the paper, and all authors participated in the review and revision of the manuscript. L.W. conceived the original idea and supervised the project.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks Carolina Soriano-Tárraga, Esther Walton and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
C. Silva, T., Young, J.I., Zhang, L. et al. Cross-tissue analysis of blood and brain epigenome-wide association studies in Alzheimer’s disease. Nat Commun 13, 4852 (2022). https://doi.org/10.1038/s41467-022-32475-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-32475-x
This article is cited by
-
Distinct CSF biomarker-associated DNA methylation in Alzheimer’s disease and cognitively normal subjects
Alzheimer's Research & Therapy (2023)
-
Progress in Mechanisms, Pathways and Cohort Studies About the Effects of PM2.5 Exposure on the Central Nervous System
Reviews of Environmental Contamination and Toxicology (2023)
-
Multi-tissue epigenetic analysis identifies distinct associations underlying insulin resistance and Alzheimer’s disease at CPT1A locus
Clinical Epigenetics (2023)