Introduction

Alzheimer disease (AD) is a neurodegenerative disorder characterized neuropathologically by neurofibrillary tangles and amyloid plaques [1]. The apolipoprotein E (APOE) ε4 variant is the strongest genetic risk factor for late-onset AD, while the ε2 variant has been shown to confer protection against AD, in a dose-dependent manner [2,3,4]. Single copies of the ε4 and ε2 alleles are associated with 3 to 4-fold increased and 0.61-fold decreased risk of AD, respectively. Previous studies identified APOE genotype-specific mechanisms including the complement pathway and blood–brain barrier dysfunction [5,6,7].

Large-scale genome-wide association studies (GWAS) have identified contributions to AD risk from more than 75 independent loci, but the large portion of heritability of the disease is unexplained [8]. Emerging omics technologies have prompted investigations of gene expression and epigenetic profiles at the tissue and cellular levels. For example, it has been shown that the degree of methylation of cytosine-phosphate-guanine (CpG) dinucleotides in brain differ between AD cases and controls in novel regions as well as in loci previously associated with AD risk such as BIN1 [9]. Methylation levels at multiple CpG sites assessed in peripheral blood have also been associated with cognitive decline and AD progression [10].

Several CpG sites in the APOE region are differentially methylated in AD cases compared to controls, and distinct methylation patterns have been observed between persons with the ε3/ε3 and ε3/ε4 genotypes [11]. In addition, the APOE region has been shown to be differentially methylated between healthy ε2 and ε4 carriers in blood [12]. However, despite these findings, the effect of APOE genotypes, especially on the genome-wide level for AD risk remains relatively unknown. Here, we analyzed methylation array data from blood and brain tissue in three datasets to discover APOE genotype-dependent genome-wide associations of methylation with AD risk and related traits, as well as co-methylation networks. The primary goals of this study were to determine whether APOE ε4 carrier status affects methylation levels and identify potential functional relationships between methylation levels at CpG sites and AD risk. We hypothesize that distinct epigenetic profiles in APOE ε4 carriers comparing in APOE ε4 non-carriers modulate cognitive performance and neuropathological traits. To understand clinical stage-dependent effects of APOE ε4 carrier status, we conducted differential methylation analysis between APOE ε4 carriers and non-carriers in all, AD, and control subjects using three methylation datasets from blood and brain. Different sources of epigenetic profiles illustrated shared and distinct APOE ε4 dependent differential methylation levels between blood and brain.

Methods

Sources of methylation and phenotypic data

Data were obtained for participants of three cohort studies including the Religious Orders Study and Rush Memory and Aging Project (ROSMAP), Alzheimer’s Disease Neuroimaging Initiative (ADNI), and Framingham Heart Study (FHS). All analyses were conducted separately within each dataset and the results were not combined because of heterogeneity among datasets.

Religious Orders Study and Rush Memory and Aging Project

Clinical, neuropathological information, APOE genotyping, and preprocessed, quality controlled, and normalized brain HumanMethylation450 BeadChip methylation array data derived from dorsolateral prefrontal cortex area tissue of autopsied brains donated by 697 ROSMAP participants (417 autopsy-confirmed AD cases and 280 controls) [13,14,15] were obtained from the CommonMind Consortium portal (http://www.synapse.org) (Supplementary Table 1). AD diagnosis was determined using National Institute of Aging (NIA) Reagan criteria for intermediate or high probability of AD [16]. AD-related traits included Braak staging for neurofibrillary tangles [17] and the Consortium to Establish a Registry for Alzheimer Disease (CERAD) semi-quantitative criteria for measuring neuritic plaques (CERAD Score) [18]. ROS and MAP were both approved by an Institutional Review Board of Rush University Medical Center. All participants signed an informed consent, Anatomic Gift Act, and repository consent.

Alzheimer’s Disease Neuroimaging Initiative

Infinium® MethylationEPIC BeadChip beta values and phenotype data from blood were obtained from the LONI website (http://adni.loni.usc.edu) for 630 ADNI participants including 91 with clinical AD diagnosis, 329 with mild cognitive impairment (MCI), and 210 controls [10]. Methylation array iDAT files were processed and normalized using wateRmelon [19]. Because methylation was measured in DNA extracted from blood specimens obtained at multiple examinations, methylation data from the earliest timepoint were analyzed. Among the available endophenotypic data, analyses included magnetic resonance imaging (MRI) volumetric measures of ventricles, and hippocampus, and entorhinal thickness, as well as neuropsychological test scores consisted of the Alzheimer’s Disease Assessment Scale – 13 item (ADAS13), Clinical Dementia Rating Scale Sum of Boxes (CDRSB), logical memory - delayed recall (LDELTOTAL), Rey Auditory Verbal Learning Test (RAVLT) immediate, RAVLT learning, and RAVLT percent forgetting.

Framingham Heart Study

Cognitive test and normalized HumanMethylation450 BeadChip methylation array data from blood were obtained for 1,391 cognitively healthy participants from the generation 3 cohort in FHS [20]. Cognitive test scores at the same time point of methylation measurement included the Paired Associate Learning - Recognition (PASr) test, Logical Memories – Immediate Recall (LMi) test, Logical Memories – Delayed Recall (LMd) test, Similarities Test (SIM), Visual Reproductions – Delayed Recall (VRd), Trail A (trailsA) test, the animal portion of the Verbal Fluency Test (FAS_animal), and Boston Naming Test (BNT30).

Differential methylation analysis

Differential methylation between AD and control samples was performed in the ADNI and ROSMAP datasets using the LIMMA software [21]. The methylation percentage of each CpG site, defined as the proportion of total signal from the methylation-specific probe, was compared between AD cases and controls using linear regression models including sex, age, and batch as covariates. Genome-wide methylation analyses were conducted in the total sample and separately within APOE ε4 carriers (ε2/ε4, ε3/ε4, and ε4/ε4) and non-carriers (ε2/ε2, ε2/ε3, and ε3/ε3). Genome-wide methylation analysis between APOE ε4 carriers and non-carriers was performed in the ADNI and ROSMAP datasets using LIMMA and regression models including terms for age, sex, and batch in the total sample and separately within AD and control groups. Since the FHS dataset included cognitively unimpaired participants, we did not conduct differential expression analysis between AD and control subjects. Genome-wide methylation analysis between APOE ε4 carriers and non-carriers for each CpG site was conducted in FHS using lmekin package in R and a linear mixed effect model incorporating a genetic relatedness matrix (GRM) as a random effect with sex and age at exam as covariates. The GRM was generated using genetic dosage data and the software GCTA [22] to account for familial relationships among 8,481 FHS participants.

Association of methylation with expression of genes in the APOE region

RNA-sequencing (RNA-seq) data derived from ROSMAP brains were obtained and processed as previously described [7]. Matched RNA-seq and methylation data were available for 510 ROSMAP participants (297 AD cases, 213 controls). Normalized gene-expression microarray data were obtained from the LONI website (http://adni.loni.usc.edu) for 159 ADNI subjects (42 AD cases, 117 controls) who also had matching methylation array data. Significantly methylated CpG sites between APOE ε4-carriers and non-carriers (p = 5.0 × 10−8) in APOE region genes (NECTIN2, APOC1, APOE, and TOMM40) were selected for further analysis. The association of gene expression and methylation levels at the CpG sites in the APOE region was evaluated using linear regression models with covariates including age, sex, RNA integrity number (RIN), RNA batch, and methylation batch. Post-mortem interval (PMI) was included as an additional covariate in analyses of the ROSMAP dataset that had this information. Analyses were performed in the total sample and separately within APOE ε4 carriers and non-carriers.

Association of methylation with quantitative traits

Quantitative or semi-quantitative traits in all three datasets were rank-transformed after adjusting for age and sex as previously described [23]. In the ROSMAP dataset, the association of CpG site methylation with rank-transformed Braak stage and CERAD score was assessed using regression models including batch as a covariate. In the ADNI dataset, the association of CpG site methylation with cognitive test scores and imaging phenotypes were assessed using regression models including covariates of batch and education for cognitive traits and of batch and intracranial volume for imaging phenotypes. In the FHS dataset, we tested the association of CpG site methylation with cognitive test scores using a linear mixed effects model accounting for education and family structure with the GRM as covariates. All association models in each dataset were evaluated in the total sample and separately within groups of APOE ε4 carriers and non-carriers. Significance thresholds were set independently for each dataset using a Bonferroni threshold based on the number of CpG site tests (ROSMAP: 25 CpG, P < 2.0 × 10−3; ADNI: 15 CpG sites. P < 1.4 × 10−3; FHS: 3 CpG sites, P < 1.6 × 10−2).

Co-methylation network analysis

Co-methylation networks were generated with differentially methylated CpG sites (P < 0.05) between APOE ε4 carriers and non-carriers in the ADNI and ROSMAP datasets using the weighted correlation network analysis (WGCNA) program [24]. Analyses in the ADNI dataset also included data from 329 subjects with MCI. We selected four and six soft-power parameters in the ADNI and ROSMAP datasets, respectively, as previously described [5]. CpG percentages were hierarchically clustered using a dissimilatory topological overlap matrix (TOM). Modules with a minimum of 100 CpG sites were created using dynamic tree cutting, networks with similar eigenvalues and a height of 0 were merged using WGCNA’s mergeCloseModules function. The signedKME function assigned fuzzy module membership. We identified networks exhibiting significantly different methylation levels of eigenvalues between APOE ε4 carriers and non-carriers and between AD cases and controls, determined by a Student’s t test, which were selected for subsequent analysis. Biological pathways (MSigDB_Hallmark_2020) and drug perturbations (Drug_Perturbations_from_GEO_2014) for each network were identified using the EnrichR program applied to genes containing CpG sites in significant networks [25]. QIAGEN Ingenuity Pathway Analysis (IPA) software was used to create a biological network containing overlapping genes across modules from the EnrichR analysis.

Results

Differentially methylated CpG sites between AD cases and controls

Methylation levels in 697 brain samples from ROSMAP participants and 301 blood samples from ADNI participants were independently compared between AD cases and controls in the total sample, and within APOE ε4 carrier and non-carrier subgroups (Fig. 1a and Supplementary Table 1). In both datasets, there were no genome-wide significant (P < 5 × 10−8) differentially methylated CpG sites between AD cases and controls in those with or without ε4. However, analysis of the brain data revealed, moderately different (P < 10−5) methylation levels at 3 CpG sites among ε4 carriers and 22 CpG sites among ε4 non-carriers (Table 1 and Supplementary Fig. 1a). Of the 25 CpG sites that were differentially methylated in either APOE genotype subgroup, approximately half (13 CpG sites) were moderately differentially methylated in the total sample (Table 1). Most of the CpG sites that were differentially methylated among ε4 non-carriers (20/25 = 80%) were significantly associated (P < 2.0 × 10−3) with Braak stage and/or CERAD score (Fig. 1b and Supplementary Table 2). Methylation levels of two intergenic CpG sites (cg05731218 and cg12307200) were lower in AD cases compared to controls in the total sample and associated with both Braak stage and CERAD score at a genome-wide significance level (P < 5 × 10−8). The most significant association of methylation CpG sites located within genes were observed for cg10907744 in GPR133 with Braak stage (P = 5.8 × 10−6) and CERAD score (P = 4.6 × 10−6) and for cg19987111 in CHSY1 with Braak stage (P = 8.5 × 10−8).

Fig. 1: Differential methylation between AD cases and controls grouped by APOE ε4 carrier status.
figure 1

a Study Design. Methylation array data were obtained from blood (2 datasets) and frozen brain tissue (1 dataset). Methylation of CpG sites was compared between AD cases and controls as well as between APOE ε4 carriers and non-carriers. Association of methylation at CpG sites with neuropathological, cognitive and imaging traits was also evaluated. CpG sites with nominally significant P values (P < 0.05) between APOE ε4 carriers and non-carriers were incorporated in co-methylation network analyses performed separately for data derived from blood and brain. Finally, biological pathways and drug perturbations were identified from analyses of co-methylation networks. Figure created with biorender.com. b Heatmap showing association of neuropathological traits with methylation at CpG sites that were differentially methylated (P < 10−5) between AD cases and controls. CpG sites whose degree of methylation was significantly (P < 2.0 × 10−3) associated with multiple testing correction with at least one trait are indicated by an asterisk. c Heatmap showing association of cognitive and MRI imaging traits with methylation at CpG sites that were differentially methylated (P < 10−5) between AD cases and controls. Direction of differential methylation between AD and controls in APOE ε4 carriers or non-carriers in blood was shown. CpG sites whose degree of methylation was significantly associated with at least one trait after multiple testing correction (P < 1.4 × 10−3) are indicated by an asterisk.

Table 1 Differentially methylated CpG sites between AD and control brains in the total sample, APOE ε4 carriers, and non-carriers.

Moderately significant differential methylation between AD cases and controls from blood were observed in 21 CpG sites among ε4 carriers and 15 CpG sites in ε4 non-carriers (Supplementary Table 3 and Supplementary Fig. 1b). In contrast to the findings in the brain data, none of these CpG sites improved P values in the total sample (Supplementary Table 3). Methylation of the 15 CpG sites, eleven among ε4 carriers and four among ε4 non-carriers, was significantly (multiple testing correction P < 1.4 × 10−3) associated with performance on global and domain-specific cognitive tests, and/or MRI brain imaging measures (Fig. 1c). Methylation of the CpG site cg09825488 in EXO5 was increased in AD cases compared to controls in ε4 non-carriers (P = 2.9 × 10−6, Supplementary Table 3) and significantly associated with both global cognitive tests and several domain-specific cognitive tests (Supplementary Table 4). Methylation of three CpG sites (cg00198525, cg18714484, and cg25190513) in ε4 carriers was significantly associated with the volume of cortical brain regions (Supplementary Table 4). Methylation at the cg18714484 in CHEK1 was decreased in AD cases compared to controls in ε4 carriers (P = 2.2 × 10−6; Supplementary Table 3) and inversely associated with global cognitive (P = 1.8 × 10−5), memory performance (4.2 × 10−5) and ventricle volume (P = 8.1 × 10−4).

Differentially methylated CpG sites between APOE ε4 carriers and non-carriers

Eight CpG sites were significantly differentially methylated (P < 5 × 10−8) between APOE ε4 carriers and non-carriers in the ADNI, FHS, and ROSMAP datasets, and seven of the 8 CpG sites are located within the APOE region (chr19:45380000-45430000) (Table 2 and Fig. 2a). In brain, cg02613937 located in TOMM40 was the most significant CpG site (hypomethylated in ε4 carriers compared to non-carriers) in the total sample (P = 1.3 × 10−13) and most of the evidence was derived from AD cases (P = 7.0 × 10−13). Methylation at the cg02613937 was not significantly associated with expression of genes in the APOE region (Fig. 2b). In contrast, three CpG sites from the APOE were hypermethylated in APOE ε4 carriers in brain (cg14123992, cg04406254) and blood (cg06750524) compared in ε4 non-carriers (Table 2 and Fig. 2a). Methylation at both APOE CpG sites in brain was nominally associated (P < 0.05) with the APOE expression in ε4 carriers only (Fig. 2b). Three CpG sites from the APOC1 in blood were significantly hypomethylated in ε4 carriers compared to non-carriers in either AD cases (cg07773593) or controls (cg23270113 and cg05644480). Methylation of cg07773593 was nominally significant (p < 0.05) with lower APOC1 expression in the total sample and ε4 non-carriers (Supplementary Fig. 2). The methylation level of CpG site cg07773593 measured at baseline and two successive one-year time intervals was not significantly different (Supplementary Fig. 3).

Table 2 Differentially methylated CpG sites between APOE ε4 carriers and non-carriers in total, AD cases, and controls.
Fig. 2: Differential methylation in the APOE region between APOE ε4 carriers and non-carriers.
figure 2

a Regional plot of the APOE region. Differential methylation between ε4 carriers and non-carriers is shown for the total sample (green line), AD cases (blue line), and controls (pink line) across three datasets. X-axis represents CpG sites that were significantly differentially methylated between APOE ε4 carriers and non-carriers at a genome-wide significance level (P < 5 × 10−8). Y-axis indicates the log10 P-value of hypermethylation (>0) or hypomethylation (<0). b Heatmap showing association of methylation in brain with expression of APOE and adjacent genes. Significant (P < 0.05) associations are indicated by an asterisk. c Heatmap showing association of methylation in blood with cognitive test scores in the FHS dataset. Significant (P < 0.05) associations are indicated by an asterisk.

Among the significant CpG sites from blood in the APOE region between ε4 carriers and non-carriers in the FHS participants (Table 2), increased methylation at cg06750524 from the APOE was associated with poor memory performance measured by the VRd (P = 0.04) test in APOE ε4 carriers (Fig. 2c and Supplementary Table 5). Increased methylation at cg23270113 and cg05644480 from the APOC1 was significantly (P < 1.6 × 10−2) associated with worse performance on trailsA (P = 6.9 × 10−3), PASr (P = 6.7 × 10−3), and BNT30 (P = 0.01) tests in ε4 non-carriers. In APOE ε4 carriers, lower methylation at cg23270113 from the APOC1 was associated with poor memory performance measured by the LMd (P = 0.05) and LMi (P = 0.05) tests, and lower methylation at cg05644480 from the APOC1 was associated with poor performance on the SIM (P = 0.05).

Co-methylation networks

The average methylation level for each of five networks from brain data and three networks from blood data was significantly different between AD and control subjects as well as between ε4 carriers and non-carriers (Table 3 and Supplementary Figs. 4 and 5). Five networks (mod2, mod3, mod4, mod5, and mod8) were significantly enriched for eleven pathways (Fig. 3a). These five networks contained 60 overlapping genes whose expression levels were modified by estradiol (Fig. 3b and Supplementary Table 6). These 60 genes were biologically connected as a subnetwork (Fig. 3c). Four of these networks (excluding mod5) were enriched for the estrogen response early pathway (Table 3 and Fig. 3a). GPR133, a member of mod2 and mod3 networks, was differentially methylated between AD cases and controls lacking ε4 (P = 6.4 × 10−6) and significantly associated with Braak stage (P = 5.8 × 10−6) and CERAD score (P = 4.6 × 10−6) (Table 1 and Supplementary Table 2). Mod5, the only network not enriched for estrogen response early, was uniquely enriched for the E2F target pathway and for seven unique drug perturbation sets (Table 3 and Fig. 3a, b). Mod2 was enriched for estradiol as well as enriched for the complement pathway, mitotic spindles, and TGF-beta signaling (Fig. 3a,b). Mod8 was the only blood network showing significant enrichment for drug perturbations and biological pathways that overlapped significant modules derived from brain (Fig. 3a, b).

Table 3 Summary of co-methylated networks from ROSMAP and ADNI associated with AD and APOE ɛ4 status.
Fig. 3: Co-methylation networks.
figure 3

Co-methylation networks included genes with significantly differentially methylated CpG sites (P < 0.05) between APOE ε4 carriers and non-carriers. Bar plots indicate significant AD and APOE genotype-related co-methylation networks containing genes enriched for (a) biological pathways and (b) drug perturbation gene-sets (i.e., genes whose expression is modified by a drug). Networks with significant pathway or drug gene-set enrichment (adjusted P < 0.05) are shown. c Biological subnetwork including genes from multiple co-methylation networks enriched for genes whose expression is perturbed by estradiol.

Discussion

We identified 25 CpG sites in brain and 36 CpG sites in blood that were differentially methylated in AD cases compared to controls in an APOE genotype-specific manner. Multiple CpG sites in the APOE region were differentially methylated between ε4 carriers and non-carriers in brain or blood. Methylation of several of these CpG sites in blood was significantly associated with performance on cognitive tests in either ε4 carriers or non-carriers. Lastly, we derived eight unique co-methylation networks across blood and brain showing significant differential methylation patterns between AD cases and controls and between ε4 carriers and non-carriers. Five of eight (62.5%) networks included genes enriched for an estradiol drug perturbation gene-set and four of these 5 networks were involved in estrogen response pathway. These findings suggest that AD-related methylation patterns are dependent on APOE genotypes and may be targeted by estrogen modulating drugs. This conclusion is supported by consistent of findings in very differently ascertained datasets for the top loci and in methylation networks enriched for the estrogen pathway.

CpG sites in GPR133 and CHSY1 were hypermethylated in brain from AD cases lacking APOE ε4 and associated with measures of plaque and tangle pathology. GPR133 is a member of the adhesion G protein-coupled receptor family, several of which have been implicated in AD and proposed as potential drug targets for neurological disease [26]. A deletion in CHSY1 causes an increased inflammatory response and hippocampal neurodegeneration in mice [27]. We also observed blood hypomethylation at CpG sites from the CHEK1 gene in AD cases carrying APOE ε4. CHEK1 induces astrogliosis in AD brains and inhibits PP2A which was linked in APOE genotype-specific patterns to AD and AD-related traits, gene expression analysis, and experimental studies [6, 28].

Association of AD with variants in the APOE region has been extensively evaluated [29]. While the link between APOE isoforms and AD risk is well established, independent associations for AD with other genes near the APOE gene including TOMM40 and APOC1 are less conclusive because they often do not replicate across ancestry populations and are confounded by high linkage disequilibrium with APOE variants [30, 31]. However, methylation studies have consistently shown unique and strong differential methylation patterns by APOE genotypes in AD cases and controls [11, 12]. We confirmed decreased methylation on APOC1 in blood and increased methylation on APOE in brain and blood among ε4 carriers compared to non-carriers, while decreased methylation of ε4 carriers on TOMM40 in brain. These results suggest possible distinct contributions of these genes in the APOE region between blood and brain tissues through differential regulation on methylation sites to AD risk. Additionally, we confirmed a recent study that increased methylation in brain at the APOE CpG sites was associated with increased APOE expression only in APOE ε4 carriers [32]. Future studies are necessary to understand the exact mechanisms involved with methylation and AD between blood and brain tissues in an APOE genotype-specific manner.

We identified pathways enriched for genes in APOE genotype and AD-specific co-methylation networks that were derived from differentially methylated CpG sites between ε4 carriers and non-carriers in brain or blood. One of the brain networks was enriched for genes in complement pathway that was previously linked to AD in an APOE genotype-specific manner [5, 6]. Five of the eight APOE ε4 associated networks showed significant enrichment with genes perturbed by a drug, estradiol. Estradiol has been associated with increased cognitive function in both animals and humans [33]. Loss of estrogen in post-menopausal women has been associated with increased AD risk [34] and estrogen replacement therapy has shown to decrease AD risk in post-menopausal women [35], particularly among those under age 64 [36]. Furthermore, the effect of estrogen use on AD risk may be limited to ε4 non-carriers [37]. A recent study showed estrogen decreased amyloid-β accumulation in the hippocampus and cortex in mice lacking ε4 [38].

Our study has several limitations. First, since our datasets used different array platforms to generate genome-wide methylation levels, we were unable to replicate the exact CpG sites. However, these independent datasets from blood and brain enhanced the validity of findings since they were co-localized within the candidate genes. In addition, the importance of blood and the brain together has been important to understanding the whole scope of AD especially due to the blood–brain barrier. Second, none of the FHS participants with methylation data had AD due to their relatively young age. However, our study provided a clue in epigenetic signatures between APOE ε4 carriers and non-carriers among cognitively intact subjects in FHS. Together with findings in AD cases, we can help predict future cognitive decline and neurodegeneration due to distinct epigenetic profiles in different APOE genotype subgroups. Third, pathway enrichment analysis was conducted using genes with network CpG sites under the assumption of each CpG site directly modulating the corresponding gene. This assumption may not hold when the CpG site regulates a long-distant gene. Fourth, phenotype data were not comparable across datasets; in particular, neuropathological measures (i.e., Braak stage and CERAD score) were available for ROSMAP, whereas ADNI and FHS featured cognitive test data. Fifth, since these three cohorts were heterogeneous regarding tissue source, age distribution, ascertainment, and methylation array platform, we were unable to consider these datasets as direct replication sets. However, despite presence of this heterogeneity, we observed similar association patterns nearby CpG sites, which enhanced the validity of findings. In addition, the importance of blood and the brain together has been incredibly important to understanding the whole scope of AD especially due to the blood–brain barrier. Finally, the datasets included in this study were too small to account for sex differences after stratification by APOE genotype or AD status. As a result, we were unable to evaluate sex effect, especially genes involved in estrogen response pathways.

In conclusion, we identified differentially methylated CpG sites in many genes including APOE that were also associated with AD and related traits. Many of these associations were APOE genotype- or tissue-specific. AD and APOE genotype-specific methylation networks were linked to estrogen response and an estrogen replacement therapy, estradiol. Future studies are required to evaluate the contributions of methylation and APOE genotypes to beneficial effects of estrogen as an AD risk-lowering therapy.