Exceptionally long-lived species, including many bats, rarely show overt signs of aging, making it difficult to determine why species differ in lifespan. Here, we use DNA methylation (DNAm) profiles from 712 known-age bats, representing 26 species, to identify epigenetic changes associated with age and longevity. We demonstrate that DNAm accurately predicts chronological age. Across species, longevity is negatively associated with the rate of DNAm change at age-associated sites. Furthermore, analysis of several bat genomes reveals that hypermethylated age- and longevity-associated sites are disproportionately located in promoter regions of key transcription factors (TF) and enriched for histone and chromatin features associated with transcriptional regulation. Predicted TF binding site motifs and enrichment analyses indicate that age-related methylation change is influenced by developmental processes, while longevity-related DNAm change is associated with innate immunity or tumorigenesis genes, suggesting that bat longevity results from augmented immune response and cancer suppression.
DNA methylation (DNAm) influences many processes including development1, gene regulation2, genomic imprinting3, X chromosome inactivation4, transposable element defense5, and cancer6. Over 75% of cytosine-phosphate-guanine (i.e., CpG) sites are typically methylated in mammalian cells, but global DNAm declines with age, which can lead to loss of transcriptional control and either cause or contribute to, deleterious aging effects7. Conversely, DNAm often increases (i.e., shows hypermethylation) at CpG islands, which are CpG-dense regions often found in gene promoter regions near transcription start sites (TSS)8,9,10. Age-related changes in DNAm can be used to predict age in humans11,12 and are beginning to be used to predict age in other species13,14,15,16,17. Given that the aging of wild animals typically requires long-term mark-recapture data or lethal tissue sampling, an accurate, noninvasive aging method would enable the study of age-associated changes in traits critical for survival, such as sensory perception, metabolic regulation, and immunity in a variety of long-lived species.
DNAm has also been used to predict lifespan in humans18,19,20. Intriguingly, interventions known to increase lifespan in some mammals, such as caloric restriction, reduce the rate at which methylation changes21,22. Moreover, a comparison of age-related changes in DNAm across species22,23 suggests that DNAm rate also varies with lifespan. However, comparative studies have so far used different methods on a few primate, rodent, or canid species22,23 making it difficult to determine reasons for methylation differences.
The distribution and function of genomic regions that exhibit age or longevity-related changes in DNAm are not fully understood20,24. In humans, hypermethylated age-associated CpG sites tend to be near genes predicted to be regulated by transcription factors involved in growth and development, whereas hypomethylated sites are near genes from more disparate pathways25. A recent study in dogs also found that age-related hypermethylated sites are near genes that influence developmental processes14,17. Human aging has been associated with modification of histone marks and relocalization of chromatin-modifying factors in a tissue-dependent manner26. Comparative analysis of CpG density in conserved gene promoter regions has revealed that CpG density is positively related to lifespan in mammals27, as well as other vertebrates28, but the genes involved were not enriched for any pathway or biological process.
Bats have great potential for providing insight into mechanisms that reduce deleterious aging effects because species from multiple independent lineages have maximum lifespans more than four times greater than similar-sized mammals29 despite tolerating high viral loads30,31 and showing few signs of aging. Here, we use a custom microarray that assays 37,492 conserved CpG sites to measure DNAm from known-aged individuals of 26 species of bats and address three questions. (1) How accurately can chronological age be estimated in bats? (2) Does an age-related change in DNAm predict maximum lifespan? (3) What genes are nearest the sites where DNAm changes as a function of age or differences in longevity between species? We find that DNAm can predict the age of individual bats with high accuracy. At the species level, the rate of change in DNAm at age-associated sites also predicts maximum lifespan. CpG sites that are informative for age or longevity are more likely to gain methylation and be near promoter regions of transcription factors involved in developmental processes. Longevity-associated sites are, in addition, enriched for genes involved in cancer suppression or immunity.
Predicting individual age using DNAm
DNAm profiles were analyzed from 712 wing tissue biopsies taken either from captive or free-ranging individuals of known age representing 26 species and six families of bats. Probe sequences were mapped in the genomes of nine of these species (Supplementary Table 1) and a total of 35,148 probes were located in at least one bat genome. For the 2340 probes not mapped in any bat, the median DNAm mean (0.496) and coefficient of variation (CV = 0.051%), were nearly identical to the 62 human SNP probes on each array (median mean = 0.500, median CV = 0.032%). In contrast, probes that mapped to at least one bat had DNAm means ranging from 0.006 to 0.995 with a median of 0.634. To predict age, we used sites mapped in one or more species from the taxonomic group of interest, i.e., order, genera or species.
Similar to human epigenetic clocks12,20, elastic-net regression accurately predicted chronological age from a linear combination of DNAm beta values (henceforth DNAmAge) using 162 CpG sites. Leave-one-out (LOO) cross-validation shows that DNAm can predict age with a median absolute error (MAE) of 0.74 years (Fig. 1a). Limiting the analysis to smaller taxonomic groups (species or genera) can improve accuracy if sufficient data are available. For example, the correlation between chronological and DNAmAge in a LOO cross-validation analysis for 40–50 samples from a single species can be 0.96 or higher (Fig. 1b, c; Supplementary Fig. 1); a similar analysis on 176 samples from six Pteropus species gave a correlation of 0.97 (MAE = 0.77 years, Supplementary Fig. 2b). Thus, DNAm from a wing tissue sample for any of these species can reveal the animal’s age at the time of sampling to within a year.
To assess how well DNAm might predict age in a species not represented by our samples, we conducted a second cross-validation analysis in which data for one species was left out (leave-one-species out; LOSO) and ages were predicted for that species using a clock estimated from the remaining 25 species. This analysis (Fig. 1d) resulted in a correlation between observed and predicted age of 0.84 (MAE = 1.41 years). The LOSO analysis also showed that DNAm consistently underestimates age in some species, while overestimating age in others. For example, Desmodus rotundus (sp. 5, Fig. 1d) samples exhibit lower values of DNAmAge (suggesting lower aging rates) than Phyllostomus hastatus (sp. 15, Fig. 1d) samples, consistent with the longer lifespan of D. rotundus29.
Predicting species longevity from DNAm
To determine if the rate of DNAm change predicts variation in maximum lifespan among species, we incorporated a recent bat phylogeny32 into a generalized least squares regression (PGLS) to predict the longevity quotient (LQ)—the ratio of observed to expected maximum lifespan for a mammal of the same body size29. We first identified a common set of age-associated CpG sites for this analysis by conducting a meta-analysis of all age-DNAm correlations by the probe for 19 bat species with 15 or more samples (“Methods”). The top 2000 age-associated sites (henceforth, age differentially methylated positions or age DMPs) consist of 1165 sites that show age-associated hypermethylation and 835 sites exhibiting age-associated hypomethylation. Both mean rates of hypermethylation and hypomethylation predict LQ, such that long-lived species have lower rates of DNAm change (Fig. 2a, b). A PGLS analysis on maximum lifespan with body mass as a covariate gave very similar results (Supplementary Table 2). Assuming that the rate of change in DNAm reflects epigenetic stability, these results suggest that better epigenetic maintenance is associated with a longer maximum lifespan, independent of body size, across bats.
Identifying age and longevity-associated genes
To identify DMPs associated with differences in LQ (henceforth longevity DMPs), we compared relationships between DNAm and age for three long-lived species and two short-lived species (cf. Fig. 2) from four bat families. Longevity DMPs have a significant (BY FDR = 0.05) interaction term between age and longevity type in a linear mixed model with species as a random effect (see Methods, Supplementary Fig. 3). We identified 1491 longevity DMPs, including 694 in which short-lived species gain DNAm faster with age and 797 in which short-lived species lose DNAm faster.
Age and longevity DMPs are widely distributed in the genome but differ in relative abundance across chromosomes (Fig. 3a, b). For example, of the 1077 probes that map to chromosome 1 (syntenic with the human X chromosome) in R. ferrumequinum (a long-lived bat with the most mapped probes, 30,724, Supplementary Table 1) only 12 are age-associated while 46 are longevity-associated. Not surprisingly, 596 of 753 sites (79.2%) that differ in DNAm values between the sexes across species are on the R. ferrumequinum X chromosome. Sex DMPs are independent of age DMPs (6.1% overlap, P = 0.32, Fisher’s Exact Test, FET) and longevity DMPs (5.2% overlap, P = 0.10, FET). When limited to promoter regions, almost all age and longevity DMPs exhibit hypermethylation (Fig. 3c). Change in DNAm with respect to age correlates with a change in DNAm with respect to longevity (r = 0.454, P < 0.0001), which results in significant overlap among longevity and age DMPs (P < 0.0001, FET, Fig. 3d and Supplementary Fig. 5a) and among unique genes near those DMPs (Fig. 3e and Supplementary Fig. 5b).
Even though about 7000 unique CpG probe sequences on the mammalian methylation array are unmapped in a bat genome (Supplementary Table 2), the mapped CpG sites are typically (median = 93%) nearest the same gene in any pair of bats (Supplementary Fig. 4c). Furthermore, genomic regions occupied by age and longevity DMPs are similar among bat species (Supplementary Fig. 4). For example, 68% of 2874 probes that map to a promoter region in the short lifespan species, M. molossus, also map to a promoter region in the distantly related long lifespan species, R. ferrumequinum (Fig. 4b). Promoter regions are enriched for hypermethylating, but not hypomethylating, age, and longevity DMPs in M. molossus (Fig. 4c, d) and other bats (Supplementary Fig. 5c, d). In bat genomes where CpG islands have been identified (e.g., R. ferrumequinum) hypermethylating age DMPs are much more likely than hypomethylating age DMPs to be located in CpG islands (P < 0.0001, FET); the same is true for longevity DMPs (P < 0.0001, FET).
Given that regions near promoters contain more age and longevity DMPs than expected, we evaluated the genes nearest those DMPs for possible functions. In view of the overlap in age and longevity DMPs, not surprisingly, genes with age or longevity DMPs in promoter regions show similar patterns of enrichment among biological process categories, i.e., developmental process, transcription, and regulation of transcription are enriched in M. molossus (Fig. 4e). Genes with age DMPs in promoter regions are further enriched for multicellular organism development. With regard to protein class, gene lists for both age and longevity DMPs are enriched for homeodomain transcription factors containing helix-turn-helix motifs (Fig. 4f). These patterns are characteristic of other bat species too (Supplementary Fig. 6), although the gene list composition varies. For example, 142 hypermethylated age genes were identified across the four bat genomes used for identifying longevity DMPs. Of these genes, 89 exhibited the same DMP-gene association in at least 3 of the 4 genomes.
Comparisons between the age and longevity-related genes and several relevant gene lists provide additional evidence for gene function. For example, hypermethylated age genes in bats strongly overlap hypermethylated age genes recently reported for dogs17 (e.g., 83 of 143 hypermethylated dog genes are also related to age in the short lifespan M. molossus, P = 4.57e−54, FET). In contrast, only 5 of 60 hypomethylated dog genes are related to age in M. molossus (P = 0.21, FET). Molossus molossus age genes are not enriched for immunity genes (P = 0.24, FET) or genes that frequently mutated in cancer (P = 0.21, FET). However, M. molossus longevity genes exhibit significant overlap with genes involved in immunity (P = 0.002, FET) and genes frequently mutated in human tumors (P = 0.016, FET, Fig. 4g). Similar overlap patterns among immunity, longevity, and tumor-mutated genes also exist for long-lived bats (Supplementary Fig. 6).
While methylation in a promoter region can influence transcription, transcription regulation can also result from interactions among DNA-bound proteins that are in proximity due to chromatin folding33. To evaluate the possibility of either short or long-range transcriptional regulation, we used eFORGEv.2.034 to predict how DMPs likely influence regulatory regions. This program first identifies probe sequences as being associated with five core histone marks or 15 predicted chromatin states in prior epigenomic studies using over 100 cell lines from multiple tissue sources, then uses permutation tests against the species genomic background to determine which histone marks or chromatin states occur nonrandomly. Using probes mapped in the long lifespan species Desmodus rotundus as background, we find that age and longevity DMPs exhibiting hypermethylation are enriched for repressive histone H3 trimethylated at lysine27 (H3K27me3) and active H3K4me1 marks in relevant cell lines (Fig. 5a, b). Hypomethylated age DMPs are enriched in all tissues for H3K9me3, while hypomethylated longevity DMPs show no enrichment (Fig. 5a, b). Analysis of predicted chromatin states reveals that hypermethylated age DMPs are enriched in all tissues for repressed polycomb complexes, while hypomethylated age DMPs are enriched for quiescent chromatin states (Fig. 5c). Longevity DMPs, both hypermethylating and hypomethylating, also show enrichment for quiescent states, as well as enrichment for repressive polycomb complexes or enhanced bivalent states in some tissues (Fig. 5d).
Transcription factor (TF) motifs identified in DMP probe sequences that are involved in cell cycle regulation and genome stability are enriched among hypermethylating age sites (Fig. 5e). Several of those transcription factors, including Cut-like homeobox 1 (CUX1), AT-rich interaction domain 3A (ARID3), and E2f transcription factor 1 (E2F) are involved in cell cycle regulation35,36,37, while others, such as Zinc finger protein 161 (ZFP161), are involved in genome stability38. In contrast, hypomethylating age sites only overlap three TF clusters, one of which, IRF7, is a master regulator of the interferon-dependent innate immune response in bats39.
Longevity TF motifs are largely independent of age TF motifs (Fig. 5e), with one exception, c203-AP2/2, a cluster including Transcription factor AP-2 gamma (TFAP2C), which is involved in epidermal cell lineage commitment40 and regulation of tumor progression41. The other longevity TF motifs also have known associations with tumorigenesis. The c221-GCM1/3 transcription factor cluster includes Pleiomorphic adenoma gene-like 1 (PLAGL1), a protein that suppresses cell growth. The gene that encodes this protein is often methylated and silenced in cancer cells42. CNOT3 acts as a tumor suppressor in T-cell acute lymphoblastic leukemia (T-ALL)43 but can also facilitate the development of non-small cell lung cancer44. Finally, HIC1, Hypermethylated in cancer 1 protein, acts as a tumor suppressor and is involved in the regulation of p53 DNA damage responses45. Only a single TF motif, HD/5 in the BARHL2 group46, was associated with hypomethylated longevity DMPs.
Enrichment analyses47 using the age and longevity gene lists for M. molossus identify several key gene regulators that are significantly associated with hypermethylated sites, but none with hypomethylated sites (Fig. 5f and Supplementary Fig. 6c). Orthodenticle homeobox 2 (OTX2) and Re1 silencing transcription factor (REST) are associated with both age and longevity, whereas other predicted TFs largely differ between age and longevity. REST is induced during human aging and represses neuronal genes that promote cell death48. Note that four of nine transcription regulators predicted to be associated with longevity frequently undergo mutations in human tumors and three are involved in innate immunity (Fig. 5f).
As with other species13,14,17,49, age-related changes in DNAm occur throughout bat genomes. While 162 CpG sites are sufficient to predict chronological age, these represent only a small fraction of the sites that correlate with age, because penalized regression excludes highly correlated variables to avoid multi-collinearity. Consequently, we carried out a meta-analysis that correlated methylation at individual CpG sites with age across species to identify age DMPs. At these sites, long-lived species exhibit a lower rate of change in DNAm, while short-lived species exhibit faster increases in DNAm. How those changes contribute to longevity is not entirely clear, but our results suggest several key transcriptional regulators are involved and modulate the rate at which DNAm changes between short and long-lived species.
Our results are consistent with an epigenetic clock theory of aging that connects beneficial developmental and cell maintenance processes to detrimental processes causing tissue dysfunction20. A large body of evidence links age-related hypermethylated sites to genes and genomic regions that influence developmental processes9,10,17. We find that the sites that gain DNAm with age also tend to be in CpG islands, consistent with studies in humans50. In contrast, we find little enrichment for genes associated with hypomethylated sites, and these genes are less likely to be shared across species. We interpret these results to indicate that DNAm loss with age is widespread and not concentrated in particular pathways. DNAm gain with age, on the other hand, occurs predictably near genes involved in many of the same developmental processes in humans, mice, dogs, and bats, consistent with a shared mammalian origin.
Our analyses are based entirely upon wing biopsy samples and the reported DNAm patterns could differ by tissue, as has been frequently observed8. However, bat wing tissue is capable of unusually rapid regeneration51 and consists of multiple tissue types52, making it particularly useful for measuring age-related changes in DNAm. In addition, these non-lethal biopsies are relatively easy to obtain from wild-caught bats, thus allowing for future longitudinal and cross-sectional studies of epigenetic aging.
DNAm of genes suppressed in stem cells is a hallmark of cancer10. Several lines of evidence suggest that bat genes with longevity DMPs are important for cancer suppression and provide enhanced immunity. First, these genes disproportionately include many known to mutate frequently in human cancers or involved in innate immunity. Second, several transcription factors identified by motif analysis act as tumor suppressors, such that if they are silenced by methylation in older individuals, tumor formation should be more likely. Third, among the transcription factors identified from the list of genes with hypermethylated sites in promoter regions, several of them mutate in human cancers. While bats are not immune from cancer53, genetic adaptations for tumor suppression have been described for Myotis brandtii54 and M. myotis55 to help explain the extreme longevity of those species. Bats also have genetic mechanisms that enable strong antiviral immune responses without inducing damaging inflammatory reactions that may enable them to tolerate high levels of viral exposure30,31,56. The results of this study are consistent with the hypothesis that enhanced epigenetic stability, especially associated with innate immunity and cancer suppression genes, facilitates exceptional longevity in bats.
Wing tissue samples
Wing punches were taken from 778 individually marked animals that were either kept in captivity (15 species) or recaptured as part of long-term field studies (11 species). We excluded 42 samples because we did not have independent evidence to confirm minimum age estimates. For 630 samples the individual was marked shortly after birth, so age estimates were exact. For the remainder, age represented a minimum estimate because the individual was not initially banded as a juvenile. We used minimum age estimates when other evidence, such as tooth wear or time since initial capture, indicated that the minimum age estimate was likely to be close to the real age. In the Supplementary Methods, we provide additional information on when and where samples were taken from either captive or free-ranging animals and details of research permits and animal use protocols. We affirm that we have complied with relevant ethical regulations. The study was approved by the University of Maryland Institutional Animal Care and Use Committee (FR-APR-18-16).
After extraction DNA concentration was estimated with a QuBit and samples were concentrated, if necessary, to reach a minimum of 10 ng/µl in 20 µl. To estimate rates of methylation we limit analyses to the 23 species for which we had more than 10 samples from known-aged individuals. The maximum lifespan for each species was obtained from29 or from captivity records and is listed along with the range of ages of individuals sampled in Table 1.
All methylation data were generated using a custom Illumina methylation array (HorvathMammalMethylChip40) based on 37,492 CpG sites57. Out of these 37,492 sites, 1951 CpGs were selected based on their utility for estimating human age in prior human biomarker studies. The remaining 35,541 probes were chosen due to their location in highly conserved 50 bp sequences with a terminal CpG site. The particular subset of species for each probe is provided in the chip manifest file at the NCBI Gene Expression Omnibus (GEO) platform (GPL28271). Five bat genomes, Pteropus vampyrus, P. alecto, Eptesicus fuscus, Myotis davidii and M. lucifugus, were used in the design of the array.
Bisulfite conversion of DNA samples using the Zymo EZ DNA Methylation Kit (ZymoResearch, Orange, CA, USA), as well as subsequent Cy3 and Cy5 labeling, hybridization, and scanning (iScan, Illumina), were performed according to the manufacturers’ protocols by applying standard settings. DNAm levels (β values) were determined by calculating the ratio of intensities between methylated (signal A) and unmethylated (signal B) sites. Specifically, the β value was calculated from the intensity of the methylated (M corresponding to signal A) and unmethylated (U corresponding to signal B) sites, as the ratio of fluorescent signals β = Max(M,0)/[Max(M,0) + Max(U,0) + 100]. Thus, β values range from 0 (completely unmethylated) to 1 (completely methylated). The SeSaMe method58 was used to normalize β values for each probe. A cluster analysis by species identified 24 samples as outliers, likely due to their low DNA concentrations. After excluding these, along with the 42 excluded due to insufficient age information, we retained 712 of the 778 samples for further analysis.
Probe mapping and annotation
We used sequences and annotations for ten bat genomes (Supplementary Table 1), which include six recently published reference assemblies19, to locate each 50 bp probe on the array. The alignment was done using the QuasR package59 with the assumption for bisulfite conversion treatment of the genomic DNA. For each species’ genome sequence, QuasR creates an in-silico-bisulfite-treated version of the genome. The set of nucleotide sequences of the designed probes, which includes degenerate base positions due to the bisulfite conversion, was expanded into a larger set of nucleotide sequences representing every possible combination of degenerate bases. We then ran QuasR (a wrapper for Bowtie2) with parameters −k 2—strata—best −v 3 and bisulfite = undir to align the enlarged set of probe sequences to each prepared genome. From these files, we collected only alignments where the entire length of the probe perfectly matched the genome sequence (i.e., the CIGAR string 50 M and flag XM = 0).
Following the alignment, the CpGs were annotated based on the distance to the closest transcriptional start site using the ChIPseeker package60. A gff file with these was created using these positions, sorted by scaffold and position, and compared to the location of each probe in BAM format using Samtools. We report probes whose variants only mapped to one unique locus in a particular genome. Gene annotations for the ten bat genomes are available at http://hdl.handle.net/1903/26373.
The genomic location of each CpG was categorized as intergenic, 3′ UTR, 5′ UTR, promoter region (minus 10 kb to plus 1000 bp from the nearest TSS), exon, or intron. We identified X-linked probes in bat genomes by comparison to probes mapped to the X for the human genome, HG19. Tests for enrichment among genomic categories were performed with contingency or Fisher’s Exact tests (FET) in JMP Pro v14.1 for the four species used to identify longevity-associated sites, i.e., one short-lived bat, Molossus molossus, and three long-lived bats, Myotis myotis, Desmodus rotundus and Rhinolophus ferrumequinum, representing four different bat families. We did not include Leptonycteris yerbabuenae in these analyses because no genome is available for that species. While most sites map to the same nearest gene, some differences exist. In the text, we present enrichment results for the short-lived species, M. molossus, but provide parallel results in Supplementary Figures for one or more of the long-lived species, R. ferrumequinum, D. rotundus and M. myotis.
Creation of epigenetic clocks using penalized regression
We developed epigenetic clocks for bat wing tissue by regressing chronological age on all CpGs that map to at least one of the ten bat genomes. To improve linear fit we transformed chronological age to sqrt(age + 1). Penalized regression models were created in the R package glmnet61. We investigated models produced by elastic net regression (alpha = 0.5). The optimal penalty parameters in all cases were determined automatically by using a tenfold internal cross-validation (cv.glmnet) on the training set. By definition, the alpha value for the elastic net regression was set to 0.5 (midpoint between Ridge and Lasso-type regression) and was not optimized for model performance. We performed two cross-validation schemes for arriving at unbiased estimates of the accuracy of the different DNAm based age estimators. One type consisted of leaving out a single sample (LOO) from the regression, predicting an age for that sample by regressing an elastic net on the methylation profiles of all other samples and iterating over all samples. We conducted LOO analyses using all samples from all species, using all samples from each species and using all samples from several species in the same genus. The second type consisted of leaving out a single species (LOSO) from the regression, thereby predicting the age of each sample using the data for all other species.
Differentially methylated positions (DMPs) for age and longevity
To identify DMPs associated with age, we used WGCNA and METAL to compute the Pearson correlation coefficient between methylation level (β) and chronological age for each of the 37,492 sites for the 19 species with 15 or more samples (Table 1). The significance of each site across species was then evaluated using Stouffer’s unweighted z-test62. CpG sites were ranked by significance and the top 2000 sites based on the correlation with untransformed age were selected for subsequent analyses and are referred to as age DMPs. For probes with contrasting patterns in different species, methylation direction was assigned based on the most frequent direction across species to ensure mean methylation rates are comprised of the same set of sites in each species. Because we used all sites on the array, some sites do not map to a unique position in one or more bat genomes. Supplementary Table 1 indicates how many sites map to each species.
To identify DMPs associated with longevity we compared methylation rates between three long-lived species (R. ferrumequinum, D. rotundus, and M. myotis) and two short-lived species (M. molossus and L. yerbabuenae). We chose these five species because they represent three independent lineages of increased longevity29 and because high-quality genome assemblies are available for four of them63. We used a linear mixed-effects model (nlme) to fit methylation level (β) as a function of transformed chronological age (sqrt(age + 1)), longevity category, and their interaction, with species included as a random effect. We defined probes as longevity-associated if the p-value of the interaction term was less than 0.05 after Benjamini–Yekutieli (BY) false discovery rate (FDR) correction64. In this analysis, a positive interaction means a steeper positive slope for the short-lived species relative to the long-lived species. If the main effect of age is positive (hypermethylation) and the interaction is positive, then short-lived species are gaining methylation faster. If the main effect is negative and the interaction is negative, then short-lived species are losing methylation faster.
Phylogenetic analysis of bat longevity
Using phylogenetic generalized least squares regression (PGLS) we tested the effect of the mean rate of methylation change on longevity using both the LQ and maximum longevity (log-transformed). LQ is the ratio of the observed species maximum lifespan to the maximum lifespan predicted for a nonflying placental mammal of the same body mass29. We present results for LQ in the text and for a model containing both log(maximum longevity) and log(mass) in Supplementary Table 2. For each species with at least ten known-age samples, we calculated the mean rates of hypermethylation and hypomethylation using the top 2000 age-associated DMPs as described above. Hypermethylation and hypomethylation rates were tested separately. Phylogenetic relationships among bats are based on a recent maximum likelihood tree32. Models were fit via maximum likelihood using the gls function of the nlme R package and assume a Brownian model of trait evolution.
Probe and gene enrichment analyses
To determine how changes in methylation influence age and longevity, we conducted enrichment analyses on the CpG probes and on the genes nearest to them. We used eFORGE 2.034 to test for enrichment among age or longevity DMPs that either increase or decrease in methylation in comparison with five histone marks and 15 chromatin states mapped in cell lines by the Epigenomics Roadmap Consortium (http://www.ncbi.nlm.nih.gov/epigenomics). Bat wing tissue is unusual in that it contains epithelial skin, muscle, blood, and elastin52. Consequently, we limited enrichment analyses to data from cell lines derived either from skin, blood, or muscle. We also restricted the analysis to probes mapped in a bat genome at least 1 kb apart. We used Demodus rotundus to provide a background probe set but obtained very similar results by using other bat genomes, e.g., Eptesicus fuscus or Pteropus vampyrus, available in eFORGE as backgrounds for the mammalian methylation array. We present enrichment values for each DMP set as the −log10 p binomial value and consider those outside the 95th percentile of the binomial distribution after correction for multiple testing64 as significant.
We identified putative transcription factors that could utilize open chromatin and bind to the DNA by testing for enrichment in each DMP set for predicted binding sites among the probes on the mammalian methylation array. Binding sites were included if their FIMO (Find Individual Motif Occurrence) p-value was less than 10e−5. FIMO scans were performed using the MEME suite (v.4.12.0, available at http://meme-suite.org/doc/download.html). Bedtools (v.2.25.0) were used to intersect the mammalian methylation array file and provide probe-to-TF motif annotations. We then used a hypergeometric test (phyper) to evaluate overlap between probe sets and transcription factor motifs obtained from four transcription factor databases: TRANSFAC65, UniPROBE66, HT-Selex67, and JASPAR68. Redundant transcription factor motifs were then consolidated into clusters69 to identify distinct transcription factors. The function was inferred using information derived primarily from studies in mice and humans46.
We used several approaches for determining the type and function of genes associated with age and longevity DMPs. First, we identified the gene (using human orthologs) with the nearest TSS for every mapped probe in each of the four species used to identify longevity DMPs (R. ferrumequinum, Desmodus rotundus, Myotis myotis, and Molossus molossus). We then used the subsequent lists of unique genes for each species as background for enrichment tests. While the number of probes near a given gene varies considerably, each gene was covered on average by five probes. The number of unique genes with an identifiable human ortholog near a probe was 4918 in R. ferrumequinum, 4693 in M. molossus, 4611 in M. myotis, and 4534 in D. rotundus, reflecting the variation in the number of mapped probes (Supplementary Table 1). Given that the probes were designed to align to regions conserved across all mammals, we suspect some of the differences across species in gene associations reflect variation in genome assembly or annotation. In addition, an important caveat to keep in mind is that the CpGs on the array do not randomly sample the genome57. Thus, even when we use mapped probes or the genes near them as background for enrichment tests, there is potential for bias given that the probes are in conserved regions. We assumed a gene was associated with hypermethylated DMPs if it had more hypermethylated than hypomethylated sites nearest its TSS and vice versa. We present results in the text for DMP-gene associations for M. molossus because it was the only short-lived species with a genome, but we summarize the DMP-gene associations for the other three species in Supplementary Fig. 5. Because we anticipated the mechanisms responsible for causing increases in methylation over time likely differs from those causing decreases, we conducted separate enrichment tests for genes with hypermethylated and hypomethylated sites associated with age and longevity using Panther v.1670 in relation to biological process, molecular function, and protein class. We carried out enrichment tests using genes with DMPs in promoter regions because promoter regions showed enrichment for hypermethylated sites. To minimize redundancy due to the hierarchical organization of gene ontologies (GO), we present no more than three significant (after FDR correction) GO terms from each parent–child group. All significant GO terms can be found in the Source Data files. We also used the significant age and longevity gene promoter lists to predict possible transcription factor regulators using BART, Binding Analysis for Regulation of Transcription47, which correlates the cis-regulatory profile derived from a gene set to the genomic binding profiles of 918 transcription regulators using over 7000 human ChIP-seq datasets. We report the Irwin-Hall P-value, which indicates the significance of the rank integrated over three test statistics47.
In addition, we carried out additional analyses to assess gene function using three relevant gene lists. The first utilized a list of 394 genes associated with changes in methylation over the lifespan of dogs17. This study assayed over 50,000 CpG sites for 104 known-aged labrador dogs, and included methylation data from mice and humans, to identify 198 hypermethylated and 196 hypomethylated sites, with most of the hypermethylated sites near genes associated with anatomical development. By comparing gene lists, we identified the number of positive (and negative) methylated genes in the dog list that occur in the genome of each bat, and then used the number of genes in the bat, as well as the number of age-related genes in the bat and the number that overlap to calculate the probability associated with the overlap in each methylation direction. We used the R program phyper to conduct a Fisher’s Exact Test (FET) using the hypergeometric distribution.
The second test utilized a list of 576 genes that have been documented to mutate frequently in over 10,864 human tumor cases. We downloaded v1.25.1 from the Genome Data Center of the National Cancer Institute (https://portal.gdc.cancer.gov). As with the dog age genes above, we calculated the probability of overlap between the cancer genes found in the genomes of each of four bat species and both the bat age and longevity gene lists using a FET.
The third test involved comparing a list of 4723 innate immunity genes downloaded from https://www.innatedb.com (Aug 14, 2020). As with the cancer gene list, we calculated the probability of overlap between the immunity genes found in the genome of the four bat genomes and both the bat age and longevity gene lists using a FET.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
All data used in this study are freely available. Normalized methylation values for each sample, along with sample metadata, are available from NCBI GEO as series GSE164127 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE164127). The design of the Illumina microarray (HorvathMammalMethylChip40) is available from the Gene Expression Omnibus (GEO) at NCBI as platform GPL28271 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GPL28271). Microarray probe annotations for ten bat genomes are available from the Digital Repository at the University of Maryland (DRUM) at http://hdl.handle.net/1903/26373. Coefficients from the penalized regressions used to estimate bat age for different taxonomic groups are available at https://doi.org/10.6084/m9.figshare.c.5257271. Transcription factor databases used in this study are available as follows: TRANSFAC (http://gene-regulation.com/pub/databases.html), UniPROBE (http://thebrain.bwh.harvard.edu/uniprobe/), HT-Selex (https://ccg.epfl.ch/htpselex/) and JASPAR (http://jaspar.genereg.net/downloads/). Source data for figures are provided with this paper.
R code for implementing the analyses described in this paper is available at https://doi.org/10.6084/m9.figshare.c.5257271.
Razin, A. & Riggs, A. D. DNA methylation and gene function. Science 210, 604–610 (1980).
Chen, Y., Breeze, C. E., Zhen, S., Beck, S. & Teschendorff, A. E. Tissue-independent and tissue-specific patterns of DNA methylation alteration in cancer. Epigenet. Chrom. 9, 10 (2016).
Shemer, R. et al. Dynamic methylation adjustment and counting as part of imprinting mechanisms. Proc. Natl Acad. Sci. USA 93, 6371–6376 (1996).
Gartler, S. M. & Riggs, A. D. Mammalian X-chromosome inactivation. Annu. Rev. Genet. 17, 155–190 (1983).
Choi, J., Lyons, D. B., Kim, M. Y., Moore, J. D. & Zilberman, D. DNA methylation and histone H1 jointly repress transposable elements and aberrant intragenic transcripts. Mol. Cell 77, 310–323 e317 (2020).
Klutstein, M., Nejman, D., Greenfield, R. & Cedar, H. DNA methylation in cancer and aging. Cancer Res. 76, 3446–3450 (2016).
Unnikrishnan, A. et al. Revisiting the genomic hypomethylation hypothesis of aging. Ann. N. Y. Acad. Sci. 1418, 69–79 (2018).
Day, K. et al. Differential DNA methylation with age displays both common and dynamic features across human tissues that are influenced by CpG landscape. Genome Biol. 14, R102 (2013).
Rakyan, V. K. et al. Human aging-associated DNA hypermethylation occurs preferentially at bivalent chromatin domains. Genome Res. 20, 434–439 (2010).
Teschendorff, A. E. et al. Age-dependent DNA methylation of genes that are suppressed in stem cells is a hallmark of cancer. Genome Res. 20, 440–446 (2010).
Hannum, G. et al. Genome-wide methylation profiles reveal quantitative views of human aging rates. Mol. Cell 49, 359–367 (2013).
Horvath, S. DNA methylation age of human tissues and cell types. Genome Biol. 14, R115 (2013).
Stubbs, T. M. et al. Multi-tissue DNA methylation age predictor in mouse. Genome Biol. 18, 68 (2017).
Thompson, M. J., vonHoldt, B., Horvath, S. & Pellegrini, M. An epigenetic aging clock for dogs and wolves. Aging 9, 1055–1068 (2017).
Polanowski, A. M., Robbins, J., Chandler, D. & Jarman, S. N. Epigenetic estimation of age in humpback whales. Mol. Ecol. Res. 14, 976–987 (2014).
Wright, P. G. R. et al. Application of a novel molecular method to age free-living wild Bechstein’s bats. Mol. Ecol. Res. 18, 1374–1380 (2018).
Wang, T. et al. Quantitative translation of dog-to-human aging by conserved remodeling of the DNA methylome. Cell Syst. 11, 176–185 (2020).
Chen, B. H. et al. DNA methylation-based measures of biological age: meta-analysis predicting time to death. Aging 8, 1844–1865 (2016).
Marioni, R. E. et al. DNA methylation age of blood predicts all-cause mortality in later life. Genome Biol. 16, 25 (2015).
Horvath, S. & Raj, K. DNA methylation-based biomarkers and the epigenetic clock theory of ageing. Nat. Rev. Genet. 19, 371–384 (2018).
Cole, J. J. et al. Diverse interventions that extend mouse lifespan suppress shared age-associated epigenetic changes at critical gene regulatory regions. Genome Biol. 18, 58 (2017).
Maegawa, S. et al. Caloric restriction delays age-related methylation drift. Nat. Commun. 8, 539 (2017).
Lowe, R. et al. Ageing-associated DNA methylation dynamics are a molecular readout of lifespan variation among mammalian species. Genome Biol. 19, 22 (2018).
Sen, P., Shah, P. P., Nativio, R. & Berger, S. L. Epigenetic mechanisms of longevity and aging. Cell 166, 822–839 (2016).
Marttila, S. et al. Ageing-associated changes in the human DNA methylome: genomic locations and effects on gene expression. BMC Genom. 16, 179–179 (2015).
Kane, A. E. & Sinclair, D. A. Epigenetic changes during aging and their reprogramming potential. Crit. Rev. Biochem. Mol. Biol. 54, 61–83 (2019).
McLain, A. T. & Faulk, C. The evolution of CpG density and lifespan in conserved primate and mammalian promoters. Aging 10, 561–572 (2018).
Mayne, B., Berry, O., Davies, C., Farley, J. & Jarman, S. A genomic predictor of lifespan in vertebrates. Sci. Rep. 9, 17866 (2019).
Wilkinson, G. S. & Adams, D. M. Recurrent evolution of extreme longevity in bats. Biol. Lett. 15, 20180860 (2019).
Ahn, M. et al. Dampened NLRP3-mediated inflammation in bats and implications for a special viral reservoir host. Nat. Microbiol. 4, 789–799 (2019).
Gorbunova, V., Seluanov, A. & Kennedy, B. K. The world goes bats: living longer and tolerating viruses. Cell Metab. 32, 31–43 (2020).
Amador, L. I., Arevalo, R. L. M., Almeida, F. C., Catalano, S. A. & Giannini, N. P. Bat systematics in the light of unconstrained analyses of a comprehensive molecular supermatrix. J. Mamm. Evol. 25, 37–70 (2018).
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Breeze, C. E. et al. eFORGE v2.0: updated analysis of cell type-specific signal in epigenomic data. Bioinformatics 35, 4767–4769 (2019).
Arthur, R. K., An, N., Khan, S. & McNerney, M. E. The haploinsufficient tumor suppressor, CUX1, acts as an analog transcriptional regulator that controls target genes through distal enhancers that loop to target promoters. Nucleic Acids Res. 45, 6350–6361 (2017).
Wilsker, D., Patsialou, A., Dallas, P. B. & Moran, E. ARID proteins: a diverse family of DNA binding proteins implicated in the control of cell growth, differentiation, and development. Cell Growth Differ 13, 95–106 (2002).
Engelmann, D. & Putzer, B. M. The dark side of E2F1: in transit beyond apoptosis. Cancer Res. 72, 571–575 (2012).
Kim, W. et al. ZFP161 regulates replication fork stability and maintenance of genomic stability by recruiting the ATR/ATRIP complex. Nat. Commun. 10, 5304 (2019).
Zhou, P. et al. IRF7 in the Australian black flying fox, Pteropus alecto: evidence for a unique expression pattern and functional conservation. PloS ONE 9, e103875 (2014).
Li, L. et al. TFAP2C- and p63-dependent networks sequentially rearrange chromatin landscapes to drive human epidermal lineage commitment. Cell Stem Cell 24, 271–284 e278 (2019).
Orso, F. et al. AP-2alpha and AP-2gamma regulate tumor progression via specific genetic programs. Faseb J. 22, 2702–2714 (2008).
Poulin, H. & Labelle, Y. The PLAGL1 gene is down-regulated in human extraskeletal myxoid chondrosarcoma tumors. Cancer Lett. 227, 185–191 (2005).
De Keersmaecker, K. et al. Exome sequencing identifies mutation in CNOT3 and ribosomal genes RPL5 and RPL10 in T-cell acute lymphoblastic leukemia. Nat. Genet. 45, 186–190 (2013).
Shirai, Y. T. et al. CNOT3 targets negative cell cycle regulators in non-small cell lung cancer development. Oncogene 38, 2580–2594 (2019).
Kumar, S. P53 induction accompanying G2/M arrest upon knockdown of tumor suppressor HIC1 in U87MG glioma cells. Mol. Cell Biochem. 395, 281–290 (2014).
Lambert, S. A. et al. The human transcription factors. Cell 172, 650–665 (2018).
Wang, Z. et al. BART: a transcription factor prediction tool with query gene sets or epigenomic profiles. Bioinformatics 34, 2867–2869 (2018).
Lu, T. et al. REST and stress resistance in ageing and Alzheimer’s disease. Nature 507, 448–454 (2014).
Petkovich, D. A. et al. Using DNA methylation profiling to evaluate biological age and longevity interventions. Cell Metab. 25, 954–960 e956 (2017).
Christensen, B. C. et al. Aging and environmental exposures alter tissue-specific DNA methylation dependent upon CpG island context. PLoS Genet. 5, e1000602 (2009).
Faure, P. A., Re, D. E. & Clare, E. L. Wound healing in the flight membranes of big brown bats. J. Mammal 90, 1148–1156 (2009).
Cheney, J. A., Allen, J. J. & Swartz, S. M. Diversity in the organization of elastin bundles and intramembranous muscles in bat wings. J. Anat. 230, 510–523 (2017).
Olds, J. E. et al. Retrospective evaluation of cases of neoplasia in a captive population of Egyptian fruit bats (Rousettus aegyptiacus). J Zoo Wildl Med 46, 325–332 (2015).
Seim, I. et al. Genome analysis reveals insights into physiology and longevity of the Brandt’s bat Myotis brandtii. Nat. Commun. 4, 2212 (2013).
Huang, Z., Jebb, D. & Teeling, E. C. Blood miRNomes and transcriptomes reveal novel longevity mechanisms in the long-lived bat, Myotis myotis. BMC Genom. 17, 906 (2016).
Banerjee, A. et al. Novel insights into immune systems of bats. Front. Immunol. 11, 26 (2020).
Arneson A., et al. A mammalian methylation array for profiling methylation levels at conserved sequences. Preprint at http://www.biorxiv.org/content/10.1101/2021.01.07.425637v1. (2021).
Zhou, W., Triche, T. J. Jr., Laird, P. W. & Shen, H. SeSAMe: reducing artifactual detection of DNA methylation by Infinium BeadChips in genomic deletions. Nucleic Acids Res. 46, e123 (2018).
Gaidatzis, D., Lerch, A., Hahne, F. & Stadler, M. B. QuasR: quantification and annotation of short reads in R. Bioinformatics 31, 1130–1132 (2015).
Yu, G., Wang, L. G. & He, Q. Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 (2015).
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33, 1–22 (2010).
Stouffer S. A., Suchman E. A., DeVinney L. C., Star S. A., Williams R. M. J. The American Soldier, Vol 1: Adjustment during Army Life. (Princeton University Press, 1949).
Jebb, D. et al. Six reference-quality genomes reveal evolution of bat adaptations. Nature 583, 578–584 (2020).
Benjamini, Y. & Yekutieli, D. The control of the false discovery rate in multiple testing under dependency. Ann. Stat. 29, 1165–1188 (2001).
Matys, V. et al. TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res. 34, D108–D110 (2006).
Hume, M. A., Barrera, L. A., Gisselbrecht, S. S. & Bulyk, M. L. UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions. Nucleic Acids Res. 43, D117–D122 (2015).
Yin, Y. et al. Impact of cytosine methylation on DNA binding specificities of human transcription factors. Science 356, eaaj2239 (2017).
Fornes, O. et al. JASPAR 2020: update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 48, D87–D92 (2020).
Maurano, M. T. et al. Large-scale identification of sequence variants influencing human transcription factor occupancy in vivo. Nat. Genet. 47, 1393–1401 (2015).
Mi, H. et al. Protocol Update for large-scale genome and gene function analysis with the PANTHER classification system (v.14.0). Nat. Protoc. 14, 703–721 (2019).
This work was supported by a Paul G. Allen Frontiers Group grant to S.H., the University of Maryland, College of Computer, Mathematical and Natural Sciences to G.S.W., an Irish Research Council Consolidator Laureate Award to E.C.T., a UKRI Future Leaders Fellowship (MR/T021985/1) to S.C.V. and a Discovery Grant from the Natural Sciences and Engineering Research Council (NSERC) of Canada to P.A.F. S.C.V. and P.D. were supported by a Max Planck Research Group awarded to S.C.V. by the Max Planck Gesellschaft, and S.C.V. and E.Z.L. were supported by a Human Frontiers Science Program Grant (RGP0058/2016) awarded to S.C.V. L.J.G. was supported by an NSERC PGS-D scholarship. We thank the Neurogenomics Core at UCLA for laboratory assistance, A. Lollar for providing Tadarida samples, M. Brooks for sharing a new maximum recorded lifespan for Pteropus giganteus, K. Bennett for graphical assistance, and to the Banbury Center, Cold Spring Harbor Labs for hosting the workshop that inspired this collaboration.
S.H. is a founder of the non-profit Epigenetic Clock Development Foundation which plans to license several patents from his employer UC Regents. These patents list S.H. as inventor. The other authors declare no competing interests.
Peer review information Nature Communications thanks Gary Churchill, Vera Gorbunova and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Wilkinson, G.S., Adams, D.M., Haghani, A. et al. DNA methylation predicts age and provides insight into exceptional longevity of bats. Nat Commun 12, 1615 (2021). https://doi.org/10.1038/s41467-021-21900-2