Epigenetic regulation involves changes in gene expression independent of DNA sequence variation that are inherited through cell division. In addition to a fundamental role in cell differentiation, some epigenetic changes can also be transmitted transgenerationally through meiosis. Epigenetic alterations (epimutations) could thus contribute to heritable variation within populations and be subject to evolutionary processes such as natural selection and drift. However, the rate at which epimutations arise and their typical persistence are unknown, making it difficult to evaluate their potential for evolutionary adaptation. Here, we perform a genome-wide study of epimutations in a metazoan organism. We use experimental evolution to characterize the rate, spectrum and stability of epimutations driven by small silencing RNAs in the model nematode Caenorhabditis elegans. We show that epimutations arise spontaneously at a rate approximately 25 times greater than DNA sequence changes and typically have short half-lives of two to three generations. Nevertheless, some epimutations last at least ten generations. Epimutations mediated by small RNAs may thus contribute to evolutionary processes over a short timescale but are unlikely to bring about long-term divergence in the absence of selection.
Subscribe to Journal
Get full journal access for 1 year
only $9.92 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Tax calculation will be finalised during checkout.
Rent or Buy article
Get time limited or full article access on ReadCube.
All prices are NET prices.
High-throughput sequencing data for small RNAs and polyA-selected RNAs have been submitted to the Sequence Read Archive under accession no. PRJNA553063.
The custom code and R markdown documents generated in this project have been uploaded to GitHub (https://github.com/belmaran/epimutations).
Holliday, R. Epigenetics: a historical overview. Epigenetics 1, 76–80 (2006).
Heard, E. & Martienssen, R. A. Transgenerational epigenetic inheritance: myths and mechanisms. Cell 157, 95–109 (2014).
Burggren, W. Epigenetic inheritance and its role in evolutionary biology: re-evaluation and new perspectives. Biology (Basel) 5, 24 (2016).
Shirayama, M. et al. piRNAs initiate an epigenetic memory of nonself RNA in the C. elegans germline. Cell 150, 65–77 (2012).
Ashe, A. et al. piRNAs can trigger a multigenerational epigenetic memory in the germline of C. elegans. Cell 150, 88–99 (2012).
Luteijn, M. J. et al. Extremely stable Piwi-induced gene silencing in Caenorhabditis elegans. EMBO J. 31, 3422–3430 (2012).
Buckley, B. A. et al. A nuclear Argonaute promotes multigenerational epigenetic inheritance and germline immortality. Nature 489, 447–451 (2012).
Sapetschnig, A., Sarkies, P., Lehrbach, N. J. & Miska, E. A. Tertiary siRNAs mediate paramutation in C. elegans. PLoS Genet. 11, e1005078 (2015).
Alcazar, R. M., Lin, R. & Fire, A. Z. Transmission dynamics of heritable silencing induced by double-stranded RNA in Caenorhabditis elegans. Genetics 180, 1275–1288 (2008).
Perales, R. et al. Transgenerational epigenetic inheritance is negatively regulated by the HERI-1 chromodomain protein. Genetics 210, 1287–1299 (2018).
Moore, R. S., Kaletsky, R. & Murphy, C. T. Piwi/PRG-1 Argonaute and TGF-β mediate transgenerational learned pathogenic avoidance. Cell 177, 1827–1841.e12 (2019).
Posner, R. et al. Neuronal small RNAs control behavior transgenerationally. Cell 177, 1814–1826.e15 (2019).
de Albuquerque, B. F. M., Placentino, M. & Ketting, R. F. Maternal piRNAs are essential for germline development following de novo establishment of endo-siRNAs in Caenorhabditis elegans. Dev. Cell 34, 448–456 (2015).
Phillips, C. M., Brown, K. C., Montgomery, B. E., Ruvkun, G. & Montgomery, T. A. piRNAs and piRNA-dependent siRNAs protect conserved and essential C. elegans genes from misrouting into the RNAi pathway. Dev. Cell 34, 457–465 (2015).
Katju, V. & Bergthorsson, U. Old trade, new tricks: insights into the spontaneous mutation process from the partnering of classical mutation accumulation experiments with high-throughput genomic approaches. Genome Biol. Evol. 11, 136–165 (2019).
Konrad, A. et al. Mutational and transcriptional landscape of spontaneous gene duplications and deletions in Caenorhabditis elegans. Proc. Natl Acad. Sci. USA 115, 7386–7391 (2018).
Katju, V., Packard, L. B. & Keightley, P. D. Fitness decline under osmotic stress in Caenorhabditis elegans populations subjected to spontaneous mutation accumulation at varying population sizes. Evolution 72, 1000–1008 (2018).
Konrad, A., Brady, M. J., Bergthorsson, U. & Katju, V. Mutational landscape of spontaneous base substitutions and small indels in experimental Caenorhabditis elegans populations of differing size. Genetics 212, 837–854 (2019).
Bergthorsson, U. et al. Long-term experimental evolution reveals purifying selection on piRNA-mediated control of transposable element expression. Preprint at bioRxiv https://www.biorxiv.org/content/10.1101/666693v2.full (2019).
Tang, W., Tu, S., Lee, H.-C., Weng, Z. & Mello, C. C. The RNase PARN-1 trims piRNA 3′ ends to promote transcriptome surveillance in C. elegans. Cell 164, 974–984 (2016).
Gu, W. et al. Distinct Argonaute-mediated 22G-RNA pathways direct genome surveillance in the C. elegans germline. Mol. Cell 36, 231–244 (2009).
Almeida, M. V. et al. GTSF-1 is required for formation of a functional RNA-dependent RNA polymerase complex in Caenorhabditis elegans. EMBO J. 37, e99325 (2018).
Zhou, X. et al. Nuclear RNAi contributes to the silencing of off-target genes and repetitive sequences in Caenorhabditis elegans. Genetics 197, 121–132 (2014).
Claycomb, J. M. et al. The Argonaute CSR-1 and Its 22G-RNA cofactors are required for holocentric chromosome segregation. Cell 139, 123–134 (2015).
Wedeles, C., Wu, M. Z. & Claycomb, J. M. Protection of germline gene expression by the C. elegans Argonaute CSR-1. Dev. Cell 27, 664–671 (2013).
Seth, M. et al. The C. elegans CSR-1 Argonaute pathway counteracts epigenetic silencing to promote germline gene expression. Dev. Cell 27, 656–663 (2013).
Denver, D. R., Morris, K., Lynch, M. & Thomas, W. K. High mutation rate and predominance of insertions in the Caenorhabditis elegans nuclear genome. Nature 430, 679–682 (2004).
Denver, D. R. et al. A genome-wide view of Caenorhabditis elegans base-substitution mutation processes. Proc. Natl Acad. Sci. USA 106, 16310–16314 (2009).
Meier, B. et al. C. elegans whole-genome sequencing reveals mutational signatures related to carcinogens and DNA repair deficiency. Genome Res. 24, 1624–1636 (2014).
Evans, K. J. et al. Stable Caenorhabditis elegans chromatin domains separate broadly expressed and developmentally regulated genes. Proc. Natl Acad. Sci. USA 113, E7020–E7029 (2016).
Frøkjær-Jensen, C. et al. An abundant class of non-coding DNA can prevent stochastic gene silencing in the C. elegans germline. Cell 166, 343–357 (2016).
McMurchy, A. N. et al. A team of heterochromatin factors collaborates with small RNA pathways to combat repetitive elements and germline stress. eLife 6, e21666 (2017).
Greer, E. L. et al. Transgenerational epigenetic inheritance of longevity in Caenorhabditis elegans. Nature 479, 365–371 (2011).
Cubas, P., Vincent, C. & Coen, E. An epigenetic mutation responsible for natural variation in floral symmetry. Nature 401, 157–161 (1999).
Manning, K. et al. A naturally occurring epigenetic mutation in a gene encoding an SBP-box transcription factor inhibits tomato fruit ripening. Nat. Genet. 38, 948–952 (2006).
Schmitz, R. J. et al. Transgenerational epigenetic instability is a source of novel methylation variants. Science 334, 369–373 (2011).
Becker, C. et al. Spontaneous epigenetic variation in the Arabidopsis thaliana methylome. Nature 480, 245–249 (2011).
van der Graaf, A. et al. Rate, spectrum, and evolutionary dynamics of spontaneous epimutations. Proc. Natl Acad. Sci. USA 112, 6676–6681 (2015).
Hagmann, J. et al. Century-scale methylome stability in a recently diverged Arabidopsis thaliana lineage. PLoS Genet. 11, e1004920 (2015).
Reik, W., Dean, W. & Walter, J. Epigenetic reprogramming in mammalian development. Science 293, 1089–1093 (2001).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10–12 (2011).
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Ramírez, F. et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165 (2016).
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
Anders, S., Pyl, P. T. & Huber, W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B Stat. Methodol. 57, 289–300 (1995).
Smit, A. F. A., Hubley, R. & Green, P. RepeatMasker Open v.4.0 (Academic software, 2013–2015).
Emms, D. M. & Kelly, S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 16, 157 (2015).
Visser, I. & Speekenbrink, M. depmixS4: an R package for hidden Markov models. J. Stat. Softw. 36, 1–21 (2010).
Allen, M. A., Hillier, L. W., Waterston, R. H. & Blumenthal, T. A global analysis of C. elegans trans-splicing. Genome Res. 21, 255–264 (2011).
We thank the London Institute of Medical Sciences genomics facility for sequencing support. We thank B. Lehner for critical comments on the manuscript. We thank M. Merkenshlager, T. Warnecke and E. Martinez-Perez for invaluable feedback on the project at various stages. We thank members of the Sarkies and Katju laboratories for helpful discussions. Work in the Sarkies laboratory is funded by the UK Medical Research Council.
The authors declare no competing interests.
Peer review information Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
a, Heatmap depicting all vs all correlation values calculated for the set of epimutated genes in MA25 and MA100 lines, after log2 transformation of the normalised counts matrix. b, Barplot outlining the number of genes epimutated in 1,2,3, up to 22 lines. Epimutated genes were defined as those with a different small RNA state (defined by k-means clustering) compared to the starting population (PeMA).
a, Overlap between genetic mutations detected by high throughput genome sequencing after 400 generations of selfing (MA400 lines, Konrad et. al., 2019), and epimutations detected in MA25 and MA100 lines. b, Representative examples of 22G-RNA counts across lines for epimutable genes with an overlapping mutation in any one line, regardless of their epimutation status. A mutation was considered to overlap with a gene if located within the gene or 1kb flanking regions. Lines with an overlapping mutation are shown in red. c, 22G-RNA counts in the two epimutations in lineage G overlapping with genetic mutations as in b.
Extended Data Fig. 3 Examples of long-lasting epimutations and genes with reduced within-lineage variation.
a, 22G normalised counts across lines for genes showing low p-values in both the variance test (pvar) and the epistates test (pst). N indicates the number of matching states. The red line indicates the threshold separating high and low small RNA states according to k-means clustering. b, 22G signal profile for F52C9.1.2 in the parental line (PeMA), in lineage D (epimutated) and lineage F (no change in 22G-RNAs).
a, b, Comparison of methods to estimate the duration of epimutations. Duration distributions are shown in A, and survival curves in B. c, Genomic features influencing the duration of epimutations. Survival curves estimated from either the k-means or the HMM datasets are shown, showing qualitatively similar trends.
a, Example of a gene displaying bistable 22G-RNA levels (compare with Fig. 3d). b, Method to examine the transition in 22G-RNA levels around the epimutation. Three generations before and after the transition were selected, and a linear model was fit to the data, recovering an r2 that represents the linearity of the change. c, Histogram illustrating the distribution of r2 values obtained from the analysis in b applied to all epimutations. d, Mean normalized small RNA levels within groups spanning different linearity in 22G-RNA alterations. e, Classification into fluctuating, bistable and gradual changes in small RNAs. Grey lines show each individual epimutation and the mean across each group is shown as a thick coloured line. f, Association of different 22G-RNA dynamics with HRDE-1, piRNA and WAGO-1 target genes. The p-value for a difference in proportion between targets and non-targets according to a Fisher’s exact test is shown above each plot. g, The duration of bistable and gradual changes is longer than fluctuating epimutable genes. The p-value for a difference in proportion between <=2 or >2 generations of epimutations is shown above the plot.
a, Test for short-term inheritance. By comparing the intergenerational variance with the overall variance across the lineage, genes with heritable variation in 22G-RNA levels are identified. b, p-value and FDR histograms from the test for short-term inheritance, in lineages A and B. c, Overlap of genes with heritable 22G-RNA levels with small RNA pathway gene targets. d, Distribution duration of runs of consecutive generations with low intergenerational variance. These were defined as consecutive generations with a difference lower than 30% of the standard deviation in 22G-RNA levels across the lineage. e, Overlap of genes with heritable variation in 22G-RNA levels with epimutable genes in MA25 and MA100 lines, and epimutable in short-term lineages A and B. f, Examples of 22G-RNA dynamics at genes with heritable variation in 22G-RNA levels in lineages A and B.
a, Visualisation of changes in 22G-RNA levels against changes at the mRNA level. Genes with significantly different mRNA levels between the high and low small RNA states are shown in green. b, Correlation between the absolute change in 22G-RNA and mRNA levels, for all epimutations (top panel), and for genes with positively correlated (bottom left) and negatively correlated (bottom right) mRNA-22G changes. c, Genes with correlated changes in 22G-RNAs and mRNAs are marginally enriched in the epimutations set (blue line) compared to random subsets of genes with >10 22G-RNA normalised counts. This enrichment is highest for cases where there is a negative correlation between 22G-RNA and mRNA levels. d, e, Example of gene with negatively correlated 22G-RNA and mRNA levels. f, Visualisation of changes in 22G-RNA levels against changes at the mRNA level for genes with significantly different mRNA levels between the high and low small RNA states, coloured by chromatin domain location. g., Comparison of the proportions of genes with positively and negatively correlated changes in 22G-RNAs and mRNAs in active and regulated chromatin domains.
a, b, Variability analysis of piRNAs (A) and miRNAs (B). c, Enrichment of genes with hypervariable 22G-RNAs (HV22Gs) in groups of genes defined according to their evolutionary conservation. d, Epimutations unique to MA100 lines are hypervariable in the MA25 dataset. e, Epimutations unique to MA25 lines are hypervariable in the MA100 dataset. f, Epimutations from MA25 lines do not show increased variation at the mRNA level. g, Epimutations from MA100 lines do not show increased variation at the mRNA level. h, Comparison of mRNA abundance of HV22Gs and non-HV22Gs for all genes, and within small RNA pathway gene targets. Genes with >20 normalised 22G counts were considered for this analysis. Box plot shows interquartile range with a line at median, and whiskers extend to the greatest point no more than 1.5 times the interquartile range. i, j, Correlation analysis between mRNA abundance and variability in 22G-RNAs, for genes with >20 22G normalised counts, separating piRNA, HRDE-1 and WAGO-1 targets (F) and CSR-1 targets (G).
a, Number of neighbouring genes observed in the set of epimutable genes, compared to a null distribution derived from randomly sampled sets of genes of the same size. b, Number of groups of 2,3 and 4 consecutive epimutable genes identified. c, Examples of highly correlated (r>0.75) pairs of epimutable genes. 22G-RNA counts in each line for each gene in the pair are shown. d, Comparison of the odds ratio of enrichment in chromatin domains and small RNA regulatory pathways of the set of clustered epimutable genes (29) and the full set of epimutable genes (442). The red dashed line (slope=1, intercept=0) corresponds to identical enrichment in both sets. Deviation towards the top of the line shows stronger enrichment in the clustered set. e, f, Enrichment of clustered epimutable genes in H3K9me3 (e) and H3K27me3 (f) chromatin, compared to a null distribution derived from randomly sampled sets of epimutable genes of the same size.
a, Summary of epimutations at transposable elements (TEs) and other repeats. TEs are only moderately enriched in the epimutation sets from the MA25 and MA100 lines, and are not enriched in the set of epimutations observed in the consecutive generation experiment. b, Repeat class annotations of epimutated TEs and other repeats. c, Small RNA level states for epimutated TEs and other repeats in MA25 and MA100 lines, showing little correspondence between states. d, Distribution of variance test p-values for TEs and other repeats compared to the rest of genes. No TEs or other repeats had significantly reduced variance within lineages after multiple testing correction. e, f, Variability analysis of 22G-RNAs mapping to TEs and other repeats in MA25 (e) and MA100 (f) lines, showing increased variability in comparison to CSR-1 targets.
About this article
Cite this article
Beltran, T., Shahrezaei, V., Katju, V. et al. Epimutations driven by small RNAs arise frequently but most have limited duration in Caenorhabditis elegans. Nat Ecol Evol 4, 1539–1548 (2020). https://doi.org/10.1038/s41559-020-01293-z