Transposon age and non-CG methylation

Wang, Zhengming; Baulcombe, David C.

doi:10.1038/s41467-020-14995-6

Download PDF

Article
Open access
Published: 06 March 2020

Transposon age and non-CG methylation

Nature Communications volume 11, Article number: 1221 (2020) Cite this article

9938 Accesses
52 Citations
40 Altmetric
Metrics details

Subjects

Abstract

Silencing of transposable elements (TEs) is established by small RNA-directed DNA methylation (RdDM). Maintenance of silencing is then based on a combination of RdDM and RNA-independent mechanisms involving DNA methyltransferase MET1 and chromodomain DNA methyltransferases (CMTs). Involvement of RdDM, according to this model should decrease with TE age but here we show a different pattern in tomato and Arabidopsis. In these species the CMTs silence long terminal repeat (LTR) transposons in the distal chromatin that are younger than those affected by RdDM. To account for these findings we propose that, after establishment of primary RdDM as in the original model, there is an RNA-independent maintenance phase involving CMTs followed by secondary RdDM. This progression of epigenetic silencing in the gene-rich distal chromatin is likely to influence the transcriptome either in cis or in trans depending on whether the mechanisms are RNA-dependent or -independent.

RNA interference-independent reprogramming of DNA methylation in Arabidopsis

Article 30 November 2020

MBD2 couples DNA methylation to transposable element silencing during male gametogenesis

Article Open access 15 January 2024

The chromatin remodeler DDM1 prevents transposon mobility through deposition of histone variant H2A.W

Article 08 April 2021

Introduction

Epigenetic control of genomes is associated with chemical modifications of DNA and histone¹, or both, that are often linked to silencing of transposable elements (TEs) as part of a genome defense system². There are also effects of these modifications on gene expression, chromosome behavior and differentiation of pericentric heterochromatin and distal euchromatin. Full understanding of genomes, therefore, requires knowledge of factors affecting the establishment and maintenance of epigenetic marks. The simplest explanation of these factors invoke DNA sequence elements or structures that are associated with recruitment of DNA- or histone-methyltransferases to chromatin domains through the RNA-directed DNA methylation (RdDM) and other pathways². Once established, DNA methylation can be maintained through cell division by different factors depending on the sequence context^2,3.

To investigate factors influencing epigenetic marks in plants we focus here on tomato in which there are large and well-separated pericentric and distal chromatin domains^4,5. In Arabidopsis, in contrast, the pericentric region is relatively small and largely restricted to narrow domains around the centromere and small knobs on chromosome 4⁶. DNA methylation in tomato, as in other plants, occurs at CG, CHG and CHH sequence contexts (H indicates A, T, or C). Loss of CG methylation in the met1 mutant disrupts growth rate, flowering time, and gametogenesis in Arabidopsis^7,8,9. In contrast the loss of CHG or CHH methylation does not show any obvious phenotype in Arabidopsis¹⁰, presumably because the levels of CHG/CHH methylation in WT is lower than CG methylation^11,12.

To gain insights into the roles of CHG methylation in tomato, we utilized CRISPR to knock out tomato homologs of KYP and CMT3 which are required for maintenance of CHG methylation^13,14. Genome-wide methylation profiles reveal that, as in Arabidopsis, SlKYP and SlCMT3 are required for CHG methylation maintenance, and CHH methylation to a lesser extent. However, unlike Arabidopsis, single slkyp or slcmt3 mutants have pleiotropic phenotypes affecting growth, development and reproduction. Our analysis of these mutants together with the slnrpd1 mutant¹⁵ identifies anti-correlation between CMT3-mediated and RdDM-mediated TE silencing in distal chromatin. In terms of intact long terminal repeats (LTR) transposons, CMT3 tends to target younger elements than RdDM.

Results

CRISPR-based mutation of SlKYP and SlCMT3

Tomato is a good epigenetic model plant because it has a clearly delineated heterochromatic region in the pericentromere. There are mutants and knock down lines of epigenetic pathway genes and a well characterized effect of DNA methylation in fruit ripening^15,16,17,18. To extend our previous analysis of DNA methylation in tomato we used CRISPR small guide RNAs (sgRNAs) to target tomato orthologs of AtCMT3 and AtKYP—the major genes involved in CHG methylation in Arabidopsis^13,14. AtKYP binds methylated CHG, mediates H3K9 di-methylation that, in turn, recruits CMT3^19,20. The SlMET1l (Solyc01g006100) and SlMET3l (Solyc12g100330)^21,22 loci are two tomato orthologs with similarity to AtCMT3 (Supplementary Fig. 1a) and we refer to them as (Sl)CMT3a and (Sl)CMT3b. The tomato ortholog of AtKYP is SlSDG5 (Solyc02g094520) (Supplementary Fig. 1b,c)²³ and we refer to it as (Sl)KYP.

Consistent with the functions of their Arabidopsis orthologs the CMT3a, CMT3b, and KYP mutations (Supplementary Fig. 2) had no effect on CG methylation (Supplementary Figs. 3a and 4a)²⁴. However, also as in Arabidopsis, CHG methylation in cmt3a and kyp was lower than WT with kyp having a stronger effect than cmt3a (Supplementary Fig. 3b). There was also a genome-wide reduction of CHH methylation in kyp (Supplementary Fig. 3c). In cmt3b there was no effect in any context (Supplementary Fig. 4). From these results we conclude that SlCMT3a rather than SlCMT3b is the primary functional homolog of AtCMT3. Most (89%) of the differentially methylated regions (DMRs) (CHG hypomethylation) in kyp (Supplementary Fig. 5a,b) overlapped with cmt3a DMRs (Supplementary Fig. 5c). The CHH kyp and cmt3a hypoDMRs represent subsets of kyp and cmt3a CHG hypoDMRs (Supplementary Fig. 5c).

The kyp and cmt3a but not cmt3b mutants were stunted with wrinkled, asymmetric leaves with yellow spots, few and abnormal flowers, reduced fruit production and with no seeds (Fig. 1a). As DNA methylation may cause transcriptional silencing we predicted that the disrupted growth would be due to CHG hypomethylation in promoter regions leading to gene over-expression. Consistent with that idea, there were 362 upregulated genes and only 31 that were downregulated in kyp (Supplementary Data 1). Similarly, 340 genes were upregulated in cmt3a and 114 were downregulated (Supplementary Data 2). There is an overlap in half of the upregulated but very few of the downregulated genes in kyp and cmt3a (Fig. 1b).

**Fig. 1: SlKYP and SlCMT3a are required for tomato growth, development and reproduction.**

Corresponding to the proposed link of DMRs with CMT- and KYP-mediated effects on gene expression, there was a dramatic reduction of CHG methylation and a slight reduction of CHH methylation in the gene body and flanking region of upregulated genes in kyp and cmt3a compared with random genes (Fig. 1c). Consistent with this pattern, there were more hypoCHG DMRs (both kyp DMRs and cmt3a DMRs) in upregulated genes in mutants compared with random genes (Fig. 1d).

It is likely that the CMT- and KYP-mediated DMRs and differentially expressed genes are related to TEs. The pericentric distribution of CMT-dependent CHG methylation and KYP-dependent CHG and CHH methylation (Supplementary Figs. 5d,e and S6), for example, coincides with the chromosomal distribution of Gypsy and Copia LTR elements identified by RepeatModeler²⁵ and anti-correlates with the distribution of TIR elements and protein-coding genes (Supplementary Fig. 7). In addition, the overexpressed genes were more associated with LTR TEs than the random genes (Fig. 1e and Supplementary Fig. 8).

KYP/CMT3a and RdDM in pericentric and distal chromatin

Therefore, to better understand CMT- and KYP-dependent CHG and CHH methylation, we analyzed the methylation pattern of TEs in the wild type and mutant plants. As the tomato chromosomes are predominantly pericentric heterochromatin we looked separately at the pericentric and distal chromosome regions (Fig. 2 and Supplementary Figs. 6 and 9). For comparison, we include data from an RdDM-defective nrpd1 mutant that has a greater effect on CHH methylation in the distal chromosome arms than in the pericentric region¹⁵.

**Fig. 2: DNA methylation of TEs in pericentric and distal chromatin.**

As in Arabidopsis²⁶, the CHH methylation in distal TEs is primarily due to NRPD1 and KYP in pericentric regions (Supplementary Fig. 9a,b). At CHG NRPD1 has little effect (Supplementary Fig. 9c) but both KYP and CMT3a are influential at both pericentric and distal TEs (Fig. 2b). In the pericentric regions, however, the patterns of CMT3a- and KYP-mediated DNA methylation were not specific for TEs (Fig. 2b). There was a high level of CHG methylation and strong CHG hypomethylation both within these elements and also in the flanking DNA (Fig. 2b). The lack of specificity of cmt3a and kyp CHG hypomethylation in the pericentric region is reinforced by the strong CHG hypomethylation pattern at the transcribed and flanking region of protein-coding genes (Fig. 2b). This pattern is also consistent with the preferential pericentric distribution of upregulated genes in cmt3a and kyp (Supplementary Fig. 10).

In the distal chromatin, however, the TEs were specific foci of CHG and CHH methylation (Fig. 2a, b). NRPD1 mediates CHG and CHH methylation (non-CG methylation) and CMT3a and KYP mediates CHG methylation in the distal chromosomal domains (Fig. 2 and Supplementary Fig. 9) and the cmt3a and kyp CHG DMRs were less dense than in the pericentric region (Supplementary Fig. 5d, e). There were higher levels of CHG and CHH DNA methylation in the distal TEs than their flanking DNA and, with Gypsy, Copia and TIR class 2 type elements, there was a proximity effect: the CHG methylation in the flanking DNA decreased with increasing distance from the transposon (Fig. 2b). In all TEs there was CHG (or non-CG for nrpd1) hypomethylation in the mutants (Fig. 2a, b). In contrast there was very little differential DNA methylation in protein-coding genes in the distal chromatin (Fig. 2a, b). From these findings we conclude that there is a high degree of specificity of CMT3a, KYP and NRPD1 for transposons in the distal but not in the pericentric regions of tomato chromosomes (Fig. 2a, b).

CMT3a effects anti-correlate RdDM at distal TEs

To investigate whether CMT3a-mediated CHG methylation coincides with NRPD1-mediated non-CG methylation in the distal chromatin (Fig. 2), we plotted the change in non-CG methylation due to nrpd1 vs the change in cmt3a-mediated CHG methylation of each TE in each TE family (Fig. 3a and Supplementary Fig. 11a). With most element types (except LINE) there was an inverse relationship of non-CG hypomethylation in nrpd1 with CHG hypomethylation in cmt3a (Fig. 3a and Supplementary Fig. 11a). This inverse relationship was most pronounced with the Gypsy, Copia and TIR elements (Fig. 3a and Supplementary Fig. 11a). The intact LTRs identified by LTRpred^27,28 also show a similar negative correlation between non-CG hypomethylation in nrpd1 and CHG hypomethylation in cmt3a (Fig. 3a).

**Fig. 3: CMT3a-mediated CHG methylation anti-correlates NRPD1-mediated non-CG methylation in distal LTR TEs.**

Not surprisingly, given the biochemistry of plant epigenetics^2,3, the distal nrpd1-hypomethylated elements were less associated with H3K9me2 and more associated with small RNA than the targets of CMT3a (Fig. 3b, c and Supplementary Fig. 11b,c). These trends were similar for distal Gypsy, Copia, TIR and for intact LTRs. With element size, however, there were opposite trends in distal Gypsy, Copia and TIR vs intact LTRs. With Gypsy, Copia and TIR the nrpd1-affected elements tended to be smaller (Fig. 3d and Supplementary Fig. 11d), as reported previously^24,26. However, with the intact LTRs, the CMT3a targets were smaller (Fig. 3d).

To explain these data with the intact LTRs we invoke TE age. The intact LTRs targeted by RdDM tend to be very large (5–10 kb and above) which could reflect secondary transposition events into existing elements²⁹. However, the smaller intact elements of approximately 5 kb may be close homologs of functional transposons that have recently mobilized. According to this age-based hypothesis the NRPD1 targeted intact LTRs in distal chromatin would be older than those affected by CMT3a (Fig. 3d).

Preferential CHG methylation of younger LTRs by CMT3a

Three other features of the intact LTRs are consistent with this proposed age effect. First there was a positive correlation of the cmt3a hypoCHG methylation with 5′ and 3′LTR similarity that is a good indicator of LTR age³⁰ (Fig. 4a). Second there is a correlation of cmt3a hypoCHG methylation with the presence of protein domains characteristic of transposition function (Fig. 4b). These protein domains are likely to be lost with transposon age. The third feature is in the flanking sequences of the intact LTRs that are more similar in Solanum pennelli and Solanum lycopersicum with the less cmt3a hypoCHG methylated elements (Fig. 4c, d). Conservation of flanking sequence is an indicator of older transposition events that occurred before the divergence of these two species.

**Fig. 4: CMT3a-targeted intact LTRs are evolutionarily younger elements.**

This pattern of CMT-dependent silencing of younger LTRs in distal chromatin is characteristic of the Rider family of LTR retrotransposons in tomato²⁸ (Supplementary Fig. 12) and it features with Arabidopsis LTRs: intact LTRs with CMT-dependent CHG methylation in Arabidopsis are larger and also have more similar LTRs than other elements (Supplementary Figs. 13 and 14 and Fig. 4e). CMT3a-mediated CHG methylation plays silencing roles on these young intact LTRs because the mRNA levels of these LTRs were upregulated in cmt3a than in WT (Supplementary Fig. 15).

Discussion

Post-transcriptional gene silencing and RdDM had been previously implicated in retrotransposon silencing during the phases of retrotransposition and copy number increase³¹ (Fig. 5a, b). The RdDM elements are then thought to progress to RNA-independent silencing in which the CMT and MET1 methyltransferases would be involved^3,32 (Fig. 5c). We interpret the KYP/CMT3a-dependent silencing of intact LTR elements (Figs. 3 and 4) as corresponding to that RNA-independent stage.

To account for our observations on H3K9me2, sRNA, transposon size and transposon age (Figs. 3 and 4) we propose that these elements would later undergo rearrangements and diminished transposon functionality. These rearranged elements would still be silenced by KYP/CMT3a (Fig. 5c) but, over time in the oldest elements, we envision that KYP/CMT3a-mediated CHG methylation would decrease and transcriptional silencing would be relaxed (Fig. 5d) so that RdDM would be re-established (Fig. 3).

According to this model there are separate phases of RdDM at two stages in the life and death cycle of retrotransposons. The first RdDM phase (Fig. 5b) would be short lived and observable only first few generations following invasion of a genome by a new transposon by introgression or horizontal gene transfer. This stage has been observed experimentally when the RNA-independent silencing of TEs has been released by mutation of MET1, DDM1, or CMT3^29,33,34. It is likely that there are very few intact elements in wild type genomes undergoing this primary RdDM because RdDM mutants of Arabidopsis do not exhibit transposon mobilization unless other stresses are applied that might release CMT- and MET1-mediated silencing³⁵.

A possible function of the sRNAs from the older elements in the second phase of RdDM is similar to that of piRNAs in animal systems: they would be a reservoir of trans-acting silencing factors that could protect against re-invasion of the genome by TEs with similar sequence identity³⁶. They could also act in trans within the genome to influence the gene expression profile and the elements could also act in cis to affect expression of adjacent genes. The younger elements, in contrast, with CMT-dependent silencing would only have an effect in cis on adjacent genes. Clearly the progression between different modes of LTR silencing will have a profound influence of the ways that these TEs are controlling elements of gene expression in the host genome.

Methods

Plasmid construction

The CRISPR/Cas9 constructs were made following the Golden Gate cloning strategy³⁷. A pair of sgRNAs were designed to target KYP, CMT3a, or CMT3b and amplified using pICH86966:AtU6p:sgRNA_PDS construct as a template. PCR products containing each sgRNA, together with pICSL01009:AtU6p providing the Arabidopsis U6 promoter, were cloned into level 1 constructs pICH47751 and pICH47761 respectively using BsaI (Thermo Fisher) and T4 DNA ligase (NEB). Together with other level 1 constructs (pICH47732:NOSp:NPTII, pICH47742:35S:Cas9) and the linker pICH41766, level 1 sgRNA constructs were assembled into the level 2 vector pAGM4723 using BpiI (Thermo Fisher) and T4 DNA ligase (NEB).

Plant materials

All tomato plants used in this study are Solanum lycopersicum cv M82. A. tumefaciens strain AGL1 containing the CRISPR/Cas9 construct was used to transform tomato. In brief, tomato seeds were surfaced sterilized in 70% ethanol for 2 min followed by 2.2% sodium hypochlorite for 15 min and rinsed in sterilized water five times. The sterilized seeds were transferred to ½ strength Murashige-Skoog (MS) medium with vitamins, 0.8% agar, 1.5% sucrose. Cotyledons from 7-day-old plants were cut in two and submerged in a solution of Agrobacterium in MS liquid medium with 3% sucrose at OD600 = 0.5. The explants were then dried on filter paper and placed on a plate (1X MS medium, 0.6% agar, 0.5 mg/L 2,4-D) under dim light. After two days cultivation, the cotyledon segments were transferred to a selective regeneration medium (1X MS medium, 1X Nitsch vitamins, 0.1 g/L Myo inositol, 2% sucrose, 0.4% agar, 320 mg/L Timentin, 25 mg/L Cefotaxime, 2 mg/L Zeatin riboside, 100 mg/L Kanamycin). The later appearing shoots were transferred to a selective rooting medium (1X MS medium with vitamins, 2% sucrose, 2.25% gelrite, 320 mg/L Timentin, 50 mg/L Kanamycin). Regenerates with shoots and roots were then transferred into soil and genotyped. Primers for sgRNA amplification and mutant genotyping were listed in Supplementary Data 3.

Bisulfite-seq

Genomic DNA was extracted from 100 mg of leaf tissue using Dneasy Plant Mini Kit (Qiagen). Bisulfite-seq library preparation was performed with three biological replicates for each genotype. 1 µg of genomic DNA was sonicated using the E220 Covaris instrument (Covaris Inc., USA) with parameters of incident power = 140 W, duty factor = 10%, cycles/burst = 200, treat time = 120 s. After being purified on XP beads (Ampure, ratio 1.8×), fragmented DNA was end-repaired and A-tailed using T4 DNA polymerase and Klenow Fragment (NEB) and purified again using XP beads (ratio 1.8×). Methylated Illumina Y-shaped adapters for paired-end sequencing were ligated using Quick-Stick ligase (Bioline). Purified (ratio 1×) adapter-ligated DNA was bisulfite-converted using the EZ DNA Methylation-Gold Kit (Zymo research). DNA was barcoded using 15 cycles of PCR amplification with KAPA HiFi HotStart Uracil+Ready Mix (KAPA Biosystems) with forward universal primer and reverse index primers. Pooled libraries were sequenced to a depth of about 12X genome coverage on a NextSeq 500 150PE (Illumina).

Methylation analysis

Bisulfite-seq data generated in this study and Arabidopsis bisulfite-seq data generated by Stroud et al²⁴ were used for further analysis. Tomato sequences from bisulfite-seq were trimmed and filtered by Trim Galore! with default parameters, and then mapped onto the tomato genome (Heinz SL3.0) using Bismark v0.15.0³⁸ with option -N 1. Reads were deduplicated with deduplicate_bismark and methylation calls were extracted using bismark_methylation_extractor with option -ignore_r2 2. Raw data of Stroud et al²⁴ (WT and cmt3) were downloaded from NCBI and processed following the same analyzing pipeline as tomato sequences, with exception of mapping onto the Arabidopsis genome (TAIR10). Differentially Methylated Region (DMR) analysis were performed with segmentSeq v3.8³⁹. DMRs were called for each context separately, with cutoffs of width > 100 bp, FDR < 0.01 and likelihood > 0.99.

Average methylation profiles over genes and TEs were calculated from the cytosine reports with segmentSeq v3.8³⁹ using the function of averageProfiles³⁹. Annotations of tomato coding genes and TEs are based on SOL ITAG3.2. Arabidopsis and tomato intact LTRs were predicted by LTRpred (https://github.com/HajkD/LTRpred)^27,28.

Methylation results of each TE were calculated from the cytosine reports with segmentSeq v3.8³⁹ by averaging the methylation data of all cytosines of the same context within each TE.

sRNA-seq

Total RNA was extracted by following the standard protocol of TRIzol method. 10 µg total RNA was run on a 15% PAGE/7 M Urea gel (Bio-Rad) and the sRNA fraction (18–25 nt) was excised and eluted from the gel, which was subsequently cloned using the NEBNext multiplex small RNA library prep kit (NEB). Libraries were indexed during the PCR step with 12 cycles and gel size-selected and purified. Four biological replicates of libraries for each genotype were constructed. Pooled libraries were sequenced on a NextSeq 500 (Illumina). Sequencing reads were trimmed using Trim Galore! with default parameters and then mapped to the Heinz genome SL3.0 using Bowtie 1.1.1⁴⁰ with specified parameters of -m 1 and -v 0 for unique mapping and no mismatch allowed respectively. The output sam files were converted to bam files by Samtools⁴¹. The uniquely mapped bam files were used for RPKM analysis of sRNA.

RNA-seq

Three micrograms of total RNA was purified using Ribo-Zero rRNA Removal Kit (Illumina). 250 ng purified RNA was used for RNA-seq library constructions following the manufacturer’s protocol of ScriptSeq RNA-Seq Library Kit (Illumina). RNA-seq libraries of three biological replicates for each genotype were sequenced on Illumina NextSeq 500. Sequencing reads were trimmed using Trim Galore! with default parameters and then mapped to the Heinz genome SL3.0 using Tophat v2.1.0⁴² with specified parameters of −N 0 and −r 330. The output sam files were converted to bam files by Samtools⁴¹ which were used for RPKM analysis.Differentially expressed genes were analyzed by DESeq2⁴³ with cutoff of p < 0.01.

ChIP-seq and distal/pericentric chromosome

ChIP-seq was performed following standard protocol⁴⁴. Half gram of new leaves was ground in liquid nitrogen and in vitro cross-linked. The extracts were filtered through Miracloth and centrifuged for 20 min at 4000 rpm at 4 °C. The nuclei pellet was washed by extraction buffer (0.25 M sucrose, 10 mM Tris-HCl pH8.0, 10 mM MgCl₂, 1 mM EDTA, 1% Triton X-100, 0.1 mM PMSF, 1 µM pepstatin, 1X protein inhibitor cocktails, 5 mM β-mercaptoethanol) and re-suspended in 0.6 ml of nuclear lysis buffer (50 mM Tris-HCl pH 8.0, 10 mM EDTA, 1% SDS, 0.1 mM PMSF, 1 µM pepstatin, 1X protein inhibitor cocktails) and sonicated using Bioruptor for 15 min (high, 30 s ON/30 s OFF). The lysate was centrifuged at 13,200 rpm for 10 min at 4 °C and the supernatant was kept as chromatin samples.

Antibodies of H3K9me2 (Abcam ab1220) and H3K9ac (Millipore 07-352) were used to perform ChIP on chromatin extracts. Libraries were constructed following manufacturer’s protocol of TruSeq Library Prep Kit (Illumina) and sequenced on Illumina NextSeq 500. Sequencing reads were trimmed using Trim Galore! with default parameters and then mapped to the Heinz genome SL3.0 using Bowtie2⁴⁰. The output sam files were converted to bam files by Samtools⁴¹ which were used for RPKM analysis.

Normalized H3K9me2 and H3K9ac across each 100 kb window were calculated based on the ChIP-seq mapping files. Ratio of H3K9me2/H3K9ac was used to define distal/pericentric chromatin: H3K9me2/H3K9ac < 0.6, distal; H3K9me2/H3K9ac > 1.2, pericentric. Arabidopsis H3K9me2 ChIP-seq data⁴⁵ was downloaded and processed following the same analysing pipeline as tomato sequences, with exception of mapping onto the Arabidopsis genome (TAIR10). The normalized H3K9me2 in each 100 kb window was calculated as the following formula:

H3K9me2 normalized level = Number of reads in each 100 kb window / (Number of total mapped reads/Number of 100 kb window in Arabidopsis genome)

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All raw sequencing data generated in this study have been deposited into in the NCBI Sequence Read Archive (SRA) under accession number PRJNA516166. The source data underlying Figs. 1a, 1c–e, 2a–b, 3a–d, 4a–c, 4e and Supplementary Figs 1c, 2e, 3a–c, 4a–c, 5d–e, 6, 7, 8, 9a–c, 10, 11a–d, 12, 13, 14, 15 are provided as Source Data file.

References

Bird, A. Perceptions of epigenetics. Nature 447, 396–398 (2007).
Article ADS CAS PubMed Google Scholar
Law, J. A. & Jacobsen, S. E. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat. Rev. Genet. 11, 204–220 (2011).
Article CAS Google Scholar
Kim, M. Y. & Zilberman, D. DNA methylation as a system of plant genomic immunity. Trends Plant Sci. 19, 320–326 (2014).
Article CAS PubMed Google Scholar
The Tomato Genome Consortium. The tomato genome sequence provides insights into fleshy fruit evolution. Nature 485, 635–641 (2012).
Article ADS CAS Google Scholar
Jouffroy, O., Saha, S., Mueller, L., Quesneville, H. & Maumus, F. Comprehensive repeatome annotation reveals strong potential impact of repetitive elements on tomato ripening. BMC Genomics 17, 624 (2016).
Article PubMed PubMed Central CAS Google Scholar
Kaul, S. et al. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796–815 (2000).
Article CAS Google Scholar
Saze, H., Scheid, O. M. & Paszkowski, J. Maintenance of CpG methylation is essential for epigenetic inheritance during plant gametogenesis. Nat. Genet. 34, 65–69 (2003).
Article CAS PubMed Google Scholar
Kankel, M. W. et al. Arabidopsis MET1 cytosine methyltransferase mutants. Genetics 163, 1109–1122 (2003).
CAS PubMed PubMed Central Google Scholar
Reinders, J. et al. Compromised stability of DNA methylation and transposon immobilization in mosaic Arabidopsis epigenomes. Genes Dev. 23, 939–950 (2009).
Article CAS PubMed PubMed Central Google Scholar
Chan, S. W. L. et al. RNAi, DRD1, and histone methylation actively target developmentally important Non-CG DNA methylation in Arabidopsis. PLoS Genet. 2, e83 (2006).
Article PubMed PubMed Central CAS Google Scholar
Cokus, S. J. et al. Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterning. Nature 452, 215–219 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Lister, R. et al. Highly integrated single-base resolution maps of the epigenome in Arabidopsis. Cell 133, 523–536 (2008).
Article CAS PubMed Central PubMed Google Scholar
Lindroth, A. M. et al. Requirement of CHROMOMETHYLASE3 for maintenance of CpXpG methylation. Science 292, 2077–2080 (2001).
Article CAS PubMed Google Scholar
Jackson, J. P., Lindroth, A. M., Cao, X. & Jacobsen, S. E. Control of CpNpG DNA methylation by the KRYPTONITE histone H3 methyltransferase. Nature 416, 556–560 (2002).
Article ADS CAS PubMed Google Scholar
Gouil, Q. & Baulcombe, D. C. DNA methylation signatures of the plant chromomethyltransferases. PLoS Genet. 12, e1006526 (2016).
Article PubMed PubMed Central CAS Google Scholar
Zhong, S. et al. Single-base resolution methylomes of tomato fruit development reveal epigenome modifications associated with ripening. Nat. Biotechnol. 31, 154–159 (2013).
Article CAS PubMed Google Scholar
Corem, S. et al. Redistribution of CHH methylation and small interfering RNAs across the genome of tomato ddm1 mutants. Plant Cell 30, 1628–1644 (2018).
Article CAS PubMed Central PubMed Google Scholar
Yang, Y. et al. Critical function of DNA methyltransferase 1 in tomato development and regulation of the DNA methylome and transcriptome. J. Integr. Plant Biol. 61, 1224–1242 (2019).
Article CAS PubMed Google Scholar
Du, J. et al. Mechanism of DNA methylation-directed histone methylation by KRYPTONITE. Mol. Cell 55, 495–504 (2014).
Article CAS PubMed PubMed Central Google Scholar
Du, J. et al. Dual binding of chromomethylase domains to H3K9me2-containing nucleosomes directs DNA methylation in plants. Cell 151, 167–180 (2012).
Article CAS PubMed PubMed Central Google Scholar
Cao, D. et al. Genome-wide identification of cytosine-5 DNA methyltransferases and demethylases in Solanum lycopersicum. Gene 550, 230–237 (2014).
Article CAS PubMed Google Scholar
Bewick, A. J. et al. The evolution of CHROMOMETHYLASES and gene body DNA methylation in plants. Genome Biol. 18, 65 (2017).
Article PubMed PubMed Central CAS Google Scholar
Aiese Cigliano, R. et al. Genome-wide analysis of histone modifiers in tomato: gaining an insight into their developmental roles. BMC Genomics 14, 57 (2013).
Article PubMed PubMed Central CAS Google Scholar
Stroud, H., Greenberg, M. V. C., Feng, S., Bernatavichute, Y. V. & Jacobsen, S. E. Comprehensive analysis of silencing mutants reveals complex regulation of the Arabidopsis methylome. Cell 152, 352–364 (2013).
Article CAS PubMed PubMed Central Google Scholar
Mehra, M., Gangwar, I. & Shankar, R. A deluge of complex repeats: the Solanum genome. PLoS ONE 10, e0133962 (2015).
Article PubMed PubMed Central CAS Google Scholar
Zemach, A. et al. The Arabidopsis nucleosome remodeler DDM1 allows DNA methyltransferases to access H1-containing heterochromatin. Cell 153, 193–205 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cho, J. et al. Sensitive detection of pre-integration intermediates of long terminal repeat retrotransposons in crop plants. Nat. Plants 5, 26–33 (2019).
Article CAS PubMed Google Scholar
Benoit, M. et al. Environmental and epigenetic regulation of Rider retrotransposons in tomato. PLoS Genet. 15, e1008370 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lippman, Z. et al. Role of transposable elements in heterochromatin and epigenetic control. Nature 430, 471–476 (2004).
Article ADS CAS PubMed Google Scholar
Wicker, T. & Keller, B. Genome-wide comparative analysis of copia retrotransposons in Triticeae, rice, and Arabidopsis reveals conserved ancient evolutionary lineages and distinct dynamics of individual copia families. Genome Res. 17, 1072–1081 (2007).
Article CAS PubMed PubMed Central Google Scholar
Marí-Ordóñez, A. et al. Reconstructing de novo silencing of an active plant retrotransposon. Nat. Genet. 45, 1029–1039 (2013).
Article CAS PubMed Google Scholar
Bond, D. M. & Baulcombe, D. C. Small RNAs and heritable epigenetic variation in plants. Trends Cell Biol. 24, 100–107 (2014).
Article CAS PubMed Google Scholar
Mirouze, M. et al. Selective epigenetic control of retrotransposition in Arabidopsis. Nature 461, 427–430 (2009).
Article ADS CAS PubMed Google Scholar
Kato, M., Miura, A., Bender, J., Jacobsen, S. E. & Kakutani, T. Role of CG and non-CG methylation in immobilization of transposons in Arabidopsis. Curr. Biol. 13, 421–426 (2003).
Article CAS PubMed Google Scholar
Ito, H. et al. An siRNA pathway prevents transgenerational retrotransposition in plants subjected to stress. Nature 472, 115–119 (2011).
Article ADS CAS PubMed Google Scholar
Czech, B. et al. piRNA-guided genome defense: from biogenesis to silencing. Annu. Rev. Genet. 52, 131–157 (2018).
Article CAS PubMed Google Scholar
Weber, E., Engler, C., Gruetzner, R., Werner, S. & Marillonnet, S. A modular cloning system for standardized assembly of multigene constructs. PLoS ONE 6, e16765 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Krueger, F. & Andrews, S. R. Bismark: a flexible aligner and methylation caller for Bisulfite-Seq applications. Bioinformatics 27, 1571–1572 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hardcastle, T. J. Methods for discovering genomic loci exhibiting complex patterns of differential methylation. BMC Bioinform. 18, 416 (2017).
Article CAS Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article PubMed PubMed Central CAS Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
Article CAS PubMed PubMed Central Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 1–21 (2014).
Article CAS Google Scholar
Choi, K. et al. Nucleosomes and DNA methylation shape meiotic DSB frequency in Arabidopsis thaliana transposons and gene regulatory regions. Genome Res. 28, 1–15 (2018).
Article CAS Google Scholar
Stroud, H. et al. Non-CG methylation patterns shape the epigenetic landscape in Arabidopsis. Nat. Struct. Mol. Biol. 21, 64–72 (2014).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors are grateful to Dr. Thomas J. Hardcastle for help with bioinformatics and Dr. Nik Cunniffe for help with data analysis. We thank Dr. Hajk-Georg Drost for assistance with LTRpred and communicating unpublished results. We thank Drs. Julie Ahringer and Alex Appert for help with the ChIP-seq experiments. We thank James Barlow, For Fan Chan, Shiyu He, Mel Steer, Shuoya Tang and Wing Hin Yip for technical assistance. We thank Drs. Ian Henderson, Sara Lopez-Gomollon, Claudia dos santos Martinho, Sebastian Müller and Nataliya E. Yelina for helpful comments on the manuscript. This work was supported by European Research Council Advanced Investigator grant ERC-2013-AdG 340642 (Transgressive Inheritance in plant Breeding and Evolution [TRIBE]), the Royal Society (RP170001), the Balzan Foundation and BBSRC grant BB/R018529/1. D.C.B. is the Royal Society Edward Penley Abraham Research Professor.

Author information

Authors and Affiliations

Department of Plant Sciences, University of Cambridge, Cambridge, CB2 3EA, UK
Zhengming Wang & David C. Baulcombe

Authors

Zhengming Wang
View author publications
You can also search for this author in PubMed Google Scholar
David C. Baulcombe
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.W. and D.C.B. designed the experiments. Z.W. performed the experiments. Z.W. and D.C.B. analyzed the data. Z.W. and D.C.B. wrote the paper.

Corresponding author

Correspondence to David C. Baulcombe.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Reporting Summary

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, Z., Baulcombe, D.C. Transposon age and non-CG methylation. Nat Commun 11, 1221 (2020). https://doi.org/10.1038/s41467-020-14995-6

Download citation

Received: 28 January 2020
Accepted: 11 February 2020
Published: 06 March 2020
DOI: https://doi.org/10.1038/s41467-020-14995-6

This article is cited by

Contribution of RdDM to the ecotype-specific differential methylation on conserved as well as highly variable regions between Arabidopsis ecotypes
- Jaehoon Lee
- Sang-Yoon Shin
- Yeonhee Choi
BMC Genomics (2023)
Multi-omic analyses reveal the unique properties of chia (Salvia hispanica) seed metabolism
- Gerardo Alejo-Jacuinde
- Héctor-Rogelio Nájera-González
- Luis Herrera-Estrella
Communications Biology (2023)
Re-evaluating evidence for adaptive mutation rate variation
- Long Wang
- Alexander T. Ho
- Sihai Yang
Nature (2023)
Technology-driven approaches for meiosis research in tomato and wild relatives
- Sander A. Peters
- Charles J. Underwood
Plant Reproduction (2023)
Interspecific hybridization in tomato influences endogenous viral sRNAs and alters gene expression
- Sara Lopez-Gomollon
- Sebastian Y. Müller
- David C. Baulcombe
Genome Biology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.