Structural basis of asymmetric DNA methylation and ATP-triggered long-range diffusion by EcoP15I

Gupta, Yogesh K.; Chan, Siu-Hong; Xu, Shuang-yong; Aggarwal, Aneel K.

doi:10.1038/ncomms8363

Download PDF

Article
Open access
Published: 12 June 2015

Structural basis of asymmetric DNA methylation and ATP-triggered long-range diffusion by EcoP15I

Yogesh K. Gupta ORCID: orcid.org/0000-0001-6372-5007¹,
Siu-Hong Chan²,
Shuang-yong Xu² &
…
Aneel K. Aggarwal¹

Nature Communications volume 6, Article number: 7363 (2015) Cite this article

5097 Accesses
37 Citations
30 Altmetric
Metrics details

Subjects

Abstract

Type III R–M enzymes were identified >40 years ago and yet there is no structural information on these multisubunit enzymes. Here we report the structure of a Type III R–M system, consisting of the entire EcoP15I complex (Mod₂Res₁) bound to DNA. The structure suggests how ATP hydrolysis is coupled to long-range diffusion of a helicase on DNA, and how a dimeric methyltransferase functions to methylate only one of the two DNA strands. We show that the EcoP15I motor domains are specifically adapted to bind double-stranded DNA and to facilitate DNA sliding via a novel ‘Pin’ domain. We also uncover unexpected ‘division of labour’, where one Mod subunit recognizes DNA, while the other Mod subunit methylates the target adenine—a mechanism that may extend to adenine N6 RNA methylation in mammalian cells. Together the structure sheds new light on the mechanisms of both helicases and methyltransferases in DNA and RNA metabolism.

Extended DNA threading through a dual-engine motor module of the activating signal co-integrator 1 complex

Article Open access 05 April 2023

Short-range translocation by a restriction enzyme motor triggers diffusion along DNA

Article Open access 02 January 2024

DNA translocation mechanism of the MCM complex and implications for replication initiation

Article Open access 15 July 2019

Introduction

We provide here a basis for two prevailing questions in the study of DNA helicases and DNA methyltransferases. First, how is ATP hydrolysis coupled to long-range diffusion of a helicase on DNA? And second, how does a dimeric methyltransferase methylate one DNA strand but not the other? Both of these activities are embodied in the Type III restriction–modification (R–M) systems in bacteria and archaea, which play roles analogous to the innate immune system in higher eukaryotes^1,2,3,4. R–M systems are categorized in four groups (Type I–IV) based on their subunit assembly, cofactor requirements and associated cleavage patterns⁵.

EcoP15I is a prototype of the Type III R–M family and consists of two methylation (Mod) and one (or two) restriction (Res) subunits⁶, resulting in a Mod₂Res₁ or Mod₂Res₂ complex^7,8,9,10. The Mod subunits are responsible for DNA recognition and methylation, while the Res subunits are responsible for ATP hydrolysis and cleavage. Cleavage only occurs if the two recognition sites (CAGCAG) are in an inverted-repeat orientation, arranged either as ‘head-to-head’ or ‘tail-to-tail’^11,12. The sites can be separated by thousands of base pairs but ATP hydrolysis is absolutely required for cleavage⁸.

According to classical helicase mechanisms, motion of EcoP15I should require one ATP per base pair moved on the DNA. However, even though EcoP15I contains the standard helicase motifs found in members of superfamily 2 (SF2) helicases and translocases^13,14, the enzyme communicates over thousands of base pairs by consuming only a few ATPs^15,16. A number of competing models have been proposed for this long-range communication, including classical translocation and three-dimensional DNA looping^17,18. More recently, single-molecule and ensemble fluorescence measurements have shown that EcoP15I undergoes random one-dimensional diffusion along the DNA^16,19. The diffusion coefficient is one of the largest yet measured for a DNA sliding process (D=0.92±0.06 μm² s⁻¹)²⁰, with DNA cleavage only occurring when the freely diffusive EcoP15I collides with a stationary enzyme bound to the second DNA site¹⁹ (Supplementary Fig. 1). Together, these results posit a new functionality for helicases, as molecular switches for long-lived DNA sliding (rather than conventional DNA/RNA unwinding or stepwise translocation)²¹. EcoP15I hydrolyses ∼30 ATP molecules in two steps (a fast consumption of ∼10 ATP molecules followed by a slower consumption of ∼20 ATP molecules), which switches the enzyme into a distinct structural state that can diffuse on DNA over long distances¹⁹. A similar sliding-based mechanism has been proposed for the mismatch repair protein MutS (and its eukaryotic homologue), where nucleotide exchange (rather than hydrolysis) triggers sliding on DNA after mismatch recognition^22,23,24. Altogether, ATP-triggered sliding is an emerging theme in helicase-like enzymes, but questions about the mechanism remain unanswered due to the lack of structural data.

Intriguingly, the EcoP15I Mod subunit is also unusual in that it functions as a dimer²⁵ as compared to monomeric methyltransferases^26,27. In this respect, it is similar to several DNA methyltransferases in mammals (Dnmt3a/3b/3L) and plants (Domains rearranged methyltransferase 2, DRM2) that also function as dimers or other higher order oligomers in processes ranging from de novo DNA methylation to RNA-directed DNA methylation^28,29. Similar to DRM2, for example, EcoP15I methylates one DNA strand but not the other. The EcoP15I mode of action also extends to the methylation of adenine N6 in mammalian mRNAs, mediated by a heterodimer of methyltransferase-like 3 and 14 (METTL3 and METTL14), which has been shown to be critical for cellular homeostasis and stem cell commitment and differentiation^30,31. Like the Mod subunit, METTL3 and METTL14 belong to the β class of amino methyltransferases and may operate in a similar dimer mode as EcoP15I. Despite the availability of several crystal structures of dimeric DNA methyltransferases (all in DNA-free form), the structural basis for asymmetric DNA (and RNA) methylation has remained mysterious.

Although Type III R–M enzymes were identified >40 years ago^1,2,3,4,32, there is still no structural information on these multisubunit enzymes that encompass DNA methylation, DNA translocation and DNA cleavage activities all within the same complex. We report here the structure of the entire EcoP15I complex (Mod₂Res₁) bound to its DNA substrate. The structure is only the second (after S. solfataricus RAD54) of an SF2 helicase with a duplex DNA bound to the motor domains³³. As such, it provides new insights into the mechanisms of DNA translocation and the nature of the conformational change that switches EcoP15I into a long-lived sliding machine. The structure is also the first, to our knowledge, of a dimeric DNA methyltransferase bound to DNA. It reveals a remarkable division of labour, where one Mod subunit recognizes the DNA, while the other Mod subunit methylates the target adenine base. Together these structural features shed new light on the diversity of helicases and methyltransferases in DNA and RNA metabolism.

Results

Structure determination

The EcoP15I holoenzyme was co-crystallized with a 20-mer DNA duplex containing a single EcoP15I recognition site (CAGCAG). The best co-crystals were obtained in the presence of AMP and diffracted to ∼2.6 Å resolution with synchrotron radiation. The co-crystals belong to space group P4₁2₁2 with unit cell dimensions of a=b=101 Å, c=533 Å and α=β=γ=90^o and contain one EcoP15I/DNA/AMP complex in the crystallographic asymmetric unit (Table 1). The structure was determined by the multiple isomorphous replacement with anomalous scattering (MIRAS) method and refined to 2.6 Å resolution (Table 1). The refined model consists of two Mod subunits (ModA, residues 13–644; ModB, residues 2–644), one Res subunit (residues 7–810), 20-mer DNA (nucleotides 1–20 on each strand), one AMP molecule, 3 ions and 103 solvent molecules. Regions of protein with no electron density were omitted, and amino acids with weak side chain densities were modelled as alanines (Supplementary Table 1). The current model lacks the endonuclease portion of the Res subunit due to the lack of electron density for this region.

Table 1 Data collection, phasing and refinement statistics.

Full size table

Overall architecture

The EcoP15I Mod₂Res₁ heterotrimer embraces the DNA duplex and makes extensive protein–DNA contacts. The Mod subunits engage the upstream portion of the DNA duplex that contains the CAGCAG recognition sequence (Fig. 1). The target adenine (CAGCAG) rotates out of the DNA helix and enters the catalytic pocket of one of the two Mods, ModB. The other Mod, ModA, makes the majority of base-specific contacts with the CAGCAG recognition sequence. The Res subunit interacts with the downstream portion of the DNA, approximately one half-turn away from recognition site (Fig. 1). Only the helicase core of the Res subunit is visible in the electron density map; the cleavage domain is disordered and may only become ordered when the enzyme collides with another EcoP15I complex and becomes cleavage competent. AMP lies in a cleft in the helicase core.

**Figure 1: Overall structure of EcoP15I/DNA/AMP complex.**

Each Mod subunit is composed of four domains, an amino-terminal domain (NTD, aa 14–60), a central methyltransferase domain (MTase, aa 62–262, 390–516), a target recognition domain (TRD, aa 263–384) and a carboxy-terminal domain (CTD, aa 539–644; Fig. 2a,b, Supplementary Fig. 2). The MTase domain contains nine motifs (I–VIII and X) characteristic of amino methyltransferases³⁴, and forms the ‘hub’ from which the NTD, TRD and CTD fan outwards (Fig. 2a,b). On the basis of the linear order of the motifs, the Mod belongs to the β class of amino methyltransferases³⁴, wherein the TRD is inserted between the N-terminal (IV–VIII) and the C-terminal (X and I–III) motifs (Supplementary Fig. 2). The TRD is split into two lobes, separated by two antiparallel β-strands that act as a hinge (Fig. 2a). The proximal lobe is mainly helical (aa 264–300) and, in ModA, interacts primarily with the DNA backbone. The distal lobe (aa 319–376) extends >40 Å from the MTase domain and contains a number of loops, which track the DNA major groove in ModA but mediate protein–protein interactions in ModB (Fig. 2a and Supplementary Fig. 3a). The NTD is composed of helices that intertwine (from ModA and ModB) to form part of the Mod₂ dimer interface. The dimeric interface is extensive (∼4,000 Å²) and lends to the stability of the Mod₂ dimer and its ability to act as a standalone methyltransferase that asymmetrically methylates the second adenine of its recognition sequence (5′-CAGCAG-3′; Supplementary Fig. 3a)⁶. A superposition of ModA and ModB shows an ∼67° movement in the TRD and ∼122° movement in NTD (Supplementary Fig. 4a), which preclude the binding of a second DNA molecule to the Mod₂ dimer and the binding of a second Res to ModB, respectively. The CTD has a globular α/β substructure that takes on different roles in ModA and ModB. In ModA, the CTD extends towards the Res subunit and makes extensive protein–protein contacts with it, whereas the CTD in ModB is solvent exposed and limited to a few lattice contacts (Fig. 2a, Supplementary Fig. 3b,c and Supplementary Table 2).

**Figure 2: Arrangement of Mod and Res subunits with respect to DNA.**

The helicase core of the Res subunit is composed of tandem RecA-like domains¹⁴ (Supplementary Fig. 4b), N-terminal RecA1 (aa 7–269) and C-terminal RecA2 (aa 366–594), followed by a helical spacer (aa 620–810; Fig. 2b). The spacer connects to the endonuclease domain (disordered in the structure). Each RecA-like domain consists of a central β sheet of six to seven parallel β-strands sandwiched by helices. AMP binds to the ‘bottom’ side of the cleft at the confluence of two domains, while the DNA duplex is accommodated on the ‘top’ side of the cleft (Fig. 2b). The AMP is highly mobile in the structure (B-factor of 139 Å²). The helicase motifs typically associated with ATP binding/hydrolysis, interdomain communication and DNA/RNA binding are located on loops that line the cleft (Fig. 2b). Altogether, RecA1 contains the classical motifs Q, I (or Walker A), Ia, Ib, Ic, II (or Walker B), IIa and III, whereas RecA2 contains the motifs IV, IVa, V and VI (Fig. 2b and Supplementary Fig. 5).

The specificity of helicases and translocases for different substrates is dictated to a large extent by accessory domains derived from ‘inserts’ in RecA1 or RecA2, or from the N- and C-terminal flanking sequences¹⁴. In DNA and RNA helicases, for example, an accessory domain can act as a ‘wedge’ to disrupt base pairing for the unwinding reaction¹⁴. In EcoP15I, we identify three new substructures, namely a loop after motif Ic (‘Ic- extension’; aa 198–211), a β-hairpin-like ‘Q-arm’, formed by an ∼50aa insertion (aa 28–77) in RecA1, and a more elaborate substructure, ‘Pin’ domain, formed by an ∼77aa insertion (aa 288–365) in RecA2 (Supplementary Table 2). The Pin domain adopts a β-sandwich-like tertiary structure with two overlaid β-sheets that extends towards the ModA TRD and interacts with the translocating strand of the DNA duplex (Figs 1 and 2b). The Pin domain is highly mobile (B-factor of 89 Å²), but we could assign the main chain and some of the side chains.

DNA conformation

The DNA is severely distorted from B-form at two sites along its axis (Fig. 2c and Supplementary Fig. 6). First, the site where the target adenine is ejected from the recognition sequence (CAGCAG), and second, near the ModA–Res interface (Fig. 2c and Supplementary Fig. 6). For convenience, we refer to the DNA strand containing the target adenine as the ‘methylating’ strand, and the opposite strand as the ‘translocating’ strand (which makes the majority of contacts with the motor domains—described later; Fig. 2c). The distortions around target adenine and the recognition sequence are mainly induced by the intrusion of ModA TRD in the DNA major groove (Fig. 1). At the ModA–Res junction the DNA is bent ∼24° towards the minor groove, in the direction of the ModA TRD and the Res Pin domain (Figs 1 and 2c). At the site of bending, the torsion angles ɛ (C3′–O3′) and ζ (O3′-P) are gauche⁻, trans rather than more characteristic trans, gauche⁻ conformation found in B-DNA³⁵. Analogous deviations in torsion angles were observed for inner thymines in DNA bound to BglII, where the DNA experiences an overall bend of ∼23° (ref. 36). Most importantly, the ∼24° bend in the EcoP15I DNA reduces the distance between the ModA TRD and the Res Pin domain to <14 Å and may facilitate an interaction between the two domains when EcoP15I assumes a diffusive or sliding state on DNA (Fig. 1).

Division of labour: DNA recognition and methylation

The EcoP15I structure is the first of a β-class of an amino DNA methyltransferase bound to DNA and it suggests a fundamentally different mechanism of methylation. There is not only a division of labour between two Mod subunits in terms of DNA recognition (ModA) and methylation (ModB), but also the methylating subunit (ModB) binds to DNA in a radically different manner from other methyltransferases.

ModA makes the majority of base-specific contacts, via the bilobed TRD that tracks the DNA major groove and interacts with bases over the entire length of the recognition sequence (Figs 2a,c and 3, Supplementary Fig. 7). In contrast, ModB makes only a few contacts to bases and its role is mainly to methylate the target adenine (CAGCAG). The adenine rotates ∼180° out of DNA helix and enters the ModB catalytic cleft (Fig. 2a). In contrast, the ModA catalytic cleft is empty and lies >30 Å away from the DNA (Fig. 2a). Compared with the other amino methyltransferases²⁷, ModB pivots around the aspartate/asparagine of the conserved D/NPPY catalytic sequence (motif IV) by >140° so that the ‘PPY’ sequence runs along the Watson–Crick edge of the extrahelical adenine base rather than the Hoogsteen edge (Fig. 3b and Supplementary Fig. 8), and the conserved tyrosine (PPY) stacks on opposite face of the base (Supplementary Fig. 8c–e). This unusual mode of DNA docking is a consequence of Mod₂ dimerization and interactions with the Res subunit, whereby if ModB were to assume the same orientation as say in the monomeric M.TaqI/DNA complex²⁷ then ModA would not be in the correct position to recognize the CAGCAG sequence. In addition, the ModA TRD would directly clash with the Res subunit bound to downstream portion of the DNA (Supplementary Fig. 8f).

**Figure 3: Interactions with DNA and AMP.**

The RecA motor domains

EcoP15I is only the second SF2 translocase (after ssRAd54) to be crystallized with a duplex DNA bound to the RecA motor domains. As with ssRAd54 (ref. 33), the EcoP15I motor domains interact predominantly with one strand of the DNA duplex—the translocating strand (Figs 1 and 4). However, the motor domains in ssRAd54 adopt an unusual arrangement, in which RecA2 is flipped 180^o with respect to RecA1, limiting the number of possible interactions with the DNA (Supplementary Fig. 9a)³³. The EcoP15I motor domains adopt a more canonical configuration, which is intermediate between the fully ‘closed’ configuration observed in the SF2 RNA helicase VASA/ssDNA/AMPPNP complex³⁷ and the ‘open’ configuration observed in zebrafish Rad54 (zRad54)³⁸ (Fig. 4) This ‘semi-closed’ configuration (16° outward motion of RecA2 when compared with VASA (Supplementary Fig. 9b), for example, appears to represent an intermediate state, following ATP hydrolysis but before AMP dissociation.

**Figure 4: Interactions between the RecA1/RecA2 motor domains and DNA.**

The position of the translocating DNA strand in the EcoP15I structure overlays with the ssRNA in VASA and NS3 RNA helicase structures^37,39, reinforcing the notion that contacts primarily to one DNA strand is a conserved feature in different subfamilies of SF2 helicases and translocases (Supplementary Fig. 9a)¹⁴. Also, as in the VASA complex³⁷, the motif Q in RecA1 and motif VI in RecA2 of EcoP15I interact directly with the adenine base of the bound nucleotide (Fig. 3c). Specifically, Gln14 (motif Q) makes direct hydrogen bonds with the N6 amino group of adenine, while Arg537 (the second arginine of the ‘arginine fingers’ of motif VI) makes hydrogen bonds with N3 of adenine and O4’ of the ribose sugar. In addition, Asp509 (motif V) makes hydrogen bonds with an oxygen of the sugar. One difference is that whereas Arg579 in VASA (the first arginine of the ‘arginine fingers’ of motif VI) makes a direct hydrogen bond with the γ-phosphate of AMPPNP, the equivalent residue in EcoP15I (Arg534) points away from the bound AMP due to the absence of γ-phosphate (Fig. 3c).

Both RecA1 and RecA2 interact with the DNA duplex (Figs 1 and 4). The RecA1 residues Thr116 and Leu117 of motif Ia, Ser151 of motif Ib, and Asn187, Met190, Ser193 and Lys194 of motif Ic interact with successive phosphates on the translocating DNA strand, while residues Lys235, Lys236 and Thr237 on the switch II region interact with the opposite methylating DNA strand (Fig. 4). RecA2 interacts with the more downstream portion of the translocating DNA strand via amino acids Thr503 and Arg505 of motif V. In particular, the Pin domain in RecA2 is involved in a number of hydrogen bonds with DNA via the main chain amides of Glu354 and Lys356 and the side chain of Ser359, as well as hydrophobic contacts via Gly352 and Ile353 (Fig. 4). Altogether, the EcoP15I motor domains are specifically adapted to bind double-stranded (ds) DNA. Importantly, there is no equivalent of a ‘wedge’ to separate DNA strands of the DNA duplex, but instead the Pin domain in RecA2 augments interactions with the backbone of the translocating DNA strand and which is more apt for diffusion on ds DNA (Fig. 4).

Discussion

The multisubunit Type I and Type III enzymes are exceptional in their dependency on ATP for restriction activity. By contrast, the Type II enzymes do not require ATP and majority of them harbour functionally independent R and M subunits with the exception of a few enzymes like Type IIG BpuSI or Type IIL MmeI^40,41. Type IV enzymes only cleave modified DNA substrates. The Type III R–M enzymes have defied structural interpretation for >40 years. We report here the first structure of a Type III R–M system, consisting of the entire EcoP15I complex (Mod₂Res₁) bound to its DNA substrate. The structure provides unprecedented new insights into the molecular underpinnings of asymmetric DNA methylation and ATP-triggered DNA diffusion.

The early structures of DNA methyltransferases with DNA revealed monomeric enzymes with the ability to both recognize and methylate DNA. This feature extended to both cytosine C5 and adenine N6 methyltransferases^26,27. As such, much of the subsequent data on DNA methyltransferases have been interpreted in a context of a monomer, even in cases where they were observed as dimers^42,43,44. All of the current structural information on dimeric DNA methyltransferases is limited to crystal structures in the absence of DNA. The EcoP15I structure provides a mechanistic basis for the action of β amino methyltransferases, which are observed primarily as dimers in solution or in crystals^42,43,44. Indeed, it is conceivable that this entire subfamily of DNA methyltransferases works in the same manner as EcoP15I, wherein one subunit recognizes the DNA while the other subunit methylates the target base. The β-amino methyltransferases differ from methyltransferases in other subfamilies in how the TRD is positioned with respect to the MTase domain³⁴. In monomeric M.HhaI (α class) and M.TaqI (γ class)^26,27, for example, the TRD is adjacent to the active site cleft and in a direction that permits it to enter the DNA major groove next to the flipped target base. By contrast, in EcoP15I, the TRD lies far off from the active site cleft and in a direction that makes it geometrically impossible for a single Mod subunit to both methylate a target base and recognize the DNA sequence; instead it is reliant on the TRD of the second Mod subunit.

Strikingly, this division of labour may also extend to RNA methylation in mammalian cells³⁰. In particular, adenine N6 methylation is the most prevalent modification in the body of nuclear and cytolasmic RNAs in mammals and is implicated in processes ranging from mRNA splicing to translation regulation^30,45,46. Intriguingly, adenine N6 methylation of mRNA has also been shown recently to be critical for stem cell commitment and differentiation^31,47,48. The modification often occurs in the context of a G(G/A)ACU sequence and the enzyme(s) responsible have recently been identified as the METTL3/METTL14 heterodimer^49,50,51. Intriguingly, both METTL3 and METTL14 contain motifs (X, I–VIII) characteristic of amino methyltransferase⁵², including the equivalent of the ‘DPPY’ sequence in motif IV (DPPW in METTL3 and EPPL in METTL14). The linear order of these motifs suggests that both METTL3 and METTL14 belong to the β-amino class of methyltransferases and—based on the EcoP15I structure—it is conceivable that one methyltransferase (METTL3 or METTL14) plays a more dominant role in recognition of the G(G/A)ACU sequence (and perhaps also the surrounding RNA secondary structure), while the other plays a more central role in adenine methylation.

The EcoP15I methylation mechanism may also extend to other subfamilies of DNA methyltransferases, such as the plant de novo DNA methyltransferase DRM2, which functions as a homodimer²⁸. On the basis of the EcoP15I structure, one can envisage a mechanism where one DRM2 monomer recognizes the DNA sequence context (CG/CHG/CHH, where H=A, T, or C) while the other methylates the target cytosine. The dimeric RNA MTases from SPOUT family display another form of division of labour in which the RNA binds in a cleft between the two monomers, whereas the target RNA base for methylation resides in the catalytic pocket of one monomer^53,54,55. Altogether, a division of labour between two or more methyltransferase subunits appears to be a more general mechanism in DNA and RNA methylation. The EcoP15I structure provides a framework for beginning to understand the interplay between different methyltransferase subunits.

As recently as 1993, helicases were considered as DNA- or RNA-unwinding machines that couple ATP hydrolysis for the unwinding reaction⁵⁶. The discovery that many helicases are actually translocases (especially those in the SF2 superfamily) has changed this view¹⁴; however, even this view has been found wanting with the discovery that several helicases or translocases behave as molecular switches²¹. These molecular switches have also been referred to as pseudo-helicases¹⁹, wherein ATP hydrolysis is coupled to a conformational change in the enzyme for thermally driven diffusion on the DNA or RNA. The EcoP15I structure uncovers a helicase motor that is generally similar to that observed in classical helicases and translocases, composed of tandem RecA-like domains with Walker A and B motifs and an arginine finger, among other classical motifs¹⁴.

What is the nature of the ATP-triggered conformational change for diffusion on DNA? The structure here provides some interesting clues. In particular, proximity of the ModA TRD to the Pin domain in RecA2 suggests a model in which the TRD may switch its location from the DNA major groove to the Pin domain and hence, adopt a ‘nonspecific’ conformation more amenable to DNA sliding¹⁹ (Fig. 5). The TRD is joined to the MTase domain by a flexible linker, and a simple rotation of ∼40° about this linker puts the ModA TRD in direct contact with the Pin domain—sequestered away from the DNA (Fig. 5a). Moreover, the DNA duplex is bent by ∼24° at this precise ModA–Res nexus, which reduces the distance between the TRD and the Pin domain to <14 Å. The Pin domain is highly mobile and may only become fully ordered when it recruits the ModA TRD. The asymmetric nature of ModA and ModB DNA binding seems to ensure that only one TRD (and not both) needs to be drawn away from the DNA. Overall, the structural model is in accord with single-molecule studies, which suggest that the entire Mod₂Res₁ complex diffuses on the DNA (and not just the Res subunit) until it collides with another complex to become cleavage competent (Fig. 5b, Supplementary Fig. 1)¹⁹.

In conclusion, we present here the first structure of a Type III R–M system, consisting of the entire EcoP15I complex (Mod₂Res₁) bound to its DNA substrate. Asymmetric methylation and ATP-triggered DNA diffusion are emerging themes in the study of methyltransferases and helicases but the mechanisms remain unclear. Plant DRM2 homodimer²⁸ and the mammalian METTL3/METTL14 heterodimer^{30,45,46,49,50,51}, for example, may operate in a similar manner to EcoP15I, where one monomer recognizes the DNA or RNA sequence context, while the other methylates the target base. Neither DRM2 nor METTL3/METTL14 possesses helicase activity. Furthermore, an EcoP15I type DNA sliding-based mechanism has also been proposed for the mismatch repair protein MutS (and its eukaryotic homologue), but where nucleotide exchange (rather than hydrolysis) triggers sliding on DNA after mismatch recognition^22,23,24. Similarly, the loading of processivity clamps at replication forks by clamp loaders occurs via a two-step conformational change mediated by ATP binding⁵⁷. Altogether, the EcoP15I structure proffers unprecedented new insights into the molecular underpinnings of asymmetric DNA/RNA methylation and ATP-triggered thermal diffusion in broad array of DNA and RNA metabolism.

Methods

Expression and purification

The genes encoding Res and Mod subunits of EcoP15I were subcloned from a plasmid kindly provided by Dr D.N. Rao (Indian Institute of Science) into an expression vector pRRS⁵⁸. E. coli expression host NEB Express (NEB) was transformed and was grown in LB medium containing 100 μg ml⁻¹ of ampicillin. Protein expression was carried out for 18 h at 30 °C. The harvested cells from 6 l of culture were lysed and the derived cell pellet was suspended in a potassium phosphate buffer (20 mM potassium phosphate, pH 7.0, 50 mM NaCl, 5% Glycerol) and sonicated on ice. The lysate was centrifuged at a maximum r.c.f. of 31,000g for 30 min at 4 °C and the supernatant was loaded onto a heparin column. The bound proteins were eluted using a NaCl gradient. Fractions containing EcoP15I activity were pooled and loaded onto a ceramic hydroxylapatite column (Bio-Rad; 7 ml), followed by elution with a potassium phosphate gradient. Fractions containing EcoP15I activity were pooled and loaded onto a cation exchange column and eluted using a NaCl gradient. Peak fractions were pooled, and concentrated using a Vivaspin 15 concentrator (10 KDa MWCO; Sartorius Stedim Biotech) to a final concentration of >10 mg ml⁻¹.

Crystallization

We co-crystallized EcoP15I complex in presence of a 20-mer DNA duplex and AMP. The crystals were obtained in a hanging drop set up by mixing 1 μl of EcoP15I/DNA/AMP complex with 1 μl of precipitant solution containing 10% PEG 5000 monomethyl ether, 0.1 M HEPES pH 7.5, 0.2M potassium acetate and 15 mM MnCl₂ at 20 °C, and were cryoprotected by serial transfer into mother liquor containing 30% PEG 5000 MME and 10% PEG400 before plunging them into liquid N₂. Neither S-adenosyl methionine (AdoMet) nor its analogue AdoHcy was included during purification or crystallization. The crystals belong to the space group P4₁2₁2 with unit cell dimensions of a=b=101 Å, c=533 Å and α=β=γ=90°. X-ray diffraction data were measured at beamlines NECAT-24IDC at Advanced Photon Source (APS), and X4A, X25 and X29 at NSLS of Brookhaven National Laboratory (BNL; Table 1).

Structure determination

To calculate the experimental phases for structure determination, we used X-ray data from native crystals and seven heavy atom derivatives (Se, Br, I, Ta, Sm, Co, Ho). The phases were calculated by the MIRAS method, using the programme SHARP⁵⁹. The bromine and iodine derivatives were prepared by substituting 7 and 8 thymines (outside of the recognition sequence in the 20-mer DNA duplex) to 5-bromouracils and 5-iodouracils, respectively. The Se-Met-labelled protein was expressed using standard method⁶⁰ and purified with similar protocol as the WT enzyme. The tantalum (Ta), cobalt (Co) and holmium (Ho) derivatives were prepared by soaking native crystals into the mother liquor containing 1 mM hexatantalum tetradecabromide (for 22 h), 15 mM cobalt chloride (for 16 h), 2 mM holmium sulfate (16 h), respectively. The samarium (Sm) derivatives were prepared by co-crystallizing the EcoP15I complex in presence of 0.5 mM samarium acetate. The single wavelength anomalous X-ray data were measured at wavelengths close to the absorption K edge for Se (0.9792 Å), Co (1.60 Å) and Br (0.9197 Å) derivatives, and the L-III edge for the Ta (1.255 Å) and Sm (1.849 Å) derivative. X-ray data for the iodine derivative were measured at wavelength of 1.608 Å. All the data were processed using processed using the programme autoProc⁶¹. The MIRAS phases and solvent-flattened maps were calculated using SHARP⁵⁹, and the model was built manually using programme COOT⁶² and refined using programme BUSTER⁶³. Among all heavy atom derivatives, the Se-Met data set gave the best phases (anomalous phasing power ∼1.0 at 5.85 Å). At a later stage, another X-ray data set on the 5-iodouracil containing crystals was measured at a longer wavelength (2.07 Å) on five of such crystals and processed, merged, and scaled using the XDS programme package⁶⁴ (Table 1). These data were used for molecular replacement-single wavelength anomalous diffraction phasing to generate log-likelihood-gradient maps in programme Phaser⁶⁵ in CCP4 that were used at a late stage of model building. These log-likelihood-gradient maps also confirmed the location of heavy atoms S, I and P. The model was improved through iterative cycles of density modification in presence of model, followed by manual rebuilding and refinement (Supplementary Fig. 10). The final model was refined to 2.6 Å resolution with R_free and R_work values of∼26.4% and 21.9%, respectively (Table 1).

Additional information

Accession codes: Atomic coordinates and structure factors have been deposited in the Protein Data Bank under accession codes 4ZCF.

How to cite this article: Gupta, Y. K. et al. Structural basis of asymmetric DNA methylation and ATP-triggered long-range diffusion by EcoP15I. Nat. Commun. 6:7363 doi: 10.1038/ncomms8363 (2015).

Accession codes

Accessions

Protein Data Bank

4ZCF

References

Haberman, A. The bacteriophage P1 restriction endonuclease. J. Mol. Biol. 89, 545–563 (1974).
Article CAS PubMed Google Scholar
Haberman, A., Heywood, J. & Meselson, M. DNA modification methylase activity of Escherichia coli restriction endonucleases K and P. Proc. Natl Acad. Sci. USA 69, 3138–3141 (1972).
Article ADS CAS PubMed PubMed Central Google Scholar
Meselson, M. & Yuan, R. DNA restriction enzyme from E. coli. Nature 217, 1110–1114 (1968).
Article ADS CAS PubMed Google Scholar
Arber, W. & Wauters-Willems, D. Host specificity of DNA produced by Escherichia coli. XII. The two restriction and modification systems of strain 15T. Mol. Gen. Genet. 108, 203–217 (1970).
Article CAS PubMed Google Scholar
Roberts, R. J., Vincze, T., Posfai, J. & Macelis, D. REBASE—a database for DNA restriction and modification: enzymes, genes and genomes. Nucleic Acids Res. 43, D298–D299 (2015).
Article CAS PubMed Google Scholar
Raghavendra, N. K., Bheemanaik, S. & Rao, D. N. Mechanistic insights into type III restriction enzymes. Front. Biosci. (Landmark Ed.) 17, 1094–1107 (2012).
Article CAS Google Scholar
Gupta, Y. K. et al. Structural insights into the assembly and shape of Type III restriction-modification (R-M) EcoP15I complex by small-angle X-ray scattering. J. Mol. Biol. 420, 261–268 (2012).
Article CAS PubMed Google Scholar
Janscak, P., Sandmeier, U., Szczelkun, M. D. & Bickle, T. A. Subunit assembly and mode of DNA cleavage of the type III restriction endonucleases EcoP1I and EcoP15I. J. Mol. Biol. 306, 417–431 (2001).
Article CAS PubMed Google Scholar
Wyszomirski, K. H. et al. Type III restriction endonuclease EcoP15I is a heterotrimeric complex containing one Res subunit with several DNA-binding regions and ATPase activity. Nucleic Acids Res. 40, 3610–3622 (2012).
Article CAS PubMed Google Scholar
Butterer, A. et al. Type III restriction endonucleases are heterotrimeric: comprising one helicase-nuclease subunit and a dimeric methyltransferase that binds only one specific DNA. Nucleic Acids Res. 42, 5139–5150 (2014).
Article CAS PubMed PubMed Central Google Scholar
Meisel, A., Bickle, T. A., Kruger, D. H. & Schroeder, C. Type III restriction enzymes need two inversely oriented recognition sites for DNA cleavage. Nature 355, 467–469 (1992).
Article ADS CAS PubMed Google Scholar
van Aelst, K. et al. Type III restriction enzymes cleave DNA by long-range interaction between sites in both head-to-head and tail-to-tail inverted repeat. Proc. Natl Acad. Sci. USA 107, 9123–9128 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Gorbalenya, A. E. & Koonin, E. V. Endonuclease (R) subunits of type-I and type-III restriction-modification enzymes contain a helicase-like domain. FEBS Lett. 291, 277–281 (1991).
Article CAS PubMed Google Scholar
Singleton, M. R., Dillingham, M. S. & Wigley, D. B. Structure and mechanism of helicases and nucleic acid translocases. Annu. Rev. Biochem. 76, 23–50 (2007).
Article CAS PubMed Google Scholar
Reiser, J. & Yuan, R. Purification and properties of the P15 specific restriction endonuclease from Escherichia coli. J. Biol. Chem. 252, 451–456 (1977).
CAS PubMed Google Scholar
Ramanathan, S. P. et al. Type III restriction enzymes communicate in 1D without looping between their target sites. Proc. Natl Acad. Sci. USA 106, 1748–1753 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Meisel, A., Mackeldanz, P., Bickle, T. A., Kruger, D. H. & Schroeder, C. Type III restriction endonucleases translocate DNA in a reaction driven by recognition site-specific ATP hydrolysis. EMBO J. 14, 2958–2966 (1995).
Article CAS PubMed PubMed Central Google Scholar
Crampton, N. et al. DNA looping and translocation provide an optimal cleavage mechanism for the type III restriction enzymes. EMBO J. 26, 3815–3825 (2007).
Article CAS PubMed PubMed Central Google Scholar
Schwarz, F. W. et al. The helicase-like domains of type III restriction enzymes trigger long-range diffusion along DNA. Science 340, 353–356 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Blainey, P. C. et al. Nonspecifically bound proteins spin while diffusing along DNA. Nat. Struct. Mol. Biol. 16, 1224–1229 (2009).
Article CAS PubMed PubMed Central Google Scholar
Szczelkun, M. D. Roles for helicases as ATP-dependent molecular switches. Adv. Exp. Med. Biol. 767, 225–244 (2013).
Article CAS PubMed Google Scholar
Qiu, R. et al. Large conformational changes in MutS during DNA scanning, mismatch recognition and repair signalling. EMBO J. 31, 2528–2540 (2012).
Article CAS PubMed PubMed Central Google Scholar
Gorman, J. et al. Single-molecule imaging reveals target-search mechanisms during DNA mismatch repair. Proc. Natl Acad. Sci. USA 109, E3074–E3083 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Cho, W. K. et al. ATP alters the diffusion mechanics of MutS on mismatched DNA. Structure 20, 1264–1274 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ahmad, I., Krishnamurthy, V. & Rao, D. N. DNA recognition by the EcoP15I and EcoPI modification methyltransferases. Gene 157, 143–147 (1995).
Article CAS PubMed Google Scholar
Klimasauskas, S., Kumar, S., Roberts, R. J. & Cheng, X. HhaI methyltransferase flips its target base out of the DNA helix. Cell 76, 357–369 (1994).
Article CAS PubMed Google Scholar
Goedecke, K., Pignot, M., Goody, R. S., Scheidig, A. J. & Weinhold, E. Structure of the N6-adenine DNA methyltransferase M.TaqI in complex with DNA and a cofactor analog. Nat. Struct. Biol. 8, 121–125 (2001).
Article CAS PubMed Google Scholar
Zhong, X. et al. Molecular mechanism of action of plant DRM de novo DNA methyltransferases. Cell 157, 1050–1060 (2014).
Article CAS PubMed PubMed Central Google Scholar
Jia, D., Jurkowska, R. Z., Zhang, X., Jeltsch, A. & Cheng, X. Structure of Dnmt3a bound to Dnmt3L suggests a model for de novo DNA methylation. Nature 449, 248–251 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Meyer, K. D. & Jaffrey, S. R. The dynamic epitranscriptome: N6-methyladenosine and gene expression control. Nat. Rev. Mol. Cell Biol. 15, 313–326 (2014).
Article CAS PubMed PubMed Central Google Scholar
Stunnenberg, H. G., Vermeulen, M. & Atlasi, Y. Developmental biology. A Me6Age for pluripotency. Science 347, 614–615 (2015).
Article ADS CAS PubMed Google Scholar
Rao, D. N., Dryden, D. T. & Bheemanaik, S. Type III restriction-modification enzymes: a historical perspective. Nucleic Acids Res. 42, 45–55 (2013).
Article PubMed PubMed Central Google Scholar
Durr, H., Korner, C., Muller, M., Hickmann, V. & Hopfner, K. P. X-ray structures of the Sulfolobus solfataricus SWI2/SNF2 ATPase core and its complex with DNA. Cell 121, 363–373 (2005).
Article PubMed Google Scholar
Malone, T., Blumenthal, R. M. & Cheng, X. Structure-guided analysis reveals nine sequence motifs conserved among DNA amino-methyltransferases, and suggests a catalytic mechanism for these enzymes. J. Mol. Biol. 253, 618–632 (1995).
Article CAS PubMed Google Scholar
Schneider, B., Neidle, S. & Berman, H. M. Conformations of the sugar-phosphate backbone in helical DNA crystal structures. Biopolymers 42, 113–124 (1997).
Article CAS PubMed Google Scholar
Lukacs, C. M., Kucera, R., Schildkraut, I. & Aggarwal, A. K. Understanding the immutability of restriction enzymes: crystal structure of BglII and its DNA substrate at 1.5 A resolution [see comments]. Nat. Struct. Biol. 7, 134–140 (2000).
Article CAS PubMed Google Scholar
Sengoku, T., Nureki, O., Nakamura, A., Kobayashi, S. & Yokoyama, S. Structural basis for RNA unwinding by the DEAD-box protein Drosophila Vasa. Cell 125, 287–300 (2006).
Article CAS PubMed Google Scholar
Thoma, N. H. et al. Structure of the SWI2/SNF2 chromatin-remodeling domain of eukaryotic Rad54. Nat. Struct. Mol. Biol. 12, 350–356 (2005).
Article PubMed Google Scholar
Kim, J. L. et al. Hepatitis C virus NS3 RNA helicase domain with a bound oligonucleotide: the crystal structure provides insights into the mode of unwinding. Structure 6, 89–100 (1998).
Article CAS PubMed Google Scholar
Morgan, R. D., Dwinell, E. A., Bhatia, T. K., Lang, E. M. & Luyten, Y. A. The MmeI family: type II restriction-modification enzymes that employ single-strand modification for host protection. Nucleic Acids Res. 37, 5208–5221 (2009).
Article CAS PubMed PubMed Central Google Scholar
Shen, B. W. et al. Characterization and crystal structure of the type IIG restriction endonuclease RM.BpuSI. Nucleic Acids Res. 39, 8223–8236 (2011).
Article CAS PubMed PubMed Central Google Scholar
Scavetta, R. D. et al. Structure of RsrI methyltransferase, a member of the N6-adenine beta class of DNA methyltransferases. Nucleic Acids Res. 28, 3950–3961 (2000).
Article CAS PubMed PubMed Central Google Scholar
Osipiuk, J., Walsh, M. A. & Joachimiak, A. Crystal structure of MboIIA methyltransferase. Nucleic Acids Res. 31, 5440–5448 (2003).
Article CAS PubMed PubMed Central Google Scholar
Thomas, C. B. & Gumport, R. I. Dimerization of the bacterial RsrI N6-adenine DNA methyltransferase. Nucleic Acids Res. 34, 806–815 (2006).
Article CAS PubMed PubMed Central Google Scholar
Meyer, K. D. et al. Comprehensive analysis of mRNA methylation reveals enrichment in 3' UTRs and near stop codons. Cell 149, 1635–1646 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dominissini, D. et al. Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq. Nature 485, 201–206 (2012).
Article ADS CAS PubMed Google Scholar
Batista, P. J. et al. m(6)A RNA modification controls cell fate transition in mammalian embryonic stem cells. Cell Stem Cell 15, 707–719 (2014).
Article CAS PubMed PubMed Central Google Scholar
Geula, S. et al. m6A mRNA methylation facilitates resolution of naive pluripotency toward differentiation. Science 347, 1002–1006 (2015).
Article ADS CAS PubMed Google Scholar
Wang, Y. et al. N6-methyladenosine modification destabilizes developmental regulators in embryonic stem cells. Nat. Cell Biol. 16, 191–198 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ping, X. L. et al. Mammalian WTAP is a regulatory subunit of the RNA N6-methyladenosine methyltransferase. Cell Res. 24, 177–189 (2014).
Article CAS PubMed PubMed Central Google Scholar
Liu, J. et al. A METTL3-METTL14 complex mediates mammalian nuclear RNA N6-adenosine methylation. Nat. Chem. Biol. 10, 93–95 (2014).
Article CAS PubMed Google Scholar
Bujnicki, J. M., Feder, M., Radlinska, M. & Blumenthal, R. M. Structure prediction and phylogenetic analysis of a functionally diverse family of proteins homologous to the MT-A70 subunit of the human mRNA:m(6)A methyltransferase. J. Mol. Evol. 55, 431–444 (2002).
Article ADS CAS PubMed Google Scholar
Michel, G. et al. The structure of the RlmB 23 S rRNA methyltransferase reveals a new methyltransferase fold with a unique knot. Structure 10, 1303–1315 (2002).
Article CAS PubMed Google Scholar
Thomas, S. R., Keller, C. A., Szyk, A., Cannon, J. R. & Laronde-Leblanc, N. A. Structural insights into the functional mechanism of Nep1/Emg1 N1-specific pseudouridine methyltransferase in ribosome biogenesis. Nucleic Acids Res. 39, 2445–2457 (2011).
Article CAS PubMed Google Scholar
Schubert, H. L., Blumenthal, R. M. & Cheng, X. Many paths to methyltransfer: a chronicle of convergence. Trends Biochem Sci. 28, 329–335 (2003).
Article CAS PubMed PubMed Central Google Scholar
Gorbalenya, A. E. & Koonin, E. V. Helicases: amino acid sequence comparisons and structure-function relationships. Curr. Opin. Struct. Biol. 3, 419–429 (1993).
Article CAS Google Scholar
Kelch, B. A., Makino, D. L., O'Donnell, M. & Kuriyan, J. How a DNA polymerase clamp loader opens a sliding clamp. Science 334, 1675–1680 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Skoglund, C. M., Smith, H. O. & Chandrasegaran, S. Construction of an efficient overproducer clone of HinfI restriction endonuclease using the polymerase chain reaction. Gene 88, 1–5 (1990).
Article CAS PubMed Google Scholar
Fortelle, d. L. & Bricogne, G. Maximum-likelihood heavy atom parameter refinement for multiple isomorphous replacement and multiwavelength anomalous diffraction methods. Methods Enzymol. 276, 472–494 (1997).
Article Google Scholar
Hendrickson, W. A. Determination of macromolecular structures from anomalous diffraction of synchrotron radiation. Science 254, 51–58 (1991).
Article ADS CAS PubMed Google Scholar
Vonrhein, C. et al. Data processing and analysis with the autoPROC toolbox. Acta Crystallogr. D Biol. Crystallogr. 67, 293–302 (2011).
Article CAS PubMed PubMed Central Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
Article PubMed Google Scholar
Blanc, E. et al. Refinement of severely incomplete structures with maximum likelihood in BUSTER-TNT. Acta Crystallogr. D Biol. Crystallogr. 60, 2210–2221 (2004).
Article CAS PubMed Google Scholar
Kabsch, W. Xds. Acta Crystallogr. D Biol. Crystallogr. 66, 125–132 (2010).
Article CAS PubMed PubMed Central Google Scholar
McCoy, A. J., Grosse-Kunstleve, R. W., Storoni, L. C. & Read, R. J. Likelihood-enhanced fast translation functions. Acta Crystallogr. D Biol. Crystallogr. 61, 458–464 (2005).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank David Hough for expert technical assistance with protein purification and Jim Samuelson for development of the overexpressing strain. We also thank K. Rajashankar (NECAT-24ID, APS) and Qun Liu (X4A, BNL) for facilitating data collection and useful suggestions, and M. Szczelekun and R. Seidel for helpful discussions. We thank E. Vanamee for help during early stages of the project. We are thankful to the staff of NE-CAT 24ID, GM-CAT 23ID, SBC-CAT 19ID (APS, ANL), X25, X29 and X4A (NSLS, BNL), and A1 and F1 beamlines at MacCHESS for the provision of synchrotron beamtime. All structure figures were generated using Pymol. This work was supported by funding from NIH (R01 GM111507) and partially by New England Biolabs Inc.

Author information

Authors and Affiliations

Department of Structural and Chemical Biology, Icahn School of Medicine at Mount Sinai, Box 1677, 1425 Madison Avenue, New York, 10029, New York, USA
Yogesh K. Gupta & Aneel K. Aggarwal
New England Biolabs Inc., 240 County Road, Ipswich, 01938, Massachusetts, USA
Siu-Hong Chan & Shuang-yong Xu

Authors

Yogesh K. Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Siu-Hong Chan
View author publications
You can also search for this author in PubMed Google Scholar
Shuang-yong Xu
View author publications
You can also search for this author in PubMed Google Scholar
Aneel K. Aggarwal
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.K.G. and A.K.A. designed the crystallographic studies; Y.K.G. performed the crystallographic studies; S.-H.C. and S.-y.X. performed the cloning and protein purification; A.K.A. and Y.K.G. wrote the paper.

Corresponding author

Correspondence to Aneel K. Aggarwal.

Ethics declarations

Competing interests

S.-H.C. and S.-y.X. are employees and shareholder of New England Biolabs Inc. The remaining authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-10, Supplementary Tables 1-2 and Supplementary References (PDF 14835 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Gupta, Y., Chan, SH., Xu, Sy. et al. Structural basis of asymmetric DNA methylation and ATP-triggered long-range diffusion by EcoP15I. Nat Commun 6, 7363 (2015). https://doi.org/10.1038/ncomms8363

Download citation

Received: 15 November 2014
Accepted: 30 April 2015
Published: 12 June 2015
DOI: https://doi.org/10.1038/ncomms8363

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.