Unwinding of a DNA replication fork by a hexameric viral helicase

Javed, Abid; Major, Balazs; Stead, Jonathan A.; Sanders, Cyril M.; Orlova, Elena V.

doi:10.1038/s41467-021-25843-6

Download PDF

Article
Open access
Published: 20 September 2021

Unwinding of a DNA replication fork by a hexameric viral helicase

Nature Communications volume 12, Article number: 5535 (2021) Cite this article

3921 Accesses
7 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Hexameric helicases are motor proteins that unwind double-stranded DNA (dsDNA) during DNA replication but how they are optimised for strand separation is unclear. Here we present the cryo-EM structure of the full-length E1 helicase from papillomavirus, revealing all arms of a bound DNA replication fork and their interactions with the helicase. The replication fork junction is located at the entrance to the helicase collar ring, that sits above the AAA + motor assembly. dsDNA is escorted to and the 5´ single-stranded DNA (ssDNA) away from the unwinding point by the E1 dsDNA origin binding domains. The 3´ ssDNA interacts with six spirally-arranged β-hairpins and their cyclical top-to-bottom movement pulls the ssDNA through the helicase. Pulling of the RF against the collar ring separates the base-pairs, while modelling of the conformational cycle suggest an accompanying movement of the collar ring has an auxiliary role, helping to make efficient use of ATP in duplex unwinding.

DNA unwinding mechanism of a eukaryotic replicative CMG helicase

Article Open access 04 February 2020

Zuanning Yuan, Roxana Georgescu, … Michael E. O’Donnell

Kinetic and structural mechanism for DNA unwinding by a non-hexameric helicase

Article Open access 01 December 2021

Sean P. Carney, Wen Ma, … Yann R. Chemla

Structural insight into the assembly and working mechanism of helicase-primase D5 from Mpox virus

Article 04 January 2024

Yaning Li, Jing Zhu, … Renhong Yan

Introduction

DNA replication is an essential process in all living organisms. It starts at specific sites known as origins of replication (ori), where helicase enzymes begin the unwinding of double-stranded DNA (dsDNA), generating replication forks (RFs) that grow bi-directionally from ori¹. Helicases use the energy of nucleoside triphosphate (NTP) hydrolysis to translocate and unwind DNA, providing the single-stranded template (ssDNA) for accurate copying by DNA polymerase. Four of the six known helicase superfamilies (SF3–6) are enzymes assembled from six subunits arranged as hexameric rings, while those of SF1 and 2 are monomeric but sometimes function as dimers^2,3. In cells, the principal replicative helicase is a hexamer, but despite their crucial function how these helicases unwind dsDNA remains uncertain.

The prokaryotic SF4 and SF5 helicases have a RecA fold NTPase domain and translocate in the 5′–3′ direction on single-stranded nucleic acids^4,5. In contrast to these, the NTPase motor domain of the SF6 and viral SF3 helicases have an AAA+ (ATPase Associated with various Activities) fold^6,7 and translocate in the 3′–5′ direction⁴. Most hexameric helicases including papillomavirus E1, bacteriophage T7 gp4, E. coli DnaB and archaeal MCM are homo-oligomers, while the eukaryotic Mcm2-7 AAA+ hexamer, that forms the core of the CMG (Cdc45-MCM-GINS) helicase complex, is composed of six related but non-identical subunits^4,7,8. These helicases are assumed to operate by a strand (or steric) exclusion mechanism⁵; translocating on the active nucleic acid strand the helicase moves towards the RF junction (RFJ) and unpairs the DNA bases, while the passive strand is excluded from the complex. Whether a hexameric helicase acts simply as a non-specific wedge or employs a specific separation pin^9,10 or other functional domains¹¹ to optimise base separation, as in SF1 and SF2 helicase, is unclear.

Crystal structures of homo-hexameric helicases with short single-stranded nucleic acid (NA) segments bound in the NTPase motor domain have been obtained for DnaB¹², the RNA helicase Rho¹³, the helicase domain of E1¹⁴, and archaeal MCM¹⁵. These structures show the nucleotides of the NA chain interacting with a “spiral staircase” of binding loops in the protein complex. Accordingly, mechanisms for NA translocation based on the sequential hydrolysis of ATP and a cyclical height-adjusted movement of the NA-binding loops have been suggested. Recent cryo-electron microscopy (cryo-EM) structures of yeast^16,17,18, Drosophila^19,20 and human²¹ MCM complexes bound to fork-like DNA substrates are also consistent with this general translocation mechanism. These structures, however, did not reveal the DNA unwinding point and interactions of the helicases with all arms of the RF in detail, so how DNA strand separation is achieved remained unclear.

The papillomaviruses (PVs) are a large group of human and animal pathogens²² and the PV E1 protein is a model AAA + SF3 hexameric helicases^6,23. The N-terminal half of PV E1 has a regulatory module and a sequence-specific dsDNA-binding domain (OBD) for PV ori DNA recognition (Fig. 1a, bovine papillomavirus (BPV) E1). A hexamer of the C-terminal helicase domain (E1HD) can function alone to unwind dsDNA in vitro²⁴. The E1HD subunit can be sub-divided into the collar domain and the NTPase motor domain that form rings in the E1HD structure^14,25. Interestingly, E1, like Mcm2-7, can encircle both dsDNA and ssDNA^26,27. Also, the N-terminal domains of these proteins are located in front of the helicase motor, in the vicinity of the RFJ^{16,17,18,19,20,21,28}. In a low-resolution EM structure of the full-length E1 helicase bound to a synthetic DNA RF the unwinding point was mapped at the entrance of the helicase collar domain²⁹, consistent with the steric exclusion model. However, the structure also showed extensive interactions of the N-terminal domains of the protein with the DNA ahead of the replication fork.

Here we present a near-atomic resolution (3.9 Å) cryo-EM structure of E1 revealing clearly all arms of the RF and the dsDNA unwinding point for an AAA+ hexameric helicase. Previous structures of the eukaryotic CMG helicase complex bound to fork substrates have been determined at resolutions close to 4 Å^{16,17,18,19,20,21}, providing important insights into the mechanism of CMG catalysed DNA unwinding. However, no structure has revealed the replication fork in its entirety so mechanistic understanding is limited. In the E1 helicase, dsDNA is separated at the entrance to the collar domain ring (Fig. 1a, b), as the AAA+ motor pulls one ssDNA strand through the complex and the RFJ against the collar ring. Two of the six E1 OBDs are observed at fixed positions, where one escorts the dsDNA to the RF unwinding point and the other the unwound 5′ssDNA away from the complex. We have also been able to trace the C-terminal acidic tails for all six E1 subunits, explaining their functional role in processive unwinding. The structure also shows deviations in the positions of the collar domains and flexibility of the AAA+ domains of the helicase induced by the presence of the RF, providing evidence that the collar is actively employed in strand separation.

Results

Cryo-EM structure of the E1RF complex

The crystal structures of the E1HD with¹⁴ and without²⁵ ADP and ssDNA bound (PDB 2GXA, 2V9P, respectively) are both asymmetric hexameric assemblies and show similar nucleotide and ssDNA-binding site architecture, despite the presence or absence of ligands. Accordingly, we reasoned that a stalled E1RF assembly could be generated in the absence of nucleotides and without using artificial DNA roadblocks¹⁶ to impede translocation. A hexameric E1-replication fork (E1RF) complex stalled at a RFJ was assembled and purified as previously described (Fig. 1c)²⁹. The DNA replication fork (RF) substrate consisted of 30 base pairs of dsDNA, a 3′ T20-active ssDNA strand (the strand upon which the helicase translocates, or the leading DNA replication strand) and a 13 base 5′ passive (lagging replication) strand (Fig. 1b, see the “Methods” section). The fork substrate used is actively unwound by the helicase, while substrates lacking a 5′ passive strand are less efficient and substrates without an active 3′ strand are not unwound significantly²⁹ (Fig. 1d).

Cryo-EM data of E1RF complexes were collected on a Titan Krios microscope operating at 300 keV using a Gatan K3 camera and were processed using RELION 3.0³⁰ and cryoSPARC v2.1³¹ (Supplementary Table 1 and Supplementary Figs. 1 and 2, see the “Methods” section). The 2D class averages of cryo-EM images of the E1RF showed a distinct, two-tiered structure confirmed to be the collar and AAA+ domains of the E1HD module (Fig. 2a, Supplementary Fig. 1). There is a ~20 Å wide rod of density extending from the E1HD, slightly tilted with respect to its central axis (17°), with a bulk of density on its outer side (Fig. 2). There is also density in the central channel of the E1HD, indicating ssDNA binding and therefore a stable complex with the DNA fork has been formed. Diffuse density visible above the E1HD region in class averages suggests a flexible arrangement of some OBDs and N-terminal sub-domains (Supplementary Figs. 1a and 3).

A cryo-EM E1RF map was obtained at a resolution of 3.9 Å (Fig. 2, Supplementary Figs. 1 and 2). The structure demonstrates unambiguously the positions of the dsDNA, the 3′ssDNA active strand within the entire central channel of the E1HD and the 5′ssDNA passive strand, positioned on the top of the collar ring (Fig. 2). The atomic model of dsDNA could be superposed with the rod of density protruding from the centre of the hexamer. The dsDNA is contiguous with the 3′ ssDNA strand located in the central E1HD ssDNA binding tunnel and the 5′ passive ssDNA strand (Fig. 2). The dsDNA separation point, the RFJ, is located at the entrance to the collar tunnel. Based on the positions of the ssDNA-binding β-hairpins we aligned the E1RF structure with respect to the crystal structures^14,25, as described below. For direct comparison, we labelled the six protein subunits of the E1RF assembly A–F, corresponding to the subunit designation in the E1HD crystal structures.

Notably, the cryo-EM structure has two well-defined bulks of densities in addition to the E1HD and DNA. One is located ~25 Å above the E1HD collar ring and is attached to the dsDNA, while the second, on the opposite side, is at a lower position bound to the 5′ssDNA (Fig. 2a). Density corresponding to the E1 NtDs (Fig. 1a) was not resolved in the EM map suggesting that these parts of the complex are disordered. Significantly, the densities for the C-terminal acidic tail of E1 (C-tT, residues 579–605, Fig. 1a) were defined for all six subunits up to residue 598, but modelled as poly-alanine. This domain is required for hexamer stability and processive DNA unwinding³². The residues of the C-tT, while present in the protein construct used to obtain the nucleotide and DNA free E1HD structure (but absent in the ligand-bound form¹⁴), were not visible in the X-ray map²⁵.

Positions of the OBDs

The two additional bulks of high density correspond very well to the size of the OBD³³ (PDB 1KSX and 1KSY). The orientations of these OBDs, from the B and E subunits (Fig. 2b), were defined by the links between their N-termini and the corresponding C-termini of the E1HD collar domain. OBDs B and E were better defined due to their interactions with DNA (Supplementary Fig. 3). The OBD of subunit B interacts with the dsDNA while the OBD from subunit E interacts with the 5′ passive ssDNA strand and is located close to the side of the E1HD collar ring (Fig. 2). The other OBDs and their associated NtDs of the other subunits were defined less well, to varying degrees, but they surround the dsDNA (Supplementary Fig. 3). Analysis of OBDs A, C, D and F revealed significant flexibility in their positions, with the locations of A and C least defined.

Interaction of E1 OBD B with dsDNA

X-ray structures of the E1 OBD showed that it has two DNA-binding segments, a DNA-binding loop (DBL, Arg180–Asn189) and a DNA-binding helix (DBH, Arg243–Leu254), that recognise dsDNA-binding sites at the PV ori, related to the E1 binding site (E1BS) consensus sequence 5′-ATTGTT^33,34. The DBL makes the major contribution to dsDNA binding through generic hydrophilic and van der Waals contacts, consistent with the relatively low binding specificity and affinity of the interaction. In the E1RF structure, an automated docking (iMODFIT³⁵, see the “Methods” section) of the E1 OBD atomic model (1KSY) into OBD B of the EM map resulted in an RMSD of 3.6 Å (Fig. 3a, b). While the dsDNA sequence of the fork does not contain an E1BS-like sequence, the cryo-EM structure indicates that OBD B interacts with the dsDNA via an interaction between the DBL and the major groove at the sequence TGTGA in the passive DNA strand, 16-20 nucleotides from the RFJ (Fig. 3c). Together, the well-defined OBD B-dsDNA interaction and contacts with the other surrounding, but more loosely positioned, OBDs (Supplementary Fig. 3) are consistent with previous biochemical “footprinting” experiments²⁹. This analysis demonstrated protection of the dsDNA from hydroxyl radical nucleolytic attack, most likely by direct contact with the OBDs and N-terminal segments of E1.

**Fig. 3: Interaction of OBD B with dsDNA.**

Lysines 183 and 186 in the OBD DBL are conserved in papillomavirus sequences (Supplementary Fig. 4) and have been demonstrated to be critical for dsDNA binding³⁶. To test if dsDNA interactions with the OBD influence helicase activity we generated an E1 protein with alanine at positions 183 and 186 (K183A/K186A). In helicase assays (Fig. 4a), a nearly two-fold reduction in DNA unwinding was observed for the altered protein (Fig. 4b, Supplementary Fig. 5a), demonstrating that the OBD domain has an auxiliary role in DNA unwinding.

**Fig. 4: Unwinding activity of E1 mutants.**

The collar domain ring

In the E1RF structure, the subunits of the collar ring are arranged with nearly six-fold rotational symmetry and three nucleotides of ssDNA appear stretched through its channel (Fig. 2a). The collar ring of E1RF is rigid and superposition with the crystal structure of E1HD/ssDNA/ADP (PDB 2GXA)¹⁴ indicates a low overall structural deviation (RMSD 0.65). In the E1RF structure, the conserved positively charged residues Lys356 and Lys359 project their side chains into the E1HD channel but their ε-amino groups are at least 4 Å from the 5′ ssDNA phosphate backbone (Fig. 5a, b). As such, strong electrostatic interactions between protein and ssDNA are unlikely, as supported by observations that substitution of these residues has no significant effect on ssDNA binding and unwinding³⁷.

**Fig. 5: The exit path of the 5′ passive ssDNA strand.**

Exit of the 5′ passive ssDNA strand

In the E1RF cryo-EM structure, the 5′ ssDNA is diverted from the separation point at an angle of 95° relative to the dsDNA axis (Fig. 2b), passing in a groove between E1 subunits D and E (Fig. 5a). The elements of the collar domains that cradle the RFJ are the loop residues ³⁵¹ThrAsnSer³⁵³ (TNS loop) from an α3–α4-hairpin turn in E1 collar domain subunit D. DNA footprinting experiments show that the DNA at the RFJ is protected from nucleolytic attack, implying close protein–DNA contacts²⁹. However, there is no evidence that these hairpins are involved directly in dsDNA unwinding as this takes place above the hairpins (~4 Å) and the distances between the RFJ and the α–α-hairpin turn is therefore rather large.

Four distinct features mark the route taken by the 5′ ssDNA strand. First, the ssDNA is close to Lys310 in the inter-domain linker between the OBD and E1HD (distance ~2.3 Å, subunit E), which makes a contact with the ssDNA likely, thus fixing the path of the ssDNA (Fig. 5b, c, Supplementary Fig. 5b). While Lys310 is not conserved in the papillomavirus sequences (Supplementary Fig. 4) the K310A mutation shows a significant reduction (up to ~50%) in helicase activity, supporting its functional role in BPV E1 DNA unwinding (Fig. 4a). Second, the TNS loop of subunit D is interacting with and displacing the ssDNA upward (Fig. 5a–c). Alignment of nearly 100 papillomavirus sequences from the databases reveals an overwhelming preference for polar residues in this segment; only in a few cases is Asn352 substituted with serine, threonine and very rarely alanine. A range of amino acid substitutions tested at position 352 all showed reduced DNA unwinding activity (Supplementary Fig. 5c–e). Notably, glycine and lysine substitutions showed more than 50% and a nearly four-fold reduction of unwinding, respectively, indicating that relatively weak interactions with DNA by the polar Asn352 may be required. Together, therefore, Lys310 and Asn352 may be optimal for guiding the exit path of the 5′ ssDNA. Third, we traced as poly-alanine the E1 C-terminal tails (C-tT). They start from the AAA+ domain and form loops at the subunit interfaces (Fig. 2a), ending within a cleft between collar domain subunits. This puts the acidic portion of the tail (amino acids 584–594) ~8 Å below the ssDNA path (Fig. 2a, left panel, and Supplementary Fig. 5b). The proximity of this segment to the ssDNA would induce repulsion between these electronegative elements, thus ensuring unimpeded passage of the ssDNA away from the helicase domain subassembly. Finally, the OBD from subunit E interacts with the 5′ passive ssDNA strand. The linker (residues 303–314) anchoring OBD E to the E1HD allowed us to define its approximate orientation, which was refined by an automated docking of the X-ray atomic model (1KSY) (Fig. 5d–f, see the “Methods” section). The fitting indicates that Lys168 of the N-terminal helix (α1) and Lys279 from helix α5 of the OBD are close to the ssDNA (3 and 4 Å, respectively), thus presenting a different binding surface to DNA compared to OBD B. Lys168 is conserved in all papillomavirus E1 sequences except for rare substitutions with arginine, while the majority of PV sequences have Lys or Arg at the position corresponding to Lys279 in BPV E1 (Supplementary Fig. 4). In helicase assays, the variant E1 proteins K168A and K279A show ~15% and 30% reductions in DNA unwinding, respectively (Fig. 4a, Supplementary Fig. 5a). Accordingly, Lys168 and Lys279 of OBD E may play a role in guiding the emerging 5′ ssDNA strand away from the helicase motor.

Interaction of the 3′ ssDNA with the E1HD

In both E1HD crystal structures (PDB 2GXA with¹⁴ and 2V9P without²⁵ cofactors bound) the ssDNA-binding segments of the subunits are positioned in a helical array along the axis of the ssDNA-binding tunnel. In E1HD/ssDNA/ADP (PDB 2GXA), interactions with ssDNA are mediated via the conserved DNA-binding β-hairpin (500–514 aa) residues Lys506 and His507^14,38. The 2GXA structure has a large gap between subunits A and F and the β-hairpin of subunit A is positioned at the top of the staircase. In the E1RF complex, as in E1HD/ssDNA/ADP, six nucleotides of the 3′ ssDNA strand form a right-hand helix with the β-hairpins of the AAA+ domains, before it exits from the channel (Figs. 2a, 6a). In E1RF, the DNA-binding β-hairpins of subunits C–E are in contact with the DNA and align well with the 2GXA structure, while the β-hairpin of F also contacts DNA in E1RF it sits slightly below the corresponding β-hairpin in 2GXA. However, there is a difference in conformations to the corresponding β-hairpins in the A and B subunits. While in both structures the β-hairpin of subunit A does not make direct contact with the 3′ ssDNA, in E1RF the β-hairpin of A is positioned 11 Å below the corresponding β-hairpin in the crystal structure (Fig. 6b). The β-hairpin of subunit B does not contact the 3′ ssDNA either, since its His507 is turned away from ssDNA compared to the crystal structure (Fig. 6a, b). Our observations, therefore, appear to be consistent with the coordinated escort mechanism for ssDNA translocation¹⁴, although a different conformational state appears to be captured in the cryo-EM structure, where the subunit A β-hairpin has disengaged from ssDNA and has not yet migrated back to the top of the complex to re-engage with ssDNA.

**Fig. 6: Interactions of the DNA-binding β-hairpins with the 3′-ssDNA.**

Biochemical analysis of E1RF–DNA interactions

We analysed E1RF–DNA interactions using a footprinting assay, where close protein–DNA contacts are revealed by the protection of the DNA from hydroxyl radical (OH•) nucleolytic attack²⁹. Hexameric helicase complexes were assembled with ³²P end-labeled substrates, complete DNA binding was confirmed by gel-shift analysis, while the remainder of the reaction was exposed to the OH• (see the “Methods” section). The wild type E1RF complex was compared to assemblies with E1 K183A/K186A to probe OBD B interaction with the dsDNA and E1 K310A/K168A/N352G (targeting residues in the inter-domain linker, OBD E and collar domain, respectively; Figs. 4, 5 and Supplementary Fig. 5f) to probe the interaction with the 5′ ssDNA component of the RF substrate. Importantly, the homohexameric nature of E1RF does not allow distinct single-subunit interactions to be probed biochemically.

The E1RF OH• footprints (Supplementary Fig. 6), visualised and quantitated by phosphorimaging, show moderate and incomplete protection throughout the DNA, as would be expected for interactions that are extensive, but weak and transient. When the 3′ active strand of wild-type and E1 K183A/K186A RF complexes are compared (Supplementary Fig. 6b, d), the ssDNA nucleotides close to the unwinding point show very similar levels of protection, while all other 3′ ssDNA nucleotides show increases in peak height (susceptibility to OH• cleavage) of up to ~12%. However, enhanced susceptibility to OH• cleavage is seen in the dsDNA of E1 K183A/K186A-RF, increasing with the distance from the RFJ (up to ~30% increase in peak height), implying weaker contacts between protein and dsDNA. On the 5′ passive DNA strand, the peak heights for the ssDNA cleavage products are nearly identical. However, again, peak heights increase by up to ~30% in the dsDNA region 10–25 nucleotides from the RFJ for E1 K183A/K186A compared to wild-type RF complexes. The diminished protection of the dsDNA in E1 K183A/K186A-RF appears most pronounced in the region ~15–20 bases from the RFJ, observed to interact with OBD B in E1RF (Fig. 3). These observations support the structural data showing that OBDs A–D and F surround the dsDNA (Supplementary Fig. 3), with B forming more stable contacts with dsDNA (Fig. 3).

For E1 K310A/K168A/N352G-RF the OH• cleavage pattern is different to K183A/K186A and wild-type RF complexes. For the 3′active DNA strand (Supplementary Fig. 6b, e), the peak heights for cleavage products in the dsDNA nucleotides 6–25 positions from the RFJ are near-equivalent between variant and wild-type E1RF. However, peak heights decrease, implying tighter contacts with DNA, in the five dsDNA nucleotides close to the RFJ and up to nine 3′ssDNA nucleotides closest to the RFJ in E1 K310A/K168A/N352G-RF. It could be suggested that the positioning of the RFJ has been perturbed in this variant. In the 5′ passive DNA strand, cleavage of all dsDNA nucleotides and the three 5′ ssDNA nucleotides close to the RFJ are near equivalent (Supplementary Fig. 6e) for mutant compared to wild-type complexes. However, the 5′ ssDNA nucleotides at positions 4–7 from the RFJ show a subtle increase in protection for E1 K310A/K168A/N352G-RF. These observations suggest that residues Lys310, Lys168 and Asn352 minimise stabilising contacts with protein and the 5′ssDNA, but are necessary for chaperoning the 5′ssDNA away from the protein complex during unwinding. Furthermore, the path taken by the 5′ ssDNA across the collar is predominantly neutral in character (Supplementary Fig. 5b), suggesting that there are in general no strong 5′ ssDNA interactions with the collar ring.

Conformational changes in E1HD

Although the collar ring is rigid, the alignment between the EM structure and the E1HD/ssDNA/ADP (PDB 2GXA) crystal structure shows changes in the positions of the collar domain subunits, where D and E are moved by ~3 Å up towards the 5′ ssDNA (Fig. 7a, and Supplementary Fig. 7). As such, the collar ring is tilted by 3° as a rigid body relative to its position observed in both X-ray structures^14,25. In the AAA+ domains, however, translational shifts of some segments are significant, particularly in the A, B and F subunits, with Cα deviations between E1RF and the crystal structure 2GXA of up to 8 Å. These differences, observed mainly as shifts in the β-layers and α-helices at the periphery of the complex (Fig. 7, Supplementary Fig. 8), are likely to be observable due to the bound RF and natural non-crystallographic environment of E1RF in cryo-EM. In the coordinated escort model¹⁴, the six AAA+ domains and their associated ssDNA-binding segments follow a conformational wave around the complex during ssDNA translocation (Fig. 7b). Interestingly, the α-5 helix of the E1HD appears to act as a main ‘hinge’ between subunit collar and AAA+ domains. In the E1RF structure, the α-5 ‘hinge’ appears to move up to 6 Å in a wave-like trajectory around the hexamer subunits A–F (Fig. 7c). As such, these observations imply that the α-5 hinge motion would coordinate the movement of both the collar and AAA+ domains during translocation on ssDNA (Fig. 7d and Supplementary Movie 1), with the tilt and elevation of the collar ring following the wave-like motion around the subunits to push up against the RFJ. Accordingly, ssDNA translocation and base-pair separation by the E1 helicase are coupled. We propose that as each of the six subunits of the E1 hexamer completes a cycle of ATP hydrolysis, pulling six ssDNA nucleotides through the AAA+ motor, there is an integral power stroke pushing against the RFJ, equivalent to the displacement of one base pair (~3 Å). We propose that the E1 helicase collar can be viewed as an active mechanical separation wedge, governed by nucleotide binding and hydrolysis events, rather than a simple obstacle for strand displacement.

**Fig. 7: Positional variation of the subunit domains in E1RF.**

Discussion

The crystal structure of E1HD bound to ssDNA and ADP¹⁴ first provided a hypothesis for how hexameric helicases can translocate on single-stranded nucleic acids^{12,13,14,15,16,17,18,19,20,21}. The E1RF cryo-EM structure now shows how E1 separates DNA base pairs and is optimised as a DNA unwinding machine (Fig. 8, Supplementary Movie 2).

**Fig. 8: Cartoon model for E1 DNA unwinding.**

Remarkably, the conformation of the RF DNA in E1RF is very similar to that observed in other DNA unwinding machines^39,40, suggesting its organisation is optimal for base separation. The conformation of the DNA fork in E1RF is maintained in several ways. First, the E1 subunit B OBD tracks the major groove while additional OBDs encircle the dsDNA as it approaches the collar ring (Figs. 2a, 3 and Supplementary Fig. 3). Second, the 5′ ssDNA is trapped in a groove between collar subunits D and E where the TNS loop of subunit D and Lys310 in the inter-domain linker of subunit E escort the unwound 5′ ssDNA from the complex (Fig. 5). Third, the positions of adjacent OBDs D and E above the collar ring (Supplementary Fig. 3) would prevent the 5′ ssDNA from skipping to an alternative channel at the subunit interfaces. Finally, interactions with the OBD of subunit E further stabilise the path of the 5′ ssDNA. Importantly, our observations of the OBD–RF interactions are consistent with single-molecule fluorescence energy transfer (smFRET) experiments, where DNA unwinding by E1HD alone is significantly less smooth than the process catalysed by E1, where the presence of the E1 OBDs helps to prevent backward slippage on ssDNA and rewinding of duplex DNA²⁸. Furthermore, variant proteins with substitutions of residues implicated in DNA interactions demonstrated measurable defects in dsDNA unwinding, while probing in a footprinting assay also suggests that DNA contacts are altered in these variants (Fig. 4, and Supplementary Figs. 5, 6). Therefore, simple strand exclusion mechanisms may be sub-optimal in hexameric helicases, without fork stabilisation and DNA strand escorting mechanisms that enhance the strand separation process.

Interactions with the fork dsDNA have been observed primarily in the monomeric SF1 and SF2 helicases, including RecB of the RecBCD-type helicase-nucleases^41,42, PcrA⁹, Hel308¹⁰ and UvrD⁴³ where they are proposed to have direct mechanical roles in base pair destabilisation in an ATP-dependent power stroke. In contrast, the role of the E1 OBD B-dsDNA interaction in unwinding is indirect, by assisting in the positioning of the RF to prevent reversal of the helicase²⁸.

Recently obtained structures of the yeast and human CMG helicases show a short stretch of dsDNA entering the complex and a possible exit path for the 3′ ssDNA^17,18, suggesting that it is trapped between specific protein segments and the fork position is also fixed during unwinding. The structure of the yeast CMG helicase (Cdc45, MCM and GINS) bound to the fork protection complex (Csm3/Tof1 and Mrc1) shows Csm3/Tof1 located on the N-terminal tier face of MCM, ‘gripping’ the dsDNA. While Csm3/Tof1 is required for efficient replication in a reconstituted cell-free system, dsDNA-binding mutants showed no or minimal defects in in vitro DNA replication¹⁸. Although the phylogenetic similarity between E1 and Mcm2–7 is limited to the AAA+ motor domain, the E1 OBDs may perform a similar function to the fork protection complex. Together, the data suggest that the correct positioning of the dsDNA and relatively lose protein contacts that guide its path are critical for optimal DNA unwinding.

Our cryo-EM E1RF structure revealed the C-terminal acidic tails. They terminate in a groove at the interface between collar domain subunits, with the acidic portion positioned below the 5′ ssDNA. Here, they play an important role in processive unwinding by stabilising the E1 hexameric assembly³². Moreover, the position of the electronegative segment, now visualised in E1RF, may also be important for 5′ ssDNA escorting. The exit route of the 5′ ssDNA across the top of the collar domain is predominantly neutral (Supplementary Fig. 5b), while only specific positively charged points (Lys310 in the interdomain linker, and lysines on the surface of OBD E) act to fix the path. The acidic (electronegative) portion of the C-terminal tail of subunit E may also help to direct the path of the 5′ ssDNA by repulsion. The C-tT is conserved in the related SF3 helicase T-antigen and similar acidic segments are also found in other helicases including the T7 gp4 helicase-primase and TWINKLE⁴⁴. Although the function is likely to be conserved in T-antigen³², in T7 gp4 the acidic tails are involved in local tethering of the polymerase, which immediately replicates the unwound DNA⁴⁰. Appropriate escorting of all arms of the RF would ensure that the ssDNA strands are separated to prevent re-annealing and facilitate coupling to the DNA replicating apparatus.

To date, only two other hexameric helicases structures, the AAA+-type yeast MCM^17,18 and the RecA-type T7 gp4 helicase-primase⁴⁰, have been obtained with the DNA strand separation point (the RFJ) observed at near-atomic resolution. MCM and T7 gp4 both use a planar aromatic residue to stack against a base at the RFJ and mechanically assist in unpairing DNA, although for T7 gp4 the separation pin is provided by the polymerase subunit of the replisome. In E1 base pair separation does not employ a specific functional residue but is by steric exclusion at the entrance to the collar ring. Using the E1RF structure we modelled an entire conformational cycle of the helicase (Fig. 7, Supplementary Movie 1) and this analysis showed that the tilt and elevation of the collar ring follows a wave-like motion around the subunits, providing an auxiliary push against the RFJ. We suggest that base pair separation could be assisted by a once-per-revolution power-stroke directly coupled to the ATPase cycle, allowing E1 to make efficient use of the energy of ATP hydrolysis for translocation coupled DNA unwinding. Although the E1RF structure was determined without ATP or analogues, X-ray structures of the E1HD without²⁵ and with¹⁴ ssDNA and ADP bound are nearly identical. In particular, the architecture of the nucleotide-binding sites, defined as ATP, ADP and apo type, is maintained and tightly linked to the positions of the DNA-binding hairpins. In E1RF the positions of the β-hairpins correspond well with those in the E1HD/ssDNA/ADP structure (Fig. 6), indicating that the nucleotide-binding site architecture will also be the same and consistent with the coordinated escort model of ssDNA tanslocation¹⁴.

Our data are in full accord with previous structural models^14,25,29, biochemical data^29,32,37 and smFRET observations²⁸. The E1 protein participates in the replication process, using both the E1HD and OBD domains for dsDNA ori binding, melting^37,38,45 and processive DNA unwinding²⁸ (Supplementary Movie 2). PV E1 demonstrates how viruses have borrowed functional segments from eukaryotic cells (e.g. the AAA+ domain) and have mimicked the operating principles of the host cell replication initiation apparatus (e.g. the CMG/fork protection complex¹⁸) to generate a minimalistic but highly streamlined replication machine. Understanding of these viral proteins will help to improve our knowledge of the more complex cellular replication machines and how viruses could be targeted therapeutically when they emerge as threats.

Methods

Assembly and analysis of E1 helicase complexes

Wild-type and variant full-length E1 protein were purified as described previously³⁸. Briefly, the protein was expressed as a GST fusion protein and first purified on glutathione sepharose. Following cleavage of the GST tag with thrombin, the protein was purified free from the tag by cation exchange (25 mM sodium posphate pH 7.1, 5 mM DTT, 10% glycerol, 1 mM PMSF, 1 mM EDTA buffer, 50–400 mM NaCl gradient) followed by anion exchange (25 mM Tris–HCl pH 8.4, 5 mM DTT, 10% glycerol, 1 mM PMSF, 1 mM EDTA buffer, 100–400 mM NaCl gradient) chromatography. The E1–RF helicase complex was assembled and purified by gel filtration chromatography (Superdex S200 HR 10/300 GL, GE Healthcare) as previously described²⁹. The oligonucleotides 5′-GGCTTGTATTTCACACCGCACCTCAGCGCG(T)₂₀ (active strand) and

5′-CCCCCCCCCCGTGCGCGCTGAGGTGCGGTGTGAAATACAAGCC (passive strand) were annealed to generate the RF substrate (30 base pair dsDNA component underlined). Complexes were assembled with 60 μM E1 and 10 μM fork DNA and the gel filtration buffer used was 10 mM Tris–Cl pH 8.0, 225 mM NaCl, 2 mM DTT, 0.1 mM PMSF and 1 mM EDTA. The hexameric peak fractions were concentrated to ~5 mg/ml, snap-frozen in liquid nitrogen and stored at −80 °C for cryo-EM.

Oligonucleotides for helicase assays were 5′ end-labeled with polynucleotide kinase and [γ³²P]-ATP (7000 Ci/mmol). The substrate used had the same sequence as for RF assembly, given above, or variants with and without the 5′ and 3′ ssDNA arms. Helicase assays²⁹ with radiolabelled substrates (0.1 nM) were performed in 20 mM HEPES pH 7.2, 135 mM NaCl, 1 mM DTT, 0.1 mg/ml BSA, 0.1% NP40, 3 mM MgCl₂, 1 mM ATP. Reactions were incubated for 60 min at 22 °C and terminated by adjusting the reactions to 20 mM EDTA, 0.1% SDS, 10% glycerol, 0.13% w/v bromophenol blue. Product were separated on an 8% poly-acrylamide/TBE gel containing 0.05% w/v SDS, and gels exposed to phosphor rimager plates (Fujifilm) for imaging and quantification (Fuji FLA3000, image gauge V3.3 software)²⁹.

For hydroxyl radical footprinting the sequence of the RF substrate was as above. The active strand was 5′ end-labeled (as above) and the passive strand 3′ end-labeled using [α³²P]-dCTP (3000 Ci/mmol) and Klenow exo^- (NEB), followed by a chase with excess unlabelled dCTP. In the latter case, an oligonucleotide lacking the two 3′ C residues was annealed to the active strand to achieve labelling. The substrates were purified by PAGE before assembling 50 μl binding reactions (20 mM Na phosphate pH 7.2, 135 mM NaCl, 0.1% NP40, 0.1 mg ml⁻¹ BSA, 1 mM PMSF, 1 mM DTT) with 16 μM E1 proteins and 2.4 μM RF DNA. After 20 min incubation, a 10 μl sample of each was analysed on an agarose gel (TAE running buffer) to confirm complete DNA binding by gel-shift. The remaining reaction was treated with the hydroxyl radical according to the general guidelines of Dixon et al. 1991⁴⁶. Reactions were diluted with an equal volume of 10 mM Tris–Cl pH 8, 0.1 mM EDTA, 100 mM NaCl and extracted twice with an equal volume of phenol/chloroform/isoamyl alcohol (25:24:1). An equal volume of the reaction was mixed with 98% formamide loading buffer and products resolved on a 15% denaturing urea sequencing gel. Gels were imaged using a phosphor imager (Fuji) and analysed using the lane profiling tool in the image analysis software (Fujifilm, Image Reader V1.8E), generating density traces for the DNA cleavage ladders with peaks proportional to the radioactive signal of the labelled DNA. Wild-type E1RF was compared to variant E1–RF complexes by overlaying the densitometry traces.

ATPase activity was determined in the helicase buffer but with 8.5 mM MgCl₂, and 7.5 mM ATP. The released phosphate was determined over time using the charcoal-binding assay of Iggo and Lane^38,47.

Cryo-EM data collection

Purified E1RF complex at ~0.05 mg/ml were applied to lacey carbon grids with a continuous carbon support film (EM Sciences). 3 μl of sample was applied and then blotted for 20 s before plunge-freezing the grids and vitrified using a Vitrobot Mark IV (ThermoFisher^TM) at 100% humidity and 8 °C. Data for the E1RF complex were collected using EPU software (ThermoFisher^TM) on a Titan Krios electron microscope (ThermoFisher^TM) operating at 300 kV and equipped with K3 Summit direct electron detector (Gatan Inc.) at the eBIC Diamond light source facility (Harwell, Oxfordshire, UK) and Birkbeck College, London. For the E1RF complex samples, movies (45 frames per movie) were collected with a dose of 1.12 e⁻/Å² per frame with a calibrated pixel size of 1.085 Å/pixel. Images were collected at a range of defoci between −1.2 and −2.5 μm.

Electron microscopy data processing

11,200 movies were aligned using MotionCorr2⁴⁸. CTFFIND4⁴⁹ was used to determine defocus values. Micrographs were screened manually to assess CTF quality and selected based on the presence of high-resolution Thon rings at least to 4 Å and beyond for further processing. For particle picking we used crYOLO v1.3.6⁵⁰ with the following procedure: a set of 50 randomly selected micrographs were used for manual picking of particles; these selected particle images were used as a model to train the crYOLO particle picking procedure. This model was optimised by running in several iterations, and tested initially on a sub-set of 100 micrographs for picking ability. The optimised model was then used to pick particles from the entire dataset. RELION 3.0³⁰ was used to extract selected particle images for the E1RF complex with the box sizes of 300 × 300 pixels, the total number was ~560,000 particle images. The extracted particle images were then subjected to two-dimensional (2D) classification in RELION 3.0 and the subset of the images that comprised the best classes, showing secondary structural features, was exported subsequently to cryoSPARC v2.9.0³¹. All following steps in image processing were carried out in cryoSPARC. A set of ~180,000 particle images was selected, based on choosing side views with a few end/tilt views in order to avoid preferred orientation effects on 3D reconstruction. This set of particles was subjected to ab-initio 3D classification implemented in cryoSPARC and running the procedure in multiple rounds, giving six K seeds in each round. 3D maps with clear density for the helicase domain and dsDNA were grouped and used for homogenous 3D refinement, cryoSPARC, using the 3D map of the E1RF complex obtained during the first step of 3D classification. The final 3D map was obtained at a resolution of 3.89 Å at 0.143 FSC threshold (and 4.5 Å at 0.5 FSC threshold). For the fitting, the map was sharpened using option AutoSharpen in PHENIX v1.14⁵¹ (Supplementary Figs. 1 and 2).

Local refinements were performed for individual domains of the E1RF complex using masks with soft edges around selected areas of the helicase domains, DNA fork with the collar domains, and OBDs B and E. Small improvements in resolution were observed based on the focused refinement. Later, the overall refined 3D map was used to analyse the OBD flexibility. Focused classification of maps within areas of the OBDs and DNA fork junction was carried out using the 3D variability option in cryoSPARC, based on the usage of three first modes of principal components and generating six clusters. These six maps were analysed for the distribution of densities and results are shown in Supplementary Fig. 3.

Model building and validation

Fitting into the final cryo-EM E1RF map was done using as a starting model the X-ray structure of the E1 helicase domain with ssDNA and bound ADP (PDB 2GXA)¹⁴. Firstly, the correspondence of subunits to the X-ray atomic model was determined by rotating the X-ray structure (rigid body fitting) by ~60°, refinements of the local fitting and assessing the cross-correlation with the EM map. The position with the highest cross-correlation was used as an initial point for the following flexible fit of the hexameric model using normal-mode analysis in iMODFIT v.1.44³⁵. Then, the model was refined and validated using PHENIX v1.14 real space refinement⁵¹. The quality of the model was assessed using COOT v0.8.9.1⁵². The initial model of the DNA fork was built using COOT and its fit into the EM density was refined using the Isolde package⁵³ and real-space refinement option in PHENIX⁵¹. Fittings of OBD-B and -E were done using the X-ray structure (PDB 1KSY)³³ as the initial model, fitted as a rigid-body into each OBD block of density within the EM map using Chimera^54,55. These fits were further locally refined using iMODFIT³⁵.

A final round of real-space refinement using PHENIX v1.14 with secondary structure restraints was run using the E1RF atomic model based on the independently refined fittings of the helicase domains, the DNA fork and OBD B and E into the cryo-EM map. MOLPROBITY v4.440⁵⁶ was used to evaluate the quality of the structures. All data and model statistics are reported in the Supplementary Table.

All figures and movies were produced using UCSF CHIMERA v1.14, CHIMERAX v1^54,55.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The E1RF map and atomic model are deposited to the EMDB data base under accession codes EMD-11852 and 7APD [https://doi.org/10.2210/pdb7APD/pdb] correspondingly. Source data are provided with this paper.

References

Parker, M. W., Botchan, M. R. & Berger, J. M. Mechanisms and regulation of DNA replication initiation in eukaryotes. Crit. Rev. Biochem. Mol. Biol. 52, 107–144 (2017).
Article CAS PubMed PubMed Central Google Scholar
Huen, J. et al. Structural insights into a unique dimeric DEAD-Box helicase CshA that promotes RNA decay. Structure 25, 469–481 (2017).
Article CAS PubMed Google Scholar
Raney, K. D., Byrd, A. K. & Aarattuthodiyi, S. Structure and mechanisms of SF1 DNA helicases. Adv. Exp. Med. Biol. 767, 17–46 (2013).
Article PubMed CAS Google Scholar
Singleton, M. R., Dillingham, M. S. & Wigley, D. B. Structure and mechanism of helicases and nucleic acid translocases. Annu. Rev. Biochem. 76, 23–50 (2007).
Article CAS PubMed Google Scholar
Patel, S. S. & Picha, K. M. Structure and function of hexameric helicases. Annu. Rev. Biochem. 69, 651–697 (2000).
Article CAS PubMed Google Scholar
Koonin, E. V. A common set of conserved motifs in a vast variety of putative nucleic acid-dependent ATPases including MCM proteins involved in the initiation of eukaryotic DNA replication. Nucleic Acids Res. 21, 2541–2547 (1993).
Article CAS PubMed PubMed Central Google Scholar
Neuwald, A. F., Aravind, L., Spouge, J. L. & Koonin, E. V. AAA+: A class of chaperone-like ATPases associated with the assembly, operation, and disassembly of protein complexes. Genome Res. 9, 27–43 (1999).
CAS PubMed Google Scholar
Abid Ali, F. & Costa, A. The MCM helicase motor of the eukaryotic replisome. J. Mol. Biol. 428, 1822–1832 (2016).
Article CAS PubMed Google Scholar
Velankar, S. S., Soultanas, P., Dillingham, M. S., Subramanya, H. S. & Wigley, D. B. Crystal structures of complexes of PcrA DNA helicase with a DNA substrate indicate an inchworm mechanism. Cell 97, 75–84 (1999).
Article CAS PubMed Google Scholar
Buttner, K., Nehring, S. & Hopfner, K.-P. Structural basis for DNA duplex separation by a superfamily-2 helicase. Nat. Struct. Mol. Biol. 14, 647–652 (2007).
Article PubMed CAS Google Scholar
Manthei, K. A., Hill, M. C., Burke, J. E., Butcher, S. E. & Keck, J. L. Structural mechanisms of DNA binding and unwinding in bacterial RecQ helicases. Proc. Natl Acad. Sci. USA 112, 4292–4297 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Itsathitphaisarn, O., Wing, R. A., Eliason, W. K., Wang, J. & Steitz, T. A. The hexameric helicase DnaB adopts a nonplanar conformation during translocation. Cell 151, 267–277 (2012).
Article CAS PubMed PubMed Central Google Scholar
Thomsen, N. D. & Berger, J. M. Running in reverse: the structural basis for translocation polarity in hexameric helicases. Cell 139, 523–534 (2009).
Article CAS PubMed PubMed Central Google Scholar
Enemark, E. J. & Joshua-Tor, L. Mechanism of DNA translocation in a replicative hexameric helicase. Nature 442, 270–275 (2006).
Article ADS CAS PubMed Google Scholar
Meagher, M., Epling, L. B. & Enemark, E. J. DNA translocation mechanism of the MCM complex and implications for replication initiation. Nat. Commun. 10, 3117 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Georgescu, R. et al. Structure of eukaryotic CMG helicase at a replication fork and implications to replisome architecture and origin initiation. Proc. Natl Acad. Sci. USA 114, E697–E706 (2017).
Article CAS PubMed PubMed Central Google Scholar
Yuan, Z. et al. DNA unwinding mechanism of a eukaryotic replicative CMG helicase. Nat. Commun. 11, 688 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Baretić, D. et al. Cryo-EM structure of the fork progression complex bound to CMG at a replication fork. Mol. Cell 78, 926–940 (2020).
Article PubMed PubMed Central CAS Google Scholar
Abid Ali, F. et al. Cryo-EM structures of the eukaryotic replicative helicase bound to a translocation substrate. Nat. Commun. 7, 10708 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Eickhoff, P. et al. Molecular basis for ATP-hydrolysis-driven DNA translocation by the CMG helicase of the eukaryotic replisome. Cell Rep. 28, 2673–2688 (2019).
Article CAS PubMed PubMed Central Google Scholar
Rzechorzek, N. J., Hardwick, S. W., Jatikusumo, V. A., Chirgadze, D. Y. & Pellegrini, L. CryoEM structures of human CMG–ATPγS–DNA and CMG–AND-1 complexes. Nucleic Acids Res. 48, 6980–6995 (2020).
Article CAS PubMed PubMed Central Google Scholar
Doorbar, J., Egawa, N., Griffin, H., Kranjec, C. & Murakami, I. Human papillomavirus molecular biology and disease association. Rev. Med. Virol. 25, 2–23 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sedman, J. & Stenlund, A. The papillomavirus E1 protein forms a DNA-dependent hexameric complex with ATPase and DNA helicase activities. J. Virol. 72, 6893–6897 (1998).
Article CAS PubMed PubMed Central Google Scholar
Castella, S., Burgin, D. & Sanders, C. M. Role of ATP hydrolysis in the DNA translocase activity of the bovine papillomavirus (BPV-1) E1 helicase. Nucleic Acids Res. 34, 3731–3741 (2006).
Article CAS PubMed PubMed Central Google Scholar
Sanders, C. M. et al. Papillomavirus E1 helicase assembly maintains an asymmetric state in the absence of DNA and nucleotide cofactors. Nucleic Acids Res. 35, 6451–6457 (2007).
Article CAS PubMed PubMed Central Google Scholar
Fouts, E. T., Yu, X., Egelman, E. H. & Botchan, M. R. Biochemical and electron microscopic image analysis of the hexameric E1 helicase. J. Biol. Chem. 274, 4447–4458 (1999).
Article CAS PubMed Google Scholar
Wasserman, M. R., Schauer, G. D., O’Donnell, M. E. & Liu, S. Replication fork activation is enabled by a single-stranded DNA gate in CMG helicase. Cell 178, 600–611e616 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lee, S.-J. et al. Dynamic look at DNA unwinding by a replicative helicase. Proc. Natl Acad. Sci. USA 111, E827–E835 (2014).
Article CAS PubMed PubMed Central Google Scholar
Chaban, Y. et al. Structural basis for DNA strand separation by a hexameric replicative helicase. Nucleic Acids Res. 43, 8551–8563 (2015).
Article CAS PubMed PubMed Central Google Scholar
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. eLife 7, e42166 (2018).
Article PubMed PubMed Central Google Scholar
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article CAS PubMed Google Scholar
Whelan, F. et al. A flexible brace maintains the assembly of a hexameric replicative helicase during DNA unwinding. Nucleic Acids Res. 40, 2271–2283 (2012).
Article ADS CAS PubMed Google Scholar
Enemark, E. J., Stenlund, A. & Joshua-Tor, L. Crystal structures of two intermediates in the assembly of the papillomavirus replication initiation complex. EMBO J. 21, 1487–1496 (2002).
Article CAS PubMed PubMed Central Google Scholar
Chen, G. & Stenlund, A. The E1 initiator recognizes multiple overlapping sites in the papillomavirus origin of DNA replication. J. Virol. 75, 292–302 (2001).
Article CAS PubMed PubMed Central Google Scholar
Lopéz-Blanco, J. R. & Chacón, P. iMODFIT: efficient and robust flexible fitting based on vibrational analysis in internal coordinates. J. Struct. Biol. 184, 261–270 (2013).
Article PubMed Google Scholar
Gonzalez, A., Bazaldua-Hernandez, C., West, M., Woytek, K. & Wilson, V. G. Identification of a short, hydrophilic amino acid sequence critical for origin recognition by the bovine papillomavirus E1 protein. J. Virol. 74, 245–253 (2000).
Article CAS PubMed PubMed Central Google Scholar
Sanders, C. M. A. DNA binding activity in BPV initiator protein E1 required for melting duplex ori DNA but not processive helicase activity initiated on partially single-stranded DNA. Nucleic Acids Res. 36, 1891–1899 (2008).
Article CAS PubMed PubMed Central Google Scholar
Castella, S., Bingham, G. & Sanders, C. M. Common determinants in DNA melting and helicase-catalysed DNA unwinding by papillomavirus replication protein E1. Nucleic Acids Res. 34, 3008–3019 (2006).
Article CAS PubMed PubMed Central Google Scholar
Cheng, K., Wilkinson, M., Chaban, Y. & Wigley, D. B. A conformational switch in response to Chi converts RecBCD from phage destruction to DNA repair. Nat. Struct. Mol. Biol. 27, 71–77 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gao, Y. et al. Structures and operating principles of the replisome. Science 363, 6429:eaav7003 (2019).
PubMed Google Scholar
Singleton, M. R., Dillingham, M. S., Gaudier, M., Kowalczykowski, S. C. & Wigley, D. B. Crystal structure of RecBCD enzyme reveals a machine for processing DNA breaks. Nature 432, 187–193 (2004).
Article ADS CAS PubMed Google Scholar
Krajewski, W. W. et al. Structural basis for translocation by AddAB helicase–nuclease and its arrest at χ sites. Nature 508, 416–419 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, J. Y. & Yang, W. UvrD helicase unwinds DNA one base pair at a time by a two-part power stroke. Cell 127, 1349–1360 (2006).
Article CAS PubMed PubMed Central Google Scholar
Bradley, P. & Falkenberg, M. TWINKLE and other human mitochondrial DNA helicases: structure, function and disease. Genes 11, 408 (2020).
Article CAS Google Scholar
Schuck, S. & Stenlund, A. Mechanistic analysis of local ori melting and helicase assembly by the papillomavirus E1 protein. Mol. Cell 43, 776–787 (2011).
Article CAS PubMed PubMed Central Google Scholar
Dixon, W. J. et al. Hydroxyl radical footprinting. Methods Enzymol. 208, 380–413 (1991).
Article CAS PubMed Google Scholar
Iggo, R. & Lane, D. Nuclear protein p68 is an RNA-dependent ATPase. EMBO J. 8, 1827–1831 (1989).
Article CAS PubMed PubMed Central Google Scholar
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for 581 improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Article CAS PubMed PubMed Central Google Scholar
Rohou, A. & Grigorieff, N. CTFFIND4: Fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015).
Article PubMed PubMed Central Google Scholar
Wagner, T. et al. SPHIRE-crYOLO is a fast and accurate fully automated particle picker for cryo-EM. Commun. Biol. 2, 218 (2019).
Article PubMed PubMed Central Google Scholar
Afonine, P. V. et al. New tools for the analysis and validation of cryo-EM maps and atomic models. Acta Crystallogr. D 74, 814–840 (2018).
Article CAS Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D 60, 2126–2132 (2004).
Article PubMed CAS Google Scholar
Croll, T. I. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr. D 74, 519–530 (2018).
Article CAS Google Scholar
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS PubMed Google Scholar
Goddard, T. D. et al. UCSF ChimeraX: meeting modern challenges in visualization andanalysis. Protein Sci. 27, 14–25 (2018).
CAS PubMed Google Scholar
Chen, W. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D 66, 12–21 (2010).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We acknowledge Diamond Light Source for access to and support of the cryo-EM facilities at the UK national Electron Bio-Imaging Centre (eBIC) (proposal EM14704), funded by the Wellcome Trust, the Medical Research Council (MRC), and the Biotechnology and Biological Sciences Research Council (BBSRC). Part of the Cryo-EM data for this investigation was collected at the ISMB EM facility at Birkbeck College, the University of London with financial support from the Wellcome Trust (202679/Z/16/Z and 206166/Z/17/Z). We thank Y. Chaban with D. Clare (eBIC), and N. Lukoyanova with S. Chen (Birkbeck) for their help with the data collection. D. Houldershaw for computer support in Birkbeck throughout the duration of the project. We thank F. Coscia, Y. Chaban, K. Ryzhenkova for the initial steps in the analysis of this complex and S. Dehghani-Tafti for cloning E1 mutants. This work was supported by BBSRC grants to E.V.O. (BB/R002622/1) and C.M.S. (BB/R001685/1). We thank all reviewers for constructive suggestions during the review of the manuscript, to help improve manuscript clarity.

Author information

These authors contributed equally: Abid Javed, Balazs Major, Jonathan A. Stead.

Authors and Affiliations

Department of Biological Sciences, Birkbeck College, Institute of Structural and Molecular Biology, Malet Street, London, WC1E 7HX, UK
Abid Javed & Elena V. Orlova
Academic Unit of Molecular Oncology, Department of Oncology and Metabolism, University of Sheffield, Medical School, Beech Hill Rd., Sheffield, S10 2RX, UK
Balazs Major, Jonathan A. Stead & Cyril M. Sanders

Authors

Abid Javed
View author publications
You can also search for this author in PubMed Google Scholar
Balazs Major
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan A. Stead
View author publications
You can also search for this author in PubMed Google Scholar
Cyril M. Sanders
View author publications
You can also search for this author in PubMed Google Scholar
Elena V. Orlova
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.V.O., and C.M.S. designed the research. J.A.S., B.M., and C.M.S. expressed, purified E1RF, and tested the activity of E1 mutants. A.J. prepared the EM grids, collected EM data. A.J., E.V.O. analysed the data and performed modelling. A.J., E.V.O. and C.M.S. wrote the manuscript; and all authors contributed to and approved the final manuscript.

Corresponding authors

Correspondence to Cyril M. Sanders or Elena V. Orlova.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Movie 1

Supplementary Movie 2

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Javed, A., Major, B., Stead, J.A. et al. Unwinding of a DNA replication fork by a hexameric viral helicase. Nat Commun 12, 5535 (2021). https://doi.org/10.1038/s41467-021-25843-6

Download citation

Received: 23 October 2020
Accepted: 31 August 2021
Published: 20 September 2021
DOI: https://doi.org/10.1038/s41467-021-25843-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.