Complete architecture of the archaeal RNA polymerase open complex from single-molecule FRET and NPS

Nagy, Julia; Grohmann, Dina; Cheung, Alan C.M.; Schulz, Sarah; Smollett, Katherine; Werner, Finn; Michaelis, Jens

doi:10.1038/ncomms7161

Article
Published: 30 January 2015

Complete architecture of the archaeal RNA polymerase open complex from single-molecule FRET and NPS

Julia Nagy¹,
Dina Grohmann²,
Alan C.M. Cheung³,
Sarah Schulz²,
Katherine Smollett³,
Finn Werner³ &
…
Jens Michaelis¹

Nature Communications volume 6, Article number: 6161 (2015) Cite this article

5653 Accesses
47 Citations
2 Altmetric
Metrics details

Subjects

Abstract

The molecular architecture of RNAP II-like transcription initiation complexes remains opaque due to its conformational flexibility and size. Here we report the three-dimensional architecture of the complete open complex (OC) composed of the promoter DNA, TATA box-binding protein (TBP), transcription factor B (TFB), transcription factor E (TFE) and the 12-subunit RNA polymerase (RNAP) from Methanocaldococcus jannaschii. By combining single-molecule Förster resonance energy transfer and the Bayesian parameter estimation-based Nano-Positioning System analysis, we model the entire archaeal OC, which elucidates the path of the non-template DNA (ntDNA) strand and interaction sites of the transcription factors with the RNAP. Compared with models of the eukaryotic OC, the TATA DNA region with TBP and TFB is positioned closer to the surface of the RNAP, likely providing the mechanism by which DNA melting can occur in a minimal factor configuration, without the dedicated translocase/helicase encoding factor TFIIH.

You have full access to this article via your institution.

Download PDF

DNA origami-based single-molecule force spectroscopy elucidates RNA Polymerase III pre-initiation complex stability

Article Open access 05 June 2020

Structures of transcription preinitiation complex engaged with the +1 nucleosome

Article Open access 21 November 2022

Structures of mammalian RNA polymerase II pre-initiation complexes

Article 26 April 2021

Introduction

Transcription of all cellular genomes is carried out by evolutionary related multisubunit RNA polymerases (RNAPs). In contrast to eukaryotes, where different types of RNAPs exist, archaea utilize only one RNAP to transcribe their genes, but its subunit composition, structure and utilization of general transcription factors is strikingly similar to the eukaryotic RNAP II (Pol II) system^1,2.

Eukaryotic Pol II involves the interplay of a large set of transcription factors to initiate transcription, most importantly general transcription factors TFIIA, B, D, E, F and H. However, not all factors are strictly required; in particular TFIIA is only necessary to alleviate the repressive effects of negative regulators such as NC1. Moreover, using strong promoters and negatively supercoiled DNA templates only two factors TATA box-binding protein (TBP) and TFIIB suffice to direct start site specific transcription initiation by Pol II in vitro³. TBP and TFIIB assemble at the promoter⁴ and recruit Pol II as well as other factors to form the preinitiation complex (PIC). This complex is referred to as the closed complex (CC), which subsequently undergoes large conformational rearrangements during which the DNA strands are separated and the template DNA strand (tDNA) is loaded into the RNAP active site to form the open complex (OC). In eukaryotes, this process is greatly enhanced by the helicase activities encoded by TFIIH.

The large size, heterogeneous composition and conformationally dynamic nature of eukaryotic PICs have made their structural and functional analysis problematic. Recent advances in the field have improved our understanding of the overall structural organization of the eukaryotic PIC. X-ray structures of PIC sub-complexes (encompassing Pol II and TFIIB fragments) at high resolution have provided hints of mechanistic aspects of the CC-to-OC transition during transcription initiation. Thus the crystal structure of the Pol II–TFIIB complex allowed for modelling of the CC and OC^5,6,7. Several crosslinking studies have yielded information about the location of transcription factors TFIIB⁸, TFIIF, TFIIE^9,10 and TFIIH¹¹. Recent cryo-electron microscopy (cryo-EM) studies have provided overall structures of both the eukaryotic and archaeal PICs at low to intermediate resolution^12,13,14. However, all of these studies have failed to fully resolve the course of the DNA within the OC, likely due to the flexibility of the transcription bubble.

The archaeal transcription apparatus is an excellent model system for the eukaryotic Pol II system¹⁵ as its RNAP and associated basal transcription factors are homologous, and because the entire system from hyperthermophilic archaea can be reconstituted from recombinant proteins¹⁶. This enables us to site specifically introduce mutations or molecular probes such as fluorescent dyes for single-molecule fluorescence analysis^17,18. The factors TBP and transcription factor B (TFB; homologous to TFIIB) are necessary and sufficient for promoter-directed start site-specific transcription initiation of the archaeal RNAP, which mirrors the minimal factor requirements for Pol II (refs 3, 16, 19). A third factor, TFE (homologous to TFIIEα) interacts with RNAP and stimulates OC formation, also comparable to the Pol II system^18,20,21,22. The pivotal difference between the two systems is the apparent ease at which the OC is formed in archaea, whereas in eukaryotes the additional helicase/translocase activity of the general factor TFIIH is required in vivo, which is not conserved in any archaeal species.

Single-molecule techniques have shown great potential to resolve the dynamics of transcription processes because they allow for the direct and real-time observation of transcription, one molecule at a time²³.

To obtain quantitative structural and dynamic information about transcription complexes during various phases of transcription, the Nano-Positioning System (NPS) was developed²⁴. The NPS combines data from single-molecule Förster resonance energy transfer (smFRET) measurements with existing structural information and a rigorous analysis using Bayesian parameter estimation. As a result, three-dimensional probability density functions for dye molecules (‘antennas’) attached to positions in unknown, flexible regions of the complex of interest can be calculated. For visualization, the smallest volume enclosing a certain probability of the computed density, the credible volume, can be displayed together with the known structure. The recorded smFRET data are measured between the antennas and dye molecule (‘satellites’) attached to known positions from the crystal structure²⁴. This method has been used in Pol II transcription elongation complexes (ECs) to study the position of the exiting RNA²⁵, the influence of transcription factor TFIIB on the position of the nascent RNA²⁴ and the position of non-template DNA (ntDNA) and upstream DNA²⁶. Moreover, the architecture of a minimal Pol II OC²⁷ and the position of transcription factor TFE in the archaeal PIC¹⁸ have been determined. NPS has been further extended to a global analysis where a complete data set of all measured information about all the antenna-satellite pairs is used as a network to simultaneously infer the position of all the antennas²⁸.

Here, we used smFRET, global NPS analysis and fluorescently labelled components of the transcription machinery from the hyperthermophilic archaeon Methanocaldococcus jannaschii (Mj) to determine the molecular architecture of the complete archaeal OC consisting of RNAP, promoter DNA, TBP, TFB and TFE. We determined the smFRET efficiencies between unknown ‘antenna’ dye molecules and several known ‘satellite’ dye molecules incorporated at one of five reference sites in the RNAP, whose position can be inferred from crystallographic structures. The ‘antenna’ dyes are attached to either one of several positions on the upstream and downstream ntDNA strand to TBP or to TFB. The probability densities generated from the NPS calculation allow us to build a model of the complete archaeal OC, which provides valuable insights into the mechanism of transcription initiation. We find the two factors TBP and TFB to be located closer to the RNAP surface in archaeal complexes compared with the Pol II system. This can provide an answer to the question why the CC-to-OC transition readily occurs in archaea but necessitates TFIIH in the Pol II system, thus illustrating how during evolution of the eukaryotic domain of life subtle changes in the architecture of the initiation complex renders DNA melting largely dependent on TFIIH.

Results

Assembly of well defined OCs for smFRET experiments

Complete archaeal OCs were assembled using M. jannaschii TBP (MjTBP), TFB, TFE and RNAP (Methods) on the strong SSV T6 promoter as DNA template. To ensure that the complexes were in the open state, a non-complementary four nucleotide ‘mismatch’ was introduced in the promoter^16,29. To perform smFRET experiments, the complexes were labelled with a fluorescent donor and acceptor at desired locations. Fluorescently labelled DNA oligonucleotides, TBP, TFB, TFE and RNAP were combined to yield a large network of ~70 differently labelled complexes, each with a single smFRET pair at a desired location (Fig. 1a,b). Previous biochemical studies had established the formation of stable OCs from these components, capable of promoter-specific transcription initiation in vitro, forming RNA templates from a precise starting point (+1; ref. 22). Complex formation of this in vitro OC was also verified by electrophoretic mobility shift assays¹⁸.

**Figure 1: Schematic representation of the global FRET network.**

While smFRET experiments can reveal both the structure and dynamics of macromolecular complexes, they only focus on one smFRET value of one dye pair at a time. Therefore, it is important to ensure that complexes are formed properly and that the information obtained from the measurement is indicative of the desired complex. To this end, we performed a number of control experiments (Supplementary Methods). We found that stable OCs were formed in a factor-dependent manner (Supplementary Fig. 1a,b) but that transcription factor TFE did not exert an influence on the architecture of the OC (Supplementary Fig. 2a–d). Also, we found that the choice of the dye on the DNA strand had no effect on the distance information obtained from smFRET experiments (Supplementary Fig. 2e).

OCs with the tDNA strand at the active centre

In contrast to the ntDNA strand, the position of the tDNA strand of the melted region in the OC could be inferred from the crystal structure of yeast Pol II using a tailed DNA template³⁰. The position of the tDNA strand resembles that of the respective EC, even in the absence of RNA³¹. To build the model of the eukaryotic OC, DNA opening was assumed to commence 20 bp downstream of TATA³², yielding a DNA melted in a region between positions (+2) to (−13) (ref. 5).

To ascertain that the extent of the melted region in the M. jannaschii OC was comparable to the published data of the Pyrococcus system^33,34, we used KMnO₄ footprinting (Supplementary Fig. 3 and Methods). This method detects thymidine nucleobases in single-stranded DNA regions. The SSV T6 promoter template contains a mismatch region from (−3) to (+1) and thus contains an obligate single-stranded T at register (−1). This residue serves as a positive control and is detectable in the free promoter probe. Addition of TBP, TFB and RNAP leads to novel signals at register (−5) and (−7), reflecting the opening of the promoter by the transcription complex. Since the next T residue occurs at (−12), we conclude that the transcription bubble starts at (−1) extends to at least (−7) and importantly not beyond (−12), showing that the size of the melted region in our OC is in good agreement with the published results for the Pyrococcus system³³. Since pre-opened promoter templates were used, the transcription factor TFE did not significantly alter DNA melting and did not exert an influence on the architecture of the OC (Supplementary Figs 2 and 3).

Furthermore, we tested whether the tDNA strand was properly loaded into the active centre cleft. Therefore, we used NPS to localize two dye positions on the tDNA strand, namely tDNA(+3) and tDNA(−9) (Methods, Fig. 2a). For each position, we performed a set of smFRET measurements with a second dye molecule attached to one of the five reference sites on the RNAP: residue 257 of Rpo1′, residue 373 of Rpo2″, residue 11 of Rpo5, residues 49 and 65 of Rpo7 (Fig. 2a, Supplementary Fig. 4 and Methods). Exemplary histograms are shown for position tDNA(−9) (Fig. 2b–f) and the extracted data are summarized in Supplementary Table 1.

**Figure 2: Localization of two positions on the tDNA strand in the archaeal OC.**

Many of the observed smFRET histograms showed a secondary peak with a relative intensity varying between 5–30%. However, there was no evidence for dynamic interconversion between the two peaks. Instead, the side peak is likely caused by a different static population. Comparison of NPS analysis of side peaks and main peaks showed only minor changes in credible volume position (Supplementary Fig. 5). In the following, we restrict ourselves to the discussion of the main peaks; however, one should note that, while the side peaks would lead to small Angstrom level alterations of the model, the general conclusions of this work are not affected.

The use of Bayesian parameter estimation allows the computation of the most likely position and the three-dimensional uncertainty of the position of the fluorescent dye attached to the unknown position²⁴. For this, the uncertainties due to the presence of flexible linkers between the dye and the known positions on the RNAP were computed first (Supplementary Fig. 4 and Methods). Moreover, for each dye pair, we experimentally determined the fluorescence anisotropies and the isotropic Förster radii (Methods and Supplementary Table 2). Three-dimensional probability densities were then calculated using the respective linker lengths and the sizes of the dye molecules (Supplementary Table 3) and credible volumes were calculated and displayed in comparison with the crystal structure of the RNAP (Fig. 2g,h). The size of all credible volumes presented in this study corresponds to 68% credibility, representing the smallest volume, which encloses a probability of 68%. The credible volume of the dye attached to tDNA(+3) is located inside the cleft, in good agreement with the eukaryotic OC models^5,27. Also tDNA(−9) localizes at a position consistent to the eukaryotic OC models and the position is distinct from the one it would adopt in a CC conformation⁵ (Fig. 2h). In the case of tDNA(−9), the displayed volume is split into two distinct sub-volumes. One should note that this is not originating from the dynamic movement between these positions (our model is a static model) but represents the positioning uncertainty of this DNA position in the calculation at 68% confidence level. In fact, if drawn at higher confidence the two volumes merge.

The path of the ntDNA strand within the archaeal OC

To determine the path of the ntDNA strand, we assembled a variety of OCs, where a fluorescent donor dye was attached to ntDNA(+7), (−1), (−5), (−7), (−10), (−12) or (−14) (Methods). For each of these ntDNA fluorescent donor positions, a fluorescent acceptor was attached to one of the five different reference sites on the RNAP, generating five unique complexes per labelled donor position. Exemplary histograms are shown (Supplementary Fig. 6b–f) and the extracted data are summarized in Supplementary Table 4. In the global NPS analysis²⁸, the complete data set consisting of mean FRET efficiencies, dye attachment information (position, length of linker and size of dye molecule), steady state fluorescence anisotropies and isotropic Förster distances of all antenna-satellite pairs and the uncertainty in position of the satellite dyes (due to linker length and dye molecule size) were used to simultaneously infer the positions of all antennas within the RNAP coordinate system (Fig. 1 and Methods). As a result of Bayesian parameter estimation, we obtained the three-dimensional probability density of each antenna, which represents the position of the dye attached to the DNA base (Fig. 3a–c).

**Figure 3: Localization of the ntDNA strand in the archaeal OC.**

The furthest downstream position was ntDNA(+7) in the downstream duplex region. The NPS credible volume is located close to the clamp head region of subunit Rpo1′ but outside of the cleft (Fig. 3c, black). NtDNA(−1) lies within the single-stranded region of the ntDNA strand and its position (Fig. 3c, brown) is split into two distinct sub-volumes located at the edge of the cleft, proximal to the lobe domain of subunit Rpo2″. As for tDNA(−9), these sub-volumes are not originating from dynamic movement but represent the uncertainty in position at 68% confidence level; if drawn at higher confidence the two volumes would merge. The positions of the dye molecules attached to the next upstream bases ntDNA(−5) (Fig. 3c, red) and ntDNA(−7) (Fig. 3c, orange) occupy a similar region within the cleft, between the clamp core of subunit Rpo1′ and the lobe domain of Rpo2″. The position of ntDNA(−10) (Fig. 3c, yellow) is located closer to the clamp coiled-coil region of subunit Rpo1′. The volumes for ntDNA(−12) (Fig. 3c, green) and ntDNA (−14) (Fig. 3c, dark green, hardly visible) are largely overlapping and remain at the same side of the clamp coiled-coil region as ntDNA(−10).

We repeated the NPS calculation with a slightly modified crystal structure of the archaeal RNAP, where we moved the clamp core region by 8 Å to mimic an open clamp polymerase structure such as observed in cryo-EM studies of the eukaryotic OC¹³. The position of the calculated credible volumes for all our antenna dyes remained largely unchanged by this alteration, that is, changes were small compared with the size of the credible volumes and therefore all further discussion is based on the closed clamp state of the polymerase in accordance with single-molecule experiments on the bacterial OC³⁵.

Position of the transcription factors and DNA in the archaeal OC

To determine the positions of TBP, TFB and the upstream TATA DNA in the OC, we assembled complexes where a fluorescent donor was attached to positions on or around the predicted binding region of TBP, namely ntDNA(−18), (−24), (−30) or (−37) to residue S71 of TBP and to residue G262 of TFB (Fig. 1, Methods). Fluorescent acceptors were attached to one of the four reference sites on the RNAP, namely residues Rpo1′-G257, Rpo2″-Q373, Rpo7-V49 or Rpo7-S65, as before (Supplementary Table 5). Residue K11 of Rpo5 was situated too far away to yield information for the localization process and was therefore left out from the analysis. The position of residues G44 in the winged-helix domain and G133 in the zinc ribbon domain of TFE had previously been determined by NPS¹⁸, and we included this smFRET data into the global NPS calculation to yield an accurate model of the complete archaeal OC (Fig. 1).

To maximize the resolution of the NPS, we used a valuable feature of the global analysis, which allows inclusion of FRET measurements between two unknown positions. We therefore included smFRET measurements from all the TATA DNA positions to both TBP and TFB and also smFRET measurements between TBP and TFB (Fig. 1a dotted red lines and Supplementary Table 5). This procedure greatly increased the accuracy of all the determined dye positions. Corresponding histograms are shown (Supplementary Figs 6g–l and 7). As a result of Bayesian parameter estimation, we obtained the three-dimensional probability density for the position of each antenna dye (Fig. 4a–d).

**Figure 4: Localization of the TATA DNA and transcription factors in the archaeal OC.**

The credible volume for ntDNA(−18) (Fig. 4d, dark cyan) is adjacent to the protrusion domain, and defines the path of the double-stranded ntDNA strand when compared with the position of the more downstream ntDNA(−14) and ntDNA(−12) (Fig. 3c), which are located further away towards the clamp domain. The first credible volume describing the position of the TATA box, ntDNA(−24) (Fig. 4d, dark blue), is situated closer to the RNAP wall but further away from the RNAP surface compared with the position of ntDNA(−18). Together with the credible volume of the second TATA box position, ntDNA(−30) (Fig. 4a,b, magenta), the bend in the DNA caused by TBP (centred at positions −26/−27) can be visualized. The credible volume of the last localized position on the ntDNA strand, ntDNA(−37) (Fig. 4a,b, gold), is located adjacent to ntDNA(−30), indicating the upstream path of double-stranded DNA leading away from TBP. The position of residue S71 of TBP (Fig. 4d, purple) is located between the credible volumes of ntDNA positions (−24) and (−30) and in proximity to the RNAP wall and subunit Rpo12 and is consistent with crystal structures of TBP in complex with DNA. The credible volume of TFB-G262 (Fig. 4d, olive) is located further away from the protrusion domain than TBP and positioned closer to RNAP subunit Rpo12. For the localization of TFE, we used the previously published smFRET data in our global analysis. The global NPS localization for the two residues of TFE yields very similar positions than those determined previously¹⁸ but the credible volumes are smaller due to the increased accuracy of the global NPS calculation (Fig. 4c, yellow and green).

This location of the TATA box in the archaeal OC is distinct from that previously determined in our group using NPS for a minimal eukaryotic OC²⁷. In these studies a different promoter DNA sequence had been used (together with endogenous yeast Pol II and recombinant yeast transcription factors) and thus the question whether the particular conformation in an OC depends on the respective sequence arises³⁶. We performed control measurements with a different DNA scaffold and concluded that the observed conformation is independent of the underlying promoter sequence, and as such our structural conclusions about the archaeal OC have general value (Supplementary Fig. 8a and Supplementary Methods), and that the differences compared with the earlier studies are due to the difference in the OC structure between yeast and archaea.

Model of the complete archaeal open promoter complex

To build a model of the complete archaeal RNAP OC, we started with the RNAP structure from Sulfolobus shibatae (PDB 2WAQ; refs 37, 38) and used the calculated probability densities of the antenna dye attachment points on the ntDNA strand, TBP, TFB and TFE to position these elements and the tDNA onto this RNAP structure (Fig. 5). To arrive at a unique structural model, we made some structural assumptions such as the size of the melted region, or the point of melting and re-annealing, all based on published data (see Methods for details).

**Figure 5: Model of the complete archaeal OC.**

To estimate how well our new model fits the NPS probability densities, we calculated the accessible volume of the antenna dyes using the coordinates from our model of the archaeal OC. We found that in all but one case, the accessible volumes overlap with the corresponding NPS probability densities, showing that the built model is accurate (for more details see Methods).

Discussion

The smFRET and global NPS data presented here reveal the complete architecture of the open promoter complex in archaea including the paths of the ntDNA and tDNA strands, and the location of the three transcription initiation factors TBP, TFB and TFE. The resulting model provides a framework for understanding the molecular mechanisms of transcription initiation in the archaea, as well as allowing a comparison to the mechanism in the eukaryotic OC and providing insights into the evolution of the transcription machinery following the divergence of the archaeal and eukaryotic lineages.

The formation of complete archaeal OCs was strictly dependent on TBP and TFB, and the overall architecture and in particular the path of the DNA was independent of its sequence. The DNA strands are melted and the tDNA strand has been loaded into the active centre cleft directly comparable to eukaryotic OCs^13,30.

In our model of the archaeal OC, the double-stranded downstream DNA enters the archaeal polymerase at a similar angle to that previously shown for structures in yeast, bacteria and archaea containing short duplex DNA^30,38,39. In this position the downstream DNA can be stabilized by the proximal lysine-rich region of the jaw domain of Rpo1″ (residues 189–239), whereas the corresponding eukaryotic jaw domain would need a rotation inwards to superimpose with the archaeal counterpart, a movement hindered by the eukaryotic subunit Rpb9 and the N-terminal domain of the eukaryotic subunit Rpb5 that are not conserved in the archaeal RNAP³⁸. The archaeal subunit Rpo5, which lacks the N-terminal domain of Rpb5, is required for the formation of stable OCs⁴⁰ and has been shown to photo-crosslink to the downstream DNA⁴¹, which is perfectly consistent with our model.

Previously, we observed a dynamic switching of the downstream DNA into and out of the cleft in single-molecule studies of a minimal eukaryotic OC²⁷. Cryo-EM data show that TFIIF appears to facilitate this transition in eukaryotes¹³. In contrast, in the archaeal OC, the smFRET data for ntDNA(+7) in the downstream double-stranded region showed no evidence of a dynamic movement of the DNA. Presumably, the transcription factor TFE, which is known to stabilize the DNA in the OC, renders the downstream DNA in a stable conformation. Note that the studies on the minimal eukaryotic OC were performed in absence of TFIIE. Thus it would be interesting to see whether the eukaryotic TFIIE has a similar function and would lead to a stabilization of the loaded state. Functional transcription assays using the Pol II system demonstrate that TFIIE stimulates OC formation and transcription on negatively supercoiled templates independent of TFIIH, which is in good agreement with our data and validates the use of archaeal transcription systems as bona fide model systems for eukaryotic Pol II (refs 42, 43).

In our model the downstream DNA strands are separated at register ntDNA(+2) in proximity to fork loop 2 (subunit Rpo2″, residues 436–445), and close to the highly conserved residue R446, which corresponds to Rpb2 residue R504 in Pol II. A point mutation of this arginine in the Pyrococcus RNAP (R445) to alanine leads to elongation deficiency in vitro²¹. Our model is also in agreement with the additional function of fork loop 2 of sterically blocking the duplex binding of the DNA and thus preventing re-association of the separated strands⁴⁴. Therefore, at these positions, both tDNA and ntDNA strand conformations are very similar to those observed in the EC²⁶.

Further upstream, between registers ntDNA(−1) and (−7), the ntDNA strand runs adjacent to the fork loop 1 element (residues 404–410) and lobe domain of Rpo2″, the rudder of the Rpo1′ clamp (residues 278–297) and the linker region of TFB. Here the path of the ntDNA of the archaeal PIC diverges from that in the eukaryotic EC, as the TFB-linker region is situated at a position where it would clash with the ntDNA strand of the EC. NtDNA registers (−1) to (−3) pass close to the rudder, fork loop 1 and the TFB linker, whereas registers (−4) to (−7) are closer to the lobe. These protein interactions with the middle of the transcription bubble are highly likely to play a role in bubble melting and/or maintenance. Previous studies have shown the essential role of the Rpo1′ rudder in DNA strand separation, where mutants lacking this loop could not separate or maintain melted DNA²¹. In addition, yeast nuclear extracts containing temperature sensitive TFIIB were transcriptionally inactive in vitro and rescued only by adding recombinant wtTFIIB and not with TFIIB-containing mutations in the linker region⁵. Similarly, in vitro transcription assays with Pyrococcus furiosus RNAP and its initiation factors showed that point mutations or deletions in the TFB linker region allowed the formation of PICs but were inhibited for transcription⁵, and subsequent footprinting studies showed that these PICs were incapable of opening the promoter DNA. Thus, the B-linker region and Rpo1′ rudder are essential for promoter opening and OC stabilization. Our new model of the archaeal OC gives a mechanistic reason for these observations since the single-stranded ntDNA is positioned adjacent to all of these elements (Fig. 6a,b) and their interaction is likely to influence the stability and formation of the transcription bubble.

**Figure 6: Open complex model has implications for the melting of DNA in the CC to OC transition.**

As a consequence of the displacement of the ntDNA to the outside of the cleft relative to its path in the EC, its path comes close to the edge of the clamp core region and in particular to the clamp helix-coil-helix motif at register nt(−12) where the upstream end of the bubble lies. Previously, we have localized the binding position of the winged-helix domain of TFE¹⁸ to the tip of the helix-coil-helix motif and the global NPS analysis presented in this paper also confirms this observation. Moreover, cryo-EM as well as crosslinking data show that eukaryotic TFIIE contacts the RNAP at a similar binding site^9,13,14. Our model describes an interaction at this point between ntDNA at the upstream end of the bubble, TFE and the RNAP helix-coil-helix motif. These interactions are likely stabilizing the upstream end of the transcription bubble to prevent its collapse.

The point of DNA re-annealing at register ntDNA(−12) lies above a tunnel formed by the N-terminal domain of the TFB core, the TFB-linker helix region, the rudder, the protrusion and TFE. A comparison to the position of the upstream DNA in the EC²⁶ shows that a rearrangement of the complete upstream double-stranded region, including a release of TFB core from the RNAP surface and movement of the upstream DNA to a position in between Rpo1′ helix α8 (residues 235–251) and Rpo2″ helix α11 (residues 349–373), is required during the initiation to elongation transition, presumably leading to a release of the transcription initiation factors and bubble collapse.

Interestingly, the probability density for position ntDNA(−24), together with those for ntDNA(−30), ntDNA(−37), TBP-S71 and TFB-G262 define the pathway of the DNA strand around the TATA box in close proximity to the surface of the polymerase. Previous Far-Western blotting studies showed the strongest protein–protein interactions of TBP and TFB with subunits Rpo12, Rpo10 and Rpo2″ (ref. 45), which are all very close to the positions of the transcription factors in this OC model. Compared with the eukaryotic system, we find that the position of the TFB core domain in the archaeal OC has shifted and the position of TBP has changed substantially by ≈45 Å (Fig. 6c,d and Supplementary Movie 1)^5,13. While the N-terminal cyclin fold of the TFB core is only slightly tilted, the C-terminal cyclin fold of the TFB core is shifted and is localized closer to the DNA strand but still remains in proximity to the wall of the polymerase. Thereby, the helix-turn-helix motif consisting of helices TFB-H 4′ and TFB-H 5′ (residues K1265–K1292) is facing the ntDNA strand at registers ntDNA(−31) to ntDNA(−36), which form the purine-rich B-recognition element BRE.

Mechanistically, transcription initiation in archaea is ancestral and streamlined compared with the eukaryotic Pol II system. Archaeal genomes do not encode homologues of TFIIA, TFIIF, TBP-associated factors and TFIIH. In particular the latter two could make important contributions to the OC formation since the TBP-associated factors make contact with the promoter DNA around the transcription start site, and TFIIH because of the ATP-dependent helicase/translocase activity is crucial for DNA melting on the majority of promoters tested in vitro and probably all transcription initiation in vivo¹¹. However, using negatively supercoiled DNA and strong promoter templates, TBP and TFIIB suffice for initiation of eukaryotic Pol II (ref. 3), which demonstrates that the same ancestral mechanisms are able to facilitate OC formation in eukaryotes and archaea. Why are additional factors required by Pol II provided that the basic mechanisms are conserved? Our model of the complete archaeal OC provides a structural hypothesis for this apparent ease of DNA melting in archaea (Fig. 7). Since the archaeal RNAP pulls the promoter-bound factors TBP and TFB much closer to its surface than Pol II (Fig. 6c,d) and the downstream promoter DNA is bound between the RNAP jaws, this topology likely induces a torsional strain in the DNA that lowers the local melting temperature of the promoter DNA. Interactions between the tDNA and residues on the inside of the DNA binding channel subsequently facilitate a swift loading of the tDNA strand into the RNAP active site. But why has this process evolved to become ATP energy dependent in the Pol II system while remaining spontaneous in archaea? Neither Pol I, nor Pol III, nor the bacterial sigma70 holo-RNAP requires energy for OC formation, which indicates that Pol II could be exceptional in this regard. Since the complexity of the Pol II transcriptome is higher than of any other RNAP system mentioned above, the energy dependence could reflect an additional layer of regulation of Pol II transcription. Support for this concept is provided by a recent report about the global regulation of OC formation in naïve lymphocytes⁴⁶, which on activation undergo a transcriptome amplification that is regulated by TFIIH.

**Figure 7: Mechanisms of the closed to open complex transition in archaea and eukaryotes.**

In conclusion, the presented data provides a structural model for the organization of the archaeal OC. Given this model, a mechanism by which DNA melting could occur without transcription factor TFIIH becomes apparent.

Methods

Recombinant protein preparation and labelling

RNAP subunits from the hyperthermophilic archaeal model system M. jannaschii were expressed in recombinant form in Escherichia coli and purified¹⁶. For the smFRET experiments, five differently labelled RNAPs were prepared. Therefore, either single cysteine residues were introduced into the RNAP at position K11 of Rpo5 or positions V49 and S65 of Rpo7 and the subunits were purified and labelled with the dye Alexa647 (ref. 17). Or an unnatural amino acid (p-Azido-L-phenylalanine) was introduced at position G257 of Rpo1′ and position Q373 of Rpo2″ and labelled with the dye DyLight650 by Staudinger ligation⁴⁷. The fluorescently labelled subunits were directly introduced into RNAP reconstitution reactions following known protocols¹⁶.

Unlabelled transcription factors TBP, TFB and TFE were expressed in recombinant form in E.coli^22,48. TFE was labelled with either the dye Cy3B attached to position G44 or the dye DyLight550 attached to position G133 by Staudinger ligation^18,47.

Preparation of fluorescently labelled TBP derivative

TBP was labelled with an Alexa647 or Alexa555 fluorophore via a cysteine–maleimide coupling strategy. To introduce a unique cysteine residue, the native cysteines at positions C48 and C67 that are buried inside the protein have been substituted by serine residues and a single cysteine residue has been introduced at position S71. The mutations have been introduced into the TBP gene using either the QuikChange II site-directed mutagenesis kit (Agilent) or the splice by overlap extension PCR strategy. Recombinant TBP-S71C was expressed from a pET21a(+) vector in BL21(DE3)/Rosetta cells and expression of TBP was induced in exponentially growing cultures with 1 mM isopropyl β-D-1-thiogalactopyranoside at an optical density of ~0.6 to 0.8 in rich medium for 4 h at 37 °C. Bacterial cells were harvested, resuspended and extracted in P300 buffer (200 mM Tris/acetate pH 7.9, 100 mM MgAc, 0.1 mM ZnSO₄, 300 mM KAc, 10% glycerine). Cells were lysed using sonification. Recombinant and heat stable MjTBP could further be pre-purified using a heat denaturation step (65 °C for 20 min). The heat stable fraction contained MjTBP and the protein was precipitated with saturating amounts of ammonium sulfate. After pelleting the precipitated protein fraction, the pellet was resuspended in 5 ml P300 with 0.05% beta-ME and further purified by size exclusion chromatography (HiPrep-Sephacryl, S100 16/60, GE Healthcare). MjTBP containing fractions were combined, subsequently further purified and β-mercaptoethanol was removed by ion exchange chromatography (MonoQ 4.6/100 PE, GE Healthcare) using a gradient from 100 to 1,000 mM potassium acetate. The labelling reaction was carried out using a 10-fold molar excess of dye at 4 °C for 16 h. Labelled protein was separated from excessive free dye using a NAP-5-column and P100 buffer (200 mM Tris/acetate pH 7.9, 100 mM MgAc, 0.1 mM ZnSO₄, 100 mM KAc, 10% glycerine) with 0.05% beta-ME.

Preparation of fluorescently labelled TFB derivative

For the production of fluorescently labelled TFB variants a nonsense-suppressor strategy was chosen, which allows the specific labelling of the protein via a unique unnatural amino acid (p-Azido-L-phenylalanine)⁴⁹. An amber mutation (TAG) was introduced at position G262 into the TFB gene using the QuikChange II site-directed mutagenesis kit. The mutated protein was expressed from a pET21a(+) plasmid, which allowed purification of the full-length protein via a C-terminal His-tag. The recombinant protein was produced in BL21/DE3 cells that additionally carried the arabinose-inducible pEvolv-pAzF plasmid encoding multiple copies of an amber-suppressor tRNA (tRNA_CUA) and an engineered tyrosyl-tRNA synthetase⁵⁰. Bacterial cultures were grown in rich medium containing 100 μg ml⁻¹ ampicillin and 25 μg ml⁻¹ chloramphenicol. p-Azido-L-phenylalanine (1 mM, Chem-Impex International Inc.) and 0.02% arabinose were added to the culture at an optical density of 0.3–0.4. TFB expression was induced with 1 mM isopropyl β-D-1-thiogalactopyranoside at an optical density of 0.5–0.6 and cells were harvested after 3 h. After harvesting the cells by centrifugation (5,000g, 15 min), the cells were resuspended in N500 buffer (200 mM Tris/acetate pH 7.9, 100 mM MgAc, 0.1 mM ZnSO₄, 500 mM NaCl, 10% glycerine) containing 0.5% Triton. Cells were lysed by sonification and the soluble protein fraction was extracted. The cell lysate was removed from cell debris and unsoluble fractions by a centrifugation step (15,000g, 30 min) and the supernatant was further purified by affinity chromatography (HisTrap FF 1 ml, GE Healthcare). Following labelling with 10-fold molar excess of either DyLight550 or DyLight650 via Staudinger ligation⁴⁷ overnight at 4 °C, the excess of free dye was removed by affinity chromatography (HisTrap FF 1 ml).

KMnO₄ footprinting

The DNA template encoding the SSV T6 promoter was prepared by annealing 5′, ³²P-labelled ntDNA strand (5′-GATTGATAGAGTAAAGTTTAAATACTTATATAGAT AGAGTATAGATAGAGGGTTCAAAAAATGGTT-3′) and unlabelled tDNA strand (5′-AACCATTTTTTGAACCCTCCGCTTATACTCTATCTATATAAGTATTTAAACTTTACTCTATCTATC-3′). For the footprinting reactions, the components were combined in 23 μl reactions containing 1 × HNME buffer (40 mM HEPES (pH 7.3), 250 mM NaCl, 2.5 mM MgCl₂, 0.02 mM EDTA, 1% glycerol and 2 mM DTT), 8.3 nM tDNA, 0.6 μM RNAP, 8.7 μM TBP, 0.5 μM TFB and 740 nM TFE. The reaction was incubated at 65 °C for 15 min followed by a 2-min incubation with 2 μl KMnO₄ (at 4, 8 or 16 mM) and stopped with 1.5 μl 14 M β-ME. Protein was digested by addition of 0.25% SDS and 1 mg ml⁻¹ proteinase K and incubation at 65 °C for 1 h. DNA was ethanol precipitated prior to treatment with 5% piperidine at 90 °C for 30 min, followed by one round of chloroform extraction and subsequent ethanol precipitation. To prepare the A+G ladder, the DNA was treated with formic acid for 5 min prior to DNA precipitation and piperidine treatment as described above. DNA was separated on 10% urea PAGE, exposed to a phosphor storage screen and visualized on a Typhoon FLA 9500 bioimager.

Archaeal OC preparation for single-molecule FRET experiments

The OCs were assembled freshly before each smFRET experiment by adding 1 μl each of nucleic acid scaffold (2 μM), TBP (10 μM), TFB (10 μM), RNAP ΔRpo4/7 (2 μM) and Rpo4/7 (10 μM) to 10 μl HNME buffer. The mixture was incubated at 60 °C for 10 min. Heparin (final concentration 0.5 mg ml⁻¹) was added to reduce non-specific binding of the RNAP to nucleic acids. Unbound transcription factors and nucleic acids were removed using Amicon Ultra centrifugal filters (Millipore) by washing two times with 450 μl HNME buffer. All smFRET experiments were done in presence of TFE (12 μM), which was added to the purified complexes and incubated for 10 min at 60 °C. The complexes were then diluted 1,000-fold in HNME buffer and loaded into the sample chamber of the TIRF microscope. For surface immobilization of the complexes, the ntDNA strand had biotin attached at the 5′-end via a C6-amino linker.

The DNA single-strands were purchased from IBA (Göttingen, Germany) and annealed as described before²⁵. The viral SSV T6 promoter DNA⁵¹ was used for all the smFRET experiments as it is known to form very stable PICs in promoter-directed transcription in vitro¹⁶. Our promoter DNA constructs consists of a 66-nucleotide long double-stranded DNA with tDNA and ntDNA strands containing a 4 nucleotide heteroduplex region around the transcription start site (−3 to +1) that stabilized the PIC by forming the OC (Fig. 1).

For the determination of the course of the ntDNA within the OC, the ntDNA strand was purchased with either Cy3B at position (+7), (−1), (−5), (−7), (−10), (−12), (−14), (−18), (−30) and (−37) or 6-TAMRA at position (−24) (Fig. 1b). For the question of the conformation of the tDNA in the OC, the tDNA strand was purchased with 6-TAMRA at position (+3) and Cy3B at position (−9).

Experimental set up for smFRET and data analysis

All smFRET experiments were performed on a custom-built prism-based total internal reflection fluorescence microscope¹⁸. Briefly, a frequency-doubled Nd:YAG laser (532 nm, Spectra-Physics) was used for the excitation of donor molecules and a diode laser (643 nm, Toptica) for the direct excitation of the acceptor molecules. Fluorescence intensity was collected through a water immersion objective (Plan Apo × 60, numerical aperture 1.2, Nikon) and directed to an EMCCD camera (iXon, Andor). OCs were immobilized onto the surface of a microfluidic chamber surface via PEGBiotin–Neutravidin–Biotin²⁵. The acquired data was analyzed using custom-written MATLAB software. We used a fully automated routine to find FRET pairs, calculating and subtracting the local background and computing the fluorescence trajectories²⁵. The correction factors were determined individually for every FRET pair. The resulting histograms were computed for every time point (frame-wise histogram). Data from at least three individual smFRET measurements were used for each pair of labelling sites. The FRET efficiencies from all molecules of all measurements were plotted in histograms. The peaks were fitted with one (or two) Gaussian function(s) to extract the mean FRET efficiencies (Supplementary Tables S1, S4 and S5). A s.d. of 2% for the FRET efficiencies was included into the calculation. These results were then used for further analysis with NPS²⁴ or global NPS²⁸, as indicated.

Determination of the probability densities using NPS

The X-ray structure of the archaeal RNAP of S. shibatae (pdb file: 2WAQ; ref. 37) was used as a reference frame for the position calculation. Moreover, the volume occupied in the crystal structure was used as a restriction for the possible positions of the dye molecules. We assumed zero probability density within an already occupied volume, which was the volume of the protein shrunk by 5 Å to account for uncertainties in the X-ray structure and equal probability density elsewhere to calculate the ADM prior.

The global NPS method²⁸, software freely available at http://www.uni-ulm.de/nawi/nawi-biophys/software.html) was then applied using the available X-ray structures, the measured FRET efficiencies and Bayesian parameter estimation. As a result, we obtained the three-dimensional probability density function for the positions (+7), (−1), (−5), (−7), (−10), (−12), (−14), (−18), (−24), (−30) and (−37) on the ntDNA strand, as well as the positions of residue S71 of TBP and residue G262 of TFB. From this we calculated the smallest volumes that enclose a certain probability, so-called credible volumes. The surface of the credible volumes was displayed by using the interactive visualization programme UCSF Chimera, which was also used for displaying all structural data⁵². All credible volumes shown in the paper are calculated at 68% probability. For more details see Methods.

Determination of isotropic Förster radii and anisotropies

For each donor–acceptor pair, the isotropic Förster radius R₀^iso was determined using standard procedures⁵³. First, the quantum yield (QY) of the donor sample was determined using Rhodamine 101 dissolved in ethanol as a standard (QY=91.5%)⁵⁴ (Supplementary Table 2).

The ntDNA positions (+7), (−1), (−5), (−7), (−10), (−12), (−14), (−18), (−24), (−30) and (−37) were labelled either with the donor dye Cy3B or 6-TAMRA, residue S71 of TBP was labelled with the dye Alexa555 and residue G262 of TFB was labelled with the dye DyLight550.

Second, overlap integrals were calculated from recorded donor emission spectra (528–700 nm with an excitation wavelength of 523 nm) and acceptor absorption spectra (400–700 nm). Together with the refractive index (n=1.35) and the orientation factor (κ²=2/3), the isotropic Förster radii R₀^iso were determined for all the different donors and Alexa647 as acceptor (Supplementary Table 2).

To account for uncertainties in the Förster distance due to orientation effects, we then measured the steady state fluorescence anisotropies of the donor and acceptor dyes for all attachment sites using a steady state fluorescence spectrometer (Edinburgh Instruments F900) (Supplementary Table 2). Both, isotropic Förster distances as well as fluorescence anisotropies were used as prior information in the global NPS analysis²⁸.

Uncertainty in the position of satellite dye molecules

Satellite dye molecules (SDMs) were attached to known positions within the archaeal polymerase using flexible linkers. While the attachment point is known from the X-ray structure of the archaeal polymerase of S. shibatae (pdb file: 2WAQ; ref. 37), the precise location of the dye molecule is not. For the NPS analysis, we therefore calculated the volume that is sterically accessible to the dye molecules, given the point of attachment, size of the dye molecule and the linker length²⁴. To this end, the SDMs were approximated by a sphere of diameter d_dye and linked to the protein complexes by flexible linkers of dimensions L_linker and d_linker (Supplementary Table 3). We assume that each SDM position within this accessible volume is equally probable (Supplementary Fig. 4).

Calculation of model-based prior volumes

For the dyes attached to the double-stranded DNA region, the coordinates of the C7 atom of the base were used as attachment point and the linker length corresponded to 12 C-atoms. In case of the single-stranded ntDNA region, the base orientations were left out of the model, since in a single strand of nucleic acids base stacking energies are small and as a result any base is relatively free to rotate about the backbone. Therefore, a 17 C-atom linker and an attachment point on the backbone C1′ atom of the DNA was used for single-stranded regions. The sequence alignment of the proteins TBP and TFB from M. jannaschii that were used in all our experiments with the corresponding proteins from P. woesei contained in the crystal structure of the TBP/TFB/DNA sub-complex used for the modelling (pdb file: 1D3U; ref. 51) resulted in the definition of the analogous residues S72 for TBP and E1223 for TFB in the model. Exemplary Figures of the comparisons can be found in Supplementary Fig. 8b–e.

Modelling

A number of structural assumptions were made to arrive at a unique model: (1) The extent of the single-stranded transcription bubble is between positions (−11) and (+1), corresponding to the permanganate footprinting results and published literature^5,32,33. (2) The tDNA strand position (+1) would be positioned at the active site for base pairing with the first NTP of the RNA transcript. (3) The DNA conforms to a B-form duplex outside of the melted region, and the downstream duplex occupies a similar position to that of the eukaryotic/bacterial EC^44,55, the OC-mimic of Pol II (ref. 30) and the archaeal RNAP-DNA binary complex³⁸. (4) The structure of archaeal TBP/TATA/TFB from P. woesei (PDB 1D3U; ref. 51) containing the C-terminal cyclin core of TFB and a bent TATA box DNA fragment was used as the template for TBP–TATA–TFB in this OC model and would not change in conformation when bound to RNAP. (5) The path of the TFB N-terminal regions within the RNAP cleft would follow the same approximate path as observed in the structure of eukaryotic Pol II in complex with TFIIB (PDB 4BBR; ref. 7). (6) The position of the N- and C-terminal domains of TFE would also be consistent with the cryo-EM density observed for the eukaryotic Pol II PIC containing TFIIE (ref. 13).

Probability densities were visualized in Coot⁵⁶ and USCF Chimera⁵². Template models assembled into the complete OC model were based on PDB entries 2WAQ, 1D3U, 4BBR, 1Q1H and 1VD4. Models were manipulated to fit the probability densities using the same programmes as for visualization, and geometry was regularized using phenix.refine⁵⁷. Model coordinates are given in a Supplementary File.

To estimate how well the model fits the NPS densities, we calculated the accessible volume priors of the dyes attached to the respective positions in the model (Methods) and compared them with our resulting probability densities. It should be noted that a perfect overlap is not expected given the nature of the model-based prior and the NPS posterior. The prior volume encompassing the complete volume of the dye molecule could be sitting, given its size and the length of the linker and the position of the anchor point defined by the model of the OC. Therefore, its size is simply a measure of the uncertainty before the measurement. The posterior volume, in contrast, represents the probability for the dye position and its size is a measure of the uncertainty after the measurement. As long as there is overlap between prior and posterior, the model is in accordance with the data.

The model and the derived accessible volumes fit the obtained credible volumes from the NPS calculation drawn at 68% credibility in all but one case (Supplementary Fig. 8b,c), the exception being the ntDNA(−14) position, where the calculated accessible volume is not overlapping with the computed NPS volume drawn at 68% credibility. The clamp coiled-coil region of the RNAP is situated exactly in between the accessible volume and the NPS credible volume of ntDNA(−14). If the NPS credible volume of ntDNA(−14) is displayed at 90% confidence level, the prior and posterior overlap (Supplementary Fig. 8d,e). One should note that TFE also binds to the clamp coiled-coil region as determined by previous cryo-EM, crosslinking and NPS studies^9,13,18. However, we did not assign a particular volume for TFE in the NPS analysis. Thus, it is quite likely that large parts of the NPS determined credible volumes for the dye attached to ntDNA(−14) are in fact excluded by TFE, preventing overlap between the model and the NPS result for ntDNA(−14).

Additional information

How to cite this article: Nagy, J. et al. Complete architecture of the archaeal RNA polymerase open complex from single-molecule FRET and NPS. Nat. Commun. 6:6171 doi: 10.1038/ncomms7161 (2015).

References

Hirata, A., Klein, B. J. & Murakami, K. S. The X-ray crystal structure of RNA polymerase from Archaea. Nature 451, 851–854 (2008).
Article ADS CAS Google Scholar
Woese, C., Kandler, O. & Wheelis, M. Towards a natural system of organisms: Proposal for the domains Archaea, Bacteria, and Eukarya. Proc. Natl Acad. Sci. USA 87, 4576–4579 (1990).
Article ADS CAS Google Scholar
Parvin, J. D. & Sharp, P. A. DNA topology and a minimal set of basal factors for transcription by RNA polymerase II. Cell 73, 533–540 (1993).
Article CAS Google Scholar
Gietl, A. et al. Eukaryotic and archaeal TBP and TFB/TF(II)B follow different promoter DNA bending pathways. Nucleic Acids Res. 42, 6219–6231 (2014).
Article CAS Google Scholar
Kostrewa, D. et al. RNA polymerase II–TFIIB structure and mechanism of transcription initiation. Nature 462, 323–330 (2009).
Article ADS CAS Google Scholar
Liu, X., Bushnell, D. A., Wang, D., Calero, G. & Kornberg, R. D. Structure of an RNA polymerase II-TFIIB complex and the transcription initiation mechanism. Science 327, 206–209 (2010).
Article ADS CAS Google Scholar
Sainsbury, S., Niesser, J. & Cramer, P. Structure and function of the initially transcribing RNA polymerase II–TFIIB complex. Nature 493, 437–440 (2012).
Article ADS Google Scholar
Chen, H. & Hahn, S. Mapping the location of TFIIB within the RNA polymerase II transcription preinitiation complex: a model for the structure of the PIC. Cell 119, 169–180 (2004).
Article CAS Google Scholar
Chen, H., Warfield, L. & Hahn, S. The positions of TFIIF and TFIIE in the RNA polymerase II transcription preinitiation complex. Nat. Struct. Mol. Biol. 14, 696–703 (2007).
Article CAS Google Scholar
Chen, Z. A. et al. Architecture of the RNA polymerase II–TFIIF complex revealed by cross-linking and mass spectrometry. EMBO J. 29, 717–726 (2010).
Article CAS Google Scholar
Grünberg, S., Warfield, L. & Hahn, S. Architecture of the RNA polymerase II preinitiation complex and mechanism of ATP-dependent promoter opening. Nat. Struct. Mol. Biol. 19, 788–796 (2012).
Article Google Scholar
Carlo, S., Lin, S., Taatjes, D. & Hoenger, A. Molecular basis of transcription initiation in Archaea. Transcription 1, 103–111 (2010).
Article Google Scholar
He, Y., Fang, J., Taatjes, D. J. & Nogales, E. Structural visualization of key steps in human transcription initiation. Nature 495, 481–486 (2013).
Article ADS CAS Google Scholar
Murakami, K. et al. Architecture of an RNA polymerase II transcription pre-initiation complex. Science 342, 12387241–12387247 (2013).
Article Google Scholar
Werner, F. & Grohmann, D. Evolution of multisubunit RNA polymerases in the three domains of life. Nat. Rev. Microbiol. 9, 85–98 (2011).
Article CAS Google Scholar
Werner, F. & Weinzierl, R. A recombinant RNA polymerase II-like enzyme capable of promoter-specific transcription. Mol. Cell 10, 635–646 (2002).
Article CAS Google Scholar
Grohmann, D., Hirtreiter, A. & Werner, F. RNAP subunits F/E (RPB4/7) are stably associated with archaeal RNA polymerase: using fluorescence anisotropy to monitor RNAP assembly. Biochem. J. 421, 339–343 (2009).
Article CAS Google Scholar
Grohmann, D. et al. The initiation factor TFE and the elongation factor Spt4/5 compete for the RNAP clamp during transcription initiation and elongation. Mol. Cell 43, 263–274 (2011).
Article CAS Google Scholar
Qureshi, S. A., Bell, S. D. & Jackson, S. P. Factor requirements for transcription in the Archaeon Sulfolobus shibatae. EMBO J. 16, 2927–2936 (1997).
Article CAS Google Scholar
Forget, D., Langelier, M., Therien, C., Trinh, V. & Coulombe, B. Photo-cross-linking of a purified preinitiation complex reveals central roles for the RNA polymerase II mobile clamp and TFIIE in initiation mechanisms. Mol. Cell Biol. 24, 1122–1131 (2004).
Article CAS Google Scholar
Naji, S., Bertero, M. G., Spitalny, P., Cramer, P. & Thomm, M. Structure-function analysis of the RNA polymerase cleft loops elucidates initial transcription, DNA unwinding and RNA displacement. Nucleic Acids Res. 36, 676–687 (2008).
Article CAS Google Scholar
Werner, F. & Weinzierl, R. Direct modulation of RNA polymerase core functions by basal transcription factors. Mol. Cell Biol. 25, 8344–8355 (2005).
Article CAS Google Scholar
Michaelis, J. & Treutlein, B. Single-molecule studies of RNA polymerases. Chem. Rev. 113, 8377–8399 (2013).
Article CAS Google Scholar
Muschielok, A. et al. A nano-positioning system for macromolecular structural analysis. Nat. Methods 5, 965–971 (2008).
Article CAS Google Scholar
Andrecka, J. et al. Single-molecule tracking of mRNA exiting from RNA polymerase II. Proc. Natl Acad. Sci. USA 105, 135–140 (2008).
Article ADS CAS Google Scholar
Andrecka, J. et al. Nano positioning system reveals the course of upstream and nontemplate DNA within the RNA polymerase II elongation complex. Nucleic Acids Res. 37, 5803–5809 (2009).
Article CAS Google Scholar
Treutlein, B. et al. Dynamic architecture of a minimal RNA polymerase II open promoter complex. Mol. Cell 46, 136–146 (2012).
Article CAS Google Scholar
Muschielok, A. & Michaelis, J. Application of the nano-positioning system to the analysis of fluorescence resonance energy transfer networks. J. Phys. Chem. B 115, 11927–11937 (2011).
Article CAS Google Scholar
Bell, S., Kosa, P., Sigler, P. & Jackson, S. Orientation if the transcription preinitiation complex in Archaea. Proc. Natl Acad. Sci. USA 96, 13662–13667 (1999).
Article ADS CAS Google Scholar
Cheung, A. C. M., Sainsbury, S. & Cramer, P. Structural basis of initial RNA polymerase II transcription. EMBO J. 30, 4755–4763 (2011).
Article CAS Google Scholar
Brueckner, F. & Cramer, P. Structural basis of transcription inhibition by α-amanitin and implications for RNA polymerase II translocation. Nat. Struct. Mol. Biol. 15, 811–818 (2008).
Article CAS Google Scholar
Giardina, C. & Lis, J. T. DNA melting on yeast RNA polymerase II promoters. Science 261, 759–762 (1993).
Article ADS CAS Google Scholar
Spitalny, P. & Thomm, M. Analysis of the open region and of DNA-protein contacts of archaeal RNA polymerase transcription complexes during transition from initiation to elongation. J. Biol. Chem. 278, 30497–30505 (2003).
Article CAS Google Scholar
Naji, S., Grünberg, S. & Thomm, M. The RPB7 orthologue E' is required for transcriptional activity of a reconstituted archaeal core enzyme at low temperatures and stimulates open complex formation. J. Biol. Chem. 282, 11047–11057 (2007).
Article CAS Google Scholar
Chakraborty, A. et al. Opening and closing of the bacterial RNA polymerase clamp. Science 337, 591–595 (2012).
Article ADS CAS Google Scholar
Fishburn, J. & Hahn, S. Architecture of the yeast RNA polymerase II open complex and regulation of activity by TFIIF. Mol. Cell Biol. 32, 12–25 (2011).
Article Google Scholar
Korkhin, Y. et al. Evolution of complex RNA polymerases: the complete archaeal RNA polymerase structure. PLoS Biol. 7, 1–10 (2009).
Article Google Scholar
Wojtas, M. N., Mogni, M., Millet, O., Bell, S. D. & Abrescia, N. G. Structural and functional analyses of the interaction of archaeal RNA polymerase with DNA. Nucleic Acids Res. 40, 9941–9952 (2012).
Article CAS Google Scholar
Zhang, Y. et al. Structural basis of transcription initiation. Science 338, 1076–1080 (2012).
Article ADS CAS Google Scholar
Grünberg, S., Reich, C., Zeller, M. E., Bartlett, M. S. & Thomm, M. Rearrangement of the RNA polymerase subunit H and the lower jaw in archaeal elongation complexes. Nucleic Acids Res. 38, 1950–1963 (2010).
Article Google Scholar
Bartlett, M., Thomm, M. & Geiduschek, P. Topography of the euryarchaeal transcription initiation complex. J. Biol. Chem. 279, 5894–5903 (2004).
Article CAS Google Scholar
Holstege, F. C., Tantin, D., Carey, M., van der Vliet, P. C. & Timmers, H. T. The requirement for the basal transcription factor IIE is determined by the helical stability of promoter DNA. EMBO J. 14, 810–819 (1995).
Article CAS Google Scholar
Holstege, F. C., van der Vliet, P. C. & Timmers, H. T. Opening of an RNA polymerase II promoter occurs in two distinct steps and requires the basal transcription factors IIE and IIH. EMBO J. 15, 1666–1677 (1996).
Article CAS Google Scholar
Kettenberger, H., Armache, K. & Cramer, P. Complete RNA polymerase II elongation complex structure and its interactions with NTP and TFIIS. Mol. Cell 16, 955–965 (2004).
Article CAS Google Scholar
Goede, B., Naji, S., Kampen, O., Ilg, K. & Thomm, M. Protein-protein interactions in the archaeal transcriptional machinery: binding studies of isolated RNA polymerase subunits and transcription factors. J. Biol. Chem. 281, 30581–30592 (2006).
Article CAS Google Scholar
Kouzine, F. et al. Global regulation of promoter melting in naive lymphocytes. Cell 153, 988–999 (2013).
Article CAS Google Scholar
Chakraborty, A., Wang, D., Ebright, Y. W. & Ebright, R. H. Chapter 2—azide-specific labeling of biomolecules by Staudinger-Bertozzi Ligation: phosphine derivatives of fluorescent probes suitable for single-molecule fluorescence spectroscopy. Methods Enzymol. 472, 19–30 (2010).
Article CAS Google Scholar
Hirtreiter, A. et al. Spt4/5 stimulates transcription elongation through the RNA polymerase clamp coiled-coil motif. Nucleic Acids Res. 38, 4040–4051 (2010).
Article CAS Google Scholar
Chin, J. W. et al. Addition of p-azido-L-phenylalanine to the genetic code of Escherichia coli. J. Am. Chem. Soc. 124, 9026–9027 (2002).
Article CAS Google Scholar
Young, T. S., Ahmad, I., Yin, J. A. & Schultz, P. G. An enhanced system for unnatural amino acid mutagenesis in E. coli. J. Mol. Biol. 395, 361–374 (2010).
Article CAS Google Scholar
Littlefield, O., Korkhin, Y. & Sigler, P. The structural basis for the oriented assembly of a TBP/TFB/promoter complex. Proc. Natl Acad. Sci. USA 96, 13668–13673 (1999).
Article ADS CAS Google Scholar
Pettersen, E. F. et al. UCSF Chimera—A visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS Google Scholar
Vámosi, G., Gohlke, C. & Clegg, R. M. Fluorescence characteristics of 5-carboxytetramethylrhodamine linked covalently to the 5' end of oligonucleotides: multiple conformers of single-stranded and double-stranded dye-DNA complexes. Biophys. J. 71, 972–994 (1996).
Article ADS Google Scholar
Würth, C., Grabolle, M., Pauli, J., Spieles, M. & Resch-Genger, U. Relative and absolute determination of fluorescence quantum yields of transparent samples. Nat. Protoc. 8, 1535–1550 (2013).
Article Google Scholar
Vassylyev, D. G. et al. Structural basis for substrate loading in bacterial RNA polymerase. Nature 448, 163–168 (2007).
Article ADS CAS Google Scholar
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Biol. Crystallogr. 66, 486–501 (2010).
Article CAS Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 (2010).
Article CAS Google Scholar

Download references

Acknowledgements

We thank Peter Schultz for plasmids. J.M. was supported by the European Union through the ERC starting grant remodelling. D.G. acknowledges financial support from the German Israel Foundation (Young Scientist Program 2292-2264.13/2011).

Author information

Authors and Affiliations

Biophysics Institute, Ulm University, Albert-Einstein-Allee 11, Ulm, 89069, Germany
Julia Nagy & Jens Michaelis
Institut für Physikalische und Theoretische Chemie—NanoBioSciences, Technische Universität Braunschweig, Hans-Sommer-Straße 10, Braunschweig, 38106, Germany
Dina Grohmann & Sarah Schulz
Division of Biosciences, Institute for Structural and Molecular Biology, University College London, Gower Street, London, WC1E 6BT, UK
Alan C.M. Cheung, Katherine Smollett & Finn Werner

Authors

Julia Nagy
View author publications
You can also search for this author in PubMed Google Scholar
Dina Grohmann
View author publications
You can also search for this author in PubMed Google Scholar
Alan C.M. Cheung
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Schulz
View author publications
You can also search for this author in PubMed Google Scholar
Katherine Smollett
View author publications
You can also search for this author in PubMed Google Scholar
Finn Werner
View author publications
You can also search for this author in PubMed Google Scholar
Jens Michaelis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.M. and F.W. designed the experiments. D.G. and S.S. expressed, purified and labelled all the proteins. J.N. performed all the smFRET experiments, data analysis and NPS calculation. K.S. did the footprinting experiments. A.C.M.C. built the model and prepared the movie. J.N., D.G., A.C.M.C., F.W. and J.M. wrote the manuscript.

Corresponding author

Correspondence to Jens Michaelis.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Figures, Supplementary Tables, Supplementary Methods and Supplementary References.

Supplementary Figures 1-8, Supplementary Tables 1-5, Supplementary Methods and Supplementary References. (PDF 1366 kb)

Supplementary Data 1

Coordinates of the structural model of the archaeal open complex (TXT 4644 kb)

Supplementary Movie 1

Comparison of the archaeal open complex model to the eukaryotic closed complex (MOV 24078 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nagy, J., Grohmann, D., Cheung, A. et al. Complete architecture of the archaeal RNA polymerase open complex from single-molecule FRET and NPS. Nat Commun 6, 6161 (2015). https://doi.org/10.1038/ncomms7161

Download citation

Received: 18 August 2014
Accepted: 21 December 2014
Published: 30 January 2015
DOI: https://doi.org/10.1038/ncomms7161

This article is cited by

Promoter-proximal elongation regulates transcription in archaea
- Fabian Blombach
- Thomas Fouqueau
- Finn Werner
Nature Communications (2021)
FttA is a CPSF73 homologue that terminates transcription in Archaea
- Travis J. Sanders
- Breanna R. Wenck
- Thomas J. Santangelo
Nature Microbiology (2020)
The smfBox is an open-source platform for single-molecule FRET
- Benjamin Ambrose
- James M. Baxter
- Timothy D. Craggs
Nature Communications (2020)
Prokaryotic sigma factors and their transcriptional counterparts in Archaea and Eukarya
- Ana G. Abril
- Jose Luis R. Rama
- Tomás G. Villa
Applied Microbiology and Biotechnology (2020)
A global analysis of transcription reveals two modes of Spt4/5 recruitment to archaeal RNA polymerase
- Katherine Smollett
- Fabian Blombach
- Finn Werner
Nature Microbiology (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.