Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Transcription initiation complex structures elucidate DNA opening

Abstract

Transcription of eukaryotic protein-coding genes begins with assembly of the RNA polymerase (Pol) II initiation complex and promoter DNA opening. Here we report cryo-electron microscopy (cryo-EM) structures of yeast initiation complexes containing closed and open DNA at resolutions of 8.8 Å and 3.6 Å, respectively. DNA is positioned and retained over the Pol II cleft by a network of interactions between the TATA-box-binding protein TBP and transcription factors TFIIA, TFIIB, TFIIE, and TFIIF. DNA opening occurs around the tip of the Pol II clamp and the TFIIE ‘extended winged helix’ domain, and can occur in the absence of TFIIH. Loading of the DNA template strand into the active centre may be facilitated by movements of obstructing protein elements triggered by allosteric binding of the TFIIE ‘E-ribbon’ domain. The results suggest a unified model for transcription initiation with a key event, the trapping of open promoter DNA by extended protein–protein and protein–DNA contacts.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Figure 1: Open complex structure at 3.6 Å resolution.
Figure 2: Basal factors position and retain DNA.
Figure 3: TFIIE architecture and interactions.
Figure 4: Closed complex cryo-EM structure.
Figure 5: Cleft clearance and DNA template loading.
Figure 6: Model for DNA opening during transcription initiation.

Similar content being viewed by others

Accession codes

Primary accessions

Electron Microscopy Data Bank

Protein Data Bank

Data deposits

Three-dimensional cryo-EM density maps of OC1, OC2, OC2-focused, OC3, OC3-focused, OC4, OC4-focused, OC5, and CC have been deposited in the Electron Microscopy Data Bank under the accession numbers EMD-3375, EMD-3376, EMD-3377, EMD-3378, EMD-3379, EMD-3380, EMD-3381, EMD-3382, and EMD-3383, respectively. Coordinate files of the OC and CC have been deposited in the Protein Data Bank under accession numbers 5FYW and 5FZ5. Coordinates and structure factors of the Pol II-Tfg1 peptide and the Pol II-TFIIF crystals have been deposited at the Protein Data Bank under the accession numbers 5IP7 and 5IP9.

References

  1. Buratowski, S., Hahn, S., Guarente, L. & Sharp, P. A. Five intermediate complexes in transcription initiation by RNA polymerase II. Cell 56, 549–561 (1989)

    Article  CAS  PubMed  Google Scholar 

  2. Roeder, R. G. The role of general initiation factors in transcription by RNA polymerase II. Trends Biochem. Sci. 21, 327–335 (1996)

    Article  CAS  PubMed  Google Scholar 

  3. Grünberg, S. & Hahn, S. Structural insights into transcription initiation by RNA polymerase II. Trends Biochem. Sci. 38, 603–611 (2013)

    Article  CAS  PubMed  Google Scholar 

  4. Sainsbury, S., Bernecky, C. & Cramer, P. Structural basis of transcription initiation by RNA polymerase II. Nat. Rev. Mol. Cell Biol. 16, 129–143 (2015)

    Article  CAS  PubMed  Google Scholar 

  5. Giardina, C. & Lis, J. T. DNA melting on yeast RNA polymerase II promoters. Science 261, 759–762 (1993)

    Article  ADS  CAS  PubMed  Google Scholar 

  6. Ohkuma, Y. & Roeder, R. G. Regulation of TFIIH ATPase and kinase activities by TFIIE during active initiation complex formation. Nature 368, 160–163 (1994)

    Article  ADS  CAS  PubMed  Google Scholar 

  7. Maxon, M. E., Goodrich, J. A. & Tjian, R. Transcription factor IIE binds preferentially to RNA polymerase IIa and recruits TFIIH: a model for promoter clearance. Genes Dev. 8, 515–524 (1994)

    Article  CAS  PubMed  Google Scholar 

  8. Holstege, F. C., van der Vliet, P. C. & Timmers, H. T. Opening of an RNA polymerase II promoter occurs in two distinct steps and requires the basal transcription factors IIE and IIH. EMBO J. 15, 1666–1677 (1996)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Parvin, J. D., Timmers, H. T. & Sharp, P. A. Promoter specificity of basal transcription factors. Cell 68, 1135–1144 (1992)

    Article  CAS  PubMed  Google Scholar 

  10. Pan, G. & Greenblatt, J. Initiation of transcription by RNA polymerase II is limited by melting of the promoter DNA in the region immediately upstream of the initiation site. J. Biol. Chem. 269, 30101–30104 (1994)

    CAS  PubMed  Google Scholar 

  11. Buratowski, S., Sopta, M., Greenblatt, J. & Sharp, P. A. RNA polymerase II-associated proteins are required for a DNA conformation change in the transcription initiation complex. Proc. Natl Acad. Sci. USA 88, 7509–7513 (1991)

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  12. Holstege, F. C., Tantin, D., Carey, M., van der Vliet, P. C. & Timmers, H. T. The requirement for the basal transcription factor IIE is determined by the helical stability of promoter DNA. EMBO J. 14, 810–819 (1995)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Kostrewa, D. et al. RNA polymerase II-TFIIB structure and mechanism of transcription initiation. Nature 462, 323–330 (2009)

    Article  ADS  CAS  PubMed  Google Scholar 

  14. Liu, X., Bushnell, D. A., Wang, D., Calero, G. & Kornberg, R. D. Structure of an RNA polymerase II-TFIIB complex and the transcription initiation mechanism. Science 327, 206–209 (2010)

    Article  ADS  CAS  PubMed  Google Scholar 

  15. Sainsbury, S., Niesser, J. & Cramer, P. Structure and function of the initially transcribing RNA polymerase II–TFIIB complex. Nature 493, 437–440 (2013)

    Article  ADS  CAS  PubMed  Google Scholar 

  16. Grünberg, S., Warfield, L. & Hahn, S. Architecture of the RNA polymerase II preinitiation complex and mechanism of ATP-dependent promoter opening. Nat. Struct. Mol. Biol. 19, 788–796 (2012)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Murakami, K. et al. Architecture of an RNA polymerase II transcription pre-initiation complex. Science 342, 1238724 (2013)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Mühlbacher, W. et al. Conserved architecture of the core RNA polymerase II initiation complex. Nat. Commun. 5, 4310 (2014)

    Article  ADS  CAS  PubMed  Google Scholar 

  19. Plaschka, C. et al. Architecture of the RNA polymerase II-Mediator core initiation complex. Nature 518, 376–380 (2015)

    Article  ADS  CAS  PubMed  Google Scholar 

  20. He, Y., Fang, J., Taatjes, D. J. & Nogales, E. Structural visualization of key steps in human transcription initiation. Nature 495, 481–486 (2013)

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  21. Murakami, K. et al. Structure of an RNA polymerase II preinitiation complex. Proc. Natl Acad. of Sci. USA 112, 13543–13548 (2015)

    Article  ADS  CAS  Google Scholar 

  22. Nikolov, D. B. et al. Crystal structure of a TFIIB–TBP–TATA-element ternary complex. Nature 377, 119–128 (1995)

    Article  ADS  CAS  PubMed  Google Scholar 

  23. Geiger, J. H., Hahn, S., Lee, S. & Sigler, P. B. Crystal structure of the yeast TFIIA/TBP/DNA complex. Science 272, 830–836 (1996)

    Article  ADS  CAS  PubMed  Google Scholar 

  24. Tan, S., Hunziker, Y., Sargent, D. F. & Richmond, T. J. Crystal structure of a yeast TFIIA/TBP/DNA complex. Nature 381, 127–134 (1996)

    Article  ADS  CAS  PubMed  Google Scholar 

  25. Eichner, J., Chen, H.-T. T., Warfield, L. & Hahn, S. Position of the general transcription factor TFIIF within the RNA polymerase II transcription preinitiation complex. EMBO J. 29, 706–716 (2010)

    Article  CAS  PubMed  Google Scholar 

  26. Deng, W. & Roberts, S. G. E. TFIIB and the regulation of transcription by RNA polymerase II. Chromosoma 116, 417–429 (2007)

    Article  CAS  PubMed  Google Scholar 

  27. Fishburn, J. & Hahn, S. Architecture of the yeast RNA polymerase II open complex and regulation of activity by TFIIF. Mol. Cell. Biol. 32, 12–25 (2012)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  28. Čabart, P., Újvári, A., Pal, M. & Luse, D. S. Transcription factor TFIIF is not required for initiation by RNA polymerase II, but it is essential to stabilize transcription factor TFIIB in early elongation complexes. Proc. Natl Acad. Sci. USA 108, 15786–15791 (2011)

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  29. Robert, F., Forget, D., Li, J., Greenblatt, J. & Coulombe, B. Localization of subunits of transcription factors IIE and IIF immediately upstream of the transcriptional initiation site of the adenovirus major late promoter. J. Biol. Chem. 271, 8517–8520 (1996)

    Article  CAS  PubMed  Google Scholar 

  30. Forget, D., Langelier, M.-F. F., Thérien, C., Trinh, V. & Coulombe, B. Photo-cross-linking of a purified preinitiation complex reveals central roles for the RNA polymerase II mobile clamp and TFIIE in initiation mechanisms. Mol. Cell. Biol. 24, 1122–1131 (2004)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Chen, H.-T. T., Warfield, L. & Hahn, S. The positions of TFIIF and TFIIE in the RNA polymerase II transcription preinitiation complex. Nat. Struct. Mol. Biol. 14, 696–703 (2007)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Grohmann, D. et al. The initiation factor TFE and the elongation factor Spt4/5 compete for the RNAP clamp during transcription initiation and elongation. Mol. Cell 43, 263–274 (2011)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Meinhart, A., Blobel, J. & Cramer, P. An extended winged helix domain in general transcription factor E/IIE alpha. J. Biol. Chem. 278, 48267–48274 (2003)

    Article  CAS  PubMed  Google Scholar 

  34. Okuda, M. et al. A novel zinc finger structure in the large subunit of human general transcription factor TFIIE. J. Biol. Chem. 279, 51395–51403 (2004)

    Article  CAS  PubMed  Google Scholar 

  35. Okuda, M. et al. Structural insight into the TFIIE-TFIIH interaction: TFIIE and p53 share the binding region on TFIIH. EMBO J. 27, 1161–1171 (2008)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Okuda, M. et al. Structure of the central core domain of TFIIEβ with a novel double-stranded DNA-binding surface. EMBO J. 19, 1346–1356 (2000)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Okamoto, T. et al. Analysis of the role of TFIIE in transcriptional regulation through structure-function studies of the TFIIEβ subunit. J. Biol. Chem. 273, 19866–19876 (1998)

    Article  CAS  PubMed  Google Scholar 

  38. Blombach, F. et al. Archaeal TFEα/β is a hybrid of TFIIE and the RNA polymerase III subcomplex hRPC62/39. eLife 4, e08378 (2015)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Martinez-Rucobo, F. W., Sainsbury, S., Cheung, A. C. M. & Cramer, P. Architecture of the RNA polymerase–Spt4/5 complex and basis of universal transcription processivity. EMBO J. 30, 1302–1310 (2011)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Kim, T. K., Ebright, R. H. & Reinberg, D. Mechanism of ATP-dependent promoter melting by transcription factor IIH. Science 288, 1418–1421 (2000)

    Article  ADS  CAS  PubMed  Google Scholar 

  41. Miller, G. & Hahn, S. A DNA-tethered cleavage probe reveals the path for promoter DNA in the yeast preinitiation complex. Nat. Struct. Mol. Biol. 13, 603–610 (2006)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Vannini, A. & Cramer, P. Conservation between the RNA polymerase I, II, and III transcription initiation machineries. Mol. Cell 45, 439–446 (2012)

    Article  CAS  PubMed  Google Scholar 

  43. Chakraborty, A. et al. Opening and closing of the bacterial RNA polymerase clamp. Science 337, 591–595 (2012)

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  44. Feklistov, A. & Darst, S. A. Structural basis for promoter-10 element recognition by the bacterial RNA polymerase σ subunit. Cell 147, 1257–1269 (2011)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Zhang, Y. et al. Structural basis of transcription initiation. Science 338, 1076–1080 (2012)

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  46. Zuo, Y. & Steitz, T. A. Crystal structures of the E. coli transcription initiation complexes with a complete bubble. Mol. Cell 58, 534–540 (2015)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. Bae, B., Feklistov, A., Lass-Napiorkowska, A., Landick, R. & Darst, S. A. Structure of a bacterial RNA polymerase holoenzyme open promoter complex. eLife 4, (2015)

  48. Yang, Y. et al. Structures of the RNA polymerase-σ54 reveal new and conserved regulatory strategies. Science 349, 882–885 (2015)

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  49. Kouzine, F. et al. Global regulation of promoter melting in naive lymphocytes. Cell 153, 988–999 (2013)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Cramer, P., Bushnell, D. A. & Kornberg, R. D. Structural basis of transcription: RNA polymerase II at 2.8 angstrom resolution. Science 292, 1863–1876 (2001)

    Article  ADS  CAS  PubMed  Google Scholar 

  51. Baumli, S., Hoeppner, S. & Cramer, P. A conserved mediator hinge revealed in the structure of the MED7.MED21 (Med7.Srb7) heterodimer. J. Biol. Chem. 280, 18171–18178 (2005)

    Article  CAS  PubMed  Google Scholar 

  52. Seizl, M. et al. A conserved GA element in TATA-less RNA polymerase II promoters. PLoS One 6, e27595 (2011)

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  53. Sydow, J. F. et al. Structural basis of transcription: mismatch-specific fidelity mechanisms and paused RNA polymerase II with frayed RNA. Mol. Cell 34, 710–721 (2009)

    Article  CAS  PubMed  Google Scholar 

  54. Korinek, A., Beck, F., Baumeister, W., Nickell, S. & Plitzko, J. M. Computer controlled cryo-electron microscopy – TOM2 a software package for high-throughput applications. J. Struct. Biol. 175, 394–405 (2011)

    Article  PubMed  Google Scholar 

  55. Li, X., Zheng, S. Q., Egami, K., Agard, D. A. & Cheng, Y. Influence of electron dose rate on electron counting images recorded with the K2 camera. J. Struct. Biol. 184, 251–260 (2013)

    Article  CAS  PubMed  Google Scholar 

  56. Tang, G. et al. EMAN2: an extensible image processing suite for electron microscopy. J. Struct. Biol. 157, 38–46 (2007)

    Article  CAS  PubMed  Google Scholar 

  57. Rohou, A. & Grigorieff, N. CTFFIND4: Fast and accurate defocus estimation from electron micrographs. J. Struct. Biol. 192, 216–221 (2015)

    Article  PubMed  PubMed Central  Google Scholar 

  58. Scheres, S. H. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 180, 519–530 (2012)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Chen, S. et al. High-resolution noise substitution to measure overfitting and validate resolution in 3D structure determination by single particle electron cryomicroscopy. Ultramicroscopy 135, 24–35 (2013)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Scheres, S. H. Semi-automated selection of cryo-EM particles in RELION-1.3. J. Struct. Biol. 189, 114–122 (2015)

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  61. Pettersen, E. F. et al. UCSF Chimera--a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004)

    Article  CAS  PubMed  Google Scholar 

  62. Nguyen, T. H. et al. The architecture of the spliceosomal U4/U6.U5 tri-snRNP. Nature 523, 47–52 (2015)

    Article  ADS  CAS  PubMed  PubMed Central  Google Scholar 

  63. Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004)

    Article  CAS  PubMed  Google Scholar 

  64. Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 213–221 (2010)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  65. Eswar, N. et al. Comparative protein structure modeling using Modeller. Current Protoc. Bioinformatics http://dx.doi.org/10.1002/0471250953.bi0506s15 (2006)

  66. Kim, Y., Geiger, J. H., Hahn, S. & Sigler, P. B. Crystal structure of a yeast TBP/TATA-box complex. Nature 365, 512–520 (1993)

    Article  ADS  CAS  PubMed  Google Scholar 

  67. Yang, J. et al. The I-TASSER Suite: protein structure and function prediction. Nat. Methods 12, 7–8 (2015)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Kilpatrick, A. M., Koharudin, L. M., Calero, G. A. & Gronenborn, A. M. Structural and binding studies of the C-terminal domains of yeast TFIIF subunits Tfg1 and Tfg2. Proteins 80, 519–529 (2012)

    Article  CAS  PubMed  Google Scholar 

  69. Shen, Y. et al. Consistent blind protein structure generation from NMR chemical shift data. Proc. Natl Acad. Sci. USA 105, 4685–4690 (2008)

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  70. Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D 66, 12–21 (2010)

    Article  CAS  PubMed  Google Scholar 

  71. Wriggers, W. Using Situs for the integration of multi-resolution structures. Biophys. Rev. 2, 21–27 (2010)

    Article  PubMed  PubMed Central  Google Scholar 

  72. Kinkelin, K. et al. Structures of RNA polymerase II complexes with Bye1, a chromatin-binding PHF3/DIDO homologue. Proc. Natl Acad. Sci. USA 110, 15277–15282 (2013)

    Article  ADS  PubMed  PubMed Central  Google Scholar 

  73. Kabsch, W. XDS. Acta Crystallogr. D 66, 125–132 (2010)

    Article  CAS  PubMed  Google Scholar 

  74. Blanc, E. et al. Refinement of severely incomplete structures with maximum likelihood in BUSTER-TNT. Acta Crystallogr. D 60, 2210–2221 (2004)

    Article  CAS  PubMed  Google Scholar 

  75. Gaiser, F., Tan, S. & Richmond, T. J. Novel dimerization fold of RAP30/RAP74 in human TFIIF at 1.7 A resolution. J. Mol. Biol. 302, 1119–1127 (2000)

    Article  CAS  PubMed  Google Scholar 

  76. Ha, I. et al. Multiple functional domains of human transcription factor IIB: distinct interactions with two general transcription factors and RNA polymerase II. Genes Dev. 7, 1021–1032 (1993)

    Article  CAS  PubMed  Google Scholar 

  77. Harami, G. M., Gyimesi, M. & Kovács, M. From keys to bulldozers: expanding roles for winged helix domains in nucleic-acid-binding proteins. Trends Biochem. Sci. 38, 364–371 (2013)

    Article  CAS  PubMed  Google Scholar 

  78. Esnault, C. et al. Mediator-dependent recruitment of TFIIH modules in preinitiation complex. Mol. Cell 31, 337–346 (2008)

    Article  CAS  PubMed  Google Scholar 

  79. Čabart, P. & Luse, D. S. Inactivated RNA polymerase II open complexes can be reactivated with TFIIE. J. Biol. Chem. 287, 961–967 (2012)

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We thank C. Bernecky, W. Mühlbacher, S. Neyer, S. Sainsbury, and D. Tegunov for help and discussions; L. Larivière and L. Wenzeck for cloning and initial purification of TFIIE; W. Mühlbacher for initial cloning of TFIIA; S. Bilakovic for the modified pET-DUET-1 vector; J. Mahamid for help with data collection for the CC; K. Maier for help with yeast growth assays; M. Raabe and H. Urlaub for protein identification; K. Kinkelin for initial Pol II–TFIIF co-crystallization; and S. Hahn for providing the TFA1 yeast strain and the shuffle plasmid pSH810. C.P. (SFB860), M.H. (GRK1721), and P.C. were supported by the Deutsche Forschungsgemeinschaft, the Advanced Grant TRANSIT of the European Research Council, and the Volkswagen Foundation.

Author information

Authors and Affiliations

Authors

Contributions

C.P. designed and carried out high-resolution cryo-EM structure determinations of OC1–OC4. M.H. designed and carried out Pol II-TFIIF crystallographic analysis, and cryo-EM structure determinations of OC5 and CC. C.P. and M.H. designed and carried out functional assays. C.D. cloned and purified full-length TBP and TFIIA. C.D. and C.B. assisted with protein purification. J.P. supervised electron microscopy data collection. P.C. designed and supervised research. C.P., M.H., and P.C. prepared the manuscript.

Corresponding author

Correspondence to P. Cramer.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Extended data figures and tables

Extended Data Figure 1 Modelling of open complex cryo-EM densities.

a, SDS–PAGE analysis of OC–cMed–Med1 complex after size-exclusion chromatography. Protein colours as in Fig. 1. Although core Mediator was required for stable association of TFIIE, it largely dissociated under cryo-EM conditions as observed previously19. Some remaining core Mediator was flexible and located as described previously19, but could not be included in further high-resolution analysis. b, Cryo-EM micrograph of the OC–cMed–Med1 complex. Scale bar, 50 nm. c, Ten representative reference-free 2D class averages of OC–cMed–Med1 reveal flexibility of the upstream DNA assembly including TFIIE (green arrow) and very weak density for core Mediator (orange arrow). Compare Extended Data Fig. 7a, c. d, Composite cryo-EM density of the OC shown in front and top views50. Colours indicate the cryo-EM densities used for modelling of the open complex (OC1, grey; OC2, green; OC2-focused, yellow; OC3, salmon; OC3-focused, blue; OC4 purple; OC4 round 2 class 2, light blue). Shown are the unsharpened cryo-EM densities. The percentage of particles from the full set of 257,259 that was used for the respective reconstruction is indicated. e, Composite cryo-EM density of the OC superimposed on a ribbon model of the OC, coloured as in Fig. 1. The composite cryo-EM density enabled modelling of the initiation factors and DNA. Our structure also enabled correction of the revised yeast initiation complex model obtained by Murakami et al. from cryo-EM at 6 Å resolution21, and we note the following differences between the structures, superimposed on Rpb1: (1) The TFIIF Tfg2 WH domain is rotated by ~180°, which is further inconsistent with nuclear magnetic resonance (NMR) data on the TFg2 WH–DNA interface68 and fits comparatively worse to protein–protein crosslinking data between the Tfg2 WH and Tfa2 WH1 (ref. 17). (2) Domains of TFIIE, except Tfa2 WH1, were placed incorrectly: Tfa1 eWH (rotation and translation into the E-linker density; 17 Å distance for helix α3 in our CC), Tfa1 E-ribbon (rotation and translation into E-linker density; 35 Å distance between the Zn atoms), and Tfa2 WH2 (~180° rotation). Further, the Tfa2 E-tether region was incorrectly assigned to density belonging to the Tfa1 eWH. The Tfa1 E-linker was not modelled. (3) The TFIIF Tfg1 arm was modelled into an empty space lacking density, and the Tfg1 helix α0 was absent. Our models of the TFIIF dimerization domain, Tfg2 linker, Tfg1 N terminus, and Tfg1 arm fit into densities from a recent study21, indicating the electron microscopic reconstruction is correct, but that the modelling was premature at the available resolution. f, Ribbon model of the OC coloured according to how different parts of the OC were modelled into the OC cryo-EM densities (see d). Regions with atomic (light blue) and backbone models (orange), and DNA (dark blue) are indicated. Views as in d. g, Representative regions of the sharpened cryo-EM densities OC1 (3.6 Å), OC2 (4.0 Å), and OC4 (3.9 Å) are shown with the underlying refined coordinate model. The OC1 density shows clear side-chain features for Rpb1 clamp helices α8 and α9 and Rpb2 β33, the OC4 density for Tfg1 β2 that is part of the dimerization domain, and the OC2 density for part of the Tfg2 linker. For OC nomenclature, see Extended Data Fig. 7. h, Fit of the TBP crystal structure (PDB: 1YTB)66 to the OC2 cryo-EM density, shown in a Pol II side view50. i, Fit of TFIIB N- and C-terminal cyclin domains, B-linker and B-reader, and B-ribbon elements to OC1 and OC2 cryo-EM densities. The B-linker element displays weak density, and the B-reader is not observed. j, Fit of the TFIIA crystal structure (PDB: 1YTF)23 to OC2-focused cryo-EM density in a Pol II top view50 (left). The four-helix bundle undergoes a minor rotation towards the β-barrel, and is apparently flexible (compare Extended Data Fig. 5e). Toa1 (middle) and Toa2 (right) subunit structures are shown. A large non-conserved insertion in Toa1 (Δ95–209), lacking in recombinant TFIIA (Methods), may affect the relative positioning of the four-helix bundle to the β-barrel. k, Fit of the TFIIF model to OC cryo-EM densities viewed from the top50. TFIIF dimerization domain and Tfg1 N-terminal region, arm, and charged helix elements are superimposed on the OC4 cryo-EM density. Tfg2 linker and WH domains are superimposed on OC2 and OC3-focused cryo-EM densities, respectively. Subunit architectures for Tfg1 (middle) and Tfg2 (right) subunits are shown, indicating disordered regions. Secondary structure elements were labelled according to the crystallographic model of the human RAP30–RAP74 heterodimer75. l, Fit of the TFIIE model to OC cryo-EM densities shown from the front50 (left). Models for Tfa1 eWH, E-linker and E-ribbon are superimposed onto OC1 and OC3 densities. Models for Tfa2 WH1 domain, Tfa2 WH2 and E-tether were fitted into OC3-focused and OC3 densities. Tfa1 (middle) and Tfa2 (right) subunits are shown, indicating disordered regions. The connectivity of the E-tether helices remains uncertain. m, Fit of promoter DNA to OC cryo-EM densities is shown in a side view50. A weak density for single-stranded template DNA contacts the Pol II fork loop 1, and is indicated by a blue arrow. Upstream and downstream DNA models are superimposed with OC3 and OC1 densities, respectively. The location of the Pol II active site magnesium ion is indicated.

Extended Data Figure 2 Details of TFIIF and the upstream DNA assembly.

a, View of the open complex from the side50. Pol II elements external 2 (dark green), lobe (yellow), protrusion (orange), Pol II subunit Rpb12 (dark blue) and basal factors TBP, TFIIB, and TFIIF are coloured as in Fig. 1. The remainder of the open complex is transparent. Green and purple boxes indicate the locations of TFIIB C-terminal cyclin and TFIIF dimerization domains, respectively. b, Interactions of TFIIB C-terminal cyclin domain with Pol II protrusion, Rpb12, Tfg2 linker and DNA. Colours as in a. c, Details of TFIIF dimerization domain interactions with Pol II external 2 and lobe50. d, Crystallographic analysis of the yeast-specific Tfg1 N-terminal region. Weak density for the Tfg1 N-terminal region was observed by cryo-EM (OC4 round 2 class 2) at low contour level (0.0155) close to Pol II elements external 1 and the hybrid binding region50 (left). X-ray analysis (right) of the corresponding peptide (Tfg1 F21–R35) enabled modelling and assignment of residue M27 (indicated with asterisk) owing to the anomalous signal. The Fo − Fc electron density map (grey, contour level 2.5σ), seleno-methionine anomalous difference Fourier (yellow, contour level 5σ), and final model in ribbon presentation (purple) are shown. The sequence of the synthetic peptide used for soaking into Pol II crystals is shown below. The modified methionine residue and predicted secondary structure are indicated. e, The Fo − Fc electron density maps obtained from soaking Pol II crystals with TFIIF (purple) and seleno-methionine labelled peptide (grey), respectively, show similar density in the same location on Pol II. f, The putative Tfg2 C terminus contacts TBP. Viewed from the side50. A tubular cryo-EM density from the OC3 map, low-pass filtered to 8 Å, emanates from the TFIIF Tfg2 WH–TFIIE Tfa2 WH1 density, and was tentatively assigned to the Tfg2 C-terminal region. The putative Tfg2 density reaches the TBP subunit, consistent with their suggested interaction29,76.

Extended Data Figure 3 Structure–function analysis of TFIIE and its interactions in the open complex.

a, The architectural model of TFIIE contains all regions required for viability in yeast16. A domain schematic (top) indicates the good overlap between modelled (dashed line) and functionally essential regions. Essential (grey), partially redundant Tfa2 WH1 and WH2 domains (blue), and non-essential elements (cyan) are indicated on the TFIIE model, shown in previously defined front and top views50 of Pol II. b, TFIIE sequence conservation. The sequence conservation among Saccharomyces cerevisiae, Schizosaccharomyces pombe, Drosophila melanogaster, Gallus gallus, and Homo sapiens was mapped onto a ribbon representation of the TFIIE model. Highly, strongly, weakly and non-conserved residues are coloured in green, yellow, white, and grey, respectively. The location of a non-modelled helical density in the OC3 cryo-EM map, which may correspond to Tfa1 helix α7, is indicated. Views as in a. c, An additional density (green) in the OC3 cryo-EM map on top of the Tfa1 E-wing was tentatively assigned to Tfa1 helix α7 and this may stabilize the long β-hairpin. A front view is shown50. d, Tfa1–FeBABE cleavage sites in TFIIE16 are consistent with the TFIIE architecture. e, Tfa1– and Tfa2–FeBABE cleavage sites in the Pol II clamp16 and a protein–protein crosslink between Rpb1 K212 (Pol II clamp)–Tfa2 K277 (TFIIE E-tether)17 are consistent with the location of eWH and E-tether. f, Tfa2–Tfg2 protein–protein crosslinks17 are consistent with the Tfg2WH–Tfa2 WH1 architecture. g, The TFIIE mutations used for functional characterization were mapped onto a domain schematic and the model of TFIIE, shown in a front view50. h, Pulldown assays with recombinant TFIIE variants carrying mutations at the TFIIE–CC interface revealed that the E-ribbon is essential for TFIIE recruitment. For details of the TFIIE mutants, see g. Pulldowns were analysed by SDS–PAGE (Coomassie staining). To confirm the integrity of the purified TFIIE variants, 2 μg were analysed (left). Some minor contaminant and degradation bands of TFIIE are indicated by an asterisk. The bead elution from the pulldown assay is shown (middle), providing negative (no TFIIE) and positive (TFIIE) controls in the two leftmost lanes. The binding of all TFIIE variants to the CC was impaired compared to the wild-type protein, with the exception of the Tfa1(ΔE-wing) mutant, suggesting that all other interfaces contribute to TFIIE binding affinity. The most severe binding defect was observed upon mutation of three residues in the E-ribbon (Tfa1(L134/V137/L140)) to glutamate or alanine. This suggests that the E-ribbon is largely responsible for recruitment of TFIIE to the CC. The bead-only control (right) indicated that TFIIE and TFIIE variants did not show unspecific binding to the beads. i, Western blot analysis of the 3×Flag-tagged Tfa1 confirms specific immune-depletion of Tfa1 in the nuclear extract (NE), whereas levels of Pol II (Rpb3), TFIIB, and Histone H3 were unaffected. j, Yeast complementation assays were performed in triplicate experiments with wild-type TFA1, an empty vector, and TFA1 variants with mutations in the TFIIE eWH domain (N50E/K51E/T52E, N50A/K51A/T52A, and P56A/A59E/R62E in eWH helix α3, and ΔE-wing), or the E-ribbon(L134E/V137E/L140E) (see Methods). k, The long E-wing in the TFIIE subunit Tfa1 eWH is characteristic of WH domains involved in DNA strand separation77. The upstream edge of the transcription bubble and eWH domain are shown in a front view50 rotated by ~20° in the horizontal axis. Corresponding regions of human (Hs) Werner syndrome ATP-dependent helicase (WRN) WH (PDB: 2WWY) and RecQ1 WH (PDB: 3AAF) domains are shown.

Extended Data Figure 4 Closed complex and spontaneously formed open complex.

a, SDS–PAGE analysis of CC–cMed–Med1 complex after size-exclusion chromatography. Protein colours as in Fig. 1. b, Cryo-EM micrograph of the CC–cMed–Med1 complex. Scale bar, 50 nm. c, Ten representative reference-free 2D class averages of CC–cMed–Med1 reveal flexibility for the upstream complex. Core Mediator was not retained during cryo-EM analysis. d. Detailed view of the Pol II funnel helices in the CC (top) and OC5 (bottom) densities. e, Promoter sequences and differences in protein–DNA interactions are shown for the two distinct nucleic acid scaffolds used for preparation of closed and open complexes (compare Fig. 1d). Coloured bars indicate DNA–protein interaction. Solid, shaded, and empty circles respectively represent nucleotides included in the structure, excluded owing to weak cryo-EM density, or excluded owing to absence of cryo-EM density. Analogous yeast (black) and human (grey) numbering of promoter DNA is shown. The TATA-box sequence (red box) and HIS4-promoter sequence absent in the modified OC nucleic acid scaffold19 (grey box) are indicated. Protein–DNA interactions in the region covered by the light grey box are unchanged between CC and OC, and shown only for the OC for clarity. Unique and altered interactions are shown for each complex. DNA–TFIIEα photo-crosslinks, indicated by black asterisks, were observed in a closed but not open promoter state40 and are consistent with the CC model. f, Fit of TFIIE, Tfg2 WH and downstream DNA into CC density. Two rigid bodies were used for fitting: (i) Tfg2 WH and Tfa2 WH1 and (ii) Tfa2 WH2, eWH, E-linker and E-tether helices. Although the overall fit reflects density well, the eWH domain and its E-wing may be rotated further away from promoter DNA. g, Details on the location of downstream DNA (template, blue; non-template, light blue), Tfg2 WH, and Tfa2 WH1 and WH2 in the closed (dark colours) and open (light colours) complexes in the same view as in f. h, Cryo-EM density of OC5 and the OC ribbon model are shown in a front view50. The OC5 map shows weak density in regions of upstream assembly, TFIIE, and DNA that may be caused by increased flexibility owing to the heterogeneous population of spontaneously opened DNA. Colours as in Fig. 1. i, Fit of promoter DNA to cryo-EM densities of CC and OC5, shown in a side view50.

Extended Data Figure 5 Pol II cleft clearance, structural flexibility and rearrangements in the OC.

a, Pol II lid and fork loop 1 assume new conformations in the OC, clearing the Pol II cleft for loading of single-stranded template DNA. Arrows indicate the direction of movement of the two Pol II elements, and the template DNA loading path. The lid (dark red) in the open complex is moved in comparison to the lid of a Pol II–TFIIB ITC crystallographic study (PDB: 4BBS). Yellow and red boxes indicate zoomed-in regions of b and c, respectively. b, The movement in the Pol II lid leads to a steric clash with the TFIIB B-reader, observed in a Pol II–TFIIB ITC crystal (PDB: 4BBS), and facilitates its withdrawal in the open complex. In particular the lid residue F252 clashes with W63 and S67 of the B-reader. The OC1 cryo-EM density is shown for both lid and B-reader elements. c, The cryo-EM density of the OC1 reveals an ‘open’ Pol II fork loop 1 and a stably associated fragment of putative template DNA. The ‘open’ state of fork loop 1 provides additional space for loading of single-stranded template DNA past the Pol II rudder, towards the active site cleft. d, The position of the TFIIB N-terminal cyclin domain (light green) is altered in comparison to a Pol II–TFIIB ITC crystal structure15 (dark grey), but similar to its location in a cITC19 (light grey), probably owing to the presence of DNA. e, Flexibility of the upstream DNA assembly. The cryo-EM data of the OC was sorted on the basis of structural differences using an upstream assembly mask that included upstream DNA, TFIIA, TBP, and TFIIB cyclin domains (OC2 round 1, compare Extended Data Fig. 7c). Four of five resultant classes revealed different positions of the upstream complex, indicated here by fitted ribbon models of the OC. Previously defined front and side views50 are shown. Class 2 (middle) revealed the TFIIA four-helix bundle rotated by 85°, consistent with a high degree of flexibility. Class 4 represents the largest fraction of the data (31%), and gave a more defined density for the upstream complex, which was improved by further classification (Extended Data Fig. 7c). Class 5 presented with no density for the upstream complex or the Tfg2 linker, but did show density for the TFIIB B-ribbon and the TFIIF dimerization domain, suggesting that TFIIB and TFIIF remained bound to the complex. This is consistent with TFIIF-dependent association of the TFIIB-core domain with the Pol II wall27, and this apparently requires an ordered Tfg2 linker. f, The Rpb4–Rpb7 stalk adopts different positions in cITC, cITC-cMed, and OC. This suggests that Mediator and TFIIE may bind co-operatively. This is consistent with previous findings78 and with pulldowns (Extended Data Fig. 3h), which suggest that the TFIIE E-ribbon–stalk interface, which is important for TFIIE recruitment, is stabilized in the presence of Mediator.

Extended Data Figure 6 Pol II clamp positions and TFIIB B-reader mobility during DNA opening.

a, The yeast CC is shown from a side view50, indicating the path of DNA and location of TFIIE. The eye symbol (grey) indicates the point of view in b. b, The Pol II clamp may undergo transitions during DNA opening as indicated. The OC model of the Pol II clamp is shown superimposed on yeast CC (this study), and yeast OC (this study). The OC model Pol II clamp was rigid-body fitted to the human CC cryo-EM density20 (EMD-2306) and is superimposed. The view is from the front50. c, The TFIIB B-reader element shows strong density only in the ITC state, suggesting that its mobility in earlier states may be important for maintaining a cleared path for template DNA loading into the Pol II cleft. Ordering of the B-reader may further lead to stabilization of the upstream promoter assembly that is flexible in the OC (Extended Data Figs 5e, 7c). Cryo-EM densities for yeast CC (this work), OC5 (this work), OC (this work), and ITC (EMD-2785) complexes are superimposed on the TFIIB model (PDB: 4BBS for the B-linker and B-reader). As secondary structure elements could not be resolved in the human CC20, we excluded this cryo-EM density from comparison.

Extended Data Figure 7 Three-dimensional classification of cryo-EM data.

a, Three-dimensional image classification of the cryo-EM data set into eight classes using an initial OC reconstruction as the reference model, revealed heterogeneity. The percentage of single particles contributing to each class is provided. To help visualize structural differences, 3D reconstructions of the OC are coloured according to mobile regions: Pol II core, TFIIB B-ribbon (grey); upstream DNA, TFIIA, TBP, TFIIB cyclin domains, Tfg2 linker (green); TFIIF dimerization domain (purple); TFIIE except E-ribbon, Tfg2 WH (magenta); Pol II Rpb4–Rpb7 stalk and E-ribbon (blue); cMed–Med1 (yellow). b, Focused classification into five classes using a mask covering the Pol II stalk and E-ribbon. The resultant class 1 (OC1) was subsequently refined to 3.58 Å resolution (grey box) and revealed the location of the TFIIE E-ribbon. Colours as in a. c, Improvement of densities for Tfg2 linker, TFIIB, and TFIIE, through rounds of focused 3D classification using various masks. First, heterogeneity due to flexibility of upstream DNA and associated factors was overcome by applying a mask around this region (round 1). Focused refinement of the upstream DNA assembly of the resultant class 4 of round 1 (OC2-focused), improved the density quality for TFIIA (Extended Data Fig. 1j). Classification of the OC2-focused density revealed the upstream DNA complex (OC2) at 3.97 Å resolution (green box). Separate classification of class 4 of round 1 using OC, Pol II stalk and TFIIE E-ribbon, and TFIIE masks yielded class 1 of round 4 (OC3, magenta box) that contained a complete TFIIE density at a nominal resolution of 4.35 Å after 3D refinement (see Extended Data Fig. 8c). The small fraction of stably bound TFIIE is consistent with its reduced affinity to the pre-initiation complex79. Focused refinement of OC3 with a TFIIE–stalk mask (OC3-focused) improved density for Tfg2 WH and Tfa2 WH1 domains. Colours as in a. d, To improve the density of TFIIF dimerization domain and the Tfg1 arm, three rounds of classification using a TFIIF, TFIIF dimerization domain, and OC mask were employed. Class 2 of round 2 (cyan box) enabled fitting of the Tfg1 N-terminal peptide, which was resolved by X-ray analysis (Extended Data Fig. 2d, e). This class was further refined locally using a mask covering the TFIIF dimerization domain, and then classified with an OC mask, revealing class 6 of round 3 at 3.89 Å resolution after 3D refinement (purple box). Colours as in a. e, 3D classification of the CC cryo-EM data set into four classes, using an initial CC reconstruction as the reference model, revealed heterogeneity. Mobile regions in the reconstructions are highlighted: promoter DNA (blue), TFIIE (except E-ribbon), and Tfg2 WH (orange). Classifying the most populated classes from round 1 into three classes unexpectedly revealed open and closed promoter DNA states in the data set: CC (round 2, class 1) and OC5 (round 2, class 3). Class 3 of round 2 (OC5) was refined to 6.1 Å resolution (blue box). Class 1 from round 2 was further classified into three classes. The resultant class 3 of round 3 revealed density for closed downstream promoter DNA above the Pol II cleft, and TFIIE. The cryo-EM density for downstream DNA and TFIIE was improved by focused classification using two soft-edged masks. A mask covering the Pol II Rpb4–Rpb7 stalk yielded a class with better occupancy for the stalk (round 4, class 3), which was further sorted using a mask covering TFIIE and Tfg2 WH to improve their densities. Class 1 of round 5 was refined to 8.8 Å resolution (CC, orange box).

Extended Data Figure 8 Resolution of cryo-EM reconstructions.

a, Gold-standard FSC (left) of the OC1 cryo-EM single particle reconstruction (FSC = 0.143). Orientation distribution plot of all particles that contribute to the OC1 reconstruction (middle). The OC1 cryo-EM map is shown (right) from a front view50 and a central slice through the reconstruction, which are coloured by local resolution as described19. b, As in a but for the OC2 reconstruction. The gold-standard FSC for the density obtained from focused refinement (OC2-focused) with a soft mask around the upstream DNA assembly is indicated in grey (see Methods). The region masked for focused refinement is indicated with a grey outline on the cyro-EM map coloured by local resolution (right). c, As in b, but for the OC3 and OC3 focus-refined reconstructions. d, As in a, but for the OC4 and OC4 focus-refined reconstructions. e, As in a, but for the CC reconstruction. f, As in a, but for the OC5 reconstruction.

Extended Data Figure 9 Data collection, refinement statistics, and structure validation.

a, Cryo-EM data collection and refinement statistics of the OC structure. Different regions of the composite OC structure were refined into OC1, OC2, and OC4 maps as described (see Methods) to obtain an atomic model for 90% of the structure. b, Gold-standard FSC between the respective coordinate models and local regions of the OC1, OC2, and OC4 cryo-EM maps used for model refinement and between overall OC and CC models compared to OC3 (best TFIIE density) and CC cryo-EM maps. c, X-ray crystallographic data collection and refinement statistics.

PowerPoint slides

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Plaschka, C., Hantsche, M., Dienemann, C. et al. Transcription initiation complex structures elucidate DNA opening. Nature 533, 353–358 (2016). https://doi.org/10.1038/nature17990

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/nature17990

This article is cited by

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing