The influenza virus polymerase transcribes or replicates the segmented RNA genome (viral RNA) into viral messenger RNA or full-length copies. To initiate RNA synthesis, the polymerase binds to the conserved 3′ and 5′ extremities of the viral RNA. Here we present the crystal structure of the heterotrimeric bat influenza A polymerase, comprising subunits PA, PB1 and PB2, bound to its viral RNA promoter. PB1 contains a canonical RNA polymerase fold that is stabilized by large interfaces with PA and PB2. The PA endonuclease and the PB2 cap-binding domain, involved in transcription by cap-snatching, form protrusions facing each other across a solvent channel. The 5′ extremity of the promoter folds into a compact hook that is bound in a pocket formed by PB1 and PA close to the polymerase active site. This structure lays the basis for an atomic-level mechanistic understanding of the many functions of influenza polymerase, and opens new opportunities for anti-influenza drug design.
Influenza A virus (FluA) mainly infects water and domestic fowl, although some strains cause disease in mammals such as humans, pigs, horses, seals and bats. The viral genome, composed of eight segments of negative-sense single-stranded RNA packaged in separate ribonucleoprotein particles, is transcribed and replicated by the heterotrimeric viral RNA (vRNA)-dependent RNA polymerase (RdRp), which comprises subunits PA, PB1 and PB2. The high mutation rate of the polymerase and the generation of novel viruses through reassortment of genome segments between different strains ensure rapid evolution of the virus with resultant seasonal epidemics and occasional, potentially devastating, pandemics. Although the polymerase has been studied extensively since the late 1960s, detailed understanding of its many functions both in vitro and in the context of the infected cell remains elusive (reviewed in refs 1 and 2), largely owing to the lack of atomic resolution structural information on the full-length polymerase. Nevertheless, in recent years, several crystal structures of fragments of the polymerase subunits have yielded important insights (reviewed in refs 1 and 3). These include the two domains involved in the unique cap-snatching mechanism of transcription used by the virus4—the PA amino-terminal endonuclease domain (PA-Nter)5,6, and the central PB2 cap-binding domain7—structures that have contributed to a renaissance in anti-influenza drug design targeting the polymerase8,9. In addition, structures are available of the inter-subunit interfaces between the PA carboxy-terminal domain (PA-C) and PB1-Nter (refs 10,11), between PB1-Cter and PB2-Nter (ref. 12), and of the PB2 C-terminal double 627-NLS domain13, which carry the host-specific PB2 residue 627 (Lys and Glu in human and avian strains, respectively) (reviewed in ref. 14) and the PB2 nuclear localization signal (NLS)15, respectively.
Here we describe the crystal structure of the complete heterotrimeric FluA polymerase bound to the vRNA promoter. To bypass difficulties in expression of recombinant human or avian polymerases, we used polymerase from the recently discovered bat-specific influenza virus (bat FluA)16, which is evolutionarily close to human/avian A strains with 70.0 (78.2), 79.5 (87.7) and 68.0 (78.6) per cent identity (similarity) for PA, PB1 and PB2, respectively (Supplementary Fig. 1). Bat polymerase can replicate efficiently in human cells16 and vice versa17, suggesting that the bat structure will be a good model for all FluA polymerases. Here we describe the overall architecture of the polymerase, the structure of each subunit and their interfaces, and how the conserved 3′ and 5′ sequences of the vRNA promoter are bound. In the accompanying manuscript18, using two additional crystal structures of influenza B polymerase, implications of the structures for the mechanisms of de novo vRNA replication and cap-dependent transcription are presented.
Structure determination and overall architecture
Heterotrimeric influenza polymerase from A/little yellow-shouldered bat/Guatemala/060/2010(H17N10) was expressed in insect cells as a self-cleaving polyprotein and purified in milligram quantities to homogeneity (Extended Data Fig. 1). Using short templates, such as a 39-nucleotide vRNA mini-panhandle containing the conserved extremities or separated 3′ (template) and 5′ (activator) sequences, the recombinant bat polymerase is active in cap-dependent transcription as well as ApG-primed and, less efficiently, unprimed replication assays (Extended Data Fig. 2) without the need for the viral nucleoprotein, consistent with previous work19. Co-crystals of FluA polymerase were obtained with nucleotides 1–16 from the vRNA 5′ end (5′-pAGUAGUAACAAGAGGG-3′), and nucleotides 1–18 or 3–18 from the 3′ end (3′OH-UCGUCUUCGUCUCCAUAU-5′OH). The structure was solved by molecular replacement at 2.65 Å resolution using the structure of FluB polymerase18 (Extended Data Table 1). The FluA polymerase structure is 97.8% complete with 699 out of 714 (for PA), 750 out of 756 (for PB1), and 733 out of 760 (for PB2) residues modelled (2,182 out of 2,230 total).
The FluA polymerase has a U-shaped structure, with approximate height, width and depth of 115 × 100 × 75 Å, respectively (Fig. 1, Extended Data Fig. 3 and Supplementary Videos 1 and 2). The two protruding arms are formed by the PA-Nter endonuclease and PB2 cap-binding domains, which face each other across a solvent channel. The bottom of the U is formed by the large PA-C domain and one of the sides by the C-terminal two-thirds of PB2 (PB2-C) including the cap-binding domain. The body of the trimer is formed by PB1, decorated on one side by the N-terminal third of PB2 (PB2-N) (Fig. 1a, b) and on the other side by the linker (PA-linker) that connects the PA endonuclease (PA-Nter) with PA-C (Fig. 1b). Previous studies have revealed crucial but limited tail (Cter) to head (Nter) interactions between PA and PB1 (refs 10 and 11) and PB1 and PB2 (refs 12, 20 and 21). The actual inter-subunit interactions are much more extensive than this owing to an extremely complex intertwining of the subunits. The total buried surface area between PB1 and PA is 17,330 Å2 and between PB1 and PB2 is around 14,100 Å2, whereas the area between PA and PB2 is only 2,880 Å2, confirming the central scaffolding role of PB1. The trimer contains a large, internal, catalytic and RNA-binding cavity formed by PB1 and PB2-N that is partially open at the top to the solvent channel between the PA endonuclease and PB2 cap-binding domains (putative template/product exit channel), as well as being accessible via two narrow side tunnels, the putative NTP and template entrance channels (see below). For sequence alignments of bat and human FluA polymerase and secondary structure assignments, see Supplementary Fig. 1. A schematic of each subunit domain structure is given in Fig. 1d.
Apart from the 15 N-terminal and 80 C-terminal residues, which form tight inter-subunit contacts with PA-C (refs 10, 11) and PB2-N (ref. 12), respectively, the detailed structure of the PB1 subunit has until now been completely unknown. However, sequence analysis revealed the presence of motifs pre-A (also known as F) and A–E characteristic of RNA-dependent RNA polymerases22,23,24 and correspondingly PB1 contains in its central region (residues 21–669) a typical right-handed RdRp fold, comprising fingers, fingertips, palm and thumb domains (Fig. 2a, b). A three-dimensional similarity search shows that hepatitis C virus (HCV) polymerase is structurally most like the polymerase region of PB1 (Fig. 2c), but many other RNA virus polymerases are also similar. Structural analysis has shown that Flaviviridae polymerases (for example, HCV, Dengue virus, West Nile virus)25,26,27 as well as bacteriophage Φ6 (ref. 28) contain a ‘priming loop’ to promote initiation of unprimed RNA synthesis29. In PB1, residues 641–657 form a conserved anti-parallel β-loop (Fig. 2b) structurally analogous to the HCV priming loop (Fig. 2d), which could be involved in unprimed genome or anti-genome replication by influenza polymerase.
There are several idiosyncratic features of PB1. First, there are the N- and C-terminal extensions (N-ext and C-ext; Fig. 1d) that make inter-subunit contacts with PA and PB2, respectively. Second, there is an unusually long (∼55 Å), solvent-exposed, flexibly hinged β-ribbon (strands β6 and β7, residues 177–212) (Fig. 2a, b). Interestingly, this element contains the PB1-NLS motifs, two separated basic patches (NSL1, 187-Lys/Arg-Lys-Lys/Arg-Arg-190 (bat/human) on β6; NSL2, 207-Lys-Lys-Arg/Lys-Val/Gln-Lys/Arg-211 on β7; Fig. 2a) that have been shown to be important for binding RanBP5, the PA–PB1 heterodimer nuclear import factor30. A third special feature of PB1 is a β-hairpin insertion (strands β12 and β13, residues 352–360; Fig. 2a) in the finger domain, which, notably, is inserted through an extended loop in PA (the ‘PA-arch’; Fig. 3a). Both structures form an integral part of the 5′ vRNA-binding site (see below). The C-terminal extension of PB1 after the putative priming loop is involved in direct 3′-template binding (residues 671–676, see below).
PA and PB2 subunits
The two structurally known domains of PA, the PA-Nter endonuclease domain (residues 1–195) and the large PA-C domain (258–714), are on opposite sides of the molecule, connected by the previously uncharacterized PA-linker (196–257) (Figs 1b and 3a), which wraps around the external face of the PB1 fingers and palm domain. In particular, residues 201–257, which include three helical segments (α7–α9), lie across the surface of PB1 making numerous, often conserved, inter-subunit contacts that are both hydrophobic and polar in nature (Extended Data Fig. 4a). The endonuclease domain is anchored to the rest of the polymerase through contacts with the same helical region of PB1-Cter that interacts with PB2-Nter, so that all three subunits are involved in positioning the endonuclease (Fig. 1a, b). The main contacts are via the packing of endonuclease helix α4 against both the penultimate PB1 helix α21 and the PB2 ‘170-loop’ (169–174), and via the endonuclease insertion (67–74) with the last PB1 helix α22 (Extended Data Fig. 4b). The endonuclease active site is solvent-exposed and facing the cap-binding domain (Fig. 1a, b), as discussed elsewhere in relation to the mechanism of cap-snatching18.
The PB2 subunit is divided into the N-terminal third (PB2-N, residues 1–247) and the C-terminal two-thirds (PB2-C, residues 248–760), each formed by several folded subdomains (Figs 1d and 3b, c). PB2-N comprises a series of linked modules that wrap around one edge and face of PB1, interacting mainly with the PB1 C-terminal extension and the polymerase thumb domain, opposite to where the PA linker binds (Figs 1 and 3b). After the well-characterized helical bundle interface with PB1-Cter, residues 35–54 of PB2-Nter are in an extended conformation followed by helix α4 that interacts with the template as it enters the polymerase active site (see below). Residues 55–103 (β1, α5, β2, β3 and α6) form a more compact subdomain (PB2-N1) that buttresses the PB1 thumb domain (for example, PB2 helix α6 packs parallel against PB1 helix α17). Another linker leads to the PB2-N2 subdomain (residues 110–247), which has an extended shape (Fig. 3b). At one end a helical bundle (α9–α11, residues 160–212) is inserted, denoted the PB2 helical lid. This includes the 170-loop (around 169–174), which contacts the endonuclease (Extended Data Fig. 4b), and the projecting helix α10, the N terminus (residue Asp 180) of which closely approaches the cap-binding domain. At the other extremity of the N2 domain are two anti-parallel β-ribbons (β4–β7 and β5–β6) with a helix inserted between them (α12–α13). These make hydrophobic contacts with PA-Cter and with the thumb and palm domains of PB1.
PB2-C (residues 248–736) forms a single, arc-shaped unit (Fig. 3c), divided into five sub-domains, which constitutes one arm of the polymerase U-shape (Fig. 1). At one end of the arc is the cap-binding domain (319–481), and, at the other end, is the NLS domain (685–760), which is disordered beyond the NLS1 motif (736-Lys-Arg-Lys-Arg)15. The NLS domain is juxtaposed to the 627-domain (539–675) as observed in crystal structures of the isolated double 627-NLS domain13,31. The loop carrying the host-specific residue 627, normally lysine in human and glutamate in avian strains but serine in bat, is in a solvent-exposed position remote from the PB1 active site. A possible role of the 627-domain is discussed elsewhere18 (see also Supplementary Information). The central part of the PB2-C arc is composed of two disconnected but interacting sub-domains: the PB2 mid-domain (248–319) that directly precedes the cap-binding domain, and the cap-627 linker (483–538). The mid-domain is a four helix bundle with one of the inter-helical linkers containing a short β-strand (β8) that makes a stabilizing two-stranded parallel sheet with the cap-627 linker (β24) (Fig. 3b). The bat cap-binding domain is very similar to that of human or avian FluA32, but Phe 357 forms one side of the methylated base sandwich rather than a histidine (Supplementary Fig. 1). The cap-627 linker proceeds from the C terminus of the cap-binding domain into a small three-stranded β-sheet (495–515, β21–β23) that packs on the last helix (α17) of the PB2 mid-domain. This sheet has a distinctly concave, solvent-facing surface that could be involved in protein–protein interactions. The mid, cap and cap-627 linker domains do not make extensive interfaces with other polymerase subunits.
PB1 functional regions
The catalytic centre responsible for template-directed nucleotide addition is located in the PB1 internal cavity and formed mainly by the highly conserved RdRp motifs pre-A/F and A–E. Comparison with known polymerase structures allows modelling of the template, substrate RNA and incoming NTPs into the PB1 active site, and deduction of the roles of certain key conserved residues (Fig. 4 and Extended Data Fig. 5). Motif pre-A/F is partly contained in the fingertips, a loop (residues 222–246) that extends from the fingers towards the thumb domain and the tip of which is stabilized by contacts with PA helix α20 (Fig. 2b and Extended Data Fig. 5a). Whereas HepC and Norwalk virus polymerases have two fingertip loops (one corresponding to motif F and the other closer to the polymerase N terminus) (Fig. 2c, d), influenza polymerase PB1-Nter is analogous to the second loop with residues 24–38 crossing from thumb to fingers in intimate association with the fingertips. Several conserved basic residues from motif pre-A/F are likely to be involved in template binding, and NTP channelling and binding33 (Fig. 4a). Motif A contains the conserved active site Asp 305, which, together with Asp 445 and Asp 446 on motif C, coordinate two divalent metal ions (Fig. 4a) and promote catalysis33. These residues have been shown to be essential for PB1 activity23. Motif B has a characteristic methionine-rich loop in PB1 (406-GMMMGMF), and is probably involved in stabilizing the base pair between the incoming NTP and the template. Motif D contains conserved Lys 480 and Lys 481 residues (involved in NTP binding) and is stabilized by contacts with PA helix α20 (656–663) and the PA peptide 671–684. Motif E forms another β-hairpin containing conserved residues thought to stabilize the position of the substrate/priming NTP (Fig. 4a).
As in other polymerases, a narrow tunnel, lined with positively charged residues, connects the internal cavity to the outside and this is presumed to attract and channel NTPs into the active site electrostatically (Extended Data Fig. 5a, b). In PB1, this putative NTP tunnel directly leads to the tip of the putative priming loop and involves highly conserved PB1 basic residues Arg 45, Lys 235, Lys 237 and Arg 239 (motif F3), Lys 308 (motif A), and Lys 480 and Lys 481 (motif D). A second tunnel constitutes the putative template entrance channel that is lined by conserved residues from all three subunits (Extended Data Fig. 5c, d).
For initiation of RNA synthesis, the influenza polymerase needs to be bound to a promoter that comprises both conserved extremities of the pseudo-circularized vRNA or complementary RNA (cRNA)34,35. The pyrimidine-rich 3′ (template) and purine-rich 5′ (activator) extremities are partially complementary and can form a non-canonical double helix, usually referred to as the panhandle36. However, they are thought to bind the polymerase in a partially single-stranded conformation35, either as a ‘corkscrew’37,38 or a ‘fork’39,40, or as a combination of both41. These models concur on the presence of a distal base-paired region between nucleotides 11–14 of the 5′ and 10–13 of the 3′ ends, but differ in whether the individual proximal strands have internal structure or not. The polymerase–promoter crystal structure shows that the distal region is indeed base-paired, and that nucleotides 1–10 of the 5′ end form a compact stem–loop (hook) structure (Fig. 4b).
The hook structure, formed by nucleotides 1–10 of the 5′ vRNA (5′-pAGUAGUAACA), has two central canonical base pairs (G2–C9 and U3–A8) flanked by mismatch base pairs A1–A10 and A4–A7 (Fig. 5a). The stem is capped by G5, which is stacked antiparallel on A4 and U6 whose base faces outward. The sequence characteristics of the 5′ hook are conserved in all known influenza virus vRNAs and cRNAs, the only variations, reflecting the imperfect complementarity of the two extremities, being the nature of the 2–9 and 3–8 Watson–Crick base pairs (G–C and A–U in vRNA, and G–C and C–G in cRNA, respectively) and the loop nucleotides 5 (usually a G) and 6 (usually an A). This hook structure is also likely to be conserved in orthomyxoviruses of the Thogoto lineage, except that G4–A7 would replace the A4–A7 mismatch42.
The 5′ hook is sandwiched in a pocket formed on one side by strands β17–β18 and β20 of the main β-sheet of PA, and on the other by the PA-arch (366–397) and the PB1 β-hairpin (353–370) that inserts through the arch (Fig. 5b). The buried surface area of the 5′ end totals 4,044 Å2 (60% with PA, 40% with PB1). Numerous polar interactions to the backbone (Extended Data Table 2) sense the shape of the stem–loop, including contacts to all phosphates (except 6–7) as well as to several ribose 2′ OHs. Base contacts are made to invariant 5′ residues G2, A7, A10 and A11 as well as to G5 and U6. Key interacting and highly conserved residues from PA are His 326, the peptide 366–370, 388-Tyr-Lys, 503-Arg-Leu-His, Lys 534, Arg 561 and Lys 569. From PB1 they include His 32, Thr 34 and Tyr 38 (conserved in all influenza strains) and 356-Met-Phe-Glu (Fig. 5c, d and Extended Data Fig. 6). An especially dense series of interactions binds and stabilizes the sharp turn between 5′ A10–A11 (Fig. 5c). The PA-arch motif 366-Gly-Glu-Gly-Gln-Ala-370 forms a phosphate-binding loop, which interacts tightly with the backbone of A10–A11. His 505 (His 510 in human/avian strains) stacks on base A11 and hydrogen bonds to unpaired G9 of the 3′ strand, which in turn stacks on PA Met 472. This histidine has previously been shown to be a crucial residue in regulating transcription43. PA Arg 503 and PB1 Arg 365 make multivalent interactions with the RNA backbone (Fig. 5c). Conserved PB1 residues His 32 and Tyr 38 contact the phosphates of G5 and U6 and the double prolines 392-Pro-Pro in the PA-arch stack on the bases of these same nucleotides (Fig. 5d).
There are five base pairs in the duplex region of the promoter, 3′ 10-UCUCC-14 with 5′ 11-AGAGG-15, which projects away from the polymerase (Fig. 1c). The self-complementary four-nucleotide overhang 15-AUAU-18 of the crystallized 3′ end base-pairs with a crystal symmetry-related equivalent, thus forming a pseudo-continuous double-stranded RNA of 14 base pairs between two two-fold-related polymerases (Extended Data Fig. 7). The duplex region of the promoter is contacted by the central section of the long PB1 β-ribbon and by residues 672–676 of PB1-Cter (Extended Data Fig. 6). The PA peptide 503-Arg-Leu-His, reinforced by 466–475, forms a wedge that separates the 5′ and 3′ strands into binding pockets (Extended Data Fig. 6). Only the proximal single-stranded 3′ nucleotides 6-UUCG-9 are visible in the structure, and these are directed towards the polymerase template entry tunnel before turning away towards the solvent. There is a sharp turn between unpaired 3′ end nucleotides G9 and C8 (Extended Data Fig. 6). Residues, very highly conserved in all influenza strains, from all three subunits (PA 505–509 and Lys 567, PB1-Cter 671–676 and PB2 36–49) are involved in binding the 3′ nucleotides 6-UUCG-9 (Extended Data Fig. 6). At the apex of the sharp turn, the phosphate of 3′ C8 is bound by PA Lys 567 and PB2 Arg 46, the latter being positioned by salt bridges with PA Asp 509 and PB2 Glu 40. PA Arg 507 and PB1 C-terminal extension residues Asn 671, Arg 672 and Ser 673 interact extensively with the backbone of 3′ U7 and U10.
The structure of influenza polymerase, the first from any negative-strand RNA virus, reveals the enormous complexity of the molecule and highlights the fact that all three subunits are intricately involved in many of most important functional regions. This undoubtedly explains why 40 years’ of polymerase biochemistry has often led to confusing and contradictory results. For instance, numerous studies have tried to identify the vRNA 3′- and 5′-end binding sites by crosslinking and/or mutagenesis44,45,46,47 but have failed to reveal the critical residues (see Supplementary Information). Conversely, the vRNA promotor structure itself is essentially as predicted41, although the A–A mismatches in the 5′-end hook were not foreseen. Indeed, the hook, tightly bound in a pocket formed by PA and PB1, is an integral part of the polymerase structure and this binding is required to enhance or activate polymerase functions48,49,50 (Extended Data Fig. 2). Without an apo-structure, this cannot be fully rationalised yet, but it is likely that without the stabilization promoted by 5′-end binding the nearby polymerase active site will be disorganized. Whereas, in the bat polymerase structure, the 3′ end of the template is not completely visible, in the FluB polymerase structure the complete 3′ strand is well ordered18. However, rather than being directed in to the PB1 active site, the vRNA 3′ end seems to have an alternative, but specific, binding site lying on the surface of the polymerase in the vicinity of the long PB1 β-ribbon. This is discussed further in the accompanying paper, along with other insights into polymerase function derived from the structure18.
There is considerable interest in understanding the exact role of polymerase residues that have been implicated in host adaptation, notably between avian and human influenza A strains14. Such mutations, identified by analysis of natural sequences or serial adaptation of viruses to mice, typically have a neutral effect in avian cells but enhance polymerase activity in mammalian cells. Because the positions of implicated residues can henceforth be mapped onto the full polymerase structure, an initial distinction can now be made between those residues that are more likely, because of their internal location, to affect the intrinsic rate of polymerase functions (which could be important for species-dependent physiological reasons), and others, which, because of their surface location, possibly act through direct interaction with other viral or cellular factors. Some initial observations are made in the Supplementary Information, but further structural studies of the polymerase in different functional conformations and eventually with bound host factors are required to determine the exact role of these putative host-specific residues.
Finally, the unexpectedly good resolution of this crystal structure gives hope that structure-based drug design targeting the PB1 active site, vRNA binding or numerous potential allosteric sites, will soon become possible.
The influenza A/little yellow-shouldered bat/Guatemala/060/2010(H17N10) polymerase heterotrimer was expressed as a self-cleaving polyprotein (Extended Data Fig. 1a). A codon-optimized synthetic construct (DNA2.0) with the composition GNHBstEII GSGSENLYFQTEVGSHHHHHHHH8×His-tag GSGS-PA (GenBank ID AFC35437.1) GSGSGENLYFQTEVGSGSGSGSG-PB1 (GenBank ID AFC35436) GSGSGENLYFQTEVGSGSGSGSG-PB2 (GenBank ID AFC35435.1) GWSHPQFEKStrep-tag GGGSGGGSGGSAWSHPQFEKStrep-tag GRSGRsrII was cloned via BstEII and RsrII sites into the vector pKL-PBac51, which also contains coding sequences for tobacco etch virus (TEV) protease (5′) and cyan fluorescent protein (CFP) (3′). (The TEV-site, His-tag and Strep-tag are underlined.)
Expression and purification
The bat FluA polymerase was produced in HighFive insect cells using the baculovirus expression system. Cells were collected by centrifugation, re-suspended in buffer A (50 mM Tris-HCl, 500 mM NaCl, 10% (v/v) glycerol and 5 mM β-mercaptoethanol, pH 8) supplemented with protease inhibitors (Roche, complete mini, EDTA-free), and lysed by sonication. Cell debris was spun off (30 min, 4 °C, 35,000g) and ammonium sulphate added to the clarified supernatant (0.5 g ml−1) to force the protein out of solution. The precipitated protein was collected by centrifugation (30 min, 4 °C, 70,000g) and re-suspended in buffer A. After a final centrifugation step (30 min, 4 °C, 70,000g) the polymerase was purified from the fraction of soluble proteins via immobilized metal ion affinity chromatography and a strep-tactin resin (IBA, Superflow), using buffer A as running buffer in both cases. Fractions containing the target protein were pooled and diluted with an equal volume of buffer B (50 mM HEPES/NaOH, 10% (v/v) glycerol and 2 mM TCEP, pH 7.5) before loading on a heparin column (HiPrep Heparin HP, GE Healthcare). Polymerase was eluted by a gradient of buffer B supplemented with 1 M NaCl, concentrated, and subjected to size-exclusion chromatography (S200, GE Healthcare) in buffer C (50 mM HEPES/NaOH, 500 mM NaCl, 5% (v/v) glycerol and 2 mM TCEP, pH 7.5). Monomeric and RNA-free polymerase was concentrated, flash-frozen and stored at −80 °C. The typical yield of pure heterotrimer is about 1 mg l−1 of insect cells.
Crystallization, data collection and structure solution
Polymerase protein in buffer C was adjusted to a concentration of 10 mg ml−1, mixed in a 1:1 ratio with vRNA, which was an equimolar mixture of nucleotides 1–16 from the 5′ end (5′-pAGUAGUAACAAGAGGG-3′) and nucleotides 1–18 or 3–18 from the 3′ end (3′OH-UCGUCUUCGUCUCCAUAU-5′OH) (IBA). Crystallization trials were performed by vapour diffusion at 4 °C using a Cartesian robot. The best crystals grew in mother liquor containing 0.7–1.5 M sodium/potassium phosphate at pH 5.0. For data collection, crystals were flash-frozen in well solution supplemented with 25% glycerol. Diffraction data were collected at 100 °K with an X-ray wavelength of 0.9763 Å on beamline ID23-1 of the European Synchrotron Radiation Facility equipped with a Pilatus 6M-F detector and integrated and scaled with XDS52. Initial phases were obtained by molecular replacement with the structure of the influenza B polymerase18. The model was improved by making use of the five known high-resolution structures of FluA polymerase fragments (endonuclease53, PA-Cter-PB1-Nter (PDB codes 2ZN1 and 3CM8), PB1-Cter/PB2-Nter (PDB code 3A1G), PB2-cap and 627-NLS domains (PDB code 2VY6). Refinement was performed with Refmac54. A putative zinc ion is found bound between PB1 His 562 and PA Asp 421. Figures were drawn with Pymol55. The vRNA and most protein regions have very good electron density apart from a few connecting peptides and the PA endonuclease domain, which has poor density except where it contacts the rest of the polymerase. Ramachandran statistics, as calculated by Molprobity56 are 94.2% (favoured), 0.7% (disallowed).
Polymerase activity assays
A T7-transcribed 39-nucleotide mini-panhandle or equimolar mixture of separated synthetic 3′ and 5′ ends were used as vRNA (Extended Data Fig. 2a, b).
For the ApG-primed replication assay, 0.5 μM protein, 0.5 μM vRNA, 0.5 mM ApG, 0.4 mM GTP/CTP, 1 mM ATP, 0.04 mM UTP, 32P-UTP and 0.8 U μl−1 Ribolock, in buffer (150 mM NaCl, 50 mM HEPES, pH 7.5, 5 mM MgCl2 and 2 mM TCEP) were mixed and incubated at 30 °C for 2 h.
For the cap-dependent transcription assay, 0.5 μM protein, 0.5 μM vRNA, 0.4 mM GTP/CTP/UTP, 1 mM ATP and 32P-labelled capped RNA in the same buffer (150 mM NaCl, 50 mM HEPES, pH 7.5, 5 mM MgCl2 and 2 mM TCEP) were mixed and incubated at 30 °C for 2 h. For this purpose, a 5′ diphosphate synthetic 20-base RNA, 5′-ppAAUCUAUAAUAGCAUUAUCC-3′ (Chemgenes), was capped by incubating with vaccinia virus capping enzyme (purified in house following ref. 57) and 20 µM SAM, 32P-GTP, 50 mM Tris, pH 8.0, 6 mM KCl, 1.25 mM MgCl2 and 0.8 U μl−1 Ribolock.
For the endonuclease assay, transcription mix without any NTPs was incubated at 30 °C for 2 h. Samples were separated on 7 M urea, 20% acrylamide gel in TBE buffer, exposed on a storage phosphor screen and read with a Typhoon scanner.
For the time course of unprimed and ApG-primed vRNA replication, 0.5 μM bat FluA polymerase was mixed with 1 μM 39-nucleotide vRNA mini-panhandle template, NTPs (1 mM ATP, 0.4 mM GTP, 0.4 mM CTP and 0.04 mM UTP) and 0.12 μCi μl−1 32P-UTP, in the absence or presence of 0.5 mM ApG. Reactions were incubated at 30 °C and samples were analysed on a 20% acrylamide, 7 M urea denaturing gel after 0, 2, 5, 10, 15, 20, 30, 40 and 50 min, 1, 2 and 3 h.
Resa-Infante, P., Jorba, N., Coloma, R. & Ortin, J. The influenza virus RNA synthesis machine: advances in its structure and function. RNA Biol. 8, 207–215 (2011)
Fodor, E. The RNA polymerase of influenza a virus: mechanisms of viral transcription and replication. Acta Virol. 57, 113–122 (2013)
Ruigrok, R. W., Crepin, T., Hart, D. J. & Cusack, S. Towards an atomic resolution understanding of the influenza virus replication machinery. Curr. Opin. Struct. Biol. 20, 104–113 (2010)
Plotch, S. J., Bouloy, M., Ulmanen, I. & Krug, R. M. A unique cap(m7GpppXm)-dependent influenza virion endonuclease cleaves capped RNAs to generate the primers that initiate viral RNA transcription. Cell 23, 847–858 (1981)
Dias, A. et al. The cap-snatching endonuclease of influenza virus polymerase resides in the PA subunit. Nature 458, 914–918 (2009)
Yuan, P. et al. Crystal structure of an avian influenza polymerase PAN reveals an endonuclease active site. Nature 458, 909–913 (2009)
Guilligay, D. et al. The structural basis for cap binding by influenza virus polymerase subunit PB2. Nature Struct. Mol. Biol. 15, 500–506 (2008)
Kowalinski, E. et al. Structural analysis of specific metal chelating inhibitor binding to the endonuclease domain of influenza pH1N1 (2009) polymerase. PLoS Pathog. 8, e1002831 (2012)
Clark, M. P. et al. Discovery of a novel, first-in-class, orally bioavailable azaindole inhibitor (VX-787) of influenza PB2. J. Med. Chem. 57, 6668–6678 (2014)
He, X. et al. Crystal structure of the polymerase PAC–PB1N complex from an avian influenza H5N1 virus. Nature 454, 1123–1126 (2008)
Obayashi, E. et al. The structural basis for an essential subunit interaction in influenza virus RNA polymerase. Nature 454, 1127–1131 (2008)
Sugiyama, K. et al. Structural insight into the essential PB1–PB2 subunit contact of the influenza virus RNA polymerase. EMBO J. 28, 1803–1811 (2009)
Tarendeau, F. et al. Host determinant residue lysine 627 lies on the surface of a discrete, folded domain of influenza virus polymerase PB2 subunit. PLoS Pathog. 4, e1000136 (2008)
Cauldwell, A. V., Long, J. S., Moncorge, O. & Barclay, W. S. Viral determinants of influenza A virus host range. J. Gen. Virol. 95, 1193–1210 (2014)
Tarendeau, F. et al. Structure and nuclear import function of the C-terminal domain of influenza virus polymerase PB2 subunit. Nature Struct. Mol. Biol. 14, 229–233 (2007)
Tong, S. et al. A distinct lineage of influenza A virus from bats. Proc. Natl Acad. Sci. USA 109, 4269–4274 (2012)
Poole, D. S. et al. Influenza A virus polymerase is a site for adaptive changes during experimental evolution in bat cells. J. Virol. 88, 12572–12585 (2014)
Reich, S. et al. Structural insights into cap-snatching and RNA synthesis by influenza virus polymerase. Nature http://dx.doi.org/10.1038/nature14009 (this issue)
Turrell, L., Lyall, J. W., Tiley, L. S., Fodor, E. & Vreede, F. T. The role and assembly mechanism of nucleoprotein in influenza A virus ribonucleoprotein complexes. Nature Commun. 4, 1591 (2013)
González, S., Zurcher, T. & Ortin, J. Identification of two separate domains in the influenza virus PB1 protein involved in the interaction with the PB2 and PA subunits: a model for the viral RNA polymerase structure. Nucleic Acids Res. 24, 4456–4463 (1996)
Poole, E. L., Medcalf, L., Elton, D. & Digard, P. Evidence that the C-terminal PB2-binding region of the influenza A virus PB1 protein is a discrete α-helical domain. FEBS Lett. 581, 5300–5306 (2007)
Müller, R., Poch, O., Delarue, M., Bishop, D. H. & Bouloy, M. Rift Valley fever virus L segment: correction of the sequence and possible functional role of newly identified regions conserved in RNA-dependent polymerases. J. Gen. Virol. 75, 1345–1352 (1994)
Biswas, S. K. & Nayak, D. P. Mutational analysis of the conserved motifs of influenza A virus polymerase basic protein 1. J. Virol. 68, 1819–1826 (1994)
Bruenn, J. A. A structural and primary sequence comparison of the viral RNA-dependent RNA polymerases. Nucleic Acids Res. 31, 1821–1829 (2003)
Yap, T. L. et al. Crystal structure of the dengue virus RNA-dependent RNA polymerase catalytic domain at 1.85-angstrom resolution. J. Virol. 81, 4753–4765 (2007)
Lesburg, C. A. et al. Crystal structure of the RNA-dependent RNA polymerase from hepatitis C virus reveals a fully encircled active site. Nature Struct. Biol. 6, 937–943 (1999)
Bressanelli, S. et al. Crystal structure of the RNA-dependent RNA polymerase of hepatitis C virus. Proc. Natl Acad. Sci. USA 96, 13034–13039 (1999)
Butcher, S. J., Grimes, J. M., Makeyev, E. V., Bamford, D. H. & Stuart, D. I. A mechanism for initiating RNA-dependent RNA polymerization. Nature 410, 235–240 (2001)
Caillet-Saguy, C., Lim, S. P., Shi, P. Y., Lescar, J. & Bressanelli, S. Polymerases of hepatitis C viruses and flaviviruses: structural and mechanistic insights and drug development. Antiviral Res. 105, 8–16 (2014)
Hutchinson, E. C., Orr, O. E., Man Liu, S., Engelhardt, O. G. & Fodor, E. Characterization of the interaction between the influenza A virus polymerase subunit PB1 and the host nuclear import factor Ran-binding protein 5. J. Gen. Virol. 92, 1859–1869 (2011)
Kuzuhara, T. et al. Structural basis of the influenza A virus RNA polymerase PB2 RNA-binding domain containing the pathogenicity-determinant lysine 627 residue. J. Biol. Chem. 284, 6855–6860 (2009)
Pautus, S. et al. New 7-methylguanine derivatives targeting the influenza polymerase PB2 cap-binding domain. J. Med. Chem. 56, 8915–8930 (2013)
Zamyatkin, D. F. et al. Structural insights into mechanisms of catalysis and inhibition in Norwalk virus polymerase. J. Biol. Chem. 283, 7705–7712 (2008)
Fodor, E., Pritlove, D. C. & Brownlee, G. G. The influenza virus panhandle is involved in the initiation of transcription. J. Virol. 68, 4092–4096 (1994)
Tiley, L. S., Hagen, M., Matthews, J. T. & Krystal, M. Sequence-specific binding of the influenza virus RNA polymerase to sequences located at the 5′ ends of the viral RNAs. J. Virol. 68, 5108–5116 (1994)
Hsu, M. T., Parvin, J. D., Gupta, S., Krystal, M. & Palese, P. Genomic RNAs of influenza viruses are held in a circular conformation in virions and in infected cells by a terminal panhandle. Proc. Natl Acad. Sci. USA 84, 8140–8144 (1987)
Neumann, G. & Hobom, G. Mutational analysis of influenza virus promoter elements in vivo. J. Gen. Virol. 76, 1709–1717 (1995)
Flick, R., Neumann, G., Hoffmann, E., Neumeier, E. & Hobom, G. Promoter elements in the influenza vRNA terminal structure. RNA 2, 1046–1057 (1996)
Fodor, E., Pritlove, D. C. & Brownlee, G. G. Characterization of the RNA-fork model of virion RNA in the initiation of transcription in influenza A virus. J. Virol. 69, 4012–4019 (1995)
Kim, H. J., Fodor, E., Brownlee, G. G. & Seong, B. L. Mutational analysis of the RNA-fork model of the influenza A virus vRNA promoter in vivo. J. Gen. Virol. 78, 353–357 (1997)
Pritlove, D. C., Poon, L. L., Devenish, L. J., Leahy, M. B. & Brownlee, G. G. A hairpin loop at the 5′ end of influenza A virus virion RNA is required for synthesis of poly(A)+ mRNA in vitro. J. Virol. 73, 2109–2114 (1999)
Briese, T. et al. Upolu virus and Aransas Bay virus, two presumptive bunyaviruses, are novel members of the family Orthomyxoviridae. J. Virol. 88, 5298–5309 (2014)
Fodor, E. et al. A single amino acid mutation in the PA subunit of the influenza virus RNA polymerase inhibits endonucleolytic cleavage of capped RNAs. J. Virol. 76, 8989–9001 (2002)
González, S. & Ortin, J. Characterization of influenza virus PB1 protein binding to viral RNA: two separate regions of the protein contribute to the interaction domain. J. Virol. 73, 631–637 (1999)
Li, M. L., Ramirez, B. C. & Krug, R. M. RNA-dependent activation of primer RNA production by influenza virus polymerase: different regions of the same protein subunit constitute the two required RNA-binding sites. EMBO J. 17, 5844–5852 (1998)
Jung, T. E. & Brownlee, G. G. A new promoter-binding site in the PB1 subunit of the influenza A virus polymerase. J. Gen. Virol. 87, 679–688 (2006)
Kerry, P. S., Willsher, N. & Fodor, E. A cluster of conserved basic amino acids near the C-terminus of the PB1 subunit of the influenza virus RNA polymerase is involved in the regulation of viral transcription. Virology 373, 202–210 (2008)
Leahy, M. B., Pritlove, D. C., Poon, L. L. & Brownlee, G. G. Mutagenic analysis of the 5′ arm of the influenza A virus virion RNA promoter defines the sequence requirements for endonuclease activity. J. Virol. 75, 134–142 (2001)
Rao, P., Yuan, W. & Krug, R. M. Crucial role of CA cleavage sites in the cap-snatching mechanism for initiating viral mRNA synthesis. EMBO J. 22, 1188–1198 (2003)
Poon, L. L., Pritlove, D. C., Sharps, J. & Brownlee, G. G. The RNA polymerase of influenza virus, bound to the 5′ end of virion RNA, acts in cis to polyadenylate mRNA. J. Virol. 72, 8214–8219 (1998)
Nie, Y., Bellon-Echeverria, I., Trowitzsch, S., Bieniossek, C. & Berger, I. Multiprotein complex production in insect cells by using polyproteins. Methods Mol. Biol. 1091, 131–141 (2014)
Kabsch, W. Integration, scaling, space-group assignment and post-refinement. Acta Crystallogr. D 66, 133–144 (2010)
Tefsen, B. et al. The N-terminal domain of PA from bat-derived influenza-like virus H17N10 has endonuclease activity. J. Virol. 88, 1935–1941 (2014)
Murshudov, G. N. Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallogr. D 53, 240–255 (1997)
DeLano, W. L. The PyMOL Molecular Graphics System; http://www.pymol.sourceforge.net (Schrödinger, LLC, 2002)
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D. 66, 12–21 (2010)
De la Peña, M., Kyrieleis, O. J. & Cusack, S. Structural insights into the mechanism and evolution of the vaccinia virus mRNA cap N7 methyl-transferase. EMBO J. 26, 4913–4925 (2007)
We thank members of the ESRF-EMBL Joint Structural Biology Group for access to European Synchrotron Radiation Facility (ESRF) beamlines, staff of the European Molecular Biology Laboratory (EMBL) eukaryotic expression and high-throughput crystallization facilities within the Partnership for Structural Biology (PSB), D. Hart for help with construct design, and H. Malet for electron microscopy. This work was supported by ERC Advanced Grant V-RNA (322586) to S.C.
The authors declare no competing financial interests.
Extended data figures and tables
a, The heterotrimeric bat polymerase was recombinantly expressed in insect cells as a self-cleaving polyprotein. N-terminally it encodes the tobacco etch virus (TEV) protease that cleaves C-terminal to the amino-acid sequence ENLYFQ (in italics), and releases N-terminally His-tagged PA, PB1, C-terminally strep-tagged PB2 and cyan fluorescent protein (CFP) for facilitated monitoring of expression. Arrows indicate the N-to-C-terminal direction and the termini of each mature protein. The histidine and streptavidin tags are underlined. b, After ammonium sulphate precipitation, immobilized metal ion affinity chromatography, engineered streptavidin (strep-tactin) affinity and heparin chromatography, the final purification step consisted of size-exclusion chromatography. The elution profile (monitored by the absorbance at 280 nm) with a single and nearly symmetric peak suggests a homogeneous and monomeric polymerase complex. mAU, milli-absorption unit. c, Fractions of the final size-exclusion chromatography were subjected to 10% SDS–PAGE followed by Coomassie blue staining. Lane 1 contains the molecular mass markers and lanes 2–7 the eluate with PA (85.4 kilodaltons (kDa)), PB1 (87.8 kDa) and PB2 (91.0 kDa). d, Recombinant bat FluA polymerase was visualized by electron microscopy following negative staining with sodium silico-tungstate of a 0.02 mg ml−1 protein sample. The image demonstrates that the sample is homogeneous and monodisperse with a V- or doughnut-like shape and central cavity.
Extended Data Figure 2 Endonuclease, RNA transcription and RNA replication activities of recombinant FluA polymerase.
a, Mini-panhandle vRNA: 5′-pppAGUAGUAACAAGAGGGUAUUGUAUACCUCUGCUUCUGCU-3′. b, Separate 5′ and 3′ ends: 5′: 5′-pAGUAGUAACAAGAGGGUA-3′; 3′: 5′-UAUACCUCUGCUUCUGCU-3′. c, Endonuclease, cap-dependent transcription and ApG-primed replication assays. Cleavage of the cap donor is visible in lanes 2–6. Capped transcripts are visible in lanes 10 (from vRNA panhandle template) and 13 (from separated 5′ and 3′ vRNA ends) as well as cRNA produced in lanes 17 and 20. Markers, with size shown on the left, are RNA ladders labelled with 32P-pCp nucleotide. d, e, Time course of unprimed (d) and ApG-primed (e) vRNA replication by bat influenza A polymerase. The products of replication (cRNA) are indicated with an arrow. Ladders (lanes L) are 32P-pCp nucleotide-labelled RNA oligomers. ApG-primed replication is more efficient than unprimed replication.
a–d, Four surface views at roughly 0° (a), 180° (b), 110° (c) and 290° (d) rotations with PA, PB1 and PB2 uniformly green, cyan and red, respectively. Major subdomains are labelled. The vRNA 5′ and 3′ ends are pink and yellow, respectively.
a, Interactions of the PA-linker (green tube) with the outer surface of the fingers (pale cyan) and palm (pale salmon) domains of PB1. Contacts are mediated by both highly conserved hydrophobic residues (for example, PA residues Phe 205, Phe 211, Leu 214, Pro 220, Tyr 226, Phe 229, Tyr 232, Val 233, Ile 242, Leu 246, Met 249 and Val 253) and polar interactions (for example, PA Glu 203, Lys 230, Glu 243 and Lys 245 to PB1 Arg 162, Glu 331, His 465 and Asp 86, respectively). b, Transparent surface diagram showing the anchoring of the PA endonuclease domain (forest green) onto the PB1-Cter–PB2-Nter interface region (cyan/red) and its position relative to the PB2 cap-binding domain (orange). The nuclease helix α4 packs parallel to the penultimate PB1 helix α21 involving both hydrophobic (for example, PA Ile 86, Ile 90 and Ile 94 with PB1 Ser 720, Ile 724 and Ile 728, respectively) and polar interactions (for example, PA Glu 77 with PB1 Arg 727). Other contacts include the PB2 170-loop interacting with the same PA helix α4 in the vicinity of Trp 88. Also the endonuclease insertion (PA 70-loop, residues 67–74) packs on the first part of the last PB1 helix α22. The total buried surface area between the endonuclease and PB1/PB2 is 2,265 Å2.
a, View straight along the putative NTP entrance tunnel towards the putative priming loop (magenta) in the internal cavity. The NTP channel is lined with basic residues from the fingertips (Lys 235, Lys 237 and Arg 239, blue), fingers (Arg 45, cyan) and palm (Lys 308, Lys 480 and Lys 481, red) that are absolutely conserved in all influenza strains. The fingertips are in close proximity to PA helices α20 and α21 and to the loop of the 5′ hook. b, Surface view as in a showing that the putative priming loop in the interior cavity is visible through the NTP tunnel. c, View straight along the template entrance tunnel towards the priming loop (magenta) in the internal cavity. The tunnel is lined by residues conserved in all influenza strains and from all three subunits, Arg 507 and Asp 509 from PA (green), Tyr 30, Arg 126, Met 227, Lys 229 and Asp 230 from PB1 (cyan), and Arg 38, Lys 41 and Asn 42 from PB2 (red). d, Surface view as in c showing that the internal priming loop is visible through the template tunnel.
Protein interactions of the distal 3′ end showing the role of PB2-Nter (red). PB2 residues Arg 46 and Trp 49 and PA residue Lys 567 stabilize the sharp turn between 3′ nucleotides C8 and G9. PB2 Arg 38 and PB1-Cter residues Asn 671, Arg 672 and Asn 676 also bind the 3′ end. In the accompanying paper18, Fig. 2a shows the interactions with the complete 3′ end as observed in the FluB vRNA complex.
Simplified diagram showing vRNA sequence and secondary structure in the bat FluA crystals including vRNA-mediated crystal contact (inverted sequences) that forms an extended duplex. Crystals were grown with 3′-end nucleotides 1–18 or 3–18, but only those from 6–18 were visible (hence 1–5 are in italics).
This file contains Supplementary Figure 1, Supplementary Discussions and Supplementary References. (PDF 1975 kb)
View and colouring as in Fig. 1 (AVI 16280 kb)
View and colouring as in Fig. 1. (AVI 24293 kb)
About this article
Cite this article
Pflug, A., Guilligay, D., Reich, S. et al. Structure of influenza A polymerase bound to the viral RNA promoter. Nature 516, 355–360 (2014). https://doi.org/10.1038/nature14008
This article is cited by
BMC Veterinary Research (2022)
Structural snapshots of La Crosse virus polymerase reveal the mechanisms underlying Peribunyaviridae replication and transcription
Nature Communications (2022)
Mapping inhibitory sites on the RNA polymerase of the 1918 pandemic influenza virus using nanobodies
Nature Communications (2022)
Nature Communications (2021)
Nature Communications (2021)