Influenza virus types A, B and C are human pathogens that cause mild to deadly respiratory infections. For expression of influenza virus proteins during infection, the viral RNA polymerase needs to transcribe the segmented genome of the influenza virus into mRNAs containing a 5′ cap-1 structure and a 3′ poly(A) tail1,2. However, unlike those of other RNA viruses3, the genomes of influenza viruses do not encode molecules with capping activities. Instead, the influenza virus RNA polymerase starts viral transcription using capped primers derived from host-cell pre-mRNAs4. Kouba, Drncova and Cusack now reveal how the influenza B virus RNA polymerase starts the process of turning ‘snatched primers’ into influenza virus mRNAs5.

The influenza virus RNA polymerase is composed of the viral proteins PB1, PB2 and PA. PB1, the N terminus of PB2 and the C-terminal domain of PA form the core of the polymerase (Fig. 1, stage 1). They are responsible for interacting with the terminal ends of viral promoters (Fig. 1, stage 2) and for catalyzing the nucleotidyl-transfer reaction. The core also binds the Ser5-phosphorylated C-terminal domain of initiating cellular polymerase II complexes6,7 in order to bring the polymerase into the vicinity of a host pre-mRNA (Fig. 1, stage 3). Capped primers are generated by the flexible PB2 C-terminal domain and PA N-terminal domain, which bind capped pre-mRNAs and cleave capped pre-mRNAs, respectively8 (Fig. 1, stage 3).

Fig. 1: Stages of influenza virus transcription initiation.
figure 1

(1) The influenza virus RNA polymerase consists of a central core (blue), the flexible PA endonuclease (PA endo; green), the PB2 cap-binding domain (PB2 cap; pink) and the PB2 627 domain (PB2 627; pink). Channels lead from the solvent to the active site of the polymerase to facilitate the entry or exit of templates, NTPs and products. The priming loop (PL; blue dotted line) occupies part of the active-site cavity. The flexible domains can adopt at least two orientations, but for clarity the apoenzyme is shown in the conformation that is compatible with transcription. (2) The segments that make up the influenza virus genome contain 3′ (orange) and 5′ (gray) promoter elements that bind to the top of the RNA polymerase. The location of the 3′ terminus is dynamic and varies between binding at the top of the RNA polymerase or threading through the template entry channel. (3) Binding of viral RNA polymerase to Ser5-phosphorylated (P; yellow) C-terminal domain (CTD) residues of initiating eukaryotic polymerase II (Pol II) complexes brings the PB2 cap-binding domain in close proximity to emerging, capped cellular pre-mRNAs (red). The PA endonuclease cleaves these pre-mRNAs 10–14 nucleotides downstream of the cap to generate a capped primer. (4) Reorientation of the PB2 cap-binding domain threads the capped primer into the product exit channel. The capped primer then forms base pairs with the first, second or third base (uridine, cytosine or guanosine, respectively) of the 3′ terminus of the template. (5) In the mixed pre-initiation complex, the extended template (black) forms extra base pairs with a 14-nucleotide-long primer that binds to the second base of the 3′ terminus of the natural template. (6) Addition of CTP (red) to the pre-initiation complex reveals a density in the +1 binding site. (7) The extended template forms 5 base pairs with a 15-nucleotide-long primer. The primer binds to the third base of the 3′ terminus of the natural template. (8) Extension of the capped primer in the presence of only ATP and GTP leads to the formation of a stalled complex, which forms a 9–base pair duplex in the active-site cavity. The duplex is separated at PB2 Thr207 (T). Several rearrangements in the polymerase (dotted arrows) widen the exit channels to allow product and template egress. Further extension will involve translocation of the main part of the viral RNA template (dotted dashed line) and the 5′ promoter.

Typical influenza virus primers are 10–14 nucleotides long and form 1– to 3–base pair interactions with the 3′ terminus of the viral genome segment, depending on the sequence of the primer9 (Fig. 1, stage 4). Previously determined crystal structures8,10 show how the template and a capped primer are threaded into the core of the RNA polymerase (Fig. 1, stage 4). However, owing to the flexibility of the 3′ terminus of the viral promoter11,12 (Fig. 1, stage 2) and limited base pairing between template and capped primer, transcription-initiation complexes have proven refractory to crystallization10.

Kouba, Drncova and Cusack cleverly overcome that problem by extending the 3′ end of the template by three bases5 and thus increasing the number of base pairs that a capped primer and the viral template can form. Using this approach, they capture how the 3′ end of a 14-nucleotide-long primer is coordinated by the RNA polymerase and bound to the –1 base of the template (Fig. 1, stage 5). This puts the primer in a position that is ready for extension. Indeed, addition of CTP, which is complementary to the +1 base of the template, to the pre-initiation crystal yields a density opposite the +1 guanosine (Fig. 1, step 6). The structure is, however, in a pre-catalysis state, as only one magnesium ion of the requisite two is present.

To mimic the addition of CTP, Kouba, Drncova and Cusack use a 15-nucleotide-long primer that ends in a cytosine5 (Fig. 1, stage 7). This cytosine binds to the +1 guanosine and is accompanied by a pyrophosphate in the location of the triphosphate of the pre-catalysis CTP. Since the template has not translocated relative to stage 5, it is likely that pyrophosphate release is coupled to active-site rearrangements that are induced by translocation. Comparison of stage 7 (a pre-translocation state) and stage 5 (equivalent to a post-translocation state) also reveals residues that select and coordinate the incoming NTP. Studying these residues may afford a better understanding of the error rate of influenza virus RNA polymerase13 or how mutations in this active site confer resistance to nucleotide analogs, such as favipiravir14.

In the three structures discussed above, additional primer–template base pairs form downstream of the initiation site due to extension of the template (Fig. 1, stages 5–7). These extra base pairs are artificial and induce changes that probably do not occur during normal transcription initiation. The complexes are thus in a mixed state. However, the structural rearrangements are nevertheless informative. The most striking example is a remodeling and partial extrusion of the priming loop, which is a structure that reduces the size of the active-site cavity (Fig. 1, stages 1–3) and has a key role in replication initiation15. Published experiments with RNA polymerases of influenza virus and other viruses have shown that truncation of the priming loop stimulates elongation15,16,17. The crystals generated by Kouba, Drncova and Cusack5 are consistent with those observations and suggest that the priming loop would be displaced to accommodate the natural product-template duplex, which would occupy the same location after one to two rounds of NTP incorporation and translocation.

Addition of ATP and GTP to the mixed post-initiation complex (Fig. 1, stage 7) allows the polymerase to extend the capped primer by five bases, before it pauses when a guanosine in the template enters the +1 site and CTP is required (Fig. 1, stage 8). A 3.2 Å cryo-electron microscopy structure of this stalled complex reveals that elongation requires unwinding of the promoter duplex at the top of the polymerase and translocation of the template down the template entry channel5. The structure also beautifully shows that the template and growing influenza virus mRNA are separated 9 base pairs downstream of the active site. In particular, PB2 residue Thr207 seems to serve a key role in duplex unwinding, because it stacks onto the last template base of the duplex and prevents formation of a tenth base pair (Fig. 1, stage 8). It is possible that this residue acts like a gate, allowing one template base to pass into the template exit channel before closing again onto the next template base. In addition, it is tempting to speculate that the 9-base duplex confers processivity and limits early dissociation of influenza virus RNA products, which can result in early termination or transcriptional realignment9,18.

Elongation also involves conformational changes in the polymerase. Most notable is the complete extrusion of the priming loop, which generates more space in the active-site cavity and ensures that unwound template base(s) can enter the exit channel (Fig. 1, stage 8). In addition, repositioning of the thumb subdomain and of PB2 and PA residues helps widen the active-site cavity and polymerase exit channels. These changes all aid a transition to progressive elongation, which will eventually push the template through the exit channel, while the growing mRNA will bulge out of the product exit channel before being released by the PB2 cap-binding domain19 and exported to the cytoplasm for translation.

In summary, the work by Kouba, Drncova and Cusack5 constitutes a considerable advance in the understanding of influenza virus transcription. Not only do the new structures and observed conformational changes offer plenty of targets for structure-guided mutagenesis studies and the design of new antivirals, they will probably also aid studies aimed at investigating the dynamics of the polymerase or understanding how aberrant RNA molecules that stimulate the immune response20 are produced. Moreover, the methodology used may help further structural studies of influenza virus RNA synthesis.