Structural insights into RNA polymerases of negative-sense RNA viruses

te Velthuis, Aartjan J. W.; Grimes, Jonathan M.; Fodor, Ervin

doi:10.1038/s41579-020-00501-8

Download PDF

Review Article
Published: 25 January 2021

Structural insights into RNA polymerases of negative-sense RNA viruses

Nature Reviews Microbiology volume 19, pages 303–318 (2021)Cite this article

12k Accesses
54 Citations
39 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 02 February 2021

This article has been updated

Abstract

RNA viruses include many important human and animal pathogens, such as the influenza viruses, respiratory syncytial virus, Ebola virus, measles virus and rabies virus. The genomes of these viruses consist of single or multiple RNA segments that assemble with oligomeric viral nucleoprotein into ribonucleoprotein complexes. Replication and transcription of the viral genome is performed by ~250–450 kDa viral RNA-dependent RNA polymerases that also contain capping or cap-snatching activity. In this Review, we compare recent high-resolution X-ray and cryoelectron microscopy structures of RNA polymerases of negative-sense RNA viruses with segmented and non-segmented genomes, including orthomyxoviruses, peribunyaviruses, phenuiviruses, arenaviruses, rhabdoviruses, pneumoviruses and paramyxoviruses. In addition, we discuss how structural insights into these enzymes contribute to our understanding of the molecular mechanisms of viral transcription and replication, and how we can use these insights to identify targets for antiviral drug design.

Improving prime editing with an endogenous small RNA-binding protein

Article Open access 03 April 2024

Jun Yan, Paul Oyler-Castrillo, … Britt Adamson

Engineered virus-like particles for transient delivery of prime editor ribonucleoprotein complexes in vivo

Article Open access 08 January 2024

Meirui An, Aditya Raguram, … David R. Liu

Expedient production of site specifically nucleobase-labelled or hypermodified RNA with engineered thermophilic DNA polymerases

Article Open access 09 April 2024

Mária Brunderová, Vojtěch Havlíček, … Michal Hocek

Introduction

Negative-sense RNA viruses (NSVs) include many highly prevalent human pathogens that can be responsible for respiratory infections, haemorrhagic fever and encephalitis. NSVs can also cause disease in livestock, arthropods and plants and are considered a substantial economic burden worldwide. In humans, NSVs are responsible for frequent epidemics, such as those caused by human respiratory syncytial virus (HRSV), human parainfluenza viruses, measles virus, mumps virus and the influenza A virus (IAV) and influenza B virus (IBV). NSVs that exist in animal reservoirs can cause occasional zoonotic outbreaks that are often associated with high mortality and morbidity, such as avian IAVs, Ebola virus (EBOV), Lassa mammarenavirus (LASV) and rabies lyssavirus (RABV). A better understanding of the molecular mechanisms underlying NSV replication and the development of new antiviral drugs against NSV infections are therefore essential to combat the impact of these viruses.

The genome of NSVs consists of one or more single-stranded, negative-sense RNA molecules that are assembled with multiple copies of viral nucleoprotein into megadalton-sized viral ribonucleoprotein (vRNP) complexes with a helical configuration¹ (Box 1). The nucleoproteins are the main determinants of this helical configuration and oligomerize to form a scaffold for the genomic viral RNA (vRNA). In non-segmented NSVs (nsNSV), these vRNP structures are highly symmetrical and relatively rigid^1,2. In segmented NSVs (sNSVs), each individual segment is contained in a distinct vRNP complex that is relatively flexible^3,4. In these vRNPs, the vRNA segment forms a pseudocircle through non-covalent RNA–RNA interactions between the 5′ and 3′ termini (Box 1). These 5′ and 3′ termini are typically partially complementary and highly conserved within the same group of viruses.

All NSVs encode an RNA polymerase that contains an RNA-dependent RNA polymerase (RdRP) domain and replicates and transcribes the vRNA genome in the context of vRNPs^5,6. Replication of the genome produces a positive-sense antigenome known as a complementary RNA (cRNA) for sNSVs (Box 2), which is encapsidated by nucleoprotein into a complementary RNP (cRNP). The antigenome serves as the template for genome synthesis. The vRNA genome is used as a template for transcription of positive-stranded mRNAs (Box 2). Some NSVs, such as arenaviruses, use an ambisense coding strategy, which involves transcription of the negative-sense genome as well as the cRNA (Box 2). RNA polymerases of NSVs produce mRNAs that contain a 5′ cap structure and, in most virus families, a 3′ polyadenosine (poly(A)) tail (Box 2). In most NSVs, replication and transcription of the genome occurs in the cytoplasm, whereas in orthomyxovirus family NSVs these processes occur in the nucleus.

The RNA polymerases of nsNSVs are capable of de novo synthesis of 5′ cap structures using their capping and methyltransferase (MT) domains^7,8. The RNA polymerases of sNSVs lack intrinsic capping and MT activities and instead use a cap-snatching mechanism⁹. This mechanism involves a cap-binding (CapB) domain and an endonuclease (Endo) domain of the RNA polymerase. The stolen cellular capped RNA fragments are used as primers by the RNA polymerase to initiate viral transcription to generate viral mRNAs with a heterologous host RNA-derived sequence at their 5′ end¹⁰. In addition to ensuring efficient translation of viral mRNAs, the host-derived capped RNA primers provide access to alternative open reading frames and thus expand the viral proteome¹¹.

NSVs encode proteins that act as cofactors of the viral RNA polymerase. These include the phosphoprotein (P protein) in paramyxoviruses, pneumoviruses and rhabdoviruses, M2-1 in pneumoviruses and VP24, VP30 and VP35 in filoviruses^{12,13,14,15,16}. These cofactors have been implicated in tethering the polymerase to the nucleoprotein-coated RNA template and assisting in nucleoprotein recruitment to nascent RNA by forming nucleoprotein–P protein complexes to prevent premature oligomerization of nucleoprotein and nonspecific binding of nucleoprotein to RNA^17,18,19.

Structures of NSV RNA polymerases and complexes of these polymerases with their cofactors have been difficult to obtain owing to difficulties in expressing recombinant forms of these proteins and cofactors and in their crystallization. Recent developments in the structural analysis of proteins by cryo-electron microscopy (cryoEM), as well as advances in X-ray crystallography and protein expression technologies, have led to reports of high-resolution structures of RNA polymerases of the orthomyxoviruses IAV, IBV, influenza C virus (ICV) and influenza D virus (IDV)^{20,21,22,23,24}, the peribunyavirus La Crosse virus (LACV)^25,26, the phenuivirus severe fever with thrombocytopenia syndrome virus (SFTSF)^27,28, the arenaviruses LASV and Machupo virus (MACV)²⁹, the rhabdoviruses vesicular stomatitis virus (VSV) and RABV^7,30,31, the pneumoviruses human metapneumovirus (HMPV) and HRSV^32,33,34 and the paramyxovirus human parainfluenza virus type 5 (HPIV)³⁵. Combined with advances in our understanding of the enzymatic activities of RNA polymerases, these structures provide unprecedented structural insights into the replication and transcription machinery of NSVs. An emerging picture is that these multifunctional molecular machines use a modular architecture consisting of a central RdRP domain that is linked to multiple functional domains through flexible linkers. The ability to rearrange these domains enables NSV RNA polymerases to perform and coordinate a set of complex catalytic reactions to produce the different types of RNA that NSVs require to complete an infection cycle.

Box 1 Ribonucleoprotein complexes of negative-sense RNA viruses

The genome and antigenome of negative-sense RNA viruses (NSVs) are encapsidated by oligomeric nucleoprotein. By coating the genome and antigenome, nucleoprotein protects the RNA from degradation and prevents double-stranded RNA (dsRNA) from forming between RNAs of opposite polarity. All NSV nucleoproteins are composed of two globular domains, a head and a body domain, as in the influenza A virus (IAV) nucleoprotein⁸⁷, or an N-terminal lobe (NT lobe) and C-terminal lobe (CT lobe), as in the Hantaan virus nucleoprotein⁸⁸, or an N-terminal domain (NTD) and C-terminal domain (CTD), as in the measles virus nucleoprotein⁸⁹ (see the figure; schematics of domain organization in part a). The RNA is bound in a positively charged groove located between the domains or lobes. Oligomerization of the nucleoproteins is facilitated by various extensions, such as an internal tail-loop (as in the IAV), flexible N-terminal and/or C-terminal arms (as in the Hantaan and measles virus nucleoproteins), or other mobile subdomains (see the figure; surface and cartoon representations in part b). Ribonucleoproteins (RNPs) of non-segmented NSVs (nsNSVs) form regular helical structures, whereas the RNPs of segmented NSVs (sNSVs) generally form irregular, flexible, supercoiled structures^{4,88,89,90,91,92} (see the figure; surface representations of nucleoprotein oligomer in part c). These differences can be explained by the intrinsic differences between the nucleoproteins of different virus families. In the RNPs of nsNSVs, the arms that mediate nucleoprotein oligomerization have limited flexibility, such that each nucleoprotein forms identical contacts with each of its neighbours and thus automatically generates a repeated, helical structure (as in the measles virus RNP⁸⁹). In the RNPs of sNSVs, the arms that mediate formation of nucleoprotein oligomers have flexible hinges that have the ability to form assemblies with very variable geometries. The IAV^4,93,94 and Hantaan virus⁸⁸ RNPs appear as flexible rods comprising two anti-parallel superhelical strands of RNA-bound nucleoproteins with a loop at one end and the polymerase holding together the viral RNA extremities at the other end. The docking of the nucleoprotein atomic structure in cryo-electron microscopy reconstructions of influenza virus viral RNPs (vRNPs) showed that the nucleoproteins in the strands could adopt a wide range of positions⁴. In addition, nucleoproteins could assume different degrees of rotation relative to the strand axis. Owing to this intrinsic flexibility, only low-resolution structures have been obtained for IAV vRNPs. The RNA-binding groove region generally shows the lowest resolution of the structure, pointing to the RNA as one of the most flexible parts of the complex. This observed flexibility of IAV vRNPs is fully consistent with chemical mappings of the RNA structure within vRNPs, which showed that the RNA can accommodate secondary structures and nucleoprotein is not evenly distributed on the RNA^95,96,97.

Box 2 Genome and RNA synthesis products of negative-sense RNA viruses

The genomes of negative-sense RNA viruses (NSVs) vary in length and segmentation. The non-segmented NSVs (nsNSVs) contain a single negative-sense RNA molecule that encodes several genes, as in the vesicular stomatitis virus (VSV) genome (see the figure). Replication of the genome involves formation of a replication intermediate termed the antigenome, which is used by the viral RNA polymerase to produce more genome copies. Expression of the genes in the genome occurs by a single-entry transcription mechanism, in which the RNA polymerase engages with the genome at the 3′ terminus and initiates or terminates transcription at each gene start or termination site, respectively. All viral mRNAs contain a 5′ cap structure and 3′ polyadenosine (A_n) tail. In contrast to nsNSVs, the genome of segmented NSVs (sNSVs) contains two to eight segments (see the figure). Furthermore, whereas replication occurs through a replication intermediate, expression of the encoded genes can occur in a number of ways. The La Crosse virus (LACV) segments each encode one gene, and these genes are expressed as viral mRNAs that contain a 5′ cap structure but no polyadenosine tail. The genome segments of Machupo virus (MACV) contain genes in the negative and positive orientation, which are transcribed by the viral RNA polymerase by transcription initiation at the 3′ terminus of the genome or the antigenome, respectively. MACV mRNAs contain a 5′ cap structure but no 3′ polyadenosine tail. Replication of the influenza A virus (IAV) genome segments also involves an antigenome intermediate, referred to as the complementary RNA (cRNA). Transcription starts at the 3′ terminus of the genome and terminates at a stretch of U residues close to the 5′ terminus, resulting in 5′ capped and 3′ polyadenylated products. Transcripts of the NS and M segments are spliced by the host splicing machinery.

Structure of the RNA polymerase

The RNA polymerases of NSVs have a molecular mass of 250–450 kDa and are expressed as a single polypeptide, the L protein, or as three polypeptides that together form a heterotrimeric RNA polymerase complex. All NSV polymerases comprise an RdRP domain, showing the right hand-like fold with thumb, fingers and palm subdomains that are typical of polymerase domains³⁶, and several other functional domains.

RNA polymerases of sNSVs

The heterotrimeric RNA polymerase encoded by influenza viruses consists of the polymerase basic 1 (PB1), PB2 and polymerase acidic (PA; known as P3 in ICVs and IDVs) proteins (Fig. 1a). The PB1 subunit is at the centre of the polymerase complex and contains the RdRP domain with the catalytic residues for RNA synthesis^20,21,23,37. The PA subunit consists of two domains, the N-terminal Endo domain and the PA C-terminal domain (PA-C), which is deeply integrated into the thumb subdomain of the RdRP domain and connected to the Endo domain by a long linker that wraps around the RdRP domain (Fig. 1a). The PB2 subunit is a modular protein composed of several domains. The N-terminal third of PB2 (PB2-N) is tightly associated with the RdRP domain, while the C-terminal two-thirds of PB2 consists of several discrete domains, including the CapB domain, the middle and linker regions flanking CapB that form a rigid unit denoted as the Mid-link domain, the 627 domain (named for amino acid residue 627 in PB2 that determines host range³⁸) and a C-terminal nuclear localization signal (NLS) domain, all separated by flexible linkers. The Endo domain of PA and the domains in the C-terminal portion of PB2 are linked to the RNA polymerase core by flexible linkers and can be arranged in different configurations relative to the RNA polymerase core, depending on association of the heterotrimeric complex with RNA and with viral and cellular proteins. In the structure of the human IAV RNA polymerase bound to vRNA and capped RNA primer³⁷ (Fig. 1a), the 627 domain packs against the palm and thumb subdomains of the RdRP domain. The CapB domain faces the Endo domain and packs on the Mid-link domain, which is positioned between the 627 domain and the CapB domain. The NLS associates with the 627 domain.

**Fig. 1: Domain organization and overall structure of NSV RNA polymerases.**

The L proteins of sNSVs, such as the LACV L protein (Fig. 1b), are single polypeptide RNA polymerases comprising domains that are similar to those in the trimeric RNA polymerase. All sNSV L proteins consist of a central RdRP domain that is flanked by an N-terminal PA-C-like domain and a C-terminal PB2-N-like domain^25,26,27,39, forming a globular RNA polymerase core. This globular core is linked to an N-terminal Endo domain and a C-terminal extension that, in the LACV L protein, is composed of several distinct domains, including a CapB domain, a middle (Mid) domain (composed of two regions of the L polypeptide flanking CapB) and a zinc-binding domain (ZBD)²⁵ (Fig. 1b). The Endo domain is linked to the RNA polymerase core by a long flexible linker that wraps around the RdRP domain, and is positioned between the RNA polymerase core and the protruding C-terminal extension of the L protein (that is, the PB2-like region), making extensive contacts with multiple regions of the RNA polymerase core on one side and the CapB domain on the other. The C-terminal extension protrudes away from the core and forms an elongated arc-shaped structure that includes the Mid domain, the CapB domain and the ZBD (Fig. 1b). Although the CapB domain is conserved, the other domains of the C-terminal extension vary considerably in sequence and structure between L proteins of different sNSVs. In the SFTSV L protein²⁷, the ZBD is replaced by a lariat domain, whereas the LASV and MACV L proteins²⁹ contain a 627-like domain that is unresolved in current structures (see below). As in the influenza virus polymerases, the Endo domain and domains in the C-terminal extension are linked to the RNA polymerase core by flexible linkers and can be arranged in different configurations relative to the RNA polymerase core²⁵ (see below).

In sNSVs, the RdRP, Endo and CapB domains are the structurally most conserved parts of the RNA polymerase^9,39. The Endo domain shows a conserved fold across all sNSV polymerases and has the structural characteristics of the PD-(D/E)-XK superfamily of nucleases that use divalent metal ions for nucleic acid cleavage. In the IAV Endo domain, H41, P107, D108, E119 and K134 constitute the active centre, and analogous amino acid residues exist in the Endo domains of other sNSV polymerases. However, in the Endo domain of arenaviruses, the histidine upstream of the conserved PD-(D/E)-XK motif is replaced by an acidic amino acid residue, resulting in a difference in coordination of divalent metal ions in the active site^40,41. This difference correlates with the inactivity of the isolated domain, suggesting that a rearrangement of the domain or contribution from the rest of the RNA polymerase is needed for endonuclease activation, potentially providing arenaviruses with a transcription ‘on/off’ switch⁴⁰. The CapB domains of sNSV polymerases show little homology at the amino acid level but fold into a similar structure, which is composed of a β-sheet formed by five to seven antiparallel β-strands surrounding a single long α-helix or a bundle of several α-helices³⁹. All solved sNSV CapB domains employ the same mechanism of cap binding whereby the methylated guanine of the cap structure is sandwiched between two aromatic amino acid residues by stacking interactions. In the IAV polymerase, H357 and F404 in PB2 form the aromatic sandwich. A histidine performing this role seems to be unique to human and avian IAV whereas phenylalanine, tryptophan or tyrosine perform this role in the cap-binding proteins of other sNSVs described to date.

RNA polymerases of nsNSVs

The L proteins of nsNSVs, such as that of VSV (Fig. 1c), are also single polypeptide RNA polymerases and fold into three catalytic and two structural domains^7,31. The core of the nsNSV L protein structure is formed by the N-terminal RdRP domain and the capping (Cap) domain, which possesses polyribonucleotidyl transferase (PRNTase) activity. The Cap domain adopts a kidney-shaped fold and interacts with the RdRP domain over a large interface, forming the RdRP–Cap module^7,31 (Fig. 1c). The RdRP–Cap module is linked through the connector domain (CD) to the third catalytic domain of the L protein, the dual-specificity MT domain, possessing both guanine-N7-methyltransferase and nucleoside-2′-O-methyltransferase activities, which is followed by the C-terminal domain (CTD). In the VSV L–P complex, the CD, MT domain and CTD form three globular domains that closely associate with each other as well as with the RdRP–Cap module, forming an overall compact structure.

The CD is only weakly conserved at the amino acid level across different nsNSV families and is believed to have a structural role. The CTD is not conserved at the sequence or structural level, and substantial differences in fold and size have been observed between the CTDs of the pneumovirus and rhabdovirus L proteins^7,32. By contrast, the Cap and MT domains are relatively well-conserved and participate in synthesizing a 5′ cap structure. The Cap domain catalyses the formation of GTP-capped pre-mRNA by a mechanism that differs from eukaryotic capping⁴². The nascent RNA transcript with a 5′ triphosphate is first covalently linked to a catalytic histidine residue (H1241 in the VSV L protein) located in a conserved histidine–arginine (HR) motif, in a reaction that leads to formation of a monophosphate RNA–L protein intermediate. This linkage is attacked by a GTP molecule, resulting in addition of GDP to the monophosphate RNA to form a GTP-capped pre-mRNA. The HR motif and a GxxT motif, which are believed to be involved in GTP binding, are conserved across the Cap domains of nsNSVs. The MT domain is structurally well-conserved across nsNSVs and encodes a dual-function enzyme that methylates the GTP cap of viral mRNAs, first at the 2′-O and then at the N7 position. A GxGxG motif forms the binding site for the methyl donor, S-adenosylmethionine (SAM), and a conserved set of charged residues (K-D-K-E) forms the catalytic tetrad for methyl group addition.

The RNA polymerase cofactor P protein consists of three domains: an N-terminal domain (NTD), a central oligomerization domain (OD) and a C-terminal X domain (XD)¹⁸ (Fig. 1c). The NTD is generally unstructured and binds to a nascent nucleoprotein monomer, forming a nucleoprotein–P protein complex to prevent premature oligomerization of nucleoprotein and its nonspecific binding to RNA⁴³. The XD is also intrinsically disordered in solution but associates with nucleoprotein in the context of vRNPs⁴⁴. The OD mediates the formation of P protein homodimers or homotetramers that interact directly with the L protein, tethering the polymerase to the nucleoprotein-coated RNA template⁴⁵.

The P protein can associate with the RNA polymerase in several ways. The structure of the VSV L protein has been solved in complex with an N-terminal fragment of P protein in which three motifs have been resolved³¹ (Fig. 1c). These motifs make clear interactions with the CTD, CD and RdRP domains, locking the CD, MT domain and CTD in a fixed arrangement with respect to the large RdRP–Cap module³¹. The overall structure of the RABV L–P complex is very similar to that of VSV L–P³⁰, whereas in the HPIV L–P complex (which contains four copies of P protein), the CD, MT domain and CTD have been observed in different positions relative to the RdRP–CAP module³⁵. These differences in L–P interactions most likely reflect different functional states of the L proteins in which they were captured rather than intrinsic differences in L–P interactions between different viruses (see below). In the HRSV and HMPV L–P complexes, which also contain four copies of P protein, the CD, MT domain and CTD remain entirely unresolved, owing to their flexibility^32,33,34 (see below).

Structure of the RdRP domain

The RdRP domain is the only domain that is shared by the RNA polymerases of sNSVs and nsNSVs (Fig. 1). Four channels lead to and from the active site of the RdRP domain and facilitate template RNA entry and exit, nucleoside triphosphate (NTP) entry and product RNA exit^26,46 (Fig. 1). In the heterotrimeric RNA polymerase, the three subunits are integrated such that they all contribute to the formation of these channels⁴⁶.

The RdRP domain folds into a right hand-like shape with thumb, fingers (including fingertips) and palm subdomains that are characteristic of many RdRPs³⁶. The thumb subdomain forms the ‘right-side wall’ of the active site, whereas the palm subdomain forms the ‘floor’ with a central four-stranded β-sheet. The fingers subdomain forms the ‘roof’ and ‘left-side wall’ of the active site. The active site chamber of the RdRP domain is formed by the conserved RNA polymerase motifs A–F, the majority of which are located in the palm subdomain³⁶ (Fig. 1). Specifically, conserved motifs A and C in the palm subdomain contain the aspartate residues that bind to the two metal ions that coordinate nucleotide condensation⁴⁷. The active site is structurally conserved, whereas a clear divergence at the amino acid sequence level is present in motifs A and C of sNSV and nsNSV RNA polymerases³⁶.

A priming loop protrudes into the active site cavity and is involved in the formation of the first dinucleotide during de novo initiation^32,48,49. In sNSV RNA polymerases, the priming loop emerges from the thumb subdomain as a flexible β-hairpin^22,25,29 (Fig. 1a,b), whereas in the L protein of nsNSVs, the priming loop is a simple flexible loop that resides in the Cap domain next to the GxxG motif, from which it inserts into the RdRP domain^7,32,35 (Fig. 1c).

Binding to promoter RNA

An important feature of all sNSV polymerases is the ability to associate with the conserved, partially complementary termini of the genome and antigenome segments, which represent the vRNA and cRNA promoters, respectively. In influenza virus RNA polymerase complexes, the 5′ end of the vRNA inserts into a pocket formed by the PA and PB1 subunits near the template entry channel^{21,22,23,37,50} (Fig. 1a). The same binding pocket is used by the 5′ end of the cRNA. Both the vRNA and cRNA 5′ termini assume a ‘hook’ conformation and bind to the RNA polymerase in a sequence-specific manner. In the LACV RNA polymerase, the vRNA 5′ terminus binds in a similar binding site in an equivalent position close to the template entry channel (Fig. 1b). In influenza virus RNA polymerases, binding of the vRNA 3′ terminus has been observed at two different sites near the polymerase surface, either in a groove near the fingers subdomain²³, called the A-site, or at a site formed by residues of the PB1 and PA subunits, called the B-site^21,24,37 (Fig. 1a). In contrast to the vRNA 3′ terminus, the cRNA 3′ terminus binds exclusively to the B-site. This B-site is equivalent to the 3′ terminus-binding site observed in the LACV and MACV RNA polymerases^26,29 (Fig. 1b).

In influenza viruses, the initiation of replication on the vRNA 3′ terminus or cleavage of capped RNA prior to transcription initiation cannot occur without binding of the vRNA 5′ terminus, consistent with the idea that the vRNA 5′ terminus aids in recruitment of the vRNA 3′ terminus and activates the RNA polymerase. Indeed, the apo form of IAV and ICV polymerases harbours a highly unstable fingertip and priming loop^20,37, whereas the same regions in structures of vRNA 5′ terminus-bound polymerases of IAV and IBV are more ordered^22,23, suggesting that binding of the vRNA 5′ terminus contributes to the stabilization of the RdRP domain. Similar observations have been made for the LACV, LASV and MACV RNA polymerases^25,28,29,51. Although the fingertip and the associated finger extension loops in the RdRP domain of the LASV and MACV RNA polymerases are highly ordered, even in the absence of binding of the RNA 5′ end²⁹, the vRNA 5′ terminus boosts primer-independent and dinucleotide-primed RNA synthesis activity of these polymerases, in line with observations for the influenza virus and LACV polymerases^28,29,51. Intriguingly, the vRNA 5′ terminus significantly inhibits capped-RNA-primed transcription for both LASV and MACV polymerases, suggesting that in arenaviruses the vRNA 5′ terminus may serve a regulatory role in the replication and transcription processes²⁹.

No RNA binding has been observed in the currently available structures of nsNSV RNA polymerases, and it remains unclear how these polymerases recognize promoter elements in the genome and antigenome (Fig. 1c). It has been proposed that the N-terminal residues of the VSV and RABV L proteins, which are unresolved in current structures, as well as the CTD of the P protein, might be involved in the binding of template RNA and/or nucleoprotein, owing to their position near the template entry channel^30,31,42.

Conformational rearrangements

The NSV RNA polymerases display great conformational flexibility, which is essential for the different modes of initiation that the RNA polymerases use to replicate and transcribe the viral genomic RNA.

RNA polymerases of sNSVs

In the heterotrimeric RNA polymerase of influenza viruses, the Endo domain of PA and the C-terminal two-thirds of PB2 are linked to the RNA polymerase core by flexible linkers and can assume different positions relative to the RNA polymerase core (Fig. 2). Several fundamentally distinct conformations of the heterotrimeric RNA polymerase have been reported. In the structure of the influenza virus RNA polymerase apo form, the CapB domain packs against the palm subdomain, whereas the 627 domain is located near the nascent RNA exit channel and the Endo domain^20,50 (Fig. 2a). In this conformation, a part of the Mid-link domain inserts into the cap-binding pocket of the CapB domain, preventing cap binding. Furthermore, the Endo domain and CapB domain face away from each other and, consequently, this conformation of the polymerase is unable to perform cap-snatching⁵. In the active site, the priming loop is disordered. By contrast, where structures have been captured at different stages of transcription, the 627 domain and CapB domain are rearranged through large conformational changes in the Mid-link domain, which repositions the 627 domain next to the palm–thumb subdomain interface and the CapB domain opposite the Endo domain^23,24,47 (Fig. 2b–e). In the vRNA-template-bound ‘pre-initiation structure’, the priming loop is ordered, and the cap-binding pocket of the CapB domain is free to bind capped RNA and faces the Endo domain to facilitate cap-snatching (Fig. 2b). In the ‘post-cap-snatching’ structure, the CapB domain is rotated and the cap-binding pocket faces the product exit channel, consistent with a movement that would facilitate insertion of the cleaved capped RNA primer into the RdRP catalytic centre through the product exit channel²³ (Fig. 2c). Further conformational rearrangements, including changes in the RdRP domain, take place as the polymerase transits from pre-initiation and initiation to elongation and then to termination^24,47 (Fig. 2d,e). Specifically, the priming loop is displaced from the active site into the solvent as a disordered loop⁴⁷ (Fig. 2b–e). An entirely distinct polymerase conformation has been observed in an asymmetrical dimer of the heterotrimeric polymerase bound by the cellular factor ANP32A⁵² (Fig. 2f). In this dimer, one of the polymerases has vRNA bound and was proposed to function as a replicase, whereas the second polymerase acts as an encapsidating polymerase that is involved in the assembly of the nascent RNA into an RNP. The conformation of the Endo and CapB domains in the vRNA-bound replicating polymerase is similar to that observed in the apo form of this polymerase (Fig. 2a), and in the active site the priming loop is also disordered. The conformation of the encapsidating polymerase is entirely distinct from that of the replicating polymerase form as well as the cap-snatching-competent vRNA-bound pre-initiation state of the polymerase. Specifically, in the encapsidating polymerase, the PB2 CapB and Mid-link domains pack against the palm and thumb subdomains of the polymerase core and are separated from the 627 and NLS domains, which primarily make contacts with PA-C. The Endo domain of PA and parts of PB2-N and NLS of the encapsidating polymerase are disordered in the structure (Fig. 2f).

**Fig. 2: Structures of influenza virus RNA polymerase conformations.**

In the L proteins of sNSVs, the positions of the Endo domain, CapB domain and ZBD or 627-like domains also vary greatly and can adopt different configurations relative to the RNA polymerase core^{25,26,27,28,29} (Fig. 3). In the apo structure of the SFTSV L protein, the configuration of domains is similar to that observed in the apo and replicating polymerase structures of influenza virus. The CapB and Endo domain are packed against each other and are oriented such that the Endo active site faces away from the CapB domain^27,28 (Fig. 3a). In addition, the cap-binding pocket of the CapB domain is occupied by the blocker motif, similar to how the Mid-link domain blocks the CapB domain in the apo and replicating polymerase of influenza virus. Consequently, the captured SFTSV L protein structure represents a cap-snatching-incompetent conformation^27,28. The LACV L protein has been captured in conformations reflecting pre-initiation and elongation states²⁵ (Fig. 3b,c). In the pre-initiation structure with both the 5′ and 3′ termini of vRNA bound, the Endo domain is rotated by 180° relative to its position in the SFTSV L protein structure and faces the CapB domain to allow cap-snatching^25,26 (Fig. 3b). Transition from the pre-initiation to the elongation state with double-stranded RNA (dsRNA) bound in the active site is accompanied by large coordinated rotations of the Endo domain and C-terminal region that are made possible by the conformationally stable Mid domain, which acts as a central hub that mediates contacts between the RNA polymerase core, the CapB domain and the ZBD²⁵. Further movements take place in the RdRP domain, including the extrusion of the priming loop from the active site. In the structures of the apo LASV L protein and the MACV L protein with the vRNA 3′ terminus bound in the B-site²⁹, the Endo domain sits at the top of the fingers subdomain, close to the product exit channel, whereas the CTDs, including the CapB domain, remain unresolved in both structures (Fig. 3d,e).

**Fig. 3: Structures of sNSV L protein conformations.**

RNA polymerases of nsNSVs

Although no structures of nsNSV L proteins bound to template RNA are available, several conformations of apo L–P complexes have been captured that suggest that conformational rearrangements take place when the polymerase transits from the pre-initiation to the elongation state (Fig. 4). In the VSV and RABV L–P complexes, the Cap domain interacts with a large surface of the RdRP domain and blocks the nascent RNA exit channel^7,30,31 (Fig. 4a,b). In the same structures, the CD and CTD pack against the Cap domain, keeping the MT domain away from the nascent RNA exit channel. This conformation is, thus, likely an initiation conformation that must undergo a rearrangement before capping can take place. Such a different state is captured in the apo structure of the HPIV L protein with the CTD moving closer to the Cap domain³⁵ (Fig. 4c). In addition, the priming loop of the CAP domain is organized differently and forms a wider platform in the active site, suggesting that this conformation could reflect a post-initiation structure. Further changes relative to the VSV pre-initiation state can be observed in the apo structures of HRSV and HMPV L–P complexes (Fig. 4d,e). Here, the Cap domain movement exposes the nascent RNA exit channel, whereas the CD, MT domain and CTD are no longer resolved in the EM maps^32,33,34. Furthermore, in these structures, the priming loop of the Cap domain is fully retracted and collapsed onto the Cap domain. The different conformations of the priming loop in the VSV and HRSV L–P complexes suggest that these rearrangements may be coupled to a transition from initiation to elongation^32,34.

**Fig. 4: Structures of nsNSV L–P complex conformations.**

The observed conformations are affected by the binding of the cofactor P protein. In the VSV and RABV L–P complexes, which contain an N-terminal fragment of the P protein, the P protein interacts with and locks the CD, MT domain and CTD in a fixed arrangement with respect to the large RdRP–Cap module^30,31 (Fig. 4a,b). In the HIPV, HMPV and HRSV L–P complexes, the P protein forms a tetramer through the central OD^32,33,34,35 (Fig. 4c–e). Strikingly, there is a large degree of variation in the conformation of each of the four P monomers (P1, P2, P3 and P4). These different conformations allow the P protein to wrap around the RdRP domain of the L protein in a tentacular fashion. Although P1 and P4 make extensive contacts with L, P3 makes minimal contacts with L, and P2 interacts almost exclusively with the other P monomers. The L protein of HRSV primarily uses the fingers motif to interact with the OD and XD of P protein, but the palm motif of the RdRP domain is also involved. Interestingly, these L–P interactions do not fix the positions of the CD, the MT domain and CTD, which remain unresolved in the HMPV and HRSV L–P structures.

Transcription and replication

The heterotrimeric RNA polymerase of influenza virus has been captured at different stages of the transcription cycle and is currently the best understood in terms of how the RNA polymerase coordinates the different modes of initiation and termination during transcription and replication. Our current understanding of these processes in the L proteins of sNSVs and nsNSVs is largely based on our knowledge of the influenza virus RNA polymerase.

sNSV RNA polymerases

Transcription by the influenza virus RNA polymerase generates capped and polyadenylated mRNAs that are transcribed from the vRNA segments. Transcription is initiated by binding of the viral polymerase in the context of vRNPs to the serine-5-phosphorylated CTD of host RNA polymerase II (Pol II) via residues in the PA-C and the 627 domain and NLS domain of PB2^53,54. Pol II CTD binding stabilizes the cap-snatching-competent, pre-initiation conformation of the viral polymerase⁵⁴, which allows the CapB domain of the viral RNA polymerase to capture the cap structure of nascent host-capped RNA (Fig. 5a). After cleavage of the nascent host transcript by the Endo domain, the 3′ end of the capped primer is transferred into the product exit channel by a 70° rotation of the CapB domain, and the primer is stabilized by interactions with the Mid-link domain and the 3′ end of the vRNA template in the active site^23,24,47. The stability of this complex is dependent on both the length of the primer and its 3′ sequence and, consequently, by the number of bases that can pair between the primer and the template^55,56. Extension of the primer occurs in a template-dependent manner, although short duplications of nucleotides at the vRNA 3′ terminus have been observed in several sNSV mRNAs, indicative of a prime-and-realign mechanism^10,55,56,57. In the transition from initiation to elongation, the priming loop is expelled from the RdRP active site, opening the template exit channel, and stays extruded during steady-state elongation^24,47. During elongation, a 9–10 bp nascent strand–template duplex is maintained in the active site before the template and nascent RNA are separated and directed down their respective exit channels. Evidence from the IAV and IBV RNA polymerases shows that this separation occurs on a conserved aromatic residue of the lid subdomain of the PB2-N domain, at the end of the active site cavity^24,47. Other sNSV RNA polymerases also contain a lid subdomain in the PB2-N-like domain, suggesting that this mechanism of strand separation is conserved²⁵. The capped RNA product exits through the product exit channel without being encapsidated. Instead, it is bound by cellular mRNA-binding proteins, such as the nuclear cap-binding complex⁵⁸. After exiting through the template exit channel, the 3′ terminus of the template is guided along the exterior of the thumb domain to the B-site 3′ binding site^24,59, which likely ensures the integrity of the vRNP complex during elongation. During termination, the RNA polymerase stutters on an oligo(U) motif starting at position 17 from the 5′ terminus, involving U17 flipping in and out of the +1 active site position, to generate a poly(A) tail⁶⁰. The 5′ terminus of the template is not copied and remains bound to its binding site^61,62, while the RNA polymerase releases the viral transcript and recycles the vRNA template²⁴.

**Fig. 5: Mechanisms of RNA synthesis by NSV RNA polymerases.**

Replication by the influenza virus RNA polymerase is a two-step process with the vRNA template first being copied into a cRNA, which next acts as template for vRNA synthesis. cRNA synthesis is initiated de novo, without the assistance of a primer, and starts with the formation of a dinucleotide opposite the terminal 3′ residues of the vRNA segment (Fig. 5b). The replication initiation process is dependent on support by the priming loop and in particular by a conserved proline at the tip of the priming loop⁴⁸. During synthesis of a new genome from an antigenome template by the influenza virus RNA polymerase, initiation begins at positions 4 and 5 of the cRNA 3′ terminus^48,63 (Fig. 5c). The resulting dinucleotide is next realigned to the terminal 3′ residues of the antigenome by backtracking of the template that is mediated by a conformational change in the priming loop^37,64. This conformational change has been proposed to be dependent on the binding of a second, regulatory polymerase^37,65,66. The regulatory and replicating polymerases form a symmetrical dimer through PA-C, the thumbs subdomain of PB1 and an N-terminal subdomain of PB2 (refs^37,67). In the observed dimer, both polymerases are in a conformation similar to that described for the apo form of the polymerase (Fig. 2a). Mutation of residues in the dimer interface or its disruption with a nanobody reduces the efficiency of vRNA synthesis, likely owing to the prevention of template backtracking³⁷. Interestingly, symmetrical dimerization of the polymerase is mutually exclusive with binding of the 3′ termini of the cRNA and vRNA in the B-site, leading to the suggestion that dimer formation might regulate the location of the 3′ end of the template^21,37.

During replication, unlike during transcription, the nascent RNA is co-replicatively encapsidated by the binding of viral polymerase to the 5′ end and the recruitment of nucleoprotein to the body of the RNA to form cRNPs and vRNPs. This process is likely mediated by another type of dimerization of the heterotrimeric polymerase, as observed in the complex with the cellular factor ANP32A⁵² (Fig. 2f). In this asymmetrical polymerase dimer, the encapsidating polymerase is ideally positioned to capture the 5′ end of the nascent RNA product emerging from the active site of the replicating polymerase. ANP32A, an essential factor for the replication of the influenza virus genome, bridges the two polymerase heterotrimers, stabilizing their interaction^68,69. The N-terminal leucine-rich repeat (LRR) domain of ANP32A acts as the bridge while most of the C-terminal low complexity acidic region (LCAR) could not be fully resolved in the structures. It has been proposed that the disordered LCAR could act as a molecular whip⁷⁰ that recruits nucleoprotein in a manner analogous to that proposed for P protein during the replication by nsNSV L proteins.

Currently, it is not known which conformational rearrangements take place during replication and at which point the symmetrical and asymmetrical polymerase dimers form and dissociate. In the proposed replicating polymerase of the asymmetrical polymerase dimer, the 3′ end of the RNA template is unresolved in the active site, and the priming loop is disordered. Further rearrangements must take place upon insertion of the RNA template into the active site before replication can proceed. Furthermore, it is likely that some of the structural rearrangements described for transcription are also relevant for replication. Thus, the priming loop is most likely extruded from the active site of the replicating polymerase, and the already copied 3′ end of the RNA template binds to the B-site as it emerges from the active site through the template exit channel, to maintain RNP integrity during replication.

In comparison with the heterotrimeric polymerase of influenza viruses, much less is known about how L proteins of sNSVs carry out transcription and replication. However, considering that the vRNAs of L-protein-encoding sNSVs are assembled into vRNPs akin to those of influenza virus, and that the L protein associates with the vRNA 5′ and 3′ termini at equivalent binding sites, it is highly likely that transcription and replication also proceed in a manner similar to that of influenza virus.

nsNSV RNA polymerases

To transcribe the genome, the L proteins of nsNSVs initiate transcription at a promoter at the 3′ end of the genome, sequentially transcribing the leader region and five internal genes (N, P, M, G and L) into the leader RNA and five monocistronic mRNAs with a 5′ cap structure and a 3′ poly(A) tail, by using a stop–start transcription mechanism. This mechanism results in the generation of a gradient of mRNA abundance in relation to proximity to the promoter, with the nucleoprotein-encoding mRNA being the most abundant and L mRNA being the least abundant^6,42. Initiation and termination are regulated by cis-acting start–stop signals that flank each gene. To replicate the genome, the L protein initiates at the 3′ promoter and produces a full-length copy of the genome, ignoring the internal start–stop signals. The resulting antigenome is neither capped nor polyadenylated and is co-replicatively assembled into an RNP complex by association with nucleoprotein. The replicative intermediate antigenome acts as a template for further rounds of replication to generate genomic RNA for progeny virions.

Although no high-resolution structures have been obtained of the nsNSV RNA polymerases bound to RNA and no pre-initiation or elongation complexes have yet been reported, the currently available apo L–P complexes likely reflect different stages of the RNA synthesis process (Fig. 5d). In the VSV and RABV L–P complexes^7,30,31, the priming loop that emerges from the Cap domain protrudes into the active site of the RdRP domain, where it could stabilize the formation of the first dinucleotide using an aromatic residue and/or proline for de novo initiation^32,49, suggesting that the captured structures represent the polymerase in a pre-initiation state. This is in contrast to the HPIV, RSV and HMPV L–P structures, in which this loop is retracted from the RdRP active site and is folded into the Cap domain^32,33,34,35. The retraction of the priming loop is consistent with elongation during RNA synthesis, suggesting that these structures represent the polymerase in elongation mode. Retraction of the priming loop also creates a continuous cavity shared by the RdRP and Cap catalytic sites, which allows the nascent transcript to enter the catalytic cavity of the Cap domain. Capping occurs by an unconventional capping mechanism that involves the covalent attachment of the monophosphorylated 5′ terminus of the nascent transcript to a histidine side chain in the Cap active site, followed by the transfer of the 5′ monophosphate of the nascent transcript onto a GDP acceptor⁴². The covalent attachment of the 5′ end of the nascent transcript to the Cap active site cavity during capping and the force generated as a result of nascent RNA filling the cavity could result in the release of the CD–MT domain–CTD module from the RdRP–Cap core module. The freeing of the module would expose the MT catalytic site for the subsequent N7 and 2′-O methylation of the 5′ cap after the 5′ end of the nascent transcript has been transferred onto a GDP and released from the Cap domain. During replication, the RNA product likely exits at a site proximal to the N-terminal end of P protein, which is known to associate with monomeric nucleoprotein. Thus, the proximity of this site to the exit for replication products could facilitate the delivery of nucleoprotein to the growing RNA chain, ensuring co-replicative encapsidation of the nascent genome. Interestingly, the phosphorylation status of the N-terminal end of the P protein also has a role in the switch from transcription to replication, although the mechanism by which this occurs is currently unclear⁴². To reveal the detailed movements of polymerase domains it will be necessary to obtain further snapshots of transcribing and replicating polymerases with bound template RNA and nascent product RNA, as has recently been reported for influenza viruses^24,47.

Therapeutic applications

Nucleoside analogues

Several nucleoside analogues are able to inhibit NSV RNA polymerases. Favipiravir, ribavirin and remdesivir are broad-spectrum nucleoside analogues that can inhibit influenza virus, EBOV and HRSV RNA synthesis^71,72,73. These NTPs act by competing with purine incorporation, resulting in the introduction of mutations in the viral genome or leading to polymerase stalling^71,74. Resistance to favipiravir can emerge through mutations in thumb and fingers subdomains of RdRP⁷⁵. The nucleoside analogue N⁴-hydroxycytidine (NHC) was more recently identified and was found to inhibit IAV, IBV and HRSV infections⁷⁶ by inducing lethal mutagenesis; no resistance to NHC has been observed to date⁷⁷. The analogues ALS-8176 and ALS-8112 triphosphate inhibit HMPV and HRSV infections in nonhuman primates, although resistance to these drugs has been observed. The resistance mutations map to homomorph A (a structurally conserved extension of motif A^36,78) of the palm subdomain³² and likely alter the binding of the nucleoside analogues in the active site and prevent active site closure when a nucleoside is bound³².

Non-nucleoside inhibitors

Non-nucleoside inhibitors target the auxiliary enzymatic domains, inter-domain interactions or inter-subunit binding of the NSV RNA polymerases. For example, peptides have been designed that can compete with the inter-subunit interactions of the influenza virus heterotrimeric RNA polymerase^79,80. The IAV inhibitors baloxavir marboxil (BXA) and pimodivir (VX-787) target the PA Endo and PB2 CapB domains, respectively, of the IAV RNA polymerase and act as transcription inhibitors^{81,82,83,84,85}. BXA can also inhibit the Endo domains of other sNSVs⁴¹, including SFTSV and Heartland virus (HRTV), owing to the structural similarity of endonucleases among sNSV polymerases⁸⁴. However, resistance to BXA and VX-787 rapidly emerges in IAV infections through mutations near the active site of the Endo domain and cap-binding pocket of the CapB domain, respectively⁸⁶. In an example of using the different capping mechanism of nsNSVs compared with that of the host as a drug target, the HRSV inhibitor BI-compound D (BID) disrupts the capping process of HRSV nascent mRNAs. However, escape mutants resistant to this compound also arise quickly by changes in a pocket near the PRNTase active site³⁴.

Concluding remarks

The past seven years have witnessed unprecedented progress in the structural and functional understanding of the NSV RNA polymerases. High-resolution structures of numerous RNA polymerases of both sNSVs and nsNSVs have been solved. These structures have vastly increased our understanding of the transcription and replication mechanisms used by these viruses. In particular, the work on the influenza virus RNA polymerase has led to a detailed understanding of the molecular mechanisms of viral transcription initiation, elongation and termination, as well as replication initiation. This new mechanistic understanding of RNA synthesis opens avenues to the development of new antiviral agents that target the RNA polymerase of NSVs. This growth in our understanding of the biology of sNSV polymerases has not been replicated for the nsNSV RNA polymerases, and a number of fundamental questions remain. How do nsNSV RNA polymerases bind to the genomic or antigenomic RNAs, and what conformational rearrangements take place when the polymerase proceeds from initiation to elongation to termination? How are the nascent genomic and antigenomic RNAs encapsidated into an RNP? What factors determine whether the RNA polymerase transcribes the genome into mRNA or replicates it into an antigenome? Which host factors have fundamental roles in viral transcription and replication by supporting RNA polymerase function? Given the progress in our understanding of sNSV polymerases, we look forward to further advancements in our knowledge of NSV polymerases in the future.

Change history

02 February 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41579-021-00524-9

References

Luo, M., Terrell, J. R. & McManus, S. A. Nucleocapsid structure of negative strand RNA virus. Viruses 12, 835 https://doi.org/10.3390/v12080835 (2020).
Article CAS PubMed Central Google Scholar
Ruigrok, R. W., Crepin, T. & Kolakofsky, D. Nucleoproteins and nucleocapsids of negative-strand RNA viruses. Curr. Opin. Microbiol. 14, 504–510 (2011).
Article CAS PubMed Google Scholar
Gallagher, J. R., Torian, U., McCraw, D. M. & Harris, A. K. Structural studies of influenza virus RNPs by electron microscopy indicate molecular contortions within NP supra-structures. J. Struct. Biol. 197, 294–307 (2017).
Article CAS PubMed Google Scholar
Coloma, R. et al. Structural insights into influenza A virus ribonucleoproteins reveal a processive helical track as transcription mechanism. Nat. Microbiol. 5, 727–734 (2020).
Article CAS PubMed Google Scholar
Te Velthuis, A. J. & Fodor, E. Influenza virus RNA polymerase: insights into the mechanisms of viral RNA synthesis. Nat. Rev. Microbiol. 14, 479–493 (2016).
Article CAS Google Scholar
Fearns, R. & Plemper, R. K. Polymerases of paramyxoviruses and pneumoviruses. Virus Res. 234, 87–102 (2017).
Article CAS PubMed PubMed Central Google Scholar
Liang, B. et al. Structure of the L protein of vesicular stomatitis virus from electron cryomicroscopy. Cell 162, 314–327 (2015). The first reported structure of a nsNSV RNA polymerase.
Article CAS PubMed PubMed Central Google Scholar
Morin, B., Rahmeh, A. A. & Whelan, S. P. Mechanism of RNA synthesis initiation by the vesicular stomatitis virus polymerase. EMBO J. 31, 1320–1329 (2012).
Article CAS PubMed PubMed Central Google Scholar
Olschewski, S., Cusack, S. & Rosenthal, M. The cap-snatching mechanism of bunyaviruses. Trends Microbiol. 28, 293–303 (2020).
Article CAS PubMed Google Scholar
Koppstein, D., Ashour, J. & Bartel, D. P. Sequencing the cap-snatching repertoire of H1N1 influenza provides insight into the mechanism of viral transcription initiation. Nucleic Acids Res. 43, 5052–5064 (2015).
Article CAS PubMed PubMed Central Google Scholar
Ho, J. S. Y. et al. Hybrid gene origination creates human-virus chimeric proteins during infection. Cell 181, 1502–1517 e1523 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sutherland, K. A., Collins, P. L. & Peeples, M. E. Synergistic effects of gene-end signal mutations and the M2-1 protein on transcription termination by respiratory syncytial virus. Virology 288, 295–307 (2001).
Article CAS PubMed Google Scholar
Fearns, R. & Collins, P. L. Role of the M2-1 transcription antitermination protein of respiratory syncytial virus in sequential transcription. J. Virol. 73, 5852–5864 (1999).
Article CAS PubMed PubMed Central Google Scholar
Rahmeh, A. A. et al. Critical phosphoprotein elements that regulate polymerase architecture and function in vesicular stomatitis virus. Proc. Natl Acad. Sci. USA 109, 14628–14633 (2012).
Article CAS PubMed PubMed Central Google Scholar
Groseth, A. et al. The Ebola virus ribonucleoprotein complex: a novel VP30-L interaction identified. Virus Res. 140, 8–14 (2009).
Article CAS PubMed Google Scholar
Xu, W. et al. Ebola virus VP30 and nucleoprotein interactions modulate viral RNA synthesis. Nat. Commun. 8, 15576 (2017).
Article CAS PubMed PubMed Central Google Scholar
Emerson, S. U. & Schubert, M. Location of the binding domains for the RNA polymerase L and the ribonucleocapsid template within different halves of the NS phosphoprotein of vesicular stomatitis virus. Proc. Natl Acad. Sci. USA 84, 5655–5659 (1987).
Article CAS PubMed PubMed Central Google Scholar
Guseva, S., Milles, S., Blackledge, M. & Ruigrok, R. W. H. The nucleoprotein and phosphoprotein of measles virus. Front. Microbiol. 10, 1832 (2019).
Article PubMed PubMed Central Google Scholar
Nishio, M. et al. Mapping of domains on the human parainfluenza virus type 2 nucleocapsid protein (NP) required for NP-phosphoprotein or NP-NP interaction. J. Gen. Virol. 80, 2017–2022 (1999).
Article CAS PubMed Google Scholar
Hengrung, N. et al. Crystal structure of the RNA-dependent RNA polymerase from influenza C virus. Nature 527, 114–117 (2015). The first reported structure of ICV RNA polymerase apo complex, which highlights the dynamics of the flexible domains.
Article CAS PubMed PubMed Central Google Scholar
Peng, Q. et al. Structural insight into RNA synthesis by influenza D polymerase. Nat. Microbiol. 4, 1750–1759 (2019).
Article CAS PubMed Google Scholar
Pflug, A., Guilligay, D., Reich, S. & Cusack, S. Structure of influenza A polymerase bound to the viral RNA promoter. Nature 516, 355–360 (2014). The first reported structure of the IAV RNA polymerase bound to viral RNA.
Article CAS PubMed Google Scholar
Reich, S. et al. Structural insight into cap-snatching and RNA synthesis by influenza polymerase. Nature 516, 361–366 (2014). The first reported structures of the IBV RNA polymerase revealing the movement of the CapB domain during cap-snatching.
Article CAS PubMed Google Scholar
Wandzik, J. M. et al. A structure-based model for the complete transcription cycle of influenza polymerase. Cell 181, 877–893 e21 (2020). This article reports the structures of the influenza virus RNA polymerase at key stages of the transcription process.
Article CAS PubMed Google Scholar
Arragain, B. et al. Pre-initiation and elongation structures of full-length La Crosse virus polymerase reveal functionally important conformational changes. Nat. Commun. 11, 3590 (2020).
Article PubMed PubMed Central CAS Google Scholar
Gerlach, P., Malet, H., Cusack, S. & Reguera, J. Structural insights into bunyavirus replication and its regulation by the vRNA promoter. Cell 161, 1267–1279 (2015). The first reported structure of a sNSV L protein.
Article CAS PubMed PubMed Central Google Scholar
Wang, P. et al. Structure of severe fever with thrombocytopenia syndrome virus L protein elucidates the mechanisms of viral transcription initiation. Nat. Microbiol. 5, 864–871 (2020).
Article CAS PubMed Google Scholar
Vogel, D. et al. Structural and functional characterization of the severe fever with thrombocytopenia syndrome virus L protein. Nucleic Acids Res. 48, 5749–5765 (2020).
Article CAS PubMed PubMed Central Google Scholar
Peng, R. et al. Structural insight into arenavirus replication machinery. Nature 579, 615–619 (2020). The first reported structures of arenavirus RNA polymerases.
Article CAS PubMed Google Scholar
Horwitz, J. A., Jenni, S., Harrison, S. C. & Whelan, S. P. J. Structure of a rabies virus polymerase complex from electron cryo-microscopy. Proc. Natl Acad. Sci. USA 117, 2099–2107 (2020). The first reported structure of the rabies virus L–P complex.
Article CAS PubMed PubMed Central Google Scholar
Jenni, S. et al. Structure of the vesicular stomatitis virus L protein in complex with its phosphoprotein cofactor. Cell Rep. 30, 53–60 e55 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pan, J. et al. Structure of the human metapneumovirus polymerase phosphoprotein complex. Nature 577, 275–279 (2020). The first reported structure of the HMPV RNA polymerase bound to the the viral phosphoprotein.
Article CAS PubMed Google Scholar
Cao, D. et al. Cryo-EM structure of the respiratory syncytial virus RNA polymerase. Nat. Commun. 11, 368 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gilman, M. S. A. et al. Structure of the respiratory syncytial virus polymerase complex. Cell 179, 193–204 e114 (2019). The first reported structure of the HRSV RNA polymerase bound to the viral phosphoprotein.
Article CAS PubMed PubMed Central Google Scholar
Abdella, R., Aggarwal, M., Okura, T., Lamb, R. A. & He, Y. Structure of a paramyxovirus polymerase complex reveals a unique methyltransferase-CTD conformation. Proc. Natl Acad. Sci. USA 117, 4931–4941 (2020). The first reported structure of the HPIV RNA polymerase with complete auxilliary domains and unique P-tetramer conformation.
Article CAS PubMed PubMed Central Google Scholar
te Velthuis, A. J. Common and unique features of viral RNA-dependent polymerases. Cell Mol. Life Sci. 71, 4403–4420 (2014).
Article CAS Google Scholar
Fan, H. et al. Structures of influenza A virus RNA polymerase offer insight into viral genome replication. Nature 573, 287–290 (2019). The first reported structures of human and avian IAV RNA polymerases, demonstrating the role of regulatory dimer formation in vRNA synthesis.
Article CAS PubMed PubMed Central Google Scholar
Subbarao, E. K., London, W. & Murphy, B. R. A single amino-acid in the PB2 gene of influenza-A virus is a determinant of host range. J. Virol. 67, 1761–1764 (1993).
Article CAS PubMed PubMed Central Google Scholar
Gogrefe, N., Reindl, S., Gunther, S. & Rosenthal, M. Structure of a functional cap-binding domain in Rift Valley fever virus L protein. PLoS Pathog. 15, e1007829 (2019).
Article CAS PubMed PubMed Central Google Scholar
Reguera, J. et al. Comparative structural and functional analysis of bunyavirus and arenavirus cap-snatching endonucleases. PLoS Pathog. 12, e1005636 (2016).
Article PubMed PubMed Central CAS Google Scholar
Reguera, J., Weber, F. & Cusack, S. Bunyaviridae RNA polymerases (L-protein) have an N-terminal, influenza-like endonuclease domain, essential for viral cap-dependent transcription. PLoS Pathog. 6, e1001101 (2010).
Article PubMed PubMed Central CAS Google Scholar
Ogino, T. & Green, T. J. RNA synthesis and capping by non-segmented negative strand RNA viral polymerases: lessons from a prototypic virus. Front. Microbiol. 10, 1490 (2019).
Article PubMed PubMed Central Google Scholar
Guryanov, S. G., Liljeroos, L., Kasaragod, P., Kajander, T. & Butcher, S. J. Crystal structure of the measles virus nucleoprotein core in complex with an N-terminal region of phosphoprotein. J. Virol. 90, 2849–2857 (2015).
Article PubMed CAS Google Scholar
Longhi, S. et al. The C-terminal domain of the measles virus nucleoprotein is intrinsically disordered and folds upon binding to the C-terminal moiety of the phosphoprotein. J. Biol. Chem. 278, 18638–18648 (2003).
Article CAS PubMed Google Scholar
Ivanov, I., Crepin, T., Jamin, M. & Ruigrok, R. W. Structure of the dimerization domain of the rabies virus phosphoprotein. J. Virol. 84, 3707–3710 (2010).
Article CAS PubMed PubMed Central Google Scholar
Reguera, J., Gerlach, P. & Cusack, S. Towards a structural understanding of RNA synthesis by negative strand RNA viral polymerases. Curr. Opin. Struct. Biol. 36, 75–84 (2016).
Article CAS PubMed Google Scholar
Kouba, T., Drncova, P. & Cusack, S. Structural snapshots of actively transcribing influenza polymerase. Nat. Struct. Mol. Biol. 26, 460–470 (2019).
Article CAS PubMed PubMed Central Google Scholar
te Velthuis, A. J. W., Robb, N. C., Kapanidis, A. N. & Fodor, E. The role of the priming loop in influenza A virus RNA synthesis. Nat. Microbiol. 1, 16029 (2016).
Article CAS Google Scholar
Ogino, M., Gupta, N., Green, T. J. & Ogino, T. A dual-functional priming-capping loop of rhabdoviral RNA polymerases directs terminal de novo initiation and capping intermediate formation. Nucleic Acids Res. 47, 299–309 (2019).
Article CAS PubMed Google Scholar
Thierry, E. et al. Influenza polymerase can adopt an alternative configuration involving a radical repacking of PB2 domains. Mol. Cell 61, 125–137 (2016).
Article CAS PubMed PubMed Central Google Scholar
Pyle, J. D. & Whelan, S. P. J. RNA ligands activate the Machupo virus polymerase and guide promoter usage. Proc. Natl Acad. Sci. USA 116, 10518–10524 (2019).
Article CAS PubMed PubMed Central Google Scholar
Carrique, L. et al. Host ANP32 mediates the assembly of the influenza virus replicase. Nature 587, 638–643 (2020). The first reported structure of a putative replicating-encapsidating ICV polymerase dimer bound by host factor ANP32A.
Article PubMed PubMed Central CAS Google Scholar
Lukarska, M. et al. Structural basis of an essential interaction between influenza polymerase and Pol II CTD. Nature 541, 117–121 (2017). The first reported structures of the IAV and IBV RNA polymerases bound to a peptide mimicking the CTD of host Pol II.
Article CAS PubMed Google Scholar
Serna Martin, I. et al. A mechanism for the activation of the influenza virus transcriptase. Mol. Cell 70, 1101–1110 e1104 (2018). The first reported structure of the ICV RNA polymerase bound to a peptide mimicking the CTD of host Pol II, which revealed the activation of viral transcription.
Article CAS PubMed PubMed Central Google Scholar
Te Velthuis, A. J. W. & Oymans, J. Initiation, elongation, and realignment during influenza virus mRNA synthesis. J. Virol. 92, e01775-17 (2018).
Article Google Scholar
De Vlugt, C., Sikora, D., Rocheleau, L. & Pelchat, M. Priming and realignment by the influenza a virus RdRp is dependent on the length of the host primers and the extent of base pairing to viral RNA. Virology 536, 91–100 (2019).
Article PubMed CAS Google Scholar
Yao, M. et al. Repetitive prime-and-realign mechanism converts short capped RNA leaders into longer ones that may be more suitable for elongation during rice stripe virus transcription initiation. J. Gen. Virol. 93, 194–202 (2012).
Article CAS PubMed Google Scholar
Bier, K., York, A. & Fodor, E. Cellular cap-binding proteins associate with influenza virus mRNAs. J. Gen. Virol. 92, 1627–1634 (2011).
Article CAS PubMed Google Scholar
Walker, A. P., Sharps, J. & Fodor, E. Mutation of an influenza virus polymerase 3′ RNA promoter binding site inhibits transcription elongation. J. Virol. 94, e00498–00420 (2020).
Article CAS PubMed PubMed Central Google Scholar
Poon, L. L., Pritlove, D. C., Fodor, E. & Brownlee, G. G. Direct evidence that the poly(A) tail of influenza A virus mRNA is synthesized by reiterative copying of a U track in the virion RNA template. J. Virol. 73, 3473–3476 (1999).
Article CAS PubMed PubMed Central Google Scholar
Fodor, E., Pritlove, D. C. & Brownlee, G. G. The influenza virus panhandle is involved in the initiation of transcription. J. Virol. 68, 4092–4096 (1994).
Article CAS PubMed PubMed Central Google Scholar
Tiley, L. S., Hagen, M., Matthews, J. T. & Krystal, M. Sequence-specific binding of the influenza virus RNA polymerase to sequences located at the 5′ ends of the viral RNAs. J. Virol. 68, 5108–5116 (1994).
Article CAS PubMed PubMed Central Google Scholar
Deng, T., Vreede, F. T. & Brownlee, G. G. Different de novo initiation strategies are used by influenza virus RNA polymerase on its cRNA and viral RNA promoters during viral RNA replication. J. Virol. 80, 2337–2348 (2006).
Article CAS PubMed PubMed Central Google Scholar
Oymans, J. & Te Velthuis, A. J. W. A mechanism for priming and realignment during influenza A virus replication. J. Virol. 92, e01773-17 (2018).
Article PubMed PubMed Central Google Scholar
Jorba, N., Coloma, R. & Ortin, J. Genetic trans-complementation establishes a new model for influenza virus RNA transcription and replication. PLoS Pathog. 5, e1000462 (2009).
Article PubMed PubMed Central CAS Google Scholar
Jorba, N., Area, E. & Ortin, J. Oligomerization of the influenza virus polymerase complex in vivo. J. Gen. Virol. 89, 520–524 (2008).
Article CAS PubMed Google Scholar
Chang, S. et al. Cryo-EM structure of influenza virus RNA polymerase complex at 4.3Å resolution. Mol. Cell 57, 925–935 (2015).
Article CAS PubMed Google Scholar
Long, J. S. et al. Species difference in ANP32A underlies influenza A virus polymerase host restriction. Nature 529, 101–104 (2016).
Article CAS PubMed PubMed Central Google Scholar
Long, J. S. et al. Species specific differences in use of ANP32 proteins by influenza A virus. eLife https://doi.org/10.7554/eLife.45066 (2019).
Article PubMed PubMed Central Google Scholar
Reilly, P. T., Yu, Y., Hamiche, A. & Wang, L. Cracking the ANP32 whips: important functions, unequal requirement, and hints at disease implications. Bioessays 36, 1062–1071 (2014).
Article CAS PubMed PubMed Central Google Scholar
Tchesnokov, E. P., Feng, J. Y., Porter, D. P. & Gotte, M. Mechanism of inhibition of ebola virus RNA-dependent RNA polymerase by remdesivir. Viruses 11, 326 (2019).
Article CAS PubMed Central Google Scholar
Delang, L., Abdelnabi, R. & Neyts, J. Favipiravir as a potential countermeasure against neglected and emerging RNA viruses. Antivir. Res. 153, 85–94 (2018).
Article CAS PubMed Google Scholar
Graci, J. D. & Cameron, C. E. Mechanisms of action of ribavirin against distinct viruses. Rev. Med. Virol. 16, 37–48 (2006).
Article CAS PubMed Google Scholar
Furuta, Y. et al. Favipiravir (T-705), a novel viral RNA polymerase inhibitor. Antivir. Res. 100, 446–454 (2013).
Article CAS PubMed Google Scholar
Goldhill, D. H. et al. The mechanism of resistance to favipiravir in influenza. Proc. Natl Acad. Sci. USA 115, 11613–11618 (2018).
Article CAS PubMed PubMed Central Google Scholar
Yoon, J. J. et al. Orally efficacious broad-spectrum ribonucleoside analog inhibitor of influenza and respiratory syncytial viruses. Antimicrob. Agents Chemother. 62, e00766-18 (2018).
Article PubMed PubMed Central Google Scholar
Toots, M. et al. Characterization of orally efficacious influenza drug with high resistance barrier in ferrets and human airway epithelia. Sci. Transl Med. 11, eaax5866 (2019).
Article CAS PubMed PubMed Central Google Scholar
Lang, D. M., Zemla, A. T. & Zhou, C. L. Highly similar structural frames link the template tunnel and NTP entry tunnel to the exterior surface in RNA-dependent RNA polymerases. Nucleic Acids Res. 41, 1464–1482 (2013).
Article CAS PubMed Google Scholar
Li, C. et al. A peptide derived from the C-terminus of PB1 inhibits influenza virus replication by interfering with viral polymerase assembly. FEBS J. 280, 1139–1149 (2013).
Article CAS PubMed Google Scholar
Ghanem, A. et al. Peptide-mediated interference with influenza A virus polymerase. J. Virol. 81, 7801–7804 (2007).
Article CAS PubMed PubMed Central Google Scholar
Yamada, K. et al. Identification of a novel compound with antiviral activity against influenza A virus depending on PA subunit of viral RNA polymerase. Microbes Infect. 14, 740–747 (2012).
Article CAS PubMed Google Scholar
Clark, M. P. et al. Discovery of a novel, first-in-class, orally bioavailable azaindole inhibitor (VX-787) of influenza PB2. J. Med. Chem. 57, 6668–6678 (2014).
Article CAS PubMed Google Scholar
Pflug, A. et al. Capped RNA primer binding to influenza polymerase and implications for the mechanism of cap-binding inhibitors. Nucleic Acids Res. 46, 956–971 (2018).
Article CAS PubMed Google Scholar
Wang, W. et al. The cap-snatching SFTSV endonuclease domain is an antiviral target. Cell Rep. 30, 153–163 e155 (2020).
Article CAS PubMed PubMed Central Google Scholar
McGowan, D. C. et al. Design, synthesis, and biological evaluation of novel indoles targeting the influenza PB2 cap binding region. J. Med. Chem. 62, 9680–9690 (2019).
Article CAS PubMed PubMed Central Google Scholar
Omoto, S. et al. Characterization of influenza virus variants induced by treatment with the endonuclease inhibitor baloxavir marboxil. Sci. Rep. 8, 9633 (2018).
Article PubMed PubMed Central CAS Google Scholar
Ye, Q., Krug, R. M. & Tao, Y. J. The mechanism by which influenza A virus nucleoprotein forms oligomers and binds RNA. Nature 444, 1078–1082 (2006).
Article CAS PubMed Google Scholar
Arragain, B. et al. High resolution cryo-EM structure of the helical RNA-bound Hantaan virus nucleocapsid reveals its assembly mechanisms. eLife 8, e43075 (2019).
Article PubMed PubMed Central Google Scholar
Gutsche, I. et al. Structural virology. Near-atomic cryo-EM structure of the helical measles virus nucleocapsid. Science 348, 704–707 (2015).
Article CAS PubMed Google Scholar
Sugita, Y., Matsunami, H., Kawaoka, Y., Noda, T. & Wolf, M. Cryo-EM structure of the Ebola virus nucleoprotein-RNA complex at 3.6 Å resolution. Nature 563, 137–140 (2018).
Article CAS PubMed Google Scholar
Su, Z. et al. Electron cryo-microscopy structure of Ebola virus nucleoprotein reveals a mechanism for nucleocapsid-like assembly. Cell 172, 966–978 e912 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wan, W. et al. Structure and assembly of the Ebola virus nucleocapsid. Nature 551, 394–397 (2017).
Article CAS PubMed PubMed Central Google Scholar
Arranz, R. et al. The structure of native influenza virion ribonucleoproteins. Science 338, 1634–1637 (2012).
Article CAS PubMed Google Scholar
Moeller, A., Kirchdoerfer, R. N., Potter, C. S., Carragher, B. & Wilson, I. A. Organization of the influenza virus replication machinery. Science 338, 1631–1634 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dadonaite, B. et al. The structure of the influenza A virus genome. Nat. Microbiol. 4, 1781–1789 (2019).
Article CAS PubMed PubMed Central Google Scholar
Williams, G. D. et al. Nucleotide resolution mapping of influenza A virus nucleoprotein-RNA interactions reveals RNA features required for replication. Nat. Commun. 9, 465 (2018).
Article PubMed PubMed Central CAS Google Scholar
Lee, N. et al. Genome-wide analysis of influenza viral RNA and nucleoprotein association. Nucleic Acids Res. 45, 8968–8977 (2017).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

A.J.W.t.V. is supported by joint Wellcome Trust and Royal Society grant 206579/Z/17/Z and the National Institutes of Health grant R21AI147172. J.M.G. is supported by Wellcome Investigator Award 200835/Z/16/Z. E.F. is supported by UK Medical Research Council programme grant MR/R009945/1.

Author information

Aartjan J. W. te Velthuis
Present address: Lewis Thomas Laboratory, Department of Molecular Biology, Princeton University, Princeton, NJ, USA

Authors and Affiliations

Division of Virology, Department of Pathology, University of Cambridge, Cambridge, UK
Aartjan J. W. te Velthuis
Division of Structural Biology, Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
Jonathan M. Grimes
Sir William Dunn School of Pathology, University of Oxford, Oxford, UK
Ervin Fodor

Authors

Aartjan J. W. te Velthuis
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan M. Grimes
View author publications
You can also search for this author in PubMed Google Scholar
Ervin Fodor
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors contributed equally to all aspects of the article.

Corresponding authors

Correspondence to Aartjan J. W. te Velthuis, Jonathan M. Grimes or Ervin Fodor.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information

Nature Reviews Microbiology thanks R. Fearns, M. Rosenthal and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Glossary

Negative-sense RNA viruses: (NSVs). Viruses with a negative-sense, single-stranded RNA genome.
Negative-sense RNA: An RNA that must be transcribed by a virally encoded RNA-dependent RNA polymerase into complementary mRNA before viral protein production can occur.
Nucleoprotein: A protein that binds to the genomic and antigenomic RNA in the context of viral ribonucleoprotein complexes.
Viral ribonucleoprotein: (vRNP). A complex consisting of viral RNA and nucleoprotein.
Antigenome: Positive-sense replicative RNA intermediate of viral replication, which is referred to as the complementary RNA for some viruses, such as the influenza viruses.
Cap-snatching: A process by which the 5′ cap structure is cleaved from cellular capped RNAs by endonuclease activity and subsequently used to prime viral transcription.
Priming loop: A protein structure that interacts with the first nucleotide of the nascent RNA or terminal base of the template and lowers the activation energy for de novo initiation.
De novo initiation: The initiation step of nucleic acid synthesis that occurs in the absence of a primer.
Apo form: A protein in the absence of a ligand.
Prime-and-realign: A mechanism in which the nascent RNA is separated from the template RNA and reannealed at an identical sequence elsewhere in the template.

Rights and permissions

Reprints and permissions

About this article

Cite this article

te Velthuis, A.J.W., Grimes, J.M. & Fodor, E. Structural insights into RNA polymerases of negative-sense RNA viruses. Nat Rev Microbiol 19, 303–318 (2021). https://doi.org/10.1038/s41579-020-00501-8

Download citation

Accepted: 07 December 2020
Published: 25 January 2021
Issue Date: May 2021
DOI: https://doi.org/10.1038/s41579-020-00501-8

Subjects

Abstract

Similar content being viewed by others

Improving prime editing with an endogenous small RNA-binding protein

Engineered virus-like particles for transient delivery of prime editor ribonucleoprotein complexes in vivo

Expedient production of site specifically nucleobase-labelled or hypermodified RNA with engineered thermophilic DNA polymerases

Introduction

Structure of the RNA polymerase

RNA polymerases of sNSVs

RNA polymerases of nsNSVs

Structure of the RdRP domain

Binding to promoter RNA

Conformational rearrangements

RNA polymerases of sNSVs

RNA polymerases of nsNSVs

Transcription and replication

sNSV RNA polymerases

nsNSV RNA polymerases

Therapeutic applications

Nucleoside analogues

Non-nucleoside inhibitors

Concluding remarks

Change history

02 February 2021

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Peer review information

Publisher’s note

Glossary

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links