Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has killed millions of people and continues to cause massive global upheaval. Coronaviruses are positive-strand RNA viruses with an unusually large genome of ~30 kb. They express an RNA-dependent RNA polymerase and a cohort of other replication enzymes and supporting factors to transcribe and replicate their genomes. The proteins performing these essential processes are prime antiviral drug targets, but drug discovery is hindered by our incomplete understanding of coronavirus RNA synthesis and processing. In infected cells, the RNA-dependent RNA polymerase must coordinate with other viral and host factors to produce both viral mRNAs and new genomes. Recent research aiming to decipher and contextualize the structures, functions and interplay of the subunits of the SARS-CoV-2 replication and transcription complex proteins has burgeoned. In this Review, we discuss recent advancements in our understanding of the molecular basis and complexity of the coronavirus RNA-synthesizing machinery. Specifically, we outline the mechanisms and regulation of RNA translation, replication and transcription. We also discuss the composition of the replication and transcription complexes and their suitability as targets for antiviral therapy.
Similar content being viewed by others
Introduction
The devastation caused by the COVID-19 pandemic has led to an extraordinary expansion of research focused on the causative agent, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). This virus has been classified within the species Severe acute respiratory syndrome-related coronavirus, which belongs to the genus Betacoronavirus of the family Coronaviridae1. The coronavirus family belongs to the order Nidovirales, which includes a rapidly expanding and diverse group of enveloped viruses with a single-stranded RNA genome of positive polarity. Different nidoviruses use similar strategies to organize, express and replicate their genomes. They constitute a monophyletic virus cluster that is characterized by the universal conservation of seven domains in their large replicase gene, which encodes the functions required for viral RNA replication and transcription in the infected cell2. Much of the foundational research on this replicase addressed viruses in related nidovirus families such as the Arteriviridae and, more relevantly, other members of the family Coronaviridae. The features unique and common to viruses within and outside the order Nidovirales have been described in several comprehensive reviews3,4,5. Using the first available genome sequences, early studies unravelled the genome organization and expression strategy used by coronaviruses and other nidoviruses. These studies were followed by pioneering bioinformatics, biochemical and genetic studies that established or confirmed the essential functions of many of the replicase proteins, thereby laying a road map for the currently ongoing SARS-CoV-2 research.
Betacoronaviruses have caused previous deadly epidemics, including the 2003 SARS outbreak and the ongoing Middle East respiratory syndrome (MERS) epidemic, which was first detected in 2012 (ref.6). These zoonotic events inspired earlier efforts to develop coronavirus-specific antiviral drugs6,7,8,9. The onset of the COVID-19 pandemic spurred scientists around the globe to apply their respective expertise to address how SARS-CoV-2 infects humans, avoids or delays host immune responses, copies its genome and expresses its proteins to make new virions. As successful replication directly depends on efficient synthesis of viral RNA, the replicase proteins responsible for this process are obvious antiviral drug targets.
Coronaviruses use an unusually large collection of RNA-synthesizing and RNA-processing enzymes to express and replicate a genome that is two to three times larger than that of most other RNA viruses. The central enzyme of transcription and replication is the RNA-dependent RNA polymerase (RdRp), which synthesizes all viral RNA and is a proven target for antiviral drugs. Successful replication and transcription also entails the use of specific RNA recognition signals to initiate RNA synthesis, sustaining RdRp processivity and fidelity, viral mRNA capping to ensure translation by host ribosomes, and the spatial and temporal regulation of the viral cycle within the infected cell. The non-structural proteins that assist the RdRp to perform these functions (Table 1) constitute additional targets for antiviral drug development. Like the RdRp, some of these non-structural proteins are conserved across most RNA viruses, whereas others are unique to coronaviruses3,4,5.
In this Review, we discuss recent advances in deciphering the molecular mechanisms of coronavirus gene expression and RNA replication, with special focus on SARS-CoV-2. We aim to contextualize the more recent studies with the foundational work, to provide a coherent view of our current understanding of the successive steps in SARS-CoV-2 replication and transcription. We focus on the processes required for viral gene expression and replication: translation, replication organelle formation, and production and capping of genomic RNA (gRNA) and subgenomic RNAs (sgRNAs). We also describe the known and proposed functions of the viral nucleic acid-metabolizing proteins, as revealed by new biochemical, structural and virological studies. We then discuss two well-studied examples of antiviral nucleoside analogues that target the RdRp; owing to space constraints, we do not address therapeutics designed to target viral proteins other than the RdRp. Likewise, we do not extensively cover the mechanisms of viral protein synthesis, or the (proposed) involvement of a substantial number of host factors in coronavirus replication and transcription. We conclude the Review by describing the remaining gaps in our knowledge to help guide new research.
The SARS-CoV-2 infection cycle
To infect a cell, coronaviruses use multiple host factors, whose expression patterns therefore co-determine viral tropism. Delivery into the cell and translation of the large RNA genome launches a cytoplasmic replication cycle that integrates a remarkable variety of strategies to fine-tune viral gene expression on both the translational level and the transcriptional level. The successive steps that ultimately lead to the release of viral progeny are coordinated temporally and spatially, and rely extensively on the infrastructure and metabolism of the host cell. In this section, we outline the key steps of the SARS-CoV-2 infection cycle.
Entry into the host cell
SARS-CoV-2 entry into the cell depends on several host attachment and entry factors, chiefly among them on the receptor angiotensin-converting enzyme 2 (ACE2), to which the SARS-CoV-2 spike (S) glycoprotein binds (reviewed in refs10,11) (Fig. 1). The fusogenic spike protein consists of two parts, S1 and S2, which mediate attachment to and fusion with the cell membrane, respectively. Cellular proteases such as transmembrane serine protease 2 cleave the spike protein, a step required to prime its membrane fusion activity12,13.
Following membrane fusion, the SARS-CoV-2 gRNA is released into the cytosol. The genome is a 5′-capped, single-stranded RNA of 29,870 bases with a 3′ poly(A) tail of variable length14,15. It encodes at least 13 recognized open reading frames (ORFs), organized largely linearly from the 5′ end to the 3′ end (Fig. 2a). The coding part of the gRNA is flanked by a 5′ untranslated region (UTR) and a 3′ UTR of 265 and 337 nucleotides (excluding the poly(A) tail), respectively. The gRNA possesses a number of regulatory sequences and higher-order RNA structures (discussed later) that are involved in its translation, replication and transcription16,17,18,19.
Gearing up for RNA synthesis
Similarly to all positive-strand RNA viruses of eukaryotes, the replication of SARS-CoV-2 occurs entirely in the cytoplasm. The SARS-CoV-2 gRNA first recruits host ribosomes and serves as mRNA for translation of the two large replicase ORFs ORF1a and ORF1b, which constitute about three quarters of the genome15 (Fig. 2a). The resulting, amino-terminally (N-terminally) collinear replicase polyproteins pp1a and pp1ab are 4,405 and 7,096 amino acids long, respectively. Production of pp1ab depends on the occurrence of a −1 programmed ribosomal frameshift (PRF) just upstream of the ORF1a termination codon, thus extending pp1a with the ORF1b-encoded polyprotein. The estimated frameshifting efficiency is 45–70%, resulting in a 1.5–2-fold overexpression of ORF1a-encoded proteins relative to ORF1b-encoded proteins20. Sixteen mature non-structural proteins are released from pp1a and pp1ab following 15 proteolytic cleavages performed by the virus-encoded papain-like protease (PLpro) in non-structural protein 3 (nsp3) and chymotrypsin-like or main protease (Mpro) in nsp5 (Table 1). In this manner, pp1a yields nsp1 to nsp11, whereas pp1ab is cleaved into nsp1 to nsp10 and nsp12 to nsp16 (refs21,22) (Fig. 2a).
The rapidly released nsp1 mediates the shutdown of the translation of host mRNAs23,24,25, while the other non-structural proteins form protein complexes, yet to be definitely determined, that engage in viral RNA synthesis and are referred to as the replication–transcription complexes (RTCs). Replication and transcription are driven primarily by the enzymes contained in nsp12, nsp13, nsp14 and nsp16 (Table 1). Nsp12, the subunit containing the RdRp domain, catalyses RNA synthesis with the assistance of nsp7 and nsp8, together forming the holoenzyme RdRp (holo-RdRp). Other RTC subunits have supporting roles in the RTC, modulate the host’s innate immune responses or remodel cell membranes into peculiar double-membrane structures known as ‘replication organelles’, which accommodate viral RNA synthesis26,27. The formation of replication organelles typically precedes the exponential phase of viral RNA synthesis and is discussed in Box 1.
RNA synthesis and virion assembly
Using the gRNA template, RNA synthesis by the RTC starts with producing both a full-length genome complement (the anti-genome) and a set of minus-strand sgRNAs, which are derived from the gRNA region downstream of ORF1a and ORF1b (the replicase gene). Whereas the anti-genome serves as a template to produce new gRNA, the minus-strand sgRNAs direct the synthesis of a nested set of subgenomic mRNAs (sg-mRNAs) (discussed later). Although transcription is defined principally as the synthesis of RNA from a DNA template, in this Review we use the term to describe the synthesis of sg-mRNAs from RNA templates, to conform with the terminology used in the coronavirus literature.
The sg-mRNAs are crucial for the production of the four coronavirus structural proteins, which are required for virion assembly and egress. New virions were recently reported to leave the cell via lysosomal trafficking rather than the biosynthetic secretory pathway used by many other enveloped viruses28. A number of the sg-mRNAs are used to express so-called accessory proteins, many of which have been implicated in modulating cellular innate immune responses20,29,30 (Fig. 1).
Regulation of SARS-CoV-2 translation
Expression of the SARS-CoV-2 proteins in infected cells depends primarily on the translation of gRNA and the eight ‘canonical’ sg-mRNAs20,29,30. The viral genes fall into three groups (Fig. 2a): replicase ORF1a and ORF1b, which are translated from gRNA with ORF1b expression depending on −1 PRF; ORFs encoding the four ‘universal’ coronavirus structural proteins (the spike, membrane, envelope and nucleocapsid proteins) (Fig. 1), which are translated from sg-mRNAs; and ORFs encoding accessory proteins, which are translated from the remaining sg-mRNAs, but differ widely between various coronavirus lineages31 (Fig. 2a). In addition, small (putative) ORFs that overlap with several of the ORFs outlined above were identified by theoretical and experimental approaches. These small ORFs were also the subject of nomenclature confusion32, and their expression and biological relevance continue to be investigated32,33,34,35. In this light, we have included only ORF3c and ORF9b of the small ORFs in the current map of the SARS-CoV-2 genome (Fig. 2a).
Like many RNA viruses, coronaviruses use non-canonical translation mechanisms to expand their coding capacity and fine-tune the expression levels of particular viral proteins36. Specifically, PRF (for ORF1b) and ‘leaky ribosomal scanning’ (for some ORFs in sg-mRNAs) co-regulate SARS-CoV-2 genome expression. In leaky ribosomal scanning, ribosomes load onto the 5′ end of the viral sg-mRNAs, but initiate translation from a more downstream, internal start codon. The use of leaky scanning has been suggested or demonstrated for SARS-CoV-2 ORF3c33,34, and for ORF7b20,22,37 and ORF9b38 of both SARS-CoV and SARS-CoV-2. In these cases, leaky scanning and expression of the more downstream ORF appear to be promoted by the suboptimal nature of upstream translation initiation signals.
Expression of ORF1b from gRNA depends on −1 PRF occurring just upstream of the ORF1a stop codon39, a highly conserved feature among coronaviruses and other nidoviruses. Termination of SARS-CoV-2 ORF1a translation yields the 4,405-residue-long pp1a, and −1 PRF results in extension to yield the 7,096-residue-long pp1ab. The ORF1a–ORF1b PRF mechanism directs the expression of the key RNA metabolism enzymes of the RTC and regulates the relative expression levels of the proteins encoded by ORF1a and ORF1b. Frameshifting occurs on a specific ‘slippery sequence’ (5′-U UUA AAC-3′ (nucleotides 13,462–13,468 in GenBank genome entry MN908947.3; all genome reference numbers in this Review refer to this sequence)), followed by GGG in the case of SARS-CoV-2) — ribosomes translating the UUA and AAC codons of ORF1a can shift one nucleotide backwards, and translation then continues with a CGG codon in ORF1b40,41 (Fig. 2b,c). The importance of maintaining the optimal ratio between ORF1a expression and ORF1b expression was demonstrated experimentally with use of SARS-CoV mutants with altered PRF levels, which were found to be dramatically crippled41.
Regulation of PRF efficiency is achieved by the formation of several RNA structures and through interactions of the nascent protein chain with the ribosome. The key PRF-stimulating element is a three-stemmed RNA pseudoknot structure located downstream of the slippery sequence39,40,41,42 (Fig. 2b). This element interacts with the ribosome at the entry of the mRNA channel of the 40S ribosomal subunit and induces translational pausing before −1 PRF; complete unfolding of this tertiary RNA structure is slow and thought to promote ribosomal frameshifting on the viral mRNA40. The position of the ORF1a stop codon, five codons downstream of the frameshift site, may also regulate PRF levels by allowing the pseudoknot to refold, thus preventing trailing ribosomes from continuing along the viral RNA that was unfolded by the leading ribosome40. PRF frequency is further modulated by a translation-attenuating RNA loop upstream of the slippery sequence40, which may either directly inhibit frameshifting42 or force elongating ribosomes to dissociate before reaching the PRF site41. Finally, interactions between specific residues in the nascent viral polyproteins and the ribosome exit tunnel are thought to co-determine PRF efficiency40.
RNA replication and transcription
As described earlier, ORF1a and ORF1b are translated into the pp1a and pp1ab precursors that give rise to 16 non-structural proteins1,2,3,4. Fourteen of these replicase subunits have been ascribed some type of function in coronavirus replication, several with manifold activities, as listed in Table 1. These non-structural proteins either have been directly implicated in nucleic acid metabolism or enable or promote the activity of the catalytic non-structural proteins, to stimulate RNA synthesis and processing or participate in the formation of replication organelles. In this Review, we discuss in detail the molecular and biochemical features of these proteins, focusing mostly on nsp7, nsp8, nsp9, nsp10, nsp11, nsp12, nsp13, nsp14 and nsp16, but do not delve into the details of the role of nsp15, which includes a unique uridylate-specific endoribonuclease43 that is conserved in most vertebrate nidoviruses. Nsp15 has been implicated in innate immunity evasion44,45, possibly by shortening the poly(U) stretches that are present at the 5′ end of viral minus-strand RNAs45. In coronaviruses, uridylate-specific endoribonuclease activity is required for efficient replication, but that requirement can be bypassed in host cells with depressed type I interferon sensing or production44,45. In the following subsections we summarize our current understanding of the mechanisms of coronavirus gRNA and sg-mRNA synthesis, and the involvement of specific viral proteins in controlling these processes. We note that some non-structural proteins have other functions, which we do not mention because they are outside the scope of this Review. For a more in-depth summary of the literature on the functions of each non-structural protein, see Table 1 and references therein.
Continuous and discontinuous RNA synthesis in replication and gene expression
Following gRNA translation and proteolytic maturation of the replicase polyproteins, a relatively complex programme of SARS-CoV-2 RNA synthesis and gene expression is initiated, which depends on the interplay between viral RNA and non-structural proteins on the one hand (Fig. 3), and host-cell proteins and membranes on the other hand (Box 1). A variety of RNA sequences and structural elements in the terminal regions of the coronavirus genome have been implicated in the specific recognition of RNA templates by the coronavirus RTC16,17,18 (Fig. 3a). Long-range RNA–RNA interactions may be important for replication and transcription, although in many cases direct experimental support for their biological relevance remains to be obtained.
As outlined earlier, SARS-CoV-2 RNA synthesis can be divided into genome replication and sg-mRNA transcription (Fig. 3b). Replication yields full-length viral plus-strand gRNA, which can be translated into additional replicase polyproteins, serve as a template for additional minus-strand RNA synthesis or be packaged into progeny virions. Transcription produces the nested set of sg-mRNAs used to express the structural and accessory proteins. Replication and transcription both require dedicated minus-strand RNA templates; full-length minus strands serve as a template for gRNA replication, whereas a nested set of minus-strand sgRNAs serve as templates for transcription, as first proposed about a quarter of a century ago46. The sg-mRNAs have the same 3′-terminal sequence, and carry a common 5′ leader sequence that is identical to the 5′-terminal 75 nucleotides of the gRNA. The leader derives from a discontinuous step (that is, from template switching47 during minus-strand sgRNA synthesis), which occurs when the RTC stalls at the 3′-proximal quarter of the gRNA template (Fig. 3b). This interruption mediates RTC detachment and relocation to a position near the 5′ end of the gRNA template (discussed later), where minus-strand synthesis resumes. This process yields a set of nested minus-strand sgRNAs with common 5′-terminal and 3′-terminal sequences, which serve as templates for the synthesis of a complementary set of sg-mRNAs47,48.
The presence of the common 5′ leader sequence in coronavirus mRNAs may offer several advantages. Its complement (the anti-leader sequence) offers a conserved starting point for plus-strand RNA synthesis, which can be used to initiate the synthesis of both gRNA and all sg-mRNAs. Although not studied in detail thus far, the common 5′ leader sequence may also serve as a recognition signal for the viral mRNA capping machinery (Box 2). Furthermore, the nsp1 proteins of SARS-CoV, MERS-CoV and SARS-CoV-2 all mediate a translation shut-off in the infected cell49, which is based on their ability to block the ribosomal mRNA entry channel24,25 and induce endonucleolytic cleavage of host mRNAs23. The common 5′ leader sequence present in all coronavirus mRNAs allows escape from translation shut-off50, by yet unknown mechanisms, resulting in simultaneous viral mRNA translation and impairment of host-cell gene expression, including of genes mediating the early responses to virus infection.
Regulation of template switching
The template switching required to extend the ‘body’ of the nascent minus-strand sgRNA with the anti-leader is primarily guided by the body transcription regulatory sequence (TRS-B) elements. These short sequences are found just upstream of the ORFs that encode structural and accessory proteins (except for those expressed through leaky scanning). After copying of a TRS-B sequence, minus-strand RNA synthesis stalls and the 3′ end of the nascent RNA strand is translocated to reinitiate RNA synthesis at the leader TRS (TRS-L) near the 5′ end of the gRNA template. This step is strongly facilitated by a base pairing interaction between the TRS-B complement at the 3′ end of the nascent minus strand (anti-TRS-B) and the TRS-L sequence in the gRNA template, as demonstrated previously in related viruses by site-directed mutagenesis studies51,52,53,54 (Fig. 3b). Coronavirus TRSs comprise a conserved core sequence (5′-ACGAAC-3′ in the case of SARS-CoV and SARS-CoV-2) that is flanked by sequences of variable length that may also contribute to the base pairing interaction with the TRS-L region20,29,30. In addition to the strength of the RNA duplex that is formed with the TRS-L region, other factors may co-determine the relative activity of a TRS-B element, and consequently the level at which the corresponding sg-mRNA is produced. These factors include the relative position of a TRS-B with respect to the 3′ end of the gRNA template, flanking RNA sequences51 and the local or overall RNA structure of the gRNA template16. Genome cyclization driven by long-distance RNA–RNA interactions (Fig. 3a), was recently proposed to expose the TRS-L for base pairing during discontinuous minus-strand synthesis16, similarly to what was previously postulated for arteriviruses55.
The series of ‘stop-or-go decisions’ at the consecutive TRS-B elements encountered by the minus strand-transcribing RTC is thought to fine-tune the relative abundances of the various sg-mRNAs, which remain largely similar throughout the course of infection56. The TRS-L is effectively ‘merged’ with one of the TRS-B elements in each of the sg mRNAs, thus positioning the ORF downstream of that TRS-B at the 5′-proximal position in the sg-mRNA and allowing it to be accessed by host ribosomes. Thus, coronavirus sg-mRNAs are nested and, except for the smallest species, polycistronic. However, they are presumed to be functionally monocistronic, with translation being restricted to the ORF most proximal to the 5′ end of the RNA, except in the case of sg-mRNAs on which leaky ribosomal scanning occurs to access a second ORF.
The canonical 5′-ACGAAC-3′ TRS core sequence occurs only nine times in the SARS-CoV-2 genome (in TRS-L and eight TRS-Bs), coordinating the production of eight sg-mRNA species (RNAs 2–9)20,29,30,57 (Fig. 2a). The smallest of these (mRNA 9) encodes the nucleocapsid protein and is by far the most abundant transcript30,56. For most sg-mRNAs, transcript abundance strongly correlates with ribosome footprint densities, indicating that they are translated with similar efficiencies, in line with the fact that their 5′ UTRs starting with the 75-nucleotide common leader sequence are largely identical (Fig. 2a). The detection of a separate sg-mRNA to express ORF7b was reported, derived from template switching at a TRS-B-like sequence (5′-AAGAAC-3′) located just upstream of ORF7b. Although the effective contribution of this TRS-B-like sequence to ORF7b expression may be limited20, this exemplifies how the (low-frequency) use of TRS-B-resembling sequences may yield additional subgenomic transcripts.
SARS-CoV-2 in-depth transcriptomics
Recently, the use of different highly sensitive techniques to study the SARS-CoV-2 transcriptome has identified numerous ‘non-canonical’ subgenomic transcripts20,29,30,57. These derive from TRS-L-dependent transcription, with the TRS-L, for example, being fused to downstream TRS-B-like sequences located in the middle of known ORFs; from large or local deletions generated without the apparent involvement of TRSs; or from the generation of (possibly) defective RNAs that may interfere with replication of the full-length genome by competing for the viral RdRp and other crucial replication factors, as described in several other coronaviruses58. These RNA species may in part derive from RNA recombination, which occurs at high frequency in coronaviruses59,60,61. The most accepted model for recombination in RNA viruses, similarity-assisted copy-choice RNA recombination, bears strong resemblance to the mechanism of coronavirus discontinuous minus-strand sgRNA synthesis47. Recombination involving host RNAs has also been invoked to explain gene acquisition during the evolution of coronaviruses and other nidoviruses3. It was hypothesized that TRS-B elements serve as recombination hotspots60,61, and that RNA secondary structures promote template switching in a TRS-independent manner61. Together, in-depth transcriptomics and ribosome-profiling experiments have revealed a complex landscape of SARS-CoV-2 RNAs and (potential) proteins, which extends well beyond the ‘canonical’ gene expression programme based on translation of the gRNA and canonical sg-mRNA20,29,30,57. Similar observations were made in other coronaviruses62,63. The additional transcripts may serve to express previously unknown small ORFs, truncated proteins or fused (partial) gene products, but their potential roles in SARS-CoV-2 replication and pathogenesis remain to be thoroughly investigated.
Balancing replication and transcription
It is unknown whether the composition of RTCs engaging in synthesis of minus-strand gRNA versus minus-strand sgRNAs is identical. Interactions with specific protein factors may govern the balance between replication and transcription. Two examples of transcription-specific protein functions have been documented in arteriviruses, which are distant coronavirus relatives in the order Nidovirales that also use discontinuous RNA synthesis to generate a nested set of sg-mRNAs64. The N-terminal subunit of the arterivirus replicase, nsp1, controls the accumulation of gRNA and sg-mRNAs by determining the levels at which their respective minus-strand templates are produced65. Specifically, mutagenesis of the N-terminal zinc-finger domain of nsp1 fully abrogated sg-mRNA synthesis, whereas gRNA production by such mutants increased 2.5–3-fold66. A serendipitous mutation just downstream of the zinc-binding domain of the helicase subunit also decreased arterivirus transcription and increased replication67,68, a finding that may be relevant to a recent hypothesis69,70 postulating that helicase-induced RTC backtracking contributes to the interruption of minus-strand RNA synthesis and/or to template switching (discussed later).
The role of the nucleocapsid protein in RNA synthesis
Although the primary role of the coronavirus nucleocapsid protein is gRNA encapsidation, it has also been implicated in a variety of other functions and interactions, including in regulating or modulating viral replication and transcription, although the interpretation of the supporting evidence is often complicated by the generally strong affinity of the nucleocapsid protein for RNA71,72,73. Both nonspecific RNA binding and binding to specific RNA sequences, including the TRS, have been reported, but often based on in vitro assays using purified nucleocapsid protein (reviewed in ref.71). A human coronavirus 229E RNA replicon lacking the nucleocapsid-encoding gene (and all other structural protein genes) retained the capability to replicate itself and synthesize sg-mRNAs74. This finding, and the fact that nucleocapsid-protein expression promotes viral replication75,76, suggests that the nucleocapsid protein has a modulatory rather than an essential role in coronavirus RNA synthesis. In line with this notion, the launching of coronavirus replication from in vitro-generated gRNA can be enhanced by co-expression of the nucleocapsid protein71. The nucleocapsid proteins of different coronaviruses interact with numerous other proteins, including with coronavirus replicase subunits such as nsp3 (ref.77) and host cell factors such as the RNA helicase DDX1 (ref.78). In the latter case, a complex formed between DDX1 and phosphorylated nucleocapsid protein was proposed to control the balance between replication and transcription by modulating the level of template switching at the successive TRS-B elements encountered by the RTC. The SARS-CoV-2 nucleocapsid protein was also shown to promote the cooperative association of the nsp7–nsp8–nsp12 complex with poly(U) RNA in vitro, thereby possibly facilitating initiation and/or elongation of viral RNA synthesis79.
The replication–transcription complex
The nsp7–nsp8–nsp12 holo-RdRp is the central component of the coronavirus RTC, and investigating the molecular basis of its RNA-synthesizing activity will facilitate rational drug design. Akin to other polynucleotide polymerases, the RdRp catalyses the incorporation of ribose nucleoside triphosphates (NTPs) into a nascent ‘product’ RNA using the information provided by the template RNA. However, maintaining the integrity of the template poses significant challenges to coronaviruses as their genomes are approximately 30 kb long, which is a burden considering the so-called error threshold that demarcates the genome size above which the long-term survival of an RNA virus species would be tenuous80,81. Thus, to preserve the integrity of their genetic information, coronaviruses have evolved mechanisms to mitigate the impact of nucleotide misincorporations during RNA synthesis3. This section highlights recent biochemical and structural studies of how coronaviruses orchestrate their replication and transcription, with the added aim of enhancing our understanding of the druggable SARS-CoV-2 proteome.
Molecular mechanism of RNA synthesis
Structural and bioinformatics work classified the coronavirus polymerase subunit (nsp12) into three domains: an N-terminal nidovirus RdRp-associated nucleotidyltransferase (NiRAN) domain (residues 1–250), an interface region between the NiRAN domain and the RdRp domain (residues 251–398) and the core RdRp domain (residues 399–932)82,83 (Fig. 4). The core RdRp domain assumes an architecture analogous to a cupped right hand composed of three subdomains: the fingers, palm and thumb84 (Fig. 4b). Within the core RdRp domain, the active site is further subdivided into seven functional features, known as motifs A–G, which are highly conserved across positive-stranded RNA viruses84 (Fig. 4d).
Studies of smaller RdRps, amenable to X-ray crystallography, have provided a wealth of information on the role of these conserved structural motifs during the nucleotide addition cycle84,85. Initial nucleotide recognition is mediated by nonspecific charge–charge interactions of the nucleotide substrate with a series of positively charged Lys and Arg residues in motifs D and F of the nsp12 RdRp domain86. Molecular dynamics simulations of the hepatitis C virus RdRp indicate that the nucleotide diffuses into the central cleft through the NTP entry channel, until it reaches the active site located in the main channel86. The nucleotide ribose and base moieties subsequently flip into the active site through stabilizing hydrogen-bonding interactions with residues in motifs A and B and base-specific interactions with residues in motif F to form a Watson–Crick base pair with the template nucleotide86,87,88 (Fig. 4d). Correct positioning of the incoming nucleotide stabilizes the two catalytic Mg2+ ions that are necessary for the condensation reaction via interactions with the α and β phosphates of the incoming nucleotide, the product RNA 3′-hydroxy group and the catalytic Asp residues of motif C87,89. Closure of the RdRp active site via the stabilizing interactions of motifs A and B with the base enables the acid–base chemistry that drives the attack of the deprotonated product RNA 3′-hydroxy on the α-phosphorus atom of the incoming NTP, resulting in nucleotide addition and the release of pyrophosphate87,89.
The conformational state immediately after catalysis, in which the product RNA 3′ base occupies the incoming-nucleotide site, is referred to as the ‘pre-translocated state’ (Fig. 5). The conversion into the ‘post-translocated’ state mandates the release of pyrophosphate via opening of the active site through a subtle rotation of motif A84. Entry into the post-translocated state resets the active site for the next nucleotide addition cycle. A generalized kinetic scheme indicates that forward translocation is driven by the high nucleotide concentrations in the cellular milieu, which saturate the incoming-nucleotide site (Fig. 5). Failure to translocate would arrest the RTC and lead to the termination of RNA synthesis unless the translocation impediment is cleared. As we discuss later, one mechanism of action of remdesivir, an RdRp inhibitor used for COVID-19 treatment, can be summarized as hindering RdRp translocation through steric effects of its base in the active site.
The enigmatic NiRAN domain of coronavirus nsp12
The NiRAN domain (Fig. 4b) attracted considerable interest following revelations that it is an essential enzymatic domain that is absent in RNA viruses outside the order Nidovirales82. Early investigations revealed that the NiRAN domain of the RdRp of the arterivirus equine arteritis virus possesses a self-nucleotidylating activity (NMPylation), which may prime the domain to transfer a nucleoside monophosphate (NMP) to another protein or nucleic acid substrate82. This activity is essential for nidovirus replication, as mutations that catalytically inactivate the NiRAN domains of equine arteritis virus and SARS-CoV were lethal82. Recent studies of the coronavirus NiRAN domain suggest that it might function as an RNA ligase, serve as a guanylyltransferase that catalyses the transfer of guanine monophosphate (GMP) during mRNA capping (Box 2) or transfer an NMP to another viral protein to serve in protein-primed initiation of RNA synthesis82. It is possible that the NiRAN domain performs multiple activities or each of these activities, depending on the substate and context. The possible role of the NiRAN in the capping mechanism is discussed in Box 2, and the RNA ligase activity of NiRAN has not been demonstrated so far.
The NiRAN domain’s possible role in protein-primed RNA synthesis warrants further exploration. Members of another clade of positive-strand RNA viruses, the order Picornavirales, initiate RNA synthesis using the protein primer viral protein genome-linked (VPg). A dinucleotide (UpU) that is covalently linked to a hydroxy group of a tyrosine, serine or threonine residue in VPg serves to prime RNA synthesis on the poly(A)-tailed template90. Consistent with the earlier work on equine arteritis virus82, the NiRAN domains of SARS-CoV, human coronavirus 229E and SARS-CoV-2 display higher specificity in vitro for UTP over GTP (UMPylation activity over GMPylation activity)91,92. Given this UTP specificity, it was posited that NiRAN could facilitate the UMPylation of a priming protein to initiate minus-strand RNA synthesis at the 3′ poly(A) tail of the gRNA template92. Recent in vitro evidence indicates that nsp9, a single-stranded RNA-binding protein, is a substrate for UMPylation at or near its N terminus92. Consistent with this finding, mutagenesis of the N-terminal residues of nsp9 severely affected viral replication. These results, combined with a structure of nsp9 bound to the NiRAN domain in an inhibited state93, implicate nsp9 in the initiation of RNA synthesis. Given observations that other RTC components and purification contaminants (such as proteins from bacterial expression systems) can be NMPylated in vitro91,92, further studies are needed to define the biologically relevant repertoire of substrates targeted by NiRAN’s NMPylation activity.
The expanded replication machinery
Replication and transcription are assumed to be executed by various subcomplexes, which include the holo-RdRp associated with other non-structural protein enzymes and accessory subunits. These viral enzymes are required to promote the fidelity of RNA synthesis, equip viral mRNAs with a 5′ cap structure and orchestrate the template switching needed for sgRNA synthesis. The interplay between these subunits and their interactions with regulatory viral RNA elements coordinate the timely replication and expression of the coronavirus genome, and provide a platform for continuous coronavirus evolution.
The holo-RdRp
To faithfully replicate the coronavirus genome, an arsenal of factors is needed to enhance the processivity of the RTC and repair errors in RNA synthesis4. Early biochemical experiments revealed that the primer-extension processivity of SARS-CoV RdRp (nsp12) is greatly increased in the presence of nsp7 and nsp8, cementing their role as essential subunits of the holo-RdRp complex94. Advancements in cryo-electron microscopy led to the seminal structure of the SARS-CoV apo-holo-RdRp complex83 and more recently to several structures of the SARS-CoV-2 RTC95,96,97,98. In the presence of an RNA duplex, the N termini of two nsp8 subunits form ordered helices that have nonspecific ionic interactions with the RNA backbone, illuminating the importance of nsp8 for enhanced RdRp processivity95 (Fig. 4a,c). In addition to enhancing processivity, nsp8 interacts in vitro with various other non-structural proteins thought to assist the RTC99,100. Thus, nsp8 is posited to be important for forming higher-order RTCs that couple replication and transcription with template unwinding (nsp13), proofreading (nsp10, nsp12, nsp13 and nsp14) and RNA capping (nsp10, nsp13, nsp14 and nsp16).
The exoribonuclease activity of nsp14
Coronaviruses encode a unique proofreading activity that is not found in other RNA viruses, including in nidoviruses with small genomes such as arteriviruses3,22,101. This activity is encoded in the N-terminal exoribonuclease (ExoN) domain of nsp14, which together with nsp10 forms an RNA proofreading complex102,103,104 that is presumed to promote faithful replication of large nidovirus genomes22. In some coronaviruses, such as murine hepatitis virus (MHV) and SARS-CoV, nsp14-ExoN knockout yielded crippled but viable viruses exhibiting a mutator phenotype105,106,107. Remarkably, equivalent ExoN-inactivating substitutions completely abolish MERS-CoV and SARS-CoV-2 replication, suggesting a function for ExoN in primary RNA synthesis108. Because of its role in enhancing the fidelity of genome replication, nsp14 ExoN can promote antiviral drug resistance. For example, in an in vitro assay, its 3′-5′ exoribonuclease activity can efficiently cleave ribavirin, an antiviral nucleoside analogue, from the 3′ end of an RNA substrate, indicating that the enzyme may promote high-level resistance to certain nucleoside analogue antiviral drugs109. Recent cryo-electron microscopy structures have revealed the molecular basis for how the ExoN domain of the nsp10–nsp14 complex interacts with double-stranded RNA containing a 5′ overhang and a one-nucleotide mismatch at the 3′ end110. The mismatched base enters the shallow ExoN active site and interacts with conserved catalytic residues via its 3′-hydroxy and 2′-hydroxy groups. In addition, the double-stranded portion of the RNA interacts with both the nsp10 N terminus and nsp14-ExoN residues outside the catalytic site. These structures provide direct visualization of recognition by ExoN of its preferred mismatched RNA substrate110.
Colliding motors: how the helicase induces RdRp backtracking
Nsp13 is an SF1B-family RNA helicase that is essential for coronavirus replication111,112,113,114,115. Biochemical assays revealed that its 5′-3′ nucleic acid unwinding activity is enhanced twofold in the presence of nsp12 (ref.116). Nsp13 makes stable interactions with the SARS-CoV-2 RTC, which enabled the structural determination of nsp13 bound to the RTC (nsp132–RTC)69,117. Single-particle image classification revealed that the major particle class consists of two molecules of nsp13 bound to the RTC. Both nsp13 molecules (nsp13F (fingers) and nsp13T (thumb)) have extensive interactions with the holo-RdRp, whereas only one copy (nsp13T) is bound to the RNA scaffold at the 5′ end of the template RNA (Fig. 4a,c). Nsp13T, bound to the template strand downstream of the RdRp active site, is positioned to translocate in the opposite direction relative to the RdRp69 (Fig. 5). The opposing directionalities of nsp13 and the RdRp are hypothesized to trigger a translocation conflict. Forward translocation of nsp13T on the template strand was proposed to lead to the reverse threading of the RdRp on the product RNA strand69. The reverse movement of polymerases, relative to their nucleic acid substrate, is well known for all cellular RNA polymerases and is termed ‘backtracking’118 (Fig. 5). The role of the second nsp13, nsp13F, is poorly understood but it has been proposed to regulate the unwinding activity of the substrate-bound nsp13T (ref.117).
A perspective on the role of backtracking in proofreading by the RdRp
Similarly to its role in cellular RNA polymerases, backtracking may be essential for excision of misincorporated nucleotides in coronavirus RNA synthesis69,70,119,120. Indeed, behaviours consistent with backtracking have been observed in single-molecule magnetic tweezer experiments for the SARS-CoV-2 RdRp and RdRps from the Φ6 bacteriophage and poliovirus, illuminating the potentially widespread nature of backtracking in the viral realm119,120,121. Furthermore, recent evidence indicates that the NTP entry channel can accommodate a single-stranded product RNA 3′ overhang, which mirrors the backtracking product70. Molecular dynamics simulations further showed that entry into the backtracking state occurs when a misincorporated RNA base flips from the pre-translocated state towards the mouth of the NTP entry channel70. Subsequently, the engagement of nsp13 with the template RNA would enhance the backtracking activity and offer a means to control entry into a long-lived backtracked state (Fig. 5).
Backtracking may grant nsp14 ExoN access to any misincorporated nucleotide at the 3′ end of nascent product RNAs, thereby coupling proofreading with RNA synthesis69,70. Alternatively, as suggested by the recent cryo-electron microscopy structures of nsp10–nsp14 bound to a double-stranded RNA substrate with a 5′ overhang and a 3′ nucleotide mismatch110, misincorporation events may lead the RdRp to release the mismatch-containing RNA duplex, thereby granting ExoN access for proofreading. Additionally, it is envisaged that backtracking could have a role in discontinuous RNA synthesis by exposing the anti-TRS-B at the 3′ end of the nascent minus-strand sgRNAs and mediating template switching69,70 (Fig. 3b). This hypothesis is supported by a mutation in the arterivirus helicase that does not affect genome replication but abolishes all sgRNA transcription68,122.
Coupling of backtracking with nsp14-ExoN activity could also explain how coronaviruses excise non-natural nucleotides from their nascent product RNA. Genetic loss of function experiments indicated that nsp14 ExoN mitigates the effect of nucleoside and base analogues such as ribavirin and fluorouracil, respectively, in the betacoronavirus MHV102,105,106. These inhibitors are ineffective for treating SARS-CoV, MERS-CoV and SARS-CoV-2 infections, highlighting that the protection conferred by nsp14 ExoN is of clinical concern in the hunt for promising nucleoside analogues106,123. Combination therapy approaches may yield fruitful outcomes given the likely synergy between nsp14-ExoN and nsp12 (RdRp) inhibition. Therefore, it is pertinent to better understand how nsp14 is recruited for excision and repair124. In addition, shedding light on why some nucleoside analogues, such as remdesivir and molnupiravir (discussed in the next section), are effective could provide valuable insight into the design of novel antiviral nucleoside analogues for monotherapy.
A rational design of RdRp inhibitors
Although vaccines against SARS-CoV-2 have shown remarkable efficacy, COVID-19 continues to spread and affect communities globally. The reasons for this are multiple, and include vaccine shortages, public vaccine hesitancy, reduced vaccine effectiveness in immunosuppressed people and the emergence of new virus variants. It is therefore anticipated that SARS-CoV-2 will become endemic125, potentially evolving in the human host and leading to gradual or more sudden reductions of vaccine efficacy. Given such concerns, the search for drugs against SARS-CoV-2 and related viruses remains a priority in the research community. In this section, we discuss the mechanisms of action of two RdRp inhibitors, remdesivir and molnupiravir, that show clinical benefit in treating COVID-19.
Mechanisms of action of remdesivir and molnupiravir
Nucleoside analogues, which can target the RdRp, are common antiviral therapeutics125. Currently remdesivir and molnupiravir are two antiviral drugs used to treat COVID-19 (ref.126). Studies indicate that treatment with remdesivir decreases the duration of the infection in hospitalized patients127. Biochemical evidence demonstrates that the SARS-CoV-2 RdRp preferably incorporates remdesivir (Fig. 6a) rather than its natural analogue adenosine and can incorporate molnupiravir (Fig. 6b) rather than its natural analogue cytidine126,128,129,130,131. Once incorporated, neither inhibitor induces immediate pausing of RNA synthesis, in contrast to classical chain terminators126,129,131 (Fig. 6c–e). Initial studies had suggested that remdesivir inhibits RNA synthesis via a delayed chain termination mechanism126,130,132,133. Delayed chain termination occurs when remdesivir impedes RdRp translocation following a steric clash between its nitrile group and nsp12 Ser861, which occurs when remdesivir reaches the fourth position from the 3′ end of the product RNA126,132,133 (Fig. 6f). This steric inhibition is surmounted in vitro in the presence of subphysiological concentrations of NTPs, indicating it is unlikely the major inhibitory hurdle for viral replication in living cells. Instead, recent data suggest that remdesivir may impair replication when incorporated into the template strand following an initial round of viral RNA synthesis134. In the template strand, remdesivir hinders the incorporation of the incoming nucleotide. This mode of activity has been termed ‘template-dependent inhibition’134 (Fig. 6d). Following the eventual incorporation of this incoming nucleotide, a second potential checkpoint was proposed, in which remdesivir would bias the RdRp towards the pre-translocated state, although direct evidence for this is lacking134.
Like remdesivir, molnupiravir is a prodrug that is converted in cells into its triphosphate form, thereby serving as a nucleotide analogue. Molnupiravir inhibits replication through lethal mutagenesis of the genomes of multiple viruses, including SARS-CoV-2 (refs135,136,137,138). Molnupiravir treatment presents a high barrier to resistance in cell culture assays135,136. Importantly, like remdesivir, molnupiravir escapes from the coronavirus nsp14-ExoN proofreading activity136. Unlike remdesivir, molnupiravir is delivered orally, which, combined with its high barrier to resistance and potent antiviral activity, led to its pursuit as an alternative therapeutic for COVID-19 (ref.139). Molnupiravir triphosphate is a cytidine analogue that exerts its effect by indiscriminately serving as a template for the incorporation of either adenine or guanine, thus explaining the observation of the transition mutations G>A and C>U in coronaviruses exposed to molnupiravir129 (Fig. 6e). Two recently resolved cryo-electron microscopy structures of molnupiravir base-paired with adenine or guanine revealed the structural basis of molnupiravir-mediated lethal mutagenesis131.
The RdRp possesses high selectivity for remdesivir and molnupiravir due to their excellent mimicry of natural nucleotides. Therefore, these compounds do not significantly affect the initial round of RNA synthesis after incorporation, a feature which likely reduces their recognition and excision by nsp14 ExoN surveying the fidelity of RTCs.
Overcoming the proofreading barrier in antiviral drug design
Designing nucleoside analogues that escape the proofreading activity of nsp14 ExoN is a trial-by-error endeavour since it is challenging to pinpoint chemical properties that would lead to nucleotide mimicry. A more rational approach could entail targeting the enzymatic activity of ExoN or its interfaces with nsp10 or the RTC. The ExoN activity is essential for SARS-CoV-2 and MERS-CoV replication, but it is not vital for viral propagation across the betacoronavirus clade108. nsp14-ExoN inactivation in MHV and SARS-CoV, although not lethal, enhances the susceptibility of the virus to nucleoside analogues, highlighting the benefits of dual-inhibition strategies in coronaviruses105. One concern is the potential for off-target effects when the ExoN active site is being targeted with a small-molecule inhibitor, due to structural similarities to other cellular DEDD-family exonucleases. Designing inhibitors against the interface of nsp10 with nsp14 ExoN and nsp16 has attracted interest given that viral replication is abrogated in interface mutants140,141.
A long-standing research interest is the characterization of how the proofreading complex, nsp10–nsp14, interacts with the RTC. Pull-down experiments using a series of truncated proteins indicated that nsp12 interacts with nsp14 and its subdomains109. Recent structural work showed that SARS-CoV-2 nsp10–nsp14 can be recruited to the RTC by forming a covalent link with nsp9, which is bound to the nsp12 NiRAN domain124. Prior observations in MHV inspired the rationale for using as nsp9–nsp10 fusion protein in that study, as ablation of the protease cleavage site between nsp9 and nsp10 in MHV maintains a viable phenotype. This nsp9–nsp10 protease cleavage mutant, however, experienced a pronounced overall defect in RNA synthesis, and the propagation of this mutant was severely compromised142. Given the crippled phenotype of the MHV nsp9–nsp10 cleavage-site mutant142, it remains to be shown whether the same interaction with nsp10–nsp14 can occur when nsp9 and nsp10 are separated. The recent structural analysis of the SARS-CoV-2 nsp12–nsp9–nsp10–nsp14 complex did not reveal any features that would suggest that the incorporation of an nsp9–nsp10 fusion protein affects RTC assembly124. Probing the role of nsp10–nsp14 in greater detail will benefit from single-molecule experiments using reconstituted RTC components. This approach could also test whether nsp14 ExoN alleviates backtracking as proposed69,70. Furthermore, both the engagement of proteins with RTCs and the proposed proofreading activity of nsp14 ExoN will have to be demonstrated in vivo.
Future perspectives
Although the surge of new coronavirus research has expanded our understanding of the molecular mechanisms of SARS-CoV-2 replication and gene expression, the foundation of our knowledge of these processes is built primarily on previous research of other coronaviruses and the distantly related arteriviruses. Corroborating our understanding will likely reveal properties and processes shared between viral species, which are of potential value for the design of pan-coronavirus inhibitors. More specifically, it is pertinent to unravel poorly understood intricacies of spatiotemporal regulation of RNA synthesis in coronaviruses. Unknown to us is the complete repertoire of host-cell factors involved in assisting the coronavirus infection cycle and how these factors may, for example, be subverted to assist in the formation of replication organelles or contribute to the formation of RTCs. The spatial segregation of coronavirus replication in virus-induced membranous organelles appears to be a requisite for successful virus propagation — a feature shared among positive-strand RNA viruses, although it remains to be elucidated where in the cell RNA synthesis occurs during the earliest phase of infection.
Downstream of replication organelle formation, key questions include how RNA synthesis is primed on its templates and how regulation of the two major synthesis pathways is orchestrated. Such regulation must be achieved by the concerted action of the replicase proteins and may be further assisted by host factors, whose role in these pathways is relatively unexplored. Regulating RNA synthesis necessitates the faithful maintenance of the encoded genetic information, as unwanted mutations can alter the RNA elements required for processes such as template switching and that lead to the production of nonsense transcripts. Yet to be worked out in detail is how the coronavirus proofreading complex coordinates its activity with the polymerase, leading to the excision of misincorporated RNA nucleotides and nucleotide analogues. Understanding how mutations accumulate during replication and how they are corrected can inform us on the evolution of drug resistance mutations and aid the design of inhibitors that directly target the replicase complex. We hope that such considerations may guide research that will shape our response to future deadly outbreaks of coronaviruses, a consequence of the encroaching footprint of humanity on the natural world.
References
Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 5, 536–544 (2020).
Gulyaeva, A. A. & Gorbalenya, A. E. A nidovirus perspective on SARS-CoV-2. Biochem. Biophys. Res. Commun. 538, 24–34 (2021).
Gorbalenya, A. E., Enjuanes, L., Ziebuhr, J. & Snijder, E. J. Nidovirales: evolving the largest RNA virus genome. Virus Res. 117, 17–37 (2006).
Snijder, E. J., Decroly, E. & Ziebuhr, J. in Advances in Virus Research Vol. 96 (ed. Ziebuhr, J.) 59–126 (Elsevier, 2016).
Saberi, A., Gulyaeva, A. A., Brubacher, J. L., Newmark, P. A. & Gorbalenya, A. E. A planarian nidovirus expands the limits of RNA genome size. PLOS Pathog. 14, e1007314 (2018).
Hilgenfeld, R. & Peiris, M. From SARS to MERS: 10 years of research on highly pathogenic human coronaviruses. Antivir. Res. 100, 286–295 (2013).
Subissi, L. et al. SARS-CoV ORF1b-encoded nonstructural proteins 12-16: replicative enzymes as antiviral targets. Antivir. Res. 101, 122–130 (2014).
Zumla, A., Chan, J. F. W., Azhar, E. I., Hui, D. S. C. & Yuen, K.-Y. Coronaviruses - drug discovery and therapeutic options. Nat. Rev. Drug Discov. 15, 327–347 (2016).
Pruijssers, A. J. & Denison, M. R. Nucleoside analogues for the treatment of coronavirus infections. Curr. Opin. Virol. 35, 57–62 (2019).
V’kovski, P., Kratzel, A., Steiner, S., Stalder, H. & Thiel, V. Coronavirus biology and replication: implications for SARS-CoV-2. Nat. Rev. Microbiol. 19, 155–170 (2021).
Whittaker, G. R., Daniel, S. & Millet, J. K. Coronavirus entry: how we arrived at SARS-CoV-2. Curr. Opin. Virol. 47, 113–120 (2021).
Shang, J. et al. Cell entry mechanisms of SARS-CoV-2. Proc. Natl Acad. Sci. USA 117, 11727–11734 (2020).
Ou, X. et al. Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV. Nat. Commun. 11, 1620 (2020).
Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 382, 727–733 (2020).
Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 265–269 (2020).
Ziv, O. et al. The short- and long-range RNA-RNA interactome of SARS-CoV-2. Mol. Cell 80, 1067–1077.e5 (2020).
Manfredonia, I. et al. Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements. Nucleic Acids Res. 48, 12436–12452 (2020).
Rangan, R. et al. De novo 3D models of SARS-CoV-2 RNA elements from consensus experimental secondary structures. Nucleic Acids Res. 49, 3092–3108 (2021).
Huston, N. C. et al. Comprehensive in vivo secondary structure of the SARS-CoV-2 genome reveals novel regulatory motifs and mechanisms. Mol. Cell 81, 584–598.e5 (2021).
Finkel, Y. et al. The coding capacity of SARS-CoV-2. Nature 589, 125–130 (2021).
Ziebuhr, J., Snijder, E. J. & Gorbalenya, A. E. Virus-encoded proteinases and proteolytic processing in the Nidovirales. J. Gen. Virol. 81, 853–879 (2000).
Snijder, E. J. et al. Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 Lineage. J. Mol. Biol. 331, 991–1004 (2003).
Lokugamage, K. G., Narayanan, K., Huang, C. & Makino, S. Severe acute respiratory syndrome coronavirus protein nsp1 is a novel eukaryotic translation inhibitor that represses multiple steps of translation initiation. J. Virol. 86, 13598–13608 (2012).
Thoms, M. et al. Structural basis for translational shutdown and immune evasion by the Nsp1 protein of SARS-CoV-2. Science 369, 1249 (2020).
Schubert, K. et al. SARS-CoV-2 Nsp1 binds the ribosomal mRNA channel to inhibit translation. Nat. Struct. Mol. Biol. 27, 959–966 (2020).
Cortese, M. et al. Integrative imaging reveals SARS-CoV-2-induced reshaping of subcellular morphologies. Cell Host Microbe 28, 853–866.e5 (2020).
Snijder, E. J. et al. A unifying structural and functional model of the coronavirus replication organelle: tracking down RNA synthesis. PLoS Biol. 18, e3000715 (2020).
Ghosh, S. et al. β-Coronaviruses use lysosomes for egress instead of the biosynthetic secretory pathway. Cell 183, 1520–1535.e14 (2020).
Wang, D. et al. The SARS-CoV-2 subgenome landscape and its novel regulatory features. Mol. Cell 81, 2135–2147.e5 (2021).
Kim, D. et al. The architecture of SARS-CoV-2 transcriptome. Cell 181, 914–921.e10 (2020).
Masters, P. S. & Perlman, S. in Fields Virology (eds Knipe, D. M. & Howley, P. M.) 825–858 (Lippincott, Williams & Wilkins, 2013).
Jungreis, I. et al. Conflicting and ambiguous names of overlapping ORFs in the SARS-CoV-2 genome: a homology-based resolution. Virology 558, 145–151 (2021).
Jungreis, I., Sealfon, R. & Kellis, M. SARS-CoV-2 gene content and COVID-19 mutation impact by comparing 44 Sarbecovirus genomes. Nat. Commun. 12, 2642 (2021).
Firth, A. E. A putative new SARS-CoV protein, 3c, encoded in an ORF overlapping ORF3a. J. Gen. Virol. 101, 1085–1089 (2020).
Bojkova, D. et al. Proteomics of SARS-CoV-2-infected host cells reveals therapy targets. Nature 583, 469–472 (2020).
Firth, A. E. & Brierley, I. Non-canonical translation in RNA viruses. J. Gen. Virol. 93, 1385–1409 (2012).
Schaecher, S. R., Mackenzie, J. M. & Pekosz, A. The ORF7b protein of severe acute respiratory syndrome coronavirus (SARS-CoV) is expressed in virus-infected cells and incorporated into SARS-CoV particles. J. Virol. 81, 718–731 (2007).
Xu, K. et al. Severe acute respiratory syndrome coronavirus accessory protein 9b is a virion-associated protein. Virology 388, 279–285 (2009).
Brierley, I., Digard, P. & Inglis, S. C. Characterization of an efficient coronavirus ribosomal frameshifting signal: requirement for an RNA pseudoknot. Cell 57, 537–547 (1989).
Bhatt, P. R. et al. Structural basis of ribosomal frameshifting during translation of the SARS-CoV-2 RNA genome. Science 372, 1306–1313 (2021).
Plant, E. P., Rakauskaite, R., Taylor, D. R. & Dinman, J. D. Achieving a golden mean: mechanisms by which coronaviruses ensure synthesis of the correct stoichiometric ratios of viral proteins. J. Virol. 84, 4330–4340 (2010).
Su, M.-C., Chang, C.-T., Chu, C.-H., Tsai, C.-H. & Chang, K.-Y. An atypical RNA pseudoknot stimulator and an upstream attenuation signal for -1 ribosomal frameshifting of SARS coronavirus. Nucleic Acids Res. 33, 4265–4275 (2005).
Ivanov, K. A. et al. Major genetic marker of nidoviruses encodes a replicative endoribonuclease. Proc. Natl Acad. Sci. USA 101, 12694–12699 (2004).
Kindler, E. et al. Early endonuclease-mediated evasion of RNA sensing ensures efficient coronavirus replication. PLoS Pathog. 13, e1006195 (2017).
Hackbart, M., Deng, X. & Baker, S. C. Coronavirus endoribonuclease targets viral polyuridine sequences to evade activating host sensors. Proc. Natl Acad. Sci. USA 117, 8094–8103 (2020).
Sawicki, S. G. & Sawicki, D. L. Coronaviruses use discontinuous extension for synthesis of subgenome-length negative strands. Adv. Exp. Med. Biol. 380, 499–506 (1995).
Sawicki, S. G., Sawicki, D. L. & Siddell, S. G. A contemporary view of coronavirus transcription. J. Virol. 81, 20–29 (2007).
Pasternak, A. O., Spaan, W. J. M. & Snijder, E. J. Nidovirus transcription: how to make sense…? J. Gen. Virol. 87, 1403–1421 (2006).
Nakagawa, K. & Makino, S. Mechanisms of coronavirus nsp1-mediated control of host and viral gene expression. Cells 10, 300 (2021).
Banerjee, A. K. et al. SARS-CoV-2 disrupts splicing, translation, and protein trafficking to suppress host defenses. Cell 183, 1325–1339.e21 (2020).
Sola, I., Moreno, J. L., Zúñiga, S., Alonso, S. & Enjuanes, L. Role of nucleotides immediately flanking the transcription-regulating sequence core in coronavirus subgenomic mRNA synthesis. J. Virol. 79, 2506–2516 (2005).
Yount, B., Roberts, R. S., Lindesmith, L. & Baric, R. S. Rewiring the severe acute respiratory syndrome coronavirus (SARS-CoV) transcription circuit: engineering a recombination-resistant genome. Proc. Natl Acad. Sci. USA 103, 12546–12551 (2006).
Sola, I., Almazán, F., Zúñiga, S. & Enjuanes, L. Continuous and discontinuous RNA synthesis in coronaviruses. Annu. Rev. Virol. 2, 265–288 (2015).
Pasternak, A. O., van den Born, E., Spaan, W. J. & Snijder, E. J. Sequence requirements for RNA strand transfer during nidovirus discontinuous subgenomic RNA synthesis. EMBO J. 20, 7220–7228 (2001).
van den Born, E., Posthuma, C. C., Gultyaev, A. P. & Snijder, E. J. Discontinuous subgenomic RNA synthesis in arteriviruses is guided by an RNA hairpin structure located in the genomic leader region. J. Virol. 79, 6312–6324 (2005).
Ogando, N. S. et al. SARS-coronavirus-2 replication in Vero E6 cells: replication kinetics, rapid adaptation and cytopathology. J. Gen. Virol. 101, 925–940 (2020).
Davidson, A. D. et al. Characterisation of the transcriptome and proteome of SARS-CoV-2 reveals a cell passage induced in-frame deletion of the furin-like cleavage site from the spike glycoprotein. Genome Med. 12, 68 (2020).
Brian, D. A. & Baric, R. S. Coronavirus genome structure and replication. Curr. Top. Microbiol. Immunol. 287, 1–30 (2005).
Masters, P. S. & Rottier, P. J. M. Coronavirus reverse genetics by targeted RNA recombination. Curr. Top. Microbiol. Immunol. 287, 133–159 (2005).
Yang, Y., Yan, W., Hall, A. B. & Jiang, X. Characterizing transcriptional regulatory sequences in coronaviruses and their role in recombination. Mol. Biol. Evol. 38, 1241–1248 (2021).
Chrisman, B. S. et al. Indels in SARS-CoV-2 occur at template-switching hotspots. BioData Min. 14, 20 (2021).
Irigoyen, N. et al. High-resolution analysis of coronavirus gene expression by RNA sequencing and ribosome profiling. PLoS Pathog. 12, e1005473 (2016).
Viehweger, A. et al. Direct RNA nanopore sequencing of full-length coronavirus genomes provides novel insights into structural variants and enables modification analysis. Genome Res. 29, 1545–1554 (2019).
Snijder, E. J., Kikkert, M. & Fang, Y. Arterivirus molecular biology and pathogenesis. J. Gen. Virol. 94, 2141–2163 (2013).
Nedialkova, D. D., Gorbalenya, A. E. & Snijder, E. J. Arterivirus Nsp1 modulates the accumulation of minus-strand templates to control the relative abundance of viral mRNAs. PLoS Pathog. 6, e1000772 (2010).
Tijms, M. A., Nedialkova, D. D., Zevenhoven-Dobbe, J. C., Gorbalenya, A. E. & Snijder, E. J. Arterivirus subgenomic mRNA synthesis and virion biogenesis depend on the multifunctional nsp1 autoprotease. J. Virol. 81, 10496–10505 (2007).
van Dinten, L. C., den Boon, J. A., Wassenaar, A. L., Spaan, W. J. & Snijder, E. J. An infectious arterivirus cDNA clone: identification of a replicase point mutation that abolishes discontinuous mRNA transcription. Proc. Natl Acad. Sci. USA 94, 991–996 (1997).
van Marle, G., van Dinten, L. C., Spaan, W. J. M., Luytjes, W. & Snijder, E. J. Characterization of an equine arteritis virus replicase mutant defective in subgenomic mRNA synthesis. J. Virol. 73, 5274 (1999).
Chen, J. et al. Structural basis for helicase-polymerase coupling in the SARS-CoV-2 replication-transcription complex. Cell 182, 1560–1573.e13 (2020).
Malone, B. et al. Structural basis for backtracking by the SARS-CoV-2 replication–transcription complex. Proc. Natl Acad. Sci. USA 118, e2102516118 (2021).
Masters, P. S. The molecular biology of coronaviruses. Adv. Virus Res. 66, 193–292 (2006).
Chang, C., Hou, M.-H., Chang, C.-F., Hsiao, C.-D. & Huang, T. The SARS coronavirus nucleocapsid protein — forms and functions. Antivir. Res. 103, 39–50 (2014).
Bai, Z., Cao, Y., Liu, W. & Li, J. The SARS-CoV-2 nucleocapsid protein and its role in viral structure, biological functions, and a potential target for drug or vaccine mitigation. Viruses 13, 1115 (2021).
Thiel, V., Herold, J., Schelle, B. & Siddell, S. G. Viral replicase gene products suffice for coronavirus discontinuous transcription. J. Virol. 75, 6676–6681 (2001).
Schelle, B., Karl, N., Ludewig, B., Siddell, S. G. & Thiel, V. Selective replication of coronavirus genomes that express nucleocapsid protein. J. Virol. 79, 6620–6630 (2005).
Zúñiga, S. et al. Coronavirus nucleocapsid protein facilitates template switching and is required for efficient transcription. J. Virol. 84, 2169–2175 (2010).
Cong, Y. et al. Nucleocapsid protein recruitment to replication-transcription complexes plays a crucial role in coronaviral life cycle. J. Virol. 94, e01925–19 (2020).
Wu, C.-H., Chen, P.-J. & Yeh, S.-H. Nucleocapsid phosphorylation and RNA helicase DDX1 recruitment enables coronavirus transition from discontinuous to continuous transcription. Cell Host Microbe 16, 462–472 (2014).
Savastano, A., Ibáñez de Opakua, A., Rankovic, M. & Zweckstetter, M. Nucleocapsid protein of SARS-CoV-2 phase separates into RNA-rich polymerase-containing condensates. Nat. Commun. 11, 6041 (2020).
Biebricher, C. K. & Eigen, M. What is a quasispecies? Curr. Top. Microbiol. Immunol. 299, 1–31 (2006).
Domingo, E., Sheldon, J. & Perales, C. Viral quasispecies evolution. Microbiol. Mol. Biol. Rev. 76, 159–216 (2012).
Lehmann, K. C. et al. Discovery of an essential nucleotidylating activity associated with a newly delineated conserved domain in the RNA polymerase-containing protein of all nidoviruses. Nucleic Acids Res. 43, 8416–8434 (2015).
Kirchdoerfer, R. N. & Ward, A. B. Structure of the SARS-CoV nsp12 polymerase bound to nsp7 and nsp8 co-factors. Nat. Commun. 10, 2342 (2019).
Peersen, O. B. Picornaviral polymerase structure, function, and fidelity modulation. Virus Res. 234, 4–20 (2017).
Jia, H. & Gong, P. A Structure-function diversity survey of the RNA-dependent RNA polymerases from the positive-strand RNA viruses. Front. Microbiol. 10, 1945 (2019).
Ben Ouirane, K., Boulard, Y. & Bressanelli, S. The hepatitis C virus RNA-dependent RNA polymerase directs incoming nucleotides to its active site through magnesium-dependent dynamics within its F motif. J. Biol. Chem. 294, 7573–7587 (2019).
Shu, B. & Gong, P. Structural basis of viral RNA-dependent RNA polymerase catalysis and translocation. Proc. Natl Acad. Sci. USA 113, E4005–E4014 (2016).
Gong, P. & Peersen, O. B. Structural basis for active site closure by the poliovirus RNA-dependent RNA polymerase. Proc. Natl Acad. Sci. USA 107, 22505–22510 (2010).
Steitz, T. A. A mechanism for all polymerases. Nature 391, 231–232 (1998).
Paul, A. V. & Wimmer, E. Initiation of protein-primed picornavirus RNA synthesis. Virus Res. 206, 12–26 (2015).
Conti, B. J., Leicht, A. S., Kirchdoerfer, R. N. & Sussman, M. R. Mass spectrometric based detection of protein nucleotidylation in the RNA polymerase of SARS-CoV-2. Commun. Chem. 4, 41 (2021).
Slanina, H. et al. Coronavirus replication–transcription complex: Vital and selective NMPylation of a conserved site in nsp9 by the NiRAN-RdRp subunit. Proc. Natl Acad. Sci. USA 118, e2022310118 (2021).
Yan, L. et al. Cryo-EM structure of an extended SARS-CoV-2 replication and transcription complex reveals an intermediate state in cap synthesis. Cell 184, 184–193.e10 (2021).
Subissi, L. et al. One severe acute respiratory syndrome coronavirus protein complex integrates processive RNA polymerase and exonuclease activities. Proc. Natl Acad. Sci. USA 111, E3900–E3909 (2014).
Hillen, H. S. et al. Structure of replicating SARS-CoV-2 polymerase. Nature 584, 154–156 (2020).
Yin, W. et al. Structural basis for inhibition of the RNA-dependent RNA polymerase from SARS-CoV-2 by remdesivir. Science 368, 1499–1504 (2020).
Gao, Y. et al. Structure of the RNA-dependent RNA polymerase from COVID-19 virus. Science 368, 779–782 (2020).
Wang, Q. et al. Structural basis for RNA replication by the SARS-CoV-2 polymerase. Cell 182, 417–428.e13 (2020).
von Brunn, A. et al. Analysis of intraviral protein-protein interactions of the SARS coronavirus ORFeome. PLoS ONE 2, e459 (2007).
Imbert, I. et al. The SARS-Coronavirus PLnc domain of nsp3 as a replication/transcription scaffolding protein. Virus Res. 133, 136–148 (2008).
Minskaia, E. et al. Discovery of an RNA virus 3′→5′ exoribonuclease that is critically involved in coronavirus RNA synthesis. Proc. Natl Acad. Sci. USA 103, 5108 (2006).
Denison, M. R., Graham, R. L., Donaldson, E. F., Eckerle, L. D. & Baric, R. S. Coronaviruses: an RNA proofreading machine regulates replication fidelity and diversity. RNA Biol. 8, 270–279 (2011).
Bouvet, M. et al. RNA 3′-end mismatch excision by the severe acute respiratory syndrome coronavirus nonstructural protein nsp10/nsp14 exoribonuclease complex. Proc. Natl Acad. Sci. USA 109, 9372–9377 (2012).
Smith, E. C., Sexton, N. R. & Denison, M. R. Thinking outside the triangle: replication fidelity of the largest RNA viruses. Annu. Rev. Virol. 1, 111–132 (2014).
Eckerle, L. D., Lu, X., Sperry, S. M., Choi, L. & Denison, M. R. High fidelity of murine hepatitis virus replication is decreased in nsp14 exoribonuclease mutants. J. Virol. 81, 12135–12144 (2007).
Smith, E. C., Blanc, H., Vignuzzi, M. & Denison, M. R. Coronaviruses lacking exoribonuclease activity are susceptible to lethal mutagenesis: evidence for proofreading and potential therapeutics. PLoS Pathog. 9, e1003565 (2013).
Eckerle, L. D. et al. Infidelity of SARS-CoV Nsp14-exonuclease mutant virus replication is revealed by complete genome sequencing. PLoS Pathog. 6, e1000896 (2010).
Ogando, N. S. et al. The enzymatic activity of the nsp14 exoribonuclease is critical for replication of MERS-CoV and SARS-CoV-2. J. Virol. 94, e01246–20 (2020).
Ferron, F. et al. Structural and molecular basis of mismatch correction and ribavirin excision from coronavirus RNA. Proc. Natl Acad. Sci. USA 115, E162 (2018).
Liu, C. et al. Structural basis of mismatch recognition by a SARS-CoV-2 proofreading enzyme. Science 373, 1142–1146 (2021).
Seybert, A., Hegyi, A., Siddell, S. G. & Ziebuhr, J. The human coronavirus 229E superfamily 1 helicase has RNA and DNA duplex-unwinding activities with 5′-to-3′ polarity. RNA 6, 1056–1068 (2000).
Seybert, A. et al. A complex zinc finger controls the enzymatic activities of nidovirus helicases. J. Virol. 79, 696 (2005).
Tanner, J. A. et al. The severe acute respiratory syndrome (SARS) coronavirus NTPase/helicase belongs to a distinct class of 5′ to 3′ viral helicases. J. Biol. Chem. 278, 39578–39582 (2003).
Lee, N.-R. et al. Cooperative translocation enhances the unwinding of duplex DNA by SARS coronavirus helicase nsP13. Nucleic Acids Res. 38, 7626–7636 (2010).
Ivanov, K. A. et al. Multiple enzymatic activities associated with severe acute respiratory syndrome coronavirus helicase. J. Virol. 78, 5619–5632 (2004).
Adedeji, A. O. et al. Mechanism of nucleic acid unwinding by SARS-CoV helicase. PLoS ONE 7, e36521 (2012).
Yan, L. et al. Architecture of a SARS-CoV-2 mini replication and transcription complex. Nat. Commun. 11, 5874 (2020).
Nudler, E. RNA polymerase backtracking in gene regulation and genome instability. Cell 149, 1438–1445 (2012).
Dulin, D. et al. Signatures of nucleotide analog incorporation by an RNA-dependent RNA polymerase revealed using high-throughput magnetic tweezers. Cell Rep. 21, 1063–1076 (2017).
Dulin, D. et al. Backtracking behavior in viral RNA-dependent RNA polymerase provides the basis for a second initiation site. Nucleic Acids Res. 43, 10421–10429 (2015).
Dulin, D. et al. Elongation-competent pauses govern the fidelity of a viral RNA-dependent RNA polymerase. Cell Rep. 10, 983–992 (2015).
van Dinten, L. C., Wassenaar, A. L., Gorbalenya, A. E., Spaan, W. J. & Snijder, E. J. Processing of the equine arteritis virus replicase ORF1b protein: identification of cleavage products containing the putative viral polymerase and helicase domains. J. Virol. 70, 6625 (1996).
Agostini, M. L. et al. Coronavirus susceptibility to the antiviral remdesivir (GS-5734) Is mediated by the viral polymerase and the proofreading exoribonuclease. mBio 9, e00221-18 (2018).
Yan, L. et al. Coupling of N7-methyltransferase and 3′-5′ exoribonuclease with SARS-CoV-2 polymerase reveals mechanisms for capping and proofreading. Cell 184, 3474–3485.e11 (2021).
Hall, M. D. et al. Report of the national institutes of health SARS-CoV-2 antiviral therapeutics summit. J. Infect. Dis. 224, S1–S21 (2021).
Gordon, C. J. et al. Remdesivir is a direct-acting antiviral that inhibits RNA-dependent RNA polymerase from severe acute respiratory syndrome coronavirus 2 with high potency. J. Biol. Chem. 295, 6785–6797 (2020).
Beigel, J. H. et al. Remdesivir for the treatment of Covid-19 — final report. N. Engl. J. Med. 383, 1813–1826 (2020).
Gordon, C. J., Tchesnokov, E. P., Feng, J. Y., Porter, D. P. & Götte, M. The antiviral compound remdesivir potently inhibits RNA-dependent RNA polymerase from Middle East respiratory syndrome coronavirus. J. Biol. Chem. 295, 4773–4779 (2020).
Gordon, C. J., Tchesnokov, E. P., Schinazi, R. F. & Götte, M. Molnupiravir promotes SARS-CoV-2 mutagenesis via the RNA template. J. Biol. Chem. 297, 100770 (2021).
Dangerfield, T. L., Huang, N. Z. & Johnson, K. A. Remdesivir is effective in combating COVID-19 because it is a better substrate than ATP for the viral RNA-dependent RNA polymerase. iScience 23, 101849 (2020).
Kabinger, F. et al. Mechanism of molnupiravir-induced SARS-CoV-2 mutagenesis. Nat. Struct. Mol. Biol. 28, 740–746 (2021).
Bravo, J. P. K., Dangerfield, T. L., Taylor, D. W. & Johnson, K. A. Remdesivir is a delayed translocation inhibitor of SARS-CoV-2 replication. Mol. Cell 81, 1548–1552.e4 (2021).
Kokic, G. et al. Mechanism of SARS-CoV-2 polymerase stalling by remdesivir. Nat. Commun. 12, 279 (2021).
Tchesnokov, E. P. et al. Template-dependent inhibition of coronavirus RNA-dependent RNA polymerase by remdesivir reveals a second mechanism of action. J. Biol. Chem. 295, 16156–16165 (2020).
Yoon, J.-J. et al. Orally efficacious broad-spectrum ribonucleoside analog inhibitor of influenza and respiratory syncytial viruses. Antimicrob. Agents Chemother. 62, e00766-18 (2018).
Agostini, M. L. et al. Small-molecule antiviral β-d-N4-hydroxycytidine inhibits a proofreading-intact coronavirus with a high genetic barrier to resistance. J. Virol. 93, e01348-19 (2019).
Sheahan, T. P. et al. An orally bioavailable broad-spectrum antiviral inhibits SARS-CoV-2 in human airway epithelial cell cultures and multiple coronaviruses in mice. Sci. Transl Med. 12, eabb5883 (2020).
Urakova, N. et al. β-d-N4-Hydroxycytidine is a potent anti-alphavirus compound that induces a high level of mutations in the viral genome. J. Virol. 92, e01965-17 (2018).
Businesswire. Interim results from phase 2/3 studies of molnupiravir, an investigational oral antiviral therapeutic for mild to moderate COVID-19, presented at the European Congress of Clinical Microbiology & Infectious Diseases (ECCMID). businesswire https://www.businesswire.com/news/home/20210712005251/en/ (2021).
Saramago, M. et al. New targets for drug design: importance of nsp14/nsp10 complex formation for the 3′-5′ exoribonucleolytic activity on SARS-CoV-2. FEBS J. 288, 5130–5147 (2021).
Bouvet, M. et al. Coronavirus Nsp10, a critical co-factor for activation of multiple replicative enzymes. J. Biol. Chem. 289, 25783–25796 (2014).
Deming, D. J., Graham, R. L., Denison, M. R. & Baric, R. S. Processing of open reading frame 1a replicase proteins nsp7 to nsp10 in murine hepatitis virus strain A59 replication. J. Virol. 81, 10280–10291 (2007).
Huang, C. et al. SARS coronavirus nsp1 protein induces template-dependent endonucleolytic cleavage of mRNAs: viral mRNAs are resistant to nsp1-induced RNA cleavage. PLoS Pathog. 7, e1002433 (2011).
Serrano, P. et al. Nuclear magnetic resonance structure of the N-terminal domain of nonstructural protein 3 from the severe acute respiratory syndrome coronavirus. J. Virol. 81, 12049 (2007).
Putics, Á., Filipowicz, W., Hall, J., Gorbalenya, A. E. & Ziebuhr, J. ADP-ribose-1″-monophosphatase: a conserved coronavirus enzyme that is dispensable for viral replication in tissue culture. J. Virol. 79, 12721 (2005).
Alhammad, Y. M. O. et al. The SARS-CoV-2 conserved macrodomain is a mono-ADP-ribosylhydrolase. J. Virol. 95, e01969-20 (2021).
Lee, H. J. et al. The complete sequence (22 kilobases) of murine coronavirus gene 1 encoding the putative proteases and RNA polymerase. Virology 180, 567–582 (1991).
Oudshoorn, D. et al. Expression and cleavage of middle east respiratory syndrome coronavirus nsp3-4 polyprotein induce the formation of double-membrane vesicles that mimic those associated with coronaviral RNA replication. mBio 8, e01658-17 (2017).
Angelini, M. M., Akhlaghpour, M., Neuman, B. W. & Buchmeier, M. J. Severe acute respiratory syndrome coronavirus nonstructural proteins 3, 4, and 6 induce double-membrane vesicles. mBio 4, e00524-13 (2013).
Wolff, G. et al. A molecular pore spans the double membrane of the coronavirus replication organelle. Science 369, 1395 (2020).
Gorbalenya, A. E., Koonin, E. V., Donchenko, A. P. & Blinov, V. M. Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis. Nucleic Acids Res. 17, 4847–4861 (1989).
Anand, K., Ziebuhr, J., Wadhwani, P., Mesters, J. R. & Hilgenfeld, R. Coronavirus main proteinase (3CLpro) structure: basis for design of anti-SARS drugs. Science 300, 1763 (2003).
Imbert, I. et al. A second, non-canonical RNA-dependent RNA polymerase in SARS coronavirus. EMBO J. 25, 4933–4942 (2006).
Tvarogová, J. et al. Identification and characterization of a human coronavirus 229E nonstructural protein 8-associated RNA 3′-terminal adenylyltransferase activity. J. Virol. 93, e00291-19 (2019).
Egloff, M.-P. et al. The severe acute respiratory syndrome-coronavirus replicative protein nsp9 is a single-stranded RNA-binding subunit unique in the RNA virus world. Proc. Natl Acad. Sci. USA 101, 3792–3796 (2004).
Decroly, E. et al. Crystal structure and functional analysis of the SARS-coronavirus RNA cap 2′-O-methyltransferase nsp10/nsp16 Complex. PLoS Pathog. 7, e1002059 (2011).
Ma, Y. et al. Structural basis and functional analysis of the SARS coronavirus nsp14–nsp10 complex. Proc. Natl Acad. Sci. USA 112, 9436–9441 (2015).
Bouvet, M. et al. In vitro reconstitution of SARS-coronavirus mRNA cap methylation. PLoS Pathog. 6, e1000863 (2010).
Chen, Y. et al. Functional screen reveals SARS coronavirus nonstructural protein nsp14 as a novel cap N7 methyltransferase. Proc. Natl Acad. Sci. USA 106, 3484–3489 (2009).
Decroly, E. et al. Coronavirus nonstructural protein 16 is a cap-0 binding enzyme possessing (nucleoside-2′O)-methyltransferase activity. J. Virol. 82, 8071 (2008).
Hartenian, E. et al. The molecular virology of coronaviruses. J. Biol. Chem. 295, 12910–12934 (2020).
Appleby, T. C. et al. Viral replication. Structural basis for RNA replication by the hepatitis C virus polymerase. Science 347, 771–775 (2015).
Gosert, R., Kanjanahaluethai, A., Egger, D., Bienz, K. & Baker, S. C. RNA replication of mouse hepatitis virus takes place at double-membrane vesicles. J. Virol. 76, 3697–3708 (2002).
Knoops, K. et al. SARS-coronavirus replication is supported by a reticulovesicular network of modified endoplasmic reticulum. PLoS Biol. 6, e226 (2008).
Maier, H. J. et al. Infectious bronchitis virus generates spherules from zippered endoplasmic reticulum membranes. mBio 4, e00801–e00813 (2013).
Klein, S. et al. SARS-CoV-2 structure and replication characterized by in situ cryo-electron tomography. Nat. Commun. 11, 5885 (2020).
Ramanathan, A., Robb, G. B. & Chan, S.-H. mRNA capping: biological functions and applications. Nucleic Acids Res. 44, 7511–7526 (2016).
Ferron, F., Decroly, E., Selisko, B. & Canard, B. The viral RNA capping machinery as a target for antiviral drugs. Antivir. Res. 96, 21–31 (2012).
Ivanov, K. A. & Ziebuhr, J. Human coronavirus 229E nonstructural protein 13: characterization of duplex-unwinding, nucleoside triphosphatase, and RNA 5′-triphosphatase activities. J. Virol. 78, 7833–7838 (2004).
Züst, R. et al. Ribose 2′-O-methylation provides a molecular signature for the distinction of self and non-self mRNA dependent on the RNA sensor Mda5. Nat. Immunol. 12, 137–143 (2011).
Acknowledgements
The authors apologize to colleagues whose work could not be cited due to scope and space limits of the manuscript. Figures 1–3 and the figure in Box 2 were created with BioRender. N.U. was supported by grant LSHM20047 from the Foundation Life Sciences Health-TKI. E.A.C. is grateful for support from NIH grants 2-R01 GM114450 and 1-R01 AI161278. E.J.S. was supported by the #wakeuptocorona crowdfunding initiative of the Leiden University Fund and Leiden University Medical Center Bontius Foundation.
Author information
Authors and Affiliations
Contributions
The authors contributed equally to all aspects of the article.
Corresponding authors
Ethics declarations
Competing interests
E.A.C has received funding from Gilead Sciences to fund research on remdesivir’s incorporation into the RNA-dependent RNA polymerase. The other authors declare no competing interests.
Additional information
Peer review information
Nature Reviews Molecular Cell Biology thanks Hauke Hillen, Kenneth Johnson and Yi Shi for their contribution to the peer review of this work.
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Related links
GenBank entry MN908947.3: https://www.ncbi.nlm.nih.gov/genome/?term=MN908947.3
Glossary
- RNA genome of positive polarity
-
An RNA genome that has mRNA polarity and, when released from viral particles, can be used directly by host ribosomes to produce viral proteins.
- Replication and transcription
-
Process of amplification of the viral genome through a full-length minus-strand intermediate serving as template and synthesis of subgenomic mRNAs through a set of subgenomic minus-strand templates.
- −1 programmed ribosomal frameshift
-
(−1 PRF). A regulated switch of the translating ribosome to an alternative open reading frame by shifting one nucleotide backwards on the mRNA.
- Nucleocapsid
-
Complex of genomic RNA and nucleocapsid proteins that forms the core of a coronavirus particle.
- Pseudoknot
-
A structural motif that arises as a result of base paring between the loop of an RNA hairpin and a complementary single-stranded (unpaired) region within RNA.
- Similarity-assisted copy-choice RNA recombination
-
A process of template switching during replication of an RNA virus genome that is guided by local sequence complementarity between the nascent strand and an alternative template, resulting in progeny genomes of mixed ancestry.
- RNA replicon
-
A self-replicating RNA molecule, derived from a viral genome, that contains the replicase gene but lacks at least one essential structural protein gene and thus is unable to produce infectious progeny.
- Error threshold
-
The size limit of a viral genome, above which too many mutations accumulate to sustain long-term viability of the virus.
Rights and permissions
About this article
Cite this article
Malone, B., Urakova, N., Snijder, E.J. et al. Structures and functions of coronavirus replication–transcription complexes and their relevance for SARS-CoV-2 drug design. Nat Rev Mol Cell Biol 23, 21–39 (2022). https://doi.org/10.1038/s41580-021-00432-z
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41580-021-00432-z
This article is cited by
-
Efficacy and safety of azvudine in symptomatic adult COVID-19 participants who are at increased risk of progressing to critical illness: a study protocol for a multicentre randomized double-blind placebo-controlled phase III trial
Trials (2024)
-
RNA structure: implications in viral infections and neurodegenerative diseases
Advanced Biotechnology (2024)
-
Human coronaviruses activate and hijack the host transcription factor HSF1 to enhance viral replication
Cellular and Molecular Life Sciences (2024)
-
Amyloidogenesis of SARS-CoV-2 delta plus and omicron variants receptor-binding domain (RBD): impact of SUMO fusion tag
Biotechnology Letters (2024)
-
Structural Landscape of nsp Coding Genomic Regions of SARS-CoV-2-ssRNA Genome: A Structural Genomics Approach Toward Identification of Druggable Genome, Ligand-Binding Pockets, and Structure-Based Druggability
Molecular Biotechnology (2024)