Length-dependent motions of SARS-CoV-2 frameshifting RNA pseudoknot and alternative conformations suggest avenues for frameshifting suppression

Yan, Shuting; Zhu, Qiyao; Jain, Swati; Schlick, Tamar

doi:10.1038/s41467-022-31353-w

Download PDF

Article
Open access
Published: 25 July 2022

Length-dependent motions of SARS-CoV-2 frameshifting RNA pseudoknot and alternative conformations suggest avenues for frameshifting suppression

Shuting Yan¹^na1,
Qiyao Zhu²^na1,
Swati Jain¹ &
…
Tamar Schlick ORCID: orcid.org/0000-0002-2392-2062^1,2,3,4

Nature Communications volume 13, Article number: 4284 (2022) Cite this article

3054 Accesses
13 Citations
4 Altmetric
Metrics details

Subjects

Abstract

The SARS-CoV-2 frameshifting element (FSE), a highly conserved mRNA region required for correct translation of viral polyproteins, defines an excellent therapeutic target against Covid-19. As discovered by our prior graph-theory analysis with SHAPE experiments, the FSE adopts a heterogeneous, length-dependent conformational landscape consisting of an assumed 3-stem H-type pseudoknot (graph motif 3_6), and two alternative motifs (3_3 and 3_5). Here, for the first time, we build and simulate, by microsecond molecular dynamics, 30 models for all three motifs plus motif-stabilizing mutants at different lengths. Our 3_6 pseudoknot systems, which agree with experimental structures, reveal interconvertible L and linear conformations likely related to ribosomal pausing and frameshifting. The 3_6 mutant inhibits this transformation and could hamper frameshifting. Our 3_3 systems exhibit length-dependent stem interactions that point to a potential transition pathway connecting the three motifs during ribosomal elongation. Together, our observations provide new insights into frameshifting mechanisms and anti-viral strategies.

Structural dynamics of single SARS-CoV-2 pseudoknot molecules reveal topologically distinct conformers

Article Open access 06 August 2021

The short isoform of the host antiviral protein ZAP acts as an inhibitor of SARS-CoV-2 programmed ribosomal frameshifting

Article Open access 10 December 2021

Secondary structural ensembles of the SARS-CoV-2 RNA genome in infected cells

Article Open access 02 March 2022

Introduction

In less than three years, COVID-19 through its novel infectious agent SARS-CoV-2 has already caused more than 566 million infections and 6 million deaths worldwide. Although the development of multiple vaccines has provided hope for a post-pandemic world, new virus variants with higher infectivity and increased ability to evade the immune system require us to maintain vigilance. Thus, the identification of novel anti-viral therapeutic targets and development of drugs against them remains a priority.

The single stranded SARS-CoV-2 RNA genome of 29,891 nucleotides includes two overlapping and frame shifted open reading frames ORF1a and 1b, which encode for viral polyproteins that begin the viral protein production. To correctly translate both polypeptides, the virus utilizes programmed –1 ribosomal frameshifting (–1 PRF) to stall and backtrack the ribosome by one nucleotide to bypass the stop codon near the start site of ORF1b.

First discovered in the Rous sarcoma virus in 1985¹, the –1 PRF stalling of the ribosome is associated with a small (<100-nt) RNA frameshifting element². SARS-CoV-2 similarly employs such a frameshifting element (FSE) located at the ORF1a/1b junction. This FSE consists of a 7-nt slippery site and a downstream 77-nt stimulatory region, which typically folds into an H-type pseudoknot (Fig. 1). The functional importance and high conservation of the FSE make it a promising candidate for anti-viral drugs and gene therapy; for example, in the latest Omicron variant, there are 31 new mutations in the spike gene region with respect to the previous variants, but no changes in the FSE (Supplementary Fig. 1)^3,4,5,6. Whether frameshifting is orchestrated by the FSE acting as a road blocker or through more complex conformational switches remains unknown^{7,8,9,10,11,12,13}. Hence, exploring the secondary (2D) and tertiary (3D) structural dynamics of the FSE during translation is essential for both untangling the frameshifting mechanism and developing anti-viral strategies.

**Fig. 1: Secondary structures of the three FSE motifs.**

Unlike the stem-loop structure for HIV-1 FSE¹⁴ or the 2-stem pseudoknot for IBV FSE¹⁵, the assumed structure for SARS-CoV-2 FSE is a 3-stem H-type pseudoknot, where the Stem 1 loop binds the 3\({}^{\prime}\) end to form Stem 2, and Stem 3 lies between them (Fig. 1). This motif has been reported by chemical probing, Cryo-EM, NMR, crystallography^{3,16,17,18,19,20,21,22}, and molecular dynamics (MD)^23,24,25. Using our coarse-grained RNA-As-Graphs (RAG) representation as dual graphs^26,27,28,29, we assign this pseudoknot motif dual graph 3_6 (Fig. 1)^3,24. RAG translates double-stranded stems to vertices and single-stranded loops to edges. We use RAG to identify key RNA motifs, design novel RNA motifs from building blocks, and perform inverse folding to transform one RNA motif into another^{30,31,32,33,34,35}. Recent applications of RAG explored the FSE conformational landscape, including RNA mutations to alter the FSE motif^3,24.

Further studies of the SARS-CoV-2 FSE have revealed a complex conformational landscape, with alternative pseudoknots^3,19,36,37 as well as unknotted structures^{3,18,19,38,39,40,41} (see³ for a detailed comparison). In particular, our prior modeling and SHAPE chemical reactivity experiments reveal an alternative 3-stem H-type pseudoknot where the Stem 1 loop binds with the 5\({}^{\prime}\) end to form a different Stem 2 (3_3 dual graph), and a 3-way junction where the 5\({}^{\prime}\) and 3\({}^{\prime}\) ends pair (3_5 dual graph)³. The three motifs (3_6, 3_3, and 3_5) have common Stems 1 and 3 (though stem lengths vary) but competing Stem 2 (see Fig. 1).

Moreover, our graph and SHAPE studies have emphasized the length dependence of the FSE motifs: for short lengths such as 77-nt without the slippery site, the 3_6 pseudoknot is the dominant motif, and the 3_5 junction is minor; for longer lengths such as 87-nt and 144-nt, conformations containing the 3_6 pseudoknot become minor, while those containing the 3_3 pseudoknot become dominant³. We calculated these FSE length/motif landscapes using partition functions of 2D structures predicted with SHAPE reactivity restraints, where we term a particular conformation 3_6, 3_3, or 3_5 according to the central 77-nt FSE fold motif³. As in other positive-sense RNA viruses^42,43,44, structural transitions among these three (and other possible) motifs likely exist and play an important role in frameshifting.

Here we employ several computational structure prediction programs to build candidate FSE 3D models and analyze microsecond MD trajectories of the three motifs at three lengths: 77, 87, and 144-nt (Fig. 1). We consider the 19 wildtype trajectories studied here as pieces of the heterogeneous FSE landscape during ribosomal translation. We also study our motif-strengthening mutants (experimentally validated in³) that stabilize each motif over the others (11 trajectories). All starting models and 30 MD trajectories are analyzed with respect to known SHAPE data and available experimental structures.

From the 19 wildtype and 11 mutant FSE trajectories, we identify critical structural features and motions. For the 3_6 pseudoknot, we capture both the L and the linear shapes observed by Cryo-EM and crystallography studies^19,20,22,45, with pseudoknot-stabilizing hydrogen bonds. A threaded ring conformation and a structural switch between the L and linear shapes may play a role in ribosomal pausing and frameshifting. Importantly, we can suppress this transition in our sextuple mutant. From the alternative motifs, especially the 3_3 pseudoknot, we find length-dependent stem interactions that suggest a potential FSE transition pathway during ribosomal translation. All these mechanistic insights help suggest frameshifting mechanisms and open new avenues for anti-viral therapy. Namely, small molecules or gene editing mutations in these key regions could hamper frameshifting: 3_6 threading (3\({}^{\prime}\) helix end of Stem 1), structural switch (Stem 2/3 junction), and pseudoknot-stabilizing interactions (hydrogen-bonded triplets near Stem 2).

Results

MD model validation and selection

Using five 3D prediction programs, we create 26 initial models compatible with our 3 wildtype FSE motifs³ (3_6, 3_3, and 3_5) at 3 relevant lengths (77, 87, and 144-nt), as listed in Table 1. An initial check for consistent 2D structures with SHAPE data left 23 viable candidates (Supplementary Table 1), which were then subjected to microsecond MD simulations (see “Methods”). MD trajectory convergence and structure validation tests excluded four more cases, with 19 viable trajectories remaining (numbered by superscripts in Table 1). See detailed analyses in SI (Supplementary Figs. 2-8, and Supplementary Tables 2-4).

Table 1 FSE systems studied in this work. For each motif (3_6, 3_3, and 3_5) and length (77, 87, and 144-nt) combination, we use five 3D prediction programs (R for RNAComposer, S for SimRNA, I for iFoldRNA, V for Vfold3D, and F for Farfar2) to generate starting models.

Full size table

We consider all these trajectories as parts of the heterogeneous FSE conformational landscape, relevant with length-dependent ribosomal interactions. To simplify our presentation below, we select representative systems (marked with asterisks in Table 1) based on multi-trajectory clustering (Fig. 2) and structure validation (Supplementary Table 3). This results in L-shape trajectories 1 and 5 and linear shape trajectories 3 and 6 for 3_6, trajectories 11 and 13 for 3_3, and compact shape trajectory 18 and elongated shape trajectory 19 for 3_5.

**Fig. 2: Clustering analysis and representative structures for the three FSE motifs.**

For the motif-strengthening mutants, we only consider trajectories from prediction programs that correspond to our 11 validated wildtype systems, and representative systems are chosen based on Stem 2 lengths (Table 1, see details in Supplementary Tables 5-6 and Supplementary Figs. 9-14).

Overview of the three fold motifs

In the following sections, we focus on these representative systems (Table 1). For the 3_6 pseudoknot, we highlight a ring formed by Stem 1 strand and the junctions, and relate the 5\({}^{\prime}\) end threading that may impact ribosomal pausing to the L shape model (Fig. 3). We identify hydrogen-bond networks that stabilize the pseudoknot complex (Fig. 3), and compare our models to the experimental structures (Fig. 4)^19,20,22,45.

**Fig. 3: Threaded L and non-threaded linear 3_6 pseudoknot ring conformations.**

**Fig. 4: MD 3_6 structures compared to four experimental structures.**

For the alternative 3_3 pseudoknot and 3_5 junction, we discuss length-dependent flanking stem or triplet formation (3_3) and the Stem 2/3 interactions (3_5) that provide insights into FSE transitions (Fig. 5).

**Fig. 5: Alternative 3_3 and 3_5 motifs.**

Inherent motions and motif-strengthening mutants are discussed in Figs. 6 and 7. Notably, a key structural switch between the L and linear shapes for 3_6 that may send frameshifting signals to the ribosome is suppressed in our mutants.

**Fig. 6: Dynamic analysis of the wildtype 3_6, 3_3, and 3_5 systems.**

**Fig. 7: Comparison of the motif-strengthening mutants with the wildtype systems.**

The combined insights suggest target regions for small-molecule binding and CRISPR gene-editing, as well as a transition pathway connecting the three motifs during ribosomal translation (Fig. 8).

**Fig. 8: Implications of the unraveled structures and motions to anti-viral therapeutics and frameshifting mechanisms.**

L-shaped and linear 3_6 conformations

In two recent Cryo-EM structures, the 3_6 pseudoknot exhibits an L shape with a bent Stem 3 from the co-axial plane of Stems 1 and 2 (Fig. 3)^19,20. Moreover, a ring forms by linking the 3\({}^{\prime}\) strand of Stem 1, and the Stem 1/3 and 2/3 junctions, with the 5\({}^{\prime}\) end threading through the ring hole, which may hamper ribosomal unwinding^19,46.

Here, we observe this threaded L shape in our 77 and 87-nt 3_6 models (Fig. 3, Supplementary Table 4, and Supplementary Fig. 8), stabilized by hydrogen-bond networks. At 77-nt, the Stem 1 loop and the Stem 2/3 junction form a quadruplet to seal the ring top. At 87-nt, another triplet forms between the threaded 5\({}^{\prime}\) end and Stem 1 at the ring bottom. Moreover, the flexible 5\({}^{\prime}\) end folds into a small helix, which is also seen in our SHAPE probing³ and in the Cryo-EM structure¹⁹.

In contrast to this L shape, we also capture linear shape models observed by crystallography^22,45 at all FSE lengths, where the 3 stems stack vertically (Fig. 3). A similar ring forms, but the hole is narrower and the 5\({}^{\prime}\) end prefers being non-threaded by winding around the structure. Other than ring-stabilizing hydrogen bonds, interactions between the 5\({}^{\prime}\) end and Stem 3 help stiffen the junction to maintain the linear shape.

Although 5\({}^{\prime}\) end threading is also captured in our linear models and the crystal structure (PDB: 7MLX)²², it is more prevalent in the L shape (Supplementary Table 4). Such threading likely forms before ring closure, as experiments for the 3_6 pseudoknot suggest that Stem 1 forms first and Stem 2 last during FSE folding⁴⁶. Therefore, it is likely that Stem 3 bending (L shape) is associated with threading to avoid steric clashes.

In both the L and the linear shapes, multiple hydrogen bonds act to stabilize the 3_6 pseudoknot complex (Fig. 3). In the 77-nt L shape, the Stem 1/2 and 2/3 junction residues interact to form a short triplex, which is further extended by binding with Stem 2. This triplex stabilizes the loose junctions and links the 3\({}^{\prime}\) end tightly near the Stem 1 loop to maintain the pseudoknot.

In the 77-nt linear shape, Stems 1 and 2 are longer than those in the L shape, but similar triplets are found at the Stem 2 junctions (Fig. 3). These triplets are also seen in the crystal structures^22,45, indicating their importance in stabilizing the pseudoknot. Another triplet forms between Stem 2 and the 3\({}^{\prime}\) end, anchoring the flexible 3\({}^{\prime}\) end.

Comparisons to experimental structures

Comparing our L shape models to the two Cryo-EM structures^19,20, we see global structural similarity in Stems 1 and 2 stacking (Fig. 4). The 88-nt Cryo-EM structure (PDB: 6XRZ, resolution 6.9 Å)¹⁹ has a wider ring than our 87-nt model, possibly due to a more bent Stem 3. Its 5\({}^{\prime}\) end helix is shifted and shorter, and is further away from Stem 3. The 77-nt Cryo-EM mRNA-ribosome complex (PDB: 7O7Z, resolution 5-7 Å)²⁰ has slightly longer Stems 1 and 3 than our 77-nt model, and its 5\({}^{\prime}\) end is pulled by the ribosome.

The two crystal structures (PDB: 7LYJ and 7MLX, resolution 2.1 Å)^22,45 align well with our 77-nt linear shape model (3-4 Å RMSD). Stem 3 loop is more stretched in the crystal structures, probably because associated residues were mutated to avoid dimerization. Evidence of threading exists in the Roman et al. crystal structure (PDB: 7MLX) but not in the Jones et al. structure (PDB: 7LYJ). Consistent with our comments above, a wider ring hole accompanies the threading to avoid steric clashes.

Overall, our independently developed yet well aligned 3_6 MD structures provide credibility for the following alternative structure modeling.

FSE transitions suggested by length-dependent 3_3 pseudoknot interactions

In our SHAPE experiments, the dominant motif in the FSE landscape shifts from 3_6 to 3_3 pseudoknot when the sequence length increases from 77-nt to 87 and 144-nt³. This alternative 3_3 pseudoknot contains a different Stem 2 formed by the Stem 1 loop and the 5\({}^{\prime}\) end. At 77-nt, Stem 2 is short with 3 base pairs; at 87 and 144-nt, upstream residues form 2 additional base pairs for Stem 2, and also a flanking stem SF with the 3\({}^{\prime}\) end to further seal the conformation (Fig. 5, more details in Supplementary Figs. 8 and 15).

Consistently, a clear jump occurs for 3_3 Stem 2 hydrogen bond number, when the length increases from 77 to 87-nt, resulting in a stronger Stem 2 of 3_3 than that of 3_6 (Supplementary Fig. 16). A similar trend is observed from the stem interaction energy (Supplementary Fig. 17).

These length-dependent interactions suggest potential motif transitions during ribosomal translation and RNA refolding. Indeed, our 77-nt 3_3 model is identified as an intermediate structure, in which the 3\({}^{\prime}\) end residues U74 and U75 form two triplets with 3_3 Stem 2 (Fig. 5). In 3_6, the same 3\({}^{\prime}\) end residues pair with Stem 1 loop (A20) to form Stem 2; in 3_5, they pair with the 5\({}^{\prime}\) end (G2 and G1) to form Stem 2. Hence, Stem 2 interactions in all three motifs co-exist in our 77-nt 3_3 model, potentially making this state a starting conformation for a transition from 3_3 to 3_6 or 3_5.

For the 87-nt 3_3 systems, the flanking stem SF formed by the 5\({}^{\prime}\) and 3\({}^{\prime}\) ends blocks Stem 2 of 3_6 and 3_5, and the hydrogen bonding between residue U86 and the Stem 3 base pair C72-G49 keeps the 3\({}^{\prime}\) end away from Stem 2 (Fig. 5). In our 144-nt models, additional stems form to avoid the mixed Stem 2 triplets (Supplementary Figs. 8 and 15). Hence, all these interactions, especially stem SF, must be unwound by the ribosome before the 3\({}^{\prime}\) end is free to form Stem 2 of 3_6 or 3_5.

Together, these insights suggest the following transition pathway during ribosomal elongation: when the ribosome is far away from the FSE 5\({}^{\prime}\) end, the flanking stem SF favors 3_3, but as the ribosome moves to occlude the slippery site, SF is unwound to allow formation of alternative Stem 2, and a transition to 3_6 or 3_5 occurs.

Elongated and compact 3-way 3_5 shapes

The 3_5 3-way junction is a minor motif seen in our SHAPE experiments at 77-nt³, where the 5\({}^{\prime}\) and 3\({}^{\prime}\) ends base pair to form Stem 2. In our MD simulations, we capture both an elongated and a compact 3_5 conformations (Fig. 5).

In the elongated model, Stems 1 and 2 are co-axially stacked, and Stem 3 is longer. A hydrogen-bond network is formed by 5 residues at the 3-way junction to stabilize the helical arrangements, so that Stem 2 would remain around Stem 3 and not interact with Stem 1 loop to form alternatives.

In the compact model, Stems 1 and 3 are stacked, and the Stem 3 loop bends towards the groove of Stem 2. Similar hydrogen bonds at the junction secure the Stem 2 orientation away from Stem 1. These hydrogen bonds must be broken to allow transition to another motif.

Dominant motions in 3_6: L to linear shape transitions

Using principal component analysis (PCA), we find that the linear 3_6 models are rather stable, while the L models switch between the two shapes, via bending of Stem 3 (Fig. 6, Supplementary Fig. 18). As the two Cryo-EM structures both capture the L shape and the two crystal structures both exhibit the linear shape, we speculate that the FSE is highly dynamic and may switch between these conformations during frameshifting.

Meanwhile, the pseudoknot complex (Stems 1 and 2) and the threaded ring conformation are maintained throughout this motion, so does the ring-holding triplet at bottom.

Consistent with the above motions, we see a peak in the 3_6 root mean square fluctuations (RMSF) in the Stem 3 loop region for all lengths (Fig. 6). The RMSF, the average number of hydrogen bonds (H-bond), and the interaction energies all indicate that Stem 1 is the strongest, followed by Stem 3, and lastly Stem 2 (Supplementary Figs. 16 and 17).

Stretching/bending motions in 3_3

The 3_3 pseudoknot’s dominant motion is a combined contraction and stretching caused by the bending of 3\({}^{\prime}\) end and Stem 3 loop (Fig. 6, Supplementary Fig. 18). In this motion, Stems 1 and 2, especially triplets that contain interactions from all three Stem 2 (purple and red residues in Fig. 6), are intact and move in unison. That these triplets are not transient suggests that this may be a stable intermediate during the FSE translation pathway directed by the elongating ribosome.

Comparing to 3_6, we see a higher RMSF peak value in the 3_3 Stem 3 loop region, and more fluctuations in 3_3 Stem 1 region due to the pseudoknot bending.

Stem 3 twisting in 3_5

For the 3_5 junction, both the elongated and the compact models experience bending of Stem 1 loop and twisting of Stem 3 (Fig. 6, Supplementary Fig. 18). As a result, the structure becomes more compact, and Stem 2 is closer to Stem 3. All the hydrogen bonds that lock the Stem 2 orientation (Fig. 5) are maintained, so that a stable 3_5 motif remains. Peak RMSF in the loop regions, and low values in the 5\({}^{\prime}\) and 3\({}^{\prime}\) ends are notable.

Overall, all three conformations have stable Stem 1, flexible Stem 3 loop, and relatively stable Stem 2 regions. The triplets and hydrogen bonds are mostly maintained throughout the simulations, and this helps stabilize key features such as the ring of 3_6 and the combined Stem 2 interactions in 3_3.

Minimal mutations to stabilize the 3_6 linear shape

Our predicted mutations confirmed by SHAPE probing in our previous study were designed to suppress conformational transitions and stabilize specific motifs over all alternatives, for the 77 and 144-nt 3_6 pseudoknot, 77-nt 3_3 pseudoknot, and 77-nt 3_5 junction (Table 1)^3,24. Our dynamics analyses below of these mutants compared to the wildtype trajectories help interrogate the mechanisms and consequences of structural stability.

The 6 mutations in the 77-nt 3_6 pseudoknot-strengthening mutant (PSM) include 4 mutations ([G18A, C19A, C68A, A69C]) that lengthen Stem 2 by up to 4 base pairs (Table 2) and 2 mutations at the 5\({}^{\prime}\) end to exclude alternative 3_3 and 3_5 Stem 2. Comparing the mutant with longest Stem 2 (9 base pairs) to its corresponding wildtype model, we observe a dramatic transformation from L shape (wildtype) to linear shape (Fig. 7). Indeed, all 3_6 mutant systems adopt this linear shape (Supplementary Table 6, Supplementary Fig. 14), and the structural switch between the two shapes has been suppressed in most systems (Supplementary Fig. 19).

Table 2 Comparison of the motif-strengthening mutants and the wildtype systems.

Full size table

For the 144-nt 3_6 PSM, one additional mutation in the downstream region suppresses formation of competing stems³. The central 3_6 pseudoknot region aligns well between the wildtype and mutant systems, both adopting the linear shape (Fig. 7). The major difference occurs in the upstream region: in the wildtype, upstream and downstream stems form on the same side of the central 3_6 pseudoknot; in the mutant, they are on different sides, due to our [G40U, U41A] mutations. From PCA, we see a relatively stable central 3_6 pseudoknot, while quite flexible upstream and downstream stems in the mutant (Supplementary Fig. 19).

As both our 77 and 144-nt 3_6 mutants adopt linear conformations, we hypothesize that this may be a more stable conformation, by separating the 5\({}^{\prime}\) and 3\({}^{\prime}\) ends further away from each other to avoid alternative 3_3 and 3_5 Stem 2.

3_3 mutant to eliminate alternative Stem 2 interactions

In our 77-nt 3_3 PSM, a large increase of Stem 2 length from 3 to 7 base pairs is induced by mere three mutations [U4C, G71A, G72U] (Table 2, Supplementary Fig. 14). The first mutation enhances the 3_3 Stem 2 and the others avoid alternative 3_6 and 3_5 motifs. The main structural changes are a vertical 5\({}^{\prime}\) end between the Stem 1 loop and helix instead of staying horizontal below, compact Stems 1 and 2, and elimination of triplets formed by the 3\({}^{\prime}\) end with Stem 2 (Fig. 7). Hence, our mutations stabilize the 3_3 motif. The dominant motion occurs in the Stem 3 region (Supplementary Fig. 19).

Elongated 3_5 mutant

Our 77-nt 3_5 mutant with only 2 mutations [G72C, U74C] also enjoys a considerable enhancement of Stem 2 from 3-4 base pairs to 6-7 (Table 2). Though Stem 2 remains close to Stem 3, the loop region of Stem 3 now stretches out, leading to an elongated conformation (Fig. 7). Moreover, the co-axial stacking changes from Stems 1 and 3 in the wildtype to Stems 1 and 2 in the mutant. Indeed, all four 3_5 mutant models adopt a stacking of Stems 1 and 2, and all resemble the elongated conformation except one (Supplementary Fig. 14). The dominant motion is bending of Stem 1 and 3 loops (Supplementary Fig. 19).

Overall, our enhanced Stem 2 in the three mutants leads to dramatic structural changes, especially for the 77-nt 3_6 and 3_5 systems. PCA analysis reveals stabilization of the linear shape in 3_6 PSM. For the 77-nt 3_3 mutant, triplets associated with possible structural transitions are eliminated.

Discussion

From our 30 molecular dynamics trajectory ensemble of SARS-CoV-2 FSE systems, we can begin to piece together aspects of the complex conformational landscape. In particular, our simulations extend beyond 3D structure models for the prevalent 3_6 pseudoknot in the literature^{3,16,17,18,19,20,21,23,24,37,47}, by providing the first viable 3D models for the alternative 3_3 pseudoknot and 3_5 junction, as well as the motif-strengthening mutants³. Moreover, guided by our prior SHAPE probing, our analyses of the 3 FSE motifs at different lengths help examine the conformational changes that might occur during ribosomal translation.

Our 3_6 models exhibit two distinct conformations: a threaded L shape that resembles the Cryo-EM structures^19,20, and a non-threaded linear shape that aligns well with the crystal structures^22,45 (Figs. 3 and 4). An interconversion between the L and linear shapes can be achieved by bending of Stem 3, as revealed by our PCA motion analysis (Fig. 6). Moreover, our mutants stabilize the linear shape and suppress the interconversion (Fig. 7). Importantly, our 3_3 models show length-dependent stem interactions: an intermediate structure containing base triplets from all three alternate Stem 2 at 77-nt, while a classic 3_3 stabilized by flanking stem SF at 87 and 144-nt (Fig. 5). Our 3_3 triple mutant successfully lengthens Stem 2 and eliminates these triplets.

The combined insights suggest the following FSE structural transition pathway relevant to ribosomal translation (Fig. 8). When the ribosome is far away from the FSE region, the dominant conformation is a 3_3 with stem SF. As the ribosome approaches and occludes the slippery site, stem SF is unwound, and the 3\({}^{\prime}\) end moves to the 3_3 Stem 2 region to form triplets, initiating the structural transition to 3_6 or 3_5. In this step, the ring starts to close at the Stem 1/3 and 2/3 junctions, and the 5\({}^{\prime}\) end can either thread through the ring hole or wind around Stem 3. When the ribosome further elongates, the 5\({}^{\prime}\) end becomes completely occluded, and only 3_6 remains viable. If the 5\({}^{\prime}\) end is threaded, to avoid steric clashes, Stem 3 would bend to widen the ring hole; if non-threaded, the 5\({}^{\prime}\) end would interact with Stem 3 to stiffen the junction and hold the linear shape (Fig. 3).

Such length-varying considerations of RNA are relevant to ribosomal translation and co-transcription in general, where the RNA can fold into different transient structures to accomplish various functions^48,49. For SARS-CoV-2, this FSE transition pathway may be associated with regulatory functions. The timescale at which the transitions occur depends on the scale of conformational rearrangements. Base pairing or tertiary structure changes occur on microsecond to second range. Major interconversions between secondary structures occur on millisecond and longer⁵⁰. Given that the ribosome pauses ~2.8s between translocations⁵¹, this time allows for the structural switches and transitions discussed here to occur. Enhanced sampling simulations are required to probe such transitions.

The biophysical insights from our work also suggest three general therapeutic approaches using small-molecule binding and CRISPR gene-editing (Fig. 8). The first anti-viral strategy is to alter the 3_6 pseudoknot plasticity. By mutating residues that form the pseudoknot-stabilizing hydrogen bonds (Stem 1/2 and 2/3 junctions, Fig. 3), we can further strengthen or destroy the pseudoknot. Since conformational plasticity has a large impact on frameshifting efficiency⁹, this should interrupt the frameshifting process. Indeed, Bhatt et al. achieve a significant reduction in frameshifting efficiency by mutating these junctions²⁰. In our prior SHAPE probing, mutations in this region modify the conformational landscape to 100% 3_6³. Both studies underscore the sensitivity of the 3_6 pseudoknot and its associated frameshifting to these junctions, which define good targets for CRISPR gene-editing.

The second approach is to strengthen the 5\({}^{\prime}\) end threading in the 3_6 ring conformation. A higher unfolding force is required when threading exists⁴⁶, so strengthening the threading may increase the mechanical barrier for translation. Recently, two alkaloids (emetine and cephaeline) predicted to bind the threading initiation site were found to inhibit SARS-CoV-2 viral replication⁵². Hence, the 3\({}^{\prime}\) helix end of Stem 1, which we find to seal the ring and initiate threading, defines a target binding region to impede ribosomal translation (Fig. 8).

The third approach is to target the 3_6 pseudoknot structural switch between the L and linear shapes. In the mRNA-ribosome Cryo-EM structure captured during translation²⁰, the L shape 3_6 wedges at the mRNA entry channel and resists unwinding by the helicase, which generates tension on the upstream mRNA²⁰. This structural switch might then enhance fluctuations of this tension and send frameshifting signals to the ribosome. When switching from the L to linear shape, residues in the Stem 2/3 junction are exposed (Fig. 6); small molecules like MTDB^10,53 can thus block the switch and hamper frameshifting. Another option is to deploy our 3_6 mutant, which assures a stabilized linear shape (Fig. 7).

In sum, by analyzing the hydrogen bonding interactions and motions of different 3_6 systems, we offer three strategic anti-viral targeting regions: the 3\({}^{\prime}\) helix end of Stem 1 and Stem 1/2 and 2/3 junction residues (Fig. 8). Although several drugs/small molecules have been shown to inhibit SARS-CoV-2 frameshifting, including MTDB^17,53,54, alkaloids⁵², Merafloxacin⁵⁵, Ivacaftor, and Huperzine A⁵⁶, they are mainly found by high-throughput drug screening, so the underlying inhibition mechanism is unexplained and, in some cases, the binding regions are unknown. Our targeting regions above emerged from mechanistic considerations.

Of course, like every computational approach, there are inherent limitations and approximations in the modeling and simulations performed here. The SHAPE experiments used to deduce the 3 conformers cannot directly reveal the 2D structure, but only provide nucleotide reactivities as restraints for guiding the 2D structure predictions. Our MD trajectories are started from predicted 3D models, but each model is anchored to SHAPE, Cryo-EM, and/or crystallographic data, rendering the results credible (see Fig. 4 and SI). Though microsecond simulations are near the state-of-the-art, each trajectory provides only a local sampling of the complex, multidimensional thermodynamic space. Enhanced sampling simulations could be used to further unravel the complex rugged landscape of the FSE. While we used simple ionic environments to avoid biases, a more extensive investigation with other ions like Mg²⁺ and K⁺ is warranted. Nevertheless, taken together, the 30 trajectories here contribute to an emerging view of the FSE conformational landscape as the ribosome elongates.

Our methods and analyses also extend to other viruses and diseases more broadly. mRNA-based therapeutics have already demonstrated success in vaccines and drugs to treat viral diseases and cancers, with the advantage of fast production and flexible design^57,58,59. Highly-conserved functional RNAs like the frameshifting element define good drug targets. With our continuously evolving computational and experimental toolkit for investigating RNA systems at increasing complexity, biophysical approaches will continue to contribute to disease diagnostics and treatment.

Methods

RAG notation and mutations

In our RNA-As-Graphs (RAG) framework, RNA secondary structures containing pseudoknots are represented as dual graphs²⁶. Each stem (≥2 base pairs) denotes a vertex, and every single strand or loop is an edge (hairpins are self-loops; 1-nt bulges, internal loops with two 1-nt strands, and dangling ends are ignored). Every non-isomorphic dual graph is assigned an identifier V_n, where V is the vertex number and n is a unique motif identifier. Our dual graph library consists of over 100,000 unique dual graphs for 2-9 vertices²⁹.

To design RNAs with minimal mutations that make the FSE fold in silico onto a target dual graph, we developed our inverse folding program for dual graphs Dual-RAG-IF^24,35. For manually selected mutation regions and a target 2D structure, Dual-RAG-IF uses a genetic algorithm to generate a pool of candidate RNA sequences with mutations. These candidates are screened by 2D prediction programs to ensure the correct graph folding, and are optimized for minimal mutations. Detailed design of the mutants is described in^3,24.

FSE lengths and conformations

We model the FSE structure at three sequence lengths: 77-nt without the 7-nt slippery site, 87-nt with the slippery site plus 3 additional residues at the \({5}^{\prime}\) end, and 144-nt with the slippery site plus 30 additional residues at each end. We perform MD simulations for all three conformations for the 77-nt FSE. (Even though the 3_3 pseudoknot was not observed at this length, we study it for comparison with other lengths.) For 87 and 144-nt, we model the 3_6 and the 3_3 conformations, with additional stems formed by the upstream and downstream nucleotides (Fig. 1).

Besides wildtype FSEs, we also model four motif-strengthening mutants predicted previously^3,24: 77-nt 3_6 PSM with 6 mutations [G3U, U4A, G18A, C19A, C68A, A69C], 144-nt 3_6 PSM with an additional mutation C137A, 77-nt 3_3 PSM with 3 mutations [U4C, G71A, G72U], and 77-nt 3_5 mutant with 2 mutations [G72C, U74C].

2D and 3D FSE structures

The 2D structure of the wildtype 77-nt 3_6 pseudoknot is predicted by PKNOTS⁶⁰, and all other 2D conformations are modeled by ShapeKnots from RNAstructure package with SHAPE reactivities incorporated^3,61.

Corresponding 3D structures are predicted, with the sequences and the 2D structures as input using RNAComposer⁶², Vfold3D⁶³, SimRNA⁶⁴, and iFoldRNA⁶⁵ for 77 and 87-nt, and RNAComposer, iFoldRNA, and Farfar2⁶⁶ for 144-nt, as SimRNA and Vfold3D failed to produce models for this length (see Table 1). For 3D structure prediction programs that give multiple structures as output, the first structure that retained the correct motif is selected as the initial 3D model.

Initial 3D model validation

We extract 2D structures of the initial predicted models using 3DNA-DSSR⁶⁷. If the central 77-nt FSE region does not fold into the correct motif (3_6, 3_3, or 3_5), the model is rejected. We also calculate the Hamming distance between each model’s 2D structure and the input (SHAPE) 2D structure. Models with Hamming distances >10 are rejected.

Molecular dynamics details

We use Gromacs 2020.3 and 2020.4⁶⁸, with the Amber OL3 forcefield⁶⁹. The systems are solvated in the cubic box with TIP3P water model, with a buffer of 10 Å from the RNA molecule⁷⁰. The systems are first neutralized with sodium ions and set to a 0.1M NaCl bulk concentration with additional Na⁺ and Cl⁻ ions. The systems are energy minimized via steepest descent and equilibrated under NVT (300 K) and NPT (1 bar and 300 K) ensembles for 100 ps each. Simulations are run with a timestep of 2 fs and a SHAKE-like LINCS algorithm⁷¹ with constraints on all atom bonds. The Particle Mesh Ewald method⁷² is used to treat long-range electrostatics. Production runs are performed for 1 ~ 1.5 μs under NPT to ensure stable RMSD. Structures from the last 500 ns of each simulation are used for analysis.

Clustering is performed on frames every 200 ps for RNA non-H backbone atoms, using the Gromos clustering method with 2, 2.5, 3, and 3.5 Å cutoffs. The largest cluster center structures (cutoff of 2.5 Å for 77-nt and 87-nt systems or 3.5 Å for 144-nt systems) are extracted from MD simulations to show and analyze in Results and Supplementary Information. The cutoffs are chosen to ensure that all simulations for the same dual graph topology produce a feasible number of clusters with outlier structures excluded. See Supplementary Fig. 7 for more details.

PCA motion analysis (on structures every 250 ps), whole system density, calculations of Rg, RMSF, interaction energy (sum of short-term Lennard-Jones and Coulomb interactions) between the two strands within each stem, and the number of hydrogen bonds in each stem are performed via Gromacs 2020.3⁶⁸. The 2D structures are extracted using 3DNA-DSSR⁶⁷. To compare different MD trajectories of 77-nt 3_6 or 3_5, multi-trajectory clustering is conducted using structures extracted from the last 500 ns simulations by R package bio3d⁷³.

All microsecond MD simulations were conducted on the Prince or Greene supercomputer clusters at the New York University High Performance Computing facilities. Each compute node in the Prince cluster is equipped with two Intel Xeon E5-2690v4 2.6 GHz CPUs (Broadwell, 14 cores/socket, 28 cores/node) and 125 GB memory. Each simulation is performed with seven to eight dedicated nodes (i.e., 196-224 cores), so the simulations complete in 7-10 days. Each compute node in the Greene clusters is equipped with two Intel Xeon Platinum 8268 24C 205W 2.9 GHz CPUs with 48 cores/node and 192 GB memory. Each simulation is performed with 30 nodes using 32 cores each, so that the simulations complete in 2-4 days.

MD trajectory validation

To monitor convergence of the MD trajectories, the system density is examined for NPT ensemble. Next, all simulations are examined for a steady state RMSD⁷⁴ plateau, and those with large RMSD fluctuations are extended until reaching a steady state over the last 500 ns. To further evaluate structural convergence, time series of Rg and eRMSD are calculated via Gromacs⁶⁸ and Barnaba⁷⁵, respectively. The evolution of hydrogen bonds over MD simulations is also used to assess structural stability every 100 ns. The cumulative number of hydrogen bonds is plotted against residue number. It adopts a mountain-like shape, where the number increases as new hydrogen bonds form in the 5\({}^{\prime}\) strand of the stem and decreases in the 3\({}^{\prime}\) strand.

All stable MD trajectories are further validated for 2D and 3D structures. For wildtype systems, we monitor graph motifs, 2D structures, and 3D clashes. Clashscores are calculated as the number of serious steric clashes per 1000 atoms using MolProbity⁷⁶. Models having wrong motifs are rejected, and those having Hamming distances >10 or clashscores >5 are tagged. In addition, all the 3_6 models are aligned to the available 4 experimental structures using PyMol⁷⁷align, with RMSD calculated. To validate the mutant systems, we check both the graph motifs and the 2D structures.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Complete data are present in the paper and/or associated Supplementary Materials. Additional information including initial 3D prediction models and validated trajectory cluster centers are shared in Zenodo (https://doi.org/10.5281/zenodo.6625172).

Experimental RNA structures analysed in this paper are available in the Protein Data Bank under accession codes 6XRZ, 7O7Z, 7LYJ, and 7MLX.

Code availability

The codes used to discover motif-strengthening mutations are available in the GitHub Schlicklab repository (https://github.com/Schlicklab/Dual-RAG-IF).

References

Jacks, T. & Varmus, H. Expression of the rous sarcoma virus pol gene by ribosomal frameshifting. Science 230, 1237–1242 (1985).
Article ADS CAS PubMed Google Scholar
Brierley, I. et al. An efficient ribosomal frame-shifting signal in the polymerase-encoding region of the coronavirus IBV. EMBO J. 6, 3779–3785 (1987).
Article CAS PubMed PubMed Central Google Scholar
Schlick, T. et al. To knot or not to knot: Multiple conformations of the SARS-CoV-2 frameshifting RNA element. J. Amer. Chem. Soc. 143, 11404–11422 (2021).
Article CAS Google Scholar
Kelly, J., Woodside, M. & Dinman, J. Programmed − 1 ribosomal frameshifting in coronaviruses: A therapeutic target. Virology 554, 75–82 (2021).
Article CAS PubMed Google Scholar
Dinman, J., Ruiz-Echevarria, M., Czaplinski, K. & Peltz, S. Peptidyl-transferase inhibitors have antiviral properties by altering programmed − 1 ribosomal frameshifting efficiencies: Development of model systems. Proc. Nat. Acad. Sci., USA 94, 6606–6611 (1997).
Article ADS CAS Google Scholar
Kinzy, T. et al. New targets for antivirals: The ribosomal A-site and the factors that interact with it. Virology 300, 60–70 (2002).
Article CAS Google Scholar
Lopinski, J., Dinman, J. & Bruenn, J. Kinetics of ribosomal pausing during programmed − 1 translational frameshifting. Mol. Cell. Biol. 20, 1095–1103 (2000).
Article CAS PubMed PubMed Central Google Scholar
Namy, O., Moran, S., Stuart, D., Gilbert, R. & Brierley, I. A mechanical explanation of rna pseudoknot function in programmed ribosomal frameshifting. Nature 441, 244–247 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Ritchie, D., Foster, D. & Woodside, M. Programmed − 1 frameshifting efficiency correlates with RNA pseudoknot conformational plasticity, not resistance to mechanical unfolding. Proc. Nat. Acad. Sci., USA 109, 16167–16172 (2012).
Article ADS CAS Google Scholar
Ritchie, D., Soong, J., Sikkema, W. & Woodside, M. Anti-frameshifting ligand reduces the conformational plasticity of the SARS virus pseudoknot. J. Amer. Chem. Soc. 136, 2196–2199 (2014).
Article CAS Google Scholar
Kim, H. et al. A frameshifting stimulatory stem loop destabilizes the hybrid state and impedes ribosomal translocation. Proc. Nat. Acad. Sci. USA 111, 5538–5543 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Chen, J. et al. Dynamic pathways of − 1 translational frameshifting. Nature 512, 328–332 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Caliskan, N., Katunin, V., Belardinelli, R., Peske, F. & Rodnina, M. Programmed − 1 frameshifting by kinetic partitioning during impeded translocation. Cell 157, 1619–1631 (2014).
Article CAS PubMed PubMed Central Google Scholar
Parkin, N., Chamorro, M. & Varmus, H. Human immunodeficiency virus type 1 gag-pol frameshifting is dependent on downstream mRNA secondary structure: demonstration by expression in vivo. Virol. J. 66, 5147–5151 (1992).
Article CAS Google Scholar
Brierley, I., Digard, P. & Inglis, S. Characterization of an efficient coronavirus ribosomal frameshifting signal: Requirement for an RNA pseudoknot. Cell 57, 537–547 (1989).
Article CAS PubMed PubMed Central Google Scholar
Wacker, A. et al. Secondary structure determination of conserved SARS-CoV-2 RNA elements by NMR spectroscopy. Nucleic Acids Res. 48, 12415–12435 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kelly, J. et al. Structural and functional conservation of the programmed − 1 ribosomal frameshift signal of SARS coronavirus 2 (SARS-CoV-2). J. Biol. Chem. 295, 10741–10748 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lan, T. et al. Secondary structural ensembles of the Sars-Cov-2 RNA genome in infected cells. Nat. Commun. 13, 1128–1128 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhang, K., Zheludev, I., Hagey, R. & Haslecker, R. et al. Cryo-EM and antisense targeting of the 28-kDa frameshift stimulation element from the SARS-CoV-2 RNA genome. Nat. Struct. Mol. Biol. 28, 747–754 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bhatt, P. et al. Structural basis of ribosomal frameshifting during translation of the SARS-CoV-2 RNA genome. Science 372, 1306–1313 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Ziv, O. et al. The short- and long-range RNA-RNA interactome of SARS-CoV-2. Mol. Cell 80, 1067–1077.e5 (2020).
Article PubMed PubMed Central CAS Google Scholar
Roman, C., Lewicka, A., Koirala, D., Li, N. & Piccirilli, J. The SARS-CoV-2 programmed − 1 ribosomal frameshifting element crystal structure solved to 2.09 Å using chaperone-assisted RNA crystallography. ACS Chem. Biol. 16, 1469–1481 (2021).
Article CAS PubMed PubMed Central Google Scholar
Omar, S. et al. Modeling the structure of the frameshift-stimulatory pseudoknot in SARS-CoV-2 reveals multiple possible conformers. PLOS Comput. Biol. 17, e1008603 (2021).
Article CAS PubMed PubMed Central Google Scholar
Schlick, T., Zhu, Q., Jain, S. & Yan, S. Structure-altering mutations of the SARS-CoV-2 frameshifting RNA element. Biophys. J. 120, 1040–1053 (2021).
Article ADS CAS PubMed Google Scholar
Rangan, R. et al. De novo 3D models of SARS-CoV-2 RNA elements from consensus experimental secondary structures. Nucleic Acids Res. 49, 3092–3108 (2021).
Article CAS PubMed PubMed Central Google Scholar
Gan, H. et al. RAG: RNA-As-Graphs database–concepts, analysis, and features. Bioinformatics 20, 1285–1291 (2004).
Article CAS PubMed Google Scholar
Zahran, M., Bayrak, C., Elmetwaly, S. & Schlick, T. RAG-3D: a search tool for RNA 3D substructures. Nucleic Acids Res. 43, 9474–9488 (2015).
Article CAS PubMed PubMed Central Google Scholar
Baba, N., Elmetwaly, S., Kim, N. & Schlick, T. Predicting large RNA-like topologies by a knowledge-based clustering approach. J. Mol. Biol 428, 811–821 (2016).
Article CAS PubMed Google Scholar
Jain, S., Saju, S., Petingi, L. & Schlick, T. An extended dual graph library and partitioning algorithm applicable to pseudoknotted rna structures. Methods 162, 74–84 (2019).
Article PubMed CAS Google Scholar
Jain, S., Bayrak, C., Petingi, L. & Schlick, T. Dual graph partitioning highlights a small group of pseudoknot-containing RNA submotifs. Genes 9, 371 (2018).
Article PubMed Central CAS Google Scholar
Schlick, T. Adventures with RNA graphs. Methods 143, 16–33 (2018).
Article CAS PubMed PubMed Central Google Scholar
Jain, S. & Schlick, T. F-RAG: Generating atomic models from RNA graphs using fragment assembly. J. Mol. Biol. 429, 3587–3605 (2017).
Article CAS PubMed PubMed Central Google Scholar
Jain, S., Laederach, A., Ramos, S. & Schlick, T. A pipeline for computational design of novel RNA-like topologies. Nucleic Acids Res. 46, 7040–7051 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zhu, Q. & Schlick, T. A Fiedler vector scoring approach for novel RNA motif selection. J. Phys. Chem. 125, 1144–1155 (2021).
Article CAS Google Scholar
Jain, S., Tao, Y. & Schlick, T. Inverse folding with RNA-as-graphs produces a large pool of candidate sequences with target topologies. J. Struct. Biol. 209, 107438 (2020).
Article CAS PubMed Google Scholar
Huston, N. et al. Comprehensive in vivo secondary structure of the SARS-CoV-2 genome reveals novel regulatory motifs and mechanisms. Mol. Cell 81, 584–598.e5 (2021).
Article PubMed PubMed Central CAS Google Scholar
Trinity, L., Wark, I., Lansing, L., Jabbari, H. & Stege, U. Shapify: Pathways to SARS-CoV-2 frameshifting pseudoknot. Research Square, doi: 10.21203/rs.3.rs-1370718/v1, preprint posted March 2022 .
Manfredonia, I. et al. Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements. Nucleic Acids Res. 48, 12436–12452 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ahmed, F. et al. A comprehensive analysis of cis-acting RNA elements in the SARS-CoV-2 genome by a bioinformatics approach. Front. Genet. 11, 1385 (2020).
Article CAS Google Scholar
Andrews, R. et al. A map of the SARS-CoV-2 RNA structurome. NAR Genom. Bioinform. 3, lqab043 (2021).
Article PubMed PubMed Central CAS Google Scholar
Iserman, C. et al. Genomic RNA elements drive phase separation of the SARS-CoV-2 nucleocapsid. Mol. Cell 80, 1078–1091 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kuhlmann, M., Chattopadhyay, M., Stupina, V., Gao, F. & Simon, A. An RNA element that facilitates programmed ribosomal readthrough in Turnip Crinkle Virus adopts multiple conformations. Virol. J. 90, 8575–8591 (2016).
Article CAS Google Scholar
Moomau, C., Musalgaonkar, S., Khan, Y., Jones, J. & Dinman, J. Structural and functional characterization of programmed ribosomal frameshift signals in West Nile virus strains reveals high structural plasticity among cis-acting RNA elements. J. Biol. Chem. 291, 15788–15795 (2016).
Article CAS PubMed PubMed Central Google Scholar
Houck-Loomis, B., Durney, M. & Salguero, C. et al. An equilibrium-dependent retroviral mRNA switch regulates translational recoding. Nature 480, 561–564 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Jones, C. & Ferre-D’amare, A. Crystal structure of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) frameshifting pseudoknot. RNA 28, 239–249 (2022).
Article CAS PubMed Google Scholar
Neupane, K., Zhao, M. & Lyons, A. et al. Structural dynamics of single SARS-CoV-2 pseudoknot molecules reveal topologically distinct conformers. Nat. Commun. 12, 4749 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Rangan, R. et al. RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look. RNA 26, 937–959 (2020).
Article CAS PubMed PubMed Central Google Scholar
Lai, D., Proctor, J. & Meyer, I. On the importance of cotranscriptional RNA structure formation. RNA 19, 1461–1473 (2013).
Article CAS PubMed PubMed Central Google Scholar
Feng, S. et al. Alternate rRNA secondary structures as regulators of translation. Nat. Struct. Mol. Biol. 18, 169–176 (2011).
Article CAS PubMed Google Scholar
Mustoe, A., Brooks, C. & Al-Hashimi, H. Hierarchy of RNA Functional Dynamics. Annu. Rev. Biochem. 83, 441–466 (2014).
Article CAS PubMed PubMed Central Google Scholar
Wen, J. et al. Following translation by single ribosomes one codon at a time. Nature 452, 598–603 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Ren, P., Shang, W. & Yin, W. et al. A multi-targeting drug design strategy for identifying potent anti-SARS-CoV-2 inhibitors. Acta Pharmacol. Sin. 43, 483–493 (2022).
Article CAS PubMed Google Scholar
Park, S., Kim, Y. & Park, H. Identification of RNA pseudoknot-binding ligand that inhibits the − 1 ribosomal frameshifting of SARS-coronavirus by structure-based virtual screening. J. Amer. Chem. Soc. 133, 10094–10100 (2011).
Article CAS Google Scholar
Neupane, K. et al. Anti-frameshifting ligand active against SARS coronavirus-2 is resistant to natural mutations of the frameshift-stimulatory pseudoknot. J. Mol. Biol. 432, 5843–5847 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sun, Y. et al. Restriction of SARS-CoV-2 replication by targeting programmed − 1 ribosomal frameshifting. Proc. Natl. Acad. Sci. USA 118, e2023051118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Chen, Y. et al. A drug screening toolkit based on the − 1 ribosomal frameshifting of SARS-CoV-2. Heliyon 6, e04793 (2020).
Article PubMed PubMed Central Google Scholar
Desterro, J., Bak-Gordon, P. & Carmo-Fonseca, M. Targeting mRNA processing as an anticancer strategy. Nat. Rev. Drug Discov. 19, 112–129 (2020).
Article CAS PubMed Google Scholar
Fiedler, K., Lazzaro, S., Lutz, J., Rauch, S. & Heidenreich, R. mRNA cancer vaccines. Recent Results Cancer Res. 209, 61–85 (2016).
Article CAS PubMed Google Scholar
Sahin, U., Karikó, K. & Töreci, Ö. mRNA-based therapeutics — developing a new class of drugs. Nat. Rev. Drug Discov. 13, 759–780 (2014).
Article CAS PubMed Google Scholar
Rivas, E. & Eddy, S. A dynamic programming algorithm for RNA structure prediction including pseudoknots. J. Mol. Biol. 285, 2053–2068 (1999).
Article CAS PubMed Google Scholar
Hajdin, C. et al. Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots. Proc. Natl. Acad. Sci. USA 110, 5498–5503 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Biesiada, M., Purzycka, K., Szachniuk, M., Blazewicz, J. & Adamiak, R. Automated RNA 3D Structure Prediction with RNAComposer. Methods Mol. Biol. 1490, 199–215 (2016).
Article CAS PubMed Google Scholar
Xu, X. & Chen, S. Hierarchical assembly of RNA three-dimensional structures based on loop templates. J. Phys. Chem. B 122, 5327–5335 (2018).
Article CAS PubMed PubMed Central Google Scholar
Boniecki, M. et al. SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction. Nucleic Acids Res. 44, e63–e63 (2016).
Article PubMed CAS Google Scholar
Krokhotin, A., Houlihan, K. & Dokholyan, N. iFoldRNA v2: folding RNA with constraints. Bioinformatics 31, 2891–2893 (2015).
Article CAS PubMed PubMed Central Google Scholar
Watkins, A., Rangan, R. & Das, R. FARFAR2: improved de novo rosetta prediction of complex global RNA folds. Structure 28, 963–976.e6 (2020).
Article PubMed PubMed Central CAS Google Scholar
Lu, X., Bussemaker, H. & Olson, W. DSSR: an integrated software tool for dissecting the spatial structure of RNA. Nucleic Acids Res. 43, e142–e142 (2015).
Article PubMed PubMed Central CAS Google Scholar
Abraham, M., Murtola, T. & Schulz, R. et al. Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1-2, 19–25 (2015).
Article ADS Google Scholar
Zgarbová, M. et al. Refinement of the Cornell et al. nucleic acids force field based on reference quantum chemical calculations of glycosidic torsion profiles. J. Chem. 7, 2886–2902 (2011).
Google Scholar
Jorgensen, W., Chandrasekhar, J., Madura, J., Impey, R. & Klein, M. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935 (1983).
Article ADS CAS Google Scholar
Hess, B., Bekker, H., Berendsen, H. & Fraaije, J. Lincs: A linear constraint solver for molecular simulations. J. Comput. Chem. 18, 1463–1472 (1997).
Article CAS Google Scholar
Essmann, U. et al. A smooth particle mesh ewald method. J. Chem. Phys. 103, 8577–8593 (1995).
Article ADS CAS Google Scholar
Grant, B., Rodrigues, A., ElSawy, K., McCammon, J. & Caves, L. Bio3d: an R package for the comparative analysis of protein structures. Bioinformatics 22, 2695–2696 (2006).
Article CAS PubMed Google Scholar
Bottaro, S., Di Palma, F. & Bussi, G. The role of nucleobase interactions in RNA structure and dynamics. Nucleic Acids Res. 42, 13306–13314 (2014).
Article CAS PubMed PubMed Central Google Scholar
Bottaro, S. et al. Barnaba: software for analysis of nucleic acid structures and trajectories. RNA 25, 219–231 (2019).
Article CAS PubMed PubMed Central Google Scholar
Williams, C., Headd, J., Moriarty, N. & Prisant, M. et al. Molprobity: More and better reference data for improved all-atom structure validation. Protein Sci. 27, 293–315 (2018).
Article CAS PubMed Google Scholar
Schrödinger, LLC. The PyMOL molecular graphics system, version 1.8 (2015) .
Brierley, I., Pennell, S. & Gilbert, R. Viral RNA pseudoknots: versatile motifs in gene expression and replication. Nat. Rev. Microbiol. 5, 598–610 (2007).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Shereef Elmetwaly for technical assistance and David Ackerman, Stratos Efstathiadis, and Shenglong Wang from the NYU High-Performance Computing facilities for providing our group dedicated resources to perform this work.

We gratefully acknowledge funding from the National Science Foundation RAPID Award 2030377 from the Division of Mathematical Science and the Division of Chemistry, National Science Foundation Award DMS-2151777 from the Division of Mathematical Sciences, National Institutes of Health R35GM122562 Award from the National Institute of General Medical Sciences, and Philip-Morris International to T. Schlick.

Author information

These authors contributed equally: Shuting Yan, Qiyao Zhu.

Authors and Affiliations

Department of Chemistry, New York University, 100 Washington Square E, New York, 10003, NY, USA
Shuting Yan, Swati Jain & Tamar Schlick
Courant Institute of Mathematical Sciences, New York University, 251 Mercer St, New York, 10012, NY, USA
Qiyao Zhu & Tamar Schlick
NYU-ECNU Center for Computational Chemistry, NYU Shanghai, 3663 North Zhongshan Road, Shanghai, 200062, China
Tamar Schlick
Simons Center for Computational Physical Chemistry, New York University, 24 Waverly Place, New York, 10003, NY, USA
Tamar Schlick

Authors

Shuting Yan
View author publications
You can also search for this author in PubMed Google Scholar
Qiyao Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Swati Jain
View author publications
You can also search for this author in PubMed Google Scholar
Tamar Schlick
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.S. conceived the project and supervised the study. Q.Z. predicted FSE 2D structures, S.J. predicted initial 3D models, S.J. and S.Y. performed molecular dynamics simulations, and S.Y. and Q.Z. validated 3D models and MD trajectories. All authors analyzed the data and wrote the manuscript, and Q.Z. and S.Y. prepared the figures.

Corresponding author

Correspondence to Tamar Schlick.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks Karissa Sanbonmatsu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yan, S., Zhu, Q., Jain, S. et al. Length-dependent motions of SARS-CoV-2 frameshifting RNA pseudoknot and alternative conformations suggest avenues for frameshifting suppression. Nat Commun 13, 4284 (2022). https://doi.org/10.1038/s41467-022-31353-w

Download citation

Received: 10 December 2021
Accepted: 10 June 2022
Published: 25 July 2022
DOI: https://doi.org/10.1038/s41467-022-31353-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.