Abstract
The RNA pseudoknot that stimulates programmed ribosomal frameshifting in SARS-CoV-2 is a possible drug target. To understand how it responds to mechanical tension applied by ribosomes, thought to play a key role during frameshifting, we probe its structural dynamics using optical tweezers. We find that it forms multiple structures: two pseudoknotted conformers with different stability and barriers, and alternative stem-loop structures. The pseudoknotted conformers have distinct topologies, one threading the 5′ end through a 3-helix junction to create a knot-like fold, the other with unthreaded 5′ end, consistent with structures observed via cryo-EM and simulations. Refolding of the pseudoknotted conformers starts with stem 1, followed by stem 3 and lastly stem 2; Mg2+ ions are not required, but increase pseudoknot mechanical rigidity and favor formation of the knot-like conformer. These results resolve the SARS-CoV-2 frameshift signal folding mechanism and highlight its conformational heterogeneity, with important implications for structure-based drug-discovery efforts.
Similar content being viewed by others
Introduction
Like most coronaviruses, the Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2) causing the COVID-19 pandemic makes use of −1 programmed ribosomal frameshifting (−1 PRF) to express proteins that are essential for viral replication1. In −1 PRF, a shift in the reading frame of the ribosome at a specific location in the RNA message is stimulated by a structure in the mRNA located 5–7 nt downstream of the slippery sequence where the reading frameshift occurs, thereby generating alternate gene products2,3. Previous work on viruses including HIV-1 and SARS-CoV showed that mutations modulating the level of −1 PRF can significantly attenuate viral propagation in cell culture4,5,6. As a result, the structures stimulating −1 PRF are potential targets for anti-viral drugs7,8,9, motivating efforts to find ligands active against −1 PRF in SARS-CoV-2 that could be used to treat COVID-1910,11,12,13,14.
The pseudoknot stimulating −1 PRF in SARS-CoV-2 has a three-stem architecture1,10,15,16 (Fig. 1a) that is characteristic of coronaviruses, in contrast to the more common two-stem architecture of most viral frameshift-stimulatory pseudoknots17. Cryo-EM imaging10,15 and computational modeling18 both suggest that the SARS-CoV-2 pseudoknot can take on several different conformers (Fig. 1b, c). Some of these conformers involve knot-like fold topologies that have not previously been observed in frameshift-stimulatory pseudoknots, specifically conformers with the 5′ end threaded through the junction between the three helices to generate what we term a ring-knot10,15,18. Such a 5′-end threaded ring-knot fold has only previously been observed in viral exoribonuclease-resistant RNAs19,20,21. Intriguingly, the co-existence of multiple conformers in the SARS-CoV-2 pseudoknot is consistent with evidence from studies of various stimulatory structures, both pseudoknots and hairpins22,23,24,25, as well as from studies of the effects of anti-frameshifting ligands26, showing that the stimulation of −1 PRF is linked to conformational heterogeneity in the stimulatory structure. In particular, −1 PRF is linked to the conformational heterogeneity under tension27 in the range of forces applied by the ribosome during translation28,29. However, the dynamic ensemble of conformers populated by the SARS-CoV-2 pseudoknot has not yet been explored experimentally, and the folding mechanism of this pseudoknot—especially its unusual ring-knotted conformer—remains unknown.
Here we examine the conformational dynamics of the SARS-CoV-2 pseudoknot in the single-molecule regime. We study it under tension in optical tweezers in order to mimic the situation seen during −1 PRF, where the force applied by the ribosome is ramped up and down as the ribosome attempts to resolve the mRNA structure before shifting reading frame30. Such force spectroscopy measurements are also a powerful tool for characterizing folding mechanisms31 and the energy landscapes that govern folding dynamics32. We find that the SARS-CoV-2 frameshift signal indeed forms at least two distinct pseudoknotted conformers, one involving threading of the 5′-end to form a ring-knot and the other without any threading. Stem 1 usually folds first, followed by stem 3 and lastly stem 2, but sometimes alternate stem-loops form that displace the pseudoknotted structures; Mg2+ rigidifies the pseudoknot structures and favors the formation of the threaded conformer. The existence of multiple conformers of this frameshift signal has important implications for structure-based efforts to find small-molecule therapeutics targeting −1 PRF.
Results
To probe the conformations formed by the SARS-CoV-2 pseudoknot, their folding pathways, and the dynamics under tension, we annealed a single RNA molecule containing the sequence of the pseudoknot flanked by handle regions to DNA handles that were attached to beads held in optical traps (Fig. 2a). We then moved the traps apart to ramp up the force and unfold the RNA, and brought them back together to ramp down the force and refold the RNA. Force-extension curves (FECs) measured during unfolding in near-physiological ionic conditions (130 mM K+, 4 mM Mg2+) showed one or more characteristic transitions in which the extension abruptly increased and force simultaneously decreased when part or all of the structure unfolded cooperatively (Fig. 2b). Unfolding events were observed over a range of forces from ~5 to 50 pN; similar transitions were seen in refolding FECs, but at forces below ~15 pN (Fig. 2c).
Examining the unfolding FECs in more detail, we found two qualitatively different behaviors distinguished by different length changes. Measuring the amount of RNA unfolded by fitting the FECs before and after the transition to worm-like chain (WLC) polymer elasticity models (Eq. (1), Methods) for the handles and unfolded RNA (Fig. 2b, dashed lines), we found that ~80% of FECs (Fig. 2b, black and magenta) showed a contour length change of ΔLc = 35.6 ± 0.4 nm for complete unfolding (Supplementary Table 1). This result was consistent with the value expected for full unfolding of the pseudoknot, 34.7–36.5 nm, based on cryo-EM reconstructions10,15 and computational modeling of likely structures18. Sometimes these curves contained an intermediate state, I1 (Fig. 2b, magenta), which unfolded with a length corresponding to stem 1 (ΔLc = 17.5 ± 0.4 nm), but most of the time they showed a single cooperative unfolding transition (Fig. 2b, black). The remaining ~20% of the FECs (Fig. 2b, green) unfolded with a smaller total length change of ΔLc = 25 ± 1 nm, indicating an alternate structure that was incompletely folded (denoted as Alt), at forces of ~10–20 pN characteristic of hairpins33 (Supplementary Fig. 1). Such alternative structures have been observed previously for many frameshift signals22,34. This second class of FECs also often contained at least one unfolding intermediate.
There is a characteristic shape expected for the distribution of unfolding forces, p(Fu), for unfolding across a single barrier35, hence p(Fu) can reveal the presence of distinct initial states during the unfolding36,37. For the population of FECs with full-length unfolding, two peaks were seen in p(Fu): a minor peak near 16 pN and a larger peak near 30 pN (Fig. 2d, black). The double peak indicates the presence of at least two distinct initial conformers, which despite sharing the same total length change nevertheless unfold over different barriers, leading to different shapes for their unfolding force distributions. Such behavior has been seen previously in ligand-bound riboswitches38 and proteins36. By fitting p(Fu) to a kinetic model for barrier crossing (Eq. 2, Methods), the shape of the energy barrier can be characterized through its height (ΔG‡) and distance from the folded state (Δx‡), reporting on the nature of the interactions that hold the structure together32. We found that p(Fu) fit better to the distribution expected for two initial states (Fig. 2d, red; fit parameters listed in Supplementary Table 2) than to the distribution expected for a single initial state (Fig. 2d, blue), as assessed by the Akaike information criterion (AIC) (see Methods)39. The results for ΔG‡ were similar within error for the two initial states, respectively, 31 ± 4 kJ/mol for the higher-force state (denoted N) and 33 ± 5 kJ/mol for the lower-force state (denoted N′), but Δx‡ was notably smaller for N: 0.7 ± 0.1 nm, compared to 2.1 ± 0.3 nm for N′, implying a more rigid structure for N. In both cases, however, the value for Δx‡ was consistent with the range characteristic of pseudoknots22,40 and other structures containing tertiary contacts41,42,43, but too short for structures consisting only of stem-loops33. The great majority of the unfolding FECs showing full-length ΔLc started in state N (91 ± 2%), with only a small minority (9 ± 2%) starting in state N′.
Given that the SARS-CoV-2 pseudoknot is predicted to form different fold topologies, such as the 5′-threaded and -unthreaded conformers seen in simulations18, such different conformers would be expected to give rise to sub-populations with different mechanical properties, because 5′-threaded folds are generally more mechanically resistant than unthreaded folds21. To test if the high-force population involved threading of the 5′ end, we explored if the proportions of the high-force and low-force populations could be modulated by changing the proximity of the duplex handle to the 5′ end of stem 1: steric hindrance from a bulky duplex that is too close to the stem 1/stem 3 junction where 5′-end threading takes place would be expected to reduce the likelihood of threading. In the construct measured in Fig. 2, the duplex handle was separated from the end of stem 1 by a 6-nt single-stranded spacer, to minimize potential interactions with the duplex during folding. We first re-measured the FECs after reducing the length of the spacer to only 1 nt (Fig. 3a). We found the same contour length changes as before (Supplementary Table 1), but now the lower-force peak in p(Fu) was increased from a small shoulder to a prominent peak (Fig. 3b), with the fraction of FECs showing full-length ΔLc attributed to N′ doubling to 20 ± 3% of the FECs, while the fraction attributed to N decreased correspondingly to 80 ± 3%. To induce even stronger interference of the duplex handle with the pseudoknot folding, we extended the handle past the 5′ end of the pseudoknot so that it paired with the first two nucleotides in stem 1 (Fig. 3c, inset). The FECs measured using this construct (Supplementary Fig. 2) revealed an unfolding force distribution with an even greater increase in the occurrence of N′, more than doubling again to 45 ± 4% of the curves with full-length ΔLc, and a corresponding decrease in N to 55 ± 4% (Fig. 3c). We also confirmed that a 6-nt spacer was sufficiently long to avoid interference of the handle with the pseudoknot folding by measuring a construct with a 12-nt spacer, finding that the proportions of N and N′ were unchanged (Supplementary Fig. 3). Extending the handle duplex closer to the 5′ end thus produced a clear trend, suppressing N but enhancing N′ (Fig. 3d and Supplementary Table 3). However, the occupancy of Alt was effectively unchanged with spacer length (Fig. 3d, brown, and Supplementary Table 3), and the landscape parameters obtained from fitting the two populations (Fig. 3b, c, red) also remained the same within error (Supplementary Table 2). The fact that the only significant effect of changing the handles was to rebalance the N:N′ ratio supports the conclusion that N is a 5′-threaded conformer, whereas N′ is an unthreaded conformer.
Turning to the refolding FECs, we found that the pseudoknotted conformers always refolded through one or more intermediate states. The first refolding transition, which was in the force range ~10–15 pN, had ΔLc = 17.9 ± 0.6 nm (Fig. 2c), consistent with the value of 16.7 nm expected for folding of stem 1 in both the threaded and unthreaded models18 (Supplementary Table 1 and Supplementary Fig. 4). After forming stem 1, often the rest of the pseudoknot appeared to refold all at once (Fig. 2c, red), without detectable intermediates, but other times it was seen to refold through an additional intermediate (Fig. 2c, blue). The cumulative length changes from the unfolded state for these subsequent transitions were ΔLc = 29.2 ± 0.6 nm and 35.3 ± 0.6 nm, consistent with expectations respectively for folding stem 3 in addition to stem 1 (29.8 nm), and then for folding stem 2 as well to form the complete pseudoknot (matching the value seen for complete unfolding). Stem 2 thus folded last, after stem 3. Intriguingly, this order is precisely what is needed to form a 5′-threaded fold topology: stem 2 must form last, after the 5′ end is threaded across the junction between stems 1 and 3. To confirm the identification of these structures in the intermediates, we repeated the measurements using anti-sense oligos to block the formation either of stem 1 (oligo 1) or stem 2 and part of stem 3 (oligo 2), as shown in Fig. 4a. The initial refolding transitions with oligo 2 present (Fig. 4b) showed effectively the same p(Fr) (Fig. 4c, red) as without the oligo (Fig. 4c, black), and close to the same ΔLc, too, albeit elongated by an extra ~2 base-pairs formed with the part of stem 3 liberated by oligo 2 (Supplementary Table 1 and Supplementary Fig. 4e), confirming that stem 1 was first to refold. FECs with oligo 1 (Fig. 4d), on the other hand, showed refolding at notably lower force than stem 1 (Fig. 4c, blue); unfolding proceeded via an intermediate with a length corresponding to the lower half of stem 3, ΔLc = 8 ± 1 nm (Supplementary Table 1 and Supplementary Fig. 4d).
We also tested the importance of Mg2+ for the folding and stability of the pseudoknot by re-measuring FECs in the absence of Mg2+ (Fig. 4e). We found that almost all (97%) of the curves showed the length change for pseudoknot unfolding, although ΔLc was ~1 nm shorter than previously (Supplementary Table 1), indicating that the absence of Mg2+ disfavored Alt. Two populations were still present in p(Fu) (Fig. 4f), but the high-force population (N) was greatly reduced from what was observed with Mg2+ present (Fig. 2d), down to only 40 ± 8% of the FECs showing full-length unfolding, and it peaked at lower forces. Mg2+ was therefore not required for folding of the pseudoknot, but it played a key role in promoting the formation of the higher-force population attributed to the ring-knot fold topology. Fitting p(Fu) to characterize changes in the landscape (Fig. 4f, red), we found that ΔG‡ was little changed, but Δx‡ was significantly higher, rising to 4.0 ± 0.6 nm for N′ and 2.8 ± 0.7 nm for N (Supplementary Table 2). The pseudoknots were thus much less rigid without Mg2+.
Finally, we examined the thermodynamic stability of N and N′ by using the Jarzynski equality44 to estimate the free-energy change relative to the unfolded state based on the non-equilibrium work done during unfolding45, while accounting for non-equilibrium populations of N and N′46. The stability of the threaded conformer N in the presence of Mg2+ was estimated as ΔGN = 61 ± 7 kBT. This value was nominally somewhat higher than the stability of the unthreaded conformer, estimated as ΔGN′ = 55 ± 6 kBT, but similar within the error, which was relatively large because the unfolding was not near equilibrium. Repeating the analysis for the FECs measured without Mg2+, we found stabilities of 55 ± 2 kBT and 53 ± 2 kBT, respectively, for N and N′, suggesting that the threading does not significantly change the thermodynamic stability of the pseudoknot, even though it does change the mechanical stability.
Discussion
These results confirm the suggestion from simulations and cryo-EM imaging that the SARS-CoV-2 frameshift signal can form a variety of different structures. The state N, which was by far the most common conformation under physiological-like conditions (handle far from stimulatory structure, with Mg2+ but without oligos), unfolded through the full length of the pseudoknot at moderately high force. This conformation was suppressed significantly by occlusion of the 5′ end by the duplex handle, precisely as would be expected for a 5′-end threaded structure such as those seen in cryo-EM images on and off the ribosome10,15 or predicted from simulations18. In contrast, the conformation unfolding at lower force, N′, while occurring ten-fold less frequently than N under normal conditions, increasingly replaced N as occlusion of the 5′ end suppressed the occupancy of N, as would be expected for a conformation in which the 5′ end remains unthreaded. Unthreaded conformers have been predicted computationally18 but not yet characterized structurally in experiments, although some individual cryo-EM images show the straight morphology expected for unthreaded conformers, in contrast to the bent shape of threaded conformers10.
The picture of the pseudoknot folding and unfolding that emerges from this work is illustrated in Fig. 5. Stem 1 always folds first, followed by sequential folding of stem 3 and then stem 2. The orientation of the 5′ end at the moment of stem 2 formation leads to two distinct fold topologies that cannot interconvert: 5′-threaded or -unthreaded. These two topologies give rise to distinct unfolding behaviors: higher forces for the threaded fold, lower forces for the unthreaded fold. The partitioning of the folding at the point when stem 2 forms—depending on whether or not the 5′ end is lying across the stem 1/stem 3 junction, as required for threading—ensures the presence of both threaded and unthreaded conformers, with the minority unthreaded state populated at some finite level, similar to what was seen in the folding of the Zika exoribonuclease-resistant RNA (xrRNA)21. This folding mechanism is dependent on stem 2 folding last; as it happens, stem 2 is also predicted by mfold47 to be the least stable thermodynamically, whereas stem 1 is expected to be the most stable, so that the folding is ordered by the relative stabilities of the stems as seen previously for two-stem pseudoknots34,48. Intriguingly, this order is also the same one in which the stems would refold co-translationally as the ribosome leaves the frameshift signal: stem 1 is at the 5′ end and would be expected to refold first, whereas stem 2 is at the 3′ end and would not be able to refold until after the whole pseudoknot emerged from the ribosome. Notably absent from this picture of the folding, however, is a third fold that was predicted computationally with the 3′ threaded through the stem 2/stem 3 junction18, which makes sense given that this fold requires stem 1 to form last instead of first.
The energy landscape parameters for unfolding states N and N′ further support this picture, in particular Δx‡, which reports on the mechanical rigidity of the RNA structure. Smaller Δx‡ implies a rigid structure that is less sensitive to tension and hence more likely to rupture in a brittle manner at high force, whereas larger Δx‡ implies a compliant structure that is more sensitive to tension and ruptures in a lower, narrower range of forces. State N was much more mechanically rigid than N′, with Δx‡ roughly three times smaller, which makes sense in terms of the mechanical effects of threading: threading of the 5′ end should rigidify the fold via interactions between the 5′ end and the helical junction in the pseudoknot18 that constrain the motion of the terminus in response to tension applied to the 5′ end, compared to the unthreaded fold. Indeed, the Δx‡ value of only 0.7 nm for state N is among the smallest reported for any pseudoknot, signifying the particular rigidity of this unusual fold topology; in contrast, the Δx‡ value for N′ is comparable to the range of values more typically reported for other pseudoknots, ~1.5–2 nm22,40.
Even though the observation of state N in the absence of Mg2+ shows that Mg2+ is not absolutely required for the pseudoknot folding, consistent with results from NMR16, Mg2+ clearly plays an important role. Given that Δx‡ increased significantly for both N and N′ in the absence of Mg2+, the latter must be essential for stabilizing tertiary contacts that rigidify both the threaded and unthreaded folds. Structural studies have not yet resolved coordinated Mg2+ ions or determined how Mg2+ affects the pseudoknot structure. However, comparing NMR results to computational modeling16,18 suggests that Mg2+-mediated tertiary interactions may be especially important in the stem 2/loop 1 region, possibly explaining why both N and N′ are more compliant in the absence of Mg2+. Mg2+ must also be important in stabilizing threading of the 5′ end into the stem 1/stem 3 junction, since removing Mg2+ greatly reduced the incidence of the threaded conformer. Presumably, Mg2+ binds near the stem 1/stem 3 junction to help coordinate the threading, consistent with suggestions from simulations showing Mg2+-mediated interactions between the 5′ end and stem 1 that make the three-helix junction more compact18. Such contacts with the threaded end in N might explain why the rigidification is twice as large for N as it is for N′ (four-fold decrease in Δx‡ instead of two-fold).
Comparing the SARS-CoV-2 pseudoknot to the Zika virus xrRNA, the only other RNA forming a similar ring-knot whose folding has been studied, reveals some interesting differences. The ring-knot in the Zika virus xrRNA unfolds at a much higher force, above 60 pN, acting as a mechanical road-block to digestion of the viral RNA by host exoribonucleases21. Such extreme mechanical resistance would be functionally counterproductive for the SARS-CoV-2 pseudoknot, however, because although the latter acts to induce a frameshift, it nevertheless must not prevent ribosomal translocation. Concurrently, these two RNAs differ in the importance of Mg2+ for threading of the 5′ end before closure of the pseudoknot: the absence of Mg2+ abolishes threading entirely for the Zika virus xrRNA, leading to the formation of different structures, but it has a much less dramatic effect on the SARS-CoV-2 pseudoknot, partially inhibiting rather than abolishing the threading so as to rebalance the proportions of N and N′. We speculate that the reduced role of Mg2+ in 5′-threading in the SARS-CoV-2 pseudoknot may help to reduce the mechanical stability of the ring-knot sufficiently to allow the ribosome to unfold it during −1 PRF.
Turning to the non-pseudoknotted conformer, Alt, the FECs provide several clues to its identity. Its low unfolding force and the relatively large distance to the barrier found from fitting the force distribution (Supplementary Fig. 1, red), Δx‡ = 4.5 ± 0.8 nm, indicate that it involves secondary structure only. Moreover, the length change reveals that 46 ± 2 nts are folded in Alt, and the fact that Alt does not rapidly convert into N or N′ implies that it involves stems that differ from those in the pseudoknotted conformers. The most likely structure consistent with these results is the hairpin with multiple bulges shown in Supplementary Fig. 4c. Because Alt is much less stable than N/N′, it would be expected eventually to convert into a pseudoknot, given enough time. Indeed, analyzing only the first FEC measured for each molecule (for which the RNA had much more time to find the minimum-energy state than during repeated unfolding/refolding cycles) supports this picture: the fraction of FECs starting in Alt was reduced significantly, by roughly a factor of 3, to 6 ± 4%. Moreover, this conversion was occasionally seen directly, in rare examples where the RNA folded into Alt but converted to N/N′ during the subsequent unfolding FEC (Supplementary Fig. 5). Alt was almost eliminated in the absence of Mg2+, suggesting that Mg2+ stabilizes it and helps to trap the RNA in Alt kinetically49.
The significant heterogeneity seen here for the SARS-CoV-2 frameshift signal is entirely consistent with the direct correlation between conformational heterogeneity and −1 PRF efficiency found in recent work27, given the relatively high level of −1 PRF in SARS-CoV-2 observed in functional assays1,11, underscoring the functional relevance of understanding the force-dependent conformational dynamics. We note that the SMFS assay mimics several physiologically important features of −1 PRF: the stimulatory structure is indeed under tension applied directly by the ribosome28, this tension is ramped up and down as the ribosome attempts to unfold the RNA during −1 PRF30, and the stimulatory structure undergoes repeated unfolding/refolding cycles as multiple ribosomes translocate through it, sometimes in rapid bursts50. However, SMFS does not perfectly recapitulate the circumstances in the cell. For example, the ribosome only applies force to the 5′ end of the RNA (not both ends as in the tweezers), and the force profile over time is more complex, including periods of sustained tension on the RNA while the ribosome is paused at the frameshift site28, in addition to ramps up and down. Specific contacts between the SARS-CoV-2 pseudoknot and the ribosome are also proposed to play a role in −1 PRF15. As a result, the forces needed to unfold the RNA in SMFS may differ from those involved in unfolding it in the cell. Although the duplex handles in the SMFS assay are not present in the cell, the fact that no change is seen when moving the handle from 6 to 12 nt away from the pseudoknot suggests that a 6-nt spacer is sufficient for the handle to have little to no effect on the folding.
Finally, we note that the existence of distinct fold topologies has important implications for structure-based drug-discovery efforts targeting the SARS-CoV-2 frameshift signal, because the structure of the junction between the helices in the pseudoknot, which is the locus of the most likely binding pockets for small molecules14,51, is strongly affected by whether the 5′ end is threaded or not18. Combining this observation with the previous result showing that −1 PRF efficiency varies linearly with the conformational heterogeneity as measured by the Shannon entropy27 suggests a strategy for developing small-molecule modulators of −1 PRF in SARS-CoV-2: ligands that stabilize the 5′-threaded conformer (thereby decreasing the heterogeneity) should be sought for inhibiting −1 PRF, whereas ligands that stabilize the unthreaded conformer (thereby increasing the heterogeneity) should be sought for enhancing −1 PRF. Future work characterizing the interactions of the SARS-CoV-2 frameshift signal with ligands that have been found to modulate −1 PRF should help clarify the molecular mechanisms of action.
Methods
Sample preparation
Samples consisting of a single RNA strand linked at each end to double-stranded handles were prepared in two ways. (1) An RNA strand containing the SARS-CoV-2 pseudoknot and spacer sequences (Fig. 1a and Supplementary Table 4) flanked by long handle sequences was annealed to single-stranded (ss) DNA complementary to the handles sequences22. The DNA fragment corresponding to the sequence in Fig. 1a was cloned into the pMLuc-1 plasmid between the BamHI and SpeI sites. A 2749-bp DNA transcription template was amplified by PCR from this plasmid, containing a T7 promoter in the upstream primer, followed by a 1882-bp handle sequence, the pseudoknot in the middle, and then a 798-bp handle sequence downstream; RNA was transcribed from this template in vitro. Two ssDNA handles complementary to the upstream and downstream handle sequences in the RNA were created by asymmetric PCR. The 3′ end of the 1882-nt DNA handle was labeled with dig-ddUTP using terminal transferase (Roche), and the 798-nt DNA handle of the transcript was functionalized with biotin on the 5′ end of the PCR primer. The RNA transcript was then annealed to the DNA handles, completing the construct. (2) In the second method, a shorter RNA strand was annealed to a 300-nt ssDNA handle on one end, and to the overhang on a 2094-bp double-stranded (ds) DNA handle on the other end. The dsDNA handle, labeled with digoxigenin via the upstream primer, was prepared by digesting a 2075-bp PCR product amplified from the plasmid pUC19 with PspGI, and then ligating a 56-nt DNA oligo to the digest sticky end to create a 36-nt 3′ overhang. RNA was transcribed in vitro from a 406-bp DNA template containing a T7 promoter, the 36-nt sequence complementary to the ssDNA overhang, the pseudoknot and spacer sequences (Fig. 1a), and a 300-nt handle sequence; this transcription template was made by cloning the required DNA sequences into a modified pMLuc-1 vector between the XhoI and SpeI sites. A 300-nt ssDNA handle complementary to the handle region of the RNA transcript was made by asymmetric PCR and annealed to the RNA, as above. The resulting DNA-RNA complex was then annealed to the overhang of the dsDNA handle, completing the construct. The sequences of the primers used in the two construct designs are listed in Supplementary Table 5.
The RNA/handle constructs were readied for measurement by diluting them to ~160 pM, mixing them with equal volumes of 600- and 820-nm diameter polystyrene beads (coated respectively with avidin DN and anti-digoxigenin) at concentrations of ~250 pM, and incubating the mixture for ~1 h at room temperature to create RNA-bead dumbbells. The incubation was then diluted ~100-fold into RNase-free measuring buffer: 50 mM MOPS pH 7.5, 130 mM KCl, 4 mM MgCl2, and 200 U/mL RNase inhibitor (SUPERase•In, Ambion). An oxygen scavenging system consisting of 40 U/mL glucose oxidase, 185 U/mL catalase, and 250 mM D-glucose was also included in the buffer. The diluted dumbbells were placed in a flow chamber prepared on a microscope slide and inserted into the optical trap.
FEC measurements and data analysis
Measurements were made using custom-built optical traps described previously38 controlled by Labview 2018.0.1. Traps were calibrated for position detection for each dumbbell prior to measurement following standard methods52. FECs were measured by moving the traps apart at a constant speed of ~160 nm/s to increase the force up to ~50 pN and unfold the RNA, bringing them back together at the same speed to ramp the force back down to ~0 pN, waiting 5–10 s to allow refolding, and then repeating the cycle. Trap stiffnesses were 0.45–0.62 pN/nm. Data were sampled at 20 kHz and filtered online at the Nyquist frequency. Measurements with anti-sense oligos added oligo 1 or oligo 2 to the measuring buffer at a final concentration of 10 μM. For measurements in the absence of Mg2+, MgCl2 was removed from the measuring buffer and EDTA was added to a final concentration of 1 mM.
Each of the branches of the FECs separated by “rips” representing unfolding/refolding transitions was fit to an extensible WLC model relating the applied force, F, and molecular extension, x:
where Lp is the persistence length, Lc the contour length, and K the enthalpic elasticity53. Two WLCs in series were used for the fitting, one to describe the duplex handles and the other for the unfolded RNA38. The WLC parameters for the handles, found from fitting the folded state of the FECs, were typically Lp ~ 40 nm, Lc ~ 850 nm for the shorter construct and ~950 nm for the longer one, and K ~ 1000 pN. The parameters for the unfolded RNA were fixed at Lp = 1 nm, Lc = 0.59 nm/nt, and K = 1500 pN, so that the only free parameter was the number of nucleotides unfolded38.
Force distributions were fit to the theory of Dudko et al.35, using
where \(k(F)={k}_{0}{\left(1-\frac{2\Delta {x}^{{{\ddagger}} }F}{3\Delta {G}^{{{\ddagger}} }}\right)}^{1/2}\exp \left\{\beta \Delta {G}^{{{\ddagger}} }\left[1-{\left(1-\frac{2\Delta {x}^{{{\ddagger}} }F}{3\Delta {G}^{{{\ddagger}} }}\right)}^{3/2}\right]\right\}\), k0 is the unfolding rate at zero force, Δx‡ is the distance from the folded state to the barrier, ΔG‡ is the barrier height, and 1/β = kBT is the thermal energy. Distributions of the unfolding forces were fit by a sum of two independent distributions defined by Eq. 2, representing the contributions from unfolding the independent and non-interconverting states N and N′. For the distributions measured from the constructs with 6- and 12-nt single-stranded spacers, where two peaks in the distribution were not immediately obvious, we verified that two-population fits of p(Fu) were justified using the AIC39 to judge the relative likelihood of the one- and two-population models and the Wald–Wolfowitz runs test to assess the randomness of the fit residuals. These tests rejected the single-population fits in favor of the double-population fits in each case: the difference in AIC values was 11.1 with the 6-nt spacer (0.4% likelihood one-population model is better) and 9.3 with the 12-nt spacer (0.9% likelihood one-population model is better); the Wald–Wolfowitz test statistic values were 1.22 (6-nt spacer) and 1.81 (12-nt spacer), both lower than the critical value of 1.96 (95% confidence level). Errors for the fitting parameters were found from bootstrapping analysis.
The thermodynamic stabilities of N and N′ were determined by calculating distributions of work done during unfolding, found by integrating the fitted unfolding FECs from the extension corresponding to F = 2 pN up to the point where the unfolded state was reached, while subtracting the work done to stretch out the unfolded RNA (found from integrating the WLC for the unfolded state between the same two end-points)42. Because N and N′ have very similar ΔLc and their unfolding force distributions overlap significantly, it was not possible to assign definitively the initial state of any given FEC to N or N′, as needed to build the work distribution for each state. Instead, we used a probabilistic approach, determining the relative likelihood that a given FEC unfolded from N or N′ based on its unfolding force, using the fits of the unfolding force distributions to Eq. 2: the likelihood that a curve with unfolding force F0 started in state N (or N′) was given by pN(or N′)(F0)/[pN(F0) + pN′(F0)]. We used this likelihood function to assign each curve to N or N′ while sampling a number of curves equal to the total number measured, with replacement, thereby generating the unfolding work distribution for each state for this sampling. We then calculated the free energy for unfolding from the Jarzynski equality44, ΔG = −kBT ln[〈exp(−W/kBT)〉, where W is the work done. We corrected for the bias in the Jarzynski estimate54, using the weighted sum of the free energy values to calculate the dissipated work. We also corrected for the non-equilibrium populations of N and N′46, adding an additional energy −kBT ln(1/ϕ), where ϕ is the fraction of refolding curves that end in the state (N or N′) whose stability is being calculated. This procedure was then repeated 5000 times while resampling the curves and recalculating their assignments to N or N′, yielding the average values and standard deviation for ΔG reported for N and N′. All analysis was done using Igor Pro 7.08.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The data supporting the findings of this study are available from the corresponding authors upon reasonable request. The raw data of this work55 have been deposited in Figshare (https://doi.org/10.6084/m9.figshare.14614176).
References
Kelly, J. A. et al. Structural and functional conservation of the programmed −1 ribosomal frameshift signal of SARS coronavirus 2 (SARS-CoV-2). J. Biol. Chem. 295, 10741–10748 (2020).
Brierley, I., Gilbert, R. J. C. & Pennell, S. Pseudoknot-dependent programmed —1 ribosomal frameshifting: structures, mechanisms and models. Recoding: Expansion Decoding Rules Enriches Gene Expr. 24, 149–174 (2009).
Atkins, J. F., Loughran, G., Bhatt, P. R., Firth, A. E. & Baranov, P. V. Ribosomal frameshifting and transcriptional slippage: from genetic steganography and cryptography to adventitious use. Nucleic Acids Res. 44, 7007–7078 (2016).
Dulude, D., Berchiche, Y. A., Gendron, K., Brakier-Gingras, L. & Heveker, N. Decreasing the frameshift efficiency translates into an equivalent reduction of the replication of the human immunodeficiency virus type 1. Virology 345, 127–136 (2006).
Plant, E. P., Rakauskaite, R., Taylor, D. R. & Dinman, J. D. Achieving a golden mean: mechanisms by which coronaviruses ensure synthesis of the correct stoichiometric ratios of viral proteins. J. Virol. 84, 4330–4340 (2010).
Plant, E. P., Sims, A. C., Baric, R. S., Dinman, J. D. & Taylor, D. R. Altering SARS Coronavirus frameshift efficiency affects genomic and subgenomic RNA production. Viruses 5, 279–294 (2013).
Belew, A. T. & Dinman, J. D. Cell cycle control (and more) by programmed -1 ribosomal frameshifting: implications for disease and therapeutics. Cell Cycle 14, 172–178 (2015).
Park, S.-J., Kim, Y.-G. & Park, H.-J. Identification of RNA pseudoknot-binding ligand that inhibits the −1 ribosomal frameshifting of SARS-Coronavirus by structure-based virtual screening. J. Am. Chem. Soc. 133, 10094–10100 (2011).
Hilimire, T. A. et al. HIV-1 frameshift RNA-targeted triazoles inhibit propagation of replication-competent and multi-drug-resistant HIV in human cells. ACS Chem. Biol. 12, 1674–1682 (2017).
Zhang, K. et al. Cryo-electron microscopy and exploratory antisense targeting of the 28-kDa frameshift stimulation element from the SARS-CoV-2 RNA genome. bioRxiv 2020.07.18.209270. https://doi.org/10.1101/2020.07.18.209270 (2020).
Neupane, K. et al. Anti-frameshifting ligand active against SARS Coronavirus-2 is resistant to natural mutations of the frameshift-stimulatory pseudoknot. J. Mol. Biol. 432, 5843–5847 (2020).
Chen, Y. et al. A drug screening toolkit based on the –1 ribosomal frameshifting of SARS-CoV-2. Heliyon 6, e04793 (2020).
Sun, Y., Abriola, L., Surovtseva, Y. V., Lindenbach, B. D. & Guo, J. U. Restriction of SARS-CoV-2 replication by targeting programmed −1 ribosomal frameshifting. Proc. Natl Acad. Sci. U.S.A. 118, e2023051118 (2021).
Kelly, J. A., Woodside, M. T. & Dinman, J. D. Programmed −1 ribosomal frameshifting in coronaviruses: a therapeutic target. Virology 554, 75–82 (2021).
Bhatt, P. R. et al. Structural basis of ribosomal frameshifting during translation of the SARS-CoV-2 RNA genome. Science 372, 1306–1313 (2021).
Wacker, A. et al. Secondary structure determination of conserved SARS-CoV-2 RNA elements by NMR spectroscopy. Nucleic Acids Res. 48, 12415–12435 (2020).
Plant, E. P. & Dinman, J. D. The role of programmed-1 ribosomal frameshifting in coronavirus propagation. Front. Biosci. 13, 4873–4881 (2008).
Omar, S. I. et al. Modeling the structure of the frameshift-stimulatory pseudoknot in SARS-CoV-2 reveals multiple possible conformers. PLoS Comput. Biol. 17, e1008603 (2021).
Akiyama, B. M. et al. Zika virus produces noncoding RNAs using a multi-pseudoknot structure that confounds a cellular exonuclease. Science 354, 1148–1152 (2016).
Steckelberg, A.-L. et al. A folded viral noncoding RNA blocks host cell exoribonucleases through a conformationally dynamic RNA structure. Proc. Natl Acad. Sci. U.S.A. 115, 6404–6409 (2018).
Zhao, M. & Woodside, M. T. Mechanical strength of RNA knot in Zika virus protects against cellular defenses. Nat. Chem. Biol. https://doi.org/10.1038/s41589-021-00829-z (2021). in press.
Ritchie, D. B., Foster, D. A. N. & Woodside, M. T. Programmed -1 frameshifting efficiency correlates with RNA pseudoknot conformational plasticity, not resistance to mechanical unfolding. Proc. Natl Acad. Sci. U.S.A. 109, 16167–16172 (2012).
de Messieres, M. et al. Single-molecule measurements of the CCR5 mRNA unfolding pathways. Biophys. J. 106, 244–252 (2014).
Ritchie, D. B. et al. Conformational dynamics of the frameshift stimulatory structure in HIV-1. RNA 23, 1376–1384 (2017).
Halma, M. T. J., Ritchie, D. B., Cappellano, T. R., Neupane, K. & Woodside, M. T. Complex dynamics under tension in a high-efficiency frameshift stimulatory structure. Proc. Natl Acad. Sci. U.S.A. 116, 19500–19505 (2019).
Ritchie, D. B., Soong, J., Sikkema, W. K. A. & Woodside, M. T. Anti-frameshifting ligand reduces the conformational plasticity of the SARS virus pseudoknot. J. Am. Chem. Soc. 136, 2196–2199 (2014).
Halma, M. T. J., Ritchie, D. B. & Woodside, M. T. Conformational shannon entropy of mRNA structures from force spectroscopy measurements predicts the efficiency of -1 programmed ribosomal frameshift stimulation. Phys. Rev. Lett. 126, 038102 (2021).
Qu, X. et al. The ribosome uses two active mechanisms to unwind messenger RNA during translation. Nature 475, 118–121 (2011).
Liu, T. et al. Direct measurement of the mechanical work during translocation by the ribosome. eLife 3, e03406 (2014).
Yan, S., Wen, J.-D., Bustamante, C. & Tinoco, I. Ribosome excursions during mRNA translocation mediate broad branching of frameshift pathways. Cell 160, 870–881 (2015).
Ritchie, D. B. & Woodside, M. T. Probing the structural dynamics of proteins and nucleic acids with optical tweezers. Curr. Opin. Struct. Biol. 34, 43–51 (2015).
Woodside, M. T. & Block, S. M. Reconstructing folding energy landscapes by single-molecule force spectroscopy. Annu. Rev. Biophys. 43, 19–39 (2014).
Woodside, M. T. et al. Nanomechanical measurements of the sequence-dependent folding landscapes of single nucleic acid hairpins. Proc. Natl Acad. Sci. U.S.A. 103, 6190–6195 (2006).
Chen, G., Chang, K.-Y., Chou, M.-Y., Bustamante, C. & Tinoco, I. Triplex structures in an RNA pseudoknot enhance mechanical stability and increase efficiency of –1 ribosomal frameshifting. Proc. Natl Acad. Sci. U.S.A. 106, 12706–12711 (2009).
Dudko, O. K., Hummer, G. & Szabo, A. Intrinsic rates and activation free energies from single-molecule pulling experiments. Phys. Rev. Lett. 96, 108101–108104 (2006).
Gupta, A. N. et al. Pharmacological chaperone reshapes the energy landscape for folding and aggregation of the prion protein. Nat. Commun. 7, 12058 (2016).
Pierse, C. A. & Dudko, O. K. Distinguishing signatures of multipathway conformational transitions. Phys. Rev. Lett. 118, 088101 (2017).
Neupane, K., Yu, H., Foster, D. A. N., Wang, F. & Woodside, M. T. Single-molecule force spectroscopy of the add adenine riboswitch relates folding to regulatory mechanism. Nucleic Acids Res. 39, 7677–7687 (2011).
Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Contr. 19, 716–723 (1974).
Chen, G., Wen, J.-D. & Tinoco, I. Single-molecule mechanical unfolding and folding of a pseudoknot in human telomerase RNA. RNA 13, 2175–2188 (2007).
Liphardt, J., Onoa, B., Smith, S. B., Tinoco, I. Jr. & Bustamante, C. Reversible unfolding of single RNA molecules by mechanical force. Science 292, 733–737 (2001).
Greenleaf, W. J., Frieda, K. L., Foster, D. A. N., Woodside, M. T. & Block, S. M. Direct observation of hierarchical folding in single riboswitch aptamers. Science 319, 630–633 (2008).
Li, P. T. X., Bustamante, C. & Tinoco, I. Unusual mechanical stability of a minimal RNA kissing complex. Proc. Natl Acad. Sci. U.S.A. 103, 15847–15852 (2006).
Jarzynski, C. Nonequilibrium equality for free energy differences. Phys. Rev. Lett. 78, 2690–2693 (1997).
Liphardt, J., Dumont, S., Smith, S. B., Tinoco, I. Jr. & Bustamante, C. Equilibrium information from nonequilibrium measurements in an experimental test of Jarzynski’s equality. Science 296, 1832–1835 (2002).
Alemany, A., Mossa, A., Junier, I. & Ritort, F. Experimental free-energy measurements of kinetic molecular states using fluctuation theorems. Nat. Phys. 8, 688–694 (2012).
Zuker, M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 31, 3406–3415 (2003).
Roca, J. et al. Monovalent ions modulate the flux through multiple folding pathways of an RNA pseudoknot. Proc. Natl Acad. Sci. U.S.A. 115, E7313–E7322 (2018).
Rook, M. S., Treiber, D. K. & Williamson, J. R. An optimal Mg2+ concentration for kinetic folding of the Tetrahymena ribozyme. Proc. Natl Acad. Sci. U.S.A. 96, 12471–12476 (1999).
Lyon, K., Aguilera, L. U., Morisaki, T., Munsky, B. & Stasevich, T. J. Live-cell single RNA imaging reveals bursts of translational frameshifting. Mol. Cell 75, 172–183.e9 (2019).
Manfredonia, I. et al. Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements. Nucleic Acids Res. 48, 12436–12452 (2020).
Neuman, K. C. & Block, S. M. Optical trapping. Rev. Sci. Instrum. 75, 2787–2809 (2004).
Wang, M. D., Yin, H., Landick, R., Gelles, J. & Block, S. M. Stretching DNA with optical tweezers. Biophys. J. 72, 1335–1346 (1997).
Gore, J., Ritort, F. & Bustamante, C. Bias and error in estimates of equilibrium free-energy differences from nonequilibrium measurements. Proc. Natl Acad. Sci. U.S.A. 100, 12564–12569 (2003).
Neupane, K. et al. Single-molecule force spectroscopy of unfolding and refolding of the frameshift-stimulatory RNA pseudoknot from SARS-CoV-2. Figshare https://doi.org/10.6084/m9.figshare.14614176.
Acknowledgements
This work was supported by the Canadian Institutes of Health Research grant reference number OV3-170709, Alberta Innovates, and National Research Council Canada.
Author information
Authors and Affiliations
Contributions
K.N., M.Z., and M.T.W. designed the research; K.N., S.M., D.B.R., and S.M.I. provided reagents; K.N., M.Z., A.L., N.Q.H., and A.N. performed experiments; K.N., M.Z., A.L., and M.T.W. analyzed data; K.N., M.Z., and M.T.W. wrote the manuscript; and all authors edited the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Nature Communications thanks Felix Ritort and other, anonymous, reviewers for their contributions to the peer review of this work. Peer review reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Neupane, K., Zhao, M., Lyons, A. et al. Structural dynamics of single SARS-CoV-2 pseudoknot molecules reveal topologically distinct conformers. Nat Commun 12, 4749 (2021). https://doi.org/10.1038/s41467-021-25085-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-021-25085-6
This article is cited by
-
Secondary structural ensembles of the SARS-CoV-2 RNA genome in infected cells
Nature Communications (2022)
-
Length-dependent motions of SARS-CoV-2 frameshifting RNA pseudoknot and alternative conformations suggest avenues for frameshifting suppression
Nature Communications (2022)
-
The short isoform of the host antiviral protein ZAP acts as an inhibitor of SARS-CoV-2 programmed ribosomal frameshifting
Nature Communications (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.