Systematic detection of tertiary structural modules in large RNAs and RNP interfaces by Tb-seq

Patel, Shivali; Sexton, Alec N.; Strine, Madison S.; Wilen, Craig B.; Simon, Matthew D.; Pyle, Anna Marie

doi:10.1038/s41467-023-38623-1

Download PDF

Article
Open access
Published: 09 June 2023

Systematic detection of tertiary structural modules in large RNAs and RNP interfaces by Tb-seq

Nature Communications volume 14, Article number: 3426 (2023) Cite this article

2139 Accesses
3 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Compact RNA structural motifs control many aspects of gene expression, but we lack methods for finding these structures in the vast expanse of multi-kilobase RNAs. To adopt specific 3-D shapes, many RNA modules must compress their RNA backbones together, bringing negatively charged phosphates into close proximity. This is often accomplished by recruiting multivalent cations (usually Mg²⁺), which stabilize these sites and neutralize regions of local negative charge. Coordinated lanthanide ions, such as terbium (III) (Tb³⁺), can also be recruited to these sites, where they induce efficient RNA cleavage, thereby revealing compact RNA 3-D modules. Until now, Tb³⁺ cleavage sites were monitored via low-throughput biochemical methods only applicable to small RNAs. Here we present Tb-seq, a high-throughput sequencing method for detecting compact tertiary structures in large RNAs. Tb-seq detects sharp backbone turns found in RNA tertiary structures and RNP interfaces, providing a way to scan transcriptomes for stable structural modules and potential riboregulatory motifs.

Global in situ profiling of RNA-RNA spatial interactions with RIC-seq

Article 21 May 2021

Changchang Cao, Zhaokui Cai, … Yuanchao Xue

RNA structure probing uncovers RNA structure-dependent biological functions

Article 25 June 2021

Xi-Wen Wang, Chu-Xiao Liu, … Qiangfeng Cliff Zhang

Keth-seq for transcriptome-wide RNA structure mapping

Article 03 February 2020

Xiaocheng Weng, Jing Gong, … Chuan He

Introduction

RNAs can adopt complex folded motifs and higher-order 3-D structures that are essential across a variety of specific cellular processes^1,2,3,4,5. It has recently become clear that many types of multi-kilobase RNA transcripts contain regions of tertiary structure that, either alone or in concert with protein partners, carry out biological function^6,7. However, identifying these regions of complex RNA structure remains challenging. Current structure prediction methods on long RNAs are unable to pinpoint regions containing stable RNA tertiary structure modules or complex protein binding sites from sequence alone⁸. While biophysical techniques such as NMR⁹, x-ray crystallography¹⁰ and cryo-EM¹¹ are invaluable tools for the observation of RNA structure, they are time-consuming and difficult to perform on a multikilobase-length RNA that contains a mixture of both structured and flexible regions. As our understanding of their biological functions becomes increasingly important, and interest in small molecule targeting of RNAs grows, it is vital to develop tools for identifying regions of tertiary structure in long RNA molecules.

In recent years, chemical probing has become a powerful tool for studying RNA structure. Many important advances have improved our ability to identify single- versus double-stranded nucleotides in RNA^{12,13,14,15,16}, and these data have primarily been used to infer secondary but not tertiary structures of RNA. Fewer methods have been developed to detect higher order structure and these protocols are limited to an assessment of solvent accessible regions^17,18,19 or identification of long-range RNA-RNA base-pairs by cross-linking methods^20,21 or statistical correlations in mutational profiling^22,23. The field would benefit from a readily adaptable, high throughput approach for identifying regions of local tertiary structure, which are often hallmarks of functional RNA motifs and riboregulatory elements.

High-resolution RNA structures show that regions of tightly packed tertiary structure often contain phosphate backbones that are packed in close proximity, within the same strand or on adjacent strands. These local regions of intense negative electrostatic potential act as sinks for multivalent ion coordination^24,25 (Fig. 1a). One way to probe these electrostatically negative reservoirs is to monitor the cleavage patterns catalyzed by coordinated metal ions²⁶. When nucleotides in such regions adopt an “in-line geometry”, which aligns an upstream 2’-OH with the downstream 3’-OR group of a phosphodiester linkage²⁷, adjacent metal hydroxide ions can behave as a general base, deprotonating the 2’-OH group and producing a 2’ oxyanion nucleophile that attacks the adjacent phosphate and causes strand scission²⁷. While this type of Mg²⁺-catalyzed cleavage (known as in-line probing²⁸) normally occurs on a slow timescale that ranges from hours to days²⁸, the same phenomenon is greatly accelerated by lanthanide ions such as terbium (Tb³⁺) and europium (Eu³⁺)^26,29. Tb³⁺ and Mg²⁺ share similar ionic radii (0.92 Å and 0.72 Å, respectively)²⁶ and coordination geometry preferences for oxygen³⁰, but lanthanide ions have an additional positive charge and the pKa of coordinated water molecules is much lower for ions such as Tb³⁺, (pKa ~7.9 for Tb³⁺-H₂O versus ~11.4 for Mg²⁺-H₂O)^30,31. Therefore, at low ion concentrations and neutral pH, Tb³⁺ coordinates with structured RNA binding sites in a manner that is similar to that of Mg²⁺ ^32,33,34,35_, but the more potent Tb³⁺ general base rapidly facilitates RNA backbone cleavage at sites of metal ion binding³¹ (Fig. 1b). Tb³⁺ probing of RNA has been used extensively in the past, but until now, it was a low-throughput method that relied on electrophoretic quantification^30,32,36.

**Fig. 1: Developing a sequencing-based approach to detect Tb³⁺ cleavage sites.**

Here we present Tb-seq, a sequencing-based approach that employs Tb³⁺ to detect regions of tertiary structure in long RNAs. To demonstrate the efficacy of this technique, we first apply the Tb-seq pipeline to identify tertiary structural motifs within structurally well-characterized RNA molecules. We then apply it to probe known and unknown RNA structures in a cellular context to investigate RNA motifs and protein binding sites. These studies show that Tb-seq detects regions of RNA involved in RNA tertiary structure motifs and within RNP complexes, thereby providing a powerful approach for pinpointing regions of complex RNA structure that are potentially associated with RNA functional elements.

Results

Developing a high throughput sequencing-based approach to detect Tb³⁺ cleavage sites

To precisely identify tertiary RNA structural elements in a high throughput manner, we adapted a previously established Tb³⁺ RNA cleavage assay for accurate single nucleotide detection in an RNA of interest. In the classical version of this experiment, the RNA of interest is end-labeled with ³²P, probed with Tb³⁺ and the sites of hydrolysis are visualized after electrophoresis of the RNA^30,32,33,36. To adapt this assay to a sequencing readout, we first determined if the expected Tb³⁺cleavage sites could be detected as termination events upon reverse transcription (RT) with a processive reverse transcriptase, MarathonRT³⁷. We used the D135 ribozyme derived from yeast group II intron aI5γ, which has been extensively characterized using the classical version of Tb³⁺ probing³⁶. We found that reverse transcription stops (Fig. 1c) recapitulate the previously-published Tb³⁺ cleavage pattern (Fig. 1d), thereby validating RT as a tool to detect Tb³⁺-induced cleavage. We then adapted this approach for NGS sequencing. Specifically, a Tb³⁺ cleaved RNA or an untreated RNA is reverse transcribed with a gene-specific RT primer containing a 5’ adapter handle. The resulting cDNA is 3’ adapter-ligated and PCR amplified to add Illumina multiplex handles. We then implemented a previously-developed pipeline for assessing RT termination events³⁸ to quantify termination (Fig. 1e). This sequencing and analysis approach, Tb-seq, recapitulated the previously published D135 Tb³⁺ cut sites. (Supplementary Fig. 1). To better understand whether Tb-seq could be used as a discovery tool for assessing higher-order RNA structure in a variety of RNA types, we next applied this method to evaluate the patterns of cleavage in RNAs with well-determined tertiary structures.

Tb-seq reveals well-folded RNA tertiary elements

To benchmark Tb-seq on RNAs that have never been analyzed with Tb³⁺ cleavage before, we probed in vitro transcribed RNAs that contain both well-folded RNA tertiary elements and known metal sites. We chose a group II intron from Oceanobacillus iheyensis (O.i.) that has been well characterized biochemically and crystallographically³⁹. First, reagent concentrations and reaction times were optimized to obtain an ideal reactivity signal and ensure the RNA is not over-cleaved (Supplementary Fig. 2). We performed Tb-seq using a range of Tb³⁺ concentrations from 0.01 mM - 2 mM for 10 min in order to evaluate the intensity and location of cleavage patterns (Supplementary Fig. 3). We observe that the cleavage signal is abolished if the O.i. intron is denatured prior to probing, supporting our interpretation that cleavage signals are indicators of RNA structure. To determine whether secondary structure alone is sufficient to produce the cleavage pattern, the intron was folded only in the presence of monovalent ions, under conditions lacking the magnesium ions known to promote its characteristic tertiary structure⁴⁰ (Supplementary Fig. 4A). We found that secondary structure was insufficient to establish the signals, supporting our interpretation that Tb-seq signals correspond to sites of tertiary structure. Instead, we found that at certain Tb³⁺ concentrations (0.5 mM), non-specific cleavage is observed (Supplementary Fig. 4B). These results demonstrate that a correctly folded intron-containing well-defined tertiary elements is required for Tb³⁺ coordination and site-specific RNA cleavage.

Next, we established a three-point criteria set for selecting nucleotide stop sites that are likely to result from specific, site-bound Tb³⁺-dependent cleavage, which we will call “strong Tb signal”. First, a reactivity value of >0.5 was established for detecting strong sites of cleavage and maximizing probe specificity (see Methods). Second, these sites must be observed in two independent replicates to demonstrate reproducibility. Third, selected sites must show a dependence of signal on Tb³⁺ concentration to ensure that stop signals are not due to spontaneous RT termination events. Nucleotide sites that satisfy these criteria are highlighted in red in the secondary structure diagram of the O.i. intron (Fig. 2b). Upon initial inspection, we observed that the strongest Tb³⁺ sites are in short-loop regions within the RNA secondary structure. Upon close inspection, it became clear that these cleavage sites fall within or are adjacent to the most evolutionarily conserved long-range RNA tertiary interactions that are essential for correctly folding the ribozyme (annotated by Greek letters, Fig. 2b)⁴¹. To further understand the conformation of these sites in 3-D space, the Tb³⁺ signal was visualized on the crystal structure of O.i. intron (Fig. 2a). We found that Tb³⁺ causes backbone cleavage at regions where the phosphate backbone compresses together to form sharp, stable turns. These turns are all components of RNA tertiary motifs required for the correct folding of the active ribozyme.

**Fig. 2: Tb-seq of *O.i*. intron detects long-range, evolutionarily conserved RNA-RNA interactions.**

To examine the sites of cleavage in greater detail, we focus on two regions that are specifically recognized by Tb³⁺. The first is the ζ-ζ’ tetraloop-receptor interaction, which is among the best-characterized and most important interactions for positioning catalytic intron domain 5 (D5)⁴¹. Here, a single G236 residue in D1 flips out of a sharp backbone turn and base-stacks with A370 in D5 (Fig. 2b top insert). We observe strong Tb-seq signals for the nucleotides in this bulge (234-237) that mediates the ζ-ζ’ tetraloop interaction. The second motif, λ-λ’ is within the z-anchor, a module that forms multiple higher-order structures and serves as a scaffold for properly positioning the 5’ splice site. Notably, strong Tb-seq signal is observed in A106 in D1, which forms a minor groove base triple with nucleotides C267 and G374 in D5 (Fig. 2b bottom insert). These results demonstrate that Tb³⁺ detects functionally important interactions in group II introns where RNA phosphate backbones come into very close proximity, thereby allowing for multi-helix base stacking and long-range interactions.

To further test and expand Tb-seq, we probed another class of RNA that contains a well-defined tertiary structure. For this we chose the Hepatitis C. Virus (HCV) internal ribosome entry site (IRES), specifically focusing on domain II, which has well-characterized structural features identified by both cryoEM⁴² and NMR⁴³. Implementing the criteria described above, we observed strong Tb³⁺ signal clustering in two regions. The first is a loop region containing nucleotides 92-95, where the phosphate backbones kink and come into close proximity (Fig. 3). The second region is near nucleotides 52-54, where the phosphate backbone forms a nearly 90˚ bend in the RNA (Fig. 3). This bend is implicated in the positioning of the downstream terminal loop near the 40 S E site of the ribosome, which allows for translation of viral proteins^42,44. Interestingly, this region has been targeted by functional inhibition studies where multiple small molecules bind and structurally extend the bend into an elongated conformation, inhibiting viral translation^45,46. Together these results indicate terbium probing can detect functionally important structures in RNAs, allowing it to be used as a screening tool for identifying regions that are likely to contain compact motifs.

**Fig. 3: Tb-seq of HCV IRES detects conserved L-shaped bend in stem loop II.**

Tb-seq detects key RNA-protein interactions in a cellular context

Having established the versatility of Tb-seq on RNAs that have been in-vitro transcribed, we sought to extend it to cellular contexts, where RNA can fold together with proteins, small molecule ligands and other nucleic acids⁷. We decided to conduct the first experiments on a structurally well-defined cellular RNA with known protein binding sites. To this end, we probed human RNase P in order to understand how terbium can be used to reveal higher-order RNA structural motifs in that stable RNP. To circumvent the issues of introducing lanthanide ions into cells, we developed an approach for gently lysing mammalian cells in a way that maintains intact RNA-Protein (RNP) complexes (Supplementary Fig. 5A). We then treated the resulting extract with Tb³⁺ and implemented the Tb-seq pipeline, using the criteria we established for identifying strong sites of specific Tb³⁺ cleavage (Supplementary Fig. 5B). By comparing the Tb-seq signal with the cryo-EM structure of human RNase P H1 RNA⁴⁷, we observe that the strongest cleavage sites are found in regions where the RNA backbone bends sharply, notably at the top and bottom of the H1 RNA (nt 47–50 and 169–173; Fig. 4a).

**Fig. 4: Probing RNA-Protein interactions in human RNase P.**

Human RNase P consists of ten protein components that wrap around and bind the H1 RNA at multiple regions, presumably stabilizing its elongated conformation⁴⁷ (Fig. 4a). While a number of sites are observed, here we highlight two examples where Tb-seq reveals regions containing critical RNA-protein interactions. The first is a backbone turn located in the loop of stem P9 (Fig. 4a, bottom insert). The bases of nucleotides C125 and U126 form hydrogen bonding interactions with the side chains of the essential core protein, Rpp29. This protein makes multiple contacts with stem P9 and P1, bringing them together in close proximity and stabilizing the downstream helical core of the H1 RNA, which recognizes the 5’ end of pre-tRNA for cleavage. The second site of strong Tb-seq signal is observed in the loop region of stem P3. Here, the backbone, bases, and sugars of the nucleotides targeted by Tb³⁺ (C61, C63, A64, U65), form networks of hydrogen bonds with proteins Rpp20 and Rpp30b (Fig. 4a, top insert). In this context, Tb-seq signals correspond to exposed regions of the RNA, which form structural motifs that are stabilized by protein interactions within RNase P.

To further explore the ability of Tb-seq to reveal RNP interactions and to understand the role of the protein in Tb³⁺ detection at these sites, we used Tb³⁺ to probe human RNase P in the absence of proteins. To this end, Tb³⁺ cleavage was conducted on cell lysates that were treated with a proteolytic enzyme (Proteinase-K), which strips proteins from RNA. (Supplementary Fig. 6A). As in studies with other chemical probes⁴⁸, we then performed a differential reactivity comparison, termed ∆ Tb, to compare changes in H1 RNA structure in the presence and absence of proteins (Fig. 4b and Supplementary Fig. 6B). Consistent with a disruption of a stabilizing protein interaction, the two regions described above become less reactive in the absence of proteins (show a loss in terbium reactivity). By contrast, other nucleotides become more reactive after proteinase K treatment (see stem P3, Fig. 4b), which may result from conformational rearrangement that occurs in the absence of proteins. These data suggest that ∆ Tb detects modules of protein-stabilized RNA structures within RNase P, thereby broadening the applicability of this method to probing of RNP interfaces.

Tb-seq reveals modules of higher-order structure in viral RNAs

Having validated Tb-seq as an RNA tertiary structure probe, we sought to apply it to discover previously unknown RNA structures in multi-kilobase RNAs, such as long viral RNA genomes. Numerous studies have demonstrated that viral RNA genomes contain secondary and tertiary structures both in the UTRs and coding regions that are important for function^49,50,51. Indeed, we utilized Tb-seq to detect functional RNA structures within the HCV IRES (Fig. 3). Given the urgency of detecting functional RNA elements within SARS-CoV-2 RNA^49,52 and the limited tools available to detect them, we performed cell lysate Tb-seq in SARS-CoV-2 infected cells. We specifically examined the 5’-terminal 1400nt of the RNA genome, which contains the 5’UTR, the coding region of Nsp1 and part of the Nsp2 ORF.

Inspection of the Tb-seq signal profile reveals a distinct cleavage pattern that is characterized by clusters of consecutive cleaved nucleotides (Supplementary Fig. 7). This signal profile resembles that obtained when probing ribozymes, suggesting a high degree of 3-D structure in the genome. Overlaying these sites onto the predicted secondary structure⁴⁹, we observe strong Tb³⁺ signals in both the UTR and coding region of the genome. Upon closer inspection, we find the majority of Tb-seq signals in small stem-loop/bulge regions, implicating these regions as modules of compact RNA structure (Fig. 5).

**Fig. 5: Tb-seq identifies structural modules in SARS-CoV-2.**

To further understand the role of protein occupancy on this structured genome and to narrow down sites of potentially functional RNA modules, we probed in the absence of proteins and implemented the ∆ Tb pipeline. Numerous changes are observed in the absence of protein, indicating a global conformational change in the architecture of the genome (Supplementary Fig. 8). At some sites, the reactivity signal increases, implicating a conformational change in RNA tertiary structure or new backbone accessibility in the absence of proteins. By contrast, there are other sites that become less reactive upon the release of proteins (Fig. 5 inserts). Given our findings with probing RNase P, these sites are likely to represent structural modules containing a sharp backbone bend that is stabilized by protein components. The limited proteomic information on the SARS-CoV-2 genome makes it difficult to assess specific interaction partners. Nevertheless, together these data underscore the utility of combinatorial Tb-seq for narrowing down structural modules and providing a course-grained roadmap of candidate functional elements within a viral genome.

Discussion

As biologists explore the growing landscape of biologically important multi-kilobase RNAs, such as viral genomes, unprocessed mRNAs, primary miRNAs and long noncoding RNAs, tools are needed that will enable researchers to focus their attention on specific regions of RNA for detailed functional analysis. The Tb-seq pipeline presented here provides one such filter, yielding valuable information about structurally compact local RNA motifs that differs from the information reflected in other probes of secondary and tertiary structure. In addition, by using the ∆ Tb probing strategy, and probing in the presence and absence of protein components, one can narrow down tertiary structures that undergo protein-dependent conformational differences. Ultimately, integrating Tb-seq with orthogonal chemical probes, pull-down methods^53,54, cross-linking agents^20,21 and functional assays will allow for a comprehensive mechanistic understanding of individual RNA molecules.

With recent technological advances, it is now possible to determine high-resolution structures of large RNAs⁵⁵. However, multi-kilobase RNAs cannot be visualized in their entirety using these approaches. Most biologically relevant transcripts contain modules of compact structure along with regions that are conformationally flexible^56,57,58. For this reason, most RNAs are amenable to high-resolution structure determination only after careful study of their overall structural landscape. This requires a methodical approach for identifying RNA regions and RNP substructures that can be visualized with powerful tools such as cryo-EM and SAXS. In addition, there are many cases where one must rationally design or isolate stable motifs of RNA and/or RNP complexes. Here we provide a way to identify the most structurally compact regions of a large RNA and, in tandem with other long-range probing methods, choose the best regions for high-resolution investigation.

Performing Tb-seq on RNAs with known structures provided a useful starting point for assessing the types of RNA motifs that are recognized and cleaved by Tb³⁺. We initially attempted to correlate Tb-seq signals with specific torsion angles and with atomic distance vectors between different base and backbone atoms, but no clear correlation emerged. In order to develop a less fine-grained structural correlation metric, we visualized the structures and noted that most Tb³⁺ cleavage sites occur in regions where multiple phosphate backbone residues pinch together in close proximity. To reflect this, we computed a metric for assessing the “sharpness” of turns in the RNA backbone at Tb³⁺ cleavage sites, deriving our values from high-resolution structures of the O.i. intron³⁹. Specifically, we measured the backbone phosphate distances between nucleotide n to nucleotide n + 2 at sites displaying strong Tb-seq signals (P_n -> P_n+2, or every other phosphate) and compared these data to the corresponding distances in a simple helical structure within domain 4 of the intron (Supplementary Fig. 9). We found that RNA regions with strong Tb-seq signals tend to have very small P_n -> P_n+2 values (5.5 - 9.1 Å) relative to the same distances calculated from a simple helix (9.6-12.5 Å), indicating local compression of the RNA backbone. We speculate that additional data will allow for the refinement of these parameters. For example, while some Tb³⁺ cleavage sites, such as those in region 7, are not characterized by small P_n -> P_n+2 values, visual inspection of the structure shows that these same nucleotides are part of a larger motif in 3-D space that contains adjacent pinched backbones that are characterized by strong Tb-seq signatures and small P_n -> P_n+2 values (in region 5). Therefore, the same bound metal ion may be catalyzing both cleavage events. Additional data and analysis will enable a more quantitative description of specific structural features that are recognized and cleaved by Tb³⁺.

There are limitations in our ability to interpret the structural significance of Tb³⁺ cleavage signals in RNA probing studies. First of all, metal ions like Tb³⁺ cannot stimulate the cleavage of an RNA backbone unless the 2’-hydroxyl nucleophile adopts a precisely in-line geometry capable of phosphodiester cleavage. Certain architectural environments may constrain local structure such that in-line attack is not possible, and it is therefore likely that some Tb³⁺-bound sites remain uncleaved by the probe, resulting in false-negative data. Furthermore, it is not possible to interpret a Tb³⁺ cleavage site as reflective of a particular type of RNA structural motif. This is because there is simply not enough information on site-bound metal ions in the available RNA structural database. Although it is hoped that this may grow with time, the emergence of RNA cryo-EM structure determination (which cannot unambiguously assign metal ion sites) and the decline in the prevalence of crystallographically-determined structures (where metals can be explicitly assigned), indicates that progress will be slow, making it all the more important that additional orthogonal information on metal sites in RNA is obtained using other methods. As discussed in the previous paragraph, additional data will be necessary to understand the precise structural features that result in cleavage by Tb³⁺. That said, it is reasonable to claim that Tb³⁺ cleavages reflect metal ion binding to sites of high electronegative potential that result from the close-packed RNA backbone atoms that are prevalent within RNA tertiary structures and RNP complexes.

The human transcriptome contains a vast set of large, complex RNA molecules, and until recently, we have lacked the tools to assess their 3-D structural content. However, the biochemical methods that were initially developed to study tRNAs, riboswitches and ribozymes are being gradually being adapted to explore the growing repertoire of multi-kilobase RNAs that are central to gene expression and pathogenicity. Here we present a much-needed expansion of the RNA probing toolbox that allows investigators to rapidly pinpoint candidate RNA tertiary structures efficiently and precisely, paving the way for downstream mechanistic study and therapeutic targeting.

Methods

In vitro transcription and purification

The in vitro transcriptions of aI5γ D135⁵⁹ and Oceanobacillus iheyensis (O.i.) group II intron D1-5³⁹, RNA were carried using T7 RNA polymerase⁶⁰ in a buffer containing 12 mM MgCl₂, 40 mM Tris-Cl pH8, 2 mM Spermidine, 10 mM NaCl, 0.01% Triton X-100, 10 mM DTT, 5 μl SUPERase-In and 3.6 mM of each NTP, in a procedure adapted from previous work⁶¹. The reactions were incubated at 37 °C for 2 hours. Thereafter, 4U of TURBO DNase was added and the mixture was incubated at 37 °C for 30 min. To chelate excess divalent ions, 5 μl of 0.5 M EDTA was added. Transcription products were gel purified on a denaturing 5% polyacrylamide gel and eluted overnight at 4 °C in a gel elution buffer (10 mM MOPS-NaOH pH 6.0, 300 mM NaCl and 1 mM EDTA). The RNA was ethanol precipitated and resuspended in ME buffer (6 mM K-MES pH 6.0, 0.1 mM EDTA). The in vitro transcription of the full-length HCV genome (JC1)^51,62 was performed as described above. The transcribed RNA was buffer exchanged into a filtration buffer (50 mM HEPES-KOH pH7.2 and 150 mM KCl using 50-kDa Amicon Ultra filtration columns. The RNA was purified by size exclusion chromatography at room temperature using a self-packed 24 ml Sephacryl S-1000 column equilibrated with filtration buffer. RNA from the peak fraction was used for subsequent folding and probing.

RNA folding and Tb³⁺ probing

For D135, Tb³⁺ cleavage was performed using two approaches. The first was direct visualization of Tb³⁺ mediated RNA cleavage by electrophoresis³⁶. In-vitro transcribed D135 was dephosphorylated using Antarctic phosphatase and 5′ end-labeled with [γ-³²P] ATP using T4 polynucleotide kinase according to manufacturer’s instructions followed by purification on a denaturing 5% polyacrylamide gel. Thereafter, 3 nM of ³²P-labeled RNA and 1 μg of unlabeled RNA were mixed in a monovalent buffer containing 50 mM MOPS pH7 and 500 mM KCl to a final volume of 18 μl. For visualization of Tb³⁺ mediated cleavage sites by reverse transcription and sequencing 1 μg of unlabeled RNA was used. For all reactions, the mixture was heated up to 90 °C for 1 min and cooled at room temperature for 2 min. Thereafter, 2 μl of 1 M MgCl₂ (final concentration 100 mM) was added and folded at 37 °C for 30 min. Subsequently, probing was performed by incubating 18 μl of the folded RNA with 10X TbCl₃ stocks prepared in the monovalent buffer (final 1x concentration from 0.01mM-2mM TbCl₃) or 2 μl of monovalent buffer (negative control) for 40 min at 25 °C. For the time-course experiments, probing was performed at the indicated times. All reactions were quenched with the addition of 3 μl of 50 mM EDTA pH 8 and precipitated by adding 1/10 volume of Na-Acetate (3 M, pH 5.2), 0.5 μl of glycogen (Invitrogen) and three volumes of ethanol. RNAs were resuspended in 4 μl of loading buffer (82 % (v/v) deionized formamide, 0.16 % (w/v) xylene cyanol (XC), 0.16 % (w/v) bromophenol blue (BB), 10 mM EDTA, pH 8.0) and resolved on a denaturing 5% polyacrylamide gel. The gel was dried, exposed to phosphor screens overnight and scanned using a Typhoon FLA9500 phosphorimager (GE Healthcare) or Typhoon RGB Biomolecular imager (Cytiva).

For Tb³⁺ probing of O.i. and HCV, 1 μg of RNA was diluted in their respective monovalent ion buffers (50 mM HEPES pH7 and 150 mM KCl for O.i. or 50 mM HEPES pH7.2 and 150 mM KCl for HCV) to a final volume of 18 μl. Thereafter, 2 μl of 100 mM MgCl₂ (final concentration 10 mM) was added and incubated at 37 °C for 30 min. Subsequently, probing was performed by incubating 18 μl of the folded RNA with 2 μl of 10X TbCl₃ stocks prepared in their respective monovalent ion buffers (final 1X concentration from 0.01mM-2mM) or 2 μl of respective monovalent buffer (negative control) for 10 min at 25 °C. Reactivities were compared under conditions where 0.5 mM TbCl₃ was employed and used in all figures unless indicated otherwise. All reactions were quenched with the addition of 3 μl of 50 mM EDTA pH 8. For the denaturing control, RNA was folded as described above but afterward, deionized formamide was added to a final concentration of 50%. The denatured RNA was probed with a final concentration of 0.5 mM TbCl₃. For the secondary structure control, the RNA was incubated in a monovalent buffer in the absence of MgCl₂ and probing was carried out at the indicated TbCl₃ concentrations. All RNA samples were cleaned up using a Zymo RNA clean and concentrator column according to the manufacturer’s instructions.

Native gel electrophoresis

Radiolabeled transcripts⁴⁰ were prepared as described above except, 1 mM UTP and 50 μCi of [α-³²P-UTP] were added to the transcription. The reaction was incubated at 37 °C for 2 h followed by purification on a denaturing 5% polyacrylamide gel. 5 nM of the radiolabeled transcript was spiked into the folding reaction described above. Reactions were mixed with 10% (v/v) glycerol containing, 0.16 % (w/v) xylene cyanol, 0.16 % (w/v) bromophenol blue and loaded onto a native 5% polyacrylamide gel.

Cell culture of human RNase P and SARS-CoV-2 infection

For in-cell studies of RNase P RNA structure, Huh7.5 cells (Cells were a gift from Brett Lindenbach. Cell line was generated as described in{Blight, 2002 #112}) were cultured in Dulbecco’s Modified Eagle Medium (DMEM w/o sodium pyruvate) that was supplemented with 10% heat-inactivated fetal bovine serum (FBS) and 1 mM non-essential amino acids. Cells were cultured to ~80% confluency (~5 × 10⁶ cells) in a 150 cm tissue culture-treated dish.

For studies of SARS-CoV-2 RNA, Huh7.5 cells were cultured in DMEM supplemented with 10% FBS and 1% Penn/Strep. Approximately 5 × 10⁶ cells were plated in each of the T150 tissue culture-treated flasks and incubated overnight at 37 °C/5% CO₂. The next day, media was removed and 5 × 10⁵ PFU (MOI ~ 0.1) of SARS-Related Coronavirus 2 Isolate USA/WA2020 (BEI Resources #NR-52281) was added to each flask in fresh media. Cells were incubated with virus inocula until three days post-infection (dpi).

Cell lysis probing

For all flasks the media was aspirated, cells were washed once with cold wash buffer (50 mM HEPES-KOH pH7.2, 150 mM NaCl, 3 mM KCl), and then dislodged in 2 ml of cold wash buffer with a cell scraper. The cells were collected and centrifuged at 200 g x 5 min at 4 °C. The supernatant was removed and the cells were resuspended in 2 ml lysis buffer (1% TritonX-100 50 mM HEPES-KOH, pH7.2, 150 mM KCl, 18 mM NaCl, 1 mM MgCl₂, 1 mM CaCl₂, 30ul SUPERase-In (20U/μl) and 1x cOmplete Protease Inhibitor Cocktail EDTA-free. To 250 μl of resuspended cells, 50/μl of Turbo DNase (2U/μl) was added and the mixture was incubated at 37 °C for 20 min. For cell lysis+Proteinase-K probing experiments, cells were prepared, lysed and DNase digested as described above, but the lysis buffer did not contain protease inhibitor. Subsequently, 25 μl of 20 mg/ml Proteinase-K was added to each 250 μl of lysed cells and the mixture was incubated at 37 °C for an additional 20 min.

All reactions were centrifuged at 200 g x 15 sec. Probing was performed by incubating 225 μl of supernatant with 25 μl of freshly made 10x TbCl₃ (final 1x concentrations from 0mM-5mM, prepared in wash buffer). The reactions were immediately placed on a rocker and allowed to incubate at 25 °C for 10 min before quenching with 20 μl of 0.1 M EDTA. RNA was extracted using Trizol according to the manufacturer’s instructions. For experiments involving RNase P, total RNA was ribosome depleted using a Ribominus kit that was used according to the manufacturer’s protocol with the following exception: the ribodepleted supernatant was purified using a Zymo RNA clean and a concentrator to retain RNAs that are greater than 17 nucleotides in size. For experiments involving SARS-CoV-2, total RNA was cleaned using a Zymo RNA clean and concentrator column.

Reverse transcription (RT)

For each probing condition, 1-4 μg of in vitro transcribed or cellular RNA was mixed with 1-2pmol of gene-specific primers (Supplementary Table 1) and brought to a volume of 7 μl. To anneal primers, the mixture was heated at 90 °C for 1 min followed by 30 °C for 2 min. To initiate reverse transcription, 2ul of Marathon RT³⁷ (can be obtained from Kerafast), 10 μl of 2x MarathonRT buffer (100 mM Tris-HCl pH 8.3, 400 mM KCl, 4 mM MgCl2, 10 mM DTT and 40% glycerol), 1 μl of 10 mM dNTP mix (NEB) were added and incubated at 42 °C for 30 min. RNA was degraded with the addition of 1 μl of 3 M KOH, heated to 95 °C for 5 min and snap cooled to 4 °C for 5 min. Thereafter, 1 μl of 3 M HCl was added to neutralize the reaction. For primer extension reactions that would be visualized using electrophoresis, reverse transcription was carried out as described, but using a ³²P-labeled primer. The primer was labeled at the 5-end using T4 PNK according to manufactures instructions and purified on a denaturing 12% polyacrylamide gel. After reverse transcription, the cDNA was ethanol precipitated at −20 °C overnight. The cDNA pellets were dissolved in 5 μl of loading buffer (82% deionized formamide, 10 mM EDTA pH 8, 0.2% xylene cyanol and bromophenol blue) and resolved on a denaturing 5% polyacrylamide gel. The gel was dried, exposed to phosphor screens overnight and scanned using a Typhoon RGB Biomolecular imager (Cytiva). For ladder generation, RT was carried out using a Thermo Sequenase cycling kit according to the manufacturer’s instructions with an input of 500 ng of the template.

Sequencing library preparation

The cDNA products from reverse transcription were purified using AMPure XP beads by adding a 1.2x bead to sample ratio and incubating at room temperature for 10 min. The beads were captured using a magnetic rack for 5 min and washed 3 times with 180 μl of fresh 80% ETOH. The beads were air-dried for 5 min and resuspended in 12 μl of water to elute the cDNA. Thereafter, 3’ adaptor ligation was performed by mixing 8 μl of purified cDNA with 0.2 μl of 50 μM 3’ adaptor (Supplementary Table 1), 1 μl of T4 RNA ligase (NEB), 1 μl of 10 mM ATP, 2 μl of T4 RNA Ligase buffer and8μl of 50% PEG 8000. To reduce ligation bias and barcode the RNA, the ligating adapter contained a random hexamer (NNNNNN) at the 5’ end. The mixture was incubated at 25 °C for 16 h, followed by enzyme deactivation at 65 °C for 15 min. Ligated products were purified with AMPure XP beads using a 1.2x bead to sample ratio. The products were PCR amplified 4-12 cycles with Q5 HF DNA polymerase using Illumina TruSeq forward primer and indexed reverse primers (NEB Next Multiplex Oligos) (Supplementary Table 1), with cycle times of 98 °C for 10 sec, 62 °C for 45 sec, and 72 °C for 60 sec. PCR products were purified with 1.2x volume of AMPure XP beads. Library concentrations were determined using a Qubit dsDNA HS Assay Kit and a BioAnalyzer High Sensitivity DNA Analysis. Libraries were diluted, pooled and sequenced using a NextSeq 500/550 or NextSeq 2000 platform.

Tb-seq data analysis

All FASTQ files were processed using Cutadapt (v1.9.1) to remove Illumina adapter sequences and then aligned to the respective RNA sequence using HISAT2 (v2.10). Stop information was extracted using RTEventsCounter.py script³⁸. The probability of stop per nucleotide was calculated as the number of stops divided by the sum of the total number of read-through events plus the number of stops (Eq. (1)). Probabilities were background subtracted against a no-probe control (Eq. (2)). Only nucleotides that contained more than 10,000 read-throughs were considered. To better compare probing experiments conducted in different contexts, including in vitro and in cell conditions where efficiencies of cleavage might differ, values were normalized to the top 10^th percentile of stop rates, then scaled from 1-8 (termed “reactivity,” below, based on⁶³).

$$P({stop})=\frac{{n}_{{stop}}}{{n}_{{stop}}+{n}_{{read}-{through}}}$$

(1)

$${Reactivity}={P({stop})}_{{treated}}-{P({stop})}_{{untreated}}$$

(2)

Reactivities were compared under conditions where 0.5 mM TbCl₃ (in-vitro transcribed RNAs) or 1 mM TbCl₃ (cell lysate RNAs) was employed and used in all figures unless indicated otherwise. For the Δ Tb analysis, the reactivity obtained from cell lysate + Proteinase-K probing experiments was subtracted from the reactivity obtained from cell lysate probing experiments (Eq. (3)). In order to take a conservative approach, a stringent cutoff of +/−0.5 was implemented to detect strong differences in reactivities.

$$\Delta {{{{{\rm{Tb}}}}}}={{Reactivity}}_{{Cell}{lysate}+{Proteinase}-K}-{{Reactivity}}_{{Cell}{lysate}}$$

(3)

Structure and graphical display

All secondary structures were visualized and drawn using Structure Editor, v1.0^64,65. All three-dimensional structure renderings were done using PyMOL Molecular Graphics System, v1.2r3pre, Schrödinger, LLC. Graphical displays were made using GraphPad Prism 8 or RStudio Version 1.2.5001. All gels were visualized using ImageJ 1.52a or ImageQuant TL v8.2.0.0.

Statistics and reproducibility

Sample size of as n = 2 was chosen for most experiments. A sample size in this case corresponds to biological replicates undergoing of full chemical probing data on intact RNAs. No statistical method was used to predetermine sample size. No data were excluded from the analyses. The experiments were not randomized. The investigators were not blinded to allocation during experiments and outcome assessment.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All FASTQ files generated in this study have been deposited in the SRA database under accession code “PRJNA966800”. The processed data generated in this study are provided in the Supplementary Information/Source Data file. The structures used in this study can be found under the accession codes “4E8M”, “5A2Q” and “6AHR”. Source data are provided with this paper.

Code availability

All source codes used in this study are available at a GitHub repository: https://github.com/pylelab/Tb-seq.

References

Brunel, C., Marquet, R., Romby, P. & Ehresmann, C. RNA loop–loop interactions as dynamic functional motifs. Biochimie 84, 925–944 (2002).
Article CAS PubMed Google Scholar
Staple, D. W. & Butcher, S. E. Pseudoknots: RNA structures with diverse functions. PLoS Biol. 3, e213 (2005).
Article PubMed PubMed Central Google Scholar
Zhao, C. & Pyle, A. M. Structural insights into the mechanism of group II Intron splicing. Trends Biochem Sci. 42, 470–482 (2017).
Article CAS PubMed PubMed Central Google Scholar
Serganov, A. & Nudler, E. A decade of riboswitches. Cell 152, 17–24 (2013).
Article CAS PubMed PubMed Central Google Scholar
Wilkinson, M. E., Charenton, C. & Nagai, K. RNA splicing by the spliceosome. Annu. Rev. Biochem. 89, 359–388 (2020).
Article CAS PubMed Google Scholar
Guttman, M. & Rinn, J. L. Modular regulatory principles of large non-coding RNAs. Nature 482, 339–346 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Cech, T. R. & Steitz, J. A. The Noncoding RNA Revolution—Trashing Old Rules to Forge New Ones. Cell 157, 77–94 (2014).
Article CAS PubMed Google Scholar
Šponer, J. et al. RNA structural dynamics as captured by molecular simulations: a comprehensive overview. Chem. Rev. 118, 4177–4338 (2018).
Article PubMed PubMed Central Google Scholar
Fürtig, B., Richter, C., Wöhnert, J. & Schwalbe, H. NMR spectroscopy of RNA. ChemBioChem 4, 936–962 (2003).
Article PubMed Google Scholar
Reyes, F. E., Garst, A. D. & Batey, R. T. Strategies in RNA crystallography. Methods Enzymol. 469, 119–139 (2009).
Article CAS PubMed Google Scholar
Fernandez-Leiro, R. & Scheres, S. H. Unravelling biological macromolecules with cryo-electron microscopy. Nature 537, 339–346 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Merino, E. J., Wilkinson, K. A., Coughlan, J. L. & Weeks, K. M. RNA structure analysis at single nucleotide resolution by selective 2‘-hydroxyl acylation and primer extension (SHAPE). J. Am. Chem. Soc. 127, 4223–4231 (2005).
Article CAS PubMed Google Scholar
Wang, P. Y., Sexton, A. N., Culligan, W. J. & Simon, M. D. Carbodiimide reagents for the chemical probing of RNA structure in cells. RNA 25, 135–146 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kertesz, M. et al. Genome-wide measurement of RNA secondary structure in yeast. Nature 467, 103–107 (2010).
Article ADS CAS PubMed Google Scholar
Underwood, J. G. et al. FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing. Nat. Methods 7, 995–1001 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ding, Y. et al. In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features. Nature 505, 696–700 (2014).
Article ADS CAS PubMed Google Scholar
Zinshteyn, B. et al. Assaying RNA structure with LASER-seq. Nucleic Acids Res 47, 43–55 (2019).
Article CAS PubMed Google Scholar
Tullius, T. D. & Greenbaum, J. A. Mapping nucleic acid structure by hydroxyl radical cleavage. Curr. Opin. Chem. Biol. 9, 127–134 (2005).
Article CAS PubMed Google Scholar
Adilakshmi, T., Soper, S. F. C. & Woodson, S. A. In Methods Enzymol. Vol. 468 239-258 (Academic Press, 2009).
Lu, Z. et al. RNA duplex map in living cells reveals higher-order transcriptome structure. Cell 165, 1267–1279 (2016).
Article CAS PubMed PubMed Central Google Scholar
Aw, J. G. et al. In Vivo Mapping of Eukaryotic RNA Interactomes Reveals Principles of Higher-Order Organization and Regulation. Mol. Cell 62, 603–617 (2016).
Article CAS PubMed Google Scholar
Tomezsko, P. J. et al. Determination of RNA structural diversity and its role in HIV-1 RNA splicing. Nature 582, 438–442 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Homan, P. J. et al. Single-molecule correlated chemical probing of RNA. Proc. Natl Acad. Sci. 111, 13858–13863 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Pyle, A. M. Metal ions in the structure and function of RNA. J. Biol. Inorg. Chem. 7, 679–690 (2002).
Article CAS PubMed Google Scholar
Draper, D. E. A guide to ions and RNA structure. RNA 10, 335–343 (2004).
Article CAS PubMed PubMed Central Google Scholar
Forconi, M. & Herschlag, D. Metal ion-based RNA cleavage as a structural probe Ch. 5. Methods Enzymol. 468, 91–106 (2009).
Article CAS PubMed Google Scholar
Soukup, G. A. & Breaker, R. R. Relationship between internucleotide linkage geometry and the stability of RNA. RNA 5, 1308–1325 (1999).
Article CAS PubMed PubMed Central Google Scholar
Regulski, E. E. & Breaker, R. R. In-line probing analysis of riboswitches. Methods Mol. Biol. 419, 53–67 (2008).
Article CAS PubMed Google Scholar
CIiesiloka, J., Marciniec, T. & Krzyzosiak, W. J. Probing the environment of lanthanide binding sites in yeast tRNAPhe by specific metal‐ion‐promoted cleavages. Eur. J. Biochem. 182, 445–450 (1989).
Article Google Scholar
Harris, D. A., Tinsley, R. A. & Walter, N. G. Terbium-mediated footprinting probes a catalytic conformational switch in the antigenomic hepatitis delta virus ribozyme. J. Mol. Biol. 341, 389–403 (2004).
Article CAS PubMed Google Scholar
Harris, D. A. & Walter, N. G. In Handbook of RNA Biochemistry 205–213 (Weinheim, 2005).
Hargittai, M. R. & Musier-Forsyth, K. Use of terbium as a probe of tRNA tertiary structure and folding. RNA 6, 1672–1680 (2000).
Article CAS PubMed PubMed Central Google Scholar
Choudhary, P. K., Gallo, S. & Sigel, R. K. O. In RNA Folding: Methods and Protocols (ed Christina Waldsich) 143-158 (Humana Press, Totowa, NJ, 2014).
Feig, A. L., Scott, W. G. & Uhlenbeck, O. C. Inhibition of the Hammerhead Ribozyme Cleavage Reaction by Site-Specific Binding of Tb(III). Science 279, 81–84 (1998).
Article ADS CAS PubMed Google Scholar
Rangan, P. & Woodson, S. A. Structural requirement for Mg2+ binding in the group I intron core. J. Mol. Biol. 329, 229–238 (2003).
Article CAS PubMed Google Scholar
Sigel, R. K. O., Vaidya, A. & Pyle, A. M. Metal ion binding sites in a group II intron core. Nat. Struct. Biol. 7, 1111–1116 (2000).
Article CAS PubMed Google Scholar
Guo, L.-T. et al. Sequencing and structure probing of long RNAs using MarathonRT: A next-generation reverse transcriptase. J. Mol. Biol. 432, 3338–3352 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sexton, A. N., Wang, P. Y., Rutenberg-Schoenberg, M. & Simon, M. D. Interpreting reverse transcriptase termination and mutation events for greater insight into the chemical probing of RNA. Biochemistry 56, 4713–4721 (2017).
Article CAS PubMed Google Scholar
Marcia, M. & Pyle, A. M. Visualizing group II Intron catalysis through the stages of splicing. Cell 151, 497–507 (2012).
Article CAS PubMed PubMed Central Google Scholar
Su, L. J., Waldsich, C. & Pyle, A. M. An obligate intermediate along the slow folding pathway of a group II intron ribozyme. Nucleic Acids Res. 33, 6674–6687 (2005).
Article CAS PubMed PubMed Central Google Scholar
Pyle, A. M. The tertiary structure of group II introns: implications for biological function and evolution. Crit. Rev. Biochem. Mol. Biol. 45, 215–232 (2010).
Article CAS PubMed PubMed Central Google Scholar
Quade, N., Boehringer, D., Leibundgut, M., Van Den Heuvel, J. & Ban, N. Cryo-EM structure of Hepatitis C virus IRES bound to the human ribosome at 3.9-Å resolution. Nat. Commun. 6, 1–9 (2015).
Article Google Scholar
Lukavsky, P. J., Kim, I., Otto, G. A. & Puglisi, J. D. Structure of HCV IRES domain II determined by NMR. Nat. Struct. Mol. Biol. 10, 1033–1038 (2003).
Article CAS Google Scholar
Yokoyama, T. et al. HCV IRES captures an actively translating 80S ribosome. Mol. Cell 74, 1205–1214.e1208 (2019).
Article CAS PubMed Google Scholar
Paulsen, R. B. et al. Inhibitor-induced structural change in the HCV IRES domain IIa RNA. Proc. Natl Acad. Sci. 107, 7263–7268 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Dibrov, S. M. et al. Structure of a hepatitis C virus RNA domain in complex with a translation inhibitor reveals a binding mode reminiscent of riboswitches. Proc. Natl Acad. Sci. 109, 5223–5228 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Wu, J. et al. Cryo-EM structure of the human ribonuclease P holoenzyme. Cell 175, 1393–1404.e1311 (2018).
Article CAS PubMed Google Scholar
Smola, M. J., Calabrese, J. M. & Weeks, K. M. Detection of RNA–Protein interactions in living cells with SHAPE. Biochemistry 54, 6867–6875 (2015).
Article CAS PubMed Google Scholar
Huston, N. C. et al. Comprehensive in vivo secondary structure of the SARS-CoV-2 genome reveals novel regulatory motifs and mechanisms. Mol. cell 81, 584–598.e585 (2021).
Article CAS PubMed PubMed Central Google Scholar
Siegfried, N. A., Busan, S., Rice, G. M., Nelson, J. A. E. & Weeks, K. M. RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP). Nat. Methods 11, 959–965 (2014).
Article CAS PubMed PubMed Central Google Scholar
Wan, H., Adams, R. L., Lindenbach, B. D. & Pyle, A. M. The In Vivo and In Vitro Architecture of the Hepatitis C Virus RNA Genome Uncovers Functional RNA Secondary and Tertiary Structures. J. Virol. 96, e0194621 (2022).
Article PubMed Google Scholar
Ziv, O. et al. The short- and long-range RNA-RNA Interactome of SARS-CoV-2. Mol. Cell 80, 1067–1077.e1065 (2020).
Article CAS PubMed PubMed Central Google Scholar
McHugh, C. A. & Guttman, M. RAP-MS: a method to identify proteins that interact directly with a specific RNA molecule in cells. Methods Mol. Biol. 1649, 473–488 (2018).
Article CAS PubMed Google Scholar
Licatalosi, D. D. et al. HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature 456, 464–469 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, H., Jia, X., Zhang, K. & Su, Z. Cryo-EM advances in RNA structure determination. Signal Transduct. Target. Ther. 7, 58 (2022).
Article CAS PubMed PubMed Central Google Scholar
Spokoini-Stern, R. et al. Visualizing the structure and motion of the long noncoding RNA HOTAIR. RNA 26, 629–636 (2020).
Article CAS PubMed PubMed Central Google Scholar
Uroda, T. et al. Conserved pseudoknots in lncRNA MEG3 are essential for stimulation of the p53 pathway. Mol. Cell 75, 982–995 e989 (2019).
Article CAS PubMed PubMed Central Google Scholar
Aguilar, R. et al. Targeting Xist with compounds that disrupt RNA structure and X inactivation. Nature 604, 160–166 (2022).
Article ADS CAS PubMed Google Scholar
Swisher, J., Duarte, C. M., Su, L. J. & Pyle, A. M. Visualizing the solvent‐inaccessible core of a group II intron ribozyme. EMBO J. 20, 2051 (2001).
Article CAS PubMed PubMed Central Google Scholar
Tang, G.-Q. et al. Relaxed rotational and scrunching changes in P266L mutant of T7 RNA polymerase reduce short abortive RNAs while delaying transition into elongation. PloS one 9, e91859 (2014).
Article ADS PubMed PubMed Central Google Scholar
Qin, P. Z. & Pyle, A. M. Stopped-flow fluorescence spectroscopy of a group II Intron ribozyme reveals that domain 1 is an independent folding unit with a requirement for specific Mg2+ ions in the tertiary structure. Biochemistry 36, 4718–4730 (1997).
Article CAS PubMed Google Scholar
Pirakitikulr, N., Kohlway, A., Lindenbach, B. D. & Pyle, A. M. The coding region of the HCV genome contains a network of regulatory RNA structures. Mol. Cell 62, 111–120 (2016).
Article CAS PubMed PubMed Central Google Scholar
Smola, M. J., Rice, G. M., Busan, S., Siegfried, N. A. & Weeks, K. M. Selective 2’-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile and accurate RNA structure analysis. Nat. Protoc. 10, 1643–1669 (2015).
Article CAS PubMed PubMed Central Google Scholar
Reuter, J. S. & Mathews, D. H. RNAstructure: software for RNA secondary structure prediction and analysis. BMC Bioinform 11, 129 (2010).
Article Google Scholar
Kleinjung, J. & Fraternali, F. POPSCOMP: an automated interaction analysis of biomolecular complexes. Nucleic Acids Res. 33, W342–W346 (2005).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to thank Dr. Li-Tao Guo for preparing and sharing MarathonRT enzyme and Dr. Ananth Kumar, Dr. Rafael Tavares and Dr. Chengxin Zhang (Pyle lab, Yale University) for thoughtful comments on the manuscript. We would like to thank Brett Lindenbach for providing the cells. This work was supported by the Howard Hughes Medical Institute, the National Institutes of Health (grant T32GM007223-45to S.P.), the National Science Foundation (fellowship DGE1752134 to M.S.S.), Burroughs Welcome Fund and Smith Family Foundation for C.B.W. A.M.P. is an Investigator in the Howard Hughes Medical Institute.

Author information

Authors and Affiliations

Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, USA
Shivali Patel, Alec N. Sexton & Matthew D. Simon
Department of Laboratory Medicine, Yale School of Medicine, New Haven, CT, USA
Madison S. Strine & Craig B. Wilen
Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
Madison S. Strine & Craig B. Wilen
Chemical Biology Institute, Yale University, West Haven, CT, USA
Matthew D. Simon
Howard Hughes Medical Institute, Chevy Chase, MD, USA
Anna Marie Pyle
Department of Chemistry, Yale University, New Haven, CT, USA
Anna Marie Pyle
Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT, USA
Anna Marie Pyle

Authors

Shivali Patel
View author publications
You can also search for this author in PubMed Google Scholar
Alec N. Sexton
View author publications
You can also search for this author in PubMed Google Scholar
Madison S. Strine
View author publications
You can also search for this author in PubMed Google Scholar
Craig B. Wilen
View author publications
You can also search for this author in PubMed Google Scholar
Matthew D. Simon
View author publications
You can also search for this author in PubMed Google Scholar
Anna Marie Pyle
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.P., A.N.S., and M.S.S. conducted experiments. S.P., A.N.S., M.D.S., C.B.W., and A.M.P. designed experiments. S.P. and A.M.P. wrote the paper.

Corresponding author

Correspondence to Anna Marie Pyle.

Ethics declarations

Competing interests

A patent application on MarathonRT has been filed by Yale University. Yale university has submitted a patent application pertaining to work outlined in this study. Inventors include A.M.P, S.P and M.D.S. C.B.W is a consultant for Exscientia. The remaining authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Patel, S., Sexton, A.N., Strine, M.S. et al. Systematic detection of tertiary structural modules in large RNAs and RNP interfaces by Tb-seq. Nat Commun 14, 3426 (2023). https://doi.org/10.1038/s41467-023-38623-1

Download citation

Received: 12 September 2022
Accepted: 09 May 2023
Published: 09 June 2023
DOI: https://doi.org/10.1038/s41467-023-38623-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.