RNA structure determination by solid-state NMR spectroscopy

Marchanka, Alexander; Simon, Bernd; Althoff-Ospelt, Gerhard; Carlomagno, Teresa

doi:10.1038/ncomms8024

Download PDF

Article
Open access
Published: 11 May 2015

RNA structure determination by solid-state NMR spectroscopy

Alexander Marchanka¹,
Bernd Simon¹,
Gerhard Althoff-Ospelt² &
…
Teresa Carlomagno^1,3

Nature Communications volume 6, Article number: 7024 (2015) Cite this article

7114 Accesses
61 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Knowledge of the RNA three-dimensional structure, either in isolation or as part of RNP complexes, is fundamental to understand the mechanism of numerous cellular processes. Because of its flexibility, RNA represents a challenge for crystallization, while the large size of cellular complexes brings solution-state NMR to its limits. Here, we demonstrate an alternative approach on the basis of solid-state NMR spectroscopy. We develop a suite of experiments and RNA labeling schemes and demonstrate for the first time that ssNMR can yield a RNA structure at high-resolution. This methodology allows structural analysis of segmentally labelled RNA stretches in high-molecular weight cellular machines—independent of their ability to crystallize— and opens the way to mechanistic studies of currently difficult-to-access RNA-protein assemblies.

Bioorthogonal masked acylating agents for proximity-dependent RNA labelling

Article 09 April 2024

Shubhashree Pani, Tian Qiu, … Bryan C. Dickinson

Digital colloid-enhanced Raman spectroscopy by single-molecule counting

Article 17 April 2024

Xinyuan Bi, Daniel M. Czajkowsky, … Jian Ye

Bridging structural and cell biology with cryo-electron microscopy

Article 03 April 2024

Eva Nogales & Julia Mahamid

Introduction

In gene expression regulation, stress response and pathogens infection, a multitude of non-coding RNAs and ribonucleoprotein complexes accomplish their function cycling through transient intermolecular contacts and related conformational changes. Taking influence on these processes requires a mechanistic understanding of the intermolecular interactions, which, in turn, necessitates structural information. Both naked RNAs and RNPs represent a challenge for structural biology. The conformational plasticity of the RNA restricts application of X-ray crystallography, while the high-molecular weight of the RNA (or RNP) of interest pushes solution-state NMR to its limits. Lately, solid-state NMR (ssNMR) spectroscopy, which is applicable to macromolecules of any size in non-crystalline form, has emerged as a powerful alternative to study the structure of amyloid fibrils^1,2, membrane proteins^3,4, and large protein–protein assemblies⁵. Despite these successes, ssNMR has been rarely applied to nucleic acids, and the methodology for RNA structure determination is still lacking⁶. Here, we present the first de novo structure determination of RNA by ssNMR, together with the experimental methods we developed for it. We demonstrate that RNA structure is accessible at high resolution by ssNMR using a few, easy to prepare, nucleotide-type selectively labeled samples. This methodology opens the way to the structure of RNA stretches in large RNA-protein assemblies, independent of their ability to crystallize, and thus to mechanistic studies of yet inaccessible cellular machines.

Results

The Box C/D RNA bound to L7Ae

In this study, we solve the structure of the 26mer Box C/D RNA from Pyrococcus Furiosus (Pf) as part of the complex with the protein L7Ae (Fig. 1a and Supplementary Fig. 1). ¹³C and ¹⁵N line widths of 0.4–0.5 and 0.6–0.8 p.p.m., respectively, allow determining the structure by ssNMR data with a precision of 0.8 Å. We choose to study this RNA for the following reasons. First, we were unable to crystallize the L7Ae-Box C/D RNA complex with the RNA sequence of Fig. 1a, despite the existence of crystallographic structures of homologues^7,8. This demonstrates that the crystallization of RNA-protein complexes can be unexpectedly challenging, in dependence of the RNA sequence. Second, the RNA of Fig. 1a contains the conserved Box C and Box D sequences, which build the so-called k-turn motif. The geometry of the k-turn is measured by an angle φ⁹, which is variable in the free RNA and depends on the concentration of magnesium^10,11. Upon protein binding, the k-turn parameter φ adopts a value close to 23° for all k-turn motifs investigated to date, independent of the experimental method, the exact sequence of the RNA or the species it belongs to ref. 12. The conservation of this structural motif offers the opportunity to verify the accuracy of the structure obtained by ssNMR, beyond the differences to crystallographic reference structures expected as a consequence of packing forces and RNA-RNA contacts in the crystals.

**Figure 1: Sequence of the Pf Box C/D RNA and magnetization transfer schemes.**

Resonance assignment and measurement of distance restraints are the key steps in structure determination by ssNMR. In contrast to proteins, where homonuclear ¹³C,¹³C correlations are sufficient for resonance assignment, the poor chemical shift dispersion of ribose resonances in RNA requires additional heteronuclear editing. We find that three-dimensional pulse schemes yield low signal-to-noise within our experimental set-up, while the quality of two-dimensional spectra allows for both assignment and quantification of cross-peaks. Therefore, our strategy does mainly without three-dimensional experiments and resolves spectral overlaps by selective labeling. To make the method accessible to a broad community, we abstain from using atom-selective labelling and employ only RNAs that can be produced with commercially available building blocks by in vitro transcription. In this study, we designed eight combinations of double-nucleotide-type selective-labelled RNAs (Supplementary Fig. 1), to accomplish both resonance assignment and measurement of structural parameters.

Sequence-specific assignment

First, we assigned the spin-systems of individual nucleotides in a non-site-specific manner¹³. For the 26mer Box C/D RNA, we found six adenosines, seven guanosines, three cytosines and four uridines spin-systems. Analysis of canonical coordinates of ribose shifts¹⁴ (Supplementary Fig. 2) suggests that two adenosines, three guanosines and one uridine are not located in regular A-form helices. This allowed us to attribute the uridine spin-system to U20, which was used as starting point for sequential assignment.

Seventeen out of the 26 nucleotides were assigned site-specifically using correlations between the C1’ or C6/C8 atoms of nucleotide i to the carbons of neighbouring nucleotides i±1, as well as nucleotides of the opposite strand (Fig. 1b, Supplementary Table 1). To improve resolution, before the ¹³C,¹³C transfer, the magnetization of C1’ or C6/C8 was correlated to the respective N1/N9 via TEDOR (Transferred-Echo-DOuble-Resonance)¹⁵, yielding a two-dimensional (2D) ¹⁵N,¹³C correlation (Fig. 2a,b, Methods). This allows clear distinction of purine and pyrimidines in double-nucleotide-type selective-labelled samples (for example, G,U^lab- or A,U^lab-RNAs). For the long-range carbon–carbon transfer, we tested different mixing sequences and finally settled on the PDSD (Proton-Driven-Spin-Diffusion) scheme due to its superior sensitivity¹⁶.

**Figure 2: ssNMR spectra for the sequential assignment and measurement of structural restraints.**

¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD (Supplementary Fig. 3a) was applied with a mixing time of 700 ms to six selective-labeled RNAs (Supplementary Fig. 1) and yielded several inter-nucleotide contacts up to a distance of 9–10 Å (Fig. 2a,b, Supplementary Figs 4 and 5). As an example of sequential assignment, G,U^lab- and A,U^lab-RNAs yielded multiple correlations between the C1’,C6 of U20 and both a guanosine and an adenosine spin-system; the latter correlates further with another adenosine of the A,U^lab-RNA. These cross-peaks are compatible with either an AAUG or a GUAA stretch, and identify unambiguously the spin-system A18-A19-U20-G21. This strategy yielded sequential assignment of 17 out of 19 nucleotides in structured regions, excluding the tetra-loop and the terminal ends. The remaining two nucleotides (G14, A15) were assigned by substitution of the tetra-loop sequence GAAA with UUCG. This alleviated the overlap of G14 and A15 with the resonances of the GAAA loop, allowing their assignment, as well as the identification of G10 and of two adenosines of the A11–A13 stretch. The poor intensity of the GAAA tetra-loop resonances is indicative of conformational heterogeneity; likewise, the terminal G1, U25 and C26 spin-systems are not visible in any of the spectra and were not considered in the structure calculation.

Next, we tested the performance of a ¹³C,³¹P correlation, which, with a TEDOR mixing time of 3.2 ms, should provide sequential C2’_i/P_i+1 and C3’_i/P_i+1 contacts. The mixing time was optimized for sensitivity of transfer over two to three bonds, ranging up to 4–5 Å distance (Fig. 1c). As expected, the ³¹P resonances are poorly resolved in helices and the spectrum provided information only for non A-form structural elements (Fig. 2c, Supplementary Fig. 6a,b).

Finally, we could sequentially assign >90% of all carbon resonances of the Pf Box C/D RNA in the stretches 2–10 and 14–24 (81% for both carbons and nitrogens).

Structural determination of the RNA by ssNMR

The determination of RNA secondary structure requires the identification of base pairs. To this end, we used a ¹⁵N,¹⁵N through-space correlation (RFDR, Radio-Frequency-Driven-Recoupling)^17,18 to reveal the spatial proximity of either A-N1 and U-N3 or G-N1 and C-N3 in Watson–Crick base pairs (Fig. 1d). The presence of three G:C base pairs (Fig. 2d) defined the C-stem. The G24:C2 base pair was not found due to the absence of the G24-N1 resonance in intra-nucleotide correlations, probably as a consequence of conformational heterogeneity at the helix ends.

Secondary structure prediction suggests one U·U and two A·G base pairs (Fig. 1a). Initially, to verify the presence and determine the topology of these non-canonical base pairs, we measured NHHN spectra^19,20; in this experiment, magnetization is transferred between close-by ¹⁵N nuclei exploiting the spatial proximity of their attached protons (Fig. 1e). This strategy failed, due to severe overlap of the involved nitrogen resonances. Next, we recorded NHHC spectra (Fig. 1e) on three selectively labelled RNA samples (Supplementary Fig. 7); (G,U)^lab-RNA yielded weak N2_G21/C1’_G4 correlations, while (A,G)^lab-RNA yielded strong N6_A22/C1’_G4 and N6_A5/C1’_G21 signals. The last two correlations were also detected in a ¹³C-band-selective, ¹⁵N-TEDOR spectrum (Supplementary Fig. 6c). This pattern of cross-peaks, together with the anti conformation of the glycosidic angle χ for all four G and A nucleotides (vide infra), is exclusively compatible with two N7-amino, N3-imino base pairs, which are typical of k-turn motifs.

The U3·U23 base pair might be detected from the proximity of the two H3 atoms in a NHHN correlation. In our case, the chemical shift difference of only 1 p.p.m. between the U3-N3 and U23-N3 hindered the resolution of the weak cross-peak from the intense diagonal. Therefore, we resorted to the analysis of chemical shifts (CS), as indicators of secondary structure. The CS of U23-C2 (151 p.p.m.) and both U3- and U23-C4 (165.6 and 167.6 p.p.m., respectively) deviate from the values of non-stacked disordered nucleotides (154.0 and 168.5 p.p.m.) as well as from the values of A-form helices (152.9 and 169.2 p.p.m.; ref. 21). The low CS of U23-C2 and U3-C4 indicate stacking on both sides, while for U3-C2 and U23-C4 the up-field shift induced by stacking is compensated by the down-field shift of carbonyl acceptors of H-bonds. All in all, CS analysis predicts that U3 and U23 form a U3·U23 2-carbonyl-N3, 4-carbonyl-N3 base pair²².

Distance restraints

Next, we obtained distance restraints from four different correlation experiments: ¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD recorded at multiple mixing times provided carbon–carbon distances; ¹³C,³¹P-TEDOR and ¹³C-band-selective, ¹⁵N-TEDOR yielded a few carbon–phosphorus (17) and carbon–nitrogen (6) distances, respectively; CHHC and NHHC experiments yielded distances between protons (Supplementary Table 2). In this context, we proved the applicability of more sophisticated and selective transfer schemes, such as PAR (Proton-Assisted-Recoupling) and PAIN (Proton-Assisted-Insensitive-Nuclei)^23,24. However, the sensitivity of these experiments remained too low, especially in combination with heteronuclear filtering.

The mixing sequence PDSD does not permit the quantitative measurement of distance restraints^25,26; however, when recorded at multiple mixing times, it provided information on (C1’,C8/C6)_i–(C_x)_j distance ranges. A total of 91 inter-nucleotide cross-peaks were obtained from the ¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD experiments, which were all incorporated in structure calculations, in addition to 46 intra-nucleotide restraints over ≥3 bonds (Supplementary Table 2).

Next, we attempted to obtain base–base C_i–N_i±1 cross-peaks through a ¹³C-band-selective,¹⁵N-TEDOR experiment¹⁵ recorded for samples with ¹³C-labelling of one nucleotide type and ¹⁵N-labelling of another nucleotide type (Fig. 1f). Our efforts were unsuccessful, due to low signal-to-noise. However, when recording a (¹³C1’,¹³C4’)-band-selective,¹⁵N-TEDOR, we obtained six inter-nucleotide cross-peaks from both the k-turn and helical regions (Supplementary Fig. 6c,d and Supplementary Table 2).

Finally, 2D NHHC and CHHC spectra^19,27 yielded 17 and 10 inter-nucleotide contacts (Supplementary Figs 7 and 8), respectively, in addition to 21 intra-nucleotide correlations over ≥3 bonds (Supplementary Table 2).

In addition to distance restraints, we obtained dihedral angles from analysis of ribose chemical shifts (Methods and Supplementary Fig. 2) and from CHHC experiments at short mixing times. Similarly to solution-state NMR, the χ angle was restrained to syn in the presence of a strong C1’-C8/C6 cross-peak (short H1’-H8/H6 distance) and to anti in the other cases. Only A19 displayed a χ angle in the syn conformation, in agreement with other k-turn RNA structures¹¹.

Structure calculations

Distance and dihedral angle restraints, as well as base pair restraints were used in ARIA²⁸ to calculate the structure of the Box C/D RNA from ssNMR data. Out of 300 calculated structures, the first 60 converged to a well-defined minimum with precision of 0.9 Å (root-mean-square-deviation (r.m.s.d.) of all heavy atoms of nucleotides 2–9 and 14–24 of the first 20 structures; Fig. 3a and Table 1). The distances derived from the ¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD spectra had the highest impact on the precision of the structure, followed by those derived from the CHHC and NHHC experiments (Supplementary Fig. 9). As for structural calculation from solution-state NMR data, the definition of the RNA secondary structure (topology of base pairs) was essential. The stem regions were defined by eight distance and nine angular restraints per residue, while the geometry of the k-turn required 21 distances per nucleotide. The structure determination method was validated by removal of random fractions of restraints. The structures bundles were consistent upon random removal of up to 20% of the total restraints.

**Figure 3: ssNMR structure of the Pf Box C/D RNA.**

Table 1 Structural statistics (20 structures out of 300 calculated, PDB code 2n0r).

Full size table

Discussion

The 26mer Box C/D RNA used in this study does not crystallize in complex with L7Ae; however, the crystallographic structure of two orthologous complexes from Archaeoglobus fulgidus^7,29 (Af, PDB code 1RLG and 4BW0), one orthologous complex from Solpholobus solfataricus³⁰ (Ss, PDB code 3PLA) and another L7Ae-Box C/D RNA from Pf with a different RNA sequence⁸ (PDB code 3NMU) let us evaluate the accuracy of the ssNMR structure in the critical k-turn region (Fig. 3b,c). The φ angle of 23° that defines the k-turn geometry of the ssNMR structure is in very good agreement with the φ angles of the reference structures (1RLG, 23°; 4BW0, 22°; 3PLA, 24°; 3NMU, 24°).

Next, we analysed the backbone and glycoside torsion angles of our structures bundle and compared them with the corresponding torsion angles of the four reference structures (Supplementary Fig. 10). We choose to compare torsion angles rather than r.m.s.d. values to better visualize the variability of both the crystallographic structures and our bundle at each nucleotide position. The δ torsion describes the ribose pucker and is defined by the chemical shift analysis of Supplementary Fig. 2. The values fit nicely to those of the reference structures, with the exception of 5, 19 and 20 of 1RLG, which adopt the C3’-endo conformation. Our NMR data indicate that the conformation of these riboses is C2’-endo, in agreement with the other three crystallographic structures. Similarly, the ɛ and ζ angles of the same nucleotides of 1RLG deviate from the values of both our structures bundle and the other three crystallographic structures. The β, ɛ and ζ torsion angles are not directly determined by any NMR parameter, but rather restrained loosely by data base values (see Methods), ³¹P–¹³C and ¹³C–¹³C distances. Nevertheless, the distribution of these angles in the ssNMR bundle is quite narrow and in good agreement with the reference structures. The α and γ torsion angles are the least well defined by the NMR distance restrains in the stretch 18–21 of the Box C sequence. Interestingly, high variability is observed for these torsion angles among the four reference structures as well, indicating that the k-turn geometry is tolerant to different values (Supplementary Fig. 10). The only clear discrepancy between the ssNMR structures bundle and the four references structures is observed for A5-α,γ. The A5-³¹P chemical shift value (Supplementary Table 1) does not allow to restrict the A5-α to the gauche±conformations³¹, as observed in the four reference structures. However, despite this local difference, the k-turn geometry of the ssNMR bundle agrees very well with that of the reference conformations, with an average backbone r.m.s.d. for k-turn nucleotides 4–6 and 17–22 of 1.3 Å to the four crystallographic structures (Fig. 3c). Finally, the glycosidic torsion angles χ are determined from the intensities of the C1’-C8/C6 cross-peaks and nicely agree with those of the reference structures.

In summary, we demonstrate that the structure of RNA is accessible by ssNMR with excellent precision and accuracy, despite the difficulties caused by broad line widths and resonance overlap. We present a straightforward, manageable strategy that uses easy-to-produce nucleotide-type selective-labelled RNAs and sensitive magnetization transfer schemes. Our results make the folding of short RNAs and selectively labelled RNA stretches, as well as their interaction with proteins, accessible at high resolution in the context of large RNAs and RNP particles. We anticipate that our method will have a considerable impact in various fields of RNA processing and small RNA regulation (siRNA, miRNA, piRNA), where the dynamic nature of the molecular complexes represents an obstacle to crystallization.

Methods

Sample preparation

The L7Ae–Box C/D RNA complex was assembled from protein and RNA in 1:1 ratio and purified by size exclusion chromatography. L7Ae was expressed in Escherichia coli (LB medium) and purified over a Ni-Nta column. Nucleotide-type ¹³C, ¹⁵N selective-labeled Box C/D RNA was prepared by in vitro transcription with T7 polymerase produced in house. Labeling patterns of the RNA were obtained using NTP mixtures where only one or two nucleotide types were either ¹⁵N or ¹³C, or double ¹³C,¹⁵N labeled.

Sequential resonance assignment and measurement of structural restraints used eight samples with different labeling patterns. Six samples consisted of double ¹³C,¹⁵N nucleotide-type selective-labelled RNAs: A,C^lab-RNA, A,G^lab-RNA, A,U^lab-RNA, C,U^lab-RNA, G,C^lab-RNA and G,U^lab-RNA (Supplementary Fig. 1a–f); two samples contained single ¹³C or ¹⁵N labelled nucleotide pairs: (G-¹³C, A-¹⁵N)^lab-RNA and (G-¹³C, U-¹⁵N)^lab-RNA (Supplementary Fig. 1g,h). Next to these RNAs, an additional RNA construct was used to facilitate assignment, where the GAAA tetra-loop is substituted with the UUCG tetra-loop.

The L7Ae–Box C/D RNA complex was concentrated to 20 mg ml⁻¹ in buffer containing 25 mM HEPES and 120 mM sodium chloride at pH 7.5, and subsequently mixed with equal amount of precipitation solution (100 mM sodium acetate, 30% PEG 400 in 100 mM HEPES, pH 7.5), as reported previously^13,32,33. The sample was micro-crystallized by slow precipitation using a SpeedVac concentrator at room temperature for ∼2.5 h. The complex precipitated at half volume. The precipitate was packed in the ssNMR rotor by centrifugation. The final sample contained ∼4 mg of RNA and 6 mg of L7Ae.

NMR spectroscopy

Solid-state NMR experiments were performed on a 700 MHz SB Bruker Avance III spectrometer equipped with 3.2 mm MAS ¹H/¹³C/¹⁵N probehead. ¹³C,³¹P TEDOR experiments were acquired at 600 MHz with a WB Bruker Avance III spectrometer equipped with a tunable ¹H/X/Y probehead at Bruker Biospin in Rheinstetten. The temperature of all experiments was 260 K. ¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD, ¹³C,³¹P-TEDOR, ¹³C,¹⁵N-TEDOR and ¹⁵N,¹⁵N-RFDR experiments were performed at 16 kHz MAS, while proton diffusion-based CHHC and NHHC experiments were performed at 13 kHz MAS.

¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD

In the ¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD experiment (Supplementary Fig. 3a) ¹³C magnetization was prepared by standard ¹H–¹³C cross polarization (mixing time, 200 μs). The ¹³C–¹⁵N dipolar coupling was reintroduced in a short TEDOR mixing time (1.5–2 ms), during which magnetization was transferred to nearby ¹⁵N nuclei, and then, after t₁, back to the ¹³C. In t₁, we recorded the frequency of nitrogens close to carbons, as for example that of N1/N9 directly bound to C1’ and C8/C6. The following, long ¹³C,¹³C-PDSD step (mixing time, 200–700 ms) transferred the ¹⁵N-chemical shift labelled ¹³C magnetization to nearby carbons. Finally, ¹³C magnetization was detected during t₂. The ambiguity on the carbon from which the magnetization originates in the PDSD step, either C1’ or C8/C6, was lifted in three-dimensional experiments, where the ¹³C frequency was recorded before the PDSD mixing. Alternatively, we evaluated the efficiency of the ¹⁵N1/N9-¹³C transfer, which in several instances was found to be better towards the C1’ than towards C8/C6. As a third alternative, a ¹³C-band-selective-TEDOR transfer, with selectivity either on C1’ or on C6/C8, can be used to resolve the ambiguity. Cross-peaks were evaluated and translated into distance restraints. Distance ranges (d) were applied for inter-nucleotide restraints as 3.5<d<9 Å, according to several previous studies^34,35,36; the ranges for intra-nucleotide base–ribose restraints, 3<d<6 Å, and intra-nucleotide ribose–ribose restraints, 3<d<4 Å, were determined from the nucleotides’ geometry.

¹³C,³¹P-TEDOR

In the ¹³C,³¹P-TEDOR experiment (Supplementary Fig. 3b), after initial preparation of ¹³C magnetization, the ¹³C,³¹P dipolar coupling was re-introduced in a TEDOR mixing time of 3.2 ms; the frequency of ³¹P was monitored in t₁, while ¹³C magnetization was detected during t₂. Optionally, a short ¹³C,¹³C-PDSD step (50–100 ms) can be applied after TEDOR to transfer the ¹³C magnetization to further carbon spins, such as C1’. This experiment was useful to identify the ribose spin systems through the better-resolved C1’ chemical shift. ¹³C,³¹P-TEDOR spectra were recorded for A^lab-RNA and G,U^lab-RNA; due to the limited signal-to-noise, only one TEDOR mixing time was recorded (3.2 ms). The spectra yielded 17 non-trivial restraints, which were classified as 3<d<5 Å, as appropriate for a mixing time of 3.2 ms.

¹³C-band-selective, ¹⁵N-TEDOR

In the ¹³C-band-selective,¹⁵N-TEDOR experiment⁹ (Supplementary Fig. 3c), after initial preparation of ¹³C magnetization, the ¹³C–³¹N dipolar coupling was reintroduced in a TEDOR mixing time of 6–15 ms with band-selective ¹³C inversion pulses; the long mixing allows transferring magnetization between carbons and nitrogens as far as 5–6 Å. The ¹⁵N and ¹³C frequencies were recorded during t₁ and t₂, respectively. With a ¹³C,¹⁵N-TEDOR that was selective for C1’ and C4’, we obtained four G-C1’,C4’/A-N6,N9 cross-peaks from the (G-¹³C,A-¹⁵N)^lab-RNA and two G-C1’/U-N1,N3 cross-peaks from the (G-¹³C,U-¹⁵N)^lab-RNA. Also in this case, we did not acquire multiple TEDOR mixing times, due to limited signal-to-noise. Distance ranges 3<d<5 Å and 3<d<7 Å were attributed to the strong and weak peaks, respectively, at a mixing time of 12 ms.

¹⁵N,¹⁵N-RFDR

In the ¹⁵N,¹⁵N-RFDR experiment (Supplementary Fig. 3d), ¹⁵N magnetization was prepared through a 300 μs cross-polarization step and its frequency was recorded during t₁; subsequently, the magnetization was transferred to nearby nitrogen atoms via an RFDR mixing step of 20 ms and finally detected during t₂.

CHHC and NHHC

In the CHHC and NHHC proton spin diffusion-based experiments (Supplementary Fig. 3e,f, respectively), ¹³C or ¹⁵N magnetization was prepared through a short cross-polarization mixing time of 100–200 μs, followed by t₁ evolution on either ¹³C (CHHC) or ¹⁵N (NHHC). Next, the magnetization was transferred back to protons, from where, after a short proton mixing of 100–200 μs, it was transferred to nearby carbons with a 100-μs cross polarization step; finally the frequency of ¹³C was recorded in t₂. Inter-proton distances 2<d<4 and 2<d<5 Å were attributed to the strong and weak signals, respectively, following previous studies^19,27.

In all the experiments, protons were decoupled in the indirect and direct acquisition times using high-power SPINAL-64 (ref. 37) decoupling at 85–95 kHz. Chemical shifts were referenced as described by Morcombe and Zilm³⁸. The spectra were processed with NMRPipe³⁹ and visualized with NMRviewJ⁴⁰.

Structural calculation protocol

Structures were calculated using the Aria 1.2/CNS 1.1 set-up^28,41 following a similar protocol as for structural calculations of RNA by solution-state NMR data^11,42,43. Both canonical and non-canonical base-pairs were incorporated in the structure calculation as distance restraints. Planarity was enforced through weak planarity restraints (5 kcal mol⁻¹ Å⁻²) for canonical base pairs and non-canonical base pair U3·U23. Flexible planarity was introduced for the base pairs A5·G21 and G4·A22 by defining the plane that involves one atom of the acceptor and four atoms of the donor base to allow for propeller twist and tilt, as described in ref. 44.

The ribose conformation was restrained through the analysis of ribose chemical shifts¹⁴ (Supplementary Fig. 2). The riboses of nucleotides G4, A5, A19, U20, G21 were given an S-type conformation, while the remaining nucleotides, except for G6 and A11-A13, were restrained to the N-type conformation. The dihedral angles α, β, ɛ and ζ were restrained to the range typical for A-form helix (300°±30°, 180°±30°, −135°±30° and 300°±30°, respectively) for nucleotides 2, 6–9, 14–17, 24, which are involved in canonical base pairs; the α, β, ɛ and ζ angles of the remaining nucleotides were loosely restrained to the allowed ranges (180°±150°, 180°±110°, −125°±75° and 180°±150°, respectively). Dihedral angles α and ζ of nucleotides G4, G6, A18, U20 and G21 were additionally restrained to 0°±120° based on ³¹P chemical shifts³¹. The dihedral angle γ was restrained to the gauche+ conformation for nucleotides involved in base pairs.

Three hundred structures were calculated in one iteration without the automated assignment or the distance calibration options of Aria 1.2 using an assigned distance list. Before minimization, we randomized all backbone dihedral angles. The minimization protocol used the force-field DNA-RNA-allatom-hj-opls.top and the following parameters in the four steps of simulated annealing (SA), together with the PROLSQ nonbonded parameters⁴³: (i) the SA protocol started with a high-temperature torsion angle simulated annealing phase of 100,000 steps at 20,000 K (time step of 22.5 fs); (ii) this was followed by a torsion angle dynamic cooling phase from 20,000 to 1,000 K in 100,000 steps and by two cartesian dynamic cooling phases with a time step of 2.5 fs ((iii) from 2,000 to 1,000 K in 100,000 steps and (iv) from 1,000 to 50 K in 80,000 steps, respectively. Finally, 20 low energy structures were refined in water (TIP3P) with OPLS nonbonded parameters⁴⁵. Standard ARIA force constants were used for the different restraint types (for example, distances—50 kcal mol⁻¹, and dihedrals—200 kcal mol⁻¹, in the final cooling step.)²⁸.

The final structures were analysed using MolMol⁴⁶ and Chimera⁴⁷. Figures were prepared with Chimera.

Analysis of the mutant Box C/D RNA with the UUCG tetra-loop

We measured two samples of the mutant Box C/D RNA containing the stable UUCG tetra-loop (UUCG-RNA) instead of the GAAA tetra-loop to aid and confirm the assignment. The 2D ¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD of the (A,U)^lab-UUCG-RNA allowed identifying two adenosines of the GAAA tetra-loop of the wild-type RNA, which disappear in the mutant spectrum. In addition, we could confirm the assignment of A15, which does not overlap with any other spin-system in the UUCG-RNA. The 2D ¹³C,¹⁵N-TEDOR-¹³C,¹³C-PDSD spectrum of G,C^lab-UUCG-RNA allowed the assignment of G10, whose spin-system is not present in the spectrum of the mutant RNA. In addition, the resonances of G14 shift slightly in the mutant with respect to the wild-type RNA, due to the different structure of the UUCG tetra-loop. This fact confirmed the assignment of the G14 spin-system.

Additional information

Accession codes: The atomic coordinates of 10 lowest energy structures have been deposited in the Protein Data Bank under accession number 2N0R. The NMR chemical shifts have been deposited in the Biological Magnetic Resonance Data Bank, entry 25534.

How to cite this article: Marchanka, A. et al. RNA structure determination by solid-state NMR spectroscopy. Nat. Commun. 6:7024 doi: 10.1038/ncomms8024 (2015).

Accession codes

Accessions

Biological Magnetic Resonance Data Bank

25534

Protein Data Bank

2N0R

References

Petkova, A. T. et al. A structural model for Alzheimer's beta-amyloid fibrils based on experimental constraints from solid state NMR. Proc. Natl Acad. Sci. USA 99, 16742–16747 (2002).
Article CAS ADS Google Scholar
Jaroniec, C. P. et al. High-resolution molecular structure of a peptide in an amyloid fibril determined by magic angle spinning NMR spectroscopy. Proc. Natl Acad. Sci. USA 101, 711–716 (2004).
Article CAS ADS Google Scholar
de Groot, H. J. M. Solid-state NMR spectroscopy applied to membrane proteins. Curr. Opin. Struct. Biol. 10, 593–600 (2000).
Article CAS Google Scholar
Opella, S. J. & Marassi, F. M. Structure determination of membrane proteins by NMR spectroscopy. Chem. Rev. 104, 3587–3606 (2004).
Article CAS Google Scholar
Shi, L. C. et al. Conformation of a seven-helical transmembrane photosensor in the lipid environment. Angew. Chem. Int. Ed. 50, 1302–1305 (2011).
Article CAS Google Scholar
Marchanka, A. & Carlomagno, T. Solid-state NMR and RNA structure: a new partnership? eMagRes 3, 119–128 (2014).
Article Google Scholar
Moore, T., Zhang, Y. M., Fenley, M. O. & Li, H. Molecular basis of box C/D RNA-protein interactions: cocrystal structure of archaeal L7Ae and a box C/D RNA. Structure 12, 807–818 (2004).
Article CAS Google Scholar
Xue, S. et al. Structural basis for substrate placement by an archaeal Box C/D ribonucleoprotein particle. Mol. Cell 39, 939–949 (2010).
Article CAS Google Scholar
Cojocaru, V., Nottrott, S., Klement, R. & Jovin, T. M. The snRNP 15.5K protein folds its cognate K-turn RNA: a combined theoretical and biochemical study. RNA 11, 197–209 (2005).
Article CAS Google Scholar
Goody, T. A., Melcher, S. E., Norman, D. G. & Lilley, D. M. J. The kink-turn motif in RNA is dimorphic, and metal ion-dependent. RNA 10, 254–264 (2004).
Article CAS Google Scholar
Falb, M. et al. Structure of the K-turn U4 RNA: a combined NMR and SANS study. Nucleic Acids Res. 38, 6274–6285 (2010).
Article CAS Google Scholar
Schroeder, K. T., McPhee, S. A., Ouellet, J. & Lilley, D. M. A structural database for k-turn motifs in RNA. RNA 16, 1463–1468 (2010).
Article CAS Google Scholar
Marchanka, A., Simon, B. & Carlomagno, T. A suite of solid-state NMR experiments for RNA resonance assignment in a 21 kDa protein-RNA complex. Angew. Chem. Int. Ed. 52, 9996–10001 (2013).
Article CAS Google Scholar
Cherepanov, A. V., Glaubitz, C. & Schwalbe, H. High-resolution studies of uniformly C-13,N-15-labeled RNA by solid-state NMR spectroscopy. Angew. Chem. Int. Ed. 49, 4747–4750 (2010).
Article CAS Google Scholar
Jaroniec, C. P., Filip, C. & Griffin, R. G. 3D TEDOR NMR experiments for the simultaneous measurement of multiple carbon-nitrogen distances in uniformly C-13, N-15-labeled solids. J. Am. Chem. Soc. 124, 10728–10742 (2002).
Article CAS Google Scholar
Szeverenyi, N. M., Sullivan, M. J. & Maciel, G. E. Observation of spin exchange by two-dimensional Fourier-Transform C-13 cross polarization-magic-angle spinning. J. Magn. Reson. 47, 462–475 (1982).
CAS ADS Google Scholar
Bennett, A. E., Ok, J. H., Griffin, R. G. & Vega, S. Chemical-shift correlation spectroscopy in rotating solids—radio frequency-driven dipolar recoupling and longitudinal exchange. J. Chem. Phys. 96, 8624–8627 (1992).
Article CAS ADS Google Scholar
Leppert, J. et al. Identification of NH...N hydrogen bonds by magic angle spinning solid state NMR in a double-stranded RNA associated with myotonic dystrophy. Nucleic Acids Res. 32, 1177–1183 (2004).
Article CAS Google Scholar
Lange, A., Luca, S. & Baldus, M. Structural constraints from proton-mediated rare-spin correlation spectroscopy in rotating solids. J. Am. Chem. Soc. 124, 9704–9705 (2002).
Article CAS Google Scholar
Riedel, K. et al. Characterisation of hydrogen bonding networks in RNAs via magic angle spinning solid state NMR spectroscopy. J. Biomol. NMR 31, 331–336 (2005).
Article CAS Google Scholar
Farès, C., Amata, I. & Carlomagno, T. C-13-Detection in RNA bases: revealing structure-chemical shift relationships. J. Am. Chem. Soc. 129, 15814–15823 (2007).
Article Google Scholar
Ohlenschläger, O. et al. The structure of the stemloop D subdomain of coxsackievirus B3 cloverleaf RNA and its interaction with the proteinase 3C. Structure 12, 237–248 (2004).
Article Google Scholar
Lewandowski, J. R., De Paëpe, G. & Griffin, R. G. Proton assisted insensitive nuclei cross polarization. J. Am. Chem. Soc. 129, 728–729 (2007).
Article CAS Google Scholar
De Paëpe, G. et al. Proton assisted recoupling and protein structure determination. J. Chem. Phys. 129, 245101 (2008).
Article ADS Google Scholar
Loquet, A. et al. 3D structure determination of the Crh protein from highly ambiguous solid-state NMR restraints. J. Am. Chem. Soc. 130, 3579–3589 (2008).
Article CAS Google Scholar
Manolikas, T., Herrmann, T. & Meier, B. H. Protein structure determination from C-13 spin-diffusion solid-state NMR spectroscopy. J. Am. Chem. Soc. 130, 3959–3966 (2008).
Article CAS Google Scholar
Lange, A. et al. A concept for rapid protein-structure determination by solid-state NMR spectroscopy. Angew. Chem. Int. Ed. 44, 2089–2092 (2005).
Article CAS Google Scholar
Linge, J. P., Habeck, M., Rieping, W. & Nilges, M. ARIA: automated NOE assignment and NMR structure calculation. Bioinformatics 19, 315–316 (2003).
Article CAS Google Scholar
Huang, L. & Lilley, D. M. The molecular recognition of kink-turn structure by the L7Ae class of proteins. RNA 19, 1703–1710 (2013).
Article CAS Google Scholar
Lin, J. et al. Structural basis for site-specific ribose methylation by box C/D RNA protein complexes. Nature 469, 559–563 (2011).
Article CAS ADS Google Scholar
Varani, G., Aboul-ela, F. & Allain, F. H. T. NMR investigation of RNA structure. Prog. Nucl. Magn. Reson. Spectrosc. 29, 51–127 (1996).
Article CAS Google Scholar
Jehle, S. et al. Intermolecular protein-RNA interactions revealed by 2D P-31-N-15 magic angle spinning solid-state NMR spectroscopy. J. Am. Chem. Soc. 132, 3842–3846 (2010).
Article CAS Google Scholar
Asami, S., Rakwalska-Bange, M., Carlomagno, T. & Reif, B. Protein–RNA interfaces probed by 1H-detected MAS solid-state NMR spectroscopy. Angew. Chem. Int. Ed. 52, 2345–2349 (2013).
Article CAS Google Scholar
Wasmer, C. et al. Amyloid fibrils of the HET-s(218-289) prion form a beta solenoid with a triangular hydrophobic core. Science 319, 1523–1526 (2008).
Article CAS ADS Google Scholar
Bertini, I. et al. High-resolution solid-state NMR structure of a 17.6 kDa protein. J. Am. Chem. Soc. 132, 1032–1040 (2010).
Article CAS Google Scholar
Demers, J. P. et al. High-resolution structure of the Shigella type-III secretion needle by solid-state NMR and cryo-electron microscopy. Nat. Commun. 5, 4976 (2014).
Article CAS Google Scholar
Fung, B. M., Khitrin, A. K. & Ermolaev, K. An improved broadband decoupling sequence for liquid crystals and solids. J. Magn. Reson. 142, 97–101 (2000).
Article CAS ADS Google Scholar
Morcombe, C. R. & Zilm, K. W. Chemical shift referencing in MAS solid state NMR. J. Magn. Reson. 162, 479–486 (2003).
Article CAS ADS Google Scholar
Delaglio, F. et al. Nmrpipe—A multidimensional spectral processing system based on Unix pipes. J. Biomol. NMR 6, 277–293 (1995).
Article CAS Google Scholar
Johnson, B. A. & Blevins, R. A. NMRView: a computer program for the visualization and analysis of NMR data. J. Biomol. NMR 4, 603–614 (1994).
Article CAS Google Scholar
Brunger, A. T. et al. Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. Sect. D 54, 905–921 (1998).
Article CAS Google Scholar
Carlomagno, T. et al. Structural principles of RNA catalysis in a 2'-5' lariat-forming ribozyme. J. Am. Chem. Soc. 135, 4403–4411 (2013).
Article CAS Google Scholar
Nozinovic, S. et al. High-resolution NMR structure of an RNA model system: the 14-mer cUUCGg tetraloop hairpin RNA. Nucleic Acids Res. 38, 683–694 (2010).
Article CAS Google Scholar
Zanier, K. et al. Structure of the histone mRNA hairpin required for cell cycle regulation of histone gene expression. RNA 8, 29–46 (2002).
Article CAS Google Scholar
Linge, J. P. et al. Refinement of protein structures in explicit solvent. Proteins 50, 496–506 (2003).
Article CAS Google Scholar
Koradi, R., Billeter, M. & Wüthrich, K. MOLMOL: a program for display and analysis of macromolecular structures. J. Mol. Graphics 14, 51–55 (1996).
Article CAS Google Scholar
Pettersen, E. F. et al. UCSF chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article CAS Google Scholar

Download references

Acknowledgements

This work has been supported by the BMBF through grant 0315870B to T.C.

Author information

Authors and Affiliations

Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, Heidelberg, 69117, Germany
Alexander Marchanka, Bernd Simon & Teresa Carlomagno
Bruker BioSpin, Silberstreifen 4, Rheinstetten, 76287, Germany
Gerhard Althoff-Ospelt
Helmholtz Zentrum für Infektionsforschung, Inhoffenstrasse 7, Braunschweig, 38124, Germany
Teresa Carlomagno

Authors

Alexander Marchanka
View author publications
You can also search for this author in PubMed Google Scholar
Bernd Simon
View author publications
You can also search for this author in PubMed Google Scholar
Gerhard Althoff-Ospelt
View author publications
You can also search for this author in PubMed Google Scholar
Teresa Carlomagno
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.M. produced samples, developed and performed both NMR experiments and structure calculations, interpreted the results and wrote the manuscript together with T.C. B.S. contributed to the development of pulse programs and structural calculation protocols. G.A. helped in the acquisition of NMR data. T.C. designed the project, contributed to the development of pulse programs and structural calculation protocols, interpreted the results and wrote the manuscript together with A.M. All the authors discussed the results reported in the manuscript.

Corresponding authors

Correspondence to Alexander Marchanka or Teresa Carlomagno.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-10, Supplementary Tables 1-2 and Supplementary References (PDF 2535 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Marchanka, A., Simon, B., Althoff-Ospelt, G. et al. RNA structure determination by solid-state NMR spectroscopy. Nat Commun 6, 7024 (2015). https://doi.org/10.1038/ncomms8024

Download citation

Received: 22 January 2015
Accepted: 25 March 2015
Published: 11 May 2015
DOI: https://doi.org/10.1038/ncomms8024

This article is cited by

Structural characterization of a protein adsorbed on aluminum hydroxide adjuvant in vaccine formulation
- Linda Cerofolini
- Stefano Giuntini
- Marco Fragai
npj Vaccines (2019)
High-resolution small RNA structures from exact nuclear Overhauser enhancement measurements without additional restraints
- Parker J. Nichols
- Morkos A. Henen
- Beat Vögeli
Communications Biology (2018)
Isotope labeling for studying RNA by solid-state NMR spectroscopy
- Alexander Marchanka
- Christoph Kreutz
- Teresa Carlomagno
Journal of Biomolecular NMR (2018)
Gd3+-chelated lipid accelerates solid-state NMR spectroscopy of seven-transmembrane proteins
- Chang Liu
- Jing Liu
- Shenlin Wang
Journal of Biomolecular NMR (2017)
RNA structure by solid-state NMR

Nature Methods (2015)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.