Visualizing group II intron dynamics between the first and second steps of splicing

Manigrasso, Jacopo; Chillón, Isabel; Genna, Vito; Vidossich, Pietro; Somarowthu, Srinivas; Pyle, Anna Marie; De Vivo, Marco; Marcia, Marco

doi:10.1038/s41467-020-16741-4

Download PDF

Article
Open access
Published: 05 June 2020

Visualizing group II intron dynamics between the first and second steps of splicing

Nature Communications volume 11, Article number: 2837 (2020) Cite this article

8024 Accesses
41 Citations
20 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 04 January 2022

This article has been updated

Abstract

Group II introns are ubiquitous self-splicing ribozymes and retrotransposable elements evolutionarily and chemically related to the eukaryotic spliceosome, with potential applications as gene-editing tools. Recent biochemical and structural data have captured the intron in multiple conformations at different stages of catalysis. Here, we employ enzymatic assays, X-ray crystallography, and molecular simulations to resolve the spatiotemporal location and function of conformational changes occurring between the first and the second step of splicing. We show that the first residue of the highly-conserved catalytic triad is protonated upon 5’-splice-site scission, promoting a reversible structural rearrangement of the active site (toggling). Protonation and active site dynamics induced by the first step of splicing facilitate the progression to the second step. Our insights into the mechanism of group II intron splicing parallels functional data on the spliceosome, thus reinforcing the notion that these evolutionarily-related molecular machines share the same enzymatic strategy.

Structural insights into intron catalysis and dynamics during splicing

Article Open access 22 November 2023

Cryo-EM reveals dynamics of Tetrahymena group I intron self-splicing

Article 16 March 2023

Snapshots of the second-step self-splicing of Tetrahymena ribozyme revealed by cryo-EM

Article Open access 16 March 2023

Introduction

Self-splicing group II intron ribozymes are essential regulators of gene expression in all domains of life and they share evolutionary origins and enzymatic properties with the spliceosome, the eukaryotic machinery that catalyzes nuclear splicing of mRNA precursors^1,2. Spliced group II introns are active retrotransposable elements that contribute to genomic diversification with potential applications in medicine and gene editing^3,4. Therefore, elucidating the mechanism of group II intron catalysis is crucial for understanding key steps in gene expression and RNA maturation, and to develop therapeutic and biotechnological tools.

The current understanding of group II intron self-splicing mechanism derives from biochemical and cell biology studies^5,6,7,8 and from 3D structures of introns from various phylogenetic classes^{9,10,11,12,13,14}. These studies have provided detailed molecular insights on intron folding¹⁵ and high-resolution molecular snapshots of the Oceanobacillus iheyensis group IIC intron trapped in various conformations throughout the catalytic cycle^{16,17,18,19,20}.

The intron catalytic site comprises a highly conserved triple helix formed by nucleotides of the so-called catalytic triad (in domain D5, C358-G359-C360), two-nucleotide bulge (D5, A376-C377), and J2/3 junction (between D2 and D3, A287-G288-C289, all numbering from the crystallized form of the O. iheyensis intron, i.e. PDB id: 4FAQ; Supplementary Fig. 1a, b). The site also harbors a metal-ion cluster formed by two divalent (M1–M2) and two monovalent (K1–K2) ions (Supplementary Fig. 1a). These ions participate directly in catalysis^16,19, which occurs via a series of nucleophilic S_N2 reactions (Fig. 1a). In the first step of splicing, depending on whether the intron follows a hydrolytic or a transesterification mechanism, respectively²¹, a water molecule or the 2′-OH group of a bulged adenosine in D6, activated by M2 and by the triple helix, attack the 5′-splice junction of the precursor (5e-I-3e), forming an intron/3′-exon intermediate (I-3e), in which the scissile phosphate (SP) is coordinated by K2. In the second step of splicing, the 5′-exon (5e), activated by M1, performs a nucleophilic attack on the 3′-splice junction, releasing ligated exons (5e-3e) and a linear or lariat form of the excised intron (I; Fig. 1a). The latter can then further reverse splice into cognate or non-cognate genomic DNA, in processes known as retrohoming or retrotransposition^22,23. Crystal structures of the pre- and post-hydrolytic states are available for the first and second steps of splicing, allowing precise localization of reactants^13,16, and computational studies have elucidated energetics and dynamics of the related reaction chemistry^24,25.

However, a key aspect of the group II intron splicing cycle that remains largely uncharacterized is the transition between the splicing steps, when the intron must release products of the first reaction and recruit substrates of the second splicing event. Biochemical and structural studies suggest that, after the first step of splicing, the intron rearranges at the K1-binding site, transiently adopting a specific inactive conformation (aka the toggled conformation), in which G288 (in the J2/3 junction) and C377 (in the two-nucleotide bulge) disengage from their triple helix with the catalytic triad of nucleotides in D5, thereby disrupting the catalytic metal center^16,26 (Supplementary Fig. 1a). Parallel studies also suggest that group II intron conformational changes may be triggered by protonation of active site nucleotides during the splicing cycle²⁷. Specifically, the N1 atom of adenosines (N1A) and the N3 atom of cytosines (N3C) can undergo large pK_A shifts in folded DNA or RNA and thereby serve as proton donors/acceptors, much like histidine residues in proteins²⁸. Consistent with this, functional studies on the spliceosome suggest that protonation within the U6 intramolecular stem-loop (ISL), which is analogous to the group II intron two-nucleotide bulge and catalytic triad, antagonizes binding of catalytic metal ions and induces transient base-flipping during splicing^29,30.

To understand the transition between the first and second step of splicing, here we probe the group II intron active site by mutagenesis, enzymatic assays, crystallography, and molecular dynamics (MD) modeling. We find that, immediately after the first step of splicing, protonation of a conserved nucleobase within the catalytic triad promotes the spontaneous release of K1 and induces intron toggling. Consistent with this finding, intron mutants that cannot be protonated have defects in the second step of splicing. Our group II intron data have parallels with functional studies on the nuclear spliceosome, suggesting that protonation and toggling are common mechanistic strategies that are adopted by both these splicing machines.

Results

A catalytic residue may become protonated during splicing

Because crystal structures of distinct states of the O. iheyensis group II intron are available, we first analyzed these structures using continuum electrostatics to obtain an initial qualitative approximation of the pK_A values of active site nucleotides (Supplementary Table 1). Using nonlinear Poisson–Boltzmann calculations, we noted that the pK_A value of most residues remains unchanged (Supplementary Table 1). By contrast, the computed pK_A value of C358 (catalytic triad) shifts between the pre-hydrolytic state (pK_A ~ 4.5 in PDB id: 4FAQ) and the so-called toggled state that forms after the first step of splicing¹⁶ (pK_A ~ 7.2 in PDB id: 4FAU). Although these values are qualitative due to the influence of geometrical changes and uncertainties in the definition of the grid and dielectric constants, the Poisson–Boltzmann calculations suggest that C358 has different protonation states along the splicing trajectory (Supplementary Fig. 1c). Consistent with these findings, nucleotide position 358 in other introns can be occupied by an adenine or a cytidine, i.e., bases that can be protonated, but this same position never varies to guanidine or uracil, i.e., bases that cannot be protonated³¹.

Computational studies on the O. iheyensis group II intron immediately after 5e hydrolysis have identified proton transfer pathways from the reaction nucleophile into the bulk solvent involving up to five water molecules (corresponding to a migration distance of ~15 Å)²⁴. Although less efficient than direct proton transfer, such chains of water molecules enable a proton to shuttle from the nucleophile to the N3 atom of C358 (N3^C358), which is exposed within the same solvent-filled cavity at a distance of 9.8 Å in the structure of the pre-hydrolytic state (PDB id: 4FAQ)³² (Supplementary Fig. 2a). Moreover, our hybrid quantum (DFT/BLYP)/classical simulations show that, once a proton is positioned at the N3 atom, C358 remains stably protonated for over 15 ps (see “Methods” section and Supplementary Fig. 2b, c).

Taken together, our observations from continuum electrostatics and quantum mechanical (QM) simulations, the specific evolutionary conservation pattern of C358, and its key structural role in the pre-hydrolytic state suggested that C358 plays a direct role in group II intron catalysis.

Non-protonatable mutants show second step splicing defects

To explore the functional role of C358 in reaction chemistry, we created O. iheyensis splicing precursor constructs¹⁶ in which C358 was replaced with A, G, or U. In addition, to maintain the structural integrity of the catalytic triple helix, we isosterically replaced the two partners of C358, i.e., its Watson–Crick pairing partner (position 385) and its J2/3 triple-helical partner (position 289) (Supplementary Fig. 1d). After incorporating the resulting triple base mutations (C289A/C358A/G385U, aka the A-mutant; C289G/C358G/G385C, aka the G-mutant; and C289U/C358U/G385A, aka the U-mutant), we monitored effects on splicing kinetics.

We found that the A-mutant—which can be protonated at position 358—splices at rates comparable to wild type, whereas the G and U mutants—which cannot be protonated at position 358—have splicing defects. Specifically, in the presence of near-physiological potassium and magnesium concentrations, the first splicing step of the G-mutant is ~12-fold slower and that of the U-mutant ~7-fold slower than in wild type. Moreover, the second splicing step of the G-mutant is ~48-fold slower and that of the U-mutant ~8-fold slower than in wild type (Fig. 1b, c and Supplementary Table 2). Most remarkably, both G and U mutants show accumulation of linear I-3e intermediate, which indicates that these mutants stall after the first step of splicing and have difficulty progressing into the second step (Fig. 1c, middle panel). These defects are comparable to those of other intron mutants designed to perturb the catalytic triad, such as ai5γ intron double mutants that carry G or U mutations at the nucleotide position analogous to O. iheyensis residue 358 and compensatory mutations of its corresponding Watson–Crick pair³³. Finally, the splicing defects of our triple mutants are comparable to those of other O. iheyensis group II intron mutants designed to impair toggling, such as the C377G mutant reported in previous studies¹⁶. In this way, our enzymatic data connect defects in the transition between the two steps of splicing to specific active site mutations that prevent protonation on C358.

The mutants are structurally intact but do not toggle

To understand the splicing defects of the G and U mutants at the molecular level, we inserted the corresponding mutations into the previously described Oi5eD1-5 construct¹⁶ and visualized the mutant active site by X-ray crystallography.

First, we determined crystal structures of the G and U mutants in the presence of potassium and magnesium at 3.4 and 3.6 Å resolution, respectively (Table 1). Both mutants have a folded structure similar to that of the post-hydrolytic state of the wild-type intron after the first step of splicing (PDB id.: 4FAR; root mean square deviation (RMSD)_4FAR-Gmutant = 0.49 Å, RMSD_4FAR-Umutant = 0.43 Å; Fig. 2a, d). Importantly, both mutant structures adopt the triple-helical configuration that corresponds with that of the wild-type intron structure (Fig. 2a and Supplementary Fig. 1d). The F_o − F_c simulated-annealing electron density omit maps calculated by omitting the J2/3 residues and the catalytic metal cluster reveal strong electron density signal for the triple helix conformer, as in wild type (total peak height for the nucleobase of G288 = 8.9 σ and 6.7 σ for the G and U mutants, respectively; maximum peak height for the metals = 9.5 σ and 6.6 σ for the G and U mutants, respectively; Fig. 2b). Moreover, the F_o − F_c maps calculated by omitting the first intron nucleotide (G1) show that the 5′-splice junction has undergone cleavage in both mutants during the crystallization process (Fig. 2c). In summary, the similarity of these mutant structures with that of wild type suggests that, despite some reductions in rate, the first step of splicing is structurally and mechanistically unaffected by the G and U mutations.

Table 1 Data collection and refinement statistics (molecular replacement).

Full size table

**Fig. 2: Crystal structures of the intron in potassium and magnesium.**

We then determined the crystal structures of the G and U mutants in the presence of sodium and magnesium at 3.2 and 3.3 Å resolution, respectively (Table 1). In this case, both mutants adopt overall structures similar to wild type (PDB id.: 4FAX; RMSD_4FAX-Gmutant = 3.9 Å, RMSD_4FAX-Umutant = 0.75 Å; Fig. 3a). However, the detailed architecture of the active site differs significantly from wild type under sodium conditions. For wild type, these conditions induce a rotation of the backbone in the J2/3 region, which breaks the triple helix structure and generates the so-called toggled conformation that is implicated in the transition between the first and the second step of splicing¹⁶ (Fig. 3b, c). By contrast with wild type, the G and U mutants in sodium maintain the triple helix configuration, as revealed by the F_o − F_c maps calculated by omitting the J2/3 residues and the catalytic metals (total peak height for the triple helix conformer of the G288 nucleobase = 7.5 σ in the G-mutant and = 9.3 σ in the U-mutant; Fig. 3b, c). Therefore, these structures show that the G and U mutants are unable to adopt the toggled conformation, which may explain their tendency to stall after the first step of splicing.

**Fig. 3: Crystal structures of the intron in sodium and magnesium.**

Taken together, the enzymatic and structural data suggest that C358 protonation and active site toggling facilitate the rearrangement of the intron active site between the two steps of splicing.

Scission of the 5e disrupts the catalytic metal cluster

To establish how C358 protonation and active site toggling are mechanistically connected, and to understand the chain of events that regulate active site rearrangement, we performed force-field-based MD simulations. We used a flexible nonbonded approach for the metal center (see “Methods” section), followed by comparative analyses of multiple systems built using the published structures of the wild-type O. iheyensis group II intron captured at different stages of catalysis^16,20 and the structures of the G and U mutants. These structures represent the highest resolution crystallographic data available for group II introns and display the most detailed architecture of an intron active site, including all metals and first splicing step reactants^16,20.

We initially investigated the dynamics of the wild-type intron in the pre-hydrolytic state (PDB id: 4FAQ; two classical MD simulations for ~600 ns and ~1.2 μs, respectively). We observed that, shortly after equilibration (~25 ns), K1 shifted closer to the N7 atom of G288 (N7^G288), which was concomitant with the weakening of the K1 interaction with O5′^G359 observed in the crystal structure (d_K1-N7G288 = 2.98 ± 0.27 Å in the simulations, d_K1-N7G288 = 4.3 Å in PDB id: 4FAQ, Fig. 4 and Supplementary Fig. 3). In both simulations, the system was structurally stable, especially nucleotides within the active site (domain D5 and junction J2/3). This was reflected in the average RMSD = 1.95 ± 0.27 Å (Supplementary Fig. 3) and the fact that catalytic triad residues maintained positions observed in the crystal structure (d_M1–M2 4.24 ± 0.04 Å in the simulations, d_M1-M2 = 4.3 Å in PDB id: 4FAQ). These simulations suggest that the pre-hydrolytic configuration does not have a tendency to undergo structural rearrangements.

**Fig. 4: Importance of the K1 interaction with N7^G288.**

We then investigated the dynamics of the wild-type intron after 5e hydrolysis, thus considering the post-hydrolytic state (PDB id: 4FAR) in protonated (three simulations, ~350 ns each) and non-protonated (six simulations, ~750 ns each) configurations (Supplementary Fig. 4). In these simulations, the overall structural fold was stably maintained, with an averaged RMSD of 4.72 ± 0.65 Å. As in the simulations of the pre-hydrolytic state, K1 shifted closer to the N7^G288 after equilibration (~25 ns, d_K1-N7G288 = 2.82 ± 0.15 Å in the simulations, d_K1-N7G288 = 4.4 Å in PDB id: 4FAR). However, none of the post-hydrolytic systems were able to release the products of the first step of splicing. For example, the SP appears locked by the K2 ion in the proximity of the active site and the nucleobase of G1 remains stably coordinated to M1–M2 (Supplementary Fig. 4). These observations suggest that the post-hydrolytic crystal structures used in these simulations may represent an unproductive low energy configuration of the intron that is not directly relevant to the pre-second step splicing configuration.

To address this issue, we modeled an active site state of the wild-type intron that would provide an improved starting point for simulations. We started with the structure of the pre-hydrolytic state (PDB id: 4FAQ), broke the scissile bond, and inverted the stereochemistry of the SP (further modeling details in “Methods” section and in Supplementary Fig. 5). This state represents the intron immediately after the first step of splicing, where the SP has just been cleaved but is still coordinated by M1 and M2 (Supplementary Fig. 5). Also for this ‘cleaved’ state, we simulated both protonated and non-protonated forms of C358 (two simulations per system, ~600 ns per simulation). In all cases, the system showed considerable stability, with an overall RMSD of 4.61 ± 0.81 Å. During these simulations, the K1–N7^G288 interaction was formed and initially preserved. Moreover, the SP was not sequestered by K2 outside the active site. In other words, the distance between the SP and M2 was constantly maintained at d_SP-M2 = 3.23 ± 0.10 Å (Supplementary Fig. 5). Intriguingly, in the protonated state, after ~20 ns of simulation, a water molecule bridged O6^G288 and M2, such that these two atoms became closer to each other (d_M2-O6 = 5.71 Å in PDB id: 4FAQ; d_M2-O6 = 4.75 ± 0.23 Å in the simulations; Fig. 5 and Supplementary Fig. 5). Concomitantly, the value of d_M1–M2 increased from 4.31 to 5.05 ± 0.13 Å (Fig. 5 and Supplementary Fig. 5). Importantly, at this point, the coordination shell of K1 was perturbed, and the K1–N7^G288 interaction broke, leading to the spontaneous release of K1 from the active site into the bulk solvent after just additional ~30 ns (Fig. 5 and Supplementary Fig. 5). Notably, these events occurred also in the non-protonated state, although less promptly. In this case, the initial conformational changes occurred after ~200 ns, with K1 released soon after, at ~250 ns. Interestingly, in all cases, the conformational ensemble of the active site after the release of K1 differed from the characteristic triple helix configuration. To specifically monitor the triple helix geometry, we used the following two geometrical parameters: (1) the distance between the O2 atom of C289 (J2/3) and the N4 atom of G358 (D5, catalytic triad) (d_289–358), which adopts values ≤ 3 Å in the triple helix configuration and >3 Å when the triple helix is disrupted; and (2) the angle α between the nucleobases plains of C358 and its Watson–Crick pair G385, which adopts values ≤ 0.35 rad in the triple helix configuration and >0.35 rad when the triple helix is disrupted (Supplementary Fig. 6). Indeed, d_289–358 = 2.7 Å and α = 0.17 rad in the crystallized pre-hydrolytic state (PDB id: 4FAQ), which harbors K1 and adopts the triple helix configuration. Notably, though, after K1 release in our MD simulations, d_289–358 reached average values of ~4.88 ± 1.05 Å and α reached average values of ~0.63 ± 0.14 rad in the protonated state (~3.07 ± 0.14 Å and ~0.47 ± 0.10 rad in the non-protonated state, respectively), suggesting that the triple helix is destabilized and the active site may toggle under these conditions (Supplementary Fig. 6).

**Fig. 5: Protonation of N3^C358 favors K1 release.**

Finally, we also simulated the crystallized G and U mutants in the cleaved and post-hydrolytic states (eight simulations, ~600 ns each; Supplementary Figs. 7 and 8). We noted that the K1–N7^G288 interaction was not stably formed in the mutants, preventing K1 release (Supplementary Figs. 7 and 8). Thus, the triple helix was stabilized in its crystallographic conformation. For example, in the simulations, d_289–358 = 2.67 ± 0.24 Å and α = 0.19 ± 0.11 rad for the G-mutant (α = 0.15 rad in PDB id 6T3K) and d_289–358 = 1.95 ± 0.23 Å and α = 0.33 ± 0.13 rad for the U-mutant (α = 0.31 rad in PDB ID=6T3R) (Supplementary Figs. 7 and 8). These simulations suggest that the G and U mutants are unlikely to toggle in their cleaved form.

Taken together, these data suggest that K1 is stably bound to the active site in the pre-hydrolytic state of the wild-type intron, but is spontaneously released from the active site immediately after 5e hydrolysis. The release of K1 breaks the catalytic triple helix, and the intron begins sampling the toggled conformation. Such a rearrangement is significantly favored by protonation of N3^C358, and it does not occur in the G and U mutants, which cannot be protonated.

The K1–N7^G288 interaction stabilizes the intron active site

Interestingly, in the simulations of the wild-type intron described above, but not in the simulations of the mutants, K1 establishes a stable interaction with N7^G288 within a very short time after equilibration (Figs. 4, 5 and Supplementary Figs. 4–8). Moreover, simulations of the cleaved state immediately after 5e hydrolysis showed that interaction with N7^G288 is a necessary step for releasing K1 from the active site (Fig. 5 and Supplementary Fig. 5). Importantly, an N7-deaza mutation at position G288 was shown to impair the first step of splicing³⁴. These observations suggest that the K1–N7^G288 interaction may be structurally and functionally important for splicing.

To test this hypothesis, we modeled the N7-deaza mutation at G288 in the pre-hydrolytic state (PDB id: 4FAQ), and we tested the importance of the K1–N7^G288 interaction for the proper folding of the active site. Three classical MD simulations of these in silico mutants (~200 ns each) showed that the loss of the K1–N7^G288 interaction irreversibly destabilized the triple helix, causing separation of M1–M2 (averaged d_M1–M2 = 5.28 ± 0.12 Å, Fig. 4b) and eventually leading to the unfolding of the active site.

These data suggest that the K1–N7^G288 interaction plays a crucial role in preventing premature release of K1 and consequent disruption of the triple helix.

Toggling energetics agree with catalytic rate constants

To appropriately sample and semi-quantitively evaluate the energetics associated with intron toggling, we used path-metadynamics (MtD)³⁵. We performed enhanced sampling MtD simulations starting from either the cleaved protonated or non-protonated wild-type models and terminating at the toggled state (referred to as the cH⁺→ T and the c→ T transitions, respectively; see details in “Methods” section). The reference path involves exclusively the J2/3 junction, which rearranges as defined from structural data¹⁶, and employs two collective variables that trace (1) the progress of the system along the reference path (variable S), and (2) the distance of the sampled conformations from the reference path (variable Z). In this way, MtD simulations sample the conformational space to find the lowest energy path for the conformational change under investigation. Notably, the non-bonded metal cluster M1–M2–K1–K2 and its extended coordination shell at the catalytic site can freely explore conformational space during these simulations.

Mechanistically, in simulations where C358 was protonated, the system first sampled a large, deep free energy minimum that contained multiple isoenergetic conformational states. While A287 freely explored the conformational space, C358 protonation disrupted the canonical WC base pairing with G385, leading to C358 rotation (d_289–358 = 5.85 ± 1.94 Å and α = 0.34 ± 0.08 rad, state A, Fig. 6). This spontaneous rearrangement promoted hydration of the K1-binding site, with consequent prompt release of K1 to the bulk solvent, disruption of the hydrogen-bond contacts between C358 and C289, and further separation of these two residues (d_289–358 = 10.33 ± 1.75 Å and α = 1.24 ± 0.18 rad, state B, Fig. 6). In this conformation, the flexibility of J2/3 nucleotides was enhanced, allowing G288 and C289 to rotate out of the active site and to stack with A287, thus enabling the disruption of the C377–C360 base pair (Toggled state, Fig. 6). The computed energetic barrier for this overall transition (ΔG^‡_cH⁺_-T) was ~20 kcal mol⁻¹, while the final metastable toggled state had a value of about +5 kcal mol⁻¹ relative to the triple helix conformer (Fig. 6).

**Fig. 6: Energetics associated with intron toggling in the protonated state.**

Nucleotides within J2/3 also rearranged in the non-protonated configuration, albeit with different dynamics and higher energy barriers. Indeed, with the spontaneous rotation of A287, the intron rearranged into the first intermediate state (d_289–358 = 2.97 ± 0.21 Å and α = 0.17 ± 0.08 rad, state A′, Supplementary Fig. 9, which is the lowest free energy minimum), in which K1 is more exposed to the bulk water. K1 is then released simultaneously to the partial rotation of G288. This led to the disruption of the triple helix and the formation of a second intermediate state (d_289–358 = 7.97 ± 1.19 Å and α = 0.31 ± 0.16 rad, state B′, Supplementary Fig. 9). Finally, the stacking of A287 and C289 with G288, together with the rotation of C377, completed the conformational rearrangement and formed the final toggled state (Supplementary Fig. 9). The computed free energy barrier for this transition (ΔG^‡_c-T) was ~25 kcal mol⁻¹, which is therefore less favorable than that of the protonated intron (ΔG^‡_cH⁺_-T ~20 kcal mol⁻¹). Importantly, the final metastable toggled state had a value of about +5 kcal mol⁻¹ relative to lowest free energy minimum (state A′, Supplementary Fig. 9). These computed activation barriers are a good match with empirical values calculated using the experimental splicing rate constants inserted into the Eyring–Polanyi equation^36,37 (k₁ = 0.031 ± 0.003 min⁻¹ → ΔG^‡₁ = 22.8 kcal mol⁻¹; k₂ = 0.026 ± 0.003 min⁻¹ → ΔG^‡₂ = 22.9 kcal mol⁻¹). This result corroborates our proposed toggling mechanism, indicating that conformational rearrangements of the intron active site between the catalytically active triple helix configuration and the toggled structure captured crystallographically¹⁶ are compatible with catalysis.

Discussion

By combining structural, enzymatic, and computational methods, we have elucidated the molecular mechanism for the transition between the two steps of group II intron splicing and we have described the dynamic behavior of the intron active site as it moves through the splicing process (Fig. 7 and Supplementary Movie 1).

**Fig. 7: Revised group II intron splicing cycle.**

In the pre-hydrolytic state, the group II intron adopts the triple helix conformation, which coordinates the heteronuclear metal cluster M1–M2–K1–K2^16,19. M2 and the phosphate backbone of C358 deprotonate the reaction nucleophile for catalyzing the scission of the 5′-splice-site. At this stage, the proton released into bulk solvent by the reaction nucleophile²⁴ is transferred to the N3 atom on the C358 nucleobase, either via specific proton transfer pathways, as previously proposed²⁴ (an illustration of one possible transfer pathway is reported in Supplementary Fig. 2a) or via simple diffusion through the solvent. Independent on the exact proton transfer mechanism, hybrid quantum-classical simulations suggest that C358 remains stably protonated on N3, never exchanging its proton with surrounding water in the quantum region (Supplementary Fig. 2b, c). Indeed, it is remarkable that position 358 is often occupied by an adenosine (which is readily protonated) in the majority of group II introns and in the spliceosome, but it never varies to G or U (which are nucleobases that cannot be protonated). C358 protonation thus emerges as a previously unrecognized event that stimulates the progression of the intron toward the second step of splicing. Indeed, when we experimentally replaced C358 with adenosine, splicing was unaffected. But when we replaced C358 with either a G or a U, the mutated intron accumulated linear I-3e intermediate, indicating a defect in the progression onto the second step of splicing. Notably, and in line with the MD simulations of protonated and non-protonated wild-type intron in the cleaved post-hydrolytic state (see below), splicing is not completely inhibited in the mutants, suggesting that protonation accelerates splicing but it is not essential. These differences in kinetics likely constitute a phenotypic advantage for the intron, which has preserved protonatable residues at position 358 throughout evolution. When position 358 is occupied by a G or U residue, steric or electrostatic perturbations may also contribute to the observed splicing defects. The extent of such perturbations may be qualitatively inferred from the behavior of the A-mutant, which—despite being protonatable at position 358—contains a bulkier purine substitute. Remarkably, splicing defects of the A-mutant are minimal (approximately twofold, Fig. 1b,c and Supplementary Table 2) and crucially, they do not lead to accumulation of an I-3e intermediate (Fig. 1b, c and Supplementary Table 2), suggesting that the defects of the G and U mutants predominantly derive from their inability to become protonated at position 358. The crystal structures of these mutants additionally confirm that these constructs preserve triple helix architecture, so any perturbation of their active site must be minimal. Finally, the crystallographic data suggest that the G and U mutants do not sample the toggled conformation, thus impairing a critical rearrangement of the intron active site between the two steps of splicing.

MD simulations enabled us to dissect the precise sequence of events that lead from 5′-splice site cleavage to toggling. Most importantly, we observed that 5e hydrolysis induces a spontaneous and prompt release of K1 from the active site, as previously hypothesized¹⁶. This key event, which happens only in the post-hydrolytic but not in the pre-hydrolytic state, is much favored by protonation of C358, which induces fast K1 release (just after ~50 ns when protonated, as compared to ~250 ns in the non-protonated state, with an energetic barrier of ΔG^‡_cH⁺_-T ~ 20 kcal mol⁻¹ and ΔG^‡_c-T ~ 25 kcal mol⁻¹, respectively). These observations reveal that K1 is a highly dynamic ion, despite its tight coordination to nearly all active site residues in the catalytic triple helix configuration, and it plays an extremely crucial role during splicing. The interaction of K1 with N7^G288, in the J2/3 junction, is particularly important for stabilizing the intron active site in the catalytically competent configuration and for controlling the binding and release of K1 within the active site along the splicing cycle. These results explain why N7^G288-deaza mutants are defective in splicing³⁴.

As a result of K1 release, the triple helix conformation becomes unstable. Under such circumstances, G288 toggles out of the active site, undergoing backbone rotations that expose the Watson–Crick face of guanosine to functional groups in D3 and the Hoogsteen face to a cavity that is likely occupied by D6 and by 3′-splice junction residues^9,11,13,16. In this conformation, G288 is thus optimally placed to promote key interactions that facilitate the second step of splicing (see below). MtD simulations show that the energy required for such conformational toggling is compatible with catalytic rate constants. Importantly, mutants that are defective for toggling, either because they cannot be protonated or because their triple helix is stable even under conditions where the wild-type toggles (i.e. the G and U mutants described here, and the C377G mutant studied previously¹⁶), fail to progress onto the second step of splicing.

Toggling of the J2/3 junction and progression to the second step of splicing is also likely to involve A287 (nucleotide γ). In our simulations, A287 establishes a canonical WC interaction with the second nucleobase of the intron (U2; d_U2-A287 = 2.13 ± 0.32 Å; Supplementary Fig. 10), which was maintained as long as K1 remained in the active site, but which was broken when K1 left the active site and the intron toggled. Such findings suggest that G288 toggling may be needed to release A287 from U2. This process would ensure the formation of the essential γ–γ′ interaction, in which A287 pairs with its partner nucleotide γ′ in D6 during the second step of splicing^9,34,38. After recruiting D6 via A287, the toggled intron would then re-establish the catalytic triple helix conformation by reverse toggling, explaining how the first and second steps of splicing are mechanistically connected. Based on the simulations, the toggled state is ~5 kcal mol⁻¹ higher in free energy compared to the triple helix state, suggesting that reverse toggling is energetically inexpensive. It is therefore tempting to speculate that protonation and toggling also occur at the end of the second step, which might favor the release of the splicing product, and reduce the chances of spliced exons reopening^22,23. In either case, these processes may be further facilitated by participation of the intron-encoded maturase protein.

The idea of a rearrangement involving J2/3 and formation of a transiently inactive intermediate is compatible with the mechanism of splicing via the branching pathway. Indeed, biochemical data and recent crystal structures of lariat introns show that the hydrolytic and transesterification pathways occur at the same active site, involve positioning of the reaction nucleophile (the proton donor) in the exact same structural position compared to nucleotide 358, and follow the same reaction chemistry^{9,11,13,20,21}. Indeed, G- and U-mutations at the 358-equivalent position of the lariat-forming ai5γ intron from Saccharomyces cerevisiae (A816G and A816U) display splicing defects similar to our G and U mutants³³. Moreover, a protonation-dependent structural rearrangement mechanism is strongly supported by functional data obtained on the spliceosome, which is evolutionarily and chemically analogous to the group II intron^1,2. In the spliceosome, the last G (G52 in yeast) of the conserved ACAGAGA box in U6 snRNA corresponds to intron G28831. This residue is in close proximity to the branch site³⁹, it interacts with the 5′-splice site, and it undergoes a rearrangement between the splicing steps⁴⁰ in a process that is modulated by protein subunits (i.e. Prp8, Prp16)^41,42 and potassium ions^43,44. Such reorganization of G52 facilitates the release of the 5′-end of spliceosomal introns from the active site after the first splicing step, while also favoring the recruitment of the 3′-splice junction into the active site for the second step of splicing⁴⁰. These rearrangements could be induced by protonation of nucleotides of the U6 ISL, which are analogous to the group II intron two-nucleotide bulge and catalytic triad because their protonation antagonizes binding of the catalytic metal ions to the spliceosome and induces a transient base-flipping conformational change^29,30. Furthermore, G52 mutations in the spliceosome have an inhibitory effect on the second step of splicing⁴⁵, similar to the effects we described for G288 in the group II intron in this and in previous work^22,46. Finally, during the splicing cycle, the spliceosome adopts transiently inactive states, possibly similar to the group II intron inactive toggled conformation, to avoid processing non-ideal pre-mRNA substrates⁴⁷. In the light of these structural and functional analogies between the intron and the spliceosome, it seems plausible that conformational toggling and dynamics of catalytic metal ions in the active site may regulate spliceosomal activation, too.

In summary, through the integration of four X-ray structures of active site mutants and in vitro splicing assays with multi-microsecond classical molecular simulations and free energy calculations, we have elucidated the dynamical behavior and determined the functional role of structural rearrangements within the group II intron active site, showing how they contribute to the mechanism of RNA splicing. We have determined that critical dynamic processes are triggered by protonation of a highly-conserved catalytic residue, thereby promoting the transition between the first and the second steps of splicing. Importantly, the resulting mechanism explains the apparent paradox of how and why a tightly bound metal ion cluster can be broken and reformed during the catalytic cycle, thereby promoting a directional sequence of coordinated chemical reactions. These findings may help in future engineering of complex, RNA-based enzymes for use as biotechnological tools and gene-specific therapeutics^4,48.

Methods

Cloning and mutagenesis

The constructs of O. iheyensis group II intron used in this work are the pOiA wild type and the OiD1-5 crystallization constructs of the O. iheyensis group II intron¹⁶. All mutagenesis experiments were performed using the PfuUltra II Hotstart PCR Master Mix (Agilent). The restriction enzymes ClaI and BamHI used for template linearization were purchased from NEB. All constructs were confirmed by DNA sequencing (W. M. Keck Foundation Biotechnology Resource Laboratory, Yale University, and Eurofins).

In vitro transcription and purification

Following restriction with the appropriate endonucleases at 37 °C overnight, the intron was transcribed in vitro using T7 polymerase¹⁶. For crystallization purposes^17,18, it was then purified under non-denaturing conditions⁴⁹, re-buffered, and concentrated to 80 µM in 10 mM MgCl₂ and 5 mM sodium cacodylate pH 6.5. For splicing studies, the intron was radiolabeled during transcription, purified in a denatured state¹⁶, and subsequently refolded.

Splicing assays

Purified radiolabeled intron precursor was refolded by denaturation at 95 °C for 1 min in the presence of 40 mM Na-MOPS pH 7.5, and cooled at room temperature for 2 min. Subsequently, the appropriate monovalent ions were added to a final concentration of 150 mM. Finally, MgCl₂ was added to a final concentration of 5 mM to start the splicing reaction. The refolded precursor samples were incubated at 37 °C. One microliter aliquots of the splicing reactions taken at specific time points were quenched by the addition of 20 μL gel loading solution containing urea and chilled on ice. The samples were analyzed onto a denaturing 5% (w/v) polyacrylamide gel. The kinetic rate constants were calculated using the Prism 6 package (GraphPad Software).

Crystallization

The natively purified intron was mixed with a 0.5 mM spermine solution in 10 mM MgCl₂ and 5 mM sodium cacodylate pH 6.5, and with the crystallization buffer in a 1:1:1 volume ratio¹⁶. Crystals were grown at 30 °C by the hanging drop vapor diffusion method using 2 μL sample drops and 300 μL crystallization solution in a sealed chamber (EasyXtal 15-Well Tool, Qiagen). Crystals were harvested after 2–3 weeks. Crystals were cryo-protected in a solution containing the corresponding crystallization buffers supplemented with 25% EG and immediately flash frozen in liquid nitrogen. The crystallization solutions used to solve the structures of the excised intron presented in this work were composed of: (1) 50 mM Na-HEPES pH 7.0, 100 mM magnesium acetate, 150 mM potassium chloride, 10 mM lithium chloride, 4% PEG 8000 for the G-mutant in potassium and magnesium (PDB id: 6T3K), (2) 50 mM Na-HEPES pH 7.0, 100 mM magnesium acetate, 150 mM potassium chloride, 10 mM lithium chloride, 4% PEG 8000 for the U-mutant in potassium and magnesium (PDB id: 6T3R), (3) 50 mM Na-HEPES pH 7.0, 100 mM magnesium acetate, 150 mM sodium chloride, 4% PEG 8000 for the G-mutant in sodium and magnesium (PDB id: 6T3N), and (4) 50 mM Na-HEPES pH 7.0, 100 mM magnesium acetate, 150 mM sodium chloride, 4% PEG 8000 for the U-mutant in sodium and magnesium (PDB id: 6T3S).

Structure determination

Diffraction data were collected with an X-ray beam wavelength of 0.979 Å and at a temperature of 100 K at beamlines 24ID-C and E (NE-CAT) at the Argonne Photon Source (APS), Argonne, IL, and processed with the Rapid Automated Processing of Data (RAPD) software package (https://rapd.nec.aps.anl.gov/rapd/) and with the XDS suite⁵⁰. The structures were solved by molecular replacement using Phaser in CCP4⁵¹ and the RNA coordinates of PDB entry 4FAR (without solvent atoms) as the initial model^16,17,18. The models were improved automatically in Phenix⁵² and Refmac5⁵¹, and manually in Coot⁵³, and finally evaluated by MolProbity⁵⁴. The figures depicting the structures were drawn using PyMOL⁵⁵. Stereo images of selected regions of the electron density are reported in Supplementary Fig. 11.

pK_A calculations

We used continuum electrostatics calculations based on the nonlinear Poisson–Boltzmann equation to estimate the pK_A of C358 in the pre-hydrolytic state (PDB id. 4FAQ) and in the toggled state (PDB id. 4FAU). Calculations were performed with DelPhiPKa⁵⁶, using a pH range from 0 to 14 with a pH interval of 0.5, a dielectric constant for RNA ε_RNA = 4, and a dielectric constant for solvent ε_solvent = 80. Metals were not considered in the calculations.

Structural models for MD simulations

We have used ten systems for MD simulations: (1) The pre-hydrolytic state, a wild-type system modeled on PDB id: 4FAQ;¹⁶ (2) The N7-deaza state, a pre-hydrolytic state in which N7^G288 was replaced by a carbon atom; (3) The cleaved state, a pre-hydrolytic state in which the phosphodiester bond between the intron and the 5e was broken introducing an oxygen atom and inverting the stereochemical configuration of the SP, and in which Ca²⁺ ions were replaced with Mg²⁺ ions; (4) The cleaved-H⁺ state, a cleaved state protonated on N3^C358; (5) The post-hydrolytic state, a wild-type system modeled on PDB id: 4FAR;¹⁶ (6) The post-hydrolytic H⁺ state, a post-hydrolytic state protonated at N3^C358; (7) The post-hydrolytic G-mutant, modeled on the structure of the G-mutant in potassium and magnesium; (8) The cleaved G-mutant, a cleaved state carrying the C289G/C358G/G385C triple mutations; (9) The post-hydrolytic U-mutant, modeled on the structure of the G-mutant in potassium and magnesium; (10) The cleaved U-mutant, a cleaved state carrying the C289U/C358U/G385A triple mutations. Each system was hydrated with a 12-Å layer of TIP3P⁵⁷ water molecules, and the ions concentration was set to the same used for crystallization¹⁶. All the crystallized ions and water molecules were considered for model building. The final models are thus enclosed in a box of ~145·125·144 Å³, containing ~220,000 water molecules, resulting in a total of ~250,000 atoms for each system.

MD simulation set up

The AMBER-ff12SB (ff99 + bsc0 + χOL3)⁵⁸ was used for the parametrization of the RNA. Nucleotide G288 in the N7-deaza model, nucleotide C358 in the cleaved-H⁺ and post-hydrolytic H⁺ models, and both 5′- and 3′- terminal nucleotides in all models were parametrized with the general amber force field (i.e. GAFF)⁵⁹, and their atomic charges were derived with RESP procedure⁶⁰. We used the Joung–Cheatham parameters⁶¹ for the monovalent metal ions, while the divalent metal ions were parametrized according to Li et al.⁶². In the simulations, we have used ionic concentrations of 100 mM for magnesium ions and 150 mM for potassium ions, in line with the crystallization conditions of the intron (see above). The two catalytic metal ions were modeled using a flexible nonbonded approach based on the atoms in molecules partitioning scheme^63,64,65. All MD simulations were performed with Gromacs 5.1.4⁶⁶. The integration time step was set to 2 fs, while the length of all covalent bonds was set with the P-LINCS algorithm⁶⁷. A temperature of 310 K was imposed using a velocity-rescaling thermostat⁶⁸ with a relaxation time τ = 0.1 ps, while pressure control was achieved with Parrinello–Rahman barostat⁶⁹ at reference pressure of 1 atm with τ = 2 ps. Periodic boundary conditions in the three directions of Cartesian space were applied. A particle mesh Ewald method, with a Fourier grid spacing of 1.6 Å, was used to treat long-range electrostatics. All the systems were subjected to the same simulation protocol. To relax the water molecule and the ions, energy minimization was carried out. At this stage, active core ions (M1, M2, K1, K2, K4¹⁹) along with the RNA backbone were kept fixed with harmonic positional restraints of 500 kcal/mol Å². Subsequently, the systems were heated up from 0 to 310 K with an NVT simulation of ~1 ns with the same positional restraints used in the energy minimization. A second NVT of ~1 ns was then performed at a fixed temperature (310 K), halving the positional restraints. In addition, ~1 ns of NPT simulation was performed with 100 kcal/mol Å² residual restraints on the backbone and the core ions to allow partial backbone relaxation. Finally, different production runs were performed in the NPT ensemble for each system. We collected overall more than 15 µs of MD trajectories, specifically: (1) ~1.8 µs for the pre-hydrolytic system, two replicas; (2) ~600 ns for the N7-deaza system, three replicas; (3) ~1.2 µs for the cleaved system, two replicas; (4) ~1.2 µs for the cleaved-H⁺ system, two replicas; (5) ~4.5 µs for the post-hydrolytic system, six replicas; (6) ~1 µs for the post-hydrolytic H⁺ state, a post-hydrolytic state protonated at N3^C358, three replicas; (7) ~1.2 µs for the post-hydrolytic G-mutant, modeled on the structure of the G-mutant in potassium and magnesium, two replicas; (8) ~1.2 µs for the cleaved G-mutant, a cleaved state carrying the C289G/C358G/G385C triple mutations, two replicas; (9) ~1.2 µs for the post-hydrolytic U-mutant, modeled on the structure of the G-mutant in potassium and magnesium, 2 replicas; (10) 1.2 µs for the cleaved U-mutant, a cleaved state carrying the C289U/C358U/G385A triple mutations, two replicas. For each system, statistics were collected after the systems reached the equilibration (i.e., stabilization of the RMSD of the nucleic acid backbone), thus discarding the first 25 ns of the trajectories.

MtD simulations

The reference path was built upon the different conformations of the nucleotides U285 to A290 in the pre-hydrolytic and toggled states (PDB id: 4FAQ and 4FAX, respectively¹⁶). The two structures were used to generate 30 interpolated intermediates through the MolMov morphing server⁷⁰. Each intermediate was subjected to energy minimization, and 16 snapshots were chosen to build the path. Each node of the path (i.e., intermediate structure) is equally spaced with a distance of ~0.32 Å. According to Branduardi et al.⁷¹, we defined two-path collective variables: (1) S, which defines the progress along the reference path; (2) Z, which measures the distance from the reference path. To sample the free energy landscape, we used adaptative-width MtD as implemented in Plumed^35,72, in which the width of the gaussian was determined by the fluctuation of S and Z over a time interval of 1 ps. A lower-bound limit for the width of the gaussian was set to 0.03 in the appropriate unit for each coordinate. The height of the gaussian was set to 0.3 kJ/mol with an additional frequency of 1 ps. By considering the distance between the nodes of the path, we set a λ = 23.66 Å⁻². We collected: (1) 350 ns, for the transition cleaved state to toggled state (referred to as c→ T); (2) 200 ns for the transition cleaved-H⁺ state to the toggled state (referred to as cH⁺→ T).

Hybrid quantum mechanical/molecular mechanical (QM/MM) simulations

QM/MM simulations were performed on the structure of the cleaved-H⁺ state with CP2K molecular dynamics engine⁷³ to explore the stability of the protonated form of N3^C358. The AMBER force field was used for the MM subsystem, whereas density functional theory (DFT) was used to describe the QM atoms. The BLYP functional^74,75 supplemented by a dispersion correction was employed⁷⁶. The Quickstep algorithm was used to solve the electronic structure problem⁷⁷, employing a double zeta plus polarization basis set⁷⁸ to represent the valence orbitals and plane waves for the electron density (320 Ry cutoff). Goedecker–Teter–Hutter type pseudopotentials were used for valence–core interactions⁷⁹. Wavefunction optimization was achieved through an orbital transformation method⁸⁰ using a threshold of 5·10⁻⁷ on the electronic gradient as a convergence criterion. The QM/MM coupling follows the protocol proposed by Laino et al.⁸¹. Simulations were performed in the NVT ensemble (300 K), employing a velocity rescaling thermostat⁶⁸. After about 4.3 ps, a second water molecule was included in the QM region, and the simulation was restarted to collect 15 ps of simulation time. N3^C358 remained stably protonated throughout the entire simulation.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Data supporting all other findings of this manuscript, including MD simulation trajectories, are available from the corresponding authors upon request. Coordinates and structure factors have been deposited in the Protein Data Bank under accession codes PDB 6T3K, PDB 6T3R, PDB 6T3N, and PDB 6T3S. The source data underlying Fig. 1b, c and Supplementary Table 2 are provided as a Source Data file. Source data are provided with this paper.

Change history

04 January 2022
A Correction to this paper has been published: https://doi.org/10.1038/s41467-021-27699-2

References

Pyle, A. M. & Lambowitz, A. M. Group II introns: ribozymes that splice RNA and invade DNA. In The RNA World (eds. Gesteland, R. F., Cech, T. R. & Atkins, J. F.) 469–505 (Cold Spring Harbor Press, Cold Spring Harbor, 2006).
Galej, W. P., Toor, N., Newman, A. J. & Nagai, K. Molecular mechanism and evolution of nuclear pre-mRNA and group II intron splicing: insights from cryo-electron microscopy structures. Chem. Rev. 118, 4156–4176 (2018).
CAS PubMed Google Scholar
Garcia-Rodriguez, F. M., Barrientos-Duran, A., Diaz-Prado, V., Fernandez-Lopez, M. & Toro, N. Use of RmInt1, a group IIB intron lacking the intron-encoded protein endonuclease domain, in gene targeting. Appl Environ. Microbiol. 77, 854–861 (2011).
ADS CAS PubMed Google Scholar
Guo, H. et al. Group II introns designed to insert into therapeutically relevant DNA target sites in human cells. Science 289, 452–457 (2000).
ADS CAS PubMed Google Scholar
Boudvillain, M., de Lencastre, A. & Pyle, A. M. A tertiary interaction that links active-site domains to the 5’ splice site of a group II intron. Nature 406, 315–318 (2000).
ADS CAS PubMed Google Scholar
Chillon, I., Martinez-Abarca, F. & Toro, N. Splicing of the Sinorhizobium meliloti RmInt1 group II intron provides evidence of retroelement behavior. Nucleic Acids Res. 39, 1095–1104 (2011).
CAS PubMed Google Scholar
Chillon, I. et al. In vitro characterization of the splicing efficiency and fidelity of the RmInt1 group II intron as a means of controlling the dispersion of its host mobile element. RNA 20, 2000–2010 (2014).
CAS PubMed PubMed Central Google Scholar
de Lencastre, A., Hamill, S. & Pyle, A. M. A single active-site region for a group II intron. Nat. Struct. Mol. Biol. 12, 626–627 (2005).
PubMed Google Scholar
Costa, M., Walbott, H., Monachello, D., Westhof, E. & Michel, F. Crystal structures of a group II intron lariat primed for reverse splicing. Science 354, aaf9258 (2016).
PubMed Google Scholar
Qu, G. et al. Structure of a group II intron in complex with its reverse transcriptase. Nat. Struct. Mol. Biol. 23, 549–557 (2016).
CAS PubMed PubMed Central Google Scholar
Robart, A. R., Chan, R. T., Peters, J. K., Rajashankar, K. R. & Toor, N. Crystal structure of a eukaryotic group II intron lariat. Nature 514, 193–197, https://doi.org/10.1038/nature13790 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Toor, N., Keating, K. S., Taylor, S. D. & Pyle, A. M. Crystal structure of a self-spliced group II intron. Science 320, 77–82 (2008).
ADS CAS PubMed PubMed Central Google Scholar
Chan, R. T. et al. Structural basis for the second step of group II intron splicing. Nat. Commun. 9, 4676 (2018).
ADS PubMed PubMed Central Google Scholar
Chan, R. T., Robart, A. R., Rajashankar, K. R., Pyle, A. M. & Toor, N. Crystal structure of a group II intron in the pre-catalytic state. Nat. Struct. Mol. Biol. 19, 555–557 (2012).
CAS PubMed PubMed Central Google Scholar
Zhao, C., Rajashankar, K. R., Marcia, M. & Pyle, A. M. Crystal structure of group II intron domain 1 reveals a template for RNA assembly. Nat. Chem. Biol. 11, 967–972 (2015).
CAS PubMed PubMed Central Google Scholar
Marcia, M. & Pyle, A. M. Visualizing group II intron catalysis through the stages of splicing. Cell 151, 497–507 (2012).
CAS PubMed PubMed Central Google Scholar
Marcia, M. Using molecular replacement phasing to study the structure and function of RNA. Methods Mol. Biol. 1320, 233–257 (2016).
PubMed Google Scholar
Marcia, M. et al. Solving nucleic acid structures by molecular replacement: examples from group II intron studies. Acta Crystallogr D. Biol. Crystallogr 69, 2174–2185 (2013).
CAS PubMed PubMed Central Google Scholar
Marcia, M. & Pyle, A. M. Principles of ion recognition in RNA: insights from the group II intron structures. RNA 20, 516–527 (2014).
CAS PubMed PubMed Central Google Scholar
Marcia, M., Somarowthu, S. & Pyle, A. M. Now on display: a gallery of group II intron structures at different stages of catalysis. Mob. DNA 4, 14–26 (2013).
CAS PubMed PubMed Central Google Scholar
Pyle, A. M. The tertiary structure of group II introns: implications for biological function and evolution. Crit. Rev. Biochem Mol. Biol. 45, 215–232 (2010).
CAS PubMed PubMed Central Google Scholar
Mikheeva, S., Murray, H. L., Zhou, H., Turczyk, B. M. & Jarrell, K. A. Deletion of a conserved dinucleotide inhibits the second step of group II intron splicing. RNA 6, 1509–1515 (2000).
CAS PubMed PubMed Central Google Scholar
Podar, M., Perlman, P. S. & Padgett, R. A. Stereochemical selectivity of group II intron splicing, reverse splicing, and hydrolysis reactions. Mol. Cell Biol. 15, 4466–4478 (1995).
CAS PubMed PubMed Central Google Scholar
Casalino, L., Palermo, G., Rothlisberger, U. & Magistrato, A. Who activates the nucleophile in ribozyme catalysis? An answer from the splicing mechanism of group II introns. J. Am. Chem. Soc. 138, 10374–10377 (2016).
CAS PubMed Google Scholar
Palermo, G., Casalino, L., Magistrato, A. & Andrew McCammon, J. Understanding the mechanistic basis of non-coding RNA through molecular dynamics simulations. J. Struct. Biol. 206, 267–279 (2019).
CAS PubMed PubMed Central Google Scholar
Dayie, K. T. & Padgett, R. A. A glimpse into the active site of a group II intron and maybe the spliceosome, too. RNA 14, 1697–1703, https://doi.org/10.1261/rna.1154408 (2008).
Article CAS PubMed PubMed Central Google Scholar
Pechlaner, M., Donghi, D., Zelenay, V. & Sigel, R. K. Protonation-dependent base flipping at neutral pH in the catalytic triad of a self-splicing bacterial group II intron. Angew. Chem. Int Ed. Engl. 54, 9687–9690 (2015).
CAS PubMed Google Scholar
Nakano, S., Chadalavada, D. M. & Bevilacqua, P. C. General acid-base catalysis in the mechanism of a hepatitis delta virus ribozyme. Science 287, 1493–1497 (2000).
ADS CAS PubMed Google Scholar
Huppler, A., Nikstad, L. J., Allmann, A. M., Brow, D. A. & Butcher, S. E. Metal binding and base ionization in the U6 RNA intramolecular stem-loop structure. Nat. Struct. Biol. 9, 431–435 (2002).
CAS PubMed Google Scholar
Reiter, N. J., Blad, H., Abildgaard, F. & Butcher, S. E. Dynamics in the U6 RNA intramolecular stem-loop: a base flipping conformational change. Biochemistry 43, 13739–13747 (2004).
CAS PubMed Google Scholar
Keating, K. S., Toor, N., Perlman, P. S. & Pyle, A. M. A structural analysis of the group II intron active site and implications for the spliceosome. RNA 16, 1–9 (2010).
PubMed PubMed Central Google Scholar
Roitzsch, M., Fedorova, O. & Pyle, A. M. The 2’-OH group at the group II intron terminus acts as a proton shuttle. Nat. Chem. Biol. 6, 218–224 (2010).
CAS PubMed PubMed Central Google Scholar
Peebles, C. L., Zhang, M., Perlman, P. S. & Franzen, J. S. Catalytically critical nucleotide in domain 5 of a group II intron. Proc. Natl Acad. Sci. USA 92, 4422–4426 (1995).
ADS CAS PubMed PubMed Central Google Scholar
de Lencastre, A. & Pyle, A. M. Three essential and conserved regions of the group II intron are proximal to the 5’-splice site. RNA 14, 11–24 (2008).
PubMed PubMed Central Google Scholar
Branduardi, D., Bussi, G. & Parrinello, M. Metadynamics with adaptive Gaussians. J. Chem. Theory Comput. 8, 2247–2254, https://doi.org/10.1021/ct3002464 (2012).
Article CAS PubMed Google Scholar
Evans, M. G. & Polanyi, M. Some applications of the transition state method to the calculation of reaction velocities, especially in solution. Trans. Faraday Soc. 31, 875, https://doi.org/10.1039/tf9353100875 (1935).
Article CAS Google Scholar
Eyring, H. The activated complex in chemical reactions. J. Chem. Phys. 3, 107–115, https://doi.org/10.1063/1.1749604 (1935).
Article ADS CAS Google Scholar
Jacquier, A. & Michel, F. Base-pairing interactions involving the 5’ and 3’-terminal nucleotides of group-II self-splicing introns. J. Mol. Biol. 213, 437–447, https://doi.org/10.1016/S0022-2836(05)80206-2 (1990).
Article CAS PubMed Google Scholar
Madhani, H. D. & Guthrie, C. Randomization-selection analysis of snRNAs in vivo: evidence for a tertiary interaction in the spliceosome. Genes Dev. 8, 1071–1086 (1994).
CAS PubMed Google Scholar
Konarska, M. M., Vilardell, J. & Query, C. C. Repositioning of the reaction intermediate within the catalytic center of the spliceosome. Mol. Cell 21, 543–553, https://doi.org/10.1016/j.molcel.2006.01.017 (2006).
Article CAS PubMed Google Scholar
Query, C. C. & Konarska, M. M. Suppression of multiple substrate mutations by spliceosomal prp8 alleles suggests functional correlations with ribosomal ambiguity mutants. Mol. Cell 14, 343–354 (2004).
CAS PubMed Google Scholar
Schwer, B. & Guthrie, C. A conformational rearrangement in the spliceosome is dependent on PRP16 and ATP hydrolysis. EMBO J. 11, 5033–5039 (1992).
CAS PubMed PubMed Central Google Scholar
Hardy, S. F., Grabowski, P. J., Padgett, R. A. & Sharp, P. A. Cofactor requirements of splicing of purified messenger RNA precursors. Nature 308, 375–377 (1984).
ADS CAS PubMed Google Scholar
Genna, V., Colombo, M., De Vivo, M. & Marcia, M. Second-shell basic residues expand the two-metal-ion architecture of DNA and RNA processing enzymes. Structure 26, 40–50 e2, https://doi.org/10.1016/j.str.2017.11.008 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fabrizio, P. & Abelson, J. Two domains of yeast U6 small nuclear RNA required for both steps of nuclear precursor messenger RNA splicing. Science 250, 404–409 (1990).
ADS CAS PubMed Google Scholar
Ho Faix, P. Conserved nucleotides in the joining segment between domains 2 and 3 are important for group II intron splicing. PhD thesis. (University of Pittsburgh, Pittsburgh, 1998).
Smith, D. J., Query, C. C. & Konarska, M. M. “Nought may endure but mutability”: spliceosome dynamics and the regulation of splicing. Mol. Cell 30, 657–666 (2008).
CAS PubMed PubMed Central Google Scholar
Riccardi, L., Genna, V. & De Vivo, M. Metal-ligand interactions in drug design. Nat. Rev. Chem. 2, 100–112 (2018).
CAS Google Scholar
Chillon, I. et al. Native purification and analysis of long RNAs. Methods Enzymol. 558, 3–37 (2015).
CAS PubMed PubMed Central Google Scholar
Kabsch, W. Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants. J. Appl Crystallogr. 26, 795–800 (1993).
CAS Google Scholar
Collaborative computational project number 4. The CCP4 suite: programs for protein crystallography. Acta Crystallogr D. Biol. Crystallogr. 50, 760–763, https://doi.org/10.1107/S0907444994003112 (1994).
Article Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr D. Biol. Crystallogr. 66, 213–221 (2010).
CAS PubMed PubMed Central Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr D. Biol. Crystallogr. 60, 2126–2132 (2004).
PubMed Google Scholar
Davis, I. W. et al. MolProbity: all-atom contacts and structure validation for proteins and nucleic acids. Nucleic Acids Res. 35, W375–W383 (2007).
ADS PubMed PubMed Central Google Scholar
Schrodinger, LLC. The PyMOL Molecular Graphics System, Version 1.3r1. (2010).
Wang, L., Zhang, M. & Alexov, E. DelPhiPKa web server: predicting pKa of proteins, RNAs and DNAs. Bioinformatics 32, 614–615 (2016).
PubMed Google Scholar
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935 (1983).
ADS CAS Google Scholar
Perez, A. et al. Refinement of the AMBER force field for nucleic acids: improving the description of alpha/gamma conformers. Biophys. J. 92, 3817–3829 (2007).
ADS CAS PubMed PubMed Central Google Scholar
Wang, J., Wolf, R. M., Caldwell, J. W., Kollman, P. A. & Case, D. A. Development and testing of a general amber force field. J. Comput Chem. 25, 1157–1174, https://doi.org/10.1002/jcc.20035 (2004).
Article CAS PubMed Google Scholar
Besler, B. H., Merz, K. M. Jr & Kollman, P. A. Atomic charges derived from semiempirical methods. J. Computational Chem. 11, 431–439 (1990).
CAS Google Scholar
Joung, I. S. & Cheatham, T. E. Determination of alkali and halide monovalent ion parameters for use in explicitly solvated biomolecular simulations. J. Phys. Chem. B 112, 9020–9041 (2008).
CAS PubMed PubMed Central Google Scholar
Li, P., Roberts, B. P., Chakravorty, D. K. & Merz, K. M. Rational design of particle mesh Ewald compatible Lennard-Jones parameters for +2 metal cations in explicit solvent. J. Chem. Theory Comput. 9, 2733–2748 (2013).
CAS PubMed PubMed Central Google Scholar
Dal Peraro, M. et al. Modeling the charge distribution at metal sites in proteins for molecular dynamics simulations. J. Struct. Biol. 157, 444–453 (2007).
CAS PubMed Google Scholar
Genna, V., Carloni, P. & De Vivo, M. A strategically located Arg/Lys residue promotes correct base paring during nucleic acid biosynthesis in polymerases. J. Am. Chem. Soc. 140, 3312–3321 (2018).
CAS PubMed Google Scholar
Genna, V., Vidossich, P., Ippoliti, E., Carloni, P. & De Vivo, M. A self-activated mechanism for nucleic acid polymerization catalyzed by DNA/RNA polymerases. J. Am. Chem. Soc. 138, 14592–14598, https://doi.org/10.1021/jacs.6b05475 (2016).
Article CAS PubMed Google Scholar
Abraham, M. J. et al. GROMACS: high performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1-2, 19–25 (2015).
ADS Google Scholar
Hess, B. P-LINCS: a parallel linear constraint solver for molecular simulation. J. Chem. Theory Comput 4, 116–122 (2008).
CAS PubMed Google Scholar
Bussi, G., Donadio, D. & Parrinello, M. Canonical sampling through velocity rescaling. J. Chem. Phys. 126, 014101 (2007).
ADS PubMed Google Scholar
Parrinello, M. & Rahman, A. Polymorphic transitions in single crystals: a new molecular dynamics method. J. Appl. Phys. 52, 7182–7190, https://doi.org/10.1063/1.328693 (1981).
Article ADS CAS Google Scholar
Krebs, W. G. & Gerstein, M. The morph server: a standardized system for analyzing and visualizing macromolecular motions in a database framework. Nucleic Acids Res. 28, 1665–1675 (2000).
CAS PubMed PubMed Central Google Scholar
Branduardi, D., Gervasio, F. L. & Parrinello, M. From A to B in free energy space. J. Chem. Phys. 126, 054103 (2007).
ADS PubMed Google Scholar
Bonomi, M. et al. PLUMED: a portable plugin for free-energy calculations with molecular dynamics. Computer Phys. Commun. 180, 1961–1972 (2009).
ADS CAS Google Scholar
Hutter, J., Iannuzzi, M., Schiffmann, F. & VandeVondele, J. cp2k: atomistic simulations of condensed matter systems. WIREs Comput. Mol. Sci. 4, 15–25 (2014).
CAS Google Scholar
Lee, C., Yang, W. & Parr, R. G. Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density. Phys. Rev. B 37, 785–789, 1988).
ADS CAS Google Scholar
Becke, A. D. Density-functional exchange-energy approximation with correct asymptotic behavior. Physical Review A. 38, 3098–3100 (1988).
ADS CAS Google Scholar
Grimme, S., Antony, J., Ehrlich, S. & Krieg, H. A consistent and accurate ab initio parametrization of density functional dispersion correction (DFT-D) for the 94 elements H-Pu. J. Chem. Phys. 132, 154104 (2010).
ADS PubMed Google Scholar
Lippert, G., Hutter, J. & Parrinello, M. A hybrid Gaussian and plane wave density functional scheme. Mol. Phys. 92, 477–488 (1997).
ADS CAS Google Scholar
VandeVondele, J. & Hutter, J. Gaussian basis sets for accurate calculations on molecular systems in gas and condensed phases. J. Chem. Phys. 127, 114105, https://doi.org/10.1063/1.2770708 (2007).
Article ADS CAS PubMed Google Scholar
Goedecker, S., Teter, M. & Hutter, J. Separable dual-space Gaussian pseudopotentials. Phys. Rev. B 54, 1703–1710 (1996).
ADS CAS Google Scholar
VandeVondele, J. & Hutter, J. An efficient orbital transformation method for electronic structure calculations. J. Chem. Phys. 118, 4365–4369 (2003).
ADS CAS Google Scholar
Laino, T., Mohamed, F., Laio, A. & Parrinello, M. An efficient real space multigrid QM/MM electrostatic coupling. J. Chem. Theory Comput. 1, 1176–1184, https://doi.org/10.1021/ct050123f (2005).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work is based upon research conducted at the Northeastern Collaborative Access Team beamlines, which are funded by the National Institute of General Medical Sciences from the National Institutes of Health (P30 GM124165). The Eiger 16M detector on 24-ID-E is funded by a NIH-ORIP HEI grant (S10OD021527). This research used resources of the Advanced Photon Source, a U.S. Department of Energy (DOE) Office of Science User Facility operated for the DOE Office of Science by Argonne National Laboratory under Contract No. DE-AC02-06CH11357. We would also like to thank Dr Laura Murray for help in cloning and crystallizing the G and U mutants, Gabriele Drews for technical assistance, and Dr. Olga Fedorova for critical reading of the manuscript. We also thank all members of the Marcia, Pyle, and De Vivo labs for helpful discussion. Work in the Marcia lab is partly funded by the Agence Nationale de la Recherche (ANR-15-CE11-0003-01), by the Agence Nationale de Recherche sur le Sida et les hépatites virales (ANRS) (ECTZ18552), by ITMO Cancer (18CN047-00), and by the Fondation ARC pour la recherche sur le cancer (PJA20191209284). The Marcia lab uses the platforms of the Grenoble Instruct Center (ISBG: UMS 3518 CNRS-CEA-UJF-EMBL) with support from FRISBI (ANR-10-INSB-05-02) and GRAL (ANR-10-LABX-49-01) within the Grenoble Partnership for Structural Biology (PSB). MDV thanks the Italian Association for Cancer Research (AIRC) for financial support (IG 23679). AMP is a Howard Hughes Medical Institute investigator.

Author information

These authors contributed equally: Jacopo Manigrasso, Isabel Chillón.

Authors and Affiliations

Laboratory of Molecular Modelling & Drug Discovery, Istituto Italiano di Tecnologia, Via Morego 30, 16163, Genoa, Italy
Jacopo Manigrasso, Pietro Vidossich & Marco De Vivo
European Molecular Biology Laboratory (EMBL) Grenoble, 71 Avenue des Martyrs, Grenoble, 38042, France
Isabel Chillón & Marco Marcia
Department of Structural and Computational Biology, Institute for Research in Biomedicine (IRB), Parc Científic de Barcelona, C/ Baldiri Reixac 10-12, 08028, Barcelona, Spain
Vito Genna
Department of Biochemistry & Molecular Biology, Drexel University College of Medicine, Philadelphia, PA, USA
Srinivas Somarowthu
Department of Molecular, Cellular and Developmental Biology, New Haven, CT, 06511, USA
Anna Marie Pyle
Department of Chemistry, Yale University, New Haven, CT, 06511, USA
Anna Marie Pyle
Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA
Anna Marie Pyle

Authors

Jacopo Manigrasso
View author publications
You can also search for this author in PubMed Google Scholar
Isabel Chillón
View author publications
You can also search for this author in PubMed Google Scholar
Vito Genna
View author publications
You can also search for this author in PubMed Google Scholar
Pietro Vidossich
View author publications
You can also search for this author in PubMed Google Scholar
Srinivas Somarowthu
View author publications
You can also search for this author in PubMed Google Scholar
Anna Marie Pyle
View author publications
You can also search for this author in PubMed Google Scholar
Marco De Vivo
View author publications
You can also search for this author in PubMed Google Scholar
Marco Marcia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MM, AMP, and MDV have conceived and designed the work; SS performed initial pK_A calculations; JM, IC, MM, VG, and PV have acquired the data; all authors have interpreted the data and drafted the manuscript. MM and MDV contributed equally.

Corresponding authors

Correspondence to Marco De Vivo or Marco Marcia.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Marlene Belfort and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Description of Additional Supplementary Files

Supplementary Movie 1

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Manigrasso, J., Chillón, I., Genna, V. et al. Visualizing group II intron dynamics between the first and second steps of splicing. Nat Commun 11, 2837 (2020). https://doi.org/10.1038/s41467-020-16741-4

Download citation

Received: 22 October 2019
Accepted: 18 May 2020
Published: 05 June 2020
DOI: https://doi.org/10.1038/s41467-020-16741-4

This article is cited by

Targeting the conserved active site of splicing machines with specific and selective small molecule modulators
- Ilaria Silvestri
- Jacopo Manigrasso
- Marco Marcia
Nature Communications (2024)
Monovalent metal ion binding promotes the first transesterification reaction in the spliceosome
- Jana Aupič
- Jure Borišek
- Alessandra Magistrato
Nature Communications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.