Unraveling RNA dynamical behavior of TPP riboswitches: a comparison between Escherichia coli and Arabidopsis thaliana

Antunes, Deborah; Jorge, Natasha Andressa Nogueira; Garcia de Souza Costa, Mauricio; Passetti, Fabio; Caffarena, Ernesto Raul

doi:10.1038/s41598-019-40875-1

Download PDF

Article
Open access
Published: 12 March 2019

Unraveling RNA dynamical behavior of TPP riboswitches: a comparison between Escherichia coli and Arabidopsis thaliana

Deborah Antunes ORCID: orcid.org/0000-0002-2927-0800¹,
Natasha Andressa Nogueira Jorge²,
Mauricio Garcia de Souza Costa¹,
Fabio Passetti^2,3 &
…
Ernesto Raul Caffarena¹

Scientific Reports volume 9, Article number: 4197 (2019) Cite this article

4266 Accesses
16 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Riboswitches are RNA sensors that affect post-transcriptional processes through their ability to bind to small molecules. Thiamine pyrophosphate (TPP) riboswitch class is the most widespread riboswitch occurring in all three domains of life. Even though it controls different genes involved in the synthesis or transport of thiamine and its phosphorylated derivatives in bacteria, archaea, fungi, and plants, the TPP aptamer has a conserved structure. In this study, we aimed at understanding differences in the structural dynamics of TPP riboswitches from Escherichia coli and Arabidopsis thaliana, based on their crystallographic structures (TPPsw^ec and TPPsw^at, respectively) and dynamics in aqueous solution, both in apo and holo states. A combination of Molecular Dynamics Simulations and Network Analysis empowered to find out slight differences in the dynamical behavior of TPP riboswitches, although relevant for their dynamics in bacteria and plants species. Our results suggest that distinct interactions in the microenvironment surrounding nucleotide U36 of TPPsw^ec (and U35 in TPPsw^at) are related to different responses to TPP. The network analysis showed that minor structural differences in the aptamer enable enhanced intramolecular communication in the presence of TPP in TPPsw^ec, but not in TPPsw^at. TPP riboswitches of plants present subtler and slower regulation mechanisms than bacteria do.

The roles of structural dynamics in the cellular functions of RNAs

Article 10 June 2019

Laura R. Ganser, Megan L. Kelly, … Hashim M. Al-Hashimi

Structural basis for control of bacterial RNA polymerase pausing by a riboswitch and its ligand

Article 01 June 2023

Adrien Chauvier, Jason C. Porta, … Nils G. Walter

Human PNPase causes RNA stabilization and accumulation of R-loops in the Escherichia coli model system

Article Open access 21 July 2023

Federica A. Falchi, Francesca Forti, … Federica Briani

Introduction

Riboswitches are natural ribonucleic acid (RNA) sensors that affect post-transcriptional processes through their ability to bind to small molecules such as vitamins, amino acids and nucleotides^1,2. Riboswitches are highly conserved and structured elements located in the untranslated regions (UTRs) or introns of pre-mRNAs and can be found in all the three domains of life^3,4,5.

Riboswitches have been shown to modulate gene expression by influencing transcription, translation, alternative splicing and RNA stability^6,7,8. Typically, genes regulated by a given riboswitch are involved in biosynthesis, catabolism, signaling or transport of a metabolite that binds to the riboswitch, creating feedback regulation mechanisms to control its adequate levels⁹. When the threshold of a metabolic pathway increases, binding of the metabolite to the Riboswitch leads to a negative feedback mechanism resulting in the suppression of the involved genes. Riboswitches display high specificity for the substrate, conferring efficiency to carry out their activity even in the presence of other similar metabolites¹⁰.

The structure of riboswitches consists of a two-domain set: the sensory and the regulatory domains. The sensory domain is the aptamer, whose sequence and structure are highly conserved. It acts as a receptor for particular metabolites, whereas the binding of small molecules are transduced to genetic regulatory signals by the expression platform localized adjacent to the aptamer¹⁰. This is achieved by a structural rearrangement, resulting in an immediate RNA conformational change that alters mRNA translation.

To date, the only riboswitch described in eukaryotes was the TPP riboswitch, since most of the studies have been carried out in prokaryotic organisms¹¹. In bacteria, such as Escherichia coli, two functionally different TPP riboswitches were reported. The first one is located in the 5′UTR region of the thiM gene, where it controls gene expression at the translation level^11,12, while the second guides both translation and transcription of 5′ UTR region of the thiC gene¹¹.

TPP riboswitches have been found in 5′ UTRs regions of genes encoding thiamine biosynthetic enzymes in fungi^13,14, and algae¹⁵, in which they promote alternatively spliced transcripts. Conversely, in all species of plants previously studied, the TPP riboswitch resides in the 3′ UTRs region of the thiC gene. This difference in mRNA localization suggests a unique mode of action for plant riboswitches^8,16.

The TPP aptamer has sequence and structures highly conserved, independently if the gene is involved at different stages of the synthesis or transport processes of thiamine in bacteria, archaea, fungi, and plants. Rfam database¹⁷ has a total of 9180 TPP riboswitch sequences (Rfam accession RF00059) from distinct organisms, and the consensus secondary structure of these entries is highly conserved. Structurally, the TPP aptamer consists of five stems. The P1 stem is responsible for connecting the aptamer domain to the expression platform. Stems P2 and P3 are involved in binding of the TPP pyrimidine ring while stems P4 and P5 bind to the pyrophosphate group.

Most of the observed differences among species are found in the P3 stem region. For instance, bacteria and archaea commonly have a P3a stem¹⁸ not observed in eukaryotic riboswitches, whereas in eukaryotic organisms, the P3 stem is significantly variable in length, sequence, and base pairings¹⁹.

Despite the fact that both prokaryotic and eukaryotic TPP aptamers are structurally similar and play roles in the regulation of genes involved in thiamine biosynthesis, smaller structural differences may affect the way TPP binds and influences riboswitches in species of bacteria and plants. To analyze these differences, we performed a computational study using molecular dynamics simulations and correlation network analysis to compare the TPP riboswitch dynamical behavior in two of theses organisms. Our results suggest that distinct interactions in the microenvironment surrounding nucleotide U36 of TPPsw^ec (and U35 in TPPsw^at) are related to different responses to TPP. We also showed that minor structural differences in TPP riboswitch made the regulatory mechanism of A. thaliana subtler and slower than E. coli.

Results

RNA content analysis

Nucleotide sequence alignment between of Escherichia coli (TPPsw^ec) and Arabidopsis thaliana (TPPsw^at) revealed they have 68% of identity (Fig. 1A), along with highly conserved secondary structures (Fig. 1B). According to the SimTree server²⁰, the secondary structure analysis indicated a normalized score of 0.68 in a scale from 0 to 1, where 1 indicates a perfect match and 0 no match at all. Both sequence and secondary structure share similar conservation.

A detailed inspection of the secondary structure content showed that the aptamers domains display identical J3-2 junctions and an equal number of pairings in the P1, P4, and P5 helices. On the other hand, the P3 helix was the least conserved substructure, presenting four nucleotides and an additional base pair in TPPsw^ec. It is noteworthy that although the number of nucleotides in the P1 helix of TPPsw^ec is smaller than in TPPsw^at, the number of base pairing remains the same, being four for each case. Differences can also be found in the J2-4 junction and P2 helix, in which an additional pairing in TPPsw^ec is formed. Altogether, the 2D riboswitch structures are highly conserved between species, although minor differences are observed mainly concerning the content of base pairings, being TPPsw^ec two pairings longer than TPPsw^at.

Both prokaryotic and eukaryotic TPP aptamers share a notable common 3D structure and organization. The superposition between TPPsw^ec and TPPsw^at crystal structures resulted in heavy atoms root-mean-square deviation (RMSD) of approximately 0.63 Å (Fig. 2C,D). The insertion and relative position of TPP is also similar presenting a 0.69 Å RMSD. The aptamer consists of a switch helix (P1) and two sensor arms (P2/P3 and P4/P5), forming a long-range tertiary rearrangement capable of stabilizing the interaction between the L5 loop and the P3 stem (Fig. 2A,B). In both crystal structures, the P2/P3 arm helps accommodate the TPP pyrimidine ring while residues from the P4/P5 arm interact with the TPP pyrophosphate group (Fig. 2E,F).

Global and local stability of the aptamer domain

The structural stability of riboswitches in aqueous solution, in both apo and holo configurations, was evaluated by comparing the average RMSD (Table 1 and Supplementary Fig. S1) and root-mean-square fluctuation (RMSF) (Fig. 3) values calculated over the trajectory production stage, taking as references the structures obtained after equilibration.

Table 1 Root mean square deviations (Å) of free and bound states of TPPsw^ec and TPPsw^at as a whole and for substructures.

Full size table

During Molecular Dynamics (MD) simulations, both apo and holo forms presented deviations around 3 Å. No noticeable differences have been observed between the RMSD values obtained for both TPPsw^ec and TPPsw^at upon TPP complexation. Indeed, both average RMSD values per region and the overall pattern of fluctuations were similar for all systems. These values are consistent with RMSD values seen in other studies reporting simulations of small RNA aptamers^21,22,23,24 which ranged between 3–4 Å approximately.

The inspection at some particular regions of the RNA aptamer evidenced slight differences in their dynamical behavior. For example, the P1 region presented higher RMSD values than that of all other substructures in both species. Also, this region was more flexible in the holo forms. It is worth mentioning that the nucleotides composing the P1 helix are located in the terminal regions of the aptamer, showing broader movements during MD simulations. Unlike the P1 segment, the P3 helix displayed higher RMSD values in the apo forms. In addition, the comparison of the fluctuations between apo and holo riboswitches evidenced that TPP binding resulted in the stabilization of the P3 helix but contributed at the same time to the disruption of the P1 helix, according to the RMSF analysis (Fig. 3). Interestingly, the nucleotide U36, located in the P2 helix of TPPsw^ec, displayed more significant fluctuations in the holo form than in the apo form (holo: 5.4 Å; apo: 2.3 Å) (Fig. 3). No significant changes were observed in the P2 substructure in apo and holo form of TPPsw^at.

Monitoring P3-L5 interaction

Despite being remotely located from the TPP binding site, the interaction between P3-L5 substructures is essential for metabolite binding^25,26. The conformation of the P3-L5 region is kept in formation via two non-Watson-Crick (WC) base pairs. Nucleotides A59(57) and A60(58) from loop L5 connect with residues G27(25) and C28(26) from the P3 minor groove (Fig. 2A,B)²⁷. Numbers in parenthesis indicate the nucleotide position in the TPPsw^at systems.

By monitoring the formation of non-WC hydrogen bonds between P3 and L5 we observed that, notably, the G27(25)·A59(57) interaction in apo TPPsw^ec was the less stable with 68% occupancy during the simulation (Table 2). Upon TPP binding, the occupancy increased to 74% while no corresponding trend was observed upon comparison of the apo and holo TPPsw^at systems (presenting ~81% occupancy in both cases). Also, the C28(26)·A60(58) was more stable and presented occupancies higher than 84% in all systems. In addition, the base pair G27·A59 of TPPsw^ec appears to be influenced by the presence of the ligand, which contributed to its stabilization.

Table 2 Occupancy of hydrogen bonds between non-Watson-Crick pairs involved in the P3-L5 interaction of free and bound states of TPPsw^ec and TPPsw^at.

Full size table

TPP-RNA interaction

To verify the stability of TPP-RNA complexes during MD simulations, we calculated the H-bonds occupancy and the average distances between pairs of atoms involved in key interactions (Table 3). We confirmed the existence of similar interactions in all systems. The aminopyrimidine ring of TPP formed hydrogen bonds with N2 and N3 of G30(28) and the 2-OH′ of G9(11). Direct contacts to non-bridging oxygens of TPP β-phosphate were also formed via N4 of C67(65) and N1 of G68(66). All other pyrophosphate-RNA contacts were mediated through the two Mg²⁺ ions^26,27 (Fig. 4).

Table 3 Occupancy and distance of RNA-TPP interaction. The number in parenthesis indicates the distance standard deviation. “—” no RMSD values.

Full size table

Analysis of the distributions of the total number of RNA-TPP hydrogen bonds revealed that most TPPsw^ec formed more interactions than TPPsw^at (Supplementary Fig. S2). The number of sampled structures presenting four or more hydrogen bonds was consistently higher in the TPPsw^ec system than in TPPsw^at. The occupancy of the two hydrogen bonds formed between G30(28) and TPP was similar for both systems. Curiously, the occupancy of G9(11) for TPPsw^at was less than a half (26%) than for TPPsw^ec (56%).

In the holo systems, the Mg²⁺(1) ion remained closer than 1.90 Å from O3 β-phosphate of TPP, interacting inespecifically with RNA. In TPPsw^at, the magnesium atom Mg²⁺(2) got closer to Mg²⁺(1) (3.35 ± 1.13 Å) and interacted strongly with O1 oxygen atom from TPP (1.90 ± 0.06 Å), leaving the O5 oxygen atom from β-phosphate available to interact with RNA, with consequent formation of a bifurcated H-bond. The O5 oxygen atom from TPP interacted with C65 e G66, with hydrogen bond occupancies of 35% and 26%, respectively.

Each TPPsw^ec nucleotide involved in pyrophosphate recognition formed more than one hydrogen bond because they interacted with other oxygen atoms from TPP, being C67·TPP–O1 and G68·TPP–O5 examples of these interactions, with occupancies of 41% and 30%, respectively (Table 3).

Principal component analysis suggests different dominant motions in apo and holo states

To identify statistically relevant motions of TPP riboswitches in solution, we performed principal component analysis (PCA) on the snapshots obtained from the MD trajectories. Overall, the first three components (named PC1, PC2, and PC3) captured the dominant motions, presenting the highest contributions to total fluctuations. The first three PCs accounted for 54.9% and 52.6% of the overall variance in apo TPPsw^ec and TPPsw^at, respectively. In the holo states, the contribution of the first three PCs was slightly higher: 61.7% and 58.7% for TPPsw^ec and TPPsw^at, respectively.

We compared the projections of the trajectories onto the subspace spanned by the first three principal components. RMSFs and structural projections along the three PCs showed that the four systems displayed a uniform and overlapping PC subspace (Fig. 5).

Inspection of the atomic fluctuations along PC1 revealed substantial higher flexibility of the P3 helix in the apo systems, confirming its stabilization by ligand binding. However, this region presented higher fluctuations along PC2 in the holo state in the TPPsw^ec system, but not in TPPsw^at. Therefore, the stabilization of the most statistical relevant motions of P3 helix inducted by ligand binding was undoubtedly more pronounced in TPPswat, as lower fluctuations were noticed along both principal components. Furthermore, TPP binding resulted in the augmented flexibility of the P1 helix. In the holo TPPsw^ec, the ligand promoted an increase in the flexibility of nucleotides 34–37 of the P2 helix that was unseen in TPPsw^at.

Correlation network analysis reveals distinct responses to ligand binding in TPPsw^ec and TPPsw^at

We analyzed the correlations between pairs of nucleotides to investigate how TPP binding affects the dynamic couplings in TPPsw^ec and TPPsw^a. We calculated the dynamic cross-correlation matrices (DCCM) for each simulated system, as described in the methods section. A similar cross-correlation pattern was observed for both systems in their apo states. However, both the extent of regions displaying anticorrelations and their magnitudes were greater in TPPsw^ec, mainly in the P3–P4-P5–L5 region (Fig. 6).

It was observed that TPP binding stabilized the P3-L5 interaction in TPPsw^ec, as evidenced by weaker anticorrelations, which are likely to be associated with the separation of these regions in the apo state. This result is in line with PCA revealing increased stability of P3 motions in the holo state (Fig. 5). Interestingly, no similar trend was observed in TPPsw^at. In TPPsw^at system, TPP binding resulted in increased anticorrelations at the P1-P2 helices, indicating a possible destabilization of interactions that ultimately resulted in a larger separation between them. In contrast, the dynamic coupling pattern in this region was not altered in TPPsw^ec.

Next, we performed a correlation network analysis by constructing weighted graphs in which each residue was represented by a single node, and the weight of the connection between pairs of nodes was proportional to their respective previously calculated correlation coefficients. To quantify the relative importance of each residue in the network, we computed the betweenness centrality per nucleotide for each simulated system (Fig. 7). This metric is used to identify critical communication nodes over the network. Residues presenting high betweenness values are considered “bottlenecks” of information as they are found in the shortest communication paths²⁸.

We calculated the square inner product (SIP) to compare the overall similarity of the betweenness centrality profiles calculated for the apo and holo states. According to this analysis, high SIP values are associated with weak modulation of intramolecular communication as a consequence of the TPP binding. We obtained a higher SIP for TPPsw^at (0.89) than for TPPsw^ec (0.65), thus reinforcing more noticeable TPP related effects in TPPsw^ec systems. In both holo systems, we noted increased centrality values at the P4-P5 helices. Interestingly, the nucleotides G27, C28, and A59 were critical for P3–L5 interaction and displayed higher betweenness values in holo TPPsw^ec, indicating that ligand binding favors an efficient communication through these nucleotides (Fig. 7). This feature was not observed in TPPsw^at, in which C26 and A58 centrality values were higher in the apo state.

Communication pathways between P3-L5

To get a deeper understanding about the critical residues governing P3-L5 interactions, we computed the 1000 shortest paths between G27(25) [located in P3] and A59(57) [located in L5] (Fig. 8). The normalized node degeneracy metric reveals the percentage of paths accessing each node. We observed for all systems conserved critical residues for communication with degeneracy values > 0.35. While the majority of these residues belongs to L5 (U58(56), A60(58) and G61(59)), two of them (C12(14) and C28(26)) are located in P3 (Fig. 8A).

The distribution of node degeneracies obtained for TPPsw^ec was narrower at the P3 region in the holo state, indicating that TPP binding restricted the presence of a few residues in the shortest paths. Whereas a larger number of residues are accessed in the apo state, the shortest paths were mostly formed by nucleotides 59–61 in the holo state. In contrast, the distributions obtained for both TPPsw^at states were very similar (Fig. 8A).

We calculated the number of nodes per path to further evaluate and characterize the influence of TPP binding on P3-L5 interactions (Fig. 8B). This analysis was based on the hypothesis that communication involving fewer nodes along the pathway is likely to be more efficient. Indeed, in holo TPPsw^ec, the P3-L5 communication required fewer nodes (Fig. 8B). Interestingly, while TPP binding did not modify the global distributions of betweenness centralities obtained for both species (Fig. 8C left boxes), opposing trends were perceived concerning the average betweenness centrality calculated for the residues participating in shortest paths (Fig. 8C right boxes). However, TPPsw^ec TPP binding resulted in increased centralities for the residues involved in shortest paths, leading to a slight decrease in average betweenness in TPPsw^at.

To support this analysis, we computed the eigenvector centralities for the overall network and the shortest paths (Fig. 8C lower boxes). Again, TPP binding resulted in higher centrality in the shortest paths only for TPPsw^ec. The selection of specific P3 residues imposed by ligand binding in TPPsw^ec resulted in a stronger communication along pathways accessing a selection of neighboring residues with high eigenvector centrality. In agreement with our previous analysis (Figs 6–8), a corresponding effect was not observed in TPPsw^at, which strongly suggests the weaker influence of TPP for an effective communication.

Discussion

Plant and bacterial TPP aptamers share similar core structures and bind to the same ligand. However, minor structural and dynamical differences between them could be found, specially concerning the behavior of P3 helix. Bacteria and archaea commonly have a P3a stem¹⁸, which is not observed in eukaryotic riboswitches. The eukaryotic P3 stem is significantly variable in length, sequence, and base pairings¹⁹. Particularly in plants, the length of distal P3 extension varies among TPP aptamer representatives of the same species, as observed in Physcomitrella patens⁸. The P3 distal portion is not required for ligand binding of L5-P3 interaction^5,29, but might act as an anchor for the aptamer as it was already pointed out by Anthony et al.³⁰. Also, the authors claim that the correct folding could help in the competition with other RNA structures with different regulation mechanisms.

However, despite that the P3 stem is significantly variable in length in plants, the TPP aptamer is structurally stable. This stability might lead to slower arm movement than the observed in the helix arm of E. coli TPP riboswitch³¹. Cross-correlation analysis corroborated this hypothesis because stronger negative correlations were noticed in the apo TPPsw^ec involving substructures P3–P4-P5–L5 (Fig. 6). Our findings also suggest that communication pathways between P3-L5 may be different in E. coli and A. thaliana. The communication between P3-L5 in TPPsw^ec can be very efficient in the holo state, while in TPPsw^at the corresponding effect was weakened, thus suggesting a slower response to TPP binding in plants than in bacteria (Fig. 8).

The A. thaliana crystallographic structure used as starting point for our simulations contains a shortened P3 stem formed by 14 nucleotides. On the other hand, the corresponding structure in E. coli is composed of 18 nucleotides. The P3 helix of TPPsw^at, although smaller than the one in TPPsw^ec, showed no significant modifications in the presence of the ligand, indicating that the size of P3 can be oblivious to plants and its influence about slow folding can be negligible.

Guedich et al. wondered whether the slow TPPsw^at folding would be related to a single nucleotide. The authors concluded that U35, located on the P2 helix, is crucial for shaping a TPP-binding competent riboswitch³². In our analysis, the equivalent pyrimidine nucleotide in TPPsw^ec is U36 (Fig. 1B). The magnitude of the fluctuations at this position was 2-fold higher in the holo state than in the apo TPPsw^ec state (Fig. 3). In contrast, in the TPPsw^at system, similar fluctuations were perceived regardless of a ligand binding. Furthermore, PCA data also supported these outcomes by showing that the segment 34–37 of the holo TPPsw^ec displayed the most significant motion amplitude along PC1 (Fig. 5).

Grounded on these findings, we hypothesize that different interactions found in the microenvironment surrounding nucleotide U36 of TPPsw^ec (and U35 in TPPsw^at) are related to different TPP responses. In TPPsw^ec, this nucleotide is neighbored at 3′ by a non-canonical A37-G9 base pair. A similar context is observed for U35 in TPPsw^at, which is delimited by a non-canonical G34-G11 base pair but on 5′ instead. Nucleotides G9 and G11 of TPPsw^ec and TPPsw^at, respectively, form hydrogen bonds with the aminopyrimidine ring of TPP. Interestingly, our simulations have shown that hydrogen bond occupancy between G9(11) and N2 of TPP was less than a half for TPPsw^at (25.67%) than for TPPsw^ec (55.58%). This suggests that slight differences in the environment may directly interfere the TPP-aptamer interaction stability.

Finally, TPP riboswitches of Arabidopsis thaliana present subtler and slower regulation mechanisms than Escherichia coli^30,31,32. Here, we have shown through molecular dynamics simulations and networking analysis that minor structural differences in the aptamer enable enhanced intramolecular communication in the presence of TPP in TPPsw^ec, but not in TPPsw^at. Weaker responses to changes in the TPP concentration may be related to the autotrophic mode of nutrition, which demands the endogenous synthesis of thiamine. Unlike in plants, bacteria can grow under thiamine-rich conditions allowing them to satisfy their full demand for compounds like thiamine exogenously⁸. Taken together, our results provide new insights into RNA behavior of TPP riboswitch, which may have adapted in a different way to the different metabolic demands of each group of organisms to accomplish distinct TPP binding modulation.

Materials and Methods

Analysis of crystallographic structures

For this study, TPP riboswitch 3D-structures of Escherichia coli (TPPsw^ec) and Arabidopsis thaliana (TPPsw^at) obtained by X-ray crystallography were selected from the Protein Data Bank (PDB)³³. Both structures are bound to TPP and present high-resolution crystal, of 2.05 Å (PDB ID: 2GDI²⁷) and 2.25 Å (PDB ID: 3D2G²⁹), for TPPsw^ec and TPPsw^at, respectively. Corresponding sequence and secondary structures information were also taken from the PDB files and analyzed with 3DNA software suite³⁴. TPPsw^ec and TPPsw^at sequences were then aligned using the SARA-Coffee mode of T-Coffee program³⁵, and figures of sequence alignment were rendered using ALINE³⁶. The secondary structure calculated on the basis of the PDB files of the crystallographic structures was compared through the SimTree server²⁰. SimTree returns a similarity score and a mapping between the similar regions of the two structures in dot-bracket notation. Graphical representation of 2D and 3D structures were generated using VARNA³⁷ and UCSF Chimera, respectively.

Molecular dynamics simulations

Molecular dynamics (MD) simulations were carried out using the GROMACS version 5.1.2 package³⁸, and RNA interactions were represented using the amber 14sb³⁹ force field with parmbsc0 + chiOL3⁴⁰. Bonded and Lennard-Jones molecular parameters for TPP have been obtained using the Generalized Amber force field (GAFF)⁴¹ and AM1-BCC⁴² tools while atomic partial charges were added using ANTECHAMBER⁴³. ACPYPE⁴⁴ program was employed to create a GROMACS compatible topology file. Electrostatic interactions were treated using the particle mesh Ewald (PME) algorithm with a cut-off of 10 Å.

MD trajectories were monitored to investigate possible differences in the dynamical behavior between apo and holo TPPsw^ec and TPPsw^at. In the apo systems, TPP was removed from the X-ray crystal structure and replaced with solvent water. Two initially positioned magnesium ions in the crystal structure were kept in both the apo and holo systems and also contributed to neutralize the systems. These ions are essential for ligand binding and confer stability to the riboswitch as well⁴⁵. Each system was simulated under periodic boundary conditions in a triclinic box whose dimensions were automatically defined considering a distance of 1 nm from the outermost RNA atoms in all cartesian directions. The simulation box was filled with TIP3P water molecules⁴⁶.

Simulations were performed in three stages: (i) Energy minimization, (ii) thermalization and equilibration, and (iii) trajectory production.

Energy minimization procedure was performed through 5000 steps and a gradient tolerance <1.0 kJ mol⁻¹ nm⁻¹ of the steepest descent and conjugate-gradient algorithms. These steps were carried out with heavy atom restraints by applying a harmonic potential with a force constant of 1000 kJ mol⁻¹ nm⁻² for the steepest descent algorithm. Applications of the conjugate-gradient algorithm do not allow the application of restraints.

In the second phase, starting atomic velocities were assigned to all atoms of the system using a Maxwell-Boltzmann distribution, corresponding to an initial temperature of 20 K. Then, the systems were gradually heated up to 300 K over 500 picoseconds (ps) utilizing the Langevin thermostat. During this stage, all heavy atoms were harmonically restrained by applying a constant force of 1000 kJ mol⁻¹ nm⁻².

Systems were subsequently equilibrated during twenty successive 100 ps long equilibration simulations where position restraints progressively approached zero. After this period, the systems were simulated with no restraints all at 300 K for 1 µs (trajectory production). All simulations were performed in the NPT ensemble. The V-rescale thermostat and Berendsen barostat were used for temperature (300 K) and pressure control (1 atm), respectively.

Trajectory analysis

As in the trajectory analysis, we were interested only in the structural aspects of the systems, regardless the temporal correlation. Two independent MD simulations were concatenated, and trajectory analyses were conducted for crystal structures systems.

To investigate structural changes of the TPP aptamers, root-mean-square deviation (RMSD) values were calculated separately for the whole RNA and its substructures after fitting to their respective parts, taking the initial structure of the production dynamics as a reference. Hydrogen bond formation was defined using a geometric criterion with VMD software. It was considered a hit when the distance between two polar heavy atoms, with at least one hydrogen atom attached, was less than 3.5 Å using a D-Ĥ-A angle cutoff of 30°. Motif Identifier for Nucleic acids Trajectory (MINT)⁴⁷ program was used to evaluate the number of Watson–Crick (WC)-edge and non-WC-edge hydrogen bonds (and their sum) per nucleotide throughout the simulations.

Principal components analysis

The study of large-scale domain motions is essential for characterizing the conformational dynamics of macromolecules. Functional motions are usually described by a few numbers of degrees of freedom that can be calculated using principal component analysis (PCA)^48,49. PCA analysis was carried out for all systems using Bio3D⁵⁰ library as implemented in R⁵¹. Rotational and translational motions were removed before calculation of the covariance matrix by least-squares superposition to the corresponding average structures. The 3 N × 3 N covariance matrices of C5′ atomic positions (Cartesian coordinates) were then calculated for each state. The conformations explored during the MD simulations were applied using hierarchical clustering in R (hclust) with the complete linkage method based on the PC1-PC2 subspace, where PC1 and PC2 denote the projections onto the two first eigenvectors.

Correlation network analysis

The cross-correlation and network analyses were carried out using the Bio3D and the igraph R packages⁵². Initially, the dynamic cross-correlation matrices (DCCM) were calculated separately for each simulation using as inputs the corresponding MD trajectory superimposed onto the initial structure. Then, each group of two matrices per riboswitch state was utilized to obtain a consensus matrix. A proximity/contact map filter was applied in the construction of the correlation network for residues that remained within 4.5 Å from one another for at least 75% of simulation time. Briefly, graphs were obtained considering C5′ atoms as nodes and the connection between nodes i and j were weighted using the absolute values of cross-correlations (C_(i,j)) coefficients (Equation 1):

$${w}_{(i,j)}=-\,\mathrm{log}(|{C}_{(i,j)}|).$$

(1)

We also calculated the relative importance of each node for communication using centrality measures. According to the definition of the betweenness centrality (Equation 2), the relevance of a given node is defined by its presence in shortest communication paths connecting nodes over the entire network⁵³.

$${c}_{B}(n)=\sum _{i\ne j\ne k\in N}\frac{\sigma (i,j|n)}{\sigma (i,j)}$$

(2)

According to the above equation, the betweenness centrality of a node depends on the total number of the shortest paths between nodes i and j that pass-through n ($\sigma (i,j|n)$ and $\sigma (i,j)$, which is the total number of shortest paths between nodes i and j (regardless of whether they cross or not through n).

Another measure of centrality can be given by the eigenvector (Equation 3) that accounts for the global relevance of each residue based on the connections with neighboring nodes. In other words, nodes with high eigenvector centrality are those connected to other central residues⁵⁴.

$${c}_{Ei}(v)=\alpha \sum _{\{u,v\}\in E}{c}_{Ei}(u)$$

(3)

The vector ${c}_{Ei}={({c}_{Ei}(1),\ldots ,{c}_{Ei}({N}_{v}))}^{T}$ is obtained as a solution to the eigenvalue problem $A{c}_{Ei}={\alpha }^{-1}{c}_{Ei}$, where A is the adjacency matrix for the network graph G. More mathematical details can be found at Kolaczyk⁵⁴.

The Square inner product (SIP) (Equation 4) was used to compare the overall similarity of the centrality profiles calculated for the systems. It varies between 0 and 1 and is defined as

$$SIP=\frac{{({w}_{A}^{T}{w}_{B})}^{2}}{({w}_{A}^{T}{w}_{A})({w}_{B}^{T}{w}_{B})}$$

(4)

where w_A and w_B are N-length vectors containing the fluctuation value for each atom in proteins A and B, respectively⁵⁵.

The Yen’s algorithm⁵⁶ was used to calculate the shortest pathways connecting two nodes in the network. Path lengths are defined as the sum of the edge weights connecting a pair of nodes in a given pathway. The first 1000 shortest paths were collected and employed to calculate the node degeneracy value, which represents the percentage of pathways from the overall ensemble in which a given node is present.

References

Edwards, A. L. & Batey, R. T. Riboswitches: A common RNA regulatory element. Nat. Educ. 3, 9 (2010).
Google Scholar
Breaker, R. R. Riboswitches and the RNA world. Cold Spring Harb. Perspect. Biol. 4, a003566 (2012).
Article Google Scholar
Miranda-Ríos, J., Navarro, M. & Soberón, M. A conserved RNA structure (thi box) is involved in regulation of thiamin biosynthetic gene expression in bacteria. Proc. Natl. Acad. Sci. USA 98, 9736–41 (2001).
Article ADS Google Scholar
Nudler, E. & Mironov, A. S. The riboswitch control of bacterial metabolism. Trends Biochem. Sci. 29, 11–17 (2004).
Article CAS Google Scholar
Thore, S., Leibundgut, M. & Ban, N. Structure of the eukaryotic thiamine pyrophosphate riboswitch with its regulatory ligand. Science (80-.). 312, 1208–1211 (2006).
Article ADS CAS Google Scholar
Mandal, M. & Breaker, R. R. Adenine riboswitches and gene activation by disruption of a transcription terminator. Nat. Struct. Mol. Biol. 11, 29–35 (2004).
Article CAS Google Scholar
Fuchs, R. T., Grundy, F. J. & Henkin, T. M. The SMK box is a new SAM-binding RNA for translational regulation of SAM synthetase. Nat. Struct. Mol. Biol. 13, 226–233 (2006).
Article CAS Google Scholar
Wachter, A. et al. Riboswitch control of gene expression in plants by splicing and alternative 3′ end processing of mRNAs. Plant Cell Online 19, 3437–3450 (2007).
Article CAS Google Scholar
Mandal, M. & Breaker, R. R. Gene regulation by riboswitches. Nat. Rev. Mol. Cell Biol. 5, 451–463 (2004).
Article CAS Google Scholar
Tucker, B. J. & Breaker, R. R. Riboswitches as versatile gene control elements. Curr. Opin. Struct. Biol. 15, 342–348 (2005).
Article CAS Google Scholar
Winkler, W., Nahvi, A. & Breaker, R. R. Thiamine derivatives bind messenger RNAs directly to regulate bacterial gene expression. Nature 419, 952–956 (2002).
Article ADS CAS Google Scholar
Ontiveros-Palacios, N. et al. Molecular basis of gene regulation by the THI-box riboswitch. Mol. Microbiol. 67, 793–803 (2008).
Article CAS Google Scholar
Kubodera, T. et al. Thiamine-regulated gene expression of Aspergillus oryzae thiA requires splicing of the intron containing a riboswitch-like domain in the 5′-UTR. FEBS Lett. 555, 516–520 (2003).
Article CAS Google Scholar
Li, S. & Breaker, R. R. Eukaryotic TPP riboswitch regulation of alternative splicing involving long-distance base pairing. Nucleic Acids Res. 41, 3022–3031 (2013).
Article CAS Google Scholar
Croft, M. T., Moulin, M., Webb, M. E. & Smith, A. G. Thiamine biosynthesis in algae is regulated by riboswitches. Proc. Natl. Acad. Sci. 104, 20770–20775 (2007).
Article ADS CAS Google Scholar
Bocobza, S. et al. Riboswitch-dependent gene regulation and its evolution in the plant kingdom Riboswitch-dependent gene regulation and its evolution in the plant kingdom. 2874–2879, https://doi.org/10.1101/gad.443907 (2007).
Nawrocki, E. P. et al. Rfam 12.0: Updates to the RNA families database. Nucleic Acids Res. 43, D130–D137 (2015).
Article CAS Google Scholar
Rodionov, D. A., Vitreschak, A. G., Mironov, A. A. & Gelfand, M. S. Comparative genomics of thiamin biosynthesis in procaryotes. New genes and regulatory mechanisms. J. Biol. Chem. 277, 48949–48959 (2002).
Article CAS Google Scholar
Sudarsan, N., Barrick, J. E. & Breaker, R. R. Metabolite-binding RNA domains are present in the genes of eukaryotes. 644–647, https://doi.org/10.1261/rna.5090103.644 (2003).
Eden, E., Wallach, I. & Yakhini, Z. SimTree: A Tool for Computing Similarity Between RNA Secondary Structures, http://bioinfo.cs.technion.ac.il/SimTree/ (2005).
Gong, Z., Zhao, Y., Chen, C., Duan, Y. & Xiao, Y. Insights into ligand binding to preQ. 1 riboswitch aptamer from molecular dynamics simulations. PLoS One 9, 1–12 (2014).
Google Scholar
Aytenfisu, A. H., Liberman, J. A., Wedekind, J. E. & Mathews, D. H. Molecular mechanism for preQ. 1 -II riboswitch function revealed by molecular dynamics. Rna 21, 1898–1907 (2015).
Article CAS Google Scholar
Wang, W. et al. Dynamics Correlation Network for Allosteric Switching of PreQ. 1 Riboswitch. Sci. Rep. 6, 31005 (2016).
Article ADS CAS Google Scholar
Hanke, C. A. & Gohlke, H. Ligand-mediated and tertiary interactions cooperatively stabilize the P1 region in the guanine-sensing riboswitch. PLoS One 12, 1–29 (2017).
Article Google Scholar
Lang, K., Rieder, R. & Micura, R. Ligand-induced folding of the thiM TPP riboswitch investigated by a structure-based fluorescence spectroscopic approach. Nucleic Acids Res. 35, 5370–5378 (2007).
Article CAS Google Scholar
Kulshina, N., Edwards, T. E. & Ferré-D’Amaré, A. R. Thermodynamic analysis of ligand binding and ligand binding-induced tertiary structure formation by the thiamine pyrophosphate riboswitch. RNA 16, 186–196 (2010).
Article CAS Google Scholar
Serganov, A., Polonskaia, A., Phan, A. T., Breaker, R. R. & Patel, D. J. Structural basis for gene regulation by a thiamine pyrophosphate-sensing riboswitch. Nature 441, 1167–1171 (2006).
Article ADS CAS Google Scholar
Sethi, A., Eargle, J., Black, A. A. & Luthey-Schulten, Z. Dynamical networks in tRNA:protein complexes. Proc. Natl. Acad. Sci. 106, 6620–6625 (2009).
Article ADS CAS Google Scholar
Thore, S., Frick, C. & Ban, N. Structural basis of thiamine pyrophosphate analogues binding to the eukaryotic riboswitch. J. Am. Chem. Soc. 130, 8116–8117 (2008).
Article CAS Google Scholar
Anthony, P. C., Perez, C. F., Garcia-Garcia, C. & Block, S. M. Folding energy landscape of the thiamine pyrophosphate riboswitch aptamer. Proc. Natl. Acad. Sci. 109, 1485–1489 (2012).
Article ADS CAS Google Scholar
Duesterberg, V. K., Fischer-Hwang, I. T., Perez, C. F., Hogan, D. W. & Block, S. M. Observation of long-range tertiary interactions during ligand binding by the TPP riboswitch aptamer. Elife 4, 1–17 (2015).
Article Google Scholar
Guedich, S. et al. Quantitative and predictive model of kinetic regulation by E. coli TPP riboswitches. RNA Biol. 13, 373–390 (2016).
Article Google Scholar
Berman, H. M. et al. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000).
Article ADS CAS Google Scholar
Lu, X.-J. & Olson, W. K. 3DNA: a versatile, integrated software system for the analysis, rebuilding and visualization of three-dimensional nucleic-acid structures. Nat. Protoc. 3, 1213–1227 (2008).
Article CAS Google Scholar
Di Tommaso, P. et al. SARA-Coffee web server, a tool for the computation of RNA sequence and structure multiple alignments. Nucleic Acids Res. 42, W356–60 (2014).
Article Google Scholar
Bond, C. S. & Schüttelkopf, A. W. ALINE: A WYSIWYG protein-sequence alignment editor for publication-quality alignments. Acta Crystallogr. Sect. D Biol. Crystallogr. 65, 510–512 (2009).
Article CAS Google Scholar
Blin, G., Denise, A., Dulucq, S., Herrbach, C. & Touzet, H. VARNA: Interactive drawing and editing of the RNA secondary structure. IEEE/ACM Trans. Comput. Biol. Bioinforma. 7, 309–322 (2010).
Article CAS Google Scholar
Abraham, M. J. et al. Gromacs: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2, 19–25 (2015).
Article ADS Google Scholar
Maier, J. A. et al. ff14SB: Improving the Accuracy of Protein Side Chain and Backbone Parameters from ff99SB. J. Chem. Theory Comput. https://doi.org/10.1021/acs.jctc.5b00255 (2015).
Pérez, A. et al. Refinement of the AMBER force field for nucleic acids: Improving the description of α/γ conformers. Biophys. J. https://doi.org/10.1529/biophysj.106.097782 (2007).
Wang, J., Wolf, R. M., Caldwell, J. W., Kollman, P. A. & Case, D. A. Development and testing of a general amber force field. J. Comput. Chem. 25, 1157–1174 (2004).
Article CAS Google Scholar
Jakalian, A., Jack, D. B. & Bayly, C. I. Fast, efficient generation of high-quality atomic charges. AM1-BCC model: II. Parameterization and validation. J. Comput. Chem. 23, 1623–1641 (2002).
Article CAS Google Scholar
Wang, J., Wang, W., Kollman, P. A. & Case, D. A. Automatic atom type and bond type perception in molecular mechanical calculations. J. Mol. Graph. Model. 25, 247–260 (2006).
Article ADS Google Scholar
Sousa da Silva, A. W. & Vranken, W. F. ACPYPE - AnteChamber PYthon Parser interfacE. BMC Res. Notes 5, 367 (2012).
Article Google Scholar
Leipply, D. & Draper, D. E. Effects of Mg²⁺ on the Free Energy Landscape for Folding a Purine Riboswitch RNA. Biochemistry 50, 2790–2799 (2011).
Article CAS Google Scholar
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935 (1983).
Article ADS CAS Google Scholar
Górska, A., Jasiński, M. & Trylska, J. MINT: Software to identify motifs and short-range interactions in trajectories of nucleic acids. Nucleic Acids Res. 43(17), e114 (2015).
Article Google Scholar
Wold, S., Esbensen, K. & Geladi, P. Principal component analysis. Chemom. Intell. Lab. Syst. 2, 37–52 (1987).
Article CAS Google Scholar
Abdi, H. & Williams, L. J. Principal component analysis. Wiley Interdisciplinary Reviews: Computational Statistics 2, 433–459 (2010).
Article Google Scholar
Grant, B. J., Rodrigues, A. P. C., ElSawy, K. M., McCammon, J. A. & Caves, L. S. D. Bio3d: an R package for the comparative analysis of protein structures. Bioinformatics 22, 2695–2696 (2006).
Article CAS Google Scholar
R Development Core Team. R: A Language and Environment for Statistical Computing. R Found. Stat. Comput. Vienna, Austria 0, {ISBN} 3-900051-07-0 (2016).
Csárdi, G. & Nepusz, T. The igraph software package for complex network research. InterJournal, Complex Systems. 1695, 1–9 (2006).
Google Scholar
Kolaczyk, E. D. & Csárdi, G. Statistical Analysis of Network Data with R., https://doi.org/10.1007/978-0-387-88146-1 (2014).
Kolaczyk, E. D. Statistical Analysis of Network Data. Learning 26 (2009).
Skjærven, L., Yao, X.-Q., Scarabelli, G. & Grant, B. J. Integrating protein structural dynamics and evolutionary analysis with Bio3D. BMC Bioinformatics 15, 399 (2014).
Article Google Scholar
Yen, J. Y. Finding the K Shortest Loopless Paths in a Network. Manage. Sci. 17, 712–716 (1971).
Article MathSciNet Google Scholar
Laskowski, R. A. & Swindells, M. B. LigPlot+: Multiple ligand-protein interaction diagrams for drug discovery. J. Chem. Inf. Model. https://doi.org/10.1021/ci200227u (2011).

Download references

Acknowledgements

This work was supported by the Fundação Carlos Chagas Filho de Amparo à Pesquisado Estado do Rio de Janeiro (FAPERJ) [E-26/111.401/2013 and E-26/203.428/2015 to E.R.C., E-26/203.229/2016 to F.P.); Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) [Papes VI 407741/2012-7 to E.R.C.]. This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001 [Ph.D. scholarships to D.A. and N.A.N.J.].

Author information

Authors and Affiliations

Computational Biophysics and Molecular Modeling Group. Scientific Computing Program (PROCC), Fundação Oswaldo Cruz, Rio de Janeiro, 21040-222, Brazil
Deborah Antunes, Mauricio Garcia de Souza Costa & Ernesto Raul Caffarena
Laboratory of Functional Genomics and Bioinformatics, Oswaldo Cruz Institute, Fundação Oswaldo Cruz, Rio de Janeiro, 21040-360, Brazil
Natasha Andressa Nogueira Jorge & Fabio Passetti
Laboratory of Gene Expression Regulation, Carlos Chagas Institute, Fundação Oswaldo Cruz, Curitiba, 81350-010, Brazil
Fabio Passetti

Authors

Deborah Antunes
View author publications
You can also search for this author in PubMed Google Scholar
Natasha Andressa Nogueira Jorge
View author publications
You can also search for this author in PubMed Google Scholar
Mauricio Garcia de Souza Costa
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Passetti
View author publications
You can also search for this author in PubMed Google Scholar
Ernesto Raul Caffarena
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D.A. participated in the design research, performed calculations, analysed the results and wrote the article; N.A.N.J. contributed in the discussed results; M.G.S.C. analysed the results and helped writing the article; F.P. and E.R.C. participated in the design research, discussed results, helped writing the article. All authors reviewed the manuscript.

Corresponding author

Correspondence to Ernesto Raul Caffarena.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Figures

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Antunes, D., Jorge, N.A.N., Garcia de Souza Costa, M. et al. Unraveling RNA dynamical behavior of TPP riboswitches: a comparison between Escherichia coli and Arabidopsis thaliana. Sci Rep 9, 4197 (2019). https://doi.org/10.1038/s41598-019-40875-1

Download citation

Received: 21 September 2018
Accepted: 19 February 2019
Published: 12 March 2019
DOI: https://doi.org/10.1038/s41598-019-40875-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.