Abstract
The 5′-untranslated region of the hepatitis C virus genome contains an internal ribosome entry site (IRES) that initiates cap-independent translation of the viral RNA. Until now, the structural characterization of the entire (IRES) remained limited to cryo-electron microscopy reconstructions of the (IRES) bound to different cellular partners. Here we report an atomic model of free full-length hepatitis C virus (IRES) refined by selection against small-angle X-ray scattering data that incorporates the known structures of different fragments. We found that an ensemble of conformers reproduces small-angle X-ray scattering data better than a single structure suggesting in combination with molecular dynamics simulations that the hepatitis C virus (IRES) is an articulated molecule made of rigid parts that move relative to each other. Principal component analysis on an ensemble of physically accessible conformers of hepatitis C virus (IRES) revealed dominant collective motions in the molecule, which may underlie the conformational changes occurring in the (IRES) molecule upon formation of the initiation complex.
Similar content being viewed by others
Introduction
Hepatitis C virus (HCV) chronically infects over 170 million people worldwide and kills >350,000 people every year1. Its positive-strand RNA genome serves as a template for synthesis of a single precursor polypeptide that is subsequently processed by viral and cellular proteases1. The initiation of protein synthesis by the host–cell translation machinery occurs by a cap-independent mechanism2. The 5′- untranslated region of the viral RNA genome contains an internal ribosome entry site (IRES), encompassing nucleotides 39 to 3713,4,5.
This IRES specifically recruits the 40S ribosomal subunit to the start AUG codon without scanning6,7,8. Next, the IRES-40S complex interacts with the eukaryotic multi-subunit initiation factor 3 (eIF3) and the ternary Met-transfer RNA (tRNAi)-eIF2-GTP complex, forming a 48S IRES-preinitiation complex. GTP hydrolysis triggers the release of the initiation factors and the binding of the 60S ribosomal subunit, leading to the assembly of an elongation-competent IRES-80S initiation complex9,10,11,12.
The IRES RNA is predicted to consist of three structural domains forming two long stem loops (domains II and III) and a spur (domain IV) (Supplementary Fig. S1)13. The molecule contains tertiary fold motifs, the IIIabc four-way junction and the IIIef/IV pseudoknot4,14,15, and adopts a defined ion-dependent fold under physiological conditions16, although it does not form a compact, globular structure13,16,17. The structure at atomic resolution has been determined for several autonomous folding regions, including domain II, the IIIabc junction and, more recently, the IIIef/IV pseudoknot18,19,20,21,22,23,24. The presence of several hairpin loops and regions predicted to be single-stranded likely confers dynamic flexibility to the entire IRES molecule13,16,23 and has so far hampered structure determination of the entire molecule by X-ray crystallography or electron microscopy. As the IRES is essential for viral replication and its sequence is well conserved among all HCV genotypes, its structure represents a novel target for drug design25.
To date, the only structural information available about the architecture of the entire IRES molecule comes from cryo-electron microscopy (cryo-EM) reconstructions of the IRES in complexes with cellular partners26,27,28. In these low-resolution structures, the HCV IRES adopts similar elongated conformations26,27,28. Upon formation of the 48S IRES-preinitiation complex, the basal part of domain III binds to the solvent-exposed side of the ribosomal 40S subunit, whereas the apical part binds to eIF39,12,26 (Supplementary Fig. S1).
Little is known about the structure of the entire IRES in solution13 and an atomic structure of this molecule in its free form would help to rationalize the conformational changes that occur during the assembly of the initiation complex. Characterizing the structure of large flexible macromolecules remains, however, a challenge. Recent advances in small-angle scattering of X-rays (SAXS) or neutrons have turned these methods into powerful tools for characterizing the shape and accessible conformational space of flexible macromolecules and macromolecular assemblies29,30. Combining small-angle scattering data with other structural and biophysical experimental observations or with computational approaches led to the characterization of various proteins, RNAs and multi-molecular complexes29.
Previous biochemical studies and structural characterizations of different RNA fragments suggest that the free HCV IRES is an articulated molecule in which long structured parts are linked by flexible joints16,18,28. Here we built structural models of the entire isolated HCV IRES molecule and characterized its dynamics by SAXS, biomolecular simulations and computational tools. We provide experimental evidence supporting this model and found that the HCV IRES is best represented by an ensemble of conformers and identified collective motions at the origin of this conformational heterogeneity.
Results
SAXS experiments
Recombinant RNA molecules corresponding to the entire IRES region of the HCV genome (nucleotides 39–371) and to four fragments were produced by in vitro transcription and purified in large amount (>10 mg) (Supplementary Fig. S1). The different RNA molecules migrated as single bands on denaturing gels and behaved as monodispersed samples in dynamic light scattering experiments indicating the homogeneity of the preparations (Supplementary Fig. S1).
SAXS intensity was measured as a function of the scattering vector, Q. Direct analysis of these data provides additional sample quality controls and information about the degree of compactness of the particle. In the presence of 2.5 mM Mg2+, the Guinier plot at low Q values, in the range where Q.Rg is lower than 1.5, was linear and independent of RNA concentration indicating monodisperse samples (Fig. 1a). Provided that concentration is known, the molecular mass of the scattering particle can be calculated from the intercept, I(0). The radius of gyration, Rg, which describes the mass distribution of the particle about its centre of gravity, can be calculated from the slope. The molecular masses calculated from I(0) agree with the expected molecular masses calculated from the nucleotide sequences within experimental errors committed on concentration measurements (Supplementary Table S1).
For full-length IRES, the Guinier plot at different concentration was rather linear, but the plot of the residuals showed a slight deviation from linearity (Fig. 1a). Deviations from linearity or concentration dependence of Rg indicate aggregation or intermolecular interactions, but here the relative amplitude of the deviation is about ±0.002 (Fig. 1a), suggesting the absence of significant aggregation.
The radius of gyration of full-length HCV IRES increased with MgCl2 concentration, while I(0) remained constant indicating an ion-dependent conformational change and confirming the requirement of 2.5 mM Mg2+ for proper refolding of the molecule16 (Supplementary Fig. S1). Also, in the presence of 2.5 mM MgCl2, full-length IRES eluted as single peak on a size exclusion chromatography column and migrated as a single band on native gel indicating the absence of aggregation or of alternative folding (Supplementary Fig. S1). The Kratky plot was bell shaped and the Porod-Debye plot tended asymptotically to a constant value indicating that the RNA molecule is compact particles and contain no long disordered regions (Fig. 1b) (31,32,33). Finally, the pair distance distribution functions, P(r), obtained by Fourier transformed of the SAXS profile, is asymmetric with maximum frequency at a distance shorter than half the maximum linear distance (Dmax) between two scattering centres, indicating that the molecules is rather elongated (Fig. 1d).
Assembly and refinement of an atomic structure of HCV IRES
In a first approach, we constructed low-resolution models of the entire IRES and of its different fragments with the programme DAMMIN34,35 (Supplementary Fig. 2). This programme models a macromolecule as an assembly of scattering beads arranged in space such that the calculated scattering curve reproduces the experimental curve. The shape of these optimized bead models were sufficiently irregular for allowing the docking of the known atomic structures (Supplementary Note 1).
In a second approach, we manually constructed an atomic model of the entire IRES that incorporates the known atomic structures (PDB accession codes 1p5o.pdb - domain II; 1kp7.pdb - domain IIIb; 1kh6.pdb - junction IIIabc; 3t4b.pdb - pseudo-knot IIIef/IV; 1f84.pdb - domain IIId, Fig. 2a), and to refine this model, we used a multi-step procedure that generated an ensemble of 8,000 different physically realistic conformations accessible to the RNA molecule in solution. To optimize the diversity of this ensemble, we combined molecular dynamics (MD) simulations in explicit solvent (ESMD), which probes local structural fluctuations, and normal mode analysis (NMA), which explores concerted, large scale motions (Supplementary Fig. S3). We then searched in this pool of conformers the unique model that minimized the discrepancy between the experimental and calculated SAXS curves. A χ-value of 10 was obtained when the calculated SAXS curve for the initial model was compared with the experimental curve, whereas a χ-value of 2.8 was obtained for the best conformer selected in the pool of 8,000 conformers (Fig. 2b). In this selected conformer, domains II and III extend in opposite directions along the main axis of the molecule, whereas subdomain IIId and domain IV protrude on each side of the molecule (Fig. 2c). Along the main axis of the molecule, domain II roughly forms a right angle with domain III (Fig. 2d).
This structural model is compatible with footprint experiments preformed with chemical or enzymatic probes16,36. The accessibility of the backbone phosphorus atoms to a probe of 0.3 nm (37) is in agreement with the pattern of hydroxyl radical footprinting cleavage, predicting protections around the junction IIIabc and the pseudo-knot IIIef/IV, but also in the internal loops of domain II (16) (Fig. 3a). With a larger probe (0.45 nm) that mimics the approach of a nuclease active site, the protection was predicted to be more extensive with remaining accessibility in loops IIb, IIIa, IIIb and IIId, as well as in domain IV, particularly at the location of the AUG codon (Fig. 3b). Also, as observed experimentally, the single-stranded region that connects domain II to domain III was predicted to be protected against both probes.
HCV IRES is an articulated molecule in solution
The HCV IRES contains two regions of tertiary fold corresponding to the four-way junction IIIabc and the pseudo-knot IIIef/IV. Previous data suggested that the isolated four-way junction exists in a dynamic equilibrium between different conformations38, but ESMD simulations showed that the four-way junction is stabilized in the context of the entire domain III and that these folded elements are not major sources of flexibility in the entire IRES molecule (Supplementary Note 2, Supplementary Figs S4, S5 and S6).
To localize the regions of highest flexibility in the entire IRES molecule, we ran 25 ns ESMD simulations (Fig. 4a) and determined the average root-mean square fluctuations over these MD trajectories (Fig. 4b). According to these results, the junction IIIabc, the stem IIIabc/IIId stem, the pseudo-knot IIIef/IV and the basal part of domain II form a rigid backbone scaffold. The apical part of domain II, subdomain IIIa, the apical loop of subdomain IIIb, subdomain IIId, the hairpin loop of domain IV and the terminal single-stranded region of domain IV appear to be the most mobile parts of the structure. The single-stranded region connecting domains II and III exhibits a moderate mobility on this time scale, likely owing to an important frictional drag exerted by elongated domains II and III.
Collective motions in macromolecules often dominate functional dynamics39. In order to identify collective motions in the IRES molecule, we applied principal component analysis to the pool of 8,000 conformations. Principal component analysis is a standard statistical method based on the analysis of covariance that is used in multiple fields for detecting correlations in large data sets. It can be used to analyse MD simulations or ensembles of conformers in order to identify collective motions of atoms in biological macromolecules39. Firstly, the conformers are aligned in the three-dimensional space. Secondly, covariance between atomic displacements is calculated between the different conformers, and analysing these covariance values allows identifying motions involving large numbers of atoms. Generally, a small set of collective motions, the first few principal components (PCs), capture the majority of fluctuations in biological macromolecules, and often these dominant motions have been found to be relevant for biological activities40,41. In a third step, the conformational changes corresponding to motions along each PC can be back calculated and visualized as a series of snapshots taken along the trajectories.
For the 8,000 conformers of HCV IRES, the first two PCs contributed 52% of the motions within the ensemble, whereas the first six PC contributed 83% (Fig. 5a). The first PC corresponds to a motion of closure of the molecule, in which domains II and III remain in the same plane, acting like jaws that move towards and away from each other (Fig. 5b). The second component corresponds to swinging motions, in which domain II and subdomain IIIabc move out of their initial plane, undergoing torsions in opposite directions relative to the central pseudo-knot IIIef/IV (Fig. 5c). The lowest and second lowest frequency modes calculated by normal mode analysis42 corresponded to similar motions in the molecule than the first and second PC, as typically found with other biomolecules43.
Ensemble structure of HCV IRES
On the basis of the small-angle scattering data, the entire molecule was modelled as an ensemble of conformers that accounted for the conformational diversity of the isolated IRES RNA and the coexistence of multiple exchanging conformations in solution44. Starting from the pool of 8,000 physically accessible conformers, we selected sub-ensembles of conformers that collectively reproduced the experimental scattering profile44 (Fig. 6a). The quality of the fit between calculated and experimental scattering curves significantly improved when the number of conformers in the selected ensemble was increased from 1 model to 5 models showing that the IRES molecule is better represented by an ensemble of at least five conformers (χ=1.9) than by a single conformer (χ=2.8) (Fig. 6b). The Rg value of the selected conformers spanned almost the same range (from 6.0 to 8.8 nm) than the initial pool of models (Fig. 6c and Supplementary Fig. S7), suggesting that the IRES RNA samples a large conformational space in solution. Selecting ensembles of >5 conformers provided no significant improvement to the quality of the fit (Fig. 6b).
In order to compare the models constituting one selected ensemble, we used the programme SUPCOMB and we calculated normalized spatial discrepancy (NSD) values. The NSD parameter measure the proximity of two objects in the three-dimensional space and is analogous to the error-weighted χ-value used to characterized deviations within one-dimensional data sets45. Structural alignment can be performed by minimizing this parameter, and the NSD calculated between aligned structures provides a quantitative estimate of the similarity between the models. Typically, an NSD value lower than 0.7 is taken as evidence that the objects are similar. The average NSD value was always larger than 2.0, indicating that the conformers were significantly different (data not shown). Figure 6d show a representative ensemble of five models that highlights the conformational diversity of full-length HCV IRES in solution. The selected conformers of this ensemble and of other independently selected ensembles were predominantly localized in the lower part of the two-dimensional plot showing the distribution of conformers along the first and second PC for the conformational ensemble of 8,000 models (Fig. 5d). This suggests that motions along the first PC dominate the exchanges between the conformers of the RNA molecule in solution and that motions along the second PC are less frequent.
Conformational changes in the IRES upon binding the ribosomal 40S subunit
To compare the IRES structure in solution with that bound to eIF3 and the ribosomal 40S subunit, we searched within the pool of 8,000 physically accessible conformers the structure that best fitted the cryo-EM difference map (Fig. 7). One model could be found for which most of domain III perfectly fitted into the cryo-EM map, but subdomain IIId and domain II were not properly oriented (Fig. 7a). However, by applying simple rotations around two pivot points at the base of domain II and of domain IIId, it was possible to slightly rearrange the IRES atomic structure such that it docked into the EM density of the bound IRES (Fig. 7a). The back calculated scattering curve for this conformer was significantly different from the experimental curve showing that the structure adopted by the HCV IRES in the complex with eIF3 and the 40S subunit is different from the averaged population of conformers in solution (Supplementary Fig. S8). On the two-dimensional plot of PC amplitudes, the model that best matched the bound IRES is localized in the upper left corner (Fig. 5d), suggesting that conformational changes involving motions along the second PC mode with a reorientation of domain II relative to domain III must occur upon binding to its partners.
Discussion
On the basis of SAXS experiments, we built bead models, as well as unique and ensemble atomic models in which the IRES RNA appears to be non-globular. Domain II and domain III form independent long cylindrical stems that extend in different directions from the central IIIef/IV pseudo-knot and constitute the dorsal spine of a structure from which subdomains IIIa, IIIc and IIId and domain IV protrude on the sides. This structural architecture is in agreement with electron microscopy pictures of free HCV IRES13. In addition, the atomic model is in agreement with available biochemical data, in particular, with experimental accessibilities measured for chemical and enzymatic probes16. A comparison of the different modelling approaches highlights the dynamic character of the molecule. Ensemble structures of five or more conformers provide better fits to the experimental data than a single atomic structure suggesting a static or dynamic heterogeneity of the structure, and raising the question of why an ensemble of five conformers is necessary and sufficient to reproduce the experimental curve. Our results argue against static heterogeneity in solution. Indeed, different ensembles of conformers can reproduce the experimental data as well, and if large ensembles were selected no conformer was statistically overrepresented in the population. The small size of the ensemble required for reproducing the scattering curve more likely results from the low information content of SAXS data, reporting only on the pair distance distribution between scattering centres. Because of this low structural resolution, an ensemble of five conformers is sufficient for representing the conformational diversity of a molecule that is continuously sampling an enormous number of conformations under thermal motions44 (Fig. 7a). In consequence, the conformers of one peculiar selected ensemble should not be taken as actual structures of free HCV IRES, but rather as mere examples of structures that collectively represent the molecule in solution.
Molecular dynamics simulations, principal component analysis and ensemble modelling of SAXS data reveal motions in the RNA molecules that may potentially be relevant to the rearrangement of the IRES molecule occurring during the assembly of the 48S preinitiation complex and initiation complex46,47. The selected models that represent the IRES in solution have values of the first component distributed along the entire range sampled in the initial ensemble, whereas they have values of the second component falling in a more limited range of the initial ensemble (Fig. 5d). This suggests that the isolated IRES molecule undergoes motions that mainly correspond to the first PC. Upon binding to its cellular partners, the conformational flexibility of the IRES seems to be significantly reduced leading to the visualization of the molecule in the cryo-EM reconstructions of the different complexes, although the shape and width of the electron density suggest that the IRES RNA conserves some flexibility in its bound state27,28 (Fig. 7c). principal component analysis suggests that the conversion of the conformers most frequently present in solution into the bound conformer requires motions through the second PC (Fig. 7a).
Two limiting mechanisms, conformational selection or conformational change after binding, can be considered for the assembly of the flexible IRES molecule with its partners during the formation of the initiation complex. In the first mechanism, only IRES molecules adopting the bound conformation in solution can bind to their partners. As the bound conformation is accessible to the molecule in solution through its main concerted motions, there is an inherent probability that IRES will adopt this conformation and, therefore, bind to its partners. The bound conformer appears to be lowly populated in solution (Fig. 5d), however, it should be noted that our structural characterization of HCV IRES was carried out in dilute solutions. It is possible that the complex cellular environment affects the conformational properties of the molecule and enhances the population of preorganized conformers capable of binding. In the second mechanism, various conformers of the IRES would bind to the partners and would then undergo conformational changes to adapt their shape to the binding site. It is difficult to distinguish between these two scenarios48, and there is always the possibility that a combination of both mechanisms is involved in the recruitment of the 40S ribosomal subunit and of eIF3 by HCV IRES.
As domains II and IV are required for efficient translation in vitro and in vivo7,26,36 but provide no significant stabilization to the IRES-preinitation complex7,9,12 and that domain III has a slightly higher affinity for the ribosomal 40S subunit than for eIF37,12, it is tempting to speculate that domain III binds first onto the back of the ribosomal 40S subunit, and subsequently to eIF3, and that conformational changes occur in the complex (Fig. 7a). In particular, motions along the second PC may take place in order to correctly position domain II within the tRNA exit (E) site and the mRNA binding cleft of the ribosome and to trigger conformational changes in the 40S subunit and in the IRES26,28,46,47 (Fig. 7a). This would commit the complex to progress towards the conformational reorganization of domain IV that contains the start AUG codon and the correct positioning of this domain into the ribosome decoding centre12. The concerted motions of the IRES revealed in this study, in particular those according to the second PC, could explain this long range communication within the IRES molecule.
Conformational dynamics thus appears as an inherent property of the HCV IRES molecule that may be biologically relevant. The IRES sequence is highly conserved among the genotypes of HCV, and the localization of the variations in our structural model suggests that the dynamics of the molecule is conserved among the different genotypes. Indeed, variations in loops unlikely affect the dynamics and variations in stem regions are always complemented such that base pairing is conserved.
IRES have been found in the RNA genomes of different viruses and in some cellular mRNA49. They have been divided in four groups on the basis of their predicted secondary structure and on their requirements for initiation factors and other cellular factors50. A common feature of all IRES is their inherent flexibility, which seems to increase in correlation with the need for more initiation factors. HCV IRES belongs to class II IRES and requires eIF3, eIF2 and initiator met-tRNA to assemble ribosomes10. We showed here that it is an articulated molecule in which structured elements undergo large reorientations relative to each other16,18,28. By comparison, IRES of group I found in the intergenic region of Dicistroviridae assemble ribosome in the absence of canonical initiation factors and Met-tRNA and undergo only local rearrangements upon formation of the initiation complex51,52,53. The structure of group I IRES is composed of two well-structured regions for which atomic structures have been obtained54,55. Groups III and IV, which require most of the canonical initiation factors, as well as some IRES trans-acting factors for assembling ribosome, exhibit no protection against enzymatic or chemical probes in their free form suggesting that they are more dynamics than HCV IRES50.
Only IRES of groups I and II bind directly to the 40S ribosomal subunit, whereas IRES of groups III and IV interact with the ribosomal subunits only in the form of ribonucleoprotein complexes, and currently, structural data are only available for groups I and II IRES. Despite their differences in structural architecture and dynamics, IRES of groups I and II share similarities in the way they recruit and manipulate ribosomal subunits. The structure of the IRES of both classes is grossly composed of two parts that have similar functions in ribosome assembly. A first part is responsible for tight binding to the ribosomal 40S subunit, although the binding site is localized in different regions of the 40S subunit. Domain II of intergenic region IRES binds to the evolutionary conserved region of the 40S subunit that interacts with the 60S subunit, whereas domain III of HCV binds to the less conserved solvent-exposed backside of the 40S subunit. The second part tinkers with the decoding centre of the ribosome but provides no additional stabilization to the complex with the 40S subunit. Domain III of the intergenic region IRES mimics tRNA in a hybrid state54 and binds to the A site, whereas domain II of HCV IRES binds close to the E site and domain IV inserts into the P site47. In conclusion, both intergenic region IRES and HCV IRES alter the conformation of the 40S subunit upon binding and change of conformation upon binding the 60S subunit. The inherent flexibility of HCV IRES as highlighted in this study may thus have an important role in the assembly and manipulation of the ribosomal subunits and, in the case of HCV, may be controlled by the binding of the initiation factors.
Methods
RNA synthesis by in vitro transcription
cDNA (HCV genotype 1b – NCBI nucleotide accession number AJ238799) corresponding to the different constructs were amplified by PCR. The forward oligonucleotide included an EcoRI site and the T7 RNA polymerase promoter followed by a triple G sequence in order to increase transcriptional activity, and the reverse oligonucleotide included a HindIII site. Each fragment was cloned into pUC18 between the EcoRI and HindIII restriction sites.
RNA corresponding to HCV IRES and its different fragments were prepared by in vitro transcription from linearized plasmids using T7 RNA polymerase as described in Supplementary methods online.
Samples for SAXS experiments were prepared by dissolving 1 mg of RNA in 50 μl of water and incubating the solution for 10 min at room temperature without shaking. Samples were heated for 3 min at 95 °C and incubated for 5 min at room temperature before adding 50 μl of refolding buffer (40 mM Tris–HCl buffer at pH 7.5 containing 200 mM KOAc, 400 mM KCl, 5 mM MgCl2 and 1 mM dithiothreitol). The samples were then incubated for 10 min at room temperature. Concentration was measured by absorbance at 260 nm and was adjusted to 10 mg ml−1.
Small-angle X-ray scattering experiments
SAXS data were collected at the European Synchrotron Radiation Facility (Grenoble, France) on beamlines ID2 and ID14-3 (see the Supplementary methods online for additional details).
Generation of an atomic model of the full-length HCV IRES
An initial atomic model of HCV IRES was reconstructed by using base pairing information from the secondary structure map and the known atomic structures of fragments. The non-natural nucleotides present in some atomic structures were replaced by natural nucleotides. The regions of unknown structure were constructed with COOT56 as stems or loops in accordance with the secondary structure map. The different fragments were progressively assembled; one by one they were correctly positioned in space using the bead model of full-length IRES as a guide and they were connected in VEGA ZZ57. After the assembly of each fragment, local energy minimization was performed in VEGA ZZ to remove potential interatomic clashes between the pre-existing and newly added parts of the structure57. The model of the entire IRES was globally energy-minimized in GROMACS using the AMBER99 force field. This model was then refined by a multiple step protocol, described below, that combines ESMD and NMA to identify a model that minimize the discrepancy between the experimental and calculated scattering curve using the χ-value:
where K is the number of points in the experimental curve, σ is the s.d. and μ is a scaling factor.
Explicit solvent molecular dynamics simulations
All molecular dynamics simulations were performed with the GROMACS 4 software package using different force fields. For the equilibration process and production runs used for refining the model, we used an ensemble of harmonic restraints between the 63 GC base pairs present in the initial model in order to enforce native local secondary structure. The system was equilibrated for 250 ps or 1 ns, and we verified that both potential and kinetic energies had reached equilibrium at the end of this equilibration. Independent production simulations with or without harmonic restraints were obtained by using different initial velocities. The resulting trajectories were analysed using GROMACS routines to extract root mean square deviation (RMSD), root-mean square fluctuations and Rg information (see Supplementary methods online for details and references).
Structure refinement
Starting from the manually assembled structure, an ensemble of models was generated by a multi-step procedure in which we combined ESMD, NMA and molecular modelling (Supplementary Fig. S3). In a first step, starting from the energy-minimized all-atom model, we ran six independent ~20 ns ESMD simulations with the 63 harmonic restraints using the AMBER99 force field. In each MD trajectory, after a rapid rise in the first 2 ns, the RMSD evolved towards a stable value near 2.5 nm and the Rg value decreases. Two-thousand four-hundred conformers were obtained as snapshots taken at regular interval (50 ps) along the six first MD simulations, and the theoretical scattering profiles were calculated from the structures using the programme CRYSOL58 and compared with the experimental data. The best agreement with the experimental curve yielded a χ-value of 6, down from a value of 10 calculated for the initial structure. A summary of the conformational sampling performed during this first refinement step is shown in the form of plots of Rg versus χ-value (Supplementary Fig. S3).
In a second step, in order to expand the sampled conformational space, the best fitting model of each MD simulation was submitted to normal mode calculation (NMA) using the elNémo web-server42. For each of these 6 models, 40 structures were extracted (using an amplitude of motion equal to ±1000 and a step of 50) from each of the five lowest frequency normal modes, yielding 1,200 additional models (Supplementary Fig. S3). The theoretical scattering profiles were compared with the experimental profile. The best agreement with the experimental profile yielded a χ-value of 4.3. To further increase the conformational sampling, the best model selected after NMA was energy-minimized and submitted to four additional ~20 ns ESMD simulations using AMBER99 and the harmonic restraints (Supplementary Fig. S3). One-thousand six-hundred additional conformers were obtained by taking at regular interval (50 ps) along these trajectories and the calculated scattering profile was compared with the experimental data, but no improvement in the fitting was obtained with these models (best agreement yielded a χ-value of 4.0).
Finally, we noticed that reorientations of domain II relative to domain III occurred during the ESMD simulations, but that this type of concerted motion was slow and was thus infrequent on the simulated timescale. To improve the conformation sampling generated by this motion, starting from the best model selected after the last four ESMD with AMBER99, we used the elNémo web-server42 with a large amplitude of motion (±2,000) and selected five snapshots from the lowest frequency mode of motion (4 × 5=20 models). These models were energy-minimized in GROMACS and were used to generate additional models by randomly rotating domain II around a pivot point located in a single-stranded region at the intersection between domains II and III (nucleotides 119–124). For each of the 20 energy-minimized models, 140 models were generated with the torsion-based random conformational search functionality of Another Molecular Mechanics Program (AMMP) implemented in VEGA ZZ, leading to 2,800 additional conformers. A model with a χ-value of 2.8 between the experimental and calculated curve was finally selected using CRYSOL58(Fig. 2).
Six additional ~25 ns ESMD simulations using AMBER99 parmbsc0 were run with or without applying harmonic restraints on the 63 GC base pairs. Three-thousand additional conformers were obtained by taking snapshots at regular interval (50 ps) along these trajectories and their calculated scattering profiles were compared with the experimental data, but no improvement in the fitting was obtained with these models. Although, no major difference was found between the trajectories calculated with the two force fields, only the three trajectories calculated without restraints were used to calculate the root-mean square fluctuations values.
Ensemble optimization
All models generated during the refinement process were pooled into an ensemble of 8,000 models. This pool was used for selecting sub-ensemble that collectively reproduce the SAXS profile using the genetic algorithm GAJOE from ensemble optimization method59. The size of the selected ensemble was varied from 1 to 50 models in order to test the effect of the size of the ensemble on the quality of the fit. The number of generation of the genetic algorithm was adjusted to optimize the χ-value. Models of the optimized ensembles were aligned using the pseudo-knot domain as a reference using GROMACS routines. The quality of the structures constituting the selected ensembles was checked with the MolProbity webserver60. The pdb files of the five conformers forming the ensemble presented in Fig. 6 are available as Supplementary Data 1.
Additional information
How to cite this article: Pérard, J. et al. Structure of the full-length HCV IRES in solution. Nat. Commun. 4:1612 doi: 10.1038/ncomms2611 (2012).
References
Lindenbach, B. D. & Rice, C. M. Unravelling hepatitis C virus replication from genome to function. Nature 436, 933–938 (2005).
Fraser, C. S. & Doudna, J. A. Structural and mechanistic insights into hepatitis C viral translation initiation. Nat. Rev. Microbiol. 5, 29–38 (2007).
Tsukiyama-Kohara, K., Iizuka, N., Kohara, M. & Nomoto, A. Internal ribosome entry site within hepatitis C virus RNA. J. Virol. 66, 1476–1483 (1992).
Honda, M., Beard, M. R., Ping, L. H. & Lemon, S. M. A phylogenetically conserved stem-loop structure at the 5′ border of the internal ribosome entry site of hepatitis C virus is required for cap-independent viral translation. J. Virol. 73, 1165–1174 (1999).
Wang, C., Sarnow, P. & Siddiqui, A. Translation of human hepatitis C virus RNA in cultured cells is mediated by an internal ribosome-binding mechanism. J. Virol. 67, 3338–3344 (1993).
Pestova, T. V. et al. Molecular mechanisms of translation initiation in eukaryotes. Proc. Natl Acad. Sci. USA 98, 7029–7036 (2001).
Otto, G. A. & Puglisi, J. D. The pathway of HCV IRES-mediated translation initiation. Cell 119, 369–380 (2004).
Lytle, J. R., Wu, L. & Robertson, H. D. Domains on the hepatitis C virus internal ribosome entry site for 40s subunit binding. RNA 8, 1045–1055 (2002).
Kieft, J. S., Zhou, K., Jubin, R. & Doudna, J. A. Mechanism of ribosome recruitment by hepatitis C IRES RNA. RNA 7, 194–206 (2001).
Pestova, T. V., Shatsky, I. N., Fletcher, S. P., Jackson, R. J. & Hellen, C. U. A prokaryotic-like mode of cytoplasmic eukaryotic ribosome binding to the initiation codon during internal translation initiation of hepatitis C and classical swine fever virus RNAs. Genes Dev. 12, 67–83 (1998).
Sizova, D. V., Kolupaeva, V. G., Pestova, T. V., Shatsky, I. N. & Hellen, C. U. Specific interaction of eukaryotic translation initiation factor 3 with the 5′ nontranslated regions of hepatitis C virus and classical swine fever virus RNAs. J. Virol. 72, 4775–4782 (1998).
Fraser, C. S., Hershey, J. W. & Doudna, J. A. The pathway of hepatitis C virus mRNA recruitment to the human ribosome. Nat. Struct. Mol. Biol. 16, 397–404 (2009).
Beales, L. P., Rowlands, D. J. & Holzenburg, A. The internal ribosome entry site (IRES) of hepatitis C virus visualized by electron microscopy. RNA 7, 661–670 (2007).
Brown, E. A., Zhang, H., Ping, L. H. & Lemon, S. M. Secondary structure of the 5′ nontranslated regions of hepatitis C virus and pestivirus genomic RNAs. Nucleic Acids Res. 20, 5041–5045 (1992).
Wang, C., Le, S. Y., Ali, N. & Siddiqui, A. An RNA pseudoknot is an essential structural element of the internal ribosome entry site located within the hepatitis C virus 5′ noncoding region. RNA 1, 526–537 (1995).
Kieft, J. S. et al. The hepatitis C virus internal ribosome entry site adopts an ion-dependent tertiary fold. J. Mol. Biol. 292, 513–529 (1999).
Kim, I., Lukavsky, P. J. & Puglisi, J. D. NMR study of 100 kDa HCV IRES RNA using segmental isotope labeling. J. Am. Chem. Soc. 124, 9338–9339 (2002).
Berry, K. E., Waghray, S., Mortimer, S. A., Bai, Y. & Doudna, J. A. Crystal structure of the HCV IRES central domain reveals strategy for start-codon positioning. Structure 19, 1456–1466 (2011).
Zhao, Q., Han, Q., Kissinger, C. R., Hermann, T. & Thompson, P. A. Structure of hepatitis C virus IRES subdomain IIa. Acta. Crystallogr. D Biol. Crystallogr. 64, 436–443 (2008).
Rijnbrand, R., Thiviyanathan, V., Kaluarachchi, K., Lemon, S. M. & Gorenstein, D. G. Mutational and structural analysis of stem-loop IIIC of the hepatitis C virus and GB virus B internal ribosome entry sites. J. Mol. Biol. 343, 805–817 (2004).
Collier, A. J. et al. A conserved RNA structure within the HCV IRES eIF3-binding site. Nat. Struct. Biol. 9, 375–380 (2002).
Kieft, J. S., Zhou, K., Grech, A., Jubin, R. & Doudna, J. A. Crystal structure of an RNA tertiary domain essential to HCV IRES-mediated translation initiation. Nat. Struct. Biol. 9, 370–374 (2002).
Lukavsky, P. J., Kim, I., Otto, G. A. & Puglisi, J. D. Structure of HCV IRES domain II determined by NMR. Nat. Struct. Biol. 10, 1033–1038 (2003).
Lukavsky, P. J., Otto, G. A., Lancaster, A. M., Sarnow, P. & Puglisi, J. D. Structures of two RNA domains essential for hepatitis C virus internal ribosome entry site function. Nat. Struct. Biol. 7, 1105–1110 (2000).
Davis, D. R. & Seth, P. P. Therapeutic targeting of HCV internal ribosomal entry site RNA. Antivir. Chem. Chemother. 21, 117–128 (2011).
Spahn, C. M. et al. Hepatitis C virus IRES RNA-induced changes in the conformation of the 40s ribosomal subunit. Science 291, 1959–1962 (2001).
Siridechadilok, B., Fraser, C. S., Hall, R. J., Doudna, J. A. & Nogales, E. Structural roles for human translation factor eIF3 in initiation of protein synthesis. Science 310, 1513–1515 (2005).
Boehringer, D., Thermann, R., Ostareck-Lederer, A., Lewis, J. D. & Stark, H. Structure of the hepatitis C virus IRES bound to the human 80S ribosome: remodeling of the HCV IRES. Structure 13, 1695–1706 (2005).
Rambo, R. P. & Tainer, J. A. Bridging the solution divide: comprehensive structural analyses of dynamic RNA, DNA, and protein assemblies by small-angle X-ray scattering. Curr. Opin. Struct. Biol. 20, 128–137 (2010).
Hura, G. L. et al. Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS). Nat. Methods 6, 606–612 (2009).
Putnam, C. D., Hammel, M., Hura, G. L. & Tainer, J. A. X-ray solution scattering (SAXS) combined with crystallography and computation: defining accurate macromolecular structures, conformations and assemblies in solution. Q. Rev. Biophys. 40, 191–285 (2007).
Glatter, O. & Kratky, O. Small Angle X-ray Scattering Academic Press (1982).
Rambo, R. P. & Tainer, J. A. Characterizing flexible and intrinsically unstructured biological macromolecules by SAS using the Porod-Debye law. Biopolymers 95, 559–571 (2011).
Svergun, D. I. Restoring low resolution structure of biological macromolecules from solution scattering using simulated annealing. Biophys. J. 76, 2879–2886 (1999).
Lipfert, J., Chu, V. B., Bai, Y., Herschlag, D. & Doniach, S. Low-resolution models for nucleic acids from small-angle X-ray scattering with applications to electrostatic modeling. J. Appl. Cryst. 40, S229–S234 (2007).
Kolupaeva, V. G., Pestova, T. V. & Hellen, C. U. An enzymatic footprinting analysis of the interaction of 40S ribosomal subunits with the internal ribosomal entry site of hepatitis C virus. J. Virol. 74, 6242–6250 (2000).
Hubbard, S. J., Campbell, S. F. & Thornton, J. M. Molecular recognition. Conformational analysis of limited proteolytic sites and serine proteinase protein inhibitors. J. Mol. Biol. 220, 507–530 (1991).
Melcher, S. E., Wilson, T. J. & Lilley, D. M. The dynamic nature of the four-way junction of the hepatitis C virus IRES. RNA 9, 809–820 (2003).
Amadei, A., Linssen, A. B. & Berendsen, H. J. Essential dynamics of proteins. Proteins 17, 412–425 (1993).
Kitao, A., Hirata, F. & Go, N. The effects of solvent on the conformation and collective motions of protein - normal mode analysis and molecular dynamics simulations of melittin in water and in vacuum. Chem. Phys. 158, 447–472 (1991).
Ichiye, T. & Karplus, M. Collective motions in proteins: a covariance analysis of atomic fluctuations in molecular dynamics and normal mode simulations. Proteins 11, 205–217 (1991).
Suhre, K. & Sanejouand, Y. H. ElNemo: a normal mode web server for protein movement analysis and the generation of templates for molecular replacement. Nucleic Acids Res. 32, W610–W614 (2004).
Yang, L., Song, G., Carriquiry, A. & Jernigan, R. L. Close correspondence between the motions from principal component analysis of multiple HIV-1 protease structures and elastic network modes. Structure 16, 321–330 (2008).
Bernado, P., Mylonas, E., Petoukhov, M. V., Blackledge, M. & Svergun, D. I. Structural characterization of flexible proteins using small-angle X-ray scattering. J. Am. Chem. Soc. 129, 5656–5664 (2007).
Kozin, M. B. & Svergun, D. I. Automated matching of high- and low-resolution structural models. J. Appl. Cryst. 34, 33–41 (2001).
Hellen, C. U. IRES-induced conformational changes in the ribosome and the mechanism of translation initiation by internal ribosomal entry. Biochim. Biophys. Acta. 1789, 558–570 (2009).
Filbin, M. E. & Kieft, J. S. HCV IRES domain IIb affects the configuration of coding RNA in the 40S subunit's decoding groove. RNA 17, 1258–1273 (2011).
Boehr, D. D., Nussinov, R. & Wright, P. E. The role of dynamic conformational ensembles in biomolecular recognition. Nat. Chem. Biol. 5, 789–796 (2009).
Mokrejs, M. et al. IRESite--a tool for the examination of viral and cellular internal ribosome entry sites. Nucleic Acids Res. 38, D131–D136 (2010).
Kieft, J. S. Viral IRES RNA Structures and ribosome interactions. Trends Biochem. Sci. 33, 274–283 (2008).
Pestova, T. V. & Hellen, C. U. Coupled folding during translation initiation. Cell 115, 650–652 (2003).
Sasaki, J. & Nakashima, N. Methionine-independent initiation of translation in the capsid protein of an insect RNA virus. Proc. Natl Acad. Sci. USA 97, 1512–1515 (2000).
Schuler, M. et al. Structure of the ribosome-bound cricket paralysis virus IRES RNA. Nat. Struct. Mol. Biol. 13, 1092–1096 (2006).
Costantino, D. A., Pfingsten, J. S., Rambo, R. P. & Kieft, J. S. tRNA-mRNA mimicry drives translation initiation from a viral IRES. Nat. Struct. Mol. Biol. 15, 57–64 (2008).
Pfingsten, J. S., Costantino, D. A. & Kieft, J. S. Structural basis for ribosome recruitment and manipulation by a viral IRES RNA. Science 314, 1450–1454 (2006).
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta. Crystallogr. D Biol. Crystallogr. 60, 2126–2132 (2004).
Pedretti, A., Villa, L. & Vistoli, G. VEGA-an open platform to develop chemo-bio-informatics applications, using plug-in architecture and script programming. J. Comput. Aided. Mol. Des. 18, 167–173 (2004).
Svergun, D., Barberato, C. & Koch, M. H. CRYSOL - a program to evaluate X-ray solution scattering of biological macromolecules from atomic coordinates. J. Appl. Cryst. 28, 768–773 (1995).
Bernado, P., Mylonas, E., Petoukhov, M. V., Blackledge, M. & Svergun, D. I. Structural characterization of flexible proteins using small-angle X-ray scattering. J. Am. Chem. Soc. 129, 5656–5664 (2007).
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta. Crystallogr. D Biol. Crystallogr. 66, 12–21 (2010).
Acknowledgements
We thank Dr J. Franck for providing the electron microscopy map of the IRES-eIF3 complex and Dr G. Schoehn for his help in preparing Fig. 7. We thank Dr. A. Round and Dr. L. Zerrad for their help on ID14-3 and BM29 SAXS beamlines at the ESRF. We thank the Partnership for Structural Biology for the access to biochemical and biophysical plateforms. This work was supported by the Agence Nationale de la Recherche sur le Sida et les hepatitis virales through ANRS 06378-CSS4 – 2006 and ANRS AO 2008-2-CSS4 to E.D., J.P. and C.L. were supported by a MENRT fellowship from the French government.
Author information
Authors and Affiliations
Contributions
J.P., E.D. and M.J. designed research; J.P., C.L. and F.B. performed experiments; J.P., C.L., F.B., E.D. and M.J. analysed data; E.D. and M.J. wrote the paper.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Supplementary information
Supplementary Figures, Tables, Notes, Methods and References
Supplementary Figures S1-S8, Supplementary Tables S1-S3, Supplementary Notes 1-2, Supplementary Methods and Supplementary References (PDF 2665 kb)
Suplementary Data 1
Structural coordinates of five IRES conformers presented in Figure 6. (ZIP 557 kb)
Rights and permissions
About this article
Cite this article
Pérard, J., Leyrat, C., Baudin, F. et al. Structure of the full-length HCV IRES in solution. Nat Commun 4, 1612 (2013). https://doi.org/10.1038/ncomms2611
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/ncomms2611
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.