Archaeal translation initiation occurs within a macromolecular complex containing the small ribosomal subunit (30S) bound to mRNA, initiation factors aIF1, aIF1A and the ternary complex aIF2:GDPNP:Met-tRNAiMet. Here, we determine the cryo-EM structure of a 30S:mRNA:aIF1A:aIF2:GTP:Met-tRNAiMet complex from Pyrococcus abyssi at 3.2 Å resolution. It highlights archaeal features in ribosomal proteins and rRNA modifications. We find an aS21 protein, at the location of eS21 in eukaryotic ribosomes. Moreover, we identify an N-terminal extension of archaeal eL41 contacting the P site. We characterize 34 N4-acetylcytidines distributed throughout 16S rRNA, likely contributing to hyperthermostability. Without aIF1, the 30S head is stabilized and initiator tRNA is tightly bound to the P site. A network of interactions involving tRNA, mRNA, rRNA modified nucleotides and C-terminal tails of uS9, uS13 and uS19 is observed. Universal features and domain-specific idiosyncrasies of translation initiation are discussed in light of ribosomal structures from representatives of each domain of life.
Translation initiation universally occurs with accurate selection of the start codon that defines the reading frame on the mRNA. The mechanism involves a macromolecular complex composed of the small ribosomal subunit, the mRNA, a specialized methionylated initiator tRNA and initiation factors (IFs). Even if the process is universal, the molecular mechanisms are different in the three domains of life. In bacteria, the ribosome generally binds in the vicinity of the AUG start codon through interaction of 16S rRNA with the Shine-Dalgarno (SD) sequence on mRNA. The initiator tRNA is formylated and only three IFs, IF1–IF3 assist the start codon selection mechanism1. In eukaryotes, translation initiation is more complex with maturated mRNAs and many IFs. The canonical mechanism involves a pre-initiation complex (PIC) comprising the ternary complex eIF2:GTP:Met-tRNAiMet (TC), the two small IFs eIF1 and eIF1A, as well as eIF5 and eIF3. In the presence of factors belonging to the eIF4 family, the PIC is recruited near the 5′-capped end and scans the mRNA until an AUG codon in a correct context (Kozak motif) is found. AUG recognition stops scanning, provokes factor release and the assembly of an elongation-proficient 80S complex through large subunit joining, with the help of eIF5B and eIF1A. eIF1 and eIF1A are key factors of the scanning process favoring a POUT conformation of the TC, where tRNA is not fully accommodated in the P site. The multimeric factor eIF3 also stimulates attachment of the PIC to the mRNA and scanning. Finally, eIF5 is the guanine activating protein of eIF2 that stimulates GTP hydrolysis during scanning of the mRNA2,3,4.
In archaea, genomic analyses have shown that three IFs homologous to their eukaryotic counterparts, aIF1, aIF1A, and aIF2 are found5,6. Moreover, archaeal ribosomal proteins are either universal or shared with eukaryotes showing the proximity of the two ribosomes7,8,9. However, there is no long-range scanning because mRNAs have SD sequences or very short 5′ untranslated regions (UTR) that allow pre-positioning of the IC in the vicinity of the start codon. Despite this, a local search of the mRNA by the IC (termed as “local scanning” in ref. 10) is necessary to allow precise positioning of the start codon in the P site. Thus, even if the recruitment of the PIC on the mRNA is different in eukaryotes and archaea, start codon selection is achieved within a common structural core made up of the small ribosomal subunit, the mRNA, the methionylated initiator tRNA (Met-tRNAiMet) and the three IFs e/aIF1, e/aIF1A and e/aIF210. e/aIF2 is a specific eukaryotic and archaeal heterotrimeric protein that binds the initiator tRNA in the presence of GTP11. Previous biochemical and structural studies of a full IC from Pyrococcus abyssi (30S:mRNA:TC:aIF1:aIF1A) identified two conformations with the initiator tRNA either in a remote position (IC0-PREMOTE) or bound to the P site (IC1-PIN). This led us to propose that conformational changes of the TC may participate in start codon selection12,13. In our model (named “spring force model” in ref. 11), interaction of aIF2 with h44 of the 30S would counteract accommodation of the tRNA in the P site. However, formation of correct codon–anticodon pairing in the P site would compensate for the restoring force exerted by aIF2 on the tRNA. This would allow a longer stay of the initiator tRNA in the P site and trigger further events, including aIF1 departure because of steric hindrance and release of aIF2 in its GDP bound form. The role of aIF1-induced dynamics of the IC in the start codon selection was supported by toeprinting experiments14. In the absence of aIF1, the IC becomes more stable, as observed by a restricted toeprinting signal. However, no structural view of an archaeal IC illustrating a state following aIF1 departure has been described to date.
In the present study, we determine the cryo-EM structure of an archaeal IC (IC2, 30S:mRNA:aIF1A:aIF2:GTP:Met-tRNAiMet) from P. abyssi devoid of aIF1 at an overall resolution of 3.2 Å. Full reconstruction of an atomic model of the small ribosomal subunit highlights archaeal features in ribosomal proteins and rRNA modifications. We find a previously unidentified archaeal ribosomal protein aS21, at the location of eS21 in eukaryotic ribosomes. Moreover, a previously unobserved N-terminal extension of eL41 contacts the P site. We also identify a set of 34 N4-acetylcytidines distributed throughout the 16S rRNA. These base modifications likely participate in the hyperthermostability of this ribosome. In the absence of aIF1, the 30S head is no longer mobile and the initiator tRNA becomes stably bound to the P site. A network of interactions involving rRNA modified nucleotides and the C-terminal tails of three universal ribosomal proteins, uS9, uS13, and uS19 is observed. Universal features and domain-specific idiosyncrasies of translation initiation are discussed in light of ribosomal structures from representatives of each domain of life.
Overview of the IC2 cryo-EM structure
In order to study the impact of aIF1 departure during translation initiation, we prepared an initiation complex (IC2) without this factor. IC2 contains archaeal 30S subunits from P. abyssi (Pa-30S), Pa-aIF1A, the ternary complex (Pa-aIF2:GDPNP:Met-tRNAiMetA1-U72) and a synthetic 26 nucleotide-long mRNA. Cryo-EM images were collected on a Titan Krios microscope (Table 1). After image processing, 218 k particles were used for refinement without classification. A density map to 3.2 Å resolution was obtained, showing a very good structural homogeneity of the complex. Density subtractions of the head or the body parts of the 30S further improved map quality (Supplementary Fig. 1). The high resolution of the electron density map allowed complete reconstruction of the 30S, as described below. After density subtraction and classification in RELION15, one class showed very weak electron density for aIF1A and was therefore not further refined (IC2C, see Methods). The other classes showed two conformations of the initiation complex, named IC2A (34k particles, 4.2 Å resolution, Fig. 1a) and IC2B (142k particles, 3.3 Å resolution, Fig. 1b). The corresponding models were refined in PHENIX16 (Supplementary Figs. 1–3, Table 1 and Supplementary Table 1). In the two conformations, the initiator tRNA and the mRNA are firmly bound to the 30S. Moreover, the position of the 30S head does not detectably change as compared to IC1-PIN12. As already observed for ribosomal initiation complexes, local resolution of the electron density is higher for the ribosome core than for the peripheral IFs (Supplementary Fig. 3). As a consequence, aIF2 subunits and aIF1A were placed in the density by rigid body fitting of crystallographic structures (Table 1 and Methods). The two mobile wings of aIF2 (aIF2β core domain and aIF2α domain 1–2) are poorly defined in the two conformations and their positions were only tentatively modeled. As observed in IC0 and IC112, aIF1A is located in the A site. Its position corresponds to that observed for eukaryotic eIF1A in PIC17,18,19. aIF2 is bound to the 3′ aminoacylated end of the initiator tRNA. However, whereas domain III of aIF2γ (aIF2γDIII) contacts h44 in IC2A, the contact is lost in IC2B. Departure of aIF2γ from h44 is accompanied by a local movement of the rRNA helix (Supplementary Fig. 4). Because the non-hydrolyzable GTP analog, GDPNP, has been used in complex preparation, aIF2 is not fully released (Fig. 1).
Identification of archaeal specificities of Pa-30S
Complete building of the Pa-30S subunit was performed in the 3.2 Å resolution cryo-EM map using the medium resolution archaeal 30S structures as guides12,20 (Fig. 2a). Statistics of the refined structure and final secondary structure diagram of Pa-16S rRNA are shown in Table 1 and in Supplementary Fig. 5, respectively. Overall, the structure is similar to that of Pyrococcus furiosus 30S20. However, many additions have been rendered possible by the higher resolution of the electron density maps. Two peculiarities concerning the ribosomal proteins deserve special attention. The first one concerns archaeal eL41 (the 2014 system for naming ribosomal proteins is used throughout9). As previously noted from the yeast 80S structure21, eL41 is more strongly associated to the small ribosomal subunit than to the large one. Consistent with this observation, eL41 was indeed found in the structures of several 40S17,22. In the present archaeal structure, eL41 is also found in the small ribosomal subunit, as in12. However, in contrast with previous annotations of the genomes of P. furiosus or P. abyssi, the electron density shows that eL41 has an N-terminal extension (Fig. 2c). The protein contains 37 amino acids with an N-terminal peptide, buried in a cavity lined by h27, h44, and h45, followed by a 20 residue long helix (Fig. 2c, d and Supplementary Fig. 6a). A protein of this length has indeed been annotated as eL41 in several archaeal genomes, arguing in favor of a conservation of the N-terminal extension in this domain of life (Supplementary Fig. 6b, c). In eukaryotes, eL41 is most frequently annotated as a 25 aminoacid protein only containing the α-helix part. The functional implications related to eL41 variability will be discussed further below. Secondly, a protein found at the position corresponding to eS21 in eukaryotic ribosomes was built and identified among translation-related proteins in the annotated P. abyssi genome (accession number WP_010867153.1, Fig. 2e and Supplementary Fig. 7a). The structural topology of this protein resembles that of eS21. The two proteins superimpose with an RSMD of 2.1 Å for 47 aligned residues. However, sequence identities in the structurally superimposed regions are limited to five residues (Fig. 2e). In particular, the archaeal protein contains two zinc knuckles that are not observed in the eukaryotic version. Overall, a common ancestor for the archaeal and eukaryotic versions of S21 is rather difficult to envisage. In this sense, aS21 might be an archaeal-specific ribosomal protein. The widespread conservation of the protein in archaeal genomes argues in favor of this idea (Supplementary Data 1).
In 16S rRNA, we identified 44 rRNA modifications (Tables 2 and 3, Fig. 2a and Supplementary Figs. 8 and 9). Some of these rRNA modifications have already been classified as universally conserved and are clustered in the P site17,23,24,25,26 (Fig. 3a, b). They correspond to m3U1467 and the two dimethyladenosines m26A1487, m26A1488 (Table 4; P. abyssi numbering is used unless otherwise stated). Dimethyladenosines at the corresponding positions have been identified in Haloferax volcanii 16S rRNA27 and the KsgA/Dim1 family of enzymes responsible for the modifications is conserved throughout evolution28 with very few exceptions29. Some bacterial specific modifications are not observed but, in contrast, some modifications observed in human and S. cerevisiae are seen (Table 4). Overall, the pattern of P site modifications in the euryarchaeal ribosome described here appears closer to that of the eukaryotic ribosome than to the bacterial one. Importantly, we confirmed the presence of all rRNA modifications introduced in the model (Tables 2 and 3) using liquid chromatography/high-resolution mass spectrometry (LC-HRMS) (Supplementary Fig. 10, Supplementary Table 2 and Methods). The only exception is hm5C, tentatively modeled at position 1378 (Supplementary Fig. 8 and Table 3).
Beside rRNA modifications already characterized in other domains of life, the electron density showed the presence of many N4-acetylcytidines (ac4C; Table 2, Supplementary Figs. 5 and 9) distributed throughout the 16S rRNA. A total of 34 N4-acetylcytidines were identified. The high level of posttranscriptional modifications and the presence of N4-actetylcytidines observed here is reminiscent of previous studies concerning two hyperthermophilic crenarchaea (Sulfolobus solfataricus and Pyrodictium occultum)30,31. In order to confirm the positions of ac4C in the 16S rRNA sequence, we performed reverse transcription mapping on borohydride-reduced 16S rRNA as described32 (Fig. 2b and Supplementary Fig. 11). N4-acetylcytidine modifications systematically target the second cytosine of a 5′ CCG 3′ sequence inside or at extremities of 16S rRNA helices (Supplementary Fig. 5). Remarkably, this is also true for previously identified N4-acetylcytidines in eukaryotic 18S rRNA24,33,34. However the CCG motif was not highlighted at this time because too few modified sites were available. Ac4C have been modeled in their preferred proximal conformation as observed in the ac4C nucleoside crystal structure35 and also as calculated using quantum chemistry methods36. This conformation allows canonical Watson-Crick base pairing interactions. Moreover, the acetyl group reinforces π–π stacking with adjacent bases that increases stability of the base pair. Therefore, this modification distributed throughout the 16S rRNA might largely contribute to the hyperthermostability. Some ac4C residues also interact with ribosomal proteins. One notable interaction involves the eukaryotic-conserved ac4C1479 and R15 of eL41 (Fig. 2d). This position corresponds to ac4C1773 and ac4C1842 previously characterized in S. cerevisiae and in human ribosomes, respectively24,33,34. An archaeal orthologue of the eukaryotic Nat10/Kre33 enzyme could be responsible for these modifications. Taking into account the small RNA guided mechanism of Nat10/Kre33 modifications of 18S rRNA, we hypothesized that small RNA guides would assist Nat10/Kre33 in archaea. Small RNAs would also explain the non-systematic modification of CCG sequences. Finally, because conservation of the corresponding CCG sequences in archaeal 16S rRNA sequences is not obvious (http://www.rna.icmb.utexas.edu/), the presence of N4-acetylations in other archaea has to be studied in each case.
Structure of the mRNA
An mRNA corresponding to the natural start region of the highly expressed elongation factor aEF1A from P. abyssi12,14 was used in IC2. It contains a strong SD sequence with a spacing of 10 nucleotides to the AUG start codon (Fig. 4a and ref. 37). The SD duplex is extended to 9 nucleotides and involves the 5′AUCACCUCC 3′ sequence of the 3′-end of the 16S rRNA (Fig. 4 and Supplementary Fig. 7b). The SD helix is positioned in the exit chamber delineated by uS11, eS3, and h26 on the one side and by uS7, eS28, h28, and h37, on the other side (Fig. 4b). Interactions of uS11 with eS28 and uS7 connect the platform to the head and form the SD duplex channel. uS2 is located at the end of the chamber. Downstream from the SD duplex, the mRNA goes towards the E site. A single unpaired base provides the junction (Fig. 4a). It is stabilized by the tip of the β-hairpin of uS7, as already observed in bacteria and eukaryotes17,38,39. This part of uS7 can serve as a gate to the E site. Downstream from U−4, the mRNA makes a sharp turn and the bases of the E, P, and A codons are pre-positioned in triplets with a kink between the adjacent codons (Fig. 4 and Supplementary Fig. 7b). During translation elongation, the mRNA kink would be important for reading frame maintenance and to prevent mRNA slippage40,41. Finally, the structure of the mRNA remains identical in the two IC2 conformations.
The initiator tRNA is stably bound to the P site
In IC2A and B, the initiator tRNA is stably bound to the P site and base paired with the mRNA AUG start codon. The interactions of the initiator tRNA within the P site will hereafter be described from the IC2B structure because of its higher resolution.
A set of rRNA modifications contribute to the stabilization of the codon:anticodon duplex (Fig. 3a, b) as previously observed in bacteria25,26,42,43 and eukaryotes17,23,24. The base pair C34tRNA:G3mRNA is stabilized by two types of stacking interactions. On the one hand, C1374 (C1400 in E. coli) is stacked onto the base pair and, on the other hand, m1ψ938 (m2G966 in E. coli) is stacked onto the ribose of C34. Interestingly, a U is systematically found at the latter position in the sequences of small ribosomal subunit (SSU) rRNA of archaea and eukaryotes (http://www.rna.icmb.utexas.edu/44), whereas a G is encountered in bacteria. In eukaryotes, this uridine is hypermodified to m1acp3ψ17,24,45,46. Here, we modeled an m1ψ taking into account its presence at this position in M. jannaschii47 and the occurrence in the P. abyssi genome of an orthologue of the corresponding modification enzyme Nep146 (Supplementary Fig. 8). Even if the presence of the 3-amino-3-carboxypropyl in position 3 of the m1ψ938 has been shown in H. volcanii27,48, this chemical group is not visible in the density and no ortholog of the modification enzyme Tsr3 could be found in P. abyssi. This reflects some variability of the set of rRNA modifications within the archaeal phyla31. Notably, the correct orientation of m1ψ938 is due to stacking onto m5C939 (Fig. 3b). Methylation of C939 is further supported by the identification in the P. abyssi genome of an ortholog of the RsmB enzyme responsible for the corresponding modification in bacteria (Accession number WP_010868369, NCBI database, see also49). On the side of the codon, the ribose groups of A1 and U2 are stacked against m3U1467 (m3U1498 in E. coli). The G1 phosphate group is held in place through interaction with the m6A1469-Cm1376 pair. The codon–anticodon helix is also stabilized by two magnesium ions, one bridging the phosphate group of A37 with Cm32 and A38 and the other bridging the phosphate groups of A and U bases of the start codon. Magnesium ions were also observed at similar positions in the bacterial 70S50. A second layer of rRNA modifications made up of m62A1487, m62A1488, hm5C1378, stabilizes the first layer. Notably, the N-terminal part of eL41, embedded between h27, h44, and h45 rRNA helices, contacts several modified bases linked to the P site (m62A1488, hm5C1378, m6A1469). Moreover, ac4C1479 interacts with R15 from eL41 (Figs. 2d and 3a). This interaction, conserved in eukaryotes, also contributes to stabilize eL41 in the cavity. Finally, the interaction of the C-terminal part of eL41 with aIF2γDIII may provide a physical link between the P site and the aIF2γ:h44 contact region (Supplementary Fig. 4). In the anticodon stem, type II and type I A-minor interactions involving the GC base pairs 29–41 and 30–40 with G1312, A1313 (G1338, A1339 in E. coli) are observed. On the other side, the pocket is delineated by A757 (A790 in E. coli) from h24 loop. These interactions (Fig. 3a, b) are conserved in bacteria40,51 and eukaryotes17 and are therefore universal.
The C-terminal tails of the three universal proteins, uS9, uS13, and uS19 contact the initiator tRNA. The universally conserved C-terminal arginine of uS9 is hydrogen bonded to the phosphate groups of Cm32, U33, A35, like in eukaryotes17 and bacteria25,26,40,51. Here, the position of the C-terminal arginine uS9-R135 is further stabilized by hydrogen bonds between the carboxylate group and uS19-R124 (Supplementary Fig. 7c). We modeled the C-terminal tail of uS13 entering the major groove of the tRNA anticodon stem, with the guanidinium group of uS13-R145 facing the Hoogsteen edge of G30. The electron density for the three terminal lysine residues of uS13 is missing. Interestingly, the guanidinium group of uS13-R145 occupies a position corresponding to that of A37 in its “unstacked” conformation observed in the anticodon loop of free initiator tRNA13,52. One may imagine that during tRNA accommodation, R145-uS13 selects G30-C40 in the 3 G-C pairs major groove and facilitates the motion of A37 towards the anticodon loop where it is stacked against U36 and A38. On the same side of the tRNA molecule, just below uS13, is the uS19 C-terminal extremity. Side chains of uS19-T123 and uS19-S125 interact with the phosphate group of G30 and uS19-R124 is stacked against the phosphate backbone from tRNA-G29 to tRNA-G30 (Fig. 3b and Supplementary Fig. 7c). In uS19, a short α-helix (residues 125–129) follows R124 (Fig. 3a). Finally, faint density for the three uS19 C-terminal residues (130–132) suggests their positioning at the edge between the P and A site codons, close to W60 of aIF1A (Fig. 3a and Supplementary Fig. 7c).
In a previous study, we determined the cryo-EM structure of an archaeal 30S initiation complex containing all factors involved in start codon selection (aIF1, aIF1A, and aIF2). In the major conformation, IC0-PREMOTE, the anticodon stem–loop of the tRNA is out of the P site (Fig. 5). The ternary complex structure is similar to that of the free one, showing that it is not constrained by the ribosome. aIF2γ is bound to h44 and interacts with aIF1. In the second conformation, called IC1-PIN, the anticodon stem–loop of the initiator tRNA is bound to the P site, while the position of aIF2γ on h44 has not changed. These two positions are in equilibrium and the transition from one position to the other, accompanied by a 30S head motion, reflects the dynamics of the PIC during testing for the presence of a start codon in the P site12,14. As observed for eukaryotic PIC, stabilization of the tRNA in the P site is impaired by aIF153,54. Consistent with this idea, IC2, made in the absence of aIF1, shows a more homogeneous conformation of the tRNA in agreement with its enhanced stability suggested by toeprinting experiments14.
The IC2A and IC2B conformations were compared to IC0-PREMOTE and IC1-PIN12 by superimposing the 30S bodies of the four structures. In IC2, aIF1A is bound to the A site, though its position is less buried than that observed in IC0 and IC1. In IC2, the conformation of h44 in the vicinity of the binding sites of aIF1 and aIF1A has changed (Supplementary Fig. 12). In IC2A, the conformation of the TC resembles that seen in IC1 (Fig. 5). However, in IC2A, the initiator tRNA is slightly displaced toward the “70S P site” position26 and aIF2γ is slightly repositioned on the side of the vacant aIF1 binding site. aIF2γDIII still contacts the G1391-A1392 h44 bulge. In IC2B, TC is detached from h44 and its conformation is relaxed, close to the conformation observed in the crystal structure55 and in IC0-PREMOTE.
In eukaryotes, it was shown that full release of eIF2 is linked to the release of Pi coming from GTP hydrolysis on eIF256. In archaea, we previously described possible contacts between the N-terminal domain of aIF1 and the switch regions controlling the nucleotide state of aIF2γ12. These contacts may link aIF1 release to Pi release. Because we used GDPNP in IC2 complex preparation, aIF2 is not fully released and contacts between aIF2 and the acceptor stem of the initiator tRNA still exist. Motion of h44 close to the P site may be caused by aIF1 release (Supplementary Fig. 12), as previously proposed in eukaryotes18,19. Readjustments of the position of h44 in the bulge region could explain the relaxation of the contacts between this helix and γDIII. Both these movements and the release of contacts between aIF1 and aIF2γ observed in IC0 and IC1 would explain how aIF2 is detached from the ribosome after start codon recognition and aIF1 release. Interestingly, contacts between h44 and other translation factors were observed, such as those involving ABCE1 during recycling57. Therefore, conformational adjustments of h44 may be a general mechanism controlling binding and release of factors during the translation cycle.
In IC2A and B, the anticodon stem of the tRNA is tightly bound to the P site partly thanks to interactions with the C-terminal tails of the universal proteins uS9, uS13 and uS19. The role of uS9 C-tail in fidelity was previously shown by studies with bacterial58,59,60,61 and yeast systems62,63. Moreover, in eukaryotes, uS9 was shown to favor the recruitment of the TC on the ribosome62. The present archaeal structure suggests that uS9 tail is universally related to fidelity. The C-tail of uS13 was only modeled in a bacterial IC51 and the C-tail of uS19 was, to our knowledge, never observed in initiation complexes before this study. In the present structure, uS13 and uS19 interact with G30 of the second base pair of the almost universally conserved three GC base pairs in the initiator tRNA anticodon stem (Fig. 3a, b and Supplementary Fig. 7c). The present observations in the archaeal system agree with previous studies in E. coli showing that the central GC pair and particularly base G30 was the most crucial nucleotide for translation fidelity64,65,66.
Interestingly, several studies in yeast identified allosteric information pathways connecting functional centers in the large ribosomal subunit (LSU) to the decoding center in the SSU through the B1a and B1b/c intersubunit bridges21,67,68. In eukaryotes and archaea, uS13 participates in B1b/c bridge and uS19 is part of the B1a bridge. One can therefore imagine that some molecular information is relayed through these two proteins to facilitate LSU joining after the accommodated state of the initiator tRNA in the P site has been sensed by the C-tails of uS13 and uS19.
Around the P site, we observed a series of rRNA modifications. In bacteria, m2G966 and m5C967 have been shown to participate in fine-tuning of initiation58,69. The variations in the three domains of life of the residue corresponding to E. coli 16S rRNA-G966 and of its modifications (Table 4) are likely related to evolution of translation initiation. eL41 is connected to the network of rRNA modifications close to the P site by the bulge loop between h44 and h45. Importantly, the C-terminus of eL41 also contacts h44 and aIF2γDIII. eL41 may therefore relay some structural information from the P site to the γDIII binding site and participate in the control of aIF2 release after start codon recognition. The question of the phylogenetic conservation of eL41 is rather puzzling. As evidenced here, archaeal representatives contain 37 residues (Supplementary Fig. 6c). However, eL41 has not been identified in all archaeal genomes8. One possibility is that its identification has been hampered by the reduced size of the protein. In higher eukaryotes, eL41 has been annotated as a 25 aminoacid protein and no supplementary N domain is observed in eukaryotic cryo-EM structures17,24. Notably, in some lower eukaryotes such as Plasmodium falciparum70, eL41 is longer. In bacteria, no orthologue of eL41 has been identified and the corresponding site is vacant on the ribosome. However, a recent cryo-EM study identified a 33 aminoacid residues protein named bS22 conserved in actinobacteria located at the eL41 binding site71. Finally, in human and yeast mitochondria, the N-terminal part of the mitochondria-specific mS38 protein occupies this position72,73,74. mS38 is proposed to be related to translation of mitochondrial mRNAs lacking typical SD sequences74. Overall, the whole data indicate that variability of eL41 might be related to evolution of translation initiation.
The present structure was also compared to the structure of a Kluyveromyces lactis initiation complex in which the N-terminal domain of eIF5 is found at the location vacated by eIF1, in front of the anticodon stem of the initiator tRNA17. This Kl-PIC structure represents a step of initiation occurring after start codon recognition. The initiator tRNA is stabilized by interactions with eIF5-NTD and the N-terminal extension of eIF1A. As compared to the present structure, the initiator tRNA adopts a position slightly more tilted towards the head of the SSU. Interestingly, eIF1A is more deeply bound to the A site and its N-terminal extension occupies the position observed here for the C-terminal tail of uS19. Moreover in Kl-PIC, the C-terminal tail of uS13 is not visible (Fig. 3c, d). According to what is observed in IC2, a step following the one illustrated by the Kl-PIC structure17 may be a repositioning of eIF1A and a relocation of the C-terminal tails of uS13 and uS19 to further stabilize the initiator tRNA in its accommodated state preceding eIF5B loading.
In bacteria and in some archaea, it has been demonstrated that the SD sequence played an important role in the formation of the IC by base-pairing with the anti-SD sequence at the 3′ end of 16S rRNA37,75,76,77. In eukaryotes however, canonical translation involves scanning of the PIC searching for AUG in an optimal context78,79. The current structure highlights similarities and differences in the exit mRNA channel in the three domains of life (Fig. 4). Overall, the euryarchaeal exit tunnel observed here is of the eukaryotic type (Fig. 4b, d). uS11, eS3, and h26 are located on one side of the cavity whereas uS7, eS28, h37, and h28 are on the opposite side. Two notable differences are, however, observed. First, in eukaryotes, eS17 has a long C-terminal extension contacting the mRNA. Second, eS26 stabilizes the 3′ end of the mRNA17. Interestingly, eS26 was proposed to be involved in recognition of Kozak sequence elements80. Moreover, IFs are involved in stabilization of the mRNA in the exit channel in eukaryotes. Indeed, Kozak consensus nucleotides are recognized in the E site by domain 1 of eIF2α. In addition, eIF3a subunit would also stabilize the mRNA at the exit channel pore17. These differences illustrate how eukaryotic and euryarchaeal ribosomes evolved different binding modes of the mRNA in the exit pocket, in relation with the canonical eukaryotic scanning mode vs. the SD-assisted AUG recognition mode occurring in many genes in the archaeal domain. In this view, it is notable that eS26 is absent in euryarchaotes but present in crenarchaeota/lokiarchaeota genomes7,81. Because the archaeal version of the exit chamber is a simplified version of the eukaryotic one, this argues in favor of the controversial hypothesis that eukaryotic ribosomes have evolved from within the archaeal version10,82.
When compared to all available bacterial ribosomal complexes, the present position of the SD duplex in the chamber corresponds to the down position observed in83 where an mRNA designed to allow a “free choice” of the start codon, with a 12 nucleotide spacing, was used (Fig. 4 b, c) or to the SDin (“stand-by”) position defined in ref. 84 where an mRNA GAAAGA lacking the upstream region was used. In contrast, when the classical model mRNA, based on the phage T4 gene 32 mRNA, chosen for its high stability, was used, the SD duplex adopted the “up” tense position in the chamber38,41,51 (Fig. 4b, d and Supplementary Table 3). Notably, this model mRNA has a spacing of seven nucleotides instead of ten in the mRNA used in this study. Hence, the structure of the mRNA observed here would represent a relaxed state favorable to translation initiation efficiency, as expected for an abundantly translated mRNA such as the aEF1A one. The distance between the chamber and the start codon may act as a ruler leading to translation initiation regulation according to the spacing between the SD and the start codon. Comparison of the archaeal exit channel with the bacterial one shows that bS6 and bS18 are found in place of eS3. eS28 is absent in bacteria and uS2 possesses a supplementary C-terminal domain, at the location of eS17. Therefore, archaeal and bacterial exit channels appear as two structural solutions for binding the SD duplex. The spacing between the AUG codon and the SD sequence changes the position of the duplex in the chamber, probably explaining how it influences translation initiation efficiency.
The present study of an archaeal initiation complex fills a gap in high-resolution structures of ribosomes representative of the three domains of life. It provides new structural information useful in support of evolution models based on sequence and experimental data. Universal elements located in the SSU common core at the decoding center ensure stabilization of universal features of the initiator tRNA. The mRNA exit chamber that does not make part of the SSU core85 indeed shows evolution related to domain-specificities for mRNA binding. On the other hand, the present study also demonstrates the occurrence of a large number of N4-acetylcytidines in the rRNA of a hyperthermophilic organism. N4-acetylation of cytidines is of increasing interest since its discovery in mRNA coding sequences where it promotes stability and translation efficiency86,87. Our study shows that the targeted sequence in the 16S rRNA from P. abyssi is systematically Cac4CG. This rule also appears to apply for the ac4C identified to date in 18S rRNA. In this view, our findings also bring information important to understand a possible general mechanism of ac4C modification.
IC2 complex preparation and cryo-EM analysis
The strategy used to prepare the IC2 complex was adapted from a previously described one12. In brief, archaeal 30S subunits from P. abyssi (Pa-30S) were purified and mixed with IFs Pa-aIF1A, the ternary complex Pa-aIF2:GDPNP:Met-tRNAiMetA1-U72 and a synthetic 26 nucleotide-long mRNA corresponding to the natural start region of the mRNA encoding the elongation factor aEF1A from P. abyssi (A(−17)UUUGGAGGUGAUUUAAA(+1)UGCCAAAG(+9))13. The complex was purified by affinity chromatography (TALON, Clontech) using N-terminally tagged versions of Pa-aIF2β and Pa-aIF2α. An excess of mRNA, aIF1A and TC was added before dilution and spotting onto Quantifoil R2/2 grids with an extra 3 nm carbon layer (Quantifoil, Inc) for cryo-EM analysis. Cryo-EM images were collected on an FEI Titan Krios microscope operated at 300 kV at the eBIC center, Diamond Light Source, England (Table 1).
Image processing was performed with RELION 2.188. A total of 2174 images was selected for the autopicking tool. Several rounds of 2D classification and an initial 3D refinement with 330 K particles gave a 3.3 Å resolution density map. While the density for the 30S subunit showed high-resolution details, the density allocated to aIF2 and aIF1A was less resolved. Several rounds of 3D classification, 3D refinement and particle polishing gave an intermediate density at 3.2 Å resolution with 218 K particles but the bound factors remained poorly resolved. Density subtraction of different parts of the complex gave final maps of the 30S head alone, the 30S head with associated factors and 30S body alone at 3.1, 3.7, and 3.2 Å resolution, respectively (Supplementary Fig. 1). Further 3D classification on the aIF2 binding region gave 7.2, 3.4, and 5.6 Å resolution. The three classes obtained correspond to different positions of aIF2 relative to the 30S and were called IC2A, IC2B, and IC2C (Supplementary Fig. 1). Because very weak electron density for aIF1A was observed in IC2C, this class was not considered as representative of an IC2 conformation and was therefore not further refined. Finally, the particles belonging to the IC2A and IC2B classes were re-extracted from raw images and reprocessed to obtain final density maps for IC2A and IC2B complexes at 4.2 Å (34 K particles) and 3.3 Å resolution (142 K particles), respectively, according to the 0.143 FSC gold-standard criterion (Supplementary Fig. 2). Local resolution estimation was carried out using the program RESMAP89.
30S model building
A full atomic model of the P. abyssi 30S has been built using an iterative approach including sequence alignment with RNA and proteins of known structure (PDBs: 4V6U20, 5JB312, Table 1) rigid body fitting in CHIMERA90, real space refinement and geometry regularization in COOT91 combined with real space refinement in PHENIX16. All model components (rRNA, ribosomal proteins) were first fitted in the map densities of the 30S head or body alone with simultaneous optimization of stereochemical properties and correction of intra- and intermolecular steric clashes. Secondary structure restraints for further refinement were determined directly from the model using the web interface of 3DNA-DSSR92 and recalculated at each round of refinement. For the three RNAs present in the models, hydrogen-bonding, base-pair and stacking restraints were applied. Adequate restraint files were constructed for modified nucleotides. The high-resolution limit was set during refinement to match the nominal resolution obtained by postprocessing in RELION. The initial structure obtained was then subjected to rigid body refinement as implemented in phenix.real-space-refine in the IC2B density map and further refined. The crystallographic structure of aIF1A from P. abyssi (PDB: 4MNO12) was manually rigid-body fitted in the cryo-EM map with COOT. Moreover, because the quality of the electron density of aIF2 did not allow side-chain fitting, we used the crystallographic structure of aIF2 from S. solfataricus (PDBs: 4RD493, 3V1155, 2QMU94) to perform rigid body fitting even if aIF2 from P. abyssi was used during complex preparation. The model of the initiator tRNA was fully reconstructed. The anticodon stem–loop region was very well defined. In contrast, according to the higher mobility of the solvent side of the aIF2:tRNA complex, the electron density of the acceptor helix was weaker. The same strategy was applied to account for the electron density of IFs in the IC2A density map. The full IC2A and IC2B models were then subjected to cycles of systematic inspection and manual corrections using COOT followed by refinement in PHENIX. Finally, the entire structure was validated using MOLPROBITY95 as implemented in PHENIX16 and correlation coefficients were calculated (Supplementary Table 1). Various programs from CCP-EM96 were also used throughout the study. The final refinement statistics are provided in Table 1 and in Supplementary Table 1.
rRNA modifications and ribosomal proteins
The full atomic model of the P. abyssi 30S allowed us to identify the N-terminal extension of eL41 and an archaeal version of eS21, as described in the Results section. rRNA modifications visible in the cryo-EM maps have been modeled taking into account information coming from high-resolution cryo-EM and X-ray structures24,25,26, RNA modifications databases (http://mods.rna.albany.edu/, http://modomics.genesilico.pl/), and literature. The presence of the rRNA modifications was then verified using mass spectrometry as well as primer extension analysis for N4-acetylcytidines. A total of 44 rRNA modifications have been identified including 34 N4-acetylcytidines (Tables 2 and 3).
Primer extension analysis of N4-acetylcytidines
16S rRNA was prepared from purified P. abyssi 30S subunits12 using a standard phenol–ether extraction protocol. 16S rRNA reduction was performed as described32. Totally, 3 µg of rRNA was incubated with either sodium borohydride (100 mM in H2O) or water (control sample) in a final reaction volume of 50 µL. Samples were incubated 60 min at 37 °C, quenched with 7.5 µL of 1 M HCl and neutralized with 7.5 µL of 1 M Tris-HCl pH 8.0. Reactions were then ethanol precipitated and rRNA was resuspended in water at a final concentration of 140 ng/µL. Reverse transcription reactions were then performed as follows: 0.28 picomoles of either reduced, nonreduced (control sample) or intact 16S rRNA (140 ng) were mixed to 4 picomoles of 5′ fluorescently labeled primer (see Supplementary Table 4), 1 unit RNAse OUT inhibitor (Invitrogen) and 1 µL of 10× annealing buffer (600 mM NH4Cl, 100 mM Hepes pH 7.5, 70 mM 2-mercaptoethanol) in a final volume of 10 µL. Reactions were then heated 5 min at 65 °C and cooled directly on ice. After annealing, 1.3 µL of 100 mM Mg Acetate and 0.75 µL of dNTP (3.75 mM each) were added and reverse transcription was started by adding 1 µL of AMV reverse transcriptase (3 U/µL, Promega). After 15 min incubation at 51 °C, the reaction was stopped with 3 µL of a solution containing 95% formamide and blue dextran. Reverse transcripts were analyzed on a Licor 4200 DNA sequencer. Sequencing reactions made from a PCR amplified sample of the 16S rRNA and the RT primer were loaded in order to identify the stop positions. Typical experiments are shown in Fig. 2b and in Supplementary Fig. 11.
Mass spectrometry analysis of hydrolyzed 16S rRNA
Fully hydrolyzed 16S rRNA suitable for mass spectrometry analysis was prepared as described97. Samples were then diluted in 0.1% formic acid (FA) prior to analysis. Chromatographic grade solvents (99.99% purity), acetonitrile (MeOH) and FA, were purchased from Sigma Aldrich. LC-HRMS analyses were performed with the timsTOF mass spectrometer coupled with an Elute HPLC system (Bruker Daltonics, Bremen, Germany). The sample (10 µL, 2.5 µg digested 16S rRNA) was injected and separated on an Atlantis T3 column (3 μm, 150 × 2.1 mm; Waters, Saint Quentin, France). The effluent was introduced at a flow rate of 0.2 mL min−1 into the interface with a gradient increasing from 10% of solvent B to 50% in 6 min to achieve 70% at 8 min (A: water with 0.1% FA; B: methanol with 0.1% FA). From 8 min to 12 min, the percentage of solvent increased up to 90% of B. The flow was then set at 10% of B for the last 6 min. Electrospray ionization was operated in the positive ion mode. Capillary and end plate voltages were set at −4.5 and −0.5 kV, respectively. Nitrogen was used as the nebulizer and drying gas at 2 bar and 8 L/min, respectively, with a drying temperature of 220 °C. In MS/MS experiments, the precursor ion was selected with an isolation window of 1 Da and the collision-induced dissociation was performed using collision energies (Ecol) ranging from 7 to 25 eV. Tuning mix (Agilent, France) was used for calibration. The elemental compositions of all ions were determined with the instrument software Data analysis, the precision of mass measurement was better than 3 ppm. All nucleosides have been characterized by both molecular (MH+) and fragment ions (BH2+) (Supplementary Table 2). The only exception was ac4Cm for which only the MH+ ion was identified.
Statistics and reproducibility
The cryo-EM data reported here come from a single experiment and a single grid was used for data collection. Individual images with either bad ice, as shown by visual inspection, or too much motion or astigmatism, as shown by power spectra, were excluded from the dataset. Data collection, processing and refinement statistics (see Table 1) were calculated in RELION88, RESMAP89, and PHENIX16.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
The EMDataBank accession numbers for the EM maps reported in this paper are EMD-10320 (IC2A), EMD-10322 (IC2B), EMD-10323 (IC2 body), and EMD-10324 (IC2 head). The coordinates of the models fitted in the maps have been deposited in the Protein Data Bank (PDB: 6SW9, IC2A, PDB: 6SWC, IC2B, PDB: 6SWD, IC2 body, PDB: 6SWE, IC2 head).
Rodnina, M. V. Translation in prokaryotes. Cold Spring Harb. Perspect. Biol. 10, a032664 (2018).
Hershey, J. W., Sonenberg, N. & Mathews, M. B. Principles of translational control: an overview. Cold Spring Harb. Perspect. Biol. 4, a011528 (2012).
Hinnebusch, A. G. & Lorsch, J. R. The mechanism of eukaryotic translation initiation: new insights and challenges. Cold Spring Harb. Perspect. Biol. 4, a011544 (2012).
Pestova, T. V. et al. Molecular mechanisms of translation initiation in eukaryotes. Proc. Natl Acad. Sci. USA 98, 7029–7036 (2001).
Kyrpides, N. C. & Woese, C. R. Universally conserved translation initiation factors. Proc. Natl Acad. Sci. USA 95, 224–228 (1998).
Kyrpides, N. C. & Woese, C. R. Archaeal translation initiation revisited: the initiation factor 2 and eukaryotic initiation factor 2B alpha-beta-delta subunit families. Proc. Natl Acad. Sci. USA 95, 3726–3730 (1998).
Lecompte, O., Ripp, R., Thierry, J. C., Moras, D. & Poch, O. Comparative analysis of ribosomal proteins in complete genomes: an example of reductive evolution at the domain scale. Nucleic Acids Res. 30, 5382–5390 (2002).
Yutin, N., Puigbo, P., Koonin, E. V. & Wolf, Y. I. Phylogenomics of prokaryotic ribosomal proteins. PLoS One 7, e36972 (2012).
Ban, N. et al. A new system for naming ribosomal proteins. Curr. Opin. Struct. Biol. 24, 165–169 (2014).
Schmitt, E., Coureux, P. D., Monestier, A., Dubiez, E. & Mechulam, Y. Start codon recognition in eukaryotic and archaeal translation initiation: a common structural core. Int. J. Mol. Sci. 20, 939 (2019).
Schmitt, E., Naveau, M. & Mechulam, Y. Eukaryotic and archaeal translation initiation factor 2: a heterotrimeric tRNA carrier. FEBS Lett. 584, 405–412 (2010).
Coureux, P. D. et al. Cryo-EM study of start codon selection during archaeal translation initiation. Nat. Commun. 7, 13366 (2016).
Monestier, A. et al. The structure of an E. coli tRNAfMet A1-U72 variant shows an unusual conformation of the A1-U72 base pair. RNA 23, 673–682 (2017).
Monestier, A., Lazennec-Schurdevin, C., Coureux, P. D., Mechulam, Y. & Schmitt, E. Role of aIF1 in Pyrococcus abyssi translation initiation. Nucleic Acids Res. 46, 11061–11074 (2018).
Zivanov, J. et al. New tools for automated high-resolution cryo-EM structure determination in RELION-3. eLife 7, e42166 (2018).
Afonine, P. V. et al. Towards automated crystallographic structure refinement with phenix.refine. Acta Crystallogr D Biol. Crystallogr. 68, 352–367 (2012).
Llacer, J. L. et al. Translational initiation factor eIF5 replaces eIF1 on the 40S ribosomal subunit to promote start-codon recognition. Elife 7, e39273 (2018).
Lomakin, I. B. & Steitz, T. A. The initiation of mammalian protein synthesis and mRNA scanning mechanism. Nature 500, 307–311 (2013).
Weisser, M., Voigts-Hoffmann, F., Rabl, J., Leibundgut, M. & Ban, N. The crystal structure of the eukaryotic 40S ribosomal subunit in complex with eIF1 and eIF1A. Nat. Struct. Mol. Biol. 20, 1015–1017 (2013).
Armache, J. P. et al. Promiscuous behaviour of archaeal ribosomal proteins: implications for eukaryotic ribosome evolution. Nucleic Acids Res. 41, 1284–1293 (2013).
Ben-Shem, A. et al. The structure of the eukaryotic ribosome at 3.0 A resolution. Science 334, 1524–1529 (2011).
Acosta-Reyes, F., Neupane, R., Frank, J. A. Ohoo & Fernandez, I. S. A. Ohoo The Israeli acute paralysis virus IRES captures host ribosomes by mimicking a ribosomal state with hybrid tRNAs. EMBO J. 38, e102226 (2019).
Sloan, K. E. et al. Tuning the ribosome: The influence of rRNA modification on eukaryotic ribosome biogenesis and function. RNA Biol. 14, 1138–1152 (2017).
Natchiar, S. K., Myasnikov, A. G., Kratzat, H., Hazemann, I. & Klaholz, B. P. Visualization of chemical modifications in the human 80S ribosome structure. Nature 551, 472–477 (2017).
Fischer, N. et al. Structure of the E. coli ribosome-EF-Tu complex at <3 A resolution by Cs-corrected cryo-EM. Nature 520, 567–570 (2015).
Polikanov, Y. S., Melnikov, S. V., Soll, D. & Steitz, T. A. Structural insights into the role of rRNA modifications in protein synthesis and ribosome assembly. Nat. Struct. Mol. Biol. 22, 342–344 (2015).
Kowalak, J. A., Bruenger, E., Crain, P. F. & McCloskey, J. A. Identities and phylogenetic comparisons of posttranscriptional modifications in 16 S ribosomal RNA from Haloferax volcanii. J. Biol. Chem. 275, 24484–24489 (2000).
O’Farrell, H. C., Pulicherla, N., Desai, P. M. & Rife, J. P. Recognition of a complex substrate by the KsgA/Dim1 family of enzymes has been conserved throughout evolution. RNA 12, 725–733 (2006).
Seistrup, K. H. et al. Bypassing rRNA methylation by RsmA/Dim1during ribosome maturation in the hyperthermophilic archaeon Nanoarchaeum equitans. Nucleic Acids Res. 45, 2007–2015 (2017).
Bruenger, E. et al. 5S rRNA modification in the hyperthermophilic archaea Sulfolobus solfataricus and Pyrodictium occultum. FASEB J. 1, 196–200 (1993).
Noon, K. R., Bruenger, E. & McCloskey, J. A. Posttranscriptional modifications in 16S and 23S rRNAs of the archaeal hyperthermophile Sulfolobus solfataricus. J. Bacteriol. 180, 2883–2888 (1998).
Thomas, J. M. et al. A chemical signature for cytidine acetylation in RNA. J. Am. Chem. Soc. 140, 12667–12670 (2018).
Sharma, S. et al. Specialized box C/D snoRNPs act as antisense guides to target RNA base acetylation. PLoS Genet. 13, e1006804 (2017).
Sharma, S. et al. Yeast Kre33 and human NAT10 are conserved 18S rRNA cytosine acetyltransferases that modify tRNAs assisted by the adaptor Tan1/THUMPD1. Nucleic Acids Res. 43, 2242–2258 (2015).
Parthasarathy, R., Ginell, S. L., De, N. C. & Chheda, G. B. Conformation of N4-acetylcytidine, a modified nucleoside of tRNA, and stereochemistry of codon-anticodon interaction. Biochem. Biophys. Res. Commun. 83, 657–663 (1978).
Kumbhar, B. V., Kamble, A. D. & Sonawane, K. D. Conformational preferences of modified nucleoside N(4)-acetylcytidine, ac4C occur at “wobble” 34th position in the anticodon loop of tRNA. Cell Biochem. Biophys. 66, 797–816 (2013).
Ma, J., Campbell, A. & Karlin, S. Correlations between Shine-Dalgarno sequences and gene features such as predicted expression levels and operon structures. J. Bacteriol. 184, 5733–5745 (2002).
Yusupova, G. Z., Yusupov, M. M., Cate, J. H. & Noller, H. F. The path of messenger RNA through the ribosome. Cell 106, 233–241 (2001).
Visweswaraiah, J. & Hinnebusch, A. G. Interface between 40S exit channel protein uS7/Rps5 and eIF2alpha modulates start codon recognition in vivo. Elife 6, e22572 (2017).
Selmer, M. et al. Structure of the 70S ribosome complexed with mRNA and tRNA. Science 313, 1935–1942 (2006).
Jenner, L. B., Demeshkina, N., Yusupova, G. & Yusupov, M. Structural aspects of messenger RNA reading frame maintenance by the ribosome. Nat. Struct. Mol. Biol. 17, 555–560 (2010).
Das, G. et al. Role of 16S ribosomal RNA methylations in translation initiation in Escherichia coli. EMBO J. 27, 840–851 (2008).
Schuwirth, B. S. et al. Structural analysis of kasugamycin inhibition of translation. Nat. Struct. Mol. Biol. 13, 879–886 (2006).
Cannone, J. J. et al. R3D-2-MSA: the RNA 3D structure-to-multiple sequence alignment server. Nucleic Acids Res. 43, W15–W23 (2015).
Maden, B. E., Forbes, J., de Jonge, P. & Klootwijk, J. Presence of a hypermodified nucleotide in HeLa cell 18 S and Saccharomyces carlsbergensis 17 S ribosomal RNAs. FEBS Lett. 59, 60–63 (1975).
Meyer, B. et al. The Bowen-Conradi syndrome protein Nep1 (Emg1) has a dual role in eukaryotic ribosome biogenesis, as an essential assembly factor and in the methylation of Psi1191 in yeast 18S rRNA. Nucleic Acids Res. 39, 1526–1537 (2011).
Jones, W. J., Leigh, J. A., Mayer, F., Woese, C. R. & Wolfe, R. S. Methanococcus jannaschii sp. nov., an extremely thermophilic methanogen from a submarine hydrothermal vent. Arch. Microbiol. 136, 254–261 (1983).
Grosjean, H., Gaspin, C., Marck, C., Decatur, W. A. & de Crécy-Lagard, V. RNomics and modomics in the halophilic archaea Haloferax volcanii: identification of RNA modification genes. BMC Genomics 9, 470 (2008).
Hikida, Y., Kuratani, M., Bessho, Y., Sekine, S. I. & Yokoyama, S. Structure of an archaeal homologue of the bacterial Fmu/RsmB/RrmB rRNA cytosine 5-methyltransferase. Acta Crystallogr. D Biol. Crystallogr. 66, 1301–1307 (2010).
Rozov, A. et al. Importance of potassium ions for ribosome structure and function revealed by long-wavelength X-ray diffraction. Nat. Commun. 10, 2519 (2019).
Hussain, T., Llacer, J. L., Wimberly, B. T., Kieft, J. S. & Ramakrishnan, V. Large-scale movements of IF3 and tRNA during bacterial translation initiation. Cell 167, 133–144 (2016).
Barraud, P., Schmitt, E., Mechulam, Y., Dardel, F. & Tisne, C. A unique conformation of the anticodon stem-loop is associated with the capacity of tRNAfMet to initiate protein synthesis. Nucleic Acids Res. 36, 4894–4901 (2008).
Hussain, T. et al. Structural changes enable start codon recognition by the eukaryotic translation initiation complex. Cell 159, 597–607 (2014).
Llacer, J. L. et al. Conformational differences between open and closed states of the eukaryotic translation initiation complex. Mol. Cell 59, 399–412 (2015).
Schmitt, E. et al. Structure of the ternary initiation complex aIF2-GDPNP-methionylated initiator tRNA. Nat. Struct. Mol. Biol. 19, 450–454 (2012).
Algire, M. A., Maag, D. & Lorsch, J. R. Pi release from eIF2, not GTP hydrolysis, is the step controlled by start-site selection during eukaryotic translation initiation. Mol. Cell 20, 251–262 (2005).
Gerovac, M. & Tampe, R. Control of mRNA translation by versatile ATP-driven machines. Trends Biochem. Sci. 2, 167–180 (2019).
Arora, S. et al. Distinctive contributions of the ribosomal P-site elements m2G966, m5C967 and the C-terminal tail of the S9 protein in the fidelity of initiation of translation in Escherichia coli. Nucleic Acids Res. 41, 4963–4975 (2013).
Arora, S., Bhamidimarri, S. P., Weber, M. H. & Varshney, U. Role of the ribosomal P-site elements of m(2)G966, m(5)C967, and the S9 C-terminal tail in maintenance of the reading frame during translational elongation in Escherichia coli. J. Bacteriol. 195, 3524–3530 (2013).
Hoang, L., Fredrick, K. & Noller, H. F. Creating ribosomes with an all-RNA 30S subunit P site. Proc. Natl Acad. Sci. USA 101, 12439–12443 (2004).
Noller, H. F., Hoang, L. & Fredrick, K. The 30S ribosomal P site: a function of 16S rRNA. FEBS Lett. 579, 855–858 (2005).
Jindal, S., Ghosh, A., Ismail, A., Singh, N. & Komar, A. A. Role of the uS9/yS16 C-terminal tail in translation initiation and elongation in Saccharomyces cerevisiae. Nucleic Acids Res. 47, 806–823 (2019).
Ghosh, A., Jindal, S., Bentley, A. A., Hinnebusch, A. G. & Komar, A. A. Rps5-Rps16 communication is essential for efficient translation initiation in yeast S. cerevisiae. Nucleic Acids Res. 42, 8537–8555 (2014).
Shetty, S., Shah, R. A., Chembazhi, U. V., Sah, S. & Varshney, U. Two highly conserved features of bacterial initiator tRNAs license them to pass through distinct checkpoints in translation initiation. Nucleic Acids Res. 45, 2040–2050 (2017).
Ayyub, S. A. et al. Coevolution of the translational machinery optimizes initiation with unusual initiator tRNAs and initiation codons in mycoplasmas. RNA Biol. 15, 70–80 (2018).
Samhita, L., Shetty, S. & Varshney, U. Unconventional initiator tRNAs sustain Escherichia coli. Proc. Natl Acad. Sci. USA 109, 13058–13063 (2012).
Bowen, A. M. et al. Ribosomal protein uS19 mutants reveal its role in coordinating ribosome structure and function. Translation 3, e1117703 (2015).
Rhodin, M. H. & Dinman, J. D. An extensive network of information flow through the B1b/c intersubunit bridge of the yeast ribosome. PLoS One 6, e20048 (2011).
Burakovsky, D. E. et al. Impact of methylations of m2G966/m5C967 in 16S rRNA on bacterial fitness and translation initiation. Nucleic Acids Res. 40, 7885–7895 (2012).
Wong, W. et al. Cryo-EM structure of the Plasmodium falciparum 80S ribosome bound to the anti-protozoan drug emetine. Elife 3, e03080 (2014).
Hentschel, J. et al. The complete structure of the Mycobacterium smegmatis 70S ribosome. Cell Rep. 20, 149–160 (2017).
Amunts, A., Brown, A., Toots, J., Scheres, S. H. W. & Ramakrishnan, V. Ribosome. The structure of the human mitochondrial ribosome. Science 348, 95–98 (2015).
Greber, B. J. et al. Ribosome. The complete structure of the 55S mammalian mitochondrial ribosome. Science 348, 303–308 (2015).
Mays, J. N. et al. The mitoribosome-specific protein mS38 is preferentially required for synthesis of cytochrome c oxidase subunits. Nucleic Acids Res. 47, 5746–5760 (2019).
Condo, I., Ciammaruconi, A., Benelli, D., Ruggero, D. & Londei, P. Cis-acting signals controlling translational initiation in the thermophilic archaeon Sulfolobus solfataricus. Mol. Microbiol. 34, 377–384 (1999).
Osada, Y., Saito, R. & Tomita, M. Analysis of base-pairing potentials between 16S rRNA and 5’ UTR for translation initiation in various prokaryotes. Bioinformatics 15, 578–581 (1999).
Saito, R. & Tomita, M. Computer analyses of complete genomes suggest that some archaebacteria employ both eukaryotic and eubacterial mechanisms in translation initiation. Gene 238, 79–83 (1999).
Kozak, M. Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell 44, 283–292 (1986).
Archer, S. K., Shirokikh, N. E., Beilharz, T. H. & Preiss, T. Dynamics of ribosome scanning and recycling revealed by translation complex profiling. Nature 535, 570–574 (2016).
Ferretti, M. B., Ghalei, H., Ward, E. A., Potts, E. L. & Karbstein, K. Rps26 directs mRNA-specific translation by recognition of Kozak sequence elements. Nat. Struct. Mol. Biol. 24, 700–707 (2017).
Schutz, S. et al. Molecular basis for disassembly of an importin:ribosomal protein complex by the escortin Tsr2. Nat. Commun. 9, 3669 (2018).
Zaremba-Niedzwiedzka, K. et al. Asgard archaea illuminate the origin of eukaryotic cellular complexity. Nature 541, 353–358 (2017).
Yusupova, G., Jenner, L., Rees, B., Moras, D. & Yusupov, M. Structural basis for messenger RNA movement on the ribosome. Nature 444, 391–394 (2006).
Kaminishi, T. et al. A snapshot of the 30S ribosomal subunit capturing mRNA via the Shine-Dalgarno interaction. Structure 15, 289–297 (2007).
Bernier, C. R., Petrov, A. S., Kovacs, N. A., Penev, P. I. & Williams, L. D. Translation: the universal structural core of life. Mol. Biol. Evol. 35, 2065–2076 (2018).
Arango, D. et al. Acetylation of cytidine in mRNA promotes translation efficiency. Cell 175, 1872–1886 (2018).
Dominissini, D. & Rechavi, G. N(4)-acetylation of cytidine in mRNA by NAT10 regulates stability and translation. Cell 175, 1725–1727 (2018).
Scheres, S. H. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 180, 519–530 (2012).
Kucukelbir, A., Sigworth, F. J. & Tagare, H. D. Quantifying the local resolution of cryo-EM density maps. Nat. Methods 11, 63 (2013).
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. Features and development of Coot. Acta Crystallogr. D66, 486–501 (2010).
Zheng, G., Lu, X.-J. & Olson, W. Web 3DNA—a web server for the analysis, reconstruction, and visualization of three-dimensional nucleic-acid structures. Nucleic Acids Res 37, W240–W246 (2009).
Dubiez, E., Aleksandrov, A., Lazennec-Schurdevin, C., Mechulam, Y. & Schmitt, E. Identification of a second GTP-bound magnesium ion in archaeal initiation factor 2. Nucleic Acids Res. 43, 2946–2957 (2015).
Yatime, L., Mechulam, Y., Blanquet, S. & Schmitt, E. Structure of an archaeal heterotrimeric initiation factor 2 reveals a nucleotide state between the GTP and the GDP states. Proc. Natl Acad. Sci. USA 104, 18445–18450 (2007).
Chen, V. B. et al. MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D Biol. Crystallogr. 66, 12–21 (2010).
Burnley, T., Palmer, C. M. & Winn, M. Recent developments in the CCP-EM software suite. Acta Crystallogr. D Biol. Crystallogr. 73, 469–477 (2017).
Crain, P. F. Preparation and enzymatic hydrolysis of DNA and RNA for mass spectrometry. Methods Enzymol. 193, 782–790 (1990).
Schrodinger, L. The PyMOL Molecular Graphics System, Version 2.0.0. (2017).
Taoka, M. et al. The complete chemical structure of Saccharomyces cerevisiae rRNA: partial pseudouridylation of U2345 in 25S rRNA by snoRNA snR9. Nucleic Acids Res. 44, 8951–8961 (2016).
This work was supported by grants from the Centre National de la Recherche Scientifique and Ecole polytechnique to Unité Mixte de Recherche n°7654 and by a grant from the Agence Nationale de la Recherche (ANR-17-CE11–0037). We thank Alistair Siebert for data collection at the electron biological cryo-imaging facility (eBIC, Diamond Light Source, UK). We thank Thomas Gaillard for helpful discussions.
The authors declare no competing interests.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Coureux, PD., Lazennec-Schurdevin, C., Bourcier, S. et al. Cryo-EM study of an archaeal 30S initiation complex gives insights into evolution of translation initiation. Commun Biol 3, 58 (2020). https://doi.org/10.1038/s42003-020-0780-0