Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Efficient genetic code expansion without host genome modifications

Abstract

Supplementing translation with noncanonical amino acids (ncAAs) can yield protein sequences with new-to-nature functions but existing ncAA incorporation strategies suffer from low efficiency and context dependence. We uncover codon usage as a previously unrecognized contributor to efficient genetic code expansion using non-native codons. Relying only on conventional Escherichia coli strains with native ribosomes, we develop a plasmid-based codon compression strategy that minimizes context dependence and improves ncAA incorporation at quadruplet codons. We confirm that this strategy is compatible with all known genetic code expansion resources, which allowed us to identify 12 mutually orthogonal transfer RNA (tRNA)–synthetase pairs. Enabled by these findings, we evolved and optimized five tRNA–synthetase pairs to incorporate a broad repertoire of ncAAs at orthogonal quadruplet codons. Lastly, we extend these resources to an in vivo biosynthesis platform that can readily create >100 new-to-nature peptide macrocycles bearing up to three unique ncAAs. Our approach will accelerate innovations in multiplexed genetic code expansion and the discovery of chemically diverse biomolecules.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Local and remote codons influence apparent quadruplet decoding.
Fig. 2: Twelve mutually orthogonal tRNA–synthetase pairs and validation in recoded circuits.
Fig. 3: Directed evolution of highly efficient qtRNAs for ncAA incorporation.
Fig. 4: Validation of synthetases with broad, nonoverlapping ncAA substrate scope.
Fig. 5: Intracellular biosynthesis of ncAA-encoding peptide macrocycles through optimized quadruplet decoding.
Fig. 6: Biosynthesis of peptide macrocycles encoding multiple ncAAs.

Similar content being viewed by others

Data availability

All data supporting the findings of this study are available within the article and its Supplementary Information. Select representative plasmids and strains were deposited to Addgene. NGS data were uploaded to the National Center for Biotechnology Information Sequence Read Archive (PRJNA1111233). Source data are provided with this paper.

References

  1. Chin, J. W. Expanding and reprogramming the genetic code of cells and animals. Annu. Rev. Biochem. 83, 379–408 (2014).

    Article  CAS  PubMed  Google Scholar 

  2. Noren, C. J., Anthony-Cahill, S. J., Griffith, M. C. & Schultz, P. G. A general method for site-specific incorporation of unnatural amino acids into proteins. Science 244, 182–188 (1989).

    Article  CAS  PubMed  Google Scholar 

  3. Wang, L., Brock, A., Herberich, B. & Schultz, P. G. Expanding the genetic code of Escherichia coli. Science 292, 498–500 (2001).

    Article  CAS  PubMed  Google Scholar 

  4. Chin, J. W., Martin, A. B., King, D. S., Wang, L. & Schultz, P. G. Addition of a photocrosslinking amino acid to the genetic code of Escherichia coli. Proc. Natl Acad. Sci. USA 99, 11020–11024 (2002).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Young, T. S., Ahmad, I., Yin, J. A. & Schultz, P. G. An enhanced system for unnatural amino acid mutagenesis in E. coli. J. Mol. Biol. 395, 361–374 (2010).

    Article  CAS  PubMed  Google Scholar 

  6. Johnson, D. B. et al. RF1 knockout allows ribosomal incorporation of unnatural amino acids at multiple sites. Nat. Chem. Biol. 7, 779–786 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Chatterjee, A., Sun, S. B., Furman, J. L., Xiao, H. & Schultz, P. G. A versatile platform for single- and multiple-unnatural amino acid mutagenesis in Escherichia coli. Biochemistry 52, 1828–1837 (2013).

    Article  CAS  PubMed  Google Scholar 

  8. Chin, J. W. Expanding and reprogramming the genetic code. Nature 550, 53–60 (2017).

    Article  CAS  PubMed  Google Scholar 

  9. Dumas, A., Lercher, L., Spicer, C. D. & Davis, B. G. Designing logical codon reassignment—expanding the chemistry in biology. Chem. Sci. 6, 50–69 (2015).

    Article  CAS  PubMed  Google Scholar 

  10. Ostrov, N. et al. Synthetic genomes with altered genetic codes. Curr. Opin. Syst. Biol. 24, 32–40 (2020).

    Article  Google Scholar 

  11. Fredens, J. et al. Total synthesis of Escherichia coli with a recoded genome. Nature 569, 514–518 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Zurcher, J. F. et al. Continuous synthesis of E. coli genome sections and Mb-scale human DNA assembly. Nature 619, 555–562 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Zhang, Y. et al. A semi-synthetic organism that stores and retrieves increased genetic information. Nature 551, 644–647 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Goto, Y. & Suga, H. The RaPID platform for the discovery of pseudo-natural macrocyclic peptides. Acc. Chem. Res. 54, 3604–3617 (2021).

    Article  CAS  PubMed  Google Scholar 

  15. Anderson, J. C. et al. An expanded genetic code with a functional quadruplet codon. Proc. Natl Acad. Sci. USA 101, 7566–7571 (2004).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Rackham, O. & Chin, J. W. A network of orthogonal ribosomemRNA pairs. Nat. Chem. Biol. 1, 159–166 (2005).

    Article  CAS  PubMed  Google Scholar 

  17. Ohtsuki, T., Yamamoto, H., Doi, Y. & Sisido, M. Use of EF-Tu mutants for determining and improving aminoacylation efficiency and for purifying aminoacyl tRNAs with non-natural amino acids. J. Biochem. 148, 239–246 (2010).

    Article  CAS  PubMed  Google Scholar 

  18. Neumann, H., Wang, K., Davis, L., Garcia-Alai, M. & Chin, J. W. Encoding multiple unnatural amino acids via evolution of a quadruplet-decoding ribosome. Nature 464, 441–444 (2010).

    Article  CAS  PubMed  Google Scholar 

  19. Agarwal, D., Kamath, D., Gregory, S. T., O’Connor, M. & Gourse, R. L. Modulation of decoding fidelity by ribosomal proteins S4 and S5. J. Bacteriol. 197, 1017–1025 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Schmied, W. H. et al. Controlling orthogonal ribosome subunit interactions enables evolution of new function. Nature 564, 444–448 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Aleksashin, N. A. et al. A fully orthogonal system for protein synthesis in bacterial cells. Nat. Commun. 11, 1858 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Debenedictis, E. A., Carver, G. D., Chung, C. Z., Söll, D. & Badran, A. H. Multiplex suppression of four quadruplet codons via tRNA directed evolution. Nat. Commun. 12, 5706 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Kim, D. S. et al. Three-dimensional structure-guided evolution of a ribosome with tethered subunits. Nat. Chem. Biol. 18, 990–998 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Gamper, H., Masuda, I. & Hou, Y. M. Genome expansion by tRNA +1 frameshifting at quadruplet codons. J. Mol. Biol. 434, 167440 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Hohsaka, T., Ashizuka, Y., Taira, H., Murakami, H. & Sisido, M. Incorporation of nonnatural amino acids into proteins by using various four-base codons in an Escherichia coli in vitro translation system. Biochemistry 40, 11060–11064 (2001).

    Article  CAS  PubMed  Google Scholar 

  26. Chatterjee, A., Xiao, H. & Schultz, P. G. Evolution of multiple, mutually orthogonal prolyl-tRNA synthetase/tRNA pairs for unnatural amino acid mutagenesis in Escherichia coli. Proc. Natl Acad. Sci. USA 109, 14841–14846 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Tuller, T. et al. An evolutionarily conserved mechanism for controlling the efficiency of protein translation. Cell 141, 344–354 (2010).

    Article  CAS  PubMed  Google Scholar 

  28. Hooper, S. D. & Berg, O. G. Gradients in nucleotide and codon usage along Escherichia coli genes. Nucleic Acids Res. 28, 3517–3523 (2000).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Gamble, C. E., Brule, C. E., Dean, K. M., Fields, S. & Grayhack, E. J. Adjacent codons act in concert to modulate translation efficiency in yeast. Cell 166, 679–690 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Schinn, S. M. et al. Rapid in vitro screening for the location-dependent effects of unnatural amino acids on protein expression and activity. Biotechnol. Bioeng. 114, 2412–2417 (2017).

    Article  CAS  PubMed  Google Scholar 

  31. Xu, H. et al. Re-exploration of the codon context effect on amber codon-guided incorporation of noncanonical amino acids in Escherichia coli by the blue–white screening assay. ChemBioChem 17, 1250–1256 (2016).

    Article  CAS  PubMed  Google Scholar 

  32. Pott, M., Schmidt, M. J. & Summerer, D. Evolved sequence contexts for highly efficient amber suppression with noncanonical amino acids. ACS Chem. Biol. 9, 2815–2822 (2014).

    Article  CAS  PubMed  Google Scholar 

  33. Bartoschek, M. D. et al. Identification of permissive amber suppression sites for efficient non-canonical amino acid incorporation in mammalian cells. Nucleic Acids Res. 49, e62 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Dunkelmann, D. L., Oehm, S. B., Beattie, A. T. & Chin, J. W. A 68-codon genetic code to incorporate four distinct non-canonical amino acids enabled by automated orthogonal mRNA design. Nat. Chem. 13, 1110–1117 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  35. Fluitt, A., Pienaar, E. & Viljoen, H. Ribosome kinetics and aa-tRNA competition determine rate and fidelity of peptide synthesis. Comput. Biol. Chem. 31, 335–346 (2007).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Bryson, D. I. et al. Continuous directed evolution of aminoacyl-tRNA synthetases. Nat. Chem. Biol. 13, 1253–1260 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Melnikov, S. V. & Söll, D.Aminoacyl-tRNA synthetases and tRNAs for an expanded genetic code: what makes them orthogonal? Int. J. Mol. Sci. 20, 1929 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Liu, F., Bratulic, S., Costello, A., Miettinen, T. P. & Badran, A. H. Directed evolution of rRNA improves translation kinetics and recombinant protein yield. Nat. Commun. 12, 5638 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Willis, J. C. W. & Chin, J. W. Mutually orthogonal pyrrolysyl-tRNA synthetase/tRNA pairs. Nat. Chem. 10, 831–837 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Chatterjee, A., Lajoie, M. J., Xiao, H., Church, G. M. & Schultz, P. G. A bacterial strain with a unique quadruplet codon specifying non‐native amino acids. ChemBioChem 15, 1782–1786 (2014).

    Article  CAS  PubMed  Google Scholar 

  41. Dunkelmann, D. L., Willis, J. C. W., Beattie, A. T. & Chin, J. W. Engineered triply orthogonal pyrrolysyl-tRNA synthetase/tRNA pairs enable the genetic encoding of three distinct non-canonical amino acids. Nat. Chem. 12, 535–544 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Ikeda-Boku, A. et al. A simple system for expression of proteins containing 3-azidotyrosine at a pre-determined site in Escherichia coli. J. Biochem. 153, 317–326 (2013).

    Article  CAS  PubMed  Google Scholar 

  43. Chatterjee, A., Xiao, H., Yang, P. Y., Soundararajan, G. & Schultz, P. G. A tryptophanyl‐tRNA synthetase/tRNA pair for unnatural amino acid mutagenesis in E. coli. Angew. Chem. Int. Ed. Engl. 52, 5106–5109 (2013).

    Article  CAS  PubMed  Google Scholar 

  44. Hughes, R. A. & Ellington, A. D. Rational design of an orthogonal tryptophanyl nonsense suppressor tRNA. Nucleic Acids Res. 38, 6813–6830 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Cervettini, D. et al. Rapid discovery and evolution of orthogonal aminoacyl-tRNA synthetase–tRNA pairs. Nat. Biotechnol. 38, 989–999 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Kwon, I., Wang, P. & Tirrell, D. A. Design of a bacterial host for site-specific incorporation of p-bromophenylalanine into recombinant proteins. J. Am. Chem. Soc. 128, 11778–11783 (2006).

    Article  CAS  PubMed  Google Scholar 

  47. Park, H.-S. et al. Expanding the genetic code of Escherichia coli with phosphoserine. Science 333, 1151–1154 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Rogerson, D. T. et al. Efficient genetic encoding of phosphoserine and its nonhydrolyzable analog. Nat. Chem. Biol. 11, 496–503 (2015).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Anderson, J. C. & Schultz, P. G. Adaptation of an orthogonal archaeal leucyl-tRNA and synthetase pair for four-base, amber, and opal suppression. Biochemistry 42, 9598–9608 (2003).

    Article  CAS  PubMed  Google Scholar 

  50. Liu, D. R. & Schultz, P. G. Progress toward the evolution of an organism with an expanded genetic code. Proc. Natl Acad. Sci. USA 96, 4780–4785 (1999).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Zambaldo, C. et al. An orthogonal seryl-tRNA synthetase/tRNA pair for noncanonical amino acid mutagenesis in Escherichia coli. Bioorg. Med. Chem. 28, 115662 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Makino, Y. et al. An archaeal ADP-dependent serine kinase involved in cysteine biosynthesis and serine metabolism. Nat. Commun. 7, 13446 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  53. Wang, K. et al. Optimized orthogonal translation of unnatural amino acids enables spontaneous protein double-labelling and FRET. Nat. Chem. 6, 393–403 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Jaric, J. & Budisa, N. in Hydrocarbon and Lipid Microbiology Protocols (eds McGenity, T. et al.) 71–82 (Springer, 2015).

  55. Wang, J. et al. A biosynthetic route to photoclick chemistry on proteins. J. Am. Chem. Soc. 132, 14812–14818 (2010).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Abdelkader, E. H. et al. Genetic encoding of cyanopyridylalanine for in-cell protein macrocyclization by the nitrile–aminothiol click reaction. Angew. Chem. Int. Ed. Engl. 61, e202114154 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Xiao, H. et al. Genetic incorporation of histidine derivatives using an engineered pyrrolysyl-tRNA synthetase. ACS Chem. Biol. 9, 1092–1096 (2014).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Yanagisawa, T. et al. Structural basis for genetic-code expansion with bulky lysine derivatives by an engineered pyrrolysyl-tRNA synthetase. Cell Chem. Biol. 26, 936–949 (2019).

    Article  CAS  PubMed  Google Scholar 

  59. Robertson, W. E. et al. Sense codon reassignment enables viral resistance and encoded polymer synthesis. Science 372, 1057–1062 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Tianero, M. D., Donia, M. S., Young, T. S., Schultz, P. G. & Schmidt, E. W. Ribosomal route to small-molecule diversity. J. Am. Chem. Soc. 134, 418–425 (2012).

    Article  CAS  PubMed  Google Scholar 

  61. Young, T. S. et al. Evolution of cyclic peptide protease inhibitors. Proc. Natl Acad. Sci. USA 108, 11052–11056 (2011).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Vamisetti, G. B. et al. Selective macrocyclic peptide modulators of Lys63-linked ubiquitin chains disrupt DNA damage repair. Nat. Commun. 13, 6174 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  63. Katoh, T. & Suga, H. In vitro selection of foldamer-like macrocyclic peptides containing 2-aminobenzoic acid and 3-aminothiophene-2-carboxylic acid. J. Am. Chem. Soc. 144, 2069–2072 (2022).

    Article  CAS  PubMed  Google Scholar 

  64. Spinck, M. et al. Genetically programmed cell-based synthesis of non-natural peptide and depsipeptide macrocycles. Nat. Chem. 15, 61–69 (2023).

    Article  CAS  PubMed  Google Scholar 

  65. Townend, J. E. & Tavassoli, A. Traceless production of cyclic peptide libraries in E. coli. ACS Chem. Biol. 11, 1624–1630 (2016).

    Article  CAS  PubMed  Google Scholar 

  66. Miranda, E. et al. A cyclic peptide inhibitor of HIF-1 heterodimerization that inhibits hypoxia signaling in cancer cells. J. Am. Chem. Soc. 135, 10418–10425 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  67. Cheriyan, M., Pedamallu, C. S., Tori, K. & Perler, F. Faster protein splicing with the Nostoc punctiforme DnaE intein using non-native extein residues. J. Biol. Chem. 288, 6202–6211 (2013).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Stevens, A. J. et al. A promiscuous split intein with expanded protein engineering applications. Proc. Natl Acad. Sci. USA 114, 8538–8543 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Kolber, N. S., Fattal, R., Bratulic, S., Carver, G. D. & Badran, A. H. Orthogonal translation enables heterologous ribosome engineering in E. coli. Nat. Commun. 12, 599 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Geu-Flores, F., Nour-Eldin, H. H., Nielsen, M. T. & Halkier, B. A. USER fusion: a rapid and efficient method for simultaneous fusion and cloning of multiple PCR products. Nucleic Acids Res. 35, e55 (2007).

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We thank fellow Badran Lab members for their helpful discussions. We gratefully acknowledge H. Li for assistance with NGS analyses, M. L. Bulos for assistance with immunoblotting, B. Seegers, B. Monteverde and A. Owirka of the Scripps Research Flow Cytometry Core for their assistance with cell sorting and Q. Nguyen Wong, B. Sanchez, J. Lee and J. Chen of the Scripps Research Institute Automated Synthesis Facility for their assistance with mass spectral analysis. This work was supported by the Scripps Research Institute, the National Institutes of Health Director’s Early Independence Award (DP5-OD024590 to A.H.B.), the Research Corporation for Science Advancement and Sloan Foundation (G-2023-19625 to A.H.B.), the Thomas Daniel Innovation Fund (627163_1 to A.H.B.), the Abdul Latif Jameel Water and Food Systems Lab Grand Challenge Award (GR000141-S6241 to A.H.B.), the Breakthrough Energy Explorer Grant (GR000056 to A.H.B.), the Foundation for Food and Agriculture Research New Innovator Award (28-000578 to A.H.B.), the Homeworld Collective Garden Grant (GR000129 to A.H.B.) and the Army Research Office Young Investigator Award (81341-BB-ECP to A.H.B.). A.A.P. is a Hope Funds for Cancer Research Fellow supported by the Hope Funds for Cancer Research Fellowship (HFCR-23-03-01). D.L.L. is supported by a Skaggs-Oxford Scholarship and a Fletcher Jones Foundation Fellowship.

Author information

Authors and Affiliations

Authors

Contributions

A.C. designed the study, led the experimental work and analyzed the results. A.A.P. designed and led all macrocycle production studies. D.L.L. contributed the orthogonal aaRS–tRNA matrix and codon–anticodon discovery efforts. Z.L. investigated RP origin copy number changes to optimize quadruplet decoding. G.D.C. designed the mRNA synonymous codon libraries. A.H.B. conceptualized and designed the study, performed the experiments, analyzed the results and supervised the research. A.C. and A.H.B. wrote the paper with input from all authors.

Corresponding author

Correspondence to Ahmed H. Badran.

Ethics declarations

Competing interests

A.H.B. and A.C. have filed a provisional patent application through The Scripps Research Institute on the sequences and activities of tRNAs, proteins, enzymes and bacterial strains described in this paper. The other authors declare no competing interests.

Peer review

Peer review information

Nature Biotechnology thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Circuit Architecture Optimization for Efficient Quadruplet Decoding.

a) Schematic representation and data of three-plasmid circuit architecture (PS3). aaRS and tRNA genes are encoded on isolated plasmids and used in combination to decode a quadruplet codon in the reporter plasmid. Y151 substituted sfGFP quadruplet decoding by Ma tRNAPyl(8)AUAG, Af tRNATyr(A01)CUAG, Int tRNAPylA17VB03AGGA, and Spe tRNAPylUAGA in PS3 circuit architectures. ncAAs are 3-cyano-L-phenylalanine, 4-iodo-L-phenylalanine, 3-methyl-L-histidine, and N6-Boc-L-Lysine respectively, (n = 4 biological replicates, error shows standard deviation). b) Circuit architecture was refined to two plasmids (PS2) by moving the tRNA expression cassette onto the reporter plasmid. Y151 substituted sfGFP quadruplet decoding by Ma tRNAPyl(8)AUAG, Af tRNATyr(A01)CUAG, Int tRNAPylA17VB03AGGA, and Spe tRNAPylUAGA in PS2 circuit architectures. ncAAs are 3-cyano-L-phenylalanine, 4-iodo-L-phenylalanine, 3-methyl-L-histidine, and N6-Boc-L-Lysine respectively, (n = 4 biological replicates, error shows standard deviation).

Source data

Extended Data Fig. 2 Directed Evolution of Improved Quadruplet Decoding Using of G1PylRS–Ma qtRNAPylAUAG.

a) The starting Ma tRNAPylAGUA variants (8) and (17)38 show poor AGUA decoding at Y151 in sfGFP using a two-plasmid system alongside G1PylRS and 1 mM H-Lys(Z)-OH (CbzK), four biological replicates, (n = 4 biological replicates, error shows standard deviation). b) Both qtRNA anticodon stems were randomized using degenerate oligonucleotides and subjected to positive selection on chloramphenicol agar plates. Single clones were assayed for resistance to 16 µg/mL chloramphenicol with and without 1 mM CbzK, yielding variants that catalyzed CbzK-dependent AGUA decoding (n = 1). c) Top chlor-resistant clones were further validated through AGUA decoding at Y151 in sfGFP. Start line indicates the highest activity achieved with the (8) variant, (n = 4 biological replicates, error shows standard deviation). Most clones showed a reduction in CbzK-independent decoding rather than an improvement in CbzK-dependent decoding. Subsequent selections and randomization at other positions did not further improve dynamic range (not shown). d) Alternative codons were tested through transplantation into the Ma tRNAPylAGUA variant (8) scaffold. Cognate codon decoding was monitored using a dedicated sfGFP reporter at Y151 with and without 1 mM CbzK in each case (n = 1). AUAG and CGAA codons were prioritized due to low background. e) Ma tRNAPylAUAG and Ma tRNAPylCGAA anticodon stems and loops were randomized using degenerate oligonucleotides, and subjected to positive selection on chloramphenicol agar plates. Single clones were assayed for decoding at Y151 in sfGFP, showing robust CbzK dependence (n = 1). f) Top chlor-resistant clones were further validated through AUAG and CGAA decoding at Y151 in sfGFP. Start line indicates the highest activity achieved with the (8) variant, (n = 4 biological replicates, error shows standard deviation). AUAG decoding qtRNAs were prioritized due to their lower background without CbzK. g) An anticodon loop library for Ma tRNAPylCGAA variant MB11 returned input sequence despite robust library coverage. h) Ma tRNAPyl AUAG variant MB11 acceptor stem was randomized using degenerate oligonucleotides, and subjected to positive selection on chloramphenicol agar plates. Single clones were assayed for decoding at Y151 in sfGFP, showing improved decoding and CbzK dependence (n = 1). i) Top chlor-resistant clones were further validated through AUAG decoding at Y151 in sfGFP. Start line indicates the highest activity achieved with the (8) variant, (n = 4 biological replicates, error shows standard deviation). Despite the improvements in dynamic range of these qtRNAs, mutant B11 (MB11) was designated as the best variant due its higher signal with 1 mM CbzK in (f) and used for all subsequent assays.

Source data

Extended Data Fig. 3 Directed Evolution of Improved Quadruplet Decoding Using AfTyrRS–AfqtRNATyrCUAG.

a) The starting Af qtRNATyr(A01)CUAG variant42 shows moderate CUAG decoding at Y151 in sfGFP using a two-plasmid system alongside AfTyrRS and 1 mM 4-iodo-L-phenylalanine, (n = 4 biological replicates, error shows standard deviation). b) The qtRNA anticodon stem was randomized using degenerate oligonucleotides, and subjected to positive selection on chloramphenicol agar plates. Evolved single clones showed robust chloramphenicol above the MIC using the starting qtRNA (n = 1). c) Top chlor-resistant clones were further validated through CUAG decoding at Y151 in sfGFP. Start line indicates the highest activity achieved with the Af qtRNATyr(A01)CUAG variant, (n = 4 biological replicates, error shows standard deviation). Mutant 9 (M9) was designated as the best variant and used for all subsequent assays.

Source data

Extended Data Fig. 4 Directed Evolution of Improved Quadruplet Decoding Using Mlum1RS–IntqtRNAPylAGGA.

(a) The starting Int qtRNAPylAGGA variant A17,VB0338 shows poor AGGA decoding at Y151 in sfGFP using a three-plasmid system alongside Mlum1RS and 1 mM 3-methyl-L-histidine (NmH), (n = 4 biological replicates, error shows standard deviation). (b) The qtRNA anticodon stem was randomized using degenerate oligonucleotides, and subjected to positive selection on chloramphenicol agar plates. Evolved single clones showed robust chloramphenicol above the MIC using the starting qtRNA (n = 1). (c) Top chlor-resistant clones were further validated through AGGA decoding at Y151 in sfGFP. Start line indicates the highest activity achieved with the A17,VB03 variant, (n = 4 biological replicates, error shows standard deviation). (d) The top mutant (M5) was used as a scaffold for further randomized using targeting the anticodon loop, and subjected to another round of positive selection on chloramphenicol agar plates. Evolved single clones showed an even greater MIC to chloramphenicol as compared to the starting qtRNA (n = 1). (e) Top chlor-resistant clones were further validated through AGGA decoding at Y151 in sfGFP. Start line indicates the highest activity achieved with the A17,VB03 variant, (n = 4 biological replicates, error shows standard deviation). Mutant 5.3 (M5.3) was designated as the best variant and used for all subsequent assays.

Source data

Extended Data Fig. 5 Directed Evolution of Improved Quadruplet Decoding Using MmPylRS–SpetRNAPyl.

a) The starting Spe qtRNAPylUAGA38 shows poor UAGA decoding at Y151 in sfGFP using a two-plasmid system alongside both MbPylRS and MmPylRS and 1 mM N6-Boc-L-Lysine (BocK), (n = 4 biological replicates, error shows standard deviation). b) The qtRNA anticodon stem was randomized using degenerate oligonucleotides, and subjected to positive selection on chloramphenicol agar plates. Evolved single clones showed robust chloramphenicol above the MIC using the starting qtRNA (n = 1). Related qtRNA genes Mb qtRNAPylUAGA15 and Vul qtRNAPylUAG38 engineered to make Vul qtRNAPylUAGA, were subjected to the identical selection but did not result in chloramphenicol survival (not shown). c) Top chlor-resistant clones were further validated through UAGA decoding at Y151 in sfGFP using both MbPylRS and MmPylRS. Start line indicates the highest activity achieved with the initial Spe qtRNAPylUAGA variant, (n = 4 biological replicates, error shows standard deviation). d) The starting Spe qtRNAPylUAGA, mutants A01 (MA01) and G02 (MG02) were used as scaffolds for further randomization targeting the anticodon loop, Ψ-arm, D-arm, and D-loop. In all cases, the selection returned Spe qtRNAPylUAGA MG02 input suggesting maximal fitness. e) Alternative codons AGAN, AUAN, CCCN, CGAN, and AGUN were tested through transplantation into the Spe qtRNAPylUAGA MG02 scaffold in a tRNA-mRNA selection using 4 µg/ mL chloramphenicol plates. Cognate codon decoding was validated by using dedicated sfGFP reporter at Y151 with and without 1 mM BocK. In each case, sixteen colonies tested for each codon-anticodon library, (n = 16). All variants were ultimately abandoned due to lower signal than Spe qtRNAPylUAGA MG02. f) Spe qtRNAPylUAGA variant MG02 acceptor stem was randomized using degenerate oligonucleotides, and subjected to positive selection on chloramphenicol agar plates. Top chlor-resistant clones were further validated through UAGA decoding at Y151 in sfGFP. Start line indicates the highest activity achieved with the initial Spe qtRNAPylUAGA variant, (n = 4 biological replicates, error shows standard deviation). Mutant C07 (MC07) was designated as the best variant and used for all subsequent assays.

Source data

Extended Data Fig. 6 Directed Evolution of Improved Quadruplet Decoding Using ScTrpRS–SctRNATrpCGGA.

a) To define an appropriate quadruplet codon–anticodon pair, all possible pairs were evaluated using a library-cross-library approach. Position Y151 of sfGFP and the Sc tRNATrp(M13) anticodon were randomized using degenerate oligonucleotides and co-transformed into S3489 E. coli cells carrying ScTrpRS. Transformants were streaked on agar plates supplemented with 1 mM 5-hydroxy-L-tryptophan (5hW). Single colonies were then picked based on green fluorescence and evaluated with and without ncAA (n = 1). All 5hW-dependent clones carried CGGN codons and qtRNAs with the corresponding anticodons, including mismatches at the 4th position. b) Evaluation of all possible CGGN codon–anticodon pairs at position Y151 in sfGFP with and without 1 mM 5hW. Sc tRNATrp(M13)CGGA and Sc tRNATrp(M13)CGGC were prioritized in subsequent studies. c) Confirmation of ScTrpRS- and 5hW-dependent quadruplet decoding by comparison to a non-cognate tRNA, AlaTGCA (Ec tRNAAlaCTRL). We note a high 5hW-independent background translation. d) The starting Sc tRNATrp(M13)CGGA and Sc tRNATrp(M13)CGGC were randomized at the anticodon stem or acceptor stem, subjected to negative selection to eliminate aminoacylation by host synthetases, then subjected to positive selection on chloramphenicol agar plates. Evolved single clones showed robust 5hW-dependent growth at 2 µg/mL chloramphenicol (n = 1). e) Top chlor-resistant clones were further validated through CGGA or CGGC decoding at Y151 in sfGFP using ScTrpRS. Start line indicates the highest activity achieved with the initial Sc tRNATrp(M13)CGGA variant, (n = 4 biological replicates, error shows standard deviation). Mutant A11 (MA11) was designated as the best variant and used for all subsequent assays.

Source data

Extended Data Fig. 7 Rational Engineering of a qtRNA Operon to Include Sc tRNATrp(M13)CGGA.

a) Schematic representation of the genetic circuit for 5’ placement of Sc qtRNATrp(M13)CGGA. Sc qtRNATrp(M13)CGGA is encoded 5’ of Int qtRNAPylAGGA, where the two qtRNAs are separated by E. coli derived inter-tRNA sequences (navy highlight). Functional Sc qtRNATrp(M13)CGGA production will yield green fluorescence when tested alongside ScTrpRS and the cognate ncAA. b) Sc qtRNATrp(M13)CGGA production is dependent on the 5’ inter-qtRNA sequence as determined by CGGA decoding at Y151 in sfGFP (n = 1). OD600 values following overnight growth in the presence of 1 mM 5-hydroxy-L-tryptophan (5hW) are shown on the right (n = 1). c) Schematic representation of the genetic circuit for 3’ placement of Sc qtRNATrp(M13)CGGA. Sc qtRNATrp(M13)CGGA is encoded 5’ of Af qtRNATyrCUAG, where the two qtRNAs are separated by E. coli derived inter-tRNA sequences (navy highlight). Functional Sc qtRNATrp(M13)CGGA production will yield green fluorescence when tested alongside ScTrpRS and the cognate ncAA. d) Sc tRNATrp(M13)CGGA production is dependent on the 3’ inter-qtRNA sequence as determined by CGGA decoding at Y151 in sfGFP (n = 1). OD600 values following overnight growth in the presence of 1 mM 5hW are shown on the right (n = 1). Due to its lower error and tolerated cell growth, glnV-glnX inter-tRNA sequence was chosen for the 3’ spacer sequence, making Sc qtRNATrp(M13)CGGA the final qtRNA in the engineered operon.

Source data

Extended Data Fig. 8 Optimization of a Macrocycle Biosynthesis Platform for cyclo-CLLFVY.

a) The split-Npu gene cassette driven by the pBAD promoter was cloned on plasmids bearing different origins of replication with variable copy number. Following expression, cells are lysed and organic soluble components isolated for analysis by LC-MS. Straight line is the logarithmic regression of these findings, and the grey area described the standard deviation of the regression. b) Arginine (R) substitutions and additions were made to the cyclo-CLLFVY macrocyclic peptide scaffold to explore tolerance to insertions and mutations. Following expression, cells were lysed and organic soluble components isolated for analysis by LC-MS.

Source data

Extended Data Fig. 9 Validation of LC-MS Analysis of E. coli Crude Lysate for Macrocycle Biosynthesis.

Following expression, lysis, and organic solvent extraction of soluble components, LC-MS analysis was used to validate expression of macrocycle products. a) Mass spectrum for C[3OmeF]LFVY. The primary mass is M + H when analysing the peak indicated on the XIC trace from Fig. 5c. b) Characteristic isotopic pattern (M + 1 and M + 3) of bromine in the mass spectrum of this macrocycle product.

Source data

Extended Data Fig. 10 Quantifying Biosynthesis of Npu-Generated Macrocycles.

Standard curves were derived using macrocycles generated by solid-phase peptide synthesis (SPPS standard) to quantify the biosynthetic yield of our platform. For each synthetase – tRNA – quadruplet codon set, we used a single representative macrocycle. We interpolated the Area (µV*sec) of SPPS standards over a serial dilution from 10 – 0.3 µM but changes depending on macrocycle (see Source Data File) and used this to calculate the yield of macrocycles generated by intracellular decoding of (a) AUGA with 4-nitro-L-phenylalanine, (b) CUAG with 4-iodo-L-phenylalanine, (c) AGGA with 3-pyridyl-L-alanine, (d) UAGA with N6-alloc-L-lysine, and (e) CGGA with 3-(1-naphthyl)-L-alanine. All SPPS and biosynthesized macrocycle samples were tested in biological triplicate (n = 3). LC traces and m/z spectra for all concentrations of SSPS synthesized samples and biosynthesized macrocycles can be found in (Supplemental Information).

Source Data File.

Supplementary information

Supplementary Information

Supplementary Figs. 1–18, Tables 1–14, Data 1–5 and references.

Reporting Summary

Supplementary Data 1

Statistical source data for supplementary figures.

Source data

Source Data

Statistical source data.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Costello, A., Peterson, A.A., Lanster, D.L. et al. Efficient genetic code expansion without host genome modifications. Nat Biotechnol (2024). https://doi.org/10.1038/s41587-024-02385-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1038/s41587-024-02385-y

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing