Organisms are defined by the information encoded in their genomes, and since the origin of life this information has been encoded using a two-base-pair genetic alphabet (A–T and G–C). In vitro, the alphabet has been expanded to include several unnatural base pairs (UBPs)1,2,3. We have developed a class of UBPs formed between nucleotides bearing hydrophobic nucleobases, exemplified by the pair formed between d5SICS and dNaM (d5SICS–dNaM), which is efficiently PCR-amplified1 and transcribed4,5 in vitro, and whose unique mechanism of replication has been characterized6,7. However, expansion of an organism’s genetic alphabet presents new and unprecedented challenges: the unnatural nucleoside triphosphates must be available inside the cell; endogenous polymerases must be able to use the unnatural triphosphates to faithfully replicate DNA containing the UBP within the complex cellular milieu; and finally, the UBP must be stable in the presence of pathways that maintain the integrity of DNA. Here we show that an exogenously expressed algal nucleotide triphosphate transporter efficiently imports the triphosphates of both d5SICS and dNaM (d5SICSTP and dNaMTP) into Escherichia coli, and that the endogenous replication machinery uses them to accurately replicate a plasmid containing d5SICS–dNaM. Neither the presence of the unnatural triphosphates nor the replication of the UBP introduces a notable growth burden. Lastly, we find that the UBP is not efficiently excised by DNA repair pathways. Thus, the resulting bacterium is the first organism to propagate stably an expanded genetic alphabet.
Subscribe to Journal
Get full journal access for 1 year
only $3.90 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Tax calculation will be finalised during checkout.
Rent or Buy article
Get time limited or full article access on ReadCube.
All prices are NET prices.
Malyshev, D. A. et al. Efficient and sequence-independent replication of DNA containing a third base pair establishes a functional six-letter genetic alphabet. Proc. Natl Acad. Sci. USA 109, 12005–12010 (2012)
Yang, Z., Chen, F., Alvarado, J. B. & Benner, S. A. Amplification, mutation, and sequencing of a six-letter synthetic genetic system. J. Am. Chem. Soc. 133, 15105–15112 (2011)
Yamashige, R. et al. Highly specific unnatural base pair systems as a third base pair for PCR amplification. Nucleic Acids Res. 40, 2793–2806 (2012)
Seo, Y. J., Malyshev, D. A., Lavergne, T., Ordoukhanian, P. & Romesberg, F. E. Site-specific labeling of DNA and RNA using an efficiently replicated and transcribed class of unnatural base pairs. J. Am. Chem. Soc. 133, 19878–19888 (2011)
Seo, Y. J., Matsuda, S. & Romesberg, F. E. Transcription of an expanded genetic alphabet. J. Am. Chem. Soc. 131, 5046–5047 (2009)
Betz, K. et al. Structural insights into DNA replication without hydrogen bonds. J. Am. Chem. Soc. 135, 18637–18643 (2013)
Betz, K. et al. KlenTaq polymerase replicates unnatural base pairs by inducing a Watson–Crick geometry. Nature Chem. Biol. 8, 612–614 (2012)
Wu, Y., Fa, M., Tae, E. L., Schultz, P. G. & Romesberg, F. E. Enzymatic phosphorylation of unnatural nucleosides. J. Am. Chem. Soc. 124, 14626–14630 (2002)
Yan, H. & Tsai, M. D. Nucleoside monophosphate kinases: structure, mechanism, and substrate specificity. Adv. Enzymol. 73, 103–134 (1999)
Winkler, H. H. & Neuhaus, H. E. Non-mitochondrial ATP transport. Trends Biochem. Sci. 24, 64–68 (1999)
Amiri, H., Karlberg, O. & Andersson, S. G. Deep origin of plastid/parasite ATP/ADP translocases. J. Mol. Evol. 56, 137–150 (2003)
Hatch, T. P., Al-Hossainy, E. & Silverman, J. A. Adenine nucleotide and lysine transport in Chlamydia psittaci. J. Bacteriol. 150, 662–670 (1982)
Winkler, H. H. Rickettsial permeability: an ADP-ATP transport system. J. Biol. Chem. 251, 389–396 (1976)
Horn, M. & Wagner, M. Bacterial endosymbionts of free-living amoebae. J. Eukaryot. Microbiol. 51, 509–514 (2004)
Haferkamp, I. et al. Tapping the nucleotide pool of the host: novel nucleotide carrier proteins of Protochlamydia amoebophila. Mol. Microbiol. 60, 1534–1545 (2006)
Miroux, B. & Walker, J. E. Over-production of proteins in Escherichia coli: mutant hosts that allow synthesis of some membrane proteins and globular proteins at high levels. J. Mol. Biol. 260, 289–298 (1996)
Haferkamp, I. & Linka, N. Functional expression and characterisation of membrane transport proteins. Plant Biol. 14, 675–690 (2012)
Ast, M. et al. Diatom plastids depend on nucleotide import from the cytosol. Proc. Natl Acad. Sci. USA 106, 3621–3626 (2009)
Baba, T. et al. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2, 2006.0008 (2006)
Lavergne, T., Malyshev, D. A. & Romesberg, F. E. Major groove substituents and polymerase recognition of a class of predominantly hydrophobic unnatural base pairs. Chemistry 18, 1231–1239 (2012)
Seo, Y. J., Hwang, G. T., Ordoukhanian, P. & Romesberg, F. E. Optimization of an unnatural base pair toward natural-like replication. J. Am. Chem. Soc. 131, 3246–3252 (2009)
Li, L. et al. Natural-like replication of an unnatural base pair for the expansion of the genetic alphabet and biotechnology applications. J. Am. Chem. Soc. 136, 826–829 (2014)
Tomizawa, J. & Selzer, G. Initiation of DNA synthesis in Escherichia coli. Annu. Rev. Biochem. 48, 999–1034 (1979)
Allen, J. M. et al. Roles of DNA polymerase I in leading and lagging-strand replication defined by a high-resolution mutation footprint of ColE1 plasmid replication. Nucleic Acids Res. 39, 7020–7033 (2011)
Hashimoto, H. et al. Structure of a Naegleria Tet-like dioxygenase in complex with 5-methylcytosine DNA. Nature 506, 391–395 (2013)
Malyshev, D. A., Seo, Y. J., Ordoukhanian, P. & Romesberg, F. E. PCR with an expanded genetic alphabet. J. Am. Chem. Soc. 131, 14620–14621 (2009)
Hirao, I. et al. An unnatural hydrophobic base pair system: site-specific incorporation of nucleotide analogs into DNA and RNA. Nature Methods 3, 729–735 (2006)
Goodman, M. F. Error-prone repair DNA polymerases in prokaryotes and eukaryotes. Annu. Rev. Biochem. 71, 17–50 (2002)
Quan, J. & Tian, J. Circular polymerase extension cloning for high-throughput cloning of complex and combinatorial DNA libraries. Nature Protocols 6, 242–251 (2011)
Ludwig, J. & Eckstein, F. Rapid and efficient synthesis of nucleoside 5′-0-(1-thiotriphosphates), 5′-triphosphates and 2',3′-cyclophosphorothioates using 2-chloro-4H-1,3,2-benzodioxaphosphorin-4-one. J. Org. Chem. 54, 631–635 (1989)
Alpert, A. & Shukla, A. Precipitation of Large, High-Abundance Proteins from Serum with Organic Solvents in ABRF 2003: Translating Biology using Proteomics and Functional Genomics Poster no. P111-W http://www.abrf.org/Other/ABRFMeetings/ABRF2003/Alpert.pdf (2003)
Kubitschek, H. E. & Friske, J. A. Determination of bacterial cell volume with the Coulter Counter. J. Bacteriol. 168, 1466–1467 (1986)
Yanes, O., Tautenhahn, R., Patti, G. J. & Siuzdak, G. Expanding coverage of the metabolome for global metabolite profiling. Anal. Chem. 83, 2152–2161 (2011)
Seidman, C. E., Struhl, K., Sheen, J. & Jessen, T. Introduction of plasmid DNA into cells. Curr. Prot. Mol. Biol Chapter 1, Unit 1.8 (2001)
Knab, S., Mushak, T. M., Schmitz-Esser, S., Horn, M. & Haferkamp, I. Nucleotide parasitism by Simkania negevensis (Chlamydiae). J. Bacteriol. 193, 225–235 (2011)
Audia, J. P. & Winkler, H. H. Study of the five Rickettsia prowazekii proteins annotated as ATP/ADP translocases (Tlc): Only Tlc1 transports ATP/ADP, while Tlc4 and Tlc5 transport other ribonucleotides. J. Bacteriol. 188, 6261–6268 (2006)
Hofer, A., Ekanem, J. T. & Thelander, L. Allosteric regulation of Trypanosoma brucei ribonucleotide reductase studied in vitro and in vivo. J. Biol. Chem. 273, 34098–34104 (1998)
Reijenga, J. C., Wes, J. H. & van Dongen, C. A. M. Comparison of methanol and perchloric acid extraction procedures for analysis of nucleotides by isotachophoresis. J. Chromatogr. B Biomed. Sci. Appl. 374, 162–169 (1986)
We thank I. Haferkamp and J. Audia for kindly providing the NTT plasmids and helpful discussions, and P. Ordoukhanian for providing access to the Center for Protein and Nucleic Acid Research at TSRI. This work was supported by the US National Institutes of Health (NIH) (GM 060005).
F.E.R. and D.A.M. have filed a patent application based on the use of NTTs for biotechnological applications. F.E.R. D.A.M., T.L. and K.D. have shares in Synthorx Inc., a company that has commercial interests in the UBP. D.A.M. and K.D. are currently employed by Synthorx Inc. The other authors declare no competing financial interests.
Extended data figures and tables
a, Survey of reported substrate specificity (KM, μM) of the NTTs assayed in this study. b, PtNTT2 is significantly more active in the uptake of [α-32P]-dATP compared to other nucleotide transporters. Raw (left) and processed (right) data are shown. Relative radioactivity corresponds to the total number of counts produced by each sample. Interestingly, both PamNTT2 and PamNTT5 exhibit a measurable uptake of dATP although this activity was not reported before. This can possibly be explained by the fact that substrate specificity was only characterized using competition experiments, and assay sensitivity might not have been adequate to detect this activity15. References 35, 36 are cited in this figure.
Unnatural triphosphates (3P) of dNaM and d5SICS are degraded to diphosphates (2P), monophosphates (1P) and nucleosides (0P) in the growing bacterial culture. Potassium phosphate (KPi) significantly slows down the dephosphorylation of both unnatural triphosphates. a, Representative HPLC traces (for the region between ∼20 and 24 min). dNaM and d5SICS nucleosides are eluted at approximately 40 min and not shown. b, Composition profiles.
a, KPi inhibits the uptake of [α-32P]-dATP at concentrations above 100 mM. Raw (left) and processed (right) data are shown. The NTT from Rickettsia prowazekii (RpNTT2) does not mediate the uptake of any of the dNTPs and was used as a negative control: its background signal was subtracted from those of PtNTT2 (black bars) and TpNTT2 (white bars). Relative radioactivity corresponds to the total number of counts produced by each sample. b, KPi (50 mM) significantly stabilizes [α-32P]-dATP in the media. Triphosphate stability in the media is not significantly affected by the nature of the NTT expressed. 3P, 2P and 1P correspond to triphosphate, diphosphate and monophosphate states, respectively. Error bars represent s.d. of the mean, n = 3.
Extended Data Figure 4 dATP uptake and growth of cells expressing PtNTT2 as a function of inducer (IPTG) concentration.
Growth curves and [α-32P]-dATP uptake by bacterial cells transformed with pCDF-1b-PtNTT2 (pACS) plasmid as a function of IPTG concentration. a, Total uptake of radioactive substrate (left) and total intracellular triphosphate content (right) are shown at two different time points. Relative radioactivity corresponds to the total number of counts produced by each sample. b, A stationary phase culture of C41(DE3) pACS cells was diluted 100-fold into fresh 2 × YT media containing 50 mM KPi, streptomycin, and IPTG at the indicated concentrations and were grown at 37°C. Error bars represent s.d. of the mean, n = 3.
Composition of [α-32P]-dATP in the media (left) and cytoplasmic fraction (right) as a function of time. TLC images and their quantifications are shown at the bottom and the top of each of the panels, respectively. 3P, 2P and 1P correspond to nucleoside triphosphate, diphosphate and monophosphate, respectively. M refers to a mixture of all three compounds that was used as a TLC standard. The position labelled ‘Start’ corresponds to the position of sample spotting on the TLC plate.
a, The SAS is plotted as a function of the fraction of template containing the UBP. Error bars represent s.d. of the mean, n = 3. b, Representative data. SA, streptavidin.
Extended Data Figure 7 Decomposition of unnatural triphosphates, pINF quantification, and retention of the UBP with extended cell growth.
a, Dephosphorylation of the unnatural nucleoside triphosphate. 3P, 2P, 1P and 0P correspond to triphosphate, diphosphate, monophosphate and nucleoside states, respectively. The composition at the end of the 1 h recovery is shown at the right. b, Restriction analysis of pINF and pACS plasmids purified from E. coli, linearized with NdeI restriction endonuclease and separated on a 1% agarose gel (assembled from independent gel images). Molar ratios of pINF/pACS plasmids are shown at the top of each lane. For each time point, triplicate data are shown in three lanes with the untransformed control shown in the fourth, rightmost lane (see Methods). c, Number of pINF doublings as a function of time. The decrease starting at approximately 50 h is due to the loss of the pINF plasmid that also results in increased error. See the section on pINF replication in E. coli in the Methods for details. d, UBP retention (%) as a function of growth as determined by gel shift (data shown in Fig. 3) and Sanger sequencing (sequencing traces are available as Supplementary Data). In a, c and d, error shown is the s.d. of mean, n = 3.
The file contains the sequences of oligonucleotides used in this study, an example calculation of plasmid amplification, and the sequence of the pACS plasmid. (PDF 243 kb)
This file contains raw sequencing traces for PCR fragments generated from pINF plasmid propagated in E. coli at different time points (n=3). The position of the unnatural nucleotide is indicated with a red arrow. (XLSX 363 kb)
About this article
Cite this article
Malyshev, D., Dhami, K., Lavergne, T. et al. A semi-synthetic organism with an expanded genetic alphabet. Nature 509, 385–388 (2014). https://doi.org/10.1038/nature13314
Nature Reviews Genetics (2021)
Nature Communications (2021)
Nature Chemical Biology (2021)
Theory in Biosciences (2021)
Nature Reviews Chemistry (2020)