Abstract
Genetic memory enables the recording of information in the DNA of living cells. Memory can record a transient environmental signal or cell state that is then recalled at a later time. Permanent memory is implemented using irreversible recombinases that invert the orientation of a unit of DNA, corresponding to the [0,1] state of a bit. To expand the memory capacity, we have applied bioinformatics to identify 34 phage integrases (and their cognate attB and attP recognition sites), from which we build 11 memory switches that are perfectly orthogonal to each other and the FimE and HbiF bacterial invertases. Using these switches, a memory array is constructed in Escherichia coli that can record 1.375 bytes of information. It is demonstrated that the recombinases can be layered and used to permanently record the transient state of a transcriptional logic gate.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$259.00 per year
only $21.58 per issue
Buy this article
- Purchase on Springer Link
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Burrill, D.R. & Silver, P.A. Making cellular memories. Cell 140, 13–18 (2010).
Yamanishi, M. & Matsuyama, T. A modified Cre-lox genetic switch to dynamically control metabolic flow in Saccharomyces cerevisiae. ACS Synth. Biol. 1, 172–180 (2012).
Ham, T.S., Lee, S.K., Keasling, J.D. & Arkin, A.P. A tightly regulated inducible expression system utilizing the fim inversion recombination switch. Biotechnol. Bioeng. 94, 1–4 (2006).
Kawashima, T. et al. Functional labeling of neurons and their projections using the synthetic activity-dependent promoter E-SARE. Nat. Methods 10, 889–895 (2013).
Zariwala, H.A. et al. A Cre-dependent GCaMP3 reporter mouse for neuronal imaging in vivo. J. Neurosci. 32, 3131–3141 (2012).
Kotula, J.W. et al. Programmable bacteria detect and record an environmental signal in the mammalian gut. Proc. Natl. Acad. Sci. USA 111, 4838–4843 (2014).
Archer, E.J., Robinson, A.B. & Suel, G.M. Engineered E. coli that detect and respond to gut inflammation through nitric oxide sensing. ACS Synth. Biol. 1, 451–457 (2012).
Ingolia, N.T. & Murray, A.W. Positive-feedback loops as a flexible biological module. Curr. Biol. 17, 668–677 (2007).
Gardner, T.S., Cantor, C.R. & Collins, J.J. Construction of a genetic toggle switch in Escherichia coli. Nature 403, 339–342 (2000).
Ajo-Franklin, C.M. et al. Rational design of memory in eukaryotic cells. Genes Dev. 21, 2271–2276 (2007).
Burrill, D.R., Inniss, M.C., Boyle, P.M. & Silver, P.A. Synthetic memory circuits for tracking human cell fate. Genes Dev. 26, 1486–1497 (2012).
Greber, D., El-Baba, M.D. & Fussenegger, M. Intronically encoded siRNAs improve dynamic range of mammalian gene regulation systems and toggle switch. Nucleic Acids Res. 36, e101 (2008).
Ham, T.S., Lee, S.K., Keasling, J.D. & Arkin, A.P. Design and construction of a double inversion recombination switch for heritable sequential genetic memory. PLoS ONE 3, e2815 (2008).
Moon, T.S. et al. Construction of a genetic multiplexer to toggle between chemosensory pathways in Escherichia coli. J. Mol. Biol. 406, 215–227 (2011).
Bonnet, J., Subsoontorn, P. & Endy, D. Rewritable digital data storage in live cells via engineered control of recombination directionality. Proc. Natl. Acad. Sci. USA 109, 8884–8889 (2012).
Bonnet, J., Yin, P., Ortiz, M.E., Subsoontorn, P. & Endy, D. Amplifying genetic logic gates. Science 340, 599–603 (2013).
Friedland, A.E. et al. Synthetic gene networks that count. Science 324, 1199–1202 (2009).
Nielsen, A.A., Segall-Shapiro, T.H. & Voigt, C.A. Advances in genetic circuit design: novel biochemistries, deep part mining, and precision gene expression. Curr. Opin. Chem. Biol. 17, 878–892 (2013).
Brown, W.R., Lee, N.C., Xu, Z. & Smith, M.C. Serine recombinases as tools for genome engineering. Methods 53, 372–379 (2011).
Smith, M.C. & Thorpe, H.M. Diversity in the serine recombinases. Mol. Microbiol. 44, 299–307 (2002).
Smith, M.C., Brown, W.R., McEwan, A.R. & Rowley, P.A. Site-specific recombination by phiC31 integrase and other large serine recombinases. Biochem. Soc. Trans. 38, 388–394 (2010).
Marchler-Bauer, A. & Bryant, S.H. CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 32, W327–W331 (2004).
Li, W. et al. Structure of a synaptic γδ resolvase tetramer covalently linked to two cleaved DNAs. Science 309, 1210–1215 (2005).
Rutherford, K., Yuan, P., Perry, K., Sharp, R. & Van Duyne, G.D. Attachment site recognition and regulation of directionality by the serine integrases. Nucleic Acids Res. 41, 8341–8356 (2013).
Zhou, Y., Liang, Y., Lynch, K.H., Dennis, J.J. & Wishart, D.S. PHAST: a fast phage search tool. Nucleic Acids Res. 39, W347–W352 (2011).
Canchaya, C. et al. Genome analysis of an inducible prophage and prophage remnants integrated in the Streptococcus pyogenes strain SF370. Virology 302, 245–258 (2002).
Brenciani, A. et al. Phim46.1, the main Streptococcus pyogenes element carrying mef(A) and tet(O) genes. Antimicrob. Agents Chemother. 54, 221–229 (2010).
Deuschle, U., Kammerer, W., Gentz, R. & Bujard, H. Promoters of Escherichia coli: a hierarchy of in vivo strength indicates alternate structures. EMBO J. 5, 2987–2994 (1986).
Lou, C., Stanton, B., Chen, Y.J., Munsky, B. & Voigt, C.A. Ribozyme-based insulator parts buffer synthetic circuits from genetic context. Nat. Biotechnol. 30, 1137–1142 (2012).
Salis, H.M., Mirsky, E.A. & Voigt, C.A. Automated design of synthetic ribosome binding sites to control protein expression. Nat. Biotechnol. 27, 946–950 (2009).
Farasat, I. et al. Efficient search, mapping, and optimization of multi-protein genetic systems in diverse bacteria. Mol. Syst. Biol. 10, 731 (2014).
Klemm, P. Two regulatory fim genes, fimB and fimE, control the phase variation of type 1 fimbriae in Escherichia coli. EMBO J. 5, 1389–1393 (1986).
Xie, Y., Yao, Y., Kolisnychenko, V., Teng, C.H. & Kim, K.S. HbiF regulates type 1 fimbriation independently of FimB and FimE. Infect. Immun. 74, 4039–4047 (2006).
Clarke, E.J. & Voigt, C.A. Characterization of combinatorial patterns generated by multiple two-component sensors in E. coli that respond to many stimuli. Biotechnol. Bioeng. 108, 666–675 (2011).
Stanton, B.C. et al. Genomic mining of prokaryotic repressors for orthogonal logic gates. Nat. Chem. Biol. 10, 99–105 (2014).
Hooshangi, S., Thiberge, S. & Weiss, R. Ultrasensitivity and noise propagation in a synthetic transcriptional cascade. Proc. Natl. Acad. Sci. USA 102, 3581–3586 (2005).
Moon, T.S., Lou, C., Tamsir, A., Stanton, B.C. & Voigt, C.A. Genetic programs constructed from layered logic gates in single cells. Nature 491, 249–253 (2012).
Tamsir, A., Tabor, J.J. & Voigt, C.A. Robust multicellular computing using genetically encoded NOR gates and chemical ′wires′. Nature 469, 212–215 (2011).
Costello, E.K., Stagaman, K., Dethlefsen, L., Bohannan, B.J. & Relman, D.A. The application of ecological theory toward an understanding of the human microbiome. Science 336, 1255–1262 (2012).
Rhodius, V.A. et al. Design of orthogonal genetic switches based on a crosstalk map of σs, anti-σs, and promoters. Mol. Syst. Biol. 9, 702 (2013).
Khalil, A.S. et al. A synthetic biology framework for programming eukaryotic transcription functions. Cell 150, 647–658 (2012).
Durfee, T. et al. The complete genome sequence of Escherichia coli DH10B: insights into the biology of a laboratory workhorse. J. Bacteriol. 190, 2597–2606 (2008).
Cormack, B.P., Valdivia, R.H. & Falkow, S. FACS-optimized mutants of the green fluorescent protein (GFP). Gene 173, 33–38 (1996).
Campbell, R.E. et al. A monomeric red fluorescent protein. Proc. Natl. Acad. Sci. USA 99, 7877–7882 (2002).
Gibson, D.G. et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods 6, 343–345 (2009).
Stothard, P. The sequence manipulation suite: JavaScript programs for analyzing and formatting protein and DNA sequences. Biotechniques 28, 1104 (2000).
Engler, C., Gruetzner, R., Kandzia, R. & Marillonnet, S. Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes. PLoS ONE 4, e5553 (2009).
Acknowledgements
C.A.V., T.K.L. and L.Y. are supported by the US Defense Advanced Research Projects Agency (DARPA CLIO N66001-12-C-4016). C.A.V. and A.A.K.N. are supported by DARPA CLIO N66001-12-C-4018. C.A.V., M.T.L., A.A.K.N. and J.F.-R. are supported by the Office of Naval Research Multidisciplinary University Research Initiative (N00014-13-1-0074; Boston University MURI award 4500000552). C.A.V. is also supported by US National Institutes of Health (GM095765), the US National Institute of General Medical Sciences (P50 GMO98792) and the US National Science Foundation Synthetic Biology Engineering Research Center (SynBERC EEC0540879). A.A.K.N. receives government support FA9550-11-C-0028 and is supported by the National Defense Science and Engineering Graduate Fellowship 32 CFR 168a from the US Department of Defense Air Force Office of Scientific Research.
Author information
Authors and Affiliations
Contributions
C.A.V. and L.Y. conceived of the study and designed the experiments. L.Y., A.A.K.N., C.J.M. and J.F.-R. performed the experiments and analyzed the data. C.A.V., L.Y., A.A.K.N. and C.J.M. wrote the manuscript. C.A.V., T.K.L. and M.T.L. managed the project.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Integrated supplementary information
Supplementary Figure 1 The alignment used to identify the core sequence from a conserved protein split by a prophage.
The yellow line indicates the prophage genome and blue lines indicate the regions in bacterial genome that flank the prophage.
Supplementary Figure 2 The phylogenetic tree for the complete set of 34 integrases.
The Genbank ID of the integrases and the attB/P sequences are provided in Supplementary Table 1. The < symbols mark the 13 integrases used to build memory switches.
Supplementary Figure 3 Additional data corresponding to the memory switch characterization.
The data corresponds to the switches presented in Figure 2. (A) The geometric average of GFP fluorescence measured at the uninduced state (0mM arabinose, 0.5% glucose, blank, –) and induced (1mM arabinose, grey, +) states is shown. (B) The fractions of the GFP ON cell population at uninduced (blank, -) and induced (grey, +) state are listed. Asterisks indicate values lower than 0.5%. The ON population is defined as having fluorescence values between 102 and 105 au. The average and standard deviation were calculated from three independent experiments performed on different days. (C) PCR bands amplified from cell cultures before (–) and after (+) arabinose induction using the primer 1 and 2 shown in part a. The expected band size is 0.8 kb. (D) The fraction of cells that are ON is shown versus time. Cells are induced at t = 0. The average and standard deviation is shown for three independent experiments.
Supplementary Figure 4 The attP site for Int13 has constitutive promoter activity.
Prior to flipping, GFP is expressed (grey line). By swapping the attP and attB locations, the expression is eliminated (black line). This data is for E. coli DH10b containing only the reporter plasmid (no integrase plasmid) in LB media supplemented with 34 mg/ml Cm grown for 16 hours. The dashed line marks the threshold of 102 used to differentiate between OFF and ON cells. The figure represents three independent replicates.
Supplementary Figure 5 Cytometry data corresponding to the orthogonality matrix.
Cytometry data corresponding to the orthogonality matrix in Figure 2e is shown. Cognate interactions between recombinases and recognition sites are shown on the diagonal in black. Off-target combinations are shown in grey. The dashed line at 102 a.u. shows the threshold used to differentiate on and off cells. This figure represents three experiments conducted on different days.
Supplementary Figure 6 Orthogonality matrix shown in GFP fluorescence
Orthogonality matrix of recombinases and their recognition sites shown in GFP fluorescence corresponding to Figure 2e is demonstrated. The data represent the average of three biological replicates conducted in different days. The average values and standard deviations are shown in Supplementary Table 6.
Supplementary Figure 7 The function and sensitivity of the memory array.
At the induced state [2mM arabinose] the memory array was able to genetically record the expression of each recombines independently (same as Figure 3b diagonal bands). At the uninduced state (0mM arabinose, 0.4% glucose) there is negligible background switching.
Supplementary Figure 8 Fluorescence histograms corresponding to the [1,1] input state for the genetic circuit time courses in Figure 3d.
Inducers were added directly after the 0 hr timepoint, and removed after the 7 h timepoint. (A) Population fluorescence timecourse for the transcriptional AND gate driving YFP, without integrase. (B) Population fluorescence timecourse for the transcriptional AND gate driving Int2, causing the GFP gene to be flipped and expressed.
Supplementary Figure 9 The two-integrase cascade.
(A) A schematic of the recombinase cascade is shown. The plasmid map is shown in Supplementary Figure 13 (pCas_5+7_gfp and pInt5). (B) The cells were uninduced (white) or induced with 1mM arabinose for 8 h (grey). The dashed line at 102 a.u. shows the threshold used to differentiate ON and OFF cells.
Supplementary Figure 10 The impact of single- or multiple-integrase on cell growth.
Cells containing genetic circuits of single (Int2, Int5, Int7 or Int8) or multiple (Int2/5, Int2/5/7 or Int 2/5/7/8) were compared at different concentrations of arabinose. The cells were incubated in at 37 °C for 12 hours with various concentrations of arabinose. The final OD (600nm) decreased slightly in cells containing single integrases at high concentration of arabinose for cells expressing single integrases. However we did not observe additive effects when multiple integrases are incorporated, suggesting that using multiple integrases in parallel is not constrained by growth defect. The average and standard deviations of three independent experiments is shown. Yellow, pInt2; Red, Int5; Blue, Int7; Green, Int8; White, Cis_2+5+7+8; Light grey, Cas_2+5; Dark grey, Cas_2+5+7.
Supplementary Figure 11 Plasmids used to construct AND gate.
The plasmids pAND_Int2 and pAND_reporter were used to conduct memory logic. The plasmid pAND-yfp and pSpec (containing PTet-mRFP1 which was unused in these experiments) were used to generate an AND gate without memory (output YFP). Genetic parts are labeled above or below construct. Promoters are indicated with a bent arrow, genes with a thick arrow, insulators with a circle and a dashed line, RBSs with a solid half-circle and terminators with a T.
Supplementary Figure 12 Plasmids used to construct cascade of integrases.
The 2-integrase cascade of Supplementary Figure 9 was constructed using pInt5 and pCas_5+7_gfp. The 3-integrase cascade of Figure 3e and f was built with pCas_2+5 and pCas_5+7_gfp. Genetic parts are labeled above or below construct. Promoters are indicated with a bent arrow, genes with a thick arrow, insulators with a circle and a dashed line, RBSs with a solid half-circle and terminators with a T.
Supplementary Figure 13 Plasmids used to write 2, 3 or 4 bits of information on the memory array.
pCis_2+5, pCis_7+8, pCis_7+8+10 and pCis_2+7+8+5 were used to write 2, 2, 3, and 4 bits information on the memory array, respectively (Fig. 3g). These plasmids and the memory array plasmid (Fig. 3a) were used for Figure 3g. Genetic parts are labeled above or below construct. Promoters are indicated with a bent arrow, genes with a thick arrow, insulators with a circle and a dashed line, RBSs with a solid half-circle and terminators with a T.
Supplementary information
Supplementary Text and Figures
Supplementary Figures 1–13, Supplementary Tables 1–9 and Supplementary Notes 1–3 (PDF 2229 kb)
Rights and permissions
About this article
Cite this article
Yang, L., Nielsen, A., Fernandez-Rodriguez, J. et al. Permanent genetic memory with >1-byte capacity. Nat Methods 11, 1261–1266 (2014). https://doi.org/10.1038/nmeth.3147
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/nmeth.3147
This article is cited by
-
Engineering intelligent chassis cells via recombinase-based MEMORY circuits
Nature Communications (2024)
-
Addressable and adaptable intercellular communication via DNA messaging
Nature Communications (2023)
-
Next generation synthetic memory via intercepting recombinase function
Nature Communications (2023)
-
Customizing cellular signal processing by synthetic multi-level regulatory circuits
Nature Communications (2023)
-
Temporally resolved transcriptional recording in E. coli DNA using a Retro-Cascorder
Nature Protocols (2023)