Synthetic biology is focused on the rational construction of biological systems based on engineering principles. During the field's first decade of development, significant progress has been made in designing biological parts and assembling them into genetic circuits to achieve basic functionalities. These circuits have been used to construct proof-of-principle systems with promising results in industrial and medical applications. However, advances in synthetic biology have been limited by a lack of interoperable parts, techniques for dynamically probing biological systems and frameworks for the reliable construction and operation of complex, higher-order networks. As these challenges are addressed, synthetic biologists will be able to construct useful next-generation synthetic gene networks with real-world applications in medicine, biotechnology, bioremediation and bioenergy.
Ten years since the introduction of the field's inaugural devices—the genetic toggle switch (J.J.C. and colleagues)1 and repressilator2—synthetic biologists have successfully engineered a wide range of functionality into artificial gene circuits, creating switches1,3,4,5,6,7,8,9, oscillators2,10,11,12, digital logic evaluators13,14, filters15,16,17, sensors18,19,20 and cell-cell communicators15,19. Some of these engineered gene networks have been applied to perform useful tasks such as population control21, decision making for whole-cell biosensors19, genetic timing for fermentation processes (J.J.C. and colleagues)22 and image processing23,24,25. Synthetic biologists have even begun to address important medical and industrial problems with engineered organisms, such as bacteria that invade cancer cells26, bacteriophages with enhanced abilities to treat infectious diseases (T.K.L. and J.J.C.)27,28, and yeast with synthetic microbial pathways that enable the production of antimalarial drug precursors29. However, in most application-driven cases, engineered organisms contain only simple gene circuits that do not fully exploit the potential of synthetic biology. There remains a fundamental disconnect between low-level genetic circuitry and the promise of assembling these circuits into more complex gene networks that exhibit robust, predictable behaviors.
Thus, despite all of its successes, many more challenges remain in advancing synthetic biology to the realm of higher-order networks with programmable functionality and real-world applicability. Here, instead of reviewing the progress that has been made in synthetic biology, we present challenges and goals for next-generation synthetic gene networks, and describe some of the more compelling circuits to be developed and application areas to be considered.
Synthetic gene networks: what have we learned and what do we need?
The engineering of mechanical, electrical and chemical systems is enabled by well-established frameworks for handling complexity, reliable means of probing and manipulating system states and the use of testing platforms—tools that are largely lacking in the engineering of biology. Developing properly functioning biological circuits can involve complicated protocols for DNA construction, rudimentary model-guided and rational design, and repeated rounds of trial and error followed by fine-tuning. Limitations in characterizing kinetic processes and interactions between synthetic components and other unknown constituents in vivo make troubleshooting and modeling frustrating and prohibitively time consuming. As a result, the design cycle for engineering synthetic gene networks remains slow and error prone.
Fortunately, advances are being made in streamlining the physical construction of artificial biological systems, in the form of resources and methods for building larger engineered DNA systems from smaller defined parts22,30,31,32. Additionally, large-scale DNA sequencing and synthesis technologies are gradually enabling researchers to directly program whole genes, genetic circuits and even genomes, as well as to re-encode DNA sequences with optimal codons and minimal restriction sites (see review33).
Despite these advances in molecular construction, the task of building synthetic gene networks that function as desired remains extremely challenging. Accelerated, large-scale diversification34 and the use of characterized component libraries in conjunction with in silico models for a priori design22 are proving useful in helping to fine-tune network performance toward desired outputs. Even so, in general, synthetic biologists are often fundamentally limited by a dearth of interoperable and modular biological parts, predictive computational modeling capabilities, reliable means of characterizing information flow through engineered gene networks and test platforms for rapidly designing and constructing synthetic circuits.
In the following subsections, we discuss four important research efforts that will improve and accelerate the design cycle for next-generation synthetic gene networks: first, advancing and expanding the toolkit of available parts and modules; second, modeling and fine-tuning the behavior of synthetic circuits; third, developing probes for reliably quantifying state values for synthetic (and natural) biomolecular systems; and fourth, creating test platforms for characterizing component interactions within engineered gene networks, designing gene circuits with increasing complexity and developing complex circuits for use in higher organisms. These advances will allow synthetic biologists to realize higher-order networks with desired functionalities for satisfying real-world applications.
Interoperable parts and modules for synthetic gene networks. Although there has been no shortage of novel circuit topologies to construct, limitations in the number of interoperable and well-characterized parts have constrained the development of more complex biological systems22,31,35,36. The situation is complicated by the fact that many potential interactions between biological parts, which are derived from a variety of sources within different cellular backgrounds, are not well understood or characterized. As a result, the majority of synthetic circuits are still constructed ad hoc from a small number of commonly used components (e.g., LacI, TetR and lambda repressor proteins and regulated promoters) with a significant amount of trial and error. There is a pressing need to expand the synthetic biology toolkit of available parts and modules. Because physical interconnections cannot be made in biological systems to the same extent as electrical and mechanical systems, interoperability must be derived from chemical specificity between parts and their desired targets. This limits our ability to construct truly modular parts and highlights the need for rigorous characterization of component interactions so that detrimental interactions can be minimized and factored into computational models.
Engineered zinc fingers constitute a flexible system for targeting specific DNA sequences, one which could significantly expand the available synthetic biology toolkit for performing targeted recombination, controlling transcriptional activity and making circuit interconnections. Zinc-finger technology has primarily been used to design zinc-finger nucleases that generate targeted double-strand breaks for genomic modifications37. These engineered nucleases may be used to enhance recombination in large-scale genome engineering techniques34. A second and potentially very promising use of engineered zinc fingers is as a source of interoperable transcription factors, which would greatly expand the current and limited repertoire of useful activators and repressors. In fact, zinc fingers have already been harnessed to create artificial transcription factors by fusing zinc-finger proteins with activation or repression domains38,39. Libraries of externally controllable transcriptional activators or repressors could be created by engineering protein or RNA ligand-responsive regulators, which control the transcription or translation of zinc finger–based artificial transcription factors themselves18. These libraries would enable the construction of basic circuits, such as genetic switches1, as well as more complex gene networks. In fact, several of the higher-order networks we describe below rely on having multiple reliable and interoperable transcriptional activators and repressors for proper functioning.
Even so, these engineered transcription factors have not yet been fully characterized, and if they are to be used as building blocks for complex gene networks, then knowledge of their in vivo kinetics and input-output transfer functions would be beneficial. In addition, much of the rich dynamics associated with small, synthetic gene networks is attributable to the cooperative binding or multimerization of transcription factors, and it is not yet clear what further engineering is required to endow zinc-finger transcription factors with such features.
Nucleic acid–based parts, such as RNAs, are also promising candidates for libraries of interoperable parts because they can be rationally programmed based on sequence specificity7,40,41. Novel circuit interconnections could be established using small interfering RNAs (siRNAs) to control the expression of specific components. Recombinases, which target specific DNA recombinase-recognition sites, also represent a fruitful, underutilized source of interoperable parts. Recombinases have been used in the context of synthetic biology to create memory elements and genetic counters9. However, more than 100 natural recombinases are known, and these can be engineered by mutagenesis and directed evolution for greater diversity and sequence specificity42,43,44,45.
Libraries of well-characterized, interoperable parts, such as transcription factors and recombinases, would vastly enhance the ability of synthetic biologists to build more complex gene networks with greater reliability and real-world applicability. In addition to libraries of individual parts, it would be of great value to have well-characterized and interoperable modules (e.g., switches, oscillators and interfaces) that could be used in a plug-and-play fashion to create higher-order networks and programmable cells. As the number of parts and modules expands, high-throughput, combinatorial efforts for quantifying the levels of interference and cross-talk between multiple components within cells will be increasingly important as guides for choosing the most appropriate components for network assembly.
Modeling and fine-tuning synthetic gene networks. Integrated efforts for modeling and fine-tuning synthetic gene circuits are useful for ensuring that assembled networks operate as intended. Such approaches will be increasingly important as more complex circuits are constructed along with the expanded development of interoperable parts. Although studies have shown that in some cases, component properties alone are sufficient for predicting network behavior22,31,46, others have demonstrated the need for modeling and fine-tuning networks after their basic topologies have been established1,22. A multi-step design cycle that involves creating diverse component libraries, constructing, characterizing and modeling representative network topologies, and assembling and fine-tuning desired circuits, followed by subsequent refinement cycles22, will be crucial for the successful design and construction of next-generation synthetic gene networks.
The fine-tuning of biomolecular parts and networks can be achieved by developing diverse component libraries through mutagenesis followed by in-depth characterization and modeling22,47,48,49,50,51. Significant progress has been made in tuning gene expression by altering transcriptional, translational and degradation activities. For example, promoter libraries with a range of transcriptional activities can be created and characterized, plugged into in silico models and then used to develop synthetic gene networks with defined outputs, without significant post-hoc adjustments22,47,48,49,50,51. Alternatively, synthetic ribosome binding site (RBS) sequences can be used to optimize protein expression levels. Recently, Salis et al.52 have developed a thermodynamic model for predicting the relative translational initiation rates for a protein with different upstream RBS sequences, a model that can also be used to rationally forward-engineer RBS sequences to give desired protein expression. In addition, protein degradation can be controlled by tagging proteins with degradation-targeting peptides that impart different degradation dynamics53.
By automating the construction and characterization of biomolecular components, extensive libraries could be created for the rapid design and construction of complex gene networks. These efforts, coupled with in silico modeling, would serve to fast-track synthetic biology (more detailed discussions of modeling techniques for synthetic biology are found in refs. 22,31,54,55,56,57). However, to build reliable models of biomolecular parts and networks, new methods for probing and acquiring detailed in vitro and in vivo measurements are needed, which we discuss below.
Probes for characterizing synthetic gene networks. Significant advances have been made in the development of new technologies for manipulating biological systems and probing their internal states. At the single-molecule level, for instance, optical tweezers and atomic force microscopes provide new, direct ways to probe the biophysical states of single DNA, RNA and protein molecules as they undergo conformational changes and other dynamical processes58,59,60,61,62. However, we lack similar tools for tracking the in vivo operation of synthetic gene circuits in a high-throughput fashion. Ideally, making dynamical measurements of biological networks would involve placing sensors at multiple internal nodes, akin to how current and voltage are measured in electrical systems. Furthermore, external manipulation of synthetic biomolecular systems is typically accomplished by the addition of chemical inducers, which can suffer from cross-talk63, be difficult to remove and be consumed over time. As a result, inputs are often troublesome to control dynamically.
Microfluidic devices have been coupled to single-cell microscopy and image processing techniques to enable increasingly precise manipulation and measurement of cells, especially since inputs can be modulated over time64,65. These systems allow the rapid addition and removal of chemical inducers, enabling more sophisticated, time-dependent inputs than conventional step functions, while also enabling researchers to track and quantify single cells for long periods of time. These developments make possible the wider use of well-established engineering approaches for analyzing circuits and other systems in synthetic biology. For example, frequency-domain analysis, a technique used commonly in electrical engineering66,67, can be employed with microfluidics to characterize the transfer functions and noise behaviors of synthetic biological circuits66,67,68. Additionally, small-signal linearization of nonlinear gene circuits can be achieved by applying oscillatory perturbations with microfluidics and measuring responses at the single-cell level67,68.
Indeed, microfluidics provides a useful platform for perturbing synthetic gene circuits with well-controlled inputs and observing the outputs in high-resolution fashion. Without the proper 'sensors' (that is, for quantitatively and simultaneously probing all the internal nodes of a given gene circuit), however, this technology alone is not sufficient to bring full, engineering-like characterization to synthetic gene networks.
Thus far, probes enabling quantitative measurements of synthetic gene circuits have primarily focused on the use of fluorescent proteins for in vivo quantification of promoter activity or protein expression. With the advent of novel mass spectrometry–based methods that provide global, absolute protein concentrations in cells69, quantitative transcriptome data can now be merged with proteome data, improving our ability to characterize and model the dynamics of synthetic gene networks. Global proteomic data may also assist synthetic biologists in understanding the metabolic burden that artificial circuits place on host cells. Further efforts to devise fluorescent-based and other types of reporters for the simultaneous monitoring of transcriptome and proteome dynamics in vivo are needed to close the loop on full-circuit accounting. Some promising tools under development include tracking protein function by incorporating unnatural amino acids that exhibit fluorescence70,71, quantum dots72 and radiofrequency-controlled nanoparticles73.
As the field awaits entire-circuit probes, there are, in the meantime, several potentially accessible technologies for increasing the throughput and pace of piecewise gene-circuit characterization. Recent advances in engineering light-inducible biological parts and systems23,24,74 have unlocked the potential for optical-based circuit characterization, expanding the number and type of tunable knobs available to synthetic biologists. For instance, by coupling a synthetic gene network of interest to a biological light/dark sensor as well as to fluorescent protein outputs, one could potentially measure the network's input/output transfer function in a high-throughput fashion using spectrophotometric microplate readers, without having to add varying concentrations of chemical inducers. In essence, both control and monitoring of biomolecular systems would be accomplished using reliable and high-speed optics that are typically associated with fluorescence readouts and microscopy. This is an exciting prospect, particularly in the context of microfluidic devices, which would facilitate the focusing of optical inputs and readouts to single cells.
Using electrical signals, in lieu of chemical or optical signals, for control and monitoring of biological systems would also present high-speed advantages. Recently, advances have been made in integrating silicon electronics with lipid bilayers containing transmembrane pores to perform electronic signal conduction75. This technology may eventually allow direct communication and control between engineered cells and electronic circuits by means of ionic flow. The incorporation of these and other technologies to perturb and monitor the in vivo performance of synthetic gene networks will enable us to achieve desired functionality faster and more reliably.
Test platforms for engineering complex gene circuits. Increasing complexity—whether assembling larger synthetic gene networks from smaller ones or engineering circuits into higher organisms—dramatically increases the number of potential failure modes. In the former case, combining multiple individually functioning genetic circuits into a single cellular background can lead to unintended interactions among the synthetic components or with host factors, and these various failure modes are often difficult to pinpoint and isolate from one another. In the latter case, engineering synthetic networks for mammalian systems poses additional challenges beyond engineering circuits for bacterial and yeast strains, which have comparatively well-characterized genomes, transcriptomes, proteomes and metabolomes. Mammalian systems are much more complex and possess substantially less well-characterized components for engineering76, but for these and other reasons, constitute fertile ground for new applications and genetic parts.
The development of test platforms where engineered gene circuits can be designed and validated before being deployed in other or more complex cellular backgrounds would mitigate failure-prone jumps in complexity. These platforms could be used to verify or debug circuit topology and basic functionality in well-controlled environments. For example, cells optimized for testing may be engineered to have minimal genomes to decrease the risk of pleiotropic or uncharacterized interactions between the host and the synthetic networks77,78,79,80,81. The use of orthogonal parts that are decoupled from host cells may enable the dedication of defined cellular resources to engineered functions, which can simplify the construction and troubleshooting of gene circuits. For example, nucleic acid–based parts can be designed to function orthogonally to the wild-type cellular machinery82,83,84. Artificial codons and unnatural amino acids, which have enabled new methods for studying existing proteins and the realization of proteins with novel functions, could also be used to produce synthetic circuits that function orthogonally to host cells85. Simplifying backgrounds would additionally enable more accurate computational modeling of complex circuits before they are deployed into their ultimate environments. Furthermore, minimal cells could themselves contain synthetic circuits that provide useful testing functionalities, such as multiplexed transcriptional and translational controls and output probes.
Lower organisms can also be useful for the construction and characterization of synthetic gene networks before such systems are extended and deployed into higher organisms. In fact, several synthetic circuits, such as clocks and switches, were initially developed in bacteria and later translated into mammalian counterparts using analogous design principles3,7,12. Additionally, lower-organism test platforms could be endowed with certain features of interest from desired higher-organism hosts. For example, RNA interference–based circuits could be built first in Saccharomyces cerevisiae before being used in mammalian cells86. In one case, mitochondrial DNA was engineered into Escherichia coli before retransplantation into mammalian hosts87. Other biomolecular systems and components that are ripe for engineering in lower organisms include chromatin, ubiquitins and proteosomes.
The introduction of synthetic gene networks into higher organisms also runs the risk of compromising natural networks, which have evolved to maintain cellular robustness. Accordingly, methods for simplifying organisms for designing and testing synthetic circuits could be extended to engineer final deployment hosts, making them more conducive to synthetic gene circuits. Ultimately, in vivo directed evolutionary methods, based on repeated rounds of mutagenesis and selection within final cellular backgrounds, could be used to identify the optimal performance conditions of synthetic gene networks after their basic functionalities have been validated in earlier test platforms34.
Next-generation gene networks
Advancing synthetic gene circuits into the realm of higher-order networks with programmable functionality is one of the ultimate goals of synthetic biology. Useful next-generation gene networks should attempt to satisfy at least one of the following criteria: first, yield insights into the principles that guide the operation of natural biological systems; second, highlight design principles and/or provide modules that can be applied to the construction of other useful synthetic circuits; third, advance the tools available for novel scientific experiments; and fourth, enable real-world applications in medicine, industry and/or agriculture. Below, we describe several next-generation gene circuits and discuss their potential utility in the context of the above criteria.
Tunable filters and noise generators. Fine-tuning the performance of a synthetic gene network typically means reengineering its components, be it by replacing or mutating its parts. Networks whose responses can be tuned without the reengineering of its parts, such as the biological version of a tunable electronic filter, would enable more sophisticated cellular-based signal processing. Synthetic transcriptional cascades can exhibit low-pass filter characteristics16, and artificial gene circuits with negative autoregulation are capable of pushing the noise spectra of their outputs to higher frequencies, where it can be filtered by the low-pass characteristics of a downstream gene cascade88. Tunable genetic filters with respect to time could be implemented by tuning RNA and/or protein degradation in autoregulated negative-feedback circuits66,89,90,91 (Fig. 1). Such circuits would be useful in studying and shaping noise spectra to optimize the performance of artificial gene networks.
Recently, an externally tunable, bacterial bandpass-filter has been described17 that uses low-pass and high-pass filters in series to derive bandpass activity with respect to enzymes and inducer molecules. These types of filters, when coupled to quorum-sensing modules, can be used for spatial patterning applications15,17. They could also be readily extended to complex multicellular pattern formation by engineering a suite of different cells, each carrying filters that respond to different inputs. Synthetic gene circuits based on tunable filters may also make useful platforms for studying cellular differentiation and development, as artificial pattern generation is a model for how natural systems form complex structures15,17.
Along similar lines, recent developments in stem cell biology have unlocked important potential roles for synthetic gene networks92. For example, it has been shown that stochastic fluctuations in protein expression in embryonic stem cells are important for determining differentiation fates93. Indeed, stochasticity might be harnessed in differentiation to force population-wide heterogeneity and provide system robustness, though it may also be detrimental if it causes uncontrollable differentiation.
The effects of stochasticity in stem cell differentiation could be studied with synthetic gene circuits that act as tunable noise generators. Lu et al., for instance, considered two such designs for modulating the noise profile of an output protein94. This showed that the mean value and variance of the output can be effectively tuned with two external signals, one for regulating transcription and the other for regulating translation, and to a greater extent with three external signals, the third for regulating DNA copy number94. By varying noise levels while keeping mean expression levels constant, the thresholds at which gene expression noise yields beneficial versus detrimental effects on stem cell differentiation can be elucidated (J.J.C. and colleagues)95.
Furthermore, the discovery of induced pluripotent stem cells (iPSCs), based on the controlled expression of four transcription factors (OCT4, SOX2, KLF2 and MYC) in adult fibroblasts, has created a source of patient-specific progenitor cells for engineering92. Genetic noise generators and basic control circuits could be used to dissect the mechanism for inducing pluripotency in differentiated adult cells by controlling the expression levels of the four iPSC-dependent transcription factors. Ultimately, these efforts could lead to the development of timing circuits22 for higher-efficiency stem cell reprogramming.
Lineage commitment to trophectoderm, ectoderm, mesoderm and endoderm pathways are controlled by distinct sets of genes93, and many interacting factors, including growth factors, extracellular matrices and mechanical forces, play important roles in cellular differentiation96. As differentiation pathways become better understood, synthetic gene cascades may be used to program cellular commitment with increased fidelity for applications in biotechnology and regenerative medicine.
Analog-to-digital and digital-to-analog converters. Electrical engineers have used digital processing to achieve reliability and flexibility, even though the world in which digital circuits operate is inherently analog. Although synthetic biological circuits are unlikely to match the computing power of digital electronics, simple circuits inspired by digital and analog electronics may significantly increase the reliability and programmability of biological behaviors.
For example, biological analog-to-digital converters could translate external analog inputs, such as inducer concentrations or exposure times, into internal digital representations for biological processing. Consider, for instance, a bank of genetic switches with adjustable thresholds (Fig. 2a). These switches could be made out of libraries of artificial transcription factors, as described above. This design would perform discretization of analog inputs into levels of digital output. Depending on the level of analog inputs, different genetic pathways could be activated. Cells possessing analog-to-digital converters would be useful as biosensors in medical and environmental settings. For example, whole-cell biosensors19, resident in the gut, may be engineered to generate different reporter molecules that could be measured in stool depending on the detected level of gastrointestinal bleeding. Expressing different reporter molecules rather than a continuous gradient of a single reporter molecule would yield more reliable and easily interpretable outputs.
Digital-to-analog converters, on the other hand, would translate digital representations back into analog outputs (Fig. 2b); such systems could be used to reliably set internal system states. For example, instead of fine-tuning transcriptional activity with varying amounts of chemical inducers, a digital-to-analog converter, composed of a bank of genetic switches, each of which is sensitive to a different inducer, might provide better control. If each activated switch enabled transcription from promoters of varying strengths (Poutput,3 > Poutput,2 > Poutput,1), then digital combinations of inducers could be used to program defined levels of transcriptional activities (Fig. 2b). Such a circuit might be useful in biotechnology applications, where reliable expression of different pathways is needed for programming different modes of operation in engineered cells. In addition, digital-to-analog converters may be useful in providing a multiplexed method for probing synthetic circuits. For example, because each analog level is associated with a distinct digital state, a single analog output can allow one to infer the internal digital state of a synthetic gene network (Fig. 2b).
Adaptive learning networks. Synthetic gene networks that can learn or adapt to exogenous conditions could provide insight into natural networks and be useful for applications where adaptation to external stimuli may be advantageous, such as autonomous whole-cell biosensors97,98. Endogenous biomolecular networks in bacteria can exhibit anticipatory behavior for related perturbations in environmental stimuli99,100. This type of behavior and the associated underlying design principles could, in principle, be harnessed to endow transcriptional networks with the ability to learn97, much like synaptic interconnections between neurons. A basic design that would enable this functionality involves two transcriptional activators (Activator A and Activator B), each of which is expressed in the presence of a different stimulus (Fig. 3a). Suppose that both transcriptional activators drive the expression of effector proteins (Effector A and Effector B), which control distinct genetic pathways. When both transcriptional factors are active, indicating the simultaneous presence of the two stimuli, a toggle switch is flipped ON. This creates an associative memory. Subsequently, if either of the transcription factors is activated, AND logic between the ON toggle switch and one transcriptional activator produces the effector protein that controls the pathways of the other activators. On the basis of this design, cells could be programmed to associate simultaneous inputs and exhibit anticipatory behavior by activating the pathways of associated stimuli, even in the presence of only one of the stimuli.
In another example of a learning network, one could design bacteria that could be taught 'winner-take-all' behavior in detecting stimuli, similar to cortical neural processing101. In this example, bacteria could be exposed to different types of chemical stimuli (Inducers A–C; Fig. 3b). An exogenously added inducer (Inducer 'Learn') acts as a trigger for learning and serves as one input into multiple, independent transcriptional AND gates, which possess secondary inputs for detecting the presence of each of the different chemical stimuli. Each gate drives an individual toggle switch that, when flipped, suppresses the flipping of the other switches. This creates a winner-take-all system in which the presence of the most abundant chemical stimuli is recorded. Furthermore, the toggle switch outputs could be fed as inputs into transcriptional AND gates, which once again possess secondary inputs for detecting the presence of the different stimuli. If these gates drive different fluorescent reporters when activated, then the overall system will associate only a single type of stimuli with the learning trigger and respond with an output only in the presence of the single type of stimuli in the future. This system could potentially be adapted to create chemotactic bacteria that 'remember' a particular location or landmark and only respond to the gradient of one chemoattractant.
In more complicated instances of learning networks, it is conceivable that synthetic gene circuits could be designed to adapt on their own, that is, without external mutagenesis or exogenous nucleic acids. For example, transcription-based interconnections could be dynamically reconfigured based on the expression of DNA recombinases9. Another design could involve error-prone RNA polymerases, which create mutant RNAs that could be reverse-transcribed and joined back into the genome based on double-stranded breaks created by zinc-finger nucleases. Specificity for where the mutations would occur could be achieved by using promoters that are uniquely read by the error-prone RNA polymerases, such as T7 promoters with a T7 error-prone RNA polymerase, and zinc-finger nucleases that define where homologous recombination can occur102. In this design, enhanced mutagenesis frequencies could be targeted to specific regions of the genome.
Protein-based computational circuits. Beyond DNA- and RNA-based circuits, protein-based synthetic systems have the potential to enable flexible and fast computation through post-translational mechanisms103,104,105. Protein-based circuits are advantageous in that they can be designed to target synthetic activities to subcellular locations24. In this way, different sites within the same cell could have different protein circuit states rather than relying solely on shared cellular promoter states, thereby enabling researchers to explore the functional dynamics and consequences of cellular localization. Protein-based designs can also operate on much shorter time scales than genetic circuits because their operation is independent of the transcription and translation machinery106. Accordingly, it would be exciting to develop protein-based circuits that can act as rapidly responding logic gates, smart sensors or memory elements.
With regards to this last application, synthetic amyloids could serve as novel components for epigenetic memory circuits. By fusing a yeast prion determinant from Sup35 to the rat glucocorticoid receptor, a transcription factor regulated by steroid hormone, Li and Lindquist107 demonstrated that the state of transcriptional activity from the fused protein could be affected and inherited stably in an epigenetic fashion. Given the increasing number of identified prionogenic proteins108, there is an opportunity to create amyloid-based memory systems that transmit functionality from one generation to the next (Fig. 4). In these systems, aggregation could be induced by the transient expression of the prionogenic domain (PD), whereas disaggregation could be achieved by expressing protein remodeling factors, such as chaperones (heat shock protein 104). Though this system relies on the transcription and translation of prionogenic and disaggregating factors, it may enable the control of protein effectors that can operate on shorter time scales. For example, enzymes fused to a prionogenic domain may exhibit different activity levels depending on whether they are attached to an amyloid core.
Because genetic circuits and proteins function on different time scales, it would also be worthwhile to develop synthetic networks that couple both modalities. For example, the output of protein-based computation could be stored in recombinase-based memory elements5,6,9. It would also be conceivable to couple the two types of networks to harness their varied filtering capabilities. For example, the mitogen-activated protein kinase cascade contains both positive-feedback and negative-feedback loops that enable rapid activation followed by deactivation109, thus acting like a high-pass filter. On the other hand, transcription- and translation-based gene networks operate on longer time scales rendering them effective low-pass filters. Thus, synthetic kinase/phosphatase circuits that in turn drive gene-based networks could be used to create bandstop filters, which could be coupled with other bandpass filters and used for complex patterning applications.
Intercell signaling circuits and pulse-based processing for genetic oscillators. Robust genetic oscillators with tunable periods have been developed through a combination of experimental and computational efforts11,12,110. In addition to shedding light on the design principles guiding the evolution of naturally occurring biological clocks and circadian rhythms, these synthetic oscillators may also have significant utility in biotechnology applications, such as in the synthesis and delivery of biologic drugs. Glucocorticoid secretion, for instance, has a circadian and ultradian pattern of release, resulting in transcriptional pulsing in cells that contain glucocorticoid receptors111. Therefore, pulsatile administration of hormones may have therapeutic benefit compared with synthetic hormones applied in a non-ultradian schedule.
An alternative to device-based periodic drug delivery systems could be engineered bacteria that reside in the human gut and synthesize an active drug at fixed time intervals. To realize such an application, one would need to develop and implement intercell signaling circuits for synchronizing and entraining synthetic genetic oscillators112,113. Such circuits could be based, for example, on modular components from bacterial quorum sensing systems. Along similar lines, one could engineer light-sensitive23,24 entrainment circuits for synchronizing mammalian synthetic genetic oscillators. This may help in the construction of oscillators that can faithfully follow circadian rhythms.
Spike- or pulse-based processing is present in neurons and has been adapted for use in hybrid computation in electrical systems, where interspike times are viewed as analog parameters and spike counts are viewed as digital parameters114. In synthetic gene circuits, pulse-based processing may open up exciting new methods for encoding information in engineered cells. For example, instead of transmitting information between cells by means of absolute levels of quorum-sensing molecules, the frequency of a robust genetic oscillator could be modulated. This might be useful in delivering information over longer distances, as frequency information may be less susceptible to decay over distance than absolute molecule levels. Representing signals in this fashion is analogous to frequency modulation encoding in electrical engineering.
Engineered circuits for biological containment. Biological containment, which refers to efforts for ensuring that genetically modified organisms do not spread throughout the natural environment, can be achieved by passive or active techniques. In passive containment, cells are engineered to be dependent on exogenous supplementation to compensate for gene defects, whereas in active containment, cells are engineered to directly express toxic compounds when located outside their target environments115. Synthetic genetic counters or timers for programmed cell death could be used as an active containment tool. Counting circuits could, for example, be designed to trigger cell suicide after a defined number of cell cycles or a sequence of events. Recently, we have developed two designs for synthetic counters—a recombinase-based cascade of memory units and a riboregulated transcriptional cascade—that could be adapted for this purpose9. In each case, one could incorporate into the counters promoters that are cell cycle–dependent and replace the output reporter proteins with toxic proteins (Fig. 5). Circuits of this sort would enable cells to be programmed to have limited, prescribed lifetimes.
Redundant circuits that implement digital logic allowing for the conditional survival of engineered cells only within their desired environments would also potentially reduce the failure rate of biological containment. If a broad set of interoperable parts were developed, multiple layers of control circuits could be built for increased reliability. As in electrical and mechanical engineering, quantitative analysis of failure rates in biological systems would enable improved systems-level design and robustness of synthetic gene networks. This could be accomplished, for example, by subjecting synthetic containment circuits to a variety of stressful conditions that would lead to increased mutation rates and thus improper functioning. Rational and directed evolutionary methods to engineer cells with decreased mutation rates or the application of redundant circuits could then be employed to minimize failure rates.
Whole-cell biosensors and response systems. Programmable cells that act as whole-cell biosensors have been created by interfacing engineered gene networks with the cell's natural regulatory circuitry19 or with other biological components, such as light-responsive elements23,24. The development of novel or reengineered sensory modalities and components would expand the range of applications that programmable cells could address. This could involve engineering proteins or RNAs to detect a range of small molecules116,117, or designing protein-based synthetic signaling cascades by rationally rewiring the protein-protein interactions and output responses of prokaryotic two-component signal transduction systems118.
The detection of electrical signals or production of biological energy (e.g., mimicking the operation of electrical electrocytes119) could also be enabled by incorporating natural or synthetic ion channels into engineered cells. In addition, magneto-responsive bacteria could play useful roles in environmental and medical applications120. Synthetic bacteria, designed to form magnetosomes and seek out cancer cells, could be used to enhance imaging, and magnetic bacteria could be engineered to interact with nanoparticles to enhance the targeting of cancer cells. Moreover, the introduction of mechanosensitive ion channels (e.g., MscL from Mycobacterium tuberculosis and MscS from E. coli) could endow designer cells with the ability to detect mechanical forces121. Such cells may be useful in vivo sensors for studying cellular differentiation signals or the effects of external stresses on the body.
Ultimately, programmable cells possessing novel sensory modules could be integrated with mechanical, electrical and chemical systems to detect, process and respond to external stimuli, and exploited for a variety of environmental and biomedical applications. For example, bacteria could be engineered to seek out hazardous chemicals or heavy metals in the environment, perform cleanup and return to their origin to report on the number of hazardous sites encountered via analysis by microfluidic devices. To eventually achieve such complex tasks, an intermediate goal might involve programming chemotactic bacteria to swim from waypoint to waypoint. A dish containing gradients of several chemoattractants would constitute the navigational course (Fig. 6a).
At the core of this design could be a synthetic gene network made up of a series of sequential toggle switches that control the expression of receptors needed for bacterial chemotaxis toward chemoattractants122 (Fig. 6b). The programmable cells would initially express only a single chemoattractant receptor, and therefore would migrate up only one of the chemoattractant gradients122. To determine that a waypoint has been achieved, a threshold-based toggle switch would be turned ON upon reaching a sufficiently high concentration of the chemoattractant. When the first toggle switch is ON, production of the first chemoattractant receptor would be suppressed and production of a second receptor allowed, resulting in cells swimming up the second chemical gradient. The ON switch would additionally prime the next toggle switch in the series to be switched ON when the second waypoint is reached. When that second toggle flips ON, the previous switch would be flipped OFF to ensure that only one chemoattractant is being followed at a time. The final chemoattractant would lead the bacteria back to its origin so that the engineered cells would complete a multi-stop round trip.
Designer circuits and systems for microbiome engineering. The human microbiome is fertile ground for the application of engineered organisms as scientific tools and therapeutic agents. There are unique bacterial populations residing in distinct locations in the human body that are perturbed in disease states123,124. Each represents an exciting opportunity for reengineering the human microbiome and designing targeted therapeutics for a range of conditions, including dermatologic, genitourinary, gastrointestinal, metabolic and immunologic diseases125,126,127.
Recently, bacteria have been engineered to infiltrate cellular communities for the purposes of delivering probes, gene circuits or chemicals128,129. In a similar fashion, bacteriophages carrying synthetic gene circuits could transform existing microbiome bacteria with new functionalities. For instance, given that anaerobic bacteria are known to migrate to hypoxic and necrotic regions of solid tumors130, bacteriophages could be designed to infect cancer-targeting bacteria. These bacteriophages could encode conditional expression of chemotherapeutic agents using synthetic logic gates or switches that are coupled to environmental sensors.
Bhatia and colleagues131 recently have developed nanoparticles that perform Boolean logic based on proteolytic activity. Viruses that infect tumor cells or bacteria could carry synthetic gene circuits that regulate in a programmable fashion the expression of enzymes that trigger nanoparticle activity. In these ways, one could develop targeted therapies against cancer or infectious diseases that exploit the human microbiome and synthetic gene networks.
Switchboard for dynamically controlling the expression of multiple genes. Engineered cells have long been used to produce recombinant proteins and chemicals for the biotechnology industry, and one of the major applications of synthetic biology to date has been in enhancing microbial production of biofuels132 and biomaterials133,134,135,136. Improving production from cells involves numerous engineering decisions related to the entire organism, including codon optimization, choosing whether or not to export recombinant proteins137, rational or evolutionary methods for improving metabolic yields138,139, and optimization of growth conditions. Often some or all of the genes required for production are non-optimal for bacterial expression and contain repetitive sequences that are unstable in bacterial hosts. Whole-gene synthesis techniques are increasingly being used to optimize coding sequences for recombinant production136.
These innovative approaches, as well as more traditional knockout techniques, introduce hard-wired changes into the genomes of interest. However, for many industrial and bioprocess applications, there is a need to dynamically modulate and control the expression of multiple genes, depending upon the state of the bioreactor. These situations would benefit from the development of a synthetic switchboard, one that could tune the expression of many different genes simultaneously and independently. Such a switchboard could be made up of a series of adjustable threshold genetic switches, riboregulators or riboswitches, and designed to respond to different environmental and intracellular variables, such as pH, light intensity and the metabolic state of the cell. The switchboard design, which would integrate novel sensory modalities with tunable, interoperable genetic circuits, would have broad functionality. It could be programmed, for example, to shift carbon flux between different pathways depending upon cellular conditions, thereby optimizing the production of biofuels, specialty chemicals and other materials.
The past decade has witnessed the power of intelligently applying engineering principles to biology in the development of many exciting, artificial gene circuits and biomolecular systems. We are convinced that next-generation synthetic gene networks will advance understanding of natural systems, provide new biological modules and create new tools that will enable the construction of even more complex systems. Most importantly, if the current pace of progress in synthetic biology continues, real-world applications in fields such as medicine, biotechnology, bioremediation and bioenergy will be realized.
We would like to thank the Howard Hughes Medical Institute and the National Institutes of Health Director's Pioneer Award Program for their financial support. We also thank the reviewers for their insights and suggestions.