Logic Synthesis of Recombinase-Based Genetic Circuits

Chiu, Tai-Yin; Jiang, Jie-Hong R.

doi:10.1038/s41598-017-07386-3

Download PDF

Article
Open access
Published: 09 October 2017

Logic Synthesis of Recombinase-Based Genetic Circuits

Scientific Reports volume 7, Article number: 12873 (2017) Cite this article

4974 Accesses
10 Citations
6 Altmetric
Metrics details

Subjects

Abstract

A synthetic approach to biology is a promising technique for various applications. Recent advancements have demonstrated the feasibility of constructing synthetic two-input logic gates in Escherichia coli cells with long-term memory based on DNA inversion induced by recombinases. Moreover, recent evidences indicate that DNA inversion mediated by genome editing tools is possible. Powerful genome editing technologies, such as CRISPR-Cas9 systems, have great potential to be exploited to implement large-scale recombinase-based circuits. What remains unclear is how to construct arbitrary Boolean functions based on these emerging technologies. In this paper, we lay the theoretical foundation formalizing the connection between recombinase-based genetic circuits and Boolean functions. It enables systematic construction of any given Boolean function using recombinase-based logic gates. We further develop a methodology leveraging existing electronic design automation (EDA) tools to automate the synthesis of complex recombinase-based genetic circuits with respect to area and delay optimization. In silico experimental results demonstrate the applicability of our proposed methods as a useful tool for recombinase-based genetic circuit synthesis and optimization.

Rational programming of history-dependent logic in cellular populations

Article Open access 21 September 2020

Genetic circuit design automation with Cello 2.0

Article 23 February 2022

Scalable recombinase-based gene expression cascades

Article Open access 11 May 2021

Introduction

The development of synthetic biology shows the feasibility to implement computing devices with DNA genetic circuits in living cells. Synthetic cellular designs often intended to implement certain functions that make cells respond to specific environmental stimuli or even change their growth and cellular development. For instance, synthetic toggle switches¹ and genetic oscillators^2,3,4,5 can be used to control cell metabolism, synthetic counters⁶ can be potentially applied to the regulation of telomere length and cell aggregation, and genetic logic gates^7,8,9,10 can achieve digital computation in response to stimulus input signals. In addition to these transcription-based DNA circuits, new emerging translational mRNA circuits¹¹ are likely to have impact on mammalian regenerative medicine and gene therapy. Through the genetic engineering, synthetic cellular circuits are potentially useful to perform therapeutic and diagnostic functions.

For some situations where noxious chemical stimuli exist for many cell generations, the computational results from the synthetic circuits in parent cells are required to be propagated to their daughter cells so that the daughter cells can save time to respond to the environmental stimuli. To achieve this transgenerational memory, one possible method is to store the computational results in separate synthetic memory devices which can be duplicated in cell divisions. In the recent work of Siuti et al.¹², a more efficient scheme for constructing synthetic cellular circuits with integrated logic and memory was proposed, where the computational result was automatically stored in the computing circuit configuration and the changes of configuration can be propagated to its descendant cells. The so-implemented circuits were built based on recombinases and tested in Escherichia coli cells and they showed a long-term memory for at least 90 cell generations. More recently, recombinase-based logic circuits has been applied in clinical uses. For instance, in recent work¹³ the authors demonstrate that biosensor made of recombinase-based logic gates can be used to detect pathological glycosuria in urine from diabetic patients. The ability to build complex recombinase-based logic circuits is an important step to enable widespread biomedical applications.

Specifically, the synthetic cellular circuits proposed by Siuti et al.¹² used serine recombinases Bxb1 and phiC31 to implement various two-input logic gates. A serine recombinase targeting a pair of non-identical recognition sites known as attB (attachment site bacteria) and attP (attachment site phage) is able to induce irreversible DNA inversion. As illustrated in Fig. 1(a), since the inversion makes the recognition sites become hybrid sites called attR and attL which cannot be targeted by the recombinase, no further inversion is allowed afterwards.

We illustrate how recombinases take part in the implementation of two-input logic gates with the two-input AND gate example shown in Fig. 1(b). (As a convention, in this paper we read a DNA sequence from left to right assuming the 5′-to-3′ direction of the coding strand). Let molecules AHL and aTc be the stimulus inputs to a cell and act as inducers activating the expressions of recombinases Bxb1 and phiC31, respectively. These recombinases when activated will irreversibly invert (flip) the DNA sequences flanked by their recognition sites (denoted by the colored triangle pairs). The DNA sequences being flanked can be a promoter, a transcription terminator, or a reporter, e.g., a green fluorescent protein (GFP). Inverting these DNA sequences will alter the output gene expression. In Fig. 1(b), two terminators were flanked by the recognition sites of recombinases Bxb1 and phiC31, and the output green fluorescent reporter is highly expressed only when both inducers AHL and aTc are in high concentration to activate BxB1 and phiC31 which together further flip and disable both terminators (denoted by letter “T”). Therefore, the circuit of Fig. 1(b) effectively implements a two-input AND gate. Note that such DNA sequence changes will survive through cell divisions and can be inherited to descendant cells in different generations. Hence the so-implemented logic function can achieve a long-term transgeneration memory.

Motivated by the viability and applicability of recombinase-based circuits, in this paper we formalize the construction of a general multi-input logic gate with its DNA sequence composed of series of promoters and transcription terminators targeted by multiple recombinases. We further characterize the set of Boolean functions realizable under such logic gates. In addition, we show a design flow for arbitrary Boolean function construction with cascaded recombinase-based logic gates. This automated design methodology is demonstrated by leveraging synthesis tool ABC¹⁴, an electronic design automation (EDA) tool developed at UC Berkeley, to synthesize cascaded multi-level recombinase-based circuits.

Methods and Results

To formalize the general multi-input gate construction, we use the three-input logic gates in Fig. 2(a–h) as examples to illustrate. Figure 2(a) shows a realization of a 3-input AND gate using three recombinases R ₁, R ₂, and R ₃, where molecule I _i is a stimulus input that activates the expression of recombinase R _i, for i = 1, 2, 3. Then R _i’s induce the inversions of their corresponding DNA sequence fragments. In order to express GFP in this gate, first we require R ₁ to invert the inverted promoter so that the RNA polymerase can bind to it and begin the transcription of the downstream DNA sequence in which the GFP gene resides. Second, R ₂ is needed to flip the terminator to avoid the termination of transcription before reaching the GFP gene. Third, R ₃ is demanded to upright the GFP gene for the RNA polymerase to initiate GFP production. Collectively, to have GFP highly expressed all R _i’s must exist, and thus this circuit implements a 3-input AND gate. Note that this 3-input AND gate, where the promoter and the reporter gene GFP can be flipped by recombinases, is designed in a different fashion from the 2-input AND gate in Fig. 1(b), where only transcription terminators are inverted by recombinases. The additional choice of flipping the DNA fragments of promoter and GFP gives more flexibility for logic gate construction.

In Fig. 2(b–h) we present seven other basic 3-input gates implemented with recombinases. Special implementations with nested targeting sites are applied on the XOR gate in (g) and the XNOR gate in (h). In the XOR gate in (g), the existence of one or three recombinases results in one or three times of GFP gene flipping and thus making the upside-down gene become upright, while the existence of two recombinases makes the GFP gene flip twice and remain upside down. Similar situations happen in the XNOR gate in (h).

Since the implementations of multi-input gates are possible, we are not constrained to using only 3-input gates and basic gate types, such as AND, OR, NAND, NOR, XOR, and XNOR gates. Rather, we can construct complex logic gates with more inputs. Figure 2(i) shows an example of a 4-input logic circuit

$$O=({R}_{1}+\overline{{R}_{2}}\oplus {R}_{3})\overline{{R}_{4}},$$

which can be directly realized by a single 4-input complex logic gate, instead of cascading multiple two-input gates.

Formalism of Recombinase-Based Logic Gates

Syntax of well-formed sequences

We define the following syntax to formalize the DNA sequences of logic gates constructed with recombinases. Here the basic elements composing a legal DNA sequence of a recombinase-based logic gate are “atomic terms”, including (inverted/non-inverted) transcription factors, (inverted/non-inverted) promoters, (inverted/non-inverted) genes, and targeting sites of recombinases. The syntax of DNA sequence forming a legal recombinase-based logic gate can be defined as follows.

Definition 1 An atomic term in a DNA sequence is a transcription terminator T, a promoter P, a gene G, an inverted transcription terminator , an inverted promoter , or an inverted gene . The syntax of an atomic term can be expressed in Backus-Naur Form as

(1)

Let the targeting sites attP and attB of recombinase r in a DNA sequence be denoted as “{_r” and “}_r”, respectively. In the sequel, the subscripts of {_r and }_r may be omitted for brevity when they are clear from the context or immaterial to the discussion. Note that targeting sites “{” and “}” of a recombinase must appear in a pair.

Definition 2 The syntax of a well-formed sequence (wfs) is recursively defined as follows.

$$\begin{array}{l}\langle wfs\rangle \,::=\,\langle atomic\,term\rangle \,|\,{\{\langle wfs\rangle \}}_{{r}_{i}}\,|\,\langle wfs\rangle \langle wfs\rangle .\end{array}$$

(2)

In this paper we concentrate on the special case of one-gene wfs (1g-wfs), where only one gene G, which is neither inverted nor sandwiched by targeting sites, appears at the end of the wfs and serves as the output. For example, , and are 1g-wfs’s. Notice that under the 1g-wfs setting, the logic gate has a single output and the gene can only be transcribed in one direction from left to right.

A pair of targeting sites of a recombinase is called basic if it only flanks an atomic term. Otherwise, it is called non-basic. We call a 1g-wfs basic if it contains only basic pairs of targeting sites, and non-basic if it contains some non-basic pair of targeting sites. For example, is a basic 1g-wfs. In contrast, and are non-basic 1g-wfs’s.

Furthermore, a non-basic pair of targeting sites can be nested. That is, a non-basic pair of targeting sites can be flanked by another pair of targeting sites. For instance, has nested two pairs of targeting sites targeted by the recombinases r ₃ and r ₄.

We discuss the logic functions induced by basic and non-basic 1g-wfs’s in the following.

Semantics of well-formed sequences – Basic well-formed sequences

We first study some reduction rules of basic 1g-wfs’s. Let σ be the DNA sequence of a basic 1g-wfs excluding the output gene, that is, σ is a basic wfs without any gene. We denote a wfs without any gene as 0g-wfs. Because σ is made of components ${\{T\}}_{{r}_{i}}$, and for any component C in σ the sequence σ can be decomposed into

$$\sigma ={\sigma }_{1}C{\sigma }_{2},$$

where σ ₁ and σ ₂ are two 0g-wfs’s, if non-empty. We show that the logic gate induced by the 1g-wfs σG can be further reduced to an equivalent form according to the type of the component C.

When C is a transcription terminator T, then σ equals

$${\sigma }_{1}T{\sigma }_{2}G\equiv {\sigma }_{2}G.$$

(3)

This equivalence holds because any transcription that starts from σ ₁ to gene G is always blocked by the transcription terminator T in the middle, making σ ₁ T a don’t-care and thus removable.

When C is an inverted terminator , then σ equals

(4)

This equivalence holds because the inverted terminator never blocks the transcription and is thus removable.

When C is a promoter P, then σ equals

$${\sigma }_{1}P{\sigma }_{2}G\equiv P{\sigma }_{2}G.$$

(5)

This equivalence holds because no matter whether there is a transcription that starts from σ ₁ to G or not, a transcription can always start from the promoter P. Therefore, σ ₁ is a don’t-care and thus removable.

When C is an inverted promoter, then σ equals

(6)

This equivalence holds because the transcription that begins at proceeds across σ ₁ in the direction from right to left, it does not pass through G. As a result, the expression of G can not be initiated by and thus can be removed from the sequence.

When C is since an atomic term A is equivalent to {A}_r for recombinase r being in low concentration (denoted R = 0 by treating r as a Boolean variable R of value 0) or for recombinase r being in high concentration (denoted R = 1 by treating r as a Boolean variable R of value 1), the reduction rules for C can be easily extended from the previous rules as summarized below.

$${\sigma }_{1}{\{T\}}_{r}{\sigma }_{2}G\equiv \{\begin{array}{ll}{\sigma }_{2}G, & {\rm{for}}\,R=0\\ {\sigma }_{1}{\sigma }_{2}G, & {\rm{for}}\,R=1\end{array}$$

(7)

(8)

$${\sigma }_{1}{\{P\}}_{r}{\sigma }_{2}G\equiv \{\begin{array}{cc}P{\sigma }_{2}G, & {\rm{for}}\,R=0\\ {\sigma }_{1}{\sigma }_{2}G, & {\rm{for}}\,R=1\end{array}$$

(9)

(10)

with the above analysis, we can derive the corresponding Boolean function of a given 1g-wfs. Consider the 1g-wfs σG with the sequence σ targeted by recombinases r _i, $i=1,...,n$. Activating the expression of gene G requires the recombinases r _i’s have adequate (high or low) concentrations so that the 1g-wfs σG effectively reduces to PG. The Boolean function induced by σG is determined through a series of decisions made by r _i’s. In essence, it corresponds to a decision list¹⁵. To illustrate, consider the example The decision list induced by the 1g-wfs σG is shown in Fig. 3. Note that given a sequence without non-basic targeting sites, the decisions always start from the rightmost to the leftmost components because a component closer to the gene may overwrite the effects imposed by the components on its left and thus it is of higher priority. Therefore, the Boolean function of σG is determined starting from R ₁ to R ₅. In order to reduce σ to P to express gene G, first we must require R ₁ to be 1. Otherwise if R ₁ = 0, σ becomes equivalent to a null sequence no matter what other R _i’s are. Next, if we let R ₂ be 1, we can have an equivalent sequence equal to P as wished. Otherwise we can let R ₂ be 0 and look for other possibilities for the reduction to P. If R ₂ = 0, we can easily tell that the only possibility occurs when R ₃ and R ₄ are both 0 and that the logic of R ₅ never affects the reduction. Collectively, the logic function of the gate σG is derived as ${R}_{1}\cdot ({R}_{2}+\overline{{R}_{3}}\cdot \overline{{R}_{4}})$, where symbol “+” denotes Boolean disjunction, symbol “·” denotes Boolean conjunction, and symbol “−” or “!” denotes Boolean negation. In the sequel, we sometimes omit the conjunction symbol “·” in a Boolean expression.

In general, we can systematically convert any basic 1g-wfs to its corresponding logic function. To achieve this conversion, the operator Ω over a 1g-wfs is defined in Table 1. For an empty sequence ⊥, we define Ω[⊥] = 0. For example, the Boolean function of the 1g-wfs is derived by

Table 1 Operators for parsing basic 1g-wfs σCG, with (non-empty) 0g-wfs σ, component C, and gene G, to logic function.

Full size table

Semantics of well-formed sequences – Non-basic well-formed sequences

We extend the above derivation of Boolean function to non-basic 1g-wfs’s by having the operator Ω over a 0g-wfs {σ}_r (which can be basic or non-basic) defined as

(11)

where is the inverted sequence of σ. To understand equation (11), consider a 1g-wfs σG with only one pair of non-basic targeting sites. Suppose σ = {σ ₁}_r, where σ ₁ is a basic 0g-wfs. Then σ is equal to σ ₁ when R = 0 and to , the inverted sequence of σ ₁, when R = 1. For example, the logic function for can be obtained by

For a 1g-wfs with multiple (possibly nested) non-basic pairs of targeting sites, its logic function can also be directly derived by the Ω operator. For example, the logic function for can be obtained by

Non-basic pairs of targeting sites can be exploited to efficiently construct special Boolean functions. One of such special functions is the parity function. An n-input odd parity function can be realized by the 1g-wfs

When there is an odd number of R _i’s equal to 1, the 1g-wfs reduces to sequence PG and gene G can be expressed. Otherwise it reduces to sequence G and gene G cannot be expressed. On the other hand, the n-input even parity function can be realized by the 1g-wfs

$$\mathop{\underbrace{\{\cdots \{}}\limits_{n}P{\}}_{{r}_{1}}\cdots {\}}_{{r}_{n}}G.$$

Construction of Multi-level Recombinase-Based Logic Circuits

With the recombinase-based logic gates built from 1g-wfs’s, we can cascade them to implement arbitrary complex multi-level circuits. For example, the logic function Z = (A + B)(A ⊕ B) can be implemented with the two-level circuit shown in Fig. 4(a), which is composed of an OR-gate, an XOR-gate, and an AND-gate. One possible DNA implementation of Z with cascade can be derived by converting each gate to their 1g-wfs realizations as shown in Fig. 4(b). The 1g-wfs’s that encode the genes R ₁, R ₂, and Z correspond to the OR, XOR and AND gates, respectively. The recombinases r ₁ and r ₂ as the inputs to the AND gate are the intermediate signals.

Because the basic 1g-wfs gates can implement decision list functions, they form a functionally complete set of primitive logic gates that can be composed to implement any Boolean function. Therefore the 1g-wfs gates can be collected as a library for the synthesis of complex logic circuits. By leveraging conventional logic synthesis tools in electronic design automation (EDA), recombinase-based logic circuits can be synthesized with the flow shown in Fig. 5(a). Given a Boolean function or circuit netlist as the input, it is first optimized by technology-independent techniques for circuit simplification. The simplified circuit is further optimized by technology-dependent techniques for technology mapping using the primitive gates in the given standard cell library. To achieve recombinase-based logic circuit synthesis, the main task is to provide the library while all other optimization tasks can be done using existing logic synthesis tools.

In this work, we adopt ABC¹⁴, an industrial-strength logic synthesis tool developed at UC Berkeley, for circuit synthesis and optimization. Given a circuit netlist, we first apply ABC to perform technology-independent optimization on the netlist, e.g., Boolean minimization to minimize the number of product terms and literals. We then use ABC to perform technology mapping to implement the area or performance optimized netlist using the 1g-wfs gates in the library.

To illustrate the synthesis flow, we consider implementing ISCAS benchmark circuit c17 shown in Fig. 5(b) with recombinase-based genetic circuit realization. The circuit consists of five inputs A, B, C, D, and E, and two outputs Y and Z with functions

$$\{\begin{array}{l}Y=AB+\overline{(BC)}D\\ Z=\overline{(BC)}D+\overline{(BC)}E.\end{array}$$

(12)

For area-driven synthesis of benchmark c17, there are 44 DNA gates defined by their 1g-wfs’s with up to three recombinase inputs. They are collected as the library as shown in Fig. 5(c). According to the experiment in the previous work¹², where the promoters and transcription terminators used are roughly of the same length, we treat the area cost of both promoter and transcription terminator as unity. Therefore, the area cost of a DNA gate is defined as the number of atomic terms, excluding the output gene, that appear in the 1g-wfs of the gate. For example, the gate c3_1 corresponding to a 3-input OR gate has three inverted promoters as shown in Fig. 2(d). Hence, the area cost of c3_1 is counted as 3 units. By providing the c17 netlist and the library to ABC, the tool can perform optimization and technology mapping to find an area-optimized circuit composed of DNA gates of the library. Note that area minimization of a recombinase-based circuit effectively reduces the number of used promoters and terminators on the DNA strand implementation. Therefore, less effort is required to synthesize the intended DNA strand via DNA assembly methods, e.g., Gibson assembly¹⁶. More importantly, a shorter DNA sequence is more likely to succeed in vector insertion to deploy the genetic circuit into the host cell to conduct the intended computation.

Figure 6(a) shows the result described in Verilog language of the synthesized c17 recombinase-based circuit using library gates listed in Fig. 5(c). The synthesized circuit comprises gates c2_4, c2_5, c3_14, and c3_25, and the total area cost is 10 units. Note that the naive DNA circuit implementation of c17 circuit by converting the digital logic gates in Fig. 5(b) to the corresponding DNA gates results in a total area cost of 12 units. Compared to the naive implementation, the area cost of the circuit synthesized by ABC technology mapping decreases. The logic functions of Y and Z in the synthesized circuit can be easily verified to be consistent with equation (12), implying the correctness of the synthesis result. The DNA circuit of module c17 in Fig. 6(a) is plotted in Fig. 6(c), where the symbols A, B, C, D, E, n7, and n8 represent some serine recombinases. In practice, to have recombinases achieve site-specific recombination in a synthetic genetic circuit, recombinases that have been reported to function outside their native hosts may be used. For example, well-reported recombinases^{17,18,19,20,21,22,23,24,25,26,27,28,29}, such as ϕC31, ϕBT1, R4, BxB1, TP901-1, RV, SPBc, TG1, ϕFC1, MR11, ϕ370, ϕK38, A118, W β, and BL3 integrase, can be plausible molecular parts for realization of the recombinase signals in Fig. 6(c).

Note that there can be more than one area-optimized circuit of a logic function. For comparison, in Fig. 6(b) we show another manually designed DNA implementation of c17 circuit whose area cost is 10 units as well. The corresponding DNA circuit is plotted in Fig. 6(d). Notice that the two circuits in Fig. 6 differ not only in their constituent logic gates, but also in their logic depths. The circuit of Fig. 6(c) is of two logic levels, whereas that of Fig. 6(d) is of three logic levels. There are six longest paths in the former circuit:

$$\{\begin{array}{l}A\to n7\to Y,\\ B\to n7\to Y,\\ B\to n8\to Y,\\ B\to n8\to Z,\\ C\to n8\to Y,\\ C\to n8\to Z.\end{array}$$

They involve a cascade of two logic gates. On the other hand, there are two longest paths in the latter circuit:

$$\{\begin{array}{l}B\to n7\to n8\to Y,\\ C\to n7\to n8\to Y.\end{array}$$

They involve a cascade of three logic gates. In digital electronic circuits, a longer circuit path often corresponds to a longer propagation delay between circuit input and output signals. Similarly in biological circuits, a longer circuit path involves more transcription and translation cascades, resulting in a longer response time of output gene expression to input stimuli. Here, the former and latter circuits involve two (n7 and Y) and three (n7, n8, and Y) gene expression cascades, respectively. Therefore, although these two circuits have the same area cost, the circuit of Fig. 6(c) is preferred due to its better performance, i.e., shorter input-to-output response time. In addition, we will detail in Section Discussion that the delay optimization may present fewer foreign genes and thus impose less metabolic burden on the host cell. In the in silico experiments, we will synthesize circuits with area or performance optimized.

To demonstrate the feasibility of the proposed synthesis flow, we conduct in silico experiment on other 67 ISCAS benchmark circuits using recombinase-based DNA gates. We expanded the library such that it includes all 684 DNA gates with decision list functions up to five inputs. In the library, the area cost of a gate is determined by the number of atomic terms, excluding the output gene, appearing in its corresponding 1g-wfs. To reduce the number of gene expression cascades, we simply assume each logic gate is of the same unit delay. By specifying a unit delay for each gate in the library, the delay of a synthesized circuit equals the logic level, which equals the number of gene expression cascades in the longest path in the circuit. Consequently, under the unit delay model the performance-driven logic synthesis minimizes the delay time between input stimuli and output response in the synthesized recombinase-based circuit. Note that this simple unit delay model is not meant to reflect the timing behavior of actual biological systems, but to facilitate the logic synthesis algorithm to perform circuit logic level minimization.

The experimental results of 54 (out of the 67) circuits are shown in Table 2. The numbers of primary inputs/outputs, the number of inverters, and the number of logic gates (with the number of included buffers, if non-zero, reported in parentheses) are listed Columns 2, 3, and 4, respectively. The circuits were synthesized under two optimization settings: one for area optimization and the other for delay optimization. The results of area optimization are reported in Columns 5–7 and those of delay optimization are reported in Columns 8–10. For each synthesized circuit, its number of DNA gates, total area, and gate level are shown. In the naive implementations of benchmark circuits by simply converting the digital logic gates to the corresponding DNA gates, the total area of a DNA circuit can be roughly calculated as “#inverter” + 2 × “#gate”. Compared to the naive implementation, the circuits synthesized by ABC have much less area cost. Taking circuit b18 for example, we observe that the total area of the naive implementation is about 202110 which is much larger compared to the area 101870 of the area-optimized implementation and 105328 of the delay-optimized implementation. On the other hand, comparing area and delay optimized b18 circuits, delay optimization reduces the number of gate levels from 137 to 51 at cost of increasing area by 3500 units.

Table 2 Results of technology mapping of ISCAS benchmark circuits.

Full size table

Discussion

Area vs. Delay Optimization

To pursue area or delay optimization in genetic circuit synthesis is a matter of tradeoff, and may depend on the intended application and/or biological feasibility. Nevertheless, Table 2 reveals that when the library of recombinase-based logic gates is used in ABC for logic synthesis, delay optimization often achieves effective reduction (62% on average) in logic level, or circuit depth, with a slight increase (7% on average) in circuit area compared to area optimization. Taking the largest circuit b18 benchmark for example, from area to delay optimization, the area cost increases by 3.39% while the logic level decreases by 62.77%. Particularly, in practice since we are limited by the biotechnology and the metabolic burden, circuits to be synthesized cannot be as large as b18 benchmark, which only serves as a proof of concept. Instead, small circuits, such as b06, are more likely to be implemented. For benchmark b06, the area cost increases by 10.71% (56 to 62) and the logic level decreases by 50% (6 to 3) from area to delay optimization. Moreover, the delay optimization helps reduce metabolic burden (to be discussed below). These facts imply that delay-driven optimization may often be a proper objective for logic synthesis of recombinase-based genetic circuits.

Metabolic Burden

One of the advantages of recombinase-based genetic circuits is its low metabolic burden imposed on the host cell³⁰. Unlike a classic genetic circuit requiring continuous production of and action by activators or repressors to maintain the output gene expression, the output gene expression in a recombinase-based genetic circuit is determined by its DNA configuration, which is changed by DNA inversion or excision by recombinases; no further continuous recombinase supply and action is needed afterwards. This permanent configuration change is understood as a long-term (nonvolatile) memory, leading to the advantage of a lower metabolic burden on the host cell. This advantage may allow more complex genetic circuit implementation using recombinases. For example, recombinase-based finite state machines have been implemented in E. coli cells³¹. Moreover, a 6-input AND gate, a 2-data-input 4-select-input Boolean logic look-up table, a full adder, a full subtractor, and a half adder-subtractor were implemented in human embryonic kidney and Jurkat T cells³². Furthermore, we have shown recombinase-based logic gates can be adopted in the conventional logic synthesis flow for efficient circuit optimization. Because an efficient design can reduce metabolic burden and outperform an inferior counterpart even with the same functionality³³, complex circuit implementation may benefit from the automation and optimization method proposed in this report.

Even with recombinase based construction, implementing a large circuit in a living cell may still be challenging due to the increase of metabolic burden³⁴ caused by two major effects. First, a larger synthetic circuit requires more cellular energy to maintain its presence in the host cell³⁵. Second, a large number of introduced genes will compete for the transcriptional and translational resources, resulting in resource redistribution³⁶ and unexpected coupling among seemingly unconnected modules³⁷, and thus leading to cell growth defects and poorly predictable circuit behavior. One approach to address these issues is to separate the target circuit into sub-circuits and implement the circuit across a consortium of host cells^7,38,39,40. In particular, the consortium is divided into colonies of the same number of the sub-circuits. Each colony is composed of a strain implementing one of the sub-circuits. The sub-circuits are connected through cell-cell communication by wiring molecules (for example, quorum-sensing molecules and yeast pheromones) or metabolites like benzoic acid. Collectively, the whole cell population implements the target circuit. This distributed strategy may also apply to a large recombinase-based circuit implementation. For instance, the c17 circuit in Fig. 6(a) may be implemented by distributing the gates g0, g1, g2, and g3 into four strains of cells.

We note from Table 2 that when using recombinase-based logic gates as the library for a target circuit synthesis, the option of delay optimization introduces fewer DNA gates, each of which contains a gene, than the option of area optimization. Hence, delay optimization is preferred over area optimization due to a lower metabolic burden imposed by fewer foreign genes in the delay-optimized circuit.

Experimental Steps for Circuit Realization

Given a target Boolean function to be implemented as a genetic circuit, our method can be applied as the first step to build the blueprint for the wet-lab construction by using the logic synthesis tool ABC to derive the area or delay-optimized circuit. The next task is to associate the abstract signals of the synthesized netlist with concrete biochemical parts, including promoters, recombinases, and genes, for wet-lab implementation. After this association step, the DNA molecule of the genetic circuit is readily to be constructed by Gibson assembly¹⁶, Unique Nucleotide Sequence (UNS) Guided assembly⁴¹, or other assembly methods. Note that the promoters used here should have the ability to strongly promote transcription. After the assembly, the DNA constructs are transformed/transfected into cells using a standard protocol, such as the polyethylenimine (PEI) protocol. The cells should be kept and maintained in custom or standard media, such as Luria-Bertani (LB) medium and Dulbecco’s Modified Eagle’s medium (DMEM), and grown for one to two days in a stimuli-free medium. To test the synthetic circuit, cells have to be exposed to stimuli and grown for several hours, and then the fluorescence response from cells is measured by a flow cytometer. For each sample of the measurement, the same number of cells should be used for consistency. After creating a gate using forward scatter (FSC) and side scatter (SSC) and applying a proper fluorescence threshold on each fluorescent protein channel, the percentage of cells in an ON state is determined by flow cytometry analysis.

Alternative Genetic Circuit Construction with CRISPR/Cas9 Systems

Cas9 nucleases⁴² may possibly be exploited to achieve gene expression effects equivalent to what recombinases can achieve. For example, Cas9 nucleases are able to induce DNA deletion^43,44, defective Cas9 nucleases (dCas9) can repress transcription by blocking transcriptional initiation or elongation⁴⁵, and dCas9 fused with a transcriptional activator is capable of activating gene expression⁴⁶. Specifically, DNA deletion of P and T may achieve an effect equivalent to inverting P and T, respectivley; transcription repression may achieve an effect equivalent to inverting P and ; transcription activation may achieve an effect equivalent to inverting and T. These mechanisms allow CRISPR/Cas9 systems to be utilized as recombinase replacements for the implementation of decision list logic functions.

Conclusion

In this paper, we generalized the two-input recombinase-based DNA logic gates to multi-input cases. We formalized the syntax of recombinase-based logic gate construction, and obtained the Boolean function semantics of well-defined DNA sequences of recombinase-based logic gates. We also showed how to synthesize multi-level recombinase-based logic circuits using existing logic synthesis tools. In silico experimental results demonstrate the feasibility and efficiency of our proposed methods as a tool for recombinase-based genetic circuit minimization. As recombinase-based logic circuits have been used in clinical biomarker detection and tested in human cells, our tool can be useful to automate complex recombinase-based circuit construction for biologists to implement advanced biomedical applications.

References

Gardner, T. S., Cantor, C. R. & Collins, J. J. Construction of a genetic toggle switch in Escherichia coli. Nature 403, 339–342 (2000).
Article ADS CAS PubMed Google Scholar
Elowitz, M. B. & Leibler, S. A synthetic oscillatory network of transcriptional regulators. Nature 403, 335–338 (2000).
Article ADS CAS PubMed Google Scholar
Hasty, J., Dolnik, M., Rottschäfer, V. & Collins, J. J. Synthetic gene network for entraining and amplifying cellular oscillations. Physical Review Letters 88, 148101 (2002).
Article ADS PubMed Google Scholar
Fung, E. et al. A synthetic gene–metabolic oscillator. Nature 435, 118–122 (2005).
Article ADS CAS PubMed Google Scholar
Stricker, J. et al. A fast, robust and tunable synthetic gene oscillator. Nature 456, 516–519 (2008).
Article ADS CAS PubMed Google Scholar
Friedland, A. E. et al. Synthetic gene networks that count. Science 324, 1199–1202 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Tamsir, A., Tabor, J. J. & Voigt, C. A. Robust multicellular computing using genetically encoded NOR gates and chemical ‘wires’. Nature 469, 212–215 (2011).
Article ADS CAS PubMed Google Scholar
Wang, B., Kitney, R. I., Joly, N. & Buck, M. Engineering modular and orthogonal genetic logic gates for robust digital-like synthetic biology. Nature communications 2, 508 (2011).
Article ADS PubMed PubMed Central Google Scholar
Moon, T. S., Lou, C., Tamsir, A., Stanton, B. C. & Voigt, C. A. Genetic programs constructed from layered logic gates in single cells. Nature 491, 249–253 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Bonnet, J., Yin, P., Ortiz, M. E., Subsoontorn, P. & Endy, D. Amplifying genetic logic gates. Science 340, 599–603 (2013).
Article ADS CAS PubMed Google Scholar
Kopniczky, M. B., Moore, S. J. & Freemont, P. S. Multilevel regulation and translational switches in synthetic biology. IEEE transactions on biomedical circuits and systems 9, 485–496 (2015).
Article PubMed Google Scholar
Siuti, P., Yazbek, J. & Lu, T. K. Synthetic circuits integrating logic and memory in living cells. Nature biotechnology 31, 448–452 (2013).
Article CAS PubMed Google Scholar
Courbet, A., Endy, D., Renard, E., Molina, F. & Bonnet, J. Detection of pathological biomarkers in human clinical samples via amplifying genetic switches and logic gates. Science Translational Medicine 7, 289ra83 (2015).
Brayton, R. & Mishchenko, A. ABC: An academic industrial-strength verification tool. In Proceedings of International Conference on Computer Aided Verification, 24–40 (Springer, 2010).
Rivest, R. R. Learning decision lists. Machine Learning 2, 229–246 (1987).
Google Scholar
Gibson, D. G. et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nature methods 6, 343–345 (2009).
Article CAS PubMed Google Scholar
Brown, W. R., Lee, N. C., Xu, Z. & Smith, M. C. Serine recombinases as tools for genome engineering. Methods 53, 372–379 (2011).
Article CAS PubMed Google Scholar
Thorpe, H. M. & Smith, M. C. In vitro site-specific integration of bacteriophage DNA catalyzed by a recombinase of the resolvase/invertase family. Proceedings of the National Academy of Sciences 95, 5505–5510 (1998).
Article ADS CAS Google Scholar
Bibb, L. A., Hancox, M. I. & Hatfull, G. F. Integration and excision by the large serine recombinase φRv1 integrase. Molecular microbiology 55, 1896–1910 (2005).
Article CAS PubMed Google Scholar
Ghosh, P., Kim, A. I. & Hatfull, G. F. The orientation of mycobacteriophage Bxb1 integration is solely dependent on the central dinucleotide of attP and attB. Molecular cell 12, 1101–1111 (2003).
Article CAS PubMed Google Scholar
Xu, Z. et al. Site-specific recombination in Schizosaccharomyces pombe and systematic assembly of a 400 kb transgene array in mammalian cells using the integrase of Streptomyces phage ϕBT1. Nucleic acids research 36, e9–e9 (2008).
Keravala, A. et al. A diversity of serine phage integrases mediate site-specific recombination in mammalian cells. Molecular Genetics and Genomics 276, 135 (2006).
Article CAS PubMed Google Scholar
Olivares, E. C., Hollis, R. P. & Calos, M. P. Phage R4 integrase mediates site-specific integration in human cells. Gene 278, 167–176 (2001).
Article CAS PubMed Google Scholar
Breüner, A., Brøndsted, L. & Hammer, K. Resolvase-like recombination performed by the TP901-1 integrase. Microbiology 147, 2051–2063 (2001).
Article PubMed Google Scholar
Lazarevic, V. et al. Nucleotide sequence of the Bacillus subtilis temperate bacteriophage SPβc2. Microbiology 145, 1055–1067 (1999).
Article CAS PubMed Google Scholar
Morita, K. et al. In vitro characterization of the site-specific recombination system based on actinophage TG1 integrase. Molecular Genetics and Genomics 282, 607–616 (2009).
Article CAS PubMed Google Scholar
Yang, H.-Y., Kim, Y.-W. & Chang, H.-I. Construction of an integration-proficient vector based on the site-specific recombination mechanism of Enterococcal temperate phage φFC1. Journal of bacteriology 184, 1859–1864 (2002).
Article CAS PubMed PubMed Central Google Scholar
Rashel, M. et al. A novel site-specific recombination system derived from bacteriophage ϕMR11. Biochemical and biophysical research communications 368, 192–198 (2008).
Article CAS PubMed Google Scholar
Canchaya, C. et al. Genome analysis of an inducible prophage and prophage remnants integrated in the Streptococcus pyogenes strain SF370. Virology 302, 245–258 (2002).
Article CAS PubMed Google Scholar
Jusiak, B. et al. Synthetic gene circuits. Reviews in Cell Biology and Molecular Medicine (2014).
Roquet, N., Soleimany, A. P., Ferris, A. C., Aaronson, S. & Lu, T. K. Synthetic recombinase-based state machines in living cells. Science 353, aad8559 (2016).
Weinberg, B. H. et al. Large-scale design of robust genetic circuits with multiple inputs and outputs for mammalian cells. Nature Biotechnology (2017).
Ceroni, F., Algar, R., Stan, G.-B. & Ellis, T. Quantifying cellular capacity identifies gene expression designs with reduced burden. Nature methods 12, 415–418 (2015).
Article CAS PubMed Google Scholar
Gorochowski, T. E., Van Den Berg, E., Kerkman, R., Roubos, J. A. & Bovenberg, R. A. Using synthetic biological parts and microbioreactors to explore the protein expression characteristics of Escherichia coli. ACS synthetic biology 3, 129–139 (2013).
Article PubMed Google Scholar
Glick, B. R. Metabolic load and heterologous gene expression. Biotechnology advances 13, 247–261 (1995).
Article CAS PubMed Google Scholar
Gorochowski, T. E., Avcilar-Kucukgoze, I., Bovenberg, R. A., Roubos, J. A. & Ignatova, Z. A minimal model of ribosome allocation dynamics captures trade-offs in expression between endogenous and synthetic genes. ACS synthetic biology 5, 710–720 (2016).
Article CAS PubMed Google Scholar
Gyorgy, A. et al. Isocost lines describe the cellular economy of genetic circuits. Biophysical journal 109, 639–646 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Regot, S. et al. Distributed biological computation with multicellular engineered networks. Nature 469, 207–211 (2011).
Article ADS CAS PubMed Google Scholar
Chen, Y., Kim, J. K., Hirning, A. J., Josić, K. & Bennett, M. R. Emergent genetic oscillations in a synthetic microbial consortium. Science 349, 986–989 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Silva-Rocha, R. & de Lorenzo, V. Engineering multicellular logic in bacteria with metabolic wires. ACS synthetic biology 3, 204–209 (2013).
Article PubMed Google Scholar
Torella, J. P. et al. Unique nucleotide sequence–guided assembly of repetitive DNA parts for synthetic biology applications. Nature protocols 9, 2075–2089 (2014).
Article CAS PubMed PubMed Central Google Scholar
Doudna, J. A. & Charpentier, E. The new frontier of genome engineering with CRISPR-Cas9. Science 346, 1258096 (2014).
Article PubMed Google Scholar
Cong, L. et al. Multiplex genome engineering using CRISPR/Cas systems. Science 339, 819–823 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ran, F. A. et al. Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity. Cell 154, 1380–1389 (2013).
Article CAS PubMed PubMed Central Google Scholar
Qi, L. S. et al. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152, 1173–1183 (2013).
Article CAS PubMed PubMed Central Google Scholar
Gilbert, L. A. et al. CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes. Cell 154, 442–451 (2013).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported in part by the Ministry of Science and Technology of Taiwan under grant MOST 106-2923-E-002-002-MY3.

Author information

Authors and Affiliations

Graduate Institute of Electronics Engineering, National Taiwan University, Taipei, 10617, Taiwan
Tai-Yin Chiu & Jie-Hong R. Jiang
Department of Electrical Engineering, National Taiwan University, Taipei, 10617, Taiwan
Jie-Hong R. Jiang
Genome and Systems Biology Degree Program, National Taiwan University, Taipei, 10617, Taiwan
Jie-Hong R. Jiang

Authors

Tai-Yin Chiu
View author publications
You can also search for this author in PubMed Google Scholar
Jie-Hong R. Jiang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.Y.C. and J.H.J. conceived and designed the experiments. T.Y.C. performed the experiments. T.Y.C. and J.H.J. analyzed the data. T.Y.C. and J.H.J. contributed reagents/materials/analysis tools. T.Y.C. and J.H.J. wrote the paper.

Corresponding author

Correspondence to Jie-Hong R. Jiang.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chiu, TY., Jiang, JH.R. Logic Synthesis of Recombinase-Based Genetic Circuits. Sci Rep 7, 12873 (2017). https://doi.org/10.1038/s41598-017-07386-3

Download citation

Received: 16 January 2017
Accepted: 23 June 2017
Published: 09 October 2017
DOI: https://doi.org/10.1038/s41598-017-07386-3

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.