Towards an engineering theory of evolution

Castle, Simeon D.; Grierson, Claire S.; Gorochowski, Thomas E.

doi:10.1038/s41467-021-23573-3

Download PDF

Perspective
Open access
Published: 07 June 2021

Towards an engineering theory of evolution

Nature Communications volume 12, Article number: 3326 (2021) Cite this article

25k Accesses
27 Citations
147 Altmetric
Metrics details

Subjects

Abstract

Biological technologies are fundamentally unlike any other because biology evolves. Bioengineering therefore requires novel design methodologies with evolution at their core. Knowledge about evolution is currently applied to the design of biosystems ad hoc. Unless we have an engineering theory of evolution, we will neither be able to meet evolution’s potential as an engineering tool, nor understand or limit its unintended consequences for our biological designs. Here, we propose the evotype as a helpful concept for engineering the evolutionary potential of biosystems, or other self-adaptive technologies, potentially beyond the realm of biology.

A biological perspective on evolutionary computation

Article 18 January 2021

Risto Miikkulainen & Stephanie Forrest

The developing toolkit of continuous directed evolution

Article 22 May 2020

Mary S. Morrison, Christopher J. Podracky & David R. Liu

Steering and controlling evolution — from bioengineering to fighting pathogens

Article 03 July 2023

Michael Lässig, Ville Mustonen & Armita Nourmohammad

Introduction

The past few decades have seen a revolution in our ability to engineer biology and create living systems with novel functions¹. Yet, several hurdles still hinder our capability to harness biology’s full potential². These stem from the fact that you cannot engineer the stuff of life without engineering its properties too and life’s most fundamental property is that it evolves. Evolution makes engineering living systems a radically different challenge to engineering other mediums. To be effective, we cannot simply apply traditional engineering design principles to biology and deal with evolution as a secondary thought. If nothing in biology makes sense except in the light of evolution³, then evolution must be a central part of an engineering theory of biology.

Evolution poses both a challenge and an opportunity when designing biosystems. On one hand, it is a detrimental force that can unpick the meticulous plans of an engineer through genetic variation⁴. Designed biosystems cannot escape evolution when used and loss of function is a particular concern for engineers, especially as there are often selection pressures working against the design’s function^5,6. It is essential that we learn to build evolutionarily stable biosystems that can continue to operate under unavoidable evolutionary forces.

On the other hand, evolution is an extremely effective problem solver and engineers have exploited this fact for decades^7,8,9,10. For example, directed evolution can be used to optimise or even generate completely novel traits in proteins^8,11 or cells¹². However, these methods rely on the ability of evolution to find solutions in a reasonable length of time. For most systems, the search space is so vast that the starting point in this process must have the potential to generate useful phenotypes relatively quickly.

Evolution may even be employed as a feature of the system during operation. For example, adaptive systems that evolve in response to environmental cues or evolvable genetic circuits that can be designed with specific classes of phenotype that are reached as necessary through evolutionary change. To create such systems, it is critical that the biological design is specifically evolvable. This means it must have the potential to generate the types of phenotypes desired by the engineer from a single starting point in a reasonable time frame.

Even more critical is our moral obligation to develop a deeper understanding of how synthetic biosystems will continue to evolve if deployed into our bodies or the wider environment¹³. The field has rightly made efforts to develop tools to reduce and mitigate evolution¹⁴, with fail-safes such as kill switches¹⁵ or metabolic dependencies¹⁶. However, without a good theoretical understanding of how synthetic biosystems might continue to evolve once deployed, we risk these technologies developing unexpected faults with dire, but avoidable, consequences. Even breeding has at times had dire consequences. Notably, the inadvertent creation of the hyper-aggressive Africanised bee, which has had a severe impact on humans and ecology¹⁷. As we develop technologies capable of even more rapid genetic change, such as gene drives¹⁸, these concerns will become even more salient.

Central to many of these issues is the view in traditional engineering disciplines that the engineered artefact is a final destination in the design process. This view breaks for biology. Instead, we believe that a new perspective is needed for a truly effective engineering of biology; one that sees a designed biosystem as a starting point in a lineage of possibilities. Although much of evolutionary biology has concerned itself with organisms’ histories¹⁹, bioengineers must consider the future and, specifically, how a biosystem will continue to evolve when used²⁰. Here we describe a framework that enables this transition and offers a way to specify, test and conceive the properties of biosystems in terms of their evolutionary potential and not just their phenotype (Fig. 1). This provides a way to re-imagine biological engineering so that it works in harmony with life’s ability to evolve.

**Fig. 1: The evotype and its key properties.**

The design type and the evotype

To better harness the capabilities of biology, we need a way of thinking about the evolutionary properties of engineered biosystems. We must design for potential evolutionary change and not just the immediate functionalities displayed by a system (i.e., its phenotypic traits). Although these are properties of populations yet to exist, they can still be considered in the context of an individual biosystem. We consider the ‘design type’ as a system that has been engineered, consisting of a single genotype. The design type could be any biosystem capable of evolution: a protein, genetic circuit, virus, cell, animal, plant or even an ecosystem. We introduce the concept of the ‘evotype’ to capture the evolutionary properties of that system. The evotype is a set of evolutionary dispositions of the design type, analogous to genotype and phenotype being sets of genes and traits, respectively (Table 1). Unlike a trait, a disposition is not a directly observable property, rather it is a potential property of the system. For example, a protein may have the evolutionary disposition of instability where its structure may change dramatically when mutated. Designing the dispositions of the evotype is a challenge fundamental to engineering biology.

For all but the very simplest biosystems, it is impractical to enumerate every potential evolutionary disposition, just as it is impossible to consider every trait of the phenotype. Instead, an appropriate sample of the evotype must be used for the purpose at hand, just as samples of traits are used when describing the phenotype. How we take this sample, and thus the scope of the evotype covered, should be determined by knowledge of the design type, its intended function and the context in which it will be used. This could include the size of population, environment and required number of generations over which the system must operate reliably.

Broadly speaking, we may wish to seek one of two goals when designing the evotype: the first is that of evolutionary stability, where a system changes its function as little as possible, as it evolves during use; the second is specific evolvability, where the system can easily evolve new phenotypes of a specific class (i.e., the classes of function specified by the engineer) or adapt to changes in the environment (i.e., continuing to produce a desired chemical product). Specific evolvability requires an element of robustness: core functions of the phenotype must remain unperturbed throughout sequence space so that new phenotypes can be explored. This is analogous to natural evolvability²¹, where the ability to generate novel phenotypes alone is insufficient, as they must also be adaptive. It also relates to the concept of plasticity, which is the ability to generate new features without total loss of function²². The relationship between robustness and evolvability in natural evolution has been explored in detail in prior literature²³. For example, a genetic circuit may have been specified to produce an OR logic function in response to two input chemicals. That is, it expresses an output protein if either one of the two input chemicals are present. A population with an evolutionarily stable version of this circuit is likely to maintain an OR function during use. A specifically evolvable version of the circuit on the other hand might be designed to readily produce other logic functions when evolved (e.g., AND, NOR, and NOT), without simply destroying existing functionality or causing lethality to the host cell.

Whether evolutionary stability or specific evolvability is the goal, it can be achieved through engineering genetic variation, the production of function from genotype and both natural and artificial selection. How these processes interact to constrain and bias evolution can be understood by describing a landscape surrounding the design type in sequence space. We term this landscape the evotype, which extends and generalises the fitness landscape concept as applied to natural systems²⁴ by accounting for the roles of variation, production of function and selection (both natural and artificial) in engineered biosystems. The bioengineer’s goal is to sculpt the evotype’s landscape to their specification, to ensure it has a structure in line with their requirements.

Engineering genetic variation

The processes of genetic mutation and recombination are often considered to be random in nature. However, the types of variation that can occur and their associated probabilities are often heavily biased and constrained by the biochemistry of the biosystem itself, limiting the paths accessible to evolution²⁵. As these constraints are partly determined by the biosystems genotype, genetic variation is something that can, in theory, be genetically engineered. For example, not all point mutations are equally likely; transversions and transitions differ in their likelihoods²⁶, and methylation²⁷, genomic context²⁸ and species²⁹ all influence local and global mutation rates. Furthermore, algorithmic mutations³⁰ may occur. These are mutations that result in changes of several nucleotides in one event (thus, an algorithm can describe the change) and can be thought of as shortcuts through sequence space (Fig. 1b, left). The likelihood of an algorithmic mutation may be much greater than the summed likelihoods of the equivalent sequence of individual point mutations. For example, the chance of an insertion of the two-base motif ‘AC’ into a tandem repeat region due to slipped-strand mispairing may be more likely than two insertion events of ‘A’ and ‘C’ occurring independently³¹. Recombination³² and mobile genetic elements³³ are other examples of biological processes capable of producing algorithmic mutations.

Sequence space is therefore not explored in a uniformly random way, even discounting for the role of selection. Instead, the paths evolution can take are determined by the ‘variation operator set’, which defines all the different point and algorithmic mutations that can occur in the system. Each variation operator in this set has an associated probability distribution that represents the likelihood of arriving at a given sequence from another (i.e., by this operator acting on the design type). The distributions of the variation operator set combine to produce the ‘variation probability distribution’. This describes the chance of arriving at any given sequence from the design type due to all the biochemical and physical processes capable of causing genetic variation that are present in the system (Fig. 1b, right). The variation operator set defines the rate and the likely directions in sequence space a design will explore during evolution. As a design type evolves, the variation probability distribution changes, as further dispositions become available.

The variation operator set depends on the specifics of the biosystem being engineered and the set to be applied in practice is dependent on available knowledge of the system. For example, the variation operator set of a design-type biosystem may be said to include transition mutations, transversion mutations and recombinations, each associated with a unique probability that varies across the design type’s sequence. A sample population can be generated by applying the operator set to the design type. This population, with the design type at its centre, may be named a quasispecies, as is used for the related concept in viral evolution³⁴.

The variation probability distribution can be considered at all stages of the design process: from specifying mutation rates of specific parts, designing new biochemical mechanisms capable of specific forms of genetic variation and thinking of genetic variation as a feature of a system that can be designed and built. Such integration would allow for global and local mutation rates to be specified as part of the design and standardised mutation rates could even be listed in part datasheets³⁵. It is likely that improvements in the prediction of mutation probabilities will be made with the increasing availability of sequence data and associated computational methods. Furthermore, some design rules for influencing local genetic variability are already known (e.g., avoiding the reusing of parts and repetitive sequences to reduce homologous recombination and indel mutations)^36,37,38, and global mutation rates can also be rationally engineered and manipulated^12,39.

A large toolkit for controlling genetic variation has already been created by bioengineers, which could be used to improve evolutionary stability or increase specific evolvability (i.e., the ability of the biosystem’s evolution to be directed as the designer intended). New tools will doubtless be developed from the diverse mechanisms that generate genetic variation in nature. The variation probability distribution of the design type can be modified by either adding or removing variation operators (e.g., by adding or removing DNA modifying enzymes) or by modulating existing operators in the system across the genotype. This may be through altering DNA sequence properties (e.g., avoiding simple sequence repeats to reduce the chance of indels through slipped-strand mispairing³⁶). Variation operators can be highly targeted like the DNA methylation of specific bases to increase likelihood of mutation through spontaneous deamination²⁷ or may have a global effect such as the removal of error-prone polymerases from a host organism⁴⁰. Orthogonal mutation systems that modulate genetic variation of a specific plasmid or region of DNA can be used to overcome genomic error thresholds, increasing the potential for directed evolution⁴¹.

Larger-scale genetic variation can be achieved through mechanisms such as site-specific recombination, which can be used for inserting, removing, duplicating, inverting or shuffling large segments of DNA, exemplified by the SCRaMbLE system used in the synthetic yeast Sc2.0⁴². Finally, acquisition of foreign DNA either from other organisms in the population through sex, horizontal gene transfer or from free oligonucleotides in the environment¹² may also be engineered. The recombinant approaches of genetic engineering can be thought of as a highly orchestrated form of horizontal gene transfer, which is also increasingly being acknowledged as a source of innovation in natural evolution. For example, it is a major mechanism used by bacteria to acquire antibiotic resistance⁴³. As with sexual recombination, it enables large jumps through sequence space. This increases the breadth of search and potentially enables the crossing of valleys in the evolutionary landscape to access peaks that would otherwise be inaccessible.

By combining these and other biochemical tools, it may eventually be possible to precisely design the variation operator set to produce complex combinations of genetic variation. For example, the variation operator set of a genetic circuit may be engineered by avoiding repeated parts (removing the homologous recombination operator), using a host with a high-fidelity DNA polymerase (globally reducing probability of point mutations), and by incorporating DNA recombination sites (adding an operator for specific DNA recombination, perhaps to be used for future directed evolution). Table 2 provides some examples of methods for controlling variation operators that have been developed so far.

Table 1 Role of genotype, phenotype and evotype when describing biological systems.

Full size table

Engineering the production of function

Genotypes produce phenotypes via the processes of gene expression, growth and development. However, due to the constraints and biases of these processes⁴⁴, phenotypes are not necessarily distributed evenly throughout sequence space and not all conceivable phenotypes may be possible. Furthermore, in the same way that multiple genotypes can achieve the same phenotype, a population of cells with identical genotypes can also potentially display many different phenotypes due to the stochastic nature of the underlying processes⁴⁵ or their sensitivity to environmental fluctuations (e.g., displaying chaotic dynamics⁴⁶). Many systems have shown similar properties in the structure of their mapping from genotype to phenotype. Namely, redundancy (there are many more genotypes than phenotypes) and bias (a small fraction of phenotypes are over-represented). This has been shown both through simulation⁴⁷ and experimentally in RNA and protein structures⁴⁷, and DNA-binding sites⁴⁸. How these principles apply to more complex biosystems is a major challenge due to their vast genotype spaces. Nevertheless, there will be a statistical structure in the mapping of genotype to phenotype. If this structure is sufficiently well understood, it could offer a powerful way of engineering the evotype.

Engineered biosystems have phenotypic traits that influence reproductive fitness and traits that influence ‘function’—the behaviour or properties specified by the designer (although these are not mutually exclusive). The structure of the mapping from genotype space to function space is therefore a key part of the evotype. Function space may be discrete or it may be continuous (Fig. 1c). In complex systems, such as biochemical networks, even continuous variation of parameters during evolution can result in the production of identical functions or cause phase changes where qualitatively different functions arise⁴⁹. Designed functions could be literal mathematical functions, physical characteristics such as colour or size, or combinations of several properties.

Any designed system has a degree of ‘utility’—the extent to which the system fulfils the specified function. The sole goal of a traditional engineering design process is to maximise the utility of the design type. However, the topology of the function landscape surrounding the design type is also important. It may be rugged and highly variable with the function rapidly changing across sequence space or it may be smooth and have large neutral regions where function changes little or remains constant. Whether the goal is to evolve novel functions or to tune the parameters of an existing one, these properties are a key design consideration: what is the functional range to be covered by the design type’s evotype? Should the variation be large for increased evolvability, or limited, for evolutionary stability? Most likely, the function landscape should be smooth and predictable, but how is this best achieved? Which regions of function space must be avoided, and which can be tolerated? For example, it may be necessary to reduce irrelevant or harmful functions as much as possible in a diagnostic application where regions of function space cause false negatives, whereas regions causing false positives can be tolerated.

Designs may have identical functionality but occupy regions of function space with very different topological properties. If a system is designed without considering its surrounding function landscape, a design with an undesirable evotype may be a likely outcome. Systems with identical phenotypes, yet differing function landscapes, were demonstrated by Schaerli et al.⁵⁰, who designed two genetic circuits, both with the same strip-generating function. It was found that each produced a different spectrum of new phenotypes when mutated due to differences in the regulatory mechanisms used⁵⁰. We are only just beginning to understand what influences the structure of the mapping from genotype to phenotype. However, there are some general principles, which seem to hold across scales and contexts. Fortunately, many of these principles are already familiar to engineering (Table 3).

Prevalent phenotypes

Phenotypes that are more prevalent in sequence space can be both more robust (as they are more likely to be in genotype networks sharing the same phenotype), and more evolvable (as this allows a wider search of genotype space, increasing access to more novel phenotypes)²³. Therefore, choosing prevalent components may aid both evolutionary stability and specific evolvability. For example, if designing a protein, the codon chosen may influence evolvability: for a leucine residue, if UAA is chosen, its 1-mutant neighbourhood has a lower prevalence for leucine than any other codon (two vs. four, respectively). Therefore, UAA may have lower evolutionary stability but higher specific evolvability than other codons (as it is able to generate a wider range of non-polar amino acids). Remapping the genetic code itself has been suggested as a way of altering its evolvability⁵¹. Other examples of applying phenotypic prevalence include choosing RNA or protein structures that are highly represented in sequence space^52,53. An interesting question is how the phenotypic prevalence of a system’s parts relates to the overall phenotype compared to higher-order properties? Is the robustness of a genetic circuit’s parts or its network topology a greater determinant of its overall robustness?

Redundancy

Redundancy is used in classical engineering and by evolution. It can add robustness by allowing variation of parts of a system without overall loss of function and can aid in evolvability by enabling redundant parts to mutate and thus explore new regions of function space. This can be seen in serial homology, where repeated parts such as the limbs or teeth enable evolution of specialised functions⁵⁴, in gene duplications⁵⁵ and in the scale-free structure of genetic networks where most nodes can be removed without altering the overall function⁵⁶. It is noteworthy that redundant parts may either be repeats of the same element or different elements that can produce the same function (often termed degeneracy⁵⁷).

Modularity, regularity and hierarchy

The organisational properties of a biosystem are a major influence on its evotype. These can be summarised by modularity, regularity and hierarchy⁵⁸. Modularity is the division of a system into subsystems (modules), where each has a high degree of internal connectivity, but little interdependency between subsystems. Examples of this can be seen in the connectivity of protein and regulatory networks, in RNA structures and in limb development⁵⁹. Regularity is the use of patterns, repetitions and symmetries (e.g., serial homology and animal body plans). Hierarchy is the recursive arrangement of a system into subsystems (that are themselves composed of subsystems, etc.)⁶⁰. For example, an organism is composed of organs, which is in turn composed of tissues, cells, etc. Hierarchy is also seen in gene regulatory networks. For example, only nine proteins regulate half of all genes of Escherichia coli⁴¹.

These principles are distinct but often work together. For example, identical modules are often repeated in regular patterns and modules are arranged in a hierarchical structure. These principles may each promote evolvability in different ways. Modularity allows parts of a system to mutate and change function with a reduced negative impact on the rest of the system. Efforts to improve the modularity of genetic systems have been made by synthetic biologists by standardising and increasing orthogonality between parts. Regularity reduces the information required to describe the system (e.g., its genotype), essentially reducing the size of the search space. Hierarchy allows the progressive increase in the complexity of a system from the bottom up⁶⁰. Although the widespread use of these principles in both biology and technology clearly demonstrates their importance, how and where these principles should be applied is context specific. This can be illustrated with an imagined example.

Consider two biosensor circuits that each use red, green and blue (RGB) fluorescent proteins to produce a white output. In circuit A, the overall output of RGB should be as high as possible when the input is positive (e.g., high sensitivity is required), the whiteness of the signal is less critical. In circuit B, it is important that the positive signal remains precisely white (e.g., other colours represent other input conditions) and the overall output level is less critical. Circuit A would benefit from a modular arrangement of RGB, because a mutation in any one of these genes does not affect the other two, thus reducing the impact on overall output. However, for circuit B, a less modular design would be preferable: although a mutation would have triple the effect on overall output, all colours would be impacted equally conserving the overall hue. The nature of the design problem therefore relates to how modularity should be used. In fact, Kashtan and Alon⁶¹ showed that modular architectures evolve in gene regulatory networks, in response to modular environmental selection pressures, and themselves prove to be more evolvable⁶¹. Similar relationships between how hierarchy and regularity should relate to the design problem, no doubt, exist. However, we are far from having design principles for their application, in particular for more complex problems.

Environmental robustness

Principles that improve robustness to environmental change or noise may also improve robustness to genetic change. For example, if a genetic circuit is robust to noise in the concentration of a regulatory protein, it may also be more robust to mutations that change the promoter’s expression level⁶². Similarly, proteins that are more thermodynamically stable may also be more evolvable⁶³. Systems could be buffered against environmental and genetic perturbations through the use of a negative feedback^64,65,66, tunable genetic parts⁶⁷, stringent multi-level regulation⁶⁸ or the application of other control engineering principles⁶⁹.

Designing parameter space

The structure of the parameter space of a system plays a large role in how function changes under genetic variation. If the behaviour of a system can be modelled or inferred against the variation of key parameters, this can provide information about which functions may be accessible and most likely throughout sequence space. For example, by modelling the regulatory mechanisms of two genetic circuits, Schaerli et al.⁵⁰ explain why they produce different distributions of functions when subjected to point mutations. Similarly, with a simple mathematical model of equilibrium binding, Mayo et al.²² showed that the cis-regulatory region of the lac operon in E. coli is incapable of accessing some input functions via point mutation. Parameter spaces are analogous to the morpho-spaces of evolutionary-developmental biology, which provide constraints on organismal form⁷⁰.

Other principles

System-specific principles may also provide design rules for the evotype. For example, RNA gene regulation may be less evolvable than transcriptional regulation⁷¹ and so could determine whether the regulation is applied at the transcriptional or translational level. Physical and chemical processes of self-organisation may even be able to reduce a function’s dependency on the genotype. Perhaps ideas from developmental biology and morphogenesis could be recast into engineering terms, such as concepts from the theory of facilitated variation⁷², in particular as bioengineering progresses to multicellular organisms. Metaheuristic design approaches will also no doubt become an increasingly powerful tool: machine learning approaches may be able to predict the evolvability of biological networks⁷³ and genetic algorithms have been used to evolve more robust genetic networks in silico⁷⁴.

Engineering natural and artificial selection

Selection is the force that gives the otherwise random (but constrained) processes of genetic variation a ‘direction’ by driving a population up the slopes of the adaptive landscape⁷⁵. Uniquely, an engineered biosystem is a result of two forms of selection: natural selection and the design process. Natural selection acts on reproductive fitness of the biosystem and the design process can be thought of as a sophisticated form of artificial selection acting on its utility. Fitness and utility both form part of the evotype and understanding the interplay between these two processes is critical for effective evotype design, as there is often a tension between the two (Fig. 1d). If fitness and utility are uncorrelated, then natural selection is likely to undo the work of the engineer. However, if fitness and utility are highly correlated, then natural selection will also increase utility⁷⁶. For example, one might design a cell in a bioreactor or a plant crop to produce a chemical product. Perhaps, it uses control circuitry to maintain optimal metabolic fluxes to maximise yield in fluctuating environmental conditions, thus resulting in high utility. However, this will inevitably have a fitness effect on the organism (e.g., due to the metabolic burden of the circuit or toxicity of the product) and, thus, natural selection will favour mutants where this functionality is repressed. It should be noted that natural selection here is meant as the process that acts on the reproductive ability of the biosystem. Neither the environment nor biosystem need to be natural (e.g., the organisms could be engineered to make use of non-canonical amino acids and grown within a bioreactor). The critical distinction is that natural selection acts on survival of the biosystem without the input of the engineer.

The aim of a bioengineer then, is to maximise ‘fitneity’—defined as a function that combines the derivatives of utility and fitness (Fig. 1d). Ultimately, evotype engineering is controlling how fitneity changes throughout sequence space: it is the sculpting of the fitneity landscape. Exactly what form the fitneity function should take and the best way to mathematically describe the fitneity landscape to effectively capture the interaction between these two forms of selection are not yet clear. Defining, modelling and characterising the fitneity landscapes of designed biological systems is a future avenue of research ripe with potential. Nevertheless, the concept can already help in thinking about how to improve the fitneity of designs on an intuitive level.

To design for evolutionary stability, it is sufficient to limit or neutralise the impact of natural selection. This can be achieved in one of the following three ways. First, the fitness of the design type and its immediate neighbours can be increased to create a local peak or plateau. This could be done through adaptive evolution^77,78 after the design phase, reducing or dynamically controlling burden⁶⁵, or by reducing toxicity of the associated function. Second, the fitness of neighbouring genotypes can be decreased to flatten the surrounding fitness landscape, e.g., by using organisms with a reduced genome that may be less fit than wild-type organisms⁷⁹, but with freed-up metabolic resources⁸⁰. Third, the utility landscape can be flattened so that even if there is a natural selection pressure away from the design type, it is less likely to impact the design’s function. Approaches for doing this have been outlined in the previous section. To ensure a specifically evolvable evotype, fitness and utility must correlate: both fitness and utility must slope in the same direction. An engineer could do this by coupling function to survival, perhaps through a toxin–anti-toxin system⁸¹ or by coupling function to growth (e.g., by having the product of a system aid in metabolism of an energy source). Alternatively, an artificial environmental pressure, such as repeated screening, could be used to ensure utility and fitness correlate. Common methods to engineer selection are shown in Table 4.

Toward evotype engineering

The evotype is a new way to think about the properties of engineered biosystems and how they relate to each other (Table 3). It is a framework for thinking about an important but often overlooked property: the role the biosystem itself plays in its future evolution. This is especially critical due to the impact of an intervention (e.g., a new mutational method) being closely linked to the composition of the system itself. For example, in a simple case, an identical protein could be encoded by very different sequences and so be impacted by a targeted mutating element in different ways. This is quite different to how engineers normally view systems. As engineered biosystems are the result of both human creativity and natural adaptation, a holistic consideration of both the roles of design and evolution is necessary. The evotype helps us do this by explicitly considering the intertwined effects that genetic variation, production of function and multiple forms of selection will have on a design (Fig. 1).

Table 2 Methods to engineer genetic variation.

Full size table

Table 3 Methods to engineer the production of function.

Full size table

Table 4 Methods to engineer natural and artificial selection.

Full size table

We can now design and build genotypes with great precision, but we must account for the inevitable processes of genetic variation that will follow. The statistical structure of variation is unique to each biosystem and something we have control over. Yet, understanding the details of genetic variation is insufficient if we do not understand how this will manifest in changes of the designed function of the biosystem as well. Even a system with low mutation rates can be evolutionarily unstable if function changes wildly with small sequence alterations. Similarly, directed evolution will not be successful, despite the mutation strategy, if desired functions are simply not accessible from the starting point. If the biosystem’s utility (i.e., its success as a design) and its fitness (i.e., its success as a biological replicator) are at odds, well-designed dispositions for variation or function might not save the design from the pressure of natural selection. This must also be understood as a conflict between utility and fitness landscapes across sequence space surrounding the original design type. It is clear then that all three of the aspects of the evotype must be considered together and all offer significant scope for engineering. For instance, imagine a large genetic circuit that places an unavoidably high metabolic burden on the host cell. If it is crucial that the function of the circuit is maintained over long periods of time, then redundancy could be used to accommodate unavoidable mutations. However, if the dent to reproductive fitness is severe, this may still not be enough. Therefore, combining redundancy in the design with a hyper-stable host cell (e.g., one where all mobile genetic elements have been deleted and efficient DNA repair mechanisms are present⁸²) might be the only way to achieve the desired goal for the system.

Designing biosystems with evolution in mind is a vital step towards a more complete engineering theory of biology. However, to be practical, supporting tools must exist that can provide key information regarding the genetic variation, genotype-function map and selective pressures within a biosystem. Advances in sequencing offer a means to quantitatively measure millions of genotypes in parallel⁸³ and when combined with high-throughput techniques, such as fluorescence-activated cell sorting, make it possible to infer simplified genotype-function maps^84,85. The local function landscapes of the green fluorescent protein⁸⁶ and transcription factor-binding sites⁴⁸ have already been characterised experimentally with such methods. Detailed measurements of fitness in large populations of cells are also possible^87,88,89. By combining sequencing with expression and growth measurements, genetic variation, function and fitness could be characterised simultaneously to provide a complete picture of the evotype.

Even so, the vastness of evotype landscapes and the need for functions calculated from many outputs of a system mean that new methods with greater throughputs are also necessary^85,90. There is a particular need for methods able to measure many characteristics of each cell simultaneously (e.g., via automated high-content microscopy⁹¹ or high-throughput Raman spectroscopy⁹²). Parallel to these experimental methods, a promising direction to bypass the need to directly measure these properties are the development of sufficiently comprehensive computational models (e.g., encompassing whole cells⁹³) to allow for a mechanistic understanding of the biases in processes related to variation and reproductive rate. In these cases, if they are sufficiently accurate, the evotype could be predicted and used within computer-aided design workflows⁹⁴ to reduce the need to physically build every possible design.

Nevertheless, for systems of even moderate complexity, the evotype landscapes are much too vast to be exhaustively characterised or even modelled. It will therefore be of great importance to understand how they should be sampled⁹⁵, how large a region of the landscape needs to be characterised and to what extent local landscape properties can be extrapolated. The use of machine learning is another method that holds great promise for increasing the ability to estimate the evotype landscape, albeit at the cost of mechanistic knowledge of the system.

Epistasis poses a particular challenge to the prediction and the engineering of evotypes, as it means even a small number of mutations can have large effects that are difficult to predict⁹⁶. In these cases, the engineer may have little choice but to limit the likelihood of such point mutations occurring and to use more constrained variation operators that act at a structural level (thus, smoothing and reducing the dimensionality of the evolutionary search space), such as the recombination of insulated parts. However, some evidence suggests that biologically relevant fitness landscapes may in fact occupy a low dimension of total sequence space⁹⁷. This offers hope that as least in some contexts, evotypes can be characterised, predicted, and designed with some accuracy. The predictability of evolution is one of the most important and challenging unanswered questions in the study of natural biological systems⁹⁸. However, engineers have the advantage of being able to design systems to suit their needs. One way to do this is to design systems to maximise forms of evolution that can be predicted and minimise those that cannot.

In addition to characterising evotypes, tools for bioengineers to directly sculpt their landscapes must also be available (Fig. 2). Here we have touched upon the numerous repurposed biomolecular components that can alter the types of possible variation (Table 2). However, there is a spectacular diversity of molecular machines dedicated to manipulating genetic information in the natural world, suggesting a need for an even larger toolkit to precisely modify genetic variation as needed. Likewise, principles for constraining and biasing the production of function (Table 3) and for controlling selection pressures (Table 4) have been suggested, but they are still poorly understood and have barely been applied rigorously in an engineering setting. It is also important to recognise that engineers may not always be in a position to influence all aspects of the evotype. For example, if function and survival cannot be linked effectively (i.e., selection cannot be engineered in a necessary way), then only variation and/or function are available to the designer. Thus, the practical constraints of a given design problem will often determine which evolutionary design methods are available or are most appropriate.

**Fig. 2: Engineering evotypes by sculpting their landscapes.**

We have been careful throughout this work to clarify the differences between natural evolution based on natural selection (i.e., fitness) and artificial evolution based on our own forms of selection. However, there are cases where their differences become blurred. For example, when an engineered biological system has reproductive success coupled to utility, is this system naturally or artificially evolving? Much of these confusions stem from semantics of the framework used to interpret the system and here we have explicitly shown that in engineered biology the term “evolution” is almost always a mix of both natural and artificial contributions. Moving forward, ensuring that the terminology we use is consistent and clear will be crucial for supporting the robust development of an engineering theory encompassing all forms of evolution.

Traditional engineering disciplines have developed various methods that are somewhat analogous to some of the evolutionary principles outlined in this work. For example, the use of modular parts and hierarchical designs, fault tolerance, factors of safety and redundancy to improve robustness, the reuse of parts and building in tolerance to variation. However, there are clearly differences in exactly how and where these principles are applied in biological vs. engineered systems. A deeper understanding of the relationship between engineering principles and their evolutionary counterparts is needed. Better awareness of how evolution applies these principles will improve our ability to engineer all types of complex systems, in particular those that evolve.

It is also crucial to recognise that evolution is not the only challenge faced when engineering biology. Unlike many of the substrates we commonly build with, biology is highly complex even at its simplest level (e.g., single cells), with changing and growing components that can deform and exhibit intricate phase transitions due to the many nonlinear interactions present. Although our focus here has been solely on evolution, an ability to effectively engineer living systems will require a holistic approach that considers and integrates these other aspects and goes far beyond current working practices.

Another area of growing importance in biological engineering is the development and adoption of standards to facilitate improved exchange and reuse of engineered biological parts and systems⁹⁹, as well as data associated with these¹⁰⁰. Standards are also pervasive in natural biology with an example being the use of a (mostly) common genetic code that aids the exchange and reuse of genetic material in the wild. To date, evolution has not featured prominently in standardisation efforts, but could be key to the collection of information about biological parts and systems (e.g., in terms of mutation rates, operator sets, function sets, selection strengths and pressures, evolutionary stability, etc.), which will support the future engineering of evolution.

The lens of engineering offers a fresh perspective on evolutionary theory. It is also a new way of thinking about what it is that engineers do and what the design process is in the context of bioengineering. The concept of the evotype, with some modifications, may also find use in evolutionary science, where it offers a framework for considering the mechanistic constraints of evolution and a way of talking about the evolutionary characteristics of organisms. It may also be applied beyond biological engineering fields to create new self-adaptive technologies. In that context, the framework could be applied to ask how we design technologies to evolve and not just how to engineer systems that already do.

References

Andrianantoandro, E. et al. Synthetic biology: new engineering rules for an emerging discipline. Mol. Syst. Biol. 2, 2006.0028 (2006).
Article PubMed PubMed Central Google Scholar
Grozinger, L. et al. Pathways to cellular supremacy in biocomputing. Nat. Commun. 10, 1–11 (2019).
Article CAS Google Scholar
Dobzhansky, T. Nothing in biology makes sense except in the light of evolution. Am. Biol. Teach. 35, 125–129 (1973).
Article Google Scholar
Renda, B. A., Hammerling, M. J. & Barrick, J. E. Engineering reduced evolutionary potential for synthetic biology. Mol. BioSyst. 10, 1668–1678 (2014).
Article CAS PubMed PubMed Central Google Scholar
Ellis, T. Predicting how evolution will beat us. Microb. Biotechnol. 12, 41–43 (2019).
Article PubMed Google Scholar
Fernandez-Rodriguez, J., Yang, L., Gorochowski, T. E., Gordon, D. B. & Voigt, C. A. Memory and combinatorial logic based on DNA inversions: dynamics and evolutionary stability. ACS Synth. Biol. 4, 1361–1372 (2015).
Article CAS PubMed Google Scholar
Yokobayashi, Y., Weiss, R. & Arnold, F. H. Directed evolution of a genetic circuit. Proc. Natl Acad. Sci. USA 99, 16587–16591 (2002).
Giver, L., Gershenson, A., Freskgard, P.-O. & Arnold, F. H. Directed evolution of a thermostable esterase. Proc. Natl Acad. Sci. USA 95, 12809–12813 (1998). A landmark work on the directed evolution of proteins – here used to improve the thermal stability of an enzyme.
Article ADS CAS PubMed PubMed Central Google Scholar
Boder, E. T., Midelfort, K. S. & Wittrup, K. D. Directed evolution of antibody fragments with monovalent femtomolar antigen-binding affinity. Proc. Natl Acad. Sci. USA 97, 10701–10705 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Esvelt, K. M., Carlson, J. C. & Liu, D. R. A system for the continuous directed evolution of biomolecules. Nature 472, 499–503 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Guntas, G., Mansell, T. J., Kim, J. R. & Ostermeier, M. Directed evolution of protein switches and their application to the creation of ligand-binding proteins. Proc. Natl Acad. Sci. USA 102, 11224–11229 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, H. H. et al. Programming cells by multiplex genome engineering and accelerated evolution. Nature 460, 894–898 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Anderson, J. et al. Engineering and ethical perspectives in synthetic biology. Rigorous, robust and predictable designs, public engagement and a modern ethical framework are vital to the continued success of synthetic biology. EMBO Rep. 13, 584–590 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wright, O., Stan, G.-B. & Ellis, T. Building-in biosafety for synthetic biology. Microbiology (Reading) 159, 1221–1235 (2013).
Article CAS Google Scholar
Chan, C., Lee, J., Cameron, E., Bashor, C. & Collins, J. ‘Deadman’ and ‘Passcode’ microbial kill switches for bacterial containment. Nat. Chem. Biol. 12, 82–86 (2015).
Article PubMed PubMed Central CAS Google Scholar
Mandell, D. J. et al. Biocontainment of genetically modified organisms by synthetic protein design. Nature 518, 55–60 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Winston, M. L. The biology and management of Africanized honey bees. Annu. Rev. Entomol. 37, 173–193 (1992).
Article Google Scholar
Oye, K. A. et al. Regulating gene drives. Science 345, 626–628 (2014).
Article ADS CAS PubMed Google Scholar
Pigliucci, M. Are ecology and evolutionary biology “soft” sciences? Ann. Zool. Fennici 39, 87–98 (2002).
Google Scholar
Bartley, B. A., Kim, K., Medley, J. K. & Sauro, H. M. Synthetic biology: engineering living systems from biophysical principles. Biophys. J. 112, 1050–1058 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Kirschner, M. & Gerhart, J. Evolvability. Proc. Natl Acad. Sci. USA 95, 8420–8427 (1998).
Article ADS CAS PubMed PubMed Central Google Scholar
Mayo, A. E., Setty, Y., Shavit, S., Zaslaver, A. & Alon, U. Plasticity of the cis-regulatory input function of a gene. PLoS Biol. 4, 555–561 (2006). This work experimentally demonstrates specific evolvability by showing that mutations in a gene regulatory region can change its function without destroying it.
Article CAS Google Scholar
Wagner, A. Robustness and evolvability: a paradox resolved. Proc. R. Soc. B Biol. Sci. 275, 91–100 (2008).
Article Google Scholar
Wright, S. The Roles of Mutation, Inbreeding, Crossbreeding, and Selection in Evolution. Vol. 1, 355–366 (na, 1932).
Cano, A. V. & Payne, J. L. Mutation bias interacts with composition bias to influence adaptive evolution. PLoS Comput. Biol. 16, e1008296 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Stoltzfus, A. & Norris, R. W. On the causes of evolutionary transition:transversion bias. Mol. Biol. Evol. 33, 595–602 (2016).
Article CAS PubMed Google Scholar
Jones, P. A., Rideout, W. M., Shen, J. C., Spruck, C. H. & Tsai, Y. C. Methylation, mutation and cancer. Bioessays 14, 33–36 (1992).
Article CAS PubMed Google Scholar
Zhu, Y. O., Siegal, M. L., Hall, D. W. & Petrov, D. A. Precise estimates of mutation rate and spectrum in yeast. Proc. Natl Acad. Sci. USA 111, E2310–E2318 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Drake, J., Charlseworth, B., Charlseworth, D. & Crow, J. Rates of spontaneous mutation. Genetics 148, 1667–1686 (1998).
Article CAS PubMed PubMed Central Google Scholar
Chaitin, G. Proving Darwin: Making Biology Mathematical (Vintage, 2013).
Levinson, G. & Gutman, G. A. Slipped-strand mispairing: a major mechanism for DNA sequence evolution. Mol. Biol. Evol. 4, 203–221 (1987).
CAS PubMed Google Scholar
Vos, M. Why do bacteria engage in homologous recombination? Trends Microbiol. 17, 226–232 (2009).
Article CAS PubMed Google Scholar
Frost, L. S., Leplae, R., Summers, A. O. & Toussaint, A. Mobile genetic elements: the agents of open source evolution. Nat. Rev. Microbiol. 3, 722–732 (2005).
Article CAS PubMed Google Scholar
Eigen, M. On the nature of virus quasispecies. Trends Microbiol. 4, 216–218 (1996).
Article CAS PubMed Google Scholar
Canton, B., Labno, A. & Endy, D. Refinement and standardization of synthetic biological parts and devices. Nat. Biotechnol. 26, 787 (2008).
Article CAS PubMed Google Scholar
Jack, B. R. et al. Predicting the genetic stability of engineered DNA sequences with the EFM calculator. ACS Synth. Biol. 4, 939–943 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sleight, S. C. & Sauro, H. M. Visualization of evolutionary stability dynamics and competitive fitness of Escherichia coli engineered with randomized multigene circuits. ACS Synth. Biol. 2, 519–528 (2013). This work experimentally uncovered design principles for improving evolutionarily stability in synthetic genetic circuits in vivo.
Article CAS PubMed Google Scholar
Hossain, A. et al. Automated design of thousands of nonrepetitive parts for engineering stable genetic systems. Nat. Biotechnol. 38, 1466–1475 (2020).
Article PubMed CAS Google Scholar
Geng, P., Leonard, S. P., Mishler, D. M. & Barrick, J. E. Synthetic genome defenses against selfish DNA elements stabilize engineered bacteria against evolutionary failure. ACS Synth. Biol. 8, 521–531 (2019).
Article CAS PubMed Google Scholar
Csörgő, B., Fehér, T., Tímár, E., Blattner, F. R. & Pósfai, G. Low-mutation-rate, reduced-genome Escherichia coli: an improved host for faithful maintenance of engineered genetic constructs. Microb. Cell Factories 11, 11 (2012). This work is an example of engineering the host organism’s genome to reduce global mutation rates.
Article CAS Google Scholar
Ravikumar, A., Arzumanyan, G. A., Obadi, M. K. A., Javanpour, A. A. & Liu, C. C. Scalable, continuous evolution of genes at mutation rates above genomic error thresholds. Cell 175, 1946–1957 (2018). An orthogonal plasmid mutation system for directed evolution at elevated error rates.
Article CAS PubMed PubMed Central Google Scholar
Dymond, J. & Boeke, J. The Saccharomyces cerevisiae SCRaMbLE system and genome minimization. Bioeng. Bugs 3, 168–171 (2012). An inducible evolution system based on large-scale genomic shuffling in the synthetic yeast project Sc2.0.
PubMed PubMed Central Google Scholar
Koonin, E. V., Makarova, K. S. & Aravind, L. Horizontal gene transfer in prokaryotes: quantification and classification. Annu. Rev. Microbiol. 55, 709–742 (2001).
Article CAS PubMed PubMed Central Google Scholar
Ahnert, S. E. Structural properties of genotype– phenotype maps. J. R. Soc. Interface 14, 20170275 (2017).
Article PubMed PubMed Central CAS Google Scholar
Vogt, G. Stochastic developmental variation, an epigenetic source of phenotypic diversity with far-reaching biological consequences. J. Biosci. 40, 159–204 (2015).
Article PubMed Google Scholar
Strogatz, S. Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering (Westview, 2014).
Ferrada, E. & Wagner, A. A comparison of genotype-phenotype maps for RNA and proteins. Biophys. J. 102, 1916–1925 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Aguilar-Rodríguez, J., Payne, J. L. & Wagner, A. A thousand empirical adaptive landscapes and their navigability. Nat. Ecol. Evol. 1, 0045 (2017).
Article Google Scholar
Savageau, M. A., Coelho, P. M. B. M., Fasani, R. A., Tolla, D. A. & Salvador, A. Phenotypes and tolerances in the design space of biochemical systems. Proc. Natl Acad. Sci. USA 106, 6435–6440 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Schaerli, Y. et al. Synthetic circuits reveal how mechanisms of gene regulatory networks constrain evolution. Mol. Syst. Biol. 14, 1–18 (2018). This work experimentally demonstrates how genetic circuits with identical phenotypes can differ in their phenotype landscapes.
Article CAS Google Scholar
Pines, G., Winkler, J. D., Pines, A. & Gill, R. T. Refactoring the genetic code for increased evolvability. mBio 8, e01654–17.mBio.01654-17 (2017).
Article CAS PubMed PubMed Central Google Scholar
Schaper, S. & Louis, A. A. The arrival of the frequent: how bias in genotype-phenotype maps can steer populations to local optima. PLoS ONE 9, e86635 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Li, H., Helling, R., Tang, C. & Wingreen, N. Emergence of preferred structures In a simple model of protein folding. Science 273, 666–669 (1996).
Article ADS CAS PubMed Google Scholar
Carroll, S. In Endless Forms Most Beautiful 29–36 (Weidenfeld & Nicolson, 2006).
Zhang, J. Evolution by gene duplication: an update. Trends Ecol. Evol. 18, 292–298 (2003).
Article Google Scholar
Albert, R., Jeong, H. & Barabási, A.-L. Error and attack tolerance of complex networks nature. Nature 406, 268–382 (2001).
Google Scholar
Tononi, G., Sporns, O. & Edelman, G. M. Measures of degeneracy and redundancy in biological networks. Proc. Natl Acad. Sci. USA 96, 3257–3262 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Lipson, H. Principles of modularity, regularity, and hierarchy for scalable systems. J. Biol. Phys. Chem. 2007, 125–128 (2007).
Article Google Scholar
Wagner, G., Pavlicev, M. & Cheverud, J. The road to modularity. Focus Evo-Devo 8, 921–931 (2007).
CAS Google Scholar
Simon, H. A. In Facets of Systems Science (ed. Klir, G. J.) 457–476 (Springer, 1991).
Kashtan, N. & Alon, U. Spontaneous evolution of modularity and network motifs. PNAS 102, 13773–13778 (2005). This work uses computational models to show how evolvable modular architectures can evolve in response to modularly varying selection pressures.
Article ADS CAS PubMed PubMed Central Google Scholar
Kaneko, K. Evolution of robustness to noise and mutation in gene expression dynamics. PLoS ONE 2, e434 (2007).
Bloom, J. D., Labthavikul, S. T., Otey, C. R. & Arnold, F. H. Protein stability promotes evolvability. Proc. Natl Acad. Sci. USA 103, 5869–5874 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Afroz, T. & Beisel, C. L. Understanding and exploiting feedback in synthetic biology. Chem. Eng. Sci. 103, 79–90 (2013).
Article CAS Google Scholar
Ceroni, F. et al. Burden-driven feedback control of gene expression. Nat. Methods 15, 387–393 (2018).
Article CAS PubMed Google Scholar
Kelly, C. L. et al. Synthetic negative feedback circuits using engineered small RNAs. Nucleic Acids Res. 46, 9875–9889 (2018).
Article CAS PubMed PubMed Central Google Scholar
Bartoli, V., Meaker, G. A., di Bernardo, M. & Gorochowski, T. E. Tunable genetic devices through simultaneous control of transcription and translation. Nat. Commun. 11, 2095 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Greco, F. V., Pandi, A., Erb, T. J., Grierson, C. S. & Gorochowski, T. E. Harnessing the central dogma for stringent multi-level control of gene expression. Nat. Commun. 12, 1738 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Aoki, S. K. et al. A universal biomolecular integral feedback controller for robust perfect adaptation. Nature 570, 533–537 (2019).
Article CAS PubMed Google Scholar
Brakefield, P. M. Evo-devo and constraints on selection. Trends Ecol. Evol. 21, 362–368 (2006).
Article PubMed Google Scholar
Payne, J. L., Khalid, F. & Wagner, A. RNA-mediated gene regulation is less evolvable than transcriptional regulation. Proc. Natl Acad. Sci. USA 115, E3481–E3490 (2018).
Article CAS PubMed PubMed Central Google Scholar
Gerthart, J. & Kirschner, M. The theory of facilitated variation | PNAS. Proc. Natl Acad. Sci. USA 104, 8582–8589 (2007).
Article ADS CAS Google Scholar
Kim, H., Muñoz, S., Osuna, P. & Gershenson, C. Antifragility predicts the robustness and evolvability of biological networks through multi-class classification with a convolutional neural network. Entropy 22, 986 (2020).
Article ADS MathSciNet PubMed Central Google Scholar
Noman, N., Monjo, T., Moscato, P. & Iba, H. Evolving Robust Gene Regulatory Networks. PLoS One 10, e0116258 (2015).
Article PubMed PubMed Central CAS Google Scholar
Kauffman, S. A. In The Origins of Order: Self-Organization and Selection in Evolution 33–120 (Oxford Univ., USA, 1993). A seminal work on NK fitness landscapes that illustrates how the statistical properties of fitness landscapes can constrain evolution.
Dekel, E. & Alon, U. Optimality and evolutionary tuning of the expression level of a protein. Nature 436, 588–592 (2005).
Article ADS CAS PubMed Google Scholar
Wannier, T. M. et al. Adaptive evolution of genomically recoded Escherichia coli. Proc. Natl Acad. Sci. USA 115, 3090–3095 (2018).
Article CAS PubMed PubMed Central Google Scholar
Springman, R., Molineux, I. J., Duong, C., Bull, R. J. & Bull, J. J. Evolutionary stability of a refactored phage genome. ACS Synth. Biol. 1, 425–430 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kurokawa, M., Seno, S., Matsuda, H. & Ying, B.-W. Correlation between genome reduction and bacterial growth. DNA Res. 23, 517–525 (2016).
Article CAS PubMed PubMed Central Google Scholar
Martínez-García, E., Nikel, P. I., Aparicio, T. & de Lorenzo, V. Pseudomonas 2.0: genetic upgrading of P. putida KT2440 as an enhanced host for heterologous gene expression. Micro. Cell Fact. 13, 159 (2014).
Article CAS Google Scholar
Stieber, D., Gabant, P. & Szpirer, C. Y. The art of selective killing: plasmid toxin/antitoxin systems and their technological applications. BioTechniques 45, 344–346 (2008).
Article CAS PubMed Google Scholar
Umenhoffer, K. et al. Genome-wide abolishment of mobile genetic elements using genome shuffling and CRISPR/Cas-assisted MAGE allows the efficient stabilization of a bacterial chassis. ACS Synth. Biol. 6, 1471–1483 (2017).
Article CAS PubMed Google Scholar
Reuter, J. A., Spacek, D. V. & Snyder, M. P. High-throughput sequencing technologies. Mol. Cell 58, 586–597 (2015).
Article CAS PubMed PubMed Central Google Scholar
Sharon, E. et al. Inferring gene regulatory logic from high-throughput measurements of thousands of systematically designed promoters. Nat. Biotechnol. 30, 521–530 (2012).
Article CAS PubMed PubMed Central Google Scholar
Gilliot, P.-A. & Gorochowski, T. E. Sequencing enabling design and learning in synthetic biology. Curr. Opin. Chem. Biol. 58, 54–62 (2020).
Article CAS PubMed Google Scholar
Sarkisyan, K. S. et al. Local fitness landscape of the green fluorescent protein. Nature 533, 397–401 (2016). The first high-throughput experimental characterisation of the (partial) fitness landscape of a protein.
Article ADS CAS PubMed PubMed Central Google Scholar
Nevozhay, D., Adams, R. M., Itallie, E. V., Bennett, M. R. & Balázsi, G. Mapping the environmental fitness landscape of a synthetic gene circuit. PLOS Comput. Biol. 8, e1002480 (2012).
Article CAS PubMed PubMed Central Google Scholar
Cira, N. J., Pearce, M. T. & Quake, S. R. Neutral and selective dynamics in a synthetic microbial community. Proc. Natl Acad. Sci. USA 115, E9842–E9848 (2018).
Article CAS PubMed PubMed Central Google Scholar
van Opijnen, T., Bodi, K. L. & Camilli, A. Tn-seq: high-throughput parallel sequencing for fitness and genetic interaction studies in microorganisms. Nat. Methods 6, 767–772 (2009).
Article PubMed PubMed Central CAS Google Scholar
Gorochowski, T. E. & Ellis, T. Designing efficient translation. Nat. Biotechnol. 36, 934–935 (2018).
Article CAS PubMed Google Scholar
Pepperkok, R. & Ellenberg, J. High-throughput fluorescence microscopy for systems biology. Nat. Rev. Mol. Cell Biol. 7, 690–696 (2006).
Article CAS PubMed Google Scholar
Goda, K. et al. High-throughput single-microparticle imaging flow analyzer. Proc. Natl Acad. Sci. USA 109, 11630–11635 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Marucci, L. et al. Computer-aided whole-cell design: taking a holistic approach by integrating synthetic with systems biology. Front. Bioeng. Biotechnol. 8, 942 (2020).
Article PubMed PubMed Central Google Scholar
Nielsen, A. A. K. et al. Genetic circuit design automation. Science 352, aac7341–aac7341 (2016).
Article PubMed CAS Google Scholar
du Plessis, L., Leventhal, G. E. & Bonhoeffer, S. How good are statistical models at approximating complex fitness landscapes? Mol. Biol. Evol. 33, 2454–2468 (2016).
Article PubMed PubMed Central CAS Google Scholar
Palmer, A. C. et al. Delayed commitment to evolutionary fate in antibiotic resistance fitness landscapes. Nat. Commun. 6, 7385 (2015).
Article ADS CAS PubMed Google Scholar
Henningsson, R., Moratorio, G., Bordería, A. V., Vignuzzi, M. & Fontes, M. DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics†. Virus Evol. 5, 1–14 (2019).
Article Google Scholar
De Visser, J. A. G. M. & Krug, J. Empirical fitness landscapes and the predictability of evolution. Nat. Rev. Genet. 15, 480–490 (2014).
Article PubMed CAS Google Scholar
Beal, J. et al. The long journey towards standards for engineering biosystems. EMBO Rep. 21, e50521 (2020).
Article CAS PubMed PubMed Central Google Scholar
Schreiber, F. et al. Specifications of standards in systems and synthetic biology: status and developments in 2020. J. Integr. Bioinform. 17, 20200022 (2020).
Mozhaev, V. V. & Martinek, K. Structure-stability relationships in proteins: new approaches to stabilizing enzymes. Enzym. Microb. Technol. 6, 50–59 (1984).
Article CAS Google Scholar
Archetti, M. Genetic robustness and selection at the protein level for synonymous codons. J. Evolut. Biol. 19, 353–365 (2006).
Article CAS Google Scholar
McDonald, J. I. et al. Reprogrammable CRISPR/Cas9-based system for inducing site-specific DNA methylation. Biol. Open 5, 866–874 (2016).
Article CAS PubMed PubMed Central Google Scholar
Nivina, A. et al. Structure-specific DNA recombination sites: Design, validation, and machine learning–based refinement. Sci. Adv. 6, eaay2922 (2020).
Romanini, D. W., Peralta-Yahya, P., Mondol, V. & Cornish, V. W. A heritable recombination system for synthetic Darwinian evolution in yeast. ACS Synth. Biol. 1, 602–609 (2012).
Article CAS PubMed PubMed Central Google Scholar
Reis, A. C. et al. Simultaneous repression of multiple bacterial genes using nonrepetitive extra-long sgRNA arrays. Nat. Biotechnol. 37, 1294–1301 (2019).
Article CAS PubMed Google Scholar
Umenhoffer, K. et al. Reduced evolvability of Escherichia coli MDS42, an IS-less cellular chassis for molecular and synthetic biology applications. Microb. Cell Factories 9, 38 (2010).
Article CAS Google Scholar
Nyerges, Á. et al. CRISPR-interference-based modulation of mobile genetic elements in bacteria. Synth. Biol. 4, ysz008 (2019).
Article CAS Google Scholar
Le Breton, Y., Mohapatra, N. P. & Haldenwang, W. G. In vivo random mutagenesis of Bacillus subtilis by use of TnYLB-1, a mariner-based transposon. Appl. Environ. Microbiol. 72, 327–333 (2006).
Article PubMed PubMed Central CAS Google Scholar
Greener, A., Callahan, M. & Jerpseth, B. An efficient random mutagenesis technique using an E. coli mutator strain. Methods Mol. Biol. 57, 375–385 (1996).
CAS PubMed Google Scholar
Badran, A. H. & Liu, D. R. Development of potent in vivo mutagenesis plasmids with broad mutational spectra. Nat. Commun. 6, 8425 (2015).
Article ADS CAS PubMed Google Scholar
Halperin, S. O. et al. CRISPR-guided DNA polymerases enable diversification of all nucleotides in a tunable window. Nature 560, 248–252 (2018).
Article ADS CAS PubMed Google Scholar
Camps, M., Naukkarinen, J., Johnson, B. P. & Loeb, L. A. Targeted gene evolution in Escherichia coli using a highly error-prone DNA polymerase I. Proc. Natl Acad. Sci. USA 100, 9727–9732 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Vojta, A. et al. Repurposing the CRISPR-Cas9 system for targeted DNA methylation. Nucleic Acids Res. 44, 5615–5628 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hess, G., Frésard, L., Han, K., Lee, C. & Bassik, M. Directed evolution using dCas9-targeted somatic hypermutation in mammalian cells. Nat. Methods 13, 1036–1042 (2016).
Article CAS PubMed PubMed Central Google Scholar
Tyo, K. E. J., Ajikumar, P. K. & Stephanopoulos, G. Stabilized gene duplication enables long-term selection-free heterologous pathway expression. Nat. Biotechnol. 27, 760–765 (2009).
Article CAS PubMed Google Scholar
Albert, R. Scale-free networks in cell biology. J. Cell Sci. 118, 4947–4957 (2005).
Article CAS PubMed Google Scholar
Park, Y., Espah Borujeni, A., Gorochowski, T. E., Shin, J. & Voigt, C. A. Precision design of stable genetic circuits carried in highly-insulated E. coli genomic landing pads. Mol. Syst. Biol. 16, e9584 (2020).
Article CAS PubMed PubMed Central Google Scholar
Meyer, A. J., Ellefson, J. W. & Ellington, A. D. Directed evolution of a panel of orthogonal T7 RNA polymerase variants for in vivo or in vitro synthetic circuitry. ACS Synth. Biol. 4, 1070–1076 (2015).
Article CAS PubMed Google Scholar
Kylilis, N., Tuza, Z. A., Stan, G.-B. & Polizzi, K. M. Tools for engineering coordinated system behaviour in synthetic microbial consortia. Nat. Commun. 9, 2677 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Wei, S.-P. et al. Formation and functionalization of membraneless compartments in Escherichia coli. Nat. Chem. Biol. 16, 1143–1148 (2020).
Article CAS PubMed Google Scholar
Xiang, N. et al. Using synthetic biology to overcome barriers to stable expression of nitrogenase in eukaryotic organelles. Proc. Natl Acad. Sci. USA 117, 16537–16545 (2020).
Article CAS PubMed PubMed Central Google Scholar
Richardson, S. M. et al. Design of a synthetic yeast genome. Science 355, 1040–1044 (2017).
Article ADS CAS PubMed Google Scholar
Steel, H. & Papachristodoulou, A. Low-burden biological feedback controllers for near-perfect adaptation. ACS Synth. Biol. 8, 2212–2219 (2019).
Article CAS PubMed Google Scholar
Gorochowski, T. E., Avcilar-Kucukgoze, I., Bovenberg, R. A. L., Roubos, J. A. & Ignatova, Z. A minimal model of ribosome allocation dynamics captures trade-offs in expression between endogenous and synthetic genes. ACS Synth. Biol. 5, 710–720 (2016).
Article CAS PubMed Google Scholar
Gorochowski, T. E., Van Den Berg, E., Kerkman, R., Roubos, J. A. & Bovenberg, R. A. L. Using synthetic biological parts and microbioreactors to explore the protein expression characteristics of Escherichia coli. ACS Synth. Biol. 3, 129–139 (2014).
Article CAS PubMed Google Scholar
Mittal, P., Brindle, J., Stephen, J., Plotkin, J. B. & Kudla, G. Codon usage influences fitness through RNA toxicity. Proc. Natl Acad. Sci. USA 115, 8639–8644 (2018).
Article CAS PubMed PubMed Central Google Scholar
Abil, Z., Ellefson, J. W., Gollihar, J. D., Watkins, E. & Ellington, A. D. Compartmentalized partnered replication for the directed evolution of genetic parts and circuits. Nat. Protoc. 12, 2493–2512 (2017).
Article CAS PubMed PubMed Central Google Scholar
Yang, G. & Withers, S. G. Ultrahigh-throughput FACS-based screening for directed enzyme evolution. ChemBioChem 10, 2704–2715 (2009).
Article CAS PubMed Google Scholar
Smith, G. P. & Petrenko, V. A. Phage display. Chem. Rev. 97, 391–410 (1997).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

S.D.C. was supported by the EPSRC/BBSRC Centre for Doctoral Training in Synthetic Biology grant EP/L016494/1. T.E.G. was supported by a Royal Society University Research Fellowship grant UF160357. T.E.G. and C.S.G. were supported by BrisSynBio, a BBSRC/EPSRC Synthetic Biology Research Centre grant BB/L01386X/1. This study did not involve any underlying data.

Author information

Authors and Affiliations

School of Biological Sciences, University of Bristol, Bristol, UK
Simeon D. Castle, Claire S. Grierson & Thomas E. Gorochowski
BrisSynBio, University of Bristol, Bristol, UK
Claire S. Grierson & Thomas E. Gorochowski

Authors

Simeon D. Castle
View author publications
You can also search for this author in PubMed Google Scholar
Claire S. Grierson
View author publications
You can also search for this author in PubMed Google Scholar
Thomas E. Gorochowski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Although T.E.G. and C.S.G. framed questions, suggested figures and tables, and provided direction, the core concepts and new terminology here are mostly the work of S.D.C., who also wrote the first draft. S.D.C. and T.E.G. developed the figures with input from C.S.G. T.E.G. and C.S.G edited the manuscript and supervised the work.

Corresponding author

Correspondence to Thomas E. Gorochowski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Harrison Steel and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Castle, S.D., Grierson, C.S. & Gorochowski, T.E. Towards an engineering theory of evolution. Nat Commun 12, 3326 (2021). https://doi.org/10.1038/s41467-021-23573-3

Download citation

Received: 05 January 2021
Accepted: 04 May 2021
Published: 07 June 2021
DOI: https://doi.org/10.1038/s41467-021-23573-3

This article is cited by

Modelling genetic stability in engineered cell populations
- Duncan Ingram
- Guy-Bart Stan
Nature Communications (2023)
Competition and evolutionary selection among core regulatory motifs in gene expression control
- Andras Gyorgy
Nature Communications (2023)
Steering and controlling evolution — from bioengineering to fighting pathogens
- Michael Lässig
- Ville Mustonen
- Armita Nourmohammad
Nature Reviews Genetics (2023)
Massively parallel characterization of engineered transcript isoforms using direct RNA sequencing
- Matthew J. Tarnowski
- Thomas E. Gorochowski
Nature Communications (2022)
Experimental exploration of a ribozyme neutral network using evolutionary algorithm and deep learning
- Rachapun Rotrattanadumrong
- Yohei Yokobayashi
Nature Communications (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.