Abstract
Digital organisms are computer programs that self-replicate, mutate and adapt by natural selection1, 2, 3. They offer an opportunity to test generalizations about living systems that may extend beyond the organic life that biologists usually study. Here we have generated two classes of digital organism: simple programs selected solely for rapid replication, and complex programs selected to perform mathematical operations that accelerate replication through a set of defined 'metabolic' rewards. To examine the differences in their genetic architecture, we introduced millions of single and multiple mutations into each organism and measured the effects on the organism's fitness. The complex organisms are more robust than the simple ones with respect to the average effects of single mutations. Interactions among mutations are common and usually yield higher fitness than predicted from the component mutations assuming multiplicative effects; such interactions are especially important in the complex organisms. Frequent interactions among mutations have also been seen in bacteria, fungi and fruitflies4, 5, 6. Our findings support the view that interactions are a general feature of genetic systems7, 8, 9.
Many fundamental questions in biology are difficult to address, as a consequence of the high dimensionality of genomes7, 8, 9 as well as the practical difficulties of manipulating numerous genotypes and analysing their resulting phenotypic properties. Progress has been made using microorganisms4,5,10, 11, 12, 13, 14, 15, 16, 17, 18, but these problems remain daunting. An alternative approach involves studying artificial life, in particular certain computer programs — or digital organisms — that share with real organisms the properties of self-replication, mutation, competition and evolution, as well as genomes with high dimensionality and hence indeterminate evolutionary trajectories. The use of digital organisms to address biological questions is controversial, but it can be justified on several grounds. First, artificial life allows us to seek generalizations beyond the organic forms that biologists have studied to date, all of which derive from a common ancestor and share the same basic chemistry of DNA, RNA and proteins. Maynard Smith makes the case thus19: "So far, we have been able to study only one evolving system and we cannot wait for interstellar flight to provide us with a second. If we want to discover generalizations about evolving systems, we will have to look at artificial ones." Second, digital organisms allow us to perform experiments on a scale that is unattainable with real organisms. Here we test billions of different genotypes to measure mutational effects and interactions; a recent experiment with the bacterium Escherichia coli did so using a few hundred genotypes4. Moreover, the performance of digital organisms can be measured exactly, whereas such data are subject to error, and hence loss of statistical power, in any real biological system. Third, there is growing interest in using programs that can evolve to solve complex computational problems20, 21, 22, 23. Knowing how mutations affect performance and interact with one another has important implications for setting parameters such as mutation and recombination rates, just as mutational effects and interactions can influence the evolution of these parameters in real organisms4,6,24, 25, 26, 27, 28.
Our experiments were performed using Avida, a flexible platform for research on artificial life3. Briefly, digital organisms are self-replicating computer programs that compete for central processing unit (CPU) time, which is the fuel needed for their replication. The programs mutate at random and evolve in a defined computational environment. Each digital organism has a genome length measured as the number of sequential instructions in its program. The sequence of instructions can change by mutation, including insertion and deletion events as well as point mutations that change one instruction to another. There are 28 different instructions, which can be thought of as analogous to the 20 different amino acids strung together in proteins. There is no imposed limit to the genome length of these digital organisms.
Starting from a short ancestral program, we generated 87 different 'complex' organisms by allowing replicated populations to evolve in an environment in which: (1) the baseline allocation of CPU time is proportional to genome size; and (2) certain mathematical operations, which require novel combinations of instructions, are rewarded with additional CPU time. For example, digital organisms may be rewarded for performing an 'XOR' operation ('exclusive-or' in which A or B is true, but not both) on 32-bit inputs using a series of 'NAND' operations ('not-and' in which A and B are not simultaneously true). 'XOR' is among the more complicated logical operations; 'NAND' is the only logical operator given as an instruction in Avida. In essence, these operations are a kind of metabolism that allows the digital organisms to acquire the CPU time needed for their replication. Starting from each complex organism, we derived a 'simple' organism that evolved in an environment that favoured faster replication and nothing else: (1) the allocation of CPU time is independent of genome length; and (2) mathematical operations are not rewarded. Thus, we define simple and complex organisms by the different environments in which they evolved. However, we cannot exclude the possibility that aspects other than functional complexity might contribute to differences between the two classes; for example, relaxed selection for genome size in complex organisms may promote genetic redundancy, with attendant consequences for mutational effects.
All 87 simple digital organisms have shorter genomes than their paired complex progenitors, with mean lengths of 19.8 and 91.3 instructions, respectively (Table 1). In every case, both simple and complex digital organisms became more fit than their progenitors in their respective environments, where fitness is simply an organism's replication rate. Using all 174 complex and simple organisms, we then performed two experiments to investigate the effects of mutations and their interactions with respect to fitness. In the first experiment — the 'decay test' — we generated, for each organism, every possible mutant program that contained exactly one point mutation, plus a million (or more) programs with between two and ten random mutations. For each mutant, we measured fitness relative to its unmutated parent in the parent's selective environment. Then, for each parent, we regressed the average mutant fitness W versus mutation number M using a power function: log10W = -
M
. In biological terms,
describes the rate at which fitness decays under a multiplicative hypothesis, whereas
describes the form of 'epistasis'. If
= 1, mutational effects are multiplicative; if
> 1, each successive mutation tends to reduce fitness more than previous ones (synergistic epistasis); and if
< 1, each additional mutation is progressively less damaging on average (antagonistic epistasis). The parameter
is measured without error because W = 1 when M = 0 (by definition) and every possible mutant with M = 1 is tested, whereas
is estimated from sample data. The power function fits these data very well; unlike a quadratic function4, it never predicts that the decay curve bends upwards at high M.
Table 1: Comparisons between complex and simple digital organisms of genome size and several mutational-effect parameters
Simple organisms are much more fragile than their complex counterparts with respect to the fitness effects of single mutations, as indicated by higher values of
(P < 0.0001; Table 1). This difference occurs because more mutations are lethal (prevent self-replication) in smaller genomes than in larger genomes (Fig. 1); on average, 92% and 53% of single mutations were lethal to simple and complex organisms, respectively. Partly offsetting this effect, non-lethal mutations are less damaging to the simple digital organisms; one non-lethal mutation reduces fitness by 11%, on average, in simple organisms, but by 44% in complex ones. The simple organisms do nothing except self-replicate; mutations that disrupt self-replication tend to be lethal, whereas most others are fairly harmless. The complex organisms perform mathematical functions in addition to self-replicating; most mutations that impact these functions hinder performance but are not lethal.
Figure 1: Proportion of single point mutations that are lethal for digital organisms.

Shown as a function of log10-transformed genome length. Circles, complex organisms; triangles, simple organisms.
High resolution image and legend (11K)Simple and complex digital organisms also differ in terms of
, which indicates the average form of interaction (P = 0.0011; Table1). In complex organisms, successive mutations tend to reduce fitness less than would be expected if effects were independent (
< 1), that is, complex organisms are also robust to the cumulative effect of multiple mutations. By contrast, successive mutations do not deviate significantly from multiplicative effects (
= 1) in simple organisms (P = 0.2360, Wilcoxon signed-ranks test). Figure 2 shows the difference between simple and complex digital organisms in their fitness decay curves, including the effects of both
and
.
Figure 2: Log10-transformed mean fitness as a function of number of point mutations for simple and complex classes of digital organisms.

Solid curve C shows the fitness function calculated using the average parameter values from the complex organisms. Dashed line Cm is their fitness function expected under the multiplicative model (obtained using the average
and setting
= 1). Solid curve S is the fitness function for the average simple organism. Dashed line Sm gives the corresponding function expected under the multiplicative model.
The fact that simple organisms appear to show multiplicative effects of mutations on average fitness may mean either that there is little epistasis or that epistasis is widespread but different sets of mutations interact in opposite ways, obscuring the overall signal4,27. We ran a second experiment — the 'pair test' — to distinguish between these two possibilities and gain further insight into the differences between simple and complex organisms. Instead of measuring fitness as progressively more mutations are added, as in the decay test, we examined numerous pairs of mutations by comparing the actual fitness of each double mutant with the expected fitness assuming multiplicative effects of the component mutations. The pair test shows that epistasis is more common in complex organisms than in their simple counterparts (P < 0.0001; Table 1), with 19% of all mutation pairs deviating from multiplicative effects in the average complex organism (Fig. 3a) compared with <5% in the average simple organism (Fig. 3b). Frequencies of epistatic interactions are much higher if we exclude lethal pairs, in which actual and expected fitnesses are both zero (Fig. 3c, d). Excluding lethal pairs, which are much more common in simple organisms, there is no difference between classes in the prevalence of epistasis (P = 0.4374; Table 1). In both classes of digital organisms, epistatic interactions include a mixture of synergistic and antagonistic effects, and antagonistic effects are more common than synergy. Complex organisms are more prone to synergistic effects when expressed as a percentage of epistatic interactions (P < 0.0001; Table 1), but the overall excess of antagonism is greater in complex organisms. The failure of the decay test to find a significant deviation from multiplicative effects in the simple organisms evidently reflects a combination of two factors: epistasis is infrequent, and synergistic and antagonistic interactions oppose one another.
Figure 3: Proportions of mutational pairs classified according to their interaction.

L, lethal: at least one mutation is lethal alone, as is the double mutant. M, multiplicative: neither mutation is lethal, and the relative fitness of the double mutant is exactly equal to the product of the relative fitnesses of the two single mutations. E, epistatic: the double mutant's fitness is unequal to the multiplicative expectation. S, synergistic: double mutant less fit than expected. A, antagonistic: the double mutant is more fit than expected. Average distributions are for complex organisms (a), simple organisms (b), complex excluding lethals (c), and simple excluding lethals (d).
High resolution image and legend (5K)In summary, mutations in digital organisms frequently exhibit epistasis, including a diverse mixture of synergistic and antagonistic interactions. Such interactions are especially pronounced in the complex digital organisms that evolved large genomes and are rewarded for mathematical operations beyond self-replication. Frequent epistasis, including a mix of synergistic and antagonistic effects, also exists in a variety of real organisms4, 5, 6. Thus, digital organisms experience complicated responses to genetic perturbations that appear similar to those seen in real organisms. The genomes and performances of digital organisms can be studied with far greater replication and precision than can be achieved with any real organism. Digital organisms may therefore offer a useful tool for addressing other biological questions in which complexity is both a barrier to understanding and an essential feature of the whole living system. Moreover, digital organisms allow us to test general hypotheses with a system that is built upon an artificial chemistry completely different from that used by real organisms.
Methods
Evolution of digital organisms. Experiments were performed using version 1.3 of Avida, which can be obtained from http://www.krl.caltech.edu/avida/pubs/nature99. We used default settings unless otherwise indicated. Our first step was to generate pairs of complex and simple organisms. All evolution experiments began with the population at its carrying capacity (3,600 individuals). The probability of point mutation was 0.0075 per instruction copied; the probabilities of insertion and deletion mutations were each 0.05 per genome divide. Time is measured in arbitrary units called updates; every update represents the execution of an average of 30 instructions per individual in the population. (A typical generation is 5–10 updates, depending on genome size and execution.) Starting with the default ancestor (genome length 20), a population of complex organisms was propagated for 50,000 updates in an environment in which the baseline allocation of CPU time was proportional to genome size and extra CPU time was obtained by performing mathematical operations in the default task set. The former condition eliminates selection for smaller genomes, and the latter condition imposes selection for functional complexity. The rewards for performing operations are also specified in the default task set; they are scaled by their approximate difficulty and combined in a multiplicative fashion with one another and with the baseline CPU time. (Note that multiplicative scaling of phenotypic rewards does not imply multiplicative effects of mutations.) Using random number seeds, 87 complex populations were derived; subsequent experiments used the numerically dominant genotype from each population. Starting with each complex organism as progenitor, a population of simple organisms evolved for 25,000 updates by allocating the same CPU time to all organisms. There was selection for smaller genome size to promote faster replication and no selection for mathematical operations. Subsequent experiments used the most abundant genotype from each population.
Mutational analyses. We developed three genetic tools to analyse the effects of point mutations on the performance of digital organisms. In all cases, the fitness (replication rate) of each mutant was calculated in the same environment in which its simple or complex parent evolved, and the mutant's fitness is expressed relative to the parent. The first tool makes every possible one-step point mutant for a particular parent. The default set includes 28 different instructions; given a parent of genome length 80, for example, there are 80
(28 - 1) = 2,160 different one-step point mutants. The mean fitness of these mutants permits exact calculation of
in the decay test. The second tool produces a random sample of progeny that differ from their parent by two or more point mutations. For each parent, we generated between 105 and 107 progeny with two mutations, three mutations and so on, up to ten mutations. The third tool produces and analyses pairs of point mutations alone and in combination; for each two-step mutant, we have both corresponding one-step mutants. Having the single mutants allows us to compare a double mutant's actual fitness with the exact value expected under the hypothesis that the mutations interact in a multiplicative manner. We ran the pair test on 104 and 105 mutational pairs for each complex and simple organism, respectively.
Statistical methods. We performed the Wilcoxon signed-ranks test on the difference scores for all comparisons between complex and simple organisms29. This test reflects the evolutionary relationship between pairs of organisms; it is also non-parametric and thus insensitive to deviations from a normal distribution. To estimate
in the decay tests, we minimized the sum of squared deviations around the log-transformed mean fitness values. We excluded samples with fewer than 100 viable mutants, in which case log mean fitness was poorly estimated. By increasing sample size to 108, we can obtain additional viable mutants; the exclusion of some values because of insufficient sampling appears to have no systematic effect on estimation of
.

6. Genetics 151, 921–927 (1999). | 
