## Introduction

In the absence of recombination, evolution is constrained by genetic linkage. That is, selection on an allele at one genomic locus can interfere with the evolution of simultaneously present alleles throughout the genome. Interference interactions between loci include background selection (the spread of a beneficial allele is impeded by linked deleterious alleles), hitchhiking or genetic draft (a neutral or deleterious allele is driven to fixation by a linked beneficial allele), and clonal interference between beneficial alleles originating in disjoint genetic clades (only one of which can reach fixation). These interactions and their consequences for genome evolution have been studied extensively in laboratory experiments1,2 and in natural populations3,4. Recent theory5,6,7,8,9,10,11,12,13 has quantified two broad interference effects in asexual evolution. First, interference selection rather than genetic drift constrains the genetic diversity in large populations, which, in turn, limits the efficacy of selection10,13,14,15. Second, interference reduces the speed of evolution7,8,9,11,12,13; this has been observed in laboratory evolution experiments16,17,18,19. The resulting fitness cost of interference, which has also been been observed in microbial laboratory evolution20,21,22,23, is the center piece of classic arguments for the evolutionary advantage of sex24,25,26,27,28.

Much less clear is how interference affects the evolution of molecular phenotypes, such as protein stabilities and affinities governing gene regulation and cellular metabolism. The systems-biological consequences of interference evolution are the topic of this paper. Our analysis is based on biophysical models of molecular evolution29,30,31,32,33,34,35,36. In a minimal model, each gene of an organism carries a single quantitative trait G, the stability of its protein fold. A fitness landscape f(G) quantifies the effect of protein stability on reproductive success. This landscape is a sigmoid function with a high-fitness plateau corresponding to stable proteins and a low-fitness plateau corresponding to unfolded proteins (Fig. 1a). We also discuss a stability-affinity protein model with a two-dimensional fitness landscape f(G, E); this model includes enzymatic or regulatory functions of genes, specifically the protein binding affinity E to a molecular target. From the perspective of molecular evolution, these landscapes provide a generic biophysical model of local fitness epistasis, which couples all sequence sites contributing to a stability or affinity trait in the same gene. Importantly, local epistasis in protein-coding sequence operates independently of fitness interactions across genes. Beyond proteins, local epistasis occurs ubiquitously in quantitative molecular traits associated with binding interactions. This form of epistasis is an important building block of our model that is not covered by the standard theory of asexual evolution5,6,7,8,9,10,11,12,13.

The system-wide evolution of molecular quantitative traits under genetic linkage defines a particular mode of phenotypic interference, which occurs broadly under conditions of typical microbial systems. This mode couples global and local evolution in a specific way: the global pace of evolution sets the average selection coefficient of local trait changes. In the first part of the paper, we develop the theory of phenotypic interference and derive a key quantitative result: in a system of g genes, the steady-state fitness cost of interference increases quadratically with g. This super-linear cost reflects a specific evolutionary mechanism: each additional gene degrades stability and function of all other genes by increasing the accumulation of deleterious mutations. We then turn to biological implications of phenotypic interference. We show that the interference cost can outweigh the metabolic cost of genes37,38 and generate long-term impact on systems biology: it strongly constrains genome complexity in viable, asexually reproducing organisms and drives the loss of non-essential genes. On the time scales of laboratory evolution experiments, phenotypic interference reduces fitness through the attrition of molecular traits; we compare this prediction to experimental data20,21,22,23. Finally, phenotypic interference provides a surprisingly simple pathway for the evolution of sex. We show that facultative recombination at low rates R can evolve near neutrality yet, once R exceeds a threshold R*, provides a large competitive advantage against competing non-recombining lineages. The predicted threshold R* is of order of the mutation rate, which is consistent with observed recombination rates.

## Results

### Housekeeping evolution under phenotypic interference

Here we analyze the evolution of genetically linked systems in a conservative environment, where populations maintain the functionality of molecular traits in the presence of deleterious mutations but there is no adaptive pressure on these traits. This scenario defines a system-wide mutation-selection steady state that we call housekeeping evolution (here, housekeeping does not refer to a particular class of genes or metabolic processes). It builds on the assumption that over long time scales, selection acts primarily to repair the deleterious effects of mutations, because these processes are continuous and affect the entire genome. In contrast, adaptive processes are often environment-dependent and transient, and they affect only specific genes. In Methods section, we extend our analysis to scenarios of adaptive evolution and show that these do not affect the conclusions of the paper.

Figure 1 illustrates the ingredients of phenotypic interference in the housekeeping state (and can serve as a shortcut through theory for readers primarily interested in the biological implications). First, local quantitative traits of a given gene are in an evolutionary equilibrium, where the long-term average of the trait value and its position on the fitness landscape are determined by the uphill force of selection and the downhill force of mutations (Fig. 1a and Supplementary Fig. 1a). Second, global genome evolution takes place in a so-called fitness wave; that is, genetic and phenotypic variants in multiple genes co-exist in a population and generate a broad distribution of fitness values7,8,9,11,12,13 (Fig. 1b and Supplementary Fig. 1b). These levels are linked by a common evolutionary parameter, the coalescence rate $$\tilde \sigma$$, or equivalently by the effective population size $$N_e = (2\tilde \sigma )^{ - 1}$$ (Supplementary Table 1 lists all mathematical symbols). The joint solution of the local and global evolutionary dynamics identifies a broad regime of phenotypic interference, which is marked by a system-wide genetic load depending quadratically on genome size (Fig. 1c).

### Evolution of a quantitative trait under interference selection

In the framework of the minimal biophysical model, we study the housekeeping evolution of genome-wide protein fold stability. The stability trait G of a given gene is defined as the free energy difference between the unfolded and the folded state (and usually denoted by ΔG; we abbreviate this notation to avoid confusion with the variance measures defined below). The trait G evolves in a fitness landscape f(G) of sigmoid form (Fig. 1a, see Methods section).

The mutation-selection equilibrium on a flank of the landscape f(G) can be characterized by the equilibrium values of its population mean trait, $${\mathrm{\Gamma }} \equiv \overline G$$, and the trait diversity or genetically heritable trait variance, $${\mathrm{\Delta }}_G \equiv \overline {G^2} - {\mathrm{\Gamma }}^2$$ (overbars denote averages within a population39). First, the diversity ΔG takes the simple, effectively neutral equilibrium form

$${\mathrm{\Delta }}_G = \frac{{u{\kern 1pt} \epsilon _G^2}}{{2\tilde \sigma }},$$
(1)

which is proportional to the total mutation rate u and the mean square stability effect $$\epsilon _G^2$$ of the relevant sequence sites, and to the effective population size $$N_e = (2\tilde \sigma )^{ - 1}$$. This form extends previous results on neutral sequence diversity14,40,41,42 and on quantitative trait diversity under genetic drift43,44,45. In Methods section, we derive Eq. 1 for quantitative traits in a fitness landscape f(G) by showing that stabilizing selection on ΔG can be neglected throughout the phenotypic interference regime; this scaling is confirmed by simulations (Supplementary Fig. 2a). In a fitness wave, the parameter $$\tilde \sigma$$ couples each individual trait to the global evolutionary dynamics of all genetically linked genes (Fig. 1a, b). In contrast, an independently evolving trait would depend on an effective population size Ne set by genetic drift. Next, we compute the equilibrium point for the mean trait $${\mathrm{\Gamma }}$$ by equating the rate of stability increase by selection with the rate of stability degradation by mutations,

$${\mathrm{\Delta }}_G{\kern 1pt} f{\prime}({\mathrm{\Gamma }}) = u\epsilon _G;$$
(2)

details are given in Methods section. This mutation-selection equilibrium depends on the effective population size, in contrast to protein evolution models in the infinite population limit31. By inserting Eq. 1, we can express the mean square selection coefficient at trait sites, $$s^2 = \epsilon _G^2f{\prime}^2({\mathrm{\Gamma }})$$, and the fitness variance $${\mathrm{\Delta }}_f \approx {\mathrm{\Delta }}_Gf{\prime}^2({\mathrm{\Gamma }})$$ in terms of the coalescence rate,

$$s^2 = 4\tilde \sigma ^2,\quad {\mathrm{\Delta }}_f = 2u\tilde \sigma ;$$
(3)

a similar relation for s2 under genetic drift has been derived in refs. 46,47. These equations describe stable trait equilibria on the downward-curved shoulder of the fitness landscape f(G), which is a non-linear trait interval with $$f{\prime}{\prime}(G) < 0$$. They express universal characteristics of these equilibria, which do not depend on details of the fitness landscape and of the trait effect distribution of sequence sites. Their validity is confirmed by numerical simulations (Supplementary Fig. 2). The above derivation neglects fluctuations of $${\mathrm{\Gamma }}$$ by genetic drift and genetic draft; cf. Supplementary Fig. 1a. However, Eq. 3 remain exactly valid in the full mutation-selection-coalescence dynamics (Supplementary Methods 1 and Supplementary Fig. 3).

A salient feature of selection on quantitative traits becomes apparent from Eq. 3: the selection coefficients of new genetic variants are not fixed a priori, but are an emergent property of the global evolutionary process. A faster pace of evolution, i.e., an increase in coalescence rate $$\tilde \sigma$$, reduces the efficacy of selection10,11,14. On the downward curved shoulder of the fitness landscape, this drives the population to an equilibrium point of lower fitness and higher fitness gradients. In other words, trait-changing mutations are under ubiquitous negative epistasis: the combined (log) fitness effect of two deleterious trait changes is larger in magnitude, the combined effect of two beneficial mutations is smaller than the sum of the individual effects. This epistasis tunes typical selection coefficients to marginal relevance, where mean allele sojourn times between low and high frequencies, 1/s, are of the order of the coalescence time $$2N_{\mathrm{e}} = 1/\tilde \sigma$$. That point marks the crossover between effective neutrality ($$s \ll \tilde \sigma$$) and strong selection ($$s \gg \tilde \sigma$$)10; consistently, most but not all trait sites carry their beneficial allele.

### Interference of multiple traits

We now obtain a closed solution of housekeeping evolution under phenotypic interference by matching the individual trait equilibria given by Eq. 3 with a fitness wave model for global evolution. First, the total fitness variance σ2 is simply the sum of the fitness variances Δf of the individual genes (Supplementary Fig. 4). Using Eq. 3, this sum rule takes the form $$\sigma ^2 = g{\mathrm{\Delta }}_f = 2ug\tilde \sigma$$, which relates the scales of global selection and coalescence, σ and $$\tilde \sigma$$. Second, given a sufficient supply of non-neutral mutations, global evolution proceeds in a fitness wave (the condition for wave occurrence will be made precise below). General fitness wave theory then provides another relation between global selection and coalescence,

$$c \equiv \frac{{\sigma ^2}}{{\tilde \sigma ^2}} = c_0{\kern 1pt} {\mathrm{log}}(N\sigma ),$$
(4)

where N is the population size and c0 is a model-dependent prefactor12,13 (Methods). Combining these relations, we obtain the global fitness wave of phenotypic interference,

$$\sigma ^2 = 4\frac{{u^2g^2}}{c},\quad \tilde \sigma = 2\frac{{ug}}{c}.$$
(5)

Equations 3 then determine the corresponding characteristics of individual traits,

$$\frac{{{\mathrm{\Delta }}_G}}{{\epsilon _G^2}} = \frac{c}{{4g}},\quad s^2 = 4\tilde \sigma ^2 = 16\frac{{u^2g^2}}{{c^2}}.$$
(6)

Equations 5 and 6 involve the fitness wave parameter defined in Eq. 4,

$$c \simeq c_0{\kern 1pt} {\mathrm{log}}\frac{{2Nug}}{{c_0}},$$
(7)

which depends only weakly on the evolutionary parameters and provides corrections to the scaling. This parameter estimates the complexity of the fitness wave, that is, the average number of genes with simultaneously segregating beneficial genetic variants destined for fixation (Fig. 1 and see Methods section). A wave pattern with temporally stable fitness polymorphism of approximately Gaussian form occurs whenever the mutation rate exceeds the average site selection coefficient, $$ug \, \gtrsim \, s$$15. This regime underlies the closure of Eqs. 5, 6; cf. Supplementary Fig. 1b. As shown in Methods section, it applies to gene numbers above a threshold g0 given by the condition

$$g_0 = \frac{c}{4}.$$
(8)

These relations are the centerpiece of phenotypic interference theory. They show that the collective evolution of molecular quantitative traits under genetic linkage depends strongly on the number of genes that encode these traits. The dependence is generated by a feedback between the global fitness variation, σ2, and mean square local site selection coefficients, s2. This feedback also tunes the evolutionary process to the crossover point between independently evolving genomic sites and strongly correlated fitness waves composed of multiple small-effect mutations (Supplementary Methods 2). Remarkably, local and global characteristics of phenotypic interference are strongly universal: they depend only on the parameters g, u, and c but decouple from details of gene fitness landscapes and site effect distributions.

The scaling of phenotypic interference is confirmed by extensive numerical simulations of Fisher-Wright populations, which are detailed in Methods section. Figure 2 shows the global observables σ2, $$\tilde \sigma ^2$$ and the local observables ΔG, s2 as functions of g. The data display a crossover from a weak-interference regime of independently evolving genes at low values of g (brown dashed lines) to the phenotypic interference scaling given by Eqs. 57 (red dashed lines); this crossover occurs around a modest gene number $$g_0 \sim 100$$. The calibration between theory and data involves the fitting of a single model-dependent amplitude c0; the calibrated theory matches the data for realistic gene numbers (g ~ 103 − 104) without additional fit parameters. The data also show the universality of the leading scaling behavior; gene selection coefficients f0 varying by more than three orders of magnitude introduce only small corrections to scaling. Supplementary Fig. 1 displays the separation of diversity scaling between predominantly monomorphic individual traits and standing fitness variation, as detailed in Eqs. 19, 20 of Methods section. The underlying near-linear relation between global fitness variance σ2 and coalescence rate $$\tilde \sigma ^2$$, which is a general property of fitness waves, is checked in Supplementary Fig. 2d.

### Interference selection against complexity

The evolutionary cost of deleterious mutations is quantified by the genetic load, which is defined as the mean fitness of a population compared to the fitness maximum. In the biophysical fitness landscape f(G) of the minimal model, the load of a given gene takes the approximate form $$f_0 - f({\mathrm{\Gamma }})$$, where $${\mathrm{\Gamma }}$$ denotes the population mean stability and f0 is the fitness of a fully stable gene ($$G \gg 0$$); see Fig. 1a and Eq. 13 in Methods section. We now compute the genetic load under phenotypic interference for stable and functional genes, which are located in the concave part of the minimal model landscape f(G). This part can be approximated by its exponential tail, where the load is proportional to the slope $${\cal{L}} = f{\prime}({\mathrm{\Gamma }})/k_{\mathrm{B}}T$$. Equation 6, $$s = \epsilon f{\prime}({\mathrm{\Gamma }}) = 2\tilde \sigma$$, then predicts a load $$sk_{\mathrm{B}}T/\epsilon _G \approx 2\tilde \sigma$$ per gene, where we have used that typical reduced effect sizes $$\epsilon _G/k_{\mathrm{B}}T$$ are of order 1 (see Methods section). With Eq. 5, we obtain a quadratic scaling of the total equilibrium genetic load,

$${\cal{L}}_{{\mathrm{int}}}(g) \approx 2g\tilde \sigma = \frac{{4ug^2}}{c},$$
(9)

which sets on at a small gene number g0 given by Eqs. 7, 8 (Fig. 1c; numerical simulations are shown in Fig. 3). The superlinearity of the load is the most important biological consequence of phenotypic interference and the main difference to previous results on protein evolution31. It is generated by the evolutionary feedback between global and local selection discussed in Fig. 1: increasing the number of genes reduces the coalescence time $$\tilde \sigma ^{ - 1}$$ and, thus, the efficacy of selection on every single gene.

In Supplementary Methods 3 and Supplementary Fig. 5, we discuss phenotypic interference in extended biophysical models. These include active protein degradation at the cellular level, a ubiquitous process that drives the thermodynamics of folding out of equilibrium48. Another example is the stability-affinity model, which has two quantitative traits per gene that evolve in a two-dimensional sigmoid fitness landscape f(G, E)35,49. Under reasonable biophysical assumptions, evolution in the stability-affinity model produces a 2-fold higher interference load than the minimal model, $${\cal{L}}_{{\mathrm{int}}}(g) \approx 8ug^2/c$$. Alternative models with a quadratic single-peak fitness landscape describe, for example, gene expression levels under stabilizing selection50. Such landscapes generate an even stronger load nonlinearity, $${\cal{L}}_{{\mathrm{int}}}(g) \sim g^3$$. In contrast, a discrete model with a fitness effect f0 of each gene shows a linear load up to a characteristic gene number $$g_{\mathrm{m}} = (f_0/u){\kern 1pt} {\mathrm{log}}(N{\kern 1pt} f_0)$$ associated with the onset of mutational meltdown by Muller’s ratchet8,51,52. These examples suggest that superlinear scaling of the genetic load holds quite generally, given a sufficient number of quantitative traits evolving under genetic linkage and in fitness landscapes with negative epistasis. This type of landscape is ubiquitous in biophysical models.

The equilibrium load $${\cal{L}}_{{\mathrm{int}}}$$ generates strong long-term selection against genome complexity: the fitness cost for each additional gene, $${\cal{L}}_{\mathrm{int}}' (g)$$, can take sizeable values even at moderate genome size. For example, in a “standard” microbe of the complexity of E. coli, a 10% increase in gene number may incur an additional load $${\mathrm{\Delta }}{\cal{L}} \approx 3 \, \times \, 10^{ - 2}$$ under the stability-affinity model (with parameters $$g = 5000$$, $$u = 10^{ - 6}$$, $$N = 10^8$$). This estimate should be regarded as a lower bound, which is based only on core protein functions but ignores, for example, regulatory functions encoded in intergenic DNA. In comparison, the discrete model leads to a much smaller value $${\mathrm{\Delta }}{\cal{L}} = 5 \times 10^{ - 4}$$ for the same parameters.

### Genetic load can exceed metabolic fitness cost

We can compare the interference load $${\cal{L}}_{{\mathrm{int}}}' (g) = 8ug/c$$ of an extra gene with its physiological fitness cost $${\cal{L}}_{{\mathrm{phys}}}' (g)$$, which is generated primarily by the synthesis of additional proteins (and is part of the fitness amplitude f0). Metabolic theory shows that spurious expression leads to a re-allocation of metabolic resources in the cell and a reduced growth rate, $$\lambda = \lambda _0(1 - \phi _{\mathrm{U}}/\phi _{{\mathrm{max}}})$$, where $$\phi _U$$ is the proteome fraction of unnecessary genes and $$\phi _{{\mathrm{max}}}$$ is the total proteome fraction available for growth ($$\phi _{{\mathrm{max}}} \approx 0.5$$ for E. coli in exponential growth)37,38. A single gene with average expression level encodes a proteome fraction $$\phi _U \sim 1/g$$; this leads to a metabolic cost $${\cal{L}}_{{\mathrm{phys}}}' (g) = (\lambda _0 - \lambda )/\lambda _0 \sim 1/(g\phi _{{\mathrm{max}}})$$ per generation. Similarly, the energetic cost of a gene is of order 1/g53. While the precise form of these cost components depends on details of cell metabolism, we expect generically $${\cal{L}}_{{\mathrm{phys}}}' (g) \approx 1/g$$. For evolution under phenotypic interference, this implies $${\cal{L}}_{\mathrm{phys}}' (g) \lesssim \tilde \sigma$$ for $$g \, \gtrsim \, 5000$$, which is similar to the interference load per gene in a standard microbe but becomes subleading in larger genomes.

The physiological cost per gene acts as a selective force on changes of genome size within a coalescence interval $$\tilde \sigma ^{ - 1}$$. The inequality $${\cal{L}}_{{\mathrm{phys}}}' (g) \lesssim \tilde \sigma$$ says that such changes are weakly selected and suggests a two-scale evolution of genome sizes. On short time scales, the dynamics of gene numbers is permissive and allows the rapid acquisition of adaptive genes. On longer time scales (of order $$\tau$$; see Eq. 11 below), the interference load prunes marginally relevant genes in a more stringent way, for example, by invasion of strains with more compact genomes.

### Interference drives gene loss

The near-neutral dynamics of genome size extends to gene losses, which become likely when a gene gets close to the inflection point of the sigmoid fitness landscape and the stability condition underlying Eq. 2 no longer holds (Fig. 4a). The relevant threshold gene selection, $$f_0^{\mathrm{c}}$$, is

$$f_0^{\mathrm{c}} \sim 2\tilde \sigma = \frac{{4ug}}{c}$$
(10)

in the minimal model; see Eq. 5. Strongly selected genes ($$f_0 \gg f_0^{\mathrm{c}} \sim 2\tilde \sigma$$) have equilibrium trait values firmly on the concave part of the landscape, resulting in small loss rates of order $$u{\kern 1pt} {\mathrm{exp}}( - f_0/2\tilde \sigma )$$10; these genes can be maintained over extended evolutionary periods. Marginally selected genes ($$f_0 \, \lesssim \, f_0^{\mathrm{c}} \sim 2\tilde \sigma$$) have near-neutral loss rates of order u10, generating a continuous turnover of genes. According to Eq. 10, the threshold $$f_0^{\mathrm{c}}$$ for gene loss increases with genome size, which expresses again the evolutionary constraint on genome complexity. The dependence of the gene loss rate on f0 and $$\tilde \sigma$$ is confirmed by simulations (Fig. 4b). The housekeeping coalescence rate $$\tilde \sigma = 2ug/c$$ sets a lower bound for $$f_0^{\mathrm{c}}$$, adaptive evolution can lead to much larger values of $$\tilde \sigma$$ and $$f_0^{\mathrm{c}}$$.

### Load accumulation in evolution experiments

After a change in gene number or other systems parameters, the evolutionary process reaches a new steady state. Because (additional) deleterious trait changes are only marginally selected (i.e., have selection coefficients of magnitude $$|s| \, \lesssim \, \tilde \sigma$$), the relaxation time $$\tau$$ is of the order of the inverse mutation rate per trait,

$$\tau \sim \frac{1}{u} = \frac{{2g}}{c}\frac{1}{{\tilde \sigma }},$$
(11)

where we have used Eq. 5. This time scale exceeds the coalescence time $$\tilde \sigma ^{ - 1}$$ and is of order 106 generations for a standard microbe. Hence, interference selection against complexity is a potent evolutionary force affecting natural populations but is beyond the time scales of laboratory evolution experiments.

Nevertheless, the phenotypic interference model makes testable predictions on load accumulation in laboratory populations. Consider a standard microbe that has an initial housekeeping interference load $${\cal{L}}_{{\mathrm{int}}}/g = 2\tilde \sigma$$ per gene and is subject to strong adaptive pressure in the experiment, generating an increased coalescence rate $$\tilde \sigma _{{\mathrm{ad}}} \gg \tilde \sigma$$. Equations 6, 11 then predict a lower bound for the genome-wide rate of load increase, $${\cal{L}}_{{\mathrm{int}}} \, \gtrsim \, 2ug\tilde \sigma$$ per generation. This loss reflects the system-wide collateral degradation of protein stability, which is caused by deleterious hitchhiker mutations of the adaptive process.

A collateral fitness decline of this type and magnitude has been observed in E. coli populations from long-term evolution experiments20,21,22,23. While the decline is masked in the original long-term experiments by a larger adaptive fitness gain21, it has been revealed by fitness measurements of the evolved strains on other substrates20. A substantial part of the fitness loss can be rescued in fitness assays at lower temperature, suggesting a link to protein stability20. The phenotypic interference model supports this interpretation. Protein stability G, as well as quantitative protein function traits, provides a large, genome-wide supply of weakly selected mutations prone to hitchhiking ($$s \, \lesssim \, \tilde \sigma$$). Moreover, the biophysical fitness landscapes of protein stability and affinity are explicitly temperature-dependent, which explains why fitness losses by deleterious mutations can be compensated by temperature reduction. We obtain a lower bound on the fitness loss related to the genome-wide attrition of these biophysical traits, $$\dot {\cal{L}} \sim 10^{ - 5}$$ per generation, by evaluating the temperature-rescuable part of the fitness decline in mutator lines (see Methods section). Nonsynonymous substitutions have been observed at a genome-wide rate $$ug \sim 10^{ - 2}$$ per generation in these lines, and a large part appears to be effectively neutral hitchhikers22. Associating these substitutions with quantitative traits, the phenotypic interference model provides a lower-bound estimate $$\dot {\cal{L}}_{{\mathrm{int}}} \sim 2 \times 10^{ - 6}$$ per generation (see Methods section), which is consistent with the observed loss rate.

### The pathway to sexual evolution

Recombination reshuffles genome segments at a rate R per genome and per generation (R is also called the genetic map length). Evolutionary models show that recombination generates linkage blocks that are units of selection. A block contains an average number $$\xi$$ of genes, such that there is one recombination event per block and per coalescence time, as given by the relation $$R\xi /(g\tilde \sigma (\xi )) = 1$$13,15,54,55. Depending on R, these models predict a regime of asexual evolution, where selection acts on entire genotypes ($$\xi \sim g$$), and a distinct regime of sexual evolution with selection acting on individual alleles ($$\xi \ll g$$). Here we focus on the evolution of the recombination rate itself and establish a selective avenue for the transition from asexual to sexual evolution.

With the phenotypic interference scaling $$\tilde \sigma (\xi ) = 2u\xi /c$$ for $$\xi \, \gtrsim \, c$$, as given by Eq. 5, our minimal model produces an instability at a threshold recombination rate

$$R^ \ast = \tilde \sigma = \frac{{2ug}}{c},$$
(12)

signaling a first-order phase transition with the genetic load as order parameter. For $$R \, < \, R^{\ast}$$, the population is in the asexual mode of evolution ($$\xi \sim g$$), where phenotypic interference produces a superlinear load $${\cal{L}}_{{\mathrm{int}}} = 2ug^2/c$$. For $$R \, > \, R^ \ast$$, efficient sexual evolution generates much smaller block sizes ($$\xi \sim c$$). In this regime, the load drops to the linear form $${\cal{L}}_0 = ug \ll {\cal{L}}_{{\mathrm{int}}}$$ providing a net long-term evolutionary fitness gain $${\mathrm{\Delta }}{\cal{L}} = {\cal{L}}_{{\mathrm{int}}} - {\cal{L}}_0 \simeq {\cal{L}}_{{\mathrm{int}}}$$. The first-order transition is a specific consequence of phenotypic interference. Because recombination rate and coalescence rate in a linkage block are both proportional to the block size $$\xi$$, the recombination-coalescence balance criterion takes the $$\xi$$-independent form $$R^ \ast /g = 2u/c$$. That is, linkage blocks cover either the entire genome ($$\xi \sim g$$) or just small genome segments ($$\xi \sim c$$). The resulting drop of $${\cal{L}}$$ in recombining populations close to R* is confirmed by simulations (Fig. 5a). The process of recombination comes with a direct, short-term cost $${\cal{L}}_{{\mathrm{rec}}}$$ per generation, which includes mating costs, physiological costs, and deleterious reshuffling costs, and can potentially prevent the evolution of recombination. The classical factor 2 scenario of obligately sexual populations says that this cost is of order 1 per recombination event; that is, $${\cal{L}}_{{\mathrm{rec}}} = R$$28,56,57. For early isogamous populations without the full machinery of sexual reproduction, $${\cal{L}}$$ is likely to be smaller58. Importantly, in the phenotypic interference mode, this cost remains always marginal. Even the upper-bound assumption $${\cal{L}}_{{\mathrm{rec}}} = R$$ leads to a cost $${\cal{L}}_{{\mathrm{rec}}} \, \lesssim \, R^ \ast = \tilde \sigma$$ at the transition, which implies only weak negative selection.

Together, the theory of phenotypic interference suggests a specific selective pathway for the evolution of recombination (Fig. 5b). First, given that the evolution of recombination at a rate of order R* is near-neutral, a recombining sub-lineage with $$R \sim R^ \ast$$ arising in an asexual background population can fix by genetic drift and draft. Second, a recombining strain with $$R \, > \, R^{\ast}$$ can eliminate the interference load by the parallel fixation of beneficial mutations in unlinked genome segments. This leads to a long-term benefit $${\mathrm{\Delta }}{\cal{L}} \sim gR^ \ast = g\tilde \sigma$$ over non-recombining but otherwise equivalent strains; by Eq. 9, this benefit is of order 1 for a standard microbe. Hence, the evolved recombining strain can readily outcompete related non-recombining strains in the same ecological niche. The threshold recombination rate R* is of the order of the genome-wide mutation rate ug, so even rare facultative recombination provides a robust pathway to sexual evolution. This pathway builds on a separation of selection scales: the near-neutral establishment of recombination is followed by the buildup of a large benefit. We can compare observed recombination rates in natural populations with the predicted threshold rates R* (Supplementary Table 2). Consistently, genome-wide average rates for species in different parts of the tree of life are always well above R*; a high-resolution recombination map of the Drosophila genome shows low-recombining regions with values above but of order R*59,60.

The phenotypic interference pathway to recombination has highly universal characteristics: its long-term benefit of recombination is g-fold higher than the upper-bound cost, independently of details of the genome-wide selection and mutation landscape. In particular, this pathway does not require any of the strong assumptions of previous models for the evolution of recombination, which include direct benefits of recombination28,58,61,62, strong and continual adaptation61,63,64,65, and genome-wide epistasis between mutations28,65,66,67,68. It builds instead on local diminishing-return epistasis for functional traits of individual proteins, which is a natural consequence of their underlying biophysical mechanism. Recent fitness-wave models, which have an interference dynamics qualitatively similar to ours, quantify the difference in adaptive speed between clonally evolving and recombining populations7,8,9,11,12,13, but a direct cost-benefit balance of recombination based on genetic load has not been attempted. We note that these models assume mutations with a fixed distribution of selection coefficients and no local epistasis, which creates important quantitative differences to phenotypic interference. First, strongly deleterious effects of asexual evolution, which are associated with the onset of Muller’s ratchet, set in at larger genome sizes8 than under phenotypic interference (Fig. 1c). Second, the crossover to sexual evolution, which has been studied in the context of adaptive fitness waves, takes place at a larger recombination rate R15,69 and, hence, a larger recombination cost. A more detailed model comparison is given in Supplementary Discussion.

## Discussion

Here we have developed the evolutionary genetics of multiple biophysical traits in non-recombining populations. Our approach combines quantitative trait theory with fitness wave theory. We find a specific evolutionary mode of phenotypic interference, which is characterized by a feedback between global and local selection. The system-wide genetic variation of the traits generates fitness variance, which, in turn, determines the scale of selection at local genomic sites encoding the traits. This feedback generates highly universal features, which do not depend on system details. These include the complexity of the evolutionary process and the scaling of coalescence rate and genetic load with the gene number, as given by Eqs. 59. A similar destructive feedback generating a superlinear cost has been identified in crosstalk of gene regulation70. Importantly, phenotypic interference also generates universal local selection. By Eq. 3, the average selective amplitude of trait-changing mutations decouples from the total fitness effect of the trait. That is, the spectrum of site selection coefficients is not a fixed input, but a dynamical output of the evolutionary process. This selection filter is the main difference of our approach to previous population-genetic models of asexual evolution5,6,7,8,9,10,11,12,13. We argue it is a relevant step towards biological realism.

Phenotypic interference depends on two prerequisites: selection is globally clonal and its local genomic units are broadly epistatic. The clonality of selection is a generic consequence of low recombination rates; broad fitness epistasis is a ubiquitous feature of biophysical gene traits, including protein stabilities and activities. Such traits have non-linear fitness landscapes, in which the selection on trait changes depends on the trait value (Fig. 1a).

We have shown that phenotypic interference produces systems-biological effects on different evolutionary time scales. In clonal adaptation experiments, it predicts a system-wide functional and fitness degradation in line with observations20,21,22,23. On macro-evolutionary scales, it generates strong selection against genome complexity in clonally reproducing populations. The underlying genetic load originates from the interference of phenotypic variants within a population and accumulates with a time delay beyond the coalescence time, as given by Eq. 11. Interference load acts as an evolutionary force in an ecological context: microbial strains with shorter genomes can outcompete otherwise similar strains with longer genomes that are in the same ecological niche. We have shown that this force, which arises naturally from a systems perspective of multiple biophysical traits, provides a robust eco-evolutionary pathway for the transition to recombination. Its selective input is local fitness epistasis, which occurs ubiquitously in quantitative molecular traits. Therefore, unlike previous models based on global epistasis28,65,66,67,68, this pathway does not require ad-hoc assumptions on the form of selection.

The target of phenotypic interference is molecular complexity, which can be regarded as a key systems-biological observable. In our simple biophysical models, we measure complexity by number of stability and affinity traits in a proteome. This is clearly just a starting point towards a broader systems-biological approach that includes regulatory, signaling, and metabolic networks. These define additional landscapes of biophysical interactions, but the key evolutionary mechanisms of phenotypic interference—globally clonal selection and tuned, epistatic selection on system components—are expected to play out in a similar way. In a systems model, we can define complexity as the number of (approximately) independent molecular quantitative traits, which includes network contributions that scale in a nonlinear way with genome size. Interference selection affects the complexity and architecture of all of these networks, establishing new links between evolutionary and systems biology to be explored in future work.

## Methods

### Biophysical fitness models

In thermodynamic equilibrium at temperature T, a protein is folded with probability $$p_ + (G) = 1/[1 + {\mathrm{exp}}( - G/k_{\mathrm{B}}T)]$$, where G is the Gibbs free energy difference between the unfolded and the folded state and kB is Boltzmann’s constant. A minimal biophysical fitness model for proteins takes the form

$$f(G) = f_0{\kern 1pt} p_ + (G) + C = \frac{{f_0}}{{1 + {\mathrm{exp}}( - G/k_{\mathrm{B}}T)}} + C$$
(13)

with a single selection coefficient capturing functional benefits of folded proteins and metabolic costs of misfolding32,33,34. The constant C is irrelevant for the computation of fitness differences (selection coefficients). This model describes the effect of a protein on Malthusian (logarithmic) fitness, depending on its free energy of folding. Similar fitness models based on binding affinity have been derived for transcriptional regulation29,30,71,72; the rationale of biophysical fitness models has been reviewed in refs. 36,73. Equation 13 applies to genes with individually small fitness effects ($$f_0 \ll 1$$). An appropriate extension to essential genes is a landscape describing zero growth (lethality) at a finite stability threshold G0, which corresponds to a singularity of the Malthusian fitness, f(G) → −∞ for $$G \to G_0$$. An example is the landscape $$f(G) = {\mathrm{log}}[f_0/(1 + {\mathrm{exp}}( - G/k_{\mathrm{B}}T)) + (1 - f_0)]$$, which has a threshold G0 given by $$p_ + (G_0) = 1 - 1/f_0$$ for $$f_0 > 1$$; alternative models for essential genes are described in refs. 31,32. However, the extended fitness landscape retains the form Eq. 13 in the regime of stable folding ($$G/k_{\mathrm{B}}T \, \gtrsim \, 1$$), which implies that our conclusions remain unaffected. In particular, the load per gene remains independent of the selection amplitude f0, as given by Eq. 9 and confirmed by simulations (Fig. 3). In Supplementary Methods 3, we introduce further alternative fitness landscapes for proteins and show that our results depend only on broad characteristics of these landscapes.

The minimal global fitness landscape for a system of g genes with traits $$G_1, \ldots ,G_g$$ and selection coefficients $$f_{0,1}, \ldots ,f_{0,g}$$ is taken to be additive, i.e., without epistasis between genes,

$$f(G_1, \ldots ,G_g) = \mathop {\sum}\limits_{i = 1}^g \frac{{f_{0,i}}}{{[1 + {\mathrm{exp}}( - G_i/k_{\mathrm{B}}T)]}}.$$
(14)

### Evolutionary model

We characterize the population genetics of an individual trait G by its population mean $${\mathrm{\Gamma }}$$ and its expected variance ΔG. These follow the stochastic evolution Equations45

$${\dot{\mathrm{\Gamma }}} = - u\kappa \epsilon _G + {\mathrm{\Delta }}_Gf{\prime}({\mathrm{\Gamma }}) + \chi _{\mathrm{\Gamma }}(t),$$
(15)
$${\dot{\mathrm{\Delta }}}_G = \left( {u\epsilon _0^2 - N_{\mathrm{e}}^{ - 1}{\mathrm{\Delta }}_G} \right){\mathrm{\Delta }}_G + {\mathrm{\Delta }}_G^2f{\prime}{\prime}({\mathrm{\Gamma }}) + \chi _{\mathrm{\Delta }}(t).$$
(16)

These equations contain white noise $$\chi _{\mathrm{\Gamma }}(t)$$ of mean $$\langle \chi _{\mathrm{\Gamma }}(t)\rangle = 0$$ and variance $$\langle \chi _{\mathrm{\Gamma }}(t)\chi _{\mathrm{\Gamma }}(t{\prime})\rangle = ({\mathrm{\Delta }}_G/2N_{\mathrm{e}}){\kern 1pt} \delta (t - t{\prime})$$ and $$\chi _{\mathrm{\Delta }}(t)$$ of mean $$\langle \chi _{\mathrm{\Delta }}(t)\rangle = 0$$ and variance $$\langle \chi _{\mathrm{\Delta }}(t)\chi _{\mathrm{\Delta }}(t{\prime})\rangle = (2{\mathrm{\Delta }}_G^2/2N_{\mathrm{e}}){\kern 1pt} \delta (t - t{\prime})$$ with an effective population size $$N_{\mathrm{e}} = 1/2\tilde \sigma$$ generated by genetic draft. This dynamics is characterized by the rate u, the mean effect (−κ)$$\epsilon _G$$, and the mean square effect $$\epsilon _G^2$$ of trait-changing mutations. We use effects $$\epsilon _G \approx 1$$ − 3kBT, which have been measured for fold stability31,74 and for molecular binding traits29,75,76. Furthermore, we approximate the mutational bias $$\kappa ({\mathrm{\Gamma }})$$ by a constant $$\kappa = 1$$, which reflects the observation that most mutations affecting a functional trait are deleterious.

### Evolutionary equilibria for individual traits

We now derive the equilibrium conditions of the model given by Eqs. 15, 16, which are used in the main text. This involves three steps. First, the deterministic term in Eq. 16 determines the average trait diversity ΔG as given in Eq. 1, if we neglect the selection component (this will be justified in step three below). That is, ΔG follows from a mutation-coalescence balance: the trait gains a heritable variance ΔG by new mutations at a speed $$u\epsilon _G^2$$, and it loses variation by coalescence at a rate $$2\tilde \sigma$$. Equation 1 is consistent with well-known results for the average sequence diversity Δ, indicating that diversity expectation values do not depend on details of the coalescence process. These results include the relation $${\mathrm{\Delta }} = 4uN_{\mathrm{e}}$$ in the standard theory of neutral evolution, where Ne is proportional to the actual population size40. The same relation is obtained for the sequence diversity of neutral genomic sites in models of genetic draft41 and in fitness wave models, where $$N_e = (2\tilde \sigma )^{ - 1}$$ is determined by selection14,42. To obtain the equivalent form for a quantitative trait G, we simply rescale the sequence diversity by the mean square effect $$\epsilon _G^2$$44,45, which leads to Eq. 1.

Second, the equilibrium point of the mean trait $${\mathrm{\Gamma }}$$ follows from a mutation-selection balance, as given by Eq. 2. The rate of stability increase by selection, $$(\partial {\mathrm{\Gamma }}/\partial t)_{{\mathrm{sel}}.} = {\mathrm{\Delta }}_Gf{\prime}({\mathrm{\Gamma }})$$, is essentially a statement of Fisher’s theorem; the corresponding rate of fitness increase reads

$$\left( {\frac{{\partial f}}{{\partial t}}} \right)_{{\mathrm{sel}}.} = f{\prime}({\mathrm{\Gamma }})\left( {\frac{{\partial {\mathrm{\Gamma }}}}{{\partial t}}} \right)_{{\mathrm{sel}}.} = {\mathrm{\Delta }}_Gf{\prime}^2({\mathrm{\Gamma }}) = {\mathrm{\Delta }}_f({\mathrm{\Gamma }}).$$
(17)

The rate of stability decrease by mutations is the product of the total mutation rate per trait, u, and the mean effect per mutation (−κ)$$\epsilon _G$$ with the approximation $$\kappa = 1$$ as discussed above. In Supplementary Methods 1 and Supplementary Fig. 3, we derive the equilibrium of the mean trait $${\mathrm{\Gamma }}$$ in a fully stochastic calculus. We also note that the weakness of stabilizing selection on the trait diversity is consistent with finite directional selection on the population mean trait45.

Third, we can check a posteriori that the selection term in Eq. 16 can be self-consistently neglected. For stable genes, our biophysical traits live on the downward-curved shoulder of the fitness landscape (where $$f{\prime}{\prime}(G) < 0$$). The neutral relation (1) remains approximately valid for these traits if the resulting stabilizing selection on the trait diversity is negligible. This condition can be written in terms of the diversity load $${\cal{L}}_{\mathrm{\Delta }} \equiv f({\mathrm{\Gamma }}) - \bar f$$,

$$\frac{{{\cal{L}}_{\mathrm{\Delta }}}}{{\tilde \sigma }} \simeq \frac{{{\mathrm{\Delta }}_G|f{\prime}{\prime}({\mathrm{\Gamma }})|}}{{\tilde \sigma }} \lesssim 1;$$
(18)

see ref. 45. We now show that this condition is self-consistently fulfilled throughout the phenotypic interference regime. Evaluating the expected fitness curvature in the high-fitness part of the minimal fitness landscape, Eq. 13, where $$f{\prime}{\prime}({\mathrm{\Gamma }}) = - f{\prime}({\mathrm{\Gamma }})/k_{\mathrm{B}}T$$, and in the mutation-coalescence equilibrium given by Eq. 1, we obtain $$f{\prime}{\prime} = - 2\tilde \sigma /(\epsilon _gk_{\mathrm{B}}T)$$. By Eqs. 6, 18 then reduces to

$$\frac{{{\mathrm{\Delta }}_G}}{{\epsilon _G^2}} = \frac{c}{{4g}} \lesssim 1,$$
(19)

which is identical to the condition for phenotypic interference, Eq. 8. We conclude that Eq. 1 is a valid approximation for the trait diversity throughout the phenotypic interference regime. This is confirmed by our simulation results (Supplementary Fig. 2a).

### Housekeeping equilibrium and fitness waves of phenotypic interference

The deterministic equilibrium solution ($${\dot{\mathrm{\Gamma }}} = 0$$, $$\chi = 0$$) of Eq. 15 determines the dependence of ΔG and the associated fitness variance $${\mathrm{\Delta }}_f = {\mathrm{\Delta }}_Gf{\prime}^2(G)$$ on $$\tilde \sigma$$, as given by Eq. 3; the same scaling follows from the full stochastic equation (Supplementary Methods 1). The derivation of the global housekeeping steady state, Eqs. 57, uses two additional inputs: the additivity of the fitness variance, $$\sigma ^2 = g{\mathrm{\Delta }}_f$$, which is confirmed by our simulations (Supplementary Fig. 4), and the universal relation Eq. 4 in a fitness wave12,13. This relation is obtained by evaluating the total fitness span, $$\hat \sigma \equiv f_{{\mathrm{max}}} - f_0$$ in a population of finite census size N. Here fmax is the fitness maximum in the set of established mutations (i.e., mutations that have overcome genetic drift), which requires a mutant clone frequency $$x \, \gtrsim \, 1/(N(f - f_0))$$. Given a Gaussian bulk fitness distribution $$\rho (f) = (2\pi \sigma ^2)^{ - 1/2}{\kern 1pt} {\mathrm{exp}}[ - (f - f_0)^2/2\sigma ^2]$$, the tail condition for established mutations, $${\int}_{f_{{\mathrm{max}}}}^\infty \rho (f){\kern 1pt} df \sim 1/(N\hat \sigma )$$, produces $$\hat \sigma ^2/\sigma ^2 \sim {\mathrm{log}}(N\sigma )$$. Equation 4 then follows via the kinematic relation $$\tilde \sigma = \sigma ^2/\hat \sigma$$ given by Fisher’s theorem. The prefactor c0 is model-dependent and known only in the infinitesimal fitness wave limit, e.g., $$c_0 \sim 100$$ in the model of refs. 12,13. Here we treat c0 as a fit parameter in simulations. The wave parameter c has a double interpretation in generic fitness wave models: it relates the total fitness span and the coalescence time to the fitness variance, $$\hat \sigma ^2 = c\sigma ^2$$ and $$N_{\mathrm{e}}^2 = \tilde \sigma ^{ - 2} = c\sigma ^{ - 2}$$. The dependence of c on genome size under phenotypic interference, Eq. 7, is obtained by inserting Eqs. 5 into  4 and neglecting subleading terms $${\cal{O}}({\mathrm{log}}{\kern 1pt} {\mathrm{log}}(Nug))$$. It is important to note that the housekeeping fitness wave describes a genome-wide mutation-selection steady state of constant mean fitness and without adaptive changes12,77, which is consistent with the equilibria of deleterious and beneficial substitutions in each gene30.

### Local and global diversity scaling under phenotypic interference

Equation 19 expresses an important scaling property of the phenotypic interference regime: individual traits evolve in the low-mutation regime and are monomorphic at most times. In contrast, the cumulative variance of all traits defines a polymorphic fitness wave,

$$\frac{{4g{\mathrm{\Delta }}_G}}{{\epsilon _G^2}} = c \, \gtrsim \, g_0,$$
(20)

where we used Eq. 19. A related measure is the complexity of the fitness wave, defined as the average number of beneficial substitutions per coalescence time, $$g\langle v_ + \rangle /\tilde \sigma = (g/\tilde \sigma ){\int}_0^\infty \nu (s){\kern 1pt} v_ + (s){\kern 1pt} ds$$. Here $$\nu (s)$$ is the spectrum of site selection coefficients, which has the average $$2\tilde \sigma$$ by Eq. 3, and v+(s) is the equilibrium beneficial substitution rate at a site of selection coefficient s, which has a near-neutral regime $$v_ + (s) \, \simeq \, u/2$$ for $$s \, \lesssim \, \tilde \sigma$$ and rapidly decreases for $$s \, \gtrsim \, \tilde \sigma$$. Hence, we obtain a wave complexity

$$\frac{{g\langle v_ + \rangle }}{{\tilde \sigma }} \approx \frac{{ug}}{{2\tilde \sigma }} = \frac{c}{4}$$
(21)

with a prefactor of order 1; here we have used Eq. 5. By Eq. 7, the fitness wave measures Eqs. 20, 21 depend only weakly on g.

### Onset of phenotypic interference

Interference effects on quantitative traits can be read off from the scaling of the genetic load, which has the linear form $${\cal{L}} = ug$$ for independently evolving genes and is given by Eq. 9 in the phenotypic interference regime. Equating these relations identifies an onset gene number g0 given by

$$ug_0 = \frac{{4ug_0^2}}{c},$$
(22)

or equivalently by Eq. 8.

### Evolutionary equilibria of stable genes

Equilibrium traits of genes with $$f_0 \gg \tilde \sigma$$ are located in the high-fitness part of the minimal fitness landscape, $$f \simeq f_0[1 - {\mathrm{exp}}( - G/k_{\mathrm{B}}T)]$$. These genes have an average fitness slope

$$f{\prime} = \left( {\frac{{{\mathrm{\Delta }}_f}}{{{\mathrm{\Delta }}_G}}} \right)^{1/2} = \frac{{2\tilde \sigma }}{{\epsilon _G}},$$
(23)

an average trait $${\mathrm{\Gamma }} = k_{\mathrm{B}}T{\kern 1pt} {\mathrm{log}}(f_0\epsilon _G/2\tilde \sigma k_{\mathrm{B}}T) > 0$$, and an average load $${\cal{L}}_{{\mathrm{int}}}(g)$$ given by Eq. 9. This is in accordance with well-known population data of protein stability in microbial populations34: typical genes balance a few kBT above the melting point $$G = 0$$, which corresponds to the shoulder of the fitness landscape above the inflection point (Fig. 1a). The average stability has only a log-dependence on evolutionary rates.

### Phenotypic interference in adaptive evolution

Here we show that the phenotypic interference scaling extends to simple models of adaptive evolution. In the minimal biophysical model, we assume that protein stabilities are still at local evolutionary equilibria of the universal form given by Eq. 3, generating a combined housekeeping component of the fitness variance, $$\sigma _{{\mathrm{hk}}}^2 = g{\mathrm{\Delta }}_f = 2gu\tilde \sigma$$. The global fitness variance acquires an additional contribution from adaptive evolution of other system functions,

$$\sigma ^2 = c\tilde \sigma ^2 = \sigma _{{\mathrm{hk}}}^2 + \phi ,$$
(24)

where $$\phi$$ is the adaptive fitness flux or rate of adaptive fitness gain78. This term quantifies the deviations of the adaptive evolutionary process from housekeeping evolution. Closure of the modified dynamics leads to an increased coalescence rate

$$\tilde \sigma = \frac{{ug}}{c}\left( {1 + \sqrt {1 + \frac{{c\phi }}{{g^2u^2}}} } \right)$$
(25)

$${\cal{L}}_{{\mathrm{int}}} = 2g\tilde \sigma = \frac{{2ug^2}}{c}\left( {1 + \sqrt {1 + \frac{{c\phi }}{{g^2u^2}}} } \right).$$
(26)

Hence, the load retains the leading nonlinearity generated by housekeeping evolution, as given by Eq. 9; this is true even if we assume that $$\phi$$ is proportional to g. At high fitness flux ($$\phi \,\, \gtrsim \,\, g^2u^2/c$$), coalescence becomes dominated by adaptation, leading to a further substantial decrease in the efficacy of selection. This is the likely regime of the laboratory evolution experiments discussed in the main text.

### Fitness loss in evolution experiments

Bacterial lineages from the long-term evolution experiment of ref. 21 have been subject to fitness measurements in diverse environments20. These measurements show heterogeneous combinations of environment-specific fitness gains and losses compared to the ancestor strain. In mutator lines evolved over 50,000 generations a higher average growth rate λ at temperature 30 °C than at temperature 37 °C. To extract a bona fide order-of magnitude estimate of the fitness loss due to attrition of quantitative traits,we evaluate the population-average difference in log growth rate, $${\mathrm{\Delta }}L = \langle {\mathrm{log}}(\lambda _{30^\circ }/\lambda _{37^\circ })\rangle = 0.47/50\,{\mathrm{k}}$$ generations, using the data provided in ref. 79. The observed average number of fixations per stable population clade is about 500/50 k generations22. These data provide the estimates $$\dot {\cal{L}} \approx 10^{ - 5}$$ and $$ug \approx 10^{ - 2}$$ used in the main text, and they inform the model estimate $$\dot {\cal{L}}_{{\mathrm{int}}} \sim 2ug\tilde \sigma$$ with the standard microbe housekeeping value $$\tilde \sigma \sim 10^{ - 4}$$. We note two additional consistency checks: (a) The inferred average deleterious fitness effect per substitution, $$s = \dot {\cal{L}}/(ug) \approx 10^{ - 3}$$ is of order of the observed inverse coalescence time, supporting the conclusion that a large fraction of these changes is effectively neutral22. (b) Non-mutator lines, which have a 100-fold lower mutation rate, do not show evidence of a large proportion of effectively neutral fixations and have significantly lower ΔL.

### Numerical simulations of phenotypic interference

We use a Wright-Fisher process to simulate the evolution of stability traits in a population. A population consists of N individuals with genomes $${\mathbf{a}}^{(1)}, \ldots ,{\mathbf{a}}^{(N)}$$. A genotype $${\mathbf{a}} = ({\mathbf{a}}_1, \ldots ,{\mathbf{a}}_g)$$ consists of g segments; each segment is a subsequence $${\mathbf{a}}_i = (a_{i,1}, \ldots ,a_{i,\ell })$$ with binary alleles $$a_{j,k} = 0,1$$ ($$i = 1, \ldots ,g$$; $$k = 1, \ldots ,\ell$$). A segment a defines a stability trait $$G({\mathbf{a}}) = \mathop {\sum}\nolimits_{k = 1}^\ell {\kern 1pt} {\cal{E}}_ka_k + G_0$$, where G0 is the minimum trait value. The resulting effect distribution of point mutations has as a second moment $$\epsilon _G^2 = \mathop {\sum}\nolimits_{k = 1}^\ell {\kern 1pt} {\cal{E}}_k^2/\ell$$ and a first moment $$\kappa _0\epsilon _G = \mathop {\sum}\nolimits_{k = 1}^\ell {\kern 1pt} {\cal{E}}_k(1 - 2\langle a_k\rangle )/\ell$$, where $$\langle a_k\rangle$$ is the state-dependent probability of a mutation at site k being beneficial and brackets $$\langle .\rangle$$ denote averaging across parallel simulations or time. The genomic fitness is $$f({\mathbf{a}}) = \mathop {\sum}\nolimits_{i = 1}^g {\kern 1pt} f(G({\mathbf{a}}_i);f_{0,i})$$ with f(G) given by Eq. 13 and gene-specific amplitudes f0,i. In each generation, the sequences undergo point mutations with probability $$\mu \tau _0$$ for each site, where $$\tau _0$$ is the generation time, and the sequences of the next generation are drawn by multinomial sampling with a probabilities proportional to $$1 + \tau _0f({\mathbf{a}})$$.

Simulations are performed with parameters $$N = 1000$$, $$N\mu = 0.0125$$, each trait with genomic base of size $$\ell = 100$$, and each site with equal effect $$E_k = 1$$. The population size N is smaller than in natural populations; this is compensated by an increased mutation rate to keep the product at a realistic value. The quantitative trait dynamics is insensitive to the form of the effect distribution45,80. To increase the performance of the simulations, we do not keep track of the full genome. We only store the number of deleterious alleles $$n_i = \mathop {\sum}\nolimits_{k = 1}^\ell a_{i,k}$$ for each trait, we draw mutations with rate $$u = \mu \ell$$, and we assign to each mutation a beneficial change $${\cal{E}}$$ with probability $$n_i/\ell$$ and a deleterious change −$${\cal{E}}$$ otherwise. This procedure produces the correct genome statistics for bi-allelic sites with uniform trait effects $${\cal{E}}_i = {\cal{E}}$$. Simulation data are shown with theory curves for $$\kappa = 1$$, which provide a good fit to all amplitudes; the input $$\kappa _0$$ is different by a factor of order 1 which includes fluctuation effects (Supplementary Methods 1).

Simulations run to reach a stationary state and then have 2000–128,000 consecutive measurements (for largest $$g = 4096$$ to smallest $$g = 4$$) every 400 generations. These intervals exceed the correlation time of the coalescence process. Therefore, measurements of the global observables σ2, $$\tilde \sigma$$, and $${\cal{L}}$$, as well as the local variance δg, decorrelate. Measurements of the other local variables s2, the loss rate, and Δf are averaged over all g genes.

For the simulations of housekeeping evolution in Figs. 2, 3, where we are not explicitly interested in the loss of genes, we use an exponential approximation of the stable regime of the stability fitness landscape. The reason is a limited accessible parameter range in simulations constraining the values of f0 and $$\tilde \sigma$$ due to finite N. We checked that the exponential approximation gives the same results as the full model in the regime $$f_0/\tilde \sigma \gg 1$$, where the gene loss rate in the biophysical landscape is negligible.

For the loss rate measurements of Fig. 4b, a long-term stationary population is maintained by evolving 70% of the traits in a biophysical fitness landscape with selection f0; the remaining 30% of the traits are modeled to be essential with selection 10f0. Gene loss is defined by the condition $$G \, < \, - 3.5k_{\mathrm{B}}T$$. To maintain a constant number of genes, lost genes are replaced immediately with an input trait value $$G \, > \, 0$$.

For simulations with recombination (Fig. 5a), we draw recombination events with rate NR for the whole population from a Poisson distribution. Each recombination event is implemented as one crossover between the genomes of two individuals at a random, uniformly distributed position of the genomes.

### Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.