Introduction

Fitness landscapes have been a very successful metaphor to study evolution. Most simply, the idea of Sewall Wright (1932) to view evolution as a hill-climbing process proved to be appealing and inspired a vast amount of theoretical work in phenotypic and molecular evolution (Orr 2005; de Visser and Krug 2014).

In evolutionary biology, fitness landscapes have been used to study adaptation. In the classical metaphor, an evolving population is abstracted into a particle that navigates in the landscape (Orr 2005). In a strong selection weak mutation regime (Gillespie 1983), populations are restricted to follow evolutionary paths of increasing fitness, which can sometimes even be completely deterministic.

The smoothness or ruggedness of the fitness landscape affect many fundamental features of adaptation and evolution in the landscape, and especially the repeatability of the adaptive process (e.g. Kauffman 1993; Colegrave and Buckling 2005; Chevin et al. 2010; Salverda et al. 2011). Consequently, it is now clear that several aspects of evolutionary processes directly depend on the structure of the fitness landscapes on which the organisms are evolving. Furthermore, Wright’s idea of genotype-fitness landscapes moved from a metaphor to an object of experimental studies, as several fitness landscapes were experimentally resolved (Malcolm et al. 1990; Whitlock and Bourguet 2000; Lunzer et al. 2005; O’maille et al. 2008; Bridgham et al. 2009; Lozovsky et al. 2009; de Visser et al. 2009; Hall et al. 2010; da Silva et al. 2010; Tan et al. 2011; Chou et al. 2011; Khan et al. 2011; Flynn et al. 2013; Jiang et al. 2013; Meini et al. 2015; Mira et al. 2015; Palmer et al. 2015; Bank et al. 2016), including the seminal work of Weinreich et al. (2006) on antibiotic resistant mutations (see de Visser and Krug 2014, for a review). In that regard, characterizing the structure of experimental and model fitness landscapes is a key step in our ability to decipher evolution at the finest scale.

Experimental fitness landscapes make it possible to test predicted properties of evolutionary trajectories through evolution experiments, together with sequencing (Achaz et al. 2014). Currently available fitness landscapes are based on mutations in a few loci, typically 4–10 (Szendro et al. 2013; Weinreich et al. 2013). Since the number of genotypes scales as the product of the number of alleles at all loci, testing all the combinations of mutations in an organism (or even in a protein) is beyond the reach of any reasonable future experiment. However, small landscapes have been resolved and analyzed. An exciting opportunity raised by the recent release of these experimental landscapes is to find the adequate model(s) that is (are) able to generate synthetic model landscapes that share similar features with the observed ones. In that regard, characterizing the structure of small fitness landscapes is a key step for our understanding of evolution in the presence of biological interactions among mutations. These epistatic interactions determine the shape of fitness landscapes.

Epistasis is one of the most relevant features of fitness landscapes. By definition, the (Malthusian) fitness effect of a combination of multiple non-interacting mutations corresponds to the sum of fitness effects of the individual mutations. Hence, epistasis can be defined as any departure from additivity of the fitness effects. Additive landscapes, i.e. landscapes where the fitness effects of all mutations are additive, are the only non-epistatic landscapes. In these landscapes evolution is unconstrained, i.e. populations are always able to reach the highest fitness peak through a sequence of beneficial mutations; exchanging the order of these mutations simply results in another fitness-increasing path to the peak. Other landscapes show a variable amount and structure of epistatic interactions, which may act as evolutionary constraints.

As for other genomic data, the high dimensionality of fitness landscapes make them almost impossible to visualize (although some attempts were proposed, see McCandlish 2011; Brouillet et al. 2015). As a consequence, the analyses of fitness landscapes will mostly rely on measures that capture important features of their structures. Several measures were proposed previously and used to analyze experimental fitness landscapes (reviewed in Szendro et al. 2013; Stadler 2002). The most natural one that was historically used to appreciate the ruggedness of a landscape is its number of peaks (Weinberger 1991). Intriguingly, although ruggedness is more adequately represented by both types of extrema, only little attention has been payed to the number of sinks (genotypes with only fitter neighbors). We therefore suggest that peaks and sinks are both adequate measures of the landscape ruggedness (Fig. 1a). Most models generate synthetic landscapes with the same mean number of peaks and sinks; however, small landscapes can have a different number of peaks and sinks due to random sampling. Other measures such as r/s ratio (ratio of the roughness over additive fitness, see Szendro et al. 2013), fraction of sign epistasis and correlation of fitness effects (Ferretti et al. 2016), as well as path-based measures, such as the number of accessible paths to the highest peak (Weinreich et al. 2006), were also proposed to characterize the structure of fitness landscapes. As all these represent direct or indirect measures of epistasis, all these measures were shown to be pairwise correlated in experimental (Szendro et al. 2013) and model fitness landscapes (Ferretti et al. 2016). In that regard, other measures related to evolution but somehow uncorrelated to the amount of epistasis would be welcome to investigate the nature and the structure of the interactions in the landscapes.

Fig. 1
figure 1

Measures for fitness landscapes. We depict, on an illustrative fitness landscape with three loci of two alleles, the three sets of measures used in this study. a Peaks, here in green, are genotypes with no fitter neighbors whereas sinks, here in red, are genotypes with only fitter neighbors. In this landscape, γ (the correlation between fitness effects of mutations in neighboring genotypes) is 0.53. b there are three direct accessible paths ending at the global peak and starting at its antipode. All paths pass through four genotypes and share the start and endpoint; moreover, two paths share another genotype as well, hence the average fraction of shared genotypes is (2/4 + 2/4 + 3/4)/3 = 0.58. c The two chain trees are composed of four steps (genotypes with a single fitter neighbor), three origins (two for one chain, one for the other) and have a maximal depth (the largest number of steps chained together) of two

In this study, we propose two new sets of measures, related to the short-term predictability of evolutionary outcomes.

The first set looks at the relation of accessible paths. In fact, while the number of accessible paths is recognized as a very important measure of epistasis and constraints (Weinreich et al. 2006), it contains no information on the similarity between these paths (Lobkovsky et al. 2011). The distance between paths is relevant for short-term predictability. In fact, evolution is more predictable when different paths tend to visit the same genotypes. For this reason, one natural measure of path similarity is the average fraction of genotypes visited by both paths (Fig. 1b).

The second set of measures is related to successions (chains) of obligatory beneficial mutations, that are paths going through genotypes with a single fitter neighbor. Their aim is to quantify the amount of evolutionary constraints in the landscapes. In the limit of strong selection and low mutation rates, when a hill climbing evolutionary path reaches a genotype that has a single fitter neighbor, the next step of the path is essentially deterministic. In this case, only one of the mutations available in the landscape can improve the fitness. If several genotypes with a single way out are connected together they form a chain of “obligatory” mutational events that often ends on a peak (Fig. 1c), but could also end on an intermediate genotype.

Chains form a tree structure of obligatory fitness-increasing paths, already noticed in Lobkovsky et al. (2011). We describe several measures on this tree that can be used to characterize the structure of fitness landscapes (Fig. 1c). These measures have been chosen for two reasons. First, they have an immediate interpretation in biology in terms of evolutionary constraints. Second, for small landscapes, these chain measures are only weakly correlated with epistasis, hence they can bring independent information on the structure of the interactions.

In the following, we will define these evolutionary measures in detail, then present analytical and numerical results for experimental and model fitness landscapes and finally discuss their relations with the existing measures of epistasis.

Definitions

Similarity between accessible paths

Many fitness landscape measures of epistasis are correlated across experimental (Szendro et al. 2013) and model (Ferretti et al. 2016) fitness landscapes with the amount and strength of epistasis. However, one could envision that landscapes with similar measures of epistasis could still be quite different in terms of their structure of epistatic interactions, as well as their evolutionary properties. New measures should be sensitive to these differences in landscape structure and the corresponding differences in evolution, while being possibly uncorrelated with changes in fitness effects across different backgrounds or other direct measures of epistasis. Measures that are directly related to properties of evolutionary trajectories are the most promising candidates in this respect.

Here, we focus on a class of statistics that measure the amount of constraints on the possible evolutionary paths. These constraints can take different forms. A well-known example is the reduction in the number of selectively accessible (i.e. fitness-increasing) paths to the highest peak from its antipode in the landscape. The number of accessible paths and the number of accessible direct paths (without any reversion) are classical measures of epistatic constraints, since all the direct paths are accessible in additive landscapes, and their number decreases with epistasis (Weinreich et al. 2006). Interestingly, the probability of finding a direct path has been analytically studied in some models (Franke and Krug 2012; Schmiegelt and Krug 2014; Hegarty et al. 2014; Berestycki et al. 2016; Hwang et al. 2017).

However, the number of accessible paths may not tell the full story. Indeed, one could imagine that epistatic constraints decimate paths at random or, on the contrary, only paths traversing subregions of the genotype space. In the latter case, the constraints would not only reduce the number of accessible paths, but also concentrate the remaining paths across specific genotypes, hence increasing predictability of the evolution.

For biallelic landscapes, we propose a simple measure of these constraints, namely the average pairwise fraction of shared genotypes between two direct accessible paths to the highest peak from its antipode (Fig. 1b). This is an intuitive measure of constraints in the landscape as two accessible paths that often pass through the same genotypes will tend to explore the same regions of the genotype space. The larger the identity among selectively accessible trajectories, the better our ability to predict the steps of an adaptive walk. This measure is reminiscent of previous approaches based on path divergence (Lobkovsky et al. 2011) and on path distances/entropies (Manhart and Morozov 2014), but it is more focused on local constraints rather than on global constraints.

This measure can be generalized to multiallelic landscapes by taking the average of the biallelic measure over all biallelic sublandscapes containing the highest peak. Note that the biallelic measure is undefined when there is either no path or a single path to the highest peak.

Chains of obligatory steps

Another natural measure of evolutionary constraints is given by the abundance and structure of maximally constrained paths in the whole landscape. To characterize these paths in a simple and effective way, we focus on genotypes with only a single beneficial mutation.

All fitness-increasing paths that pass through such genotype will share this mutation. Some landscape have “chains” of consecutive mutations with this property (see Fig. 1c). We now examine the abundance and size of these chains as measures of evolutionary constraints.

We define a chain step as a mutation g → g′ that is the only possible fitness-increasing mutation from the genotype g. Chain steps can occur one after another, forming a linear path of obligatory mutational steps, that we call a chain. Several chain steps can lead to the same genotype, but a genotype can have at most one outgoing chain step. For this reason, chains can form tree-like structures, that we call a chain tree. A chain tree is formally the set of all genotypes, which are forced to evolve along obligatory paths up to a common final genotype, i.e. a maximal set of connected chain steps. This definition implicitly assumes a strong selection regime (Gillespie 1983), as there is no fixation for deleterious or neutral mutations. In an additive landscape, there is a single chain tree containing only those genotypes one mutation away from the (single) peak.

Note that other landscapes can contain more than one chain tree (or none). The set of all chain trees correspond to the “tree component” of the landscape, i.e. the component with deterministic evolution, discussed sporadically in the literature (Lobkovsky et al. 2011; Kauffman and Weinberger 1989). We compute also the number of origins, that correspond to all genotypes that are initial points of a chain (obviously excluding intermediate steps). Finally, we compute also the maximal depth of all chains in the landscape, that is, the maximum number of consecutive steps. In an additive landscape, the depth of the only chain tree is 1.

Model landscapes

Extreme models of epistatic interactions are additive landscapes, corresponding to the case of no epistasis, and the HoC model (Kingman 1978), which corresponds to the case of strong epistasis. The latter model assigns random uncorrelated values to the fitnesses of the genotypes; these values are extracted from a given distribution, e.g. a Gaussian with variance \(\sigma _{{\mathrm {HoC}}}^2\) and mean 0.

We compute the measures on accessible paths and tree chains on two different biallelic model landscapes:

  • The Rough Mount Fuji (RMF) model is a mixture of an additive component and a random contribution from a HoC model. The additive component has fitness effects randomly chosen from a Gaussian distribution of mean μa and variance \(\sigma _{\mathrm a}^2\), while the HoC component is Gaussian with variance \(\sigma _{{\mathrm {HoC}}}^2\). This RMF model is often used to interpolate between a complete additive landscape and a complete random one (Aita et al. 2000). We consider a fixed strength of the random “House of Cards” component; hence, the higher the mean additive effect of mutations μa, the more additive is the model; consequently, when μa → ∞, the model becomes additive, whereas it converges to an House of Cards model when μa → 0.

  • The Ising Mount Fuji (IMF) model (Ferretti et al. 2016) was also introduced to mix an additive component (also with mean μa and variance \(\sigma _{\mathrm a}^2\)) with a fixed Ising-like set of interactions in which each pairs of interacting loci implies a fitness cost if the two loci in the pair do not have the same allele. This cost is assumed to be Gaussian distributed with mean μc and variance \(\sigma _{\mathrm c}^2\) and it could be interpreted biologically as a genetic incompatibility between interacting alleles. In this model, all loci are located along a sequence and each locus interacts only with its neighboring loci.

We compute the mean value for the number of peaks, the number of direct paths and their identity, as well as chain steps and depth for RMF and IMF models, varying the additive contribution of the two models. Results are reported in Fig. 2. We computed also the γ measure of epistasis (Ferretti et al. 2016), which is defined as the correlation of fitness effects of mutations before and after a single change in the genetic background. γ = 1 corresponds to no epistasis while γ = 0 corresponds to strong epistasis.

Fig. 2
figure 2

Epistasis and chain measures for different landscape models with L = 5. The landscapes are build from 104 simulations of an additive component with mean μa and variance σa = μa/10 and an epistatic component. The epistatic component is: (RMF) an HoC model with \(\sigma _{{\mathrm {HoC}}}^2 = 1\); (IMF) an Ising model with mean incompatibility cost μc = 1 and variance \(\sigma _{\mathrm c}^2 = 0.1\). We plot the amount of epistasis as measured by γ and peaks, as well as the number and similarity of direct accessible paths plus the number of chain steps and their maximum depth

Accessible paths

In both models, the number of direct accessible paths is close to zero for strong random interactions (μa → 0: HoC model), but it increases with increasing additive component μa (i.e. decreasing epistasis) as expected, with a steep slope where the fitness variance due to the additive component is comparable to the epistatic one (see Fig. 2).

The similarity of accessible paths increases with the weight of random interactions (i.e. with epistasis) in both models. Since the number of direct paths follows the opposite trend, this suggests an inverse relation between number and similarity of paths. However, this relation is not necessarily informative on the structure of the epistatic interactions, rather it appears to be a common feature of small landscapes.

Chains

In an additive landscape, all the L genotypes around the peak are origins of a chain tree that ends at the peak. This starlike chain tree of depth 1 around the peak is the only chain tree in these landscapes. In contrast, in a HoC model, there are many small and slightly deeper chain trees; their number is of the order of 2L/(L + 1) (see proof in the Appendix). Therefore, it would be tempting to conclude that the number of chain trees (as well as other measures of chains) are correlated to the amount of epistasis. However, this is not the case.

For example, in an RMF model with equal additive contributions, it is possible to derive exact and approximate theoretical results for the mean of some chain measures—most relevant, the number of chain trees, chain steps and origins. Equations are presented in Appendix.

The results are illustrated in Fig. 3, where the mean of several chains measures are reported as a fraction of additive component in the RMF model. Furthermore, even though chain measures for the IMF model cannot be computed analytically, they can be retrieved from simulations (Fig. 2). Interestingly, several of the chain measures appear to be non-monotonic with respect to epistasis in the RMF model and even more so in the IMF model. This non-monotonicity and the weak correlations suggest that these statistics depend weakly on epistasis.

Fig. 3
figure 3

Chain statistics versus epistasis in Rough Mount Fuji landscapes. Expected values for chain measures (number of steps, number of chains, number of origins) as a function of the additive fitness effect μa for a RMF landscape with \(\sigma _{\mathrm a}^2 = 0\), \(\sigma _{{\mathrm {HoC}}}^2 = 1\). Lines represent the analytical mean values, while dots represent the average over 104 simulations

Both the number of steps and the chain depth tend to have a maximum for intermediate values of epistasis, when the contributions of the additive component and of the interactions are comparable (Figs. 2 and 3). Interestingly the type of interaction plays a important role in the structure of the chains, as only short chains are observed for the RMF model, whereas long chains are observed for the IMF model. It is important to notice that the size of the landscape changes the dependence between epistasis and chain measures (depth and abundance) (Fig. 3).

Experimental landscapes

We computed the accessible direct paths (number and similarity) and the chain measure (number of chains, steps and origins, as well as maximum depth) on 38 experimentally biallelic resolved landscapes (Malcolm et al. 1990; Whitlock and Bourguet 2000; Lunzer et al. 2005; Weinreich et al. 2006; O’maille et al. 2008; Bridgham et al. 2009; Lozovsky et al. 2009; de Visser et al. 2009; Hall et al. 2010; da Silva et al. 2010; Tan et al. 2011; Chou et al. 2011; Khan et al. 2011; Flynn et al. 2013; Jiang et al. 2013; Meini et al. 2015; Mira et al. 2015; Palmer et al. 2015). They differ on their number of loci: 1 landscape has 3 loci, 20 have 4 loci, 9 have 5 loci, and 8 have 6 loci and their general features are summarized in Weinreich et al. (2017). Results can be seen in Fig. 4.

Fig. 4
figure 4

Relation between landscape measures in several experimental biallelic landscapes with different number of loci. Top: Similarity between accessible direct paths (computed as path similarity minus its minimum possible value for the landscape) as a function of the fraction accessible direct paths and of epistasis (defined as 1 − γ). Middle and bottom: Number of chain trees, origins, and maximum chain depth versus the number of chain steps or 1 − γ

We also illustrate explicitly these statistics for two landscapes. The first landscape is the landscape of antibiotic (cefotaxim) resistance of β-lactamase mutations in an Escherichia coli plasmid from Weinreich et al. (2006) (Fig. 5, left). The five mutations have a very strong effect that together give a 4 × 104 increase in antibiotic resistance and were therefore selected together. Given the huge selective advantage of the combined mutations, this landscape is single-peaked, where the peak corresponds to the five-point mutant. Furthermore, it also has a single sink, that interestingly does not correspond to the wild type, and is weakly epistatic (γ = 0.85).

Fig. 5
figure 5

Epistasis measures and their corresponding illustrations for two experimental fitness landscapes. The pictures and the statistics were computed using MAGELLAN (Brouillet et al. 2015). The left column represents different views from the Weinreich et al. (2006) landscapes, whereas the right one concerns the csI from de Visser et al. (1997). a On the top panel, we report the general aspect of the landscapes, as well as previous measures of epistasis. b On the middle panel, we depict and report the number of direct accessible paths and their average pairwise similarity. c Finally, we report four measures concerning chains of obligatory beneficial mutations, on a sky view of the landscapes

The second is a landscape of deleterious mutations in Aspergillus niger from Franke et al. (2011) (Fig. 5, right). It is one of the four L = 5 complete sublandscape (csI) of a larger landscape with L = 8 loci (de Visser et al.1997). This landscape is a combination of unrelated deleterious mutations where epistatic interactions were not filtered by natural selection. This landscape has 4 peaks and 2 sinks; in fact, at present it is one of the most rugged among the completely resolved landscapes (γ = 0.33).

Accessible paths

The results in Fig. 4 show a clear negative trend between the number of the paths and their similarity, which is observed across different numbers of loci and both across direct and indirect paths (not shown). This trend is predicted by landscape models and suggests that in small landscapes, epistatic interactions do not block evolutionary trajectories at random; instead, they constrain whole groups of similar trajectories in a clustered way, hence reducing the accessibility of the landscape but someway increasing the predictability of evolutionary trajectories.

The landscape by Weinreich et al. (2006) has 18 accessible direct paths to the highest peak (and a further 9 indirect accessible paths), out of 120 accessible paths for an additive landscape of the same size. Direct paths share slightly more than half of their genotypes (0.504 or 0.533 considering indirect paths as well), versus a minimum of 0.333 (i.e., only the extremes). Similarly, the landscape by de Visser et al. (1997) has 25 accessible paths (all are direct ones) that share on average 0.542 of their genotypes. Both landscapes appear to follow the same pattern outlined in Fig. 4 for the other empirical landscapes.

Chains

The number of chain steps is unrelated to measures of local epistasis, such as γ (Fig. 4), supporting the arguments for a weak dependence of chain statistics on epistasis. Interestingly enough, however, this measure of the total size of chains in the landscape is related to several other chain measures, like the total number of chains, origins and the maximum chain depth, as shown in Fig. 4 (see also Table 1). For a fixed size of the landscape, these measures tend to increase slightly with the number of chain steps, meaning that landscapes with larger chain trees tend to have slightly more chains, and deeper and wider chains as well.

Table 1 Correlation coefficients between landscape measures

The landscape by de Visser et al. (1997) has a single chain of depth 2 with 5 origins and 7 steps, i.e. a slightly larger chain than expected according to the RMF model and to other empirical landscapes. This is not so surprising considering the amount of epistasis in this landscape, which could increase. On the other hand, the landscape by Weinreich et al. (2006) has a relatively small overall amount of epistasis, but it contains two chains with a maximum depth of 3, 6 origins, and 9 steps, revealing one of the largest chain structures observed in empirical landscapes.

Correlations between measures

In this section, we discuss the relations existing between the newly proposed measures and the existing ones.

From a theoretical perspective, the number of chain steps is a close relative of other measures, such as the number of peaks and sinks, since they correspond to three different components of the distribution of beneficial mutations. This stems from the theory of fitness graphs. Fitness graphs correspond to a network of genotypes, where each pair of genotypes related by a beneficial mutation is connected by a directed link (Weinreich et al. 2005; Crona 2014). The out-degree of a node is defined as the number of outgoing links, i.e. the number of beneficial mutations from the corresponding genotype. The out-degree distribution (or equivalently, the distribution of the number of beneficial mutations from a genotype) is simply the distribution of the out-degrees of all nodes. Since there are no fitness-increasing mutations from a peak, peaks are simply nodes with out-degree 0. Therefore, the number of peaks is actually the first bin of the out-degree distribution. Similarly, in a biallelic landscape, the number of sinks correspond to the number of genotypes with out-degree L (the last bin of the out-degree distribution). Chain steps correspond simply to nodes of out-degree 1. Hence, the number of chain steps is the second bin of the out-degree distribution of the fitness graph, and is therefore a natural step further in the characterization of the out-degree distribution and the properties of fitness graphs.

However, this does not mean that number of peaks and of chain steps are necessarily correlated. In fact, we suggest that there is no direct relation between the amount of epistasis and the number of chains.

To evaluate in a more systematic way the relations between these and other measures, we perform a correlation analysis for the 38 experimental landscapes, as well as for 104 random replicates of RMF and IMF model landscapes each. We select the number of peaks, the number of sinks, γ, as well as four of the measures discussed in this article: the similarity of direct accessible paths, plus the number of chain steps and the maximum chain depth. We compute the Spearman correlation coefficients of all pairs of measures in the experimental as well as in the RMF and the IMF landscapes varying the model parameters (in particular, the ruggedness).

The pairwise correlations (Table 1) confirm the intuition that most measures are all strongly correlated, including the similarity between paths. However, the correlation between most measures is notably low with both chain measures. This shows that the chain measures quantify some landscape properties that are not simply correlated to the amount of epistasis.

Interestingly, the lack of correlation of chains with other measures of epistasis is apparent when we compare them across different models of epistatic interactions. In Fig. 2, we depict how the different epistasis and chain measures depend on the amount of epistasis and the underlying epistatic model (HoC vs. Ising). The comparison between measures of epistasis and chains shows clearly that there is no simple relation between them, as these models of interactions have different behavior. The size of chains in the landscape depends strongly on the nature of the epistatic interactions and shows a non-monotonic behavior with the amount and strength of epistasis, as larger chains are more probable for intermediate levels of epistasis. Hence chain measures could provide useful information beyond the existing measures of epistasis.

Discussion

In this work, we presented two new sets of landscape measures, which have a simple interpretation and cover a range of potential applications. Both these measures are based on the sign of the fitness effects, i.e. on the fitness graph, hence they are well suited for landscapes where only fitness ranks or the beneficial/deleterious nature of the fitness effects can be experimentally determined. These measures and the others discussed here have been implemented in MAGELLAN, a graphical tool to explore small fitness landscapes (Brouillet et al. 2015).

The first set of measures—the fraction of genotypes shared between paths—is an interesting and revealing measure, but has several limitations. The first is precisely the strong anti-correlation between the number of accessible paths and their similarity, which reduces the usefulness of the latter as an independent measure, at least for small landscapes. The second is the impossibility to define path similarity in any meaningful way in cases where epistasis is so strong that there are no accessible paths, or just one. Finally, this measure is restricted to biallelic landscapes or sublandscapes.

The second set of measures is a more fine-grained measure of evolutionary constraints, based on genotypes with a single beneficial mutation available. These genotypes represent steps in “obligate” fitness-increasing paths on the landscape, funnelling different evolutionary paths through the same mutation. Sequences of such beneficial mutations represent chains of obligatory fitness-increasing mutations.

Chains are a natural choice for a measure of evolutionary constraints in the framework of fitness graphs. In particular, chain steps are strongly related to the distribution of beneficial mutations from a genotype, that is the out-degree distribution of the fitness graph corresponding to the landscape. In fact, as discussed before, there is a one-to-one correspondence between chain steps and genotypes with a single fitness-increasing mutation. The number of fitness-increasing mutations from a given genotype is the out-degree of the genotype in the fitness graph. Therefore, chain steps are simply nodes of out-degree 1. In this respect, this statistics is a close relative of one of the most well-studied measures of fitness landscapes, namely the number of peaks, which correspond to genotypes without beneficial mutations (i.e. nodes of out-degree 0).

Both the number of steps and related measures like the depth of chains have an immediate evolutionary interpretation, in terms of evolvability and constraints, yet they show peculiar properties. The most relevant one is that they are often non-monotonic or weakly correlated with the amount of epistasis (i.e. measured by the number of peaks, sinks or γ), as we have shown using simulations and analytically. The lack of correlation between epistasis and chains shows that these new statistics can be used to obtain independent information about the nature of the interaction, instead of their abundance.

Interestingly, the number of chain steps, the chain depth (Fig. 2) and even the total number of accessible paths (Szendro et al. 2013) seem to be peaked at intermediate weight of epistatic interactions. All these measures are related both to evolvability—deep chains show that fitness-increasing paths are open—and to constraints—chains represent obligatory paths in evolution. This suggests an evolutionary interpretation of these peaks in terms of the tradeoff between evolvability (higher at low epistasis) and constraints (stronger with high epistasis). Previous measures that were shown numerically and by some analytical approximations to be non-monotonic include the total number of accessible paths (Szendro et al. 2013) and the number of exceedances, i.e. the number of available fitness-increasing mutations after an evolutionary step (Neidhart et al. 2014), which are also related to evolvability and constraints. It is worth mentioning that chains and exceedance are both related to the out-degree distribution after one step of increasing fitness. This suggests that beyond the out-degree distribution, it is perhaps worth characterizing the sequences of out-degrees, e.g. the out-degree distribution along evolutionary paths.

Interestingly, many real landscapes tend to have longer chains than expected according to theoretical landscape models with random epistasis. This is the case for both experimental landscapes from Weinreich et al. (2006) and de Visser et al. (2009). This is especially apparent and interesting in the former landscape, since it implies some highly non-random structure of epistatic interactions for this set of mutations. This result has been found independently by randomization tests (Weinreich et al. 2013) and cannot be seen in complex measures of epistasis like the Fourier spectrum (Stadler 1996; Szendro et al. 2013) or the decay of correlations γd (Ferretti et al. 2016), however our chain measures were able to capture this signal. A possible explanation for this is the heterogeneity between the functional and selective effect of different mutations in real landscapes.

Chains are natural measures both from the evolutionary point of view and from the mathematical point of view. For most small empirical landscapes currently available (biallelic and with L ≈ 4–6 loci), peaks and chains summarize most of the information about local evolutionary constraints and especially about the distribution of beneficial mutations. However, for larger landscapes or multi-allelic ones, other measures could be more informative about local and global constraints. This issue is discussed in details in the first two sections of the Appendix: in the first one we explain the reasons for the weak dependence of the number of chain steps on epistasis in terms of the behavior of the distribution of the number of beneficial mutations, while in the second we employ this framework to provide a guide to the choice of measures weakly related to epistasis in large multi-allelic landscapes. Global constraints can be also summarized by generalized chains, which are sets of “obligate” genotypes along evolutionary paths. These generalized chains describe non-local constraints and represent an interesting and potentially new source of information on the evolutionary dynamics in larger landscapes.

We are still far from predicting evolution on real landscapes based on their measures, partly because of the incomplete knowledge of the structure of real landscapes, and partly because of the lack of measures with a natural evolutionary interpretation. In the future, we expect to witness a strong increase in the number of publications of new experimentally resolved empirical landscapes. The measures that we propose here will therefore find applications in the understanding and classification of these landscapes, as well as in studies of model landscapes. The new chain measures are able to reveal information that is uncorrelated with the strength or amount of epistasis in the landscape, hence they will represent useful tools to discriminate and classify fitness landscapes. Chains also highlight the interplay of constraints and evolvability that influence evolution on complex landscapes.