Computation and Simulation of Evolutionary Game Dynamics in Finite Populations

Hindersin, Laura; Wu, Bin; Traulsen, Arne; García, Julian

doi:10.1038/s41598-019-43102-z

Download PDF

Article
Open access
Published: 06 May 2019

Computation and Simulation of Evolutionary Game Dynamics in Finite Populations

Laura Hindersin¹^na1,
Bin Wu²^na1,
Arne Traulsen ORCID: orcid.org/0000-0002-0669-5267¹^na1 &
…
Julian García³^na1

Scientific Reports volume 9, Article number: 6946 (2019) Cite this article

7041 Accesses
35 Citations
6 Altmetric
Metrics details

Subjects

Abstract

The study of evolutionary dynamics increasingly relies on computational methods, as more and more cases outside the range of analytical tractability are explored. The computational methods for simulation and numerical approximation of the relevant quantities are diverging without being compared for accuracy and performance. We thoroughly investigate these algorithms in order to propose a reliable standard. For expositional clarity we focus on symmetric 2 × 2 games leading to one-dimensional processes, noting that extensions can be straightforward and lessons will often carry over to more complex cases. We provide time-complexity analysis and systematically compare three families of methods to compute fixation probabilities, fixation times and long-term stationary distributions for the popular Moran process. We provide efficient implementations that substantially improve wall times over naive or immediate implementations. Implications are also discussed for the Wright-Fisher process, as well as structured populations and multiple types.

Rapid environmental change in games: complications and counter-intuitive outcomes

Article Open access 14 May 2019

Evolutionary games with environmental feedbacks

Article Open access 14 February 2020

A unified framework for analysis of individual-based models in ecology and beyond

Article Open access 17 October 2019

Introduction

Theoretical models of evolutionary games in finite populations typically require numerical procedures or simulations^1,2,3,4,5. This is even the case when analytical results exist, as these are often difficult to interpret or confined to specific limits^{6,7,8,9,10,11,12,13}. Simulations as well as numerical approximations are therefore common in the field, but far from being standardised. There are different computational methods to assess the key quantities in evolutionary game dynamics. Here we focus on studying the popular Moran process⁶. The purpose of this paper is to give an overview of such computational methods and to compare their limitations and scalability. We provide algorithms in pseudo-code as well as the source code for all the procedures that we study.

The Moran process¹⁴ and the Wright-Fisher process¹⁵ have become popular models to describe how phenotypes change over time by evolution. Both processes have their roots in population genetics. Only recently, they were introduced to evolutionary game dynamics in finite populations^6,16,17. In each time step of the Moran process, an individual is selected proportional to its fitness and produces an identical offspring. Subsequently, another randomly chosen individual is removed from the population. In the Wright-Fisher process, all individuals produce a large number of identical offspring based on their fitness. Then, N of the offspring individuals are selected randomly to become the next generation population. We are focusing on computations for the Moran process here. Other processes are considered as possible extensions in the discussion. We also focus on discrete-time processes. Continuous-time processes require different simulation techniques (e.g. based on the Gillespie algorithm¹⁸) that are beyond our scope, see¹⁹ for a systematic comparison of these processes.

In evolutionary game dynamics, interactions between types are defined by a payoff matrix. We consider a population of N individuals with two types or strategies, A and B. The payoff matrix is given by $(\begin{array}{cc}a & b\\ c & d\end{array})$ and describes the payoff that each type gets from interaction with its own and the other type respectively. If two A’s interact, they both get payoff a. If an A meets a B, A gets b, whereas B gets c. If two B’s interact, they both get payoff d.

A key quantity is the fitness which measures how successfully a type (e.g. phenotype/strategy) reproduces. In the context of evolutionary dynamics in finite populations, it has a direct interpretation in terms of relative birth and death rates²⁰. In the Moran process, the selection mechanism can be thought of as a roulette wheel, in which every field represents one individual and the higher its fitness, the larger the field on the wheel²¹. In classic population genetics, the fitness is usually only dependent on the focal individuals’ type. In evolutionary game theory, however, it is often partitioned into two parts f = f₀ + βπ: a constant background fitness which is independent of other individuals, e.g. f₀ = 1, and a payoff which is dependent on others, π.

The selection intensity β represents how strongly fitness depends on the game. For strong selection, $\beta N\gg 1$, the evolutionary game dominates the dynamics. In a weak selection regime on the other hand, $\beta N\ll 1$, the dynamics are mostly stochastic^22,23.

In general, any payoff-to-fitness mapping f is assumed to avoid negative fitness in games where payoffs can be negative. Additionally it should be an increasing function of the payoff ^23,24. For the linear payoff-to-fitness mapping, β has to be bound in order to keep the fitness positive. By using an exponential payoff-to-fitness mapping, f = exp(βπ)^23,25, this bound on β is not necessary. We will focus on this mapping here. It is standard in a range of applications^{26,27,28,29,30} and analytically convenient by allowing to replace a product by a sum, namely the product of transition ratios that appears for calculating fixation probability. At the same time, the exponential mapping approximates the results of the simple linear payoff-to-fitness mapping when β is sufficiently small.

The Moran process and the Wright-Fisher process share a lot of similarities: (a) they are both represented by absorbing Markov chains, (b) they keep the population size constant, (c) they have the same absorbing states where every individual has the same strategy (either all A or all B). In particular, because of (c), it is interesting to ask for the probability that each of these absorbing states is reached, given a certain initial condition. We focus on the probability that a single mutant takes over the population of wild-type individuals, i.e. the fixation probability in a population of two types.

Besides the fixation probability, the time it takes a mutant to take over a population is of interest. The average unconditional fixation time is defined as the number of time steps it takes starting from one mutant until extinction or fixation of the mutants. As the population is assumed to be finite, the process hits one of the absorbing boundaries after finite time with probability 1. Another interesting quantity of the process is the conditional fixation time. It is given by the time it takes one mutant to take over the population, given that it does succeed. For simulating the conditional fixation time, this means only keeping track of the time steps of realisations where the mutant takes over and discarding the runs where the mutants go extinct.

When we introduce mutations, the homogeneous population states are not absorbing anymore. In that case, we are interested in the stationary distribution of the process. For every state of the population, the stationary probability distribution gives the probability that the process is at that state in the long run.

Another process we will mention is the pairwise comparison process with the Fermi function^23,31,32. Instead of letting an individual reproduce based on fitness, a pair of a focal individual and a role model are randomly chosen in each time step. The focal individual evaluates its payoff difference using an imitation function. This determines the probability that the focal individual adopts the strategy of the role model. As this process is a simple birth-death process, it shares the same complexity as the Moran process for computing the above mentioned quantities.

It is important to note that there are also alternative approaches to evolutionary dynamics in finite populations, other than the ones we discuss here. In particular, stochastic differential equations are useful to derive mean-field predictions from individual based models if the population size is finite, but large^33,34,35. These alternative methods may be particularly useful when the population size is large enough that it renders the methods we discuss unfeasible due to computational complexity, or when specific features such as spatial structure combine with large population sizes^36,37,38. Note, however, that as the population size becomes very large the stochastic effects we are concerned with become less important.

Thinking about an evolutionary process in a computational way can deliver insights into the details of the process. This becomes apparent, for example, when thinking about the wall time required to simulate a process in order to reach a target precision. The wall time is the actual time that elapses between the start and the end of a program. When simulating an evolutionary process, the wall time is composed of the number of realisations and the time each realisation takes before the process hits an absorbing state, see the conceptual Fig. 1. A very high fixation probability requires few realisations (see Section When to stop the simulation?). However, there are situations where high fixation probability occurs together with high fixation time, which entails that it takes longer to simulate each realisation. Understanding these tradeoffs between few realisations necessary to simulate a high fixation probability occurring together with a high fixation time that might need a high number of time steps can be insightful and useful.

Methods

We discuss three methods to calculate the fixation probabilities, the fixation times, and the stationary distribution. These three main methods, which also define the underlying structure in this paper, are:

(i)
a direct analytical solution
(ii)
a numerical approach based on the transition matrix of the associated Markov chains
(iii)
Monte Carlo simulations.

As our results are intimately connected to the details of implementation, further details are given in the results sections.

Analytical solutions are usually the most elegant, but they are often convoluted in practice and only limiting cases, for example arising from small intensity of selection β, can be interpreted easily. The naive implementations of the full analytical results are sometimes inefficient and can be computationally more expensive than smart simulations.

Alternatively, the numerical approach based on the transition matrix of the Markov chain can be useful and can feel natural when thinking about the process in terms of transition probabilities. However, as the transition matrix size grows quadratically with population size, this computational approach becomes unfeasible for large populations in terms of memory³⁹ and even much faster for graph structured populations, where the transition matrix can be of size 2^N × 2^N^40,41. Making use of sparse solvers for banded matrices, however, leads to linear convergence of the computation time with population size in the case without population structure.

To discuss these methods, we focus mostly on the Moran process, mentioning the alternative Wright-Fisher process occasionally as an extension.

The source code and demo notebooks can be downloaded from http://bit.ly/finite_computation_ed.

Results

Direct analytical calculation

Fixation probability

The direct analytical calculation is based on the solution of a recursive equation to receive the desired quantities. Let us show this by using the Moran process with two strategies, A and B, as an example. The payoff matrix is given by $(\begin{array}{cc}a & b\\ c & d\end{array})$. If two A’s interact, they both get payoff a. If an A meets a B, A gets b, whereas B gets c. If two B’s interact, they both get payoff d. Let i be the number of strategy A individuals in a population of size N. For the Moran process, in every time step, i can only increase or decrease by one or stay the same. Let us denote Tⁱ⁺ as the probability that i increases by one and Tⁱ⁻ as the probability that i decreases by one.

Here, we are interested in the probability ${\varphi }_{A}^{i}$ that the population reaches fixation of A when initially there are i strategy A individuals in the population. Without mutations, the boundary conditions are given by ${\varphi }_{A}^{0}=0$ and ${\varphi }_{A}^{N}=1$: If there are only B-strategists, the probability that the A-strategists take over is zero. Similarly, if the population consists of only type A, the fixation probability of them is one. Based on the forward Kolmogorov equation^8,21, we have

$${\varphi }_{A}^{i}={T}^{i-}{\varphi }_{A}^{i-1}+\mathrm{(1}-{T}^{i-}-{T}^{i+}){\varphi }_{A}^{i}+{T}^{i+}{\varphi }_{A}^{i+1}\mathrm{.}$$

(1)

Solving the recursion yields the fixation probability of a single type A individual invading a population^6,21,42

$${\varphi }_{A}^{1}={(\sum _{k=0}^{N-1}\prod _{i=1}^{k}{\gamma }^{i})}^{-1},$$

(2)

where γⁱ = Tⁱ⁻/Tⁱ⁺ and where the empty product is defined as 1.

For the Moran process with a payoff-to-fitness mapping $f={e}^{\beta {\pi }^{i}}$, let us denote ${\pi }_{A}^{i}$ and ${\pi }_{B}^{i}$ as the payoff for a single strategy A and B individual when there are i individuals playing strategy A. These payoffs determine the transition probabilities via the fitness^6,25,

$$\begin{array}{rcl}{T}^{i+} & = & \frac{i{e}^{\beta {\pi }_{A}^{i}}}{i{e}^{\beta {\pi }_{A}^{i}}+(N-i){e}^{\beta {\pi }_{B}^{i}}}\frac{N-i}{N},\\ {T}^{i-} & = & \frac{(N-i){e}^{\beta {\pi }_{B}^{i}}}{i{e}^{\beta {\pi }_{A}^{i}}+(N-i){e}^{\beta {\pi }_{B}^{i}}}\frac{i}{N}.\end{array}$$

(3)

This leads to ${\gamma }^{i}=\exp [\beta ({\pi }_{B}^{i}-{\pi }_{A}^{i})]$.

It is of common interest to ask for which selection intensity the fixation probability is greater than that of the neutral case, where we have ${\varphi }_{A}^{1}(\beta =\mathrm{0)}=1/N$. Theoretical insights are difficult to obtain based on equation (2). This is because the equation ${\varphi }_{A}^{1}(\beta )=1/N$ is typically transcendental for non-linear payoff-to-fitness mapping. Even for the linear payoff-to-fitness mapping, the equation contains a polynomial of order N in the denominator. Weak selection, i.e. $\beta \ll 1$, can provide substantial further insight^{6,24,28,43,44} because it usually simplifies analytical calculations.

The fixation probability can then be approximated by Taylor expansion

$${\varphi }_{A}^{1}(\beta )\approx \frac{1}{N}+\mathop{\underbrace{f^{\prime} \mathrm{(0)}}}\limits_{\ge 0}\beta \mathop{\underbrace{\sum _{k=0}^{N-1}\,\sum _{i=1}^{k}\,({\pi }_{A}^{i}-{\pi }_{B}^{i})}}\limits_{D}.$$

(4)

When D > 0, ${\varphi }_{A}^{1}(\beta ) > 1/N$, such that selection favors the invasion of strategy A under weak selection. An alternative approximation is to replace the sum and the product in equation (2) by integrals, but the resulting expression is still difficult to interpret²². However, if we are interested in exact results for general selection intensities and population sizes, we need to resort to numerical techniques.

Having the formula at hand, we transform equation (2) into Algorithm 1 to compute this quantity.

Here, the function TRANSITION-RATIO(N, β, a, b, c, d, k) implements the formula ${e}^{\beta ({\pi }_{B}-{\pi }_{A})}$ with ${\pi }_{A}=\frac{a(k-1)+b(N-k)}{N-1}$ and ${\pi }_{B}=\frac{ck+d(N-k-1)}{N-1}$, the payoffs of type A and B, respectively. This naive implementation results in two nested loops. Note that we can store the product (line 6), such that we can reduce to a single loop. A pseudo-code that avoids a second loop is given by Algorithm 2.

Computing the ratio of transition probabilities in line 4 of DIRECT-FIXATION-PROBABILITY() does not depend on N, thus we obtain a scaling in ${\mathscr{O}}\mathrm{(1)}$.

The loop is entered N − 1 times. Thus, the time-complexity of the whole computation is of order ${\mathscr{O}}(N)$.

Note that the naive implementation in Algorithm 1 with two nested loops, results in N(N − 1)/2 computations of the transition ratio γⁱ, providing a less efficient computation of quadratic order.

The above computation works for arbitrary intensity of selection β. Weak selection is often used as it leads to closed formulas as shown in equation (4), but if numerics are required, the term D in equation (4) will still lead to a linear time complexity computation. The weak selection approximation can be theoretically insightful, in particular when the sums can be solved analytically (such as for two-player matrix games or multiplayer games⁴⁵), but it is in general not computationally more efficient than the case of general β.

Unconditional fixation time

We can also use a direct analytical computation for computing the average number of steps required for fixation. The expected unconditional fixation time τⁱ, starting from i individuals of type A, can be recursively calculated from^8,21

$${\tau }^{i}=1+{T}^{i-}{\tau }^{i-1}+(1-{T}^{i-}-{T}^{i+}){\tau }^{i}+{T}^{i+}{\tau }^{i+1},$$

(5)

where the transition probabilities Tⁱ⁻ and Tⁱ⁺ are given by equation (3). The boundary conditions are τ⁰ = τ^N = 0. Solving the recursion, one obtains the expected unconditional fixation time τ¹, starting from a single individual²¹

$${\tau }^{1}={\varphi }_{A}^{1}\,\sum _{k=1}^{N-1}\,\sum _{l=1}^{k}\,\frac{1}{{T}^{l+}}\prod _{m=l+1}^{k}\,\frac{{T}^{m-}}{{T}^{m+}},$$

(6)

where ${\varphi }_{A}^{1}$ is given by (2). Again one can obtain additional insights from a weak selection approximation of this quantity^24,46,47,48.

For computational reasons (explained in Supplementary Method Calculating the unconditional fixation time), we rewrite the above equation as

$${\tau }^{1}={\varphi }_{A}^{1}\,\sum _{l=1}^{N-1}\,\frac{{R}^{l}}{{T}^{(N-l)+}}.$$

(7)

where R^l can be calculated recursively from

$${R}^{l+1}=1+{\gamma }^{N-l}{R}^{l},$$

(8)

with R¹ = 1 (see Supplementary Method Calculating the unconditional fixation time). This simplification holds for general selection intensity β.

Using equations (7) and (8), the computation is simplified and can be executed as presented in Algorithm 3.

which uses the function TRANSITION-UP(N, β, a, b, c, d, l), implementing the probability that the number of A-strategists increases by one in one time-step. This is given by T⁺ as follows:

$${T}^{+}=\frac{l\,{f}_{A}}{l\,{f}_{A}+(N-l){f}_{B}}\frac{N-l}{N}$$

where fitnesses are ${f}_{A}={e}^{\beta {\pi }_{A}}$ and ${f}_{B}={e}^{\beta {\pi }_{B}}$, with payoffs

${\pi }_{A}=\frac{a(l-1)+b(N-l)}{N-1}$ and ${\pi }_{B}=\frac{cl+d(N-l-1)}{N-1}$.

The modules TRANSITION-RATIO() and DIRECT-FIXATION-PROBABILITY() are defined as in Algorithm 2. The complexity of calculating the fixation probability in line 3 of UNCONDITIONAL-FIXATION-TIME() is of order N. The computation time of the payoff ratio γ does not depend on N, so it has constant time complexity. The summation loop is entered N − 1 times. Therefore, the time-complexity of the whole calculation is of the order ${\mathscr{O}}(N)$.

Conditional fixation time

A Master equation for the expected conditional fixation time ${\tau }_{A}^{i}$, starting in state i and fixating in state N, is given by^8,21,49

$${\varphi }_{A}^{i}{\tau }_{A}^{i}=(1-{T}^{i+}-{T}^{i-}){\varphi }_{A}^{i}{\tau }_{A}^{i}+{T}^{i-}{\varphi }_{A}^{i-1}({\tau }_{A}^{i-1}+1)+{T}^{i+}{\varphi }_{A}^{i+1}({\tau }_{A}^{i+1}+1),$$

(9)

where ${\tau }_{A}^{0}=0$ and ${\tau }_{A}^{N}=0$. Solving the recursion yields^8,21,42,50

$${\tau }_{A}^{1}=\sum _{k=1}^{N-1}\,\sum _{l=1}^{k}\,\frac{{\varphi }_{A}^{l}}{{T}^{l+}}\,\prod _{m=l+1}^{k}\,\frac{{T}^{m-}}{{T}^{m+}}.$$

(10)

The above equation can be rewritten as

$${\tau }_{A}^{1}=\sum _{l=1}^{N-1}\,\frac{{\psi }_{A}^{l}}{{T}^{(N-l)+}}{R}^{l}$$

(11)

for general β, where the following recursions hold

$$\begin{array}{cc}{R}^{l+1}=1+{\gamma }^{N-l}{R}^{l}, & {\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\,{R}^{1}=1,\\ {\psi }_{A}^{h}={\psi }_{A}^{h-1}-{\varphi }_{A}^{1}(\prod _{m=1}^{N-h}{\gamma }^{m}), & {\rm{w}}{\rm{i}}{\rm{t}}{\rm{h}}\,{\psi }_{A}^{1}={\varphi }_{A}^{N-1}\end{array}$$

(12)

see Supplementary Method Calculating the conditional fixation time. This is expressed in Algorithm 4.

The modules DIRECT-FIXATION-PROBABILITY(), TRANSITION-RATIO() and TRANSITION-UP() are defined as in Algorithm 1 and in Algorithm 3.

The complexity of calculating the fixation probability is of order ${\mathscr{O}}(N)$ (line 2). Again, the computation time of the transition ratio does not depend on N. We have to use memorisation to store the product of transition ratios γ. Therefore we have two loops that are entered N − 1 times. The time-complexity of the whole calculation is still ${\mathscr{O}}(N)$. Note that it takes much longer if we calculate the conditional fixation time directly based on equation (10), where one would naively come up with an algorithm that implements each sum and product separately, leading to ${\mathscr{O}}({N}^{3})$.

An alternative approach to calculate fixation times can be implemented via the sojourn times^15,51,52,53. The average sojourn time in a transient state i ∈ {1, …, N − 1} gives the average number of time steps the process spends in that state before absorption. Summing up the sojourn times of all transient states gives the average fixation time. While this calculation can lead to additional insight⁵¹, that approach does not lead to a further reduction in computation time.

Stationary distribution

So far, we have considered the fixation of either of the two types. The process will eventually hit one of the absorbing boundaries with probability 1 in the absence of mutations. In the presence of mutations, however, the types can no longer fixate in the population. Instead of the fixation probability and time, we then study the stationary probability distribution. The stationary probability distribution gives the fraction of time the process spends in each state in the long run³⁹. Mutations are implemented as in⁵⁴ with mutation rate μ. The transition probabilities including mutation are⁵⁴

$${T}^{k+}=\frac{k\,{f}_{A}}{k\,{f}_{A}+(N-k){f}_{B}}\frac{(N-k)}{N}(1-\mu )+\frac{(N-k){f}_{B}}{k\,{f}_{A}+(N-k){f}_{B}}\frac{(N-k)}{N}\mu $$

(13)

$${T}^{k-}=\frac{(N-k){f}_{B}}{k\,{f}_{A}+(N-k){f}_{B}}\frac{k}{N}(1-\mu )+\frac{k\,{f}_{A}}{k\,{f}_{A}+(N-k){f}_{B}}\frac{k}{N}\mu $$

(14)

where the first part in T^k+ (T^k−) corresponds to choosing a mutant (wild-type) for birth, a wild-type (mutant) for death and no mutation happening. The second part describes the probability of choosing the same types for birth and death, but a mutation happening.

For general mutation rate μ and selection strength β, the stationary probability distribution p^k can be calculated from detailed balance⁵⁵. It is given by^54,56,57

$${p}^{k}={p}^{k-1}\frac{{T}^{(k-1)+}}{{T}^{k-}}={p}^{0}\prod _{i=0}^{k-1}\,\frac{{T}^{i+}}{{T}^{(i+1)-}},$$

(15)

where p⁰ can be obtained from normalisation, ${\sum }_{k=0}^{N}\,{p}^{k}=1$.

The pseudo-code for computing the stationary distribution is given by Algorithm 5, which uses the functions TRANSITION-UP(N, β, a, b, c, d, k) and TRANSITION-DOWN(N, β, a, b, c, d, k), implementing the probability that the number of A-strategists increases or decreases by one, respectively. These are given by equation 13 and 14.

The computation of the stationary distribution with this algorithm scales in ${\mathscr{O}}(N)$.

Limitations and scalability

The fixation probability in equation (2) is valid for any one-dimensional birth-death process. Thus, in particular it also applies to imitation processes^23,24 as well as to general multiplayer games^45,58, where the payoff depends on the state of the population in a polynomial way. The Moran process on a cycle-graph also reduces to a one-dimensional birth-death process and thus falls into that category^53,59. It is important to see that these applications do not change the order of the time-complexity, and only affect the computational time by constant factors that do not depend on population size.

This method does not work for the Wright-Fisher process, because the Wright-Fisher process is not a birth-death process. In one time step the number of A individuals can change by more than one. However, the diffusion approximation provides a very powerful way to approximate the fixation probability accurately^15,17,60. This typically involves two nested integrals, which implies the same computational complexity as our nested double sums, assuming that the discretisation of the integrals uses 1/N as a step size.

Calculating equation (2) can lead to computational inaccuracies in some specific cases. Summing up numbers in floating point representation will carry truncation errors that are no longer negligible if summing up many numbers. Thus, if the population size is large, this issue needs to be addressed. A number of algorithms can be used to alleviate the problem. A discussion of those can be found in^61,62. Issues may also arise when γⁱ values are either too small or too large (leading to numerical underflow/overflow). These often appear when computing fixation probabilities for strong selection. For example, for payoff matrix $(\begin{array}{cc}1 & 2\\ 3 & 4\end{array})$, with population size N = 20, γ¹⁰(β) = exp[β(π_B − π_A)] ≈ 10⁸⁴ for strong selection, β = 100.

In summary, a naive implementation of the direct calculation will lead to quadratic complexity in fixation probability. This directly affects computations of fixation time that rely on the fixation probability. But all quantities of interest here can be computed in linear time with the appropriate implementation, c.f. Fig. 2.

Numerical matrix-based approach

In this section, we will again make use of recursions to estimate the same three important quantities in evolutionary theory: fixation probability, fixation times and stationary distribution. For all the three quantities, the main idea is the same, i.e., to make use of the Kolmogorov backward equation to get a linear difference equation³⁹. Yet for different quantities, the recursive equation comes with different boundary conditions. Consequently, a standard method based on matrix algebra would facilitate obtaining the quantities analytically.