Stochastic Evolution Dynamic of the Rock–Scissors–Paper Game Based on a Quasi Birth and Death Process

Yu, Qian; Fang, Debin; Zhang, Xiaoling; Jin, Chen; Ren, Qiyu

doi:10.1038/srep28585

Download PDF

Article
Open access
Published: 27 June 2016

Stochastic Evolution Dynamic of the Rock–Scissors–Paper Game Based on a Quasi Birth and Death Process

Qian Yu¹,
Debin Fang²,
Xiaoling Zhang³,
Chen Jin⁴ &
…
Qiyu Ren²

Scientific Reports volume 6, Article number: 28585 (2016) Cite this article

3757 Accesses
11 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Stochasticity plays an important role in the evolutionary dynamic of cyclic dominance within a finite population. To investigate the stochastic evolution process of the behaviour of bounded rational individuals, we model the Rock-Scissors-Paper (RSP) game as a finite, state dependent Quasi Birth and Death (QBD) process. We assume that bounded rational players can adjust their strategies by imitating the successful strategy according to the payoffs of the last round of the game and then analyse the limiting distribution of the QBD process for the game stochastic evolutionary dynamic. The numerical experiments results are exhibited as pseudo colour ternary heat maps. Comparisons of these diagrams shows that the convergence property of long run equilibrium of the RSP game in populations depends on population size and the parameter of the payoff matrix and noise factor. The long run equilibrium is asymptotically stable, neutrally stable and unstable respectively according to the normalised parameters in the payoff matrix. Moreover, the results show that the distribution probability becomes more concentrated with a larger population size. This indicates that increasing the population size also increases the convergence speed of the stochastic evolution process while simultaneously reducing the influence of the noise factor.

Non-Hermitian topology in rock–paper–scissors games

Article Open access 12 January 2022

Computation and Simulation of Evolutionary Game Dynamics in Finite Populations

Article Open access 06 May 2019

The effect of territorial awareness in a three-species cyclic predator–prey model

Article Open access 02 February 2022

Introduction

Cyclic competition is a very common phenomenon in nature and society, with the simplest and most studied model being the Rock-Scissors-Paper (RSP) game¹. This can be characterised by three strategies: R (rock), S (scissors) and P (paper), where R excludes S, S excludes P and P excludes R. Because of their widespread existence, RSP games have attracted the attention of many researchers in related fields – being widely applied in biological systems, such as biological diversity^2,3 and species interaction⁴.

Schreiber and Killingback⁵, for example, have studied the effect of a general metacommunity structure on the coexistence of the strategies in the RSP game to show that dispersal in spatially heterogeneous environments can alter dynamical outcomes; Jeppe⁶ quantitatively describes how spatial clustering slows down the dynamics and stabilizes the spatial system of the game; and Kerr and Riley⁷ and Kirkup and Riley⁸ show that three strands of E. colibacteria compete in a RSP fashion to reach an evolutionary stable distribution. The RSP game can also be applied to economic and social systems as well as biological systems. In the Edgeworth price cycle, for example, when not all consumers are fully informed with pricing information, oligopolistic pricing can be regarded as a RSP game⁹. Also, the RSP cycle arises in public goods games^10,11, where the populations are separated into three groups: co-operators, defectors and loners.

Various evolution properties of population behaviour for the RSP game have been analysed based on evolutionary dynamics such as the replicator dynamic and the Smith dynamic, the BNN dynamic and the BR dynamic, etc. after evolutionary game theory was developed. Around the 1990 s, Hofbauer¹² studied the evolutionary stability of the Nash equilibrium under the replicate dynamic and proved that the Nash equilibrium of the RSP game is just the fixed point of the replicate dynamic. Meanwhile, Weibull’s¹³ analysis showed that any evolutionary stable profile of the RSP game is not only a fixed point under the replicator dynamic but is also asymptotically stable. Furthermore, Gaunersdorfer and Hofbauer¹⁴ found that the limiting behaviour of fictitious play and the time-averaged replicator dynamics coincide in the RSP game, while Loertscher¹⁵ proved it has an evolutionary stable strategy (ESS) when the discount factor of the game is less than 1.

However, deterministic models can no longer be relied upon for finite populations. In this case, the random fluctuations due to sampling effects, for instance, have to be taken into account and stochastic processes, such as the Moran process and the Wright-Fisher process, must be employed to analyse the equilibrium of RSP^16,17 instead of ordinary differential equations.

Studies of the impact of stochasticity on the RSP game indicate various important factors influencing the cyclic behaviour of populations, such as noise, alliance-specific heterogeneous invasion rate, mutations, group interaction and protection spillovers except evolutionary dynamics. Perc and Szolnoki^18,19, for example, have studied noise-guided evolution in a six-species predator-prey model within cyclical interaction and cyclical interaction with alliance-specific heterogeneous invasion rates; Mobilia²⁰ studied the oscillatory dynamics in generic three-species RSP games with mutations; and Szolnoki^21,22 considers group interaction in RSP games and studied the vortices to determine the dynamics of biodiversity in cyclical interactions with protection spillovers. Szabó and Fáth²³ discuss the evolution dynamic of the RSP game in a lattice grid and depict heat maps of the evolutionary dynamic based on the Monte-Carlo simulation method. The limit cycles have also been found for the RSP game by Ochea²⁴ under logit dynamics.

Although these studies have been very successful and well verified in the evolutionary biology field, the RSP game played by the individuals with bounded rationality in a social and economic system still needs to be investigated, because the learning dynamic in such situations may differ from that based on genetics. Motivated by this question, research attention turned to focus on the evolutionary RSP game under learning dynamics, such as imitation²⁵ and best-response²⁶. Sato et al.²⁷, for instance, investigated the RSP game based on the reinforcement learning dynamic, finding the learning trajectory can be simple or complex depending on initial conditions and that the learning process displays Hamiltonian chaos for the zero sum game; while Platkowski and Zakrzewski²⁸ found an asymptotically stable polymorphism for the standard RSP game and the limit cycles for the general RSP game under complex personality profiles of the players and the imitation dynamics. In 2014, Szolnoki²¹ reviewed the main results in well-mixed populations with the focus on oscillatory dynamics and firstly on the important role of the topology of the interaction networks²⁹. Hoffman et al.³⁰ experimentally investigated three RSP games and found population distributions further from the centre are more frequent when the NE section is not stable, which is consistent with the predictions of non-deterministic evolutionary dynamics models.

To analyse an evolutionary model with a finite number of players and with noise or mutations, Kandori³¹ firstly considered stochastic characteristics in the learning dynamics and introduced the Markov process model to describe the learning dynamic in evolutionary games. Amir and Berninghaus³² extended this to a continuous time model³³ and modelled the evolutionary process of the evolutionary normal-form game with a finite number of players as a homogeneous Markov process with finite state space. Tadj and Touzene³⁴ then proposed the state dependent quasi-birth-and-death process model to describe the learning dynamic for more complicated stochastic evolution. Compared with general 3^*3 game described in Tadj and Touzene³⁴ that emphasises computational efficiency, this paper is more focused on the impact of stochasticity on the evolutionary dynamic of the RSP game, which considers population size, the payoff matrix and noise factor. In addition, according to the unique characteristics of the RSP game, we consider that the states above the diagonal can be transferred to each other.

To study the stochastic evolutionary dynamic of the RSP game in finite populations and analyse the effect factors of the stochastic evolutionary dynamic, we assume that the bounded rational individuals in a finite population can adjust their strategies by imitating the successful strategy according to payoffs from the last round of the game and model the stochastic evolutionary process involved as a finite, state dependent, quasi birth and death process. The long run equilibrium of the RSP game played by bounded individuals in a finite population can be explained by the limiting distribution of the QBD process of the game evolutionary dynamic. With the limiting distribution, the probability of a stable state of the RSP game evolution dynamic can be more profoundly predicted and interpreted. Furthermore, comparisons of the numerical experimental results plotted as pseudo colour ternary heat maps shows the effect of the population size and parameters in the RSP payoff matrix and noise factor.

Results

Nash equilibrium of the general RSP game

A symmetrical Rock-Scissors-Paper game can be characterised by three strategies R, S and P. Each strategy dominates the next one in a cyclical way.

The standard RSP game can be depicted by the constant-sum payoff matrix

It is easily verified that this game has no Nash equilibrium in pure strategies and precisely one Nash equilibrium in mixed strategies, namely the strategy pair in which both players randomise uniformly, i.e., .

Furthermore, the payoff matrix of a general Rock-Scissors-Paper game can be normalised as:

It is easy to see that there always exists a unique Nash equilibrium that is

where Σ is a normalising constant.

Under the replicator dynamic, the unique Nash equilibrium p, is asymptotically stable when , the Nash equilibrium is neutrally stable when and the Nash equilibrium is unstable when ³⁵.

With a special transformation, we can obtain a simple payoff matrix for the general RSP game G′ as:

Then, the Nash equilibrium becomes . Coding with Matlab 7, the phase portraits of the RSP game are shown in Fig. 1.

Numerical experiments for the symmetrical RSP game

We can model the stochastic evolutionary dynamic of the RSP game as a QBD process and carry out numerical simulations to verify the dynamic stability of long run equilibrium for the RSP game in populations. The limiting distributions of the QBD process can be solved for the RSP game stochastic evolutionary dynamic (see Methods section below).

For example, by setting N = 20, δ = 5 and ε = 0.01, the limiting distributions of the QBD process of the general RSP game G″ can be obtained. Then the profiles, whose appearance probability is larger than 0.01, are listed in Table 1. For instance, there are 7, 6 and 7 individuals in the population in row 1 who chose strategy R, S and P respectively and 0.1461 is the appearance probability of this state in the QBD process. As shown in Fig. 2, the main probability in the limiting distribution of the QBD process is concentrated in the states around (7,6,7), (7,7,6) and (6,7,7). The sum of these probabilities is 0.8876, indicating that (1/3, 1/3, 1/3) can be regarded as the long run equilibrium of the stochastic evolutionary RSP game.

Table 1 Main probabilities in the limiting distribution.

Full size table

To analyse the results more intuitively and compare the effect of population size and the payoff on the QBD process, we carried out numerical experiments with sizes 10, 20 and 100 and set the payoff parameter δ as −0.5, 0 and 1 respectively. The noise factor was set at 0.01. The limiting distribution of QBD was then obtained and plotted as pseudo colour ternary heat maps (Fig. 3), where the vertical distances of each point to the three edges in the simplex represents the probability of choosing either the R, S or P strategy respectively. Therefore, the top, bottom left and bottom right vertices of the simplex correspond to the three pure strategies (1, 0, 0), (0, 1, 0) and (0, 0, 1) respectively and other interior points correspond to other mixed strategies. Figure 3 shows the results, where the hue value of each point, ranging from red to blue, indicates the probability of the corresponding strategy in the limiting distribution of the QBD process.

Comparison of these results indicates that:

1
Observing Fig. (3a,3d,3g), we can see when δ = 1 or δ > 0, the heat maps appear as a target, the red points representing the states with higher distribution probability gathered around the centre of this simplex regardless of population size. Most of the remaining points appear as blue, which means the distribution probabilities of the states corresponding to those points approach zero. These features confirm that the mixed strategy profile (1/3, 1/3, 1/3) is a long run Nash equilibrium of the RSP game and also an asymptotically stable equilibrium when δ > 0.
2
Observing Fig. (3b,3e,3h), the colour images appear as a nebula with three spiral arms, with the colour of the unscrewing arms from the centre to the edges of the simplex gradually turning red. The impression is that the interior points in the simplex are simultaneously attracted by forces from the centre point and centrifugal force from the edges of simplex. The change of colour shows that the distribution probabilities of states corresponding to the points are increasing along the spiral arms of the nebula. This corroborates that, when δ = 0, the Nash equilibrium (1/3, 1/3, 1/3) is merely neutrally stable and the states corresponding to the points on the central sections of the edges occur with higher distribution probability.
3
Observing Fig. (3c,f,i) when δ = −0.5 or δ < 0, the red points representing those states with higher distribution probability are gathered around the area near the centre sections of the edges in the simplex. The rest of the parts in the simplex are almost blue, which means the states corresponding to these points have distribution probabilities approaching zero. This indicates an unstable evolutionary Nash equilibrium with condition δ < 0. The long run equilibrium will be a mixed-strategy on the boundary of simplex, such as (2/5, 3/5, 0), (0, 2/5, 3/5) and (3/5, 0, 2/5) in the simulation experiment.
4
Vertically comparing diagrams from (3a) to (3g), (3b) to (3h) and (3c) to (3i), indicates larger populations have more concentrated distribution probabilities. This indicates, along with increasing population size, the convergence speed of stochastic evolution process will be faster and the influence of the perturbations will be weaker.

Numerical experiments for the asymmetrical RSP game

To avoid artificial symmetries structure, we also investigated the asymmetrical RSP game as³⁶:

Under the replicator dynamic, G′″ has a saddle node fixed point at the vertices of the simplex and an interior fixed point at (1/2, 1/3, 1/6), independent of γ. However, the stability of the fixed point will be affected by γ. For γ > 1, (1/2, 1/3, 1/6) is a stable focus and an unstable focus for γ < 1. The figure will exhibit closed orbits when γ = 1.

Based on the QBD model of RSP game, we set the population to 20 and 50 and the parameter γ in the payoff matrix as 0.1, 0.3, 0.5, 0.8, 1 and 2 to inspect the effect of γ. The perturbation factor was set to ε = 0.01 to represent noise. The resulting pseudo colour ternary heat maps are shown in Fig. 4.

From these, we find that:

(1) Observing (4a–f), when γ < 1, the red points representing the states with higher distribution probability are gathered at the left bottom edge in the simplex at the beginning and the vortex appears gradually and screwed to the point near (1/2,1/3,1/6) with increasing γ. The rest of the parts in the simplex are almost blue, indicating the states corresponding to the points have distribution probabilities approaching zero. Therefore, the long run equilibrium will be a mixed-strategy equilibrium on the boundary near the left bottom corner in the simplex when γ is small and the Nash equilibrium will become the long run equilibrium eventually when γ ≥ 1. The Nash equilibrium at (1/2,1/3,1/6) with the condition γ < 1 is an unstable evolutionary equilibrium, will become neutrally stable when γ = 1 and will be an asymptotically stable equilibrium finally when γ > 1.

(2) Vertically comparing from (4d) to (4i), increasing the population size strengthens the vortex. This indicates that, as with the symmetric situation in Fig. 3, as the population size increases, the convergence speed of stochastic evolutionary process is faster and the influence of noise is weaker.

Discussion

In this paper, we established a QBD process for the stochastic evolutionary dynamic for the RSP game. For the general RSP game G″, the pseudo colour ternary heat maps of numerical experiments show that the limiting distribution will converge to the centre of the simplex when δ > 0, diverge as spiral nebula when δ = 0 and diverge to the boundary of the simplex when δ < 0. For the asymmetrical RSP game G′″, the limiting distribution will converge to the Nash equilibrium (1/2, 1/3, 1/6) gradually when γ > 1, become neutrally stable when γ = 1 and diverge to the boundary of the simplex when γ < 1. The numerical experimental results are consistent with the conclusions proposed in the literature^12,13 based on replicator dynamics.

Moreover, the study provides a deeper understanding of the RSP game in three aspects. Firstly, the stochastic evolutionary dynamic of the RSP game here was built based on the assumption of bounded rationality. This assumption makes the model more suitable for describing an individual’s behaviour in social environments and gives the study more practical significance. In fact, as Zhijian Wang³⁷ recently revealed in a laboratory experiment of the discrete-time iterated RSP game, a microscopic model of a win-lose-tie conditional response can offer higher payoffs to individual players in comparison with the NE mixed strategy, suggesting that high social efficiency is achievable through an optimised conditional response. Based on the assumption of bounded rationality, the numerical experiment results in Fig. 3 are very similar to the laboratory experimental evidence observed by Moshe³⁰.

Secondly, the probability interpretation for the long run equilibrium of the RSP game can be provided by the limiting distribution of the QBD process. The limiting probabilities of the states in the QBD process can be used to predict how likely the equilibrium of the RSP game will be.

Finally, the noise factor introduced in our model can be used for detecting the stochastic stability of the long run equilibrium in evolutionary dynamics, which could not be done by previous research with replicator dynamics.

In summary, our study provides a new means of interpretation for the long-term equilibrium of the RSP game played by bounded rational individuals, based on learning dynamics. Further research is needed to generalise these results. For example, RSP games that take place between players from heterogeneous populations can be taken into account. The learning dynamic contained in the QBD process of our model can also be further generalised.

Methods

Without loss of generality, we supposed that the general RSP games are repeatedly played between individuals in a finite population with a size of N and payoff matrix G′.

First, denote the three strategies R, S and P in the RSP game as 1, 2 and 3. Then we can define a stochastic process as: the state at time, t, is a two-dimensional stochastic variable , where are the number of players in the population who choose strategies R and S, while the number of players in the population who choose strategy P is . Hence, the profile and the average payoffs of players are decided by and at time t.

At time t, if there are individuals choosing strategy R, individuals choosing strategy S and individuals choosing strategy P, then any individual in the population adopting strategy i will obtain an average payoff , where :

if the individual chooses strategy R,

if the individual chooses strategy S,

and if the individual chooses strategy P,

The next time, t + 1, the individuals in the population will switch from one strategy to another at a rate depending on the average payoff of each strategy at time t.

According to the three hypotheses of bounded rationality proposed by Kandori³¹, the individuals in the population are inertia, myopia and mutation. That means that not all players react instantaneously to their environment; the players are not taking into account the long run implication of their strategy choices; and the players will change their strategies at random with a small probability.

According to the profile and the average payoffs at time t we can suppose only one individual can change his strategy within a short time. That is, at time t + 1, can only shift to an adjacent state. In this case, the strategy distribution of the population satisfies the inertia hypothesis. The myopia hypothesis will also be satisfied because only the profile and the average payoffs at time t will be considered by the individuals when they make the choice at time t + 1. Furthermore, the mutation feature is introduced into the model by a perturbation factor ε ≥ 0, which allows the players to deviate from the selected strategy with a small probability.

Assume and , then the individuals in the population will switch from strategy 1 to strategy at the rate , where and , where is defined for an arbitrary real function f.

This means the individuals choosing strategy 1 will switch to strategy k at rate , if and the transition rate will be 0, if the number of individuals adopting the same strategy in the population reach N, the size of the population.

The stochastic evolutionary process of the strategy distribution of the RSP game then becomes a two-dimension Markov stochastic process that can be described as a QBD process.

Denote the state space of as , where denotes the number of individuals choosing strategy i in the population (i.e. strategy distribution at time t). Obviously the number of states is . Given , k < 1 and , the evolutionary process of strategy distribution is a finite, state dependent QBD process, better known as a generalised QBD process (GQBD).

The switches between states in this GQBD can be as depicted in Fig. 5.

The Q-matrix of the GQBD is then:

Using a numerical algorithm based on the block Gauss-Seidel iteration method, proposed by Stewart³⁸, the steady-state probability of the GQBD can be obtained if the GQBD is recurrent. The solution is the steady distribution of the GQBD in the long run, which is a limiting distribution of strategies of the RSP game. In fact, it can be considered as the long run equilibrium of the stochastic evolution RSP game in the population.

Additional Information

How to cite this article: Yu, Q. et al. Stochastic Evolution Dynamic of the Rock–Scissors–Paper Game Based on a Quasi Birth and Death Process. Sci. Rep. 6, 28585; doi: 10.1038/srep28585 (2016).

References

McCannon, B. C. Rock paper scissors. Journal of Economics 92(1), 67–68 (2007).
Article Google Scholar
Frean, M. & Abraham, E. R. Rock-scissors-paper and the survival of the weakest. Proceedings of the Royal Society B Biological Sciences 268, 1323–1327 (2001).
Article CAS Google Scholar
Reichenbach, T., Mobilia, M. & Frey, E. Mobility promotes and jeopardizes biodiversity in rock-paper-scissors games. Nature 448(7157), 1046–1049 (2007).
Article CAS ADS Google Scholar
Cheng, H. et al. Mesoscopic interactions and species coexistence in evolutionary game dynamics of cyclic competitions. Scientific Reports 4, 7486 (2014).
Article CAS Google Scholar
Schreiber, S. J. & Killingback, T. P. Spatial heterogeneity promotes coexistence of rock–paper–scissors metacommunities. Theoretical Population Biology 86, 1–11 (2013).
Article Google Scholar
Jeppe, J., Kim, S. & Joachim, M. Labyrinthine clustering in a spatial rock-paper-scissors ecosystem. Physical Review E Statistical Nonlinear & Soft Matter Physics 87, 187 (2013).
Google Scholar
Benjamin, K., Riley, M. A., Feldman, M. W. & Bohannan, B. J. M. Local dispersal promotes biodiversity in a real-life game of rock-paper-scissors. Nature 418, 171 (2002).
Article ADS Google Scholar
Kirkup, B. C. & Riley, M. A. Antibiotic-mediated antagonism leads to a bacterial game of rock-paper-scissors in vivo. Nature 428, 412 (2004).
Article CAS ADS Google Scholar
Maskin, E. & Tirole, J. A theory of dynamic oligopoly, II: price competition, kinked demand curves and Edgeworth cycles. Econometrica 56, 571 (1988).
Article MathSciNet Google Scholar
Hauert, C., De Monte, S., Hofbauer, J. & Sigmund, K. Volunteering as Red Queen mechanism for cooperation in public goods games. Science 296, 1129 (2002).
Article CAS ADS Google Scholar
Szolnoki, A. et al. Cyclic dominance in evolutionary games: a review. Journal of the Royal Society Interface 11, 20140735 (2014).
Article Google Scholar
Hauer, J. & Sigmund K., Evolutionary games and population dynamics. Vol. 2 Ch. 7, 67–85 (Cambridge University Press, 1998).
Weibull, J. W. Evolutionary game theory. Southern Economic Journal 22, 43 (1995).
MathSciNet MATH Google Scholar
Gaunersdorfer, A. & Hofbauer, J. Fictitious play, Shapley polygons and the replicator equation. Games & Economic Behavior 11, 279 (1995).
Article MathSciNet Google Scholar
Loertscher, S. Rock–Scissors–Paper and evolutionarily stable strategies. Economics Letters 118, 473 (2013).
Article MathSciNet Google Scholar
Sigmund, K., Calculus of Selfishness Ch. 2, 25–45 (Princeton University. Press, 2010).
Taylor, C., Fudenberg, D., Sasaki, A. & Nowak, M. A. Evolutionary game dynamics in finite populations. Bulletin of Mathematical Biology 66, 1621 (2004).
Article MathSciNet Google Scholar
Perc, M. & Szolnoki, A. Noise-guided evolution within cyclical interactions. New Journal of Physics 9, 126 (2007).
Article Google Scholar
Perc, M., Szolnoki, A. & Gyrgy, S. Cyclical interactions with alliance-specific heterogeneous invasion rates. Physical Review E Statistical Nonlinear & Soft Matter Physics 75, 490 (2007).
Google Scholar
Mobilia, M. Oscillatory dynamics in rock-paper-scissors games with mutations. Journal of Theoretical Biology 264, 1 (2010).
Article MathSciNet Google Scholar
Szolnoki, A., Vukov, J. & Perc, M. From pairwise to group interactions in games of cyclic dominance. Physical Review E Statistical Nonlinear & Soft Matter Physics 89, 062125 (2014).
Article Google Scholar
Szolnoki, A. & Perc, M. Vortices determine the dynamics of biodiversity in cyclical interactions with protection spillovers. New Journal of Physics 17, 113033 (2015).
Article ADS Google Scholar
Szabó, G. & Fáth, G. Evolutionary games on graphs. Physics Reports 446, 97 (2007).
Article ADS MathSciNet Google Scholar
Ochea, M. I. Limit cycles and multiple attractors in logit dynamics. Econometrics 58, 4 (2008).
Google Scholar
Duersch P., Oechssler, J. & Schipper, B. C. Unbeatable imitation. Games & Economic Behavior 76, 88 (2012).
Article MathSciNet Google Scholar
Banerjee, B. & Peng J. Strategic best-response learning in multiagent systems. Journal of Experimental & Theoretical Artificial Intelligence 24, 139 (2012).
Article Google Scholar
Yuzuru, S., Eizo, A. & Doyne, F. J. Chaos in learning a simple two-person game. Proceedings of the National Academy of Sciences of the United States of America 99, 4748 (2002).
Article MathSciNet Google Scholar
Platkowski, T. & Zakrzewski, J. Asymptotically stable equilibrium and limit cycles in the Rock–Paper–Scissors game in a population of players with complex personalities. Physica A Statistical Mechanics & Its Applications 390, 4219 (2011).
Article ADS MathSciNet Google Scholar
Szolnoki, A. & György, S. Phase transitions for rock-scissors-paper game on different networks. Physical Physical Review E. 70, 037102 (2004).
Article ADS Google Scholar
Hoffman, M., Suetens, S., Gneezy, U. & Nowak, M. A. An experimental investigation of evolutionary dynamics in the Rock-Paper-Scissors game. Scientific Reports 5 (2015).
Kandori, M. & Rob, R. Learning, mutation and long run equilibria in games. Econometrica 61, 29 (1993).
Article MathSciNet Google Scholar
Amir, M. & Berninghaus, S. K. Another approach to mutation and learning in games. Games & Economic Behavior 14, 19 (1996).
Article MathSciNet Google Scholar
Cason, T. N., Friedman, D. & Hopkins, E. D. Cycles and instability in a Rock-Paper-Scissors population game: a continuous time experiment. Review of Economic Studies 81, 112 (2012).
Article MathSciNet Google Scholar
Tadj, L. & Touzene, A. A QBD approach to evolutionary game theory. Applied Mathematical Modelling 27, 913 (2003).
Article Google Scholar
Zeeman, E. C. Population dynamics from game theory. Vol. 819 Ch. 4, Global Theory of Dynamical System 471–497 (Springer Berlin Heidelberg, 1980).
Chapter Google Scholar
Traulsen, A., Claussen, J. C. & Hauert, C. Stochastic differential equations for evolutionary dynamics with demographic noise and mutations. Physical Review E Statistical Nonlinear & Soft Matter Physics 85, 154 (2012).
Article Google Scholar
Wang, Z., Xu, B. & Zhou, H. J. Social cycling and conditional responses in the Rock-Paper-Scissors game. Scientific Reports 4, 5830 (2014).
Article CAS Google Scholar
Stewart, W. J. Introduction to the Numerical Solution of Markov Chains. Vol. 3, Iterative Methods 121–176 (Princeton University Press, 2015).

Download references

Acknowledgements

This research was funded in part by the National Natural Science Foundation of China (No. 71373198, 71231007, 71103135) and the Fundamental Research Funds for the Central Universities of China (WUT: 2015VI004).

Author information

Authors and Affiliations

School of Economics, Wuhan University of Technology, Wuhan, 430070, China
Qian Yu
School of Economics and Management, Wuhan University, Wuhan, 430072, China
Debin Fang & Qiyu Ren
Department of Public Policy, City University of Hong Kong, Hong Kong, China
Xiaoling Zhang
Department of Economics, The George Washington University, Washington, 20052, DC, United States of America
Chen Jin

Authors

Qian Yu
View author publications
You can also search for this author in PubMed Google Scholar
Debin Fang
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoling Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chen Jin
View author publications
You can also search for this author in PubMed Google Scholar
Qiyu Ren
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Q.Y. and D.F. designed the model, simulation experiment and the main manuscript text. C.J. and Q.R. prepared the computation and figures. X.Z. wrote the part manuscript text. All authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Yu, Q., Fang, D., Zhang, X. et al. Stochastic Evolution Dynamic of the Rock–Scissors–Paper Game Based on a Quasi Birth and Death Process. Sci Rep 6, 28585 (2016). https://doi.org/10.1038/srep28585

Download citation

Received: 11 January 2016
Accepted: 07 June 2016
Published: 27 June 2016
DOI: https://doi.org/10.1038/srep28585

This article is cited by

Noise-Induced Quasi-Heteroclinic Cycle in a Rock–Paper–Scissors Game with Random Payoffs
- Tian-Jiao Feng
- Jie Mei
- Xiu-Deng Zheng
Dynamic Games and Applications (2022)
Multi-AI competing and winning against humans in iterated Rock-Paper-Scissors game
- Lei Wang
- Wenbin Huang
- Sailing He
Scientific Reports (2020)
Research on Evolutionary Model for Trust of Nodes Based on the Fuzzy Correlation Measures
- Lei Zhu
- Lei Wang
- Changhua Yao
Wireless Personal Communications (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.