A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game

Nowak, Martin; Sigmund, Karl

doi:10.1038/364056a0

Letter
Published: 01 July 1993

A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game

Martin Nowak^na1 &
Karl Sigmund¹

Nature volume 364, pages 56–58 (1993)Cite this article

7281 Accesses
1080 Citations
35 Altmetric
Metrics details

Abstract

THE Prisoner's Dilemma is the leading metaphor for the evolution of cooperative behaviour in populations of selfish agents, especially since the well-known computer tournaments of Axelrod¹ and their application to biological communities^2,3. In Axelrod's simulations, the simple strategy tit-for-tat did outstandingly well and subsequently became the major paradigm for reciprocal altruism^{4 12}. Here we present extended evolutionary simulations of heterogeneous ensembles of probabilistic strategies including mutation and selection, and report the unexpected success of another protagonist: Pavlov. This strategy is as simple as tit-for-tat and embodies the fundamental behavioural mechanism win-stay, lose-shift, which seems to be a widespread rule13. Pavlov's success is based on two important advantages over tit-for-tat: it can correct occasional mistakes and exploit unconditional cooperators. This second feature prevents Pavlov populations from being undermined by unconditional cooperators, which in turn invite defectors. Pavlov seems to be more robust than tit-for-tat, suggesting that cooperative behaviour in natural situations may often be based on win-stay, lose-shift.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Maximum diffusion reinforcement learning

Article 02 May 2024

Entropy, irreversibility and inference at the foundations of statistical physics

Article 01 May 2024

The development of human causal learning and reasoning

Article 26 April 2024

References

Axelrod, R. The Evolution of Cooperation (Basic Books, New York, 1984).
MATH Google Scholar
Axelrod, R. & Hamilton, W. D. Science 211, 1390–1396 (1981).
Article ADS MathSciNet CAS Google Scholar
Axelrod, R. & Dion, D. Science 242, 1385–1390 (1988).
Article ADS CAS Google Scholar
Wilkinson, G. Nature 308, 181–184 (1984).
Article ADS Google Scholar
Lombardo, M. P. Science 227, 1363–1365 (1985).
Article ADS CAS Google Scholar
Milinski, M. Nature 325, 433–435 (1987).
Article ADS CAS Google Scholar
May, R. M. Nature 327, 15–17 (1987).
Article ADS Google Scholar
Dugatkin, L. A. Behav. Ecol. Sociobiol. 25, 395–397 (1988).
Article Google Scholar
Nowak, M. & Sigmund, K. Nature 355, 250–253 (1992).
Article ADS Google Scholar
Krebs, J. R. & Davies N. B. An Introduction to Behavioural Ecology (Sinauer, MA, 1981).
Google Scholar
Dawkins, R. The Selfish Gene (Oxford Univ. Press, Oxford, 1988).
Google Scholar
Sigmund, K. Games of Life (Oxford Univ. Press, Oxford, 1993).
Google Scholar
Domjan, M. & Burkhard, B. The Principles of Learning and Behaviour (Brooks/Cole, Monterey, 1986).
Google Scholar
Nowak, M. A. & May, R. M. Nature 359, 826–829 (1992).
Article ADS Google Scholar
Selten, R. & Hammerstein, P. Th. Behav. Brain Sci. 7, 115–142 (1984).
Article Google Scholar
Boyd, R. & Lorberbaum, J. P. Nature 327, 58–59 (1987).
Article ADS Google Scholar
Nowak, M. & Sigmund, K. Proc. natn. Acad. Sci. U.S.A. 90, 5091–5094 (1993).
Article ADS CAS Google Scholar
Kraines, D. & Kraines, V. Theory and Decision 26, 47–63 (1988).
Article MathSciNet Google Scholar
Rapoport, A. & Chammah, A. M. Prisoner's Dilemma (Univ. of Michigan Press, Ann Arbor, 1965).
Book Google Scholar
Nowak, M. & Sigmund, K. Acta appl. Math. 20, 247–265 (1990).
Article MathSciNet Google Scholar
Boyd, R. J. theor. Biol. 136, 47–56 (1989).
Article CAS Google Scholar
Maynard Smith, J. Evolution and the Theory of Games (Cambridge Univ. Press, Cambridge, 1982).
Book Google Scholar
Hofbauer, J. & Sigmund, K. The Theory of Evolution and Dynamical Systems (Cambridge Univ. Press, Cambridge, 1988).
MATH Google Scholar
Maynard Smith, J. Th. Behav. Brain Sci. 7, 95–101 (1984).
Article Google Scholar
Axelrod, R. in Genetic Algorithms and Simulated Annealing (ed. Davis, D.) (Pitman, London, 1987).
Google Scholar
Lindgren, K. in Artificial Life II (eds Farmer, D. et al.) (Proc. Santa Fe Inst. Stud., Addison Welsey, 1991).
Google Scholar
Reboreda, J. C. & Kacelnik, A. J. exp. Animal Behav. 60, 176–193 (1993).
Google Scholar

Download references

Author information

Martin Nowak: Department of Zoology, University of Oxford, South Parks Road, Oxford 0X1 3PS, UK

Authors and Affiliations

†Institut für Mathematik, Universit † Wien, Strudlhofgasse 4, A-1090, Vienna, Austria
Karl Sigmund

Authors

Martin Nowak
View author publications
You can also search for this author in PubMed Google Scholar
Karl Sigmund
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nowak, M., Sigmund, K. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game. Nature 364, 56–58 (1993). https://doi.org/10.1038/364056a0

Download citation

Received: 01 February 1993
Accepted: 15 April 1993
Issue Date: 01 July 1993
DOI: https://doi.org/10.1038/364056a0

This article is cited by

Strategy inference using the maximum likelihood estimation in the iterated prisoner’s dilemma game
- Minjae Kim
Journal of the Korean Physical Society (2024)
Effect of reciprocity mechanisms on evolutionary dynamics in feedback-evolving games
- Xiaojian Ma
- Ji Quan
- Xianjia Wang
Nonlinear Dynamics (2024)
New memory-one strategies of the Iterated Prisoner’s Dilemma: a new framework to programmed human-AI interaction
- Katharine Padilha de Paulo
- Carlos Alberto Estombelo-Montesco
- Julian Tejada
Discover Psychology (2024)
The effect of environmental information on evolution of cooperation in stochastic games
- Maria Kleshnina
- Christian Hilbe
- Martin A. Nowak
Nature Communications (2023)
Intrinsic fluctuations of reinforcement learning promote cooperation
- Wolfram Barfuss
- Janusz M. Meylahn
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game

Abstract

Access options

Similar content being viewed by others

Maximum diffusion reinforcement learning

Entropy, irreversibility and inference at the foundations of statistical physics

The development of human causal learning and reasoning

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

This article is cited by

Strategy inference using the maximum likelihood estimation in the iterated prisoner’s dilemma game

Effect of reciprocity mechanisms on evolutionary dynamics in feedback-evolving games

New memory-one strategies of the Iterated Prisoner’s Dilemma: a new framework to programmed human-AI interaction

The effect of environmental information on evolution of cooperation in stochastic games

Intrinsic fluctuations of reinforcement learning promote cooperation

Comments

Search

Quick links

Abstract

Access options

Similar content being viewed by others

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links