THE Prisoner's Dilemma is the leading metaphor for the evolution of cooperative behaviour in populations of selfish agents, especially since the well-known computer tournaments of Axelrod1 and their application to biological communities2,3. In Axelrod's simulations, the simple strategy tit-for-tat did outstandingly well and subsequently became the major paradigm for reciprocal altruism4 12. Here we present extended evolutionary simulations of heterogeneous ensembles of probabilistic strategies including mutation and selection, and report the unexpected success of another protagonist: Pavlov. This strategy is as simple as tit-for-tat and embodies the fundamental behavioural mechanism win-stay, lose-shift, which seems to be a widespread rule13. Pavlov's success is based on two important advantages over tit-for-tat: it can correct occasional mistakes and exploit unconditional cooperators. This second feature prevents Pavlov populations from being undermined by unconditional cooperators, which in turn invite defectors. Pavlov seems to be more robust than tit-for-tat, suggesting that cooperative behaviour in natural situations may often be based on win-stay, lose-shift.
This is a preview of subscription content
Subscribe to Journal
Get full journal access for 1 year
only $3.90 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Tax calculation will be finalised during checkout.
Get time limited or full article access on ReadCube.
All prices are NET prices.
Axelrod, R. The Evolution of Cooperation (Basic Books, New York, 1984).
Axelrod, R. & Hamilton, W. D. Science 211, 1390–1396 (1981).
Axelrod, R. & Dion, D. Science 242, 1385–1390 (1988).
Wilkinson, G. Nature 308, 181–184 (1984).
Lombardo, M. P. Science 227, 1363–1365 (1985).
Milinski, M. Nature 325, 433–435 (1987).
May, R. M. Nature 327, 15–17 (1987).
Dugatkin, L. A. Behav. Ecol. Sociobiol. 25, 395–397 (1988).
Nowak, M. & Sigmund, K. Nature 355, 250–253 (1992).
Krebs, J. R. & Davies N. B. An Introduction to Behavioural Ecology (Sinauer, MA, 1981).
Dawkins, R. The Selfish Gene (Oxford Univ. Press, Oxford, 1988).
Sigmund, K. Games of Life (Oxford Univ. Press, Oxford, 1993).
Domjan, M. & Burkhard, B. The Principles of Learning and Behaviour (Brooks/Cole, Monterey, 1986).
Nowak, M. A. & May, R. M. Nature 359, 826–829 (1992).
Selten, R. & Hammerstein, P. Th. Behav. Brain Sci. 7, 115–142 (1984).
Boyd, R. & Lorberbaum, J. P. Nature 327, 58–59 (1987).
Nowak, M. & Sigmund, K. Proc. natn. Acad. Sci. U.S.A. 90, 5091–5094 (1993).
Kraines, D. & Kraines, V. Theory and Decision 26, 47–63 (1988).
Rapoport, A. & Chammah, A. M. Prisoner's Dilemma (Univ. of Michigan Press, Ann Arbor, 1965).
Nowak, M. & Sigmund, K. Acta appl. Math. 20, 247–265 (1990).
Boyd, R. J. theor. Biol. 136, 47–56 (1989).
Maynard Smith, J. Evolution and the Theory of Games (Cambridge Univ. Press, Cambridge, 1982).
Hofbauer, J. & Sigmund, K. The Theory of Evolution and Dynamical Systems (Cambridge Univ. Press, Cambridge, 1988).
Maynard Smith, J. Th. Behav. Brain Sci. 7, 95–101 (1984).
Axelrod, R. in Genetic Algorithms and Simulated Annealing (ed. Davis, D.) (Pitman, London, 1987).
Lindgren, K. in Artificial Life II (eds Farmer, D. et al.) (Proc. Santa Fe Inst. Stud., Addison Welsey, 1991).
Reboreda, J. C. & Kacelnik, A. J. exp. Animal Behav. 60, 176–193 (1993).
About this article
Cite this article
Nowak, M., Sigmund, K. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner's Dilemma game. Nature 364, 56–58 (1993). https://doi.org/10.1038/364056a0
Nature Communications (2022)
Dysregulated affective arousal regulates reward-based decision making in patients with schizophrenia: an integrated study
Journal of the Operations Research Society of China (2022)
Neuroscience Bulletin (2022)
Reconsidering Meaningful Learning in a Bandit Experiment on Weighted Voting: Subjects’ Search Behavior
The Review of Socionetwork Strategies (2022)