Extortion subdues human players but is finally punished in the prisoner’s dilemma

Hilbe, Christian; Röhl, Torsten; Milinski, Manfred

doi:10.1038/ncomms4976

Download PDF

Article
Open access
Published: 29 May 2014

Extortion subdues human players but is finally punished in the prisoner’s dilemma

Christian Hilbe^1,2,
Torsten Röhl¹ &
Manfred Milinski³

Nature Communications volume 5, Article number: 3976 (2014) Cite this article

4587 Accesses
66 Citations
36 Altmetric
Metrics details

Subjects

Social evolution

Abstract

Extortion is the practice of obtaining advantages through explicit forces and threats. Recently, it was demonstrated that even the repeated prisoner’s dilemma, one of the key models to explain mutual cooperation, allows for implicit forms of extortion. According to the theory, extortioners demand and receive an excessive share of any surplus, which allows them to outperform any adapting co-player. To explore the performance of such strategies against humans, we have designed an economic experiment in which participants were matched either with an extortioner or with a generous co-player. Although extortioners succeeded against each of their human opponents, extortion resulted in lower payoffs than generosity. Human subjects showed a strong concern for fairness: they punished extortion by refusing to fully cooperate, thereby reducing their own, and even more so, the extortioner’s gains. Thus, the prospects of extorting others in social relationships seem limited; in the long run, generosity is more profitable.

Improving microbial phylogeny with citizen science within a mass-market video game

Article Open access 15 April 2024

Anger is eliminated with the disposal of a paper written because of provocation

Article Open access 09 April 2024

Persistent interaction patterns across social media platforms and over time

Article Open access 20 March 2024

Introduction

The repeated prisoner’s dilemma has a long tradition of serving as a key model to explore the evolution of cooperation^1,2,3,4,5,6. The rules of this stylized game are simple: in each round, two subjects simultaneously decide whether to cooperate or to defect. When both subjects cooperate they each receive a payoff R, which exceeds the payoff P for mutual defection. However, when a cooperating subject encounters a defector, the defector gets the highest possible payoff T, whereas the cooperator obtains the lowest payoff S. Although mutual defection is inefficient, it is the unique equilibrium if the prisoner’s dilemma is only played for a single round. However, if subjects have the option to reciprocate past actions in future encounters, a considerable body of evidence suggests that mutual cooperation becomes feasible^{7,8,9,10,11,12}, and that it is in fact favoured by evolutionary forces^{13,14,15,16,17,18}.

Recently, the conclusion that repetition naturally promotes mutual cooperation has been challenged. With an elegant mathematical proof, Press and Dyson¹⁹ have demonstrated that the repeated prisoner’s dilemma also contains sophisticated strategies that aim to dominate the co-player. Such extortionate strategies have three remarkable properties: (i) they enforce a linear relationship between the player’s own payoff and the opponent’s payoff (strategies with this property were called zero-determinant strategies or ZD strategies); (ii) they prescribe to cooperate sufficiently often, such that the opponent’s best response is to be fully cooperative; (iii) at the same time, extortioners aim to cooperate less often than their opponent, to gain higher payoffs. As a result, extortioners are unbeatable: in a pairwise encounter, they cannot be outperformed by any opponent. These surprising findings have attracted considerable attention²⁰, as they suggest that sophisticated players aware of such strategies are able to manipulate and exploit their partners, which should result in an evolutionary advantage.

Despite this relative strength, extortioners have problems to succeed in evolving populations^21,22,23. Extortion is unstable: as a homogeneous population of extortioners ends up with the mutual defection payoff P, more cooperative strategies can easily invade and take over the population. Eventually, this dynamics may even promote the emergence of generous ZD strategies, which may be considered as the more benevolent counterpart to extortioners²⁴. Generous ZD strategies share the first two properties of extortioners: they enforce a linear relationship between the payoffs of the two players, and they provide incentives for the opponent to cooperate. However, as opppsed to extortioners who aim to outcompete their opponents, the payoff of generous players never exceeds the payoff of the co-player. Although generous strategies seem to be too modest to succeed, they evolve under a wide range of conditions^25,26,27. Extortionate strategies, on the other hand, require specific assumptions to be successful: extortioners either need to be stubborn and to stick to their strategy¹⁹, or they need to adopt new strategies at a slower rate than their co-players^21,28,29,30.

Although these previous theoretical studies offer a fascinating new perspective on direct reciprocity and repeated games, they raise great expectations for studying how the two strategy classes, extortion and generosity, perform against real subjects. To this end, we have designed an economic experiment with four different treatments (see Table 1 and Methods). In each treatment, human subjects played 60 rounds of the prisoner’s dilemma against a predefined computer programme (subjects did not receive any information about the length of the game or the nature of their opponent). The four treatments differed in the implemented ZD strategy of the computer programme, which was either strongly extortionate (ES), mildly extortionate (EM), mildly generous (GM) or strongly generous (GS).

Table 1 Overview of the experimental design.

Full size table

For all treatments, theory predicts that humans maximize their expected payoff by cooperating in every round. In that case, extortioners do not only outperform their human opponents, but they are also expected to receive higher average payoffs than the generous ZD strategies. In the experiment, however, we find that although extortionate strategies indeed dominate their human co-players, this success comes at a cost. Humans are significantly less cooperative against extortioners. As a result, generosity is the more profitable strategy.

Results

Performance of ZD strategies against humans

Figure 1 shows the resulting average payoffs over all 60 rounds of the game, across the 4 treatments. These results confirm that the two extortionate ZD strategies indeed gain higher payoffs than their human co-players. For example, in the strong extortion treatment, the computer programme obtained an average payoff of π_ES=[euro]0.192 per round, whereas the human subjects earned on average (Wilcoxon matched-pairs signed-rank test, n_ES=16 human co-players, Z=3.523, P<0.001). Similarly, the mildly extortionate ZD strategy received a payoff of π_EM=[euro]0.208, which clearly exceeds the mean payoff of the human opponents, (Wilcoxon matched-pairs signed-rank test, n_EM=14, Z=3.181, P=0.001).

**Figure 1: Average payoffs across the four treatments for humans (empty bars) and the ZD strategies implemented by the computer programme (filled bars).**

Conversely, in the two generosity treatments human subjects had the upper hand, as expected. In the mild generosity treatment, the ZD strategy earned π_GM= [euro]0.235, as compared with the human subjects’ mean payoff (Wilcoxon matched-pairs signed-rank test, n_GM=14, Z=−2.527, P=0.012). Lastly, the strong generosity treatment resulted in an average payoff of π_GS=[euro]0.237 for the ZD strategy and for the human co-players (Wilcoxon matched-pairs signed-rank test, n_GS=16, Z=−2.521, P=0.012). Thus, extortionate strategies dominated their respective co-players, whereas generous strategies let their co-players succeed. These results are in line with the theory of ZD strategies, which in fact makes virtually no assumptions about human play¹⁹. In addition, the relationship between the payoffs of the ZD strategist and the human co-player fits reasonably to the linear prediction, as illustrated by Fig. 2, despite the fact that the experimental game is only played for a finite number of rounds (see Methods).

**Figure 2: Comparison of experimental results to the theoretical prediction.**

Comparison of the performance of different ZD strategies

Surprisingly, however, both extortionate ZD strategies yielded a lower payoff than their two generous counterparts. Indeed, when we pool the two extortionate treatments and the two generous treatments, we find that generosity resulted in a >18% increase in payoffs (Mann–Whitney U-test, n_E=n_G=30, Z=−2.544, P=0.011). Against an extortionate ZD strategy, the mean cooperation rate of the human co-players was 34.2%, which is only half of the cooperation rate against generous ZD strategies, 67.7% (Mann–Whitney U-test, n_E=n_G=30, Z=−3.625, P<0.001). This gap comes unexpected, as the different ZD strategies provide similar incentives for their human co-players to cooperate (as indicated by the matching slope values in Table 1). However, a comparison of the human decisions over the course of the game suggests that the treatments followed a different dynamical pattern (Fig. 3). Generous ZD strategies were more successful in motivating their human co-players towards more cooperation: in the two generous treatments, humans had a cooperation rate of 53.0% during the first ten rounds, as compared with 76.0% during the last ten rounds (Wilcoxon matched-pairs signed-rank test, n_G=30, Z=3.161, P=0.002). In contrast, when paired with an extortionate ZD strategy, the cooperation rate of human subjects only slightly increased from 30.3% during the first ten rounds to 39.7% during the last ten rounds (this increase was not significant, Wilcoxon matched-pairs signed-rank test, n_E=30, Z=1.131, P=0.258).

**Figure 3: Human cooperation rates over the course of the game.**

These results suggest that humans were somewhat reluctant to cooperate against extortioners. In fact, in the extortion treatments <14% of the human co-players were fully cooperative during the last ten rounds of the game, as compared with >63% in the generosity treatments (see Supplementary Fig. 1). On the other extreme, a third of the human subjects refused to cooperate against an extortionate co-player during the last ten rounds of the game, whereas only 1 out of 30 subjects did so in the generosity treatments. Thus, although the different treatments provided similar monetary incentives for cooperation, subjects were more hesitant to cooperate against an extortionate co-player. Withholding cooperation against these ZD strategies can be considered as a form of costly punishment (Fig. 4 and Supplementary Fig. 2). For example, reducing one’s cooperation rate by 10% against strong extortioners decreased the opponent’s mean payoff per round by [euro] 0.029, but it also diminished the own payoff by [euro] 0.011. The resulting fine-to-cost ratio for punishment, 0.029/0.011≈2.6, is close to typical values used in experiments on costly punishment³². Being less cooperative thus led to a strong reduction in the co-player’s payoff, but it also turned out to be costly for the punishing individual itself.

**Figure 4: Withholding cooperation as a form of costly punishment.**

Discussion

Repeated games, and in particular the repeated prisoner’s dilemma, are model cases to explore the tension between cooperation and conflict in long-term social relations³³. Although repetition was previously thought to promote cooperation, it has recently been suggested that iterated games may open the door for the systematic manipulation of opponents¹⁹. The newly discovered ZD strategies are surprisingly simple: they do not require to take the whole history of the game into account—it is sufficient to consider the last round only. Although previous literature on ZD strategies has focused on infinitely repeated games, social relationships in reality (and also our experiment) have a finite though fuzzy horizon. However, as we show in the Methods, this does not notably diminish the power of ZD strategies; if there is a sufficient number of rounds, ZD strategists have a similar amount of control as in the infinitely repeated game.

Two subclasses of ZD strategies have received particular attention: extortioners, as they are able to outcompete their direct opponents¹⁹, and generous ZD strategies, as they allow for stable mutual cooperation^25,26. Herein, we have investigated the performance of these two strategy classes against human subjects. Our results confirm that extortioners dominated their direct opponents, but unexpectedly generosity turned out to be the more profitable strategy. In a way, extortion meant to ‘win each battle, but at the expense of losing the war’. These findings are superficially in line with previous evolutionary studies, which suggested that natural selection in well-mixed populations favours generous ZD strategies^26,27. However, in these theoretical studies the success of generosity was based on a different argument; in an evolving population extortion does not prevail because mutual extortion is unstable, which leads extortioners to change their strategy²¹. In our experiment, the strategy of the extortioners was fixed, but extortioners were unable to motivate their co-players to cooperate fully, despite setting up appropriate incentives.

There are two possible explanations why humans were reluctant to cooperate against extortioners. On the one hand, subjects may have strived for high payoffs, but they did not have enough time to learn that they need to fully cooperate to reach this aim. This seems to be especially relevant as their opponents’ strategies were stochastic and thus not straightforward to predict. However, this argument does not explain why generous ZD players were more successful to catalyse cooperation than extortioners—after all, the implemented ZD strategies were equally complex and they provided comparable monetary incentives to promote cooperation. Instead, our results suggest that the subjects were not only driven by monetary considerations, but that they were willing to apply reciprocal strategies to oppose extortionate behaviours. In fact, several behavioural studies have reported that a large fraction of humans can be described as conditional cooperators^34,35,36. In line with this hypothesis, we find that humans were almost four times more likely to cooperate in a given round if their co-player did so in the previous round (human cooperation rates were 81.1% if the co-player cooperated in the previous round, and 22.0% otherwise, see Supplementary Table 2). Reciprocal behaviours in turn can have various behavioural roots, such as conformism, or the wish to enforce fair outcomes^37,38,39. In our generosity treatments the two possible objectives, payoff-maximization and fairness, were perfectly aligned; by maximizing their expected payoffs humans also ensured equal outcomes. In contrast, in the extortion treatments there was a trade-off; humans that aimed to maximize their payoffs had to accept the most unfair outcome. As more than half of the participants declared in the post-experiment questionnaire that equality motives affected their decisions, the wish to ensure fair outcomes may have been an important reason for the downfall of extortion.

However, unlike in other strategic situations as in the ultimatum game⁴⁰, unfairness was not straightforward to detect in our behavioural experiment. It is not a single selfish decision that makes an opponent behaving extortionate. Rather, it is the systematic interplay of selfishness and cooperation, which only unfolds itself over the course of the game. At first sight, the extortionate strategies described by Press and Dyson¹⁹ look rather inconspicuous (which may be one of the reasons why these strategies were discovered only recently). Extortioners apply a simple, conditionally cooperative strategy—with a slight bias to their own advantage. Although this more implicit form of selfishness seems to be more difficult to detect, humans have evolved mechanisms such as conditional cooperation that prevent them from being exploited.

Although our experiment did not entail an explicit punishment option, we found that by withholding contributions, subjects applied an implicit form of costly punishment. Such an effect has not been reported previously. In fact, it seems difficult to show such an effect with a conventional experiment, in which two human subjects play against each other. One would have to demonstrate that withholding cooperation is indeed individually costly. However, this seems almost impossible, as long as the co-player’s strategy is unknown (for example, against an unconditional defector, withholding cooperation is the best response and hence no instance of costly punishment). Overall, our results thus suggest that sufficient monetary incentives alone are not enough to induce cooperation in long-term social relationships. Instead, humans take additional motives such as individual intentions and fairness considerations into account, and they are ready to fight back when they feel exploited.

Methods

Experimental design

Experiments were conducted in November and December 2013 at the universities of Kiel and Hamburg, Germany, with subjects recruited from a first-year course in biology. All participants gave their informed consent to participate. For each of ten experimental sessions, we invited six volunteers to participate in a game. To ensure the subjects’ anonymity, participants were separated by opaque partitions, they were playing under a neutral pseudonym and they were not allowed to talk to each other during or after the experiment. All experimental decisions were made on a computer screen using the experimental software Z-Tree⁴¹. As we were interested in the relative performance of extortionate and generous strategies, participants were not playing against each other, but against a randomly determined computer strategy (out of the four alternatives ES, EM, GM or GS, as outlined in Table 1). The subjects’ instructions were kept in a neutral way, that is, subjects were neither told that they would interact with a computer opponent nor that they would play against each other (see Supplementary Methods for a translation of the experiment’s instructions). The game consisted of 60 rounds of the prisoner’s dilemma (subjects were not informed about the exact duration of the game, but rather that they would play over many rounds). The experiment took ~1 h. Including the show-up fee of [euro] 10, individual earnings were on average between [euro] 17.65 (in the strong extortion treatment) and [euro] 26.78 (in the strong generosity treatment).

Theoretical predictions

The extortionate and generous strategies used for the experiment are instances of a more general strategy class, the class of ZD strategies. In an infinitely repeated prisoner’s dilemma, a ZD strategist can unilaterally enforce a linear relation between his own payoff π and the co-player’s payoff . That is, payoffs obey a linear relation of the form^19,21,25,26

where l and s are characteristic properties of the applied ZD strategy²⁷. The baseline payoff l can be interpreted as the payoff of a ZD strategy against itself (for the two extortionate strategies l=P, and for the two generous strategies l=R). The slope s determines how strongly the payoffs of the two players are correlated (for the two mild treatments, we have used s=2/3, corresponding to a rather high correlation; for the two strong treatments, we have used s=1/3). If the prisoner’s dilemma is only repeated for a finite number of rounds M, equation (1) does not need to be satisfied any longer. Nevertheless, one can derive the following estimate for the players’ expected payoffs (see Supplementary Methods),

where p₀ is the probability that the ZD-strategist cooperates in the first round and φ is a constant. In Fig. 2, the expected payoff range according to equation (2) is depicted as a thin black area. See Supplementary Methods for further details.

Additional information

How to cite this article: Hilbe, C. et al. Extortion subdues human players but is finally punished in the prisoner’s dilemma. Nat. Commun. 5:3976 doi: 10.1038/ncomms4976 (2014).

References

Rapoport, A. & Chammah, A. M. Prisoner’s Dilemma University of Michigan Press (1965).
Trivers, R. L. The evolution of reciprocal altruism. Q. Rev. Biol. 46, 35–57 (1971).
Article Google Scholar
Axelrod, R. & Hamilton, W. D. The evolution of cooperation. Science 211, 1390–1396 (1981).
Article CAS ADS MathSciNet Google Scholar
Boyd, R. & Lorberbaum, J. No pure strategy is evolutionary stable in the iterated prisoner’s dilemma game. Nature 327, 58–59 (1987).
Article ADS Google Scholar
Nowak, M. A. Five rules for the evolution of cooperation. Science 314, 1560–1563 (2006).
Article CAS ADS Google Scholar
Sigmund, K. The Calculus of Selfishness Princeton University Press (2010).
Friedman, J. A non-cooperative equilibrium for supergames. Rev. Econ. Stud. 38, 1–12 (1971).
Article ADS Google Scholar
Aumann, R. J. Survey of Repeated Games Wissenschaftsverlag: Mannheim, (1981).
Fudenberg, D. & Maskin, E. Evolution and cooperation in noisy repeated games. Am. Econ. Rev. 80, 274–279 (1990).
Google Scholar
Wedekind, C. & Milinski, M. Human cooperation in the simultaneous and the alternating prisoner’s dilemma: Pavlov versus generous tit-for-tat. Proc. Natl Acad. Sci. USA 93, 2686–2689 (1996).
Article CAS ADS Google Scholar
Milinski, M. & Wedekind, C. Working memory constrains human cooperation in the prisoner’s dilemma. Proc. Natl Acad. Sci. USA 95, 13755–13758 (1998).
Article CAS ADS Google Scholar
Dal Bó, P. & Fréchette, G. R. The evolution of cooperation in infinitely repeated games: experimental evidence. Am. Econ. Rev. 101, 412–429 (2011).
Article Google Scholar
Molander, P. The optimal level of generosity in a selfish, uncertain environment. J. Conflict. Resol. 29, 611–618 (1985).
Article Google Scholar
Nowak, M. A. & Sigmund, K. Tit for tat in heterogeneous populations. Nature 355, 250–253 (1992).
Article ADS Google Scholar
Stephens, D. W., McLinn, C. M. & Stevens, J. R. Discounting and reciprocity in an iterated prisoner’s dilemma. Science 298, 2216–2218 (2002).
Article CAS ADS Google Scholar
Nowak, M. A., Sasaki, A., Taylor, C. & Fudenberg, D. Emergence of cooperation and evolutionary stability in finite populations. Nature 428, 646–650 (2004).
Article CAS ADS Google Scholar
St. Pierre, A., Larose, K. & Dubois, F. Long-term social bonds promote cooperation in the iterated prisoner’s dilemma. Proc. R. Soc. B Biol. Sci. 276, 4223–4228 (2009).
Article Google Scholar
Fischer, I. et al. Fusing enacted and expected mimicry generates a winning strategy that promotes the evolution of cooperation. Proc. Natl Acad. Sci. USA 110, 10229–10233 (2013).
Article CAS ADS Google Scholar
Press, W. H. & Dyson, F. D. Iterated prisoner’s dilemma contains strategies that dominate any evolutionary opponent. Proc. Natl Acad. Sci. USA 109, 10409–10413 (2012).
Article CAS ADS Google Scholar
Ball, P. Physicists suggest selfishness can pay. Nature doi:10.1038/nature.2012.11254 (2012).
Hilbe, C., Nowak, M. A. & Sigmund, K. The evolution of extortion in iterated prisoner’s dilemma games. Proc. Natl Acad. Sci. USA 110, 6913–6918 (2013).
Article CAS ADS MathSciNet Google Scholar
Adami, C. & Hintze, A. Evolutionary instability of zero-determinant strategies demonstrates that winning is not everything. Nat. Commun. 4, 2193 (2013).
Article ADS Google Scholar
Szolnoki, A. & Perc, M. Evolution of extortion in structured populations. Phys. Rev. E 89, 022804 (2014).
Article ADS Google Scholar
Stewart, A. J. & Plotkin, J. B. Extortion and cooperation in the prisoner’s dilemma. Proc. Natl Acad. Sci. USA 109, 10134–10135 (2012).
Article CAS ADS Google Scholar
Akin, E. Stable cooperative solutions for the iterated prisoner’s dilemma. Preprint at http://arxiv.org/abs/1211.0969 (2013).
Stewart, A. J. & Plotkin, J. B. From extortion to generosity, evolution in the iterated prisoner’s dilemma. Proc. Natl Acad. Sci. USA 110, 15348–15353 (2013).
Article CAS ADS MathSciNet Google Scholar
Hilbe, C., Nowak, M. A. & Traulsen, A. Adaptive dynamics of exortion and compliance. PLoS One 8, e77886 (2013).
Article CAS ADS Google Scholar
Bergstrom, C. T. & Lachmann, M. The Red King Effect: when the slowest runner wins the coevolutionary race. Proc. Natl Acad. Sci. USA 100, 593–598 (2003).
Article CAS ADS Google Scholar
Frean, M. R. & Abraham, E. R. Adaptation and enslavement in endosymbiont-host associations. Phys. Rev. E 69, 051913 (2004).
Article ADS Google Scholar
Damore, J. A. & Gore, J. A slowly evolving host moves first in symbiotic interactions. Evolution 65, 2391–2398 (2011).
Article Google Scholar
Nowak, M. A. & Sigmund, K. A strategy of win-stay, lose-shift that outperforms tit-for-tat in the Prisoner’s Dilemma game. Nature 364, 56–58 (1993).
Article CAS ADS Google Scholar
Fehr, E. & Gächter, S. Cooperation and punishment in public goods experiments. Am. Econ. Rev. 90, 980–994 (2000).
Article Google Scholar
Colman, A. M. Game Theory and its Applications in the Social and Biological Sciences Butterworth-Heinemann (1995).
Keser, C. & van Winden, F. Conditional cooperation and voluntary contributions to public goods. Scand. J. Econ. 102, 23–39 (2000).
Article Google Scholar
Fischbacher, U., Gächter, S. & Fehr, E. Are people conditionally cooperative? Evidence from a public goods experiment. Econ. Lett. 71, 397–404 (2001).
Article Google Scholar
Fischbacher, U. & Gächter, S. Social preferences, beliefs, and the dynamics of free riding in public goods experiments. Am. Econ. Rev. 100, 541–556 (2010).
Article Google Scholar
Fehr, E. & Schmidt, K. A theory of fairness, competition, and cooperation. Q. J. Econ. 114, 817–868 (1999).
Article Google Scholar
Brosnan, S. F. & de Waal, F. B. M. Monkeys reject unequal pay. Nature 425, 294–297 (2003).
Article ADS Google Scholar
Oechssler, J. Finitely repeated games with social preferences. Exper. Econ. 16, 222–231 (2013).
Article Google Scholar
Güth, W., Schmittberger, R. & Schwarze, B. An experimental analysis of ultimatum bargaining. J. Econ. Behav. Organ. 3, 376–388 (1982).
Article Google Scholar
Fischbacher, U. z-tree: Zurich toolbox for ready-made economic experiments. Exper. Econ. 10, 171–178 (2007).
Article Google Scholar

Download references

Acknowledgements

We are grateful to A. Traulsen for his insightful comments. Moreover, we thank K. Hagel, H. Brendelberger and S. Dobler for support in performing the experiment and the 60 students for their participation. C.H. acknowledges generous funding from the excellence scholarship of the University of Vienna and from the Schrödinger scholarship of the Austrian Science Fund (FWF) J3475.

Author information

Authors and Affiliations

Evolutionary Theory Group, Max-Planck-Institute for Evolutionary Biology, August-Thienemann-Strasse 2, 24306, Plön, Germany
Christian Hilbe & Torsten Röhl
Program for Evolutionary Dynamics, Harvard University, One Brattle Square, Cambridge, 02138, Massachusetts, USA
Christian Hilbe
Department of Evolutionary Ecology, Max-Planck-Institute for Evolutionary Biology, August-Thienemann-Strasse 2, 24306, Plön, Germany
Manfred Milinski

Authors

Christian Hilbe
View author publications
You can also search for this author in PubMed Google Scholar
Torsten Röhl
View author publications
You can also search for this author in PubMed Google Scholar
Manfred Milinski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.H., T.R. and M.M. designed the research; C.H. and M.M. performed the experiment and wrote the paper.

Corresponding author

Correspondence to Christian Hilbe.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

Supplementary Figures 1-2, Supplementary Tables 1-2, Supplementary Methods and Supplementary References (PDF 213 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 3.0 Unported License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/3.0/

Reprints and permissions

About this article

Cite this article

Hilbe, C., Röhl, T. & Milinski, M. Extortion subdues human players but is finally punished in the prisoner’s dilemma. Nat Commun 5, 3976 (2014). https://doi.org/10.1038/ncomms4976

Download citation

Received: 14 February 2014
Accepted: 25 April 2014
Published: 29 May 2014
DOI: https://doi.org/10.1038/ncomms4976

This article is cited by

Game Theory and the Evolution of Cooperation
- Bo-Yu Zhang
- Shan Pei
Journal of the Operations Research Society of China (2022)
Controlling Conditional Expectations by Zero-Determinant Strategies
- Masahiko Ueda
Operations Research Forum (2022)
Human players manage to extort more than the mutual cooperation payoff in repeated social dilemmas
- Chiara D’Arcangelo
- Luciano Andreozzi
- Marco Faillo
Scientific Reports (2021)
Five rules for friendly rivalry in direct reciprocity
- Yohsuke Murase
- Seung Ki Baek
Scientific Reports (2020)
The interplay of emotion expressions and strategy in promoting cooperation in the iterated prisoner’s dilemma
- Celso M. de Melo
- Kazunori Terada
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.