An optimal strategy to solve the Prisoner’s Dilemma

Bravetti, Alessandro; Padilla, Pablo

doi:10.1038/s41598-018-20426-w

Download PDF

Article
Open access
Published: 31 January 2018

An optimal strategy to solve the Prisoner’s Dilemma

Alessandro Bravetti¹^na1 &
Pablo Padilla^1,2^na1

Scientific Reports volume 8, Article number: 1948 (2018) Cite this article

35k Accesses
10 Citations
17 Altmetric
Metrics details

Subjects

Abstract

Cooperation is a central mechanism for evolution. It consists of an individual paying a cost in order to benefit another individual. However, natural selection describes individuals as being selfish and in competition among themselves. Therefore explaining the origin of cooperation within the context of natural selection is a problem that has been puzzling researchers for a long time. In the paradigmatic case of the Prisoner’s Dilemma (PD), several schemes for the evolution of cooperation have been proposed. Here we introduce an extension of the Replicator Equation (RE), called the Optimal Replicator Equation (ORE), motivated by the fact that evolution acts not only at the level of individuals of a population, but also among competing populations, and we show that this new model for natural selection directly leads to a simple and natural rule for the emergence of cooperation in the most basic version of the PD. Contrary to common belief, our results reveal that cooperation can emerge among selfish individuals because of selfishness itself: if the final reward for being part of a society is sufficiently appealing, players spontaneously decide to cooperate.

Self-regulation versus social influence for promoting cooperation on networks

Article Open access 16 March 2020

Evolution of cooperation and consistent personalities in public goods games

Article Open access 09 December 2021

The conditional defector strategies can violate the most crucial supporting mechanisms of cooperation

Article Open access 07 September 2022

Introduction

Cooperation is so important that it has been suggested as a fundamental principle of evolution, besides reproduction, mutation and selection^{1,2,3,4,5,6,7,8,9,10,11,12,13}. However, the situation regarding the underlying mechanisms responsible for the emergence of cooperation among individuals who try to maximize their own fitness and who are competing among each other is not clear. To try to solve such complicated puzzle, several mechanisms have been proposed during the last decades, including kin selection^1,6, direct reciprocity^2,3,14, indirect reciprocity^15,16, network reciprocity^17,18, group selection^9,19, green beards^5,20, optional participation^21,22, punishment and reward^23,24, pre-commitments^25,26,27 and others^28,29,30,31. All these situations reflect some specific important aspects of real social and biological interactions. However, none of them can really provide a solution to the most basic version of the paradigmatic example of the Prisoner’s Dilemma (PD): here one imagines that two people that are suspected of having committed a joint crime are caught by the police and confined into different rooms, without the possibility to communicate. Each of them is offered the possibility to confess the crime and defect his partner in exchange for a reduced sentence. If only one defects, the other will get the full sentence. If they defect, both will have the sentence reduced. If they cooperate between themselves and do not confess, then they will immediately be freed. The situation can be exemplified in the following payoff matrix

$$\begin{array}{cc} & \begin{array}{cc}C & D\end{array}\\ \begin{array}{c}C\\ D\end{array} & (\begin{array}{ll}R & S\\ T & P\end{array})\end{array},$$

(1)

where C stands for “cooperation”, D for “defection”, and the entries denote the payoff for the row player. Thus, if they both cooperate, they get R points each (the “reward” for cooperation). If only one cooperates, then he gets S points (the “sucker’s” payoff), while the defector gets T (the “temptation” to defect). If they both defect, they get P points each (the “punishment” for defection).

As it is well-known, in the PD the payoffs satisfy T > R > P > S. In this situation the best option for each player is to defect, no matter what the other player does³². Thus both players defect and get P points each. However, this is less than the R points they would get if they had collaborated. This is the essence of the PD: mutual cooperation leads to a higher payoff than mutual defection, but it is not a “safe” strategy, exposing the player to exploitation by a defector, and therefore each player, in an attempt to maximize his own payoff, chooses to defect. Thus apparently there is no room for cooperation between such rational agents.

We remark also that in this one-shot formulation of the PD, the game is not repeated (and thus direct and indirect reciprocity do not apply), there is no link between the two prisoners (neither a genetic one as in kin selection, nor a “social” one, as in the cases of network reciprocity and group selection), nor is there a special tag that the prisoners can use (thus ruling out green beards and tag-based donation) and finally, the two prisoners cannot decide whether to play or not (and hence optional participation cannot be used). Therefore our best candidate mechanisms for the emergence of a cooperative strategy between the two prisoners do not apply in this paradigmatic example and we are left with the fundamental question: is there any other reason that can lead the two individuals to cooperate in this case?

A related unsatisfactory aspect in the formulation of evolutionary biology in terms of replicator-type dynamics with a fixed fitness landscape is the lack of adaptability: since the payoff structure is set from the beginning, e.g. (1) for the PD, and it completely determines the fitness of each strategy, the model is not flexible enough in order to incorporate changes in the payoffs which might happen over time. To overcome such difficulty, dynamical models for the coevolution of the existing species and their fitness landscape have been proposed^33,34,35,36. This subject is also fundamentally linked to the fact that evolution is similar to an optimization process: a population evolves in order to adapt as much as possible to some external conditions (which in turn may change in time). Some works from different perspectives have been recently proposed in this context^37,38,39. In particular, in³⁸ it has been argued that living systems adapt to the environment by performing an optimal control.

The aim of this work is to present a dynamical mechanism based on optimal control theory that has two main features: on the one side it generalizes the standard replicator-type dynamics to an evolution which can automatically adapt to changes in the fitness landscape by exerting a dynamical control on the fitness itself and on the other side it provides a simple and natural rule for the emergence of cooperation in the basic formulation of the PD presented above. Remarkably, this rule seems quite reasonable for the two prisoners as well as in biological and social interactions.

Methods

To analyze the PD from a dynamical point of view, we need to switch the perspective from game theory to evolutionary dynamics. Here the different strategies correspond to different types of individuals in a population competing by natural selection and the payoffs correspond to each type’s ability to reproduce, which is called the fitness. For simplicity, we consider a population with only two possible types (this is all we need in order to address the PD; including more types is straightforward in principle, but the calculations get more involved very quickly). The fundamental equation of evolutionary dynamics is the Replicator Equation (RE)

$${\dot{x}}_{a}={x}_{a}({f}_{a}({\bf{x}})-\langle {\bf{f}}\rangle )\,\quad a=1,2,$$

(2)

where x_a is the relative abundance (frequency) of individuals of type a, f_a(x) is the (frequency-dependent) fitness of type a, $\langle {\bf{f}}\,\rangle ={x}_{1}\,{f}_{1}+{x}_{2}\,{f}_{2}$ is the average fitness of the population and the overdot denotes time derivative.

For the PD, there are only two possible strategies: C or D. Thus we can label the frequency of the population adopting each strategy as x_C and x_D respectively. Moreover, the fitness of each strategy is obtained from the payoff matrix (1) by combining each payoff with the probability that the opponent chooses the corresponding strategy, thus obtaining

$${f}_{C}({\bf{x}})=R{x}_{C}+S{x}_{D}\,,\quad {f}_{D}({\bf{x}})=T{x}_{C}+P{x}_{D}.$$

(3)

Without loss of generality, we assume S = 0. Using the fact that x_C + x_D = 1, after some algebra we can rewrite the RE and the fitnesses in terms of x_C only, obtaining the following evolution equation for the frequency of cooperators

$${\dot{x}}_{C}=-{x}_{C}\mathrm{(1}-{x}_{C})[(T-R){x}_{C}+P\mathrm{(1}-{x}_{C})],$$

(4)

From which it is easy to deduce that there are only two fixed points x_C = 0,1, and that only x_C = 0 is stable, that is, cooperators are dominated by defectors (cf. Fig. 1).

Thus both game theory and evolutionary dynamics agree on the fact that the best strategy in the PD is to defect. Remarkably, the standard RE always favours defectors over cooperators whenever T > R > P > S, independently of their relative values. Notwithstanding, one would expect that in real situations there should be a difference between e.g. the case T = 100, R = 10, P = 9, S = 0 and the case T = 5, R = 4, P = 1, S = 0. Notice also that in this classical treatment of the PD, the average fitness of the population decreases over time (see Fig. 2), meaning that the optimal strategy (defection) is optimal only with respect to the local payoff of each player, but it is not optimal with respect to the global payoff of the entire population. This is again the essence of the PD: by trying to maximize only their individual fitness, the prisoners do not achieve the best available fitness. Interestingly, this is unstable whenever the population is not isolated, but it is under evolutionary pressure by some other population, meaning that whenever selection acts both at the level of single individuals in a given population and at the level of competing populations, a strategy considering only the former aspect is destined to be suppressed by one considering both features (this is the main reason why group selection achieves cooperation^9,19).

Let us now use this observation in order to construct a mechanism that boosts the emergence of collaboration among competing individuals. Motivated by the fact that being part of a society provides in many situations a concrete evolutionary advantage for each organism, we assume that natural evolution acts by two mechanisms: on the one side it selects those individuals inside a population with higher fitness at any given time, and on the other side it selects those populations with the higher final average fitness (we can see this process as due to the existence of two time scales, a faster one, in which selection operates at the level of individuals in a population, and a slower scale, in which selection operates among populations). We require the evolution in the faster scale to be dictated locally in time by the standard RE. Finally, in order to obtain a model sensitive to changes in the environment, we suppose that the fitness of each strategy can be regulated in response to the present state of the population (this is similar to the coevolution mechanisms proposed e.g. in refs^33,34).

The above assumptions can be translated into the mathematical problem of maximizing a functional which consists of two terms (one corresponding to selection at the level of individuals, and one corresponding to selection at the level of populations) subject to a dynamical constraint set by the standard RE. In order to find the best strategy to realize this task, we need to solve a maximization problem (similar ideas have been put forward in^37,38,39). This is the arena of Optimal Control Theory (OCT)^40,41,42,43. By applying OCT we obtain (see Supplementary Information) an enlarged system of equations that extends the RE (2) and describes the dynamical equations for the coevolution of the frequencies x in the RE and the fitnesses f. For a = C, D, as in the case of the PD, this system reads

$${f}_{a}=\frac{1}{2}\,{p}_{a},$$

(5)

$${\dot{x}}_{a}=\frac{{x}_{a}}{2}({p}_{a}-\langle {\bf{p}}\rangle ),$$

(6)

$${\dot{p}}_{a}=\frac{{p}_{a}}{2}(\langle {\bf{p}}\rangle -\frac{{p}_{a}}{2})\mathrm{.}$$

(7)

The additional variables p_C and p_D are usually called the co-states (see Supplementary Information). We call the system (5)–(7), together with the initial conditions ${x}_{a}\mathrm{(0)}={x}_{a}^{0}$ and the terminal conditions

$${p}_{a}(\tau )=\frac{\partial g({\bf{x}}(\tau ))}{\partial {x}_{a}},\quad \quad g({\bf{x}})\,:=\langle {\bf{f}}\rangle $$

(8)

the Optimal Replicator Equation (ORE). In the following we show that the ORE leads directly to a simple rule for the emergence of cooperation in the PD.

Results

Let us consider now the PD in terms of the ORE. As usual, we use x_C + x_D = 1 to simplify (6),(7) by eliminating the equation for x_D. Indeed, one can use ${\dot{x}}_{D}=-{\dot{x}}_{C}$ to rewrite the two equations in (6) as a single equation for x_C, which reads

$${\dot{x}}_{C}={x}_{C}\mathrm{(1}-{x}_{C})\frac{{p}_{C}-{p}_{D}}{2}.$$

(9)

Thus we see that we have obtained already a simple condition for the emergence of cooperation, that is, ${p}_{C}-{p}_{D} > 0$. Since by the optimal control strategy (5), p_C and p_D correspond to the fitness of cooperators and defectors respectively, the above condition seems almost trivial at first sight: cooperation emerges whenever f_C > f_D. Nevertheless, this condition is not trivial in this case, at least for two reasons: firstly, it is obtained as the result of an optimal strategy; secondly, and most importantly, because the two variables p_C and p_D are dynamical, with dynamical equations (7) and terminal conditions determined by the choice of the final average fitness according to (8). This aspect is crucial in the solution of the PD.

Let us return to the case of our two prisoners, with payoff matrix as in (1). Now we consider the payoffs in (1) as a final payoff given at time t = τ and use the ORE with the final fitness for each strategy given by (3). As before, we assume S = 0, thus the final average fitness is

$$g({\bf{x}}(\tau ))=R{x}_{C}{(\tau )}^{2}+T{x}_{C}(\tau ){x}_{D}(\tau )+P{x}_{D}{(\tau )}^{2},$$

(10)

and the terminal conditions (8) read

$${p}_{C}(\tau )=(2R-T){x}_{C}(\tau )+T,$$

(11)

$${p}_{D}(\tau )=(T-2P){x}_{C}(\tau )+2P.$$

(12)

the system of equations (7) and (9), together with the initial conditions ${x}_{C}\mathrm{(0)}={x}_{C}^{0}$ and ${x}_{D}\mathrm{(0})=1-{x}_{C}^{0}$ and the terminal conditions (11),(12) is our dynamical model for the PD. Using (9) and (11),(12) one can prove that for

$$2P\le T\le 2R,$$

(13)

the right hand side of (9) at t = τ is always greater than or equal to zero, with equality only for x_C = 0, 1. This means that this evolution admits only two equilibria, namely x_C = 0,1 and that only x_C = 1 is asymptotically stable. Therefore (13) is the condition for the emergence of cooperation in the PD using the ORE. Typical numerical solutions are given in Fig. 1. As we see, contrary to the standard RE, according to the ORE cooperators take over the population.

Another difference with respect to the standard RE is that the average fitness of the population increases with the ORE: while the standard evolution dictated by the RE predicts that the average fitness of the population decreases, approaching the payoff of mutual defection (see Fig. 2), the ORE predicts that selfish individuals cooperate in order to maximize the final average fitness of the population, because this entails an advantage for themselves.

The reason for the cooperative behavior in the ORE is simple: using the ORE we have extended the RE to a dynamics which considers the important evolutionary advantage for each individual deriving from being in a population that has a large final average fitness. Knowing that in case of collaboration they will be able to share an important final payoff, naturally induces the two prisoners to collaborate. Interestingly, collaboration only appears whenever R is “high enough”, that is, whenever 2R ≥ T, and also whenever P is “low enough”, that is, whenever 2P ≥ T. In both cases the prisoners do not find advantageous to defect, and prefer to take the risk to be exposed to exploitation rather than taking a lower payoff. They consider that in such cases the risk is worth the price (cf.⁴⁴).

In biology we can safely assume that in many situations being part of a society guarantees a much higher probability of survival, for instance because it provides better strategies for the collection of food, or for reproduction, or for defense against predators. In this biological setting, we argue that the effect of cooperation can only be assessed at the level of the average fitness of the whole population. So, even if a tendency to cooperate might be inherited, its evolutionary advantage can only be evaluated collectively a posteriori and therefore cannot be included in a local term for the RE, but rather as a final condition. A striking example is provided by bees: in bees society, usually only queens can reproduce. Therefore an evolution based only on the ability to reproduce would lead very quickly to the disappearance of any worker bee. However, workers play an important role for finding food and defending the hive. Cooperation between queens and workers means that the former guarantee reproduction for the latter, while the latter work for the former. This leads to a huge final reward, that is, the conservation of the species after each generation.

Finally, let us stress that while in the case of the PD the two prisoners can be aware of the final reward for collaboration and therefore they can (consciously) decide to cooperate under the appropriate conditions, in a biological setting we can no longer give a similar interpretation. However, as it is usual in evolutionary dynamics, we suppose that the different individuals in the population inherit the possible traits randomly and that evolution favours only those individuals with the best traits, so that the optimization process is a consequence of natural selection and not a conscious decision in such case.

Discussion

To summarize, we have proposed a modified version of the Replicator Equation (RE), the equation governing natural selection, called the Optimal Replicator Equation (ORE), which stems from the assumption that evolution is an optimization process that on the one side selects at any given time those individuals with higher fitness and on the other side favours those populations with higher average fitness. The main motivations for the introduction of such model are the facts that the standard RE cannot account for selection on the two levels of individuals and populations and that it fails to reproduce observed situations, such as the emergence of cooperation in the Prisoner’s Dilemma. Interestingly, by implementing our model (which by definition takes into account the two levels of selection among individuals and populations) to the case of the PD, we have shown that the corresponding dynamics naturally favours cooperation in the case of the basic Prisoner’s Dilemma under some reasonable conditions (cf. (¹³)). Our results thus open the door for an investigation of evolution and social dilemmas in terms of optimization by using the reproduction coefficients – i.e. the fitness – as control parameters. In particular, it would be interesting for future work to compare the condition for the emergence of cooperation obtained here with data from various experiments on the PD and with conditions derived from similar models^44,45,46. Moreover, one can enlarge the study of the optimal strategies deriving from the ORE by considering different social dilemmas beside the PD. We expect to find results on the emergence of cooperation similar to the ones presented here, thus enforcing the idea that the ORE can be a good dynamical model for explaining the emergence of cooperation in a competitive framework.

References

Hamilton, W. D. The genetical evolution of social behaviour. II. Journal of theoretical biology 7(1), 17–52 (1964).
Article CAS PubMed Google Scholar
Trivers, R. L. The evolution of reciprocal altruism. The Quarterly review of biology 46(1), 35–57 (1971).
Article Google Scholar
Axelrod, R. The emergence of cooperation among egoists. American political science review 75(2), 306–318 (1981).
Article Google Scholar
Nowak, M. A. & Sigmund, K. Tit for tat in heterogenous populations. Nature 355(6357), 250 (1992).
Article ADS Google Scholar
Riolo, R. L., Michael, D. C. & Axelrod, R. Evolution of cooperation without reciprocity. Nature 414(6862), 441 (2001).
Article ADS CAS PubMed Google Scholar
West, S. A., Pen, I. & Griffin, A. S. Cooperation and competition between relatives. Science 296(5565), 72–75 (2002).
Article ADS CAS PubMed Google Scholar
Nowak, M. A. et al. Emergence of cooperation and evolutionary stability in finite populations. Nature 428(6983), 646 (2004).
Article ADS CAS PubMed Google Scholar
Nowak, M A. & Sigmund, K. Evolution of indirect reciprocity. (2005).
Traulsen, A. & Martin, A. N. Evolution of cooperation by multilevel selection. Proceedings of the National Academy of Sciences 103(29), 10952–10955 (2006).
Article ADS CAS Google Scholar
Axelrod, R M. The evolution of cooperation: revised edition. Basic books (2006).
Nowak, M. A. Five rules for the evolution of cooperation. science 314(5805), 1560–1563 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Taylor, C. & Martin, A. N. Transforming the dilemma. Evolution 61(10), 2281–2292 (2007).
Article PubMed PubMed Central Google Scholar
Hagel, K. et al. Which risk scenarios can drive the emergence of costly cooperation? Scientific reports 6, 19269 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Milinski, M. Tit for tat in sticklebacks and the evolution of cooperation. nature 325(6103), 433–435 (1987).
Article ADS CAS PubMed Google Scholar
Nowak, M. A. & Sigmund, K. Evolution of indirect reciprocity by image scoring/the dynamics of indirect reciprocity (1998).
Manfred, M., Semmann, D. & Krambeck, H.-J. Reputation helps solve the ‘tragedy of the commons’. Nature 415(6870), 424–426 (2002).
Article ADS Google Scholar
Nowak, M. A. & Robert, M. M. Evolutionary games and spatial chaos. Nature 359(6398), 826–829 (1992).
Article ADS Google Scholar
Ohtsuki, H. et al. A simple rule for the evolution of cooperation on graphs. Nature 441(7092), 502 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Perc, M. et al. Evolutionary dynamics of group interactions on structured populations: a review. Journal of the royal society interface 10(80), 20120997 (2013).
Article PubMed Central Google Scholar
Jansen, V. A. A. & Baalen, M. V. Altruism through beard chromodynamics. Nature 440(7084), 663 (2006).
Article ADS CAS PubMed Google Scholar
Hauert, C. et al. Volunteering as red queen mechanism for cooperation in public goods games. Science 296(5570), 1129–1132 (2002).
Article ADS CAS PubMed Google Scholar
Ghang, W. & Martin, A. N. Indirect reciprocity with optional interactions. Journal of theoretical biology 365, 1–11 (2015).
Article MathSciNet PubMed MATH Google Scholar
Szolnoki, A & Perc, M. Antisocial pool rewarding does not deter public cooperation. Proc. R. Soc. B. Vol. 282. No. 1816. The Royal Society (2015).
Chen, X., Szolnoki, A. & Perc, M. Competition and cooperation among different punishing strategies in the spatial public goods game. Physical Review E 92(1), 012819 (2015).
Article ADS MathSciNet Google Scholar
Han, T. A. et al. Good agreements make good friends. Scientific reports 3 (2013).
Pereira, L. M. & Lenaerts, T. Avoiding or restricting defectors in public goods games? Journal of the Royal Society Interface 12(103), 20141203 (2015).
PubMed Central Google Scholar
Sasaki, T. et al. Commitment to cooperation and peer punishment: Its evolution. Games 6(4), 574–587 (2015).
Article MathSciNet Google Scholar
Chen, X. et al. First carrot, then stick: how the adaptive hybridization of incentives promotes cooperation. Journal of The Royal Society Interface 12(102), 20140935 (2015).
Article PubMed Central Google Scholar
Szolnoki, A. & Chen, X. Benefits of tolerance in public goods games. Physical Review E 92(4), 042813 (2015).
Article ADS Google Scholar
Szolnoki, A. & Chen, X. Cooperation driven by success-driven group formation. Physical Review E 94(4), 042311 (2016).
Article ADS PubMed Google Scholar
Chen, X. & Szolnoki, A. Individual wealth-based selection supports cooperation in spatial public goods games. Scientific reports 6 (2016).
Nowak, M. A. Evolutionary dynamics. Harvard University Press (2006).
Nilsson, M. & Snoad, N. Error thresholds for quasispecies on dynamic fitness landscapes. Physical Review Letters 84(1), 191 (2000).
Article ADS CAS PubMed Google Scholar
Wilke, C. O., Ronnewinkel, C. & Martinetz, T. Dynamic fitness landscapes in molecular evolution. Physics Reports 349(5), 395–446 (2001).
Article ADS MathSciNet CAS MATH Google Scholar
Klimek, P., Thurner, S. & Hanel, R. Evolutionary dynamics from a variational principle. Physical Review E 82(1), 011901 (2010).
Article ADS MathSciNet Google Scholar
Karev, G. P. On mathematical theory of selection: continuous time population dynamics. Journal of mathematical biology 60(1), 107–129 (2010).
Article MathSciNet PubMed MATH Google Scholar
Traulsen, A., Iwasa, Y. & Martin, A. N. The fastest evolutionary trajectory. Journal of theoretical biology 249(3), 617–623 (2007).
Article MathSciNet PubMed PubMed Central Google Scholar
Chakrabarti, R. et al. Mutagenic evidence for the optimal control of evolutionary dynamics. Physical review letters 100(25), 258103 (2008).
Article ADS PubMed Google Scholar
Saakian, D. B., Makar, H. G. & Hu, C.-K. Punctuated equilibrium and shock waves in molecular models of biological evolution. Physical Review E 90(2), 022712 (2014).
Article ADS Google Scholar
Geering, H P. Optimal control with engineering applications. Berlin Heidelberg (2007).
Lenhart, S. & J. T. Workman Optimal control applied to biological models. Crc Press, 2007.
Evans, L. C. An introduction to mathematical optimal control theory. Lecture Notes, University of California, Department of Mathematics, Berkeley (2005).
Fleming, W. H. and Raymond W. Rishel. Deterministic and stochastic optimal control. Vol. 1. Springer Science & Business Media, 2012.
Engel, C. & Zhurakhovska, L. When is the risk of cooperation worth taking? The prisoner’s dilemma as a game of multiple motives. Applied Economics Letters 23(16), 1157–1161 (2016).
Article Google Scholar
Capraro, V. A model of human cooperation in social dilemmas. PLoS One 8(8), e72427 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Capraro, V., Jillian J. J. & David G. R. Heuristics guide the implementation of social preferences in one-shot Prisoner’s Dilemma experiments. Scientific reports 4 (2014).

Download references

Acknowledgements

The authors would like to thank Diego Tapias and Cecilia Salinas for insightful discussions. AB is funded by a DGAPA–UNAM postdoctoral fellowship. PP is grateful to the Fitzwilliam College for hospitality during his sabbatical leave.

Author information

Alessandro Bravetti and Pablo Padilla contributed equally to this work.

Authors and Affiliations

Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, México City, 04510, Mexico
Alessandro Bravetti & Pablo Padilla
Fitzwilliam College, University of Cambridge, Storey’s Way, CB3 ODG, UK
Pablo Padilla

Authors

Alessandro Bravetti
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Padilla
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.B. and P.P. contributed to all aspects of this work.

Corresponding author

Correspondence to Alessandro Bravetti.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bravetti, A., Padilla, P. An optimal strategy to solve the Prisoner’s Dilemma. Sci Rep 8, 1948 (2018). https://doi.org/10.1038/s41598-018-20426-w

Download citation

Received: 04 September 2017
Accepted: 06 November 2017
Published: 31 January 2018
DOI: https://doi.org/10.1038/s41598-018-20426-w

This article is cited by

Explaining human altruism
- Michael Vlerick
Synthese (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.