Rapid environmental change in games: complications and counter-intuitive outcomes

Human-induced rapid environmental change (HIREC) has recently led to alterations in the fitness and behavior of many organisms. Game theory is an important tool of behavioral ecology for analyzing evolutionary situations involving multiple individuals. However, game theory bypasses the details by which behavioral phenotypes are determined, taking the functional perspective straight from expected payoffs to predicted frequencies of behaviors. In contrast with optimization approaches, we identify that to use existing game theoretic models to predict HIREC effects, additional mechanistic details (or assumptions) will often be required. We illustrate this in relation to the hawk-dove game by showing that three different mechanisms, each of which support the same ESS prior to HIREC (fixed polymorphism, probabilistic choice, or cue dependency), can have a substantial effect on behavior (and success) following HIREC. Surprisingly, an increase in the value of resources can lead to a reduction in payoffs (and vice versa), both in the immediate- and long-term following HIREC. An increase in expected costs also increases expected payoffs. Along with these counter-intuitive findings, this work shows that simply understanding the behavioral payoffs of existing games is insufficient to make predictions about the effects of HIREC.

the approach that we adopt assumes that individuals exhibit behavioral plasticity guided by previously adaptive cue-response systems, but that the pattern of plasticity can be maladaptive due to evolutionary mismatch.
While the above approach allows decision-making of an individual to be analyzed both before and after HIREC, it ignores the game theory aspect (where expected payoffs depend on the actions of others) that can be important for many interactive (social or ecological) situations. In the HIREC context, game-theoretic models differ from the non-game approach outlined above in a key, under-appreciated manner. Rather than specifying an underlying mechanism (e.g., cue-response system) by which behavior is chosen, game theory typically specifies the expected payoffs (dependent on the behavior of the focal actor and the actions of others). These payoffs are then used to predict the frequency of each type of behavior, assuming that each individual is using a strategy that maximises expected payoff. Although this game-theoretic approach allows the evolutionarily stable strategies (ESSs) to be calculated 11 , we identify that in many cases it will not allow the immediate effect of an environmental change to be predicted, because the decision mechanism is not specified. In the HIREC context, for mixed evolutionarily stable strategies, it is critically important whether the population consists of, for instance: (1) an ESS proportion of individuals with fixed strategies, or (2) individuals that each play the previously adaptive ESS proportion of behaviors, or (3) individuals that use a cue-based, optimally plastic behavioral mechanism to attain the ESS. We show this in relation to the simple and well-studied hawk-dove game 12 where playing hawk means being aggressive, while playing dove is docile. Although the hawk-dove game is clearly an abstraction of real biological systems, the three situations listed above represent useful simplifications of reality. With regard to situation 1 (with fixed hawks and doves), while real animals typically exhibit behavioral plasticity, the animal personality literature shows that some animals are indeed consistently more aggressive while others are consistently more docile. For example, Kokko et al. 13 show that the hawk-dove game can explain a colour polymorphism in a particular bird species 14 , and Lichtenstein & Pruitt 15 identify that 'docile' and 'aggressive' personality types can co-exist in social spiders (discussed later). In relation to situation 2 (individuals choose options probabilistically), a growing literature emphasizes the existence and importance of intra-individual variability, the tendency (in many contexts, not just aggressive contests), for individuals to exhibit unpredictable, apparently random variation in behavior 16 . Finally, situation 3 represents the standard behavioral ecology assumption that animals adjust their behavior depending on cues and social context. Here, we ask how these scenarios differ in their response to HIREC.
In this game context, we find that strikingly counter-intuitive results emerge, but for sensible reasons. First, if all individuals in a population exhibit optimal plasticity (i.e., if they adjust their strategy to fit the new ESS after HIREC), they can have lower mean payoffs than populations that exhibit less plastic strategies that do not adjust to match the new ESS. In addition, in a game context, when individuals adjust optimally to HIREC, changes that increase benefits (i.e. resource value) can decrease expected payoffs to all individuals. Conversely, HIREC that increases contest costs can increase expected payoffs. Below, we show when these counter-intuitive phenomena do or do not occur, and explain why they occur.

Analysis
The hawk-dove game. The hawk-dove game (also known in other contexts as Chicken, or the snowdrift game) assumes that when competing for a resource, some individuals play the 'hawk' strategy (of aggressively competing for the resource) whilst others play 'dove' (taking the resource if it is not contested; so, on average, splitting the resource with other doves). When a hawk meets a dove, the hawk gains the resource (assumed to be of value V); however, when a hawk meets another hawk, there is assumed to be a cost, C, paid during the ensuing competition. The expected payoffs are shown in Table 1.
Assuming that C > V (e.g., if competition is likely to result in serious injury), then in a population that consisted only of hawks, it would be better to play dove. Similarly, in a population consisting only of doves, a hawk would do extremely well. Thus there is an optimal frequency at which the strategies should occur; this evolutionarily stable strategy (ESS) occurs where the two strategies have the same expected gain in reproductive value. Denoting the probability of meeting a hawk by h, the expected payoff to a hawk is: (1 − h)V + h(V − C)/2, and the expected payoff to a dove is: (1 − h)V/2. Setting these equal provides the probability of hawk, h = V/C, which is the ESS when V < C 12 .
Consequences of mechanisms which determine phenotype post HIREC. It is well-known that the balance of frequencies between behaviors in the hawk-dove game could come about in numerous ways; e.g., via a genetic polymorphism where each individual is either a highly aggressive hawk or an unaggressive dove (i.e., with an ESS proportion being born as hawks), or through each individual being behaviorally variable 17 . However, the effect of these possibilities may be very different following HIREC. Assume for instance that before HIREC, the reward for an uncontested resource was V = 1, and the cost of a contest was C = 2. This means that half the individuals should behave as hawks (before the environmental change), with an expected payoff (to any individual) of 0.25 before HIREC (the calculations supporting this value and others in this text are detailed in Supplementary Information 1). We now consider the effect of HIREC decreasing the value of rewards from V = 1 to 0.5. This could happen, for example, if human disturbance reduces the quality of a contested territory or food resource in ways that are not transparent to the competitors. What would be the effect?

Hawk Dove
Hawk www.nature.com/scientificreports www.nature.com/scientificreports/ If the behavioral differences were the result of genetic polymorphisms then, immediately after HIREC (i.e., before the frequency of hawks and doves adjusts), the hawks would get a negative payoff following HIREC, of −0.125, whereas the doves would still get a positive payoff, of 0.125. Consequently, only the doves would, on average, gain reproductive value in interactions.
If, instead, the same balance (of h = 0.5) had ancestrally come about by each individual randomizing their choice in each encounter then, following HIREC, each individual would get the same mean payoff of zero.
A third possibility is that individuals may have evolved in circumstances where their likelihood of acting as hawk had evolved in relation to the strength of food (or habitat) cue that they received (i.e., letting h for each encounter depend on the value V). Under these circumstances, the individuals would display a reaction norm to the food rewards, increasing the probability of playing hawk as the perceived reward increased. Under such circumstances, the optimal reaction norm is to play hawk with probability h = V/C, as described in the previous section; this would still achieve the same expected payoff pre-HIREC (of 0.25) but, immediately after HIREC, they would adopt the best possible behavior of now playing hawk on only 25% of occasions, resulting in a positive expected payoff of 0.1875. Although the rate of reproductive success would have decreased due to HIREC, each member of the population would still tend to accrue reproductive value with numerous interactions -and would do even better than the doves in the genetic polymorphism case (discussed above, where hawks did very badly after HIREC). Figure 1 shows the effect of modifying V whilst C remains fixed, for each of the three mechanisms of choice. Although the reaction norm is the ESS response to the conditions by each individual in the population, and does best if the environmental change decreases V, if the value of resources were to increase from the baseline value (of 1), the reaction norm would result in a lower expected payoff than under the other strategies.
The outcome of this plastic response (where they shift their behavior adaptively in response to HIREC changes in V) is perhaps surprising. The expected payoff to doves is (1 − h)V/2, and the ESS dictates that h = V/C, with both hawks and doves getting the same expected payoff of (1 − V/C)V/2. Differentiating this payoff with respect to V gives (C − 2V)/(2C), which is positive for V < C/2 and negative for V > C/2. Therefore, if V starts below C/2, then small changes in V have intuitive effects on expected payoffs, but if V was greater than C/2 before HIREC, then altering V will have a counter-intuitive effect: increasing resource value will lead to a reduction in expected payoffs and vice versa. The reason is that increased resource value results in a higher tendency to play hawk, resulting in more fights and a reduction in expected payoff. Conversely, reduced resource value reduces the tendency to play hawk, resulting in fewer fights and an increase in expected payoff. Figure 2 shows that if HIREC increases the cost of conflict, C, then the expected payoff increases under the ESS (reaction norm) strategy. This result is general; as discussed above, under this strategy, both hawks and doves get the same expected payoff of (1 − V/C)V/2. Differentiating this payoff with respect to C gives V 2 /(2C 2 ), which is always positive. Thus, as the cost of conflict, C, increases, the optimal payoff increases (no matter what the original value of V or C).
In many intra-specific contexts, the mechanism must be specified (sometimes at several additional layers below that of the game) to carry out the analysis. This requirement to specify the mechanism of behavioral choice (as well as the functional payoffs), that we have shown in relation to the hawk-dove game, applies across a great many games. In Supplementary Information 2, we show that the mechanism would also need to be specified to www.nature.com/scientificreports www.nature.com/scientificreports/ consider the effects of HIREC in a producer-scrounger game 18 . The mechanism supporting the old ESS behavior (e.g., whether traits are fixed, probabilistically selected or dependent on cues) will affect subsequent population dynamics and timescales to reach a new equilibrium. The hawk-dove and producer-scrounger games have negative frequency dependence in payoffs: hawks do better than doves when there are more doves than the ESS (and vice versa); similarly with producers and scroungers. This is not a necessary condition though; we also identify in Supplementary Information 2 that a mechanism of decision would be required to predict outcomes following HIREC in an iterated Prisoner's Dilemma game, where negative frequency-dependence does not apply (cooperators don't do better when there are more defectors).

Discussion
We have shown that altering the value of resources can have counter-intuitive effects. When individuals use a flexible rule that conforms to the ESS for the current conditions (i.e., the reaction norm, that shifts with conditions), then a decrease in resource value (e.g., a reduction in food magnitude) can increase expected payoffs, and vice versa. This effect occurs when the value of resources was previously at least half the expected cost of competition. Furthermore, increasing the cost of conflict in the hawk-dove game always increases the expected payoffs to individuals under the ESS strategy. In each case, the counter-intuitive results occur in populations using a reaction norm that follows the ESS, and the fact that this decrease in fitness occurs with increased resources (or decreased costs) can seem particularly counter-intuitive in the light that it occurs for individuals that would not be able to improve their strategy, even if they were able to choose a different strategy. This occurs because we are assuming that all individuals are using the same strategy; just like the tragedy of the commons 19 , if only one individual were to adapt their behavior to the current conditions then that individual would do better than the others. But because all individuals are using the same strategy, an increased direct benefit (of the resource value increasing) can have indirect costs (through more individuals choosing the hawk behavior) that outweigh the direct benefits.
Why does the expected payoff increase and then decrease with as the value of the resource, V, increases? In cases where the resource value is low, relatively few individuals are behaving as hawks, and increasing V increases the direct benefits more than the indirect costs. This is because the probability of meeting a hawk increases linearly with V (for V between 0 and the cost of a contest, C). However, when V > C/2, more than half the population are already hawks, so an increase in V means that the corresponding linear increase in the number of hawks imposes more indirect costs than the direct benefits of the increase. In the case of increasing C, the indirect benefits always outweigh the direct costs.
The so-called 'paradox of enrichment 20 ' shows that increases in resources can increase the risk of a population going extinct -but that paradox is caused by the increase making the system unstable; in contrast, our result of decreasing payoffs as resources increase is a point of Nash-equilibrium. Counter-intuitive effects in games are not uncommon-which is one of the reasons that models of game situations are so useful. For instance, Braess's paradox 21 and the well-known Prisoner's Dilemma provide examples where the Nash equilibrium that is reached by individuals does not achieve the maximum possible payoff to individuals. Nevertheless, the ease with which these effects occur in a situation as simple as the hawk-dove game suggests that they may not be uncommon in real situations. Furthermore, the different proximate details (about how the behavioral ESS is supported) show that the details of mechanisms can even result in differences over whether payoffs will increase or decrease as the value of resources, or the cost of conflict, increase.
The extent to which such situations occur in the real world is an interesting question. If group-housed quail are individually (artificially) selected according to how many eggs they produce, then the resulting new groups do significantly less well than when whole groups are selected, because the individual selection has also selected for aggression 22 . We would argue that this is somewhat like selecting for hawks (in the case of individual selection) www.nature.com/scientificreports www.nature.com/scientificreports/ and doves (in the case of group selection). Muir 22 also identifies that such a process may apply in forest trees (which also compete for resources), and similar findings may apply in various farm-kept species (such as chickens and pigs) 23 . Much like the results with quail, Lichtenstein & Pruitt 15 found that in social spiders, housing numerous 'aggressive' spiders together resulted in them putting on less weight than when mixed with 'docile' spiders (these personality types were found to be stable over time). Moreover, the level of aggression appears to be lower than would be predicted by the standard hawk-dove game (for short-term individual maximisation); this result may relate to longer-term inclusive fitness effects (i.e., of colonies, rather than individuals).
We have also shown that to calculate the immediate effects of a change in a game-theoretic situation, additional assumptions will typically need to be introduced (on top of the assumptions of the original model and those of the environmental change), to know the degree of adaptive plasticity in the evolved organism. This contrasts with recent work on HIREC scenarios that do not involve game theory, which have used existing optimality models to set behavioral parameters before identifying how the system would respond following an environmental change (e.g. 6,7 , with the general approach specified in 10 ).
In both game-theoretic and non-game scenarios, the mechanism being used pre-HIREC is fundamental to what will occur following HIREC -but the mechanism is often not specified (or known) in game theoretic scenarios. Simply knowing the pre-HIREC evolutionarily stable balance does not allow us to identify outcomes following HIREC without additional assumptions (or inferences) about the proximate drivers which maintained the pre-HIREC behavioral choices (this is even without the complications of learning and feedbacks -for instance of hunger potentially driving more hawk-like behavior -which would seem bound to further complicate what would occur following HIREC). There is a simple answer about mechanism in some circumstances though: when different species are involved, it can often be assumed that the differences are species-specific. For instance 24 , puts forward a model for how different bee species may respond to HIREC, in relation to competition for nesting space, where larger individuals have a competitive advantage. Because this is done on the basis of the individuals being of different species (like having genetic hawks and doves, rather than individuals choosing which type to play), the mechanism of difference -and thus decision -is clear.
The effect applies in both negative-and positive-frequency dependent cases. There are other games which are not frequency-dependent in either way. The war of attrition 11 is one well-known case, where the ESS is a mixed strategy for how long to persist in competition. It is easy to see in such a game also, the effect of HIREC will depend on additional assumptions that relate not to how the organism has evolved to respond to environmental cues. If an organism has evolved a strategy that does not pay attention to the payoff size (e.g., in a world where payoffs are relatively static) then their immediate behavior following HIREC will not be affected by the reward sizes having altered. If, instead, their strategy is dependent on the reward magnitude, then their response will immediately shift with the environmental change. It is also worth noting that in the war-of-attrition, under the ESS strategy (following the optimal reaction norm to food value), HIREC would result in no change to expected payoff, of zero.
We have analysed cases as though the mechanisms of choice are thoroughly distinct but behaviors will typically be partially genetic, partially learned/ontogenetic, and partially cue-based 25 . The long-term outcomes in each case are similar (still governed by the functional payoffs of the game) if we assume that evolution will optimize the behavior (the 'behavioral gambit 26 '), but the behavioral adjustments and success of species following an environmental change can be quite different. The behavioral outcome immediately after an environmental change will depend on the evolutionary history of problems faced by the focal species and the range of responses they have evolved to cope with these problems. This raises an important question of the extent to which we can make reliable predictions about HIREC effects if we only know the balance between behavioral types, rather than the driving causes of those behaviors. Game theory is useful for understanding evolved (ESS) behaviors in an ancestral environment. We have shown that knowing only the probabilities of behaviors, without additional knowledge (or assumptions) of their causal mechanism, it is not possible to identify the effect of HIREC even in the initial interactions that immediately follow the environmental change. This even applies to whether environmental changes in simple scenarios (such as the hawk-dove game) will have positive or negative effects on populations. This in turn shows that the functional perspective that is taken in game theory is not sufficient to make predictions about the immediate effects of HIREC. Thus, to make worthwhile predictions about the effects of HIREC, researchers will also need to focus on the proximate pillars identified by Tinbergen 27 , of mechanism and ontogeny.