Spite is contagious in dynamic networks

Fulker, Zachary; Forber, Patrick; Smead, Rory; Riedl, Christoph

doi:10.1038/s41467-020-20436-1

Download PDF

Article
Open access
Published: 11 January 2021

Spite is contagious in dynamic networks

Nature Communications volume 12, Article number: 260 (2021) Cite this article

7737 Accesses
17 Citations
36 Altmetric
Metrics details

Subjects

Abstract

Spite, costly behavior that harms others, presents an evolutionary puzzle: given that both the actor and recipient do worse, how could it emerge? We show that dynamically evolving interaction networks provide a novel explanation for the evolution of costly harm. Previous work has shown that anti-correlated interaction (e.g., negative assortment or negative relatedness) among behavioral strategies in populations can lead to the evolution of costly harm. We show that these approaches are blind to important features of interaction brought about by a co-evolution of network and behavior and that these features enable the emergence of spite. We analyze a new model in which agents can inflict harm on others at a cost to themselves, and simultaneously learn how to behave and with whom to interact. We find spite emerges reliably under a wide range of conditions. Our model reveals that when interactions occur in dynamic networks the population can exhibit correlated and anti-correlated behavioral interactions simultaneously, something not possible in standard models. In dynamic networks spite evolves due to transient and partial anti-correlated interaction, even when other behaviors are positively correlated and average degree of correlated interaction in the population is low.

Self-regulation versus social influence for promoting cooperation on networks

Article Open access 16 March 2020

Pairwise interact-and-imitate dynamics

Article Open access 24 June 2021

Information Cascades and the Collapse of Cooperation

Article Open access 14 May 2020

Introduction

Costly behavior that harms others, sometimes known as spite^1,2,3, is among the most basic of anti-social behaviors. However, it presents an evolutionary puzzle. If there are no benefits, how could it have emerged? There nevertheless are cases of costly harm in both humans^4,5,6 and non-humans^7,8,9,10. For instance, in human evolution there is a case to be made that the killing of individuals who exhibit high degrees of reactive aggression, clearly a dangerous harmful endeavor, played an important role in the self-domestication of human ancestors¹¹. This has prompted a number of evolutionary models to explain the emergence of such costly harm. We can represent this sort of interaction abstractly with a simple game. Suppose two individuals interact and each has the opportunity to pay a cost to inflict a harm on the other. Let b be the benefit derived from normal social interaction and let −c be the cost an agent can pay to take away that baseline benefit. This game is the Prisoner’s Delight¹², which contrasts with the Prisoner’s Dilemma in that acting anti-socially, rather than pro-socially, is costly (Fig. 1). In this game, unlike in the classic dilemma, harmful behavior is strictly dominated and generally we should not expect it to evolve.

**Fig. 1: Game play and updating mechanism.**

Standard models assume large randomly mixing populations and, in such models, spiteful behavior is invariably eliminated. Allowing non-random interaction due to assortment of strategies opens new possibilities. It is well known that costly altruism can evolve with non-random interactions. In particular, altruism can evolve if interactions are correlated—that is, if altruists interact disproportionately with other altruists^13,14,15. A similar, but inverse, result has been shown regarding spite¹⁶. If a given strategy or type interacts disproportionately often with different types—if interactions are anti-correlated—spite can emerge by generating relative advantages. To illustrate this, consider a population where individuals can play one of two possible strategies: ‘Social’ or ‘Spiteful’. ‘Spiteful’ agents choose to pay a cost to inflict harm on their interaction partner, while ‘Social’ agents do not pay a cost and do not inflict harm. Note that the strategy label ‘Social’ is used minimally to refer to non-spiteful behavior. Let r denote the degree of anti-correlated behavior in the population: with probability r individuals interact with those using different strategies and with probability (1 − r) interact randomly. Using the payoffs from Fig. 1, we can deduce that ‘Spiteful’ behavior will outperform ‘Social’ behavior whenever the degree of anti-correlated interaction is larger than the cost-to-harm ratio: r > c/b (see SI). This rule mirrors Hamilton’s rule for when altruism will be favored, and was also derived by Hamilton in the context of negative relatedness^16,17. However, anti-correlated interactions need not be generated by genetic relatedness. It can be realized in any number of ways including green beard effects⁸, neutral markers¹⁸, spatial structure¹⁹, and small populations²⁰. The exogenous parameter r represents correlated interactions abstractly, without reference to a specific mechanism. We demonstrate that dynamic networks can endogenously produce the anti-correlated interactions necessary for spite, and that dynamic networks do so in a way that is not captured with the traditional analysis.

Dynamic networks capture the fact that in many human interactions we choose the people we wish to interact with—and how often—such as when we join social or religious groups²¹. It has been shown that dynamic networks allow for endogenous correlated interactions and thus for the emergence of altruism in the classic Prisoner’s Dilemma^{22,23,24,25,26}. We show that dynamic networks can also do the opposite: produce anti-correlated interactions which allow for the spread of spite. Specifically, spite will spread in dynamic networks with adaptive link weights, where these weights are based on reinforcement learning. This happens because both correlated and anti-correlated interactions occur simultaneously, enabling spite to spread through the network via imitation. Reinforcement learning has been identified by cognitive scientists and economists for its ability to describe human choice in a variety of iterative decision-making settings that is biologically plausible^27,28,29. We use reinforcement learning^22,30,31 to dynamically update network ties, and find that correlated and anti-correlated interactions emerge endogenously in a population, which impacts the evolution of cooperation and social conventions. Our approach contrasts with the class of dynamic network models that represent network links as discrete and where the dynamics describe patterns of link breakage and formation^24,26,32. Real social network ties are rarely discrete, therefore modeling network connections with reinforcement learning adds an important element of realism^33,34,35.

Here we employ these weighted dynamic networks to study the evolution of costly harmful behavior in the Prisoner’s Delight. We track the emergent levels of correlated and anti-correlated interactions among the strategies over time and allow the strategies to co-evolve over time via imitation. ‘Social’ agents become correlated with one another and ‘Spiteful’ agents become anti-correlated. Despite the advantages of pro-social behavior in this game and the ability to correlate social interactions, spite nevertheless evolves in a wide range of conditions.

Results

We use agent-based computer simulations³⁶ to explore the co-evolution of network structure and behavioral strategies. Results are averaged across 200 randomly seeded simulations with 1 million time-steps and a population of 50 with an imitation rate of 0.01, unless otherwise noted. Each time-step every agent selects one other agent to visit and both agents play the game. Agents then update their likelihood to visit others based on the payoffs received and have a chance to imitate the strategies other agents (Fig. 1). Because playing ‘Social’ is a dominant strategy in the Prisoner’s Delight, it will be expected to fixate in a randomly mixing population. Any agent playing ‘Spiteful’ in a given interaction will be worse off than if they had played ‘Social’ instead. However, when b > c, ‘Spiteful’ agents inflict greater harm on their partner than they pay in cost. This difference allows for the possibility of the ‘Spiteful’ strategy to spread via imitation in non-random networks. Setting b + c = 1 normalizes the payoffs and allows us to analyze the effects of payoff differences through the ratio b/c.

Spite spreads on dynamic networks

Our key question is whether reinforcement learning on a dynamic network can produce anti-correlated interactions leading to the emergence of spiteful behavior: the answer is yes. All populations converge to a uniformity of social behavior or spiteful behavior. Spite becomes the norm in our model over a range of population sizes (Fig. 2a), as well as imitation rates (Fig. 2b). This outcome occurs when the ratio of the harm done to the cost of spite is sufficiently large, i.e., a high enough b/c value. As each agent reinforces their network weights based on the payoffs they receive, over time both ‘Social’ and ‘Spiteful’ agents learn to visit ‘Social’ agents. This learning process changes the share of all interactions that each of the four possible ordered visitor–host couplings represents (Fig. 2c). The result is that ‘Spiteful’ agents begin to participate in fewer interactions as other agents learn to avoid visiting them. In each round, however, every agent has their own opportunity to select a partner to visit. It becomes likely that a ‘Spiteful’ agent will select a ‘Social’ agent to visit, and therefore receive a payoff of b. Conversely, ‘Social’ agents are visited more frequently as other agents learn to target them. These interactions will occur with both ‘Social’ and ‘Spiteful’ agents, providing a mixture of the maximum payoff, 1, and numerous much smaller payoffs, c. As the ratio of b/c grows larger, the smaller value of c will slow the rate of growth in the average payoff received by ‘Social’ agents. At the same time, the increased value of b and decreasing visits from other ‘Spiteful’ agents will increase average payoff received by ‘Spiteful’ agents until it surpasses that of the ‘Social’ agents (Fig. 2d). This means when ‘Social’ agents select a ‘Spiteful’ agent to consider imitating, they will choose to adopt the ‘Spiteful’ strategy.

**Fig. 2: Dynamic networks produce (anti-)correlated interactions that lead to spite.**

These central results are robust with respect to starting population frequencies: spite is able to spread reliably to the whole population from a single ‘Spiteful’ individual (Fig. S2). Similar results are also derivable from a simplified analytic model assuming discrete ties formed by choosing best-responses (see ‘Methods’). In the simplified model, a single ‘Spiteful’ individual will invade and spread through the whole population provided 3b − c > 2, even if overall average interactions are positively correlated when invasion occurs. Just as in our simulation results the invasion of ‘Spiteful’ occurs because ‘Social’ agents make up a larger proportion of the ‘Spiteful’ agents’ interactions than they do for other ‘Social’ agents. This is possible because ‘Social’ agents gain incoming ‘Spiteful’ links, reducing their share of ‘Social’ interactions. Simultaneously, ‘Spiteful’ agents shed harmful links with other ‘Spiteful’ agents, increasing their share of ‘Social’ interactions.

The endogenous partner choices of the agents in response to payoffs is essential. This allows ‘Spiteful’ agents to disproportionately target ‘Social’ agents, even as the ‘Social’ agents are disproportionately interacting with one another. The simultaneous independent formation of anti-correlated interactions for ‘Spiteful’ agents as well as correlated interactions for ‘Social’ agents is made possible by the use of asymmetric link updating in the model. This allows each agent to form reinforced preferences over the other agents they can choose to visit without being influenced by who approaches them. This would not be possible under symmetric link updating: then one agent’s preference for frequently visiting another agent would cause the visited agent to add reinforcing link weight to the visiting agent. Since both kinds of correlated interaction can appear simultaneously, we represent the mean degree of correlated interaction as $\bar{a}$. If $\bar{a}\,\,> \,\,0$, mean interactions are positively correlated. If $\bar{a}\,\,<\,\,0$, mean interactions are anti-correlated. The classic condition for the invasion and stability of spite in a population corresponds to an overall degree of correlated interaction below −c/b (see SI). Note that although we can compare the results, $\bar{a}$ is distinct from the classic exogenous r-parameter as $\bar{a}$ is merely a descriptive statistic of the network structure resulting from the local learning process of agents. Agents gradually adapt their behavior based on past payoffs and everyone learns to avoid ‘Spiteful’ agents. This tends to produce correlated interactions among agents engaged in ‘Social’ interactions, and anti-correlated interactions between agents engaged in ‘Spiteful’ interactions (Fig. 2e). This outcome highlights a frequently overlooked detail of previous modeling approaches: it is important to consider and measure the degree of correlated interaction for each strategy type individually, in addition to the overall value. In our model, the overall average degree of correlated interaction stays relatively stable and neutral over time because the offsetting correlated interactions of each strategy type. A notable consequence of this is that spite can reliably spread through a population even with relatively neutral correlated interaction ($\bar{a}{\,}> -{\hskip -3pt}c/b$; Fig. 2f). This result suggests a significant limitation of the exogenous global methods of modeling correlated interactions and that the well established r > c/b condition for the evolution of harmful behavior does not generalize.

Effect of variations in learning

Network learning speed influences the evolution of network weights and the resulting degree of correlated interaction. The rate of learning can be adjusted by multiplying the payoffs received in the reinforcement process. Stronger adjustments of the relative weight of agents’ outgoing network weights speed up the process of finding partners that they perform better against. Thus, faster network learning causes the correlated and anti-correlated interactions of ‘Social’ and ‘Spiteful’ agents to emerge earlier, leading to the ‘Spiteful’ strategy to become dominant after fewer time steps. Given a non-negligible network learning speed and a sufficient number of time steps, however, spite regularly emerges in the population provided a large enough b/c value (Fig. 3a). Network learning speed also effects the rate at which global network patterns appear and transition throughout the simulation. The overall network displays four key stages during the network evolution process (Fig. 3b). Populations begin with behavior akin to random mixing, due to the fact that agents are initialized with uniform network weights. Structure in the population begins to emerge as agents learn to target ‘Social’ agents in their interactions. ‘Social’ agents begin to receive most incoming interactions from both ‘Social’ agents and ‘Spiteful’ agents. This process continues and results in a core–periphery structure with ‘Social’ agents forming the core, and ‘Spiteful’ agents generally in the periphery receiving relatively few visitors. Despite this, the ‘Spiteful’ strategy begins to spread by imitation due to comparison of relative payoffs between individuals. Finally, the ‘Spiteful’ strategy comes to dominate and the network slowly trends back toward random mixing as there are no more ‘Social’ agents to target.

**Fig. 3: Network processes drive model results.**

Another parameter that influences network evolution is the discount rate in learning, which controls the rate at which weights diminish over time. Network discounting determines how significant recent payoffs are relative to total past payoffs in determining network weights. This can be interpreted as an abstract representation of recency bias or fading memory of past payoffs. This is important both as a psychologically realistic aspect of learning and since discounting is known to impact results of reinforcement learning rules^22,27,28,29.

Network discounting allows agents to unlearn network link weights connected to agents who have switched from the ‘Social’ to ‘Spiteful’ strategy. This is implemented at the end of every round by multiplying the weight of each network connection by a constant (1 − δ) where δ (0 ≤ δ ≤ 1) represents the degree to which past payoffs are discounted. When δ ≈ 0, learning speed slows down over time and agents will not adapt to changes in the strategies of other players. If a frequently visited agent changes strategy from ‘Social’ to ‘Spiteful’, the lack of new reinforcement combined with the discount factor allows agents to unlearn their previous reinforced behavior. In practice, this means that when the network discount is small, it dampens the levels of correlation and anti-correlation in the model (Fig. 3c). When the network discount value (1 − δ) is set to 0.01, or larger, spite emerges over a large space of parameter combinations. But when the network discount is set to 0.001, spite does not become the dominate strategy even for very high values of the network learning speed.

Thus far, we have only considered one imitation mechanism, but we can easily implement others in the model. For example, if the mechanism imitated behaviors based on total payoffs rather than average payoff per interaction, spite rapidly dies out and populations always converge on the ‘Social’ strategy. The opposite is true of cooperation in the Prisoner’s Dilemma^14,22 (see Supplementary Note 4 for comparison). Furthermore, we can implement alternative imitation mechanisms that mirror biological reproduction, copying other strategies with a probability proportional to their success such as the Moran process^37,38. Adapting the Moran process to our model shows that spite can still emerge with considerable frequency, albeit with less regularity (see Supplementary Note 6 for more detail).

Discussion

Our model shows that costly harm can spread through a population, structured by preferential interaction, via imitation. Such harm can emerge even without a significant overall degree of anti-correlated interactions in the population. It is well known that in populations with a sufficient degree of anti-correlated interaction that costly harm will emerge^{8,12,16,18,19,20}. What our model demonstrates is that the degree of anti-correlated interaction can be partial, transient, strategy specific, and that it can coexist with correlated interactions among other strategies. These results highlight the importance of evaluating patterns of correlated interaction not just at a system level, but also at the level of strategy type. Consequently, traditional global-parameter methods of representing such assortment in populations cannot adequately capture the evolutionary consequences of dynamic assortment on evolving networks. Thus, our results reveal a novel evolutionary pathway for the emergence of costly harmful behavior in any system where individual agents are capable of learning by reinforcement and imitation. It has been recognized that dynamic networks can provide a mechanism for the spread of cooperation^{22,23,24,25,26}, and our results reveal they can also cause the spread of spite. Interestingly, recent empirical work shows that among humans, anti-social behaviors spread more readily among peers than do pro-social behaviors³⁹.

There are two connections between our model and other formal evolutionary models of social behavior that help clarify the generality of our results. First, while we have constructed our model using learning dynamics, the abstract nature of learning models allow for a more general application. Learning models tend to be associated with within-generation cultural evolutionary interpretations⁴⁰. Yet there are formal results that show that models of imitation learning map onto biological reproduction^41,42, and models of reinforcement learning map onto standard Darwinian evolutionary dynamics⁴³. This allows for biological interpretations of the model in addition to the cultural interpretations. Second, while we have framed our research in terms of correlated interaction, there are many important formal results connecting the notion of correlation to the notions of relatedness and inclusive fitness in Hamilton’s work^{8,15,17,44,45}. We find that our model has similar results when using a strategy update rule more amenable a biological interpretation (specifically a modified Moran process; see SI). This further supports the view that the overall effect we observe should be relevant to any species that is capable of preferential interaction. A biological interpretation of our model would then have implications for whether we need to re-evaluate the traditional global-parameter approaches to kin-selection and inclusive fitness.

Our results also have significant implications for open questions in social science and the evolution of social behavior. Human (and human ancestor) populations are clearly candidates for the operation of this evolutionary mechanism. Humans often choose their interaction partners on the basis of past experience and imitate the social behavior of others. Humans are also tuned to the success of their behavioral strategies and can update their behavior accordingly. That said, our model focuses only on the costs and benefits of behavioral strategies and does not include motivational states, sophisticated cognition, or many other psychological mechanisms. A full account of human social behavior would require considering these factors explicitly. Applying these models to humans, or any cognitively complex organism, raises a host of questions about how psychological mechanisms relate to behavior, e.g., about how motivations connect to evolutionary models of behavioral change. One avenue for further research is to pursue the connections between formal models, such as ours, and comparative psychological studies on altruism and spite^4,46,47,48.

The origin and evolution of punishment is another area of ongoing debate and research where our results are relevant. The use of punishment has been well-document in humans and broader ecological systems^49,50,51. Punishment, broadly construed, is costly harm inflicted conditional on some behavioral response. An example of such behavior occurs in laboratory experiments of iterative public goods games in labor settings with profit sharing⁵². In these settings, humans choose to reduce the payoff of a non-contributing free-rider even when they are charged a cost in order to do so⁵³. Punishment can influence social interaction in many possible ways. Enforcing norms through punishment can stabilize cooperation (or any other behavior)⁵⁴, or create complicated interactions between reciprocity and retaliation^55,56. There are also a number of hypotheses on the evolution of punishment⁵⁷. One influential approach treats punishment as an altruistic behavior that stabilizes cooperative norms⁵⁸. More recent studies have shown surprising complexity to punishing behavior, including anti-social punishment^6,59, where individuals punish cooperating members and can destabilize cooperation⁶⁰. Our study, by uncovering a new pathway for the evolution of costly harm, suggests a new hypothesis for the origin of punishment. Rather than evolving to stabilize a cooperative social environment by enforcing some beneficial norm, costly harm may have emerged independently as a way of targeting competitors. Then it need only be directed toward enforcement of some other behavior to become true punishment^3,61.

Methods

We model the co-evolution of interaction structure and social behavior^62,63. Agents are assigned initial behaviors in the Prisoner’s Delight game: ‘Social’ or ‘Spiteful’. Agents choose their interaction partner, update who they are likely to choose on the basis of payoffs received, and periodically imitate others who are receiving higher average payoffs.

Simulation model

Network updating occurs via Roth-Erev reinforcement learning. Each round, agents select one other agent to visit proportional to their outgoing network weights, interact, and receive a payoff. Network weights are updated according to Roth-Erev reinforcement learning on the basis of the payoff received: the larger the payoff, the more weight is added to that agents out-going network link. Network weights are updated asymmetrically, meaning only the agent who initiated the interaction (the visitor) updates their outgoing weight based on the received payoff. This asymmetry represents a situation where individuals can control who they choose to approach, but not who approaches them. It also allows some individuals, who receive more visitors than others, to have more interactions per round (however, each agent is guaranteed one interaction as visitor). Our network model also represents links in a continuous manner. This approach avoids the need to exogenously set the number of network connections which can affect results⁶⁴. Instead, all learning and evolution of network weights occurs endogenously. Network weights are initialized as uniform, representing random initial interaction. Structured interaction emerges as agents learn and payoffs accumulate.

Strategy learning occurs via imitation. After each round is complete, agents have a chance to consider imitating the strategy of another agent. When imitating, an agent selects another agent with a probability proportional to how often they interact, and if the selected agent received a higher average payoff per interaction in the previous round, imitates that agent’s strategy. This captures the importance of social learning in many animal behaviors, while also allowing individuals to select their interactions based on their own experiences^22,65,66. Imitation is a form of social learning that is used in a variety of contexts to model cultural evolution^40,67 and also has important formal relationships to models of biological evolution⁴¹. Initial strategies are determined by randomly assigning half of the agents to be ‘Social’ and the other half to be ‘Spiteful’ (Supplement Note 5 also examines the case of a single random ‘Spiteful’ agent).

More precisely, we model a set of N agents in pairwise games of cooperation across a set number of rounds (1 million). In each round, every agent selects another agent to play against. During every interaction, an agent plays their pure strategy independently of the other agent’s strategy, and receives a payoff accordingly. Next, the agent who initiated the interaction reinforces their outgoing network connection by the payoff received. An agent can be a player in a maximum of N interaction per round (1 as visitor, and N − 1 as host). At the end of each round, every agent independently and simultaneously considers imitating another agent with a set probability proportional to the imitation rate (λ). If an agent is selected to consider imitation, then they randomly select another agent with a probability proportional to the weight of their outgoing and incoming links with other agents. This represents the overall frequency of interaction with each other agent so individuals are more likely to consider agents with which they have more interactions. The selected agent’s strategy is imitated if and only if that agent received a strictly greater average payoff per interaction than the imitating agent. We also include an error chance (e = 0.01) when agents consider imitation, in which a random strategy is chosen rather than imitating another agent.

Each agent i has a pure strategy (‘Social’, ‘Spiteful’) and vector representing their reinforcement weights for the choice of which player to visit: (w_i1, w_i2, . . . , w_in) where w_ij represents the weight related to player i visiting player j. Self-visits are not allowed (w_ii = 0). At the start of each simulation, half of the agents are ‘Social’ and half of the agents are ‘Spiteful’ (results are robust to different starting proportions; Fig. S2). Initial network partner weights are set uniformly to L/(N − 1), where L is a parameter determining initial learning weights. We use the convention L = 9, so that network partner weights start at w_ij = 1 for the smallest population we study (N = 10). To ensure similar reinforcement learning speed relative to total initial weights across different population sizes, we kept L constant over all simulations. Changing L varies network learning speed because larger L values reduce the relative size of each payoff in comparison to the initial uniform network weights.

The model has two parameters that affect the reinforcement of network weights: discounting (δ) and error (ϵ). Discounting reduces past learning weights as more reinforcement occurs, gradually allowing agents to forget old network connections. Errors represent mistakes, noise, or mutations where an agent selects an interaction partner at random rather than according to their network weights. Both components have been shown to impact the stability and long-run behavior of reinforcement learning.

When selecting an interaction partner for a given round, the probability of choosing agent j is proportional to the current network weights:

$${\mathrm{Pr}}(j)=(1-\epsilon )\frac{{w}_{ij}}{{\sum }_{k}{w}_{ik}}+\epsilon \frac{1}{| N| },$$

(1)

where ϵ is the error rate, N is the set of agents, j ∈ N, and k ∈ N.

After each round of interactions, link weights for the selected outgoing links are updated by discounting the prior weight by a factor (δ) and adding the received payoff (π):

$${w}_{ij}^{\prime}=(1-\delta ){w}_{ij}+R{\pi }_{i},$$

(2)

where ${w}_{ij}^{\prime}$ is the link weight after updating and π_i is sum of i’s the most recent payoffs. R is the rate of network learning, set to R = 1 by default, higher values result in faster learning (results are robust to both small and large learning rates; (Fig. S3). All link weights are updated simultaneously at the end of each round, reflecting the outcome of every interaction an agent was a part of during the round.

After link weights have been updated at the end of a round, every agent independently considers imitating another agent’s strategy proportional to the imitation rate (γ). If an agent is randomly selected to consider imitation, they select a possible imitation partner proportional to their outgoing and incoming link weights associated with every other agent:

$${\mathrm{Pr}}(j)=\frac{{w}_{ij}+{w}_{ji}}{{\sum }_{k}({w}_{ik}+{w}_{ki})}.$$

(3)

If the selected imitation partner has received a higher average payoff over all interactions in the previous round, then the agent considering imitation will adopt the selected agent’s strategy.

We measure of correlated interactions for each agent (a_i) as a function of the total proportion of the incoming and outgoing link weights of that agent for all connected agents of the same strategy type. Let s_i denote the strategy type of agent i, and let Same_i denote the set of agents j that have the same strategy as i (s_i = s_j). We calculate the difference between the proportion of same-strategy interactions and the proportion of same-strategies assuming random interaction:

$${a}_{i}=\frac{{\sum }_{j\in {\mathrm{Same}}_{i}}({w}_{ij}+{w}_{ji})}{{\sum }_{k}({w}_{ik}+{w}_{ki})}-\frac{| {\mathrm{Same}}_{i}| }{| N| }.$$

(4)

If a_i > 0, i interacts with agents of the same type more than would in random interaction and thus are positively correlated, and if a_i < 0, they are anti-correlated. This allows us to represent correlated and anti-correlated interaction simultaneously in different sub-populations. The degree of correlated interactions for ‘Social’ agents is the mean measure of correlated interaction of all agents using the ‘Social’ strategy; the degree for ‘Spiteful’ agents is the mean measure for all agents using the ‘Spiteful’ strategy. The overall measure of correlated interaction for the population $(\bar{a})$ is the mean degree of correlation for all agents in the population regardless of strategy type. Note that in an unstructured population with uniform correlated interaction rates, the classic inequality r > c/b corresponds to the $\bar{a}$-value of −c/b with respect to the stability and invasion conditions for ‘Spiteful’ behavior (see SI). Also note that the measure of correlated interaction for extinct strategy types is not defined because it is the mean of an empty set, but we plot these values as 0 as these types can be re-introduced by imitation error.

All model parameters are summarized in Table 1.

Table 1 Parameters and baseline values.

Full size table

Analytic model

A simplified model enables the derivation of mathematical results that illustrate and further support the key insights of our study. This simplified model uses discrete network links formed by a best-response rule, strategy updating occurs by imitation similarly to the central model. Analytic results from the simplified model further support the simulation results of the central model: endogenous partner choice in dynamic networks allows spite to spread via imitation.

Suppose there are N agents, each with a strategy s_i ∈ {‘Spiteful’, ‘Social’}. Each agent i at each time t has exactly one outgoing link to one other agent j ≠ i, represented as ${l}_{ij}^{t}\in N\times N$ (a temporal adjacency matrix), incoming links are limited only by the number of agents. At a given time t, the interaction set ${I}_{i}^{t}$ for an agent i is the set of all their interaction partners: ${I}_{i}^{t}=\{j| {l}_{ij}^{t}\,{\rm{or}}\,{l}_{ji}^{t}\}$. Let u(s_i, s_j) be the payoff of i’s strategy played against j’s strategy.

At the beginning of each time-step, agents select their outgoing link using a myopic best-response rule: link to another agent such that payoffs are maximized if all strategies remain constant. Precisely, each agent forms one link ${l}_{ij}^{t}$ for time-step t by choosing randomly (with a uniform distribution) from the set of optimal links L_i:

$${L}_{i}=\{{l}_{ij}| u({s}_{i}^{t},{s}_{j}^{t})\ge u({s}_{i}^{t},{s}_{k}^{t})\,\,{\text{for}}\; {\text{all}}\,\,k\in N\}.$$

(5)

After links are formed, agents play the game in Fig. 1 with every agent in their interaction set. An agent’s average payoff per interaction at time t is:

$${\bar{U}}_{i}^{t}=\sum _{j\in {I}_{i}^{t}}u({s}_{i}^{t},{s}_{j}^{t})/| {I}_{i}^{t}| .$$

(6)

After interacting, agents update their strategies by imitation. Let ${B}_{i}^{t}$ represent the equal or better-performing agents in i’s interaction set (including i):

$${B}_{i}^{t}=\{j\in \{{I}_{i}^{t}\cup i\}| {\bar{U}}_{j}^{t}\ge {\bar{U}}_{i}^{t}\}.$$

(7)

During imitation i selects a random $j\in {B}_{i}^{t}$ (with a uniform distribution) and adopts that agent’s strategy: ${s}_{i}^{t+1}={s}_{j}^{t}$. After imitation, links are updated as above and the process repeats.

Conditions for invasion and spread of spite

Note that because the game is dominance solvable, all agents regardless of their own strategies will form links with agents playing ‘Social’. Thus, in any mixed population (N > x > 0), ‘Spiteful’ agents will only have ‘Social’ interaction partners whereas ‘Social’ agents may have a mix of partners. Consequently, the proportion of ‘Social’ interactions for a ‘Spiteful’ agent will always exceed the proportion of ‘Social’ interactions for anyone they visit. This difference, when b > c, allows for the possibility of spite to spread through the population. Given endogenous partner choice, whether spite is expected to spread through imitation depends on the relevant mean payoffs of the agents interacting with ‘Spiteful’ individuals. To determine this condition, suppose there is a single ‘Spiteful’ agent i. This agent will visit a single ‘Social’ agent j, and not be visited by anyone. Thus, ${\bar{U}}_{i}^{t}=b$.

The ‘Social’ agent j will be visited by i (${l}_{ij}^{t}$), will visit some other ‘Social’ agent k (${l}_{jk}^{t}$), and will be visited by z other ‘Social’ agents (where 0 ≤ z ≤ N − 2). Thus, ${\bar{U}}_{j}^{t}=(c+1+z)/(z+2)$ and j has a chance to imitate i if and only if

$$b\ge \frac{z+1+c}{z+2}.$$

(8)

If the above inequality is strict, i will never imitate j and j will eventually imitate i’s ‘Spiteful’ strategy.

Since c < 1, there will be some c, b, and z such that ‘Spiteful’ will spread by imitation. To determine at what payoff values we can expect spite to begin to spread from a single ‘Spiteful’ agent, note that all agents form a single link. Hence, the expected value of z is 1 and the expected mean payoff for j is ${\mathrm{Exp}}({\bar{U}}_{j}^{t})=(c+2)/3$. Thus, spite will be expected to spread by imitation whenever b > (c + 2)/3 or:

$$3b-c{\,}> {\,}2.$$

(9)

Generalizing this, if there are x ‘Social’ agents and y ‘Spiteful’ agents, a ‘Social’ agent j who is interacting with at least one ‘Spiteful’ individual is expected to be visited by (y − 1)/x other ‘Spiteful’ individuals and 1 other ‘Social’ individual. Thus, the expected mean payoff for a ‘Social’ agent j who is interacting with at least one ‘Spiteful’ agent is

$${\mathrm{Exp}}({\bar{U}}_{j}^{t})=\frac{2+c\left(1+\frac{y-1}{x}\right)}{3+\frac{y-1}{x}}.$$

(10)

Agent j is expected to imitate a ‘Spiteful’ individual whenever ${\mathrm{Exp}}({\bar{U}}_{j}^{t})\,<\,b$ or

$$b\left(3+\frac{y-1}{x}\right)-c\left(1+\frac{y-1}{x}\right)> \,2.$$

(11)

Comparing the generalized inequality to the case where y = 1 reveals that the chance of imitating ‘Spiteful’ becomes strictly greater the more ‘Spiteful’ individuals are present in the population. Once a single ‘Spiteful’ individual begins to spread, it is expected that they will continue to spread through the entire population.

Measure of correlated interaction during invasion

We can employ our measure of correlated interaction to this analytic model as well. Suppose there is a single ‘Spiteful’ individual i in a population of size N, and that the condition for the spread of spite is met (3b − c > 2). After network links are formed, we can calculate the expected a-value of the population. Note that a_i = −1/N for the single ‘Spiteful’ individual and a_j = 2/3 − (N − 1)/N (expected) for the j that i visits, and a_k = 1 − (N − 1)/N (expected) for the remaining N − 2 individuals. The aggregate correlated interaction is then

$$\bar{a}=(1/N)\left({a}_{i}+{a}_{j}+\sum _{k}{a}_{k}\right),$$

(12)

which reduces to

$$\bar{a}=\frac{1}{N}(2/3-2/N).$$

(13)

Using this equation we can see that $\bar{a}\,> \, 0$ whenever N > 3. Therefore, when spite begins to invade and spread in a population of 4 or more, strategies are (on average) positively correlated. This finding shows clearly that the general conditions derived in classic models (e.g., r > c/b) do not generalize to dynamic networks. Indeed, even the very weak condition that $\bar{a}\,<\,0$ is not necessary for spite to invade and spread. This reinforces the important lesson that traditional population-level statistics do not adequately describe social change in dynamic networks.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

All figures and data can be recreated using our code which has been made publicly available. Simulations were written in C++ and run for 10⁶ rounds of play. Data aggregation and network plots were created using Python.

Code availability

Replication code available on GitHub: https://doi.org/10.5281/zenodo.3962292.

References

Hamilton, W. D. The genetical evolution of social behaviour, I. J. Theor. Biol. 7, 1–16 (1964).
Article CAS PubMed Google Scholar
Hamilton, W. D. The genetical evolution of social behaviour, II. J. Theor. Biol. 7, 17–52 (1964).
Article CAS PubMed Google Scholar
Forber, P. & Smead, R. Evolution and the classification of social behavior. Biol. Philos. 30, 405–21 (2015).
Article Google Scholar
Marcus, D., Zeigler-Hill, V., Mercer, S. & Norris, A. The psychology of spite and the measurement of spitefulness. Psychol. Assess. 26, 563–574 (2014).
Article PubMed Google Scholar
McAuliffe, K., Blake, P. R. & Warneken, F. Children reject inequity out of spite. Biol. Lett. 10, 20140743 (2014).
Article PubMed PubMed Central Google Scholar
Sylwester, K., Herrmann, B. & Bryson, J. J. Homo homini lupus? explaining antisocial punishment. J. Neurosci. Psychol. Econ. 6, 167–188 (2013).
Article Google Scholar
Gardner, A., West, S. A. & Buckling, A. Bacteriocins, spite and virulence. Proc. R. Soc. Lond. Ser. B Biol. Sci. 271, 1529–1535 (2004).
Article CAS Google Scholar
West, S. A. & Gardner, A. Altruism, spite, and greenbeards. Science 327, 1341–1344 (2010).
Article ADS CAS PubMed Google Scholar
Inglis, R. F., Garfjeld Roberts, P., Gardner, A. & Buckling, A. Spite and the scale of competition in pseudomonas aeruginosa. Am. Nat. 178, 276–285 (2011).
Article PubMed Google Scholar
Robinson, S. K. Anti-social and social behaviour of adolescent yellow-rumped caciques (icterinae: Cacicus cela). Anim. Behav. 36, 1482–1495 (1988).
Article Google Scholar
Wrangham, R. The Goodness Paradox: The Strange Relationship Between Virtue and Violence in Human Evolution (Pantheon, 2019).
Skyrms, B. Social Dynamics (Oxford, 2014).
Eshel, I. & Cavalli-Sforza, L. L. Assortment of encounters and evolution of cooperativeness. Proc. Natl Acad. Sci. USA 79, 1331–1335 (1982).
Article ADS MathSciNet CAS PubMed MATH PubMed Central Google Scholar
Skyrms, B. Evolution of the Social Contract (Cambridge University Press, 1996).
Fletcher, J. A simple and general explanation of the evolution of altruism. Proc. R. Soc. B 276, 13–19 (2009).
Article PubMed Google Scholar
Hamilton, W. Selfish and spiteful behaviour in an evolutionary model. Nature 228, 1218–1220 (1970).
Article ADS CAS PubMed Google Scholar
Grafen, A. A geometric view of relatedness. Oxford Surv. Evol. Biol. 2, 28–90 (1985).
Google Scholar
Lehmann, L., Feldman, M. W. & Rousset, F. On the evolution of harming and recognition in finite panmictic and infinite structured populations. Evol. Int. J. Org. Evol. 63, 2896–2913 (2009).
Article Google Scholar
Hawlena, H., Bashey, F. & Lively, C. M. The evolution of spite: population structure and bacteriocin-mediated antagonism in two natural populations of xenorhabdus bacteria. Evol. Int. J. Org. Evol. 64, 3198–3204 (2010).
Article CAS Google Scholar
Smead, R. & Forber, P. The evolutionary dynamics of spite in finite populations. Evolution 67, 698–707 (2012).
Article PubMed Google Scholar
Chaudhuri, A. Experiments in Economics: Playing Fair With Money (Routledge, 2009).
Skyrms, B. & Pemantle, R. A dynamic model of social network formation. Proc. Natl Acad. Sci. USA 97, 16 (2000).
Article MATH Google Scholar
Zimmermann, M. G. & Eguíluz, V. M. Cooperation, social networks, and the emergence of leadership in a prisoner’s dilemma with adaptive local interactions. Phys. Rev. E 72, 056118 (2005).
Article ADS MathSciNet CAS Google Scholar
Pacheco, J., Traulsen, A. & Nowak, M. Coevolution of strategy and structure in complex networks with dynamical linking. Phys. Rev. Lett. 97, 258103 (2006).
Article ADS PubMed PubMed Central CAS Google Scholar
Wu, B. et al. Evolution of cooperation on stochastic dynamical networks. PLoS ONE 5, e11187 (2010).
Article ADS PubMed PubMed Central CAS Google Scholar
Rand, D., Arbesman, S. & Christakis, N. Dynamic social networks promote cooperation in experiments with humans. Proc. Natl Acad. Sci. USA 108, 19193–19198 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Roth, A. & Erev, I. Learning in extensive-form games: experimental data and simple dynamic models in the intermediate term. Games Econ. Behav. 8, 164–212 (1995).
Article MathSciNet MATH Google Scholar
Erev, I. & Roth, A. E. Predicting how people play games: reinforcement learning in experimental games with unique, mixed strategy equilibria. Am. Econ. Rev. 88, 848–881 (1998).
Gershman, S. & Daw, N. Reinforcement learning and episodic memory in humans and animals: an integrative framework. Annu. Rev. Psychol. 68, 101–128 (2017).
Article PubMed Google Scholar
Skyrms, B. The Stag Hunt and the Evolution of Social Structure (Cambridge University Press, 2004).
Foley, M., Forber, P., Smead, R. & Riedl, C. Conflict and convention in dynamic networks. J. R. Soc. Interface 15, 20170835 (2018).
Article PubMed PubMed Central Google Scholar
Goyal, S. & Vega-Redondo, F. Network formation and social coordination. Games Econ. Behav. 50, 178–207 (2005).
Article MathSciNet MATH Google Scholar
Granovetter, M. The strength of weak ties. Am. J. Sociol. 78, 1360–1380 (1973).
Article Google Scholar
Barrat, A., Barthelemy, M., Pastor-Satorras, R. & Vespignani, A. The architecture of complex weighted networks. Proc. Natl Acad. Sci. USA 101, 3747–3752 (2004).
Article ADS CAS PubMed MATH PubMed Central Google Scholar
Yook, S., Jeong, H., Barabasi, A. & Tu, Y. Weighted evolving networks. Phys. Rev. Lett. 86, 5835–5838 (2001).
Article ADS CAS PubMed Google Scholar
Fulker, Z., Forber, P., Smead, R. & Riedl, C. Spite is contagious in dynamic networks. GitHub https://doi.org/10.5281/zenodo.3962292 (2020).
Moran, P. A. P. et al. The statistical processes of evolutionary theory. Am. J. Hum. Genet. 14, 438–439 (1962).
Google Scholar
Taylor, C., Fudenberg, D., Sasaki, A. & Nowak, M. A. Evolutionary game dynamics in finite populations. Bull. Math. Biol. 66, 1621–1644 (2004).
Article MathSciNet PubMed MATH Google Scholar
Dimant, E. Contagion of pro-and anti-social behavior among peers and the role of social proximity. J. Econ. Psychol. 73, 66–88 (2019).
Article Google Scholar
Boyd, R. & Richerson, P. J. The Origin and Evolution of Cultures (Oxford University Press, 2005).
Hofbauer, J. & Sigmund, K. Evolutionary Games and Population Dynamics (Cambridge University Press, 1998).
Sandholm, W. H. Population Games and Evolutionary Dynamics (MIT Press, 2010).
Beggs, A. W. On the convergence of reinforcement learning. J. Econ. Theory 122, 1–36 (2005).
Johnstone, R. A. & Bshary, R. Evolution of spite through indirect reciprocity. Proc. R. Soc. Lond. Ser. B Biol. Sci. 271, 1917–1922 (2004).
Article Google Scholar
Birch, J. & Okasha, S. Kin selection and its critics. BioScience 65, 22–32 (2015).
Article Google Scholar
Jensen, K., Hare, B., Call, J. & Tomasello, M. What’s in it for me? self-regard precludes altruism and spite in chimpanzees. Proc. R. Soc. B Biol. Sci. 273, 1013–1021 (2006).
Article Google Scholar
Jensen, K., Call, J. & Tomasello, M. Chimpanzees are vengeful but not spiteful. Proc. Natl Acad. Sci. USA 104, 13046–13050 (2007).
Article ADS PubMed CAS PubMed Central Google Scholar
Jensen, K. Punishment and spite, the dark side of cooperation. Proc. R. Soc. B Biol. Sci. 365, 2635–2650 (2010).
Google Scholar
Clutton-Brock, T. H. & Parker, G. A. Punishment in animal societies. Nature 373, 209–216 (1995).
Article ADS CAS PubMed Google Scholar
Fehr, E. & Gächter, S. Altruistic punishment in humans. Nature 415, 137–140 (2002).
Article ADS CAS PubMed Google Scholar
Raihani, N., Thornton, A. & Bshary, R. Punishment and cooperation in nature. Trends Ecol. Evol. 27, 288–295 (2012).
Article PubMed Google Scholar
Kandel, E. & Lazear, E. P. Peer pressure and partnerships. J. Political Econ. 100, 801–817 (1992).
Article Google Scholar
Fehr, E. & Gachter, S. Cooperation and punishment in public goods experiments. Am. Econ. Rev. 90, 980–994 (2000).
Article Google Scholar
Boyd, R. & Richerson, P. J. Punishment allows the evolution of cooperation (or anything else) in sizable groups. Ethol. Sociobiol. 13, 171–195 (1992).
Article Google Scholar
Fehr, E. & Gachter, S. Fairness and retaliation: the economics of reciprocity. J. Econ. Perspect. 14, 159–181 (2000).
Article Google Scholar
Falk, A. & Fischbacher, U. A theory of reciprocity. Games Econ. Behav. 54, 293–315 (2006).
Article MathSciNet MATH Google Scholar
Raihani, N. J. & Bshary, R. Punishment: one tool, many uses. Evol. Hum. Sci. 1, 1–26 (2020).
Google Scholar
Boyd, R., Gintis, H., Bowles, S. & Richerson, P. J. The evolution of altruistic punishment. Proc. Natl Acad. Sci. USA 100, 3531–3535 (2003).
Article ADS CAS PubMed PubMed Central Google Scholar
Herrmann, B., Thöni, C. & Gächter, S. Antisocial punishment across societies. Science 319, 1362–1367 (2008).
Article ADS CAS PubMed Google Scholar
Rand, D. G., Armao IV, J. J., Nakamaru, M. & Ohtsuki, H. Anti-social punishment can prevent the co-evolution of punishment and cooperation. J. Theor. Biol. 265, 624–632 (2010).
Article MathSciNet PubMed PubMed Central Google Scholar
Forber, P. & Smead, R. The evolution of spite, recognition, and morality. Philos. Sci. 83, 884–896 (2016).
Article MathSciNet Google Scholar
Jackson, M. O. Social and Economic Networks (Princeton University Press, 2008).
Jackson, M. O., Rodriguez-Barraquer, T. & Tan, X. Social capital and social quilts: network patterns of favor exchange. Am. Econ. Rev. 102, 1857–97 (2012).
Article Google Scholar
Rand, D., Nowak, M., Fowler, J. & Christakis, N. Static network structure can stabilize human cooperation. Proc. Natl Acad. Sci. USA 111, 17093–17098 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Laland, K. N. Imitation, Social Learning, and Preparedness As Mechanisms Of Bounded Rationality Chap. 13, 233–248 (The MIT Press, 2002).
Alexander, J. M. The Structural Evolution of Morality (Cambridge University Press, 2007).
Schlag, K. H. Why imitate, and if so, how? A boundedly rational approach to multi-armed bandits. J. Econ. Theory 78, 130–156 (1998).
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We would like to thank Michael Foley who originally created much of the code used in our simulations and provided advice throughout the project.

Author information

Authors and Affiliations

Network Science Institute, Northeastern University, Boston, MA, USA
Zachary Fulker & Christoph Riedl
Department of Philosophy, Tufts University, Medford, MA, USA
Patrick Forber
Department of Philosophy and Religion, Northeastern University, Boston, MA, USA
Rory Smead
D’Amore-McKim School of Business, Northeastern University, Boston, MA, USA
Christoph Riedl
Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
Christoph Riedl
IMT Lucca, Piazza S. Ponziano, 6, 55100, Lucca, Italy
Christoph Riedl

Authors

Zachary Fulker
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Forber
View author publications
You can also search for this author in PubMed Google Scholar
Rory Smead
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Riedl
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.F., P.F., R.S., and C.R. conceived the study, Z.F. conducted the experiment, Z.F. analyzed the results. All authors wrote and reviewed the manuscript.

Corresponding author

Correspondence to Christoph Riedl.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Bin Wu and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fulker, Z., Forber, P., Smead, R. et al. Spite is contagious in dynamic networks. Nat Commun 12, 260 (2021). https://doi.org/10.1038/s41467-020-20436-1

Download citation

Received: 09 April 2020
Accepted: 01 December 2020
Published: 11 January 2021
DOI: https://doi.org/10.1038/s41467-020-20436-1

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.