Much of the research on human cooperation has focused on the free-rider problem: how to maintain cooperation when individuals’ interests are opposed to those of the group. However, individual interests are often aligned with those of the group, and these mutualistic scenarios may be equally important in understanding human social evolution1,2,3,4. Even without incentives for individuals to defect, mutualism provides a different dilemma. When individuals differ in preferences or norms, it is harder to efficiently coordinate. The formation of reliable expectations of partner behavior that make coordination possible is therefore essential for the evolution of mutualism5.

Consider, for example, a couple planning their Saturday. Chris wants to go to the opera; Pat wants to go to the monster truck rally. Each would rather do something with their partner than go it alone, but each has a different preference6. If such mismatches are sufficiently frequent, Chris and Pat might be better off finding new partners with better-aligned interests. Successful cooperation requires resolution of this clash of preferences. Human societies are replete with dilemmas of this kind7, and the need to efficiently coordinate extends to many forms of collective action8. Institutions like punishment convert other social dilemmas into coordination dilemmas, expanding their importance. If individuals could assort on preferences and norms, cooperative payoffs may be increased. But often these traits are impossible to directly observe. When preferences are consciously held, individuals can merely signal them. But often individuals are not conscious of their preferences or realize their relevance too late to signal them.

One solution is the evolution of ethnic marking. Anthropologists have long argued that ethnic markers may signal group membership and improve cooperative outcomes9. An extensive formal literature has developed exploring how arbitrary signals can facilitate assortment on unconscious norms and preferences10,11,12,13,14,15. Language can also serve as a marker for social coordination15, as can visible purchasing and fashion choices16,17.

Communication is implicated in all these solutions. However, much communication is ambiguous. Is this ambiguity merely the result of constraints on the accuracy of communication? It may naïvely appear that communication should have clarity as its goal. However, purposeful ambiguity may allow signalers flexibility and plausible deniability18,19,20. Previous work has illustrated how leaders may use ambiguous language to rally diverse followers18, politicians may use vague platforms to avoid committing to specific policies21, and suitors may mask their flirtations to be viewed innocuously (or at least to provide plausible deniability) if their affections are unreciprocated19,22.

We propose more broadly that ambiguity may enable coordination and thereby enhance cooperation. Consider again the use of signals for assortment on cooperative norms. Overt signals like ethnic markers are useful in some contexts, where the adaptive problem is to delimit a set of partners who subscribe to the same broad behavioral norms and to categorically avoid interaction with those who do not. However, the “all-or-nothing” character of such signals can also foreclose valuable partnerships in different contexts. Signals that communicate similarity can also communicate difference, and this can be damaging for within-group cooperation. Individuals may benefit from not foreclosing relationships with less similar group members, so as to successfully cooperate with them in other contexts. Although any two individuals within a group can cooperate when it is mutually beneficial, pairs who are more similar can cooperate more effectively, generating larger benefits23,24,25,26,27. To be clear, we focus on those aspects of individual variation for which similarity enhances cooperation—these include norms, values, and identity28,29. Similarity on these dimensions matter even when the cooperative benefit is maximal for diverse rather than homogenous groups30. Scenarios in which individuals are unable to effectively assort on norms or attitudes are common, especially in complex societies (for example in business or education settings), but also in smaller societies. A signaling system that enables group members to communicate relative similarity only when similarity is high while retaining a shroud of ambiguity when similarity is low is likely to have been advantageous for much of human history.

Covert signaling is the transmission of information that is accurately received by its intended audience but obscured when perceived by others. A common example is “dog-whistling,” in which statements have one meaning for the public at large and a more specialized meaning for others31. Such language attempts to transmit a coded message while alienating the fewest listeners possible. A possibly much more common form of covert signaling is humor. According to the encryption model of humor32,33, a necessary component of humorous production is the presence of multiple, divergent understandings of speaker meaning, some of which are dependent on access to implicit information. Only listeners who share access to this information can “decrypt” the implicit understandings and understand the joke. Because the successful production of a joke requires access to that implicit information, humor behaves in manner similar to “digital signatures” in computer cryptography, verifying the speaker’s access to that information without explicitly stating it. Laughter may also serve as an honest signal of a shared sense of humor34, and thus of similarity on a variety of traits. While not all humor has this form, a substantial amount of spontaneous, natural humor does32,33. We allow that other types of identity signals16 may be also be covert.

We propose that covert signaling serves an important function in facilitating effective cooperation within groups by allowing individuals to assort on norms when possible while avoiding conflict with dissimilar individuals when their assistance is necessary. In the remainder of this paper, we define the logic of covert signaling. We analyze the conditions for selection to favor covert signaling over overt signaling, in which information about an individual’s traits is more transparent. Using a formal model, we show that covert signaling can be favored. It sacrifices transparency for the sake of maintaining working relationships with dissimilar individuals. Although covert signals are less accurate than overt signals, we show that the increased ambiguity can in some cases be advantageous. This characterization of covert signaling in terms of cooperative assortment may therefore help to explain forms of communication and coordination such as coded speech and humor, as well as for the flexibility of human sociality more generally. Nevertheless, covert signaling is not always advantageous. For example, if it is possible to freely choose cooperative partners from a large pool, overt signaling may be more advantageous, as individuals will be better able avoid dissimilar partners. Our model yields specific predictions about default attitudes toward strangers in the absence of clear signals, with implications for understanding differences between contemporary political affiliations.

Model Description

We consider a large population of individuals who have already solved the first-order cooperation problem of suppressing free riders, and can instead focus on maximizing the benefit generated by cooperation. Although individuals all belong to the same group, they also vary along many trait dimensions and thus share more in common with some individuals than others. Pairs of individuals whose trait profiles overlap to some threshold degree are deemed similar. Otherwise they are deemed dissimilar. Pairs of similar individuals can more effectively coordinate and obtain higher payoffs. The probability that two randomly selected individuals have similar trait profiles is given by s.

Our model proceeds in discrete generations, with each generation subdivided into two stages. In the first stage, individuals signal information about their trait profiles to the other members of the group, and those who receive signals form attitudes about their senders. In the second stage, individuals interact in one of two ways and receive payoffs conditional upon similarity as well as attitudes formed in the first stage.

Stage 1: Signaling

Individuals produce either an overt or covert signal of their underlying traits. We study a family of continuous signaling strategies in which covert signals are produced a fraction p of the time. Overt signals are received by a fraction R of the population and explicitly signal similarity or dissimilarity. Covert signals in contrast are received by a fraction r < R of the population and have content contingent upon the similarity of the sender and receiver. See Fig. 1. When the sender and receiver are similar, covert signals are received as signaling similarity. Otherwise, the receiver does not notice the signal and acts as if no signal was received at all.

Figure 1
figure 1

Illustration of the signal transmission and attitude formation components of the model. Left: A proportion s of the population is similar. Overt signals are received by a proportion R of the total population, while covert signals are received by a proportion r < R of similar individuals only. Right: Generous receivers default to a neutral attitude in the absence of a signal, while churlish receivers dislike the signaler unless they know he or she is similar.

Receivers have a default attitude towards all individuals in the population and update this attitude upon receiving a signal. Receiver strategy maps three signal states—similar, dissimilar, no signal—to an attitude. We consider three discrete attitudes: like, dislike, and neutral. These correspond to the hypothesis that covert signals help individuals to avoid being disliked while also achieving sufficient positive assortment by type. Two attitudes would be too few, because it would force agents to adopt either like or dislike as a default attitude, removing any incentive for covert signals. Three is the minimum required to model the hypothesis. We allow a continuous family of receiver strategies. Each strategy parameter aXY indicates the probability of mapping signal X {Similar, None, Dissimilar} to attitude Y {Like, Neutral, Dislike}. The total receiver strategy can be represented by a table (Table 1).

Table 1 Matrix of receiver strategies.

The three parameters in each row are constrained to sum to one. Although receiver strategies can vary continuously, we can think of the space of possible strategies as bounded by two extremes. Generous receivers default to a neutral attitude in the absence of a signal, while churlish receivers dislike the signaler unless they receive a signal of similarity. As we will see, these default receiver strategies are very important.

Stage 2: Interaction

After attitudes are established, pairs of individuals interact. There are two interaction contexts, each with its own mode of dyad formation. In a free choice scenario, dyads form conditional on the attitudes of both individuals. This is akin to those circumstances in which individuals can choose their partners based on pre-existing knowledge or established relationships. In contrast, in a forced choice scenario, an individual must seek help from whomever happens to be around and dyads are not conditional upon shared attitudes. Such scenarios are common in contemporary industrialized societies, as when two students are paired by their teacher to work on a jointly-graded project. Under these circumstances, it may be important not to have burned bridges, since this will limit the likelihood of effective coordination.

In the free choice context, dyads form based on joint attitudes, but attitudes do not directly influence payoffs. Instead, payoffs are determined by the underlying similarity of the dyad. Specifically, similar dyads receive an average payoff of 1, establishing a baseline measurement scale. Dissimilar dyads receive a payoff of zero. An individual in a dyad who likes the other individual increases the proportional odds of that dyad forming by a factor wL > 1. For each individual who dislikes the other, the proportional odds of the dyad forming are reduced by a factor wD < 1. This implies five possible kinds of dyads that might interact: LL, LN, NN, ND, and DD. The proportional odds of each, relative to random assortment, are: \({w}_{{\rm{L}}}^{2}\), wL, 1, wD, and \({w}_{D}^{2}\). These parameters are fixed features of the social environment, not aspects of strategy. This prevents strategy dynamics from generating perfect assortment. Receiver strategies that assign attitudes in ways that make good use of signal information will achieve better assortment and receive higher payoffs, conditional on the assortment constraints determined by wL and wD.

In the forced choice context, dyads form at random with respect to attitudes, but attitudes directly influence payoffs. This context entails a baseline payoff of 1 for both individuals. However, attitudes adjust payoffs, because negative attitudes make it harder to interact. When one individual dislikes the other, he makes the interaction more difficult than it must be and thereby imposes a cost −d on the other individual. When both individuals dislike one another, their difficulties act synergistically, inducing an additional cost −δ on each. This cost could result from spite or from uncontrollable inefficiency, a negative consequence of second-order common knowledge (sensu Chwe35). As we show later, these synergistic costs are very important to the overall signaling dynamics.

Let q be the relative importance of the free choice context and 1−q the relative importance of the forced choice context. These two contexts are starkly different, presenting the clearest investigation of the hypothesis that covert signals trade worse performance in assortment contexts, in which norms influence payoffs, for better performance in forced contexts in which attitudes influence payoffs. Real contexts are some mix of these extremes, and the parameter q allows us to explore the range of mixes. We note also that payoffs depend on similarity and attitudes, which result in part due to sender and receiver strategies, but the payoffs are not directly derived from those strategies. When dyads form for cooperation, both parties are senders and receivers, each having had the opportunity to signal to and receive from one another. While we can imagine cases of sender-receiver asymmetries in terms of cooperative payoffs, we have chosen not to model these in this particular analysis.

Payoff expression

With the assumptions above, we can define a general payoff expression for a rare individual with signal strategy p′ and receiver strategy matrix a′ in a population with common-type strategy {p, a}. The expected payoff to this individual is:

$$W(p^{\prime} ,{\bf{a}}^{\prime} )={\rm{\Omega }}+q{\rm{\Pr }}({\rm{similar}}|p^{\prime} ,{\bf{a}}^{\prime} )+\mathrm{(1}-q\mathrm{)(1}-{\rm{\Pr }}({\rm{disliked}}|p^{\prime} ,{\bf{a}}^{\prime} )(d+{\rm{\Pr }}({\rm{dislike}}|p^{\prime} ,{\bf{a}}^{\prime} )\delta ))$$

where Ω is an expected payoff due to other activities. The work lies in defining the probabilities Pr(similar|p′, a′), Pr(dislike|p′, a′), and Pr(disliked|p′, a′). In the mathematical appendix, we show how to define these probabilities, using the assumptions above. The resulting general payoff expression is very complicated. In the following section, however, we are nevertheless able to analyze it by considering invasion and stability of relevant combinations of signaling and receiver strategies.

Analysis and Results

The motivating hypothesis is that covert signals can proliferate because they allow sufficient assortment in the free choice context and also reduce being disliked in the forced choice context. To evaluate the logic of this idea, we proceed by asking when covert signals can be stable, when they can invade, and which receiver strategies are necessary for their stability or invasion. The following conditions favor covert signals.

  1. 1.

    Covert signals require a sufficient proportion of receivers to default to neutral attitudes. If everyone defaults to disliking, then covert signals can produce no benefit. Defaulting to neutral is favored under a wide range of conditions, provided that covert signals are sufficiently hard to receive (r is not too large) and avoidance of disliked individuals is not too efficient (wD is not too small).

  2. 2.

    Covert signals require that the cost of being disliked in the forced choice context be sufficiently high. This also means that baseline similarity in the population (s) must be sufficiently low, because this creates the risk of being disliked by dissimilar individuals.

  3. 3.

    Overt signalers cannot have too large an advantage in the free choice context. This requires that assortment with liked individuals not be too accurate. The accuracy of assortment is influenced by the reception probabilities of both signal types, R and r, as well as the proportional odds assortment factors, wL and wD.

In the remainder of this section, we derive these results and provide intuition for why they hold. First we derive simple evolutionary dynamics for these payoffs. This allows us to submit the model to invasion and stability analysis, asking both when covert signals can be stable and when they may invade a population of overt signals. Then we proceed by considering the dynamics within each interaction context—the forced choice context and the free choice context—separately. Then we summarize the joint dynamics of the full model with both contexts.

Evolutionary dynamics

We generate evolutionary dynamics for the strategy space by assuming that rare invading strategies increase in frequency when they achieve higher payoffs than a common-type strategy. We define selection gradients for both p and the attitude parameters. The gradient for signaling is defined by:

$$g(p)={\frac{\partial W(p^{\prime} ,{\bf{a}}{\boldsymbol{^{\prime} }})}{\partial p^{\prime} }|}_{p^{\prime} =p,{\bf{a}}^{\prime} ={\bf{a}}}$$

The gradient for each receiver parameter is defined similarly. A number of different mechanisms can generate such dynamics. For example, an individual could acquire its strategy from successful individuals through learning. Genetically coded strategies that influence biological fitness would also generate this dynamic. We remain agnostic about inheritance and transmission mechanism, because the point of our modeling exercise is to explore the design aspects of covert signals. This is best achieved by a form of analysis that abstracts away from transmission details, even though of course in any real system such details will turn out to influence which strategies are possible and how they evolve36. We also note that if the mechanism of transmission is cultural, replicators are not strictly necessary but can serve as meaningful approximations of lower-fidelity transmission channels37.

The potential space of receiver strategies is very large. However, the relevant space of strategies is fairly small. In the mathematical appendix, we show that payoff dynamics always favor mapping similar signals to like attitudes, implying aSL = 1. The reason is that maximizing the probability of assortment for similar individuals also maximizes payoffs, and the like attitude maximizes the probability of interacting in the free choice context. On the other hand, payoff dynamics do not always favor mapping dissimilar signals to dislike attitudes. The reason is that the forced choice context disfavors disliking whenever δ > 0. We therefore constrain further analysis to the relevant situations in which the penalty for mutual dislike, δ, is small enough that assortment incentives favor mapping dissimilar signals to dislike attitudes. We reemphasize this constraint in the discussion, because constraints of this sort help in producing predictions. Finally, with respect to default attitudes, formed when no signal is received, payoff dynamics never favor assigning like, because this erodes the value of assigning like to similar signals.

The remaining default receiver parameters are free to evolve. Therefore, for most of the analysis to follow, we assume that aSL = 1, aDD = 1, aNL = 0, aNN = 1 − α, and aND = α. This allows us to use the gradient on α, defined by:

$$g(\alpha )={\frac{\partial W(p^{\prime} ,\alpha ^{\prime} )}{\partial \alpha ^{\prime} }|}_{p^{\prime} =p,\alpha ^{\prime} =\alpha }$$

to ask when evolution favors assigning dislike to no signal, α > 0, effectively defaulting to disliking everyone. As noted earlier, it will be convenient to refer to α = 0 as the generous receiver strategy and α = 1 as the churlish receiver strategy.

Dynamics of the forced choice context

In this context, incentives favor covert signals, because such signals are better at avoiding being disliked. However, this advantage depends upon incentives favoring generous receiver strategies that do not dislike by default. That said, receiver incentives in this context always favor generous receiver strategies as long as there is any negative synergy, δ > 0. Therefore the forced choice context favors generous receivers, α = 0, which in turn favor covert signals, p = 1.

The gradients in this context are:

$${g(p)|}_{q=0}=(\alpha rs+R\mathrm{(1}-\alpha -s))(d+\delta (\alpha \mathrm{(1}-prs)+R\mathrm{(1}-p\mathrm{)(1}-\alpha -s)))$$
$${g(\alpha )|}_{q=0}=-\delta \mathrm{(1}-\mathrm{(1}-p)R-prs)(\alpha \mathrm{(1}-prs)+\mathrm{(1}-p)R\mathrm{(1}-\alpha -s))$$

These expressions seem complex at first, but produce fairly simple dynamics. First let’s ask when p can increase. When α = 0, the generous receiver strategy is common, and covert signals can increase when:

$$d+\delta \mathrm{(1}-s\mathrm{)(1}-p)R > 0$$

This is satisfied for any allowable values of the parameters. Note also that it does not require both a direct cost of being disliked, d, and a synergistic cost of mutual dislike, δ. Either one is sufficient to favor covert signals, as long as α is small. Next consider when α = 1, the churlish receiver strategy is common. Then covert signals can increase when:

$$d+\delta \mathrm{(1}-s(pr+\mathrm{(1}-p)R)) < 0$$

And this is never satisfied, for any p. Therefore covert signals are favored when 1 − α, the amount of generous receiver behavior, is sufficiently high. The threshold value is found where g(p) = 0:

$$\hat{\alpha }=\frac{1-s}{1-\frac{r}{R}s}$$

When α is above this value, overt signals are favored. When it is below it, covert signals are favored. Why? When receivers are relatively generous, and there is sufficient dissimilarity in the population, covert signals reduce costs by avoiding being disliked. If generous receivers are relatively rare, however, then covert signals can actually do worse than overt signals, because they are received less often than overt signals, r < R. If r/R is sufficiently small, overt signals are favored for a wide range of values of α. If however r = R and covert signals have no disadvantage in audience size, then overt signals are never favored in this context, no matter the amount of similarity s.

Now the crucial question is when α will fall below this threshold \(\hat{\alpha }\). The condition for payoff dynamics to favor smaller values of α, in the forced choice context only, is just δ > 0. Therefore the forced choice context always favors smaller values of α, provided there is any negative synergy between disliking and being disliked. Otherwise α is neutral and does not move at all, based on payoff dynamics. Why does this context always favor generous receivers? There is no advantage to be had in disliking people in this context, because assortment does not depend upon attitudes. Payoffs depend upon attitudes, however, and mutual dislike results in poor payoffs. Therefore, it pays to be generous in attitudes towards those one has no information about.

Dynamics of the free choice context

In the free choice context, attitudes influence assortment but do not directly influence payoffs. Instead, hidden norm similarity influences payoffs. The free choice context favors overt signals over covert signals, because overt signals increase assortment—such signals are easier to receive and more effectively discriminate similarity. The churlish receiver strategy, α = 1, is favored by this context, because it also increases assortment with similar individuals. Therefore this context is hostile to covert signals and to the receiver strategy that favors them.

To support the above statements, we demonstrate that the gradient for p in this context is always negative and that the gradient for α in this context is always positive. The gradients in this context are:

$$\begin{array}{rcl}{g(p)|}_{q=1} & = & (-1)\frac{s(1-s)}{{Z}^{2}}(1-(\alpha +(1-\alpha )(1-p)R){v}_{{\rm{D}}})\\ & & ((1-\alpha {v}_{{\rm{D}}})+(R(1-p)+pr)(\alpha {v}_{{\rm{D}}}+{v}_{{\rm{L}}}))\\ & & (R(1-\alpha {v}_{{\rm{D}}})({v}_{{\rm{D}}}+{v}_{{\rm{L}}})-r(1-{v}_{{\rm{D}}}(\alpha (1-R)+R))(\alpha {v}_{{\rm{D}}}+{v}_{{\rm{L}}}))\end{array}$$
$$\begin{array}{rcl}{g(\alpha )|}_{q=1} & = & \frac{s(1-s)}{{Z}^{2}}(1-{w}_{{\rm{D}}})(1-(1-{w}_{{\rm{D}}})(\alpha +(1-\alpha )(1-p)R))\\ & & ((1-p)R({w}_{{\rm{L}}}-{w}_{{\rm{D}}})(1-pr-(1-p)R)+pr{w}_{{\rm{L}}})\\ & & ((1-(1-p)R-pr)(1-\alpha (1-{w}_{{\rm{D}}}))+(R(1-p)+pr){w}_{{\rm{L}}})\end{array}$$

where Z is a normalizing term and the symbols vD = 1 − wD and vL = wL − 1 are used for compactness of notation. By inspection, every term after the leading (−1) in g(p) is positive, for all allowed values of the variables, and so the gradient is always negative. Similarly, every term in g(α) is positive, and so the gradient is always positive. Therefore payoff incentives in the free choice context never favor covert signals and always favor churlish receivers.

While this context always favors overt signalers and churlish receivers, the strength of the incentives may vary. First, both gradients are proportional to the variance in similarity, s(1 − s). This indicates that intermediate similarity more strongly favors overt signals, unlike the situation in the forced choice context, in which high similarity favored overt signals. The reason that the variance matters now is that payoffs depend directly upon similarity, not upon attitudes. The more variance in similarity in the population, the greater the advantage of efficient assortment.

Overt signals have the advantage in this context, because they are better at assortment. Therefore, any change in variables that reduces the accuracy of assortment overall will reduce overt signalers’ advantage. The important variables are R, the probability an overt signal is received, and the assortment proportional odds wD and wL. Reducing R reduces overt signalers’ advantage, because it makes signals less valuable overall. Making either wD or wL closer to 1 makes assortment, based on attitudes, less effective. This also reduces overt signalers’ advantage.

Joint dynamics: When do covert signals evolve?

When both contexts matter, the joint dynamics take on one of three characteristic regimes. First, covert signals both invade and are evolutionarily stable. Second, overt signals both invade and are evolutionary stable. Third, a mixed equilibrium exists at which covert and overt signals coexist in the population. Figure 2 illustrates these three regimes. These three examples all weigh the forced choice and free choice contexts equally, q = 0.5. The other parameters then shift the strength of incentives in each context to influence overall dynamics. For other values of q, the strength of incentives would have to shift as well to overcome weight given to each context.

Figure 2
figure 2

The three dynamic regimes that arise from the joint dynamics. In each plot, the paths show the evolutionary trajectories in each region of the phase space defined by the probability of covert signals (p, horizontal) and the probability of churlish receivers (α, vertical). The red points show equilibria. In all three plots: d = 0.1, δ = 0.01, q = 0.5. Left: s = 0.1, r/R = 0.25, wL = 1.1, wD = 0.9. Middle: Same as left, but wD = 0.6. Right: s = 0.2, r/R = 0.75, wL = 1.25, wD = 0.8.

When covert signals are sufficiently noisy (r/R low), similarity is sufficiently rare (s low), and assortment (wL, wD) not too efficient, covert signals can both invade and are an ESS. This situation is shown in the lefthand plot. While dynamics do not favor covert signals when α is large, near the top of the phase space, dynamics in that region favor smaller values of α. Eventually, α becomes small enough to allow covert signals to invade and reach fixation. In many cases, a small amount of churlish receiver strategy, α > 0, persists.

When the conditions outlined above are not met, incentives favor instead overt signals. The middle plot illustrates a case essentially the opposite of the one on the left. Here, wD = 0.6, making assortment efficient. When assortment is efficient, it may pay to dislike by default. This sets up a dynamic that eventually favors overt signals. While covert signals are still favored when α is low, the fact that larger values of α are favored everywhere leads eventually to invasion and fixation of overt signals.

Finally, the plot on the right shows an intermediate case, in which conditions favor both signaling strategies. Here s = 0.2, r/R = 0.75, wL = 1.25, and wD = 0.8. In this regime, the conditions that favor covert signals also favor more churlish receivers. Similarly, the conditions that favor overt signals also favor fewer churlish receivers. In total, the population comes to rest with a mixture of signaling and receiving strategies.

A more general view of the dynamics is available by considering the boundary conditions that make covert signaling an ESS. Recall that q is the relative importance of free choice scenarios. Define a threshold \(\hat{q}\) as the largest value of q for which covert signals can resist invasion by overt signals. This is defined by values of \(q=\hat{q}\) and \(\alpha =\hat{\alpha }\) that satisfy g(p)|p=1 = 0 and g(α)|p=1 = 0. These cannot in general be solved analytically. So we solve the system numerically, in order to illustrate the range of joint dynamics. Values of q less than \(\hat{q}\) make covert signals evolutionarily stable–overt signals cannot invade. Values of q greater than \(\hat{q}\) allow overt signals to invade, though we note that covert signals may nevertheless remain in the population, at an internal stable value p. Therefore \(\hat{q}\) provides a useful metric of how strongly a parameter configuration favors covert signals.

We use \(\hat{q}\) to summarize the tradeoffs in the signaling model. Recall that the cost of being disliked, d, is needed to favor covert signals. Therefore increasing d makes it easier for covert signals to be an ESS. However, the rate of similarity, s, favors overt signals. It is of value to note that d cannot compensate for s—if s is large, then steeply increasing costs d will not favor covert signals. We show this relationship in Fig. 3, lefthand plot. Each curve in this plot is a threshold \(\hat{q}\), below which covert signaling is an ESS. For each level of s, the impact of increasing d diminishes rapidly. Therefore some cost d is necessary for covert signals to evolve and be stable, but these costs cannot easily compensate when similarity is sufficiently common.

Figure 3
figure 3

Plots of the largest value of q, \(\hat{q}\), that allows covert signaling to be an ESS. Each curve represents a set of parameter values. Points below each curve make covert signals uninvadvable by overt signals. Points above each curve allow overt signaling to invade. Left: The cost of being disliked, d, for four values of the baseline rate of similarity, s. r/R = 0.5, wL = 1.1, wD = 0.9, and δ = 0.01. Right: The ratio of covert transmission rate, r, to the rate of overt transmission rate, R, for four values of wL = 1/wD. s = 0.1, R = 0.5, d = 0.1, and δ = 0.01.

Consider another important pair of dimensions: the ratio of transmission rates r/R in covert/overt signals and the efficiency of assortment, as measured by wL and wD. Figure 3, righthand side, shows \(\hat{q}\) curves for four values of wL = 1/wD, as functions of r/R. Covert signals are favored when r/R is small, as explained in the previous sections. But when assortment is very efficient, such as wL = 2 near the bottom of the plot, it requires very low values of r/R to compensate in favor of overt signaling.


The dynamics of cooperation are more complicated than implied by models in which maximal benefits accrue to those who can simply avoid free riders. Not all cooperators are equal. Individuals vary, making assortment among cooperators important. Circumstances also vary. When individuals must occasionally collaborate with those outside their circles of friends, it can be critical to avoid burning bridges with dissimilar members of one’s group. Covert signaling makes this possible, and this may be why phenomena like humor are observed in all human societies, at both small and large scales38,39.

We have shown that covert signaling is favored when forced choice scenarios are common, when similarity is low, when the cost of being disliked is high, and when covert signals are sufficiently noisy to make the meaning of a signal’s absence ambiguous. We emphasize that covert signaling can be favored even though it is less effective than overt signaling at communicating similarity, because it simultaneously avoids communicating dissimilarity. Although we have focused our attention on the initial establishment of cooperative relationships via signaling, we also note that people can change over time; they may grow more similar to one another or further apart. Covert signals may be important for the continued maintenance of a relationship, or for its reestablishment after prolonged absence.

Our model points to interesting transitions from inter- to intra-group assortment dynamics. As noted, overt signaling systems are favored when the ability to assort on attitudes is high, and when being disliked by dissimilar individuals carries little risk. This is precisely the kind of situation that is assumed to obtain in inter-group assortment, where overt signals such as ethnic markers are used to discriminate between similar and dissimilar individuals. In these between-group contexts, the difference between similar and dissimilar individuals is so great that attempting to coordinate with dissimilar others is not worth the effort, and one can afford to burn bridges with them in order to ensure that similar others are aware of their similarity13. In fact, it might be argued that burning bridges with dissimilar out-group members is as much a goal of overt signals like ethnic markers as is attracting similar in-group members.

Intra-group assortment, however, is not simply a matter of scaling down inter-group dynamics. Rather, we must already presume some baseline level of similarity resulting from inter-group assortment; for there to be a group within which to assort, some degree of similarity should already be in place that defines that group, such as the shared interaction norms, communication systems, etc. that ethnic markers are thought to ensure. The benefits of further assorting on the basis of more nuanced similarity are therefore likely to be marginal relative to random assortment within the group. When such benefits are small but the costs of being disliked are high, covert signaling is favored.

Relatedly, we emphasize that the probability of similarity, s, need not reflect some number of discrete types in the population, but can instead refer to a level of selectivity in how much a given pair of individuals must have in common to be considered “similar.” That is, s refers to the proportion of the population that could be considered similar to a focal individual in a given context, with higher values indicating a looser concept of “similar” than lower ones. Changes to s can have a significant impact on the overall dynamics of the system. When s is large, the focus is on avoiding rare dissimilar individuals, and overt signals will be favored. As s decreases, the criteria for considering a potential partner sufficiently similar to reap the benefits of enhanced coordination become stricter, and the utility of covert signals increases.

An interesting direction for future exploration is how these dynamics might respond to increased social complexity. In the larger and more complex societies associated with the development of agriculture, and particularly in the last few centuries, interactions with strangers are more frequent and occur across many contexts, necessitating strategies for temporary assortment28,29,40. Consequently, expected similarity will be lower, signal fidelity will be noisier, and assortment on attitudes will be less efficient. These are precisely the conditions in our model associated with the evolution of covert signaling. In large, diverse populations, covert signaling may sustain social cohesion and prevent burning bridges between individuals or groups that must occasionally collaborate. That said, covert signals are not necessarily rare in small-scale societies. Our own experiences in the field and conversations with other researchers indicate that they occur with some regularity. Our model can help to identify contexts in which covert signaling should or should not be expected.

Identity signaling, whether overt social markers or more covert communication, can be used by individuals looking to find others similar to themselves and to avoid being mistaken for something they are not16,28,29. If the need to cooperate with dissimilar individuals is unlikely or if similar individuals are common, then overt declarations of identity should be expected. On the other hand, if burning bridges is both costly and likely given an overt signaling strategy, we should expect identity to be signaled much more subtly. In reality, increasing levels of specificity may be signaled in increasingly covert ways, and without all received signals actively inducing a change in disposition toward the sender. A related signaling strategy, not covered by our model, might facilitate liking between similar individuals but only indifference otherwise. Casual, coarse-grain identity signaling may often take this form, as in cases of fashion adoption or pop culture allegiances. It would be interesting to investigate how common these “semi-covert” signals are in small-scale communities, as they seem pervasive in complex industrialized societies.

Our model additionally helps make sense of findings from political psychology suggesting that people in the industrialized West who identify as conservative or right-leaning tend to view ambiguous people as hostile, while those identifying as liberal or left-leaning tend view ambiguous people as neutral41,42,43. In our model, a default attitude to dislike was linked with overt signaling, which we in turn associate with the preservation of strong between-group boundaries. In contrast, a default attitude of neutral was associated with covert signaling, and with the avoidance of burned bridges to facilitate more widespread within-group cooperation. As a broad generalization, our analysis suggests that conservatives may be operating under the assumptions of stronger ingroup/outgroup boundaries, increased expectations of similarity toward those they signal, and lower costs to being disliked by dissimilar individuals. In contrast, liberals may be operating under the assumptions of a more broadly defined ingroup, limited expectations for similarity toward those they signal, and higher costs to being disliked by dissimilar individuals. Lending modest support to this idea is the finding that conservatives appear to have a stronger “need for cognitive closure” (reviewed in ref.42), which is associated with, among other things, a distaste for uncertainty and ambiguity. The modeling framework we present in this paper may thus be useful in understanding patterns of differences between groups, including but not limited to political affiliation.

Ours is the first formal model of covert signaling. As such, it necessarily involves simplifying assumptions concerning the nature of signaling and cooperative assortment. For example, while we have allowed for covert signaling errors in the form of failed transmission to similar individuals, we have not included the converse form of error, where dissimilar individuals are able to detect the signal some of the time, and therefore update their disposition to disliking the covert signaler. Adding an additional parameter to account for this possibility does not qualitatively change our analysis. But it may create conditions where a non-signaling “quiet” strategy could invade. In addition, we ignore the possibility of strategic action on the part of the receiver to either improve coordination or to avoid partnering with dissimilar individuals entirely. We assumed that a pairing of dissimilar partners would simply lead to an unsuccessful collaboration, but such a pairing might instead lead each individual to pursue more individualistic interests. At the population level, we assumed that all individuals had an equal probability of encountering similar individuals, and that all similar and dissimilar individuals were equivalent. In reality, some individuals may be more or less likely to encounter similar individuals, perhaps related to differences in the tendency to be conformity- versus distinctiveness-seeking44, or reflecting minority-majority dynamics45,46. Exploration of this variation opens the door to evaluating signaling and assortment strategies in stratified groups. All of these limitations provide avenues for future research that will build upon the central findings reported here.

In a population where individuals vary and burning bridges is costly, overtly announcing precisely where one stands entails venturing into a zone of danger. Covert signaling, as in the case of humor or otherwise encrypted language, allows individuals to effectively assort when possible while avoiding burned bridges when the situation calls for partnerships of necessity.