Introduction

As is well known, confidence leads to success in a wide range of domains including business, job performance and competitions for champions1,2,3. Ironically, when confidence is relatively high, the whole human beings fall victim to overconfidence, which is considered to be the most “pervasive and potentially catastrophic” of all the cognitive biases by some psychologists4,5. Such exaggerated confidence is often blamed for labor strikes, wars and stock market bubbles6,7,8. But like a coin has two sides, some other authors hold completely different opinions, insisting on its advantages in boosting morale, ambition, resolve and persistence so as to increase the probability of success9,10,11,12. Regardless of different views, the prevalence of overconfidence is an undeniable fact. For example, by asking people how confident they are of specific beliefs, researchers find that confidence systematically exceeds accuracy, implying people are more sure that they are correct than they deserve to be13,14. Further observations also show that individuals incline to form overly positive judgment of their abilities, which is called “better-than-average effect”15.

Overconfidence is a fascinating area of research, since it touches upon many different disciplines ranging from sociology to economics. Mounting effort have been devoted to unveiling the reasons for the pervasiveness of this faulty psychology. Prior work have mostly offered three explanations. The first posits a motivated bias: Individuals are driven to be overconfident because it provides them with psychological benefits16. The second explanation highlights the cognitive processes that may sometimes produce directional biases17. An additional possibility is that overconfidence enhances the person's social status18. Apart from the above results, Johnson and Fowler have presented an evolution model, demonstrating that overconfidence often prevails over accurate assessment as long as benefits from contested resources are sufficiently large compared with the cost of competition12. In fact, Johnson and his coauthors used standard assumptions about evolution dynamics19, under which whoever get larger fitness are more likely to survive and better strategies will be imitated.

Treating whether to be overconfident or not as a strategy, we can associate the study of this psychology with evolution game theory which was originally introduced as a tool for studying animal behavior20,21, but has become a general approach that transcends almost every aspect of evolutionary biology22. The traditional approach of evolutionary game theory uses deterministic dynamics to describe infinitely large, well-mixed populations23,24. To understand evolutionary game dynamics in finite-sized populations, a stochastic approach is developed25,26,27,28. Particularly, network has been found to be an important ingredient affecting evolutionary game dynamics. While pioneering works mainly focus on spatial games on regular lattices27,29,30,31,32,33,34,35,36,37, more studies have been expanded to general structured populations, such as small-world38,39,40,41,42,scale-free networks43,44,45,46,47 and random networks48,49. Taking into account the conspicuous effect of population structure in evolutionary dynamics, one may then ask how psychologies or emotions transmit between interacting individuals who site on specific networks and whether the transmission has profound significance. In fact, by reconsidering the concept of imitation, Szolnoki et al. find that copying emotions such as goodwill and envy instead of pure strategies from more successful players can even resolve social dilemmas on structured populations, additionally pointing out that the emotional profile depends not only on the payoffs but also on the heterogeneity of the interaction network50,51.

Inspired by all the previous works above, we propose a spatial resource competition model, simple but general, aiming at dealing with the puzzle how overconfidence spreads and fixes in structured populations. Square lattice prescribes that each individual competes with his 4 nearest neighbors to collect available resources. Then players are more likely to learn the psychology of the successful, despite their real capabilities. Different from the model in Ref12, bluffing behavior has been incorporated into our model. Although bluffing is often argued to be unstable in natural selection9,12, it is usually not reliably distinguished from the true ability52. In particular, successful disguises (bluffing) and real capacity need a much longer time scale to evolve than that of overconfidence, so they are treated as constant indices in our model. We will show in this study that, moderate values of resource-to-cost ratio r/c create a more favorable environment for the evolution of overconfidence in structured population. More importantly, the intricate relationship between bluffing and overconfidence may be a potential driving force for the high level of overconfidence. The simulation and theoretical analysis details are described in the Methods.

Results

We begin by presenting overconfidence level fO as a function of resource-to-cost ratio r/c while initial fractions of different types of individuals are equal (see Fig. 1(a)). It is shown that moderate values, rather than high values of r/c, boost fO effectively in structured populations. Interestingly, regardless of the values of α and β, the plateau of high overconfidence level sharply drops to a lower platform once r/c exceeds the critical value 2.0. This can be attributed to the fact that considerable amount of resource stimulates everyone to make a claim actively, leading to intensive conflicts that help one to recognize his real capability. Moreover, the existence of bluffing plays a key role on the prosperity of overconfidence, especially when r/c is high. As is shown in Fig. 1(a), without bluffing (β = 0), fO is stabilized at a much lower level than that when bluffing exists for r/c > 2.0. Players seem to be more apt to be overconfident when facing opponents who are not as powerful as they display, which accords with common beliefs. However, excessive overconfidence intensity might inhibit its spreading among the population. For instance, when α ≥ 0.5 one individual overestimates himself so seriously that he can even challenge opponents from higher rank of real capability (the rank gap is 0.5) and this choice never pays off.

Figure 1
figure 1

(a) Stable fraction of overconfident individuals fO in structured population and (b) equilibrium frequency of overconfident individuals x* in infinite well-mixed population as a function of resource-to-cost ratio r/c for different values of α and β. Data presented in Panel (a) are obtained by means of Monte Carlo simulations, while in Panel (b) are obtained by means of extended replicator dynamic (see Methods for details). Hollow symbols in Panel (a) and dash line in Panel (b) correspond to cases where bluffing does not exist (β = 0). Other parameters: fIO = 0.5, fB = 0.5.

As a comparison, we also consider the evolution of overconfidence in well-mixed populations (see Fig. 1(b)), by means of extended replicator dynamic (see Methods for details). It is found that network structure works as a “double edged sword”: When r/c is relatively small, it plays a role as “catalyst” to impel the spreading of overconfidence; When resource is rich (larger r/c), it acts as “depressant” to cool down the atmosphere. The primary cause lies in the fact that network produces a local “celebrity effect”53,54. Specifically, individuals with the highest real capability get more payoff in each game round and thus do not incline to change their overconfidence states. In networks, they act as hubs to influence their neighbors, forming non-overconfidence clusters to resist invasions (detailed explanations will be discussed later). Therefore no matter how large r/c is, the existing unbiased elites ensure that overconfidence cannot dominate the whole population. In contrast, in well-mixed populations individuals always imitate the “fittest” strategy globally, thereby x* is more sensitive to the values of r/c than fO does and x* will finally reach a fully overconfident state if resource is rich enough (r/c > 4.0) when bluffing exists. However, common traits in both structured and unstructured population exists that bluffing promotes overconfidence level (see Fig. 1 (a) and (b)).

We next evaluate the impact of overconfidence intensity α and bluffing intensity β on overconfidence level fO, as presented in Fig. 2. When r/c = 1.5 (see Fig. 2(a)), fO sustains a high level over the whole parameter plane of α and β. This confirms that moderate values of resource-to-cost ratio elevate overconfidence, which is robust to changes in overconfidence intensity and bluffing intensity. However, when r/c = 3 (a large resource-to-cost ratio, see Fig. 2(b)), distinction in fOs with respect to α and β becomes more conspicuous. The maximum fO is reached within the region where large α is accompanied by large β. Note that for any fixed value of β, fO is always larger within the region above diagonal (i.e., α > β) than below (i.e., α < β). In other words, an overconfidence intensity slightly larger than bluffing intensity induces higher overconfidence level. In this sense, overconfidence is evolved as a tool to conquer the negative effect of others' bluffing. One should believe himself more than believing others9.

Figure 2
figure 2

The 3-D plots showing the stationary fraction of overconfident individuals fO in dependence on both α and β for (a) r/c = 1.5 and (b) r/c = 3.

The minimum fO is about 0.77 in Panel (a) showing that overconfidence is always favored under moderate values of r/c. When r/c is high, fO becomes more strongly dependent on the combination of α and β. The maximum fO is reached when both α and β are high, indicating that higher values of overconfidence is evolved in response to higher bluffing activities.

We have further studied the impact of initial frequencies of overconfident individuals fIO and bluffing individuals fB on the final overconfidence level fO, as presented in Fig. 3. It can be observed that at any given value of fIO, increasing fB will elevate fO. A larger fB means that more individuals display an exaggerated capability. When surrounded by more bluffing neighbors, one is more likely to be profitable if he is overconfident. Meanwhile, for any given fB, fO also monotonously increases with fIO. Unlike the influence of initial frequency on the evolution of cooperation in structured population55, we find that the system always converges to a higher overconfidence level compared to the initial one (i.e., fO > fIO), showing that overconfidence is constantly favored by natural selection once bluffing exists. Our investigation shows that the prosperity of bluffing stimulates overconfidence to prevail, given that α > β and r/c > 2.0.

Figure 3
figure 3

Influence of initial overconfidence level fIO and initial bluffing level fB on stationary fraction of overconfident individuals fO in the contour form.

Parameters are: α = 0.4, β = 0.3 and r/c = 2.5. Similar results can be obtained whenever α > β and r/c > 2.0. Overall, larger fIO corresponds to higher fO. For any given value of fIO, increasing fB further promotes fO, showing that overconfidence is favored when bluffing exists.

For better understanding the influence of network structure on the evolution of overconfidence, we present in Fig. 4 some typical snapshots of strategy patterns in time evolution. It can be observed that, regardless of the different initial distributions, overconfidence spreads quickly and flourishes. Individuals who have high real ability (H), considered as “elites” here, play decisive roles in guiding social mentality and thus they become the core of the clusters. As evolution proceeds, the number of overconfident elites (blue) increases, indicating that they have an advantage over the unbiased elites (yellow). However, the latter can also take advantage of the network structure to form some solid clusters (yellow + red) to resist invasion, which explains why the unbiased ones will not die out in the long run in structured populations. Notably, the overconfidence clusters (blue + green) tend to connect with each other, forming larger ones to expand their territory, especially when initial fraction of overconfidence is relatively high (fIO = 0.3). Note that besides network reciprocity29,56,57, local “celebrity effect” also plays important roles. Individuals are susceptible to neighbor elites' psychology. These followers of overconfident or unbiased elites work as “buffer” in between, preventing the direct confrontation of two types of elites.

Figure 4
figure 4

Typical snapshots of spatial patterns formed by different types of individuals under initial overconfidence levels fIO = 0.1 ((a)–(c)) and fIO = 0.3 ((d)–(f)) at different time steps.

Overconfident ones are colored blue (high real ability) and green (middle and low real ability). Unbiased ones are colored yellow (high real ability) and red (middle and low real ability). The stationary fractions of overconfident ones are 42.5% for fIO = 0.1 and 62.6% for fIO = 0.3. In structured population, high real ability but unbiased individuals form clusters to resist the invasion by overconfident ones, explaining the survival of unbiased ones even with high r/c. The size of the square lattices is 100 × 100. Other parameters: α = 0.4, β = 0.3 and r/c = 3.

Moreover, who benefit more from such a resource competition game draws our attention and thus we compare the average payoffs of different types of individuals in Fig. 5. Firstly, real ability is fatal in deciding one's faith in resource competition. The higher one's real ability is, the higher average payoff one acquires. Just as illustrated in Fig. 5 (a), there exist clear-cut gaps between income hierarchies and the lower level can never outcompete the upper one, no matter how the initial frequencies of bluffing and overconfident individuals are. Secondly, whether bluffing ones perform much better than unbiased ones also depends on their real abilities. The low capability bluffing ones have the whip hand over those who do not bluff, especially when bluffing and overconfidence are rare initially (see the left panel of Fig. 5(b)). While for middle and high ability, the advantage of bluffing is relatively low (see the middle and right panel of Fig. 5 (b)). Common for individuals of any level of real ability, bluffing is more effective when the others do not bluff and meanwhile do not develop the overconfidence, the useful tool countervailing bluffing.

Figure 5
figure 5

Comparison of the payoffs of individuals with different real abilities.

(a) The 3-D plot showing the normalized average accumulated payoffs of individuals with high (H, crimson), middle (M, light green) and low(L, blue) real abilities respectively, as a function of fIO and fB. H ranges from 0.9350 to 1; M ranges from 0.4859 to 0.5521; L ranges from 0.0443 to 0.1065. (b) Average payoff ratio between the individuals who bluff and those who do not (R = Payof fbluff/Payof fnon–bluff) as a function of fIO and fB, when their real abilities are low (RL), middle (RM) and high(RH) respectively. RL ranges from 1.03 to 8.62; RM ranges from 1.01 to 1.634; RH ranges from 1.002 to 1.322. Data point is obtained by averaging over a period of 10000 time steps and we have checked that longer time period does not qualitatively change the results. Other parameters: α = 0.4, β = 0.3 and r/c = 3.

Furthermore, it is worth studying the situation not only overconfidence state (OS), but also bluffing state (BS) could be subject to evolution process. In other words, we consider an extended model that individuals can mimic any feature of others except the real ability. Different from the case with fixed bluffing level (see Fig. 1), where fO suddenly drops once r/c > 2.0, here fO monotonously increases with r/c (see Fig. 6). And in the latter case, fO is higher than the former. Meanwhile, fB evolves to a higher value relative to the initial one (0.5) when r/c > 2. And fB is further promoted with larger bluffing intensity β (see Fig. 6(b)). This may provide an explanation why fO evolves to a higher level and does not decrease when r/c > 2. Just like what we have shown in the scenario with fixed fB (see Fig. 3), a larger percentage of bluffing individuals induces higher overconfidence level. It seems that overconfidence may gain more popularity if the skill of bluffing can be learned easily.

Figure 6
figure 6

Stable fractions of overconfident individuals fO and bluffing individuals fB as a function of r/c for different values of α and β, when players can learn both the overconfidence state and the bluffing state of others.

In Panel (a), α = 0.4 and β = 0.3. Bluffing prevails once r/c exceeds the critical value 2.0. Overconfidence does not drop to lower level with increasing r/c and remains above 0.89. In Panel (b), α = 0.7 and β = 0.6. Bluffing almost dominate the whole range of r/c. Overconfidence stays above 0.98 for r/c > 0.5. Other parameters: fIO = 0.5 and fIB = 0.5.

Discussion

In summary, we have investigated the impact of spatial structure and bluffing on the evolution of overconfidence in the resource competition game. In contrast to previous results in well-mixed populations12, the stable state is no longer composed of monomorphic overconfident individuals when the resource-to-cost ratio r/c is sufficiently large. Overconfidence never dominates the whole population and is more adaptive for moderate r/c. This is partially ascribed to the local celebrity effect caused by the population structure and real ability hierarchies. Furthermore, a potential benefit far exceeding the cost of competition stimulates more conflicts12,58, which in turn depresses the psychology of self-deception, i.e., the overconfidence.

The existence of bluffing behavior usually leads to ambiguity in one's perception about other's real ability. Our study shows that bluffing promotes overconfidence. Overconfidence is beneficial for individuals in an environment where bluffing is prevalent, because it makes them dare claim and compete for the resource, which would not happen without such “erroneous self-deception”. Therefore overconfidence is just like a tool to conquer the negative effect of bluffing. This conclusion is robust to initial distributions or intensity of overconfidence and bluffing. Furthermore, the advantage of bluffing is more significant within communities of relatively low real abilities. If bluffing can be learned by others, it will become popular among individuals, further promoting overconfidence level.

All in all, the spreading of overconfidence is investigated in a more realistic model and we propose that the existence of bluffing is the key factor for the prosperity of the bias towards exaggerated personal qualities and capabilities. We hope that these observations could provide some insight into understanding this widespread erroneous psychology in human society.

Methods

Mathematical model

Within this work, the basic model is a resource competition game (RCG). Without loss of generality, an individual i in our model is characterized by real capability θi, overconfidence intensity α and bluffing intensity β, where θi Θ = {1, 1.5, 2}, corresponding to low (L), middle (M) and high (H) real capabilities, respectively. The value of α represents the perception error of a player about his own ability when he is overconfident, while β represents how much stronger a player displays than he really is. Whoever dare compete for resource is at least confident, so we assume no self-abasement in this model. In a RCG, i belives he owns a “self-perceived capability” ki as:

where ui = 1 if i is overconfident and 0 otherwise. He also exhibits his “displaying capability” mi as:

where vi = 1 if i bluffs and 0 otherwise. Supposing a resource r is potentially available to individuals that claim it and two players i and j both can claim this resource, a RCG takes place. If neither individual claims the resource, then no fitness is gained. If only one individual makes a claim, then it acquires the resource and gains fitness r and the other individual gains nothing. If both individuals claim the resource, then each individual pays a cost c due to the conflict between them and the results depend on their real capabilities. Therein player i faced with j gains a payoff Pij, which can be calculated as:

  • (1) If ki > mj and kj < mi, i claims but j does not and thus Pij = r.

  • (2) If ki < mj, i will not claim and Pij = 0.

  • (3) If ki > mj and kj > mi, a conflict happens between i and j, whose real capabilities determine what they can get: If θi > θj, Pij = rc; If θi < θj, Pij = −c; If θi = θj, Pij = r/2 − c, i.e., r is equally distributed between them after a battle.

  • (4) If ki = mj, the relation between r and c should be considered: If r ≤ 2c, i will not claim because the upcoming conflict promises no positive benefit and thus Pij = 0; If r > 2c and kj < mi, i gains the whole resource without any payout and Pij = r; If r > 2c and kjmi, the clash is inevitable and therefore Pij can be calculated as in case (3).

Players are located on the vertices of L × L square lattices with periodic boundary conditions. Capability rank and bluffing state (BS) are considered to be natural instincts and thus are constant throughout the evolution. Individuals can only imitate overconfidence states (OS) of others. In an extended model we also explore the case where individuals can mimic both OS and BS of others. fIB denotes the fraction of individuals that bluff. Monte Carlo simulations of the game are carried out comprising two elementary steps. Firstly, each player i collects its payoff Pi by interacting with its four nearest neighbors through RCG. The payoff of player i can be calculated as:

where Ω(i) represents players in i's neighborhood. Secondly, players consider changing their OS synchronously. Specifically, player i adopts the OS of the randomly selected neighbor j with the probability

where K characterizes the level of uncertainty in strategy adoption59. Without loss of generality we use K = 0.1 and we have confirmed that different values of K do not qualitatively change the simulation results.

Extended replicator dynamic

For comparison, evolutionary dynamics of overconfidence in well-mixed population is also investigated. In replicator dynamics, a differential equation governs the evolution of the densities of different strategies, lending to the notion of evolutionary stability23,60,61. Further, enlightened by Ref62, we can consider the whole population to be specially community-structured63,64,65,66,67,68, depending on different character profiles of individuals. There are 6 character profiles, specified by a 2-bit feature name [a1a2] where a1 [L, M, H] indicates the capability level of this individual is low, middle, or high, while a2 [B, N] indicates whether he bluffs or not, indeed forming 6 communities. Players imitate overconfidence states of others but cannot change their characters and thus the size of each community is constant and the members cannot migrate among communities. Strategies A (or B) indicates that an individual is overconfident (or not), while xi and yi (i = LB, LN, MB, MN, HB, HN) denote the fractions of individuals adopting strategy A and B in community i respectively. Here xi + yi = 1. We use ρi to denote the fraction of community i members in the whole population and then . Hence any individual interacts with a player from community i with probability ρi according to the well-mixed assumption. In each community, the standard replicator equation takes the form of:

where ηAi and ηBi are the fitness of overconfident and unbiased people in community i respectively and is the average fitness of individuals in community i. By solving above equations, we can obtain , the stable fixed state of overconfidence (SFSO) of community i. Hence we can derive the SFSO of the entire system as

Assume that individuals are randomly distributed among the communities, i.e., for each community i(i = LB, LN, MB, MN, HB, HN), ρi = 1/6 and that each one has equal probability 0.5 to be overconfident or not. Thus initial overconfidence level for each community is . Taking α = 0.4 and β = 0.3 as an example, we can obtain the dynamic equations as:

The stable equilibrium point of the above differential equations, i.e., the SFSO, is:

Different SFSOs with respect to other overconfidence and bluffing intensities are shown in the main text.