Main

On 6 January 2021, the US Congress assembled in the Capitol for the electoral vote count that would formalize Joe Biden’s election as the new president. Outside the Capitol building, thousands of Americans participated in a riot motivated by claims that the election was fraudulent. Many of them broke into the Capitol building, endangering members of Congress and forcing an evacuation. These events were preceded, and followed, by widespread attempts to debunk beliefs in election fraud with substantial contradictory evidence. These attempts largely failed.

How do people update beliefs in light of new information, and why do false beliefs often persist despite countervailing evidence? An epistemically rational agent makes evidence-based inferences guided by the pursuit of accuracy1. However, belief updating can exhibit desirability effects, such that, given evidence, beliefs are more often updated towards a desired state. For example, people update their beliefs about an expected election winner less following polls that show a projected loss (versus win) for their preferred candidate, discounting undesired outcomes2. In politics, desirability effects can take the form of partisan asymmetries, with members of opposing groups interpreting evidence in a way that favours their party. Such effects can lead to belief polarization, with partisan groups increasingly diverging in their beliefs3,4,5, even if they are exposed to similar evidence6,7,8. Beyond politics, this dynamic extends to many of the most important issues of our time, from vaccine uptake to climate change9,10,11,12,13.

According to current theories, these desirability effects are the result of irrational, biased reasoning (‘directional motivated reasoning’14,15): beliefs are selectively updated to support desired conclusions. Often the conclusions are those that help maintain positive emotions, self-esteem and social identities4,16. Put simply, people ‘believe what they want to believe’. If this is the case, the role of evidence in combatting false beliefs is unclear.

When studying beliefs in isolation, as in most previous work on belief updating, observations of updating in opposing directions given the same evidence17 or preferential updating of desired beliefs15 seem irrational. We consider an alternative possibility based on a systems perspective, in which multiple beliefs combine or compete to explain observed evidence18,19,20. Desirability effects might emerge from the dynamics of rational updating over this system. Belief systems are typically modelled in a Bayesian framework (for example, ‘belief nets’21). As Bayesian belief updating strictly follows normative rules of probabilistic inference, Bayesian models describe the behaviour of rational agents free of bias towards a preferred belief (that is, unbiased belief updating).

Surprisingly, under some conditions, Bayesian belief updating can produce desirability effects as an emergent property. Consider a simple belief system in which an official election outcome is attributed to two possible causes: (1) winning the true (fraudless) vote or (2) winning by fraud. Upon observing the outcome, a rational Bayesian agent will update both causal beliefs in proportion to how diagnostic the observed outcome is about each causal variable (for mathematical details, see Methods). This process entails two predictions that are perhaps surprising when considering beliefs in isolation: (1) beliefs (for example, in fraud) can be updated in the absence of direct evidence for or against them and (2) different people can update a belief in opposing directions given the same evidence, because updates depend not only on the prior belief and outcome, but also on other beliefs20. For example, the more strongly a losing candidate was expected to win beforehand, the more likely an alternative explanation such as fraud becomes. Therefore, if prior beliefs are polarized in a partisan fashion (for example, both sides expect their candidate to win the true vote), desirability effects in fraud beliefs could emerge as a result of rational, Bayesian updating.

Distinguishing between these two possible mechanisms—biased updating arising from directional motivated reasoning and rational updating of polarized prior beliefs—has important implications for real-world policies and campaigns for combatting false beliefs across domains. In particular, they point to different strategies, both of which go beyond straightforward attempts to debunk false beliefs with counter-evidence. If people are biased to believe what they want to believe, the value of new evidence is limited, and interventions might best focus on incentives or on inducing rational thinking. Alternatively, if updating is rational but depends on a system of interrelated beliefs, then one needs to identify the key beliefs and target them simultaneously to elicit change.

In this Article, we examine the dynamics of belief polarization in a high-stakes, real-world setting: fraud beliefs in the context of the 2020 US presidential election. We address three key questions. First, are fraud beliefs affected simply by presentation of electoral outcomes, in the absence of any evidence for or against fraud per se? Second, are beliefs about fraud in the 2020 election subject to desirability effects—that is, partisan asymmetry in how the election outcome is interpreted? Third, if these two effects are observed, are they better explained by directional motivated reasoning (believing in fraud to protect a desired belief about the true winner) or by rational updating of polarized priors (believing in fraud to explain an otherwise unexpected result)?

To address these questions, we surveyed a large online sample of 1,642 Americans from Amazon’s Mechanical Turk (MTurk) via Cloud Research22 (‘Use of MTurk for data collection’ in Supplementary Information) during the 2020 US presidential election on 4–5 November, while the winner was not yet determined and votes were being counted in key states. The sample included participants from each state, whose distribution strongly correlated with the population distribution (on the basis of the 2020 US census: Pearson’s r = 0.938, 95% confidence interval (CI) 0.893 to 0.964, P < 0.001) and age ranging 18–84 years (mean 41.6 (s.d. 13.1) years; Extended Data Fig. 1). Participants reported their preference for president (Biden or Trump) and strength of preference, probability of a win by their preferred candidate (prior win belief), and probability that election fraud would play a significant role in the outcome (prior fraud belief; Supplementary Table 1). We then showed each participant a hypothetical map showing the winner in each state, randomized to indicate either a Republican (Trump) or Democratic (Biden) victory (Fig. 1a,b). Participants reported the probability that fraud played a significant role in the outcome (posterior fraud belief). Comparing posterior and prior fraud beliefs allowed us to test for desirability effects on belief updating. Approximately 11 weeks later, a subsample of the same participants (N = 828) completed a follow-up survey reporting their beliefs about the true vote winner and fraud beneficiary (Supplementary Tables 2 and 3). This allowed us to formulate a Bayesian model under which participants postulate two possible causes of the election outcome (a true win or fraud) and test whether it accurately predicted individual differences in fraud belief updates.

Fig. 1: Empirical fraud belief update given hypothetical election outcome maps.
figure 1

a,b, Participants were randomly presented with either a map showing a Republican win (Trump, red) (a) or a map showing a Democratic win (Biden, blue) (b). The bottom scale in each map shows the distribution of electoral votes (the party receiving more than 270 electoral votes wins the presidential election). Maps were created by flipping four out of the six states that were considered critical at the time of the survey with votes still being counted, using the election tool provided by the fivethirtyeight.com website: https://projects.fivethirtyeight.com/trump-biden-election-map/. The maps presented here are an illustration of the maps used in the experiment. c, Prior and posterior (following presentation of the hypothetical maps) fraud beliefs are shown separately for Democrats (top, blue, N = 1,032 participants) and Republicans (bottom, red, N = 610 participants), for each scenario. Points represent single participants, and error bars represent s.e.m. across participants. Fraud beliefs increased following a hypothetical loss and decreased following a win, except for Republicans following a loss, potentially because many Republican participants’ beliefs were already maximal before the hypothetical outcome (that is, a ceiling effect). d, Fraud belief updates, aggregated across partisan groups, as a function of strength of preference towards the preferred candidate (x axis), separately for preferred candidate loss (green) and win (purple). Increases in fraud beliefs following a loss and decreases following a win (as shown in c) both scale with preference strength: updates are close to 0 with weak preferences and increase in magnitude with stronger preferences (neutral, 0; the crossover at the left is an artefact of linearity constraints). e, Fraud belief updates as a function of the prior win belief (probability of the preferred candidate’s win; equal probability, 50). Updates are larger when the outcome is less expected (that is, when the prior win belief is higher but the preferred candidate loses, or when the prior win belief is lower but the preferred candidate wins). Points represent single participants. The lines represent the linear fit, and the grey shading represents 95% CIs of the linear fit.

We found an asymmetric updating of fraud beliefs (Fig. 1) depending on the partisan group (direction desirability effect), which was stronger in those with stronger preferences (magnitude desirability effect). The Bayesian model accurately predicted these patterns (Fig. 2) and outperformed models that either lacked the full system of beliefs or included an extra Bayesian motivated reasoning mechanism (Fig. 3). Together, these results suggest that strong partisan asymmetries in how the election outcome is interpreted are consistent with a rational updating process operating on politically polarized prior beliefs.

Fig. 2: Bayesian model, priors and predictions.
figure 2

a, The generative model. The observer begins with a prior belief state comprising the probability v that candidate A (the participant’s preferred candidate) will win the true vote, the probability f of substantial fraud and the probability c that fraud (if present) favours candidate A. After observing the official election outcome, these beliefs are updated according to Bayes’ rule (Methods). b,c, Histograms of participants’ empirically measured prior beliefs (from the follow-up sample) regarding the true (fraudless) winner (v) (b) and the candidate benefiting more from fraud (c) (c). Both beliefs strongly diverge on the basis of partisan preferences. Dem, Democratic; Rep, Republican. d, Heat map of the model’s predicted difference in posterior fraud belief between loss and win scenarios, as a function of v and c, when prior fraud belief f = 0.5. Larger values of v indicate a stronger belief that candidate A will win the true vote. Larger values of c indicate a stronger belief that fraud favours candidate A. Values of 0.5 reflect no fraud, or fraud from both sides with neither candidate favoured. We define A as the preferred candidate of each partisan group for convenience, a choice of notation that does not affect the model’s structure or predictions. Thus, high v and low c (shaded in green) indicate prior beliefs that the preferred candidate will win the true vote but the opposing side will cheat. In this part of the prior belief space, fraud beliefs increase for a loss and decrease for a win (a direction desirability effect). Most participants are located in this space, and this was especially true for participants with stronger preferences (a magnitude desirability effect; Extended Data Fig. 7). Coloured points represent single participants based on empirically measured priors (binned and then jittered within each bin for visibility).

Fig. 3: Predicted fraud belief update as a function of preference strength across models.
figure 3

Paralleling the empirical results from Fig. 1d, the model-based predictions of the fraud belief update for each participant are presented as a function of the empirical strength of preference towards the preferred candidate. Dashed lines show linear fits to models’ predictions, with 95% confidence regions. For comparison, solid lines show the linear fits of the observed empirical patterns (from Fig. 1d). The original Bayesian model accounts for all desirability effects: increase of fraud belief following a loss, decrease following a win and scaling of both with preference. The outcome desirability model improved the fit by tempering the over-prediction of fraud belief increase following a loss. The fraud-only model does not account for the increase of fraud belief following a loss, and the random beneficiary model does not account for the decrease of fraud belief following a win. Note that the plot for the Bayesian model is also the plot for the hypothesis desirability model, because fitting the latter to the data yields α = 0.

Results

Prior fraud and win beliefs

Participants’ preferences for one candidate were strong across the sample, with 52.7% indicating the strongest preference and only 4.8% in the lower half of the scale. Preferences were stronger for Biden’s compared with Trump’s supporters (henceforth labelled Democrats and Republicans, respectively; Democrats: mean 89.6 (s.d. 18.4) on a 100-point scale; Republicans: mean 85.8 (s.d. 19.6); two-sample t-test: 95% CI 1.915 to 5.689, t(1640) = 3.95, P < 0.001; Extended Data Fig. 2a). The prior win beliefs indicated that most participants believed their preferred candidate would win the election, with stronger beliefs for Biden than Trump supporters, in line with pre-election polls as well as the partial results from some states available at the time of the survey (estimated probability of preferred candidate’s win: Democrats, mean 77.5% (s.d. 15.5%); Republicans, mean 50.3% (s.d. 26.4%); Welch two-sample t-test, 95% CI 24.956 to 29.551, t(861.49) = 23.28, P < 0.001; Extended Data Fig. 2b). Before the outcome presentation, fraud beliefs were higher for Republicans than for Democrats, as was expected on the basis of allegations made by Republican politicians before election day (estimated probability of fraud playing a significant role in the election outcome: Democrats, mean 19.9% (s.d. 24.4%); Republicans, mean 63.9% (s.d. 30.7%); Welch two-sample t-test 95% CI 41.068 to 46.794, t(1,061.6) = −30.11, P < 0.001; Extended Data Fig. 2c).

Desirability effects on fraud belief updates

The hypotheses and analysis plan were pre-registered after data collection but before inspection of the belief data (https://osf.io/kucsw; for more details, see Methods). As expected, we observed strong partisan asymmetry in fraud belief updating. Participants’ fraud beliefs significantly increased after a preferred candidate loss (one-sample t-test: mean 12.97 (s.d. 31.11), 95% CI 10.844 to 15.098, t(823) = 11.97, P < 0.001) and significantly decreased after a preferred candidate win (one-sample t-test: mean −17.35 (s.d. 29.41), 95% CI −19.365 to −15.327, t(817) = −16.87, P < 0.001; Fig. 1c). That is, for either hypothetical outcome (Biden wins or Trump wins), Democrats and Republicans updated their fraud beliefs in opposite directions. The directional update was weaker in the ‘Biden wins’ scenario (Biden wins, mean 11.37 (s.d. 16.91); Trump wins, mean 30.22 (s.d. 31.28); Welch two-sample t-test, 95% CI −21.282 to −16.408, t(1255) = −15.17, P < 0.001), potentially because many Republicans had maximal prior fraud beliefs that could not increase given a Republican loss, creating a ceiling effect, and many Democrats had minimal prior fraud beliefs that could not decrease given a Democratic win, creating a floor effect.

These directional updates in fraud beliefs were significantly associated with participants’ strength of preference for one candidate and prior win beliefs (each controlling for the other in a multiple regression model; Fig. 1d,e). Following a preferred candidate loss, increases in fraud beliefs were greater for those with stronger partisan preferences (β = 0.10, 95% CI 0.04 to 0.15, t(800) = 3.20, P = 0.001), and when the outcome was less expected (β = 0.16, 95% CI 0.09 to 0.23, t(800) = 4.31, P < 0.001; Supplementary Table 4). Following a preferred candidate win, fraud beliefs decreased more strongly with stronger preferences (β = −0.11, 95% CI −0.16 to −0.07, t(799) = −5.05, P < 0.001), but updates were not significantly related to prior win belief (β = −0.03, 95% CI −0.08 to 0.02, t(799) = −1.11, P = 0.265; Supplementary Table 5). Results were robust with respect to model variants and partisan groups (‘Partisan asymmetry in belief updating’ and ‘Update of fraud belief: robustness across model variants and covariates’ in Supplementary Information, and Extended Data Fig. 2e). Furthermore, the empirical desirability effects remained significant when limiting the analysis to the follow-up subsample, except for the scaling of the fraud belief update with preferences following a loss (‘Update of fraud belief, restricted to the follow-up sub-sample’ in Supplementary Information).

In sum, we observed three empirical effects: (1) participants believed in fraud less following a win, and more following a loss (a ‘direction desirability effect’). These updates were larger (2) when the preference was stronger (a ‘magnitude desirability effect’) and (3) when the outcome was less expected, at least for the loss scenario (an ‘expectancy effect’). These effects generalized to members of both political parties.

A Bayesian model predicts the observed desirability effects

At first glance, these results suggest directionally motivated reasoning, that is, biased updating in favour of preferred beliefs. However, as explained in the introduction, desirability effects might emerge from rational belief updating as a result of how fraud beliefs interact with other beliefs1,18,19,23,24. As Jern et al.20 previously demonstrated, using a model structurally identical to the one we report below, Bayesian updating can yield different conclusions from the same evidence if people hold divergent prior beliefs. In a belief systems framework, fraud beliefs result from abductive inferences (‘inference to best explanation’) about which of two potential causal explanations (true votes or fraud) is the most likely explanation for the outcome. If both sides expect their candidate to win, then normative updating could explain the observed desirability effects.

To test this proposition, we formulated a Bayesian attribution model (not pre-registered), which formalizes the abductive inference concept by describing a Bayesian observer of an election outcome with two candidates (A and B) jointly determined by three binary causal variables (Fig. 2a): winner of the true votes (V: election outcome given no fraud), presence of substantial fraud that could affect the outcome (F) and the side perpetrating any such fraud (C). Thus, V represents a causal explanation for the observed outcome, and the combination of F and C together represent an alternative causal explanation.

The observer’s prior belief constitutes a probability distribution over these variables, taken to be independently distributed (within each individual) with subjective probabilities v, the probability that candidate A will win the most true votes; f, the probability of substantial fraud; and c, the probability that fraud (if present) favours candidate A. The candidate winning the true vote will win the election unless substantial fraud exists and favours the other candidate, in which case the latter wins. The model provides quantitative predictions by simultaneously updating beliefs in the three binary variables (true vote winner, fraud and fraud beneficiary), as dictated by the mathematics of Bayesian inference. The posterior fraud belief given a win by candidate A (W = A) or by candidate B (W = B) as predicted by the model is as follows (for full derivation and explanations, see Methods, and for visualizations, see Extended Data Fig. 3):

$$\Pr \left[ {F = 1|W = {{{\mathrm{A}}}}} \right] = \frac{{\Pr \left[ {F = 1} \right]\Pr \left[ {W = {{{\mathrm{A}}}}|F = 1} \right]}}{{\Pr \left[ {W = {{{\mathrm{A}}}}} \right]}} = \frac{{fc}}{{v + f\left( {c - v} \right)}}$$
$$\Pr \left[ {F = 1|W = {{{\mathrm{B}}}}} \right] = \frac{{\Pr \left[ {F = 1} \right] \Pr \left[ {W = {{{\mathrm{B}}}}|F = 1} \right]}}{{\Pr \left[ {W = {{{\mathrm{B}}}}} \right]}} = \frac{{f\left( {1 - c} \right)}}{{1 - v + f\left( {v - c} \right)}}$$

The posterior true vote belief is given by (Extended Data Fig. 4):

$$\Pr \left[ {V = {\mathrm{A}|W} = {{{\mathrm{A}}}}} \right] = \frac{{\Pr \left[ {V = \mathrm{A}} \right] \Pr \left[ {W = {{{\mathrm{A}}}}|V = {\mathrm{A}}} \right]}}{{\Pr \left[ {W = {{{\mathrm{A}}}}} \right]}} = \frac{{v(1 - f) + vfc}}{{v + f\left( {c - v} \right)}}$$
$$\Pr \left[ {V = {\mathrm{A}}|W = {{{\mathrm{B}}}}} \right] = \frac{{\Pr \left[ {V = {\mathrm{A}}} \right] \Pr \left[ {W = {{{\mathrm{B}}}}|V = {\mathrm{A}}} \right]}}{{\Pr \left[ {W = {{{\mathrm{B}}}}} \right]}} = \frac{{vf\left( {1 - c} \right)}}{{1 - v + f\left( {v - c} \right)}}$$

For convenience, we define A as the preferred candidate, a purely notational choice that aligns interpretation of v and c across partisan groups. We used the empirically obtained v and c from the follow-up survey (subsample, N = 828, not pre-registered) to obtain model predictions. The model accurately predicted individual participants’ fraud belief updates in the original survey, with no free parameters. The correlation between predicted and empirical fraud belief updates was r = 0.63 (95% CI 0.586 to 0.669, t(826) = 23.27, P < 0.001, N = 828, root mean squared error (RMSE) 0.336; Extended Data Fig. 5). Democratic and older participants from the original sample were more likely to complete the follow-up survey (logistic regression, preferred candidate: log odds ratio −0.462, 95% CI −0.75 to −0.18, z = −3.21, P < 0.001; age: log odds ratio 0.03, 95% CI 0.02 to 0.03, z = 6.15, P < 0.001). However, sensitivity analyses revealed that the model fits similarly well for both partisan groups and that the quality of the fit was insensitive to age and preference strength (‘Sensitivity analyses of the Bayesian model’ in Supplementary Information), indicating robustness to the sample characteristics available in our data.

Importantly, although the model did not have access to participants’ preferences, it successfully captured the desirability effects we observed (Fig. 3 and Extended Data Fig. 6): model predictions on the basis of individual participants’ beliefs show increased fraud beliefs after a loss (N = 398, mean 38.59, 95% CI 35.89 to 41.28, t(397) = 28.16, P < 0.001), decreased fraud after a win (N = 430, mean −18.54, 95% CI −21.10 to −15.99], t(429) = −14.27, P < 0.001), and scaling of both effects with the strength of partisan preferences (loss, β = 0.10, 95% CI 0.06 to 0.15, t(384) = 4.19, P < 0.001; win, β = −0.12, 95% CI −0.17 to −0.07, t(417) = −4.96, P < 0.001). The model captures these effects and predicts individual differences in belief updating because participants’ desires and preferences are correlated with their beliefs in the true winner (v) and fraud beneficiary (c).

More specifically, the model predicts that a rational agent observing a win by their preferred candidate will decrease fraud beliefs if v > c and increase fraud beliefs if c > v, and vice versa for a win by the dispreferred candidate (Fig. 2d and Extended Data Fig. 3a). Thus, a direction desirability effect arises in a rational observer when the observer believes their preferred candidate is more likely to win the true votes (v) than to benefit from fraud (c). Indeed, almost all participants from both parties (Fig. 2b–d and Extended Data Fig. 7a,b) believed that (1) their candidate would have been the true winner in the absence of fraud (v ≈ 1) and (2) if there was fraud, it favoured the dispreferred candidate (c ≤ 0.5). Thus, nearly all participants occupied a portion of the (v,c) parameter space that, under rational updating, would lead to a direction desirability effect (that is, partisan asymmetry), as we observed empirically (‘Follow-up survey: divergent beliefs’ in Supplementary Information). Moreover, the desirability effect becomes stronger as v increases and c decreases (Fig. 2d), and indeed participants with stronger preferences reported significantly higher v and lower c values, accounting for the magnitude desirability effect (‘Prediction of magnitude desirability effects’ in Supplementary Information).

In sum, the Bayesian model provides a potential mechanism for the desirability effects observed in the empirical data (Fig. 1c,d), which comprises a rational belief-updating process operating on polarized prior beliefs that co-vary with preference strength, rather than motivated updating. While Democrats expect Biden to win the true vote and fraud (if any) to favour Trump or favour both sides equally, Republicans expect Trump to win the true vote and fraud to favour Biden (Fig. 2b,c). These priors are clearly biased, since both groups cannot be correct. As for the expectancy effect (Fig. 1e), the relationship between prior win belief and fraud update is indirect according to the Bayesian model, and depends on how prior win belief co-varies with v, f and c. Nevertheless, the Bayesian model also accounts for the observed expectancy patterns (Extended Data Fig. 8). The one notable discrepancy, addressed further below, is that the model over-predicts the direction desirability effect by producing greater increases in fraud belief in the loss scenario than was empirically observed. Thus, if participants were optimal Bayesian observers, they would have been expected to increase fraud beliefs following a loss more than they actually did.

Comparisons with competing models

To test whether rational belief updating operating on polarized priors provides a better explanation than motivated updating, we considered two kinds of irrational updating biases. We formulated extended models that included each of these two non-Bayesian biases in addition to the attribution mechanism in the core Bayesian model (Methods). One is a hypothesis desirability bias, under which people update more in favour of beliefs they prefer (that is, people believe what they want to be true), in line with the notion that beliefs have value beyond their epistemic content16. In the ‘hypothesis desirability’ model, probabilities of hypotheses (joint values of V, F and C) are weighted by their desirability, such that posterior beliefs are biased towards desired hypotheses. The second is outcome desirability bias, under which people update more on the basis of preferred outcomes, and information from undesired outcomes is discounted. In the ‘outcome desirability’ model, likelihoods for undesired outcomes are under-weighted and thus elicit reduced belief updates compared with more desired outcomes15,25,26.

The difference between these two models is that, in the hypothesis desirability model, desired hypotheses receive a greater (more positive or less negative) update than do undesired hypotheses, regardless of what evidence is observed. By contrast, in the outcome desirability model, all hypotheses receive a stronger update if the evidence (that is, election outcome) is desired and a weaker update (closer to zero) if the outcome is undesired. In both of these models, the strength of the bias is determined by a free parameter, 𝛼, and the models reduce to the original Bayesian model when 𝛼 = 0.

The hypothesis desirability model did not improve the fit compared with the original Bayesian model, resulting in a best-fitting 𝛼 of zero (equivalent to the original Bayesian model). Positive 𝛼 values, corresponding to biased inference towards desired hypotheses, further increased the over-prediction of observed desirability effects in the loss scenario. This finding supports the conclusion that the empirical desirability effects arise from rational updating. That is, we find not only that the data are predicted reasonably well by the Bayesian model but also that incorporating motivated reasoning worsens the fit. Therefore, we did not consider the hypothesis desirability model further.

In contrast, the outcome desirability model fits better than the original Bayesian model, with an estimated 𝛼 of 0.3 (RMSE 0.293; bootstrapped 95% CI 0.188 ≤ 𝛼 ≤ 0.470; F-test, F(1,827) = 364.37, P < 0.001; for model comparison statistics, see Supplementary Table 13). Importantly, however, the biased updating in this model did not drive the observed desirability effects. Instead, it mitigated the over-prediction of fraud updates present in the original model by reducing the strength of the desirability effects in the loss scenario (that is, it tempers rather than explains the desirability effects produced by the Bayesian attribution process; Fig. 3). Rational attribution over-polarized prior beliefs remained the best explanation of the observed desirability effects. We discuss alternative explanations for the over-prediction of desirability effects in the original Bayesian model below (Discussion).

To test whether all three prior beliefs (f, v and c) in our modelling framework are necessary to account for the empirically observed desirability effects, we formulated two reduced Bayesian models (Methods, Fig. 3 and Supplementary Table 13). The fraud-only model, which includes beliefs only about fraud (F and C) and not about the true vote (V), fits the data relatively well but does not account for the increase of fraud beliefs following an undesired loss. The random beneficiary model, which includes V and F but omits C, fits the data worse than the other models and does not account for the decrease in fraud belief following a desired win. Thus, all three elements of the belief system appear essential to the models’ explanation of the observed desirability effects.

Update of beliefs in the legitimate winner

The goal of an election is to convince the population to accept the winner as their legitimate leader. Our models also predict how people update their belief in the legitimate winner after observing the outcome (Extended Data Fig. 4). Bayesian observers update their beliefs about V and F (and C) simultaneously upon observing an outcome, following the rules of Bayesian inference.

In our sample, both the outcome desirability model and the original Bayesian model predict that 36% of the participants will be ‘election proof’, in the sense that more than 90% of the overall update following an official loss by their preferred candidate will be to increase fraud beliefs rather than to change belief in the true winner. The models both classify 29% of Democrats and 51% of Republicans in this way. Importantly, our sample is not strictly representative of the entire US population, which is necessary for quantitatively estimating the share of the population that holds specific beliefs or is ‘election proof’. Therefore, these specific estimates, which also rely on certain assumptions about reports of extreme probabilities (Methods), should be treated only as qualitative approximations.

Nevertheless, they suggest that a considerable proportion of the population will give essentially no credence to official election results, even if their reasoning is epistemically rational, based on their prior beliefs. Moreover, our model-based estimate is corroborated by both our follow-up data (39% of Republicans in our sample were still certain that Trump was the true winner weeks after Biden was declared as the official winner) and national polls that consistently find that about one-third of Americans continue to believe that Biden won only due to fraud even months after the election (for example, https://www.monmouth.edu/polling-institute/reports/monmouthpoll_us_062121/ and https://www.cnn.com/2021/04/30/politics/cnn-poll-voting-rights/index.html).

Discussion

Our findings illustrate how specific combinations of beliefs can prevent rational people from accepting the results of democratic elections. Beliefs in election fraud have played a substantial role in undermining democratic governments worldwide27, and have grown and remained strikingly prevalent in the United States28,29. Belief in fraud undermines both motivation to vote and acceptance of election results, which bear directly on the viability of an elected government.

Our results suggest that fraud beliefs and other beliefs do not exist in isolation, but are part of systems of beliefs that interact to guide how new evidence is interpreted. These beliefs can withstand countervailing evidence, even if the belief-updating process is rational, depending on other prior beliefs that determine how credit for observed evidence (for example, an election winner) is distributed across potential causes. Real-world evidence that directly bears on these causal beliefs is difficult at best for individuals to obtain. For example, direct evidence about V requires a fraudless election, and the ‘gold standard’ evidence about F (and C) is the result of an unbiased investigation. Rhetoric that increases belief in fraud or challenges the credibility of the investigation can thus create a ‘short circuit’: if one side is believed to be cheating disproportionately (C), then an election outcome (W) might not be taken as evidence regarding the true winner (V), and might instead serve almost solely to update fraud beliefs (F and C). Our model-based analysis suggests this occurred in a considerable proportion of our sample.

The belief systems framework has important implications for efforts to counteract fraud beliefs in the United States and beyond. Such efforts often focus on directly ‘debunking’ them (providing direct evidence against fraud). However, these debunking efforts largely fail30. Our model affords alternative ways of thinking about how to change beliefs, in accordance with other perspectives emphasizing the worldview of the person holding the beliefs and the importance of alternative causal explanations31,32,33. Specifically, to reduce belief in election fraud, in addition to (1) directly targeting them with evidence that fraud is very rare (decreasing f in our model), it is advantageous to simultaneously provide evidence that (2) Americans have diverse preferences and one’s dispreferred candidate truly is supported by many (decreasing v), and (3) to the extent fraud does occur, its benefits are distributed across both candidates (increasing c). Furthermore, our model could be useful for distributing resources across interventions targeting these three prior beliefs, by predicting how each would impact people’s willingness to update fraud beliefs (for a formal proof of concept, see ‘Comparing among interventions targeting prior beliefs’ in Supplementary Information).

Real-world political campaigns already consider multiple beliefs in this fashion to varying degrees, though perhaps without the explicit formulation embodied in our model. For example, the Trump campaign simultaneously targeted these same three beliefs before and during the 2020 election34: (1) ‘It’s rigged’: election fraud is rampant (increase f); (b) ‘We’re winning’: Trump is popular and enjoys the true support of most Americans (increase v for Trump) and (c) ‘The Democrats are crooked’: fraud overwhelmingly favours the opposition (decrease c). Although these messages may initially seem unrelated, our model shows that they combine to form a coordinated strategy to create evidence-resistant beliefs about the true winner. Moreover, the campaign also cast suspicion on the unprecedented use of mail-in ballots in the 2020 election (due to the coronavirus disease 2019 pandemic), which probably made fraud more plausible as an alternative explanation. While some political actors already seem able to intuitively apply this approach in effective campaigns, a formal understanding such as we offer here could be beneficial to studying and preventing misinformation campaigns, and promoting belief change for the sake of society.

A few caveats and limitations deserve additional mention. First, our sample was not strictly representative of the US population. Thus, it is not suited for quantifying population-average beliefs (or belief updates) across the population as election polls intend to do, and does not precisely estimate prevalence of fraud beliefs across the entire US population. Moreover, Democratic and older participants were more likely to complete the follow-up survey, which could potentially bias numerical estimates. However, the presence of desirability effects and the model’s goodness of fit did not vary across partisan groups and age, suggesting that our qualitative conclusions are robust to these sample characteristics.

Second, there are at least two reasons that participants’ reported prior beliefs might have been more extreme than their actual priors during the main survey: (1) reports of v and c were collected from the follow-up survey and thus may have been impacted by intervening events (for example, further claims by the Trump campaign that he was the true winner but lost because of fraud, and consensus among Democrats that Biden won in the absence of fraud). Indeed, fraud beliefs after the 2020 election have been shown to become more polarized with time35. (2) Participants might have reported more extreme prior beliefs than they actually held because they were motivated to communicate partisanship (expressive responding36). If reported prior beliefs were artificially polarized for either of these reasons, this could undermine the conclusion that participants updated their beliefs rationally. That is, our Bayesian model holds that the desirability effects in participants’ posterior beliefs are rooted in the asymmetries in their priors, and that argument could fail if the asymmetries were not genuine. However, it is unlikely that the model’s predictive success was artificially inflated in this way. First, recent studies, including a comprehensive analysis of fraud beliefs in the context of the 2020 US election, found little or no support for expressive responding in reported political beliefs37,38. Second, we conducted an additional analysis that assumed participants’ prior beliefs were less extreme than reported. This analysis adjusted the reported priors by shrinking them towards the undesired state before passing them to the models to calculate predictions (‘Simulations of extreme priors’ in Supplementary Information). This improved, rather than worsened, the Bayesian model’s fit, by reducing over-predictions of fraud belief increases in the loss scenario. While these results are consistent with some degree of expressive responding or polarization over time, they show that the model’s ability to capture the desirability effects in our primary analysis was despite such effects, not because of them.

Third, the present models are relatively simple and do not explicitly incorporate many beliefs, such as confidence in election polls or in news sources. Interestingly, beliefs about many such additional latent variables are not expected to change the models’ predictions regarding fraud belief, because they are not directly causally related to the official winner (‘Additional beliefs not included in the models’ in Supplementary Information).

Finally, the unique settings of our data collection may elicit pre-treatment effects, whereby participants may appear to be insensitive to experimentally presented information because they have already been exposed to similar information. However, we compare between participants who were given contrasting observations (rather than certain information versus no information), and the results clearly show that participants did update their beliefs. Furthermore, a strength of our Bayesian approach is that updates in Bayesian models depend on both the prior beliefs and the new evidence, and thus they account for pre-treatment effects that would arise when those two are aligned (that is, when priors are already strongly in the direction of the new evidence). Nevertheless, future studies should examine whether our findings generalize to other situations, and particularly to situations with milder prior beliefs.

In sum, our findings indicate that the desirability effects in fraud beliefs we observed during the 2020 US presidential election are best attributed to rational updating from polarized prior beliefs rather than to biased updating. This suggests that partisan groups tend to stereotype opposing groups as irrational because they do not know (or disregard) their prior beliefs. Accurately taking them into account might help bring people closer together by showing they are in fact often rational. Such polarized prior beliefs could be driven by rational and/or irrational processes, including many social, technological and psychological processes beyond the scope of our model and study. For example, they could result from differential exposure to evidence driven by social media, which has been shown to reinforce belief polarization5 and create ‘filter bubbles’ and ‘echo chambers’39,40. In this case, the partisan differences we found in people’s priors may be rational, in that individuals are simply drawing on the information available to them. However, the process of information selection may itself be irrationally biased, and indeed people have been found to select information that reinforces desired beliefs41,42,43. Prior beliefs are also shaped powerfully by social norms and group identity4,44,45,46. This could be interpreted as an instrumental bias (agreeing with in-group members has value) or potentially as rational, if people place more trust in their in-group to deliver accurate information. Our model does not attempt to explain these varied and powerful processes, but it helps to understand the dynamics of their influence.

More broadly, our findings highlight the importance of a systems perspective in understanding human beliefs across domains, and in particular the critical role of attribution. The dynamics we observed here are probably at play in multiple areas crucial for human wellbeing, including beliefs about the self and others that shape individual mental health47, beliefs in the efficacy and safety of vaccines48, and beliefs in the need for action to address climate change49,50. In each of these domains, evidence must be attributed across multiple latent causes, shaping which beliefs are reinforced by observed evidence. Our approach provides a blueprint that can be applied to beliefs in these diverse domains, identifying key beliefs and making quantitative predictions about how they will interact.

Methods

Participants

A large online sample of 1,760 American citizens was collected from MTurk (Amazon) via Cloud Research22. The sample size was chosen on the basis of available funds aiming for a large sample across the United States. Participants were 18 years old or older, provided their consent for participation online before accessing the survey questions and were paid $1 for their participation. The study was approved by Dartmouth College’s Committee for the Protection of Human Subjects.

Thirty-two participants were excluded due to failure to correctly answer a simple attention check included in the survey and one due to not meeting the participation requirement of American citizenship or permanent residency. Eighty-five additional participants were excluded from analyses because they chose ‘Other’ for their preferred Presidential candidate (and not ‘Donald Trump’ or ‘Joe Biden’). Analyses included 1,642 participants. Using G*Power version 3.1 (ref. 51), we found that the minimal detectable effect size for either predictor (preference strength or prior win belief) in our main regression analysis of fraud belief update, based on α = 0.05, power of 0.9 and N = 818 (the number of included participants who were randomly assigned to the loss scenario, which is lower than the 824 participants in the win scenario), is a small effect size of Cohen’s f2 = 0.016 (ref.52).

Out of the 1,642 included participants, 1,032 preferred Joe Biden and 610 preferred Donald Trump. Partisan affiliation was generally, but not perfectly, aligned with the preferred candidate. We use Democrat and Republican to refer to participants according to their preferred candidates (Biden and Trump, respectively). The sample included 1–138 participants from each of the 50 US states and the District of Columbia. The age of participants ranged from 18 to 84 years old (mean 41.6, s.d. 3.1 years) and Republicans were older than Democrats on average (Democrats, mean 39.9, s.d. 12.8 years; Republicans, mean 44.5, s.d. 13.1 years).

Materials and procedures

Each participant completed a survey distributed on the Qualtrics platform (Qualtrics). The US election took place on 3 November 2020. The survey was available online from 4 November at 20:00 until 5 November at 10:30 Eastern Time. During that time, votes were still being counted and there was no conclusive winner. Six states were considered critical ‘swing states’ with no clear winner and votes still being counted: Nevada, Arizona, Michigan, Pennsylvania, North Carolina and Georgia. We created two hypothetical maps that differed by flipping four of these six states: Nevada, Arizona, Michigan and Pennsylvania, using the election tool provided by fivethirtyeight.com: https://projects.fivethirtyeight.com/trump-biden-election-map/. Each map provided a graphical representation of the states’ final results (red for Republican victory and blue for Democrat victory), sum of electoral votes for Republicans and Democrats and the winner: one map with a Republican (Donald Trump) win and one with a Democrat (Joe Biden) win (for an illustration, see Fig. 1ab).

We collected participants’ demographic information (age and state of residency), political affiliation (Republican, Democrat, Independent or Other), candidate preference (Donald Trump, Joe Biden or Other), preference strength (continuous scale from 0 (don’t care either way) to 100 (extremely strong preference)), prior subjective probability of the preferred candidate’s win (prior win belief; continuous scale from 0 (no chance) to 100 (definite win)) and prior subjective probability of fraud playing a significant role in the election outcome (prior fraud belief; continuous scale from 0 (not at all) to 100 (extremely); for full description of the survey questions, see Supplementary Table 1). Then, each participant was randomly presented with one of the two maps described above and was asked to indicate their belief about fraud affecting the election outcome if it were to turn out as in the map shown to them.

Analysis

All statistical tests reported in the manuscript and Supplementary Information were two sided.

Pre-registration

We pre-registered our analysis plan and predictions on the Open Science Framework (https://osf.io/kucsw) after data collection but before inspection of beliefs-related data. We did examine some portions of the data, such as the number of participants supporting each candidate, as well as demographic information. These procedures did not inform us about the effects of interest. Rather, they were meant to monitor data quality and sample characteristics.

Deviations from pre-registration

We report a different multiple linear regression model than the one we pre-registered for the update of fraud belief, to simplify the model and interpretations. The differences between the two models, along with the results of the pre-registered model, which are in line with those of the revised one, are described in ‘Updates in fraud belief: pre-registered analysis’ in Supplementary Information.

The pre-registration was based mainly on the biased belief updating hypothesis and focused on the empirical data from the original survey. After observing the data of the original survey, we developed the Bayesian model and associated variants. A follow-up survey was then collected to test the model and expand the findings from the original survey. The follow-up survey and models were not pre-registered.

Updating of fraud belief

At the time of the pre-registration, we predicted, on the basis of the desirability bias perspective and learning theories, that the update of fraud belief (posterior minus prior subjective probability of fraud playing a significant role in the election outcome) would be affected by two factors: (1) participants’ preferences, such that fraud belief would increase when the hypothetical results were undesired, with a larger update for stronger preferences and (2) participants’ prior win belief, such that fraud belief will increase when the hypothetical results are less expected.

We tested our predictions with multiple linear regression predicting update of fraud belief as a function of the preference strength and the prior win belief, separately for the loss scenario (that is, following a map in which the preferred candidate lost) and for the win scenario (that is, following a map in which the preferred candidate won). The prior fraud belief, preferred candidate (Biden/Trump), participant’s age and order of survey submission during data collection were included in both models as covariates. The participant’s state of residence was not included as it did not significantly reduce residual variance while substantially increasing model complexity. All numeric variables included in the model (dependent and independent) were z scored. The same models were also tested separately for each partisan subgroup, where the preferred candidate covariate was omitted (‘Partisan asymmetry in belief updating’ in Supplementary Information). Additionally, a one-sample two-sided t-test was used to test whether the mean change in fraud belief was significantly non-zero in each scenario.

Bayesian model

To test whether the general patterns of belief update we found can reflect rational reasoning from possibly polarized prior beliefs, we formulated a Bayesian model that assumes participants attribute the election outcome to the combination of the true votes (that is, election outcome given no fraud) and fraud (Fig. 2a). The candidates are labelled A and B, with A representing the participant’s preferred candidate. Let W,V {A,B} be the official and true winners, respectively. Let F {0,1} indicate whether there was substantial fraud that could overturn the outcome, and let C {A,B} indicate which side committed fraud (only meaningful when F = 1). Therefore, the official winner will match the true winner unless there was substantial fraud by the opposition (F = 1 and C ≠ V):

$$\Pr \left[ {W = V} \right] = \left\{ {\begin{array}{*{20}{c}} 0 & {F = 1\;{{{\mathrm{and}}}}\;C \ne V} \\ 1 & {{{{\mathrm{otherwise}}}}} \end{array}} \right.$$

We assume the participant has an independent prior on V, F and C, and denote Pr [V = A] = v, Pr [F = 1] = f, Pr [C = A] = c. For convenience, we define R {, A, B} by R =  when F = 0 and R = C when F = 1; that is, R indicates who committed substantial fraud, with denoting no one. Then the joint prior on V,R is given by

The likelihoods, as described above, are given by

The joint posteriors are then given by

and

with partition function

$$Z_{{{\mathrm{A}}}} = \Pr \left[ {W = {{{\mathrm{A}}}}} \right] = v + f\left( {c - v} \right)$$
$$Z_{{{\mathrm{B}}}} = \Pr \left[ {W = {{{\mathrm{B}}}}} \right] = 1 - v + f\left( {v - c} \right)$$

Marginalizing the joint posteriors gives the posteriors on F:

$$\Pr \left[ {F = 1|W = {{{\mathrm{A}}}}} \right] = \frac{{fc}}{{v + f\left( {c - v} \right)}} = \frac{c}{{c + \left( {1 - f} \right)\left( {v - c} \right)}}\Pr \left[ {F = 1} \right]$$
$$\Pr \left[ {F = 1{{{\mathrm{|}}}}W = {{{\mathrm{B}}}}} \right] = \frac{{f\left( {1 - c} \right)}}{{1 - v + f\left( {v - c} \right)}} = \frac{{1 - c}}{{1 - c + \left( {1 - f} \right)\left( {c - v} \right)}}\Pr \left[ {F = 1} \right]$$

Therefore, fraud belief increases if the official winner is more likely to be the cheater than to be the true winner (that is, if c > v for W = A, and if c < v for W = B).

We also derived the posteriors for V, which are given by

$${{{\mathrm{Pr}}}}\left[ {V = {{{\mathrm{A|}}}}W = {{{\mathrm{A}}}}} \right] = \frac{{v(1 - f) + vfc}}{{v + f(c - v)}}$$

and

$${{{\mathrm{Pr}}}}\left[ {V = {{{\mathrm{A|}}}}W = {{{\mathrm{B}}}}} \right] = \frac{{vf(1 - c)}}{{1 - v + f(v - c)}}$$

Importantly, as can be seen above, the model simultaneously updates beliefs about V, F and C, as dictated by Bayes rule. For example, the posterior probability that candidate A is the true vote winner given that A is the official winner (PR[V = A|W = A]) depends on the prior probability of V (Pr[V = A], which is v), the likelihood of A being the official winner given that A is the true vote winner (Pr[W = A|V = A], which depends on f and c) and the marginal probability of A being the official winner (Pr[W = A], which depends on v, f and c). The belief systems perspective comes into play primarily in the term Pr[W = A|V = A]. This likelihood term masks hidden complexity, because it depends on other causal explanations. In our model, its value depends on f and c: for example, Pr[W = B|V = A] = f(1 – c). Thus, the updates regarding V, F and C are simultaneous and inter-dependent. Updates about V depend on f and c (in addition to v), and updates about F depend on v and c (in addition to f). In some cases, depending on the prior beliefs, it is rational not to take the election outcome as diagnostic of the true winner. For example, if an agent’s prior beliefs are f = 1 and c = 0, the agent is certain that substantial fraud is committed in favour of candidate B, and therefore that B will be the official winner regardless of who wins the true vote. In such a case, a win by candidate B is not diagnostic of V, and therefore a Bayesian (rational) agent will not update beliefs about V.

Follow-up survey

To assess the participants’ beliefs about the true vote winner and who commits fraud, we collected a follow-up survey, about 11 weeks after the original survey. The follow-up survey was again distributed on Qualtrics via Cloud Research and was available to all participants who completed the original survey. Although participants did not originally sign up for a multi-session study, and the follow-up survey was available for less than 24 h, more than half of the original sample completed it (N = 937; nine additional ones failed the attention check). Participants were notified about the follow-up survey via email (sent via Cloud Research) and were paid an additional $1 upon completion of the follow-up survey. For the purpose of testing the models and assessing the priors, we further excluded 44 participants who indicated ‘Other’ as their preferred candidate during the original survey, 22 participants who indicated ‘Other’ as their preferred candidate during the follow-up survey and 43 additional participants who changed their preferences between the surveys (all switched from preferring ‘Trump’ to preferring ‘Biden’). We obtained the priors and applied the models on the remaining N = 828 participants.

Participants included in the final follow-up sample were 18–77 years old with at least one participant from each state. The follow-up survey was accessible between the evening of 19 January 2021 and the inauguration of Joe Biden (around 12:00 Eastern Time on 20 January 2021), and included questions about the participants’ preferences (both at the time of the follow-up survey completion and during the election), their beliefs about the winner of the election, their beliefs about fraud and the activities they considered when answering the questions about fraud (for all survey questions, see Supplementary Table 2). Participants’ values for v were computed from their answer to the question ‘Who would have won in a fraudless election?’ along with their reported confidence in that answer (continuous scale 0–100). The confidence level was multiplied by −1 for participants who indicated the opponent candidate as the fraudless winner, and v was set as the confidence level after rescaling to a range of [0, 1]. For participants who answered ‘Don’t know’ with regard to the fraudless winner, we set their v to 0.5, indicating uncertainty. Participants’ values for c were computed from their answer to the question ‘Who benefited most from fraudulent voting activity?’ (continuous scale 0 (Trump/Biden) to 100 (Biden/Trump); the labels’ order was counterbalanced across participants). The value of c was set as the reported probability of fraud benefiting their preferred candidate, divided by 100 for a range of [0, 1]. The correlations between the preference strength and v, and between the preference strength and c, were computed with Pearson correlation. Prior fraud belief values (f) from the original survey were also scaled to a range of [0, 1] to be used in the models.

We derived the models’ predicted posteriors and magnitude of updating of fraud beliefs on the basis of the empirically measured priors for the 828 participants who successfully completed the follow-up survey, preferred either Trump or Biden (that is, not ‘Other’) and did not change their preference between the two surveys. Many participants reported extreme prior beliefs (f, v and/or c of 0 or 1), which are unlikely to indicate true mathematical certainty and instead may result from lack of granularity in reports, biases towards anchors or a tool to communicate partisanship or support36,53,54. Furthermore, for some participants, these extreme prior beliefs led to undefined posterior beliefs in the model, because they implied zero probability for the hypothetical outcome the participant observed (for example, viewing a win by the preferred candidate after having a 100% belief that the opposition would reverse the election through fraud). Therefore, we regularized extreme priors to the range [0.01, 0.99] by converting values that are lower than 0.01 to 0.01, and values that are higher than 0.99 to 0.99 when computing the models’ predictions. Following this step, we could derive predictions for all 828 follow-up participants. Importantly, we verified that the choice of regularization value (no regularization, regularization to [0.01, 0.99] or regularization to [0.05, 0.95] did not affect the study’s conclusions (‘Regularization of extreme prior beliefs’ in Supplementary Information). To assess the model fit, we calculated the RMSE and Pearson correlation between the empirical and predicted values of fraud belief updates.

Non-Bayesian models

To examine potential deviations from the original Bayesian model, we formulated and tested two extended models, each adding a particular form of non-Bayesian desirability bias to the model’s belief updating rule. The Bayesian model uses the standard Bayes’ rule

$$\Pr \left[ {h{{{\mathrm{|}}}}W} \right] \propto \Pr \left[ h \right] \Pr \left[ {W{{{\mathrm{|}}}}h} \right]$$

where h is the hypothesis (combinations of V, F and C in our models), W is the observed outcome and stands for proportionality. As an analogue of correct probabilities in true Bayesian modes, Pr\(\left[ \cdot \right]\), we introduce the notation \({\Bbb B}\left[ \cdot \right]\) to stand for belief in non-Bayesian models.

Hypothesis desirability model

One possible form of bias in belief updating comes from desirability of hypotheses, whereby hypotheses are weighted according to their subjective utility when determining posterior beliefs. We implemented this by assuming each hypothesis h is weighted by a linear function \(1 + \alpha U\left( h \right),\) where U denotes subjective utility and α is a free parameter determining the strength of the desirability bias:

$${\Bbb B}\left[ {h{{{\mathrm{|}}}}W} \right] \propto \left( {1 + \alpha U\left( h \right)} \right) \Pr \left[ h \right] \Pr \left[ {W{{{\mathrm{|}}}}h} \right]$$

When \(\alpha = 0\), the hypothesis desirability model reduces to normative Bayesian inference. As \(\alpha \to \infty\), the desirability bias becomes maximal in that every hypothesis is weighted exactly according to its utility (that is, the \(\alpha U(h)\) term dominates the 1 term, and then α cancels out due to normalization).

Under the assumption that participants care about the true vote winner and not about fraud per se, we defined the utility of each hypothesis by the utility of the corresponding true vote winner, \(U\left( h \right) = U\left( V \right)\). These utilities were then taken as \(U\left( {\mathrm{A}} \right) = u\) and \(U\left( {\mathrm{B}} \right) = 1 - u\), where u denotes the participant’s reported preference strength towards their preferred candidate (A), scaled to lie in \(\left[ {\raise.5ex\hbox{$\scriptstyle 1$}\kern-.1em/ \kern-.15em\lower.25ex\hbox{$\scriptstyle 2$} ,1} \right]\). In other words, \(U\left( h \right) = u\) for all hypotheses for which \(V = {\mathrm{A}}\), and \(U\left( h \right) = 1 - u\) for all hypotheses for which \(V = {\mathrm{B}}\). Negative values of α lead to down-weighting of desired hypotheses relative to undesired ones, and therefore only values of 0 or greater were considered.

These assumptions lead to the following posterior beliefs:

and

with partition function

$$Z_{{{\mathrm{A}}}} = v - vf + fc + \alpha \left( {uv + fc - uvf - ufc - vfc + 2uvfc} \right)$$
$$Z_{{{\mathrm{B}}}} = 1 - v + vf - fc + \alpha \left( {1 - u - v + uv - fc + uvf + ufc + vfc - 2uvfc} \right)$$

Marginalizing the joint posteriors gives the posteriors on F:

$${\Bbb B}\left[ {F = 1{{{\mathrm{|}}}}W = A} \right] = \frac{{fc + \alpha \left( {fc - ufc - vfc + 2uvfc} \right)}}{{v - vf + fc + \alpha \left( {uv + fc - uvf - ufc - vfc + 2uvfc} \right)}}$$
$$\begin{array}{l}{\Bbb B}\left[ {F = 1{{{\mathrm{|}}}}W = {{{\mathrm{B}}}}} \right] \\= \displaystyle\frac{{f\left( {1 - c} \right) + \alpha \left( {f - uf - vf - fc + 2uvf + ufc + vfc - 2uvfc} \right)}}{{1 - v + vf - fc + \alpha \left( {1 - u - v + uv - fc + uvf + ufc + vfc - 2uvfc} \right)}}\end{array}$$

Outcome desirability model

Another possible form of bias in belief updating comes from desirability of outcomes. Specifically, people have been shown to discount evidence from undesired outcomes1. We implemented discounting of evidence by shrinking the effective likelihood of the outcome toward a uniform distribution (\(Pr\left[ {W = {\mathrm{A}}|h} \right] = Pr\left[ {W = {\mathrm{B}}|h} \right] = 1/2\) for all h), to a degree determined by a variable \(\lambda \in \left[ {0,1} \right]\)

$${\Bbb B}\left[ {h{{{\mathrm{|}}}}W} \right] \propto \Pr \left[ h \right] \left( {\left( {1 - \lambda } \right) \Pr \left[ {W{{{\mathrm{|}}}}h} \right] + \lambda /2} \right)$$

When \(\lambda = 0\), this update rule reduces to standard Bayesian inference, \({\Bbb B}\left[ {h{{{\mathrm{|}}}}W} \right] \propto \Pr \left[ h \right] \Pr \left[ {W{{{\mathrm{|}}}}h} \right]\). When \(\lambda = 1\), the outcome is completely disregarded, and the posterior belief is the same as the prior (the ½ factor is absorbed into normalization).

Desirability bias is implemented by assuming the degree of discounting depends negatively on the desirability of the observed outcome:

$$\lambda = \alpha (1 - U(W))$$

We define the utility of the official winner (W) in the same way as we define that of the true vote winner in the hypothesis desirability model: \(U\left( {\mathrm{A}} \right) = u\) and \(U\left( {\mathrm{B}} \right) = 1 - u\), where u is the participant’s preference strength towards their preferred candidate (A), scaled to lie in [½, 1] (note that the uses of u in the two models do not together imply that participants’ utility functions for the true vote winner and the official winner are the same. These are two separate candidate models, embodying different proposals for how participants’ prior beliefs and preferences are related to their posterior beliefs). As before, α determines the strength of the bias. When \(\alpha = 0\), any outcome leads to \(\lambda = 0\), and the outcome desirability model reduces to the original Bayesian model. When α = 1, maximally undesired outcomes (for example, a loss observed by a participant for whom \(u = 1\)) are completely disregarded (\(\lambda = 0\)). Negative values of α lead to hyper-weighting of evidence (\(\lambda < 0\)) that is stronger for undesired outcomes, and can also produce nonsensical negative beliefs (\({\Bbb B}\left[ {h{{{\mathrm{|}}}}W} \right] < 0\)). Values of α greater than 1 lead to reverse weighting of undesired outcomes (\(\lambda > 1\)), meaning the posterior belief favours hypotheses that are inconsistent with the evidence. Therefore, only values of α in [0,1] were considered.

The preceding assumptions lead to the following posterior beliefs:

and

with partition function:

$$Z_{{{\mathrm{A}}}} = 2\left( {v - vf + fc} \right) + \alpha \left( {1 - u} \right)\left( {1 - 2v + 2vf - 2fc} \right)$$
$$Z_{{{\mathrm{B}}}} = 2\left( {1 - v + vf - fc} \right) - \alpha u\left( {1 - 2v + 2vf - 2fc} \right)$$

Marginalizing the joint posteriors gives the posteriors on F:

$${\Bbb B}\left[ {F = 1{{{\mathrm{|}}}}W = {{{\mathrm{A}}}}} \right] = \frac{{2fc + \alpha \left( {1 - u} \right)f\left( {1 - 2c} \right)}}{{2\left( {v - vf + fc} \right) + \alpha \left( {1 - u} \right)\left( {1 - 2v + 2vf - 2fc} \right)}}$$
$${\Bbb B}\left[ {F = 1{{{\mathrm{|}}}}W = {{{\mathrm{B}}}}} \right] = \frac{{2f\left( {1 - c} \right) - \alpha uf\left( {1 - 2c} \right)}}{{2\left( {1 - v + vf - fc} \right) - \alpha u\left( {1 - 2v + 2vf - 2fc} \right)}}$$

Similarly, the posteriors on V are:

$${\Bbb B}\left[ {V = {{{\mathrm{A|}}}}W = {{{\mathrm{A}}}}} \right] = \frac{{2v\left( {1 - f + fc} \right) - \alpha \left( {1 - u} \right)v\left( {1 - 2f + 2fc} \right)}}{{2\left( {v - vf + fc} \right) + \alpha \left( {1 - u} \right)\left( {1 - 2v + 2vf - 2fc} \right)}}$$
$${\Bbb B}\left[ {V = {{{\mathrm{A|}}}}W = {{{\mathrm{B}}}}} \right] = \frac{{2vf\left( {1 - c} \right) + \alpha uv\left( {1 - 2f + 2fc} \right)}}{{2\left( {1 - v + vf - fc} \right) - \alpha u\left( {1 - 2v + 2vf - 2fc} \right)}}$$

Reduced Bayesian models

To test whether all component beliefs of the Bayesian model are needed to account for the four qualitative aspects of the direction and magnitude desirability effects (increase in fraud beliefs following a loss, decrease in fraud beliefs following a win, and a correlation with preference strength for each), we formulated two versions of reduced Bayesian models.

Fraud only

The first reduced model includes only the beliefs about fraud (F) and the side committing fraud (C), without considering the true winner (V). The side committing fraud wins. If no fraud is committed, then the winner is determined randomly. This model is equivalent to the original Bayesian model with a fixed value of v = 0.5 (that is, maximal uncertainty regarding the true winner). The posterior fraud belief is thus:

$${{{\mathrm{Pr}}}}[F = 1|W = {{{\mathrm{A}}}}] = \frac{{2fc}}{{1 - f + 2fc}}$$
$${{{\mathrm{Pr}}}}[F = 1|W = {{{\mathrm{B}}}}] = \frac{{2f(1 - c)}}{{1 + f - 2fc}}$$

Random beneficiary

The second reduced model includes only the beliefs about fraud (F) and the true winner (V), without considering the fraud beneficiary (C). Thus, the candidate benefiting more from fraud is random. This model is equivalent to the original Bayesian model with a fixed value of c = 0.5. The posterior fraud belief is thus:

$${{{\mathrm{Pr}}}}[F = 1|W = {{{\mathrm{A}}}}] = \frac{f}{{2v - 2vf + f}}$$
$${{{\mathrm{Pr}}}}[F = 1|W = {{{\mathrm{B}}}}] = \frac{f}{{2 - 2v + 2vf - f}}$$

Model comparison

The free parameter of the hypothesis desirability and outcome desirability models, α, was optimized for each model based on the sum of squared errors (SSE) between the empirical and predicted fraud belief update for all participants in the follow-up sample. All models were compared on their ability to reproduce the qualitative observed desirability effects (increase in fraud beliefs following a loss, decrease following a win and scaling of both with preference strength) and based on the sum of squared errors (transformed to RMSE in Supplementary Table 13). Since its optimized α was different from 0, we further tested whether the outcome desirability model significantly improves the fit compared with the Bayesian model, in two ways: first, we performed bootstrapping with 10,000 resamples of the follow-up sample and optimized α as explained above for each resample. We then computed the 95% CI of the bootstrapped α with the percentile method55 to test whether the value 0, which represents the Bayesian model, is included in the CI. Second, we used a nested-model F-test to compare the explained variance of the two models.

Update of beliefs in the legitimate winner

To estimate the proportion of participants who are considered ‘election-proof’, we calculated for each participant the predicted updates of beliefs about V and F in the loss scenario. We then computed how much of the overall update (sum of absolute updates regarding V and F) was allocated to each of these two beliefs. We report the percentage of participants for whom more than 90% of the overall update was attributed to fraud rather than to the true vote. The estimates from the outcome desirability model and the original Bayesian model are identical. Note that the regularized prior beliefs (to 0.01 and 0.99) were used, as for all other analyses with the models.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.