Trusting a stranger is a prerequisite for initiating any kinds of social relationships, especially those with potential mating partners1,2,3. Besides its obvious benefits for our personal well-being4, this particularly intimate sort of human social bond involves significant risks and interpersonal dependency, as the partner might choose to disappoint expectations or abuse one’s own investments in the relationship (e.g., emotions, time, resources5,6). To minimize such risks, it is necessary that we accurately and quickly infer another’s trustworthiness. In doing so, humans have evolved brain modules specialized in processing facial features, which are one of the richest sources of information in signaling social intentions7,8. Two facial features crucially affecting the decision whether or not to trust other-sex individuals are attractiveness and threat9,10,11,12,13. As sexual-selection forces operate differentially across sexes14,15,16,17,18, trust behavior in a mating context might likewise be differentially impacted by attractiveness and threat in men and women. We find the shortage of empirical research on investigating sex-specific effects in actual social behavior involving financial consequences to participants (e.g.,19,20,21,22,23,24,25,26) surprising (for examples from other domains, see27,28,29,30,31). We therefore decided in this study to systematically investigate the influence of attractiveness and threat in single men and women by applying a refined, incentivized trust paradigm with facially manipulated trustees of the other sex.

Attractiveness is one of the most important factors when it comes to selecting a potential mating partner32,33. Across sexes, people exhibit more prosocial behavior towards attractive than unattractive individuals (e.g., trust34; cooperation35; cooperation and punishment36). However, there is also evidence of sex-specific effects, with stronger attractiveness effects in other- than same-sex interactions, and in males than females37; but see also32. Evolutionary accounts like the “mate selection theory” state that men are particularly apt to approach attractive women, as attractiveness signals information about youth and good reproductive fitness, while women pay more attention to other “mating characteristics” signaling the potential ability to provide resources for future offspring38,39. So far, however, there is no evidence whether such sex-specific effects of attractiveness extend to the trust-behavior domain within a potential mating context. In the present study, we try to close this gap by investigating whether the facial attractiveness of other-sex individuals may modulate trust behavior displayed towards them.

Another facial feature that is key to assessing a potential partner’s trustworthiness is threat. Both men and women demonstrate lower trust levels towards individuals possessing threatening facial features, probably because threatening faces signal both harmful intentions and the capability of causing harm (i.e., dominance11,40). Research on the neurobiological foundations of trust further corroborates the key role of threat cues in trust behavior. Brain regions involved in detecting threat (e.g., amygdala, insula) are crucially involved in assessing another person’s trustworthiness41,42, and intranasal administration of the hormone oxytocin increases trust in men by inhibiting threat-related brain activity43,44,45. There seem to be interesting sex differences in these neurohormonal effects46,47,48 as well as in the general sensitivity to threat cues. Women display greater attentional bias towards threat cues, which may contribute to their higher risk for developing an anxiety disorder49,50,51,52. From an evolutionary perspective, abusing trust is accompanied by even higher relative costs for women due to their significantly higher parenting investments (i.e., women possess relatively few ovules and invest more time in gestation and lactation53 and their weaker physical strength makes them potential victims of male aggressive behavior54,55,56,57). Consequently, threat cues might be relatively more important for women than for men. In sum, whereas both sexes seem to lose trust in response to threatening faces, we lack empirical research on potential sex differences in these effects in other-sex interactions.

In this study, we investigated potential sex-specific effects of both attractiveness and threat on trust behavior in 93 heterosexual participants (46 women). To control for confounding changes in females’ preferences for male faces’ characteristics across the menstrual cycle58,59, we tested all women during their luteal phase and excluded women taking oral contraceptives. In addition to the solid evidence that women in the luteal phase are most similar to men in their stress response60, there is evidence for the similar processing of emotional faces of men and women during the luteal phase61. Participants took part in an incentivized trust game with individuals of the other sex represented by photos revealing the low- or high-intensity facial features attractiveness and threat, respectively. By using these face stimuli, our research approach aims to expand upon previous research studying incentivized social behavior in interactions between anonymous individuals (e.g.,62,63,64, and also see65). In line with previous findings, we hypothesized that both female and male participants would transfer more money to attractive than to unattractive individuals (= more trust), and to unthreatening compared to threatening individuals. Moreover, we aimed to examine unexplained sex differences in the relative importance of facial attractiveness and facial threat with regard to trust behavior. On the basis of the evolutionary biological assumption that sexual selection has shaped distinct strategies for assessing attractiveness and threat in males and females, we further hypothesized that threat cues would carry higher relative importance than attractiveness cues in women compared to men.


Facial phenotypes and trust

First, we analyzed whether female and male participants trusted attractive other-sex trustees more often than unattractive ones. As expected, we detected a significant main effect of facial attractiveness (F1, 91 = 75.854, p < 0.001, ηp2 = 0.455), indicating that participants generally trusted attractive faces more often. Second, we tested whether female and male participants trusted threatening faces less often than unthreatening faces. The significant main effect of facial threat (F1, 91 = 128.810, p < 0.001, ηp2 = 0.586) confirmed this hypothesis. There was no significant interaction between facial threat and facial attractiveness (F1, 91 = 2.636, p = 0.108, ηp2 = 0.028). Thus, both facial features, attractiveness and threat, revealed a significant influence on trusting behavior towards potential other-sex partners in men and women.

Sex, facial phenotypes, and trust

There was no significant main effect of sex on either trust across phenotypes (F1, 91 = 0.276, p = 0.601, ηp2 = 0.003), or on trust for the four different phenotypes (high facial attractiveness and low facial threat: t91 = 1.415, p = 0.161; low facial attractiveness and low facial threat: t91 = 0.955, p = 0.342; high facial attractiveness and high facial threat: t91 =  − 0.706, p = 0.482; low facial attractiveness and high facial threat: t91 = 0.456, p = 0.649; see Fig. 1). To analyze sex differences in the effects of attractiveness and threat on trust behavior, we next looked at those factors’ interaction effects, and found no significant interactions between participants’ sex and facial attractiveness (F1, 91 = 1.133, p = 0.290, ηp2 = 0.012), or between participants’ sex and facial threat (F1, 91 = 3.878, p = 0.052, ηp2 = 0.041). However, we observed a significant interaction between facial attractiveness × facial threat × participants’ sex (F1, 91 = 4.562, p = 0.035, ηp2 = 0.048). Separate ANOVAs for men and women showed that the interaction effect of facial attractiveness × facial threat was significant in women (F1, 45 = 6.097, p = 0.017, ηp2 = 0.119), but not in men (F1, 46 = 0.155, p = 0.696, ηp2 = 0.003). Following up on this interaction, we found that the difference in trust decisions between the dimensions low and high facial attractiveness was greater for low threat (∆M = 0.142, SD = 0.176, p < 0.001), than high threat (∆M = 0.064, SD = 0.164, p = 0.011). Comparing the main effects in both ANOVAs in women, we found that the influence (i.e., the effect size) of facial threat (F1, 45 = 79.695, p < 0.001, ηp= 0.639) was stronger than that of facial attractiveness (F1, 45 = 28.124, p < 0.001, ηp= 0.385; statistical comparison of effect sizes: Cohen’s q = 0.371; medium effect). In men, on the other hand, we found that there was no difference between the influence of facial threat (F1, 46 = 49.352, p < 0.001, ηp= 0.518) and facial attractiveness (F1, 46 = 49.646, p < 0.001, ηp= 0.519; statistical comparison of effect sizes: Cohen’s q = 0.001; no effect). Hence, in men, attractiveness and threat equally affected trust, while in women, threat had a stronger effect on trust than attractiveness.

Figure 1
figure 1

Effects of attractiveness and threat on trust behavior. Phenotypes from left to right are: ‘high attractiveness and low threat’, ‘low attractiveness and low threat’, ‘high attractiveness and high threat’, ‘low attractiveness and high threat’. Left: Trust decisions in percent for female participants. All differences are statistically significant (all p < 0.011). Right: Trust decisions in percent for male participants. All differences are statistically significant (all p < 0.001), except for the difference between the faces of the phenotypes ‘low attractiveness and low threat’ and ‘high attractiveness and high threat’. Please note that the symbolic faces in this figure represent schematic representations of the actual stimuli that were derived from photos of real human faces (for details, see Fig. 2). Bars represent standard errors; dots represent the mean trust decision in percent for each participant and each phenotype. For mean values and standard deviations of trust decisions, rating values and response times, see Tables S2S4, respectively, in the Supplementary Material.

Sex, facial phenotypes, and stimulus perception

To ensure that the sex difference in the trust decisions between the phenotypes was not caused by different stimulus perceptions across female and male participants, we analyzed the ratings of attractiveness perception and threat perception. As the stimuli were rated separately for attractiveness and threat, we conducted two separate ANOVAs: one with the mean ratings for attractiveness perception as dependent variable, and the other with the mean ratings for threat perception as dependent variable. We identified no interaction between participants’ sex × facial attractiveness × facial threat in either ANOVA (ANOVA for rating attractiveness: F1, 91 = 1.804, p = 0.183, ηp2 = 0.019; ANOVA for rating threat: F1, 91 = 0.421, p = 0.518, ηp2 = 0.005; for details see Supplementary Material). In sum, there was no interaction effect of attractiveness, threat, and sex in stimulus perception, suggesting that the observed difference in trust behavior between female and male participants was not caused by different attractiveness or threat perceptions in the two groups.


In this study, we investigated the sex-specific effects of the facial features attractiveness and threat on trusting individuals of the other sex. Using a trust game with actual financial consequences for participants, we systematically varied the intensities of facial attractiveness and facial threat of other-sex trustees. We found that both male and female participants exhibited more trust towards attractive and unthreatening-looking potential partners. However, the relative importance of these two facial features differed in men and women: compared to men’s behavior, women’s trust behavior was more strongly influenced by a threatening-looking than by an attractive-looking individual. Importantly, this difference in social behavior was not driven by a sex difference in social perception, suggesting that women value threat more than attractiveness in their trust decisions than do men, even though their perception of these features is similar.

By experimentally modulating both facial attractiveness and threat, our results expand upon earlier studies that investigated the effects of these features in social interactions separately from one another. In line with previous research, we detected main effects of attractiveness (e.g.34) and threat (e.g.11) on trust in both sexes. We failed to observe that these effects differed in strength across sexes (as some empirical and theoretical research has suggested, e.g.37,50). However, a sex difference did emerge in the relative importance of both features, with women in their luteal phase placing more emphasis on threat than on attractiveness, and men valuing both features equally. More specifically, women trusted attractive but threatening-looking trustees less than unattractive but unthreatening ones, whereas men demonstrated indistinguishable levels of trust towards both phenotypes. In other words, a partner’s high attractiveness could “compensate for” high threat in male but not female participants. From an evolutionary perspective, being relatively more wary of potential threat than vulnerable to attractiveness cues in a mating context might be a highly adaptive mechanism for women, because having one’s trust abused is more costly, and approaching attractive men of “good health” is less beneficial for them66,67,68. Moreover, empirically, men (but not women) given face cues related to threat (i.e., greater facial width) abuse trust more often3 and (young) men perceived as more attractive by women reveal lower degrees of prosocial behavior69,70. Although the accuracy of social inferences from facial cues remains equivocal and people certainly refine their quick facial assessments of another’s trustworthiness on the basis of other cues and experiences during an interaction71, these findings suggest that it benefits women to prioritize threat over attractiveness in trust decisions that rely on the partner’s facial appearance.

When searching for explanations for this sex difference we identified in the relative importance of attractiveness and threat on trust, it is important to keep in mind that none of the women in this study were taking hormonal contraceptives, and that they were participating in this experiment during their menstrual cycle’s luteal phase. This phase, which starts after ovulation, is characterized by high levels of progesterone and to a lesser extent estradiol72,73. Interestingly, high levels of both hormones are known to be associated with enhanced sensitivity to threat-related facial cues in women74,75. As the highest levels of these hormones are observed during the mid-luteal phase when, potentially, conception has already taken place, and during pregnancy76, one could speculate that these steroids trigger behavior that protects fetal development (for a similar argument; see46). In a similar vein, the “dual sexuality” hypothesis argues that women may prefer sexually attractive men on days before ovulation when they are looking for conception partners, whereas they prefer men who appear unthreatening during their luteal phase when they are seeking high-investing partners with father qualities77,78,79. Indeed, our study provides evidence that women in their luteal phase are more cautious than men, or less willing, to trust potentially threatening but attractive partners, perhaps because the price of having one’s trust abused is particularly high during this phase of the menstrual cycle when pregnancy is possible. Following the dual sexuality hypothesis, one could speculate that facial attractiveness might become more important for women when making trust decisions during the days before ovulation. Future research could systematically test this hypothesis (see80, for example).

So far, we have interpreted the present study’s findings as evidence demonstrating that male and female participants evaluate attractiveness and threat differentially in trusting other-sex individuals (= participant’s sex effect). However, one could also argue that our findings demonstrate that male and female faces are evaluated differently (= face stimulus’s sex effect). As the present study investigates approach behavior in other-sex interactions, female participants always interacted with male-face stimuli, and male participants always interacted with female-face stimuli, rendering it difficult to disentangle the participant’s sex from the stimulus’ sex effect. Interestingly, research on trait judgments from faces has revealed that judgments of female in comparison to male faces are less differentiated with regard to the effects of distinct facial characteristics and more based on a general impression of the face’s valence81. Transferring these findings to the present study, male participants did not distinguish between faces signaling ambiguous information about trustworthiness (trust level: unattractive and unthreatening = attractive and threatening), because they might evaluate ambiguous female faces less differentiatedly and assign intermediate valence to such faces. In contrast, female participants, who distinguished between faces offering ambiguous information (trust level: unattractive and unthreatening > attractive and threatening), might evaluate facial attractiveness and threat independently from one another, leading to a more differentiated assessment of trustworthiness towards ambiguous faces. To better distinguish the effects of participants’ and stimulus’ sex, future studies could investigate both same-sex and other-sex interactions.

In sum, the present study provides first evidence of a sex-specific interacting effect of a potential partner’s facial attractiveness and threat on trust behavior. From an evolutionary perspective, the fact that women in their luteal phase prioritize threat over attractiveness might be explained by their higher parental investment and the high price of being abused when they might get pregnant82. Future studies might include women in their pre-ovulatory phase (validated by blood assays to measure gonadal steroid) to observe whether women do indeed prioritize attractiveness over threat during this menstrual phase, thereby contributing to the ongoing debate about whether women’s preferences for their partner’s attractiveness and threat change across the menstrual cycle83,84. Furthermore, we expect our findings to stimulate research refining such explanations by illuminating the possible role of non-evolutionary explanations for sex differences (e.g., sex stereotypes85, societal influences86), or by revealing the role of individual differences in traits (e.g., general dispositional trust87; attachment styles88) or states (e.g., recent trust-related experiences such as trust violations89) and their interaction with sex effects in affecting trust in other-sex interactions. It could also prove fascinating to investigate the effects of attractiveness and threat in couples in both intra- and extra-couple interactions and in homosexual male and female couples (e.g.90), as well as non-mating contexts like approaching new potential friends. Overall, this research may help us better understand which factors impact trust as a fundamental ingredient in intimate social relationships.



We recruited a sample of 93 healthy participants (47 men, 46 women; age: M = 22.44, SD = 3.85, range: 18–34). All participants were single (i.e., not in a couple relationship91) and attracted to the other sex (rated five or higher on a 7-point Likert scale from ‘not at all’ [1] to ‘absolutely’ [7]). Exclusion criteria were alcohol, nicotine or drug abuse, studying or having a degree in psychology or economics, current or previous history of psychiatric disorders, or insufficient fluency in the German language. All female participants took part in the experiment during their menstrual cycle’s luteal phase59,60 and were free of hormonal contraceptives. The luteal phase was determined by self-report: the earliest testing day was determined by adding half of the maximum cycle length duration, plus a 2-day buffer to the first day of menstruation, and to determine the latest testing day we added the minimal cycle length duration to the first day of menstruation (see Supplementary Material for further details). The Ethics Committee of the University of Freiburg approved this study, which was conducted according to the principles expressed in the Declaration of Helsinki.


We recruited participants using flyers and bulletins. Participants had to fill out an online screening to assess exclusion criteria92. We telephoned eligible participants to offer appointments for the experimental session, which took place in a group laboratory with up to 12 participants taking part simultaneously. Participants provided written informed consent to participate in the study and have their photo taken. For the picture displayed in Fig. 3B, we furthermore obtained informed consent of that participant for publication of identifying images. To reduce demand characteristics, we included a cover story in the study. We told participants that their photos would be utilized in the trust game paradigm (see below) to ensure that participants believed that they would later be interacting with real participants. Importantly, none of the participants expressed any disbelief about the genuineness of the trust game after the experiment. To keep the photos taken in our laboratory comparable to the final set of face stimuli taken from the Basel Face Database93, participants were sent into another room set up for professional photography. All participants wore a black T-Shirt and were instructed to look into the camera with a neutral expression. Afterwards, participants received instructions for the trust game and performed practice trials. Participants played 88 rounds of the trust game, each with a different other-sex individual represented by a color photo, followed by a stimuli perception paradigm, in which participants rated the photos they saw in the trust game for attractiveness and threat. Both paradigms took 10–15 min. At the end of the study, participants received their reimbursement of 15 Euros basic payment plus on average 7.03 Euros additional payoff for the trust game (SD = 0.566, range = 5.02–8.70; for details, see below).

Face stimuli

We created two sets of stimuli: one with female faces that were presented to male participants, and one with male faces presented to female participants. We strictly used neutral faces not demonstrating any emotional states. Faces were manipulated to reveal two features of low or high intensity, namely attractiveness and threat (e.g., ‘high threat and low attractiveness’), resulting in four phenotypes (see Fig. 2). Faces were taken from the Basel Face Database93, as well as from pictures shot by our own photographer. Face stimuli were generated by applying a data-driven, computational approach that captured the variance in facial structure that signals specific social attributions42,94. This approach enabled us to parametrically manipulate the values of facial attractiveness and threat based on previously collected other-sex ratings. Based on those ratings, vectors were created and applied to the photographed faces with a neutral facial expression, which caused faces to have high or low values on attractiveness and high or low values threat (e.g., ‘high threat’ and ‘low attractiveness’; also see95,96). These newly-created faces were then rated by 53 participants (20 women, 33 men, age: M = 24.17, SD = 3.57) in a pilot study. Afterwards, we selected the 22 distinct faces most characteristic of each phenotype, resulting in 88 faces with subtle attractiveness and threat manipulations per stimulus set so that the phenotype should not have been apparent to the participants. As required for the purpose of this experiment, all t-tests comparing threat and attractiveness ratings of faces with the same intensity (i.e., low vs. low or high vs. high) were not significant (p > 0.078), whereas all t-tests comparing threat and attractiveness ratings of faces with different intensity (i.e., low vs. high) were significant (p < 0.001; for details, see Supplementary Material).

Figure 2
figure 2

Drawings of representative male (left) and female stimulus set (right). There were four different phenotypes with 22 faces each revealing either low- or high-intensity ‘attractiveness’ and ‘threat’. In total, every set contained 88 faces, each generated from a unique face identity. The symbolic faces in this figure represent prototypes by which to illustrate the face phenotypes, facial symmetry codes attractiveness (asymmetric = low attractiveness), and mouth codes threat (showing teeth = high threat). Please note that this figure illustrates schematic representations of the stimulus categories for visualization purposes, and that the actual stimuli were derived from photos of real human faces exhibiting neutral expressions (i.e., displaying no teeth or other emotionally-loaded expressions; see Fig. 3B for a stimulus example).

Trust game paradigm

Participants took part in a slightly modified version of the trust game97,98 (for details on screen presentation durations, see Fig. 3). Participants were assigned the role of the investor, and, in each round, saw a face from the other sex’s stimulus set representing the trustee. Both investor and trustee received an initial endowment to ensure that no social preference other than trust (e.g., inequality aversion, altruism) would influence the participant’s decision about whether to transfer money to the trustee. Participants could then decide whether to keep their endowment, in which case both the investor and trustee received 14 MU, or to transfer their endowment. If they transferred their endowment, the trustee could either keep everything for him/herself (‘not trustworthy’), in which case the investor received 0 MU and the trustee received 60 MU, or transfer back half of it (‘trustworthy’), in which case both the investor and trustee received 30 MU. We chose a payoff structure (see Fig. 3) which triggered trust-decision rates of approximately 50% in previous studies without face stimuli99, thus enabling both the up- and down-modulation of trust decision rates by varying facial features. At the end of the study, participants received their payoff according to the following exchange ratio: 100 MU = 0.50€. Participants were told that a subgroup of participants had been randomly selected as trustees who had already made their decision whether to return money. In fact, to determine payments, we randomly drew trustee decisions from a distribution of decisions collected during our laboratory’s previous studies43. Participants learned about trustees’ decisions at the end of the experiment to avoid any influence of reciprocal or reputational considerations of previous interactions on trust behavior. This paradigm was programmed using Presentation software (Version 18.0, Neurobehavioral Systems, Inc., Berkeley, CA).

Figure 3
figure 3

Modified version of the trust game. (A) Payoff structure. Participants were assigned the role of the investor. If the investor decided not to trust the trustee, both investor and trustee received 14 MUs. If the investor decided to trust, the trustee could either keep everything for him/herself (0/60) or transfer half of the points back to the investor (30/30). (B) Example trial with screen durations in milliseconds (ms). First, participants saw a blank screen for 400–600 ms, followed by fixation cross for 1000–1500 ms. Then, participants saw a photo of the trustee until they made their decision, which was then displayed for 1000–1250 ms.

Stimuli perception paradigm

Participants rated all 88 faces for attractiveness and threat, respectively, on a 7-point Likert scale from ‘very unattractive’/‘very unthreatening’ to ‘very attractive’/‘very threatening’. The presentation order of the photos was randomized. In total, there were four blocks with 44 photos, respectively. At the beginning of each block, participants learned which facial feature they should rate (attractiveness or threat). Whether participants rated attractiveness or threat in block one and three was also randomized, and followed by the other feature in the subsequent block (i.e., block two and four, respectively).

Statistical analysis

All analyses were conducted with the software SPSS (27th edition). We first calculated the mean trust for each phenotype by dividing the number of decisions to trust a specific phenotype by 22, i.e., the total number of trust decisions for each phenotype. To test the influence of the facial features attractiveness and threat on mean trust, and whether there were any differences between men and women in the trust decisions, we conducted a repeated-measures ANOVA with participants’ sex (women and men) as a between-subject factor, and facial attractiveness (low vs. high) and facial threat (low vs. high) as within-subject factor. We then compared the effect sizes of the main effects facial attractiveness and facial threat to investigate any sex differences in the importance of both features. For that purpose, we transformed the ηp2 effect size of each main effect to the effect size r and then compared those within the sexes100,101.

The Stimuli Perception Paradigm was used to show that any differences in the trust game decisions between sexes were not due to differing attractiveness or threat perceptions of the distinct other-sex stimulus sets. We first calculated mean ratings for facial attractiveness perception and facial threat perception by averaging the ratings of the 22 faces of each phenotype. Then, we conducted two repeated measures ANOVAs, one for the mean ratings of attractiveness perception and one for the mean ratings of threat perception as dependent variable. We included participants’ sex (women and men) as a between-subject factor, and facial attractiveness (low vs. high) and facial threat (low vs. high) as within-subject factors.