Gaze direction and face orientation modulate perceptual sensitivity to faces under interocular suppression

Lanfranco, Renzo C.; Stein, Timo; Rabagliati, Hugh; Carmel, David

doi:10.1038/s41598-022-11717-4

Download PDF

Article
Open access
Published: 10 May 2022

Gaze direction and face orientation modulate perceptual sensitivity to faces under interocular suppression

Renzo C. Lanfranco^1,2,
Timo Stein³,
Hugh Rabagliati¹ &
…
David Carmel^1,4

Scientific Reports volume 12, Article number: 7640 (2022) Cite this article

2040 Accesses
3 Citations
8 Altmetric
Metrics details

Subjects

Abstract

Faces convey information essential for social interaction. Their importance has prompted suggestions that some facial features may be processed unconsciously. Although some studies have provided empirical support for this idea, it remains unclear whether these findings were due to perceptual processing or to post-perceptual decisional factors. Evidence for unconscious processing of facial features has predominantly come from the Breaking Continuous Flash Suppression (b-CFS) paradigm, which measures the time it takes different stimuli to overcome interocular suppression. For example, previous studies have found that upright faces are reported faster than inverted faces, and direct-gaze faces are reported faster than averted-gaze faces. However, this procedure suffers from important problems: observers can decide how much information they receive before committing to a report, so their detection responses may be influenced by differences in decision criteria and by stimulus identification. Here, we developed a new procedure that uses predefined exposure durations, enabling independent measurement of perceptual sensitivity and decision criteria. We found higher detection sensitivity to both upright and direct-gaze (compared to inverted and averted-gaze) faces, with no effects on decisional factors. For identification, we found both greater sensitivity and more liberal criteria for upright faces. Our findings demonstrate that face orientation and gaze direction influence perceptual sensitivity, indicating that these facial features may be processed unconsciously.

No influence of eye gaze on emotional face processing in the absence of conscious awareness

Article Open access 07 November 2019

Holistic processing of gaze cues during interocular suppression

Article Open access 11 May 2022

Guilt-inducing interaction with others modulates subsequent attentional orienting via their gaze

Article Open access 01 April 2023

Introduction

Facial features provide essential information about others’ mental states and intentions, and are remarkably effective at capturing attention¹ even from early infancy^2,3. A number of reports have even claimed that some facial features can be processed unconsciously^{4,5,6,7,8,9,10}, with the implication that faces might be special stimuli, whose processing is prioritised to the extent that it does not require awareness. However, concerns about these findings have been raised both in terms of their replicability and interpretation^{11,12,13,14,15,16,17,18,19}. In particular, and as explained in detail below, even the findings that replicate may not in fact reflect detection sensitivity to facial features, but instead could reflect differences in the biases and criteria that participants use during face processing tasks. This latter concern is particularly acute because the most popular recent method used to study unconscious face processing, the Breaking Continuous Flash Suppression technique (b-CFS), is unable to distinguish sensitivity from criterion and response bias. Do the configural features of a face and its gaze direction affect how faces gain access to awareness or just post-perceptual factors such as decision criteria? Here, we address this issue by focusing on two specific claims about facial features—first, that upright faces reach awareness faster than inverted faces, and second, that faces with direct gaze reach awareness faster than faces with averted gaze.

We test these claims using a more comprehensive method, which replaces response times (RTs) with measures based on signal-detection theory; to do so, we combine interocular suppression with the psychophysical method of constant stimuli, avoiding the problems inherent in b-CFS and allowing us to assess how face orientation and gaze direction modulate perceptual sensitivity to faces initially suppressed from awareness. If face configuration and gaze direction affected perceptual sensitivity, this would indicate in our method that these facial features affect basic perception, perhaps mediated by unconscious processing. However, if face configuration and gaze direction influenced decisional criteria only, this would imply that these effects emerged at later processing stages, thus requiring conscious awareness.

A rich body of studies has claimed that facial features such as gaze direction^9,20, emotional expression^10,21,22,23, familiarity⁷, and attractiveness²⁴ can be processed unconsciously. To render images invisible, these studies have employed Continuous Flash Suppression (CFS), a strong interocular suppression procedure²⁵, in which a stimulus presented to one eye is suppressed from awareness by Mondrian-like masks flashed to the other eye. In the b-CFS variant, participants are asked to provide a response as soon as the invisible stimulus breaks through suppression into awareness²⁶, with the assumption that stimuli which are processed with higher priority will break through into awareness faster²⁷. Previous work using this procedure has found that faces break through suppression faster when shown in upright orientation than in inverted orientation⁸, when expressing fear compared to a neutral expression¹⁰, or when making eye contact compared to looking away⁹.

Although the b-CFS paradigm has been widely used to provide evidence for differential access of visual features to awareness, its reliance on RTs raises some concerns. Importantly, RTs are a measure of overall processing speed, encompassing the many processes that go into producing a speeded (not just correct) response. RTs are not an isolated measurement of perceptual sensitivity, and thus using them precludes conclusions that make specific claims about perceptual sensitivity to suppressed stimuli. A crucial concern is that differences in detection times could reflect differences in decision criteria rather than differences in perceptual sensitivity. When suppressed stimuli break into awareness they often do so gradually, which means that participants have to make a decision as to whether—and when—to report a partially-perceived stimulus. Their criteria for making these decisions may vary by stimulus category. For example, even if perceptual sensitivity for upright and inverted faces was identical, upright faces might be reported faster simply because they are associated with a more liberal criterion for the decision to press a key, perhaps because they look more familiar, resulting in greater confidence. Similarly, even if perceptual sensitivity for direct-gaze and averted-gaze faces was identical, direct-gaze faces might be reported faster simply because they are associated with a more liberal criterion for the decision to press a key, perhaps due to their personal social relevance rather than because they are more visible. Alternatively, participants may be inclined to visually explore a certain stimulus category more exhaustively than another before deciding to commit to a response, thus leading to a more conservative criterion and thereby to a slower response. The implication of this is that differences in breakthrough times may not be due to differential sensitivity to stimulus categories but rather to differential decision criteria (i.e. the willingness to report a signal). This potential confounding effect of decision criteria could have major theoretical implications—if the face-inversion effect and/or the eye-contact effect are due to differences in decision criteria rather than perceptual sensitivity, it would suggest that social cognitive processes that rely on face processing may require some degree of conscious awareness to unfold. While differences in decision criteria may inform about implicit preferences or expectations, only differences in perceptual sensitivity can tell us about the ability of different stimulus categories to overcome suppression from awareness.

We are not the first to note that criterion issues are a concern in b-CFS studies, and indeed some b-CFS studies have tried to control for this problem. For instance, some researchers have included a non-rivalrous control condition (where the target stimuli are shown binocularly or monocularly on top of the flashing CFS masks) with the assumption that post-perceptual effects, such as differences in decision criteria, should have similar effects on suppressed and visible stimuli^{8,19,28,29,30,31,32,33,34}. The underlying reasoning is that if a non-rivalrous condition emulates all processes that are not CFS-specific but contribute to differences in RTs, any larger differences between stimulus categories found in the rivalrous b-CFS condition (compared to the visible control condition) should index unconscious processing differences. However, non-rivalrous conditions do not effectively control for decision criteria. For example, targets in non-rivalrous control conditions are more easily discernible from the mask³⁵, meaning there is less uncertainty about them; and the level of uncertainty is known to affect decision criteria³⁶ and may do so differentially for different stimulus categories. Visible conditions therefore differ in a substantive way from CFS conditions, meaning they are not valid controls.

Another proposed method for controlling for differences in decision criteria is to ask participants to perform an orthogonal task, such as reporting a stimulus feature that is irrelevant to the experimental manipulation (e.g. Gayet et al.³⁷; Salomon et al.³⁸). This approach assumes that if participants do not need to identify or make decisions about the experimentally critical but task-irrelevant feature, their RTs will reflect processing that is unaffected by differences in identification performance or decision criteria. However, this assumption is unjustified: Participants may still perceive (and thus make decisions about) the task-irrelevant feature, and their choice of how long to accumulate information on each trial may still be affected by their internal criterion for responding to that feature, or their ability to identify it, irrespective of its relevance for the task. Crucially, we cannot tell what factors will affect participants’ decision in any paradigm where they can freely choose how much perceptual evidence to gather (i.e. how long to look at the stimulus in a trial) before responding.

To assess perceptual sensitivity independently of decision criterion and dissociate detection from identification, we must use a method that does not rely on RTs (a measure of participants’ willingness to commit to a response), but rather on measures collected under conditions where perceptual evidence (e.g. exposure duration in a trial) is controlled by the experimenter. Here, we developed and tested a method that combines CFS with the method of constant stimuli, and thus does not suffer from the above problems. We used this method to test two well-established b-CFS findings that have been successfully replicated: the face-inversion effect and the eye-contact effect.

Even without suppression, upright faces are easier to recognise than inverted faces^{39,40,41,42,43}. In line with this, the first published b-CFS study found that upright faces overcome suppression faster than inverted faces⁸. This face-inversion effect has been repeatedly replicated with b-CFS procedures^{9,14,28,44,45} and has been interpreted as evidence of unconscious holistic face processing. Similarly, faces that make eye contact appear to be processed in a special way. Without suppression, for example, eye contact draws attention towards the face, whereas averted gaze draws attention towards the gaze’s direction^46,47,48,49. Multiple studies have shown that eye contact also promotes social learning from a very young age^50,51,52,53. Using b-CFS, Stein et al.⁹ reported that suppressed human faces with direct gaze were detected faster than faces with averted gaze, suggesting a processing advantage driven by eye contact (the same study also replicated the aforementioned face-inversion effect). Subsequently, a number of other studies have supported the idea that direct gaze faces are (unconsciously) prioritised either by measuring breakthrough times directly^28,54,55 or by measuring neural markers before the faces overcome suppression^20,31,56.

In some of these studies, the task—to report stimulus location (on the left or right side of the screen)—was orthogonal to the hypothesis-relevant stimulus category (e.g. direct/averted gaze; Chen and Yeh²⁰; Stein et al.⁹). However, as detailed above, shorter breakthrough times to direct-gaze faces do not necessarily reflect higher sensitivity, but could instead be due to a more liberal decision criterion: observers may simply require less evidence (and thus less time) for deciding to report that they have seen a face when its gaze is direct rather than averted. Thus, it is still unclear whether upright and direct-gaze faces break suppression faster. To ascertain this, it is necessary to demonstrate greater perceptual sensitivity to CFS-suppressed upright (compared to inverted) and direct-gaze (compared to averted) faces, under conditions that limit the influence of criteria over participants’ decisions.

To accomplish this, we presented CFS-suppressed stimuli for a range of predefined durations. On each trial, participants saw a face with direct or averted gaze that was presented in upright or inverted orientation. Following each display, participants reported the face’s location (left or right of fixation) and its identity (direct or averted gaze), as accurately as possible, with no speed pressure. We used signal detection analyses to establish how stimulus duration and type affected sensitivity and decision criteria for both of these reports. A similar stimulus-presentation approach was employed by Stein et al.³⁵ (Experiment 3), who used four predetermined exposure durations and found that participants showed higher accuracy in reporting the location of upright versus inverted faces at all durations. Notably, however, they only measured accuracy; they did not use signal-detection measures to directly assess perceptual sensitivity. Furthermore, they did not account for identification processes that might affect accuracy, or for criterion differences in such identification processes.

First, in Experiment 1, we verified the robustness of previous b-CFS findings and the suitability of our stimuli and setup, by conducting a direct replication of Stein et al.’s⁹ second experiment, a b-CFS study that demonstrated faster RTs to upright than to inverted faces, and was the first to demonstrate faster responses to direct than to averted gaze faces. In Experiment 2 (pre-registered at https://aspredicted.org/qj4wf.pdf), we used our new method to acquire signal-detection measures for both face location (left/right side of the screen) and identification (direct/averted gaze) at each of seven exposure durations, ranging from 500 to 5695 ms. If face orientation and gaze direction modulate perceptual sensitivity under suppression, as suggested by previous b-CFS findings, we should find greater sensitivity for direct-gaze versus averted-gaze faces and for upright versus inverted faces. Data and materials are publicly available on the Open Science Framework (https://osf.io/uepgt/).

Experiment 1

Experiment 1 was an exact replication of Experiment 2 reported by Stein et al.⁹, testing whether upright faces break through suppression faster than inverted faces, whether faces making eye contact break through suppression faster than averted gaze faces, and whether the factors of face orientation and gaze direction interact. We used the same Matlab scripts and stimuli as the original study but employed a larger sample (32 instead of 14 participants). The original study found a processing advantage for faces making eye contact. Additionally, upright faces broke through suppression faster than inverted faces. There was no interaction between these two effects.

Methods

Participants

Thirty-two University of Edinburgh students (21 female; 4 left-handed; mean age 23.8, SD_age = 4.1) provided informed consent and were paid £3 for participation. All had normal or corrected-to-normal vision and reported no history of neurological or psychiatric disorders. Both experiments reported here were approved by the University of Edinburgh Psychology Research Ethics Committee. All participants provided informed consent in accordance with the Declaration of Helsinki.

Originally, Stein et al.⁹ employed only 14 participants in each of their experiments. Because concerns have been expressed regarding power limitations in psychophysical studies^57,58, we more than doubled the number of participants to 32. We note that our sample size, which was ~ 2.3 times larger than the original, provided 99% power to detect an effect of size \({\eta {\mathrm{p}}}{2}=0.4\), which corresponds to the effect size reported in another replication of Stein et al.’s⁹ experiment by Akechi et al.²⁸; although publication bias and other factors may inflate effect sizes in the published record, a power estimate of 0.99 indicates that our sample size provided sufficient power to detect even a much smaller effect.

For copyright reasons, the illustrative faces shown in Figs. 1 and 3 were not among those used in either experiment. The model in these figures provided informed consent and permission to publish her face images and did not participate in either experiment. The stimuli used in the experiments and datasets analysed can be found in the Open Science Framework (OSF) repository: https://osf.io/uepgt/.

Stimuli

In both experiments reported here, stimuli were presented on a 19-inch CRT monitor in a dimly lit room. The monitor was connected to a computer running Matlab 2014a (Mathworks, Inc) using the Cogent 2000 toolbox (http://www.vislab.ucl.ac.uk/cogent.php). A chin rest and mirror stereoscope were positioned 57 cm from the monitor, with a vertical divider splitting the display so each eye only saw half of the screen.

Figure 1a illustrates the display and stimuli. Two red frames containing binocular alignment contours (random noise pixels around the inside border of the frame; squares measuring 10.6° × 10.6°, width 0.8° × 0.8°) appeared side by side on the screen, supporting binocular alignment through the mirror stereoscope such that only a single frame was perceived. A red fixation dot (0.7° × 0.7°) was presented in the centre of each frame. Rectangular multicoloured Mondrian-like masks differing in size, rotation, and position were flashed at 10 Hz to one eye while a face stimulus was presented to the other eye.

We employed the same twelve face stimuli used by Stein et al.⁹ and other b-CFS studies^28,31,55; these face stimuli were previously used^49,59,60,61 and perceived gaze direction was validated⁶¹ in earlier non-CFS gaze direction studies. In these images, the face is laterally averted either to the left or to the right, and the eyes are also averted to either the left or right, giving the impression of either averted or direct gaze, depending on whether gaze direction matches head direction. For instance, from the viewer’s perspective, in the case of faces averted to the right, eyes directed to the left were classified as direct gaze and eyes directed to the right were classified as averted gaze, which ensures that eye symmetry is the same in direct-gaze faces and indirect-gaze faces (see Senju and Hasegawa⁴⁹, for details of stimulus creation). Stimuli were cropped to oval shapes (3.3° × 4.6°), equalised for contrast and luminance and the edges were blurred into the grey background. Inverted faces were created by turning upright faces 180°.

Procedure

Participants were instructed to focus on the fixation dot with both eyes open, avoid blinking as much as possible, and not look elsewhere.

The procedure on each trial is shown in Fig. 1a. The red frames and binocular alignment contours were continuously present during the experiment. At the start of each trial, fixation dots were presented binocularly for 1 s. Then, one eye was shown the CFS mask—Mondrian-like patterns changing at 10 Hz—and a face was introduced to the other eye. The face’s contrast ramped up linearly from 0 to 100% over 1 s and then remained constant until either the participant responded, or 10 s passed, at which point the face, fixation dots, and mask disappeared during a 1.5 s intertrial interval (ITI). The eye receiving the mask was the same throughout the study but varied randomly between participants.

Face stimuli were presented either to the left or to the right of the fixation dot (horizontal fixation-to-centre distance 2.7°; Fig. 1b) at a random vertical position (maximum centre-to-horizontal-midline distance 2.1°). Participants were instructed to press the left or right arrow key on the keyboard to indicate the location of the face as soon as they became aware of its presence (Fig. 1c).

The experiment consisted of 192 randomly ordered trials, which were evenly distributed over the two crossed experimental factors (gaze direction and face inversion), with the face appearing on each side of the visual field on half of the trials. A 5-min break was given halfway through the experiment. There were no practice trials. Half of the participants viewed a version of the faces with the head averted to the left and the other half viewed a version of the faces with the head averted to the right. The full experiment took around 20 min to complete.

Analysis and results

We calculated mean RTs based on trials with correct responses (98.8% of all trials). Trials with no response were treated as missing data (< 5% for each participant). A preliminary mixed analysis of variance (ANOVA) on mean RTs, which included the factors of gaze direction (direct or averted) and face orientation (upright or inverted) as within-subject factors, and head direction (left or right) as a between-subjects factor, showed no main effect of head direction nor any interaction of this factor with any other factor (all relevant p-values > 0.1), so this factor was collapsed in further analyses.

To examine whether upright and direct-gaze faces elicit faster breakthrough reports than inverted and averted-gaze faces, as Stein et al.⁹ found, we entered RTs into a 2 (gaze direction: direct, averted) × 2 (orientation: upright, inverted) repeated-measures ANOVA (Fig. 2). Critically, there was a main effect of gaze direction, with faster RTs for direct-gaze faces (M = 3016.8 [SD = 962.9]) than for averted-gaze faces (M = 3436 [1020.3]), \(\left(F{(1, 31)}=54.14,p<.001,\eta{\mathrm{ p}}{2}=0.636\right).\) There was also a main effect of orientation, with faster RTs for upright faces (M = 2996.1 [950.5]) than for inverted faces (M = 3456.8 [1030.9]), \(\left(F{(1, 31)}=75.72,p<.001,\eta{\mathrm{ p}}{2}=0.710\right)\). Finally, and similar to Stein et al.⁹, although the difference between direct and averted gaze was numerically larger for upright (M_difference = 535 ms [900]) than for inverted faces (M_difference = 303.4 ms [1008.8]), and each of these simple effects was significant \(\left({t}_{upright}\left(61.5\right)= -6.35, p<.001, d=-1.122; {t}_{inverted}\left(61.5\right)=-3.59, p=.004, d=-0.635\right)\), they did not differ significantly from each other, as indicated by the finding that the interaction between gaze direction and face orientation did not reach significance \(\left(F{(1, 31)}=3.49, p=.071, \mathrm{\eta p}{2}=0.101\right)\). These results replicate all aspects of Stein et al.’s findings⁹.

We further examined the non-significant interaction with a Bayes factor analysis, using JASP⁶² (version 0.12.2), in which we ran a Bayesian repeated-measures ANOVA with a standard r-scale prior of width 0.5 (for fixed effects), with a Cauchy prior scale parameter for covariates of 0.354 (this default prior was used in all subsequent Bayes factor analyses). This provided a value of \(BF{01}=1.338\) for the interaction, indicating that given the data, the null is only slightly more likely than the alternative hypothesis model (anecdotal evidence). Thus, these data are not strongly informative as to whether or not the eye-contact effect is smaller for inverted faces.

Discussion

Experiment 1 replicated Stein et al.’s⁹ findings: direct-gaze faces broke through CFS faster than averted-gaze faces (eye-contact effect), and upright faces broke suppression faster than inverted faces (face-inversion effect; see also^{6,7,8,34,35,63,64,65}. As in the original study, we did not find a significant interaction between these effects, which may have implications for the possible mechanisms underlying the eye-contact effect in b-CFS; we return to this issue in the General Discussion. However, while faster breakthrough times have previously been interpreted as suggesting prioritised unconscious processing, such findings do not rule out the potential influence of differential criteria. Therefore, we next examined whether eye contact and face inversion affect perceptual sensitivity when the duration of exposure to the stimulus is controlled.

Experiment 2

To measure perceptual sensitivity independently of decision criteria, we used the same stimuli as in the b-CFS paradigm but presented them, on each trial, for one of seven fixed durations. After each stimulus presentation, participants judged both where on the screen the masked stimulus was shown (left or right; location task), and what that stimulus was (direct or averted gaze; identification task), with no speed pressure. We used signal detection analyses to assess sensitivity to both stimulus location and stimulus identity, as well as bias/criterion measures for making these judgments.