The Flashed Face Distortion Effect Does Not Depend on Face-Specific Mechanisms

Balas, Benjamin; Pearson, Hannah

doi:10.1038/s41598-018-37991-9

Download PDF

Article
Open access
Published: 07 February 2019

The Flashed Face Distortion Effect Does Not Depend on Face-Specific Mechanisms

Benjamin Balas^1,2 &
Hannah Pearson¹

Scientific Reports volume 9, Article number: 1612 (2019) Cite this article

20k Accesses
3 Citations
43 Altmetric
Metrics details

Subjects

Abstract

When normal faces are rapidly presented in the visual periphery, they are perceived as grotesque and distorted. This phenomenon, “The flashed-face distortion effect” (FFDE) is a powerful illusion that may reveal important properties of how faces are coded in peripheral vision. Despite the strength of the illusion (and its popularity), there has been almost no follow-up work to examine what governs the strength of the illusion or to develop a clear account of its phenomenology. Presently, our goal was to address this by manipulating aspects of facial appearance and spatial/temporal properties of the flashed-face stimulus to determine what factors modulate the illusion’s strength. In three experiments, we investigated the extent to which local contrast (operationalized by the presence or absence of makeup), image eccentricity, image size, face inversion, and presentation rate of images within the sequence each contributed to the strength of the FFDE. We found that some of these factors (eccentricity and presentation rate) mattered a great deal, while others (makeup, face inversion and image size) made little contribution to the strength of the FFDE. We discuss the implications of these results for a mechanistic account of the FFDE, and suggest several avenues for future research based on this compelling visual illusion.

Gaze direction and face orientation modulate perceptual sensitivity to faces under interocular suppression

Article Open access 10 May 2022

Prioritization of emotional faces is not driven by emotional content

Article Open access 11 January 2023

The effect of processing partial information in dynamic face perception

Article Open access 29 April 2024

Introduction

The “Flashed Face Distortion Effect” (or FFDE) refers to a striking visual illusion in which faces presented sequentially in peripheral vision begin to look increasingly grotesque after just a few faces have been presented¹. Most observers report large shape distortions, such that faces appear to have strange proportions, as well as distortions of color appearance that lead to faces that look too red, purple, or green. Despite the popularity of the illusion, there is as yet no clear consensus regarding its basis. In the original article describing the illusion, the authors suggest a number of candidate mechanisms that may drive the illusory percept, but to our knowledge there has been very little work to explore the nature of this powerful visual effect. Our main goal, therefore, is to explore the FFDE in more detail, with an emphasis on understanding how the strength of the illusion depends on aspects of face appearance (local contrast of facial features, face inversion) and spatial and temporal factors of stimulus presentation (presentation rate, image size and eccentricity). We continue by discussing the possible relationship between the FFDE and two potential contributing mechanisms: Face adaptation and the limited fidelity of peripheral vision. In each case, we highlight features of these mechanisms that suggest specific manipulations of spatial and temporal parameters that may affect the strength of the FFDE. In the experiments that follow, we implement these manipulations to determine which factors do affect the perceived distortion of faces presented in these sequences, and which factors appear to have a lesser impact.

Because distortion of facial appearance is the key outcome of the FFDE, the literature describing face distortion aftereffects² is a natural place to start trying to understand the mechanisms supporting the phenomenon. Could the FFDE simply be a special case of a face distortion aftereffect (FDAE)? If so, we would expect the strength of the effect to be modulated by spatial and temporal parameters in the same manner as FDAEs. These aftereffects are typically obtained by asking observers to adapt to an image of a face that has been altered to have unrealistic feature proportions (e.g. an elongated nose), warped so that all facial features are compressed or expanded, or had facial features moved within the external contour into unrealistic locations (eyes placed very high in the head, e.g.). After viewing such a stimulus for several seconds or longer, observers typically report that unaltered faces appear distorted in an opposing manner, such that adapting to a compressed face will make a typical face look expanded. These aftereffects have been an important means of studying the underlying nature of population coding for facial appearance^3,4 and also have yielded insights regarding the category structure of face representation in terms of race, age, and species^5,6.

There are some challenges to an explanatory account of the FFDE that is based solely on known properties of FDAEs. First, it is unclear from the face adaptation literature whether or not we should expect distortion effects as large as we see in the FFDE following the short presentation times per face that are used in demonstrations of the effect. Usually observers are asked to adapt to a distorted face for at least several seconds before aftereffects are measured, though non-retinotopic figural face aftereffects are measurable in short-duration faces after one second of adaptation⁷. However, the FFDE does not actually incorporate any faces that are truly distorted – instead, the effect must depend on differences in natural appearance between faces in the sequence. It thus seems difficult to adopt this as a primary explanation of the effect. Second, the authors note in the original report that inserting a short blank period between images appears to abolish or at least substantially reduce the effect, which to our knowledge is also not easily related to the adaptation/aftereffect literature. Finally, it also seems problematic that the effect persists (and for some observers appears to increase) as the sequence continues. Observers are capable of estimating the average appearance of a sequence of faces over some interval⁸, and to the extent that prolonged viewing of the sequence is similar to adaptation to the ensemble representation of face appearance, prolonged viewing should lead to adaptation to a face that is fairly average (which is to say, not distorted at all) as the appearance differences between individuals are averaged out. As a result, while distortion aftereffects may be a good place to start in trying to understand the illusion, we also do not think they are a place where we can stop.

The second feature of the FFDE that we chose to consider is the presentation of the images in peripheral vision. Could it be the case that the limited fidelity of peripheral vision plays a key role in the effect? Peripheral vision differs from central vision in a number of ways, and perhaps some of these differences offer a means of understanding the illusory percept, or help address some of the issues facing an account of the effect based on face distortion aftereffects.

We chose to focus on three key differences between peripheral vision and central vision to understand how peripheral viewing may result in the distortions observed in the FFDE. First, visual acuity is substantially poorer in peripheral vision⁹, and both color and contrast sensitivity are worse than in central vision¹⁰. Second, peripheral vision is also subject to visual crowding, in which the recognition of a target in peripheral vision can be severely impaired by the presence of flanking items within a critical region that scales with target eccentricity^11,12. Finally, peripheral vision is more sensitive to flicker than central vision, due to the faster responses of rods relative to cones¹³.

Given that the FFDE is not readily observed when the face sequence is viewed centrally, can these characteristics of peripheral vision help us understand why the effect happens? Regarding visual acuity, face images will necessarily look blurrier, etc. as a result of peripheral viewing. It’s not obvious that this should lead to the FFDE, but we also know of no results that make it clear we should rule this possibility out. With regard to visual crowding, the holistic processing of faces makes it difficult to conceive of crowding effects in terms of specific targets and flankers¹⁴. After all, faces comprising an FFDE sequence are presented in isolation, thus lacking the flanking items that are part of standard crowding tasks. However, recent models of visual crowding that suggest peripheral vision is characterized by a lower-fidelity texture-like representation of visual structure^15,16,17 could offer one means by which crowding could contribute to the FFDE. Specifically, in these models of crowding, the hypothesis is that peripheral vision entails a description of stimulus appearance in terms of texture statistics, which only partially constrain the set of possible stimuli that may be present. This ambiguity may lead to some of the distortions we observe in the FFDE as the rapid presentation of each image leaves little time for more than a weak inference regarding the appearance of the stimuli, and the application of ambiguous texture statistics may mean that some distorted faces are equally good candidates for the appearance of each stimulus. Both of these properties of peripheral vision (reduced acuity and increased visual crowding) could therefore contribute to the FFDE, and if they do, manipulating the extent to which images in FFDE sequences are subject to these factors could affect the strength of the effect.

Considering the manner in which face distortion aftereffects and peripheral viewing of FFDE sequences may contribute to the effect leads us to several simple manipulations of the basic FFDE paradigm that have the potential to modulate the strength of the effect. Our goal across the experiments presented here was to implement a set of these manipulations to examine which factors matter and which do not, hopefully leading to a clearer account of why the effect happens in the first place. Specifically, we developed experiments based on the following predictions: (1) If image blur or visual crowding plays a key role in the FFDE, then the size and eccentricity of the faces in peripheral vision should affect the strength of the FFDE. Specifically, increased eccentricity should increase the strength of the illusion and increased size should weaken it. (2) If the temporal properties of face adaptation and aftereffects are an important contributor to the FFDE, than changing the presentation rate of faces within a sequence should also affect the FFDE, and (3) If the reduced contrast sensitivity of peripheral vision is relevant to the FFDE, then manipulating local contrast should also affect the strength of the FFDE. Specifically, increased contrast should weaken the strength of the effect. Finally, (4) If face-specific mechanisms are an important contributor to the FFDE, we would also predict that inverting face images may weaken the effect.

In the experiments that follow, we investigate each of these predictions in turn to better understand the necessary conditions to observe the FFDE. In Experiment 1, we use make-up as a tool for manipulating the local contrast of face images, and examine how this affects the strength of the illusion. We also use inverted face images in this experiment to examine how specific the phenomenon is to upright faces. In Experiment 2, we examine the influence of temporal factors on the FFDE by manipulating the presentation rate of faces within the sequence. Finally, in Experiment 3, we examine how the eccentricity and size of face images in peripheral vision affect the FFDE. Considered together, our results offer novel insights on the nature of the Flashed Face Distortion Effect and also hint at some important avenues for face recognition research based on the striking phenomenology of this illusion.

Experiment 1

In our first experiment, we investigated the impact of two aspects of facial appearance, local contrast and face orientation, on the strength of the FFDE. Given the reduced contrast sensitivity of peripheral vision, we hypothesized that increasing local contrast in face images might reduce the strength of the illusion. To manipulate the contrast of face images, we chose to present observers with stimuli depicting faces with and without cosmetics. Typical applications of cosmetic products like lipstick, eye liner, and mascara tend to increase the contrast between the eye and mouth regions and the remainder of the face¹⁸. These contrast relationships are particularly important for a number of face recognition judgments, including attractiveness judgments and sex categorization¹⁹. Moreover, manipulating local constrast via the presence or absence of makeup has both good ecological validity and also saves us from potential artifacts that arise from globally manipulating image contrast by altering the intensity histogram of a starting image. Besides manipulating contrast, we also hypothesized that if the FFDE depends on face-specific processing, face inversion might also reduce the strength of the effect. Inverting face stimuli tends to reduce performance in a wide range of recognition tasks²⁰ and to the extent that the FFDE depends on a contribution from face-specific mechanisms, interfering with those mechanisms by presenting face stimuli upside-down should weaken the distortion effect.

Methods

Participants

We recruited a total of 28 participants (16 female) from the NDSU Undergraduate Psychology Study Pool. All participants were between the ages of 18–24 and self-reported normal or corrected-to-normal vision. None of the participants were familiar with the Flashed Face Distortion Effect prior to participating in the study.

Stimuli

To execute our manipulation of local contrast, we chose to use stimuli drawn from the VMU Face Database^21,22, which is comprised of full-color images drawn from YouTube makeup tutorial videos. We selected 96 pairs of images such that each pair depicted a unique female face without makeup and with makeup applied. The original images were 130 × 150 pixels in size and most images included at least some portion of the external face contour. All images were aligned with respect to the eyes, which may enhance the distortion effect¹.

Procedure

After obtaining informed consent from each participant, the experimenter first explained the nature of the Flashed Face Distortion Effect by showing the participant a version of the illusion that is available on YouTube (http://www.youtube.com/watch?v=wM6IGNhPujE). After confirming that the participant did experience the illusion while watching the video and observed less distortion when the faces were fixated, the experimenter explained that the participant would be shown a series of short image sequences designed to elicit the effect and should rate the strength of the illusion of a 1–7 Likert scale, with “7” indicating very strong distortion and “1” indicating little or no distortion of the faces. Participants were instructed to maintain fixation on a small cross drawn at the center of the display during the task and also told that they could take breaks as necessary by simply withholding their response to the prior stimulus until they were ready to continue.

All stimulus sequences were presented to participants on a 2560 × 1440 pixel LCD display with a refresh rate of 100 Hz. Participants were seated approximately 40 cm from this display, though viewing distance varied somewhat across participants. On each trial, participants were presented with two sequences of faces presented simultaneously to the left and right of the fixation cross. Each sequence was comprised of a random set of 8 face images drawn on that trial from the larger set of 96 stimuli. These images were each presented 3 times for a total of 24 images in the entire sequence presented on each trial. The order of images within these sequences was shuffled independently for the left and right sequence presented on each trial to ensure that the left and right stimuli were not identical. Each image was displayed for approximately 150 ms with no blank period between consecutive images. Each image was scaled to subtend approximately 4 degrees of visual angle onscreen, and each sequence was presented at an eccentricity of approximately 6–8 degrees of visual angle (Fig. 1). Participants were given unlimited time to rate the perceived distortion of faces in the sequences presented on each trial.

The orientation of the images within sequences (upright or inverted) and the presence or absence of makeup was pseudo-randomized across trials within the design. Participants completed 96 trials per condition for a total of 384 trials in the full testing session. All stimulus presentation and response collection routines were controlled by custom software written using the Psychophysics Toolbox 3 extensions for Matlab^23,24,25.

In this experiment (and all that follow), all procedures used in all experiments were approved by the NDSU IRB, in accordance with the guidelines established in the Declaration of Helsinki. Informed consent was obtained from all participants in this and all the following experiments.

Results

For each participant, we calculated the average rating across all trials in each condition (Fig. 2). We analyzed these values using a Bayesian Repeated-Measures ANOVA implemented in JASP²⁶. Table 1 includes the model comparison data obtained from considering models with each main effect included singly, both main effects included, and finally, both main effects and the interaction term included.

Table 1 Model comparison output for the results of Experiment 1.

Full size table

This analysis reveals that there is little evidence in support of a main effect of makeup on perceived distortion. Indeed, the observed Bayes Factor of ~0.20 indicates that there is 5 times more evidence for the null hypothesis than for the alternative hypothesis in this case. With regard to the main effect of orientation, our results are inconclusive: A Bayes Factor of 1.48 indicates weak evidence in favor of the alternative hypothesis (in this case, an effect of orientation on distortion), but this amount of evidence is not typically taken as sufficient to conclude that there is a meaningful effect²⁷. Finally, to consider the evidence in support of an interaction between the two factors, we examine the ratio between the Bayes Factor for the full model and the Bayes Factor for the model that includes both main effects²⁸. This yields a value of approximately 0.3, which indicates that there is substantial evidence in favor of the null hypothesis in this case.

Discussion

The results of Experiment 1 suggest that neither the orientation of the faces, nor the presence or absence of makeup impacted the magnitude of the FFDE in this experiment. Regarding makeup, which is our proxy for contrast in this experiment, the results suggest strong support for the null hypothesis (no effect of makeup). We conclude, therefore, that the FFDE probably does not result from the reduced contrast sensitivity of peripheral vision. If this were a key contributor to the FFDE, we would expect that increasing the contrast of our stimuli should have reduced the effect. What about face-specific mechanisms and their contribution to the FFDE? Our results regarding the effect of orientation on perceived distortion are not as conclusive. The Bayes Factor that we obtained for this main effect is in the range of “anecdotal” or “weak” evidence²⁷ in favor of the alternative hypothesis (a main effect of orientation on perceived distortion), which means that we cannot draw a strong conclusion regarding either the presence or absence of an effect. We suggest that this means that any effects of inversion on the FFDE are likely rather small, as an inconclusive Bayes Factor often indicates a lack of sufficient power to accept or reject the null hypothesis (no effect of the target manipulation). A more conclusive result may therefore require a larger sample, but presently we conjecture that the potentially small effect size associated with face inversion in this task implies that face-specific mechanisms at least are unlikely to make a substantial contribution to the FFDE, which further suggests that the observed distortion depends on more general properties of spatial vision in the periphery. We continue by examining temporal properties of the FFDE in Experiment 2.

Experiment 2

In our second experiment, we chose to examine how the duration of images presented within an FFDE sequence impacted the strength of the illusion. Characterizing the effects of image duration on the perceived distortion of faces in FFDE sequences is an important way to examine the effect in the context of known properties of face adaptation (and associated aftereffects), which we suggest is an important candidate mechanism that may contribute to the phenomenon. The strength of face aftereffects depends on both the duration of the adapting stimulus and the duration of the test stimulus. Identity aftereffects grow logarithmically stronger with adaptation duration, and decay exponentially as test duration increases²⁹. Distortion aftereffects, which are most relevant to the FFDE, have similar properties⁷. What do these temporal properties imply for the FFDE if adaptation is indeed a contributor to the effect? In terms of face adaptation, a typical FFDE image sequence is somewhat like an ongoing experiment with adaptation and test images being presented in rapid succession, with no interval between them. Increasing the image duration should thus have both positive effects (more distortion) based on the increased duration of each image as an adapting stimulus, but also may have negative effects (less distortion) based on the increased duration of each image as a test stimulus. Despite these opposing effects, we hypothesized that for relatively short image presentation times (on the order of 100–200 ms) the effect of increased adaptation time should be stronger than the effect of increased test image time, leading to more perceived distortion as image duration increases.