Facial identity matching ability varies widely, ranging from prosopagnosic individuals (who exhibit profound impairments in face cognition/processing) to so-called super-recognizers (SRs), possessing exceptional capacities. Yet, despite the often consequential nature of face matching decisions—such as identity verification in security critical settings—ability assessments tendentially rely on simple performance metrics on a handful of heterogeneously related subprocesses, or in some cases only a single measured subprocess. Unfortunately, methodologies of this ilk leave contributions of stimulus information to observed variations in ability largely un(der)specified. Moreover, they are inadequate for addressing the qualitative or quantitative nature of differences between SRs’ abilities and those of the general population. Here, therefore, we sought to investigate individual differences—among SRs identified using a novel conservative diagnostic framework, and neurotypical controls—by systematically varying retinal availability, bandwidth, and orientation of faces’ spatial frequency content in two face matching experiments. Psychophysical evaluations of these parameters’ contributions to ability reveal that SRs more consistently exploit the same spatial frequency information, rather than suggesting qualitatively different profiles between control observers and SRs. These findings stress the importance of optimizing procedures for SR identification, for example by including measures quantifying the consistency of individuals’ behavior.
Ability in matching images of unfamiliar face identities varies widely between neurotypical adults1,2,3,4,5. At the upper end of this spectrum are super-recognizers (SRs), originally reported by Russell et al.6 as possessing exceptional facial identity processing capabilities across a range of sub-processes, including face matching, recognition, and identification. Presently, though, there is limited empirical evidence concerning the factors underlying SRs’ extreme abilities7,8.
To date, two aspects ubiquitously present in previous studies have hindered genuine progress in understanding the mechanisms underlying SRs’ superior abilities: (i) the use of inconsistent or inappropriate diagnostic criteria for assessing such skills; (ii) absence of studies systematically varying the information conveyed by face stimuli themselves. Here, with the goal of addressing these aspects, we studied a sample of SRs drawn from a large cohort of 70 individuals identified with a recently proposed conservative “diagnostic framework” for SR identification8. Two experiments systematically varied the spatial frequency content of face stimuli (in terms of its retinal availability and orientation), thus providing a greater level of granularity than previous studies. Moreover, they provide sufficient granularity at the group level to investigate whether previously reported performance differences between SRs and control observers are of a qualitative or quantitative nature. Operationally, this can be framed as asking whether, relative to neurotypical control observers, SRs exploit the same type of information more efficiently, or utilize markedly different types of information.
Determining superior face processing abilities
In previous work, identification of individuals as SRs has involved application of simple performance criteria on laboratory-based tests of face identity processing (which we refer to as face cognition or processing in the present work). As mentioned, Russell et al.’s6 seminal study reported a small number of SRs who achieved superior performance on three tests of face cognition measuring perceptual matching, old/new recognition and identification. Naturally, such extreme cases are, per definitionem, a relative rarity; so, more thorough multifactorial examination of their performance was critical to the success of their approach. In particular, identifying such individuals among the general population would require scrutiny of the factors that reliably distinguish them.
Yet, in the intervening time, departing from this multi-test procedure, the majority of studies since Russell et al.6 have adopted less stringent criteria. Assessments of ability that qualify observers as SRs have for the most part hinged on only a single test and/or process measured: Some have identified SRs based on performance surpassing two standard deviations above the mean of “typical” controls9,10,11 on the long (challenging) version of the Cambridge Face Memory Test (CFMT+; Russell et al.6). Others identified SRs based on a high score on “any [test reported] in the super-recognizer literature”12.
Such a single-criterion approach to SR diagnosis is problematic for at least two reasons. First, some tests that have been used as a criterion for SR identification are either inappropriate, or often insufficiently sensitive due to relatively low task difficulty. This applies to the short (72-item) version of the CFMT, which was developed for the diagnosis of developmental prosopagnosia13, and extends to the CFPT14 and GFMT15—tests for which individuals with developmental prosopagnosia can even achieve normal performance accuracy6,16,17. Tests intended to be sensitive to individual performance differences at the lower end of the performance spectrum lack sensitivity to individual differences at the upper end of the performance spectrum and vice versa, owing to ceiling and floor effects, respectively17. Since even the originators of the CFMT+ and CFPT conclude that SRs “are about as good (at face recognition) as many developmental prosopagnosics are bad” (p. 256), it is important that any criteria for SR assessment reflect individual differences at least as large at the upper end of the performance spectrum. Given that the above tests cannot do so alone, multidimensional assessments would be theoretically better suited to this end8.
Second, it is well-established that at the individual level, superior performance on one face cognition task/subprocess generalizes poorly, if at all, to others4,5,9,18,19,20. Consequently, performance on any single task/sub-process should not be considered as sufficient evidence of processing superiority to warrant generalizing across tasks/sub-processes. Therefore, minimally, multiple measures should be applied for SR identification, to ensure that any evidence of superior performance within an individual is consistent for various tasks/subprocesses, and across the most challenging of available tests. That is, if SRs are considered to display superior face processing abilities, generally, then their assessment should ideally also encompass their consistency relative to typical control observers for any such task.
Of note, practical concerns potentially used to justify simplified and varied diagnostic assessments (e.g., time constraints, desire to recruit larger SR samples, etc.) would inherently trade off against replicability. Heterogeneous criteria between studies produce concomitantly heterogeneous data. Meanwhile, relatively lax criteria and/or insufficiently difficult tests would lead to increased rates of individuals incorrectly identified as SRs (i.e., false positives), which would in turn contribute to the heterogeneity reported in the SR literature7,8. Therefore, reduction of false positives—and by extension the advancement of our understanding of SRs’ abilities—requires a more conservative and widely agreed upon framework for their diagnosis7,8,21. Moreover, these criteria should be evaluated within the context of cross-task consistency within observers, for a variety of tasks/processes4,5,7, in order to determine whether SRs indeed live up to their namesake in distinguishing themselves from the general population.
In the sense that diagnoses are thought to reflect identification based on agreed upon criteria with high sensitivity, which are lacking across studies of SRs, some researchers may deem it premature for any publication to report diagnosis of SRs. Others may consider the term “diagnosis” appropriate exclusively in seeking to identify the presence of an impairment or disease. Of note, numerous studies “diagnose” developmental prosopagnosia, despite the still lacking consensus regarding diagnostic criteria. Additionally, even criteria that have once been generally accepted change historically, while the diagnoses as such prevail. Therefore, acknowledging its origin, we conceptualize “diagnosis” as the systematic investigation conducted with the aim of increasing knowledge by distinction and delineation from other possible phenomena or manifestations. Previous work (including our own) has thus collectively attempted to assess performance for the sake of identifying or diagnosing SRs.
Psychophysical descriptions of observers’ information exploitation
Diagnostic issues aside, previous empirical studies of SRs would generally benefit from addressing the two issues raised earlier: a general lack of systematic variations of any parameters across the full-face stimulus, and consideration of only simple performance measures rather than stimulus information-dependent performance profiles. Consequently, these studies are left hard-pressed to answer the currently open question of whether control observers and SRs differ qualitatively, i.e. due to exploitation of distinct types of information22,23,24, or quantitatively, e.g. due to differential exploitation of the same local, featural information available to control observers25.
In general, observers’ performance on tasks employing systematic variations of stimulus parameters inherently derives from the information content they can exploit26. For instance, increased viewing distance shifts the bandwidth of spatial frequency information conveyed to the retina27,28 (Fig. 1a), until it falls beyond the retina’s contrast sensitivity, at which point even personally familiar face identification performance deteriorates to chance29. Crucially, as the spatial frequency content shifts in this way, the identity information (or signal) it conveys is gradually lost following a psychometric function. Psychophysically, the maximum slope of the psychometric function directly relates to the standard deviation of the probability density function: the greater the cumulative distribution’s slope, the less variability is present in its corresponding probability distribution30. Operationally, those with steeper psychometric slopes more consistently exploit and utilise the available stimulus information in their judgments. Thus, psychophysical assessment of SRs provides a potentially useful window into the consistency with which they exploit stimulus information relative to others, beyond their simple ability (typically measured by the proportion of correct responses alone).
Tardif et al.25 recently provided the first experimental psychophysical evidence of differential spatial frequency information exploitation in face stimuli between SRs and control observers. They reported that SRs’ superior performance was related to exploitation of the same local facial feature information as control observers25. However, this could also be explained by more consistent (i.e., less variable) information exploitation among SRs. Moreover, due to the ‘Bubbles’ response classification method23 used by Tardif et al.25, (which involves provision of only piecemeal local featural information) usage of spatial frequency information as would be available to the retina under normal viewing conditions28 was not probed. Previous studies with typical observers have shown familiarity-dependent enhancement of selectivity for retinally available horizontal, but not vertical, spatial frequency information across the whole face stimulus (Fig. 1b) for upright identity matching31. However, this effect has never been assessed in SRs.
In Tardif et al.’25 study, both groups of observers had equal access to the information preserved after applying Bubbles filters to stimuli, and were thus limited to use the same information content in making their judgments on each trial. Since SRs generated more hits under the same information constraints, they had to have exploited the same information that controls did, but more effectively. The authors reported that SRs’ performance could be predicted from the same information as controls’, without loss of generality in their model. This further suggests that the difference between the two groups was purely quantitative—SRs exploited the same information as controls while generating more hits. Since adding the same information produced greater gains in performance among SRs, this implies that they would have had steeper psychometric slopes (had those been measured), and would have meant that they were more consistent, according to our definition.
Therefore, the present study aims to psychometrically assess individual differences in the consistency of full-face spatial frequency and orientation information exploitation, across two experiments. In the first, we test whether consistent exploitation of retinally available spatial frequency information (across simulated viewing distances)27 during unfamiliar face matching differentiates SRs from a sample of control observers. In the second, testing the same two groups of observers, we assess whether such (potential) differences extend to increased consistency in horizontal versus vertical spatial frequency structure as shown in previous research31,32,33,34,35.
Acknowledging the expressed need to “investigate systematic, functional relationships as they are manifested at the individual participant level…us[ing] methods that are optimized to identify relationships of this kind”36 we provide a systematic description of SRs as a unique group of observers. We sought to determine whether these SRs, previously identified using novel conservative criteria8, are more sensitive to retinally available spatial frequency information across orientations, as well as whether they more consistently use information diagnostic for facial identity, exploiting it to enhance their judgments of facial identity.
All procedures and protocols were approved by the University of Fribourg’s Ethics Committee (approval number 473) and conducted in accordance with both their guidelines, as well as those set forth in the Declaration of Helsinki. All participants were healthy volunteers, provided informed written consent and were not financially compensated for their participation.
Thirty-one students in the Department of Psychology and the University of Fribourg (21 Females; mean age of 27 years; age range of 20–47 years) participated in two experimental sessions in exchange for course credit. SRs who participated in this study were recruited from a larger cohort of recently reported super-recognizer individuals8. In short, they were identified as disposing of exceptional face processing ability if they achieved high performance (relative to previously tested normative samples)4,5 on at least two of three challenging tests of face cognition: the YBT (long form)4,5,37, the FICST4,5,38, and the CFMT+6. As previous work has shown that high performance on one test does not predict high performance on another test4,5, the requirement of surpassing typical performance on at least two of the three extremely challenging tests represents a more conservative diagnostic approach relative to most previous selection methods8. No participants in either group were personally familiar with any of the identities of the face stimuli presented throughout the experimental sessions, and all were eye disease-free, with normal or corrected-to-normal visual acuity, as assessed by qCSF39,40 (see Supplemental Material, Table S1).
Psychophysical experiments were designed and implemented using the PsychToolbox41,42,43 in the Matlab (Mathworks Inc.) programming environment. Two different monitors were beta calibrated and employed for stimulus presentation throughout experimental sessions: a VIEWPixx/3D—22005C monitor (1980 × 1020 pixel resolution; viewing distance: 68.5 cm; 120 Hz refresh rate; average luminance of ca. 110 cd/m2), and a Samsung SyncMaster 2233RZ—3D LCD monitor (1680 × 1050 pixel resolution; viewing distance: 62.2 cm; 60 Hz refresh rate; average luminance of ca. 110 cd/m2)44.
In both experiments, participants were instructed to complete a 10AFC sequential face matching task as employed previously31. On each trial throughout the task, they had to match one initially presented left- or right-facing filtered target identity to one of 10 unfiltered front-facing potential probe identities. The beginning of each trial was signaled by the appearance of a central fixation point (1 s duration), whose disappearance (for 100 ms) in turn signaled the appearance of the target identity (250 ms duration). This sequence was immediately followed by the aforementioned array of 10 potential probe identities, in two horizontal rows of 5 (with males above and females below fixation). Participants were given unlimited time to select one probe that (best) matched the target’s identity, by hovering the mouse cursor over it and pressing the left mouse button to confirm their selection. They were further instructed to prioritize responding correctly over quickly. A matching identity was always presented among the 10 potential probes (i.e., the experiment did not include target-absent trials), while four other same-sex and five opposite-sex identities were randomly drawn from the remaining face identities in the stimulus set to make up the array of potential probes. A fully counterbalanced and randomized sequence of images was presented to each participant. Each session of Experiment 1 consisted of 480 trials, lasting roughly 45 min total, including five mandatory break intervals (of participant-determined duration) to alleviate fatigue. All procedures in Experiment 2 were identical, with the exception that the fully counterbalanced and randomized sequence of stimuli comprised 350 trials and 4 break intervals.
Twenty-five male and 25 female faces, sampled from a larger database of images taken from students at Université catholique de Louvain (Belgium) who provided written consent for their image usage, served as the experimental stimulus set45. For each of these face models, three images were taken from different viewpoints, then converted to grayscale and cropped to exclude external identifying features (e.g. ears, hair) (Adobe Photoshop, Adobe Inc.), centered in a 512 × 512 pixels array delimiting all stimuli to 5.4° of visual angle. One front-facing, one left-rotated and one right-rotated viewpoint were generated per face identity, yielding 60 unique images. Each was then isotropically reduced in scale to each of 7 different pixel sizes (8, 16, 32, 64 and 512 pixels across) as a manipulation of viewing distance, using the Laplacian pyramid model27,28. This simulates the retinally available information at each size-correspondent viewing distance by effectively removing high spatial frequency content at larger viewing distances (see Fig. 1a). All stimuli were finally adjusted to a root-mean-squared contrast of 0.2.
Participants were shown the same 10 upright male and 10 upright female identities as used previously31, including 3 different viewpoint images of each identity, as in Experiment 1. Here, however, rather than reducing the images’ scale as in Experiment 1, we systematically varied the amount of horizontal and vertical information present in these images. All were band-passed using sharp-edged orientation filters centered on 0° (for horizontal information filtering) or 90° (for vertical information filtering) (Fig. 1b). Then, all were filtered at each of 12 bandwidths in steps of 15° (15°, 30°, 45°, 60°, 75°, 90°, 105°, 120°, 135°, 150°, 165°, 180°), such that 180° filters passed all horizontal and vertical information (unfiltered stimuli), while 90° was the largest bandwidth at which horizontal and vertical information was independently isolated31.
Logistic regression analysis
In order to assess individual and group level differences in psychophysical performance, we employed mixed effects (fixed and random) logistic regression models using Matlab’s (Mathworks) nlmefit function. These were constructed incorporating the experimental parameters (e.g. bandwidth, image size) of interest, separately for each experiment, as predictors of correct responses. As a first step, we selected parameters for the base model (e.g. not accounting for group) based on minimization of the Akaike Information Criteria (AIC) that evaluate model parsimony (quality of fit, given the number of parameters required to achieve it). We subsequently compared the Bayesian Information Criteria (BIC) between optimally parameterized models hierarchically, as an estimate of the Bayes factor, in order to determine the effect of each predictor’s inclusion on the relative goodness of fit of the competing models. In general, this means that we evaluated each predictor’s effect by fitting one model that included it, and compared it with the model excluding it, but that was otherwise identical. Each model fitted a logistic function to the averaged data as an approximation of the psychometric function, and each parameter φijk could include a fixed effect for the jth group as well as a random effect for the ith observer:
where k = 1 corresponded to the scaling factor, k = 2 to the mean, and k = 3 to the slope of the psychometric function.
In order to assess contributions of the latent grouping variable, each parameter was subdivided to take on nonzero values only when fitting the ith jth group by letting φi1k = 1 for SRs, and 0 otherwise, and similarly letting φi2k = 1 for control observers and 0 otherwise. Thus, the full model’s logistic regression function had the general form:
Similar parameterization was employed for filter orientation in Experiment 2 (though at the subject level, rather than group level).
In order to assess whether SRs exploit available SF information differently from control observers, as well as to generate psychometric functional fits to individual observers’ data (see Supplemental Material, Fig. S1, left panels for model fits to individual observers’ data), we fitted three mixed effects logistic regression models to the trial-level data (see Table 1 for descriptions of models and their comparative fits to the data). Here, image size and group were taken as predictors of correct responses. In general, we evaluated each effect by fitting one model including it, and compared it with a model excluding it, but that was otherwise identical.
The first model included regression coefficients for fixed and random effects at the subject level, though not the group level, and thus assessed the effect of retinally available SF information on performance at the subject level, irrespective of group. Including both a slope parameter and a scaling parameter (Model 2) resulted in a more parsimonious model fit than including only a scaling parameter (Model 1) (ΔAIC21 = − 913.9), suggesting that individual differences in performance were best accounted for by changes in slope (BF21 = 2.49 × 1023; BF k > 10). The third model therefore included parameters for the fixed and random effects of group, for both slope and scale. Model comparison revealed that inclusion of group-level effects in Model 3 more accurately predicted performance than model 2 (BF32 = 739.5), which was blind to group effects.
Taken together, these results suggest that SRs’ performance differs from controls. To determine whether there were any consistent differences in slope between groups, we performed an independent-samples post-hoc t-test on the slope parameters obtained for both groups, and found that SRs had significantly steeper psychometric slopes than controls (t(40) = 2.10, p = 0.04).
Overall, the fact that models tended to reduce the AIC when adding parameters suggests that we were not ‘overfitting’ the data with their addition. That is, the addition of parameters tended to produce better fits, and not at the expense of providing parsimonious explanations of the data.
In order to assess whether psychophysical performance (the probability of correctly responding as a function of filter bandwidth) varied as a function of slope or scaling factor, we fitted two mixed effects logistic regression models to observers’ trial-level data, as in Experiment 1 (see Table 2 for descriptions of models and their comparative fits to the data). Model 2, accounting for both slope and scaling factor, predicted correct responding more parsimoniously than Model 1 (ΔAIC21 = − 7.4, BF21 = 7.0), which accounted only for the scaling factor. We therefore retained predictors for slope and scaling factors in subsequent models assessing horizontal selectivity and group effects.
The third model assessed the effect of horizontal selectivity (Fig. 2, center & right panels), including orientation as a predictor of performance, by comparison against Model 2 (Fig. 2, left panel). These two models were identical except in the respect that Model 3 accounted for filter orientation. Model 3 provided a better fit to the data than model 2 (ΔAIC32 = − 631.5, BF32 = 7.39 × 10134), suggesting that filter orientation is an important predictor of psychophysical performance, irrespective of group, decisively supporting the effect of horizontal selectivity (see Supplemental Material, Fig. S2 for a visualization more directly replicating previous work31).
Next, we fitted a model (Model 5, c.f. Table 2) that additionally included group-level coefficients in order to distinguish SRs from controls (Fig. 3). The only difference between Models 3 and 5 was the addition of group-level coefficients in Model 5, such that any change in fit between models would have had to arise from between-group differences in horizontal selectivity beyond any variance already explained by the fixed and random effects of horizontal selectivity in the overall sample. Comparison of the models’ fits suggests that Model 3 provided a better overall fit to the data (Fig. 3), decisively suggesting that SRs’ horizontal selectivity is no different from that of control observers (ΔAIC32 = − 3.6, BF43 = 2.647 × 10–4).
Although comparison of models 5 and 3 suggested that SRs did not exhibit differential horizontal selectivity relative to controls, they could show generally increased psychometric slopes relative to control observers, irrespective of orientation. Therefore, we fitted a model that accounted only for group (c.f. Model 4, Table 2), and not for filter orientation (Fig. 4). Comparison of Model 5 (accounting for group and filter bandwidth) against Model 4 (accounting only for bandwidth) revealed that the grouping factor strongly improved the model fit (ΔAIC65 = − 633.8; BF54 = 1.3 × 10133). A post-hoc t-test of fitted slopes between groups (from model 5) reveals that SRs’ were steeper than control observers’ (t(40) = 3.62, p < 0.0001). Overall, these results suggest that while SRs do not display enhanced horizontal selectivity relative to controls (i.e. evidence favors the null hypothesis that there is no effect of filter orientation), they do tend to exhibit more consistent performance overall (see Supplemental Material, Fig. S1, right panels, for model fits to individual observers’ data).
As was the case in Experiment 1, addition of predictors and parameters to our models tended to reduce the AIC. So, their addition seems not only to produce tighter fits to the data, but to do so without sacrificing parsimony of the model.
In this study we sought to address the currently open question of whether SRs—individuals with exceptional face processing skills6,7,8—exploit spatial frequency information in a quantitatively or qualitatively different manner than typical individuals. To this end, we systematically varied retinal availability, bandwidth, and orientation of faces’ spatial frequency content across two face matching experiments. Together the results reveal that SRs utilize the same information, albeit more consistently than control observers.
The results of Experiment 1 demonstrate that SRs show higher consistency (as evidenced by steeper psychometric slopes) in face matching than controls (Fig. 2), over a similar range of Laplacian Pyramid-simulated viewing distances27,28,29. This suggests that—among the factors examined in the current experiments—SRs more readily incorporate retinally available spatial frequency information in their matching judgments of facial identity. However, absolute identity discrimination thresholds were low enough in both groups of observers to imply that they likely approached the face detection threshold, rendering qualitative differences in the range of spatial frequency information utilization between groups difficult to observe.
Meanwhile, in our second experiment, we find horizontal selectivity commensurate with previous studies31,32,34,35, as suggested by improved model fits when adding orientation as a factor (Figs. 2, 3). More specifically, horizontal spatial frequency information is more diagnostic for facial identity matching than vertical, among both groups of observers (Supplemental Material, Figs. S1, S2). SRs’ performance was also quite different from control observers’—across filter bandwidths and orientations—suggesting that they have generally higher sensitivity to spatial frequency content when matching facial identity. This is indicated by the group-level differences in psychometric slopes, irrespective of orientation (Fig. 2). And critically, in terms of our initial hypotheses, it implies that SRs do not display differential horizontal selectivity from controls. We also find no evidence to suggest that SRs employ qualitatively different spatial frequency bands than control observers (see Supplemental Material, Fig. S1 for individual observers’ profiles), though we do not specifically find evidence against this possibility either. So, while we do find evidence of horizontal selectivity as reported by previous research, it is neither particularly predictive of superior ability for face identity matching, nor does it reveal any qualitative differences between groups.
On quantitative versus qualitative differences in face processing
To our knowledge, the only previous psychophysical study assessing the qualitative versus quantitative nature of SRs’ distinguishing characteristics25 used the ‘Bubbles’ response classification method23,46. This method applies random spatial frequency bands to random facial locations in a piecemeal fashion, over a large number of trials, reconstructing the classification image based solely on the contribution of un/available local information. These particular stimuli substantially distort the distribution of spatial frequency information available under normal viewing conditions28. Despite using manipulations of information applied uniformly across the full face, our results nonetheless agree with those previously reported25, in that both SRs and typical observers utilized roughly the same range of spatial frequency information. However, here we add explicitly that SRs exploit this information more consistently.
This agrees with the individual differences in psychometric functional fits that we find across experiments (c.f. Figs. 3, 5). While both groups (control observers and SRs) show comparable matching thresholds for retinally available spatial frequency content across spatial scales, as well as for horizontally and vertically oriented spatial frequency content, we find that consistency of retinally available spatial frequency exploitation (i.e., psychometric slope) predicts SR status (Fig. 5), irrespective of the spatial frequency content’s spatial orientation (Fig. 4).
Overall, we surmise that within-observer consistency constitutes an important aspect of performance, in that it is predictive of superior face processing abilities (Figs. 3, 4, 5), but alone is likely not sufficient for SR identification. The improved model fits upon inclusion of scaling factors generally suggest that some portion of psychophysical performance is not accounted for with slope (and thus consistency) alone. For instance, SRs seem to reach greater asymptotic performance levels than controls, on average, which warrants an investigation into selectivity for information at different spatial scales. Potentially, application of similar psychophysical techniques over a wider array of stimulus properties, spanning the gamut of potential differences in information exploitation between typical observers and SRs, could provide more and clearer diagnostic criteria, and further, possibly reveal qualitative differences unavailable to scrutiny here.
Consistency as a potentially novel means for super-recognizer classification
Given the paucity of psychophysical studies of SRs’ abilities, we propose that future studies adopt such systematic approaches to provide a more detailed description of their unique skill(s). While we find consistency to be predictive of superior ability (as also reported recently in typical cohorts47), we would advocate deploying measures of consistency in addition to more traditional performance metrics and not as a substitute for them. That is, they ought to be integrated to extend the granularity of current8 multifactorial frameworks of SR diagnostic tests, as originally proposed6. Moreover, systematic variations of different kinds of stimulus information hold the potential to reveal other important factors that contribute to overall ability. Recognizing that such procedures are comparatively more cumbersome, we consider their inclusion a crucial step in addressing the stated needs for appropriately stringent diagnostic tools and reduction of false positive SR identification7,48,49,50.
Over two psychophysical experiments, we tested proficiency for matching of unfamiliar facial identity in a group of 11 conservatively identified SRs and 31 neurotypical observers. Following parametric manipulations of spatial frequency content and orientation across the full face, we find relatively quantitative differences between SRs and control observers. However, it should be emphasized that we cannot rule out the potential for qualitative differences beyond the scope of the parametric stimulus manipulations used here. But, based on the currently available psychophysical findings, SRs should not be described as a special class51, but rather as experts among experts. Rather than accepting this notion, we emphasize that further work using complementary methods is needed to ascertain whether SRs process facial information in just quantitatively, or also qualitatively, different ways than typical psychophysical observers. Indeed, it is plausible that qualitative differences—similar to those reported for developmental prosopagnosia52,53,54—may be found in larger cohorts of carefully described SR cases identified using the same multi-factorial diagnostic framework8.
Accompanying data can be downloaded from the Open Science Framework (https://osf.io/5h3rj/).
Fox, E. & Bindemann, M. Individual differences in visual acuity and face matching ability. Appl. Cogn. Psychol. 34, 1034–1046. https://doi.org/10.1002/acp.3682 (2020).
Estudillo, A. J. & Bindemann, M. Generalization across view in face memory and face matching. i-Perception 5, 589–601. https://doi.org/10.1068/i0669 (2014).
Fysh, M. C. & Bindemann, M. Forensic face matching: A review. In Face Processing Systems, Disorders and Cultural Differences (ed. Bindemann, M.) 1–20 (Nova Science Publishing, 2017).
Stacchi, L., Huguenin-Elie, E., Caldara, R. & Ramon, M. Normative data for two challenging tests of face matching under ecological conditions. Cogn. Res. Princ. Implic. https://doi.org/10.1186/s41235-019-0205-0 (2020).
Fysh, M. C., Stacchi, L. & Ramon, M. Differences between and within individuals, and subprocesses of face cognition: Implications for theory, research and personnel selection. R. Soc. Open Sci. https://doi.org/10.1098/rsos.200233 (2020).
Russell, R., Duchaine, B. & Nakayama, K. Super-recognizers: People with extraordinary face recognition ability. Psychon. Bull. Rev. 16, 252–257. https://doi.org/10.3758/PBR.16.2.252 (2009).
Ramon, M., Bobak, A. K. & White, D. Super-recognizers: From the lab to the world and back again. Br. J. Psychol. 110, 461–479. https://doi.org/10.1111/bjop.12368 (2019).
Ramon, M. Super-recognizers—A novel diagnostic framework, 40 cases, and guidelines for future work. Neuropsychologia. https://doi.org/10.1016/j.neuropsychologia.2021.107809 (2021).
Bobak, A. K., Hancock, P. J. B. & Bate, S. Super-recognizers in action: Evidence from face-matching and face memory tasks. Appl. Cogn. Psychol. 91, 81–91. https://doi.org/10.1002/acp.3170 (2016).
Bobak, A. K., Pampoulov, P. & Bate, S. Detecting superior face recognition skills in a large sample of young British adults. Front. Psychol. 7, 1–11. https://doi.org/10.3389/fpsyg.2016.01378 (2016).
Bobak, A. K., Dowsett, A. J. & Bate, S. Solving the border control problem: Evidence of enhanced face matching in individuals with extraordinary face recognition skills. PLoS ONE 11, 1–13. https://doi.org/10.1371/journal.pone.0148148 (2016).
Phillips, P. J. et al. Face recognition accuracy of forensic examiners, superrecognizers, and face recognition algorithms. PNAS 115, 6171–6176. https://doi.org/10.1073/pnas.1721355115 (2018).
Duchaine, B. & Nakayama, K. Dissociations of face and object recognition in developmental prosopagnosia. J. Cogn. Neurosci. 17, 249–261. https://doi.org/10.1162/0898929053124857 (2005).
Duchaine, B. & Nakayama, K. The Cambridge face memory test: Results for neurologically intact individuals and an investigation of its validity using inverted face stimuli and prosopagnosic participants. Neuropsychologia 44, 576–585. https://doi.org/10.1016/j.neuropsychologia.2005.07.001 (2006).
Burton, M. A., White, D. & McNeill, A. The Glasgow face matching test. Behav. Res. Methods 42, 286–291 (2010).
White, D., Rivolta, D., Burton, A. M., Al-Janabi, S. & Palermo, R. Face matching impairment in developmental prosopagnosia. Q. J. Exp. Psychol. 70, 287–297. https://doi.org/10.1080/17470218.2016.1173076 (2017).
Fysh, M.C. & Ramon, M. (submitted). It’s about time. Testing tests of normal face matching with prosopagnosia.
Bate, S. et al. Applied screening tests for the detection of superior face recognition. Cogn. Res. Princ. Implic. 3, 19 (2018).
Lander, K., Bruce, V. & Bindemann, M. Use-inspired basic research on individual differences in face identification: Implications for criminal investigation and security. Cogn. Res. 3, 1–13. https://doi.org/10.1186/s41235-018-0115-6 (2018).
Fysh, M. C. Individual differences in the detection, matching and memory of faces. Cogn. Res. Princ. Implic. https://doi.org/10.1186/s41235-018-0111-x (2018).
de Haas, B. What’s a super-recogniser? Neuropsychologia. https://doi.org/10.1016/j.neuropsychologia.2021.107805 (2021).
Schyns, P. G. Diagnostic recognition: Task constraints, object information, and their interactions. Cognition 67, 147–179. https://doi.org/10.1016/S0010-0277(98)00016-X (1998).
Gosselin, F. & Schyns, P. G. Bubbles: A technique to reveal the use of information in recognition tasks. Vis. Res. 41, 2261–2271. https://doi.org/10.1016/S0042-6989(01)00097-9 (2001).
Goffaux, V., Jemel, B., Jacques, C., Rossion, B. & Schyns, P. G. ERP evidence for task modulations on face perceptual processing at different spatial scales. Cogn. Sci. 27, 313–325. https://doi.org/10.1016/S0364-0213(03)00002-8 (2003).
Tardif, J. et al. Use of face information varies systematically from developmental prosopagnosics to super-recognizers. Psychol. Sci. 30, 300–308. https://doi.org/10.1177/0956797618811338 (2019).
Murray, R. F., Bennett, P. J. & Sekuler, A. B. Classification images predict absolute efficiency. J. Vis. 5, 139–149. https://doi.org/10.1167/5.2.5 (2005).
Burt, P. J. & Adelson, E. H. The Laplacian pyramid as a compact image code. Fund. Pap. Wavelet Theory 4, 28–36. https://doi.org/10.1515/9781400827268.28 (1983).
Sowden, P. T. & Schyns, P. G. Channel surfing in the visual brain. Trends Cogn. Sci. 10, 538–545. https://doi.org/10.1016/j.tics.2006.10.007 (2006).
Ramon, M. Perception of global facial geometry is modulated through experience. PeerJ 3, e850. https://doi.org/10.7717/peerj.850 (2015).
Klein, S. A. Measuring, estimating, and understanding the psychometric function: A commentary. Percept. Psychophys. 63, 1421–1455. https://doi.org/10.3758/BF03194552 (2001).
Pachai, M. V., Sekuler, A. B., Bennett, P. J., Schyns, P. G. & Ramon, M. Personal familiarity enhances sensitivity to horizontal structure during processing of face identity. J. Vis. 17, 1–11. https://doi.org/10.1167/17.6.5 (2017).
Goffaux, V., Poncin, A. & Schiltz, C. Selectivity of face perception to horizontal information over lifespan (from 6 to 74 year old). PLoS ONE 10, 1–17. https://doi.org/10.1371/journal.pone.0138812 (2015).
Taubert, J., Goffaux, V., Van Belle, G., Vanduffel, W. & Vogels, R. The impact of orientation filtering on face-selective neurons in monkey inferior temporal cortex. Sci. Rep. 6, 1–12 (2016).
Goffaux, V. & Greenwood, J. A. The orientation selectivity of face identification. Sci. Rep. 6, 34204. https://doi.org/10.1038/super-recognizerep34204 (2016).
Jacobs, C., Petras, K., Moors, P. & Goffaux, V. Contrast versus identity encoding in the face image follow distinct orientation selectivity profiles. PLoS ONE 15, 1–22. https://doi.org/10.1371/journal.pone.0229185 (2020).
Smith, P. L. & Little, D. R. Small is beautiful: In defense of the small-N design. Psychon. Bull. Rev. 25, 2083–2101. https://doi.org/10.3758/s13423-018-1451-8 (2018).
Bruck, M., Cavanagh, P. & Ceci, S. J. Fortysomething: Recognizing faces at one’s 25th reunion. Mem. Cognit. 19, 221–228. https://doi.org/10.3758/BF03211146 (1991).
Jenkins, R., White, D., Van Montfort, X. & Mike Burton, A. Variability in photos of the same face. Cognition 121, 313–323. https://doi.org/10.1016/j.cognition.2011.08.001 (2011).
Lesmes, L. A., Lu, Z. L., Baek, J. & Albright, T. D. Bayesian adaptive estimation of the contrast sensitivity function: The quick Cspatial frequency method. J. Vis. 10, 1–21. https://doi.org/10.1167/10.3.17 (2010).
Canare, D., Ni, R. & Lu, T. An open-source implementation of the Quick Cspatial frequency method. J. Vis. 19, 86b (2019).
Brainard, D. H. The psychophysics toolbox. Spat. Vis. 10, 433–436 (1997).
Pelli, D. G. The VideoToolbox software for visual psychophysics: Transpatial frequencyorming numbers into movies. Spat. Vis. 10, 437–442 (1997).
Kleiner, M., Brainard, D. & Pelli, D. What’s new in Psychtoolbox-3? Perception 36, 1–16 (2007).
Wang, P. & Nikolić, D. An LCD monitor with sufficiently precise timing for research in vision. Front. Hum. Neurosci. 5, 1–10. https://doi.org/10.3389/fnhum.2011.00085 (2011).
Laguesse, R., Dormal, G., Biervoye, A., Kuefner, D. & Rossion, B. Extensive visual training in adulthood significantly reduces the face inversion effect. J. Vis. 12, 1–13. https://doi.org/10.1167/12.10.14 (2012).
Haig, N. D. How faces differ—A new comparative technique. Perception 14, 601–615. https://doi.org/10.1068/p140601 (1985).
Kramer, R. S., Jones, A. L. & Gous, G. Individual differences in face and voice matching abilities: The relationship between accuracy and consistency. Appl. Cogn. Psychol. 35, 192–202. https://doi.org/10.1002/acp.3754 (2021).
Devue, C., Wride, A. & Grimshaw, G. M. New insights on real-world human face recognition. J. Exp. Psychol. Gen. 148, 994–1007. https://doi.org/10.1037/xge0000493 (2019).
Moreton, R., Pike, G. & Havard, C. A task- and role-based perspective on super-recognizers: Commentary on ‘Super-recognisers: From the lab to the world and back again’. Br. J. Psychol. 110, 486–488. https://doi.org/10.1111/bjop.12394 (2019).
Robertson, D. J. & Bindemann, M. Consolidation, wider reflection, and policy: Response to ‘super-recognisers: From the lab to the world and back again’. Br. J. Psychol. 110, 489–491. https://doi.org/10.1111/bjop.12393 (2019).
Young, A. W. & Noyes, E. We need to talk about super-recognizers Invited commentary on: Ramon, M., Bobak, A. K., & White., D Super-recognizers: From the lab to the world and back again. Br. J. Psychol. 110, 492–494. https://doi.org/10.1111/bjop.12395 (2019).
Duchaine, B. C., Yovel, G., Butterworth, E. J. & Nakayama, K. Prosopagnosia as an impairment to face-specific mechanisms: Elimination of the alternative hypotheses in a developmental case. Cogn. Neuropsychol. 23, 714–747. https://doi.org/10.1080/02643290500441296 (2006).
Bobak, A. K., Parris, B. A., Gregory, N. J., Bennetts, R. J. & Bate, S. Eye-movement strategies in developmental prosopagnosia and “super” face recognition. Q. J. Exp. Psychol. 70, 201–217. https://doi.org/10.1080/17470218.2016.1161059 (2017).
Behrmann, M., Scherf, K. S. & Avidan, G. Neural mechanisms of face perception, their emergence over development, and their breakdown. Wiley Interdiscipl. Rev. Cogn. Sci. 7, 247–263. https://doi.org/10.1002/wcs.1388 (2016).
We thank two anonymous reviewers for their comments on a previous version of the manuscript, in particular Reviewer 2 for their insightful suggestions which motivated improvements to the statistical analyses reported in its final form. MR is supported by a Swiss National Science Foundation PRIMA (Promoting Women in Academia) Grant (PR00P1_179872).
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Nador, J.D., Zoia, M., Pachai, M.V. et al. Psychophysical profiles in super-recognizers. Sci Rep 11, 13184 (2021). https://doi.org/10.1038/s41598-021-92549-6