The neural representation of human versus nonhuman bipeds and quadrupeds

Papeo, Liuba; Wurm, Moritz F.; Oosterhof, Nikolaas N.; Caramazza, Alfonso

doi:10.1038/s41598-017-14424-7

Download PDF

Article
Open access
Published: 25 October 2017

The neural representation of human versus nonhuman bipeds and quadrupeds

Liuba Papeo^1,2,
Moritz F. Wurm^1,3,
Nikolaas N. Oosterhof¹ &
…
Alfonso Caramazza^1,3

Scientific Reports volume 7, Article number: 14040 (2017) Cite this article

2707 Accesses
11 Citations
1 Altmetric
Metrics details

Subjects

Abstract

How do humans recognize humans among other creatures? Recent studies suggest that a preference for conspecifics may emerge already in perceptual processing, in regions such as the right posterior superior temporal sulcus (pSTS), implicated in visual perception of biological motion. In the current functional MRI study, participants viewed point-light displays of human and nonhuman creatures moving in their typical bipedal (man and chicken) or quadrupedal mode (crawling-baby and cat). Stronger activity for man and chicken versus baby and cat was found in the right pSTS responsive to biological motion. The novel effect of pedalism suggests that, if right pSTS contributes to recognizing of conspecifics, it does so by detecting perceptual features (e.g. bipedal motion) that reliably correlate with their appearance. A searchlight multivariate pattern analysis could decode humans and nonhumans across pedalism in the left pSTS and bilateral posterior cingulate cortex. This result implies a categorical human-nonhuman distinction, independent from within-category physical/perceptual variation. Thus, recognizing conspecifics involves visual classification based on perceptual features that most frequently co-occur with humans, such as bipedalism, and retrieval of information that determines category membership above and beyond visual appearance. The current findings show that these processes are at work in separate brain networks.

The language network as a natural kind within the broader landscape of the human brain

Article 12 April 2024

Evelina Fedorenko, Anna A. Ivanova & Tamar I. Regev

EEG is better left alone

Article Open access 09 February 2023

Arnaud Delorme

Perceptography unveils the causal contribution of inferior temporal cortex to visual perception

Article Open access 18 April 2024

Elia Shahbazi, Timothy Ma, … Arash Afraz

Introduction

A requirement for an animal that has evolved to live in a social environment is to recognize entities with which it can engage in social interactions. Humans are very efficient at recognizing conspecifics among other entities in the environment: very early in life, they can visually distinguish conspecifics from other objects^1,2 and, across the whole life span, they process conspecifics faster and better than other animals^3,4,5. How is this tuning for conspecifics realized?

A preference for humans over other biological creatures may emerge already in the visual perceptual system. In various current models, the visual system is tuned for the analysis of socially relevant information; and the recognition of conspecifics through detection of specific perceptual cues is the initial stage that triggers inferential processes culminating in the representation of others’ intentions and mental states^6,7,8,9,10. An important neural structure for that initial stage would be a region for biological motion perception, in the right posterior superior temporal sulcus (pSTS). Spatially disposed to integrate signals from category-specific regions, the right pSTS has been implicated in forming a representation of human motion and action¹¹, and it is regarded as the main entry into the social brain network^6,7,8,9,10.

Stronger activity in the right pSTS during processing of biological motion versus nonbiological motion has been largely documented in neuroimaging studies^12,13. The relevance of this activity for biological motion perception is supported by research with brain stimulation¹⁴ and brain-damaged patients¹⁵. In the current literature, the terms “biological motion” and “nonbiological motion” refer to displays of human motion and object motion, respectively, somehow implying that the visual system encodes all biological motions similarly. Only recently, research has focused on the relationship between human and nonhuman motion in the right pSTS¹⁶, following reports of behavioural differences in visual perception of moving humans versus nonhuman animals.

In particular, it has been shown that, under visual noise, individuals are better at detecting the gait of a man than the gait of a horse, depicted by point light displays (PLDs)¹⁷. The technique based on PLDs, pioneered by Johansson¹⁸, is used to depict biological movements by means of few isolated points of light in correspondence with the major joints of the moving body, without visual attributes such as color, form, and texture. The increased tolerance to noise in the case of PLDs of human gait has been taken as evidence of greater visual sensitivity to human (versus nonhuman) motion, by virtue of the human’s higher social value. In keeping with the behavioural difference, stronger right-pSTS activity has been reported during visual perception of PLDs of walking men versus walking dogs¹⁹.

In the current functional MRI (fMRI) study, we investigated whether the stronger response to humans versus other familiar biological entities such as dogs, in the biological-motion perception pSTS, reflects a genuine preference for the category of conspecifics or, rather, the effect of perceptual features that differ between a walking man and a walking dog (or a horse). A prominent perceptual difference between a walking man and a walking dog, which could strongly affect a region sensitive to visual motion, is the mode of motion: bipedal versus quadrupedal.

It has been shown that the posture in which a body appears – bipedal or quadrupedal – affects the way in which the body is processed. In particular, in a behavioural study, participants exhibited a body-part compatibility effect (faster hand response to vision of upper limbs and faster foot response to lower limbs) comparable to the effect elicited by human bodies, for nonhuman animals in bipedal but not in quadrupedal posture²⁰. This finding suggests that nonhuman animals are coded with reference to the human body schema, when presented in a bipedal posture (e.g., a bear standing on its posterior paws). Moreover, it has been shown that visual recognition of biological motion involves an initial stage where the structure of limbs is recovered; this information appears to be necessary to reconstruct, in a second stage, the whole agent^21,22. This processing of biological motion would be compatible with a recognition-by-parts model²¹, whereby limbs’ structure and motion are particularly diagnostic for visual classification, by virtue of their consistent co-occurrence with a specific category of objects (e.g., bipedalism for prototypical humans).

Following these observations, during fMRI, we presented PLDs of human and nonhuman characters moving in their typical bipedal (man and chicken) or quadrupedal mode (crawling-baby and cat). We also presented scrambled versions of those stimuli, in which dots moved in the same way as in the original PLDs, but their relative positions were varied, so that they formed abstract meaningless shapes. Participants were instructed to report whether two identical video-clips were shown in a row (repetition detection task). We examined whether pSTS responded more strongly to human (i.e., man and crawling-baby) than nonhuman characters (i.e., chicken and cat), irrespective of the within-category variation in the mode of motion. To identify other brain regions sensitive to a perceptually invariant human versus nonhuman distinction, we conducted additional multivariate whole brain analyses. Using multivariate pattern analysis (MVPA), we sought to reveal finer-grained neural distinctions captured by the local spatial patterns of activation associated with each character^23,24.

The results showed stronger right-pSTS activity for bipeds than for quadrupeds. This effect implies that the right pSTS does differentiate among different biological motions, and that this distinction is tied to perceptual features (bipedal versus quadrupedal mode of motion), as opposed to more abstract properties that may determine membership to the category of conspecifics. While no brain region showed stronger response to human than nonhuman stimuli (or vice versa), a distinction between the two categories, reflected in distinct neural patterns, was found in a network encompassing the bilateral posterior cingulate cortex (PCC) and the left pSTS.

Methods

Participants

Twenty healthy volunteers with normal or corrected-to-normal vision participated in the fMRI study (7 female, mean age 26.9 years ± 5.41 SD). The Human Research Ethics Committee of the University of Trento approved all procedures, in compliance with the Declaration of Helsinki. Informed consent was obtained from all participants, in written. All methods were performed in accordance with the relevant guidelines and regulations.

Stimuli and Procedures

Stimuli were 2-s video-clips of point-light displays (PLDs) of four characters walking in their typical manner (lateral view) – a walking man, a walking chicken, a crawling baby and a walking cat – and their scrambled versions (see Supplementary Information for examples of video-clips). Original PLDs were obtained from fully illuminated video-clips modified using Adobe After Effects video processing software (Adobe Systems Inc.), so that the final result was a pattern of 14 moving white dots on a black background. From the original video-clip showing walking cycles of a character for 8–10 s, three fragments of 1.5, 2, or 2.5 s were extracted starting from each of two different frames. The speed of the 1.5-s fragments and of the 2.5-s fragments was accelerated and decelerated, respectively, so that each lasted 2 s. Each video was flipped so that the character faced leftward in six videos and rightward in other six videos. Finally, each video was displayed in 4 different sizes (480 × 360, 393 × 294, 305 × 230, 218 × 164 pixels, corresponding to 100, 140, 180 and 220% of original video-clips, respectively) for a total of 48 versions of a video-clip (2 starting-frames × 3 speeds × 2 orientations × 4 sizes) for each of the four conditions. Scrambled versions of the videos were created, where dots moved in the same way as in the above PLDs but their configuration (i.e., relative positioning) was scrambled, so that the character was no longer recognizable. For each scrambled-PLD, 48 versions were created just like for the PLDs. The variation of speed, size, starting frame and orientation, and the inclusion of scrambled versions of PLDs served to control for the effect on neural activations, of low- and mid-level visual information respectively.

Stimuli were presented during six functional runs (6.64 min each). For every subject, the first three runs included scrambled-PLDs and the last three, PLDs. The fix order was chosen so that the presentation of scrambled-PLDs, introduced as abstract stimuli, was not influenced by prior exposure to PLDs (i.e., to prevent participants from attempting an interpretation of scrambled-PLDs). PLDs were introduced at the end of the third run. In a brief familiarization phase, three video-clips for each character were presented. In this phase, the experimenter named the character in each clip.

Each run comprised: a warm-up phase (10 s), 16 blocks of video-clips (four blocks per category, 12 s each) alternating with 15 baseline phases (12 s each), and a cool-down phase (16 s). In each block, four 2-s video-clips of the same condition were shown, separated by 1 s of fixation. Video-clips of the same condition could vary for speed, size, starting frame or orientation. Participants were instructed to report whether two video-clips, identical with respect to every dimension (i.e., speed, size, etc.), were shown in the same block of four video-clips (repetition detection task). They had to provide the response through yes-or-no keypress at the end of a block, when a question mark appeared on the screen. Stimuli were back-projected onto a screen by a liquid crystal projector (frame rate: 60 Hz; screen resolution: 1024 × 768 pixels). Participants viewed the stimuli binocularly through a mirror above the head coil. The screen was visible as a rectangular aperture of 17.8° × 13°. Presentation of PLDs and scrambled-PLDs did not exceed an aperture of 4.7° × 3.2°. At the end of the scanning section, we checked with debriefing questions, that the participants did not recognize anything/anyone in the first set of video-clips (scrambled-PLDs) and that they could easily recognize each of the four characters in the last set of video-clips (PLDs).

fMRI data acquisition

Functional images were acquired using echo planar T2-weighted scans (BioSpin MedSpec 4 T Bruker). A total of 1062 volumes of 33 anterior/posterior-commissure aligned slices were acquired over six runs (field of view = 192 × 192 mm; repetition time 2250 ms; echo time 30 ms; flip angle 76°; voxel resolution 3 × 3 × 3 mm³; gap 0.45 mm). An additional high-resolution T1-weighted MPRAGE sequence was acquired (176 slices; field of view 256 × 224 mm²; repetition time 2700 ms; inversion time 1020 ms, flip angle 7°; voxel resolution 1 × 1 × 1 mm³; generalized autocalibrating partially parallel acquisitions with acceleration factor of 2; duration 5.36 min).

fMRI data preprocessing and analyses

fMRI data pre-processing and statistical analysis were performed with BrainVoyagerQX 2.8.0 (BrainInnovation, Maastricht, NL) and MATLAB (Mathworks Inc., Natick, MA). The first four volumes were discarded prior to image processing. Pre-processing included: spatial realignment and motion correction of the images using the first volume of the first run of PLDs as reference; slice timing correction; removing of low-frequency drifts with a temporal high-pass filter (cutoff frequency 3 cycles per run); spatial smoothing with a Gaussian kernel of 8-mm FWHM for univariate analyses, and of 3-mm FWHM for multivariate analyses; and transformation of data into Talairach space. Subject-specific β-weights were derived through a general linear model (GLM) including the eight conditions (PLDs and scrambled-PLDs of human and nonhuman bipeds and quadrupeds) and the question as regressors of interest and six motion parameters as regressors of no interest. All regressors were convolved with a standard hemodynamic response function. Multivariate analysis (MVPA with cross-validated classification) was performed using the CoSMoMVPA Toolbox²⁵ through MATLAB. In the following analyses, statistical significance for second-level (group) analyses was determined using an initial voxelwise threshold of P < 0.001 and a cluster-level threshold of P = 0.05 using Monte Carlo simulations (10,000 iterations), as implemented in the CoSMoMVPA Toolbox.

fMRI univariate analyses

We used the whole-brain contrast PLDs > scrambled-PLDs to identify the bilateral pSTS region responsive to biological motion (right: 44, −68, 15; left: −49, −74, 24; P < 0.001, uncorrected; for other clusters from the same contrast, see Table 1). To define the individual region-of-interest (ROI) in pSTS, we performed the same contrast for each individual’s GLM, identified the peak activity (Ps ≤ 0.05, uncorrected) within 20 mm from the group-average activity-peak along the three axes (x, y and z), and created a sphere of 6-mm around the individual peak coordinates (see Supplementary Table S1 for individual peak coordinates). From the right and left ROIs of each participant, we extracted the β-weights relative to each condition (man, chicken, baby and cat) and subtracted the activity (β-weights) relative to the corresponding scrambled video-clips to minimize the differences across conditions due to lower- and mid-level visual features. Two separate repeated-measures ANOVAs were performed for the left and right pSTS-ROI, respectively, with the factors Pedalism (bipedal, quadrupedal) and Category (human, nonhuman).

Table 1 Location and significance of clusters showing stronger activity for PLDs relative to scrambled-PLDs. The results are cluster-based corrected using Monte Carlo simulations (10.000 iterations).

Full size table

MVPA with cross-validated classification

In order to increase sensitivity to the neural distinction between human and nonhuman stimuli, we used MVPA with linear support vector machine (SVM) classifiers²⁶ (LIBSVM, http://www.csie.ntu.edu.tw/~cjlin/libsvm), to obtain the neural patterns of response to each of the experimental stimuli, and test the similarity among them²⁷. In this analysis, for each participant, 12 multivariate β-patterns per condition (one for each of four blocks per run) were estimated in each 3-voxel-radius sphere centred in each voxel in the brain (searchlight MVPA). A classifier was trained and tested on individual subject data, using a leave-one-out cross-validation strategy. In the current design, a region that distinguishes between humans and nonhumans should show distinct neural patterns for all the pairs that instantiate that distinction: man versus cat, man versus chicken, baby versus cat and baby versus chicken. This was tested as follows. In the first cross-validation scheme, the classifier was trained on 22 patterns (11 from the man- and 11 for chicken-specific patterns) and tested on its accuracy at classifying two patterns from the baby- and the cat-condition, and vice versa (i.e., training on baby versus cat and test on man versus chicken). This procedure was performed in 24 iterations. Classification accuracies from all iterations were averaged to give the mean classification accuracy for each participant. The same approach was repeated in a second cross-validation scheme, where the classifier was trained on man versus cat and tested on baby versus chicken, and vice versa (i.e., training on baby versus chicken and test on man versus cat).

For each classification scheme, classification accuracies were summarized at the centre voxel (“summary voxel”) of each sphere of the searchlight, yielding maps of classification accuracy values. Individual maps were averaged to obtain mean accuracy maps. In addition, individual maps were entered into a one-sample t tests, separately for each scheme. A conjunction of the two statistical maps was computed based on the lower common t value per voxel to identify the brain structures with above-chance classification accuracy for both schemes²⁸.

To test whether putative effects could be due to low-level visual characteristics of the PLDs like single dot movement trajectories, speed, or accelerations, we performed the same classification procedure using the scrambled versions of the PLDs. For the classification of scrambled PLDs we computed mean accuracy and statistical maps. In addition, we subtracted the scrambled from the nonscrambled maps for each participant.

In sum, to emphasize perceptually invariant human-nonhuman distinction and reduce the decoding of merely perceptual information, we implemented two strategies. First, we increased the variability within each condition by varying speed, size, gait direction (leftward or rightward) and starting-frame. Second, we included two different instances for each relevant condition (i.e., baby and man for humans, and chicken and cat for nonhumans). The analysis on scrambled PLDs provided further control for the effect of human-nonhuman discrimination.

The dataset generated and analysed during the current study is available from the corresponding author on request.

Results

Behavioural data

Participants performed better with PLDs than with scrambled-PLDs (RTs: F(19) = 6.003, P = 0.024; accuracy: F(19) = 21.140, P < 0.001) although the accuracy was high for both types of stimuli (means: 86% PLDs; 81% scrambled-PLDs). Accuracy rates and RTs were comparable across the four critical conditions (man, chicken, baby and cat; all Ps ≥ 0.309).

fMRI data

Representation of humans and nonhumans in the biological-motion processing pSTS

The analysis of variance with factors Pedalism (bipedal and quadrupedal) and Category (human and nonhuman) identified an effect of Pedalism in the right pSTS, F(1,19) = 7.759, P = 0.012, with greater activity for bipeds than quadrupeds, but no effect of Category, F(1,19) = 0.746, P = 0.398, or interaction, F(1,19) = 0.470, P = 0.501 (Fig. 1). Activations in left pSTS showed no significant effect, although there was a trend for the effect of Pedalism, congruent with the effect found in right pSTS (Pedalism, F(1,19) = 3.190, P = 0.090; Category, F(1,19) = 0.128, P = 0.724; interaction, F(1,19) = 0.348, P = 0.562).

No stronger activity for humans over nonhumans was found in the ROI analysis focusing on pSTS or in the univariate whole-brain analysis. This motivated further investigation with multivariate analyses.

The human-nonhuman distinction throughout the brain

As reported above, we found no univariate effect of Category (i.e., stronger activity for humans than nonhuman, or vice versa) in the ROIs or across the whole brain. It is therefore possible that human and nonhuman representations are encoded in one and the same brain region, but in distinct subpopulations of that common region. Using searchlight MVPA, we aimed at identifying brain regions that are sensitive to the categorical human-nonhuman distinction, independently from the mode of biological motion that characterizes a character (bipedal or quadrupedal). With the two cross-validation schemes described above, we decoded humans vs. nonhumans across pedalism, reasoning that a brain region that differentiates between human and nonhumans should show significant above chance accuracies in both schemes. For both schemes, we found effects overlapping in left pSTS and in left and right PCC (Fig. 2 and Table 2). While only for the second scheme the effects in both regions survived the correction for multiple comparisons, classification accuracies were not statistically different between the two decoding schemes. In particular, a whole-brain contrast using the statistical maps obtained from the two decoding schemes showed no significant clusters in the left pSTS and the PCC, with a voxelwise threshold of P = 0.001, or more liberal thresholds of P = 0.005 and P = 0.01.

Table 2 Clusters for human-nonhuman distinction identified with the searchlight MVPA using a cross-validation approach for decoding scheme 1 and 2 (chance accuracy is 50%) and the conjunction of the maps obtained by the two schemes.

Full size table

The same analysis using the scrambled version of the four PLD types revealed that for both schemes classification accuracies in left pSTS and PCC were at chance and no other region revealed significant above chance accuracies (Figure S1 of Supplementary Information). This suggests that low-level features of PLDs, which were retained in the scrambled videos (i.e., movement trajectories or the speed/acceleration profiles of points), did not account for the decoding of humans versus nonhumans.

Discussion

How do humans recognize conspecifics? We interpreted this fundamental question as asking what it means to recognize humans. This process could involve processing of perceptual features characteristic of conspecifics and/or the retrieval of more abstract information that determines category membership (human or nonhuman). Here, we identified both processes, at work in separate brain networks, contributing to distinguishing between humans and nonhumans. In particular, we found evidence for detection of perceptual features in right pSTS and category membership assignment in left pSTS and bilateral PCC.

Reports of higher activity for humans, relative to familiar, nonhuman animals such as dogs, in the circuitry for biological-motion perception centred in the right pSTS¹⁸, have suggested that the human perceptual system is tuned to conspecifics possibly because of their highest social value^7,8,9,10. Stronger right pSTS activity for moving humans versus moving nonhuman characters has suggested sensitivity to differences within the large category of biological motion. However, that effect can reflect a tuning for the category of conspecifics, as well as a tuning for visual perceptual features that are characteristic of the typical conspecific, but not of biological entities such as dogs and horses. The current results speak to this question.

In keeping with previous findings, we found that the right pSTS, functionally localized during biological motion perception, showed stronger response to PLDs of a human adult (i.e., a man) than to cat-PLDs. However, the right-pSTS activity was also stronger for chicken-PLDs than for baby-PLDs and comparable for chicken-PLDs and man-PLDs (and for baby- and cat-PLDs). This pattern of response revealed the novel effect of pedalism, whereby the two instances of biological bipedal motion in the current design (walking man and walking chicken) elicited stronger right-pSTS activity relative to two instances of biological quadrupedal motion (crawling-baby and walking cat). This effect suggests representation of a perceptual feature, bipedalism, which is statistically associated with the prototypical human adult, as opposed to representation of the abstract category of humans.

Anecdotal reports of PLDs of bipedal animals being misclassified as humans^29,30 have suggested that bipedal motion is an important perceptual cue for recognizing conspecifics. Moreover, retrieving the structure of limbs has been described as a critical stage in the process that leads to recognition of a biological entity from its motion^22,30,31. This is compatible with a recognition-by-parts model²¹, involving an initial stage where the structure of limbs is recovered from the local motion signals, and a second stage where information about limbs is used to reconstruct the whole agent²². In this model, limbs are defined as features of intermediate complexity³², which are particularly informative for visual classification because of their systematic co-occurrence with a specific category of objects (e.g., bipedalism for prototypical humans). In this framework, our results show that the human visual system attaches specific significance to limbs, in that limbs define a feature (i.e., the mode of motion) that can guide visual classification within the large variety of biological entities. Differential activity for bipedal versus quadrupedal motion in the biological motion-perception right pSTS implies sensitivity to distinctions based on visual features.

The tuning for bipedal motion in the right pSTS could reflect the pressure of our phylogenetic and/or ontogenetic history on the perceptual systems for recognizing conspecifics, although, at this point in the brain network, there is not (yet) a representation of, or a preference for, the abstract category of conspecifics or the encoding of human-nonhuman distinction.

Such distinction is computed outside the biological motion-perception pSTS. Despite physical dissimilarity, the neural patterns for man- and baby-PLDs were consistently classified together, and as distinct from the patterns for chicken- and cat-PLDs, in a network encompassing the bilateral PCC and the left pSTS. Neural patterns in those regions distinguished between humans and nonhumans across all instances that represented that distinction in the current design (man versus cat, man versus chicken, baby versus cat and baby versus chicken). Moreover, given the demand to attend to low-level features of the stimuli (size, movement trajectories, or speed of PLDs) for repetition detection, the access to non-perceptual information that determines the human-nonhuman distinction occurred automatically.

The current findings add to previous fMRI research addressing the cortical representation of object-categories above and beyond distinctions based on physical/perceptual properties or on input modality. This research has shown that neural responses to objects in the PCC and posterior temporal cortex respect the supramodal criterion (i.e., they are independent from the modality through which objects are presented), suggesting abstract representation of semantic properties³³.

We shall highlight that the human-nonhuman distinction survived the cluster-level correction for one of the two classification schemes only. Thus, the current results should be treated with caution. However, at the very least, they emphasize neural representations of biological entities based on dimensions, independent of low-level features such as movement trajectories and speed/acceleration profiles (see analyses of scrambled stimuli), and independent of – and more abstract than – the type of biological motion (bipedal or quadrupedal) seen in the right pSTS.

The PCC has been proved selective to the representation of people knowledge^34,35,36; while the posterior middle/superior temporal cortex has been implicated in the representation of abstract properties of motion and action^37,38,39. The present study suggests that information in both regions contributes to determine membership in the category of conspecifics/humans above and beyond their physical appearance. Further research should address the specific information that determines the human-nonhuman distinction in left pSTS and PCC. That information could be highlighted more strongly in task settings in which processing of higher-level properties of the stimuli is less shallow than it was in the current study, where participants’ attention was driven toward low-level visual features of the stimuli.

Other brain sites may contribute to characterize humans. For example, the right dorsolateral prefrontal cortex has been shown to discriminate between the face of a man versus the face of a dog or a monkey⁴⁰. Different stimuli (static faces versus point-light displays) may emphasize different aspects of biological characters, yielding different results. A critical element may be the inclusion of the baby-condition in the current design, which was not present in the previous one. This novel condition may have been crucial to isolated neural information that assigns man and baby to the same taxonomic category, irrespective of perceptual differences or differences in abstract, psychological features between the prototypical human (the adult human) and every other creature, including the baby. For example, dimensions of mind perception, such as agency and the ability to think, plan and work toward goals, are assigned to a uniquely maximal degree to the adult human, and to various but lesser degrees to other biological creatures such as baby, cat, chicken, monkey and dog⁴¹.

In conclusion, recognition of conspecifics is achieved through the collective activity of separate brain networks to detect the perceptual/physical features that reliably correlate with the appearance of a conspecific (e.g., bipedal motion in the right pSTS), and to retrieve abstract information that determines species membership (i.e., human-nonhuman in the left pSTS and PCC).

References

Bonatti, L., Frot, E., Zangl, R. & Mehler, J. The human first hypothesis: identification of conspecifics and individuation of objects in the young infant. Cognit. Psychol. 44, 388–426 (2002).
Article PubMed Google Scholar
Pascalis, O., de Haan, M. & Nelson, C. A. Is face processing species-specific during the first year of life? Science 296, 1321–1323 (2002).
Article ADS CAS PubMed Google Scholar
New, J., Cosmides, L. & Tooby, J. Category-specific attention for animals reflects ancestral priorities, not expertise. Proc. Natl Acad. Sci. USA 104, 16598–16603 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Papeo, L., Stein, T. & Soto-Faraco, S. The two-body inversion effect. Psychol. Sci. 28, 369–379 (2017).
Article PubMed Google Scholar
Stein, T., Sterzer, P. & Peelen, M. V. Privileged detection of conspecifics: evidence from inversion effects during continuous flash suppression. Cognition 125, 64–79 (2012).
Article PubMed Google Scholar
Allison, T., Puce, A. & McCarthy, G. Social perception from visual cues: Role of the STS region. Trends Cogn. Sci. 4, 267–278 (2000).
Article CAS PubMed Google Scholar
Blakemore, S. J. et al. The detection of contingency and animacy from simple animations in the human brain. Cerebral Cortex 13, 837–844 (2003).
Article PubMed Google Scholar
Graziano, M. S. & Kastner, S. Human consciousness and its relationship to social neuroscience: A novel hypothesis. Cogn. Neurosci. 2, 98–113 (2011).
Article PubMed PubMed Central Google Scholar
Hein, G. & Knight, R. T. Superior temporal sulcus-It’s my area: or is it? J. Cogn. Neurosci. 20, 2125–2136 (2008).
Article PubMed Google Scholar
Pelphrey, K. A., Morris, J. P. & McCarthy, G. Grasping the intentions of others: the perceived intentionality of an action influences activity in the superior temporal sulcus during social perception. J. Cogn. Neurosci. 16, 1706–1716 (2004).
Article PubMed Google Scholar
Grosbras, M. H., Beaton, S. & Eickhoff, S. B. Brain regions involved in human movement perception: a quantitative voxel-based meta-analysis. Hum. Brain Mapp. 33, 431–54 (2012).
Article PubMed Google Scholar
Beauchamp, M. S., Lee, K. E., Haxby, J. V. & Martin, A. FMRI responses to video and point-light displays of moving humans and manipulable objects. J. Cogn. Neurosci. 15, 991–1001 (2003).
Article PubMed Google Scholar
Pyles, J. A., Garcia, J. O., Hoffman, D. D. & Grossman, E. D. Visual perception and neural correlates of novel ‘biological motion’. Vision res. 47, 2786–2797 (2007).
Article PubMed Google Scholar
Grossman, E. D., Battelli, L. & Pascual-Leone, A. Repetitive TMS over posterior STS disrupts perception of biological motion. Vision res. 45, 2847–2853 (2005).
Article PubMed Google Scholar
Saygin, A. P. Superior temporal and premotor brain areas necessary for biological motion perception. Brain 130, 2452–2461 (2007).
Article PubMed Google Scholar
Han, Z. et al. Distinct regions of right temporal cortex are associated with biological and human–agent motion: functional magnetic resonance imaging and neuropsychological evidence. J. Neurosci. 33, 15442–15453 (2013).
Article CAS PubMed Google Scholar
Pinto, J. & Shiffrar, M. The visual perception of human and animal motion in point-light displays. Soc. Neurosci. 4, 332–346 (2009).
Article PubMed Google Scholar
Johansson, G. Visual perception of biological motion and a model for its analysis. Percept. & Psychophys. 14, 201–211 (1973).
Article Google Scholar
Kaiser, M. D., Shiffrar, M. & Pelphrey, K. A. Socially tuned: brain responses differentiating human and animal motion. Soc. Neurosci. 7, 301–310 (2012).
Article PubMed Google Scholar
Welsh, T. N., McDougall, L. & Paulson, S. The personification of animals: coding of human and nonhuman body parts based on posture and function. Cognition 132, 398–415 (2014).
Article PubMed Google Scholar
Marr, D. & Vaina, L. Representation and recognition of the movements of shapes. Proc. R. Soc. London B 214, 501–524 (1982).
Article ADS CAS Google Scholar
Neri, P. Wholes and subparts in visual processing of human agency. Proc. R. Soc. London B 276, 861–869 (2009).
Article Google Scholar
Edelman, S., Grill-Spector, K., Kushnir, T. & Malach, R. Toward direct visualization of the internal shape space by fMRI. Psychobiology 26, 309–321 (1998).
Google Scholar
Haxby, J. V. et al. Distributed and overlapping representations of faces and objects in ventral temporal cortex. Science 293, 2425–2430 (2001).
Article ADS CAS PubMed Google Scholar
Oosterhof, N. N., Connolly, A. C. & Haxby, J. V. CoSMoMVPA: multi-modal multivariate pattern analysis of neuroimaging data in Matlab/GNU Octave. Front. Neuroinform. 10(27), https://doi.org/10.3389/fninf.2016.00027, pmid:27499741 (2016).
Vapnik, V. N. Methods of pattern recognition in The nature of statistical learning theory, 123–180 (Springer New York, 2000).
Misaki, M., Kim, Y., Bandettini, P. A. & Kriegeskorte, N. Comparison of multivariate classifiers and response normalizations for pattern-information fMRI. NeuroImage 53, 103–118 (2010).
Article PubMed PubMed Central Google Scholar
Nichols, T., Brett, M., Andersson, J., Wager, T. & Poline, J. B. Valid conjunction inference with the minimum statistic. Neuroimage 25, 653–660 (2005).
Article PubMed Google Scholar
Chouchourelou, A., Jacobs, A., Shiffrar, M., & Chouchourelou, A. What does “biological motion” really mean? Differentiating visual percepts of human, animal, and non-biological motions in: People watching: social, perceptual, and neurophysiological studies of body perception (ed. Johnson, K., Shiffrar, M.) 63 (New York, Oxford UP, 2013).
Mather, G. & West, S. Recognition of animal locomotion from dynamic point-light displays. Perception 22, 759–766 (1993).
Article CAS PubMed Google Scholar
Mather, G., Radford, K. & West, S. Low-level visual processing of biological motion. Proc. R. Soc. London B 249, 149–155 (1992).
Article ADS CAS Google Scholar
Ullman, S., Vidal-Naquet, M. & Sali, E. Visual features of intermediate complexity and their use in classification. Nat. Neurosci. 5, 682–687 (2002).
CAS PubMed Google Scholar
Fairhall, S. L. & Caramazza, A. Brain regions that represent amodal conceptual knowledge. J. Neurosci. 33, 10552–10558 (2013).
Article CAS PubMed Google Scholar
Fairhall, S. L., Anzellotti, S., Ubaldi, S. & Caramazza, A. Person- and place-selective neural substrates for entity-specific semantic access. Cereb. Cortex 24, 1687–1696 (2014).
Article PubMed Google Scholar
Gobbini, M. I. & Haxby, J. V. Neural systems for recognition of familiar faces. Neuropsychologia 45, 32–41 (2007).
Article PubMed Google Scholar
Sugiura, M. et al. Anatomical segregation of representations of personally familiar and famous people in the temporal and parietal cortices. J. Cogn. Neurosci. 21, 1855–1868 (2009).
Article PubMed Google Scholar
Lingnau, A. & Downing, P. E. The lateral occipitotemporal cortex in action. Trends Cogn. Sci. 19, 268–277 (2015).
Article PubMed Google Scholar
Papeo, L. & Lingnau, A. First-person and third-person verbs in visual motion-perception regions. Brain lang. 141, 135–141 (2015).
Article PubMed Google Scholar
Wurm, M. F. & Lingnau, A. Decoding actions at different levels of abstraction. J. Neurosci. 35, 7727–7735 (2015).
Article CAS PubMed Google Scholar
Anzellotti, S. & Caramazza, A. Individuating the neural bases for the recognition of conspecifics with MVPA. Neuroimage 89, 165–170 (2014).
Article PubMed Google Scholar
Gray, H. M., Gray, K. & Wegner, D. M. Dimensions of mind perception. Science 315, 619–619 (2007).
Article ADS CAS PubMed Google Scholar

Download references

Acknowledgements

The authors are grateful to Cristiano Maifrè for making the experimental stimuli and to Jean-Rèmy Hochmann and Nicholas A. Peatfield for their comments on an early version of the manuscript. The research was supported in part by SMC-Fondazione Cassa di Risparmio di Trento e Rovereto to AC.

Author information

Authors and Affiliations

Center for Mind/Brain Sciences, University of Trento, Corso Bettini, 31, 38068, Rovereto, TN, Italy
Liuba Papeo, Moritz F. Wurm, Nikolaas N. Oosterhof & Alfonso Caramazza
CNRS – Institut des Sciences Cognitives Marc Jeannerod – UMR 5304, Univ Lyon, 67 Boulevard Pinel, 69675, Bron, France
Liuba Papeo
Department of Psychology, Harvard University, 33 Kirkland Street, Cambridge, MA, 02138, USA
Moritz F. Wurm & Alfonso Caramazza

Authors

Liuba Papeo
View author publications
You can also search for this author in PubMed Google Scholar
Moritz F. Wurm
View author publications
You can also search for this author in PubMed Google Scholar
Nikolaas N. Oosterhof
View author publications
You can also search for this author in PubMed Google Scholar
Alfonso Caramazza
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.P. developed the study concept. L.P. and M.F.W. contributed to the study design. Testing, data collection, and analysis were performed by L.P., M.F.W. and N.N.O. L.P. drafted the manuscript, and M.F.W., N.N.O. and A.C. provided critical revisions. All authors approved the final version of the manuscript for submission.

Corresponding author

Correspondence to Liuba Papeo.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Supplementary video 1

Supplementary video 2

Supplementary video 3

Supplementary video 4

Supplementary video 5

Supplementary video 6

Supplementary video 7

Supplementary video 8

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Papeo, L., Wurm, M.F., Oosterhof, N.N. et al. The neural representation of human versus nonhuman bipeds and quadrupeds. Sci Rep 7, 14040 (2017). https://doi.org/10.1038/s41598-017-14424-7

Download citation

Received: 31 May 2017
Accepted: 10 October 2017
Published: 25 October 2017
DOI: https://doi.org/10.1038/s41598-017-14424-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.