Introduction

Our brain is constantly facing a plethora of sensory stimuli that need to be properly sampled and organized to construct a meaningful perceptual experience. Since seminal studies on point-light displays (PLD) on the perception of distinct types of motion patterns (e.g., walking, running and dancing1), the robust tuning of the human visual system to biological motion (BM) has represented an intriguing challenge for scientists2,3,4,5. Compared to full body motion, stimuli created with PLD permit to profitably separate motion processing from other features such as color or shape, representing an essential tool to investigate the principles behind our perceptual tuning to BM. In light of its implications for cognitive science, developmental and clinical neuroscience, the study of BM processing raised considerable attention in recent years6,7. In particular, its putative implications in social cognition have been widely debated in the context of Autism Spectrum Disorder (ASD), raising the hypothesis that anomalies in BM processing could be considered a marker or an intermediate phenotype of ASD5. The present work has multiple aims. First, to test whether (and eventually to what extent) the processing of BM, in its most low-level and controlled form, namely PLD stimuli, is effectively anomalous in ASD. Second, to consider a different theoretical and conceptual model that deconstructs BM into distinct levels of processing and according to distinct perceptual manipulations. Third, to present a quantitative meta-analysis of putative BM processing anomalies in ASD that follows this alternative conceptualization.

Being characterized by early onset, lifelong and debilitating behavioral symptomatology, ASD represents a serious concern for families, clinicians and generally for societies8,9. To date, the clinical diagnosis of ASD is based on behavioral symptoms and according to the DSM 5, it is characterized by restricted and repetitive patterns of behaviors, interests or activities, and difficulties in the social communication/interaction domain10. Hyper- or hypo-reactivity to sensory stimuli or unusual interest for specific sensory aspects are also considered important features of the disorder11. Some of the major concerns in dealing with ASD are the heterogeneity in severity and behavioral manifestations as well as the challenges in its early detection, when behavioral signs are less evident12. These aspects significantly impact on the clinicians’ ability to provide early and specific interventions13. Although the growing body of studies targeting neural structures implicated in the pathophysiology of ASD has improved our insights on the condition14,15,16,17, the neurobiology of ASD is far from being clarified12,18,19,20,21. Similarly, neither brain functional and structural connectivity approaches22,23 nor genetics24,25 have definitely solved the puzzle. Efforts in understanding ASD are hampered by substantial disconnection between neurobiological findings aiming to characterize the neural/genetic components associated with the condition, and its core behavioral phenotype. Thus, research investigating endophenotypes of ASD, i.e. measurable and quantifiable links between neurobiological underpinnings and behavioral symptoms26,27, represents a critical challenge for the future28,29. In recent years, a growing body of studies on endophenotypes of ASD has been performed exploring complex functions such as visual attention30,31,32,33,34,35, visual perception36,37,38,39, motor cognition40,41,42,43,44,45 and sensory processing11,46,47,48,49,50.

A significant interest in the literature has been directed to the idea that individuals with ASD have a different perceptual experience of the world and, even more critical for our aims, that sensory/perceptual anomalies in ASD are not just a secondary effect of reduced social interactions. Accordingly, sensory/perceptual anomalies would represent a key and primary component both in terms of symptom development and pathophysiology11. A specific focus has been oriented to BM processing in ASD. Following the first report of BM impairment in ASD51, the number of studies investigating this question has constantly increased over the years (see Fig. S1 in Supplementary Information). However, factors such as heterogeneous experimental designs, weak understanding of potential confounds and inconsistency among the classes of information conveyed by BM may impact on the results reliability. Despite a general agreement on the supposed BM anomalies in ASD, awareness for both the epistemological nature of these anomalies and their quantification is to date lacking5,52,53.

Biological motion processing: an early emerging ability

We may expect that BM processing, being a phylogenetically highly preserved mechanism, would show an early developmental progression during ontogenesis54. However, it still remains to be clarified whether, and eventually to what extent, being tuned to BM is dependent on experience. A seminal study indicates that at 3 months of age infants are able to discriminate BM stimuli vs. their scrambled counterparts55. This trend is observed also at 4–5 months of age55,56. More recently, a convincing argument in favor of the hypothesis that BM processing is largely independent of experience was provided by a study testing the ability of newborns to discriminate BM stimuli4. The sensitivity to BM was tested in 2-day-old babies using hen-walking animations as compared to non-BM stimuli (i.e., the same pattern of elements moving in a random manner) and inverted (upside -down) hen-walking stimuli. The use of hen stimuli allowed to exclude any possible (although remote) learning mechanisms taking place in the first 2 days of life. The results indicate that newborns manifest a spontaneous preference for BM stimuli, supporting the idea that this processing is largely (or even completely) independent from experience4. Although the authors could not rule out the possible role of fetal experience in shaping BM predispositions (see for example5), these findings support a certain degree of innate or nearly innate tuning of the human visual system to BM.

The developmental trajectory of the sensitivity to BM stimuli has also been extensively explored in comparison to global motion57. Testing different groups of children (aged 6–8, 9–11 and 12–14 years, respectively) and adults, Hadad and colleagues57 claimed that developmental trajectories for coherent motion and BM are similar. In both cases, the authors found that thresholds achieved adult-like profiles around the age of 14, following a sort of monotonic developmental trajectory of improvement from childhood. In contrast, testing after surgery the BM and global motion processing in dense bilateral congenital cataract patients that suffered from early visual deprivation, showed that marked deficits in global motion are not accompanied by similar difficulties in BM processing58. Importantly, a recent EEG study showed that the BM perception is not only preserved in these patients at the behavioral level, but also that the same neural processing is involved. Indeed, in patients with congenital cataracts, the N1 component of event-related potentials was modulated post-surgery by the processing of BM as in the control group59. Spared abilities in the processing of BM after early visual deprivation are surprising in light of the data reporting impairments in similar cohorts of patients both in global motion60 and in holistic face processing61,62.

Taken together, these data indicate a composite picture in which clear similarities57 and marked differences58 between basic and more complex mechanisms (e.g., global motion and BM processing) coexist. Such coexistence should stimulate further studies aiming at characterizing shared developmental patterns vs. neuroplastic mechanisms. For shared patterns, a classical approach refers to the well-established neuroconstructivist view supporting the idea that very early not-detailed, domain-general functions impact on the development of more complex, domain-specific, and even “social” functions63,64. For compensatory mechanisms, examples from heterogeneous clinical pictures may provide intriguing insights58 (for example, related to other functions, see also65,66). This overview of BM ontogenesis suggests that oversimplifying this construct, as if it were a monolithic process, can be misleading. For this reason, we believe that a paradigm shift in approaching the understanding of BM is needed.

Motor, spatio-temporal and social aspects of biological motion processing

It has been hypothesized that the ability to detect, recognize and interpret BM is related to the expertise in performing similar movements or actions and, to some extent, also to one’s social abilities5. Beyond the attractiveness of this view, whether and how the ability to process BM is linked tout court to the expertise in producing similar movements or actions requires a cautious interpretation. Recent experimental and theoretical investigations on mirror mechanisms clearly indicate that humans benefit from their own motor representations to understand the actions of others directly (“motorically”, “from the inside”67). Notably, mirroring phenomena may occur at distinct levels of abstractness, as if we are able to represent distinct components of an action in a motor way67,68. Such mechanisms have been hypothesized to be anomalous in ASD43,69.

Two studies reporting seemingly diverging results may help in furnishing further insights on this point. First, adolescents born preterm and suffering from periventricular leukomalacia, a clinical condition characterized by white matter lesions near the lateral ventricles, were tested in their ability to process BM5,70,71. These participants varied in their motor ability, ranging from complete walking disability to typical or near-typical motor patterns70. Impairment in the execution of actions with the lower and upper limbs correlated with the lateral extent of periventricular lesions but did not correlate with the performance at the BM task. Interestingly, sensitivity to the processing of BM negatively correlated with the volume of the lesions over parieto-occipital areas, whereas no correlation over the frontal and temporal regions was found70. Second, the link between action perception and action execution was explored by Casile and Giese72, who tested whether the acquisition of novel motor behaviors could improve the processing of BM stimuli. The authors demonstrated that nonvisual motor training selectively improves the visual recognition of novel BM patterns, thus suggesting that motor learning, independently of visual feedback (i.e., the participants were blindfolded), impacts on BM recognition73. Taken together, these two studies outline a more articulated picture. The hypothesis that BM recognition is linked to some extent to the ability in producing specific motor patterns is fascinating, yet simplistic73. At the same time, BM processing seems to benefit from more efficient and unexpected compensatory mechanisms, in turn suggesting that functional networks supporting BM processing are more complex70,74.

Another source of debate in the literature is how spatio-temporal features of BM stimuli impact on the BM processing itself. BM processing entails both spatial and temporal dynamics, but the exact contribution of these components and individual tolerance to their perturbation have not yet been clarified in ASD57,75. This may offer insights toward the understanding of apparently controversial findings reported in the ASD literature, given that being tolerant to spatial and/or temporal perturbations is supported by potentially distinct neural mechanisms. Recent evidence refined traditional views on timing by attributing a key role to the cerebellum76. Nowadays, cerebellar contributions to non-motor functions, such as sensation and perception, are well-established77,78. Interestingly, the cerebellum also contributes to the pathophysiology of ASD15 and – more specifically – to the sensory/perceptual anomalies recently explored in ASD14,79.

The complexity of networks supporting BM processing and the neuroplasticity of the compensatory mechanisms may be considered cues of the functional relevance of BM. A classical view suggests that the perception of BM is tightly linked with individual abilities in the social domain, leading back this hypothesis to the evolutionary benefit in distinguishing efficiently biological from non-BM motion. This “social” interpretation of BM posits that performance in BM tasks may serve as benchmark for unveiling difficulties in social functioning, for example in neurodevelopmental conditions, such as ASD5,7. Beyond the considerable success of this view80, refined theoretical models and robust quantitative approaches (i.e., meta-analysis) are limited and whether and how BM processing is linked to social functioning is not clear. One possibility is to interpret BM processing as a building block of more complex social abilities in the context of the so-called “social brain”. Poor performance in BM discrimination in ASD would then be a marker of their social functioning difficulties5,80. The exact nature of the “social brain” construct30,81,82 and of other related constructs such as “theory of mind” or “mindreading”83,84,85, which gained considerable attention in the last decades in a wide range of disciplines, remains however often vague and inconsistent among studies. The methodologies employed for exploring these constructs are often heterogeneous and the associated neurobiological underpinnings are unclear86, suggesting the need for a perspective shift.

Aim

As outlined in the previous sections, BM perception (i) is phylo- and onto-genetically robust, (ii) has been hypothesized to be a putative marker for ASD5,28,87, and (iii) seems to be preserved, at least to a certain degree, when the BM stimuli are perturbed (or scrambled), spatially and/or temporally75.

Despite the fact that a rigorous selection of comparable studies for conducting a quantitative meta-analysis has the disadvantage of losing some information on the characterization of the BM phenomenon at large, such an approach can help systematizing the heterogeneity of the findings reported so far. In our meta-analysis, we selected only studies that assessed the BM processing in ASD using PLD stimuli. As stated above, this kind of stimuli allows to investigate the essence of BM perception without the interference of other higher-level visual features that cannot be easily controlled for. We believe that focusing on low-level features is critical in our attempt to shed light on possible anomalies of BM processing in ASD. Moreover, this choice is consistent with other theoretical proposals which focused on findings from PLD stimuli to disentangle BM processing in the neurotypical population72,88. To better understand BM processing anomalies in ASD, it is critical to investigate the impact of both the type of processing involved in different BM tasks, and the type of stimuli that are used to assess this ability. These critical points will be addressed in the following subsections.

Deconstructing biological motion processing: a multi-level conceptual model

Given the high variability of the BM tasks employed with ASD individuals, we propose a conceptual classification of these tasks that aims at deconstructing the complex notion of BM in distinct (sub)components. The need for such deconstruction has been previously proposed also for other complex constructs such as “theory of mind”89,90,91, “social brain”82,92 or “imitation”65,93,94,95. Indeed, when these constructs are treated as monolithic ones, we may run into misleading consequences. From a conceptual and epistemological perspective, they may convey different meanings and could translate into highly heterogeneous experimental protocols, which in turn can threat the reproducibility of findings and the reliability of results86.

In the literature, others have attempted to identify distinct (sub)components that play a role in BM processing. Troje88, for example, proposed a hierarchical model with four levels of processing with different complexity: (1) life detector, a cue of an animate object moving on the ground driven by the local motion, (2) structure-from-motion driven by the global motion, (3) action recognition and (4) style recognition, such as the recognition of emotional states. The author pointed out that the BM is a complex phenomenon characterized by a local spatio-temporal, ballistic-velocity profile due to the gravity force, which can convey high-level information. Also, Casile and Giese72 proposed an interesting model that tries to explain the phylo- and onto-genetically highly preserved inclinations in processing BM. The authors hypothesized a hierarchy of neural detectors that extract motion features with different BM complexities: (1) local motion energy detectors, (2) detectors for horizontal and vertical opponent motion, (3) detectors for complex global optic flow patterns and (4) detectors for complete biological motion patterns. Both these models highlighted a clear hierarchical organization in BM processing and have the merit of focusing on the need of deconstructing BM. However, such models use categories that are less informative to our specific aim, namely better understanding BM anomalies in ASD. For that reason, we hereby propose a new clusterization across different levels of BM processing according to task complexity. We hypothesize that these distinct levels of BM processing could be differently affected in participants with ASD. Specifically, we propose a three-level model that distinguishes between: (I) first-order processing of BM, (II) direct processing of BM, and (III) instrumental processing of BM.

  1. (I)

    First-order processing of BM refers to the implicit detection of BM that implicates a behavioral response, which does not entail any explicit recognition or categorization of BM. Experimenters infer that participants have detected a BM stimulus by simply evaluating their spontaneous behavioral response (e.g., looking time in preferential looking paradigms).

  2. (II)

    Direct processing of BM involves an explicit discrimination between distinct features of the stimuli, using tasks with different types of BM stimuli (e.g. rightward vs. leftward walker), or an explicit detection and/or recognition of BM when both BM and non-BM stimuli are employed (e.g., human vs. vehicle; biological vs. its scrambled version).

  3. (III)

    Instrumental processing of BM implicates that the observer takes advantage of the processing of a BM stimulus for a secondary purpose, i.e. to disclose distinct and potentially more complex information, such as inferring about other’s intentionality, emotional states or actions.

A meta-analysis based on this three-level model allows to disentangle whether the richness of information conveyed by PLD-based BM could be, at least partially, the source of the variability of the findings of BM processing reported in ASD.

Do different features of BM stimuli imply different performance

Another possible source of variability in BM processing in the ASD population may stem from the fact that BM is characterized by a moving structure that requires the accurate decoding of both spatial and temporal information. Interestingly, Casile and Giese72 found that local motion seems to be an important feature supporting BM processing. Indeed, they showed that even if the spatial arrangement of the PLD dots is inconsistent with the human skeleton, a plausible local optic flow is enough to suggest the presence of a BM stimulus. This suggests that low-level features, such as spatio-temporal patterns, are extremely relevant in the processing of BM. Moreover, given the evidence of anomalies in low-level features processing in individuals with ASD11, we think that it could be extremely useful to test if the heterogeneity showed by ASD individuals in BM processing can be driven by the manipulation of different stimulus features. In other words, perceptual anomalies that are typically associated with ASD could potentially impact on BM processing more strongly when certain types of stimuli are employed. Therefore, the present work aims to preliminarily assess whether the difference between the performance of ASD and typical developing (TD) individuals is modulated by the type of low-level manipulation (spatial vs. temporal) of scrambled PLD stimuli used as comparison. Moreover, to assess whether spatio-temporal processing anomalies are specific to BM stimuli or whether they are generalized to other type of stimuli, we tested if there is a difference between the performance of ASD and TD individuals with non-BM stimuli.

Altogether, the present meta-analysis will compare the performance of ASD and TD groups according to the following four steps:

  1. 1.

    A general analysis exploring all studies using PLD to investigate BM in ASD.

    Biological motion in ASD vs. TD [1 = All BM studies];

  2. 2.

    An analysis testing BM processing in ASD according to our three-levels model (i.e., first-order, direct and instrumental BM processing).

    Level of processing of BM in ASD vs. TD [2 = Level of processing];

  3. 3.

    An analysis investigating the contribution of spatial and temporal features in how individuals with ASD process BM.

    Impact of low-level features of BM scrambled stimuli in ASD vs. TD [3 = Low-level features];

  4. 4.

    An analysis testing ASD performance in non-BM stimuli.

    Non-biological motion in ASD vs. TD [4 = Non-BM].

Methods

Literature search

Following the PRISMA-P guidelines96, we have established a protocol detailing the a priori rationale, the methodological and analytical approaches to which we adhered during the preparation of the present meta-analysis. This protocol does not represent an update of any other reports and it is not part of a systematic review on this topic.

An automatic literature search began on September 2017 and the last search was performed in 14th January 2020. We searched for all studies published in English in both PubMed and PsycInfo, by performing a search on titles, abstracts and keywords. For direct replication, we include the three Boolean queries used to the search the PubMed database: (1) ((“biology”[MeSH Terms] OR “biology”[All Fields] OR “biological”[All Fields]) AND (“motion”[MeSH Terms] OR “motion”[All Fields])) AND (“autistic disorder”[MeSH Terms] OR (“autistic”[All Fields] AND “disorder”[All Fields]) OR “autistic disorder”[All Fields] OR “autism”[All Fields] OR “asd”[All Fields] OR “autism spectrum disorder”[All Fields]); (2) “Human motion”[All Fields] AND (“autistic disorder”[MeSH Terms] OR (“autistic”[All Fields] AND “disorder”[All Fields]) OR “autistic disorder”[All Fields] OR “autism”[All Fields OR “asd”[All Fields] OR “autism spectrum disorder”[All Fields]); (3) point-light[All Fields] AND (“autistic disorder”[MeSH Terms] OR (“autistic”[All Fields] AND “disorder”[All Fields]) OR “autistic disorder”[All Fields] OR “autism”[All Fields] OR “asd”[All Fields] OR “autism spectrum disorder”[All Fields]). Moreover, in order to reduce the possibility of publication bias, we repeated the same searches in two preprint servers relevant for the field (PsyArXiv and BioRxiv) to identify unpublished articles on the topic.

In addition, we also performed a manual literature search which encompassed a search of the reference lists of the review articles and the original research papers retrieved via the Boolean queries. One paper was retrieved by using this method. Altogether the automatic and the manual literature searches resulted in 146 unduplicated published articles. No reviews, book chapters, master and doctoral theses or conference presentations were hereby included.

Screening process

The studies retrieved from the automatic and manual literature searches were independently assessed and screened by two authors (A.F. and L.Ra.). At the end of the first round of screening, they evaluated together the differences emerged from the two separate searches and discussed with the other authors the final set of papers to be included, based on the inclusion criteria as detailed below. The n reported represents the number of studies excluded at each step. Please, also refer to Fig. 1.

  1. (1)

    Screening: from the analysis of title and abstract, the empirical studies that did not include a task in which the processing of BM was assessed via PLD in both an ASD and a TD group were excluded from the initial paper count (n = 78).

  2. (2)

    Eligibility: the full text of the screened articles was read, and following this step, studies were further excluded if they did not actually meet the same criteria of the previous screening step (n = 20).

  3. (3)

    Qualitative assessment: from reading the whole publication, the articles were further excluded if they did not meet the following criteria:

    • If the study sample was the subset of a bigger sample presented in a different paper, the study with the smaller sample was discarded (n = 297,98).

    • If the study did not investigate behavioral measures such as accuracy, sensitivity (d’), threshold estimation, reaction times (RTs), or percentage of preferential looking (or first orienting) in a BM task. Thus, all the studies that use a passive view task in order to investigate BM perception using EEG or fMRI were discarded (n = 8).

    • If the dependent variables and/or task employed were not comparable to the majority of the studies included. Specifically, these studies were removed for the following reasons: (i) Krüger and colleagues99 used a confidence rating as the dependent variable, no accuracy or RTs were reported; (ii) Van Boxtel and colleagues100 used as the dependent variable the difference in the point of subjective equality (PSE) between the walking and running conditions, and thus the data related to BM performance could not be retrieved, for similar reasons, we also excluded the study conducted by Karaminis and colleagues101; (iii) Swettenham and colleagues102 used an attentional cuing paradigm in which cues were BM stimuli and reported RTs for valid and invalid conditions, which do not primarily map onto BM processes (n = 4).

    • If the study did not have sufficient statistical information (either present in the paper, Supplementary Materials or provided after contacting the authors) (n = 4).

Figure 1
figure 1

Flow diagram of the selection of studies included in the current meta-analysis. The number of studies included or excluded at each step of the evaluation process is indicated in each of the diagram boxes.

Coding and reliability

The final sample of studies were coded based on the following rules:

  1. 1.

    Level of BM processing, which led to the creation of three main categories: (1) first (2) direct, and (3) instrumental processing of BM;

  2. 2.

    Type of scrambled stimulus used in tasks that assess the perception of BM vs. its scrambled version: (1) no scramble; (2) spatial scramble; (3) temporal scramble; (4) spatial and temporal scramble (see definition below for the details of both spatial and temporal scramble labels).

  3. 3.

    Type of motion stimuli presented: (1) BM, when the data refer to the performance following the presentation of solely BM stimuli; (2) BM vs. non-BM, when the dependent variable refers to a task in which the perception of BM stimuli is contrasted with the perception of non-BM stimuli; (3) non-BM, when it is possible to separately extract data concerning only the performance of non-BM stimuli (i.e., the scrambled version of the BM stimulus or the motion of an object, such as a truck or a bicycle). Disagreement and inconsistencies between the two raters (A.F. and L.Ra., n = 7/114, 6% of all assessments) were questioned with all authors to gather consensus.

In addition, for all the studies we collected:

  1. 1.

    the relevant sample sizes of both ASD and the TD groups;

  2. 2.

    the percentage of male participants in each of the two groups;

  3. 3.

    the mean age of each of the two groups;

  4. 4.

    the test employed to assess general cognitive level of participants, and the relative mean score for each of the two groups;

  5. 5.

    the type of behavioral measure collected: accuracy rates (including error rates) or percentage of looking time. Since RTs were present only in 5 of the studies included in this meta-analysis, we preferred to exclude them to increase the homogeneity and the comparability of measures;

  6. 6.

    whether behavioral tasks are performed during an EEG or fMRI acquisition;

  7. 7.

    a brief qualitative description of the task and the displayed stimuli.

All this relevant information about the studies to be included in the final analyses is reported in Table S1 as Supplementary Information. Moreover, we used the descriptive statistics or the associated test statistics of the performance from both the ASD and TD groups to calculate the Cohen’s ds. If further groups were tested in the study, the statistics of the other groups were not considered.

Effect size calculation

For each observation, using the descriptive or test statistics present in the papers selected, we calculated Cohen’s d as a measure of the size of the group effect (ASD vs. TD) on the variable of interest. Only two studies explicitly reported Cohen’s d. For all other studies, we extracted Cohen’s d from mean and standard deviations (or standard errors), or from the t-test value (or the associated p-value) of the reported statistical analyses. The mean and the standard deviation were available in most of the studies, therefore we used the following equation (1) to compute the Cohen’s d (see103,104):

$$d=\frac{{M}_{1}-{M}_{2}}{\sqrt{\frac{({n}_{1}-1)S{D}_{1}^{2}+({n}_{2}-1)S{D}_{2}^{2}}{{n}_{1}+{n}_{2}-2}}}$$

where M1 is the mean of the dependent variable for the first group (e.g., ASD), M2 is the mean of the dependent variable for the second group (e.g., TD), SD1 and SD2 are respectively the standard deviations of the mean for the first and the second group, n1 and n2 refer to the number of participants in the first and the second group, respectively. We arbitrarily set the direction as positive indicating a better performance for the TD group as compared to the performance of the ASD group (e.g., lower error rate). According to the guidelines by Cohen105, an absolute effect size of 0.2–0.3 is considered a small effect, ~0.5 a medium effect and >0.8 a large effect. The variance associated to each Cohen’s d was computed following equation (2)103:

$$Var(d)=\frac{{n}_{1}+{n}_{2}}{{n}_{1}{n}_{2}}+\frac{{d}^{2}}{2({n}_{1}+{n}_{2})}$$

In some cases, two or more Cohen’s ds referring to the same study were included in the same meta-analysis. When more than one task was included in the same (sub)group of the meta-analysis, we pooled those Cohen’s ds in order to compensate for the possible underestimation of the variance of the mean effect size resulting from the inclusion of multiple effect sizes from a single study in the same meta-analysis106,107. The Cohen’s ds referring to the same study were pooled using the formulas reported by Borenstein and colleagues106, p. 230 (equations 3 and 4):

$$\bar{d}=\frac{1}{m}(\mathop{\sum }\limits_{j}^{m}{d}_{j})$$
$$Var(\bar{d})={(\frac{1}{m})}^{2}(\mathop{\sum }\limits_{j}^{m}Var({d}_{j})+\mathop{\sum }\limits_{j\ne k}^{m}({r}_{jk}\sqrt{Var({d}_{j})}\sqrt{Var({d}_{k})}))$$

where \(\bar{d}\) and \(Var(\bar{d})\) are the average Cohen’s d and its variance, m is the number of Cohen’s ds that were pooled, \({d}_{j}\) and \({d}_{k}\) are the Cohen’s ds referring to measures j and k, and \({r}_{jk}\) is the correlation between measures j and k, which was arbitrarily set to 0.5. The maximum number of Cohen’s ds that were pooled is 10. This number varies depending on the meta-analysis performed.

In each of the four meta-analyses, multiple Cohen’s ds from the same study were pooled when they referred to tasks that were coded in the same manner. For instance, in Annaz and colleagues study108 there are two tasks, one measuring the sensitivity (d’) for the perception of BM in normal PLD vs. scrambled stimuli, and the other establishing the thresholds for the detection of a PLD walker in noise. Since both tasks were coded as direct processing, the two Cohen’s d were always pooled together. This was also the case of the “action”, “subjective state”, and “emotion” conditions in Hubert and colleagues109, that were all coded as tasks of instrumental processing. In some cases, multiple Cohen’s ds from the same study could be kept separate or pooled together depending on whether the categorical distinction between the tasks was relevant or irrelevant for the meta-analysis at hand. For instance, in Nackaerts and colleagues110 the Cohen’s ds referring to the task categorized as direct processing and the task categorized as instrumental processing were pooled in the first general meta-analysis [1 = All BM studies], but they were kept separate in the meta-analysis testing the effects of the level of BM processing [2 = Level of processing].

Mixed effects models

Statistical analyses were conducted in R, using the package metafor111. The meta-analysis performed was analyzed using a mixed effects model111, estimating the regression coefficients for each moderator included as well as the error between experiments. No within error estimate was included in the model, because we only included one Cohen’s d per study. The rationale for this choice was that the majority of data from the studies presented a single effect size and a traditional random-effect model would suffice106. A simple model with one moderator variable is given in equation (5):

$${y}_{i}={b}_{0}+{b}_{1}{x}_{i}+{e}_{i}$$

for i {1,…,n}

yi is the Cohen’s d for the ith study

b0 is the fixed intercept for the regression model

b1 is the fixed slope for the regression model

xi is the predictor for the ith study

ei N(0, σ) is a Gaussian error term.

The variance components of the model including the predictor variables refer to the unexplained variance. To answer specific questions, we run four separate meta-analyses:

  1. 1.

    Biological motion in ASD vs. TD [1 = All BM studies]. We included all the studies that were selected, and, for each of them, a single Cohen’s d was inserted (n = 29). Since cognitive functioning was assessed with different tests across studies (see Table S1 in Supplementary Information) we computed a Cohen’s d per study to have a comparable measure of the cognitive level to be used as a possible moderator. A positive Cohen’s d indicates that the cognitive level of the TD group is greater than the cognitive level of the ASD group. We included this moderator in the meta-analytic models as a way to assess if the ASD diagnosis has a specific effect on the processing of BM, which cannot simply be attributed to a reduction in cognitive level. Moreover, we insert also the mean age of ASD and of TD groups as moderators in order to control for the possible effect of age.

  2. 2.

    Level of processing of biological motion in ASD vs. TD [2 = Level of processing]. We included one Cohen’s d per study in each of the three levels of BM processing identified (n = 7 first-order, n = 16 direct, n = 11 instrumental). ‘Level of processing’ was included in the meta-regression analysis as a categorical predictor, and no further moderating effects were considered here, due to the limited number of studies included in each group.

  3. 3.

    Impact of low-level features of BM scrambled stimuli in ASD vs. TD [3 = Low-level features]. We included one Cohen’s d for each study that used the scrambled version of the BM stimulus (n = 7 spatial, n = 6 temporal). We considered two types of scrambled stimuli according to the manipulation, spatial or temporal, employed to generate them. To this aim, this meta-analysis includes a categorical moderator, ‘type of scramble’, which consists of two levels: (1) spatial scramble, the difference between the scrambled and the BM stimulus is limited to a difference in the spatial configuration. Specifically, in the scrambled version, each dot of the PLD was randomly displaced in a different spatial location; (2) temporal scramble, the difference between the scrambled and the BM stimulus is limited to a difference in the temporal phase. Specifically, each dot in the PLD scrambled stimulus was presented in the same spatial position but the motion of each single dot was temporally out of phase as compared to that of the BM stimulus. Studies with both spatial and temporal manipulations were excluded since they were not numerous enough to create a stand-alone group (n = 2). Considering the limited number of studies per level, we refrained from including other moderators.

  4. 4.

    Non-biological motion in ASD vs. TD [4 = Non-BM]. This meta-analysis is limited to the studies that test, together with the perception of a BM stimulus, also the processing of another stimulus that can be defined as a non-BM stimulus (i.e., an object as a bike or a truck presented in PLD109,112 or of a scrambled version of the BM stimulus39,113,114 (n = 5). It is important to note that we do not consider this analysis to be ultimately conclusive, because an extensive analysis of this type should include also other types of non-BM tasks such as coherent motion, form-from-motion, etc. However, we still believe that this aspect is worth investigating to provide an initial characterization of possible dissociations between BM and non-BM processing in ASD vs. TD.

All tests were conducted with a significance level of 5%. The results are presented including the following measures: weighted mean effect sizes (Cohen’s d), 95% confidence intervals, I2 heterogeneity values, and p values. All error bars in forest plots are 95% confidence intervals and were generated by applying customized R scripts.

Moderator correlations

Being aware of the need to control for confounding moderators115, we opted to use ANOVAs (performed in R with the aov function) in order to evaluate possible systematic relationships between the categorical predictor (i.e. ‘level of processing’, ‘type of scramble’) and the quantitative predictor (i.e. ‘cognitive level’, ‘mean age of ASD’, ‘mean age of TD’). This procedure allowed us to define whether the cognitive level or the group age differed systematically across the levels of processing and type of scramble.

Publication bias

To determine the presence of publication bias for the main effect of each meta-analysis, we plot each study’s effect size (Cohen’s d ASD vs. TD) against its own standard error in a funnel plot. Publication bias is computed as a negative association between the sample size and the effect size across studies, since a result coming from a small sample must have a large effect size to be significant. More specifically, publication bias occurs when studies with small samples are more likely to be published when they report statistically significant results (i.e., a large effect size), as compared to when they do not (i.e., a small effect size). The publication bias implies an overestimation of the true effect size, for which appropriate correction methods are available.

Results

Biological motion in ASD vs. TD [1 = All BM studies]

The analysis including all papers contrasting the processing of BM in ASD vs. TD revealed a medium effect of the difference in performance between the two groups, Cohen’s d = 0.60, SE = 0.13, CI 95% = [0.36, 0.85], z = 4.77, p < 0.0001. In other words, there is a medium effect size indicating that individuals with ASD have more difficulties in BM processing than TD individuals (Fig. 2A). However, the I² coefficient reveals that 76% of the variability of the effect sizes can be attributed to the heterogeneity among the true effects (Test for Heterogeneity: Q(28) = 110.67, p < 0.0001). Cognitive level, mean age of ASD group and mean age of TD group proved to have no statistically significant effect on Cohen’s d (cognitive level: b = 0.19, SE = 0.17, CI 95% = [−0.14, 0.52], z = 1.13, p = 0.26; mean age of ASD group: b = 0.003, SE = 0.01, CI 95% = [−0.02, 0.02], z = 0.28, p = 0.78; mean age of TD group: b = 0.001, SE = 0.01, CI 95% = [−0.02, 0.02], z = 0.09, p = 0.93).

Figure 2
figure 2

(A) Forest plot of the differences in the BM performance between the ASD and TD groups. Positive values indicate a better performance for the TD group as compared to the performance of the ASD group. (B) Funnel plot indicating the publication bias in studies of BM processing in ASD vs. TD. White circles represent the missing studies identified by the trim and fill test.

A visual inspection of the funnel plot resulting from this meta-analysis suggests a quite symmetrical distribution of the studies (Fig. 2B). To confirm whether or not a publication bias is present in this case, we computed a trim and fill test116. Such test estimates the absence of four studies on the left side (SE = 3.58) and none on the right side. When computing a new average effect size including the effect sizes of the three studies estimated to be absent, the average effect size was still significantly different from zero, Cohen’s d = 0.47, SE = 0.13, CI 95% = [0.22, 0.72], z = 3.63, p = 0.0003. Thus, since the effect size was still medium and significant even after the inclusion of the estimated missed studies, the difference in the processing of BM between ASD and TD groups cannot be considered dependent on publication bias.

Level of processing of biological motion in ASD vs. TD [2 = Level of processing]

To investigate the role of the level of processing needed to accurately perceive BM in ASD vs. TD, we computed a mixed effect model with ‘level of processing’ as a moderator. This moderator has three levels: (i) first-order processing; (ii) direct processing; (iii) instrumental processing, indicating the three subgroups in which tasks were divided. The I2 coefficient indicates that the heterogeneity among the true effects is ~71% (Test for Residual Heterogeneity: QE(31) = 103.85, p < 0.0001). The results reveal a significant effect of the moderator (QM(3) = 37.94, p < 0.0001). Thus, to further explore each level of the moderator we calculated separate meta-analyses for each level of processing. When analyzing the first-order processing level, we obtained Cohen’s d = 0.55, SE = 0.27, CI 95% = [0.02, 1.09], z = 2.04, p = 0.04, and the heterogeneity was comparable to that of the full model (I2 = 79%, Q(6) = 27.04, p < 0.0001, Fig. 3A). When analyzing the direct processing level, we obtained Cohen’s d = 0.40, SE = 0.18, CI 95% = [0.05, 0.74], z = 2.24, p = 0.03, and the heterogeneity was comparable to that of the full model (I2 = 76%, Q(15) = 61.29, p < 0.0001, Fig. 3B). Instead, the instrumental processing level is the one showing the greatest difference in the processing of BM between ASD and TD, Cohen’s d = 0.98, SE = 0.13, CI 95% = [0.73, 1.23], z = 7.56, p < 0.0001, and the heterogeneity was reduced as compared to that of the full model and not significant anymore (I2 = 34%, Q(10) = 15.52, p = 0.11, Fig. 3C).

Figure 3
figure 3

(A) Forest plot of studies investigating the difference in first-order level of BM processing between ASD and TD group. (B) Forest plot of studies investigating the difference in direct level of BM processing between ASD and TD group. (C) Forest plot of studies investigating the difference in instrumental level of BM processing between ASD and TD group. In all plots, positive values indicate a better performance for the TD group as compared to the performance of the ASD group.

Moreover, in order to test if there is a significant difference between the magnitude of these estimated effect sizes, we performed three z-tests between the Cohen’s ds of the different levels of processing using the Bonferroni correction method. Only the difference between the direct and instrumental levels was statistically significant (direct vs. instrumental: z = −2.81, p = 0.015; first-order vs direct: z = −0.46, p ~ 1; first-order vs. instrumental: z = −1.45, p = 0.45).

To evaluate possible systematic differences in cognitive level across the levels of BM processing, we performed a one-way ANOVA on the Cohen’s ds referring to the cognitive level with ‘level of processing’ as a factor. No statistically significant differences were retrieved (F(1, 27) = 0.09, p = 0.76, ηp2 = 0.003). Moreover, two one-way ANOVAs showed that neither the mean age of the ASD groups nor that of the TD groups differed systematically across levels of BM processing (F(1, 31) = 1.71, p = 0.20, ηp2 = 0.05, and F(1, 32) = 1.94, p = 0.17, ηp2 = 0.06, respectively). In sum, the effects of level of processing on Cohen’s d cannot be explained in terms of systematic differences between cognitive level, mean age of the ASD groups, or mean age of the TD groups.

Impact of low-level features of BM scrambled stimuli in ASD vs. TD [3 = Low-level features]

With this third meta-analysis, we aim at further characterizing the low-level perceptual aspects potentially affecting the processing of BM in ASD vs. TD. Here, we focus only on studies that include both BM and non-BM scrambled stimuli. In other words, with this meta-analysis we inquire whether there are specific spatial or temporal features of the stimuli in motion that affect ASD and TD individuals differently.

The I2 coefficient indicates that the heterogeneity among the true effects is ~58% (Test for Residual Heterogeneity: QE(11) = 25.95, p = 0.007). Interestingly, results indicate that the moderator is significant (QM(2) = 12.02, p = 0.003). To further investigate this finding, we calculated separate meta-analyses for each moderator level. When considering only the studies with a BM task that includes a spatially scrambled stimulus, results indicated no significant difference in the performance of ASD vs. TD (Cohen’s d = 0.38, SE = 0.23, CI 95% = [−0.07, 0.83], z = 1.67, p = 0.10, Fig. 4A). The confidence interval includes 0, suggesting that we cannot exclude that the effect size for this subgroup of studies is null, and the heterogeneity is still moderate and significantly different from zero (I2 = 60%, Q(6) = 14.84, p = 0.02).

Figure 4
figure 4

(A) Forest plot of studies investigating the difference in BM performance between ASD and TD group that employed spatial scramble stimuli. (B) Forest plot of studies investigating the difference in BM performance between ASD and TD group that employed temporal scramble stimuli. In both plots, positive values indicate a better performance for the TD group as compared to the performance of the ASD group.

On the contrary, the model including the temporally scrambled stimulus showed a significant medium-large effect size (Cohen’s d = 0.67, SE = 0.21, CI 95% = [0.26, 1.08], z = 3.20, p = 0.001, Fig. 4B), and the included studies can be considered a homogenous group (I2 = 55%, Q(5) = 11.11, p = 0.05). These differences in the processing of scrambled stimuli were not confounded by the cognitive level, as it was shown by a one-way ANOVA on Cohen’s ds referring to cognitive level with ‘type of scramble’ as a factor (F(1, 9) = 0.009, p = 0.93, ηp2 = 0.001). Similarly, differences in the processing of scrambled stimuli were not confounded by the mean age of ASD groups (F(1, 12) = 2.29, p = 0.16, ηp2 = 0.16) nor by the mean age of TD groups (F(1, 13) = 4.29, p = 0.06, ηp2 = 0.25).

Non-biological motion in ASD vs. TD [4 = Non-BM]

We tested whether the deficit shown by individuals with ASD is selective to BM stimuli or it also generalizes to non-BM stimuli. Results indicate that no significant difference between the performance of individuals with ASD and TD is evident, (Cohen’s d = 0.26, SE = 0.29, CI 95% = [−0.31, 0.83], z = 0.89, p = 0.38, Fig. 5). The total heterogeneity estimated with I2 is ~75% (Q(4) = 19.80, p = 0.0005).

Figure 5
figure 5

Forest plot of studies investigating the difference in the processing of non-BM stimuli between ASD and TD group. Positive values indicate a better performance for the TD group as compared to the performance of the ASD group.

For a summary of the main results, please refer to Table 1 below.

Table 1 Each row reports the main result for each of the four meta-analyses performed.

Discussion

In this article, we conducted an original meta-analytic investigation examining the abilities of individuals with ASD in the processing of BM stimuli. The results of the general meta-analysis [1=All BM studies] showed that individuals with ASD present a moderate (d = 0.60) deficit in BM processing as compared to TD individuals. This deficit was not influenced either by the cognitive level or by the mean age of participants. Moreover, it was not due to publication bias. Nonetheless, we observed a high heterogeneity among studies (I² = 76%), and this is an interesting aspect because it suggests that possible (underestimated) sources of variability are present in the group of studies included. We believe that this result supports the idea that the perception of BM is a complex ability, which cannot always be reduced to a monolithic process.

The need for well-controlled meta-analytic approaches in evaluating BM processing in ASD is witnessed by the fact that, following our pre-print deposit on BioRxiv (January 24, 2019), two other groups have pursued similar investigations. Consistently with our results, both of these two meta-analyses have shown a general significative impairment in ASD to process BM117,118. However, behind the general deficit, the relevant question becomes which are the component(s) that can better explain the differences in behavioral performance between TD and ASD participants? Our work provides preliminary insights on this matter. By challenging the high heterogeneity reported in our general meta-analysis, we proposed meta-analytic investigations focusing on specific moderators.

Thus, we considered the distinct levels of processing defined in our three-level model (i.e., first-order, direct and instrumental BM processing), the type of manipulation used to create the comparison scrambled stimulus (spatial vs. temporal), and the selectivity of the impairment for biological as opposed to non-biological motion.

Is BM perception in ASD anomalous across all levels of processing?

A more precise analysis testing BM processing in ASD according to a three-level model that distinguished among first-order, direct and instrumental processing of BM confirmed that such a deficit was present independently of the specific level of information processing required in the BM task [2 = level of processing]. Nonetheless, this moderator revealed a significant effect on the observed effect size of the impairment in BM processing, suggesting that the magnitude of this impairment depends on the level of processing involved in the specific type of task. Individuals with ASD showed the largest deficit as compared to TD participants when the experimental task required the instrumental recognition of the BM (d = 0.98). Instead, the deficit was moderate (d = 0.55) for tasks requiring the first-order processing of BM and only small-to-moderate (d = 0.40) when tasks required the direct recognition of BM. It is worth pointing out two observations regarding the first- order level of processing: (i) first-order BM usually includes samples of younger children, which performed preferential looking tasks; (ii) the deficit for first-order BM is moderate, with a single study119 reporting a negative effect size. Such study tested preferential looking in children at the age of 6, rather than in younger children or infants. Post-hoc tests revealed that the effect sizes for direct and instrumental processing were significantly different. The differences between the ASD and TD groups across all levels of BM processing were not ascribable to differences in cognitive level or mean age.

The study of BM anomalies in ASD has so far been accumulating a conspicuous body of evidence, but often without putting the necessary effort in trying to understand the different sub-components required to efficiently process the distinct types of BM tasks. This meta-analysis shows that the most severe deficit in participants with ASD emerges when the recognition and categorization of a BM stimulus is serving a secondary purpose, such as when inferring someone else’s intentions, actions or emotional states; in other words, when correctly perceiving BM is a pre-requisite to disclose distinct (and possibly more complex) information. On the contrary, the least severe deficit is observable in participants with ASD when a more basic recognition of BM is tested, the level that we referred to as direct recognition of BM (e.g., BM vs. non-BM/scrambled stimulus; BM stimulus “A” vs. BM stimulus “B”, leftward vs. rightward moving stimulus, etc.). This less severe deficit could be due, at least in part, to compensatory mechanisms that can help participants with ASD to correctly process BM, when the task is not excessively complex. Interestingly, a recent study in which morphed PLD stimuli were created by BM prototypical action, found that individuals with ASD showed intact abilities to recognize actions, but weakened adaptation to BM100. This difference in the implicit processing of BM despite an intact explicit BM recognition suggests that individuals with ASD could develop compensatory mechanisms to perceive BM stimuli. Such compensation becomes more evident with increasing age. Accordingly, Rutherford and Troje120 have shown that, despite no group differences in sensitivity to BM or in the ability to identify the direction of motion were found between ASD and TD groups, only in the ASD group the IQ scores were significantly predictive of their performance. One possibility is that when participants with ASD are processing BM they use different strategies to compensate for their perceptual anomalies.

Overall, these results stress the need for a careful evaluation of what is at the core of BM processing and what are instead additional, secondary and higher-level information that can be extracted from a PLD depicting a stimulus moving in a biologically-plausible fashion. Persevering with such an ambiguity would undermine the study of this construct as a putative endophenotype for ASD with negative clinical repercussions. For this reason, to assess how different low-level features can affect BM perception in ASD is pivotal.

What are the low-level perceptual features linked to biological motion anomalies in ASD

After assessing the role of different levels of BM processing in influencing the strength of the BM processing deficit in individuals with ASD, we assessed another fundamental aspect of BM that is the role of low-level perceptual features [3 = Low-level features]. Indeed, in many studies, BM stimuli are compared to scrambled stimuli obtained by varying the spatial and/or temporal properties of the PLD. Experiments using a spatial scramble use comparison stimuli which maintain the original temporal phase, while all or part of the dots constituting the target PLD change spatial location. Experiments using a temporal scramble, instead, use comparison stimuli which maintain the original relative spatial position, while all or part of the dots constituting the target PLD change temporal phase. By testing the role of the type of scrambled stimuli used as a comparison, our aim was to inquire whether the deficits observable in individuals with ASD are dependent on the specific spatial or temporal properties of the target stimulus that are experimentally manipulated. We found that such a moderator had a significant influence in the effect size of the impaired BM processing in ASD. Studies employing a spatial scramble as a comparison stimulus revealed no significant difference in the performance between individuals with ASD and TD controls. On the contrary, a significant impaired performance was evident for studies employing a temporal scramble as a comparison stimulus, with a moderate-to-large effect size (d = 0.67). Although the sample size for each of this meta-analysis was limited (n = 7 spatial; n = 6 temporal), they furnish an initial interesting insight on the deficit underlying the perception of BM in ASD.

It is increasingly evident that simple spatio-temporal visual dynamics are essential building blocks for social behavior. A recent study in juvenile zebrafish showed that collective behavior, like shoaling, can be driven by simple specific motion cues that are constituted by black dots that mimic the precise zebrafish motion, while other naturalistic cues such as fish-like shape, pigmentation pattern, or non-visual sensory modalities are not required121. Anomalies in the processing stages connected to the tracking of low-level motion dynamics might be disruptive for detection, recognition and categorization of BM stimuli. Our results suggest that a candidate core mechanism playing a role in BM processing anomalies in ASD is linked to the temporal properties of moving dots embedded in a PLD. This is consistent with several experimental pieces of evidence and theoretical proposals suggesting that the temporal binding of unimodal and multisensory information is one of the core anomaly of individuals with ASD122,123,124,125. Additional evidence supports the idea that the aberration of simple spatio-temporal visual dynamics would be a common endophenotype for neurodevelopmental disabilities other than ASD124,125. Specifically, some evidence indicates that individuals with ASD show an extended temporal binding window (TBW; an epoch of time within which stimuli from different sensory modalities are highly likely to be bound), which would impair the temporal processing of sensory stimuli125. The majority of this evidence comes from audio-visual multisensory paradigms (for reviews see123,124,125), where researchers assess the temporal precision required to perceive auditory and visual stimuli as synchronous. Nonetheless, initial evidence for temporal processing anomalies in unisensory perception has been documented in individuals with ASD in both auditory48,126 and visual127,128 perception, suggesting a common underlying deficit in the fundamental coordination of brain networks that are responsible for the timing of sensory information processing129. It is also worth noting that the extension of TBWs has a high degree of malleability and they can shrink or expand as a function of the complexity and the low-level statistics of the incoming sensory information. TBWs change as a function of spatial and temporal relationship among stimuli. For example, when information from different sensory modalities originates from the same spatial position130 or can be temporally-predictable (e.g., rhythmic)131 is integrated across a larger temporal window (see also132). TBWs can shrink or expand also as a result of synchronization of brain rhythms (i.e. entrainment)133,134, or after few days of training in perceptual learning135,136,137. Intriguingly, the hypothesis of a temporal processing anomaly as one of the key aspects responsible for BM deficits observed in ASD would be consistent with the involvement of the cerebellum and its bidirectional connectivity with the right posterior superior temporal sulcus (STS), which represents an important part of the network involved in the processing of BM138,139,140,141,142. For more details, please refer to the subsection on the neural and oscillatory correlates of BM processing below.

Is the impairment observed in ASD specific to biological motion stimuli

We conducted a meta-analysis of the subgroup of studies that tested performance in tasks with BM vs. non-BM stimuli [4 = Non-BM]. The aim of this analysis was to assess whether the deficit shown by participants with ASD was selective for BM stimuli or generalized also to non-BM stimuli. We observed no evidence of impairment in the processing of non-BM stimuli in individuals with ASD. This seems prima facie suggest that suggests that the deficit is selective for BM stimuli and in particular, to their specific spatio-temporal pattern, whereas other types of complex stimuli in motion do not elicit a similar impairment. Among the non-BM tasks tested, we can find recognition/categorization of an object as a bike or a truck presented in PLD109,112 but also recognition/categorization of a scrambled version of the BM stimulus39,113,114. It is clear however that testing the scrambled version of BM stimuli is not ideal to claim that the deficit observed in individuals with ASD is specific to stimuli moving in a biological fashion. Consistently, there is evidence showing that only when neurotypical adults cannot extract a form from a non-BM stimuli, they find non-BM stimuli harder to discriminate than BM stimuli. In other words, the presence of a structure that can be extracted from non-BM stimuli seems to be a crucial aspect when comparing the recognition/categorization of BM and non-BM stimuli143. This would mean that BM is not generally easier to detect than equally-structured non-BM. Therefore, to claim that individuals with ASD show a deficit which is specific to BM, the processing of non-BM stimuli should involve a similar extraction and interpretation of a form-from-motion. In the present meta-analysis there were only two studies contrasting BM stimuli with structured non-BM stimuli109,112, and no clear evidence of impairment in participants with ASD was found, independently of the nature of the moving stimuli (BM vs. non-BM) employed.

We conclude that from the evidence available at the time of the present meta-analysis, it is still an open question whether anomalies observed in individuals with ASD are specific to BM. Thus, although interesting, the comparison between the processing of BM and non-BM stimuli needs a more careful investigation in future studies.

Neural and oscillatory correlates of BM processing in individuals with ASD

Support to the idea that BM is hardly conceivable as a monolithic process comes from the available literature on the neural correlates of BM processing in primates, neurotypical humans, and in individuals with ASD. Such literature highlights the brain networks responsible for BM processing are highly complex and probably not unitary. Neuroimaging studies in the typical population show that the activity of the posterior STS (pSTS) is specifically associated to BM stimuli presented as PLD. The pSTS involvement is often lateralized in the right hemisphere144,145,146,147. While this region is activated for the observation of a wide range of human movements148,149, it has not been revealed to be involved in studies examining the perception of static bodies or body parts, suggesting that pSTS is involved specifically in extracting features of human body motion rather than forms per se147. Congruently, studies in non-human primates have shown the presence of neurons that selectively respond to various biological actions (e.g. walking, turning of the head, bending of the torso, moving of the arms, etc150.). However, these neurons receive convergent information from the dorsal visual stream areas, middle temporal (MT) and medial superior temporal (MST), and also from the ventral stream area in the inferior temporal (IT) regions147. The homologue of pSTS in the primate brain is also connected with the inferior parietal cortex151. A recent meta-analytic investigation of human brain regions activated in different types of BM processing (i.e. whole body, hand and face movements) confirmed common convergent activations in the right pSTS region, independent of the specific motion category. However, the pattern of activations included also bilateral regions at the junction between middle temporal and lateral occipital gyri147.

Other evidence in favor of a highly complex network of brain regions supporting the processing of BM that goes well beyond pSTS comes from functional connectivity studies. For example, a study by Sokolov and colleagues139 that used fMRI to test healthy participants during a BM task, in combination with functional connectivity analysis and dynamic causal modeling, showed that the left lateral cerebellum (lobules Crus I and VIIB) exhibited increased activation and connectivity with right pSTS during the task.

More recently, Sokolov and colleagues142 claimed that BM processing is organized in a parallel, rather than in a hierarchical manner. Benefiting from innovative approaches that combine structural and effective connectivity, the authors confirmed the integrative role of the right STS, but they also observed that this area is not the single main gate that integrates the activity of the occipito-temporal and frontal regions. Also, other regions such as the fusiform gyrus and middle temporal cortex were found to be directly connected to right inferior frontal gyrus and insula during processing of BM stimuli. Moreover, both STS and insula have shown significant BM-specific effective connectivity with the cerebellum (cerebellar lobule Crus I). According to their conclusions, this parallel processing, dependent on the activity of different cortico-cortical and cortico-cerebellar pathways, may help to explain why BM processing is rather resilient to focal brain damage, whereas it is frequently impaired in neuropsychiatric conditions with distributed network alterations (e.g., ASD).

In addition, it has been suggested that the cerebellum uses a common computational algorithm not only upon motor but also non-motor functions, such as perceptual ones45,152. The idea that the cerebellum may play a critical role as internal “timing device” for motor and non-motor functions has been widely explored in the literature76. Notably, this idea may fit with the evidence indicating that the cerebellum is implicated in the pathophysiology of ASD14,15 and may also fit with our present results showing a specific role for temporal scrambled stimuli in explaining the anomalies in BM processing observed in individuals with ASD.

Although it is not within the scope of the present article to provide an extensive review of the neuroimaging literature that investigates the neural correlates of impaired BM processing in ASD, it is interesting to notice that consistent reports in the literature show that during the processing of BM stimuli individuals with ASD show hypoactivation in the pSTS153,154,155,156 and in regions of the prefrontal cortex98,156. However, only few studies so far have tested the effective connectivity among the key regions of the cortical and subcortical (i.e. cerebellum) network responsible of the processing of BM141,157,158. These studies suggest that a defective temporo-parietal158 and cerebellar-pSTS connectivity141 could contribute to the anomalies in the processing of BM reported in ASD and reviewed in the present meta-analysis. Although these studies provide initial important evidence, the results of the present meta-analysis suggest that distinct levels of BM processing can be differentially affected in participants with ASD, and that the manipulation of low-level perceptual features may result in markedly distinctive behavioral outcomes. These aspects should be taken into account in future neuroimaging studies that will address the neural correlates of the anomalies in the processing of BM in ASD. More generally, even from a neural perspective, BM should not be considered as a monolithic process, but as constituted by different components, that are likely supported by different parallel neural routes.

Limitations and future directions

Although not conclusive, we believe that the model and the cumulative findings reported in the present meta-analysis move the conceptualization of BM anomalies in ASD a step forward. To improve the explanatory power of this and other models proposed in the field of BM, we call for future studies to provide appropriate quantitative data to be summarized with metanalytic approaches. Specifically, we deem relevant to investigate whether the potential anomalies in BM processing in ASD are extended to non-BM stimuli, and whether they are selectively tied to temporal binding anomalies. To this end, testing multiple BM and non-BM stimuli (e.g., BM, non-BM, spatial scramble, temporal scramble) within the same ASD sample would be of paramount importance (please refer to meta-analysis 3 = Low-level features and 4 = Non-BM).

Although we would find extremely interesting to produce evidence on the specificity of the ASD impairment in BM vs. other motion tasks (e.g., coherent motion or form-from motion tasks), only two of the studies included in our meta-analysis used coherent motion as a control task98,159. Results indicated that in the ASD group the performance to BM and coherent motion tasks are positively correlated. Future studies on contrasting the performance to BM, coherent motion and form-from motion tasks will aid the clarification of whether the deficit in BM processing showed by ASD individuals is specific to BM or instead is common to other complex motion tasks. In accordance with the idea of general perceptual anomalies in ASD that are not specific to BM, a recent study160 has shown that infants that are less attracted to multisensory BM stimuli were diagnosed with ASD at 2 years of age. Future work in this direction will clarify how anomalies in BM processing in ASD, if any, should be appropriately conceived for clinical aims.

Conclusions

Taken together, this work represents a new assessment of the putative impairment in the processing of BM in individuals with ASD, broken down by specific sub-processes and sub-components (see Table 1). Despite some of our analyses are probably influenced by the limited sample of studies available and by the high heterogeneity of participants’ characteristics, we made an effort toward interpreting the multifaceted aspects of the vast literature that uses BM as a proxy for social perception deficits in ASD. While on the one hand a significant impairment in BM processing emerged in individuals with ASD, on the other hand the idea that this deficit is selective for BM and does not extend to non-BM stimuli is, at present, not properly supported by the literature. Moreover, we observed a high heterogeneity due to the different experimental protocols employed. A major source of heterogeneity can be individuated in the distinct levels (first-order, direct, instrumental) of BM processing, given that the most severe deficit in participants with ASD emerges when processing of a BM stimulus is serving a secondary purpose (e.g. inferring other’s intentionality, action or emotional state). How more severe difficulties reported in the instrumental recognition of BM may represent just a byproduct of more basic anomalies should be clarified in future studies. Finally, an initial result points toward the importance of low-level temporal perceptual features in determining the deficit in the perception of BM in ASD.