Disentangling predictive processing in the brain: a meta-analytic study in favour of a predictive network

Ficco, Linda; Mancuso, Lorenzo; Manuello, Jordi; Teneggi, Alessia; Liloia, Donato; Duca, Sergio; Costa, Tommaso; Kovacs, Gyula Zoltán; Cauda, Franco

doi:10.1038/s41598-021-95603-5

Download PDF

Article
Open access
Published: 10 August 2021

Disentangling predictive processing in the brain: a meta-analytic study in favour of a predictive network

Linda Ficco^1,2,4,
Lorenzo Mancuso^1,2,
Jordi Manuello^1,2,
Alessia Teneggi^1,2,
Donato Liloia^1,2,
Sergio Duca²,
Tommaso Costa^1,2,
Gyula Zoltán Kovacs³ &
…
Franco Cauda^1,2

Scientific Reports volume 11, Article number: 16258 (2021) Cite this article

13k Accesses
20 Citations
23 Altmetric
Metrics details

Subjects

Abstract

According to the predictive coding (PC) theory, the brain is constantly engaged in predicting its upcoming states and refining these predictions through error signals. Despite extensive research investigating the neural bases of this theory, to date no previous study has systematically attempted to define the neural mechanisms of predictive coding across studies and sensory channels, focussing on functional connectivity. In this study, we employ a coordinate-based meta-analytical approach to address this issue. We first use the Activation Likelihood Estimation (ALE) algorithm to detect spatial convergence across studies, related to prediction error and encoding. Overall, our ALE results suggest the ultimate role of the left inferior frontal gyrus and left insula in both processes. Moreover, we employ a meta-analytic connectivity method (Seed-Voxel Correlations Consensus). This technique reveals a large, bilateral predictive network, which resembles large-scale networks involved in task-driven attention and execution. In sum, we find that: (i) predictive processing seems to occur more in certain brain regions than others, when considering different sensory modalities at a time; (ii) there is no evidence, at the network level, for a distinction between error and prediction processing.

Meta-analysis of human prediction error for incentives, perception, cognition, and action

Article 11 January 2022

Forms of prediction in the nervous system

Article 10 March 2020

Movement errors during skilled motor performance engage distinct prediction error mechanisms

Article Open access 11 December 2020

Introduction

According to the theory of predictive coding (PC)^1,2,3,4,5, our brain constantly attempts to model the probability of its own future states, with the goal of minimizing uncertainty⁴. More specifically, the brain is considered a hierarchically organized system where, at each level of processing, higher layers try to predict the latent causes of the sensory input coming from lower layers^6,7. Thus, neurons at higher levels encode predictions about the upcoming signal, which is continuously compared with the effective signal received from lower levels. Through this comparison, the brain either reinforces existing predictions or it updates them, if these do not match the incoming signal⁸. When predictions are violated, a prediction error signal^5,9,10 is fed back to the neurons encoding predictions. These recursive loops of predictions and error signals ultimately allow the individual to maintain up-to-date representations about its own internal states¹¹ and the surrounding external stimuli. Over the past two decades, PC theory has received extensive support from a vast range of theoretical and experimental studies, both in relation to primary sensory processes^5,12,13,14 and higher level cognitive processes^15,16, such as decision making and naturalistic speech comprehension^14,17,18. Moreover, evidence has been obtained with a variety of methods, mostly with functional magnetic resonance imaging (fMRI), but also electroencephalography^19,20,21, computational simulations²², transcranial magnetic stimulation²³, and physiological recordings of single neurons (for a review, see²⁴).

Since 1999, when Rao and Ballard published their seminal simulation work on predictive coding in the visual cortex⁵, there has been a proliferation of attempts to implement PC in the human brain. Initially, it was argued that predictive processing occurs at the cellular level²⁵, where the activity of neural populations is modulated by higher-order predictions and units signalling precision of those predictions. According to Bastos and colleagues²⁶, PC is a typical property of the human cerebral neocortex because its structure suits a hierarchical signal exchange between cortical layers. In particular, error signals seem to be computed in the granular layers (especially layer IV), while predictions would be encoded in layers II and III²⁶. These mechanisms have been identified in a large set of brain areas, including the primary sensory and motor cortices, motor association cortices, dorsal and ventral prefrontal cortices, parietal cortex, anterior cingulate cortex, insula, hippocampus, amygdala, basal ganglia, thalamus, hypothalamus, cerebellum and the superior colliculus^27,28. However, in all these regions, neuronal units producing error signals and those encoding predictions of future states are functionally separated²⁹. This separation has been found empirically through computational modelling, where neurons encoding predictions were found to be located in cortical layers II/III and prediction error neurons in layer IV²². Based on these considerations and a previous study³⁰, in this work we consider the functions of prediction encoding and violation in two separate conditions.

Current views conceive brain functions as a product of the co-activity of distributed brain networks^31,32. This shift has also influenced implementations of PC in the brain. Specifically, while earlier formulations ground PC mechanisms in different layers of the human cortex, more recent models of PC attribute functions of error computation and prediction encoding to discrete brain regions and their long-range interconnections^33,34,35. However, such models describe PC-related networks in isolated domains, such as face processing³³. Thus, the question of whether the same network structure exists for the encoding of predictions and transmission of error signal across sensory modalities, domains and experimental paradigms remains open. To our knowledge, no previous study has addressed the existence of a predictive network with meta-analytic functional connectivity methods. In addition, compared to previous studies (e.g.³⁰), we aim to include a wider variety of experimental paradigms and sensory modalities. First, we performed a coordinate-based meta-analysis, then we calculated the functional connectivity of the regions which were activated in the original studies. We formulated the following hypotheses:

(a)
At least some of the regions involved in predictive activity might be functionally connected, revealing a spatially defined network. Moreover, given the dense interchange of prediction and error signals in the human cortex^26,29, and the heterogeneous nature of our datasets, our network might involve mostly higher order regions.
(b)
As regards the brain areas generally involved in predictive processing, we might only partially replicate the results from a recent meta-analysis³⁰, principally due to the diversity of selection criteria. While Siman-Tov and colleagues³⁰ include studies pertaining to three specific domains, we include a wider range of effects and sensory modalities.
(c)
As for the areas involved in prediction error computation, a recent meta-analysis⁹ highlighted the role of striatum, insula, thalamus and fronto-medial structures, while others³⁶ reported other regions (the bilateral ventral striatum, the thalamus, the left frontal operculum, the left caudate and the left IFG). We therefore aimed to verify whether, with different selection and categorization criteria, these results on prediction error computation could be replicated.

Results

Selection of studies

Following the criteria (a–f) described in “Selection of studies”, 106 articles were collected (see Fig. 1). Data from these articles were classified in a table specifying the study identification code, year of publication, first author’s last name, title, scientific journal, number of experimental subjects, experimental task, sensory modality investigated, experimental contrast, type of stimuli. A further selection based on criteria (g) and (h) led to 70 articles. All the peak coordinates listed for the experimental contrast, which were classified as Prediction Encoding or Prediction Violation, were reported in a separate table. The classification of each reported contrast in the two conditions can be found in the Supplementary Tables S1 and S2. When necessary, we converted the peak coordinates to the Montreal Neurological Institute (MNI) space, using the icbm_spm2tal transform on GingerALE^37,38 (http://www.brainmap.org/icbm2tal/).

Activation likelihood estimation

A detailed explanation of how each contrast was classified as reflecting the effect of prediction encoding and violation is reported in “Selection of studies”. As a first step, we performed ALE meta-analyses singularly on the two main conditions: Prediction Violation (45 experiments, 511 foci and 939 subjects) and Prediction Encoding (39 experiments, 444 foci, 750 subjects). Afterwards, we run an ALE meta-analysis on a unified dataset, derived by pooling the coordinates relative to the two conditions (70 experiments, 930 foci of activation and 1419 participants). We refer to these analyses as General Prediction. Figure 2 shows the results of the ALE analyses at FWE, p < 0.05. Further details of the ALE results are reported in Table 1.

Table 1 Activation likelihood estimation (ALE) results.

Full size table

Prediction violation

Two significant clusters were related to the violation of predictions (Fig. 2, red color). The larger cluster included the left inferior frontal gyrus, while a smaller cluster was found over the left anterior insular cortex, partially overlapping with the claustrum.

Prediction encoding

No significant cluster was found for Prediction Encoding at the typically applied, conservative threshold of FWE, p < 0.05. Lowering the threshold to FDR, p < 0.01 still did not produce any significant convergence. However, at an exploratory level, we report the results obtained at a more liberal threshold (Uncorrected, p < 0.0005). At this threshold, fourteen clusters emerged. These included the right superior and left inferior parietal lobules, the right superior, right middle and left inferior frontal gyri, the bilateral fusiform gyri and the right amygdala, and a few clusters with a size inferior to 200 mm³ (including the left amygdala, the left precuneus and the right cuneus, the right insula and the right superior temporal gyrus).

General prediction

Overall, the ALE analysis of the whole dataset returned a set of cortical regions in the frontal and parietal lobes (Fig. 2, green). These include the left inferior frontal gyrus, the insulae bilaterally, the right superior frontal gyrus, the bilateral inferior parietal lobules, and the left precuneus.

Seed-voxel correlations consensus

This technique highlights the regions showing correlated activity with those that were active during the tasks tapping into predictive processing (see “Seed-voxel correlations Consensus”). Overall, the results from all the three conditions are remarkably similar, thus we focus on the results of the General Prediction condition (${SIM}_{General/Encoding}=0.78$; ${SIM}_{General/Violation}=0.93$; ${SIM}_{Encoding/Violation}=0.68$). Peaks are located in the left inferior frontal gyrus, the superior temporal gyrus bilaterally, the left thalamus, the left hippocampus and the left cerebellum. Significant voxels are shown in warm colors. The network emerging from negative correlations (cold colors) includes the right cerebellum (uvula), the left precentral gyrus and the post-central gyri bilaterally, and the right middle occipital gyrus (Fig. 3; see Table S2 for more details). Finally, the network relative to Encoding is substantially overlapping with that of the other two conditions, although the map of positive values appeared to be less extended. The major regions of differential connectivity between this and the map of Prediction Encoding include the left insula, the left middle frontal gyrus, the left anterior cingulate gyrus and the inferior frontal gyrus bilaterally (Fig. S3).

Although the SVC Consensus maps indicate regions that are significantly connected to the activation foci reported in the literature, the relation between these foci and such maps needs to be clarified. In fact, on one hand it is possible that only a few foci are responsible for the connectivity maps. On the other hand, these maps might show areas that are not reported in the literature (thus are not primarily considered to be involved in predictive processing), but are systematically connected to the predictive regions, possibly providing input or output to them. To investigate the relation between the foci and their connectivity, we overlapped the SVC Consensus of the General Prediction condition to the corresponding unthresholded ALE map. Here, the unthresholded map can be seen as an indicator of all the activated regions in the literature. We found out that there is a substantial overlap between the two maps (Fig. 4), suggesting that the activated areas tend to be interconnected and to form a coherent functional network ($SIM = 0.56$). Lastly, to exclude the possibility of bias due to local connectivity, the SVC Consensus analysis was repeated excluding proximal connections between close areas from the SVC maps. The resulting maps were extremely similar to those obtained with the original SVC Consensus maps, suggesting that activation foci are connected not only to the spatially closer areas, but also to the more distal ones (Fig. S1).

Fail-safe technique

To test the impact of potential selection bias, we performed a fail-safe analysis³⁹ (“Fail-safe technique”). In the General Prediction condition, the analysis shows that at least one of the clusters remains significant up to the introduction of 250% random data, suggesting their robustness against selection bias (Fig. 5). The analysis of the Prediction Violation dataset suggests an even greater robustness (the clusters remain significant up to the inclusion of 425% random data). In general, both fail-safe tests suggest the robustness to bias of the two clusters that are in common for the two conditions, i.e. the left IFG/precentral gyrus and the left insula/claustrum.

Leave-N-out

This analysis tests the heterogeneity of a dataset, or whether all the studies in a dataset contribute to the results similarly (Section 4.5). It was performed on the condition of Prediction Encoding due to the lack of convergence. Figure 6 shows the results from the leave-N-out analysis. The y axis indicates the number of papers, while the x axis the energy (1-quadratic error/total N experiments). The diagrams show the distribution of energy obtained by removing 3, 5, 7, 9 and 11 articles at each run separately. When removing less than 7 random articles at a time, no important changes are visible in the distribution. Since 7/39 (removed articles/total) equals to 18% of the included experiments, this suggests that the condition of Prediction Encoding is mostly homogeneous. Thus, it is unlikely that the absence of convergence is due to the heterogeneity in the experiments. Instead, it may be due to the spatial distributedness of the activation coordinates per se.

Discussion

In this study, we partially replicate convergence results from previous meta-analyses. In addition, we provide evidence for the existence of a network involved in PC across sensory modalities and a variety of tasks. This spatially extended and bilateral network overlaps with known large-scale networks supporting attention and task execution. Finally, although the separation between error, weighting and encoding units is supported by our ALE results and previous work, our findings suggest that the regions that are engaged during prediction violation and encoding tend to be functionally connected with the same network.

The ALE results show convergence across tasks targeting predictive processing in a set of cortical regions, in both the Violation and the General condition. However, we did not find convergence in the Encoding condition, even at more liberal thresholds. As suggested by the results of the leave-N-out analysis, this spatial heterogeneity does not seem to be due to the disproportionate contribution of few studies, but rather to the large variability in the localization of foci. A qualitative inspection further indicates that this spatial distributedness is not due to a larger heterogeneity in the selected tasks either. Rather, it likely reflects a wider range of effects, due to our definition of prediction encoding. In fact, while the Violation condition mainly includes effects of surprise and stimulus randomness, that of Encoding reflects effects of item repetition, habituation/adaptation, belief updating, memory, high probability and, in some cases, an explicit effort to predict upcoming stimuli. However, we cannot exclude that the spatial heterogeneity is an intrinsic property of the process itself. For instance, it was found that the effect of belief updating, one of the effects we included in the Encoding dataset, did not replicate between different methods⁴⁰. If other effects suffered from the same issue, this could perhaps explain their wide spatial distributedness. Another potential reason for the lack of convergence is that one of the typical effects we included is that of repetition suppression, which is defined as a decrease in brain activity in task-related areas⁴¹. Therefore, this effect is probably at least partially more task-dependent in nature, thus leading to reduced convergence. It is challenging to compare this result to that by Siman-Tov et al., since only results from a subtraction between violation and formation of prediction are reported³⁰. Nevertheless, we compared a meta-analysis on visual repetition suppression⁴² with the results of a subgroup analysis on a subset of studies employing visual stimuli from the Encoding condition. While Kim et al. report a network including visual cortices, frontal and parietal regions and the caudate, we again obtained no significant convergence, at the same threshold. This further strengthens the idea that the effects included in the Prediction Encoding condition show strong task-dependency. Although we preferred not to compare such effects systematically due to low power, future meta-analyses may quantitatively compare different aspects of prediction encoding.

One expected site of convergence that we have not found is the cerebellum, especially when analysing the violations of predictions. In fact, this structure has been reported in previous works as an important hub processing the comparison between an internal model and the current sensory input, and as a region that supports procedural and perceptual learning mechanisms^43,44,45. However, there are two main reasons why few neuroimaging meta-analyses are able to detect convergence in the cerebellum⁴⁶. First, there could be technical difficulties associated with the detection of BOLD signal from the cerebellum, principally dependent on experiments targeting climbing fibres, which are poorly coupled to this signal⁴⁶. Second, some experimental paradigms tend to promote rapid habituation in this area, which in turn produces lower neural responses⁴⁶.

Overall, our analyses of the Prediction Violation condition confirm previous findings^9,30,36 regarding the insula and the IFG, while the involvement of others, such as the striatum and the thalamus, are not replicated. Both the insula and the IFG have been related to the violation of predictions in previous works. In particular, the anterior insulae have been related to the processing of bodily sensations and awareness of subjective feelings⁴⁷. Moreover, insular regions compute prediction error signals, especially in the interoceptive modality^35,48,49. Considering that only a small number of included studies explicitly targeted bodily sensations, this finding deserves special attention. A tentative interpretation is that, regardless of the nature of the specific expectations that are violated in each task, these tend to produce an error broadly related to the self⁵⁰. The insular cluster also extends to include the left claustrum, which produces prediction error signals in Pavlovian classical conditioning paradigms⁵¹. Notably, these involve a component of automatic learning, which probably most of the tasks we included contain, to some degree. The second cluster was located in the IFG, which is involved in risk aversion⁵², and in detecting a mismatch between expectations and decisions⁵³. Recent studies indicate that this region plays a role in the violation of expectations. For instance, a correlation was found between both IFG and insula activity with a prediction error model during bi-stable perception, which is a paradigm inducing strong violations of visual expectations⁵⁴. Moreover, ERP components in both which are associated with surprise were larger than those related to belief updating the right IFG and the bilateral insulae⁵⁵. Lastly, intrinsic connectivity between the IFG and the insula was predicted by the degree of intolerance of uncertainty, which further indicates their sensitivity to error signals⁵⁶. Together, these findings and our results suggest that the insula and the IFG, and their connectivity during prediction violation across modalities, are a worthy avenue for further research.

The General Prediction condition was designed to tap into the general effects of predictive processing, that arise from the mere fact of performing a task eliciting predictions or prediction errors. Thus, we expected the brain regions emerging here to be related either to one or the other process, or to both. First, convergence was found in two larger clusters, one in the inferior frontal gyrus/precentral gyrus, and the other bilaterally in the insula. Strikingly, both regions also emerged in the meta-analysis of Siman-Tov et al., who similarly pooled together the effects of prediction encoding and violation³⁰. This strengthens the plausibility of this result in other domains than those of music, action and language perception. Further supporting the double role of both the IFG and the insula in PC, activity in these areas represents the building of an expectation, analysing the conjunction across somatosensory, visual and auditory stimulus modality⁵⁷. However, evidence about the IFG is somewhat more mixed. In fact, its activity does not always seem to depend on the predictability of a situation⁵⁸. Moreover, whereas some authors describe it as an area involved in the processing of “expectancy input”⁵⁹, others report increased activity in the IFG when the stimulus probability is low, leading to larger prediction error signal⁶⁰. As regards the anterior insulae, notably these are an important hub of the salience network⁶¹. It is plausible that this hub is more activated in surprising situations driving attention⁶², which also involve an increased gain in error signal computation (the relationship between predictive and attentional processes is extensively discussed in the following paragraphs). Lastly, the role of precuneus in the General Prediction condition is more difficult to relate to the existing literature. In general, it is involved in self-related cognition, episodic memory and mental imagery⁶³. Interestingly, this region was responsive to deviant stimuli even during sleep⁶⁴, which may indicate a selective sensitivity to prediction error during different states of consciousness. However, belief updating modulates activity in this region as well⁴⁰. Overall, these studies support our findings, and suggest that the IFG and the insula might be involved in both the encoding and the violation of predictions. More evidence on prediction violation than encoding exists in both cases, and the sensitivity of the IFG (and the less discussed precuneus) to stimulus probability may more strongly depend on specific task characteristics.

The SVC Consensus analysis was conducted to highlight the brain regions that tend to provide input or output to those involved in prediction violation and encoding. Since the resulting network is largely similar between conditions, we focus on the General Prediction condition.

An important aspect is that the maps relative to the Prediction Violation and Prediction Encoding conditions are highly similar ($SIM=0.68$, Fig. S2). This means that the regions involved in prediction cross-modally tend to exchange information with the same, broad set of areas during task execution. This finding is surprising for several reasons. First, as previously discussed, the Prediction Encoding condition reflects a greater diversity of effects than that of Violation. Second, a study examining functional connectivity during temporal and spatial predictions found that prediction violation and fulfilment modulate connectivity in distinct networks⁶⁵. Lastly, this finding seems to contradict the fact that prediction violation and encoding are functionally separated^6,26,29. Our ALE results further support this functional separation. Notably, all the regions differentially connected during the violation of predictions (Prediction Violation > Prediction Encoding) have been reported in a meta-analysis on prediction error during reinforcement learning⁵¹, an experimental paradigm designed to provoke a strong error signal. Still, despite minor differences between these two maps (Figs. S2 and S3), our results strongly suggest that the same network supports both functions.

A key feature of the network that we obtained in all conditions is its remarkable similarity to the so-called task-positive network (TPN⁶⁶). The TPN is a set of areas involved in task execution, and is usually divided into three large-scale brain networks related to salience processing⁶¹ and the dorsal and the ventral attentional networks^67,68.

The fact that the regions which are more involved in prediction are also part of attentional networks replicates the findings by Siman-Tov and colleagues³⁰ and is of key theoretical importance. An increasing body of research considers prediction and attention as dissociable but strongly interdependent processes (for empirical evidences, see^69,70,71; for further readings see^72,73,74,75). Attention adjusts the computational weight (precision) of prediction error units via synaptic gain enhancement^7,76,77, leading to increased error signals. While attention enhances the processing of relevant information and regulates the overall cortical responsiveness^78,79,80, prediction allows the brain to take prior information into account⁸¹. Moreover, prediction “anchors” attentional processing, meaning that computing predictions is necessary to subsequent attention orientation⁷¹. Thus, the overlap between our map and the TPN could be interpreted in several ways. First, despite being distinct processes^82,83,84,85, they may share a common neural territory. Considering that this network relies on the original coordinates of brain activations during task performance, it is clearly possible that both attention to the actual stimuli and the prediction of future stimuli were working simultaneously. Second, when multiple modalities are considered together, the activity of prediction and error computation might specifically involve attentional networks more than other brain regions. This has never been observed before because, for obvious practical reasons, only a limited set of modalities are investigated at a time, often in a rather constrained experimental setting. Finally, a third possibility is that the TPN emerged from our analyses merely for the effect of participants’ engagement in any attention-demanding task, and the selected contrasts do not reflect predictive processing at all. It is difficult to rule this possibility out completely, as we did not analyse an arguably non-predictive neutral control condition⁸⁶. However, as we performed a careful selection of neuroimaging contrasts targeting the effects of interest, the overlap with the canonical attentional networks might imply some relationship between the two processes.

Finally, our predictive network appears to be negatively correlated with the default mode network (DMN; for example, see Fig. 3, negative values). Activity patterns of the DMN and those of the TPN are anticorrelated⁸⁷, and possibly involved in different forms of cognition. In particular, the DMN is typically reported to be more activated during rest and mind-wandering⁸⁸. Considering recent work suggesting that this network creates and updates internal predictive models about the self⁸⁹, and that it is engaged when stimuli are temporally predictable⁹⁰, the lack of connectivity within its key hubs is rather unexpected, especially in the Encoding condition (cf.⁴⁰). Moreover, since the DMN is located at the extreme end of a continuum of integration and hetero-modal functioning within the human connectome^91,92, it is even more surprising that it did not result from our functionally heterogeneous dataset. A possible reason for its absence could be that, under the hypothesis that the DMN is responsible for the integration of predictions in prior internal models—acting in a sort of “autopilot mode”^93,94—almost no included experimental paradigm tested this kind of automatic activity. High temporal resolution techniques, computational, and meta-analytic approaches to functional neuroimaging data can be valuable tools to investigate the role of the DMN in predictive processing in the future.

While we have highlighted the theoretical relevance of our results in the previous paragraphs, the finding of a predictive network could also be meaningful in the clinical context. Debilitating clinical conditions seem to stem from alterations in the production of prediction error signals (e.g. schizophrenia, anxiety^95,96,97,98), hyper-rigidity of prior information or inflexible precision of prediction error (e.g. autism^99,100,101). Since psychopathology tends to spread in the brain exploiting existing functional connectivity patterns^{102,103,104,105}, and our network is thought to reflect the connectivity between regions involved in cross-modal prediction, investigating this network in different psychiatric samples should be fruitful.

One potential limitation of the current work is that the selection, classification and coding of the articles was conducted manually by one author only. However, the coded dataset was independently cross-checked by another author. Moreover, a section of notes was included in the database with the aim to make the interpretation and selection processes more transparent, as suggested by recent guidelines¹⁰⁶. Another potential flaw is caused by the heterogeneity of the definition of “prediction” in the literature. In fact, the concepts of prediction, anticipation and expectation are often used interchangeably¹⁰⁷ and how they are operationalized in each study can potentially lead to confusion with other processes^81,108. Lastly, the strong presence of studies employing visual or audio-visual tasks may have also limited the validity of the current results (Table S4). However, the absence of early visual areas in both the ALE and the SVC Consensus results may suggest that the impact of this imbalance is nevertheless limited.