Imaging object-scene relations processing in visible and invisible natural scenes

Faivre, Nathan; Dubois, Julien; Schwartz, Naama; Mudrik, Liad

doi:10.1038/s41598-019-38654-z

Download PDF

Article
Open access
Published: 14 March 2019

Imaging object-scene relations processing in visible and invisible natural scenes

Scientific Reports volume 9, Article number: 4567 (2019) Cite this article

3476 Accesses
20 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Integrating objects with their context is a key step in interpreting complex visual scenes. Here, we used functional Magnetic Resonance Imaging (fMRI) while participants viewed visual scenes depicting a person performing an action with an object that was either congruent or incongruent with the scene. Univariate and multivariate analyses revealed different activity for congruent vs. incongruent scenes in the lateral occipital complex, inferior temporal cortex, parahippocampal cortex, and prefrontal cortex. Importantly, and in contrast to previous studies, these activations could not be explained by task-induced conflict. A secondary goal of this study was to examine whether processing of object-context relations could occur in the absence of awareness. We found no evidence for brain activity differentiating between congruent and incongruent invisible masked scenes, which might reflect a genuine lack of activation, or stem from the limitations of our study. Overall, our results provide novel support for the roles of parahippocampal cortex and frontal areas in conscious processing of object-context relations, which cannot be explained by either low-level differences or task demands. Yet they further suggest that brain activity is decreased by visual masking to the point of becoming undetectable with our fMRI protocol.

Cortical and subcortical signatures of conscious object recognition

Article Open access 18 May 2021

Task-dependent functional organizations of the visual ventral stream

Article Open access 27 June 2019

A functional dissociation of face-, body- and scene-selective brain areas based on their response to moving and static stimuli

Article Open access 03 June 2019

Introduction

A very short glimpse of a visual scene often suffices to identify objects, and understand their relations with one another, as well as with the context in which they appear. The observed co-occurrences of objects and their relations within specific scenes or contexts¹ shape our expectations. When these expectations are violated², object and scene processing are impaired - both with respect to speed^3,4,5 and to accuracy ^6,7,8, suggesting that contextual expectations may have an important role in scene and object processing^1,9,10, though see¹¹.

Yet the mechanisms of contextual facilitation for object/scene comprehension are still unclear. Several studies have tried to track the neural substrates of contextual processing and gist extraction¹⁰, suggesting parallel yet interacting streams for object and scene processing¹², and emphasizing the role of expectations in visual perception¹³. Specifically for processing the relations between objects and scenes, an interplay between frontal and temporal visual areas has been suggested¹(see also^13,14,15), so that after identifying the scene’s gist, high-level contextual expectations about scene-congruent objects are formed. The activated representations of such objects are compared with upcoming visual information about objects’ features, until a match is found and the objects are identified (for electrophysiological support, see^{9,16,17,18,19},though see²⁰).

However, these suggestions are mostly based on studies that did not directly examine the processing of objects embedded in scenes, but rather used other ways to probe contextual processing (e.g., comparing objects that evoke strong vs. weak contextual associations²¹, or manipulating the relations between two isolated objects^22,23). Critically, the few papers that did focus on objects embedded in real life scenes^24,25,26 report conflicting findings about the role of frontotemporal regions - more specifically the prefrontal cortex (PFC) and the medial temporal lobe (MTL). Several areas within the prefrontal cortex have been implicated in semantic processing (e.g., left Inferior PFC²⁷; bilateral Inferior PFC, bilateral Superior Frontal Gyrus, right Middle Frontal Sulcus, Cingulate²²; medial PFC^21,28), including a direct manipulation of object-scene relations (right Middle Frontal Sulcus and Inferior PFC²⁴). However, it has been suggested that these frontal activations may stem from task-induced conflict rather than from the actual processing of object-context relations²⁴.

Likewise, the literature is divided about the involvement of the MTL in such processing. In particular, for the parahippocampal cortex (PHC), some have suggested a functional dissociation²⁹ whereby anterior parts process contextual associations while posterior parts process spatial layouts^24,30,31,32 (though Rémy and colleagues reported semantic effects on both posterior and anterior PHC; see also³³, who reported PHC and hippocampal activations for contextual binding between objects and scenes, and³⁴ that gave an account of PHC activity in terms of statistical learning of object co-occurrences). Others argue that the PHC is solely dedicated to processing spatial layouts (the spatial layout hypothesis^35,36; see also³⁷ and³⁸ that support the spatial account, yet interestingly find effects of both the scene and its constituents on PHC activity), or representations of three-dimensional local spaces, even of a single object³⁹, irrespective of contextual associations⁴⁰.

Other regions participating in scene processing have also been suggested: namely, the retrosplenial cortex (RSC) and Occipital Place Area (OPA) were both implicated in scene perception and in contextual processing²⁹. Yet RSC involvement in scene processing is commonly held to be related to navigation^41,42, without being affected by the objects in the scene³⁸. And OPA activity is considered to reflect more perceptual-level processing of features characteristic of scenes⁴³. Thus, it is not clear whether these areas should also be involved in processing object-scene relations.

As outlined above, the neural mechanisms underlying the processing of object-context relations are still under investigation; and so are the cognitive characteristics of this phenomenon. For instance, whether conscious perception is necessary for the processing of object-context relations is still unresolved. Using behavioral measures, two previous studies suggested that integration of object and scene can occur even when subjects are unaware of both objects and the scenes in which they appear^44,45 (for prioritized access to awareness of interacting vs. non-interacting objects, see⁴⁶). This result is in line with the unconscious binding hypothesis⁴⁷, according to which the brain can associate, group or bind certain features in invisible scenes, especially when these features are dominant (for a discussion of conscious vs. unconscious integration, see⁴⁸). However, two recent attempts to replicate these findings have failed^49,50. This absence of an effect accords with theories that tie integration with consciousness (Global Workspace Theory^51,52; Integrated Information Theory^53,54). While the jury is still out on this question, our study aimed at measuring the brain activity mediating the processing of unconscious object-context relations - if indeed such processing is possible in the absence of awareness.

The goals of the current study were thus twofold; first, we aimed at identifying the neural substrates of the processing of object-context relationships, and specifically at testing whether frontal activations indeed reflect contextual processing rather than task-related conflict²⁴. Second, we looked for evidence of unconscious processing of object-context relationships while carefully controlling visibility, with the goal of identifying the neural substrates of such processing in the absence of awareness. Subjects were thus scanned as they were presented with masked visual scenes depicting a person performing an action with a congruent (e.g., a man drinking from a bottle) or an incongruent (e.g., a man drinking from a flashlight) object. The experiment had two conditions: one in which scenes were clearly visible (visible condition), and one in which they were not (invisible condition) (see Fig. 1). Stimulus visibility was manipulated by changing masking parameters while keeping stimulus energy constant. Participants rated stimulus visibility on each trial; they did not perform any object-context congruency or object-identification judgments, to ensure that the measured brain activations could be attributed to object-context integration per se, and not to task-induced conflict²⁴.

Methods

Participants

Eighteen participants (eight females, mean age = 25.1 years, SD = 4.32 years) from the student population of the California Institute of Technology took part in this study for payment ($50 per hour). All participants were right-handed, with normal or corrected-to-normal vision, and no psychiatric or neurological history. The experiment was approved by the Institutional Review Board committee of the California Institute of Technology, and all methods were performed in accordance with the corresponding guidelines and regulations. Informed consent was obtained after the experimental procedures were explained to the subjects. Two additional participants had too few invisible trials in the invisible condition (less than 70% of trials), and were excluded from the analysis.

Apparatus and stimuli

Stimuli were back-projected onto a screen that was visible to subjects via a mirror attached to the head coil, using a video projector (refresh rate 60 Hz, resolution 1280 × 1024). Stimuli were controlled from a PC computer running Matlab with the Psychophysics Toolbox version 3^55,56,57. Responses were collected with two fiber optic response devices (Current Designs, Philadelphia, USA). Target images were 6.38° by 4.63° (369 × 268 pixels) color pictures of a person performing an action with an object. In the congruent condition, the object was congruent with the performed action (e.g., a woman drinking from a cup), while in the incongruent condition, it was not (e.g., a woman drinking from a plant; see^45,58 for details). In both types of images, the critical object was pasted onto the scene. Low-level differences in saliency, chromaticity, and spatial frequency were controlled for during the creation of the stimulus set⁹, and tested using an objective perceptual model⁵⁹. Visual masks were generated from a different set of scenes, by cutting each scene image into 5 × 6 tiles and then randomly shuffling the tiles.

Procedure

The experiment was run over two separate one-hour scanning sessions on two consecutive days. The first session included a Calibration condition (which was conducted during the anatomical scan), four runs of the Invisible condition, and two post-experiment visibility tests. The second session included four runs of the Visible condition, followed by two runs of an Unmasked condition in which scenes were presented in blocks for longer durations, and with no masks (Fig. 1). Note that the visible and unmasked conditions were conducted in the second session, so the results in the invisible condition would not be biased by previous conscious exposure. Thus, the visible condition was aimed at achieving the first goal of this research, which was to identify the neural activations underlying object-scene relations processing. The unmasked condition was added in case the mere presence of masks in the visible condition hindered such processing. The invisible condition was aimed at achieving the second goal of this research – to look for neural evidence for object-scene relations processing in the absence of awareness.

Calibration condition

The calibration was designed to individually adjust the contrasts of the masks and targets, thus ensuring a comparable depth of suppression across subjects in the invisible condition. 72 images (half congruent, half incongruent; all different from the ones used in the main experiment) were presented, either upright or inverted (pseudo-randomly intermixed, with the constraint that the same image orientation was never presented in four consecutive trials). Subjects indicated for each trial whether the image was upright or inverted, and rated its visibility subjectively using the Perceptual Awareness Scale (PAS⁶⁰), where 1 signifies “I didn’t see anything,” 2 stands for “I had a vague perception of something,” 3 represents “I saw a clear part of the image,” and 4 is “I saw the entire image clearly.” Subjects were instructed to guess the orientation if they did not see the image. Initial mask (Michelson) contrast was 0.85, and initial prime contrast was 0.7. Following correct responses (i.e., correct classification of the image orientation as upright or inverted), mask contrast was increased by 0.05, and following incorrect responses, it was decreased by 0.05 (i.e., 1-up, 1-down staircase procedure⁶¹). If mask contrast reached 1, target contrast was decreased by steps of 0.05, stopping at the minimum allowable contrast of 0.15. In the main experiment, mask contrast was then set to the second highest level reached during the calibration session (i.e., if the highest mask contrast during the calibration reached a value of 0.95, it was set to 0.90). If mask contrast reached 1, it was set to 1, and target contrast was then set to the second lowest level reached (i.e., if it reached a value of 0.4, it was set to 0.45). Mask contrast reached 1 for all subjects. Average target contrast was 0.39 ± 0.05 (here and elsewhere, ± denotes 95% confidence interval). The same contrasts were used in both visible and invisible conditions in the main experiment, so stimulus energy entering the system would be matched.

Main experimental conditions

The invisible and visible conditions were each divided into four runs of 90 trials, of which 72 contained either congruent or incongruent target images, and 18 had no stimuli (i.e., “catch trials”), serving as baseline. Thus, since we had 144 pairs of images, each image repeated two times in each condition. The order of congruent and incongruent trials within a run was optimized using a genetic algorithm⁶² with the constraint that a run could not start with a baseline trial, and that two baseline trials could not occur in succession. Each trial started with a fixation cross presented for 200 ms, followed by three repeats of a sequence of target images and masks (the sequence was repeated three times to allow for better processing of the masked scene and increase its signal⁶³), and then a judgment of image visibility using the PAS. The sequence started with two forward masks (each presented for 50 ms, with a 17 ms blank interval), followed by the target image (33 ms), and two backward masks (50 ms each, 17 ms blank interval). The only difference between the invisible condition (first session) and the visible condition (second session) lied in the duration of the intervals immediately preceding and immediately following the target image: a 17 ms gap was used in the invisible condition, while a 50 to 200 ms gap (randomly selected on each trial from a uniform distribution) was used in the visible condition. To equate the overall energy of a trial across conditions, the final fixation in the sequence lasted between 100 and 400 ms in the invisible condition. A random number of masks (0–4) was presented between repetitions of the sequence in each trial, to minimize predictability of the onset of the target image (Fig. 1). A random inter-trial interval (uniform distribution between 1 and 3 s) was enforced between trials, so that on average a whole trial lasted 4.5 s.

Post-experiment visibility tests

At the end of the first session, following the invisible condition, two objective performance tasks were administered to behaviorally validate that the masking procedure was effective in suppressing the scenes from awareness. First a congruency task performed in the scanner, in which subjects were asked to determine if a scene was congruent or not (2-AFC, 90 trials). Second, an orientation task performed outside the scanner, in which half the images were upright and half were inverted, and subjects were asked to determine their orientation (2-AFC, 72 trials). In both tasks, the same trial structure and the same images as the ones used in the main experiment were used; subjects were instructed to guess if they did not know the answer. Subjects also rated image visibility using the PAS, after each trial.

Unmasked condition

At the end of the second session, following the visible condition, subjects participated in two runs of a block design paradigm to localize brain regions that respond differentially to congruent and incongruent scenes, in the absence of visual masks. This session served as an alternative to the visible condition, in case the short presentation of the stimuli and the presence of masks might evoke too weak responses. Each run consisted of 18 blocks of 12 images, which were either all congruent or all incongruent scenes – the same scenes which were presented in the main experimental conditions. Blocks started with a 5–7 s fixation cross. Then, the 12 images were presented successively for 830 ms each, with a 190 ms blank between images. Subjects had to detect when an image was repeated, which occurred once per block (1-back task). When analyzing the data, we verified that regions responding differently to congruent and incongruent scenes were consistent across the unmasked and the visible conditions.

Behavioral data analysis

All analyses were performed with R (2016), using the BayesFactor⁶⁴, and ggplot2⁶⁵ packages.

MRI data acquisition and preprocessing

All images were acquired using a 3 Tesla whole-body MRI system (Magnetom Tim Trio, Siemens Medical Solutions) with a 32-channel head receive array, at the Caltech Brain Imaging Center. Functional blood oxygen level dependent (BOLD) images were acquired with T2*-weighted gradient-echo echo-planar imaging (EPI) (TR/TE = 2500/30 ms, flip angle = 80°, 3 mm isotropic voxels and 46 slices acquired in an interleaved fashion, covering the whole brain). Anatomical reference images were acquired using a high-resolution T1-weighted sequence (MPRAGE, TR/TE/TI = 1500/2.74/800 ms, flip angle = 10°, 1 mm isotropic voxels).

The functional images were processed using the SPM8 toolbox (Wellcome Department of Cognitive Neurology, London, UK) for Matlab. The first three volumes of each run were discarded to eliminate nonequilibrium effects of magnetization. Preprocessing steps included temporal high-pass filtering (1/128 Hz), rigid-body motion correction and slice-timing correction (middle reference slice). Functional images were co-registered to the subject’s own T1-weighted anatomical image. The T1-weighted anatomical image was segmented into gray matter, white matter and cerebrospinal fluid, and nonlinearly registered to the standard Montreal Neurological Institute space distributed with SPM8 (MNI152). The same spatial normalization parameters were applied to the functional images, followed by spatial smoothing (using a Gaussian kernel with 12 mm full-width at half maximum) for group analysis. Scans with large signal variations (i.e., more than 1.5% difference from the mean global intensity) or scans with more than 0.5 mm/TR scan-to-scan motion were repaired by interpolating the nearest non-repaired scans using the ArtRepair toolbox⁶⁶.

fMRI analysis

Statistical analyses relied on the classical general linear model (GLM) framework. For univariate analyses, models included one regressor for congruent, one regressor for incongruent, and one regressor for baseline trials in each run. Each regressor consisted in delta functions corresponding to trial onset times, convolved with the double gamma canonical hemodynamic response function (HRF). Time and dispersion derivatives were added to account for variability of the HRF across brain areas and subjects. Motion parameters from the rigid-body realignment were added as covariates of no interest (6 regressors). Individual-level analyses investigated the contrast between the congruent vs. the incongruent condition. Group-level statistics were derived by submitting individual contrasts to a one sample two-tailed t-test. We adopted a cluster-level thresholding with p < 0.05 after FWE correction, or uncorrected threshold with p < 0.001 and a minimal cluster extent of 10 voxels. Note that our results did not resist correction for multiple comparisons after threshold free cluster enhancement (TFCE⁶⁷). The unmasked session followed the same preprocessing and analysis steps as the main experimental runs. For multivariate analyses, each run was subdivided into four mini-runs, each containing nine congruent and nine incongruent trials. One regressor was defined for congruent and incongruent trials in each mini-run. No time or dispersion derivative was used for these models, and no spatial smoothing was applied. The individual beta estimates of each mini-run were used for classification. Beta estimates of each voxel within a given region of interest (ROI) were extracted and used to train a linear Support Vector Machine (using the libsvm toolbox for Matlab, http://www.csie.ntu.edu.tw/cjlin/libsvm) to classify mini-runs into those with congruent and incongruent object-context relations. A leave-one-run-out cross-validation scheme was used. Statistical significance of group-level classification performances was assessed using Bayes Factor and permutation-based information prevalence inference to compare global vs. majority null hypotheses⁶⁸. ROIs were defined based on the contrast in the unmasked condition, by centering spheres of 12 mm radius on the peak activity of each cluster of more than five contiguous voxels (voxel-wise threshold p < 0.001, uncorrected). Note that the inter-individual variability did not allow defining ROIs at the individual level.

Results

Behavioral data

In the visible condition, subjects’ visibility ratings indicated that target images were partly or clearly perceived (percentage of total trials: visibility 1: 3.5% ± 3.0%; visibility 2: 18.5% ± 7.5%; visibility 3: 37.8% ± 5.4%; visibility 4: 45.4% ± 9.4%). Only trials with visibility ratings of 3 or 4 were kept for further analyses. By contrast, in the invisible condition, subjective visibility of target images was dramatically reduced due to masking (visibility 1: 56.6% ± 12.2%; visibility 2: 31.2% ± 7.1%; visibility 3: 14.3% ± 7.8%; visibility 4: 0.1% ± 0.04%) (Fig. 2a). Only trials with visibility ratings of 1 or 2 were kept for further analyses (for a similar approach, see⁶⁹). Objective performance for discriminating scene congruency in corresponding visibility ratings in the invisible condition did not significantly differ from chance-level (51.6% ± 2.5%, t(15) = 1.24, p = 0.23, BF = 0.49, indicating that H₀ was two times more likely than H₁) (Fig. 2b). However, objective performance for discriminating upright vs. inverted scenes was slightly above chance-level (56.0% ± 4.4%, t(16) = 2.64, p = 0.02, BF = 3.3, indicating that H₁ was around three times more likely than H₀; regrettably, two subjects did not complete the objective tasks due to technical issues). This suggests some level of partial awareness, in which some participants could discriminate low-level properties of the natural scene such as its vertical orientation, but not its semantic content^70,71. In all following results, unconscious processing is therefore defined with respect to scene congruency, and not to the target images themselves of which subjects might have been partially aware, at least in some trials.

Imaging data

Univariate analyses

The set of brain regions involved in the processing of scene congruency was first identified using the unmasked condition, contrasting activity elicited by congruent vs. incongruent blocks of target images (no masking, see Methods). At the group level, this contrast revealed a set of regions in which activity was weaker for congruent than incongruent images (here and elsewhere, no activations in the other direction – weaker for incongruent images – were found). Significant activations (for cluster-level thresholding with p < 0.05 after FWE correction, or uncorrected threshold with p < 0.001 and a minimal cluster extent of 10 voxels, for exploratory purposes) in the temporal lobe comprised the right inferior temporal and fusiform gyri extending to the posterior parahippocampal gyrus and the left middle temporal gyrus (see Table 1). In the parietal lobe, activations were detected in the bilateral cunei, the right parietal lobule, and the left paracentral lobule. In the frontal lobe, significant clusters were observed in the bilateral inferior frontal gyri, right middle frontal gyrus, left superior medial frontal gyrus, and left cingulate gyrus.

Table 1 Cluster extent (CE), statistical values, and MNI stereotaxic brain atlas coordinates for the brain regions more activated by incongruent vs. congruent target images during the unmasked condition.

Full size table

We then used results from the unmasked condition as an inclusive mask when contrasting activity elicited by congruent vs. incongruent target images during the visible condition in the main experimental runs (see Fig. 3 and Table 2). Significant activations were found bilaterally in the fusiform (including the posterior parahippocampal gyrus), cingulate, middle and inferior frontal gyri, inferior parietal lobules, precunei, insular cortices, and caudate bodies; as well as activations in the left hemisphere, including the left inferior and superior temporal gyri, and left medial frontal gyrus (see Table 2).

Table 2 Cluster extent (CE), statistical values, and MNI stereotaxic brain atlas coordinates for the brain regions more activated by incongruent vs. congruent target images during the main task in the visible condition.

Full size table

When contrasting activity elicited by congruent vs. incongruent target images in the invisible condition, no significant result was found with the same uncorrected threshold of p < 0.001. We verified that the images did elicit a response in visual cortices irrespective of their congruency, by contrasting both congruent and incongruent trials with the baseline condition in which no target image was shown (see fig. SI1). Thus, masking was not so strong as to completely prevent low-level visual processing of the target images.

Multivariate analyses

Although congruent and incongruent scenes did not elicit different magnitudes of activity in the invisible condition, one possibility is that they elicited different patterns of activity⁷². To test this possibility, we investigated whether the activity patterns extracted from spheres centered on the clusters identified in the unmasked condition conveyed additional information regarding scene congruency. In the visible condition, we could decode scene congruency from the spheres located in the right fusiform gyrus, right superior parietal lobule, middle and inferior frontal gyrus, left precuneus, left superior medial and inferior frontal gyrus, and left anterior cingulate cortex. (Fig. 4, Table 3). In the invisible condition, on the other hand, we could not decode congruency above chance in any of the spherical ROIs, with a statistical significance threshold of p < 0.05 for the global and majority null hypotheses (see Methods). Bayesian analyses revealed that our results in the invisible condition were inconclusive: evidence was lacking in our data to support the null hypothesis (i.e., log(1/3) < log(BF) > log(3), see Fig. 4).

Table 3 Average decoding performance, p-values for the global and majority null hypotheses (permutation-based information prevalence inference), logarithm of Bayes factors in the visible condition in the main task, and MNI stereotaxic brain atlas coordinates for the spheres centered on brain regions defined in the unmasked condition.

Full size table

Discussion

The current study aimed at determining the neural substrates for the processing of object-context relations that are recruited irrespective of a specific task or instruction, and assessing the existence of such processing even when subjects are unaware of the presented scenes. While for the first question an extensive network of areas was identified – overcoming possible previous confounds induced by task demands or low-level features of the stimuli - the data was inconclusive with respect to answering the second question. We were unable to detect any neural activity which differentiated between congruent and incongruent scenes in the invisible condition, though as we explain below, this might stem from limitations of our study.

Neural correlates of object-context integration

In line with previous studies, several areas emerged when contrasting activations induced by congruent vs. incongruent images. A prominent group of visual areas were identified: the inferior temporal gyrus, suggested by Bar¹ (see also¹⁵) as the locus of contextually-guided object recognition, and the fusiform gyrus, previously implicated also in associative processing^73,74. In line with this finding, the Lateral Occipital Complex (LOC), which comprises the posterior part of the fusiform gyrus, was previously reported to differentiate between associatively related and unrelated objects^22,74,75. Interestingly, in a recent study which also presented congruent and incongruent scenes²⁴, the LOC did not show stronger activity for incongruent images but only responded to animal vs. non-animal objects, irrespective of the scene in which they appeared. Possibly, this discrepancy could be explained by a difference in stimuli. All of our scenes portray a person performing an action with an object, so that object identity is strongly constrained by the scene (the number of possible objects which are congruent with the scene is very small; e.g., a person can only drink from a limited set of objects). In most previous studies, including that of Rémy and colleagues²⁴, the scene is a location (e.g., a street, the beach, a living room), in which many objects could potentially appear. Arguably, the stronger manipulation of congruency induced by our stimuli elicited wider activations.

Our results also suggest that the posterior – and not solely the anterior – parahippocampal cortex (PHC), most likely corresponding to the Parahippocampal place area (PPA), is involved in processing contextual associations^{22,24,30,31,32,76,77}, in line with a previous study which also found posterior activations in response to incongruent scenes²⁴. Critically, these activations cannot be explained by spatial layouts^35,36,38,39, since the scenes were identical in that respect, so that the only difference between them is the congruent/incongruent object that was pasted onto them. Furthermore, our study overcomes another, more specific possible confound²⁴, which relates to spatial frequencies. The latter were found to modulate PHC activity^78,79, implying that its object-related activation might be at least partially explained by different spatial properties of these stimuli, rather than by their semantic content. In the current study, all scenes were empirically tested for low-level visual differences between the conditions, including spatial frequencies, and no systematic difference was found between congruent and incongruent images⁹, suggesting that the scenes differ mainly - if not exclusively - in contextual relations between the object and the scene in which it appears. Thus, our results go against low-level accounts of PHC activity, and suggest that it more likely reflects associative processes. This is in line with previous findings of PHC activations in response to objects which entail strong, as opposed to weak, contextual associations^21,31; thus, in both cases the PHC was recruited when associative processing was evoked, either by objects that entail strong associations or by scenes in which these associations are defied (note again though that our findings were more focused on posterior PHC, like in Rémy et al.²⁴, which might be explained by the usage of scenes vs. isolated objects).

Finally, our results strengthen the claim that different frontal areas (the inferior frontal gyri²⁴, the medial frontal gyrus²⁸ (notably, there a very different paradigm was used, and this area showed elevated activity for retrieval of previously learned congruent associations) and the cingulate cortex²² are indeed involved in the processing of object-scene relations (in fact, these areas elicited the most conclusive MVPA results). Some have suggested that such frontal activations recorded during the processing of incongruent scenes are task-dependent²⁴. Arguably, these activations could arise because subjects need to inhibit the competing response induced by the scene’s gist (indeed, right inferior frontal cortex activations are found in response inhibition tasks ^80,81,82), need to exert higher cognitive control⁸³, or engage in online monitoring processes⁸⁴. Yet in our experiment, these explanations are less plausible, since subjects were not asked to give any content-related responses, but simply to freely view the stimulus sequence and indicate how visible it was. In this passive viewing condition, we still found frontal activations - mainly in the inferior and middle frontal gyri (IFG and MFG, respectively) - bilaterally, while response inhibition is held to be mediated by the right IFG⁸². Our results accord with previous studies which implicated these areas in different types of associative processing, for related and unrelated objects²², for objects which have strong contextual associations²¹, for related and unrelated words²⁷ and for sentence endings that were either unrelated^85,86 or defied world-knowledge expectations⁸⁷. In addition, increased mPFC connectivity with sensory areas has been found for congruent scenes which were better remembered than incongruent ones²⁸, suggesting that this area mediates contextual facilitation of congruent object processing, and may be involved in extracting regularities across episodic experiences^88,89.

Taken together, our findings seem to support the model suggested by Bar (2004) for scene processing. According to this model, during normal scene perception the visual cortex projects a blurred, low spatial-frequency representation early and rapidly to the prefrontal cortex (PFC; note that the model remains general about the exact areas within the PFC which are involved in the process, and that we specifically found inferior and medial frontal activations) and parahippocampal cortex (PHC). This projection is considerably faster than the detailed parvocellular analysis, and presumably takes place in the magnocellular pathway^90,91. In the PHC, this coarse information activates an experience-based prediction about the scene’s context or its gist. Indeed, the gist of visual scenes can be extracted even with very short presentation durations of 13 ms⁹² (see also⁹³, though see⁹⁴), and such fast processing is held to be based on global analysis of low-level features²⁹. Then, this schema is projected to the inferior temporal cortex (ITC), where a set of schema-congruent representations is activated so that representations of objects that are more strongly related to the schema are more activated, hereby facilitating their future identification. In parallel, the upcoming visual information of the target object selected by foveal vision and attention activates information in the PFC that subsequently sensitizes the most likely candidate interpretations of that individual object, irrespective of context³¹. In the ITC, the intersection between the schema-congruent representations and the candidate interpretations of the target object results in the reliable selection of a single identity. For example, if the most likely object representations in PFC include a computer, a television set, and a microwave, and the most likely contextual representation in the PHC correspond to informatics, the computer alternative is selected in the ITC, and all other candidates are suppressed. Then, further inspection allows refinement of this identity (for example, from a “computer” to a “laptop”).

Based on this model, we can now speculate about the mechanisms of incongruent scenes processing: in this case, the process should be prolonged and require additional analysis - as implied by the elevated activations in all three areas in our study in the incongruent condition. There, since the visual properties of the incongruent object generate different guesses about object-identities in a network of frontal areas than the schema-congruent representations subsequently activated in the ITC, the intersection fails to yield an identification of the object, requiring further inspection and a re-evaluation of both the extracted gist (PHC) and the possible object-guesses. This further inspection might lead to attentional engagement, as might be implied by the increased activations in the right superior parietal lobule⁹⁵. Indeed, we previously showed that while attention is not captured by incongruent scenes, it is nevertheless engaged by them⁵⁸. The difficulty to reach a match between the upcoming visual information and the activated guesses can explain the widely-reported disadvantage in identifying incongruent objects, both with respect to accuracy^6,7,8 and to reaction times^3,4,5. It is further strengthened by EEG findings, showing that the waveforms induced by congruent and incongruent scenes start to diverge in the N300 time window (200–300 ms after the scene had been presented) - if not earlier⁹⁶, the time at which these matching procedures presumably take place, prior to object identification^9,16,18 (though see²⁰ and¹⁹). Note however, that this model holds the ITC as the locus of object-context integration, at which the information from the PHC and frontal areas converge. It also focuses on the perceptual aspect of object-context integration, to explain how scene gist affects object identification. Yet frontal activations (more specifically, IFG activations) were found also for verbal stimuli that were either semantically anomalous or defied world-knowledge expectations⁹⁷, suggesting that (a) these frontal areas may also be involved in the integrative process itself, rather than only in generating possible guesses about object identities irrespective of context and (b) that they may mediate a more general, amodal mechanism of integration and evaluation.

The role of consciousness in object-context relations processing

In this study, we found no neural evidence for processing of object-context relations in the absence of awareness. Using only trials in which subjects reported seeing nothing or only a glimpse of the stimulus, and were at chance in discriminating target image congruency (though slightly above chance in discriminating target image orientation), the BOLD activations found in the visible and unmasked conditions became undetectable in our setup. Following previous studies which failed to find univariate effects during unconscious processing but managed to show significant decoding⁷² we used multivariate pattern analysis (MVPA⁹⁸) as a more sensitive way to detect neural activations which may subserve unconscious object-context integration; however, we did not find significant decoding in the invisible condition.

How should this null result be interpreted? One possibility is that it reflects the brain’s inability to process the relations between an object and a scene when both are invisible. This finding goes against our original behavioral finding of differential suppression durations for congruent vs. incongruent scenes⁴⁴,see also⁴⁵; however, a recent study failed to replicate⁴⁹ this original finding, and we were also not able to replicate behavioral evidence for unconscious scene-object integration⁵⁰. Furthermore, another study which focused on the processing of implied motion in real-life scenes also failed to find evidence of unconscious processing⁹⁹. In the same vein, the findings of another study which investigated high-level contextual unconscious integration of words into congruent and incongruent sentences¹⁰⁰ was recently criticized^101,102 (for a review of studies showing different types of unconscious integration, see⁴⁸). Taken together with the null result in our study, this could imply that high-level integration may actually not be possible in the well-controlled absence of awareness. This interpretation is in line with the prominent theories of consciousness which consider consciousness and integration to be intimately related^51,52, if not equivalent to each other^53,54.

On the other hand, one should be cautious in interpreting the absence of evidence as evidence of absence. The observable correlates of unconscious object-scene relations processing may be so weak that we missed them; our study was likely underpowered both with respect to number of subjects and number of trials, and fMRI may simply not be a sensitive enough methodology to detect the weak effects of unconscious higher-level processing. The Bayesian analyses we performed suggest that our data in the invisible condition was indeed inconclusive, and did not support the null hypothesis. Many previous fMRI studies either showed substantially reduced or no activations to invisible stimuli^103,104,105, or effects that were significant, yet weaker and more focused compared with conscious processing e.g.^106,107,108. Together, this raises the possibility that in some cases behavioral measures may be more sensitive than imaging results; in a recent study, for example, invisible reward cues improved subjects’ performance to the same extent as visible ones - yet while the latter evoked activations in several brain regions (namely, motor and premotor cortex and inferior parietal lobe), the former did not¹⁰⁹. Critically, in that study subjects were performing a task while scanned. Our study, on the other hand, included no task in order to make sure that the observed activations were not task-induced, but rather represented object-context relations processing per se. Thus, while our findings cannot rule out the possibility that such processing indeed does not depend on conscious perception; they surely do not support this claim.

Conclusions

While finding no evidence for unconscious processing of object-context relations, the present study contributes to our understanding of the underlying mechanisms during conscious processing. We found enhanced LOC, ITC, PHC and frontal (MFG, IFG, Cingulate) activations for incongruent scenes, irrespective of task requirements. Our results cannot be explained by low-level differences between the images, including spatial frequencies, which were suggested as a possible confound in previous studies²⁴. The use of stimuli that depict people performing real-life actions with different objects, rather than non-ecological stimuli (e.g., line drawings⁷⁵; isolated, floating objects^22,23), or stimuli with which subjects have less hands-on, everyday experience (animals or objects presented in natural vs. urban sceneries^24,25,26), enables us to better track real-life object-context relations processing which also occurs outside the laboratory. Arguably, in such real-life processes, incoming visual information about object features is compared with scene-congruent representations evoked by the scene gist, in an interplay between the abovementioned areas. This interplay - usually leading to contextual facilitation of object processing - is disrupted when incongruent scenes are presented, resulting in additional neural processing to resolve these incongruencies. Thus, our results go beyond previous studies by overcoming potential design limitations and providing further evidence for frontotemporal mechanisms of object-scene relations processing.

References

Bar, M. Visual objects in context. Nature Reviews Neuroscience 5, 617–629, http://www.nature.com/nrn/journal/v5/n8/abs/nrn1476.html (2004).
Biederman, I., Mezzanotte, R. J. & Rabinowitz, J. C. Scene perception: detecting and judging objects undergoing relational violations. Cognitive Psychology 14, 143–177 (1982).
Article CAS PubMed Google Scholar
Davenport, J. L. & Potter, M. C. Scene consistency in object and background perception. Psychological Science 15, 559–564 (2004).
Article PubMed Google Scholar
Palmer, S. E. Effects of Contextual Scenes on Identification of Objects. Memory and Cognition 3, 519–526 (1975).
Article CAS PubMed Google Scholar
Rieger, J. W., Kochy, N., Schalk, F., Gruschow, M. & Heinze, H. J. Speed limits: Orientation and semantic context interactions constrain natural scene discrimination dynamics. Journal of Experimental Psychology-Human Perception and Performance 34, 56–76 (2008).
Article PubMed Google Scholar
Biederman, I., Rabinowitz, J. C., Glass, A. L. & Stacy, E. W. Information Extracted from a Glance at a Scene. Journal of Experimental Psychology 103, 597–600 (1974).
Article CAS PubMed Google Scholar
Boyce, S. J., Pollatsek, A. & Rayner, K. Effect of Background Information on Object Identification. Journal of Experimental Psychology-Human Perception and Performance 15, 556–566 (1989).
Article CAS PubMed Google Scholar
Underwood, G. In Cognitive Processes in Eye Guidance (ed. Underwood, G.) 163–187 (Oxford University Press, 2005).
Mudrik, L., Lamy, D. & Deouell, L. Y. ERP evidence for context congruity effects during simultaneous object-scene processing. Neuropsychologia 48, 507–517, https://doi.org/10.1016/j.neuropsychologia.2009.10.011 (2010).
Article PubMed Google Scholar
Oliva, A. & Torralba, A. The role of context in object recognition. Trends in Cognitive Sciences 11, 527 (2007).
Article Google Scholar
Henderson, J. M. & Hollingworth, A. High-level scene perception. Annual Review of Psychology 50, 243–271 (1999).
Article CAS PubMed Google Scholar
Oliva, A. & Torralba, A. Building the gist of a scene: the role of global image features in recognition. Progress in Brain Research 155, 23–36 (2006).
Article PubMed Google Scholar
Summerfield, C. & Egner, T. Expectation (and attention) in visual cognition. Trends in cognitive sciences 13, 403–409 (2009).
Article PubMed Google Scholar
Bar, M. A cortical mechanism for triggering top-down facilitation in visual object recognition. Journal of Cognitive Neuroscience 15, 600–609 (2003).
Article PubMed Google Scholar
Trapp, S. & Bar, M. Prediction, context, and competition in visual recognition. Annals of the New York Academy of Sciences 1339, 190–198 (2015).
Article ADS PubMed Google Scholar
Mudrik, L., Lamy, D., Shalgi, S. & Deouell, L. Y. Synchronous contextual irregularities affect early scene processing: replication and extension. Neuropsychologia 56, 447–458 (2014).
Article PubMed Google Scholar
Dyck, M. & Brodeur, M. B. ERP evidence for the influence of scene context on the recognition of ambiguous and unambiguous objects. Neuropsychologia 72, 43–51 (2015).
Article PubMed Google Scholar
Võ, M. L.-H. & Wolfe, J. M. Differential electrophysiological signatures of semantic and syntactic scene processing. Psychological science 24, 1816–1823 (2013).
Article PubMed PubMed Central Google Scholar
Demiral, Ş. B., Malcolm, G. L. & Henderson, J. M. ERP correlates of spatially incongruent object identification during scene viewing: Contextual expectancy versus simultaneous processing. Neuropsychologia 50, 1271–1285 (2012).
Article PubMed Google Scholar
Ganis, G. & Kutas, M. An electrophysiological study of scene effects on object identification. Cognitive Brain Research 16, 123–144 (2003).
Article PubMed Google Scholar
Kveraga, K. et al. Early onset of neural synchronization in the contextual associations network. Proceedings of the National Academy of Sciences 108, 3389–3394 (2011).
Article ADS CAS Google Scholar
Gronau, N., Neta, M. & Bar, M. Integrated contextual representation for objects’ identities and their locations. Journal of Cognitive Neuroscience 20, 371–388 (2008).
Article PubMed Google Scholar
Kaiser, D., Stein, T. & Peelen, M. V. Object grouping based on real-world regularities facilitates perception by reducing competitive interactions in visual cortex. Proceedings of the National Academy of Sciences 111, 11217–11222 (2014).
Article ADS CAS Google Scholar
Rémy, F., Vayssière, N., Saint-Aubert, L., Barbeau, E. & Fabre-Thorpe, M. The anterior parahippocampal cortex processes contextual incongruence in a scene. Journal of Vision 13, 1064–1064 (2014).
Article Google Scholar
Jenkins, L. J., Yang, Y. J., Goh, J., Hong, Y. Y. & Park, D. C. Cultural differences in the lateral occipital complex while viewing incongruent scenes. SCAN 5, 236–241 (2010).
PubMed Google Scholar
Kirk, U. The Neural Basis of Object-Context Relationships on Aesthetic Judgment. PLoS One 3, e3754 (2008).
Article ADS PubMed PubMed Central CAS Google Scholar
Gold, B. T. et al. Dissociation of automatic and strategic lexical-semantics: functional magnetic resonance imaging evidence for differing roles of multiple frontotemporal regions. Journal of Neuroscience 26, 6523–6532 (2006).
Article CAS PubMed Google Scholar
van Kesteren, M. T. R., Rijpkema, M., Ruiter, D. J. & Fernández, G. Retrieval of associative information congruent with prior knowledge is related to increased medial prefrontal activity and connectivity. Journal of Neuroscience 30, 15888–15894 (2010).
Article PubMed CAS Google Scholar
Malcolm, G. L., Groen, I. A. & Baker, C. I. Making sense of real-world scenes. Trends in Cognitive Sciences 20, 843–856 (2016).
Article PubMed PubMed Central Google Scholar
Aminoff, E. M., Kveraga, K. & Bar, M. The role of the parahippocampal cortex in cognition. Trends in cognitive sciences 17, 379–390 (2013).
Article PubMed PubMed Central Google Scholar
Bar, M. & Aminoff, E. Cortical analysis of visual context. Neuron 38, 347–358 (2003).
Article CAS PubMed Google Scholar
Montaldi, D. et al. Associative encoding of pictures activates the medial temporal lobes. Human brain mapping 6, 85–104 (1998).
Article CAS PubMed Google Scholar
Goh, J. O. S. et al. Cortical areas involved in object, background, and object background processing revealed with functional magnetic resonance adaptation. Journal of Neuroscience 24, 10223–10228 (2004).
Article CAS PubMed Google Scholar
Stansbury, D. E., Naselaris, T. & Gallant, J. L. Natural scene statistics account for the representation of scene categories in human visual cortex. Neuron 79, 1025–1034 (2013).
Article CAS PubMed PubMed Central Google Scholar
Epstein, R. A. Parahippocampal and retrosplenial contributions to human spatial navigation. Trends in cognitive sciences 12, 388–396 (2008).
Article PubMed PubMed Central Google Scholar
Epstein, R. A. & Ward, E. J. How reliable are visual context effects in the parahippocampal place area? Cerebral Cortex 20, 294–303 (2009).
Article PubMed Google Scholar
MacEvoy, S. P. & Epstein, R. A. Constructing scenes from objects in human occipitotemporal cortex. Nature neuroscience 14, 1323–1329 (2011).
Article CAS PubMed Google Scholar
Harel, A., Kravitz, D. J. & Baker, C. I. Deconstructing visual scenes in cortex: gradients of object and spatial layout information. Cerebral Cortex 23, 947–957 (2013).
Article PubMed Google Scholar
Mullally, S. L. & Maguire, E. A. A new role for the parahippocampal cortex in representing space. Journal of Neuroscience 31, 7441–7449 (2011).
Article CAS PubMed Google Scholar
Howard, L. R., Kumaran, D., Ólafsdóttir, H. F. & Spiers, H. J. Double dissociation between hippocampal and parahippocampal responses to object–background context and scene novelty. Journal of Neuroscience 31, 5253–5261 (2011).
Article CAS PubMed Google Scholar
Epstein, R. A., Parker, W. E. & Feiler, A. M. Where am I now? Distinct roles for parahippocampal and retrosplenial cortices in place recognition. Journal of Neuroscience 27, 6141–6149 (2007).
Article CAS PubMed Google Scholar
Spiers, H. J. & Maguire, E. A. Thoughts, behaviour, and brain dynamics during navigation in the real world. Neuroimage 31, 1826–1840 (2006).
Article PubMed Google Scholar
Marchette, S. A., Vass, L. K., Ryan, J. & Epstein, R. A. Outside looking in: Landmark generalization in the human navigational system. Journal of Neuroscience 35, 14896–14908 (2015).
Article CAS PubMed Google Scholar
Mudrik, L., Breska, A., Lamy, D. & Deouell, L. Y. Integration without awareness: expanding the limits of unconscious processing. Psychological Science 22, 764–770 (2011).
Article PubMed Google Scholar
Mudrik, L. & Koch, C. Differential processing of invisible congruent and incongruent scenes: A case for unconscious integration. Journal of Vision 13, 24 (2013).
Article PubMed Google Scholar
Stein, T., Kaiser, D. & Peelen, M. V. Interobject grouping facilitates visual awareness. Journal of Vision 15, 10–10 (2015).
Article PubMed Google Scholar
Lin, Z. C. & He, S. Seeing the invisible: The scope and limits of unconscious processing in binocular rivalry. Progress in Neurobiology 87, 195–211, https://doi.org/10.1016/j.pneurobio.2008.09.002 (2009).
Article ADS PubMed Google Scholar
Mudrik, L., Faivre, N. & Koch, C. Information integration without awareness. Trends in Cognitive Sciences 18, 488–496 (2014).
Article PubMed Google Scholar
Moors, P., Boelens, D., van Overwalle, J. & Wagemans, J. Scene integration without awareness: No conclusive evidence for processing scene congruency during continuous flash suppression. Psychological science 27, 945–956 (2016).
Article PubMed Google Scholar
Biderman, N. & Mudrik, L. Evidence for Implicit—But Not Unconscious—Processing of Object-Scene Relations. Psychological Science, 1–12 (2017).
Dehaene, S. & Changeux, J. P. Experimental and theoretical approaches to conscious processing. Neuron 70, 200–227 (2011).
Article CAS PubMed Google Scholar
Dehaene, S. & Naccache, L. Towards a cognitive neuroscience of consciousness: basic evidence and a workspace framework. Cognition 79, 1–37 (2001).
Article CAS PubMed Google Scholar
Tononi, G., Boly, M., Massimini, M. & Koch, C. Integrated information theory: from consciousness to its physical substrate. Nature Reviews Neuroscience 17, 450–461 (2016).
Article CAS PubMed Google Scholar
Tononi, G. & Edelman, G. M. Consciousness and Complexity. Science 282, 1846 (1998).
Article CAS PubMed Google Scholar
Brainard, D. The psychophysics toolbox. Spatial Vision 10, 433–436, https://doi.org/10.1163/156856897X00357 (1997).
Article CAS PubMed Google Scholar
Pelli, D. G. The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spatial Vision 10, 437–442 (1997).
Article CAS PubMed Google Scholar
Kleiner, M. et al. What’s new in Psychtoolbox-3. Perception 36, 1 (2007).
Google Scholar
Mudrik, L., Deouell, L. Y. & Lamy, D. Scene congruency biases binocular rivalry. Consciousness and Cognition 20, 756–767 (2011).
Article PubMed Google Scholar
Neumann, D. & Gegenfurtner, K. Image retrieval and perceptual similarity. ACM Transactions on Applied Perception (TAP) 3, 31–47 (2006).
Article Google Scholar
Ramsøy, T. Z. & Overgaard, M. Introspection and subliminal perception. Phenomenology and the Cognitive Sciences 3, 1–23 (2004).
Article Google Scholar
Levitt, H. Transformed Up-Down Methods in Psychoacoustics. The Journal of the Acoustical Society of America 49, 467 (1971).
Article ADS Google Scholar
Wager, T. D. & Nichols, T. E. Optimization of experimental design in fMRI: a general framework using a genetic algorithm. Neuroimage 18, 293–309 (2003).
Article PubMed Google Scholar
Macknik, S. L. & Livingstone, M. S. Neuronal correlates of visibility and invisibility in the primate visual system. Nature Neuroscience 1, 144–149 (1998).
Article CAS PubMed Google Scholar
Morey, R. D. & Rouder, J. N. BayesFactor: Computation of Bayes Factors for Common Designs. R package version 0.9.10-2 (2015).
Wickham, H. ggplot2: elegant graphics for data analysis. Springer New York 1, 3 (2009).
Mazaika, P. K., Whitfield, S. & Cooper, J. C. Detection and repair of transient artifacts in fMRI data. Neuroimage 26, S36 (2005).
Article Google Scholar
Smith, S. M. & Nichols, T. E. Threshold-free cluster enhancement: addressing problems of smoothing, threshold dependence and localisation in cluster inference. Neuroimage 44, 83–98 (2009).
Article PubMed Google Scholar
Allefeld, C., Görgen, K. & Haynes, J. D. Valid population inference for information-based imaging: From the second-level t-test to prevalence inference. NeuroImage 141, 378–392 (2016).
Article PubMed Google Scholar
Melloni, L., Schwiedrzik, C. M., Müller, N., Rodriguez, E. & Singer, W. Expectations change the signatures and timing of electrophysiological correlates of perceptual awareness. Journal of Neuroscience 31, 1386–1396 (2011).
Article CAS PubMed Google Scholar
Gelbard-Sagiv, H., Faivre, N., Mudrik, L. & Koch, C. Low-level awareness accompanies “unconscious” high-level processing during continuous flash suppression. Journal of vision 16, 3 (2016).
Article PubMed Google Scholar
Kouider, S. & Dupoux, E. Partial awareness creates the “illusion” of subliminal semantic priming. Psychological Science 15, 75–81 (2004).
Article PubMed Google Scholar
Sterzer, P., Haynes, J. D. & Rees, G. Fine-scale activity patterns in high-level visual areas encode the category of invisible objects. Journal of Vision 8, 10 (2008).
Article PubMed Google Scholar
Hawco, C. & Lepage, M. Overlapping patterns of neural activity for different forms of novelty in fMRI. Frontiers in human neuroscience 8 (2014).
Roberts, K. L. & Humphreys, G. W. Action relationships concatenate representations of separate objects in the ventral visual system. Neuroimage 52, 1541–1548 (2010).
Article PubMed Google Scholar
Kim, J. G. & Biederman, I. Where do objects become scenes? Cerebral Cortex, bhq240 (2010).
Baumann, O. & Mattingley, J. B. Functional organization of the parahippocampal cortex: dissociable roles for context representations and the perception of visual scenes. Journal of Neuroscience 36, 2536–2542 (2016).
Article CAS PubMed Google Scholar
Aminoff, E. M., Gronau, N. & Bar, M. The Parahippocampal Cortex Mediates Spatial and Nonspatial Associations. Cerebral Cortex (2006).
Andrews, T. J., Clarke, A., Pell, P. & Hartley, T. Selectivity for low-level features of objects in the human ventral stream. Neuroimage 49, 703–711 (2010).
Article PubMed Google Scholar
Rajimehr, R., Devaney, K. J., Bilenko, N. Y., Young, J. C. & Tootell, R. B. H. The “parahippocampal place area” responds preferentially to high spatial frequencies in humans and monkeys. PLoS Biol 9, e1000608 (2011).
Article CAS PubMed PubMed Central Google Scholar
Aron, A. R., Robbins, T. W. & Poldrack, R. A. Inhibition and the right inferior frontal cortex. Trends in cognitive sciences 8, 170–177 (2004).
Article PubMed Google Scholar
Garavan, H., Ross, T. J., Murphy, K., Roche, R. A. P. & Stein, E. A. Dissociable executive functions in the dynamic control of behavior: inhibition, error detection, and correction. Neuroimage 17, 1820–1829 (2002).
Article CAS PubMed Google Scholar
Garavan, H., Ross, T. J. & Stein, E. A. Right hemispheric dominance of inhibitory control: an event-related functional MRI study. Proceedings of the National Academy of Sciences 96, 8301–8306 (1999).
Article ADS CAS Google Scholar
Miller, E. K. The prefontral cortex and cognitive control. Nature reviews neuroscience 1, 59–65 (2000).
Article CAS PubMed Google Scholar
Kerns, J. G. et al. Anterior cingulate conflict monitoring and adjustments in control. Science 303, 1023–1026 (2004).
Article ADS CAS PubMed Google Scholar
Baumgaertner, A., Weiller, C. & Büchel, C. Event-related fMRI reveals cortical sites involved in contextual sentence integration. Neuroimage 16, 736–745 (2002).
Article CAS PubMed Google Scholar
Kiehl, K. A., Laurens, K. R. & Liddle, P. F. Reading anomalous sentences: an event-related fMRI study of semantic processing. Neuroimage 17, 842–850 (2002).
Article PubMed Google Scholar
Hagoort, P., Hald, L., Bastiaansen, M. & Petersson, K. M. Integration of word meaning and world knowledge in language comprehension. science 304, 438–441 (2004).
Article ADS CAS PubMed Google Scholar
Kroes, M. C. W. & Fernández, G. Dynamic neural systems enable adaptive, flexible memories. Neuroscience & Biobehavioral Reviews 36, 1646–1666 (2012).
Article Google Scholar
Schlichting, M. L. & Preston, A. R. Memory integration: neural mechanisms and implications for behavior. Current opinion in behavioral sciences 1, 1–8 (2015).
Article PubMed PubMed Central Google Scholar
Graboi, D. & Lisman, J. Recognition by top-down and bottom-up processing in cortex: The control of selective attention. Journal of Neurophysiology 90, 798–810, https://doi.org/10.1152/jn.00777.2002 (2003).
Article PubMed Google Scholar
Merigan, W. H. & Maunsell, J. H. R. How Parallel Are the Primate Visual Pathways. Annual Review of Neuroscience 16, 369–402 (1993).
Article CAS PubMed Google Scholar
Potter, M. C., Wyble, B., Hagmann, C. E. & McCourt, E. S. Detecting meaning in RSVP at 13 ms per pictur. e. Attention, Perception, & Psychophysics 76, 270–279 (2014).
Article Google Scholar
Joubert, O. R., Fize, D., Rousselet, G. A. & Fabre-Thorpe, M. Categorisation of natural scene: Global context is extracted as fast as objects. Perception 34, ECVP Abstract Supplement (2005).
Maguire, J. F. & Howe, P. D. L. Failure to detect meaning in RSVP at 27 ms per picture. Attention, Perception, & Psychophysics 78, 1405–1413 (2016).
Article Google Scholar
Corbetta, M., Shulman, G. L., Miezin, F. M. & Petersen, S. E. Superior parietal cortex activation during spatial attention shifts and visual feature conjunction. SCIENCE-NEW YORK THEN WASHINGTON-, 802–802 (1995).
Guillaume, F., Tinard, S., Baier, S. & Dufau, S. An ERP Investigation of Object-Scene Incongruity. Journal of Psychophysiology, 1–10 (2016).
Hagoort, P., Baggio, G. & Willems, R. M. Semantic unification. The cognitive neurosciences 4, 819–836 (2009).
Google Scholar
Norman, K. A., Polyn, S. M., Detre, G. J. & Haxby, J. V. Beyond mind-reading: multi-voxel pattern analysis of fMRI data. Trends in cognitive sciences 10, 424–430 (2006).
Article PubMed Google Scholar
Faivre, N. & Koch, C. Inferring the direction of implied motion depends on visual awareness. Journal of vision 14, 4 (2014).
Article PubMed PubMed Central Google Scholar
Sklar, A. Y. et al. Reading and doing arithmetic nonconsciously. Proceedings of the National Academy of Sciences 109, 19614–19619 (2012).
Article ADS CAS Google Scholar
Moors, P. & Hesselmann, G. A critical reexamination of doing arithmetic nonconsciously. Psychonomic Bulletin & Review, 1–10 (2017).
Shanks, D. R. Regressive research: The pitfalls of post hoc data selection in the study of unconscious mental processes. Psychonomic Bulletin & Review, 1–24 (2016).
Bahmani, H., Murayama, Y., Logothetis, N. K. & Keliris, G. A. Binocular flash suppression in the primary visual cortex of anesthetized and awake macaques. PloS one 9, e107628 (2014).
Article PubMed PubMed Central CAS Google Scholar
Hesselmann, G. & Malach, R. The link between fMRI-BOLD activation and perceptual awareness is “stream-invariant” in the human visual system. Cerebral Cortex 21, 2829–2837 (2011).
Article PubMed Google Scholar
Yuval-Greenberg, S. & Heeger, D. J. Continuous flash suppression modulates cortical activity in early visual cortex. The Journal of Neuroscience 33, 9635–9643 (2013).
Article CAS PubMed PubMed Central Google Scholar
Dehaene, S. et al. Cerebral mechanisms of word masking and unconscious repetition priming. Nature Neuroscience 4, 752–758 (2001).
Article CAS PubMed Google Scholar
Sadaghiani, S. et al. Intrinsic connectivity networks, alpha oscillations, and tonic alertness: a simultaneous electroencephalography/functional magnetic resonance imaging study. Journal of Neuroscience 30, 10243–10250 (2010).
Article CAS PubMed Google Scholar
Van Gaal, S., Lamme, V. A. F., Fahrenfort, J. J. & Ridderinkhof, K. R. Dissociable brain mechanisms underlying the conscious and unconscious control of behavior. Journal of cognitive neuroscience 23, 91–105 (2011).
Article PubMed Google Scholar
Bijleveld, E. et al. Distinct neural responses to conscious versus unconscious monetary reward cues. Human brain mapping 35, 5578–5586 (2014).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Christof Koch and Ralph Adolphs for constructive suggestions and helpful discussion, and Hagar Gelbard Sagiv and Uri Maoz for their help with the design. We further thank Galit Yovel and Nurit Gronau for their insightful comments on the manuscript. LM was supported by the Israel Science Foundation (grant No. 1847/16) and the Marie Skłodowska-Curie Individual Fellowships (Grant No. 659765- MSCA-IF-EF-ST). NF was supported by the Fyssen and the Philippe foundations.

Author information

Authors and Affiliations

Division of Biology, California Institute of Technology, Pasadena, CA, 91125, USA
Nathan Faivre, Naama Schwartz & Liad Mudrik
Laboratory of Cognitive Neuroscience, Brain Mind Institute, Faculty of Life Sciences, Swiss Federal Institute of Technology (EPFL), Geneva, Switzerland
Nathan Faivre
Centre d’Economie de la Sorbonne, CNRS UMR 8174, Paris, France
Nathan Faivre
Division of the Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA
Julien Dubois
Department of Neurosurgery, Cedars Sinai Medical Center, Los Angeles, CA, USA
Julien Dubois
School of Psychological sciences, Tel Aviv University, Tel Aviv, Israel
Naama Schwartz & Liad Mudrik
Sagol school of Neuroscience, Tel Aviv University, Tel Aviv, Israel
Liad Mudrik

Authors

Nathan Faivre
View author publications
You can also search for this author in PubMed Google Scholar
Julien Dubois
View author publications
You can also search for this author in PubMed Google Scholar
Naama Schwartz
View author publications
You can also search for this author in PubMed Google Scholar
Liad Mudrik
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.M. developed the study concept and all authors contributed to the study design and data collection. N.F. performed data analysis. N.F. and L.M. drafted the paper; all authors provided critical revisions and approved the final version of the paper for submission.

Corresponding authors

Correspondence to Nathan Faivre or Liad Mudrik.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file S1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Faivre, N., Dubois, J., Schwartz, N. et al. Imaging object-scene relations processing in visible and invisible natural scenes. Sci Rep 9, 4567 (2019). https://doi.org/10.1038/s41598-019-38654-z

Download citation

Received: 23 January 2018
Accepted: 13 December 2018
Published: 14 March 2019
DOI: https://doi.org/10.1038/s41598-019-38654-z

This article is cited by

Predictive processing of scenes and objects
- Marius V. Peelen
- Eva Berlot
- Floris P. de Lange
Nature Reviews Psychology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Cortical and subcortical signatures of conscious object recognition

Task-dependent functional organizations of the visual ventral stream

A functional dissociation of face-, body- and scene-selective brain areas based on their response to moving and static stimuli

Introduction

Methods

Participants

Apparatus and stimuli

Procedure

Calibration condition

Main experimental conditions

Post-experiment visibility tests

Unmasked condition

Behavioral data analysis

MRI data acquisition and preprocessing

fMRI analysis

Results

Behavioral data

Imaging data

Univariate analyses

Multivariate analyses

Discussion

Neural correlates of object-context integration

The role of consciousness in object-context relations processing

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Supplementary information

Supplementary file S1

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Predictive processing of scenes and objects

Comments

Search

Quick links