Dense sampling reveals behavioral oscillations in rapid visual categorization

Drewes, Jan; Zhu, Weina; Wutz, Andreas; Melcher, David

doi:10.1038/srep16290

Download PDF

Article
Open access
Published: 06 November 2015

Dense sampling reveals behavioral oscillations in rapid visual categorization

Jan Drewes¹^na1,
Weina Zhu^2,3^na1,
Andreas Wutz¹^na1 &
…
David Melcher¹^na1

Scientific Reports volume 5, Article number: 16290 (2015) Cite this article

2483 Accesses
34 Citations
Metrics details

Subjects

Abstract

Perceptual systems must create discrete objects and events out of a continuous flow of sensory information. Previous studies have demonstrated oscillatory effects in the behavioral outcome of low-level visual tasks, suggesting a cyclic nature of visual processing as the solution. To investigate whether these effects extend to more complex tasks, a stream of “neutral” photographic images (not containing targets) was rapidly presented (20 ms/image). Embedded were one or two presentations of a randomly selected target image (vehicles and animals). Subjects reported the perceived target category. On dual-presentation trials, the ISI varied systematically from 0 to 600 ms. At randomized timing before first target presentation, the screen was flashed with the intent of creating a phase reset in the visual system. Sorting trials by temporal distance between flash and first target presentation revealed strong oscillations in behavioral performance, peaking at 5 Hz. On dual-target trials, longer ISIs led to reduced performance, implying a temporal integration window for object category discrimination. The “animal” trials exhibited a significant oscillatory component around 5 Hz. Our results indicate that oscillatory effects are not mere fringe effects relevant only with simple stimuli, but are resultant from the core mechanisms of visual processing and may well extend into real-life scenarios.

Oscillatory visual mechanisms revealed by random temporal sampling

Article Open access 29 October 2021

A delay in sampling information from temporally autocorrelated visual stimuli

Article Open access 15 April 2020

Flexible time course of spatial frequency use during scene categorization

Article Open access 07 July 2021

Introduction

Our visual system is capable of rapidly extracting high-level meaning from a visual scene. Studies have shown that humans are able to discriminate visual category for scenes presented as briefly as 20 ms^1,2 and to initiate actions contingent on target category in as little as 100 ms^2,3. This ability to grasp the gist of a briefly presented visual scene can also occur when multiple images are presented in succession⁴, even at high rates of presentation⁵.

These findings raise critical question of how the visual system is able to quickly categorize a new, unexpected stimulus. It is commonly assumed that the ultra-fast mode of visual target identification is based on feed-forward processing^3,6,7. However, other studies have shown evidence that recurrent processes also play an important role in object and scene processing and that categorization can involve temporal integration^8,9,10,11. Moreover, it has been argued that recurrent processes may be required for stimuli to be consciously perceived^12,13.

The goal of the present study was to investigate three critical issues regarding rapid visual categorization. The first was the temporal window during which information is combined in making an initial judgment of visual category. Previous studies have focused on the minimum duration necessary for categorization, but a full understanding of the underlying mechanisms would include also the temporal integration window (TW) during which additional information can affect perception. Second, we examined the role of recurrent processing in rapid visual categorization. One hallmark of recurrent processing is oscillations in behavioral performance, reflecting the interaction between the stimulus and the state of ongoing brain activity¹⁴. The perceptual system undergoes periods of optimal and sub-optimal performance, as shown in various perception tasks such as target detection^15,16 and also in motor responses^17,18,19,20. Finally, this study examined the claim that rapid categorization depends on target category, such that certain target categories, such as animals, are afforded a privileged kind of processing^{21,22,23,24,25,26}.

To test these questions, we presented target images (animal or vehicle) in an RSVP sequence of natural scenes not containing any target objects (20 ms/50 Hz). We added two novel manipulations in order to investigate the mechanisms of rapid categorization. First, we inserted a bright flash into the rapid image sequence at a controlled, but randomized time (see methods below). This flash should induce a phase reset in the visual system, creating a degree of phase coherence across both trials and subjects^27,28. By varying the onset of the target relative to the flash, it is possible to align and systematically sample the time course of performance in order to find signs of behavioral oscillations^{16,29,30,31,32}.

In addition, we included trials in which there were two targets in the RSVP, separated by a varying ISI. We expected that presenting two targets immediately after each other and within the temporal integration window, would lead to highest levels of performance and that lengthening the ISI would eventually eliminate most of the benefit of having two targets. We investigated the time course of the interaction between the two stimuli to look for evidence of recurrent processing. After a certain delay congruent with the duration of the feedback loop, a second presentation of the target should reinforce processing of the original stimulus and lead to an increase in target recognition/discrimination performance.

Methods

Participants

A total of 38 participants completed the experiment. All subjects reported to have normal or corrected-to-normal vision and gave informed written consent. Experiments were conducted in accordance with the Declaration of Helsinki and approved by the University of Trento ethical committee.

Stimuli

Photographs of natural scenes were displayed in a rapid serial visual presentation (RSVP) paradigm. The images for use in this study were selected from the Corel Stock Photo Library and divided into 3 different categories. The first category was the “background/neutral” category. These images contained neither text, nor vehicles, nor animals, but otherwise any natural scene was deemed acceptable, resulting in a set of 3973 images. The other two categories were target images containing either an animal or vehicle (300 images each, see also³³). As color information has been shown not to be critical for ultra-fast visual processing^33,34,35, all images were converted to gray-scale. Sample images may be seen in Fig. 1. All images were square, measuring 300 pixels width and height, in 8 bit gray scales. To reduce low-level differences both within and across image categories, all images were histogram-equalized using the SHINE toolbox³⁶.

Experimental paradigm

A random stream of “background/neutral” images was displayed on a 20” CRT screen at 1024 × 768 pixels spatial resolution, with a vertical refresh rate of 100 Hz. Every image was shown for 2 refresh cycles, resulting in 50 Hz image frequency. Background/neutral images were displayed in the center of the screen, spanning approximately 10.8 deg of visual field, at 50% contrast surrounded by a black background. For the duration of one image (20 ms), at a randomized time ranging from 700 to 1200 ms after image sequence onset, the background of the screen was flashed from black to white. At a random time interval ranging from 200 to 740 ms (ten steps of 60 ms) thereafter (Flash/Stimulus onset interval), either one or two presentations of the same target chosen randomly from the animal or vehicle image categories were displayed (20 ms each), replacing the background/neutral image at the corresponding time point(s). When the target image was shown twice, the inter stimulus interval (ISI) between the two presentations was varied randomly in a systematic way between 0 and 600 ms (31 steps of 20 ms). Overall trial duration was fixed at 2.7 seconds. A flowchart of the paradigm can be seen in Fig. 2. After each trial, subjects were asked to indicate whether the target had been an animal or a vehicle by means of pressing one of two keys on the keyboard. Each block consisted of 16 repetitions of the single presentation condition and 4 repetitions of each of the 31 ISIs in the double presentation condition, with a total of 140 trials per block. Each experimental session consisted of 4 blocks, with a total of 560 trials. Prior to each block, the contrast of only the target images was adapted by a staircase procedure³⁷ to level out subject performance at 57% correct based on those trials with a single target presentation. The experiment was programmed in Matlab, using the Psychtoolbox³⁸.

Results

On average, the QUEST-determined contrast for the target images was 71.1%, with a standard deviation across subjects of 7.2% and a standard error of 1.2%. Target discrimination performance in the single-presentation condition reached an average of 56.0%.

Single target trials: Flash/Stimulus onset asynchrony (FSOA) analysis

Firstly, all trials were sorted by the temporal distance between the flash and the first (or only) stimulus presentation (Flash/Stimulus onset asynchrony, FSOA). Average target discrimination performance across all conditions ranged from 60.0 to 64.5%. When separating trials into single and double presentation, the average performance of the single presentation trials ranged from 50.2 to 60.0%, while the double presentation trials ranged from 62.0 to 65.0% (see Fig. 3A). The difference was significant (paired double-tailed t-test, p < 0.0001). To identify any periodic components relative to the flash onset in the time course of the single-presentation data, the individual means of each subject were centered and the data were Fourier-transformed with the application of a hamming window and zero-padding. Of the resulting Fourier spectra, the amplitude information was averaged across subjects, while the phase information was discarded (for a similar approach, see^{16,29,30,31,32}). The maximum of the resulting average spectrum was located at 5.03 Hz (see Fig. 3C). A zero distribution was then generated by randomly exchanging the individual time points of the subjects (permutation analysis, N = 100 k), with subsequent processing as before. Under the zero hypotheses that no periodic components exist, this should not change the average of the Fourier spectrum in a significant way. After sorting, the significance margin was determined by the percentage of zero distribution samples under the real averaged amplitude spectrum, as shown in Fig. 3C. The main peak in the spectrum was found to be significant (p < 0.05: 4.7–5.2 Hz, p_min = 0.0244 (5.03 Hz), Bonferroni corrected). Separating data by target category revealed no significant effect for either animal or vehicle stimuli (see Fig. 3D). When pooling both single and double presentation trials, a similar trend emerged, but did not reach significance. Also, no significant result was found in the double presentation trials alone, which we attribute to the additional variance from the variable ISI (see below).

ISI Analysis

When sorting the double presentation trials by ISI, the average subject performance was best at short ISIs (maximum at 0ms ISI, 76.5% correct) and then decayed for longer ISIs (see Fig. 4, top row). Performance appears to decay and converge after around 120–160 ms, consistent with a temporal integration window of around 100 ms as has been found in other tasks (for review, see³⁹). Nonetheless, the performance level remained significantly higher than in the single presentation condition, most likely due to probability summation (two independent chances to detect at least one target). Average performance with two targets beyond the temporal integration window (convergence level), as determined by the average over the range of 400–600 ms, was 64.8%.

The decay in performance for longer ISIs was found in both animal and vehicle trials. However, the overall level of discrimination performance differed between the two categories. The single presentation trials for the animal stimuli averaged 51.0% correct, while reaching 61.7% correct with the vehicle stimuli. For the dual presentation trials, the maximal performance for animal stimuli reached 72.5%, while reaching 81.5% with the vehicle stimuli. The apparent convergence level was determined by averaging the last 200 ms of the measurement interval, resulting in a convergence performance of 56.5% for the animal stimuli and 67.1% for the vehicle stimuli. To identify whether the convergence characteristics differed between stimulus classes, a decay function of the type was chosen for a least-squares fitting approach, with t being the ISI and k being the exponent determining the decay. Under the assumption that the performance indeed converges to a fixed level at longer ISIs, all data was first centered by subtracting the average over the last 200 ms interval, separately for animal and vehicle trials. To achieve robustness against noise, a representative distribution was generated by repeatedly (N = 10000) sampling full sets (N = 38) of random subjects and the decay function was fitted to the averages of the re-sampled sets. To minimize bias from compression, the resampled data was scaled to the common interval [0..1] prior to fitting. From this, distributions of exponentials k were collected for both animal and vehicle trials. A paired t-test then confirmed that the distributions were significantly different, with the vehicle distribution reaching convergence approximately 35% faster (mean and 95% confidence intervals: animals: 1.078 [0.766 1.542], vehicles: 1.242 [0.861 1.859], F(9999) = 47.77, p < 0.0001). When arbitrarily defining the convergence threshold (the point where convergence is considered to be achieved) at 10% (5%) distance from the convergence level, the time of convergence was determined to be on average 125 ms (309 ms) for animal stimuli and 81 ms (199 ms) for vehicle stimuli. This provides some tentative evidence that the integration window for animal stimuli might be longer than that for vehicle stimuli.

To identify possible oscillatory components in the ISI time course, any FSOA-related oscillation was removed from the ISI data by subtracting the average of the centered FSOA results. The average centered FSOA time course of the data was computed for each subject and the result was subtracted from the individual trials depending on the respective FSOA timing. The data from each subject was then individually centered by subtracting the mean of the last 200 ms interval of the ISI time course. Afterwards, the data was fitted with a decay function (see above), which was then subtracted from the data to minimize spectral artifacts induced by the decay. The data was then considered centered (see Fig. 4, bottom row). Subsequently, the data were Fourier transformed, the amplitude spectra averaged and the peak of the average determined. At the location of the peak, a permutation test was performed, similar to the one above (see Fig. 5). On the pooled data, the peak did not achieve significance. However, when separating trials by target category, a significant peak was identified with the animal trials at 4.88 Hz (p = 0.0074). No such peak was found with the vehicle stimuli; in fact a small trend in the opposite direction (trough, rather than peak) emerged at this frequency.

Assuming that the oscillation is stimulus-driven, the oscillatory amplitude associated with each stimulus image should be dependent on the average detection performance of that stimulus. If a stimulus is detected correctly on every trial (100% hit ratio), there will be no remaining variability to exhibit an oscillatory pattern – the oscillation would be squashed against the ceiling. In the opposite case, if an individual stimulus is detected only with chance performance, the oscillation would be squeezed against the floor. Consequently, oscillatory amplitude should therefore be strongest with stimuli that are detected with medium performance (about 75% hit ratio). To verify this, we first computed the average hit ratio across subjects for each animal stimulus. Each stimulus was then sorted into one of two equally sized groups (N = 150), depending on the difference of the average hit ratio achieved from the theoretical oscillatory optimum of 75% (see Fig. 6A). According to our hypothesis, the stimulus group with hit ratios close to 75% should result in larger Fourier amplitudes (Expected-High) than those with hit ratios close to 100% or 50% (Expected-Low). The mean of the resulting Fourier amplitudes of all 38 subjects, computed on each of the two stimulus groups separately, confirmed the hypothesis, as shown in Fig. 6B (Fourier amplitudes at 4.88 Hz: Expected-High, 1.22 ± 0.07; Expected-Low, 0.91 ± 0.08; mean ± 1s.e.m., Paired t-test: F(37) = 2.87, p = 0.0067).

In general, discrimination performance was higher with the vehicle stimuli than with the animal stimuli (72.5% vs. 81.5% at maximum, 56.5% vs. 67.1% at convergence). It may be that the vehicle stimuli were in fact easier to detect in the context of the neutral background image sequence; However, in a Continuous Flash Suppression (CFS) paradigm comparing these identical vehicle and animal images, no such difference was found in target detectability between classes³³. The difference in performance may therefore reflect a decision bias for this specific task.

Discussion

We employed a novel high-resolution RSVP-based dual-presentation paradigm to analyze the temporal dynamics of rapid visual object discrimination. Humans are known to be able to recognize the content of very briefly presented natural scenes², even when images are presented in very rapid succession^4,5. While these very rapidly presented scenes may not usually be consolidated in long-term memory⁴⁰, our results do suggest that, at least for a brief period, the stimulus is represented in a brief (iconic/sensory) memory, as the detection/discrimination performance is significantly higher during the first 120–160 ms of ISI, allowing for temporal integration. At a later time, this boost in performance dissipates, possibly because the intense masking effect of the ongoing background RSVP causes the iconic/sensory memory to decay. At longer time intervals, the performance converged on a level that may represent simple probability summation: at larger ISIs the performance increase relative to the single presentation trials may be the result of two independent chances to perceive the presented target.

The main finding of the present study is that we were able to identify two oscillatory signatures in behavioral performance for natural scenes. First, for single-target trials, we found a 5 Hz oscillation that was time locked to the Flash-Stimulus onset asynchrony (FSOA). A regular oscillation in perceptual threshold (independent of target category) may explain this peak in the Fourier spectrum, as has been reported with other visual paradigms at varying frequencies from 4 to 11 Hz^14,15,16,29. Our results show that the timing of this effect was aligned with the flash, which indicates that the flash displayed during our paradigm was capable of phase-resetting relevant rhythms in the brain.

Second, for double-target trials, we report a second 5 Hz oscillation in the dense sampling of performance for two animal targets as a function of the ISI between those two targets. In other words, participants were best at the task when the temporal separation of the first and second presentation of the target was in a multiple of around 200 ms, consistent with the idea of a “perceptual moment” in which information is combined⁴¹. The reason why we were successful in elucidating these oscillatory effects is to be found in the design of our paradigm. Firstly, we introduce a phase reset at a known time, which enables us to sort trials by their temporal distance from the phase reset. Secondly, we used a comparatively fine timescale in a dense sampling approach^{16,29,30,31,32}. Lastly, the tuning of the task difficulty to almost (but not quite) breaking point adjusted most trials to an optimal point which made the presence of oscillations in performance manifest.

It is interesting to note that the first 5 Hz (5.03 Hz) oscillatory signature (linked to the flash) was found only in the pooled data, while the second 5 Hz (4.88 Hz) oscillatory activity (linked to the first presentation of the target) was found only with the animal stimuli (see Figs 3 and 5). There are at least two possible reasons for this difference. The first, less theoretically interesting, possibility is a decision bias. In a 2AFC task, subjects are asked to decide between two alternatives (“was it A or was it B?”). This question can however be solved with a proxy task, by making a binary decision on just one of the two alternatives (“was it A or not? For if it was not A, it must have been B.”). For example, our subjects could perform the task “Did I see an animal or not?” rather than “Did I see an animal or a vehicle?”. If most subjects also employed a conservative strategy, then this would create a decision bias towards the vehicle stimuli, consistent with the different average hit ratios of the two stimulus groups (Fig. 4). More importantly, our subjects would have been comparing the visual impressions to only one internal decision criterion (“animal”) rather than two.

A second explanation would be that animal images are indeed special to the human visual system, as has been previously suggested^{21,22,23,24,25,26}. In this case, it would be possible for the feedback-driven oscillatory effect to be much stronger for the animal stimuli because a dedicated neural mechanism for vehicle stimuli simply may not exist, or a mechanism exists only in a more general, more variable or otherwise different fashion that was not revealed in our analysis.

Recurrent processing would be one plausible neural mechanism to explain this pattern of results. The first presentation of the animal target is processed along the ventral pathway. At some point along this processing path, feedback is generated and sent back to the earlier visual processing stages. If this feedback information arrives (for example at V1/V2) at precisely the same time as the new visual information resulting from the second target presentation, this second wave of information may optimally combine with the feedback from the first wave, resulting in an increased chance of successful target recognition. If the relevant feedback was generated only if the first target was already regarded by the visual system as a potential animal target (selective feedback), then such a feedback-driven oscillatory signature would only manifest itself with animal stimuli, not with vehicle stimuli. This temporally selective increase in task performance may then result in an oscillation of behavioral performance, as revealed in the time course of the behavioral performance recorded from our subjects. Such recurrent processing would be consistent with previous findings^11,12, including scene processing^8,10,42,43. This interpretation is also consistent with the idea of “perceptual echoes”, in which the presentation of a stimulus shows effects at regular intervals in later time periods in an oscillatory fashion⁴⁴.

Indeed, the proposal that perceptual cycles play a critical role in temporal integration of multiple samples of a stimulus, due to recurrent processing, provides a theoretical motivation for behavioral and neural oscillations in the theta range in humans. Each new sample of the world involves feedforward and re-entrant processing. In natural viewing, we sample the world via eye movements⁴⁵, hand movements⁴⁶ or shifts in attention^29,30,31,32 with an overt sampling of the world at a rate of 3–5 times per second^3,39,45. Using a flash to reset and align these oscillations in the laboratory setting (as in the flash condition here) is a useful methodology to uncover these fluctuations, but in natural viewing oscillations would more likely be tied to perceptual sampling, via feedforward and re-entrant processing. In the case of the dual stimulus paradigm used here, performance on a given trial would vary depending on whether the first stimulus evoked a change in oscillatory activity (related to a phase reset) or whether the two stimuli fell into the same perceptual cycle (without a phase reset).

In terms of alternative explanations for this data, we can exclude the role of an “attentional blink” or priming. Behavioral performance when two targets must be independently reported has been shown to exhibit an attentional blink approximately 180–450 ms after a first stimulus presentation^47,48, during which detection/identification of a second target is severely impaired. This effect appears to be most pronounced when the presented targets are spatially and featurally similar⁴⁹. In our paradigm however, we would not expect to find an attentional blink, since the second target was identical to the first and discrimination between the two target presentations was never required. To the contrary, most likely the temporal integration of the first and second target presentation was responsible for the initial increase in discrimination performance during the first 120–160 ms.

The overall pattern of results also speaks against a more general priming effect as the underlying mechanism. While oscillatory activity in a compatible frequency range has been shown in priming paradigms³⁰, our results differ in that we also obtained an oscillatory signature with the single-presentation condition, in which the flash could only serve as a neutral prime. However, Huang et al. found no oscillatory activity with neutral primes. Additionally, there would be no obvious reason why such priming-induced oscillations should only appear with animal stimuli, but not vehicles.

In summary, these results suggest oscillatory dynamics in the detection of real-world stimuli. Perceptual systems must solve the problem of how to create discrete objects and events out of a continuous flow of sensory information (for review, see³⁹). Our findings are consistent with the idea that perceptual systems solve this problem by discretizing sensory input into perceptual units or cycles, alternating between states of higher and lower sensitivity to new input and allowing for recurrent processing. The current findings indicate that such oscillatory effects are not mere fringe effects relevant only with simple stimuli, but instead are resultant from the core mechanisms of visual processing and may well be manifest even in real-life scenarios with natural scenes.

Additional Information

How to cite this article: Drewes, J. et al. Dense sampling reveals behavioral oscillations in rapid visual categorization. Sci. Rep. 5, 16290; doi: 10.1038/srep16290 (2015).

References

Bacon-Macé, N., Macé, M. J.-M., Fabre-Thorpe, M. & Thorpe, S. J. The time course of visual processing: backward masking and natural scene categorisation. Vision Res 45, 1459–1469 (2005).
Article Google Scholar
Thorpe, S., Fize, D. & Marlot, C. Speed of processing in the human visual system. Nature 381, 520–522 (1996).
Article CAS ADS Google Scholar
Crouzet, S. M., Kirchner, H. & Thorpe, S. J. Fast saccades toward faces: face detection in just 100 ms. J Vis 10, 16, 1–17 (2010).
Article Google Scholar
Potter, M. C. Short-term conceptual memory for pictures. Journal of Experimental Psychology: Human Learning and Memory 2, 509–522 (1976).
CAS Google Scholar
Potter, M. C., Wyble, B., Hagmann, C. E. & McCourt, E. S. Detecting meaning in RSVP at 13 ms per picture. Atten Percept Psychophys 1–10 (2014). 10.3758/s13414-013-0605-z
Serre, T., Oliva, A. & Poggio, T. A feedforward architecture accounts for rapid categorization. Proc Natl Acad Sci USA 104, 6424–6429 (2007).
Article CAS ADS Google Scholar
VanRullen, R. & Thorpe, S. J. Is it a bird? Is it a plane? Ultra-rapid visual categorisation of natural and artifactual objects. Perception 30, 655–668 (2001).
Article CAS Google Scholar
Camprodon, J. A., Zohary, E., Brodbeck, V. & Pascual-Leone, A. Two Phases of V1 Activity for Visual Recognition of Natural Images. J Cogn Neurosci 22, 1262–1269 (2010).
Article Google Scholar
Koivisto, M., Kastrati, G. & Revonsuo, A. Recurrent processing enhances visual awareness but is not necessary for fast categorization of natural scenes. J Cogn Neurosci 26, 223–231 (2014).
Article Google Scholar
Koivisto, M., Railo, H., Revonsuo, A., Vanni, S. & Salminen-Vaparanta, N. Recurrent Processing in V1/V2 Contributes to Categorization of Natural Scenes. J. Neurosci. 31, 2488–2492 (2011).
Article CAS Google Scholar
Lamme, V. A. & Roelfsema, P. R. The distinct modes of vision offered by feedforward and recurrent processing. Trends Neurosci 23, 571–579 (2000).
Article CAS Google Scholar
Boehler, C. N., Schoenfeld, M. A., Heinze, H.-J. & Hopf, J.-M. Rapid recurrent processing gates awareness in primary visual cortex. Proc. Natl. Acad. Sci. USA 105, 8742–8747 (2008).
Article CAS ADS Google Scholar
Pascual-Leone, A. & Walsh, V. Fast backprojections from the motion to the primary visual area necessary for visual awareness. Science 292, 510–512 (2001).
Article CAS ADS Google Scholar
VanRullen, R., Busch, N., Drewes, J. & Dubois, J. Ongoing EEG phase as a trial-by-trial predictor of perceptual and attentional variability. Front. Psychology 2, 60 (2011).
CAS Google Scholar
Busch, N. A., Dubois, J. & VanRullen, R. The Phase of Ongoing EEG Oscillations Predicts Visual Perception. J Neurosci 29, 7869–7876 (2009).
Article CAS Google Scholar
Fiebelkorn, I. C. et al. Ready, Set, Reset: Stimulus-Locked Periodicity in Behavioral Performance Demonstrates the Consequences of Cross-Sensory Phase Reset. J Neurosci 31, 9971–9981 (2011).
Article CAS Google Scholar
Callaway, E. & Yeager, C. L. Relationship between reaction time and electroencephalographic alpha phase. Science 132, 1765–1766 (1960).
Article ADS Google Scholar
Drewes, J. & VanRullen, R. This Is the Rhythm of Your Eyes: The Phase of Ongoing Electroencephalogram Oscillations Modulates Saccadic Reaction Time. J Neurosci 31, 4698–4708 (2011).
Article CAS Google Scholar
Dustman, R. E. & Beck, E. C. Phase of alpha brain waves, reaction time and visually evoked potentials. Electroencephalogr Clin Neurophysiol 18, 433–440 (1965).
Article CAS Google Scholar
Lansing, R. W. Relation of brain and tremor rhythms to visual reaction time. Electroencephalogr Clin Neurophysiol 9, 497–504 (1957).
Article CAS Google Scholar
Crouzet, S. M., Joubert, O. R., Thorpe, S. J. & Fabre-Thorpe, M. Animal detection precedes access to scene category. PLoS ONE 7, e51471 (2012).
Article CAS ADS Google Scholar
Mahon, B. Z., Anzellotti, S., Schwarzbach, J., Zampini, M. & Caramazza, A. Category-Specific Organization in the Human Brain Does Not Require Visual Experience. Neuron 63, 397–405 (2009).
Article CAS Google Scholar
Mormann, F. et al. A category-specific response to animals in the right human amygdala. Nat Neurosci 14, 1247–1249 (2011).
Article CAS Google Scholar
New, J., Cosmides, L. & Tooby, J. Category-specific attention for animals reflects ancestral priorities, not expertise. PNAS 104, 16598–16603 (2007).
Article CAS ADS Google Scholar
Öhman, A. Has evolution primed humans to ‘beware the beast’? PNAS 104, 16396–16397 (2007).
Article ADS Google Scholar
Yang, J. et al. Distinct processing for pictures of animals and objects: Evidence from eye movements. Emotion 12, 540–551 (2012).
Article CAS Google Scholar
Brandt, M. E. Visual and auditory evoked phase resetting of the alpha EEG. International Journal of Psychophysiology 26, 285–298 (1997).
Article CAS Google Scholar
Tass, P. A. Desynchronization of brain rhythms with soft phase-resetting techniques. Biol Cybern 87, 102–115 (2002).
Article Google Scholar
Landau, A. N. & Fries, P. Attention samples stimuli rhythmically. Curr. Biol. 22, 1000–1004 (2012).
Article CAS Google Scholar
Huang, Y., Chen, L. & Luo, H. Behavioral Oscillation in Priming: Competing Perceptual Predictions Conveyed in Alternating Theta-Band Rhythms. J. Neurosci. 35, 2830–2837 (2015).
Article CAS Google Scholar
Fiebelkorn, I. C., Saalmann, Y. B. & Kastner, S. Rhythmic Sampling within and between Objects despite Sustained Attention at a Cued Location. Current Biology 23, 2553–2558 (2013).
Article CAS Google Scholar
Song, K., Meng, M., Chen, L., Zhou, K. & Luo, H. Behavioral Oscillations in Attention: Rhythmic α Pulses Mediated through θ Band. J. Neurosci. 34, 4837–4844 (2014).
Article CAS Google Scholar
Zhu, W., Drewes, J. & Gegenfurtner, K. R. Animal Detection in Natural Images: Effects of Color and Image Database. PLoS ONE 8, e75816 (2013).
Article CAS ADS Google Scholar
Delorme, A., Richard, G. & Fabre-Thorpe, M. Ultra-rapid categorisation of natural scenes does not rely on colour cues: A study in monkeys and humans. Vision Res 40, 2187–2200 (2000).
Article CAS Google Scholar
Macé, M. J.-M., Thorpe, S. J. & Fabre-Thorpe, M. Rapid categorization of achromatic natural scenes: how robust at very low contrasts? Eur. J. Neurosci 21, 2007–2018 (2005).
Article Google Scholar
Willenbockel, V. et al. The SHINE toolbox for controlling low-level image properties. J Vis 10, 653–653 (2010).
Article Google Scholar
Watson, A. B. & Pelli, D. G. QUEST: A Bayesian adaptive psychometric method. Percept Psychophys 33, 113–120 (1983).
Article CAS Google Scholar
Brainard, D. H. The Psychophysics Toolbox. Spatial Vision 10, 433–436 (1997).
Article CAS Google Scholar
Wutz, A. & Melcher, D. The temporal window of individuation limits visual capacity. Front. Psychol. 5, 952 (2014).
Article Google Scholar
Subramaniam, S., Biederman, I. & Madigan, S. Accurate identification but no priming and chance recognition memory for pictures in RSVP sequences. Visual Cognition 7, 511–535 (2000).
Article Google Scholar
VanRullen, R. & Koch, C. Is perception discrete or continuous? Trends Cogn. Sci. ( Regul. Ed.) 7, 207–213 (2003).
Article Google Scholar
Wokke, M. E., Sligte, I. G., Steven Scholte, H. & Lamme, V. A. F. Two critical periods in early visual cortex during figure-ground segregation. Brain Behav 2, 763–777 (2012).
Article Google Scholar
Wokke, M. E., Vandenbroucke, A. R. E., Scholte, H. S. & Lamme, V. A. F. Confuse Your Illusion Feedback to Early Visual Cortex Contributes to Perceptual Completion. Psychological Science 24, 63–71 (2013).
Article Google Scholar
VanRullen, R. & Macdonald, J. S. P. Perceptual Echoes at 10 Hz in the Human Brain. Curr Biol (2012), 10.1016/j.cub.2012.03.050.
Wutz, A., Muschter, E., van Koningsbruggen, M. & Melcher, D. Saccades reset temporal integration windows. Journal of Vision 14, 584–584 (2014).
Article Google Scholar
Tomassini, A., Spinelli, D., Jacono, M., Sandini, G. & Morrone, M. C. Rhythmic Oscillations of Visual Contrast Sensitivity Synchronized with Action. J. Neurosci. 35, 7019–7029 (2015).
Article CAS Google Scholar
Raymond, J. E., Shapiro, K. L. & Arnell, K. M. Temporary suppression of visual processing in an RSVP task: an attentional blink? J Exp Psychol Hum Percept Perform 18, 849–860 (1992).
Article CAS Google Scholar
Shapiro, K. L., Raymond, J. E. & Arnell, K. M. The attentional blink. Trends in Cognitive Sciences 1, 291–296 (1997).
Article CAS Google Scholar
Raymond, J. E., Shapiro, K. L. & Arnell, K. M. Similarity determines the attentional blink. J Exp Psychol Hum Percept Perform 21, 653–662 (1995).
Article CAS Google Scholar

Download references

Acknowledgements

We would like to express our gratitude to Lorilei Alley and Maddalena Costanzo for their help with data collection. This research was supported by a European Research Council (ERC) grant (grant agreement no. 313658) to DM. WZ was supported by a National Natural Science Foundation of China (62263042, 61005087), China Scholarship Council Grant. This collaboration was also supported by the Chinese State Administration of Foreign Experts Affairs (GDT20155300084) to JD and DM.

Author information

Drewes Jan and Zhu Weina contributed equally to this work.

Authors and Affiliations

Center for Mind/Brain Sciences (CIMeC), University of Trento Corso Bettini 31, Rovereto TN, 38068, Italy
Jan Drewes, Andreas Wutz & David Melcher
School of Information Science, Yunnan University Cuihu Beilu, Kunming, 650091, China
Weina Zhu
Kunming Institute of Zoology Chinese Academy of Sciences, 32 Jiaochang Donglu, Kunming, 650223, China
Weina Zhu

Authors

Jan Drewes
View author publications
You can also search for this author in PubMed Google Scholar
Weina Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Wutz
View author publications
You can also search for this author in PubMed Google Scholar
David Melcher
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors designed the research. J.D. and W.Z. performed research. J.D, W.Z. and A.W. analyzed data. J.D. prepared figures, J.D. and D.M. wrote the main manuscript text, all authors reviewed the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Drewes, J., Zhu, W., Wutz, A. et al. Dense sampling reveals behavioral oscillations in rapid visual categorization. Sci Rep 5, 16290 (2015). https://doi.org/10.1038/srep16290

Download citation

Received: 09 July 2015
Accepted: 06 October 2015
Published: 06 November 2015
DOI: https://doi.org/10.1038/srep16290

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.