Training improves visual processing speed and generalizes to untrained functions

Lev, Maria; Ludwig, Karin; Gilaie-Dotan, Sharon; Voss, Stephanie; Sterzer, Philipp; Hesselmann, Guido; Polat, Uri

doi:10.1038/srep07251

Download PDF

Article
Open access
Published: 28 November 2014

Training improves visual processing speed and generalizes to untrained functions

Maria Lev¹^na1,
Karin Ludwig^2,3^na1,
Sharon Gilaie-Dotan⁴^na1,
Stephanie Voss²^na1,
Philipp Sterzer²^na1,
Guido Hesselmann²^na1 &
…
Uri Polat¹^na1

Scientific Reports volume 4, Article number: 7251 (2014) Cite this article

8693 Accesses
26 Citations
56 Altmetric
Metrics details

Subjects

Abstract

Studies show that manipulating certain training features in perceptual learning determines the specificity of the improvement. The improvement in abnormal visual processing following training and its generalization to visual acuity, as measured on static clinical charts, can be explained by improved sensitivity or processing speed. Crowding, the inability to recognize objects in a clutter, fundamentally limits conscious visual perception. Although it was largely considered absent in the fovea, earlier studies report foveal crowding upon very brief exposures or following spatial manipulations. Here we used GlassesOff's application for iDevices to train foveal vision of young participants. The training was performed at reading distance based on contrast detection tasks under different spatial and temporal constraints using Gabor patches aimed at testing improvement of processing speed. We found several significant improvements in spatio-temporal visual functions including near and also non-trained far distances. A remarkable transfer to visual acuity measured under crowded conditions resulted in reduced processing time of 81 ms, in order to achieve 6/6 acuity. Despite a subtle change in contrast sensitivity, a robust increase in processing speed was found. Thus, enhanced processing speed may lead to overcoming foveal crowding and might be the enabling factor for generalization to other visual functions.

Visual perceptual learning modulates microsaccade rate and directionality

Article Open access 02 October 2023

Shao-Chin Hung, Antoine Barbot & Marisa Carrasco

Feature-based attention enables robust, long-lasting location transfer in human perceptual learning

Article Open access 06 July 2021

Shao-Chin Hung & Marisa Carrasco

Specificity and retention of visual perceptual learning in young children with low vision

Article Open access 01 June 2020

Bianca Huurneman, F. Nienke Boonstra & Jeroen Goossens

Introduction

Contextual modulation is a general phenomenon that relates to changes in the perceived appearance of targets or objects when they are presented within the context of other targets or objects. Some well-known types of contextual modulations are visual masking (including center-surround), crowding, grouping and several types of contextual illusions. However, most research interest has focused on masking (spatial and spatio-temporal) and crowding; both phenomena refer to reduced performance on a target stimulus when the mask stimulus is presented within a small spatio-temporal window^{1,2,3,4,5,6,7,8}.

Perceptual learning has a major influence on our understanding of the development and plasticity of visual processes such as masking and crowding. It is considered to be highly specific to the particular characteristics of the stimuli used during training (e.g., the location in the visual field and orientation), which is thought to reflect encoding in early visual areas^{9,10,11,12,13}. However, recent studies show that learning and transfer may depend on several training properties such as the task, attention, difficulty and the paradigm's manipulations such as training on two tasks simultaneously, the sequence of stimulus presentation (roving vs. fixed stimuli), among others^{11,14,15,16,17,18,19,20,21,22,23,24,25}. Some insight into the mechanism underlying learning comes from lateral masking experiments²⁶. In such experiments, when participants are trained to detect a low-contrast Gabor target embedded between two similar Gabor flankers, higher sensitivity to the target in the presence of flankers compared with that of the target alone (termed the facilitation effect) and an expansion of the target-flanker distance that induced facilitation are observed. These effects are found only when the target and flankers have the same orientation and are positioned along a collinear direction^27,28,29. The lateral facilitation effect is largely explained and modeled in terms of spatial processing such as a) the propagation of lateral excitation from the flankers through the horizontal connections in the primary visual cortex^{1,26,27,29,30,31}, b) contrast integration of the flankers and the target within large simple³², or c) complex³³ receptive fields. Quantitative models suggest that the flanker effects are multiplicative terms applied to both the excitatory and inhibitory terms of a divisive inhibition response function^34,35. Top-down modulation of the target response was also considered^36,37,38. Another study shows that similar training shortens the processing time needed for target detection³⁹. More specifically, it suggests that practice involving targeting the improvement of the spatial and temporal lateral interactions increases the efficacy of the lateral interactions between neighboring neurons and improves the processing speed; hence, it enables the practice-based improvement to be generalized to other untrained visual functions^16,40.

Studies have shown that training effects on lateral interactions can be generalized to non-trained visual functions such as visual acuity^41,42, contrast sensitivity^16,41,42,43, contrast discrimination⁴⁰ and reading speed⁴⁰. However, most of these results were obtained for impaired vision following abnormal visual development such as amblyopia^41,42,43, developmental visual form agnosia⁴⁴, or in the case of blurred retinal inputs in the aging eye (presbyopia)⁴⁰. It was shown that the extent of the improvement is proportional to the initial level of the visual function^42,45. Thus, these remarkable improvements may be found only in cases of impaired visual functions that lead to initial sub-normal vision. In addition, the generalization of these effects might critically depend on the initial (pre-training) sub-normal vision. A similar procedure, when applied to young participants with normal vision, resulted in reduced backward masking effects, shortened reaction times and shortened latencies of an EEG component that is thought to reflect visual integration³⁹.

Visual information processing takes time, whether for simple tasks such as target detection or for more complex tasks such as reading, searching, or object tracking. Thus, in order to enable appropriate behavior, processing at all stages must be coordinated in time and completed within a limited time window⁴⁶. It was shown that categorization of visual images involves several stages, with increasing time needed to process the information, e.g., fast for the early detection processing stage and longer for the later identification stage⁴⁷. Visual information processing may be compromised if any of the processing stages are inefficient, for example, due to noisy retinal input⁴⁰, slow neural processing¹⁶, masking^2,48, or crowding^3,4,5,49. Thus, improved processing speed through perceptual learning may enable a processing gain within the limited time window and lead to the observed generalization of the training effect to many untrained visual functions^16,40 including the transfer from contrast detection (masking) to letter identification (crowding).

The relationship between masking (spatial and spatio-temporal) and crowding (letter acuity)

Both masking and crowding include a situation in which the reaction to a target stimulus is deteriorated by other stimuli, called masks. In crowding the surrounding masks are usually presented simultaneously with the central stimulus and in the case of masking, the mask can appear before the stimulus (forward masking), after the stimulus (backward masking), or also simultaneously, as in crowding.

The literature on masking distinguishes between pattern masking (the mask and target presented at the same retinal location) and lateral masking (the mask location does not overlap with the target location)^1,5. Likewise, the crowding effect is measured when the target and flankers are not overlapping; thus it parallels the lateral masking measurements. Since both crowding and lateral masking share similar properties such as dependency on the distance between the target and flankers (spacing) and an increase of the effect with increasing eccentricity, some studies suggest that masking and crowding are related^1,50,51,52 and some even view crowding as a type of masking^1,49,52,53.

On the other hand, visual crowding extends throughout large parts of the visual field^3,4,54,55 (mostly found in peripheral vision but in some studies it has been found in the normal fovea^56,57,58 and in the foveal region of people with strabismic amblyopia^3,59) and – compared to lateral masking – up to longer distances between the target and flankers. Furthermore, since masking is assumed to affect the detection level (the stimulus is rendered invisible) and crowding is assumed to affect the identification level (the stimulus can be detected but not identified), the general view, supported by many studies, considers crowding to be a different process than ordinary masking, especially in the periphery^3,4,5,55,60.

Recently it was shown that young adults with normal foveal vision exhibit crowding for very short presentation times or when the availability of the stimulus is limited by backward masking⁴⁹, indicating that processing of targets under crowding conditions requires a longer processing time. Therefore, here we hypothesize that increasing the processing speed can lead to reduced crowding effects. In this study we investigated how perceptual learning affects the visual processing of healthy young people using the GlasseOff application, which is used to improve vision in presbyopia⁴⁰. In a study with presbyopes, using this technique, it was shown that training, which focused on improving spatio-temporal processing by strengthening lateral interactions, resulted in improved visual performance. More specifically, it enabled the participants to read smaller font sizes and to increase their reading speed and thus to overcome and/or delay some disabilities imposed by the aging eye. This improvement was achieved without changing the optical characteristics of the eye. It was shown that visual acuity deteriorates when the presentation time is shortened⁶¹. In the current study we determined whether the training on contrast detection of a Gabor target, under conditions that pose limitations on the processing time, leads to generalization and hence to an improvement in spatial and temporal visual functions such as letter recognition under crowding conditions with a short presentation time.

Our second aim in this study was to determine whether training on near distance will transfer to improvement in visual functions tested at far distances. It is generally thought that perception is invariant to the viewing distance if the retinal image size is the same (retinal spatial frequency). However, this notion of distance invariance is surprising, given that early and recent studies^62,63 have consistently shown lower visual resolution for near rather than for far viewing and that this difference is related to the difference in the accommodation power needed for fixation from far to near viewing. This is further supported by a study that contradicts the basic assumption of distance-invariant perception and shows that perception of retinal spatial frequency might be affected by the context⁶⁴.

We believe that investigating this issue will provide very useful information for future experiments, for example, about the appropriateness of collecting data using near presentations and hand-held devices. Thus, here we examined whether training on tasks involving fixation for near viewing (hence, involving accommodation) transfers to visual tasks involving far viewing and whether the same visual mechanisms process these different tasks. This transfer of improvements between the two domains is not trivial and has not been previously reported.

Results

Spatial processing: contrast detection and lateral masking (Gabor targets)

We measured several distance visual functions on a PC screen (with a viewing distance of 150 cm) before and after near vision training (detecting Gabor targets, 1.3 to 8 cycles per degree, [cpd]) from a 40 cm viewing distance using personal iDevices (iPhones or iPods) to determine whether training from near viewing transfers to distant visual functions.

We found that distant contrast sensitivity, i.e. the ability to detect a target at low contrasts, significantly improved after training, as displayed in Figure 1c. A 2-way ANOVA with factors training (pre vs. post) and spatial frequency (5, 6.5, 8.5 and 13 cpd) revealed a significant main effect of training (F(1,13) = 9.215, p = 0.0096) driven by improvement at spatial frequencies of 5, 6.5 and 13 cpd (post-hoc paired 2-tailed t-tests for 5, 6.5 and 13 cpd, respectively: t(13) = 4.19, p* = .0011; t(13) = 2.735, p = .017; t(13) = 2.198, p = .047; *significant at Bonferroni corrected alpha level = .0125).

Previous studies showed that practice increases the range of the lateral interactions (the distance up to which the presence of flankers modulates the target detection threshold), but only when the flankers are collinear with the target^26,65. This finding suggests that practice increases the efficacy between neighboring neurons along the collinear direction, an effect that enables connectivity with remote neurons via a cascade of local interactions. Previous studies also show that training does not improve the sensitivity to the target alone⁶⁵ when the training is limited to one spatial frequency. Here we investigated how lateral interactions at distant vision (from 1.5 m) are modulated by near vision training (from 40 cm) when the training included spatial frequencies between 2 and 8 cpd and target-flanker separations of 1.5, 2, 3 and 4 wavelengths (λ) during the training. We tested lateral interactions before and after training at a spatial frequency of 6.5 cpd, which was identical for the near vision training and for the far distance pre and post training testing sessions; this is a frequency at which performance is typically neither at floor nor at ceiling levels. We found that the sensitivity to detect a distant target (from 1.5 m) when it is embedded in collinear flankers increased significantly following the near vision training (see Figure 1d). A 2-way ANOVA with factors training (pre vs. post) and target-flanker separations (4, 3, 2 and 1.5 λ) revealed a significant main effect of training (F(1,13) = 8.25, p = .0131), an expected main effect of separation (F(3,39) = 58.9; p < 10⁻⁴) and no interaction (F(3,39) < 1; p = 0.54). Post-hoc t-tests revealed that the improvement following training resulted from a significant improvement in the 4λ target-flanker separation (2-tailed paired t(13) = 3.15, p = .0076, with Bonferroni corrected alpha level = .0125), a trend for improvement in 3λ (t(13) = 1.702, p = 0.11), whereas the other target-flanker separation showed no significant improvements (all t's < 1.43, p's > 0.17).

The results presented in Figure 1e are in line with the typical effects of target detection modulation, namely, collinear facilitation (the presence of collinear flankers improves target detection, above the y = 0 line) at 3 and 4λ as well as collinear suppression (reduced target detection in the presence of collinear flankers, below the y = 0 line) at 1.5λ¹. However, after training, unlike previous findings²⁶, there was no significant change in the modulation effects. The lack of a significant change in the modulation effects is due to a parallel improvement in the sensitivities to the target alone (Figure 1c) and the target within the collinear configuration (Figure 1d). Here the participants were trained on the tested parameters (spatial frequency, orientation and target-mask separations) for a very limited number of trials (1–2 blocks) and sessions (only 2) before they moved on to the next parameters, whereas in the previous studies the participants were extensively trained at the same spatial frequency and orientation²⁶. This short training per stimulus feature may prevent deterioration within a session⁶⁶ and enable transfer between different tasks. Moreover, here we show improvement in the target-alone condition in parallel with improvement under the lateral masking condition. This effect may be due to training on a wide range of spatial frequencies and orientations⁴⁰, whereas the previous studies used only one spatial frequency and orientation^26,65. However, here, owing to the parallel increase in the sensitivity to the target under both conditions, we did not observe an appreciable effect of enhanced facilitation.

Temporal processing: backward masking (Gabor targets)

Previous studies showed that presenting collinear masks after the collinear flankers and the target (lateral masking) abolishes the facilitation effect^1,67. Consistent with these results, Figure 2 shows that the effect of suppression by backward masking is larger for short inter-stimulus intervals (ISIs) and that it decreases with longer ISIs. Figure 2b shows the reduced thresholds of target detection with training (pre vs. post) as a function of increasing the length of the ISIs (60, 90, 120 and 150 ms). A 2-way ANOVA with the factors training (pre vs. post) and ISI revealed a significant main effect of training (F(1,13) = 11.4, p = .0049), a main effect of ISI (F(3,39) = 5.88, p = .0021) and a significant interaction (F(3,39) = 3.803, p = .0175), resulting from the significant improvement in the short ISIs (post-hoc t-tests for 60 ms: t(13) = 3.84, p = .002, 90 ms: t(13) = 2.38, p = .03). Figures 2b and 2c show the effect of training on the threshold change in the target that was presented with the two flankers (lateral masking), followed by backward masking of the two flankers. Figure 2b presents the unnormalized data (contrast detection thresholds (log units)) and Figure 2c shows the data as threshold elevations (normalized to the contrast detection threshold without backward masking (but with lateral flankers)). After training (blue line, filled circles), the backward masking effect was significantly reduced only for the short ISIs. A 2-way ANOVA with training and ISI as factors revealed no main effect of training (F(1,13) = 1.704, p = .214), a main effect of ISI (F(3,39) = 5.885, p = .0021) and a significant interaction (F(3,39) = 3.8, p = .0175). Here too, this effect was revealed due to the large reduction for the shortest ISI (from 0.4 to 0.15 log units, 78%, ISI = 60 ms, 2-tailed, p = 0.0029; for all other ISIs, 2-tailed, p > .18), reaching almost a “flat” level across ISIs. Before training, the backward masking effect for short ISIs of 60 and 90 ms was significantly different from the one resulting from longer ISIs of 120 and 150 ms (2-tailed p < 0.0223). However, after training the performance for the shorter ISIs improved and became as good as for the longer ISIs (not significantly different from longer ISIs). The results, presented in Figure 2, show that the slope after training has changed, indicating that after training the participants were able to process the information much faster and could overcome backward masking effects. This result supports our hypothesis that our training leads to improved processing speed.

Temporal processing: visual acuity under temporal crowded conditions (E letters)

Before and after training we also measured crowding (the crowded condition) as a function of presentation time (30, 60, 120 and 240 ms) using E letters on an iPod from a distance of 40 cm. The results, presented in Figure 3b, show that the effect of crowding by E letters is similar to the effect of temporal masking by Gabors (cf. with Figures 2b and 2c). To measure crowding, an E target is embedded in a matrix of randomly oriented E letters, with 0.4 inter-letter spacing. An adaptive method was used for measuring the smallest E for which the direction in which it is facing can be identified. The y axis denotes visual acuity in LogMAR (the minimal angle of resolution) units, where 0 denotes a visual acuity of 6/6 (a log minimal angle of 1). Before training, the results showed significant crowding for short presentation times of 30 and 60 ms (p = 0.022), which decreased with increasing presentation time. The crowding was significantly reduced for stimulus durations of 120, 60 and 30 ms. A 2-way ANOVA with the factors training (pre vs. post) and stimulus durations (30, 60, 120 and 240 ms) showed a main effect of training (F(1,13) = 24.342, p = .0003) and a main effect of duration ((F(3,39) = 30.098 p < .0001). For the 30 ms presentation time, the crowding was reduced from 0.26 to 0.09 log units (41%), for 60 ms, from 0.2 to 0.04 (45%) and for 120 ms, from 0.1 1 to 0.01 (26%). The interaction was marginally significant (F(3,39) = 2.75, p = .056). Note also that after training, the participants achieved a better than normal vision level of 6/6 at about 240 ms. Very interestingly, the participants were able to isolate the target faster: as Figure 3b shows, before training they were able to identify a crowded letter equivalent to a visual acuity of 6/6 (0 LogMar) in about 240 ms, whereas after training they almost reached this level in about 120 ms. When this effect was calculated for each participant, the change was from 204 to 123 ms (Figure 3c). This slope change supports the notion that the training led to an improvement in the processing speed and not to an improvement in sensitivity per se.

To test whether this improvement was merely due to a test-retest effect, we tested a control group (n = 19) on this task in two sessions spaced apart as the duration of the training. A 3-way ANOVA on temporal visual acuity with the between-subject factor group (training, control), the within-subject factors testing session (pretest, posttest) and the duration revealed an unsurprisingly significant effect of duration (F(3,90) = 74.64, p < .001), a significant effect of group (F(1,30) = 40.49, p < .001), a significant effect of testing session (F(1,30) = 10.74, p = .003), and, importantly, a significant interaction between the group and the testing session (F(1,30) = 17.52, p < .001). These results, presented in Figure 3, show that there was no significant learning effect in the control group because their scores on this temporal visual acuity test did not change from the first to the second session (2-way ANOVA on the control group's temporal visual acuity with the duration and testing sessions revealed no significant effect of testing session: F(1,17) < 1, p = .498 and no interaction between duration and testing session: F(3,51) < 1, p = .721). Thus, we contend that the significant improvements reported here are due to the training (a significant effect of the testing session in the group receiving training: F(1,13) = 24.34, p < .001) and not due to test-retest effects.

Spatial processing; Improvement of static visual acuity as measured on ETDRS clinical charts

Previous studies reported that the near visual acuity is significantly worse than the far visual acuity^62,63. Here we measured the visual acuity of all participants on near (40 cm) and far (3 meters) ETDRS clinical charts, for the training group, before and after the training and for the control group in the first and second testing sessions (see above). In the first testing session, the average near visual acuity of all participants (N = 33, −0.1 ± 0.01 (SE) LogMAR (1 line better than 6/6)) was significantly worse than their far visual acuity (−0.15 ± 0.01 (1.5 lines better than 6/6); far vs. near 2-tailed paired t-test: t(31) = 2.95, p < .006) by 12%. This effect further supports our study's aim to explore whether the mechanisms processing near and far vision are the same. Thus, our results suggest that the distance-invariant notion is more complex than the received view and that vision and visual acuity may be affected not only by the physical image present on the retina but also by the distance of the image.

In order to examine the effect of training on visual acuity, we ran a 3-way ANOVA with the between-subject factor group (training, control), the within-subject factors testing session (pre, post) and the VA-measurement distance (near, far). Interestingly, we found a significant effect of VA distance (F(1,31) = 11.44, p = .002) and a tendency towards a three-way interaction (F(1.31) = 2.36, p = .071). Post-hoc analysis revealed that prior to training the visual acuities of the two groups were not significantly different (far: t(31) = 1.16, p = .256; near: t(31) = 1.57, p = .127; two sample t-test). However, after training, the near visual acuity of the trained group improved slightly but significantly (7%, t(13) = 3.85, p = .002, paired t-test), whereas that of the control group did not (t(18) < 1, p > .7). The far visual acuity of both groups remained unchanged (t's < 0.91, p's > 0.38). The significant difference between the far and near VA that was evident in the trained group prior to training was no longer present following training (t(13) = 0.78, p > .44).

This effect of specific improvement in near visual acuity (40 cm), which did not transfer to far visual acuity (3 meters), may suggest that the improvement is due to the training on near visual tasks that did not transfer to far visual acuity, suggesting that the visual processing involved in the spatial processing of letter resolution for near is different from that of far visual acuity. However, we noted that the training for near (40 cm) did transfer to improved detection of far Gabor targets, as measured on a PC (1.5 m). Thus, a conclusion regarding the transfer of improvement between distances may be confounded by the possibility that the far visual acuity was already very good and may have reached nearly the best level (the ceiling effect), thus not enabling further improvement. It was shown that the extent of improvement, in particular, visual functions is proportional to the initial level of these visual functions before training^42,45. Thus, further studies may consider designing a study in which this issue is tested with populations with reduced far visual acuity to enable improvement.

Discussion

Here we trained young adults with normal or corrected to normal vision using a visual paradigm that combined spatial and temporal Gabor detection tasks at near vision. We found that visual improvements were not specific to the trained tasks and that they generalized to other non-trained visual functions such as detection under crowded conditions and importantly, to far vision (1.5 meters). Although these results are consistent with previous results in atypical vision showing generalization of improvements^41,44, this is the first study to show generalization of improvements in normal young adult vision, including several novel effects of perceptual learning that are discussed next.

Faster temporal processing for detection (Gabors) and identification (letter crowding)

A previous study suggested that visual improvements following perceptual learning may result from improved contrast sensitivity and/or processing speed³⁹. Here we directly tested the improvement in spatial and temporal processing. We found robust temporal improvements (a gain of 81 ms in the processing of letter acuity) despite only subtle improvement in contrast sensitivity. Therefore, our results provide evidence favoring an alternative explanation, namely, that the improvements following visual training are due to faster processing of visual information, together with a reduction of crowding and masking effects. Recently, it was emphasized that crowding is an essential bottleneck in perceptual and perceptual processing^3,4. Since the processing of visual information takes time, in order to mediate relevant behavior, the processing must be completed within a limited time window. Thus, the gain in temporal processing speed may enable one to overcome the bottleneck of crowding and may provide a better stream of visual information for perceptual processing; thus, it may improve cognitive functions such as decision-making³⁹ and reading⁴⁰. A previous study showed that visual recognition, as measured by letter size (visual acuity), takes more time with decreasing letter size⁶¹. Here we show that before training the participants needed 204 ms to recognize a letter (of a size that leads to 6/6), but that they were able to do so within only 123 ms following training. Interestingly, following training, at 240 ms they reached a better than normal vision level. This result further supports our hypothesis that improved processing speed may underlie the generalization of improvement in many visual functions⁴⁰.

There has been some controversy about the nature, size and even the existence of crowding in the fovea^{3,4,5,57,58,68}. Here we show robust foveal crowding for short presentation times. This result is consistent with earlier studies showing that Vernier acuity (measured at the fovea) is affected by crowded displays and by their distinctiveness from the targets⁵⁷ or at very short exposures⁵⁸ (<100 ms). Moreover, a recent study⁴⁹ showed that both target identification and reaction time are affected when a foveal target is presented for a short time, or when the processing time is limited by backward masking. These findings suggest that extra processing time is required to overcome foveal crowding^49,58. These results are consistent with our current findings showing improved contrast detection under backward masking conditions and improved letter identification (visual acuity) under crowded conditions with short presentation times. All together, the results suggest that the improved processing is achieved in stages, where an early detection stage is followed by a later identification stage⁶⁹. After training, the detection task was accomplished within a shorter time period, suggesting that the overall processing was much faster. This could be attributed to faster processing of either the first (detection) or the second (identification) stage.

A few possible neural changes at different levels of the visual processing hierarchy may underlie improved performance, leading to improved processing speed. One possibility is that neurons at the early processing levels (e.g. in V1) may improve their sensitivity⁴⁷, resulting from sharpening of the orientation tuning curves⁷⁰ or a reduction in the receptive field size⁷¹. Other possibilities are related to the retuning of internal templates⁷², or to noise reduction^72,73. A previous study showed that increasing contrast is associated with increased neural responses and decreased neural latencies of single neurons in the primary visual cortex⁷⁴. It was also shown that training reduces the internal noise in human visual processing⁷³ and thereby improves sensitivity. However, neurons in the visual cortex are extensively connected to other neurons, enabling them to integrate lateral inputs (which are noisy as well). Thus, noisy responses may also result from lateral influences. Moreover, imbalanced excitation-inhibition inputs may contribute to noisy activity. It has been suggested that reduced inhibition in the visual cortex underlies increased noise⁷⁵ or reduced processing speed⁷⁶. It was shown that collinear facilitation reduces the noise of neural responses⁷⁷, that similar training shortens the response latency in young participants³⁹ and that collinear facilitation expedites the brain's processing⁷⁸. Thus, we can conclude that our training, which attempts to improve the efficiency of the spatial and temporal interactions at early visual areas, might improve processing speed directly by changing the excitation-inhibition balance, or possibly indirectly by reducing internal noise and improving neural sensitivity.

Generalization of improvement: Transfer between tasks

An important result of our study is the transfer of improvement following training on contrast detection of Gabor patches to improvement in a letter visual acuity task (visual acuity under crowded conditions presented for short times). Although transfer of improvements were shown previously with clinical populations (amblyopia^42,43, presbyopia⁴⁰ and developmental visual agnosia⁴⁴), our results are novel since a) we found that the improvement is greater for shorter presentation times and when measured under crowded conditions, whereas previous studies showed improvements in visual acuity using static clinical charts; b) we provide data for young participants with normal/corrected to normal vision and not in impaired vision (clinical cases); c) we showed transfer from near vision training to both improved near visual acuity and to far temporal processing, while previous studies showed transfer of visual functions only for a trained viewing distance (either far visual acuity improvement following far visual training in amblyopia, or near visual acuity improvement following near visual training in presbyopia). We found that following training, static near visual acuity improved, whereas static distance/far visual acuity did not improve. This may suggest that near and far visual acuity, as measured by static charts, do not rely on joint mechanisms. However, since the spatial visual acuity for distance vision, as measured on the static ETDRS chart, may have reached a ceiling performance in the training group, our results did not allow us to reach such a conclusion.

One can claim that the improvements reported here may be due to a retest effect, i.e. very fast learning taking place already during the pretest. It has been established that many perceptual learning studies show improvement in learning just after a few sessions^9,10,14,79 mainly if the effects are not robust. However, to date, no study has shown rapid, remarkable improvement in visual acuity. Moreover, previous training studies that have used similar methods found no improvement in lateral facilitation, contrast sensitivity and backward masking for the control group just by retesting or placebo training for 50 hours^80,81. No improvement in contrast sensitivity was found even after 10 sessions of training³⁹. An appreciable improvement in collinear facilitation requires many sessions of training at the same orientation and spatial frequency²⁶. Furthermore, we show (Figure 3) the results from a control group, tested on the novel temporal visual acuity task under the crowded condition task. This group did not undergo training and was retested after the same time period as the training group (~2 months). The results showed that for the control group there was no improvement at any of the presentation times. It is also worth noting that studies show that the magnitude of the improvement is related to the initial level of the participants' performance, being maximal for worse vision and minimal for good vision^42,45. In our study, the initial level of the spatial processing of the young participants was very good and therefore, it is expected that the improvement will not be robust, as found for the improvement of contrast sensitivity or for the static visual acuity for distance. Moreover, the main novel result of our study is an improvement in temporal processing. Indeed, the initial vision for short presentation times was reduced and it improved remarkably after the training. The control group (Figure 3) that was retested for the same task after ~2 months showed no improvement at all the presentation times. Therefore, we contend that the training on Gabor patches is transferred to spatio-temporal gains of letter resolution and crowding.

Using iDevices for training

Here we show for the first time conclusive evidence showing the significant effects of training on hand-held iDevices using the GlassesOff application. The results provide encouraging news for future research in the field of perceptual learning. Training on hand-held devices may increase training efficiency, simplify future research and make the training much easier for potential users. Such training can be effectively used for testing and training children and for special populations and also bypass transportation limitations.

The relationship between spatio-temporal masking and crowding

We recently showed⁵¹ that masking and crowding behave similarly in the fovea and in the periphery for a particular range of spatio-temporal parameters. Those results suggest that a joint mechanism might exist and that it may mediate these masking and crowding effects. Both masking and crowding may be related to the size of the human perceptive fields in the fovea^{1,82,83,84,85} as well as in the periphery^76,85. Participants with larger perceptive fields exhibit greater effects of masking and crowding and vice versa. However, the mere correlation between masking and crowding does not necessarily suggest that they operate by mutual processes.

Accumulating evidence suggests that multi-dimensional parameters and multiple factors may affect the relationships between masking and crowding. Thus, masking and crowding may be determined by multiple sources of interference operating at several levels of cortical processing^51,86 and each of them might affect the task. Among these factors are a) the proximity between the target and the flankers, which depends on the eccentricity^3,4,85, b) the duration for which the target is visible [in the fovea longer presentation times reduce crowding and masking such that at presentation times longer than 120 ms there are no crowding effects^49,51; in the periphery, presentation times longer than 250 ms do not affect crowding⁸⁸ even though such elongated presentation times can involve eye movements that potentially increase crowding⁸⁷ and whether presentation times shorter than 250 ms affect peripheral crowding is still unclear, c) the temporal order (dynamics) of the presentation (backward, simultaneous, or forward masking^49,58,89), d) the global configuration and grouping between the mask and the target elements, where collinear configuration seems to produce the maximal effect^57,86,90, e) contrast – where higher crowding is found with a higher contrast threshold and f) attention⁹¹. Thus, crowding and masking may or may not be correlated, depending on the particular spatial-temporal parameters chosen in the study.

Conclusions

Since the processing of visual information takes time, in order to mediate relevant behavior, the processing must be completed within a limited time window. Thus, the gain in temporal processing speed may enable one to overcome the bottleneck of crowding and may provide a better stream of visual information for perceptual processing and thus may improve perceptual functions such as contrast detection, identification and object recognition and cognitive functions such as decision-making³⁹ and reading⁴⁰. The results of our current study show that improved processing speed also improves the temporal processing of both crowding, using letters and masking, using Gabors, suggesting that the two phenomena are at least partly related^49,51. Thus, processing speed may lead to overcoming foveal crowding and might be the enabling factor for generalizing to other visual functions.

Methods

The paradigm used in this study is similar to the paradigm used in our earlier studies in presbyopic [aging eye] participants⁴⁰ in terms of behavioral tasks and temporal conditions. Visual acuity, spatial contrast sensitivity, crowding and backward masking were tested before (pretest) and after (posttest) the treatment using a PC at a distance of 150 cm in the laboratory.

Participants

Twenty-three young participants with no neurological conditions and with normal or corrected-to-normal vision in both eyes volunteered to participate in the training study. Fourteen of them (aged 24 ± 5 years old, mean ± STD) completed the training and returned for the posttest. Twenty additional participants enrolled in a control group and completed the pretest. Nineteen of them (aged 24 ± 5 years old, mean ± STD) returned for the posttest after the same time as the group undergoing training but without any training. The procedures were approved by the ethics committee of the Charité and all participants gave informed written consent to participate in the study. They were paid for participation in pre- and posttests and voluntarily completed the training phase. The study was performed at the Visual Perception Laboratory, Charité – Universitätsmedizin Berlin, Germany. All experimental protocols were performed in accordance with the guidelines provided by the committee approving the experiments.

Apparatus

Pretest and posttest were measured at the lab on a Samtron 98PDF 19″ CRT screen (1024 × 768 pixels at a 100 Hz refresh rate; the effective screen diagonal was 43.6 cm) controlled by a PC.

Visual acuity before and after training

We measured near (40 cm) and far (3 meters) visual acuity with an ETDRS chart. Far visual acuity was measured from a viewing distance of 3 meters using a wall-mounted ETDRS chart (Precision Vision) and near vision was measured using a hand-held chart from 40 cm.

Psychophysical measurements before and after training: stimuli and paradigms

PC test -The stimuli were vertically oriented localized gray-level gratings (Gabor patches, GPs) with an equal luminance distribution (STD, σ, allowing a minimum of 2 cycles in the GP) and the viewing distance was 150 cm. A 2AFC paradigm was used and participants were asked to report which interval contained the target. Target detection contrast threshold was determined for each condition, using a separate adaptive method for each block that converged to 79% percent correct. Participants started each trial by pressing the middle mouse button. A visible fixation circle was presented in the center of the screen until the participants pressed the button again to start the intervals. The two intervals were 60 ms each with an 800 ms gap between them. The first interval was preceded by a 300 ms blank period with a temporal jitter of 500 ms on average. The target GP was presented in only one of the intervals (the order was randomized). Participants were asked to report which interval contained the target by pressing a mouse button (left for the first interval and right for the second). Across trials, the target presentation was equally distributed between the two intervals. Participants were instructed to maintain their fixation at the center of the monitor and to avoid eye movements during the trials.

Psychophysical measurements included the following: 1) contrast sensitivity: The task was to detect a single Gabor patch target with a spatial frequency of 5, 6.5, 8.5, or 13 cycles per degree (cpd) presented for 60 ms; 2) lateral masking (LM): Detection of a Gabor target masked by two high-contrast (60%) collinear Gabor flankers with a target-flanker distance of 1.5, 2, 3 and 4 wavelengths (λ) (presented for 60 ms) with a spatial frequency of 6.5 cpd occupying 0.31 degrees of visual angles; 3)temporal masking: Backward masking following lateral masking, composed of LM followed by another mask, identical to the two flanking collinear GPs used in LM, presented at the same location but with varied time intervals (inter-stimulus interval, ISI) after LM. The ISIs were 60, 90, 120 and 150 ms. The target-flanker distance was 2λ and the target and flankers had a spatial frequency of 6.5 cpd.

Training on iDevices using the GlassesOff application

The paradigm is a structured perceptual learning training method originally developed for improving visual functions in presbyopia (GlassesOff applications for iDevices). The results of each session are sent via the internet to a remote server that analyzes the results. The training difficulty for the next session is adapted individually for each user according to the user's performance in the previous session. Thus, the pace of the progress is determined according to the individual's results. The initial number of sessions is individually set after an initial evaluation of the temporal visual acuity⁹² (see below) and is continuously updated throughout the training, based on the user's performance. The participants were instructed to perform at least 3 sessions per week and completed 24 ± 3 sessions (mean ± STD, range 20–33); one participant performed 33 and 2 participants performed 20 sessions on different days not including the days of the pretest and posttest.

Training on iDevices

Recent technology enabled the use of high-resolution screens on iDevices known as retina displays. The pixel size of the retina display is 0.078 × 0.078 mm, about 4 times smaller than the standard pixel size of PC monitors. This provides the advantage of presenting high spatial frequencies viewed from short distances. In this study we were able to train the participants using high spatial frequencies up to 8 cpd. We recently showed⁹³ that the contrast sensitivity measured on a retina display is much better than that measured on PC monitors and that this improves the visual functions of presbyopes. The screen resolution of the iPod and iPhones was 960 × 640 pixels at a 60 Hz refresh rate, whereas the effective training area from 40 cm was a circle with a diameter of 4.9 cm.

To avoid variability among the resolution, screen size and luminance values that exist between the different iDevices, the pre and post testing of the temporal acuity were performed on the same device for all participants: an iPod (retina display) in a controlled environment at the lab. The training was performed using the participants' personal devices, which all had retina displays, except for one user who used an iPhone 3 (pixel size 0.156 × 0.156 mm², better than a PC). Nevertheless, the application sets the overall luminance and the image size of the Gabor patches (by compensating for the known pixel size of each device) at the beginning of each session to be the same among the different devices.

The luminance of the screen was controlled throughout the training by automatically setting it to its maximal value (120 cd/m²) at the beginning of each session and returning it back to the user's preferences at the end of the session. The participants were instructed to train at home in a dark environment from 40 cm with both eyes open in a dark environment at their convenience. Each participant was provided with a ribbon of this length so that they could easily adjust the distance from the device to their eyes at home.

Training paradigm

Participants were trained on contrast detection of Gabor targets under lateral and backward masking conditions, by posing spatial and temporal constraints on the visual processing. The training covered a range of spatial frequencies (2–8 cpd; the size of the Gabor patches ranged from 0.18 to 1 deg) and included 4 orientations (0, 45, 90 and 135 deg) that were modified in accordance with the improved performance. Each session included 6 blocks that included the target alone and 5 blocks composed of two of the above-described 4 conditions (contrast sensitivity, lateral masking, spatial masking (crowded configuration), temporal masking) and a fifth condition: pedestals: contrast discrimination of the target while the two flankers served as pedestals either at a) 1.5 λ or 0 λ. The selection of the conditions was determined by an automated algorithm that advanced the conditions, the difficulty level (spatial frequency, orientation and target contrast) and ISI according to the participant's performance. Each condition was repeated twice during different successive sessions on different days.

The ISIs were 60, 90, 120, 150, 180, 210, or 240 ms. A 2AFC paradigm was used, identical to the one used in the pretest and posttest and the participants were asked to report which interval contained the target. Auditory and visual feedback were provided. ISI, the duration of the presentation of target and flanking Gabors, as well as their orientation and spatial frequency were modified between sessions, one parameter at a time, according to the performance in the preceding session. The duration of the stimulus presentation was 60 ms. The spatial distance between the target and the flankers varied from 0 to 4 λ. The orientation of the Gabor patches was always the same for the target and masking GPs (i.e., collinear, side-by-side or cross: ‘collinear + side-by-side’).

Visual acuity under temporal crowded conditions using E letters on an iPod (at the lab)

We applied here, on iPods, the same paradigm that we used before^49,59 in order to investigate the crowding (letter resolution) at different presentation times. This method accurately predicts the near visual acuity, as measured on near ETDRS charts⁹². It is a LogMAR chart equivalent, monitor-based paradigm that uses E-patterns presented for presentation times ranging from 30–240 ms. Five rows of five E-patterns each, facing one of four directions, with a 0.1-log unit size difference between the rows were presented. These stimuli correspond to a subset of the LogMAR chart, with a baseline pattern size corresponding to baseline (i.e. 6/6 vision) of the LogMAR chart. The central pattern (the center of the middle row) was always the target for identification. The patterns were dark gray on a gray background and the viewing distance was 40 cm. For each trial the task was to determine the direction of the central E (the target) presented for durations ranging from 30 to 240 ms. An adaptive procedure in which the pattern size and spacing were modified in 0.1 log unit steps was used to determine the size for 50% correct (the chance level was 25%). A different auditory feedback was given for correct and incorrect responses. To determine crowding, we used a crowded condition (0.4 letter spacing)⁴⁹ for each presentation time. We recently showed that the results revealed from this procedure are highly correlated with near visual acuity, as measured on an ETDRS chart⁹². This measure was used twice in the lab using the same iPod in a controlled environment. The second measure (posttest) took place immediately after the training period for the group undergoing training and after the same time period but without intervening training for the control group.

References

Polat, U. & Sagi, D. Lateral interactions between spatial channels: suppression and facilitation revealed by lateral masking experiments. Vision Res 33, 993–999 (1993).
Article CAS PubMed Google Scholar
Breitmeyer, B. G. Visual masking: an integrative approach. Vol. 4 (Oxford University Press, 1984).
Google Scholar
Whitney, D. & Levi, D. M. Visual crowding: a fundamental limit on conscious perception and object recognition. Trends Cogn Sci 15, 160–168 (2011).
Article PubMed PubMed Central Google Scholar
Levi, D. M. Crowding-an essential bottleneck for object recognition: a mini-review. Vision Res 48, 635–654 (2008).
Article ADS PubMed PubMed Central Google Scholar
Pelli, D. G., Palomares, M. & Majaj, N. J. Crowding is unlike ordinary masking: distinguishing feature integration from detection. J Vis 4, 1136–1169 (2004).
PubMed Google Scholar
Enns, J. T. & Di Lollo, V. What's new in visual masking? Trends Cogn Sci 4, 345–352 (2000).
Article CAS PubMed Google Scholar
Breitmeyer, B. G. & Ogmen, H. Recent models and findings in visual backward masking: a comparison, review and update. Percept Psychophys 62, 1572–1595 (2000).
Article CAS PubMed Google Scholar
Francis, G. Quantitative theories of metacontrast masking. Psycholog Rev 107, 768–785 (2000).
Article CAS Google Scholar
Fahle, M. & Poggio, T. Perceptual Learning (MIT Press, Cambridge, Masssachusetts, 2002).
Sagi, D. Perceptual learning in Vision Research. Vision Res 51, 1552–1566 (2011).
Article PubMed Google Scholar
Sasaki, Y., Nanez, J. E. & Watanabe, T. Advances in visual perceptual learning and plasticity. Nat Rev Neurosci 11, 53–60 (2010).
Article CAS PubMed Google Scholar
Gilbert, C. D. Early perceptual learning. Proc Natl Acad Sci U S A 91, 1195–1197 (1994).
Article CAS ADS PubMed PubMed Central Google Scholar
Crist, R. E., Kapadia, M. K., Westheimer, G. & Gilbert, C. D. Perceptual learning of spatial localization: specificity for orientation, position and context. J Neurophysiol 78, 2889–2894 (1997).
Article CAS PubMed Google Scholar
Fahle, M. Perceptual learning: specificity versus generalization. Curr Opin Neurobiol 15, 154–160 (2005).
Article CAS PubMed Google Scholar
Harris, H., Gliksberg, M. & Sagi, D. Generalized perceptual learning in the absence of sensory adaptation. Curr Biol 22, 1813–1817 (2012).
Article CAS PubMed Google Scholar
Polat, U. Making perceptual learning practical to improve visual functions. Vision Res 49, 2566–2573 (2009).
Article PubMed Google Scholar
Xiao, L. Q. et al. Complete transfer of perceptual learning across retinal locations enabled by double training. Curr Biol 18, 1922–1926 (2008).
Article CAS PubMed PubMed Central Google Scholar
Zhang, J. Y. et al. Rule-based learning explains visual perceptual learning and its specificity and transfer. J Neurosci 30, 12323–12328 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhang, T., Xiao, L. Q., Klein, S. A., Levi, D. M. & Yu, C. Decoupling location specificity from perceptual learning of orientation discrimination. Vision Res 50, 368–374 (2010).
Article PubMed Google Scholar
Watanabe, T., Nanez, J. E. & Sasaki, Y. Perceptual learning without perception. Nature 413, 844–848 (2001).
Article CAS ADS PubMed Google Scholar
Ahissar, M. & Hochstein, S. The reverse hierarchy theory of visual perceptual learning. Trends Cogn Sci 8, 457–464 (2004).
Article PubMed Google Scholar
Ahissar, M. & Hochstein, S. Attentional control of early perceptual learning. Proc Natl Acad Sci U S A 90, 5718–5722 (1993).
Article CAS ADS PubMed PubMed Central Google Scholar
Zhang, J. Y. et al. Stimulus coding rules for perceptual learning. PLoS Biol 6, e197 (2008).
Article CAS PubMed PubMed Central Google Scholar
Tartaglia, E. M., Aberg, K. C. & Herzog, M. H. Perceptual learning and roving: Stimulus types and overlapping neural populations. Vision Res 49, 1420–1427 (2009).
Article PubMed Google Scholar
Herzog, M. H., Ewald, K. R., Hermens, F. & Fahle, M. Reverse feedback induces position and orientation specific changes. Vision Res 46, 3761–3770 (2006).
Article PubMed Google Scholar
Polat, U. & Sagi, D. Spatial interactions in human vision: from near to far via experience- dependent cascades of connections. Proc Natl Acad Sci U S A 91, 1206–1209 (1994).
Article CAS ADS PubMed PubMed Central Google Scholar
Polat, U. Functional architecture of long-range perceptual interactions. Spatial vision 12, 143–162 (1999).
Article CAS PubMed Google Scholar
Polat, U. & Tyler, C. W. What pattern the eye sees best. Vision Res 39, 887–895 (1999).
Article CAS PubMed Google Scholar
Polat, U. & Sagi, D. The architecture of perceptual spatial interactions. Vision Res 34, 73–78 (1994).
Article CAS PubMed Google Scholar
Adini, Y. & Sagi, D. Recurrent networks in human visual cortex: psychophysical evidence. J Opt Soc AM 18, 2228–2236 (2001).
Article CAS ADS Google Scholar
Adini, Y., Sagi, D. & Tsodyks, M. Excitatory-inhibitory network in the visual cortex: psychophysical evidence. Proc Natl Acad Sci U S A 94, 10426–10431 (1997).
Article CAS ADS PubMed PubMed Central Google Scholar
Solomon, J. A., Watson, A. B. & Morgan, M. J. Transducer model produces facilitation from opposite-sign flanks. Vision Res 39, 987–992 (1999).
Article CAS PubMed Google Scholar
Solomon, J. A. & Morgan, M. J. Facilitation from collinear flanks is cancelled by non-collinear flanks. Vision Res 40, 279–286 (2000).
Article CAS PubMed Google Scholar
Chen, C. C. & Tyler, C. W. Lateral modulation of contrast discrimination: flanker orientation effects. J Vis 2, 520–530 (2002).
PubMed Google Scholar
Chen, C. C. & Tyler, C. W. Excitatory and inhibitory interaction fields of flankers revealed by contrast-masking functions. J Vis 8, 10 11–14 (2008).
Article Google Scholar
Gilbert, C., Ito, M., Kapadia, M. & Westheimer, G. Interactions between attention, context and learning in primary visual cortex. Vision Res 40, 1217–1226 (2000).
Article CAS PubMed Google Scholar
Freeman, E., Driver, J., Sagi, D. & Zhaoping, L. Top-down modulation of lateral interactions in early vision: does attention affect integration of the whole or just perception of the parts? Curr Biol 13, 985–989 (2003).
Article CAS PubMed Google Scholar
Freeman, E., Sagi, D. & Driver, J. Lateral interactions between targets and flankers in low-level vision depend on attention to the flankers. Nature Neurosci 4, 1032–1036 (2001).
Article CAS PubMed Google Scholar
Sterkin, A., Yehezkel, O. & Polat, U. Learning to be fast: Gain accuracy with speed. Vision Res 61, 115–124 (2012).
Article PubMed Google Scholar
Polat, U. et al. Training the brain to overcome the effect of aging on the human eye. Sci Rep 2, 278 (2012).
Article CAS PubMed PubMed Central Google Scholar
Polat, U., Ma-Naim, T., Belkin, M. & Sagi, D. Improving vision in adult amblyopia by perceptual learning. Proc Natl Acad Sci U S A 101, 6692–6697 (2004).
Article CAS ADS PubMed PubMed Central Google Scholar
Polat, U. Restoration of underdeveloped cortical functions: evidence from treatment of adult amblyopia. Restor Neurol Neuros 26, 413–424 (2008).
Google Scholar
Polat, U., Ma-Naim, T. & Spierer, A. Treatment of children with amblyopia by perceptual learning. Vision Res 49, 2599–2603 (2009).
Article PubMed Google Scholar
Lev, M. et al. Training-induced recovery of low-level vision followed by high-level perceptual improvements in an adult with developmental object and face agnosia. Dev Sci Apr 4. 1–15 10.1111/desc.12178. (2014).
Astle, A. T., Li, R. W., Webb, B. S., Levi, D. M. & McGraw, P. V. A Weber-like law for perceptual learning. Sci Rep 3, 1158 (2013).
Article CAS ADS PubMed PubMed Central Google Scholar
Fell, J. & Axmacher, N. The role of phase synchronization in memory processes. Nat Rev Neurosci 12, 105–118 (2011).
Article CAS PubMed Google Scholar
Thorpe, S., Fize, D. & Marlot, C. Speed of processing in the human visual system. Nature 381, 520–522 (1996).
Article CAS ADS PubMed Google Scholar
Polat, U. & Sagi, D. Temporal asymmetry of collinear lateral interactions. Vision Res 46, 953–960 (2006).
Article PubMed Google Scholar
Lev, M., Yehezkel, O. & Polat, U. Uncovering foveal crowding? Sci Rep 4, 4067 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Petrov, Y. & McKee, S. P. The effect of spatial configuration on surround suppression of contrast sensitivity. J Vis 6, 224–238 (2006).
PubMed Google Scholar
Lev, M. & Polat, U. When masking is like crowding. J Vis 12, 333 (2012).
Article Google Scholar
Polat, U., Sterkin, A. & Yehezkel, O. Spatio-temporal low-level neural networks account for visual masking. Adv Cogn Psychol 3, 153–165 (2007).
Article Google Scholar
Chung, S. T., Levi, D. M. & Legge, G. E. Spatial-frequency and contrast properties of crowding. Vision Res 41, 1833–1850 (2001).
Article CAS PubMed Google Scholar
Pelli, D. G. Crowding: a cortical constraint on object recognition. Curr Opin Neurobiol 18, 445–451, 10.1016/j.conb.2008.09.008 (2008).
Article CAS PubMed PubMed Central Google Scholar
Pelli, D. G. & Tillman, K. A. The uncrowded window of object recognition. Nature Neurosci 11, 1129–1135 (2008).
Article CAS PubMed Google Scholar
Manassi, M., Sayim, B. & Herzog, M. H. Grouping, pooling and when bigger is better in visual crowding. J Vis 12, 13 (2012).
Article PubMed Google Scholar
Malania, M., Herzog, M. H. & Westheimer, G. Grouping of contextual elements that affect vernier thresholds. J Vis 7, 1 1–7 (2007).
Article Google Scholar
Westheimer, G. & Hauske, G. Temporal and spatial interference with vernier acuity. Vision Res 15, 119–1141 (1975).
Article Google Scholar
Bonneh, Y. S., Sagi, D. & Polat, U. Spatial and temporal crowding in amblyopia. Vision Res 47, 1950–1962 (2007).
Article PubMed Google Scholar
Chakravarthi, R. & Cavanagh, P. Recovery of a crowded object by masking the flankers: determining the locus of feature integration. J Vis 9, 4 1–9 (2009).
Article Google Scholar
Baron, W. S. & Westheimer, G. Visual acuity as a function of exposure duration. J Opt Soc AM 63, 212–219 (1973).
Article CAS ADS PubMed Google Scholar
Dong, L. M., Hawkins, B. S. & Marsh, M. J. Consistency between visual acuity scores obtained at different test distances: theory vs observations in multiple studies. Arc Ophthalmol 120, 1523–1533 (2002).
Article Google Scholar
Giese, W. J. The interrelationship of visual acuity at different distances. J Appl Psychol 30, 91–106 (1946).
Article CAS PubMed Google Scholar
Burbeck, C. A. Locus of spatial-frequency discrimination. J Opt Soc AM 4, 1807–1813 (1987).
Article CAS ADS Google Scholar
Polat, U. & Sagi, D. Plasticity of spatial interactions in early vision. in Maturational Windows and Adult Cortical Plasticity Vol. XXIV (eds Julesz, B. & Kovacs, I.) 1–15 (Addison-Wesley, 1995).
Google Scholar
Censor, N., Karni, A. & Sagi, D. A link between perceptual learning, adaptation and sleep. Vision Res 46, 4071–4074 (2006).
Article PubMed Google Scholar
Sterkin, A., Yehezkel, O., Bonneh, Y. S., Norcia, A. & Polat, U. Backward masking suppresses collinear facilitation in the visual cortex. Vision Res 49, 1784–1794 (2009).
Article PubMed Google Scholar
Levi, D. M. & Carney, T. Crowding in peripheral vision: why bigger is better. Curr Biol 19, 1988–1993 (2009).
Article CAS PubMed PubMed Central Google Scholar
Neri, P. & Heeger, D. J. Spatiotemporal mechanisms for detecting and identifying image features in human vision. Nature Neurosci 5, 812–816 (2002).
Article CAS PubMed Google Scholar
Teich, A. F. & Qian, N. Learning and adaptation in a recurrent model of V1 orientation selectivity. J Neurophys 89, 2086–2100 (2003).
Article Google Scholar
Lev, M., Yehezkel, O., Sterkin, A. & Polat, U. Perceptual learning can reduce the size of the perceptive field leading to reduced impact of masking and crowding. Society for Neuroscience abstract 842.18/VV18 http://www.abstractsonline.com/plan/ViewAbstract.aspx?cKey=b69b038d-27ee-4907-b3c5-d0ffb7237b36&mID=3236&mKey=8d2a5bec-4825-4cd6-9439-b42bb151d1cf&sKey=3f29bcb0-a247-49c1-bd28-dd16a01a84ca (2013) Date of access: 13/11/2013.
Lu, Z. L. & Dosher, B. A. Perceptual learning retunes the perceptual template in foveal orientation identification. J Vis 4, 44–56 (2004).
PubMed Google Scholar
Dosher, B. A. & Lu, Z. L. Perceptual learning reflects external noise filtering and internal noise reduction through channel reweighting. Proc Natl Acad Sci U S A 95, 13988–13993 (1998).
Article CAS ADS PubMed PubMed Central Google Scholar
Albrecht, D. G., Geisler, W. S., Frazor, R. A. & Crane, A. M. Visual cortex neurons of monkeys and cats: temporal dynamics of the contrast response function. J Neurophysiol 88, 888–913 (2002).
Article PubMed Google Scholar
Leventhal, A. G., Wang, Y., Pu, M., Zhou, Y. & Ma, Y. GABA and its agonists improved visual cortical function in senescent monkeys. Science 300, 812–815 (2003).
Article CAS ADS PubMed Google Scholar
Kail, R. & Salthouse, T. A. Processing speed as a mental capacity. Acta Psycholog 86, 199–225 (1994).
Article CAS Google Scholar
Kasamatsu, T., Polat, U., Pettet, M. W. & Norcia, A. M. Colinear facilitation promotes reliability of single-cell responses in cat striate cortex. Exp Brain Res. 138, 163–172 (2001).
Article CAS PubMed Google Scholar
Paradis, A.-L., Morel, S., Seriès, P. & Lorenceau, J. Speeding up the brain: when spatial facilitation translates into latency shortening. Front Hum Neurosci 6, December 19 10.3389/fnhum.2012.00330 (2012).
Poggio, T., Fahle, M. & Edelman, S. Fast perceptual learning in visual hyperacuity. Science 256, 1018–1021 (1992).
Article CAS ADS PubMed Google Scholar
Li, R., Polat, U., Makous, W. & Bavelier, D. Enhancing the contrast sensitivity function through action video game training. Nature Neurosci 12, 549–551, 10.1038/nn.2296 (2009).
Article CAS PubMed Google Scholar
Li, R., Polat, U., Scalzo, F. & Bavelier, D. Reducing backward masking through action game training. J Vis 10, Decemeber 28, 10.1167/10.14.33 (2010).
Neri, P. & Levi, D. M. Receptive versus perceptive fields from the reverse-correlation viewpoint. Vision Res 46, 2465–2474 (2006).
Article PubMed Google Scholar
Watson, A. B. Summation of grating patches indicates many types of detector at one retinal location. Vision Res 22, 17–25 (1982).
Article CAS PubMed Google Scholar
Watson, A. B., Barlow, H. B. & Robson, J. G. What does the eye see best? Nature 302, 419–422 (1983).
Article CAS ADS PubMed Google Scholar
Lev, M. & Polat, U. Collinear facilitation and suppression at the periphery. Vision Res 51, 2488–2498 (2011).
Article PubMed Google Scholar
Levi, D. M., Hariharan, S. & Klein, S. A. Suppressive and facilitatory spatial interactions in peripheral vision: peripheral crowding is neither size invariant nor simple contrast masking. J Vis 2, 167–177 (2002).
PubMed Google Scholar
Nandy, A. S. & Tjan, B. S. Saccade-confounded image statistics explain visual crowding. Nature Neurosci 15, 463–469, S461–462 (2012).
Article CAS PubMed Google Scholar
Wallace, J. M., Chiu, M. K., Nandy, A. S. & Tjan, B. S. Crowding during restricted and free viewing. Vision Res 84, 50–59 (2013).
Article PubMed PubMed Central Google Scholar
Chung, S. & Patel, S. Temporal Dynamics of the Crowding Mechanism. J Vis 11, 1143 (2011).
Article Google Scholar
Herzog, M. H. & Fahle, M. Effects of grouping in contextual modulation. Nature 415, 433–436 (2002).
Article CAS ADS PubMed Google Scholar
He, S., Cavanagh, P. & Intriligator, J. Attentional resolution and the locus of visual awareness. Nature 383, 334–337 (1996).
Article CAS ADS PubMed Google Scholar
Yehezkel, O., Sterkin, A., Lev, M. & Polat, U. Digital precise remote near visual acuity evaluation using mobile devices. The Association for Research in Vision and Ophthalmology abstract https://arvo2013.abstractcentral.com/s1agxt/com.scholarone.s1agxt.s1agxt/S1A.html?&a=2662&b=1593989&c=19971&d=17&e=21482456&f=17&g=null&h=BROWSE_THE_PROGRAM&i=N&j=N&k=N&l=Y&m=MG45pCCkFdVJDUxbhJmux4zanI&n=0&o=1406742935032&q=Y&p=https://arvo2013.abstractcentral.com (2013) 582 - C0193, Date of access: 05/05/2013.
Ma-Naim, T., Polat, U., Lev, M., Yehezkel, O. & Sterkin, A. Perceptual Training on Mobile Devices Is Effective for Overcoming the Effects of Aging on the Human Eye. American Academy of Opthalmology (AAO) http://www.nxtbook.com/tristar/aao/final_program2012/index.php#/264 (2012) 155, Date of access: 11/11/2012.

Download references

Acknowledgements

This study was supported by grants for U.P. from the Israel Science Foundation (ISF188/2010) and GlassesOff, Inc. M.L. and K.L. each received a Short-Term Research Grant by the Minerva Foundation of the Max Planck Society in support of this project. G.H., K.L. and P.S. were supported by the German Research Foundation (grants HE 6244/1-1 and STE 1430/2-1).

Author information

Lev Maria, Ludwig Karin and Gilaie-Dotan Sharon contributed equally to this work.

Authors and Affiliations

Faculty of Medicine, Goldschleger Eye Research Institute, Tel Aviv University, Israel
Maria Lev & Uri Polat
Department of Psychiatry and Psychotherapy, Visual Perception Laboratory, Charité – Universitätsmedizin Berlin, Germany
Karin Ludwig, Stephanie Voss, Philipp Sterzer & Guido Hesselmann
Department of Psychology, Humboldt-Universität zu Berlin, Germany
Karin Ludwig
UCL Institute of Cognitive Neuroscience, London, UK
Sharon Gilaie-Dotan

Authors

Maria Lev
View author publications
You can also search for this author in PubMed Google Scholar
Karin Ludwig
View author publications
You can also search for this author in PubMed Google Scholar
Sharon Gilaie-Dotan
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Voss
View author publications
You can also search for this author in PubMed Google Scholar
Philipp Sterzer
View author publications
You can also search for this author in PubMed Google Scholar
Guido Hesselmann
View author publications
You can also search for this author in PubMed Google Scholar
Uri Polat
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.L., K.L., S.G.-D., G.H., S.V., P.S. and U.P. were involved in designing the study, as well as in writing and editing the manuscript. K.L., M.L., S.G.-D. and S.V. collected and analyzed the data. All authors reviewed the manuscript.

Ethics declarations

Competing interests

U.P.'s work has been funded by GlassesOff, Inc. He has received compensation as a consultant and as amember of the scientific advisory board and owns stock in the company. M.L., K.L., S.G.-D., S.V., G.H. and P.S. declare no competing financial interest.

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/

Reprints and permissions

About this article

Cite this article

Lev, M., Ludwig, K., Gilaie-Dotan, S. et al. Training improves visual processing speed and generalizes to untrained functions. Sci Rep 4, 7251 (2014). https://doi.org/10.1038/srep07251

Download citation

Received: 01 April 2014
Accepted: 13 November 2014
Published: 28 November 2014
DOI: https://doi.org/10.1038/srep07251

This article is cited by

Perceptual learning based on a temporal stimulus enhances visual function in adult amblyopic subjects
- Auria Eisen-Enosh
- Nairouz Farah
- Yossi Mandel
Scientific Reports (2023)
Testing the efficacy of vision training for presbyopia: alternating-distance training does not facilitate vision improvement compared to fixed-distance training
- Suraiya Jahan Liza
- Seonggyu Choe
- Oh-Sang Kwon
Graefe's Archive for Clinical and Experimental Ophthalmology (2022)
Investigating face and house discrimination at foveal to parafoveal locations reveals category-specific characteristics
- Olga Kreichman
- Yoram S. Bonneh
- Sharon Gilaie-Dotan
Scientific Reports (2020)
Contextual influences in the peripheral retina of patients with macular degeneration
- Giulio Contemori
- Luca Battaglini
- Clara Casco
Scientific Reports (2019)
Evaluation of Critical Flicker-Fusion Frequency Measurement Methods for the Investigation of Visual Temporal Resolution
- Auria Eisen-Enosh
- Nairouz Farah
- Yossi Mandel
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.