Stimulus- and goal-oriented frameworks for understanding natural vision

Turner, Maxwell H.; Sanchez Giraldo, Luis Gonzalo; Schwartz, Odelia; Rieke, Fred

doi:10.1038/s41593-018-0284-0

Review Article
Published: 10 December 2018

Stimulus- and goal-oriented frameworks for understanding natural vision

Nature Neuroscience volume 22, pages 15–24 (2019)Cite this article

6370 Accesses
38 Citations
24 Altmetric
Metrics details

Subjects

Abstract

Our knowledge of sensory processing has advanced dramatically in the last few decades, but this understanding remains far from complete, especially for stimuli with the large dynamic range and strong temporal and spatial correlations characteristic of natural visual inputs. Here we describe some of the issues that make understanding the encoding of natural images a challenge. We highlight two broad strategies for approaching this problem: a stimulus-oriented framework and a goal-oriented one. Different contexts can call for one framework or the other. Looking forward, recent advances, particularly those based in machine learning, show promise in borrowing key strengths of both frameworks and by doing so illuminating a path to a more comprehensive understanding of the encoding of natural stimuli.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Texture synthesis based on deep convolutional neural networks.**

**Fig. 2: Efficient coding strategies rely on self-generated movement.**

**Fig. 3: Beyond-pairwise statistics contribute to complex structure in natural images.**

**Fig. 4: Motion-sensitive neurons encode self-movement across the animal kingdom.**

**Fig. 5: DNNs reflect some, but not all, architectural and computational motifs found in neural circuits.**

BOLD5000, a public fMRI dataset while viewing 5000 visual images

Article Open access 06 May 2019

Qualitative similarities and differences in visual object representations between brains and deep networks

Article Open access 25 March 2021

Capturing the objects of vision with neural networks

Article 20 September 2021

References

Gollisch, T. & Meister, M. Eye smarter than scientists believed: neural computations in circuits of the retina. Neuron 65, 150–164 (2010).
Article CAS PubMed PubMed Central Google Scholar
Schwartz, G. W. & Rieke, F. Nonlinear spatial encoding by retinal ganglion cells: when 1 + 1 ! = 2. J. Gen. Physiol. 138, 283–290 (2011).
Article PubMed PubMed Central Google Scholar
Demb, J. B. & Singer, J. H. Functional circuitry of the retina. Annu. Rev. Vis. Sci. 1, 263–289 (2015).
Article PubMed PubMed Central Google Scholar
Graham, N. V. Beyond multiple pattern analyzers modeled as linear filters (as classical V1 simple cells): useful additions of the last 25 years. Vision Res. 51, 1397–1430 (2011).
Article PubMed Google Scholar
Rieke, F. & Rudd, M. E. The challenges natural images pose for visual adaptation. Neuron 64, 605–616 (2009).
Article CAS PubMed Google Scholar
Solomon, S. G. & Kohn, A. Moving sensory adaptation beyond suppressive effects in single neurons. Curr. Biol. 24, R1012–R1022 (2014).
Article CAS PubMed PubMed Central Google Scholar
Baddeley, R. et al. Responses of neurons in primary and inferior temporal visual cortices to natural scenes. Proc. Biol. Sci. 264, 1775–1783 (1997).
Article CAS PubMed PubMed Central Google Scholar
Creutzfeldt, O. D. & Nothdurft, H. C. Representation of complex visual stimuli in the brain. Naturwissenschaften 65, 307–318 (1978).
Article CAS PubMed Google Scholar
Smyth, D., Willmore, B., Baker, G. E., Thompson, I. D. & Tolhurst, D. J. The receptive-field organization of simple cells in primary visual cortex of ferrets under natural scene stimulation. J. Neurosci. 23, 4746–4759 (2003).
Article CAS PubMed PubMed Central Google Scholar
Stanley, G. B., Li, F. F. & Dan, Y. Reconstruction of natural scenes from ensemble responses in the lateral geniculate nucleus. J. Neurosci. 19, 8036–8042 (1999).
Article CAS PubMed PubMed Central Google Scholar
Vickers, N. J., Christensen, T. A., Baker, T. C. & Hildebrand, J. G. Odour-plume dynamics influence the brain’s olfactory code. Nature 410, 466–470 (2001).
Article CAS PubMed Google Scholar
Vinje, W. E. & Gallant, J. L. Sparse coding and decorrelation in primary visual cortex during natural vision. Science 287, 1273–1276 (2000).
Article CAS PubMed Google Scholar
Sharpee, T. O. et al. Adaptive filtering enhances information transmission in visual cortex. Nature 439, 936–942 (2006).
Article CAS PubMed PubMed Central Google Scholar
Theunissen, F. E. & Elie, J. E. Neural processing of natural sounds. Nat. Rev. Neurosci. 15, 355–366 (2014).
Article CAS PubMed Google Scholar
Zwicker, D., Murugan, A. & Brenner, M. P. Receptor arrays optimized for natural odor statistics. Proc. Natl Acad. Sci. USA 113, 5570–5575 (2016).
Article CAS PubMed PubMed Central Google Scholar
Carandini, M. et al. Do we know what the early visual system does? J. Neurosci. 25, 10577–10597 (2005).
Article CAS PubMed PubMed Central Google Scholar
David, S. V. & Gallant, J. L. Predicting neuronal responses during natural vision. Network 16, 239–260 (2005).
Article PubMed Google Scholar
Turner, M. H. & Rieke, F. Synaptic rectification controls nonlinear spatial integration of natural visual inputs. Neuron 90, 1257–1271 (2016).
Article CAS PubMed PubMed Central Google Scholar
Heitman, A. et al. Testing pseudo-linear models of responses to natural scenes in primate retina. Preprint at bioRxiv https://doi.org/10.1101/045336 (2016).
Maheswaranathan, N., Kastner, D. B., Baccus, S. A. & Ganguli, S. Inferring hidden structure in multilayered neural circuits. PLoS Comput. Biol. 14, e1006291 (2018).
Article PubMed PubMed Central CAS Google Scholar
McIntosh, L. T., Maheswaranathan, N., Nayebi, A., Ganguli, S. & Baccus, S. A. Deep learning models of the retinal response to natural scenes. Adv. Neural Inf. Process. Syst. 29, 1369–1377 (2016).
PubMed PubMed Central Google Scholar
Felsen, G., Touryan, J., Han, F. & Dan, Y. Cortical sensitivity to visual features in natural scenes. PLoS Biol. 3, e342 (2005).
Article PubMed PubMed Central CAS Google Scholar
Rust, N. C., Schwartz, O., Movshon, J. A. & Simoncelli, E. P. Spatiotemporal elements of macaque V1 receptive fields. Neuron 46, 945–956 (2005).
Article CAS PubMed Google Scholar
Eickenberg, M., Rowekamp, R. J., Kouh, M. & Sharpee, T. O. Characterizing responses of translation-invariant neurons to natural stimuli: maximally informative invariant dimensions. Neural Comput. 24, 2384–2421 (2012).
Article PubMed PubMed Central Google Scholar
Vintch, B., Movshon, J. A. & Simoncelli, E. P. A convolutional subunit model for neuronal responses in macaque V1. J. Neurosci. 35, 14829–14841 (2015).
Article CAS PubMed PubMed Central Google Scholar
Rowekamp, R. J. & Sharpee, T. O. Cross-orientation suppression in visual area V2. Nat. Commun. 8, 15739 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pagan, M., Simoncelli, E. P. & Rust, N. C. Neural quadratic discriminant analysis: nonlinear decoding with V1-like computation. Neural Comput. 28, 1–29 (2016).
Article Google Scholar
Hyvärinen, A. Statistical models of natural images and cortical visual representation. Top. Cogn. Sci. 2, 251–264 (2010).
Article PubMed Google Scholar
Lewicki, M. S., Olshausen, B. A., Surlykke, A. & Moss, C. F. Scene analysis in the natural environment. Front. Psychol. 5, 199 (2014).
PubMed PubMed Central Google Scholar
Simoncelli, E. P. & Olshausen, B. A. Natural image statistics and neural representation. Annu. Rev. Neurosci. 24, 1193–1216 (2001).
Article CAS PubMed Google Scholar
Zhaoping, L. Theoretical understanding of the early visual processes by data compression and data selection. Network 17, 301–334 (2006).
Article PubMed Google Scholar
Coen-Cagli, R., Dayan, P. & Schwartz, O. Cortical surround interactions and perceptual salience via natural scene statistics. PLoS Comput. Biol. 8, e1002405 (2012).
Article CAS PubMed PubMed Central Google Scholar
Frazor, R. A. & Geisler, W. S. Local luminance and contrast in natural images. Vision Res. 46, 1585–1598 (2006).
Article PubMed Google Scholar
Karklin, Y. & Lewicki, M. S. A hierarchical Bayesian model for learning nonlinear statistical regularities in nonstationary natural signals. Neural Comput. 17, 397–423 (2005).
Article PubMed Google Scholar
Parra, L., Spence, C. & Sajda, P. Higher-order statistical properties arising from the non-stationarity of natural signals. Adv. Neural Inf. Process. Syst. 14, 786–792 (2001).
Google Scholar
Ruderman, D. L. & Bialek, W. Statistics of natural images: Scaling in the woods. Phys. Rev. Lett. 73, 814–817 (1994).
Article CAS PubMed Google Scholar
Portilla, J. & Simoncelli, E. P. Parametric texture model based on joint statistics of complex wavelet coefficients. Int. J. Comput. Vis. 40, 49–70 (2000).
Article Google Scholar
Gatys, L. A., Ecker, A. S. & Bethge, M. Texture synthesis using convolutional neural networks. Adv. Neural Inf. Process. Syst. 28, 262–270 (2015).
Google Scholar
Karras, T., Aila, T., Laine, S. & Lehtinen, J. Progressive Growing of GANs for Improved Quality, Stability, and Variation. Preprint at arXiv https://arxiv.org/abs/1710.10196 (2018).
Freeman, J., Ziemba, C. M., Heeger, D. J., Simoncelli, E. P. & Movshon, J. A. A functional and perceptual signature of the second visual area in primates. Nat. Neurosci. 16, 974–981 (2013).
Article CAS PubMed PubMed Central Google Scholar
Okazawa, G., Tajima, S. & Komatsu, H. Image statistics underlying natural texture selectivity of neurons in macaque V4. Proc. Natl Acad. Sci. USA 112, E351–E360 (2015).
Article CAS PubMed Google Scholar
Rust, N. C. & Dicarlo, J. J. Selectivity and tolerance (“invariance”) both increase as visual information propagates from cortical area V4 to IT. J. Neurosci. 30, 12978–12995 (2010).
Article CAS PubMed PubMed Central Google Scholar
Atick, J. J. & Redlich, A. N. What does the retina know about natural scenes? Neural Comput. 4, 196–210 (1992).
Article Google Scholar
Srinivasan, M. V., Laughlin, S. B. & Dubs, A. Predictive coding: a fresh view of inhibition in the retina. Proc. R. Soc. Lond. B Biol. Sci. 216, 427–459 (1982).
Article CAS PubMed Google Scholar
Marr, D. & Hildreth, E. Theory of edge detection. Proc. R. Soc. Lond. B Biol. Sci. 207, 187–217 (1980).
Article CAS PubMed Google Scholar
Zhaoping, L. Understanding Vision: Theory, Models, and Data. (Oxford University Press, Oxford, UK, (2014).
Barlow, H.B. Possible principles underlying the transformations of sensory messages. in Sensory Communication (ed. W.A. Rosenblith) 217–234 (Wiley, Oxford, UK, 1961).
Attneave, F. Some informational aspects of visual perception. Psychol. Rev. 61, 183–193 (1954).
Article CAS PubMed Google Scholar
Shannon, C. E. A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423 (1948).
Article Google Scholar
Field, D. J. What is the goal of sensory coding? Neural Comput. 6, 559–601 (1994).
Article Google Scholar
Bhandawat, V., Olsen, S. R., Gouwens, N. W., Schlief, M. L. & Wilson, R. I. Sensory processing in the Drosophila antennal lobe increases reliability and separability of ensemble odor representations. Nat. Neurosci. 10, 1474–1482 (2007).
Article CAS PubMed PubMed Central Google Scholar
Laughlin, S. A simple coding procedure enhances a neuron’s information capacity. Z. Naturforsch., C, Biosci. 36, 910–912 (1981).
Article CAS Google Scholar
Brinkman, B. A. W., Weber, A. I., Rieke, F. & Shea-Brown, E. How do efficient coding strategies depend on origins of noise in neural circuits? PLoS Comput. Biol. 12, e1005150 (2016).
Article PubMed PubMed Central CAS Google Scholar
Gjorgjieva, J., Sompolinsky, H. & Meister, M. Benefits of pathway splitting in sensory coding. J. Neurosci. 34, 12127–12144 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kastner, D. B., Baccus, S. A. & Sharpee, T. O. Critical and maximally informative encoding between neural populations in the retina. Proc. Natl Acad. Sci. USA 112, 2533–2538 (2015).
Article CAS PubMed PubMed Central Google Scholar
Field, D. J. Relations between the statistics of natural images and the response properties of cortical cells. J. Opt. Soc. Am. A 4, 2379–2394 (1987).
Article CAS PubMed Google Scholar
Ruderman, D. L. Origins of scaling in natural images. Vision Res. 37, 3385–3398 (1997).
Article CAS PubMed Google Scholar
Dan, Y., Atick, J. J. & Reid, R. C. Efficient coding of natural scenes in the lateral geniculate nucleus: experimental test of a computational theory. J. Neurosci. 16, 3351–3362 (1996).
Article CAS PubMed PubMed Central Google Scholar
Franke, K. et al. Inhibition decorrelates visual feature representation in the inner retina. Nature 542, 439–444 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pitkow, X. & Meister, M. Decorrelation and efficient coding by retinal ganglion cells. Nat. Neurosci. 15, 628–635 (2012).
Article CAS PubMed PubMed Central Google Scholar
Vincent, B. T. & Baddeley, R. J. Synaptic energy efficiency in retinal processing. Vision Res. 43, 1283–1290 (2003).
Article PubMed Google Scholar
Atick, J. J. Could information theory provide an ecological theory of sensory processing? Network 22, 4–44 (2011).
Article PubMed Google Scholar
Li, Z. & Atick, J. J. Efficient stereo coding in the multiscale representation. Network 5, 157–174 (1994).
Google Scholar
Kuang, X., Poletti, M., Victor, J. D. & Rucci, M. Temporal encoding of spatial information during active visual fixation. Curr. Biol. 22, 510–514 (2012).
Article CAS PubMed PubMed Central Google Scholar
Segal, I. Y. et al. Decorrelation of retinal response to natural scenes by fixational eye movements. Proc. Natl Acad. Sci. USA 112, 3110–3115 (2015).
Boi, M., Poletti, M., Victor, J. D. & Rucci, M. Consequences of the oculomotor cycle for the dynamics of perception. Curr. Biol. 27, 1268–1277 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hyvärinen, A., Hurri, J. & Hoyer, P. O. Natural Image Statistics: a Probabilistic Approach to Early Computational Vision. (Springer-Verlag, London, UK, 2009).
Book Google Scholar
Bell, A. J. & Sejnowski, T. J. The “independent components” of natural scenes are edge filters. Vision Res. 37, 3327–3338 (1997).
Article CAS PubMed PubMed Central Google Scholar
Olshausen, B. A. & Field, D. J. Emergence of simple-cell receptive field properties by learning a sparse code for natural images. Nature 381, 607–609 (1996).
Article CAS PubMed Google Scholar
Rehn, M. & Sommer, F. T. A network that uses few active neurones to code visual input predicts the diverse shapes of cortical receptive fields. J. Comput. Neurosci. 22, 135–146 (2007).
Article PubMed Google Scholar
Eichhorn, J., Sinz, F. & Bethge, M. Natural image coding in V1: how much use is orientation selectivity? PLoS Comput. Biol. 5, e1000336 (2009).
Article PubMed PubMed Central CAS Google Scholar
Golden, J. R., Vilankar, K. P., Wu, M. C. K. & Field, D. J. Conjectures regarding the nonlinear geometry of visual neurons. Vision Res. 120, 74–92 (2016).
Article PubMed Google Scholar
Schwartz, O. & Simoncelli, E. P. Natural signal statistics and sensory gain control. Nat. Neurosci. 4, 819–825 (2001).
Article CAS PubMed Google Scholar
Karklin, Y. & Lewicki, M. S. Emergence of complex cell properties by learning to generalize in natural scenes. Nature 457, 83–86 (2009).
Article CAS PubMed Google Scholar
Lochmann, T., Ernst, U. A. & Denève, S. Perceptual inference predicts contextual modulations of sensory responses. J. Neurosci. 32, 4179–4195 (2012).
Article CAS PubMed PubMed Central Google Scholar
Rao, R. P. N. & Ballard, D. H. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat. Neurosci. 2, 79–87 (1999).
Article CAS PubMed Google Scholar
Spratling, M. W. Predictive coding as a model of response properties in cortical area V1. J. Neurosci. 30, 3531–3543 (2010).
Article CAS PubMed PubMed Central Google Scholar
Zhu, M. & Rozell, C. J. Visual nonclassical receptive field effects emerge from sparse coding in a dynamical system. PLoS Comput. Biol. 9, e1003191 (2013).
Article CAS PubMed PubMed Central Google Scholar
Berkes, P. & Wiskott, L. Slow feature analysis yields a rich repertoire of complex cell properties. J. Vis. 5, 579–602 (2005).
Article PubMed Google Scholar
Cadieu, C. F. & Olshausen, B. A. Learning intermediate-level representations of form and motion from natural movies. Neural Comput. 24, 827–866 (2012).
Article PubMed Google Scholar
Coen-Cagli, R. & Schwartz, O. The impact on midlevel vision of statistically optimal divisive normalization in V1. J. Vis. 13, 1–20 (2013).
Article Google Scholar
Hosoya, H. & Hyvärinen, A. A hierarchical statistical model of natural images explains tuning properties in V2. J. Neurosci. 35, 10412–10428 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lee, H., Ekanadham, C. & Ng, A. Y. Sparse deep belief net model for visual area V2. Adv. Neural Inf. Process. Syst. 20, 873–880 (2008).
Google Scholar
Shan, H. & Cottrell, G. Efficient visual coding: from retina to V2. Preprint at arXiv https://arxiv.org/abs/1312.6077. (2013).
Dayan, P., Sahani, M. & Deback, G. Adaptation and Unsupervised Learning. Adv. Neural Inf. Process. Syst. 15, 237–244 (2003).
Google Scholar
Hinton, G. E. & Ghahramani, Z. Generative models for discovering sparse distributed representations. Phil. Trans. R. Soc. Lond. B 352, 1177–1190 (1997).
Article CAS Google Scholar
Wainwright, M. J. & Simoncelli, E. P. Scale mixtures of Gaussians and the statistics of natural images. Adv. Neural Inf. Process. Syst. 12, 855–861 (2000).
Google Scholar
Coen-Cagli, R., Kohn, A. & Schwartz, O. Flexible gating of contextual influences in natural vision. Nat. Neurosci. 18, 1648–1655 (2015).
Article CAS PubMed PubMed Central Google Scholar
Li, Z. Contextual influences in V1 as a basis for pop out and asymmetry in visual search. Proc. Natl Acad. Sci. USA 96, 10530–10535 (1999).
Article CAS PubMed PubMed Central Google Scholar
Lettvin, J. Y., Maturana, H. R., McCulloch, W. S. & Pitts, W. H. What the frog’s eye tells the frog’s brain. Proc. IRE 47, 1940–1951 (1959).
Article Google Scholar
Masland, R. H. & Martin, P. R. The unsolved mystery of vision. Curr. Biol. 17, R577–R582 (2007).
Article CAS PubMed Google Scholar
Nath, A. & Schwartz, G. W. Cardinal orientation selectivity is represented by two distinct ganglion cell types in mouse retina. J. Neurosci. 36, 3208–3221 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schwartz, G., Harris, R., Shrom, D. & Berry, M. J. II Detection and prediction of periodic patterns by the retina. Nat. Neurosci. 10, 552–554 (2007).
Article CAS PubMed PubMed Central Google Scholar
Krishnamoorthy, V., Weick, M. & Gollisch, T. Sensitivity to image recurrence across eye-movement-like image transitions through local serial inhibition in the retina. eLife 6, e22431 (2017).
Article PubMed PubMed Central Google Scholar
Franke, F. et al. Structures of neural correlation and how they favor coding. Neuron 89, 409–422 (2016).
Article CAS PubMed PubMed Central Google Scholar
Zylberberg, J., Cafaro, J., Turner, M. H., Shea-Brown, E. & Rieke, F. Direction-selective circuits shape noise to ensure a precise population code. Neuron 89, 369–383 (2016).
Article CAS PubMed PubMed Central Google Scholar
Rodieck, R. W. The First Steps in Seeing. (Oxford Press, Oxford, UK, 1998).
Google Scholar
Hecht, S. & Verrijp, C. D. Intermittent stimulation by light III. The relation between intensity and critical fusion frequency for different retinal locations. J. Gen. Physiol. 17, 251–268 (1933).
Article CAS PubMed PubMed Central Google Scholar
Sinha, R. et al. Cellular and circuit mechanisms shaping the perceptual properties of the primate fovea. Cell 168, 413–426.e12 (2017).
Article CAS PubMed PubMed Central Google Scholar
Solomon, S. G., Martin, P. R., White, A. J. R., Rüttiger, L. & Lee, B. B. Modulation sensitivity of ganglion cells in peripheral retina of macaque. Vision Res. 42, 2893–2898 (2002).
Article PubMed Google Scholar
Oyster, C. W. & Barlow, H. B. Direction-selective units in rabbit retina: distribution of preferred directions. Science 155, 841–842 (1967).
Article CAS PubMed Google Scholar
Hughes, S. et al. Signalling by melanopsin (OPN4) expressing photosensitive retinal ganglion cells. Eye (Lond.) 30, 247–254 (2016).
Article CAS Google Scholar
Hausen, K. & Egelhaaf, M. in Facets of Vision (eds. Stavenga, D.G. & Hardie, R.C.) 391–424 (Springer, London, UK, 1989).
O’Carroll, D. C., Bidwell, N. J., Laughlin, S. B. & Warrant, E. J. Insect motion detectors matched to visual ecology. Nature 382, 63–66 (1996).
Article PubMed Google Scholar
Krapp, H. G. & Hengstenberg, R. Estimation of self-motion by optic flow processing in single visual interneurons. Nature 384, 463–466 (1996).
Article CAS PubMed Google Scholar
Longden, K. D., Wicklein, M., Hardcastle, B. J., Huston, S. J. & Krapp, H. G. Spike burst coding of translatory optic flow and depth from motion in the fly visual system. Curr. Biol. 27, 3225–3236.e3 (2017).
Article CAS PubMed Google Scholar
Franz, M. O. & Krapp, H. G. Wide-field, motion-sensitive neurons and matched filters for optic flow fields. Biol. Cybern. 83, 185–197 (2000).
Article CAS PubMed Google Scholar
Kohn, J. R., Heath, S. L. & Behnia, R. Eyes matched to the prize: the state of matched filters in insect visual circuits. Front. Neural Circuits 12, 26 (2018).
Article PubMed PubMed Central CAS Google Scholar
Sabbah, S. et al. A retinal code for motion along the gravitational and body axes. Nature 546, 492–497 (2017).
Article CAS PubMed PubMed Central Google Scholar
Gauvain, G. & Murphy, G. J. Projection-specific characteristics of retinal input to the brain. J. Neurosci. 35, 6575–6583 (2015).
Article CAS PubMed PubMed Central Google Scholar
Burge, J. & Jaini, P. Accuracy maximization analysis for sensory-perceptual tasks: computational improvements, filter robustness, and coding advantages for scaled additive noise. PLoS Comput. Biol. 13, e1005281 (2017).
Article PubMed PubMed Central CAS Google Scholar
Geisler, W. S., Najemnik, J. & Ing, A. D. Optimal stimulus encoders for natural tasks. J. Vis. 9, 1–16 (2009).
Article PubMed Google Scholar
Krizhevsky, A., Sutskever, I. & Hinton, G. E. ImageNet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 25, 1–9 (2012).
Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article CAS PubMed Google Scholar
Maheswaranathan, N. et al. Deep learning models reveal internal structure and diverse computations in the retina under natural scenes. Preprint at bioRxiv https://doi.org/10.1101/340943 (2018).
Kriegeskorte, N. Deep neural networks: a new framework for modeling biological vision and brain information processing. Annu. Rev. Vis. Sci. 1, 417–446 (2015).
Article PubMed Google Scholar
Yamins, D. L. K. & DiCarlo, J. J. Using goal-driven deep learning models to understand sensory cortex. Nat. Neurosci. 19, 356–365 (2016).
Article CAS PubMed Google Scholar
Cadena, S. A. et al. Deep convolutional models improve predictions of macaque V1 responses to natural images. Preprint at bioRxiv https://doi.org/10.1101/201764 (2017).
Cichy, R. M., Khosla, A., Pantazis, D., Torralba, A. & Oliva, A. Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence. Sci. Rep. 6, 27755 (2016).
Article CAS PubMed PubMed Central Google Scholar
Pospisil, D., Pasupathy, A. & Bair, W. Comparing the brain’s representation of shape to that of a deep convolutional neural network. Proc. 9th EAI Int. Conf. Bio-inspired Inf. Commun. Technol. (formerly BIONETICS) 516–523 (2016).
Young, M. P. & Yamane, S. Sparse population coding of faces in the inferotemporal cortex. Science 256, 1327–1331 (1992).
Article CAS PubMed Google Scholar
Fukushima, K. Neocognitron: a self organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol. Cybern. 36, 193–202 (1980).
Article CAS PubMed Google Scholar
Riesenhuber, M. & Poggio, T. Hierarchical models of object recognition in cortex. Nat. Neurosci. 2, 1019–1025 (1999).
Article CAS PubMed Google Scholar
Razavian, A. S., Azizpour, H., Sullivan, J. & Carlsson, S. CNN features off-the-shelf: an astounding baseline for recognition. IEEE Conf. Comput. Vis. Pattern Recog. (CVPR) Workshops 806–813 (2014).
Szegedy, C. et al. Intriguing properties of neural networks. Preprint at arXiv https://arxiv.org/abs/1312.6199 (2014).
Ullman, S., Assif, L., Fetaya, E. & Harari, D. Atoms of recognition in human and computer vision. P roc. Natl Acad. Sci. USA 113, 2744–2749 (2016).
CAS Google Scholar
Goodfellow, I.J., Shlens, J. & Szegedy, C. Explaining and harnessing adversarial examples. Preprint at arXiv https://arxiv.org/abs/1412.6572 (2015).
Nayebi, A. & Ganguli, S. Biologically inspired protection of deep networks from adversarial attacks Preprint at arXiv https://arxiv.org/abs/1703.09202v1 (2017).
Brendel, W. & Bethge, M. Comment on ‘Biologically inspired protection of deep networks from adversarial attacks’. Preprint at arXiv https://arxiv.org/abs/1704.01547 (2017).
Nishimoto, S. & Gallant, J. L. A three-dimensional spatiotemporal receptive field model explains responses of area MT neurons to naturalistic movies. J. Neurosci. 31, 14551–14564 (2011).
Article CAS PubMed PubMed Central Google Scholar
Berardino, A., Ballé, J., Laparra, V. & Simoncelli, E.P. Eigen-distortions of hierarchical representations. Preprint at arXiv https://arxiv.org/abs/1710.02266v3 (2017).
Han, S. & Vasconcelos, N. Object recognition with hierarchical discriminant saliency networks. Front. Comput. Neurosci. 8, 109 (2014).
Article PubMed PubMed Central Google Scholar
Ren, M., Liao, R., Urtasun, R., Sinz, F. H. & Zemel, R. S. Normalizing the normalizers: comparing and extending network normalization schemes. Preprint at arXiv https://arxiv.org/abs/1611.04520 (2017).
Sanchez Giraldo, L.G., Schwartz, O. Integrating flexible normalization into mid-level representations of deep convolutional neural networks. Preprint at arXiv https://arxiv.org/abs/1806.01823 (2018).
Spoerer, C. J., McClure, P. & Kriegeskorte, N. Recurrent convolutional neural networks: A better model of biological object recognition. Front. Psychol. 8, 1551 (2017).
Article PubMed PubMed Central Google Scholar
Shwartz-Ziv, R. & Tishby, N. Opening the black box of deep neural networks via information. Preprint at arXiv https://arxiv.org/abs/1703.00810v3 (2017).
Chalk, M., Marre, O. & Tkačik, G. Toward a unified theory of efficient, predictive, and sparse coding. Proc. Natl Acad. Sci. USA 115, 186–191 (2018).
Article CAS PubMed Google Scholar
Sederberg, A. J., MacLean, J. N. & Palmer, S. E. Learning to make external sensory stimulus predictions using internal correlations in populations of neurons. Proc. Natl Acad. Sci. USA 115, 1105–1110 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kuleshov, V. & Ermon, S. Deep hybrid models: bridging discriminative and generative approaches. Uncertainty in AI http://auai.org/uai2017/proceedings/papers/297.pdf (2017).
Park, I. M. & Pillow, J. W. Bayesian efficient coding. Preprint at bioRxiv https://doi.org/10.1101/178418 (2017).
Ballé, J., Laparra, V. & Simoncelli, E.P. End-to-end optimized image compression. Preprint at arXiv https://arxiv.org/abs/1611.01704 (2017).
Hirayama, J., Hyvärinen, A. & Kawanabe, M. SPLICE: fully tractable hierarchical extension of ICA with pooling. Proc. Mach. Learn. Res. 70, 1491–1500 (2017).
Google Scholar
Scholte, H. S., Losch, M. M., Ramakrishnan, K., de Haan, E. H. F. & Bohte, S. M. Visual pathways from the perspective of cost functions and multi-task deep neural networks. Cortex 98, 249–261 (2018).
Article PubMed Google Scholar
Kell, A. J. E., Yamins, D. L. K., Shook, E. N., Norman-Haignere, S. V. & McDermott, J. H. A task-optimized neural network replicates human auditory behavior, predicts brain responses, and reveals a cortical processing hierarchy. Neuron 98, 630–644.e16 (2018).
Article CAS PubMed Google Scholar
Zhuang, C. D. Y. Using multiple optimization tasks to improve deep neural network models of higher ventral cortex. J.Vis. 18, 905 (2018).
Article Google Scholar
Van Der Linde, I., Rajashekar, U., Bovik, A. C. & Cormack, L. K. DOVES: a database of visual eye movements. Spat. Vis. 22, 161–177 (2009).
Article Google Scholar
Rucci, M. & Victor, J. D. The unsteady eye: an information-processing stage, not a bug. Trends Neurosci. 38, 195–206 (2015).
Article CAS PubMed PubMed Central Google Scholar
Thomson, M. G. Visual coding and the phase structure of natural scenes. Network 10, 123–132 (1999).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank H. Krapp, D. Pospisil, and J. Shlens for helpful feedback on an earlier version of this review. H. Krapp very generously provided the data and schematic shown in Fig. 4a,b. This work was supported by NIH grants F31-EY026288 (to M.H.T.), EY028542 (to F.R.), and a National Science Foundation Grant 1715475 (to O.S.).

Author information

These authors contributed equally: Maxwell H. Turner, Luis Gonzalo Sanchez Giraldo.
These authors jointly supervised to this work: Odelia Schwarz, Fred Rieke.

Authors and Affiliations

Department of Physiology and Biophysics, University of Washington, Seattle, WA, USA
Maxwell H. Turner & Fred Rieke
Graduate Program in Neuroscience, University of Washington, Seattle, WA, USA
Maxwell H. Turner
Department of Computer Science, University of Miami, Coral Gables, FL, USA
Luis Gonzalo Sanchez Giraldo & Odelia Schwartz

Authors

Maxwell H. Turner
View author publications
You can also search for this author in PubMed Google Scholar
Luis Gonzalo Sanchez Giraldo
View author publications
You can also search for this author in PubMed Google Scholar
Odelia Schwartz
View author publications
You can also search for this author in PubMed Google Scholar
Fred Rieke
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fred Rieke.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Turner, M.H., Sanchez Giraldo, L.G., Schwartz, O. et al. Stimulus- and goal-oriented frameworks for understanding natural vision. Nat Neurosci 22, 15–24 (2019). https://doi.org/10.1038/s41593-018-0284-0

Download citation

Received: 09 March 2018
Accepted: 22 October 2018
Published: 10 December 2018
Issue Date: January 2019
DOI: https://doi.org/10.1038/s41593-018-0284-0

This article is cited by

A large-scale fMRI dataset for the visual processing of naturalistic scenes
- Zhengxin Gong
- Ming Zhou
- Zonglei Zhen
Scientific Data (2023)
Neuro-inspired optical sensor array for high-accuracy static image recognition and dynamic trace extraction
- Pei-Yu Huang
- Bi-Yi Jiang
- Cheng-Yan Xu
Nature Communications (2023)
Naturalistic Scene Modelling: Deep Learning with Insights from Biology
- Kofi Appiah
- Zhiyong Jin
- Sze Chai Kwok
Journal of Signal Processing Systems (2023)
In-sensor image memorization and encoding via optical neurons for bio-stimulus domain reduction toward visual cognitive processing
- Doeon Lee
- Minseong Park
- Kyusang Lee
Nature Communications (2022)
Understanding the retinal basis of vision across species
- Tom Baden
- Thomas Euler
- Philipp Berens
Nature Reviews Neuroscience (2020)