Neurons in the pigeon visual network discriminate between faces, scrambled faces, and sine grating images

Clark, William; Chilcott, Matthew; Azizi, Amir; Pusch, Roland; Perry, Kate; Colombo, Michael

doi:10.1038/s41598-021-04559-z

Download PDF

Article
Open access
Published: 12 January 2022

Neurons in the pigeon visual network discriminate between faces, scrambled faces, and sine grating images

William Clark¹,
Matthew Chilcott²,
Amir Azizi³,
Roland Pusch⁴,
Kate Perry¹ &
…
Michael Colombo¹

Scientific Reports volume 12, Article number: 589 (2022) Cite this article

2716 Accesses
8 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Discriminating between object categories (e.g., conspecifics, food, potential predators) is a critical function of the primate and bird visual systems. We examined whether a similar hierarchical organization in the ventral stream that operates for processing faces in monkeys also exists in the avian visual system. We performed electrophysiological recordings from the pigeon Wulst of the thalamofugal pathway, in addition to the entopallium (ENTO) and mesopallium ventrolaterale (MVL) of the tectofugal pathway, while pigeons viewed images of faces, scrambled controls, and sine gratings. A greater proportion of MVL neurons fired to the stimuli, and linear discriminant analysis revealed that the population response of MVL neurons distinguished between the stimuli with greater capacity than ENTO and Wulst neurons. While MVL neurons displayed the greatest response selectivity, in contrast to the primate system no neurons were strongly face-selective and some responded best to the scrambled images. These findings suggest that MVL is primarily involved in processing the local features of images, much like the early visual cortex.

Single neuron responses underlying face recognition in the human midfusiform face-selective cortex

Article Open access 13 September 2023

Rodrigo Quian Quiroga, Marta Boscaglia, … Bruno Rossion

Local features drive identity responses in macaque anterior face patches

Article Open access 23 September 2022

Elena N. Waidmann, Kenji W. Koyano, … David A. Leopold

Hierarchical and nonhierarchical features of the mouse visual cortical network

Article Open access 26 January 2022

Rinaldo D. D’Souza, Quanxin Wang, … Andreas Burkhalter

Introduction

The ability to recognise visual objects belonging to different categories is the foundation for object-dependent behaviour across the animal kingdom. The structures that mediate object recognition are most well understood in the primate brain. Ascending visual information from the retina is progressively transformed into a more readable form at each stage of the primate ventral stream, with increasingly complex and viewpoint-invariant representations of faces and other objects emerging at the level of inferior temporal (IT) cortex^1,2,3. The discovery of a general purpose circuitry underlying face perception in IT cortex raises the question of whether similar networks have evolved in the visual systems of organisms distantly related to primates.

Similar to mammals, birds have two ascending visual pathways. The thalamofugal pathway is composed of the thalamic dorsolateral geniculate nucleus, which projects to the visual Wulst⁴. The Wulst is possibly homologous with the mammalian primary visual cortex, and forms a retinotopic map of the visual field^5,6,7. The visual Wulst is heavily involved in visually guided behaviour, and may participate mainly in pattern vision for small and distant targets in the fovea of the lateral visual field^8,9,10. The tectofugal pathway consists of the midbrain visual tectum and its projections via the thalamic nucleus rotundus to the pallial entopallium (ENTO)¹¹. ENTO is possibly analogous to parts of extrastriate cortex^{12, 13}, containing neurons with large receptive fields well suited for object identification over large areas of the visual field¹⁴. The entire anterior–posterior extent of ENTO forms a topographic and reciprocal connection with the above-positioned layers of the nidopallium and the mesopallium¹⁵. The mesopallium ventrolaterale (MVL) is one of the mesopallial visual nuclei of the dorsal ventricular ridge (DVR) in the avian brain, and receives input from both the ENTO as well the intermediate nidopallial layers^13,16. The tectofugal pathway is thought to be primarily involved in identification of objects in the area dorsalis (a second fovea region) of the frontal visual field^{17, 18}.

Neurons in the pigeon visual association regions discriminate between basic stimulus parameters such as pattern, color, amplitude, and spatial frequency¹⁹. A recent study using linear discriminant analysis (LDA)²⁰ also demonstrated that a small population of MVL neurons can discriminate between the features of animate and inanimate objects with greater capacity than at the level of ENTO. The static features of the avian face-region holds ethological relevance for pigeons^21,22,23, suggesting that neural specialization related to social aspects of vision may exist in the avian brain. Face-selectivity at the single-cell level outside of primates has only been confirmed in sheep²⁴. The purpose of the current study was to assess whether neurons in the pigeon visual system might show selectivity for faces, despite the evolutionary separation and differences in brain organisation from mammals.

Methods

Subjects

Fourteen experimentally naive pigeons (Columba livia) served as subjects and were housed individually in wire mesh cages in a colony room maintained at 20 °C. The birds had ad libitum access to grit and water, and were fed a blend of wheat, peas, and corn. The pigeons were maintained at 85% of their free feeding weight during the experiment. All experimental procedures were approved by the University of Otago Animal Ethics Committee and conducted in accordance with the University of Otago’s Code of Ethical Conduct for the Manipulation of Animals and the ARRIVE guidelines for the care and use of laboratory animals.

Apparatus

The equipment was similar to that used in Clark et al.²⁵. Training and testing of the pigeons was performed using standard operant chambers with dimensions of 32.5 cm (length), 36 cm (width) and 34.5 cm (height). A 17-inch screen (resolution: 1284 × 1024) was used to present stimuli. A Carroll Touch infrared touch frame (EloTouch, baud rate 9600, transmission time 20 ms) was placed directly in front of the screen and registered the XY coordinates of pecks. A transparent plexiglass panel with a single square response key (2.5 × 2.5 cm) was also situated in front of the screen and prevented accidental responses from the pigeon’s body from being registered. Grain reward was delivered via a food hopper 20 cm below the square response key, and was illuminated when raised.

Stimuli

Twenty images were used as visual stimuli, consisting of five different stimulus groupings, with four examples in each stimulus grouping (Fig. 1a). The five stimulus grouping were images depicting: human faces, scrambled human faces, pigeon faces, scrambled pigeon faces, and sine gratings of four different spatial frequencies. The human face images were obtained from the FEI face database available at (https://fei.edu.br/~cet/facedatabase.html). The images of pigeon faces were taken by the lead author (W.C.) using a Cannon DS126291 digital camera. Human face and pigeon face scrambled controls were created by dividing the face images into 15 × 32 square segments and then randomly shuffling the position and orientation of the tiles using open-source Webmorph software (https://webmorph.org/#P).

Behavioural task

Pigeons were initially trained to respond with a single peck to a white dot to receive a grain reward. When pigeons were responding reliably to the white dot, they were then trained on a response inhibition task (Fig. 1b) during which they were required to withhold responses while a visual stimulus was displayed. Experimental sessions consisted of 160 trials, taking approximately 1 h to complete. Each image was presented 8 times, and all the stimuli were presented in a random order on each session. The procedure on a typical trial was as follows. At the end of a 6 s intertrial interval (ITI) period, a white dot was displayed in the centre of the response key during the ready period. Any pecks elicited during the pause period extended the pause period by 2 s. Two pecks to the white dot turned it off and initiated a pause period of a random time between 2 and 4 s. Any pecks in the first 0.5 s of the pause period were ignored to prevent pecks directed towards the ready stimulus from extending the duration of the pause period. Following the pause period, a stimulus period started during which an image belonging to one of the five stimulus groupings (human faces, scrambled human faces, pigeon faces, scrambled pigeon faces, or sine gratings) was displayed within the response window for a random duration between 1.5 to 3 s. Pecks during the stimulus period immediately turned off the stimulus and initiated a correction repeat of the same trial from the start of the ITI period. Following the stimulus period, a Go cue (grey square) appeared in place of the stimulus, letting the bird know that it was required to respond with a single peck to the Go cue. A peck to the Go cue turned it off and resulted in the start of the reward period with access to grain from the hopper for 1.75 s, accompanied by a 1000-Hz tone and the illumination of the hopper. To proceed to the next trial, the bird was required to peck the Go cue to deliver reward and initiate the ITI following the delivery of reward. Any pecks in the response window extended the ITI by 2 s.

Surgery

Once the pigeons were reliably completing the task, stereotaxic surgery was performed to install a movable microdrive into the target brain areas²⁶. A mixture of Ketamine (30 mg/kg) and Xylazine (6 mg/kg) was injected into the pigeon’s legs as an anaesthetic. The feathers on the head were then removed. The pigeons were placed in a Revzin stereotaxic adapter²⁷ to immobilise the head and a topical anaesthetic (10% Xylocaine) was applied to the scalp. The skin overlying the skull was retracted exposing the skull, and six stainless steel screws were inserted into the skull. One of these screws served as the ground screw. A hole was drilled above the targeted area and the dura was removed. A microdrive housing the electrodes was lowered into the hole until the tips of the electrodes were positioned above either MVL, ENTO, or Wulst (Fig. 2a). Ten pigeons (X9, X11, X16, X17, X20, X22, X23, X29, X32 and X39) had microdrives installed at positions AP ± 10.5 mm, and ML ± 6.0 mm, corresponding to the location of anterior MVL and ENTO. Four pigeons (X1, X5, X40, and LV3) had microdrives installed at positions AP ± 11.0 mm, and ML ± 6.0 mm, corresponding to the location of the Wulst. The microdrive was then secured to the skull using dental acrylic, and the wound was sutured closed. Xylocaine was applied again before the pigeons were placed into a padded and heated recovery cage. The pigeon remained in the recovery cage until it had returned to an active state, and was then returned to their home cage where they were given another 7 days to recover before experimental sessions began.

Neuronal recording

The microdrives housed eight 25 μm Formvar-coated nichrome wires (California Fine Wire, Grover Beach, CA, USA) used to measure single neuron activity²⁶. For each experimental session we searched for activity on any one of the eight wires and used one of the remaining wires as the indifferent. The signals were amplified using a Grass P511K amplifier (Grass Instruments, Quincy, MA, USA) and 50 Hz noise was eliminated using a notch filter. A CED (Cambridge Electronic Design, Cambridge, UK) electrophysiology system with Spike2 software stored and analyzed the data. Cells were isolated using CED’s template matching capacity (thereby eliminating artefacts) sampling at a rate of 20,000 Hz. The selection criterion was that the isolated neuron had a signal-to-noise ratio of no less than 2:1. A separate computer controlled the behavioural task and sent codes to the CED system to align key task events. Following each recording session, the electrodes were advanced approximately 40 μm before the pigeon was returned to their home cage. If we did not record from any neural activity the electrodes were moved approximately 20 μm, and the animal was returned to its cage. For the eight birds that were implanted in MVL, it was possible to subsequently record from ENTO due to its position directly ventral to MVL in the pigeon brain²⁷. After advancing the electrode through the entire extent of MVL (2000 μm), the electrode was then advanced another 500 μm into ENTO, and subsequent recordings were performed through the extent of ENTO (3000 μm). For two birds (X9 and X29) we recorded directly from ENTO to balance the number of recorded neurons across MVL and ENTO. Recording sessions took approximately 1 h to complete. Pigeons completed one session daily for 5 days a week.

Histology and electrode track reconstruction

At the end of the experiment, a 9 V potential was sent through each electrode for 10 s to create an electrolytic lesion marking the recording position of each electrode at the termination point in ENTO, MVL and Wulst. The pigeons were then euthanized using carbon dioxide gas, and were perfused with physiological saline and 10% formalin. The brains were removed from the skull and kept in 10% formalin for at least 5 days, followed by sucrose formalin (10% formalin, 30% sucrose). The brains were frozen and sliced into 40 µm sections and stained with thionin. Track reconstructions were made using the position of the electrolytic lesion and depth records. All electrode tracks were within the borders of the targeted ENTO, MVL and Wulst regions²⁷ (Fig. 2b, and see Supplementary Table S1 for coordinates of electrode positions).

Results

Response dynamics of Wulst, ENTO and MVL single neurons

We analyzed neuronal responses on all 160 trials that the bird successfully inhibited responses to images until the grey square appeared, and discarded correction trials data from the analysis. To determine whether the neurons were visually responsive, each recorded neuron’s firing rates were first compared during 500 ms window post stimulus onset with a 500 ms window in the baseline ITI period using a paired t-test (p < 0.05). In Wulst we recorded from a total of 96 neurons of which 51 (53%) were visually responsive. In ENTO we recorded from a total of 140 neurons of which 88 (62%) were visually responsive. In MVL we recorded from a total of 120 neurons of which 77 (64%) were visually responsive. The proportion of neurons that were visually responsive was similar between the three regions (χ² (2) = 3.18, p = 0.2).

There were differences in the proportions of visually-responsive neurons that were excitatory and inhibitory between the three regions (Fig. 3a,b). Wulst displayed a similar proportion of 26 excitatory (51%) and 25 inhibitory (49%) neurons. In ENTO we found a greater proportion of 53 inhibitory (61%) compared with 35 excitatory (39%) neurons, whereas we found a greater proportion of 58 excitatory (75%) compared with 19 inhibitory (25%) neurons in MVL. There was a significant difference in the proportions of excitatory and inhibitory neurons between ENTO and MVL (χ² (1) = 21.1, p = 0.00001), but not ENTO and Wulst (χ² (1) = 1.64, p = 0.19).

The behavioural task required that the birds actively inhibited responses during the randomised pause period prior to visual stimuli appearing on the screen. Differences in neural activity between the three regions were observed during the pause period for the visually-responsive neurons (Fig. 3b). In Wulst, 26 of the 51 visually-responsive neurons (51%) also showed excitatory or inhibitory responses in the pause period. In ENTO, 57 of the 88 visually-responsive neurons (65%) responded in an excitatory or inhibitory manner during the pause period. In MVL, 34 of the 77 visually-responsive neurons (44%) responded with excitatory or inhibitory activity during the pause period. The relative proportion of visually-responsive neurons that became active during the pause period was substantially greater in ENTO relative to MVL (χ² (1) = 7.05, p = 0.007), but not in ENTO relative to Wulst (χ² (1) = 2.55, p = 0.11). The increased responsivity of ENTO during the pause period suggests that visually-responsive neurons may be modulated to a greater extent by attentional processes in anticipation of the upcoming visual stimulus in ENTO than at the level of MVL.

Single-unit analysis of selectivity in Wulst, ENTO and MVL

To determine if a visually-responsive neuron was sensitive to a particular grouping of stimuli, we compared the responses to each of the five stimulus groupings using a one-way AVOVA (p < 0.05). Neurons with a significant effect of stimulus grouping were further assessed using a Tukey Honest Significant Difference post-hoc comparison test (p < 0.05) in order to determine to which stimuli the neuron was responding, and a selectivity index (SI) was calculated to determine the magnitude of selectivity of the response (Fig. 4). The SI expresses the ratio of the average excitatory or inhibitory response to the preferred stimulus grouping of the neuron relative to the responses for the other stimulus groupings (see Supplementary Materials for single-unit data analysis). The classification system was previously used to map neuronal selectivity inside and outside of fMRI identified patches in macaque IT cortex²⁹. Note that the classification of stimulus-selective neurons does not imply that a given neuron is exclusively “selective” for that particular grouping of stimuli, merely that of the five stimulus groupings tested, the preferred stimulus grouping produced the strongest response.

Of the 51 visually-responsive neurons in Wulst, 4 (8%) displayed a significant effect of stimulus grouping. The Wulst stimulus-selective neurons responded best to scrambled pigeon faces (n = 2: 50%) with a SI of 0.62 ± 0.22, human faces (n = 1: 25%) with a SI of 0.28, and sine gratings (n = 1: 25%) with a SI of 0.43. In ENTO, 10 of the 88 visually-responsive neurons (11%) displayed a significant effect of stimulus grouping. The ten ENTO stimulus-selective neurons responded best to sine gratings (n = 2: 20%) with an average SI of 0.64 ± 0.2, scrambled pigeon faces (n = 3: 30%) with an average SI of 0.46 ± 0.15, scrambled human faces (n = 3: 30%) with an average SI of 0.37 ± 0.17, and human faces (n = 2: 20%) with a SI of 0.33 ± 0.03.

MVL displayed the greatest proportion of visually-responsive neurons sensitive to particular stimulus groupings, with 22 of the 77 visually-responsive neurons (29%) displaying a significant effect of stimulus grouping (Fig. 4). Three of the stimulus-selective MVL neurons responded best to sine gratings (n = 3: 14%) with an average SI of 0.5 ± 0.17. A greater number of MVL stimulus-selective neurons showed strong selectivity for scrambled pigeon faces (n = 5: 23%) with an average SI of 0.42 ± 0.11, and scrambled human faces (n = 6: 27%: see Fig. 5 for an example cell) with an average SI of 0.37 ± 0.16. Other MVL stimulus-selective neurons responded best to pigeon faces (n = 5: 22%: see Fig. 5 for an example cell) with an average SI of 0.25 ± 0.09, and human faces (n = 3: 14%) with an average SI of 0.2 ± 0.04. There were significant differences in the proportions of visually-responsive neurons in MVL that displayed a significant effect of stimulus grouping relative to ENTO (χ² (1) = 7.77, p = 0.005) and Wulst (χ² (1) = 8.14, p = 0.004).

Next, we verified that the number of neurons identified as stimulus selective in MVL was above expected chance level by generating simulated data of randomised firing rates for each stimulus grouping between the maximum and minimum values displayed by the real neurons during the stimulus period trials. We performed a one-way AVOVA (p < 0.05) comparing the responses between the five stimulus groupings for each of the 77 simulated visually-responsive neurons. Of the 77 simulated visually-responsive neurons, 4 (5%) displayed a significant effect of stimulus grouping, verifying that the high proportion of 22 out of 77 visually-responsive neurons (29%) that were stimulus selective was significantly greater relative to chance level (χ² (1) = 14.99, p = 0.0001).

Given that some neurons responded to the scrambled pictures, we next examined whether it was potentially the high spatial frequency information (corresponding to fine details and sharp edges/corners), or the low spatial frequency information (more global shape and broad swaths of luminance), to explain the selective responses of MVL neurons. To quantify the feature information of each stimulus grouping, we performed an analysis of the images’ Fourier amplitude spectrum (see Supplementary Materials). The spectral analysis showed that the grid-scrambling procedure resulted in greater high spatial frequency information for the scrambled images in comparison with the unscrambled images (Fig. 6a). Moreover, the scrambled images’ spectral information differed from the human faces and pigeon faces, which were highly correlated for low spatial frequencies (Fig. 6b). While we did not match the images’ luminance before performing image scrambling, cells responsive to scrambled images didn’t also respond to the unscrambled human and pigeon versions that shared the same luminance, and vice versa. The selective responses to scrambled or face images are therefore attributable to the differences in the features of the images, rather than differences in luminance.

Population-level analysis of selectivity in Wulst, ENTO and MVL

Our single-unit findings indicated that MVL displayed a greater proportion of stimulus-selective neurons than ENTO and Wulst. That said, it is clear that coding object information is achieved mainly by a population of neurons^{2, 20, 30}. We therefore next examined whether at the population level, MVL also discriminated better among the stimulus groupings. We evaluated the stimulus-discrimination capacity of all the visually-responsive neurons sampled from each region using a LDA with permutation resampling (see Supplementary Materials for population data analysis). As each neuron was recorded on sequential days and one cannot associate the responses of the single-trial firing rates to form true multivariate observations, the LDA procedure was performed 1200 times while shuffling the data lists before associating vectors with stimulus grouping labels (permutation resampling), to generate different sets of vectors from the same data. Others have used LDA with permutation resampling to extract information from single-trail responses to objects in monkeys³⁰.

For the Wulst population, the distribution of the receiver operating characteristic (ROC) for each stimulus grouping did not deviate significantly from chance performance under the null hypothesis distribution (Fig. 7), indicating that very little stimulus feature information was accessible from the population code of the thalamofugal visual pathway. The population response of ENTO distinguished between most of the stimulus groupings with classification performance higher than > 99% of the samples of the estimated null hypothesis distribution, with the exception of scrambled pigeon faces with > 90% of the samples higher than the null hypothesis distribution (Fig. 7). The MVL population response, however, discriminated between all of the stimulus groupings tested with > 99% classification performance compared with the null hypothesis distribution, displaying greater stimulus feature information than the Wulst and ENTO populations (Fig. 7). We verified that the population responses of the LDA classifier for the three regions generalised to held-out images for each stimulus grouping (see Fig. 1 in Supplementary Materials).

We also compared the average response to the original images versus the scrambled images for all of the visually-responsive neurons from each region to determine whether they displayed a preference for the scrambled images over the originals. There were no significant differences in the average responses for the scrambled images (mean: 6.88 spikes/s) compared with the original face images (mean: 6.39 spikes/s), (paired t-test, t(76) = 5.56, p = 0.07) for the MVL visually responsive neurons. There were also no significant differences between the average responses to the scrambled images (mean: 9.01 spikes/s) compared with the original images (mean: 9.14 spikes/s) for the 88 visually-responsive ENTO neurons (paired t-test, t(87) = 1.5, p = 0.39). Likewise, there were no significant differences between the population average responses to the scrambled images (mean: 4.68 spikes/s) compared with the original images (mean: 4.77 spikes/s) for the 51 visually-responsive Wulst neurons (paired t-test, t(50) = − 0.95, p = 0.52).

Discussion

We investigated the response selectivity of three visual forebrain areas of the pigeon brain at the single cell and population level, and determined the coding principles of these regions, with a special emphasis on face perception. We found that the pigeon MVL displays a greater proportion of stimulus-selective neurons than ENTO and Wulst. None of the stimulus-selective neurons identified were truly face-selective, consistent with past studies of the pigeon^{25, 31} and crow³² visual system.

Our finding of single neurons responsive to scrambled images in MVL was likely due to the overall increase in power across all spatial frequencies when compared with the original images. Sensitivity to additional power introduced by image scrambling is well documented in mammals, where it is mostly observed in early visual cortex of macaque monkeys³³, rats³⁴, and also the early layers of computational models of object recognition³⁵. ENTO and MVL are part of separate layers in the DVR, with ENTO receiving its primary sensory input from the thalamus¹⁵. Since both the mesopallium and nidopallium layers are both heavily reciprocally connected with ENTO^{13, 15}, MVL is a natural candidate for displaying sensitivity to more complex visual form than at the level of ENTO. We observed in the present study that while the numbers of stimulus selective cells is greater in MVL relative to ENTO, the selectivity may be driven by low-level image features, as would be expected in early visual cortex.

Beyond the single-unit analysis, the population response of the visually-responsive neurons sampled from the three regions indicates that information associated with stimulus features is recoded in a more readable form between ENTO and MVL. A recent study using LDA and adopting the same stimulus set used in studies of macaque² and human³⁶ IT cortex showed that MVL responses distinguished between the features of animate and inanimate object categories with greater accuracy than an ENTO population²⁰. The differences in population responses that we observed between ENTO and MVL suggests that neurons in separate layers of the DVR display different sensitivity to visual form within the avian canonical circuitry. However, these differences are not consistent with a progression in selectivity for objects over scrambled stimuli equivalent to that observed between early visual and extrastriate cortex. Consistent with divergent strategies for object recognition in birds compared with primates, pigeons more readily attend to the local details of stimuli rather than their global configuration^{37, 38}. For example, pigeons strongly rely on the high spatial frequency components of images viewed in the frontal visual field as the most diagnostic level of information during picture memorization³⁹. While it is possible to train pigeons to report information at the global scale by directing attention to features shared across images belonging to the same category^{40, 41}, they are predisposed to attend to stimuli at the local level.

Some of the stimulus-selective MVL neurons did also respond best to images of faces. The average selectivity index values for faces displayed by MVL neurons, however, are lower than the minimum selectivity index value of 0.33 (corresponding to a 2:1 ratio of face-to-non-face category response) required for classification of face-selective neurons in macaque face-patches (average SI for faces of 0.87)⁴². It is possible that the lack of strong selectivity for faces is related to fundamental differences between birds and primates in holistic face processing⁴³. Unlike humans and non-human primates, pigeons’ memory performance for images of primate faces is unimpaired by inversion of the faces⁴⁴, and discriminations between faces are based primarily on an additive integration of local features⁴⁵. The absence of strong selectivity displayed to the human and pigeon faces suggests that pigeons do not possess circuitry dedicated to face-perception analogous to the face-patch system, and these selective responses may reflect sensitivity to the general similarity in low spatial frequency content shared across the natural images depicting faces. It is also important to note that future studies with the aim of assessing face-selectivity in the avian brain will need to also include images of non-face objects to disentangle face-selective responses from a general selectivity for natural images over scrambled stimuli.

Understanding how information is processed among the different visual regions of the avian brain is still very much in its infancy. On the basis of the findings from previous studies and the current study, we tentatively propose that object categorisation in the pigeon brain may not depend on a stage of holistic/viewpoint-invariant representation comparable to higher stages of the primate ventral stream. We cannot rule out the possibility that the visual nidopallium¹⁶ and the associative nidopallium caudolaterale (the avian equivalent of prefrontal cortex^{46, 47}) may be involved in categorical representation of objects. It is also possible that the thalamofugal pathway integrates representations of object features at a more global spatial scale when presented laterally at distance in comparison with the tectofugal pathway. The low number of stimulus-selective neurons we found in Wulst may be because the thalamofugal pathway participates mainly in lateral object vision^8,9,10. As the Wulst also displays small receptive field sizes^{6, 7} future studies in freely moving pigeons could use search stimuli⁴⁸ to map the receptive fields of neurons so that image size and position in the visual field is adjusted according to a given neuron’s preference.

In summary, we found evidence that MVL displays greater selectivity to visual stimuli in comparison with ENTO. In comparison with the tectofugal pathway, the Wulst of the thalamofugal pathway is less involved in object feature analysis in the frontal visual field, and is likely to be specialised for lateral object vision. Further electrophysiological studies are required to determine how the transformation of information between different layers of the sensory DVR and Wulst constructs representations of objects in the avian brain.

Data availability

All data that support the findings of this study are available from the corresponding author upon reasonable request.

Code availability

The analysis code supporting the findings of this study are available from the corresponding author upon reasonable request.

References

Lafer-Sousa, R. & Conway, B. R. Parallel, multi-stage processing of colors, faces and shapes in macaque inferior temporal cortex. Nat. Neurosci. 16, 1870. https://doi.org/10.1038/nn.3555 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kiani, R., Esteky, H., Mirpour, K. & Tanaka, K. Object category structure in response patterns of neuronal population in monkey inferior temporal cortex. J. Neurophysiol. 97, 4296–4309. https://doi.org/10.1152/jn.00024.2007 (2007).
Article PubMed Google Scholar
Bao, P., She, L., McGill, M. & Tsao, D. Y. A map of object space in primate inferotemporal cortex. Nature 583, 103–108. https://doi.org/10.1038/s41586-020-2350-5 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Miceli, D., Marchand, L., Repérant, J. & Rio, J. P. Projections of the dorsolateral anterior complex and adjacent thalamic nuclei upon the visual Wulst in the pigeon. Brain Res. 518, 317–323. https://doi.org/10.1016/0006-8993(90)90990-S (1990).
Article CAS PubMed Google Scholar
Karten, H. J., Hodos, W., Nauta, W. J. & Revzin, A. M. Neural connections of the “visual wulst” of the avian telencephalon. Experimental studies in the pigeon (Columba livia) and owl (Speotyto cunicularia). J. Comp. Neurol. 150, 253–277. https://doi.org/10.1002/cne.901500303 (1973).
Article CAS PubMed Google Scholar
Pettigrew, J. D. & Konishi, M. Neurons selective for orientation and binocular disparity in the visual Wulst of the barn owl (Tyto alba). Science 193, 675–678. https://doi.org/10.1126/science.948741 (1976).
Article CAS PubMed ADS Google Scholar
Ng, B. S. W., Grabska-Barwińska, A., Güntürkün, O. & Jancke, D. Dominant vertical orientation processing without clustered maps: Early visual brain dynamics imaged with voltage-sensitive dye in the pigeon visual Wulst. J. Neurosci. 30, 6713–6725. https://doi.org/10.1523/JNEUROSCI.4078-09.2010 (2010).
Article CAS PubMed Google Scholar
Hahmann, U. & Güntürkün, O. The visual acuity for the lateral visual field of the pigeon (Columba livia). Vis. Res. 33, 1659–1664. https://doi.org/10.1016/0042-6989(93)90031-Q (1993).
Article CAS PubMed Google Scholar
Budzynski, C. A. & Bingman, V. P. Participation of the thalamofugal visual pathway in a coarse pattern discrimination task in an open arena. Behav. Brain Res. 153, 543–556. https://doi.org/10.1016/j.bbr.2004.01.011 (2004).
Article PubMed Google Scholar
Güntürkün, O. & Hahmann, U. Functional subdivisions of the ascending visual pathways in the pigeon. Behav. Brain Res. 98, 193–201. https://doi.org/10.1016/S0166-4328(98)00084-9 (1999).
Article PubMed Google Scholar
Karten, H. J. & Hodos, W. Telencephalic projections of the nucleus rotundus in the pigeon (Columba livia). J. Comp. Neurol. 140, 35–51. https://doi.org/10.1002/cne.901400103 (1970).
Article CAS PubMed Google Scholar
Nguyen, A. P. et al. A dissociation of motion and spatial-pattern vision in the avian telencephalon: Implications for the evolution of “visual streams”. J. Neurosci. 24, 4962–4970. https://doi.org/10.1523/JNEUROSCI.0146-04.2004 (2004).
Article CAS PubMed PubMed Central Google Scholar
Krützfeldt, N. O. & Wild, J. M. Definition and novel connections of the entopallium in the pigeon (Columba livia). J. Comp. Neurol. 490, 40–56. https://doi.org/10.1002/cne.20627 (2005).
Article PubMed Google Scholar
Gu, Y., Wang, Y., Zhang, T. & Wang, S. R. Stimulus size selectivity and receptive field organization of ectostriatal neurons in the pigeon. J. Comp. Physiol. 188, 173–178. https://doi.org/10.1007/s00359-002-0290-1 (2002).
Article Google Scholar
Stacho, M. et al. A cortex-like canonical circuit in the avian forebrain. Science 369, 5534. https://doi.org/10.1126/science.abc5534 (2020).
Article CAS Google Scholar
Stacho, M., Ströckens, F., Xiao, Q. & Güntürkün, O. Functional organization of telencephalic visual association fields in pigeons. Behav. Brain Res. 303, 93–102. https://doi.org/10.1016/j.bbr.2016.01.045 (2016).
Article PubMed Google Scholar
Hellmann, B. & Güntürkün, O. Visual-field-specific heterogeneity within the tecto-rotundal projection of the pigeon. Eur. J. Neurosci. 11, 2635–2650. https://doi.org/10.1046/j.1460-9568.1999.00681.x (1999).
Article CAS PubMed Google Scholar
Remy, M. & Güntürkün, O. Retinal afferents to the tectum opticum and the nucleus opticus principalis thalami in the pigeon. J. Comp. Neurol. 305, 57–70 (1991).
Article CAS Google Scholar
Koenen, C., Pusch, R., Bröker, F., Thiele, S. & Güntürkün, O. Categories in the pigeon brain: A reverse engineering approach. J. Exp. Anal. Behav. 105, 111–122. https://doi.org/10.1002/jeab.179 (2016).
Article PubMed Google Scholar
Azizi, A. H. et al. Emerging category representation in the visual forebrain hierarchy of pigeons (Columba livia). Behav. Brain Res. 356, 423–434. https://doi.org/10.1016/j.bbr.2018.05.014 (2019).
Article PubMed Google Scholar
Shimizu, T. Conspecific recognition in pigeons (Columba livia) using dynamic video images. Behaviour 135, 43–54. https://doi.org/10.1163/156853998793066429 (1998).
Article Google Scholar
Patton, T. B., Szafranski, G. & Shimizu, T. Male pigeons react differentially to altered facial features of female pigeons. Behaviour 147, 757–773. https://doi.org/10.1159/000314283 (2010).
Article Google Scholar
Watanabe, S. & Troje, N. F. Towards a “virtual pigeon”: A new technique for investigating avian social perception. Anim. Cogn. 9, 271–279. https://doi.org/10.1007/s10071-006-0048-1 (2006).
Article PubMed Google Scholar
Kendrick, K. M. & Baldwin, B. A. Cells in temporal cortex of conscious sheep can respond preferentially to the sight of faces. Science 236, 448–450. https://doi.org/10.1126/science.3563521 (1987).
Article CAS PubMed ADS Google Scholar
Clark, W. J., Porter, B. & Colombo, M. Searching for face-category representation in the avian visual forebrain. Front. Physiol. 10, 140. https://doi.org/10.3389/fphys.2019.00140 (2019).
Article PubMed PubMed Central Google Scholar
Bilkey, D. K., Russell, N. & Colombo, M. A lightweight microdrive for single-unit recording in freely moving rats and pigeons. Methods 30, 152–158. https://doi.org/10.1016/S1046-2023(03)00076-8 (2003).
Article CAS PubMed Google Scholar
Karten, H. J. & Hodos, W. Stereotaxic Atlas of the Brain of the Pigeon (Columba livia) (The Johns Hophins Press, 1967).
Google Scholar
Reiner, A. et al. Revised nomenclature for avian telencephalon and some related brainstem nuclei. J. Comp. Neurol. 473, 377–414. https://doi.org/10.1002/cne.20118 (2004).
Article PubMed PubMed Central Google Scholar
Bell, A. H. et al. Relationship between functional magnetic resonance imaging-identified regions and neuronal category selectivity. J. Neurosci. 31, 12229–12240. https://doi.org/10.1523/JNEUROSCI.5865-10.2011 (2011).
Article CAS PubMed PubMed Central Google Scholar
Gochin, P. M., Colombo, M., Dorfman, G. A., Gerstein, G. L. & Gross, C. G. Neural ensemble coding in inferior temporal cortex. J. Neurophysiol. 71, 2325–2337. https://doi.org/10.1152/jn.1994.71.6.2325 (1994).
Article CAS PubMed Google Scholar
Scarf, D., Stuart, M., Johnston, M. & Colombo, M. Visual response properties of neurons in four areas of the avian pallium. J. Comp. Physiol. 202, 235–245. https://doi.org/10.1007/s00359-016-1071-6 (2016).
Article Google Scholar
Veit, L., Hartmann, K. & Nieder, A. Neuronal correlates of visual working memory in the corvid endbrain. J. Neurosci. 34, 7778–7786. https://doi.org/10.1523/JNEUROSCI.0612-14.2014 (2014).
Article CAS PubMed PubMed Central Google Scholar
Rainer, G., Augath, M., Trinath, T. & Logothetis, N. K. The effect of image scrambling on visual cortical BOLD activity in the anesthetized monkey. Neuroimage 16, 607–616. https://doi.org/10.1006/nimg.2002.1086 (2002).
Article PubMed Google Scholar
Vinken, K., Van den Bergh, G., Vermaercke, B. & Op de Beeck, H. P. Neural representations of natural and scrambled movies progressively change from rat striate to temporal cortex. Cereb. Cortex 26, 3310–3322. https://doi.org/10.1093/cercor/bhw111 (2016).
Article PubMed PubMed Central Google Scholar
Stojanoski, B. & Cusack, R. Time to wave good-bye to phase scrambling: Creating controlled scrambled images using diffeomorphic transformations. J. Vis. 14, 6–6. https://doi.org/10.1167/14.12.6 (2014).
Article PubMed Google Scholar
Kriegeskorte, N. et al. Matching categorical object representations in inferior temporal cortex of man and monkey. Neuron 60, 1126–1141. https://doi.org/10.1016/j.neuron.2008.10.043 (2008).
Article CAS PubMed PubMed Central Google Scholar
Cavoto, K. K. & Cook, R. G. Cognitive precedence for local information in hierarchical stimulus processing by pigeons. J. Exp. Psychol. Anim. Behav. Process. 27, 3. https://doi.org/10.1037/0097-7403.27.1.3 (2001).
Article CAS PubMed Google Scholar
Aust, U. & Braunöder, E. Transfer between local and global processing levels by pigeons (Columba livia) and humans (Homo sapiens) in exemplar-and rule-based categorization tasks. J. Comp. Psychol. 129, 1. https://doi.org/10.1037/a0037691 (2015).
Article PubMed Google Scholar
Murphy, M. S., Brooks, D. I. & Cook, R. G. Pigeons use high spatial frequencies when memorizing pictures. Exp. Psychol. Anim. Learn. Cogn. 41, 277. https://doi.org/10.1037/xan0000055 (2015).
Article Google Scholar
Cook, R. G., Goto, K. & Brooks, D. I. Avian detection and identification of perceptual organization in random noise. Behav. Process. 69, 79–95. https://doi.org/10.1016/j.beproc.2005.01.006 (2005).
Article Google Scholar
Lea, S. E., De Filippo, G., Dakin, R. & Meier, C. Pigeons use low rather than high spatial frequency information to make visual category discriminations. J. Exp. Psychol. Anim. Behav. Process. 39, 377. https://doi.org/10.1037/a0033104 (2013).
Article PubMed Google Scholar
Tsao, D. Y., Freiwald, W. A., Tootell, R. B. & Livingstone, M. S. A cortical region consisting entirely of face-selective cells. Science 311, 670–674. https://doi.org/10.1126/science.1119983 (2006).
Article CAS PubMed PubMed Central ADS Google Scholar
Brecht, K. F., Wagener, L., Ostojić, L., Clayton, N. S. & Nieder, A. Comparing the face inversion effect in crows and humans. J. Comp. Physiol. 203, 1017–1027. https://doi.org/10.1007/s00359-017-1211-7 (2017).
Article Google Scholar
Phelps, M. T. & Roberts, W. A. Memory for pictures of upright and inverted primate faces in humans (Homo sapiens), squirrel monkeys (Saimiri sciureus), and pigeons (Columba livia). J. Comp. Psychol. 108, 114. https://doi.org/10.1037/0735-7036.108.2.114 (1994).
Article CAS PubMed Google Scholar
Jitsumori, M. & Yoshihara, M. Categorical discrimination of human facial expressions by pigeons: A test of the linear feature model. Q. J. Exp. Psychol. B 50, 253–268. https://doi.org/10.1080/713932657 (1997).
Article Google Scholar
Güntürkün, O. The avian ‘prefrontal cortex’ and cognition. Curr. Opin. Neurobiol. 15, 686–693. https://doi.org/10.1016/j.conb.2005.10.003 (2005).
Article CAS PubMed Google Scholar
Rose, J. & Colombo, M. Neural correlates of executive control in the avian brain. PLoS Biol. 3, 190. https://doi.org/10.1371/journal.pbio.0030190 (2005).
Article CAS Google Scholar
Nieder, A. & Wagner, H. Perception and neuronal coding of subjective contours in the owl. Nat. Neurosci. 2, 660–663. https://doi.org/10.1038/10217 (1999).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by a Royal Society of New Zealand Marsden Fund grant 19-UOO-162 to Michael Colombo, and Roland Pusch was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—Project number 430157321—within the SPP 2205. We thank Hayley Chapman for performing surgeries and data collection, and Adam Bartoníček for advice on modelling.

Author information

Authors and Affiliations

Department of Psychology, University of Otago, Dunedin, New Zealand
William Clark, Kate Perry & Michael Colombo
Department of Physics, University of Otago, Dunedin, New Zealand
Matthew Chilcott
Department of Systems Biology, Agricultural Biotechnology Research Institute of Iran (ABRII), Karaj, Iran
Amir Azizi
Department of Biopsychology, Institute of Cognitive Neuroscience, Faculty of Psychology, Ruhr University Bochum, Bochum, Germany
Roland Pusch

Authors

William Clark
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Chilcott
View author publications
You can also search for this author in PubMed Google Scholar
Amir Azizi
View author publications
You can also search for this author in PubMed Google Scholar
Roland Pusch
View author publications
You can also search for this author in PubMed Google Scholar
Kate Perry
View author publications
You can also search for this author in PubMed Google Scholar
Michael Colombo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.C. and Mi.C. designed the experiments. Mi.C. wrote all behavioural testing programs. W.C. and K.P. collected the data. W.C. analyzed the single-unit data, Ma.C. and A.A. analyzed the population data. W.C., Mi.C., A.A., R.P., K.P., and Ma.C. interpreted the data. W.C. and Mi.C. wrote the paper.

Corresponding author

Correspondence to William Clark.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figure 1.

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Clark, W., Chilcott, M., Azizi, A. et al. Neurons in the pigeon visual network discriminate between faces, scrambled faces, and sine grating images. Sci Rep 12, 589 (2022). https://doi.org/10.1038/s41598-021-04559-z

Download citation

Received: 14 August 2021
Accepted: 24 December 2021
Published: 12 January 2022
DOI: https://doi.org/10.1038/s41598-021-04559-z

This article is cited by

Figure-ground segmentation based on motion in the archerfish
- Svetlana Volotsky
- Ronen Segev
Animal Cognition (2024)
Gamma-band-based dynamic functional connectivity in pigeon entopallium during sample presentation in a delayed color matching task
- Xiaoke Niu
- Yanyan Peng
- Li Shi
Cognitive Neurodynamics (2024)
Visual categories and concepts in the avian brain
- Roland Pusch
- William Clark
- Onur Güntürkün
Animal Cognition (2023)
The effect of progressive image scrambling on neuronal responses at three stations of the pigeon tectofugal pathway
- William Clark
- Matthew Chilcott
- Michael Colombo
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Subjects

Apparatus

Stimuli

Behavioural task

Surgery

Neuronal recording

Histology and electrode track reconstruction

Results

Response dynamics of Wulst, ENTO and MVL single neurons

Single-unit analysis of selectivity in Wulst, ENTO and MVL

Population-level analysis of selectivity in Wulst, ENTO and MVL

Discussion

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links