Introduction

The processing of shape and form begins with the encoding of local orientation information in the visual scene by arrays of neural mechanisms selective for different orientations. For luminance vision, numerous psychophysical studies have demonstrated the presence of orientation tuning in human vision, based on measurements of masking1,2,3,4,5, adaptation6,7,8, orientation discrimination9, or subthreshold summation5,10. Far fewer psychophysical studies have investigated orientation selectivity for colour vision and have used adaptation11, masking12,13 or orientation discrimination14.

Curiously, studies using the classic method of overlay masking in colour vision, which employ high contrast sinewave masks, have found orientation tuning functions that are flat12, or broadly tuned13. These relatively flat masking functions occur even though there is good evidence from other sources to support the presence of orientation tuning in colour vision14,15,16. It has been proposed that high contrast masks flatten the measured orientation tuning functions because they activate a mechanism of gain control, which suppresses sensitivity evenly across all orientations. This effect becomes confounded with the orientation tuning of the underlying sensory detectors3,17 making it difficult to separate the two. In comparison, a relative advantage of the psychophysical method of subthreshold summation is that it uses low, near threshold contrasts and therefore avoids the suppressive effects of contrast gain control, potentially allowing more specific estimates of the underlying detector bandwidths.

Our recent experiments using subthreshold summation have found evidence for two different types of colour mechanism revealed at detection threshold18. At low spatial frequencies, significant levels of subthreshold summation were found between superimposed orthogonal chromatic gratings, supporting the presence of very broadly tuned colour mechanisms, whereas at higher spatial frequencies, the cross orientation summation was lost, indicating the presence of orientation tuning. On the basis of these results, Gheiratmand, et al.18 proposed that colour vision has an isotropic mechanism that determines threshold at low spatial frequencies particularly for monocular stimuli, switching to an orientation-tuned response at higher spatial frequencies.

Here, we further develop the subthreshold summation technique to measure full orientation tuning curves for human colour vision at two different spatial frequencies and for luminance vision under equivalent conditions. We derive estimates of the orientation bandwidth of the underlying chromatic responses based on measurements of how the amount of subthreshold summation declines as the orientation difference between two superimposed, sinusoidal gratings increases from 0 to 90 degs. Although it was initially thought that the tuning of this response is determined directly by the orientation bandwidths of the underlying sensory detectors19,20, we now know that the response tuning is also influenced by the orientation bandwidths of the visual stimuli and the summation across space of the neural responses to the stimuli5,9,21,22. Thus in order to extract the bandwidths of the underlying neural detectors from the response tuning functions, we use a modeling approach to take into account the effect of stimulus bandwidth and spatial summation.

In this paper, we first measure subthreshold summation as a function of the increasing orientation difference between two superimposed sinusoidal gratings (the response tuning functions), using the stimuli illustrated in Figure 1. We obtain these functions for luminance and colour contrast at two different stimulus spatial frequencies and fit them with von Mises functions to quantify their shapes and bandwidths. We then develop a model of the response functions to determine estimates of the orientation bandwidths of the underlying detectors using a spatial probability summation rule between outputs of a bank of orientation tuned filters. In particular, we aim to determine whether there is significant broadening in orientation tuning for lower spatial frequencies. In addition to a model of orientation tuning, we also employ a simple isotropic model, which lacks all orientation tuning and relies on a nonlinear neural summation.

Figure 1
figure 1

Illustration of example stimuli.

Examples of the red-green and achromatic component gratings and plaid stimuli at (a) 1.5 c/deg and (b) 0.375 c/deg, shown at high contrasts for illustrative purposes. Gratings are shown at 50% contrast.

At a mid spatial frequency (1.5 c/deg), we find evidence for orientation-tuned detectors that have similar bandwidths for isoluminant colour and achromatic stimuli. At a low spatial frequency (0.375 c/deg), however, distinctly different results were obtained: the orientation tuning curves for chromatic stimuli become very broad and are compatible with isotropic detectors, whereas for achromatic stimuli, orientation tuning is preserved with no change in bandwidth of the underlying detectors. These results support the presence of two types of spatial detector in colour vision: an orientation-tuned detector that can extract information about chromatic edges and contours in the visual scene and a non-oriented detector, relatively more sensitive at low spatial frequencies, better suited to the representation of chromatic ‘blobs’ or larger areas of colour.

Results

Orientation tuning for colour and luminance contrast: mid versus low spatial frequencies

Figure 2 shows orientation tuning curves for two spatial frequencies, a mid spatial frequency of 1.5 c/deg (Fig. 2a) and a low spatial frequency of 0.375 c/deg (Fig. 2b), both viewed monocularly. Results are shown for chromatic (red circles) and achromatic stimuli (black triangles). The different panels show the results for three different subjects with the average results of all three subjects given in the top panel. When the two gratings are co-oriented, or very closely oriented, the summation ratio (SR) is highest, at around a factor of 2 (6 dB), which is as expected from the physical combination of two identical co-oriented stimuli. As the orientation difference increases, however, the summation ratio falls to reveal the orientation tuning of the function.

Figure 2
figure 2

Orientation tuning data for four experiment conditions.

Orientation tuning curves for red-green (red circles) and achromatic (black triangles) stimuli at (a) 1.5 c/deg and (b) 0.375 c/deg for three subjects and their average. Summation ratio (right axis) also expressed in dB (left axis), is plotted as a function of relative orientation between the two gratings. Error bars show ± s.e.m. for the averaged data. Sample error bars on individual data are the average s.d.'s across orientation, calculated using Gaussian error propagation.

A three-way mixed design ANOVA (2 (chromaticity) × 6 (orientation) × 2 (spatial frequency)) showed a significant main effect of orientation: F(5, 20) = 77.563, p < 0.001. Average summation ratio decreases as relative orientation increases. There are clear differences in the results for colour and achromatic stimuli that depend on the spatial frequency of the stimuli. The interaction between chromaticity and spatial frequency was significant: F(1, 4) = 8.62, p = 0.043. The average summation ratio is similar for colour and achromatic stimuli at 1.5 c/deg, but higher for colour stimuli compared to achromatic at 0.375 c/deg. The main effect of spatial frequency was significant: F(1,4) = 18.10, p = 0.013. A Bonferroni posthoc test showed that the average summation ratio at 0.375 c/deg is significantly higher than 1.5 c/deg (p = 0.013). The main effect of chromaticity was also significant: F(1,4) = 13.62, p = 0.021, with the average summation ratio being higher for the colour condition compared to achromatic (Bonferroni posthoc test: p = 0.021). Both spatial frequency and chromaticity effects are driven by the higher summation ratios found for the colour compared to achromatic stimuli at 0.375 c/deg. The interaction between spatial frequency and orientation was significant: F(5, 20) = 5.873, p = 0.002, showing that summation ratio is affected differently by orientation for the two different spatial frequency conditions.

At 1.5 c/deg (Fig. 2a), summation decreases for both colour and achromatic stimuli as the orientation difference between the two component gratings increases, dropping to approximately half by 15 degs and reaching minimal values at larger orientation differences (>30 degs). This indicates the presence of orientation-tuned detectors with constrained bandwidths that generate little or no response to stimuli of widely separated and orthogonal orientations. At this spatial frequency, the chromatic and achromatic tuning curves are very similar for all subjects (AB, MG and DK).

At 0.375 c/deg (Fig. 2b), the colour and achromatic tuning curves are substantially different. For the achromatic stimuli, summation ratios decline as the orientation difference increases, reaching relatively low values (<1 dB) by 45 degs. This suggests the presence of orientation-tuned detectors with constrained bandwidths. In contrast, for the chromatic stimuli, the orientation tuning curves are flatter and maintain a relatively high summation ratio (a factor of 1.4 or 3 dB) even for large orientation differences (45–90 degs), suggesting that both component gratings are producing a response within a common neural mechanism with a very broad orientation bandwidth.

An earlier report measured subthreshold summation between pairs of orthogonal red-green gratings across two spatial frequencies in six subjects, with each subject completing both conditions18. A significant effect of spatial frequency was found, with higher summation ratios at lower spatial frequencies. This effect is compatible with the very broad orientation tuning seen for the chromatic, low spatial frequency stimuli in Figure 2.

In order to quantify and compare these orientation-tuning curves we fitted the individual and averaged functions with a one-dimensional von Mises function7,23,24. A von Mises function is the circular version of the Gaussian function and is appropriate for use with orientation-based data. It is also thought to be the best function to fit to neurophysiological orientation tuning data25. The range of the SR data varies between 1 and 2, so we use equation (1) to match the range of the experimental data:

where A is the scaling factor and VM is the von Mises function:

where I0(k) is a modified Bessel function kind one of order of zero and k is the concentration factor that determines the response bandwidth. The larger the k value, the narrower is the bandwidth of the tuned curve. k is a free parameter and is optimized using fminsearch routine in Matlab. A is fixed to 1 for monocular conditions, because the maximum of monocular data is SR = 2 at 0 deg. The response bandwidth, Half Width at Half Height (HWHH), is calculated from the fitted k parameter according to equation (3):

Figure 3a and b shows the von Mises fits to averaged colour (red circles) and achromatic (black triangles) data for 1.5 c/deg and 0.375 c/deg stimuli. The k parameter, the bandwidth calculated from equation (3) and the goodness of fit measures (sum of squared errors, SSE and R-squared) are shown in the inset of each panel. Fitted response bandwidths are: 15 degs and 12 degs for the chromatic and achromatic data at 1.5 c/deg, respectively and 47 degs and 19 degs for average chromatic and achromatic data at 0.375 c/deg, respectively (see Table 1). The fits to the data of individual subjects are available in the Supplementary Material (Fig. S1 and Table S1). Figure 3c plots the response bandwidths for the individual fits of the three subjects for each stimulus condition. At 0.375 c/deg, response bandwidth for the colour stimulus is wider than for the achromatic by about 30 degs, whereas bandwidths at 1.5 c/deg are similar for colour and luminance vision with colour only 3 degs wider than luminance (Table 1).

Table 1 Response bandwidth and neural detector bandwidth estimates for different stimulus conditions, found from the descriptive Von Mises function fits and the tuned model fits respectively. Fits are found for each individual subject (average of the bandwidths ± s.d. is presented for each condition) and for the average of the three subjects data and the results are shown in different rows
Figure 3
figure 3

Fits to the response functions.

Von Mises (equation (1)) fits to averaged colour (red circles) and achromatic (black triangles) data at (a) 1.5 c/deg and (b) 0.375 c/deg. Estimate of the free parameter, k, response bandwidth and goodness of fit measures (SSE and R2) are shown in each panel. As bandwidth narrows, k increases. Error bars show ± s.e.m. R2 indicates the proportion of variance in the data accounted for by the model fit, varying from 0 to 1. (c) Response bandwidths for three subjects for each stimulus condition and the average (solid line). (See Figure S1 and Table S1 in Supplementary Material for individual fits).

Model estimates of neural detector orientation bandwidth

The shape of the orientation tuning response functions depends on three key factors: the orientation bandwidths of the visual stimuli, the orientation bandwidths of the underlying neural detectors and the summation across space of the neural responses to the stimuli. In order to extract the bandwidths of the underlying neural detectors, we require a modeling approach to take into account the effect on the response tuning functions of stimulus bandwidth and spatial probability summation.

At 1.5 c/deg for both colour and achromatic contrast and at 0.375 c/deg for achromatic contrast, we find very little response summation between gratings with orientation differences ≥45 degs (Fig. 2a and 2b achromatic), which indicates the presence of independent orientation-tuned filters for the detection of each oriented grating. This supports the use of a model in which outputs of the independent filters20 are combined based on a probabilistic [minimal] combination rule, a ‘probability summation’ model22,26,27,28,29,30,31. On the other hand, the wide tuning curves found for the chromatic stimulus at the low spatial frequency, with significant summation between orthogonal gratings, implies the presence of isotropic or very broadband detector mechanisms that respond to a wide range of stimulus orientations. Hence, a model capable of representing these data needs to encompass a generous combination rule that has to occur within a neural mechanism, referred to as a ‘neural combination’ model18,32. Both models are discussed in detail in the following subsections. Their fits to our averaged data and estimates of bandwidths are shown in Figures 4 and 5.

Figure 4
figure 4

Fits of the orientation-tuned model.

Fits of the probability summation model to averaged colour (red circles) and achromatic (black triangles) data at (a) 1.5 c/deg and (b) 0.375 c/deg. Orientation bandwidth estimates (HWHH in degs) from the model fit are shown in each panel. (c) Neural detector bandwidths for three subjects with the average (solid line) for each stimulus condition as marked. (See Figure S2 and Table S2 in Supplementary Material for individual fits).

Figure 5
figure 5

Fit of the Isotropic model.

Fit of the isotropic model to the average colour data at 0.375 c/deg, with model scheme illustrated above. The contrast nonlinearity exponent, m, estimated from the isotropic model and goodness of fit are shown in each panel. Error bars show ± s.e.m. (See Figure S3 and Table S3 in Supplementary Material for individual fits.) The isotropic filter box in the schematic applies no spatial frequency or orientation constraints.

a) Tuned model

We use a spatial probability summation model modified after Wilson and Bergen31 and Phillips and Wilson5 to find estimates of neural detectors bandwidth for each stimulus condition. The model uses a bank of Cartesian separable log Gabor filters5,17,33 centered at the same spatial frequency as the stimulus and oriented from 0 to 179 degs in steps of 10 degs. The response of the model is a probabilistic combination of the filters outputs over space and orientation, which is approximated by a Minkowski summation:

where C is component grating's contrast, S(x,y) is the stimulus spatial pattern (grating or plaid), M = 267 is the stimulus dimension in pixels, Fi(x,y) is the ith filter in the bank and n = 18 (180 degs/10 degs) is the number of filters in the bank, * indicates a 2-dimensional convolution and m is the order of the Minkowski summation. m is fixed at 4, which has been consistently used in the literature for approximation of the probability summation rule27,31,34,35. For narrowly tuned filters, Minkowski summation becomes similar to a MAX operator for large m values.

The spatial frequency bandwidth of the filters is set to 1.6 octaves for SF = 1.5 c/deg and 2.2 octaves for SF = 0.375 c/deg, full width at half height, based on previous literature36,37 and is similar for colour and luminance mechanisms11,38,39. The orientation bandwidth of the filters is the free parameter. We ascertained that steps finer than 10 degs for the filter bank were redundant. From equation (4) contrast at threshold (Cthresh) is found for the grating and plaid stimuli by setting R to an arbitrary value, R = 1:

and summation ratio is calculated according to equation (9) in the Methods section.

The dashed curves in Figure 4a and b show the model fits to the averaged data for colour (red circles) and achromatic stimuli (black triangles) at 1.5 c/deg and 0.375 c/deg, respectively. The bandwidth estimate, the fit error (SSE) and R2 goodness of fit measure are included in the inset of each graph. Linear SR data were used for fitting and the best fit was chosen from a series of fits with filter orientation bandwidths varying from 5 to 62 degs in steps of 1 deg, based on the minimum root mean squared (RMS) error and highest R2.

At 1.5 c/deg, orientation bandwidth (HWHH) estimates for the averaged data are similar for colour and achromatic contrast at 16 and 12 degs, respectively (Table 1). At 0.375 c/deg, the fitted bandwidths were 9 degs for achromatic data averaged over three subjects and 62 degs for colour vision. (We note that 62 degs is the broadest possible bandwidth for our log Gabor filters). Figure 4c shows the filter bandwidths for the three subjects for different stimulus conditions, with averages given in Table 1. (Figure S2 and Table S2 of the Supplementary Material show the fits for individual subjects.)

The results show that the bandwidths of the colour detectors at 0.375 c/deg are extremely broad and considerably greater than the other three conditions and the filters overlap, even at orthogonal orientations, to generate an SR of 1.4 (~3 dB). This means there is considerable summation within a single neural mechanism, which responds to both component gratings. In the light of this, in the next section, we apply a different version of the model to the low spatial frequency chromatic data, which takes into account neural summation within a single isotropic detector (‘neural summation model’). The same model configuration is obtainable if we replace the tuned filters in the probability summation model with isotropic filters. The resulting curve is shown by the thin dashed line in Supplementary Figure S3.

b) Isotropic Model

The significant summation at large plaid angles of 45 to 90 degs for the low spatial frequency colour data (Fig. 2b, red circles) signifies the presence of very broadband or isotropic mechanisms that respond to orthogonal orientations. We test the hypothesis of an isotropic neural detector by applying an ‘isotropic model’ (modified after Meese and Summers40, Meese33 and Gheiratmand, et al.18). Similar to an energy model41,42, this model involves an isotropic filter, full-wave rectification, non-linear contrast transduction and spatial summation as illustrated in Figure 5:

where m is the contrast exponent order and the other parameters are as for equation (4). The isotropic filter has no spatial tuning and is transparent. We can find the threshold of the gratings and the plaids by setting R equal to an arbitrary value of 1:

The exponent order, m, is the only free parameter in the model, which is optimized using fminsearch routine in Matlab.

Figure 5 shows the model fit to the averaged colour data at 0.375 c/deg with the fit parameters included in the inset. The contrast exponent m equals 3.08 for the average fit (3.26 ± 2.02 s.d. averaged over three subjects), which is compatible with the range found for contrast nonlinearity in many other studies and the energy model29,33,40,41,42. The m value is also compatible with Pelli43 approximation of contrast exponent, m′ = β/1.25, where β is the slope of the Weibull psychometric function fit to data. The slope averaged over all stimulus conditions and all subjects is β = 2.95 ± 0.14 SE and from there m′ = 2.36. The use of an exponent determined directly from the measurements of the slopes of our psychometric functions (m = 2.36) also provides a good model fit to the data (Fig. S3), although we have used the best fitting exponent of m = 3.08. The isotropic model provides a reasonable fit to colour data at 0.375 c/deg (RMSE = 0.05 and R2 = 0.93 for the fit to the averaged data), supporting the plausibility of non-oriented colour mechanisms at low spatial frequencies. (Fits for individual subjects and parameter estimates are given in Figure S3 and Table S3 of the supplementary material.) We also fitted the isotropic model to the equivalent achromatic data at 0.375 c/deg for comparison purposes, but rejected it because it was a worse fit than the tuned model and failed to capture the key characteristics of the data, specifically the very low summation ratios for large plaid angles.

Note on response vs. neural detector bandwidths

It is interesting to note that stimulus-based effects may influence the response tuning, particularly for the low spatial frequencies that have fewer number of spatial cycles. For example for the chromatic stimuli at 0.375 c/deg, we see some tuning of the response even in the absence of any tuning of the underlying neural detectors (Fig. 5). This reflects the decline of stimulus energy for different plaid patterns at small angles, where beating occurs between overlaid grating components and affects the result of spatial summation. The model fit to the empirical data is reflecting the fact that the psychophysical response is dependent on the stimulus energy. For the low spatial frequency achromatic stimuli (Fig. 4b), the response tuning is broader than the fitted underlying detectors (by about 10 degs), again reflecting the influence of the stimulus-based effects. For the higher spatial frequency, the response bandwidths are very similar to the neural detector bandwidths. These stimuli have more spatial cycles and narrower orientation tuning (in relation to the detector bandwidth) and the stimulus effects have less influence on the response tuning. Thus explicit models are required to relate the response tuning to the underlying detector bandwidths, given the stimuli used.

Discussion

We have used the classical psychophysical method of subthreshold summation to measure orientation tuning of the visual detectors underlying human colour vision at different spatial frequencies. We fit the response functions using a von Mises function since this function provides a good fit and a measurement of bandwidth using the fewest parameters. However, this function, or any other, is simply a description of the data and provides little insight into the visual mechanisms involved in generating the response. In order to determine the orientation tuning of the underlying sensory detectors, we use a model involving arrays of orientation and spatial frequency tuned filters whose outputs are combined across the visual extent of the stimulus using a Minkowski summation rule. This method has been used previously to determine orientation5 and spatial frequency tuning26,31 for achromatic stimuli. By fitting this model, we obtain an estimate of the orientation bandwidths of the underlying detectors. Since, in some cases the best fitting orientation tuning was very broad and was at the maximum that could be achieved, we also used an isotropic version of the model with no orientation tuning.

At low spatial frequencies, we find that the chromatic response function is much broader than the achromatic. We found that an isotropic model, which assumes no orientation tuning of the underlying detectors, provided a good fit to the chromatic response function. The orientation-tuned model could only provide a reasonable account of the data if we used very broad detector bandwidths of 62 degs (HWHH), which was the broadest possible bandwidth yielded from our log Gabor filters. Effectively, a very broadly tuned and an isotropic model are similar in that both predict significant summation between orthogonal gratings. There are differences, however, in the shapes of the underlying spatial filters, as illustrated in Figure 6. Figure 6c illustrates the spatial filter obtained from the tuned model fit to the chromatic data. The filter is highly elongated along the horizontal axis in order to render the broad orientation tuning. There is no physiological evidence, however, for the elongated detector of the type. Figure 6e and f illustrates the filters that may underlie the isotropic model, for which the constraints of orientation and spatial frequency tuning have been removed. Crucially, these filters are circularly symmetric and may be low pass for chromatic spatial frequency (Fig. 6e), corresponding to type 1 or type 2 receptive fields as originally defined by Wiesel and Hubel44, or tuned to chromatic spatial frequency (Fig. 6f), so resembling a circularly symmetric dual opponent receptive field. There is plentiful evidence for these types of isotropic detectors in the primate cortex, as discussed further below (and see Shapley and Hawken45 for a review)46,47,48,49. Hence we conclude that an isotropic detector model provides more plausible account of the chromatic response at low spatial frequencies than a tuned model that requires an extreme filter bandwidth.

Figure 6
figure 6

Illustrations of the filter shapes underlying threshold detection.

Representation of the optimal Log Gabor filter estimated from the model fits for the average (a) colour (red-green) and (b) achromatic (grey) data at the mid spatial frequency. (c–d) optimal Log Gabor filter estimates for the colour and achromatic data at the low spatial frequency, respectively. (e–f) possible neural detectors underlying the isotropic model: (e) an isotropic (circularly symmetric) single opponent detector, low-pass for chromatic spatial frequency and (f) a band pass isotropic dual opponent detector tuned to chromatic spatial frequency. Red and green parts represent the stimulus color that the detector best responds to within its receptive field.

For the achromatic stimuli, the response functions are best fitted by a model with orientation-tuned detectors and yield bandwidths of 9 degs, as illustrated in Figure 6d. The isotropic model could not properly fit the achromatic response tuning functions.

At the mid spatial frequency of 1.5 c/deg, both chromatic and achromatic response functions show marked orientation tuning with the model fit indicating orientation-tuned detectors with only a small difference in bandwidth (16 degs for chromatic and 12 degs for achromatic). The presence of orientation tuning in colour vision, slightly broader than for achromatic contrast, is in agreement with previous studies using other methods11,13,14. Thus there is a clear effect of spatial frequency on the orientation tuning of colour contrast, with a switch from orientation tuned to isotropic detectors at low spatial frequencies, but for achromatic contrast we found no evidence for any increase in bandwidth as spatial frequency declined.

Previous masking studies using achromatic stimuli have reported that orientation bandwidth increases as spatial frequency decreases1,5,50, something that we do not find in our study. Meese and Holmes17 and Cass, et al.3 have argued that the suppressive effect of contrast normalization, which is thought to increase at lower spatial frequencies, may account for this effect and argued for constant underlying detector bandwidths across spatial frequency of around 20 degs. However, Cass, et al.3 only measured response tuning, providing no model of the detector tuning and so estimated bandwidths are likely to be broader due to stimulus-based effects.

The use of very low contrasts in the subthreshold summation method allows us to measure the orientation tuning of the neural detectors more directly by avoiding the suppressive effect of contrast normalization inherent in the use of high contrast masking or adaptation methods. Contrast normalization, revealed as the perceptual effect called “cross orientation masking”, elevates thresholds across all orientations, broadening the orientation tuning of the masking3,17. The effect of cross orientation masking is stronger in colour than luminance vision12,51 and acts to obscure the orientation tuning of the underlying colour detectors in masking experiments; for example, Medina and Mullen12 found no orientation tuning in the chromatic response functions using masking. For both chromatic and achromatic contrast, our bandwidth estimates at 1.5 c/deg (16 and 12 degs, for colour and luminance respectively) are narrower than those found previously from masking experiments, which are in the range of 26–30 degs for colour and 22–24 degs for luminance contrast at a mid-spatial frequency5,13,14. This difference may be accounted for by the isotropic effects of contrast normalization in masking, as well as by other differences in the stimuli or models used.

Our results reveal the presence of non-oriented detectors sensitive to low spatial frequencies in human colour vision for monocular viewing conditions. They support our earlier report based on measuring subthreshold summation between pairs of orthogonal gratings18. While the previous study suggested the presence of isotropic detectors at low spatial frequencies switching to tuned detectors at higher spatial frequencies, the experiments described here yield the complete tuning functions and provide models for orientation tuned versus isotropic detection that allow bandwidth estimates for each condition.

The isotropic detectors that we find are likely to reflect the activity of non-oriented, circularly symmetric chromatic neurons in primate cortical areas V1 and V2. In colour vision, the most commonly reported non-oriented cortical neurons are type 2, which have circularly symmetric receptive fields with single cone opponency. These neurons are spatially low pass for colour contrast, responding best to low spatial frequencies and are thought to form around 10% of the population in primate V145,47,52. They have been called “area responsive” because they will produce strong responses to the interior regions of coloured objects or figures and are not edge sensitive. Hence, they can potentially transmit crucial information about the colour of a surface. Circularly symmetric dual opponent neurons have also been reported in V146. Such cells are unselective for orientation, but have spatial selectivity and have been typically investigated using “blobs” of colour on a contrasting background. These neurons, if tuned to a low spatial frequency, could also underlie the isotropic response to our low spatial frequency chromatic stimuli and provide information about larger regions of colour against a background. The presence of isotropic dual opponent neurons is controversial, however45 and it has been suggested that they may in fact be the type 2 neurons above. Most dual opponent neurons are orientation tuned and edge-responsive and form around 30% of the population in V1, making them ideal candidates for underlying the orientation-tuned chromatic response that determines threshold at the mid-spatial frequency45,47,49,53. Although it is unknown at what stage psychophysical threshold is determined, the colour selective non-oriented and orientation-tuned neural populations in V1 have sufficient sensitivity to be plausible candidates for psychophysical threshold54. In conclusion, we suggest that the psychophysical, non-oriented chromatic detector that we find is likely to be involved in providing information about the colour of a larger area or surface, whereas the orientation-tuned detectors, which are selective for colour edges and contours, are likely to be involved in delineating shape or form. In further support of this distinction of roles, evidence is emerging for distinct visual areas found in extra striate (occipitotemporal) cortex that respond differentially to the shape of an object as opposed to its surface properties (colour and texture)55, with colour vision providing input into both these types of function.

Methods

Apparatus

Stimuli were generated using a ViSaGe video-graphics card (Cambridge Research Systems, Kent, UK) with 14-bit contrast resolution, programmed in Matlab and were displayed on a CRT colour monitor (Iiyama Vision Master Pro 513, Iiyama Corporation) with a resolution of 1024 × 768 and a frame refresh rate of 120 Hz. The monitor was gamma corrected using an OptiCal photometer (CRS) and the VSG Desktop software. The spectral radiances of the red, green and blue phosphors of the monitor were calibrated using a PR-645 Spectrascan spectroradiometer (Photo Research Inc, Chatsworth, CA, USA). The CIE 1931 x-y chromaticity coordinates for the red, green and blue phosphors outputs were (x = 0.624, y = 0.335), (x = 0.293, y = 0.608) and (x = 0.147, y = 0.073), respectively. The background was achromatic with a mean luminance of 43 cd/m2 at the screen centre. All stimuli were viewed through a mirror stereoscope in a dimly lit room with an optical distance of 58 cm.

Observers

Six observers participated in the study, the two authors (MG and KTM) and four naïve subjects with previous training in psychophysics (AB, DK, SK and RSE). Each subject completed at least two of the four stimulus conditions, colour and achromatic contrast at one of the two spatial frequencies. All subjects had normal or corrected-to-normal visual acuity and normal colour vision, assessed with the Farnsworth-Munsell 100 Hue test. The experiments were performed in accordance with the Declaration of Helsinki and approved by the institutional ethics committee of McGill University Health Centre. An information consent form was signed by each of the subjects.

Stimuli

Stimuli were achromatic or isoluminant red-green sine-wave gratings (phase = 0) presented alone or as pairs of gratings of the same spatial frequency overlaid to form a plaid, as illustrated in Figure 1. Stimuli were sinewave gratings of 0.375 or 1.5 c/deg and were displayed in a circular patch (10 degs in diameter) with edges contrast enveloped with a spatial raised cosine (2.5 degs). Stimuli were static and presented in a temporal Gaussian envelop with a sigma of 125 ms. The two component gratings were generated independently and were combined in alternative monitor frames at 120 Hz using frame interleaving with a frame-cycling option in software. In the experiments, the orientation difference (θ) between the grating pairs forming the plaid was varied between 0 and 90 degs, as illustrated in Figure 1. Component gratings were positioned ±θ/2 relative to the vertical axis to avoid any potential oblique effect56. Stimuli were viewed monocularly with right eye and the left eye viewed a uniform field of the same mean luminance.

Colour space

Stimuli are represented in a three-dimensional cone contrast colour space57,58 in which each axis is defined by the incremental stimulus intensity for each cone type to a given stimulus normalized by the respective intensity of the fixed adapting white background. Cone excitations for the L-, M- and S-cones were calculated using the Smith and Pokorny cone fundamentals. A linear transform was calculated to specify the required phosphor contrasts of the monitor for given cone contrasts.

Stimulus contrast is defined as the vector length in cone contrast units (CC):

where Lc, Mc and Sc represent the L, M and S cone-contrast for the L, M and S cones. This metric differs by a factor of √3 from the Michelson luminance contrast. Chromatic stimuli were isoluminant and activated the L/M cone opponent pathway. The stimulus vector direction was (L-αM), where α is a numerical constant obtained at isoluminance. Achromatic stimuli had the vector direction (L + M + S). For each subject, the isoluminant point was determined using a minimum motion experiment as previously described18. The subject viewed a drifting Gabor whose spatial frequency matched that used in the main experiment and a method of adjustment was used to determine the L:M cone ratio (value of α, above) at which the subject perceived motion to be at a minimum. This was repeated 10–20 times for each eye and binocularly and the average was taken as isoluminance. We did not find a difference between the two eyes in any subject. The α value at isoluminance was: 4, 4, 11.4, 2.1, 1 and 3.4 for subjects DK, AB, MG, RSE, SK and KTM respectively.

Protocols

For each summation experiment, detection thresholds were obtained for a left oblique grating, a right oblique grating and the plaid composed of the superimposition of the two gratings. We used 9 orientation differences for the pairs of component gratings and their corresponding plaids (±θ/2) (see Stimuli): 0, ±5, ±8, ±11.25, ±16, ±22.5, ±30, ±35.5 and ±45 degs. We measured detection thresholds using a temporal two-alternative-forced choice (2AFC) method of constant stimuli, in which one interval contained the stimulus and the other was blank (mean luminance). Each stimulus interval was 500 ms in duration, cued with a beep, with an inter-stimulus interval of 400 ms. Subjects had to choose in which interval, first or second, they detected the stimulus. Auditory feedback was provided. Detection thresholds were derived from fits of the psychometric functions plotting per cent correct for stimulus detection as a function of cone contrast (achromatic or colour), based on measurements for at least six different contrasts with 90 to 120 trials per contrast. Data collection was made over several sittings with presentations of the two gratings and their combination ordered in a balanced block design. Psychometric functions were fitted with a standard Weibull function with the chance level set to 0.5, the lapse rate constrained to 0.02 and the parameters for contrast detection threshold corresponding to 81.6% correct detection and the slope of the function were extracted. Fitting was done using the psignifit toolbox for Matlab59. The variability of threshold and slope parameter estimates was found using the bootstrap method based on 4,999 simulations60.

We did not find systematic differences between the detection thresholds for the component gratings of different orientations. Typically, we combined the psychometric data for all component-grating orientations, yielding 110–480 trials per contrast level of the psychometric function. In one subject (SK), however, full psychometric functions were collected on each component grating for the colour condition and data was only combined across the two components, yielding at least 80 trials per contrast level. In another subject (KTM), we pooled the grating data for grating pairs of similar orientations so increasing the number of trials per threshold estimate (at least 140 trials per contrast level). Psychometric functions for the plaids were based on 90 to 100 trials per contrast level, yielding a total of 540–600 trials per psychometric function.

Analysis

To quantify the amount of visual response summation between the two combined component gratings we calculated a summation ratio, which is defined as the ratio of the contrast at detection threshold for a component grating presented alone (ThreshGrat) to its contrast in the plaid when plaid is at detection threshold (ThreshPlaid):

Both component gratings composing a plaid have equal contrast. An SR of 2 shows a linear summation between responses to two component gratings and is expected for the case of monocular plaid presentation, where the two component gratings interleaving in successive frames are co-oriented. A ratio of near 1 indicates the absence of neural summation and is found, for instance, for a luminance orthogonal plaid at a high spatial frequency18. In this case, while physical input to the visual system has been increased by the combination of two gratings into a plaid compared to a single grating, visual sensitivity does not improve. This indicates involvement of independent orientation tuned mechanisms in the detection of the compound stimulus21. Notwithstanding, a small improvement is still expected even for orthogonal stimuli and observed because of higher probability of detection in presence of two stimuli compared with one (classically referred to as probability summation). A summation ratio between 1 and 2 signifies some sort of neural summation that can be assessed by employment of a model.

Orientation tuning curves

By measuring summation ratio as a function of plaid angle from 0 to 90 degs we reveal the orientation tuning of the response. These response curves are measured for four different stimulus conditions: colour and achromatic contrast at low (0.375 c/deg) and mid (1.5 c/deg) spatial frequencies presented monocularly. The bandwidths of the response tuning are found by fitting a von Mises function to the orientation tuning curves (see Results). To find estimates of the bandwidths of the underlying neural detectors we employ a modeling approach, as described in the Results.