Variability in neuronal response latency has been typically considered caused by random noise. Previous studies of single cells and large neuronal populations have shown that the temporal variability tends to increase along the visual pathway. Inspired by these previous studies, we hypothesized that functional areas at later stages in the visual pathway of face processing would have larger variability in the response latency. To test this hypothesis, we used magnetoencephalographic data collected when subjects were presented with images of human faces. Faces are known to elicit a sequence of activity from the primary visual cortex to the fusiform gyrus. Our results revealed that the fusiform gyrus showed larger variability in the response latency compared to the calcarine fissure. Dynamic and spectral analyses of the latency variability indicated that the response latency in the fusiform gyrus was more variable than in the calcarine fissure between 70 ms and 200 ms after the stimulus onset and between 4 Hz and 40 Hz, respectively. The sequential processing of face information from the calcarine sulcus to the fusiform sulcus was more reliably detected based on sizes of the response variability than instants of the maximal response peaks. With two areas in the ventral visual pathway, we show that the variability in response latency across brain areas can be used to infer the sequence of cortical activity.
Neural variability in response latency is usually considered to be caused by random noise. Based on the assumption that the measurement at each trial consists of an invariant response waveform and uncorrelated noise1, evoked responses are typically obtained via averaging across trials. The process of averaging effectively suppresses noise and increases the signal-to-noise ratio of the measurements. Because of averaging, response variability is often ignored.
The sequence of brain activity across areas has been typically inferred by the relative size of a specific feature of the evoked responses. Temporal features, such as onset or time-to-peak, have been widely used to estimate the progression of cortical activity. With magnetoencephalographic (MEG) measures, peak latencies from different cortical areas have been used to infer the sequence of activity. This strategy has been used to disclose sequential brain activity in studies of face perception2, reading3,4, online imitation of lip movements5,6, and somatosensory processing7,8. Peak latencies are typically defined as the interval between the stimulus onset and the time at which responses reach the maximal amplitude. However, measurements of peak latencies can be unstable and prone to noise contamination. Specifically, researchers have to decide to measure the latency of either positive or negative peaks, which may have similar magnitudes. Moreover, cortical responses may not always show clearly defined peaks. For example, multiple peaks are commonly present in the time courses in a given cortical area.
It has been shown that neural responses differ across trials when the identical stimulus is presented repeatedly: at the neuronal level, single-digit unit recordings suggested that neuronal responses to the identical visual stimuli varied considerably across trials in cat and monkey visual cortex9,10,11. It has also been reported that along the mammalian visual pathway, higher-order neurons, or neurons further away from sensory inputs, usually showed greater response variability in terms of spike timing12,13. The variability of neuronal responses could be due to noise from sensory or cellular levels, such as synaptic responses10,14. In addition, small amount of latency variability could accumulate as neural activity propagates or as information flows from lower to higher-level areas12,13,14. When large neuronal populations were recorded by non-invasive technique in healthy human brains, similar patterns of variability were found. For example, with a visuomotor task, Tang et al.15 found that the early MEG responses in the primary visual cortex showed smaller latency variability than the late responses.
Based on micro- and macro-scale neuronal findings in the visual cortex, we hypothesized that functional areas at later processing stages would have larger variability in response latency compared to areas at earlier processing stages. To test this hypothesis of increasing latency variability along the functional pathway, we examined if the size of regional latency variability would match the known sequence of cortical processing. Here, we used the ventral pathway of face processing to test this hypothesis.
Face-selective event-related fields or potentials measured by MEG or EEG (M170 or N170) have been reported previously2,16,17. The M170 or N170 response has been consistently localized to the fusiform gyrus17,18,19,20. Moreover, MEG studies have reported sequential activity from the calcarine fissure to the fusiform gyrus. Specifically, face-related cortical activity was found in the calcarine fissure with peak latencies around 110 ms followed by activity in the fusiform gyrus with peak latencies around 170 ms using either dipole models17 or distributed source estimates21. Taken together, the consensus is that the primary visual cortex functions as the initial stage to process general visual information, followed by the fusiform gyrus, which is specifically responsive to face detection and categorization2,22.
Based on the previous findings of sequential visual processing from the primary visual cortex to the fusiform gyrus, the goal of our present study was to characterize the variability of response latency during face processing. We specifically examined the variability of response latency in the primary visual area (the calcarine fissure) and the fusiform gyrus. Neural processing in these two areas was taken to reflect the sequence from lower-order primary sensory cortices to higher-order association areas. Using MEG measurements and based on our hypothesis, we expected that the variability of response latency in the calcarine fissure would be smaller than that in the fusiform gyrus. If true, the relative positions of cortical areas in the functional ventral pathway of face processing could be inferred by the size of response latency variability. Furthermore, we estimated the sequence of functional processing across brain areas in both the time and frequency domains by using dynamic and spectral decompositions of response latency variability. Dynamic estimates of response latency variability can reveal specific intervals of clear sequential processing, while spectral decomposition of response latency variability may disclose higher functional specificity as the neural activity at distinct oscillatory frequencies is related to different functions.
Evoked responses to images of faces were observed in the calcarine fissure and fusiform gyrus in both the left and right hemispheres. Based on anatomically defined regions of interest (Fig. 1), the group average time courses showed two response peaks in the left calcarine fissure within the first 200 ms after the stimulus onset: the first peak at 121.8 ms and the second peak at 168.7 ms. The left fusiform gyrus showed a response peak at 169.7 ms. In the right hemisphere, two prominent response peaks were found at 121.8 ms and 167.7 ms in the calcarine fissure and one peak at 164.7 ms in the fusiform gyrus. Because no significant differences were observed between the time courses in the left and right calcarine fissure and we assumed no functional differences between the left and right calcarine fissures, time courses from the left and right calcarine fissures were combined for further analyses. Time courses combing the left and right calcarine fissure showed two peaks within the first 200 ms after the stimulus onset: at 121.8 ms and 168.7 ms.
To compare peak latencies in the fusiform gyrus and the calcarine fissure, Wilcoxon signed rank test was used. By identifying the maximum peak latency within the pre-defined time window (70 ms–200 ms) from the average waveform in each cortical area from each subject separately (Fig. 2), we found that the peak latency in the calcarine fissure (150.0 ms+/−33.4 ms) was significantly earlier compared to that in the right fusiform gyrus (168.8 ms+/−22.0 ms; Z = 2.262, p = 0.024). No significant difference was found between the peak latency in the calcarine fissure and the left fusiform gyrus (161.0 ms+/−28.4 ms; Z = 1.083, p = 0.279). As shown in Fig. 2, nine out of 14 subjects showed earlier maximum peak latencies in the calcarine fissure than in the left fusiform gyrus. Ten out of 14 subjects showed earlier maximum peak latencies in the calcarine fissure than in the right fusiform gyrus. With regard to the response amplitude, the peak amplitude found in the calcarine fissure (5.4+/−2.4) was significantly stronger compared to that in the left fusiform gyrus (3.6+/−1.2; Z = −2.982, p = 0.003), but not the right fusiform gyrus (4.5+/−1.4; Z = −1.538, p = 0.124).
The left panel in Fig. 3 presents average waveforms from three representative subjects with the maximum peaks in the calcarine sulcus before 130 ms. Time courses from three other subjects are displayed in the right panel of Fig. 3, showing the maximum peaks in the calcarine sulcus after 130 ms. In the calcarine fissure, subjects with the maximum peak latency later than 130 ms contributed to the more delayed responses in our study than the responses reported in previous studies2,21,23. These results raised concerns about inferring the sequence of brain activity by the maximum peak latencies in related areas, which can vary across subjects considerably.
Static estimates of latency variability
Noise levels during the pre-stimulus baseline period were significantly larger in the left fusiform area compared to the calcarine fissure (Z = 3.045, p = 0.002). Noise levels were also significantly higher in the right fusiform gyrus than the calcarine fissure (Z = 3.233, p = 0.001) (Fig. 4, left). The difference in noise levels across regions could become a potential confounding factor in estimating latency variability. Thus, we added noise to equalize noise levels across areas (see Methods for details). As a result, the noise levels did not differ significantly between the left fusiform gyrus and the calcarine fissure (Z = −1.726, p = 0.084), nor between the right fusiform gyrus and the calcarine fissure (Z = −1.350, p = 0.177) (Fig. 4, right).
Because the signal-to-noise ratio of a single epoch is usually low, we used bootstrap samples to obtain a more robust estimate of relative latency. To ensure that our boostrap samples have a sufficient signal to noise ratio, we plotted evoked responses from each bootstrap sample in each region of interest. Figure 5 provides representative data from two individual subjects. A consistent pattern of stronger activity was observed between 100 and 200 ms after the stimulus onset across bootstrap samples.
For the static estimates of latency variability, we calculated the standard deviation of relative latencies across bootstrap samples. This latency variability was estimated for each subject and each area separately. We found that when face images were presented, there was a trend of larger latency variability in the left fusiform gyrus (31.8 ms+/−16.1 ms) than the calcarine fissure (28.0 ms+/−19.1 ms; Z = 1.790, p = 0.074). Greater latency variability in the left fusiform gyrus compared to the calcarine fissure was found in 11 out of 14 subjects (Fig. 6). In addition, we found 8 out of 14 subjects showed larger latency variability in the right fusiform gyrus than in the calcarine fissure (Fig. 6). The latency variability of the right fusiform gyrus (30.5 ms+/−20.9 ms) was not significantly different from that of the calcarine fissure (28.0 ms+/−19.1 ms; Z = 0.471, p = 0.638) across subjects.
To test if the regional latency variability can be inferred from the maximum peak latency in regional responses, we correlated between these two quantities. No significant correlation was found in either the calcarine fissure, left fusiform gyrus, or right fusiform gyrus (Fig. 7). This result indicates that the latency variability in regional responses carries independent physiological information other than the maximum peak latency.
Dynamic estimates of latency variability
The latency variability was also estimated dynamically using a moving window approach. Dynamic estimates of latency variability are displayed in Fig. 8. Statistical comparisons were performed in the latency range between 70 ms and 200 ms. Significantly larger response latency variability was observed in the left fusiform gyrus than the calcarine fissure between 70 ms and 130 ms as well as between 170 ms and 180 ms (FDR p < 0.05). The right fusiform gyrus showed significantly larger response latency variability than the calcarine fissure between 70 ms and 200 ms (FDR p < 0.05).
To evaluate the sensitivity of different measures, we compared the number of subjects showing consistent patterns across regions of interest in term of the maximum peak latency and latency variability (Table 1). An earlier peak latency in the calcarine fissure than the fusiform gyrus would suggest a sequence of activity from the calcarine fissure to the fusiform gyrus. Smaller latency variability in the calcarine fissure compared to the fusiform gyrus would also suggest a sequence of activity from the calcarine fissure to the fusiform gyrus. However, differences between the tradition measures of maximum peak latency and the static measures of latency variability did not reach statistical significance (calcarine < left fusiform, Z = 0.837, p = 0.403; calcarine < right fusiform, Z = −0.789, p = 0.430). Differences between the traditional measures and the dynamic measures were also not statistically significant (calcarine < left fusiform, Z = 1.309, p = 0.190; calcarine < right fusiform, Z = 1.480, p = 0.139).
Percentages of subjects showing the above sequence of activity were also calculated. Averaging across the left and right fusiform gyrus, we found 68% of the tested subjects with earlier peak latencies in the calcarine fissure than the fusiform gyrus using the traditional measures of peak latency. Using the static estimates of latency variability, the percentage of subjects exhibiting smaller variability in the calcarine fissure compared to the fusiform gyrus was also 68%. Using the dynamic estimates of latency variability, 89% of the subjects showed smaller variability in the calcarine fissure compared to the fusiform gyrus. Overall, the dynamic estimates of latency variability showed a non-significant trend of higher sensitivity in detecting the specified sequence of activity, compared to the traditional measures of peak latency activity (Z = 1.382, p = 0.167).
Spectral estimates of latency variability
In the frequency domain, latency variability was estimated using spectrally decomposed time courses24. Estimated latency variability in the frequency domain is plotted in Fig. 9. Significantly larger variability was found in the left fusiform gyrus than the calcarine fissure at 4 to 40 Hz (FDR p < 0.05). The right fusiform gyrus showed significantly greater variability compared to the calcarine fissure at 4 to 40 Hz (FDR p < 0.05). Taken together, we found that most significant differences in the response variability were spectrally distributed between 4 and 40 Hz.
Our MEG results show that latency variability changes across cortical regions along the ventral pathway of face processing. The differential variability among regions was observed in both the temporal and spectral domains. Specifically, we found (1) significantly larger latency variability in the fusiform gyrus than the calcarine fissure between 70 to 130 ms and 170 to 180 ms in both hemispheres (Fig. 8). (2) In the frequency domain, significantly greater variability was observed in the fusiform gyrus than the calcarine fissure mostly between 4 and 40 Hz in both hemispheres (Fig. 9). In short, differences in the variability between functional areas studied here were found to reflect their relative positions in the ventral visual pathway: the earlier the processing stage, the smaller the response latency variability.
In our study, ROIs included the primary visual cortex as the region responsible for early visual processing and the fusiform gyrus as the region responsible for later face detection and identification. From the static estimates of latency variability, we found the fusiform gyrus, an area responsive to faces, showed a trend of greater latency variability than the calcarine fissure (Fig. 6).
While the static estimates of variability provide an overall measure of latency variability, the dynamic and spectral estimates of variability provide more temporally and spectrally specific analysis on latency variability by decomposing time series data into finer intervals and frequency bands. From the estimated latency variability in the temporal domain (Fig. 8), we found larger latency variability in the fusiform gyrus compared to the calcarine fissure. The increased latency variability from the calcarine fissure to the fusiform gyrus seems to reflect the temporal sequence of activity from the calcarine fissure to the fusiform gyrus as identified in previous studies2,17,21. These previous studies identified the maximum amplitudes from the sensor data or source data estimated using dipole models or distributed source estimates. Therefore, results from the present study support our postulate that the variability in regional response latency can serve to identify the sequence of activity.
Specifically, in the dynamic estimates, significantly larger variability was found in the fusiform gyrus compared to the calcarine fissure in the time window between 70 and 130 ms and 170 to 180 ms after the stimulus onset in both hemispheres. This latency range appears to be earlier than the typically reported face-selective evoked response around 170 ms17,21. Nevertheless, several studies have also reported face-selective responses start as early as 100 ms2,25,26,27,28. Taken together, these findings suggest that differences in variability in this early latency range can be associated with extracting stimulus information required to categorize faces or non-faces.
During the latency range between 70–200 ms, latency variability was relatively small compared to latency variability observed later than 200 ms. This pattern of response was observed in both the calcarine fissure and fusiform gyrus. Previous EEG studies have related increased signal variability to greater information transmission29,30. Hence, relatively large variability observed after 200 ms in our present study could indicate multiple sources of information exchange (e.g., feedforward and feeback signals), whereas small latency variability could be related to a simpler pattern of information inputs (e.g., feedforward signals only). When comparing latency variability across regions, significantly larger latency variability was found in the fusiform gyrus than the calcarine fissure in the latency range between 70–200 ms, but not after 200 ms. This result could reflect feedforward inputs or bottom-up effects from the calcarine fissure to the fusiform gyrus before 200 ms. However, in latency ranges later than 200 ms, both the feedforward and feedback signals may work at the same time. We speculate that such bidirectional information flows between the calcarine fissure and the fusiform gyrus could account for our finding that there were no dominant directions of influence and no significant differences in latency variability between the calcarine and fusiform gyri.
Our results also indicate that dynamic measures of latency variability could be more sensitive than static measures of latency variability (Table 1). By calculating the percentage of the tested subjects showing larger latency variability in the fusiform gyrus than the calcarine fissure, we found an improvement of about 20% using dynamic estimates compared to using static estimates of latency variability. Dynamic estimates of latency variability showed fluctuations of variability in the temporal domain. Because the feedforward signals seem transient, dynamic estimates using moving time windows could better characterize these transient changes and disclose sequential activity across cortical areas.
In the spectral domain, significantly different variability was found between 4 and 40 Hz (Fig. 9). Specifically, the latency variability was greater in the fusiform gyrus than the calcarine fissure across the theta, alpha, beta, and gamma bands. When comparing faces to non-face objects, previous ERP studies reported responses to faces in the theta, alpha, and gamma bands31,32 over posterior electrode positions. Using intracranial EEG, responses to faces were found in a wider frequency range, including the theta, alpha, beta, and gamma bands33. With MEG measures, significantly stronger responses in the right occipitotemporal area were reported between 4 and 25 Hz when comparing responses to faces and inverted faces34. These findings suggest that oscillatory activity across frequency bands could be associated with different aspects of face processing, such as information integration, face detection, or face identification. Taken together, the latency variability in frequency bands that have been associated with face processing is greater in the fusiform gyrus in both hemispheres than the calcarine fissure.
The latency variability observed here can be attributed to intrinsic variability of neural processing, such as synaptic transmission or neurotransmitter functions10,14. The intra-subject variability examined in the present study also suggests the possibility of intervening processes in local processing. These intervening processes could include fluctuations in top-down processes of attention or degrees of task engagement. To further validate our hypothesis of increasing latency variability as information flows across cortical regions, future studies should be conducted to test whether latency shifts remain constant along different functional pathways, or increase as the cortical area gets more distant from primary sensory processing.
The results from our present study indicate that there are no significant correlations between the static estimates of latency variability and the traditional measures of maximum peak latency (Fig. 7). In addition, the number of subjects showing larger peak latency or latency variability in the fusiform gyrus than the calcarine fissure tends to increase from the traditional measure to the dynamic measures (Table 1). The dynamic measures of latency variability are more sensitive to detect the sequence of activity than peak latencies defined within pre-determined time windows, suggesting that our method is more generalized in analyzing sequential processing in other experiments involving areas subserving different functions.
Our results also reveal that peak latency might not be the optimal index to estimate the order of regional activity, particularly when the variability of regional response latency is large, as exemplified by our analysis (Fig. 3). Averaging regional response waveforms smears out peaks and reduces the peak amplitude. Large regional response variability may thus result in no explicit response peaks in the average waveform. Moreover, when prior knowledge of response time courses is not well known, the dynamic and spectral estimates of latency variability without specifying a time window can provide more sensitive measures than using peak latencies within pre-defined time windows as we demonstrated here (Table 1). When response time courses are unknown, selecting a wider time window can be the first analysis approach. However, by using a wide latency window, peak latencies may be shifted to later latencies when multiple peaks are present in the time course as shown in our data. For estimating the processing pipeline in the brain, methods other than finding peak latencies, such as finding relative latencies via the cross-correlation method implemented in our study, may be more robust.
One approach of detecting regional information flow is Granger causality35,36,37. However, Granger causality assumes the time series should be stationary. This assumption is seriously challenged in evoked brain responses. For example, our dynamic estimates of variability indicate latency variability does not remain constant over time. It should be noted that while our approach does not require the stationary assumption and can be used to estimate the relative positions of regional processing in a functional pathway, we did not infer any causal influence across brain regions.
In our analysis, the cross-correlation procedure requires a template waveform. Here, we used the averaged waveform across trials as the template. Because the template is defined for each subject and each ROI separately, variations between individuals do not exert influence in our analysis. However, variations within subjects or across trials still remains. Furthermore, our bootstrap samples were not single-trial responses, but averaged across a subsample of trials. The number of trials chosen to generate one bootstrap sample can affect our estimates of variability and the signal-to-noise ratio. We found that the sensitivity of detecting the specified sequence of activity across subjects was stable when using between 20 to 40 trials to form one bootstrap sample (see Supplementary Information for details). Cautions should be taken to determine the number of trials in generating the bootstrap samples in order to minimize the noise fluctuation and to detect the response variability at the same time.
Using images of human faces to elicit neuromagnetic responses in brain areas subserving visual processing, we found that the order of brain areas demonstrating progressively larger response latency variability matched the expected sequence of information transfer across brain areas. Our present study suggests that regional response latency variability can provide meaningful information about the underlying sequence of neural processes. Differential variability between cortical areas can reflect their relative positions in a functional pathway. The close correspondence between temporal variability and the sequence of cortical activity in other functional processing pathways should be further tested with other complex stimuli and tasks.
Materials and Methods
Fourteen subjects (mean age: 27.9+/−6.2 years; 8 females and 6 males) participated in this study. All subjects provided written inform consent prior to participation and the experimental protocols were approved by the Institute of Review Board at the National Taiwan University Hospital. All experiments were carried out in accordance with the approved guidelines. All subjects had normal or corrected to normal vision.
Stimuli and Tasks
Visual stimuli of neutral faces from eight individuals38,39 and the phase-scrambled versions of the same stimuli were presented randomly to subjects. Images of faces or scrambled faces were presented for 500 ms in duration with a random inter-stimulus interval ranging between 1800 and 2200 ms. Stimuli were presented using Presentation (Version 18.0, Neurobehavioral Systems, Inc.). To ensure that subjects were attending to the visual stimuli, subjects were instructed to press a button when two sequential images were identical. Four runs (about 9 minutes per run) of data were collected from each subject. About 200 trials of faces and scrambled faces were shown to each subject in each run.
MEG data acquisition and preprocessing
MEG data were recorded using a 306-channel whole-head MEG system (VectorView, Elekta-NeuroMag Oy, Helsinki, Finland). MEG signals were band-pass filtered between 0.03–200 Hz and digitized at 1000 Hz. Temporal signal space separation40,41 was not used. Epochs with EOG exceeding 150 uV and MEG exceeding 3 pT/cm (gradiometers) and 4 pT (magnetometers) were removed from offline averages to suppress artifacts related to eye movements and system instability.
MRI data acquisition and processing
Anatomical data were measured on a 3 T MR scanner (Tim Trio, Siemens, Erlangen, Germany) with a 32-channel head coil array. Three-dimensional T1-weighted structural images were collected (MP-RAGE sequence: TR = 2530 ms, TE = 3.03 ms, TI = 1100 ms, flip angle = 7°, slice thickness = 1 mm, image matrix = 256 × 224, voxel size = 1 × 1 × 1 mm3). For visualization and localizing anatomical landmarks, cortical surfaces were reconstructed using FreeSurfer (version 5.1.0; http://surfer.nmr.mgh.harvard.edu).
MEG source analysis
MEG sources were estimated using anatomically constrained minimum-norm estimates (MNE)42,43 on the cortical surface. The MEG source spaces had about 7,000 vertices on each hemisphere. The forward models were calculated using Boundary Element Method (BEM) based on T1-weighted MRI data44. MEG noise covariance matrices were calculated using the 200 ms baseline prior to the stimulus presentation. Noise-normalized MNE, also called dynamic statistical parametric maps (dSPM)42, was calculated at each time point to map the extracranial MEG data onto cortical surfaces. Before averaging across subjects, source estimates from each individual subject were morphed to an average brain provided by FreeSurfer (subject “fsaverage”)45,46.
Four regions of interest (ROIs) were defined by anatomical labels47. These four ROIs were the calcarine fissure and fusiform gyrus in the left and right hemispheres. In this study, the calcarine fissure is taken as the low-order visual area, processing sensory information before the information is sent to higher-level areas. The fusiform gyrus, part of the core system of face processing, is taken as the high-order visual areas for detecting and identifying faces22,48.
For comparisons, we also followed the conventional method of using peak latencies across regions to estimate the sequence of activity. In previous MEG studies when conditions of face and non-face were compared17,21,23, the maximum peak latency was defined as the time when the average response amplitude reached its maximum within the time intervals of interest. For example, the time intervals of interest were defined between +/−30 ms of the M100 responses in the calcarine fissure and M170 responses in the fusiform gyrus; that is, between 70 ms and 130 ms for the calcarine fissure and between 140 ms and 200 ms for the fusiform gyrus. However, these definitions already assumed an earlier response window for activity in the calcarine fissure compared to the fusiform gyrus. Defining the time windows of interest differently for the two regions of interest would defeat our purpose of identifying sequences of activity across regions. Thus, we used the same time window of interest for both ROIs. Our time window of interest was defined between 70 ms and 200 ms, wide enough to cover the activity in both the calcarine fissure and the fusiform gyrus.
Static estimates of latency variability
To estimate relative latency, we first calculated template waveforms separately for each subject in each ROI (the calcarine fissure and fusiform gyrus bilaterally). The template waveform was obtained by averaging dSPM source waveforms across trials and across vertices within a given ROI. To cope with the low signal-to-noise ratio from single-epoch waveform, we estimated evoked responses using a bootstrap approach. Specifically, we obtained one bootstrap sample by randomly selecting 30 single-trial source estimates (approximately 15% of the trials) and averaging over these trials. The bootstrap procedure was repeated 100 times for each condition, ROI, and subject. These responses were then low-pass filtered at 40 Hz.
We also compared noise levels across ROIs. Noise levels were estimated by the standard deviation over the pre-stimulus or baseline period (−200 to 0 ms) and averaged across bootstrap samples and across subjects. To avoid potential confounds caused by regional differences in noise levels, we equalized noise levels across regions by adding Gaussian random noise (mean = 0 and standard deviation = differences in noise levels between regions) to the bootstrap samples across time.
Then, we calculated cross-correlations between the template waveform and each bootstrap sample in each ROI. The template waveform was temporally shifted (−200 and +200 ms in steps of 10 ms). The shift corresponding to the maximum correlation coefficient was defined as the relative latency. With 100 relative latencies across bootstrap samples, we calculated intra-subject latency variability or the standard deviation of relative latencies across bootstrap samples. Intra-subject latency variability in the calcarine fissure and fusiform gyrus was compared using Wilcoxon signed-rank test.
To further understand the relationship between the static estimates of latency variability and the traditional measures of peak latency, Pearson’s correlation coefficients were calculated between the static estimates of latency variability and the peak latency obtained using the traditional method as described in the previous section.
Dynamic estimates of latency variability
Dynamic latency variability was calculated by first convolving the template waveform and each bootstrap sample with a Gaussian window with standard deviation of 30 ms. The window center was temporally shifted across the entire epoch, from 100 ms before the stimulus onset to 1000 ms after the stimulus onset in increments of 10 ms. Relative latencies were calculated using the same cross-correlation method as described above. Intra-subject latency variability was obtained by calculating the standard deviation of relative latencies across bootstrap samples within each time window. Within the pre-defined time intervals from 70 to 200 ms, the Wilcoxon signed-rank test was then used to compare intra-subject latency variability in the calcarine fissure and fusiform gyrus in each time window. Multiple tests over latency windows were controlled by the false discovery rate (FDR)49.
To further compare the sensitivity of different measures (i.e., the traditional measures of peak latency, static estimates, and dynamic estimates of latency variability), we counted the number of subjects showing a sequence of activity from the calcarine fissure to the fusiform gyrus from each measure. With the traditional measures of peak latency, a smaller peak latency in the calcarine fissure compared to the fusiform gyrus would suggest a sequence of activity from the calcarine fissure to the fusiform gyrus. With the static and dynamic estimates of latency variability, greater latency variability in the fusiform gyrus compared to the calcarine fissure would indicate a sequence of activity from the calcarine fissure to the fusiform gyrus. For the dynamic estimates of latency variability, the number of subjects showing the above sequence of activity was averaged across time windows and then rounded up or down to the nearest integer. To test whether the proportions or numbers of subjects differ between measures, Z-test was used.
Spectral estimates of latency variability
In the frequency domain, latency variability was obtained by first calculating the Temporal Spectral Evolution (TSE)24 for the template and bootstrap samples. TSE was calculated by convolving time series data with a Morlet wavelet50 with width = 5. The center frequency of wavelet varied from 4 Hz to 40 Hz in steps of 1 Hz. The spectral power time series was then used for calculating relative latency and latency variability using the cross-correlation method described above. Intra-subject variability in latency between the calcarine fissure and the fusiform gyrus was compared using the Wilcoxon signed-rank test at each frequency. Multiple comparisons across frequencies were controlled by FDR49. All calculations were done using Matlab (Mathworks, Natick, MA, USA).
Dawson, G. D. A multiple scalp electrode for plotting evoked potentials. Electroencephalogr Clin Neurophysiol 6, 153–154 (1954).
Liu, J., Harris, A. & Kanwisher, N. Stages of processing in face perception: an MEG study. Nat Neurosci 5, 910–916 (2002).
Salmelin, R., Helenius, P. & Service, E. Neurophysiology of fluent and impaired reading: a magnetoencephalographic approach. J Clin Neurophysiol 17, 163–174 (2000).
Vartiainen, J., Liljestrom, M., Koskinen, M., Renvall, H. & Salmelin, R. Functional magnetic resonance imaging blood oxygenation level-dependent signal and magnetoencephalography evoked responses yield different neural functionality in reading. J Neurosci 31, 1048–1058 (2011).
Nishitani, N., Avikainen, S. & Hari, R. Abnormal imitation-related cortical activation sequences in Asperger’s syndrome. Ann Neurol 55, 558–562 (2004).
Nishitani, N. & Hari, R. Viewing lip forms: cortical dynamics. Neuron 36, 1211–1220 (2002).
Korvenoja, A. et al. Activation of multiple cortical areas in response to somatosensory stimulation: combined magnetoencephalographic and functional magnetic resonance imaging. Hum Brain Mapp 8, 13–27 (1999).
Ioannides, A. A. et al. Timing and connectivity in the human somatosensory cortex from single trial mass electrical activity. Hum Brain Mapp 15, 231–246 (2002).
Tomko, G. J. & Crapper, D. R. Neuronal variability: non-stationary responses to identical visual stimuli. Brain Res 79, 405–418 (1974).
McAdams, C. J. & Maunsell, J. H. Effects of attention on the reliability of individual neurons in monkey visual cortex. Neuron 23, 765–773 (1999).
Tolhurst, D. J., Movshon, J. A. & Dean, A. F. The statistical reliability of signals in single neurons in cat and monkey visual cortex. Vision Res 23, 775–785 (1983).
Bair, W. Spike timing in the mammalian visual system. Curr Opin Neurobiol 9, 447–453 (1999).
Tiesinga, P., Fellous, J. M. & Sejnowski, T. J. Regulation of spike timing in visual cortical circuits. Nat Rev Neurosci 9, 97–107 (2008).
Faisal, A. A., Selen, L. P. & Wolpert, D. M. Noise in the nervous system. Nat Rev Neurosci 9, 292–303 (2008).
Tang, A. C., Pearlmutter, B. A., Zibulevsky, M., Hely, T. A. & Weisend, M. P. An MEG study of response latency and variability in the human visual system during a visual-motor integration task. Advances in Neural Information Processing Systems., 185–191 (1999).
Bentin, S., Allison, T., Puce, A., Perez, E. & McCarthy, G. Electrophysiological Studies of Face Perception in Humans. J Cogn Neurosci 8, 551–565 (1996).
Halgren, E., Raij, T., Marinkovic, K., Jousmaki, V. & Hari, R. Cognitive response profile of the human fusiform face area as determined by MEG. Cereb Cortex 10, 69–81 (2000).
Deffke, I. et al. MEG/EEG sources of the 170-ms response to faces are co-localized in the fusiform gyrus. Neuroimage 35, 1495–1501 (2007).
Linkenkaer-Hansen, K. et al. Face-selective processing in human extrastriate cortex around 120 ms after stimulus onset revealed by magneto- and electroencephalography. Neurosci Lett 253, 147–150 (1998).
Watanabe, S., Miki, K. & Kakigi, R. Mechanisms of face perception in humans: a magneto- and electro-encephalographic study. Neuropathology 25, 8–20 (2005).
Marinkovic, K., Courtney, M. G., Witzel, T., Dale, A. M. & Halgren, E. Spatio-temporal dynamics and laterality effects of face inversion, feature presence and configuration, and face outline. Front Hum Neurosci 8, 868 (2014).
Grill-Spector, K., Knouf, N. & Kanwisher, N. The fusiform face area subserves face perception, not generic within-category identification. Nat Neurosci 7, 555–562 (2004).
Sams, M., Hietanen, J. K., Hari, R., Ilmoniemi, R. J. & Lounasmaa, O. V. Face-specific responses from the human inferior occipito-temporal cortex. Neuroscience 77, 49–55 (1997).
Salmelin, R. & Hari, R. Spatiotemporal characteristics of sensorimotor neuromagnetic rhythms related to thumb movement. Neuroscience 60, 537–550 (1994).
Dering, B., Martin, C. D., Moro, S., Pegna, A. J. & Thierry, G. Face-sensitive processes one hundred milliseconds after picture onset. Front Hum Neurosci 5, 93 (2011).
Herrmann, M. J., Ehlis, A. C., Ellgring, H. & Fallgatter, A. J. Early stages (P100) of face perception in humans as measured with event-related potentials (ERPs). J. Neural Transm 112, 1073–1081 (2005).
Itier, R. J. & Taylor, M. J. N170 or N1? Spatiotemporal differences between object and face processing using ERPs. Cereb Cortex 14, 132–142 (2004).
Colombatto, C. & McCarthy, G. The Effects of Face Inversion and Face Race on the P100 ERP. J Cogn Neurosci 29, 664–676 (2017).
Misic, B., Vakorin, V. A., Paus, T. & McIntosh, A. R. Functional embedding predicts the variability of neural activity. Front Syst Neurosci 5, 90 (2011).
Vakorin, V. A., Lippe, S. & McIntosh, A. R. Variability of brain signals processed locally transforms into higher connectivity with brain development. J Neurosci 31, 6405–6413 (2011).
Rousselet, G. A., Husk, J. S., Bennett, P. J. & Sekuler, A. B. Single-trial EEG dynamics of object and face visual processing. Neuroimage 36, 843–862 (2007).
Zion-Golumbic, E. & Bentin, S. Dissociated neural mechanisms for face detection and configural encoding: evidence from N170 and induced gamma-band oscillation effects. Cereb Cortex 17, 1741–1749 (2007).
Klopp, J., Halgren, E., Marinkovic, K. & Nenov, V. Face-selective spectral changes in the human fusiform gyrus. Clin Neurophysiol 110, 676–682 (1999).
Hsiao, F. J. et al. Oscillatory characteristics of face-evoked neuromagnetic responses. Int J Psychophysiol 61, 113–120 (2006).
Deshpande, G., Sathian, K. & Hu, X. Effect of hemodynamic variability on Granger causality analysis of fMRI. Neuroimage 52, 884–896 (2010).
Granger, C. W. J. Investigating causal relations by econometric models and cross-spectral methods. Econometrica 37, 424–438 (1969).
Roebroeck, A., Formisano, E. & Goebel, R. Mapping directed influence over the brain using Granger causality and fMRI. Neuroimage 25, 230–242 (2005).
Ekman, P. Are there basic emotions? Psychol Rev 99, 550–553 (1992).
Woods, D. L., Yund, E. W. & Kang, X. J. Unified functional/anatomical maps of human auditory cortex. Archives of Neurobehavioral Experiments and Stimuli, 46 (2003).
Taulu, S. & Simola, J. Spatiotemporal signal space separation method for rejecting nearby interference in MEG measurements. Phys Med Biol 51, 1759–1768 (2006).
Taulu, S. & Hari, R. Removal of magnetoencephalographic artifacts with temporal signal-space separation: demonstration with single-trial auditory-evoked responses. Hum Brain Mapp 30, 1524–1534 (2009).
Dale, A. M. et al. Dynamic statistical parametric mapping: combining fMRI and MEG for high-resolution imaging of cortical activity. Neuron 26, 55–67 (2000).
Dale, A. M. & Sereno, M. I. Improved localization of cortical activity by combining EEG and MEG with MRI cortical surface reconstruction: a linear approach. J Cogn Neurosci 5, 162–176 (1993).
Hämäläinen, M. S. & Sarvas, J. Realistic conductivity geometry model of the human head for interpretation of neuromagnetic data. IEEE Trans Biomed Eng 36, 165–171 (1989).
Dale, A. M., Fischl, B. & Sereno, M. I. Cortical surface-based analysis. I. Segmentation and surface reconstruction. Neuroimage 9, 179–194 (1999).
Fischl, B., Sereno, M. I. & Dale, A. M. Cortical surface-based analysis. II: Inflation, flattening, and a surface-based coordinate system. Neuroimage 9, 195–207 (1999).
Destrieux, C., Fischl, B., Dale, A. & Halgren, E. Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature. NeuroImage 53, 1–15 (2010).
Haxby, J. V., Hoffman, E. A. & Gobbini, M. I. The distributed human neural system for face perception. Trends Cogn Sci 4, 223–233 (2000).
Nichols, T. & Hayasaka, S. Controlling the familywise error rate in functional neuroimaging: a comparative review. Stat Methods Med Res 12, 419–446 (2003).
Tallon-Baudry, C. & Bertrand, O. Oscillatory gamma activity in humans and its role in object representation. Trends Cogn Sci 3, 151–162 (1999).
This work was partially supported by Ministry of Science and Technology, Taiwan (103–2628-B-002-002-MY3, 105-2221-E-002-104), National Health Research Institute, Taiwan (NHRI-EX107-10727EI) and the Academy of Finland (No. 298131).
The authors declare no competing interests.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
About this article
Cite this article
Lin, JF.L., Silva-Pereyra, J., Chou, CC. et al. The sequence of cortical activity inferred by response latency variability in the human ventral pathway of face processing. Sci Rep 8, 5836 (2018). https://doi.org/10.1038/s41598-018-23942-x