Introduction

The detection of changes in the sensory environment is an important function of the nervous system. When the EEG is recorded during an oddball sequence, in which an unexpected discriminably different stimulus (deviant, DEV) interrupts a train of regular stimuli (standards, STD), the EEG signal exhibits a negative event-related potential (ERP) peak referred to as mismatch negativity (MMN)1. MMN is a pre-attentive signal commonly quantified as the difference between the amplitude of the DEV- and STD ERPs, and relies particularly on intact right lateral temporal and frontal top-down feedback circuits2.

The predictive coding framework has emerged as an appealing model of MMN3 and of how sensory information is processed. According to this model, the brain constantly generates top-down predictions that are compared with sensory bottom-up signals. Neural responses to stimuli that match predictions are suppressed, whereas unexpected stimuli discrepant with the prediction generate an enhanced error signal4,5,6. There are two mechanisms underlying the MMN difference signal according to the predictive coding model. First, MMN could reflect repetition suppression. When the same stimulus is repeatedly presented, neuronal populations sensitive to that stimulus undergo adaptation and neural responses decrease7. MMN could also reflect a process of prediction error, where the sensory memory of regular stimuli that establishes a predictive model is violated upon the presentation of an unexpected DEV stimulus. This violation results in an enhanced neural response, allowing detection of the environmental change and an update of the predictive model. Prediction error has been observed in human and rodent surface recordings when suitable control conditions have been included in the design of sound sequences8,9,10,11. Essentially, there is evidence in both humans and rodents that MMN receives contributions from both prediction error and repetition suppression at various levels of the auditory system11, 12.

MMN reduction occurs in healthy volunteers administered an NMDA-R antagonist13 and has been reported in over 100 separate reports in individuals diagnosed with schizophrenia14, leading to assertions that MMN indexes the functional state of NMDA-R neurotransmission15. The schizophrenia findings fit with current views that NMDA-R hypofunction contributes to the neuropathology of the disorder16, 17. Mismatch-like responses in animal models have shown a similar sensitivity to NMDA-R antagonism17,18,19,20,21, though it has also been reported that low doses of NMDA-R antagonist can increase rat mismatch responses22. MK-801 is a NMDA-R antagonist that has been widely used23. Interestingly, the use of low doses of MK-801 have physiological effects as previously demonstrated in different systems other than the auditory system, such as the acquisition of associative learning in the classical conditioning of eyeblink responses24,25,26. It is generally accepted that NMDA-R antagonists reduce MMN14. However, recent data has demonstrated that the link between NMDA-R and MMN is not as clear as previously thought27.

Our primary interest here is to test whether MK-801 differentially affects repetition suppression and prediction error at the single-unit level at both the thalamic level and cortex. While previous studies have demonstrated that MMN-like responses in rodents are altered by NMDA-R antagonists21, 28,29,30, only one report has examined the impact on prediction error component of the MMN in surface recordings22. There are no published data on the effects of NMDA-R antagonists on repetition suppression. There are no reports that have examined the impact of NMDA-R antagonists on single-unit activity and local field potential recordings from the thalamus and auditory cortex. More importantly, because the thalamus and cortex have different microcircuits that involve excitatory and inhibitory neurons and different bottom-up and top-down interconnections31, it is likely that the effects of MK-801 will not be the same at these two levels of the auditory hierarchy.

Thus, it is unknown (i) whether there are differential effects of NMDA-R antagonism on prediction error as opposed to repetition suppression at the single unit or local field potential level, and (ii) the regional specificity of where effects of NMDA-R antagonists occur: in the lemniscal (L) vs. non-lemniscal (NL) auditory areas, or the thalamus vs. cortex. Therefore, we used an acute exposure to a systemic low dose of MK-801 to examine the impact of NMDA-R antagonism on individual responses of lemniscal and non-lemniscal thalamocortical neurons while auditory oddball, many standards and cascade control sequences were presented (Fig. 1a,b). Systemic administration of the drug was chosen in order to mimic as closely as possible human research that found that infusion of ketamine, an NMDA-R antagonist, via the antecubital vein reduced MMN13. Importantly, the dose of 0.1 mg/kg MK-801 was chosen because it is sufficient to induce behavioral changes and changes to NMDA-R subunit expression in young adult female rats32, 33, similar to those used in this study. This is also the dose in which peak locomotor activation is seen in female rats34. Male animals require a higher MK-801 dose (0.3 mg/kg) to induce behavioral changes33. This design allowed us to delineate effects on repetition suppression vs. prediction error (Fig. 1c)11, 19, 35, 36. Unexpectedly, we found that MK-801 increased repetition suppression in the thalamic regions, enhanced prediction error in the cortex, and increased neuronal mismatch at both locations.

Figure 1
figure 1

Experimental design. (a) Frequency response area example, with a representation of the ten selected tones to build the experimental paradigms. (b) Stimulation sequences, the same tone could be presented in different experimental paradigms, thus we can compare same tone in different contexts to control adaptation and deviance detection; and conform the indices of neuronal mismatch (iMM), prediction error (iPE) and repetition suppression (iRS). Note that ascending and descending tones will be compared to the control ascending or descending, respectively. (c) Sketch of summary results of mismatch responses for healthy and schizophrenia subjects under the classical analysis of mismatch. Second row decomposition of neuronal mismatch, under the assumption of predictive coding framework in healthy subjects, and the hypothetical decomposition of neuronal mismatch into prediction error and repetition suppression in schizophrenia.

Material and methods

Experiments were performed on 48 (control = 25; MK-801 = 23) adult, female Long-Evans rats with body weights between 200 and 250 g (aged 9 to 15 weeks). All experimental procedures were performed at the University of Salamanca, and all procedures and experimental protocols were in accordance with the guidelines of the European Communities Directive (86/609/EEC, 2003/65/EC and 2010/63/EU) and the RD 53/2013 Spanish legislation for the use and care of animals. All the details of the study were approved by the Bioethics Committee of the University of Salamanca (ref# USAL-ID-195).

Surgical procedures

Anesthesia was induced and maintained with urethane (1.5 g/kg, i.p), with supplementary doses (0.5 g/kg, i.p.) given as needed. Dexamethasone (0.25 mg/kg) and atropine (0.1 mg/kg) were administered at the beginning of the surgery to reduce brain edema and bronchial secretions, respectively. Isotonic glucosaline solution was administered periodically (5–10 ml every 6–8 h, s.c.) to avoid dehydration. During all experimental procedures, animals were artificially ventilated, and CO2 and temperature monitored37,38,39.

The initial procedure was the same in each case, and the subsequent procedures differed only in the craniotomy location, and the placement/orientation for the recording electrode (animals per group/location: control MGB = 16, AC = 9; MK-801 MGB = 15, AC = 8). For MGB recordings, a craniotomy (~ 2 × 2 mm, from − 5 to − 6.5 mm Bregma and − 3.5 mm lateral) was performed in the left parietal bone, the dura was removed, and the electrode advanced in a vertical direction40. For AC recordings, the skin and muscle over the left temporal bone was retracted and a 6 × 5 mm craniotomy was performed (between − 2 and − 6 from Bregma) over the temporal bone41 dura was removed and the area was covered with a thin, transparent layer of agar to prevent desiccation and stabilize recordings. Electrodes for AC recording were inserted using a triple-axis micromanipulator (Sensapex), forming a 30º angle with the horizontal plane, to penetrate through all cortical layers of the same cortical column.

For this study, animals in MK-801-treated group receive a systemic intraperitoneal injection (0.1 mg/kg) of a noncompetitive NMDA-R antagonist (MK-801 hydrogen maleate, M107 Sigma-Aldrich). Control animals did not receive any injection.

Electrophysiological recording procedures

During all procedures, animals were placed in a stereotaxic frame fixed with hollow specula ear bars that housed the sound delivery system. One single neuron and local field potential (LFP) was recorded at a time, using the same tungsten electrode (1–4 MΩ) inserted into a single auditory station (MGB or AC) in each individual animal. The signal recorded was pre-amplified (1000×) and band-pass filtered (1–3000 Hz) with a medusa preamplifier (TDT). This analog signal was digitalized 12 k sampling rate and further band-pass filtered (TDT-RX6) separately for spikes (500 Hz–3 kHz) and LFP (3–50 Hz). We used short trains of white noise bursts (30 ms, 5 ms rise-fall ramps) to search for neuronal activity. To prevent neuronal adaptation during the search, some parameters (frequency and intensity) and stimulus type (white noise, pure tone) were manually varied. Once a single neuron was isolated a frequency–response area (FRA) of the response magnitude for each frequency/intensity combination was first computed (Fig. 1a). A randomized sequence of pure tones (from 1 to 44 kHz) was presented at a rate of 4 Hz, with varying frequency and intensity, and with 3 repetitions of all tones.

For each animal treated with MK-801 the first single neuron was recorded ~ 15 min after the drug injection42. Ten evenly-spaced pure tones (0.5 octaves separation) at a fixed sound intensity (usually 20–30 dB above the threshold) were selected to each neuron recorded to create the control sequences, cascade and many-standard11, 36, and additionally, adjacent pairs of them were used to present various oddball sequences (Fig. 1b). All sequences were 400 tones in length (75 ms duration, 5 ms rise-fall ramp and 250 ms interstimulus interval), each tone in the control sequences was played 40 times, with the same overall presentation rate as deviants in the oddball sequence.

Oddball sequences were used to test the specific contribution of deviant tones in an adaptation context. An oddball sequence consisted of a repetitive tone (standard 90% probability), occasionally replaced by a tone of a different frequency (deviant 10% probability), in a pseudorandom manner. We used two types of control sequences: the many-standard and cascade sequences. Both contained the same 10 frequencies but differing in the order of presentation. In the many-standard control, the 10 frequencies were randomly presented, mimicking the presentation rate and the unpredictability of the deviant tones, while cascades were played always in the same presentation order, ascending or descending in frequency. Hence the cascade contains a regularity and mimics the presentation rate of deviant sounds but in a predictable context and consequently does not violate a regularity whereas the many-standard control mimics the presentation rate of the deviant sounds but contained no regularity that could be modelled nor violated. These four conditions, and by extension responses to them, will be denoted as deviant (DEV), standard (STD), cascade (CAS) and many-standard (MSC) (Fig. 1b). Finally, if the neuron could be held for long enough, the same protocol was repeated for different frequencies and/or intensity.

Anatomical location

For MGB recording localization, at the end of each tract and experiment, two electrolytic lesions were made to mark the end and the beginning of the auditory signal (Fig. 2a). Then, animals were given a lethal dose of sodium pentobarbital and perfused transcardially with phosphate-buffered saline (0.5% NaNO3 in Phosphate Buffered Saline) followed by a fixative mix of 1% paraformaldehyde and 1% glutaraldehyde. After fixation and dissection, the brain was cryoprotected in 30% sucrose and sectioned into 40 μm slices. Sections were Nissl stained with 0.1% cresyl violet. Recording sites were marked on images from an adult rat brain atlas43 and neurons that were recorded from were assigned to one of the main divisions of the MGB (dorsal, medial or ventral). This information was complemented and confirmed by the stereotaxic coordinates as well as the depth of the neuron within a tract.

Figure 2
figure 2

Anatomical recordings location. (a) Photomicrography sample of a MGB Nissl stained slice (10x), red arrows point the two electrolytic lesions. (b) Example of localization all recordings made in the AC of one rat, each colored dot represents the characteristic frequency of each performed tract. A1: Primary Auditory Field; AAF: Anterior Auditory Field; VAF: Ventral Auditory Field; PAF: Posterior Auditory Field and SRAF: Suprarhinal Auditory Field.

For the AC experiments, a magnified picture (25×) of the exposed cortex and the Bregma references was taken at the end of the surgery with a digital single-lens reflex camera (D5100, Nikon) coupled to the surgical microscope (ZEISS). The picture was overlapped to guide and mark each electrode placement into a micrometric grid (250–500 μm spacing; Fig. 2b). Then we performed several tracts recording multi-unit activity frequency response area (FRA), the characteristic frequency arising from each FRA was placed over the picture, resulting in a characteristic frequency map of each animal. Boundaries were identified following the changes in the tonotopic gradient: high-frequency reversal between the ventral and anterior auditory fields (rostrally), low-frequency reversal between primary and posterior auditory field (dorsocaudally) and high-frequency reversal between ventral and suprarhinal auditory field (ventrally)41. Then, each recording was located in one of these five fields.

Statistical analysis

All the data analyses were performed with MATLAB software, using the built-in functions, the Statistics and Machine Learning toolbox, or custom scripts and functions developed in our laboratory. To test for significant excitatory responses to tones we used a Monte Carlo approach, simulating 1,000 peri-stimulus time histogram (PSTH) using a Poisson model with a constant firing rate equal to the spontaneous firing rate (SFR). A null distribution of baseline-corrected spike counts was generated from this collection of PSTH. Lastly, the p-value of the baseline-corrected spike count was empirically computed as p = (g + 1)/(N + 1), where g is the count of null measures greater than or equal to baseline-corrected spike count, and N = 1,000 is the size of the null sample. Finally, we only included in the analysis neuron/frequency combinations with significant excitatory response (p > 0.05) after the baseline-corrected spike count to at least one of the conditions (DEV, STD, CAS). PSTH was used to estimate the spike-density function (SDF) over the time, showing action potential density over time (in action potentials per second) from − 75 to 250 ms around stimulus onset, for the 40 trials available for each tone and condition (DEV, STD, CAS), smoothed with a 6 ms gaussian kernel (“ksdensity” function in Matlab) in 1 ms steps. The baseline SFR was determined as the average firing rate during the 75 ms preceding stimulus onset.

The excitatory response was measured as the area below the SDF and above the baseline SFR, between 0 and 180 ms after stimulus onset (positive area patches only, to avoid negative response values). This measure will be referred to as “baseline-corrected spike count”.

Baseline-corrected spike count responses of a neuron to the same tone in the three conditions (DEV, STD, CAS) were normalized using the formulas:

$${\text{DEV}}_{{{\text{Normalized}}}} = {\text{ DEV}}/{\text{N}};$$
$${\text{STD}}_{{{\text{Normalized}}}} = {\text{ STD}}/{\text{N}};$$
$${\text{CAS}}_{{{\text{Normalized}}}} = {\text{ CAS}}/{\text{N}};$$

where \(N = \sqrt {DEV^{2} + STD^{2} + CAS^{2} }\), is the Euclidean norm of the vector (DEV, STD, CAS) defined by the three responses. Similar indices were calculated using the many-standard control sequence, but the outcomes were identical with one exception as described in results and therefore the reported analyses focused on CAS. Normalized values were the coordinates of a 3D unit vector (DEVNormalized, STDNormalized, CASNormalized) with the same direction of the original vector (DEV, STD, CAS), and thus the same proportions between the three response measures. This normalization procedure always results in a value ranging from 0 to 1, and has a straightforward geometrical interpretation.

From these normalized responses, indices of neuronal mismatch (iMM), repetition suppression (iRS), and prediction error (iPE) were computed as:

$${\text{iMM }} = {\text{ DEV}}_{{{\text{Normalized}}}} {-}{\text{ STD}}_{{{\text{Normalized}}}} ;$$
$${\text{iPE }} = {\text{ DEV}}_{{{\text{Normalized}}}} - {\text{ CAS}}_{{{\text{Normalized}}}} ;$$
$${\text{iRS }} = {\text{ CAS}}_{{{\text{Normalize}}}} - {\text{ STD}}_{{{\text{Normalized}}}} ;$$

These indices, consequently, always range between − 1 and 1, and provide the following quantitative decomposition of neuronal mismatch into repetition suppression and prediction error: iMM = iRS + iPE. To test these indices over time, we divided the whole response into 12-time windows, 20 ms width, from − 50 to 190 ms with respect to the stimulus onset. Then, we compared each time window against zero using a sign-rank test, false discovery rate (FDR = 0.1) corrected for the 12 windows.

For the analysis of the LFP signal, we aligned the recorded wave to the onset of the stimulus for every trial, and computed the average LFP for every recording site and stimulus condition (DEV-LFP, STD-LFP and CAS-LFP), as well as the differences between them, resulting in the three LFP-indices: “neuronal mismatch” (MM-LFP = DEV-LFP – STD-LFP), “prediction error” (PE-LFP = DEV-LFP – CAS-LFP) and “repetition suppression” (RS-LFP = CAS-LFP – STD-LFP). Then, grand-averages were computed for all conditions and auditory station separately. The p-value of the grand-averaged for the three LFP-indices (MM-LFP, PE-LFP and RS-LFP) was determined for every time point with a two-tailed t-test (FDR corrected).

Our data set was not normally distributed, so we used distribution-free (non-parametric) tests. These included the Wilcoxon signed-rank test and Friedman test (for baseline-corrected spike counts, normalized responses, indices of neuronal mismatch, repetition suppression and prediction error). Only the difference wave for the LFPs was tested using a t-test, since each LFP trace is itself an average of 40 waves. For multiple comparison tests, p values were FDR corrected using the Benjamini–Hochberg method. Linear models were used to test for significant average iMM, iPE and iRS within each auditory station. Significant effects of station, pathway, and interactions between them were fitted using the ‘fitlm’ function in Matlab, with robust options. To estimate final sample sizes required for the observed effects after the initial exploratory experiments, we used the ‘sampsizepwr’ function in Matlab adjusted for the iPE for each region, to obtain a statistical power of 0.8 for this index. Sample sizes were enlarged with additional experiments until they were just greater than the minimum required (number of points recorded, and the minimum required for each station; see Table 1).

Table 1 Spike population analysis1.

To analyze the time course of adaptation, we computed an averaged time course for all the standard stimuli presented41, 44. Then, we fitted a power-law function with a three-parameters model, y(t) = a·tb + c, where a indicates the response’s beginning or the first spike strength; b the sensitivity to repetitive stimuli, or the adaptation velocity, and c the steady-state response. R2 values indicated that the model fits very well for standard responses in both groups, explaining between 60 and 78% of the response variability within all regions.

To analyze spikes differences between MK-801 and control group, we computed the median values for each condition tested (DEV, STD and CAS) and their differences (iMM, iRS and iPE) and calculated a ranksum test. To compare each time window between groups a two-sample t-test (from 0 to 200 ms, Bonferroni corrected for 200 comparisons with family-wise error rate FWER < 0.05) was used for the SDF and LFPs to each stimulus condition and indices, using the ‘ttest2’ function in Matlab, for every time point.

Results

We recorded a total of 290 well-isolated neurons, 143 from the control group (25 animals) and 147 from the MK-801-treated group (23 animals). One single neuron and local field potential (LFP) was simultaneously recorded at a time, using the same tungsten electrode. Recordings were performed in the medial geniculate body (MGB) and in the auditory cortex (AC) while playing oddball and control sequences (many-standards and cascade: CAS) in anesthetized rats. Thus, the same tone was played as DEV, STD, CAS and many-standards and using an ascending or descending sequence for the cascade control. This methodological configuration allowed us to calculate indices that conform to the classical mismatch index controlling for tone characteristics in different contexts (Fig. 1b). To confirm the exact recording location, we mapped the AC surface and perform electrolytic lesions in MGB (Fig. 2). Since we found no statistically significant differences when we compare mean baseline-corrected spike counts between the use of the cascade and many-standards sequences for the control group and MK-801 group, except for the MGBNL from the MK-801 group (Table 1: Cascade vs. many-standard controls analysis, p > 0.05 within all fields and groups, except for the MGBNL from the MK-801 group where p = 0.0286), the CAS sequence was chosen to control for repetition effects. This is because the CAS paradigm not only controlled for the presentation rate of the deviant stimuli but also the frequency difference (ascending or descending, Fig. 1b) between standards and deviants in the oddball sequences45.

Effects of MK-801 on the neuronal firing rate

MK-801 injection significantly reduced the responses to STD tones within all regions. By contrast, for responses to the DEV tones, we observed a significant increment in responses in AC but not for the MGB. Table 2 shows a comparative analysis of the firing rate to each stimulus condition for each group. When the firing rate of the cascade sequence was considered, MK-801 differentially affected the AC and MGB such that CAS responses were significantly increased in the MGBNL but decreased in AC (Table 2). This suggests that MK-801 may have different effects in the thalamus and cortex depending on the context of the stimulus. Furthermore, in order to test whether MK-801 affected spontaneous activity, we computed a comparative analysis for SFR. MK-801 decreased the SFR for DEV and STD tones in the AC, but not in the MGB (Table 3). Figure 3a, illustrates a boxplot showing the median normalized responses to each stimulus condition and group. The statistical significance of between group comparisons reveals that MK-801 reduced the firing rate to repetitive stimuli within all fields, while increased the responses to unexpected sounds in the AC. These results reveal a differential effect of MK-801 on the refractoriness and salience of infrequent events at the single neuron level.

Table 2 Firing rate comparisons1.
Table 3 Spontaneous firing rate comparisons1.
Figure 3
figure 3

 Single neuron spikes population analysis. Results for firing rate analysis and their computed differences along the thalamocortical axis. (a) Boxplot of median normalized responses for deviants (red), cascade (green) and standard (blue) for each group, control (light colors) and MK801 (bright colors), within each station and the statistical significance between groups (Wilcoxon signed-rank test, *p < 0.05, **p < 0.001, ***p < 0.000). (bd) Indices histograms displayed in a mirror-like manner for the two groups (controls upper and in light colors; MK801 under and in bright colors), showing the distribution of the three indexes for each neuronal response (ranging between -1 and +1, dotted lines indicate index = 0). Vertical solid lines indicate their medians and the significant difference between groups is noted at the right of each histogram block. (e) Median indices of Prediction Error (orange) and Repetition Suppression (blue), represented with respect to the baseline set by the cascade control (green line). Thereby, iPE upwards-positive while iRS is downwards-positive. Each median index corresponds to differences between normalized responses in (a). Asterisks inside bars denote statistically significance of these indices against zero (Friedman test), while asterisks outside bars denote statistically significance between groups (Wilcoxon signed-rank test, *p < 0.05, **p < 0.001, ***p < 0.000).

Effects of MK-801 on neuronal mismatch and its components

Next, we analyzed the differences between these normalized responses and computed three indexes: (1) the iMM, similar to the typical SSA index used in previous single neurons studies; (2) the iPE, that shows the relative enhancement of DEV tones compared with CAS tones and (3) iRS that reflects the level of response suppression due to the repetition effect.

The analysis of the iMM after the injection of MK-801 demonstrated that iMM values are significantly different from zero for all recording sites (Fig. 3b, Table 1: Friedman test). But when comparisons between groups were considered, the analysis revealed that MK-801 increased the neuronal iMM (Fig. 3b-iMM; Table 2). As described above, these changes are mainly due to a reduced response to STD tones in all recording locations and an enhanced response to DEV in the AC.

Since iMM = iRS + iPE, an essential advantage of these metrics is that we can determine how much of the mismatch index is due to the regularity of the context (RS) and/or to the occurrence of an infrequent event (PE). Thus, to determine which of these two components of the iMM is affected by MK-801, we computed the indices of iPE and iRS separately.

Interestingly, MGB neurons in the MK-801 group did not show any sign of genuine deviance detection, as iPE values were negative and close to zero. While both AC showed a significant positive iPE (Fig. 3c; iPE values in Table 1). When the comparison between groups was analyzed, an increased iPE for the MK-801 group in the AC was found, and even a further decreased iPE for the MGBNL in the MK-801 group (Fig. 3c,e light and bright oranges; iPE in Table 2). These data suggest that the MK-801 produces an augmentation of saliency for novel stimuli processed in the AC.

Yet, the detection of rare or novel stimuli requires the establishment of a regular context or pattern. Therefore, we were also interested in finding out if the refractoriness due to regularity was altered by MK-801. We calculated the iRS by assessing the response of the same tone when it was presented as CAS, with a 10% probability in a regular pattern and presented as STD with a probability of 90%, within an oddball paradigm, so it is in a much more regular context8, 36. In both cases, we assume some level of regularity adaptation, but only a genuine repetition suppression can be determined if the responses to STD tones are lower than responses to CAS. Our results demonstrate that there is a significant repetition suppression effect in the MK-801 group along the thalamocortical pathway (Fig. 3d bright blue; iRS in Table 1). The analysis also revealed that MK-801 produced a significant increase in repetition suppression at thalamic level but did not affect repetition suppression in the AC when compared with controls (Fig. 3d,e light and bright blues; results in Table 2).

At this juncture, it is interesting to mention that although we used as a general rule the CAS, the response to MSC and CAS within the MGB_NL are statistically different (median values to CAS = 0.7889 and MSC = 0.9966, p < 0.05 Wilcoxon signed-rank; Table 1, cascade vs. many-standard controls analysis), showing more reduced or adapted response to CAS than the MSC. Nevertheless, we also performed the analysis of the indices using the MSC as the control paradigm. Results using MSC to calculate the indices (iPE = DEV-MSC and iRS = MSC-STD) show statistical differences for iRS within all conditions, while when the MSC was used to calculate the iPE results do not show statistical differences within the MGB_NL in both groups (see Table 1, indices using many-standard control). These outcomes suggest that MGB_NL is a much complex region that we originally anticipated, and MK-801 is especially affecting their responses for the iPE as we can see in Fig. 3e “MGB_NL” where we noticed that MK-801 produced a significant reduction of the iPE. Similar results using MSC in anesthetized rats were already observed in our previous study11; so it seems a robust and reliable result. A plausible explanation is that MK-801 differentially affects the neuronal responses in the dorsal and the medial divisions of the MGB, i.e., the two areas that include MGB_NL. Indeed, previous studies has suggested that these two regions show anatomically and physiologically different properties46,47,48.

These results, in general show that the auditory thalamus and cortex differ in the way repetition effects and prediction errors are processed. To confirm this hypothesis and considering that we have previously found an increase in the level of iPE along the thalamocortical hierarchy in awake and anesthetized animals11, we fitted a linear model to determine the effect of MK-801 on the degree of increase in iPE along the auditory hierarchy relative to the control (urethane) condition. Using station (S: MGB vs. AC), drug (D: Control vs. MK-801) and pathway (P: Lemniscal vs. Non-lemniscal) and their interactions as factors, with control MGBL as reference level for these factors, the fitted model is as follows:

$$\begin{aligned} \text{iPE } & = - 0.099 - 0.052\text{D} + 0.16\text{P} + 0.216\text{S} - 0.032\text{DP} \\ & \quad + 0.297\text{DS} + 0.003\text{SP} + 0.1490\text{DPS}. \end{aligned}$$

Next, we applied an ANOVA to this model and found a significant effect of station (F = 209.24, p = 1.21 × 10–44), drug (F = 10.82, p = 0.001), pathway (F = 42.93, p = 7.66 × 10–11) and for the interaction drug × station (F = 37.9, p = 9.41 × 10–10) but not for the interactions of drug × pathway (F = 0.45, p = 0.5011) nor station × pathway (F = 0.004, p = 0.9488), nor the three-way interaction (F = 2.37, p = 0.1236). That is, there is a significant station effect (iPE is larger overall at AC than MGB), pathway (iPE is larger overall in the non-lemniscal than the lemniscal pathway), and drug (iPE is increased overall with MK-801 vs. control) and a drug × station interaction indicating that while iPE at AC is increased overall relative to MGB, the degree of iPE increase is moderated by MK-801. Inspection of Fig. 3e indicates that the significant station difference is in the direction of being larger under MK-801 than urethane. These results indicate that, indeed, the sensitivity to detect novel stimuli increases significantly more along the thalamocortical axis in the MK-801 than the control group (Fig. 3e, iPE in orange).

We also fitted a similar linear model to iRS. The resulting model was:

$$\begin{aligned} \text{iRS } & = 0.467 + 0.167\text{D} - 0.118\text{P} - 0.125\text{S} + 0.01\text{DP} \\ & \quad - 0.159\text{DS} + 0.0405\text{SP} - 0.0424\text{DPS}. \end{aligned}$$

ANOVA demonstrated a significant effect for drug (F = 26.42, p = 3.08 × 10–7), station (F = 80.52, p = 7.89 × 10–19), pathway (F = 24.65, p = 7.6 × 10–7) and the interaction of drug × station (F = 17.61, p = 2.84 × 10–5), but not for the interactions drug × pathway (F = 0.078, p = 0.7790), station × pathway (F = 1.45, p = 0.2281) and the three-way interaction (F = 0.31, p = 0.5759). Once again, these results indicate that while iRS was more marked at MGB than AC in general, this station difference was exacerbated with MK-801.

In summary, the changes described above demonstrate that NMDA-R antagonism has distinct effects on auditory scene analysis, as measured by the iPE and iRS, at different levels of the thalamocortical hierarchy.

Effect of MK-801 on spike-density function and indexes

Next, we sought to identify how MK-801 affected the temporal responses to auditory stimuli (DEV, STD and CAS) by comparing spike-density functions (SDF) to each condition between groups. The analysis revealed the latency of the main peak for the SDF to DEV tones was mostly unaffected by MK-801 in the MGB, but it was clearly delayed by 40 and 60 ms in the ACL and ACNL, respectively. Presumably, the spike count builds up more slowly following stimulus onset and then persists for longer. It is also very likely that MK-801 interact with other neuromodulatory systems including cholinergic, dopaminergic and serotonergic systems (see Discussion). Furthermore, the magnitude of the SDF was altered at the AC and MGBNL, with the early component being reduced and the later sustained component being enhanced (Fig. 4a, horizontal white line for significant differences at p < 0.05). When the STD tones were considered, we observed a distinct and significant decrease of the SDF, mostly at the AC and only marginally at the subcortical levels (Fig. 4b). Finally, MK-801 affected mostly the initial responses to cascade tones in all regions, being reduced in the auditory cortex but earlier and increased in MGBL (Fig. 4c). The sustained portion of the SDF was only significantly increased in the MGBNL. Results show that MK-801 has a profound effect on the spike-density functions to DEV, STD and cascade stimuli.

Figure 4
figure 4

Spike Density Function. (a-c) Averaged firing rate profiles for each condition as normalized spike-density function (light colors for control and bright color for MK801 group), and their respective differences (white dotted lines). Solid horizontal white lines represent the time in which the difference between groups is significant (two-sample t test p < 0.05, Bonferroni corrected). (d-f) Indices over time computed for 12 intervals (from -50 to 190ms) compared against zero (signed-rank test and FDR corrected for 12 comparisons; *p < 0.01) for each group (light colors for control and bright color for MK801 group). Solid white lines denote differences between groups across time intervals (two-sample t test for each of the 12-time windows, p < 0.05).

Next, we studied where and when the MK-801 effect on the neuronal indices of iMM, iPE and iRS was significantly different from control. Thus, we examined whether in each group independently (MK-810 and control) these indices are different from zero, i.e., is there a significant iMM, iPE or iRS at each time point. Figure 4d–f highlights the significant time windows (p < 0.01) with white and black asterisks for control and MK-801, respectively. The analysis revealed that under MK-801, there was a significant iMM along the thalamocortical axis (between 20 and 40 ms for MGBL, 20–80 ms in MGBNL and from 20 to 190 ms in both AC; Fig. 4d, bright purple lines) and a significant iPE between 20 and 180 ms in both AC, and a late iPE in the lemniscal thalamus between 60–80 ms and 140–190 ms (Fig. 4e, bright orange lines). We also found significant thalamocortical iRS (Fig. 4f, bright cyan lines; between 20 and 40 ms for MGBL, 0–100 ms in MGBNL, from 20 to 120 ms in ACL and between 40 and 100 ms in ACNL).

When we compared the two groups, the analysis revealed that MK-801 produced a significant enhancement of iMM and iPE at both AC subdivisions (p < 0.000 for iMM between 60 and 190 ms in both AC; and p < 0.05 for iPE ranging between 100 and 190 ms in ACL and between 60 and 190 ms in ACNL; white horizontal lines in Fig. 4d–f). By contrast, iRS was affected more in the MGB (p < 0.000 between 5 and 35 ms in MGBL; p < 0.000 between 40 and 110 ms in MGBNL; p < 0.05 between 60 and 130 ms in ACL; and p < 0.05 at 80 ms in ACNL; white horizontal lines in Fig. 4f). Thus, MK-801 produces an increase of iMM and iPE mostly in the late time window in AC, while iRS is much affected in the MGB.

MK-801 affects the dynamics of adaptation

Since MK-801 lowered and flattened responses to STD tones across the response window, we sought to assess the dynamics and the time course of adaptation (Fig. 5a). Results show that the control group (light gray arrows) exhibit a hierarchical adaptation, becoming faster in higher-order areas (from top to down, responses reach the half of the initial values at the fourth, ninth, twelfth and fourteenth standard tone, respectively). By contrast, results from the MK-801 group exhibited much faster adaptation dynamics (Fig. 5b; 50% of the initial response occurred at the third and second standard tones in MGB and AC, respectively; b values for control group: MGBL = − 0.1769, MGBNL = − 0.4174, ACL = − 0.6824 and ACNL = − 1.175; and for MK-801 group: MGBL = − 0.8499, MGBNL = − 0.8853, ACL = − 1.712 and ACNL = − 1.418).

Figure 5
figure 5

Time course for dynamical thalamocortical adaptation. (a) Averaged time course for the stimulus played in relation to the time elapsed from the beginning of the sequence. (b) The first fifteen standard stimuli showing the three parameters of the power low fitted: a = initial average response; b = adaptation velocity; and c = the steady-state value (dotted lines) for each group. Arrows represent the 50% of the initial responses demonstrating faster adaptation in the MK801 group and the break down in the dynamical hierarchy of adaptation.

These data suggest that by blocking the NMDA-R, MK-801 exerted an inhibitory effect on responses, allowing adaptation to occur swiftly, and altering the timing across the hierarchical organization of the auditory system, resulting in the lemniscal thalamus having almost the same adaptation velocity as the non-lemniscal cortex (arrows in Fig. 5b). Furthermore, MK-801 reduces (almost by half) the steady-state plateau in the AC (dotted lines in Fig. 5b; c values for control group: MGBL = 0.0776, MGBNL = 0.2908, ACL = 0.6084 and ACNL = 0.7740; and for MK-801 group: MGBL = 0.1428, MGBNL = 0.2884, ACL = 0.3523 and ACNL = 0.3834).

All these results together support the idea that MK-801 produces a differential effect on adaptation and deviance detection along the thalamocortical axis, providing new evidence of a change in the firing pattern and temporal responses at single neuron level.

Delayed and broader larger-scaled LFP responses

Next, we wanted to check if the single unit responses correlated with larger-scale measurements of neuronal activity. The analysis of local field potentials (LFP) revealed that MK-801 produced significant changes in MGBNL and AC (both in the lemniscal and non-lemniscal portions; Fig. 6). MK-801 shaped broader and longer responses for DEV-LFP and CAS-LFP in the auditory cortex. Although the waveforms of these LFPs were shifted in latency for the MGBNL due to a progressive delay of N1, P1 and N2 (note that this terminology refers to the first negative peak, first positive peak and second negative peak). The N1, P1 and N2 delayed peaks occurs at 8, 14 and 57 ms for DEV-LFP and at 6, 26 and 45 ms for CAS-LFP (DEV-LFP: N1 peak for MK-801 = − 6.6 μV at 20 ms and control = − 1.5 μV at 12 ms; P1 peak for MK-801 = 6.9 μV at 41 ms and control = 6.8 μV at 28 ms; finally, N2 peak for MK-801 = − 5.4 μV at 102 ms and control = − 10.2 μV at 45 ms. CAS-LFP: N1 peak for MK-801 = − 6.5 μV at 18 ms and control = − 1.2 μV at 12 ms; P1 peak for MK-801 = 5.1 μV at 53 ms and control = 10.6 μV at 27 ms; finally, N2 peak for MK-801 = − 5.0 μV at 91 ms and control = − 10.3 μV at 45 ms).

Figure 6
figure 6

 Local Field Potentials for each condition and their differences. (a-c) Population grand-averaged LFP for each condition recorded (CAS, DEV and STD) within each group (controls and MK801). Grey panels under the main LFP representations shows the instantaneous p value (white trace) of corresponding stimulus condition LFP (critical threshold set at 0.05 represented as a horizontal dotted yellow line). The thick black horizontal bars in figure 5a-c highlights the time interval for which the LFP comparison between the control and MK801 groups is significant. (d-f) Population grand-averaged LFP for and neuronal Mismatch (MM-LFP = LFPDEV-LFPSTD), Prediction Error (PE-LFP = LFPDEV-LFPCAS), and Repetition Suppression (RS-LFP = LFPSTD-LFPCAS) respectively Colored horizontal lines denote significative deflections (t-test, FDR corrected). Grey panels show the instantaneous p value (white trace) of corresponding stimulus condition LFP (critical threshold set at 0.05 represented as a horizontal dotted yellow line) and black horizontal lines the time interval in which MK801 and control are statistically different.

Similarly, we also sought significant LFP signals for each computed index (Fig. 6d–f). The horizontal colored lines highlight the time at which significant deflections occur to each index-LFP for control and MK-801 groups independently (light and bright horizontal lines, respectively). Additionally, we compared these LFP indices between groups. The analysis of the MM-LFP shows that MK-801 elicited stronger and broader deflections within all regions (horizontal bright purple lines; Fig. 6) and abolished the late negative component (N2) in the AC (MGBL: N2 = 114–157 ms; MGBNL: N1 = 12–21 ms, P1 = 32–63 ms and N2 = 75–135 ms; ACL: N1 = 10–57 ms and P1 = 60–147 ms; ACNL N1 = 20–53 and P1 = 60–144 ms). Our data also demonstrate that MK-801 produced a higher MM-LFP for virtually the whole LFP response within MGBNL and both AC, while no differences occurred in MGBL. This is consistent with our previous work where we demonstrated that neurons in the MGBL show neither SSA40 nor iMM11. This is also in agreement with the roles of the lemniscal versus non-lemniscal pathways: the lemniscal is used for the transmission of straightforward sensory information, whereas the non-lemniscal pathway integrates this information.

Similar to the spike population analysis, and considering that the PE-LFP and RS-LFP both contribute to the MM-LFP, we also wanted to understand how MK-801 shapes the LFP for prediction error and repetition suppression. In response to MK-801, the PE-LFP waveform was reduced at the early component of the MGBNL, while it was increased and delayed for the AC (orange horizontal lines in Fig. 6e). Moreover, MK-801 also abolished the N2 deflection (MGBNL: N1 = 99–146 ms; ACL: N1 = 30–65 ms and P1 = 87–180 ms; ACNL N1 = 30–67 and P1 = 106–180 ms). When PE-LFP was compared between groups, we only found differences in AC, mainly at the early (50–70 ms) and late components (120–180 ms). In other words, the lemniscal thalamus does not exhibit deviance detection, neither at the single neuron level nor at large-scale responses. Hence PE-LFP confirms single unit population data, where MK-801 produced greater levels of deviance detection in the auditory cortex (Fig. 3e).

Finally, MK-801 had similar effects on RS-LFP to those described above for MM-LFP and PE-LFP, eliciting broader and larger waveforms for MGBNL and AC (Fig. 6f; MGBNL: N1 = 10–28 ms, P1 = 34–63 ms and N2 = 73–108 ms; ACL: N1 = 10–55 ms, P1 = 67–140 ms and N2 = 148–180 ms; ACNL: N1 = 10–51, P1 = 55–132 ms and N2 = 141–180 ms). When differences between groups are considered, the non-lemniscal thalamus exhibited a shift in the waveform between 15 and 100 ms, while for the cortex, responses over virtually the whole temporal window were increased by MK-801.

Discussion

In this study, we demonstrate that the neuronal index of mismatch, derived from single-cell recordings, is profoundly affected along the auditory thalamocortical system in rats treated acutely with NMDA-R antagonist, MK-801. Importantly, we also reveal that the two elements that make up the index of mismatch, i.e., repetition suppression and prediction error, are differentially affected by MK-801 in single neurons at auditory thalamus and cortex. MK-801 increases repetition suppression in auditory thalamus and prediction error in auditory cortex. The increase in repetition suppression is more prominent in lemniscal areas of the auditory thalamus, while the increase in prediction error is more evident in the non-lemniscal areas of the auditory cortex. Furthermore, our results demonstrate that MK-801 alters the dynamics of neuronal adaptation along the thalamocortical axis, becoming faster and stronger especially at thalamic level. The results from single-unit data were confirmed by recordings of large-scale responses, LFPs, as the latter exhibit delayed and broader deflections. In summary, our work demonstrates that the MK-801 increase of the neuronal mismatch in the auditory cortex 60 ms after stimulus onset. This enhancement is due to the combined effect of an increment in the sustained responses to deviant tones and a decrement to standard tones. It should be noted that, in contrast to most previous studies using recording procedures to study neuronal population activity in rodents such as LFPs or EEG via skull screws, we have recorded single-unit activity, a technique that has a much higher resolution level at the single-cell level for revealing patterns of activity underpinning mismatch responses.

Differential effects of MK-801 on repetition suppression and prediction error in the thalamocortical pathway

It is well established that NMDA-R plays a fundamental role in neuronal plasticity, controlling long-term potentiation and depression49. NMDA-R dependent plasticity is believed to underpin the capacity of the brain to adjust internal predictions and use the memory of recent past inputs to anticipate future stimuli50. Further, it is generally accepted that human MMN is reduced after NMDA-R antagonist treatments because NMDA-R antagonist blocks synaptic plasticity, precluding the formation of a memory trace for the standard tones14. As we have seen in our results, MK-801 reduces responses to standard tones thus increasing repetition suppression.

Although this finding supports the hypothesis that NMDA-R antagonists alter sensory-memory formation51, the results that 0.1 mg/kg MK-801 treatment produces a significant increment in response to the deviant tones, in prediction error and hence, an increase in the neuronal mismatch, is in the opposite direction to expected. It is clear that the role of NMDA-R in the generation of MMN is considerably more complex than thought22. There have been suggestions in the literature of precedents for our observations52. Even considering that MK-801 has 160 times the affinity of ketamine to NMDA-R, necessitating higher ketamine doses for similar drug effect53, our results conform with those that report an increment in amplitude and latencies to deviant responses after acute ketamine treatment in rats54 and with a sub-anesthetic dose of ketamine in healthy humans producing larger N100 to deviant tones but not MMN55. Interestingly, a dose–response study of the MK-801 effects on MMN-like responses in male rats showed that while a high dose (0.5 mg/kg) reduced late deviance detection (around 55 ms), a medium dose (0.3 mg/kg) significantly enhanced early deviance detection effects (at about 13 ms) and some evidence of enhanced late effects although not significantly22. We used a single dose of 0.1 mg/kg in female rats, as it has been demonstrated that females are more sensitive to MK-801 than males34 and that this dose is sufficient to induce behavioral effects56. Importantly, memantine, a low-affinity uncompetitive agonist of NMDA-R, has been shown to (i) increase the duration of rodent MMN-like responses30, (ii) increase MMN amplitude in healthy individuals57, and (iii) in persons with schizophrenia58.

The memantine results suggest an interpretation of our findings in terms of the mechanisms underpinning synaptic plasticity59. Partial blockade of NMDA-R (such as mediated by memantine, or low dose MK-801) is also likely to reduce background calcium flux resulting in homeostatic upregulation of NR2B-containing NMDA-Rs leading in turn to the conversion of synapses to a plastic state. That is, while these drugs reduce calcium influx during uncorrelated activity, there is increased calcium influx during correlated activity (produced by physiological stimuli), improved signal to noise, facilitated transmission and increased plasticity60,61,62.

Different excitatory/inhibitory networks may explain different MK-801 effects along the thalamocortical axis

The increased repetition suppression we observed in the medial geniculate body could also be by altered excitatory/inhibitory balance. Although the rat MGB lacks GABAergic neurons, it receives GABAergic input from the thalamic reticular nucleus (TRN) and the inferior colliculus63. The latter is a source of bottom-up inhibitory influences, while the TRN provides the MGB with an indirect and inhibitory feedback activation from auditory cortex64. Cortical stimulation hyperpolarizes TRN neurons and increases their inhibitory output to the MGB65 and furthermore, TRN has been demonstrated to profoundly influence SSA in the MGB66. Changes in the neuronal firing pattern of thalamic neurons into bursts have been suggested to provide an alerting signal to the somatosensory cortex to enhance stimulus detection67. The previous statement supports the idea that following MK-801 treatment we observed a decrement of responses to standard tones, which could result in a reduced responsiveness of standards. As a consequence, MK-801 would enhance the detection rate of a novel stimulus, a deviant (i.e., increased signal to noise). Overall our results match the general concept that when the system is adapted, it is more sensitive to detect changes in the environment68, where a stronger thalamic repetition suppression (or inhibition) supports the increase in the prediction error signals (excitatory) at cortical level or viceversa. It would be interesting to test whether thalamic repetition suppression is correlated with cortical prediction error signals, but this question awaits future experiments.

Other characteristics of the neuronal mechanisms and microcircuitry involving the glutamate NMDA-R system is that NMDA-R are located not only at postsynaptic and presynaptic sites in excitatory neurons, but they are also found at GABAergic inhibitory interneurons in the neocortex69. MK-801 has demonstrated a preferential regulation of the firing rate of cortical GABA interneurons, increasing the firing rate of the majority of pyramidal neurons70 and therefore producing an imbalance in the excitatory/inhibitory networks in the cortices71, 72. It is well known that cortical GABAergic interneurons differentially amplify stimulus-specific adaptation (a similar phenomenon to iMM) in excitatory pyramidal neurons in the auditory cortex73. Moreover, a model of a mutually coupled excitatory/inhibitory network can explain distinct mechanisms that allow cortical inhibitory neurons to enhance the brain's sensitivity to deviant or unexpected sounds74. Further, MK-801 would alter the tonic inhibitory control of NMDA-R in auditory cortical areas leading to the activation of pyramidal neurons by subsequent deviant tones. Thus, the increase in prediction error could arise due to the reciprocal interaction between fast-spiking parvalbumin-positive interneurons and pyramidal neurons. As such, NMDA-R hypofunction of either of these cell types could disturb the excitatory/inhibitory balance that in turn result in the increased or decreased firing of the recorded neurons.

Paradoxical effects of low doses and administration mode of MK-801

Our data are at odds with two previous studies17, 75 using similar recording approaches both of which report reduction in deviant responses with NMDA-R antagonist with no effect on neural responses to standards. The reasons for this discrepancy are unclear, but differences in animal species (rats vs. macaques), in the NMDA-R antagonist used (MK-801 vs ketamine), in drug administration routes (i.p, intramuscular, or local), and/or a combination of these factors may have affected our results. It should also be noted that paradoxical behavioral effects of low doses of MK-801 have been noted previously76 as well as differential effects of low (0.1 mg/kg) vs. high (1.0 mg/kg) MK-801 doses on NMDA-R and neuronal activity in pyramidal neurons vs. interneurons32. It is clear that MK-801 effects on both behavior, mRNA expression and protein of NMDA-R subunits are nonlinear and strongly dose-dependent.

Another explanation for our findings is that MK-801 was administered systemically. Thus, MK-801 would affect not only auditory thalamus and auditory cortex, but likely also other regions such as the prefrontal cortex. Furthermore, it is plausible that NMDA-R blockade will also lead to a reduction in direct top-down excitatory input to pyramidal neurons in the auditory cortex from prefrontal cortex77, 78. As discussed in79, MK-801 increases activity of prefrontal cortex axons targeting primary visual cortex, reducing the evoked-activity in V1 to repetitive stimuli, possibly mediated through specific subclasses of inhibitory neurons, such as vasoactive intestinal peptide (VIP) interneurons or somatostatin (SST) interneurons. A similar explanation would be plausible for an intermediate region in-between the auditory thalamus and cortex such as the TRN, which may, in turn, suppress the bottom-up thalamic auditory input to auditory cortex. Both mechanisms likely contribute to the alterations of neuronal mismatch we observed as both auditory cortex and auditory thalamus responses are affected. It will be important in the future to further distinguish between these possibilities by, e.g., optogenetic and electrophysiological techniques, in order to target different neuronal populations and probe the activity of the different interneuron subtypes in this model.

Additionally, it is unclear if the effects we observe in single neurons following systemic administration of the drug would be similar to those following local administration of MK-801. A recent publication summarized the literature on effects of systemic vs. local administration MK-801. For example, systemic MK-801 increased glutamate release in medial prefrontal cortex, while local MK-801 did not80. Likewise81, have shown that LFP recordings in urethane-anesthetized rats differ for systemic and local MK-801 applications. There are many potential consequences of systemic NMDA-R blockade, ranging from the alteration of the inputs to MGB from IC and/or to auditory cortex from prefrontal cortex (see above) to differential effects on pyramidal cells versus interneurons in cortex. The reasons for our choice of systemic administration as outlined in the introduction was to mimic as closely as possible the human research on the effects of NMDA-R antagonists on MMN. However, it is clear that future animal studies will be needed to reconcile the divergence of results between the effects of using systemic administration of the present study and localized injections of previous studies (e.g.,17).

Admittedly, we have only tackled the functional role of the NMDA-R under a particular experimental manipulation, and we cannot exclude the possibility that higher doses of MK-801 would have generated different results. It is also well known that other neuromodulatory systems such as the dopaminergic, cholinergic and/or cannabinoid systems may be altered and interact with the NMDA-Rs in healthy brain function82,83,84 as well as in schizophrenia71, 85,86,87.

Our study is valuable because it has revealed the involvement of two fundamental mechanisms, i.e., repetition suppression and prediction error; and two different pathways, lemniscal and non-lemniscal, underlying the neuronal mismatch in the thalamocortical hierarchy. Predictive coding theory proposes that the brain constantly tries to minimize the discrepancy between actual sensory input and internal representations of the environment5. What is new in our data is the critical importance of the hierarchical organization of the auditory system in sharing the ‘responsibility’ for generating the representation and detecting the discrepancy, mainly attributable to thalamic and cortical processes. However, our data provide evidence that the NMDA-synaptic plasticity and MMN relationship is not as simple as previously surmised from human studies.

MMN and schizophrenia

What are the implications of our findings for schizophrenia given the somewhat unexpected outcome? Reduced MMN is associated with poor global functioning88 and cognitive deficits89 in schizophrenia. Hence, if a safe drug were available that targeted the relevant NMDA-R subunit, and facilitated neuroplasticity as indexed by increased MMN, it offers opportunities for interventions to remediate cognitive deficits that are a core feature of schizophrenia90. Memantine, which has been shown to increase MMN amplitude in healthy individuals and schizophrenia, has been used as adjunctive therapy in schizophrenia for some time to improve cognition in particular. While the effects of adjunctive therapy are small, a recent meta-analysis suggests that there are improvements in global measures of cognition, but enhancements in more sensitive composite cognitive test scores have not been observed91. To date, there have been no attempts to utilize MMN enhancement to memantine as an index of increased neuroplasticity that could be exploited in remediation studies (but see 92 for a suggested galantamine-memantine combination therapy for enhancing MMN, improving cognition and preventing transition to psychosis in a high risk group). Interestingly, both the moderate affinity antagonist, memantine, and high-affinity antagonist, MK-801, bind to the NR2B subunit of the NMDA-R at very similar binding locations93 but only memantine has been approved for use in humans given evidence of neurotoxic effects of MK-801 in humans94. One avenue of future research is the development of safe compounds for human use that target similar binding locations to memantine and MK-801.