The right inferior frontal gyrus processes nested non-local dependencies in music

Cheung, Vincent K. M.; Meyer, Lars; Friederici, Angela D.; Koelsch, Stefan

doi:10.1038/s41598-018-22144-9

Download PDF

Article
Open access
Published: 28 February 2018

The right inferior frontal gyrus processes nested non-local dependencies in music

Scientific Reports volume 8, Article number: 3822 (2018) Cite this article

6983 Accesses
47 Citations
99 Altmetric
Metrics details

Subjects

Abstract

Complex auditory sequences known as music have often been described as hierarchically structured. This permits the existence of non-local dependencies, which relate elements of a sequence beyond their temporal sequential order. Previous studies in music have reported differential activity in the inferior frontal gyrus (IFG) when comparing regular and irregular chord-transitions based on theories in Western tonal harmony. However, it is unclear if the observed activity reflects the interpretation of hierarchical structure as the effects are confounded by local irregularity. Using functional magnetic resonance imaging (fMRI), we found that violations to non-local dependencies in nested sequences of three-tone musical motifs in musicians elicited increased activity in the right IFG. This is in contrast to similar studies in language which typically report the left IFG in processing grammatical syntax. Effects of increasing auditory working demands are moreover reflected by distributed activity in frontal and parietal regions. Our study therefore demonstrates the role of the right IFG in processing non-local dependencies in music, and suggests that hierarchical processing in different cognitive domains relies on similar mechanisms that are subserved by domain-selective neuronal subpopulations.

Meter enhances the subcortical processing of speech sounds at a strong beat

Article Open access 29 September 2020

Brain-correlates of processing local dependencies within a statistical learning paradigm

Article Open access 12 September 2022

Spatiotemporal brain hierarchies of auditory memory recognition and predictive coding

Article Open access 21 May 2024

Introduction

Complex auditory sequences known as music exist in all human cultures¹, and elements in many musical styles are hierarchically structured^2,3. Examples include harmonic progressions in classical western tonal music⁴ and jazz⁵, as well as transformations of tone rows in twelve-tone serialist compositions⁶. A sequence is said to be hierarchical if the dependencies (rules which bind two elements) between its elements can be represented as a type of mathematical graph called a rooted tree (acyclic graph with a designated root element)^7,8. This means that all elements of a sequence are connected to form an overarching structure⁷, and implies the existence of a sub-/superordinate relationship between elements⁹. We shall refer to this definition of hierarchy throughout the text. Furthermore, a dependency is said to be local if it relates elements that directly follow one another in a sequence, and non-local if the dependency spans over multiple intervening elements. Consider nested sequences, which have the form A_nA_n−1…A₁B₁…B_n−1B_n and contain dependencies that are embedded, or nested, within another dependency: In the nested sequence A₃A₂A₁B₁B₂B₃, the (local) dependency between A₁ and B₁ is embedded within the (non-local) dependency between A₂ and B₂, which is in turn embedded within the (non-local) dependency between A₃ and B₃. Importantly, the ability to relate remote, non-local musical events beyond their immediate temporal sequential order is said to be crucial for successfully processing hierarchical structures in music^7,10,11.

Previous studies on processing hierarchical structures in music argued that humans can differentiate between auditory tone sequences generated according to a hierarchical recursive rule and an iterative rule¹², show priming effects in integrating harmonic contextual information^13,14, and discriminate between grammatical and ungrammatical transformations in serialist music^15,16. Moreover, harmonically irregular chords within a chord sequence were shown to elicit an early right anterior negativity (ERAN) in event-related brain potentials (ERPs) (using EEG^11,17,18 and MEG for the magnetic equivalent¹⁹), which can already be observed in infants²⁰, and in a musical scale previously unheard by participants²¹. Functional MRI (fMRI) studies employing similar violation paradigms have also reported differential responses in the bilateral inferior frontal gyri (sometimes with a right-hemispheric dominance)^{18,22,23,24,25,26,27}, and the anterior insular cortices^22,23,27. Activity in the inferior frontal gyrus (IFG) was suggested to process hierarchical structure in music, given that language studies have implicated the left IFG – particularly the pars opercularis – in syntactic reordering and embedding^28,29,30,31. This is corroborated by interference effects observed between musical and linguistic syntax^32,33,34, which suggest that processing hierarchical structure relies on domain-general neural resources³⁵.

However, the question of whether hierarchical processing was really involved in the previous studies has also been raised. It has been argued that humans rearrange scrambled phrases of music in a way that was grammatically-coherent locally but not globally³⁶, and are insensitive towards transpositions^37,38 and re-orderings³⁹ to sections of classical music pieces. Moreover, violations to the hierarchical structure in previous experiments were not restricted to dependencies between non-local elements, but also violated local dependencies between immediately-adjacent chords¹¹. Although the ERP study by Koelsch and colleagues¹¹ controlled for the possibility of processing musical sequences in a strictly local fashion, the precise neuro-functional basis of processing non-local dependencies in music nevertheless remains unknown.

The current study was thus conducted to assess the functional basis of processing non-local dependencies in music in the human brain, whilst controlling for local transition probabilities. Our approach was a grammaticality judgment task based on an artificial grammar learning paradigm^40,41,42. Musicians learnt a nested atonal grammar of piano-tone sequences (i.e. A_nA_n−1…A₁B₁…B_n−1B_n) before discriminating between novel grammatical and ungrammatical musical sequences during fMRI scanning. In contrast to previous studies, successful completion of the task requires participants to explicitly abstract notes into motifs and to store multiple non-local dependencies in parallel. We moreover manipulated the level of embedding (that is, the number of nested dependencies in a sequence) to dissociate the processing of nested dependencies in music from the effects of increasing working memory demands on processing these dependencies, and to ensure that the observed responses in resolving the nested dependencies generalised to different levels.

We hypothesised that violations to the nested grammar would elicit increased BOLD responses in the bilateral inferior frontal gyri, especially in the right hemisphere. Based on the literature on auditory tonal working memory^43,44,45,46, we also hypothesised increased BOLD responses in the dorsolateral prefrontal cortex and parietal areas with increased levels of embedding due to additional working memory demands.

Materials and Methods

Participants

Twenty musically-trained participants (12 females, 8 males) with normal hearing and at least seven years of training (cf.^47,48,49) in their most experienced instrument (M = 13.30 years, SD = 5.78 years) completed both sessions of the experiment. No participants reported absolute pitch or neurological/psychological disorders. Participants were excluded from further analyses if their hit rates during the fMRI session for grammatical sequences, ungrammatical sequences with category violations, or ungrammatical sequences with state violations (see Stimuli) fell below the 5% significance level of performing above chance according to a binomial test. Two male participants were excluded that way. One female participant was excluded due to incidental findings. Data were analysed for the remaining 17 participants (mean age = 26.29 years, SD = 2.37 years; mean experience in most experienced instrument = 13.76 years, SD = 6.00 years, seven of whom were conservatory-level). They were right-handed⁵⁰, and had a mean score of 93.3 (SD = 13.1) in general music sophistication⁵¹. Informed consent was obtained from all participants and the experiment was approved by the ethics committee of the University of Leipzig in accordance with the Declaration of Helsinki.

Stimuli

Our study employed a 2 × 2 factorial design with factors GRAMMATICALITY (GRAMMATICAL versus UNGRAMMATICAL) and LEVEL OF EMBEDDING (LoE: ONE-LoE versus TWO-LoE) to dissociate the effects of grammaticality and working memory in processing non-local dependencies in music.

Auditory sequences of the nested structure A_nA_n−1…A₁B₁…B_n−1B_n (see Fig. 1A) were generated by concatenating two (ZERO-LoE; structure A₁B₁), four (ONE-LoE; structure A₂A₁B₁B₂), or six musical motifs (TWO-LoE; structure A₃A₂A₁B₁B₂B₃). Each motif consisted of three successive isochronous piano tones (duration = 250 ms per tone) and belonged to one of four categories in one of two states (see Fig. 1A, Table 1 and Supplementary Audio clips SA1–6). The motifs of a sequence were randomly concatenated without replacement so that distinct categories in state A preceded the same categories in state B but in reverse order. This prevented participants from using a counting strategy. Ungrammatical sequences that violated this nested structure were introduced by interchanging either the state (so-called state violations) or category (so-called category violations), but not both, of exactly one B-motif. The replaced motif in a category violation was chosen so that the category was not presented before in the sequence. All sequences contained a pause (duration = 750 ms) between each pair of motifs, and the lowest note of each motif in a sequence was uniformly sampled between all twelve tones in the octave between C4 and B4. This was to avoid participants relying solely on tonal information and accomplishing the task based on matching tones they heard. Sequence durations were thus 2.25 s for ZERO-LoE, 5.25 s for ONE-LoE, and 8.25 s for TWO-LoE.

Table 1 Relative pitches of the four motif categories in semitones apart.

Full size table

Due to the extensive number of unique sequences (see Supplementary Information), ONE-LoE and TWO-LoE sequences were presented only once throughout the entire experiment. Sequences with ZERO-LoE were used without replacement before being reshuffled back into the pool after each session.

Procedure

The experiment was divided into a training session and a scanning session, which were around 3.5 weeks apart (mean = 25.76 days, SD = 7.20 days); only participants who acquired the musical grammar in the training session participated in the scanning session (76.67% of participants successfully acquired the grammar). The experiment was programmed on Presentation 18.1 (Neurobehavioral Systems, Inc., Berkeley, CA, USA). Auditory stimuli (mono, 44,100 Hz sampling rate, 16 bits per sample) were delivered at a comfortable volume through circumaural headphones (with foam earplugs inserted inside the MRI scanner). White text was shown against a black background on a computer screen or viewed using a mirror attached to the head-coil from a back-projected image. Foam pads were placed to reduce head movement before fMRI scanning.

In the training session, participants were instructed to learn the grammar of a new language and extract the rule underlying the sequences. The session resembled the learning phase by Bahlmann and colleagues⁴⁰ (see Supplementary Information).

In the scanning session (see Fig. 1B), participants discriminated inside the MRI scanner the grammaticality of 144 novel sequences, equally divided between the four combinations of the two factors: GRAMMATICALITY (GRAMMATICAL versus UNGRAMMATICAL) and LEVEL OF EMBEDDING (ONE-LoE versus TWO-LoE). Violations in ungrammatical sequences were counterbalanced for violation type (i.e. state versus category) and occurrence amongst the B positions. Stimuli were presented across six runs with a break (25 s) between each run. Participants were notified visually 5 s before the end of each break. Stimuli were pseudo-randomised such that at most two consecutive stimuli shared the same LEVEL OF EMBEDDING and GRAMMATICALITY. Each trial began with a randomly-jittered fixation cross (1.2 s–2.2 s) at the centre of the screen, followed by the presentation of an auditory sequence. After a short break (0.75 s), the letters Y (grammatical) and N (ungrammatical) appeared on the lower left and right sides of the fixation cross, for which participants had a 4 s time-window to decide on the grammaticality of the preceding sequence by pressing either the right index or middle finger on an MR-compatible button box; The letter position was pseudo-randomised. Visual feedback was displayed on the centre of the screen (1 s) to motivate performance (see Data analysis) and the next trial ensued. Additionally, 18 grammatical and 18 ungrammatical 0-LoE sequences were presented as filler sequences and were not analysed.

Data acquisition

Imaging data were collected on a 3 T Magnetom Skyra scanner (Siemens Healthcare, Erlangen, Germany) with a 20-channel head coil. Slices were acquired axially parallel to the AC-PC line for the whole-brain using a gradient EPI sequence (31 slices per volume, slice thickness = 3 mm, inter-slice gap = 1 mm, acquisition order = odd-interleaved ascending, FoV = 192 mm × 192 mm, acquisition matrix = 64 × 64, TR = 2000 ms, TE = 30 ms, flip angle = 90°, bandwidth = 2004 Hz/Px, echo spacing = 0.56 ms). The functional scan time was 42.3 minutes and 1270 volumes were obtained continuously. T1-weighted structural images (voxel size = 1 mm isotropic) of each participant were used to coregister and normalise the functional images to MNI space.

Data analysis

Data were analysed using MATLAB R2016a (The MathWorks, Inc., Natick, MA, USA), JASP 0.7.5.6 (JASP Team), and R 3.3.3 (The R Foundation for Statistical Computing, Vienna, Austria). To measure how well participants discriminated between GRAMMATICAL and UNGRAMMATICAL nested musical sequences, we applied signal detection theory⁵² to dissociate their behavioural sensitivity in detecting deviants in grammaticality from their response bias using the non-parametric sensitivity measure A and associated log-bias ln(b)⁵³. The sensitivity measure gives an estimate of the mean area under the ROC curve, and the dissociation of sensitivity and bias avoids a misrepresentation of performance due to conflated hit rates. A high sensitivity score thus corresponds to a high hit rate and a low false positive rate. As we wanted participants to perform accurately, they were not instructed to respond as quickly as possible and reaction times were not analysed.

After checking for normality using the Shapiro-Wilk test, one-sample t-tests were conducted to compare the overall mean sensitivity against chance-level (0.5), and mean bias against 0, as well as paired t-tests to compare mean sensitivity and bias for the two LEVELS OF EMBEDDING separately. Effect sizes were calculated using Cohen’s d for correlated samples⁵⁴.

Imaging data were analysed using SPM 12.6685 (Wellcome Trust Centre for Neuroimaging, London, UK). Volumes were slice-timing corrected, realigned to the first volume using rigid-body transformation and motion-susceptibility correction, coregistered to the individual’s structural image, resampled to a voxel size of 3 mm × 3 mm × 4 mm and normalised to MNI space, and smoothed with a FWHM Gaussian kernel 2.5 times the voxel size for preprocessing.

For statistical analyses at the first-level, a voxelwise GLM was estimated for each participant. Each sequence was modelled as a boxcar function of the same stimulus duration and convolved with the canonical HRF. One regressor was used to model correctly-responded sequences for each combination of the two factors: GRAMMATICALITY (GRAMMATICAL versus UNGRAMMATICAL) and LEVEL OF EMBEDDING (ONE-LoE versus TWO-LoE). A regressor modelling all remaining sequences, a regressor indicating volumes within breaks, a regressor to remove finger-press artifacts, and six motion regressors were added as regressors of no interest. Finger presses were modelled using a boxcar function with the response-prompt as onset and reaction time as duration and were convolved with the canonical HRF. A high-pass filter (128 s cut-off) and an autoregressive AR(1) model were applied.

To guarantee that the observed effects on grammaticality were not confounded by serial processing, we additionally estimated a refined model at the subject-level where ungrammatical sequences only consisted of correctly-responded category violations at positions B₂ and B₃ post hoc. Sequences with violations in position B₁ or state violations could have been rejected by a strategy which does not require resolving non-adjacent dependencies, as it might have been possible for participants to have detected these violations by only comparing surface features of the violated motif and the preceding motif^55,56. However, it is unlikely that participants relied solely on serial or local processing to reject these sequences, because (1) they could not have known beforehand the grammaticality of a sequence and position of the violation, and so would have had built a nested representation of the motifs to complete the task in case the sequence was grammatical, and (2) only participants who were proficient in detecting category violations were included in the analysis.

For statistical analyses at the group-level, data were modelled by a 2 × 2 flexible factorial model. We assumed independence at the subject and condition levels, equal variance in the former and unequal in the latter. Significant clusters were identified using an a priori-defined voxelwise FWE-corrected threshold of p < 0.05 and an extent of four voxels, determined by rounding above the expected cluster size estimated from the smoothness of the SPM based on Gaussian random-field theory⁵⁷. Anatomical locations were identified using the MNI2TAL tool⁵⁸ and SPM Anatomy Toolbox 2.2c⁵⁹.

We turned to a psychophysiological interaction (PPI) analysis to explore the extent to which brain regions implicated in the current study are functionally related. In brief, PPI measures the change in functional connectivity between two regions under different experimental contexts⁶⁰. This allows us to infer any (undirected) flow of task-relevant information between two brain regions. Given our factorial design, we carried out a generalised PPI analysis⁶¹ as opposed to a traditional PPI analysis⁶². Generalised PPI is more suited to our experiment as the model includes all psychological factors and hence spans the entire experimental space.

Seed regions were defined by drawing spheres (radii = 4.5 mm, chosen to avoid overlap between the regions of interest) around the maxima of each significant cluster of the refined model. They were then multiplied with the group mask to ensure that each seed region only included brain voxels present in all participants. We examined the task-modulated functional connectivity between each seed region and the remaining clusters across the four conditions by comparing the gPPI regressors of each subject in a 2 × 2 flexible factorial model at the group-level. An explicit mask of the seed regions was applied during the estimation of the second-level model, and we adopted the same statistical threshold as before. The analysis was performed using the gPPI toolbox 13.1⁶¹.

To further examine whether psychophysiological interactions predicted behavioural performance, we additionally correlated the mean difference in beta estimates of each significant PPI cluster with participants’ overall sensitivity in detecting grammatical violations. The beta estimates of each cluster were extracted using MarsBaR 0.44⁶³.

Data analysed during this study are available from the corresponding author upon reasonable request.

Results

Behavioural results

Participants’ overall sensitivity (M = 0.95 (out of a maximum score of 1), SD = 0.04, range = 0.88–1.00) in discriminating between grammatical and ungrammatical nested musical sequences was significantly above chance (t(16) = 51.92, p = 1.44 × 10⁻¹⁹, Cohen’s d = 12.59) and correlated with the number of years of training in their most experienced instrument (r = 0.63, p(corrected) = 0.03, see Figure S1 and Supplementary Information), although the response bias (M = −0.19, SD = 0.17) indicated a significant bias towards judging a sequence as grammatical (t(16) = −4.52, p = 3.47 × 10⁻⁴, Cohen’s d = −1.10). The mean sensitivity for ONE-level of embedding (LoE) sequences was also significantly higher than TWO-LoE sequences, but the difference in bias between the two levels of embedding was not significant (see Fig. 2 and Table 2).

Table 2 Mean sensitivity and bias in detecting violations in nested musical sequences.

Full size table

Imaging results

Distinct clusters of significant BOLD-response differences (see Fig. 3A and Table 3) were evaluated using SPM t-contrasts at the whole-brain level for both main effects of GRAMMATICALITY and LEVEL OF EMBEDDING. For the main effect of GRAMMATICALITY (UNGRAMMATICAL > GRAMMATICAL), we found a cluster of increased BOLD response with maxima in the right inferior frontal gyrus (IFG; pars opercularis, triangularis, and orbitalis), right middle frontal gyrus (in the dorsolateral prefrontal cortex), and right anterior insular cortex (AIC). We additionally identified clusters in the pre-supplementary motor area (pre-SMA), right dorsal premotor cortex, and left anterior insular cortex (AIC). The reverse contrast (GRAMMATICAL > UNGRAMMATICAL) yielded increased responses in the bilateral ventromedial prefrontal cortex (vmPFC).

Table 3 Significant clusters showing differential BOLD responses with respect to the GRAMMATICALITY and LEVEL OF EMBEDDING (LoE) of nested musical sequences.

Full size table

For the main effect of LEVEL OF EMBEDDING (TWO-LoE > ONE-LoE), we observed bilaterally clusters in the inferior parietal lobule (IPL), middle frontal gyrus, dorsal premotor cortex, pre-supplementary motor area, and left frontal pole. For the reverse contrast (ONE-LoE > TWO-LoE), we identified clusters in the bilateral middle superior temporal gyri, and left premotor cortex.

No supra-threshold clusters were yielded for the interaction contrast (even at a more lenient cluster-wise FWE-corrected threshold of p < 0.05).

Results of the refined model which only contained category violations in ungrammatical sequences (see Materials and Methods) were furthermore analogous to the original model. The GRAMMATICALITY contrast (UNGRAMMATICAL > GRAMMATICAL) yielded four significant clusters with maxima in the right anterior insular cortex (coordinates in MNI space: [33, 26, 2]), right pars opercularis ([45, 17, 10]), right pars triangularis ([48, 23, 22]), and left anterior insular cortex (AIC; [−33, 20, −6]), whilst the LEVEL OF EMBEDDING contrast (TWO-LoE > ONE-LoE) yielded two significant clusters with maxima in the right inferior parietal lobule (IPL; [60, −49, 30]), and the right middle frontal gyrus ([42, 29, 42]). No significant clusters in the reverse and interaction contrasts were observed at the voxel-corrected statistical threshold (although a more-lenient cluster-corrected threshold nonetheless yielded significant clusters in the same regions as in the original model).

Psychophysiological Interaction (PPI) analysis

For the main effect of GRAMMATICALITY (UNGRAMMATICAL > GRAMMATICAL), we found significant psychophysiological interactions (see Fig. 3B and Table 4) between the right IPL (as seed) and the right pars opercularis and bilateral AIC. Only the psychophysiological interaction of the right IPL on the right AIC significantly correlated with participants’ overall sensitivity in discriminating between grammatical and ungrammatical sequences (see Fig. 3C, r = 0.55, p = 0.03, one-tailed test; corrected using Holm’s method across all significant clusters with the same seed region).

Table 4 Maxima of clusters showing increased psychophysiological interactions during the experimental context of UNGRAMMATICAL versus GRAMMATICAL nested sequences.

Full size table

We also found significant interactions between the right middle frontal gyrus (as seed) and the right pars triangularis/inferior frontal sulcus and bilateral AIC with respect to GRAMMATICALITY Other combinations of contrasts and seed regions did not yield significant results.

Discussion

The present experiment aimed to uncover the functional neural basis underlying the human ability to process non-local dependencies – a key feature of hierarchical structures – in music. By independently manipulating the grammaticality and auditory tonal working memory demands of nested atonal musical sequences, we found that grammatical violations of nested musical dependencies led to increased BOLD responses in the right inferior frontal gyrus (IFG) and bilateral anterior insular cortices (AIC), whilst increased auditory working memory demands led to enhanced responses in the bilateral middle frontal gyri (MFG) and inferior parietal lobules (IPL). This result confirms our hypothesis that the inferior frontal gyrus – especially the right homologue of Broca’s area – is involved in processing non-local dependencies in music. Modulations in functional connectivity between these two distinct functional networks were also associated with discriminating between grammatical and ungrammatical nested sequences. In particular, the task-modulated connectivity between the right AIC and right IPL predicted behavioural performance. These suggest that resolving non-local dependencies in music requires the interplay between brain regions involved in processing hierarchical structures in music and brain regions involved in tonal working memory.

Our findings provide the first evidence that engagement of the right IFG in the neurocognition of music reflects processing of nested non-local dependencies based on internal knowledge of the grammatical rules of musical syntax. Importantly, our refined analysis suggests that the posterior right IFG is sensitive to violations of exclusively non-local dependencies in the absence of any local violation between immediate-adjacent elements. This clarifies prior work that had observed right-lateralised IFG activity (e.g. Koelsch et al., 2005, 2002) but employed paradigms that confounded hierarchical and local irregularity (e.g. a chord other than the tonic following a dominant seventh chord at the end of a chord sequence), such that observed IFG activity could have resulted from to local (serial) processing alone.

Participants’ post-experiment reports moreover suggest that the nested non-local dependencies were processed hierarchically. When prompted to explain what the underlying rule for the sequences were in an open-ended manner, all participants used terms such as ‘mirror’ (seven participants), ‘symmetry’ (four participants), ‘tree’ (one participant), or that the second half is the same as the first but in reversed/inverted order (five participants), to explain how the sequences were arranged. Given that each test sequence was unique and participants were never told what the generative rule was (see Materials and Methods), this demonstrates that participants were able to abstract surface features of the stimuli into a syntactic rule that describes their structure. This representation is moreover hierarchical because the A-motifs (e.g. A₃A₂A₁) are grouped into a superordinate set that is mirror-transformed into another superordinate set containing the B-motifs (e.g. B₁B₂B₃) in the second half. Importantly, participants exploited their understanding of the mirror-symmetric rule to accomplish the task. Participants actively predicted what the next motif could be in each sequence and compared their predictions with the incoming motif. A sequence was deemed grammatical if all predictions were met, and ungrammatical otherwise. This required participants to maintain multiple nested dependencies in parallel and in the correct order, as well as to monitor which superordinate class a motif belonged to (i.e. whether the upcoming motif would belong to the first or second half of a sequence).

Our finding of the right IFG in processing nested non-local dependencies in music thus supports the view that Broca’s area and its right homologue are involved in processing hierarchical structure, although with different hemispheric-weightings across various cognitive domains. While processing linguistic syntax is weighted towards the left hemisphere, we suggest that processing hierarchical structure in music is weighted towards the right: Previous studies have shown that the left IFG is sensitive to violations in nested artificial phoneme sequences⁶⁴, and elicited increased activity for nested hierarchical sequences compared to serial non-hierarchical verbal sequences^30,40,64,65. The pars opercularis of the left IFG in particular showed functional specificity to syntactic information in language²⁹. Therefore, the observed activity in the right IFG – particularly the pars opercularis as highlighted in our refined analysis – likely reflects mechanisms which involve constructing hierarchical representations of the incoming acoustic information⁶⁶.

The present data therefore raise the possibility that at least some aspects of hierarchical processing may rely on lateralised, domain-selective neuronal populations. Despite proposals for an interaction between syntactic processing in music and language^32,34,67,68, our findings are more in line with the proposal that representing hierarchical dependencies could differ in music and language⁶⁹, and thus calls for the existence of domain-specific resources in parsing the hierarchical dependencies. Musso and colleagues⁷⁰, for example suggest that processing syntax in music and language is highly differentiated within the left IFG, but more generally engages in a dual-stream system that connects left frontal, parietal, and temporal regions. Our findings therefore motivate further studies on clarifying the extent to which music and language share common neural resources, and to what extent the observed effects in the right IFG pertain to syntactic processing specifically, or to general mechanisms such as attention⁷¹ or cognitive control^72,73.

Nevertheless, our proposed clear-cut separation between the left IFG processing linguistic syntax and the right IFG processing musical syntax is only tentative: In light of the recent controversy on inflated false-positive rates in fMRI studies⁷⁴, we adopted a conservative correction for multiple comparisons at the expense of statistical sensitivity – significantly increased BOLD responses to violations in musical syntax were indeed also observed in the left IFG at a more lenient cluster-wise corrected statistical threshold. Second, the participants of our study were trained musicians – a group shown to exhibit increased grey-matter volume in the right IFG⁷⁵, higher fibre-tract volumes between the right IFG and the temporal lobe⁷⁶, and differences in BOLD-response patterns^77,78,79, compared to non-musicians. Alternatively, effects of lateralisation could have been driven by the idiosyncratic differences in experimental stimuli. For example, phonemes in speech are typically much shorter in duration than musical notes. It has been argued that auditory information is extracted at different timescales between the two hemispheres⁸⁰, and that the left hemisphere is more specialised towards processing temporal features whilst the right towards spectral features of the auditory stream^46,81. This mechanism could also explain how musical stimuli does not consistently engage the right or left IFG.

Furthermore, the role of hierarchy in resolving nested dependencies in artificial grammar studies has also been challenged. It has been argued that sequential processing is more parsimonious⁸², and that there is no intrinsic super-/subordinate relation across elements in a sequence⁸³. In light of this, we are cautious in interpreting how processing nested non-local dependencies in our sequences extended towards hierarchical processing of music, and only suggest the role of the right IFG in the hierarchical processing of music based on participants’ self reports.

In addition to our experimental manipulation of grammaticality in a nested musical syntax, we also manipulated the demands of auditory tonal working memory by varying the number of embedding levels. When contrasting sequences with two levels of embedding against one, we observed decreased behavioural performance and increased BOLD responses in the inferior parietal lobule (IPL) and the middle frontal gyrus (MFG) of the dorsolateral prefrontal cortex (dlPFC). These effects likely reflect the established role of the IPL and dlPFC in tonal working memory, given the additional dependency induced by an additional pair of motifs to be held in memory. In accordance with this interpretation, activity in the MFG and IPL were shown to be involved in same/difference tasks in musical melodies^46,84, and n-back tasks using chord sequences⁸⁵. Similar responses have also been observed during the maintenance of pitch information^43,45,46,86.

However, because motifs were matched with their transposed conjugates, participants had to maintain interval relations between the motifs and not their absolute pitch classes. Consequently, the contrast between sequences with two levels of embedding against one also reflects manipulation – on top of encoding and maintenance – of the incoming acoustic signals in tonal working memory. Previous studies have suggested that dlPFC activity may reflect modulatory top-down control signals to information represented elsewhere in the neocortex^87,88,89,90, whilst the IPL was shown to be involved in integrating acoustic melodic information within a tonal context⁹¹, as well as comparing original and transposed melodies^84,92. According to state-based models of working memory^87,93,94, information held in working memory are distributed across the cortex and represented in increasing levels of abstractness: from relatively raw and unprocessed in the sensory cortices to highly abstract in the frontal cortex⁹³. The observed increased BOLD response in the dlPFC may therefore reflect increased attention in maintaining the additional pair of motifs that is represented in a transposed and processed format in the IPL (please also see the Supplementary Information for the discussion on some brain areas for which we did not have clear a priori hypotheses).

Extending the functional main effects that we have just discussed, our psychophysiological interaction (PPI) analysis suggests that the behavioural sensitivity to violations of musical syntax requires the interaction of a system that processes nested hierarchical information (i.e. IFG and AIC), and an auditory working memory system (i.e. MFG and IPL). We thus propose that the ability to resolve nested hierarchies in music depends on how salient the constituent musical elements are represented in working memory. With this interpretation, BOLD response modulation in the AIC reflects participants’ awareness to violations that motivates the appropriate motor preparation and response. In addition to the involvement of the AIC in music cognition^{22,25,27,43,45,46,95,96}, the right AIC is a key node of the salience network that detects behaviourally relevant stimuli⁹⁷, supports the translation of affective signals into specific actions^98,99 and is associated with perceptual decision-making, interoception, and emotional awareness^{98,100,101,102,103}. A second mathematically equivalent interpretation of our PPI result is that processing violations in musical syntax instead drives functional connectivity between the IFG and AIC. We suggest that this is less plausible, because discriminating between grammatical and ungrammatical sequences required similar demands in maintaining the musical motifs in tonal working memory.

A limitation in comparing sequences with two levels of embedding against one is the inclusion of an extra pair of motifs that necessitates a longer sequence duration, which confounds distance and item effects. Although this limitation is present in other studies using similar artificial grammar-like nested sequences^64,104, the effects of length were typically not observed, possibly because the differences in duration in those studies were around 1 s compared to 3 s in the current experiment. This difference could explain the supposed activity of the supplementary motor area when comparing sequences with one level of embedding against two – which most likely reflects motor preparation for the ensuing button response after the stimuli. Moreover, although we were able to infer from participants’ reports that they were sensitive to the order and pairing between A and B motifs, further studies could for example include a reordering violation (e.g. A₃A₂A₁B₃B₁B₂) to verify that participants were indeed sensitive to the order of motifs presented.

In conclusion, processing hierarchical structure in music may involve two functionally segregated but nonetheless interacting systems in the right hemisphere: the IFG and AIC in resolving nested non-local dependencies between musical elements, and the MFG and IPL in auditory tonal working memory. The resemblance of the right-lateralised activation profile in processing non-local dependencies in music with the left-hemispheric system in language moreover suggests that processing hierarchical structures in music and language involves qualitatively similar mechanisms subserved by domain-specific neural subpopulations.

References

Schellenberg, E. G. Expectancy in melody: Tests of the implication-realization model. Cognition 58, 75–125 (1996).
Article CAS PubMed Google Scholar
Lerdahl, F. & Jackendoff, R. A Generative Theory of Tonal Music. (The MIT Press, 1983).
Rohrmeier, M., Zuidema, W., Wiggins, G. A. & Scharff, C. Principles of structure building in music, language and animal song. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 370, 20140097 (2015).
Article PubMed PubMed Central Google Scholar
Rohrmeier, M. Towards a generative syntax of tonal harmony. J. Math. Music 5, 35–53 (2011).
Article Google Scholar
Steedman, M. J. A Generative Grammar for Jazz Chord Sequences. Music Percept. 2, 52–77 (1984).
Article Google Scholar
Babbitt, M. Twelve-tone invariants as compositional determinants. Music. Q. 46, 246–259 (1960).
Article Google Scholar
Fitch, W. T. & Martins, M. D. Hierarchical processing in music, language, and action: Lashley revisited. Ann. N. Y. Acad. Sci. 1316, 87–104 (2014).
Article ADS PubMed PubMed Central Google Scholar
Fitch, W. T. Toward a computational framework for cognitive biology: unifying approaches from cognitive neuroscience and comparative cognition. Phys. Life Rev. 11, 329–64 (2014).
Article ADS PubMed Google Scholar
Badre, D. Cognitive control, hierarchy, and the rostro-caudal organization of the frontal lobes. Trends Cogn. Sci. 12, 193–200 (2008).
Article PubMed Google Scholar
Jeon, H.-A. Hierarchical processing in the prefrontal cortex in a variety of cognitive domains. Front. Syst. Neurosci. 8, 223 (2014).
Article PubMed PubMed Central Google Scholar
Koelsch, S., Rohrmeier, M., Torrecuso, R. & Jentschke, S. Processing of hierarchical syntactic structure in music. Proc. Natl. Acad. Sci. USA 110, 15443–8 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Martins, M. D., Gingras, B., Puig-Waldmueller, E. & Fitch, W. T. Cognitive representation of ‘musical fractals’: Processing hierarchy and recursion in the auditory domain. Cognition 161, 31–45 (2017).
Article PubMed PubMed Central Google Scholar
Tillmann, B. & Bigand, E. Global context effect in normal and scrambled musical sequences. J. Exp. Psychol. Hum. Percept. Perform. 27, 1185–96 (2001).
Article CAS PubMed Google Scholar
Bigand, E. & Pineau, M. Global context effects on musical expectancy. Percept. Psychophys. 59, 1098–1107 (1997).
Article CAS PubMed Google Scholar
Kuhn, G. & Dienes, Z. Implicit Learning of Nonlocal Musical Rules: Implicitly Learning More Than Chunks. J. Exp. Psychol. Learn. Mem. Cogn. 31, 1417–1432 (2005).
Article PubMed Google Scholar
Dienes, Z. & Longuet-Higgins, C. Can musical transformations be implicitly learned? Cogn. Sci. 28, 531–558 (2004).
Article Google Scholar
Garza Villarreal, E. A., Brattico, E., Leino, S., Østergaard, L. & Vuust, P. Distinct neural responses to chord violations: A multiple source analysis study. Brain Res. 1389, 103–114 (2011).
Article CAS PubMed Google Scholar
Koelsch, S., Gunter, T., Friederici, A. D. & Schröger, E. Brain indices of music processing: ‘nonmusicians’ are musical. J. Cogn. Neurosci. 12, 520–541 (2000).
Article CAS PubMed Google Scholar
Maess, B., Koelsch, S., Gunter, T. C. & Friederici, A. D. Musical syntax is processed in Broca’s area: an MEG study. Nat. Neurosci. 4, 540–545 (2001).
Article CAS PubMed Google Scholar
Jentschke, S., Friederici, A. D. & Koelsch, S. Neural correlates of music-syntactic processing in two-year old children. Dev. Cogn. Neurosci. 9, 200–208 (2014).
Article PubMed Google Scholar
Loui, P., Wu, E. H., Wessel, D. L. & Knight, R. T. A Generalized Mechanism for Perception of Pitch Patterns. J. Neurosci. 29, 454–459 (2009).
Article CAS PubMed PubMed Central Google Scholar
Koelsch, S., Fritz, T., Schulze, K., Alsop, D. & Schlaug, G. Adults and children processing music: An fMRI study. Neuroimage 25, 1068–1076 (2005).
Article PubMed Google Scholar
Tillmann, B., Janata, P. & Bharucha, J. J. Activation of the Inferior Frontal Cortex in Musical Priming. Ann. N. Y. Acad. Sci. 999, 209–211 (2003).
Article ADS PubMed Google Scholar
Tillmann, B. et al. Cognitive priming in sung and instrumental music: Activation of inferior frontal cortex. Neuroimage 31, 1771–1782 (2006).
Article CAS PubMed Google Scholar
Vuust, P., Roepstorff, A., Wallentin, M., Mouridsen, K. & Østergaard, L. It don’t mean a thing…. Neuroimage 31, 832–841 (2006).
Article CAS PubMed Google Scholar
Minati, L. et al. FMRI/ERP of musical syntax: comparison of melodies and unstructured note sequences. Neuroreport 19, 1381–5 (2008).
Article PubMed Google Scholar
Koelsch, S. et al. Bach speaks: A cortical ‘language-network’ serves the processing of music. Neuroimage 17, 956–966 (2002).
Article PubMed Google Scholar
Meyer, L., Obleser, J., Anwander, A. & Friederici, A. D. Linking ordering in Broca’s area to storage in left temporo-parietal regions: The case of sentence processing. Neuroimage 62, 1987–1998 (2012).
Article PubMed Google Scholar
Goucha, T. & Friederici, A. D. The language skeleton after dissecting meaning: A functional segregation within Broca’s Area. Neuroimage 114, 294–302 (2015).
Article PubMed Google Scholar
Makuuchi, M., Bahlmann, J., Anwander, A. & Friederici, A. D. Segregating the core computational faculty of human language from working memory. Proc. Natl. Acad. Sci. USA 106, 8362–7 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Uddén, J., Ingvar, M., Hagoort, P. & Petersson, K. M. Broca’s region: A causal role in implicit processing of grammars with crossed non-adjacent dependencies. Cognition 164, 188–198 (2017).
Article PubMed Google Scholar
Kunert, R., Willems, R. M. & Hagoort, P. Language influences music harmony perception: effects of shared syntactic integration resources beyond attention. R. Soc. Open Sci. 3, 150685 (2016).
Article PubMed PubMed Central Google Scholar
Slevc, L. R., Rosenberg, J. C. & Patel, A. D. Making psycholinguistics musical: Self-paced reading time evidence for shared processing of linguistic and musical syntax. Psychon. Bull. Rev. 16, 374–381 (2009).
Article PubMed Google Scholar
Koelsch, S., Gunter, T. C., Wittfoth, M. & Sammler, D. Interaction between syntax processing in language and in music: an ERP Study. J. Cogn. Neurosci. 17, 1565–1577 (2005).
Article PubMed Google Scholar
Patel, A. D. Language, music, syntax and the brain. Nat. Neurosci. 6, 674–681 (2003).
Article CAS PubMed Google Scholar
Tillmann, B., Bigand, E. & Madurell, F. Local versus global processing of harmonic cadences in the solution of musical puzzles. Psychol. Res. 61, 157–174 (1998).
Article Google Scholar
Marvin, E. W. & Brinkman, A. The Effect of Modulation and Formal Manipulation on Perception of Tonic Closure by Expert Listeners. Music Percept. An Interdiscip. J. 16, 389–407 (1999).
Article Google Scholar
Cook, N. The Perception of Large-Scale Tonal Closure. Music Percept. 5, 197–205 (1987).
Article Google Scholar
Konecni, V. J. In Advances in Pschology: Cognitive Processes in the Perception of Art (eds. Crozier, W. R. & Chapman, A. J.) 19, 71–93 (Elsevier Science Publishers B.V., 1984).
Bahlmann, J., Schubotz, R. I. & Friederici, A. D. Hierarchical artificial grammar processing engages Broca’s area. Neuroimage 42, 525–534 (2008).
Article PubMed Google Scholar
Lai, J., Krahmer, E. & Sprenger, J. The learnability of Auditory Center-embedded Recursion. Proc. 37th Annu. Meet. Cogn. Sci. Soc. 1237–1242 (2015).
Lai, J. & Poletiek, F. H. The impact of adjacent-dependencies and staged-input on the learnability of center-embedded hierarchical structures. Cognition 118, 265–273 (2011).
Article PubMed Google Scholar
Koelsch, S. et al. Functional architecture of verbal and tonal working memory: An fMRI study. Hum. Brain Mapp. 30, 859–873 (2009).
Article PubMed Google Scholar
Kumar, S. et al. A Brain System for Auditory Working Memory. J. Neurosci. 36, 4492–4505 (2016).
Article CAS PubMed PubMed Central Google Scholar
Schulze, K., Zysset, S., Mueller, K., Friederici, A. D. & Koelsch, S. Neuroarchitecture of verbal and tonal working memory in nonmusicians and musicians. Hum. Brain Mapp. 32, 771–783 (2011).
Article PubMed Google Scholar
Zatorre, R. J., Evans, A. C. & Meyer, E. Neural mechanisms underlying melodic perception and memory for pitch. J. Neurosci. 14, 1908–1919 (1994).
Google Scholar
Wu, H. et al. Musical experience modulates categorical perception of lexical tones in native Chinese speakers. Front. Psychol. 6 (2015).
Steele, C. J., Bailey, J. A., Zatorre, R. J. & Penhune, V. B. Early Musical Training and White-Matter Plasticity in the Corpus Callosum: Evidence for a Sensitive Period. J. Neurosci. 33, 1282–1290 (2013).
Article CAS PubMed Google Scholar
Bernardi, L. Cardiovascular, cerebrovascular, and respiratory changes induced by different types of music in musicians and non-musicians: the importance of silence. Heart 92, 445–452 (2005).
Article PubMed PubMed Central Google Scholar
Oldfield, R. C. The Assessment and Analysis of Handedness: The Edinburgh Inventory. Neuropsychologia 9, 97–113 (1971).
Article CAS PubMed Google Scholar
Müllensiefen, D., Gingras, B., Musil, J. & Stewart, L. The musicality of non-musicians: An index for assessing musical sophistication in the general population. PLoS One 9 (2014).
Green, D. M. & Swets, J. A. Signal detection theory and psychophysics. (John Wiley & Sons, 1966).
Zhang, J. & Mueller, S. T. A note on ROC analysis and non-parametric estimate of sensitivity. Psychometrika 70, 145–154 (2005).
Article MathSciNet MATH Google Scholar
Lakens, D. Calculating and reporting effect sizes to facilitate cumulative science: a practical primer for t-tests and ANOVAs. Front. Psychol. 4, 1–12 (2013).
Article Google Scholar
de Vries, M. H., Monaghan, P., Knecht, S. & Zwitserlood, P. Syntactic structure and artificial grammar learning: The learnability of embedded hierarchical structures. Cognition 107, 763–774 (2008).
Article PubMed Google Scholar
Perruchet, P. & Rey, A. Does the mastery of center-embedded linguistic structures distinguish humans from nonhuman primates? Psychon. Bull. Rev. 12, 307–313 (2005).
Article PubMed Google Scholar
Friston, K. J., Holmes, a, Poline, J. B., Price, C. J. & Frith, C. D. Detecting activations in PET and fMRI: levels of inference and power. Neuroimage 4, 223–235 (1996).
Article CAS PubMed Google Scholar
Lacadie, C. M., Fulbright, R. K., Rajeevan, N., Constable, R. T. & Papademetris, X. More accurate Talairach coordinates for neuroimaging using non-linear registration. Neuroimage 42, 717–725 (2008).
Article PubMed PubMed Central Google Scholar
Eickhoff, S. B. et al. A new SPM toolbox for combining probabilistic cytoarchitectonic maps and functional imaging data. Neuroimage 25, 1325–1335 (2005).
Article PubMed Google Scholar
O’Reilly, J. X. et al. Tools of the trade: psychophysiological interactions and functional connectivity. Soc. Cogn. Affect. Neurosci. 7, 604–9 (2012).
Article PubMed PubMed Central Google Scholar
McLaren, D. G., Ries, M. L., Xu, G. & Johnson, S. C. A generalized form of context-dependent psychophysiological interactions (gPPI): A comparison to standard approaches. Neuroimage 61, 1277–1286 (2012).
Article PubMed PubMed Central Google Scholar
Friston, K. J. et al. Psychophysiological and modulatory interactions in neuroimaging. Neuroimage 6, 218–29 (1997).
Article CAS PubMed Google Scholar
Brett, M., Anton, J.-L., Valabregue, R. & Poline, J.-B. Region of interest analysis using the MarsBar toolbox for SPM 99. Neuroimage 16, S497 (2002).
Google Scholar
Friederici, A. D., Bahlmann, J., Heim, S., Schubotz, R. I. & Anwander, A. The brain differentiates human and non-human grammars: functional localization and structural connectivity. Proc. Natl. Acad. Sci. USA 103, 2458–63 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Friederici, A. D., Makuuchi, M. & Bahlmann, J. The role of the posterior superior temporal cortex in sentence comprehension. Neuroreport 20, 563–568 (2009).
Article PubMed Google Scholar
Dehaene, S., Meyniel, F., Wacongne, C., Wang, L. & Pallier, C. The Neural Representation of Sequences: From Transition Probabilities to Algebraic Patterns and Linguistic Trees. Neuron 88, 2–19 (2015).
Article CAS PubMed Google Scholar
Fedorenko, E., Patel, A. D., Casasanto, D., Winawer, J. & Gibson, E. Structural integration in language and music: evidence for a shared system. Mem. Cognit. 37, 1–9 (2009).
Article PubMed Google Scholar
Kunert, R., Willems, R. M., Casasanto, D. & Patel, A. D. Music and Language Syntax Interact in Broca’ s Area: An fMRI Study. 1–16, https://doi.org/10.1371/journal.pone.0141069 (2015).
Patel, A. D. & Morgan, E. Exploring Cognitive Relations Between Prediction in Language and Music. Cogn. Sci. 1–18, https://doi.org/10.1111/cogs.12411 (2016).
Musso, M. et al. A single dual-stream framework for syntactic computations in music and language. Neuroimage 117, 267–283 (2015).
Article PubMed Google Scholar
Perruchet, P. & Poulin-Charronnat, B. Challenging prior evidence for a shared syntactic processor for language and music. Psychon. Bull. Rev. 20, 310–7 (2013).
Article PubMed Google Scholar
Slevc, L. R. & Okada, B. M. Processing structure in language and music: a case for shared reliance on cognitive control. Psychon. Bull. Rev. 22, 637–52 (2015).
Article PubMed Google Scholar
Fedorenko, E., Behr, M. K. & Kanwisher, N. Functional specificity for high-level linguistic processing in the human brain. Proc. Natl. Acad. Sci. USA 108, 16428–16433 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Eklund, A., Nichols, T. E. & Knutsson, H. Cluster failure: Why fMRI inferences for spatial extent have inflated false-positive rates. Proc. Natl. Acad. Sci. 113, 201602413 (2016).
Article Google Scholar
Sato, K., Kirino, E. & Tanaka, S. A Voxel-Based Morphometry Study of the Brain of University Students Majoring in Music and Nonmusic Disciplines. Behav. Neurol. 2015 (2015).
Halwani, G. F., Loui, P., Rüber, T. & Schlaug, G. Effects of Practice and Experience on the Arcuate Fasciculus: Comparing Singers, Instrumentalists, and Non-Musicians. Front. Psychol. 2, 1–9 (2011).
Article Google Scholar
Vuust, P. et al. To musicians, the message is in the meter. Neuroimage 24, 560–564 (2005).
Article PubMed Google Scholar
Schlaug, G., Jancke, L., Huang, Y. & Steinmetz, H. In vivo evidence of structural brain asymmetry in musicians. Science (80-.). 267, 699–701 (1995).
Article ADS CAS Google Scholar
Jäncke, L., Shah, N. J. & Peters, M. Cortical activations in primary and secondary motor areas for complex bimanual movements in professional pianists. Cogn. Brain Res. 10, 177–183 (2000).
Article Google Scholar
Poeppel, D. The analysis of speech in different temporal integration windows: Cerebral lateralization as ‘asymmetric sampling in time’. Speech Commun. 41, 245–255 (2003).
Article Google Scholar
Zatorre, R. J. Cerebral correlates of human auditory processing; perception of speech and musical sounds. Acoust. Signal Process. Cent. Audit. Syst. 453–468 (1997).
Frank, S. L., Bod, R. & Christiansen, M. H. How hierarchical is language use? Proc. R. Soc. B Biol. Sci. 279, 4522–4531 (2012).
Article Google Scholar
Ojima, S. & Okanoya, K. The Non-Hierarchical Nature of the Chomsky Hierarchy-Driven Artificial-Grammar Learning. BIOLINGUISTICS 8, 163–180 (2014).
Google Scholar
Jerde, T. A., Childs, S. K., Handy, S. T., Nagode, J. C. & Pardo, J. V. Dissociable systems of working memory for rhythm and melody. Neuroimage 57, 1572–1579 (2011).
Article PubMed Google Scholar
Pallesen, K. J. et al. Cognitive Control in Auditory Working Memory Is Enhanced in Musicians. PLoS One 5, e11120 (2010).
Article ADS PubMed PubMed Central Google Scholar
Gaab, N., Gaser, C., Zaehle, T., Jancke, L. & Schlaug, G. Functional anatomy of pitch memory - An fMRI study with sparse temporal sampling. Neuroimage 19, 1417–1426 (2003).
Article PubMed Google Scholar
D’Esposito, M. & Postle, B. R. The Cognitive Neuroscience of Working Memory. Annu. Rev. Psychol. 66, 115–142 (2015).
Article PubMed Google Scholar
Curtis, C. E. & D’Esposito, M. Persistent activity in the prefrontal cortex during working memory. Trends Cogn. Sci. 7, 415–423 (2003).
Article PubMed Google Scholar
Riggall, A. C. & Postle, B. R. The Relationship between Working Memory Storage and Elevated Activity as Measured with Functional Magnetic Resonance Imaging. J. Neurosci. 32, 12990–8 (2012).
Article CAS PubMed PubMed Central Google Scholar
Wager, T. D. & Smith, E. E. Neuroimaging studies of working memory: a meta-analysis. Cogn. Affect. Behav. Neurosci. 3, 255–274 (2003).
Article PubMed Google Scholar
Royal, I. et al. Activation in the right inferior parietal lobule reflects the representation of musical structure beyond simple pitch discrimination. PLoS One 11, 1–14 (2016).
Google Scholar
Foster, N. E. V. & Zatorre, R. J. A role for the intraparietal sulcus in transforming musical pitch information. Cereb. Cortex 20, 1350–1359 (2010).
Article PubMed Google Scholar
Christophel, T. B., Klink, P. C., Spitzer, B., Roelfsema, P. R. & Haynes, J.-D. The Distributed Nature of Working Memory. Trends Cogn. Sci. 21, 111–124 (2017).
Article PubMed Google Scholar
Ma, W. J. et al. Changing concepts of working memory. Nat. Neurosci. 17, 347–56 (2014).
Article CAS PubMed PubMed Central Google Scholar
Platel, H. et al. The structural components of music perception. A functional anatomical study. Brain 120(Pt 2), 229–243 (1997).
Article PubMed Google Scholar
Lappe, C., Steinsträter, O. & Pantev, C. Rhythmic and melodic deviations in musical sequences recruit different cortical areas for mismatch detection. Front. Hum. Neurosci. 7, 260 (2013).
Article PubMed PubMed Central Google Scholar
Uddin, L. Q. Salience processing and insular cortical function and dysfunction. Nat. Rev. Neurosci. 16, 55–61 (2014).
Article PubMed Google Scholar
Craig, A. D. (Bud). How do you feel — now? The anterior insula and human awareness. Nat. Rev. Neurosci. 10, 59–70 (2009).
Article CAS PubMed Google Scholar
Wager, T. D. & Barrett, L. F. From affect to control: Functional specialization of the insula in motivation and regulation. Emotion 129, 2865 (2004).
Google Scholar
Nieuwenhuys, R. The insular cortex. A review. Progress in Brain Research 195, (Elsevier B.V., 2012).
Chang, L. J., Yarkoni, T., Khaw, M. W. & Sanfey, A. G. Decoding the role of the insula in human cognition: Functional parcellation and large-scale reverse inference. Cereb. Cortex 23, 739–749 (2013).
Article PubMed Google Scholar
Kurth, F., Zilles, K., Fox, P. T., Laird, A. R. & Eickhoff, S. B. A link between the systems: functional differentiation and integration within the human insula revealed by meta-analysis. Brain Struct. Funct. 1–16, https://doi.org/10.1007/s00429-010-0255-z (2010).
Augustine, J. R. Circuitry and functional aspects of the insular lobe in primates including humans. Brain Res. Rev. 22, 229–244 (1996).
Article CAS PubMed Google Scholar
Bahlmann, J., Schubotz, R. I., Mueller, J. L., Koester, D. & Friederici, A. D. Neural circuits of hierarchical visuo-spatial sequence processing. Brain Res. 1298, 161–70 (2009).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the Max Plank Society (V.K.M.C., L.M., A.D.F., S.K.) and the Croucher Foundation (V.K.M.C.). We also thank all participants in the study, Chiao-Yi Wu, Mauricio Martins, Tomás Goucha, and Thomas C. Gunter for their helpful discussions, as well as the two anonymous reviewers for their contributions in improving this paper.

Author information

Authors and Affiliations

Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
Vincent K. M. Cheung, Lars Meyer, Angela D. Friederici & Stefan Koelsch
Department of Biological and Medical Psychology, University of Bergen, Bergen, Norway
Stefan Koelsch

Authors

Vincent K. M. Cheung
View author publications
You can also search for this author in PubMed Google Scholar
Lars Meyer
View author publications
You can also search for this author in PubMed Google Scholar
Angela D. Friederici
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Koelsch
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

V.K.M.C., S.K. and A.D.F. developed the paradigm. V.K.M.C. conducted the experiment. V.K.M.C. and L.M. analysed the data. V.K.M.C., L.M., A.D.F., and S.K. wrote and reviewed the manuscript.

Corresponding author

Correspondence to Vincent K. M. Cheung.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplemental Information

Supplementary Audio SA1

Supplementary Audio SA2

Supplementary Audio SA3

Supplementary Audio SA4

Supplementary Audio SA5

Supplementary Audio SA6

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cheung, V.K.M., Meyer, L., Friederici, A.D. et al. The right inferior frontal gyrus processes nested non-local dependencies in music. Sci Rep 8, 3822 (2018). https://doi.org/10.1038/s41598-018-22144-9

Download citation

Received: 24 October 2017
Accepted: 16 February 2018
Published: 28 February 2018
DOI: https://doi.org/10.1038/s41598-018-22144-9

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and Methods

Participants

Stimuli

Procedure

Data acquisition

Data analysis

Results

Behavioural results

Imaging results

Psychophysiological Interaction (PPI) analysis

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links