Enhanced activations in the dorsal inferior frontal gyrus specifying the who, when, and what for successful building of sentence structures in a new language

Umejima, Keita; Flynn, Suzanne; Sakai, Kuniyoshi L.

doi:10.1038/s41598-023-50896-6

Download PDF

Article
Open access
Published: 02 January 2024

Enhanced activations in the dorsal inferior frontal gyrus specifying the who, when, and what for successful building of sentence structures in a new language

Keita Umejima¹,
Suzanne Flynn² &
Kuniyoshi L. Sakai¹

Scientific Reports volume 14, Article number: 54 (2024) Cite this article

3943 Accesses
30 Altmetric
Metrics details

Subjects

Abstract

It has been argued that the principles constraining first language acquisition also constrain second language acquisition; however, neuroscientific evidence for this is scant, and even less for third and subsequent languages. We conducted fMRI experiments to evaluate this claim by focusing on the building of complex sentence structures in Kazakh, a new language for participants having acquired at least two languages. The participants performed grammaticality judgment and subject-verb matching tasks with spoken sentences. We divided the participants into two groups based on the performance levels attained in one of the experimental tasks: High in Group I and Low in Group II. A direct comparison of the two groups, which examined those participants who parsed the structures, indicated significantly stronger activations for Group I in the dorsal left inferior frontal gyrus (L. IFG). Focusing on Group I, we tested the contrast between the initial and final phases in our testing, which examined when the structures were parsed, as well as the contrast which examined what structures were parsed. These analyses further demonstrated focal activations in the dorsal L. IFG alone. Among the individual participants, stronger activation in the dorsal L. IFG, measured during the sentence presentations, predicted higher accuracy rates and shorter response times for executing the tasks that followed. These results cannot be explained by task difficulty or memory loads, and they, instead, indicate a critical and consistent role of the dorsal L. IFG during the initial to intermediate stages of grammar acquisition in a new target language. Such functional specificity of the dorsal L. IFG provides neuroscientific evidence consistent with the claims made by the Cumulative-Enhancement model in investigating language acquisition beyond target second and third languages.

The language network as a natural kind within the broader landscape of the human brain

Article 12 April 2024

Memorability shapes perceived time (and vice versa)

Article 22 April 2024

EEG is better left alone

Article Open access 09 February 2023

Introduction

The Cumulative-Enhancement model (CEM) is hypothesized to account for how multiple languages are acquired; in this model, knowledge of any previously acquired languages can facilitate subsequent language acquisition^1,2,3,4. The CEM coheres with the claim that the principles of the biological endowments that constrain the first language (L1) acquisition process also constrain the second language (L2) bilingual acquisition process⁵. In a recent functional magnetic resonance imaging (fMRI) study⁶, we obtained neuroscientific support for this model, in that the same syntax-related brain regions were activated for both multilinguals [L1: Japanese; L2: English; third language (L3): typically Spanish] and bilinguals [L1: Japanese; L2: English], while acquiring sentence constructions in a new subsequent language Kazakh, i.e., in an L3 for bilinguals and in a fourth language (L4) for multilinguals. Moreover, both syntax-related and domain-general brain networks were more enhanced for multilinguals than for bilinguals. Direct comparisons between the multilinguals and bilinguals showed significantly enhanced activations for the multilinguals in the ventral left inferior frontal gyrus (L. IFG) and right lingual gyrus (R. LG). In addition, activations of the multilinguals in the bilateral frontal and temporal regions, including the lateral premotor cortex (LPMC) and superior/middle temporal gyri (STG/MTG), were maintained at a higher level than the initial level during new, subsequent grammar conditions, while activations of the bilinguals in the basal ganglia/thalamus and cerebellum returned to the initial level at the start of each condition. While the above regions were identified as the neural substrates for multilingualism, previous studies investigating L1 and L2 acquisition have shown that the dorsal L. IFG is commonly recruited for syntactic processing in L1/L2^7,8,9; the dorsal L. IFG is identified as the “grammar center”¹⁰. It should be noted that activations in the dorsal L. IFG were eliminated by the most stringent group comparisons of multilinguals versus bilinguals in the above mentioned study. On the other hand, it has been suggested that the L. IFG and L. STG/MTG form the core language regions^11,12. In the present study, we hypothesized that the most critical syntactic processes involved in the acquisition of the grammars of the L3, L4, …, Ln should continuously involve the same core region of the dorsal L. IFG, consistent with the CEM accounts for language acquisition. Regarding other theories and hypotheses, see our previous paper⁶. In the present paper, we aim to specify which of the cortical regions (especially dorsal L. IFG, ventral L. IFG, or L. STG/MTG) reflect the building of sentence structures in L3/L4s.

The present study is a sequel to our previous Kazakh experiments. We further elucidate the neural processes involved in successfully acquiring construction-dependent grammatical features. Both Kazakh and Japanese are agglutinative languages with a modifier-head (i.e., head-final) word order, and with a subject-object-verb (SOV) word order for declarative sentences¹³; the word orders of Kazakh sentences thus generally match those of the Japanese sentences. However, it is interesting to note that the participants in our previous and present studies reported no knowledge concerning the match in word orders; the participants were not informed about these linguistic facts during the experiments. On the other hand, subject-verb (SV) agreement (i.e., verb suffix in agreement with the person and number of the subject) is absent in Japanese¹⁴, but SV agreement is present in Kazakh, similar to English and Spanish. The participants were not informed of this syntactic difference either.

In order to understand the acquisition process described above, we used three step-wise Grammar conditions in our previous study to gradually familiarize participants with the syntactic structures in Kazakh: G1, G2 (after acquiring G1), and G3 (after acquiring G1 and G2) [see Supplementary Table S4 of Umejima et al. (2021) for sentence examples under G1-G3]. Those participants who could not reach criteria for each condition did not proceed to subsequent steps or levels. Under the G1 condition, we presented a conjoined sentence with al (“and”), consisting of [[N₁ V₁] al [N₂ V₂]]. We examined whether the SV agreement could be acquired for each of [N₁ V₁] and [N₂ V₂] structures (the same set of indices defines an SV pair). Under the G2 condition, we presented a nested sentence made up of two simple sentences with dep (“that”). We examined the construction of [N₁ [N₂ V₂] dep V₁], where SV agreement is applied to each of the [N₁ V₁] and [N₂ V₂] pairs, just as in G1. Under the G3 condition, we presented a sentence involving a relative clause with kezde (“when” or a locative form of “time”). We examined the construction of [[N₂ V₂] kezde N₁ V₁] or [N₁ V₁, [N₂ V₂] kezde]. In Kazakh, the suffix on an adjectival participle (V₂ in this case) is always fixed regardless of the person and number of the corresponding subject (N₂). In other words, SV agreement holds for the [N₁ V₁] pair for the main clause, but not for the [N₂ V₂] pair for the relative clause under the G3 condition. The participants, who were highly proficient and successful in passing the G1-G3 conditions in the previous study, proceeded onto the next G4 condition described here in the present study (see Supplementary Table S1).

Under the G4 condition, we tested sentences that consisted of a main clause and a subordinate relative clause, including nouns as objects (basically marked with the suffix -di/-dï in Kazakh). Each of the stimulus sentences in the G4 condition had either adamdï (an accusative form of man) or adam (a nominative form of man). For example, an adjectival participle with adamdï builds the structure of “N₁ [N V₂ adamdï] V₁” (English example: “We (N₁) recognized (V₁) [a man who knew (V₂) John (N)]”). An N without an index represents an object hereafter. With adam, for example, the structure of “[N₂ V₂ adam] N V₁” is constructed (English example: “[A man whom John (N₂) knew (V₂)] recognized (V₁) us (N)”). Just as in the G3 condition, SV agreement is mandatory for the [N₁ V₁] pair in the main clause, but not for the [N₂ V₂] pair in the relative clause.

For each sentence used in the G4 condition, its syntactic structure is primarily determined by a two-by-two factorial design (Fig. 1): the head position (either Object or Subject) in the main clause, and the gap position (either Subject or Object) in the relative clause. In total, four types of sentence structures were presented to the participants in randomized order, which in turn made the G4 condition more demanding than the G1-G3 conditions. With regards to the CEM, it is of great interest concerning whether the common computational system, underlying the acquisition of any language specific grammar, can be shown to be critically involved under such demanding grammatical conditions as those exemplified in the G4 condition.

In English, a head-initial language, an object relative clause with the head “the man” and the gap indicated by an empty category (e), such as (i) “the man [whom John knew e],” has the meaning of “John knew the man.” Brain imaging studies using English sentences with relative clauses have indicated that such object relatives carry increased loads for grammatical processing, i.e., higher parsing loads, than subject relatives, like (ii) “the man [who e knew John]”^15,16. This increased syntactic load for the object relatives has been explained in terms of the surface structure “distance” between the head and the gap, which are both structurally and linearly farther apart in object relatives [see (i)] than in subject relatives [see (ii)]. In head-final languages of both Kazakh and Japanese, the distance between the head and gap becomes structurally farther apart (i.e., the gap being more deeply embedded), although linearly closer, in object relatives (gap position: Object) than it is in subject relatives (gap position: Subject); compare the red zigzagging arrows with the straight ones in Fig. 1. Example sentences with object relative clauses are shown as (3) and (5) in Table 1; those with subject relative clauses are shown as (1) and (7). The structural account for the higher syntactic load required for object relative clauses has been experimentally confirmed for Japanese¹⁷.

Table 1 Sentence structures under four construct conditions.

Full size table

The structural distance between adam and the main verb (V₁) as defined by an underlying tree structure, as well as the linear distance in the surface structure of the sentence, is greater than that between adamdï and V₁. Therefore, the former (head position: Subject) is hypothesized to involve a higher syntactic load than the latter (head position: Object; see Fig. 1). Example sentences with adamdï are shown as (1) and (3) in Table 1; those with adam are shown as (5) and (7). Combining these loads together, the Subject-Object [SO; see (5)] construction with adam (head position: Subject) and an object relative clause (gap position: Object) presents the learner with potentially the highest syntactic load among the four constructions investigated. In contrast, the Object-Subject [OS; see (1)] construction with adamdï (head position: Object) and a subject relative clause (gap position: Subject) presents the learner with the lowest syntactic load. If the OS and SO constructions (see the main diagonal of the two-by-two matrix in Fig. 1) are accurately distinguished from the other constructions, then we can reasonably assume that the learner accumulated linguistic knowledge regarding the head and gap positions.

It should be noted that non-linguistic factors other than these syntactic loads, may affect task difficulty, along with constraints on short-term memory loads (including “working memory”). If so, this could reduce the accuracy rates and increase the response times (RTs) as well. It is well-known that the number of “distractors” (non-targets) influences the processing of a serial search¹⁸. The Subject-Subject [SS; see (7) in Table 1] construction was the most difficult to cope with regarding the tasks themselves, because there were two direct objects (the nouns shown in gray in Fig. 1), which were distractors in the tasks that involved the correct identification of subject-verb pairs (see below). In contrast, the Object-Object [OO; see (3)] construction was the easiest to cope with as there was no such distractor. The OS and SO constructions, in which there was one distractor, presented a potentially intermediate level of difficulty between the SS and OO constructions for the participants.

With respect to the relationship between L1/L2 acquisition and consequent brain activations, we had suggested earlier the possibility that “cortical activations increase initially at the onset of acquisition, followed by the maintenance of the activations and then [followed by] a fall in activations during consolidation of linguistic competence”¹⁰. These multiphase changes are associated with the initial, intermediate, and final stages of grammar acquisition, respectively. Multiphase changes might occur rapidly, because we observed dynamic changes in the activations for multilinguals during the time course throughout the G1-G3 conditions⁶. In the present study, we focused on the initial to intermediate stages of grammar/language acquisition, in which cortical activations should increase, for learners acquiring new syntactic knowledge in the G4 condition. Recall that this condition utilizes sentence stimuli that consisted of OS, OO, SO, and SS sentence constructions (denoted hereafter as construct conditions), completely mixed as in a natural language acquisition setting.

Experimental design

In the present study, we essentially followed the design of our previous study⁶. In the design slightly modified here, we alternated between eight demo and eight task trials, in order that linguistic knowledge acquired during the demo trials would be tested in the subsequent task trials. We did not provide any explicit information about syntactic structures or rules of the grammar in Kazakh, but instead presented visual signs (either + or −) during the demo trials, where each sign indicated the status of a sentence: its grammaticality and SV correspondence. For each sentence with main and relative clause structures, three nouns and two verbs were presented, controlling for length of the sentence and number of syllables (nouns: 1–3 syllables, verbs: 2–4 syllables; see Supplementary Methods, The Kazakh vocabulary used in this study). With regards to the words used in the G4 condition, the participants were already familiar with these words from the previous G1-G3 experimental trials.

Before the magnetic resonance (MR) scanning was initiated, the participants were given an instruction sheet (written in Japanese) stating that, “The following examples are English translations of sentences that you will hear. There are four sentence types; each will be presented one at a time. Please note that there are objects in the sentence structure. In all the sentences, the third-person nouns he, John, Dan, or man represent different persons [thus avoiding ambiguous coreference between nouns in a sentence, but without affecting the syntactic structures].

Example 1: The man, whom you understood, knew John,
Example 2: The man, who knew Dan, recognized him,
Example 3: We understood the man, whom John knew, and
Example 4: Dan recognized the man, who knew you”.

In the demo trials, each participant using headphones heard a sentence (“Sentence” capitalized here), and then a noun–verb pair (“NV pair”) extracted from the Sentence of the same trial (Fig. 2a). The noun in an NV pair was optionally a direct object in the Sentence; the noun and verb in an NV pair were presented without any inflectional suffixes. For each stimulus Sentence and NV pair, a visual sign (either + or −) was simultaneously presented on the video goggles used by each participant. The + / − sign that appeared on the goggle screen and was associated with each Sentence indicated whether the sentence was grammatical (+) or ungrammatical (−); an ungrammatical sentence always included an error in the verb suffix. The + / − sign associated with each NV pair indicated whether the pair matched (+) or did not match (−) the SV pairs in the sentence structure.

In the task trials, both + and − signs were presented (Fig. 2b), and the participants chose one for the Sentence, then chose another one for the NV pair (see above). These tasks have been named the grammaticality task (GR task) and the subject-verb task (SV task), respectively. While the GR task required participants to make a grammatical judgment about the Sentence, the SV task required participants to judge whether the NV pair was matched with one of the two SV pairs in the sentence structure or not, where the participants should identify an SV pair in each of the main and relative clauses. The SV task further required a syntactic analysis of the abstract empty category (see Fig. 1).

We observed a bimodal distribution for the accuracy rates in the SV task with the transition point at 60% between the two peaks under each of the OS and SO conditions (see Supplementary Figure S1). On the basis of these results, we set criteria for this experiment, such that the participants had to reach accuracy rates higher than 60% in the SV task for both the OS and SO conditions (see the above explanation for the main diagonal in Fig. 1). We separated the participants into two groups: Group I, consisting of those who had reached the criterial level, and Group II, consisting of those who did not. These two groups were formed from all of right-handed participants who had reached G4 from the previous experiment of G1-G3. This division between the two groups is no longer based on bilinguals versus multilinguals (see Supplementary Table S1), but on proficiency levels in the new language.

During the presentation of the Sentence to the participants, the processes at the lexical level involved discrimination of the nominative form of the nouns (e.g., adam and John) from the accusative form (e.g., adamdï and Johndï), as well as discrimination of the verb suffixes necessary for SV agreement. Syntactic processes were also critically involved in constructing phrase-level structures where the construction also involved the integration of both phonological and semantic information. During the presentation of the NV pair, in contrast, identification of an SV pair was required in each of the main and relative clauses. Focusing on the fMRI activations during either the Sentence or NV pair event, common and specific syntactic processes should be revealed.

According to the CEM hypothesis, if learners had more experience with their L2/L3s, higher proficiency levels would be evident in the L4 in the overall group effects, overriding individual differences. The participants with less experience in their L2/L3s would eventually become as proficient in the L4 as those participants who had sufficient experience in their L2/L3s. However, with inevitable differences in the length and/or depth of exposure to the L2/L3s, we would expect marked differences in the performances and activations in the L4. Likewise, different proficiency levels in the L4 between Groups I and II (the “who”) may also reflect differences in exposure to the L2/L3s as well, consistent with the CEM. Among the multi-stages involved in the acquisition of a new grammar, it is also necessary to clarify whether the initial, intermediate, and final stages (the “when”) are relevant for the CEM. By comparing the associated brain activations between Groups I and II, as well as between the initial and intermediate stages (i.e., initial and final phases, respectively, in our testing), we hypothesize that enhanced activations will be observed in the most crucial region among the syntax-related networks^19,20.

Results

Overall proficiency improvement in Kazakh

There were large differences among the participants with respect to improving their proficiency levels in Kazakh; this made the number of task blocks for the participants variable, depending on how well they performed on the tasks (see the Tasks section). We divided the task blocks for each participant into four phases as equally as possible. If there were five blocks, for example, the four phases consisted of 1, 1, 1, and 2 blocks, with more blocks for the latter phases. We then averaged the accuracy rates for each quarter among all the participants (combining Groups I and II).

We evaluated language proficiency levels in the L2 and L3 using the Listening Comprehension sub-test of the Avant STAMP 4S (Standards-based Measurement of Proficiency—4 Skills; Avant Assessment, Eugene, OR, USA), as the scores of 1–9 [Novice (1–3), Intermediate (4–6), and Advanced (7–9)] (Supplementary Table S1). With respect to the fourth quarter, i.e., the final phase in our testing, Avant scores in the L2 and L3 were significantly correlated with accuracy rates in the GR task (Spearman’s correlation test, r_s = 0.53, p = 0.02; Supplementary Fig. S2). This result directly supports the CEM hypothesis (see the Introduction), regardless of large individual differences in proficiency levels, in that the more proficient the bilinguals and multilinguals were in their L2/L3s, the higher their performance became in their L4.

With respect to the GR task, the accuracy rates for all participants steadily increased throughout the block quarters from chance level at 50% to 70% under the OS condition (Fig. 3a, left). Here, we regarded OS as a reference for comparison for the construct conditions (gray bars in Fig. 3). A one-way repeated-measures analysis of variance (rANOVA) indicated a significant main effect of the quarters (F(3, 90) = 6.9, p = 0.0003), and paired t-tests indicated significantly higher rates during the fourth quarter than the first quarter (t(30) = 4.8, p < 0.0001). With respect to the fourth quarter, the accuracy rates in the GR task reached 60–70% under all the construct conditions (Fig. 3a, right). These rates were significantly above chance level (one-sample t-tests, p < 0.01, Holm corrected), confirming the above criterial level of 60% for distinguishing Groups I and II. An rANOVA did not indicate a significant difference among the four conditions (F(3, 90) = 1.0, p = 0.4).

With regards to the SV task, the accuracy rates for all participants also indicated an increase throughout the quarters under the OS condition (Fig. 3b, left). An rANOVA indicated a significant main effect of the quarters (F(3, 90) = 6.3, p = 0.0006), and paired t-tests indicated significantly higher rates during the fourth quarter than the first quarter (t(30) = 4.3, p = 0.0002). During the fourth quarter, the rates reached 60% except under the SS condition (Fig. 3b, right). An rANOVA indicated a significant main effect of the conditions (F(3, 90) = 9.1, p < 0.0001), and the rate under the SS condition was significantly lower than that under the OS condition (t(30) = 4.8, p < 0.0001). Recall that SS was hypothesized to be the most difficult condition (see the Introduction). The RTs were comparable for all the construct conditions for each task (rANOVA, p > 0.1; Fig. 3c). The accuracy rates in the SV task were most sensitive to reveal performance differences among the construct conditions.

In order to obtain a more robust estimation of performances in both tasks, we employed “the signal detection theory,” which is generally used in order to discriminate the distribution of a signal source that has noise from the distribution of a noise source alone²¹. In doing this, we obtained d′-values as a Z-value of the “hit” rate (i.e., correct detection of ungrammatical and mismatched stimuli in our study) minus that of the “false-alarm” rate (i.e., incorrect responses to grammatical and matched stimuli). In order to examine any significant deviation from chance level (d′ = 0), we estimated variances of d′-values²². Although d′-values in the GR task were consistent with the accuracy rates (see Fig. 3a), d′-values in the SV task were significant under the OS and OO conditions alone (p < 0.05, Holm corrected for each task). This result was due to the larger variance, i.e., larger individual differences (Fig. 3d). Given that the criterial level noted above (see the Experimental design section) was not met by some participants, we divided the participants into Groups I and II according to this criteria during the fourth quarter.

Group differences in proficiency levels

Consistent with the results for all participants (see Fig. 3b), the accuracy rates for Group I significantly increased from the first to fourth quarters in the SV task under the OS and SO conditions (Fig. 3e), as well as under the OO condition (p < 0.05, Holm corrected). In contrast, for Group II, the accuracy rates did not significantly change from the first to fourth quarters under the OS or SO condition (Fig. 3f), nor under the OO or SS condition (p > 0.05). The accuracy rates under the SO condition were above 60% during the first quarter. However, this was an exception for Group II, and this tendency immediately dropped to chance level after the second quarter. We confirmed that the actual numbers for the task blocks were comparable for Groups I and II (t(29) = 0.6, p = 0.5). Considering the notable progress for Group I, we focused on the fourth quarter for subsequent analyses.

For Group I, the d′-values in the GR task were significantly above value 0 under all four conditions, and the d′-values in the SV task were significant under the OS, OO, and SO conditions (p < 0.05, Holm corrected for each task; Fig. 3g). In contrast, for Group II, the d′-values under none of the four conditions in the GR or SV task were significantly different from chance level (p > 0.05; Fig. 3h). These results confirmed successful building of the sentence structures under the OS, OO, and SO conditions by the Group I participants.

Event-related group differences in brain activations

To obtain the overall activation patterns for the entire brain, we compared activations during the “Sentence” event with those during the “Lexical list” event (a list of five words; see Fig. 2b). The Lexical list controlled auditory recognition of individual words used in the stimulus Sentences, as well as for lexico-semantic processing. For all participants, the [Sentence − Lexical list] contrast indicated consistent results under all four construct conditions, revealing bilateral activations in the LPMC, dorsal IFG, insula, superior/middle/inferior temporal gyri (STG/MTG/ITG), angular/supramarginal gyri (AG/SMG), and cerebellum VI/Crus I (Supplementary Fig. S3a, for OS and SO). Medial activations were also observed in the supplementary motor area (SMA), anterior cingulate cortex (ACC), basal ganglia, thalamus, precuneus, and calcarine/LG. When Groups I and II were analyzed separately (Fig. 4a), the overall activation patterns for both groups were similar to those for all participants, but the extent of the significant activations was more restricted for Group II in the bilateral LPMC, dorsal IFG, insula, STG/MTG, SMG, and cerebellum, as well as in the medial SMA/ACC, basal ganglia, thalamus, precuneus, and calcarine/LG.

Next, we focused on the NV pair events prior to the SV responses (see Fig. 2b). The extent of activations during the NV pair events was narrowed down in the [NV pair − Lexical list] contrast from those in the [Sentence − Lexical list] contrast for all participants (Supplementary Fig. S3b). When Groups I and II were analyzed separately (Fig. 4b), the overall activation patterns for Group I were similar to those for all participants. We observed activations in the bilateral LPMC, dorsal left IFG, bilateral STG/MTG, and L. AG/SMG, as well as in the medial SMA/ACC and thalamus (more than 20 voxels) for Group I (Fig. 4b, for OS and SO; see Table 2 for the list of activated regions under OS). The overall activation patterns were also similar for Group II, but the extent of significant activations was more restricted (see Table 2). These results for both groups confirmed the involvement of language areas and supporting networks during syntactic, semantic, and phonological processes.

Table 2 Regions with significant activations related to groups, phases, and conditions.

Full size table

Following qualitative comparisons between the two groups, we performed a direct group comparison, i.e., directly obtaining the functional map of the [Group I − Group II] contrast. We focused on the Sentence events, averaged among the four conditions. Significant activations were observed in the dorsal L. IFG and bilateral STG/MTG (Fig. 4c, left), as well as in the medial precuneus. These regions were subsections of the activated regions for Group I in the [Sentence − Lexical list] contrast (see Fig. 4a).

Given that the participants in Group I performed better in the OS, OO, and SO conditions than in the SS condition (see Fig. 3g), we next focused on the former three conditions. We repeated the same group comparison and observed more localized activations in the dorsal L. IFG and L. STG/MTG alone (Fig. 4c, right; Table 2).

Condition-specific temporal activation changes

Following the direct group comparisons shown above, we next focused on Group I. To determine which of the above-mentioned regions were critical for the final phase in our testing, we directly compared the activations between the initial and final phases, i.e., the [4th quarter − 1st quarter] contrast. We focused on the OS, OO, and SO conditions in the [NV pair − Lexical list] contrast. We observed focal activation in the dorsal L. IFG alone (Fig. 4d, Table 2), indicating temporal activation changes occurring continuously from the initial to final phases.

We further examined the activations specific to the successful construct conditions. For the contrast [(OS + OO + SO) − SS], focal activation was observed also in the dorsal L. IFG (Fig. 4e, Table 2). This region mainly consisted of Brodmann’s areas (BA) 44/45, and included more BA 45 than the region shown in Fig. 4d. With regards to the three regions of the dorsal L. IFG, observed in the separate analyses (see Fig. 4c–e), we confirmed an overlap of eight significant voxels among these clusters. For Group II, neither of the [4th quarter − 1st quarter] or [(OS + OO + SO) − SS] contrasts showed any significant activation. These results further confirmed the central role of the dorsal L. IFG in successful structure building processes.

We also conducted two-way [groups × quarters] analyses of covariance (rANCOVAs) under the OS, OO, and SO conditions. We observed focal activations in the dorsal L. IFG for the main effect of quarters in the [NV pair − Lexical list] contrast (Supplementary Fig. S3c), replicating the results of Group I (Fig. 4d) for all participants. Moreover, we also found a significant interaction of groups by quarters in the [Sentence − Lexical list] contrast (Supplementary Fig. S3d).

Brain activations related to the subsequent task performance

For all the participants, we conducted additional region of interest (ROI) analyses for the cluster of the dorsal L. IFG activations, identified by the [(OS + OO + SO) − SS] contrast in Fig. 4e. We focused on the signal changes in the [Sentence − Lexical list] contrast, in order to examine whether those reliably enhanced activations affected the subsequent processes required by a grammatical judgment about the Sentence, and by the correct identification of the SV pairs. Averaged among the four conditions, significantly positive correlations were observed between accuracy rates and dorsal L. IFG activations for the GR (Fig. 5a) and SV (Fig. 5b) tasks (both, r = 0.51, p = 0.003).

We also focused on the OS condition for Group I, and observed significantly negative correlations between RTs and dorsal L. IFG activations for the GR (Fig. 5c) and SV (Fig. 5d) tasks (GR task: r = − 0.50, p < 0.05; SV task: r = − 0.56, p = 0.02). These results demonstrate that higher signal changes in the dorsal L. IFG measured during the Sentence events actually predicted higher accuracy rates and shorter RTs for the subsequent experimental tasks.

Discussion

In our previous study⁶, we examined the initial acquisition of Kazakh sentences under three step-wise grammar conditions with distinct sentence structures: a conjoined sentence, a nested sentence, and a sentence involving a relative clause. As a next step in the present study, participants were presented with sentences that consisted of main and relative clauses that both included nouns as objects. The inclusion of objects was newly introduced in this phase of the design, which made the tested conditions very demanding. Moreover, the four types of sentence structures (OS, OO, SO, and SS; Fig. 1) were presented to the participants in a completely randomized order. By simply alternating demo and task trials (Fig. 2), we were able to test participants on their abilities to build sentence structures in a new language without the explicit teaching of grammatical rules.

For the analyses with respect to the behavioral and functional data, we divided the participants into Groups I and II based on the levels attained on the subject-verb matching task. We obtained the following three results. First, consistent with successful building of complex sentence structures under the OS, OO, and SO conditions for Group I (Fig. 3g), but not for Group II (Fig. 3h), the contrast of [Group I − Group II], which examined those participants who parsed the structures, indicated that the dorsal L. IFG and L. STG/MTG, the core language areas, were significantly activated under the OS, OO, and SO conditions (Fig. 4c). Secondly, focusing on Group I, the contrast of [4th quarter − 1st quarter], which examined when the structures were parsed (Fig. 4d), as well as that of [(OS + OO + SO) − SS], which examined what structures were parsed (Fig. 4e), additionally demonstrated focal activations in the dorsal L. IFG alone. Thirdly, among the individual participants, stronger activation in the dorsal L. IFG, measured during the Sentence events, predicted higher accuracy rates and shorter RTs for the execution of each of the tasks that followed (Fig. 5). These results cannot be explained by task difficulty or memory loads, and they, instead, indicate a critical and consistent role of the dorsal L. IFG during the initial to intermediate stages of grammar acquisition in a new target language. Such functional specificity of the dorsal L. IFG, i.e., the grammar center, provides neuroscientific evidence consistent with the claims made by the CEM in investigating language acquisition beyond target L2/L3s.

The results observed with respect to “what structures were parsed” support the claim that the dorsal L. IFG played an essential role in syntactic processing as used for successful building of sentence structures in a new target language. While left ventral BA 44 has been associated with syntactic structure building²³, left dorsal BA 44 and the inferior frontal sulcus have been suggested to be linked to memory loads^24,25. In the present study, however, we demonstrated that activations in the dorsal L. IFG were free from task difficulty or memory loads, because larger activations were observed in the participants indicating less difficulty in both GR and SV tasks (Fig. 5). Moreover, when the dorsal L. IFG, together with the L. LPMC, was damaged by a glioma, we have already reported clear evidence of agrammatic comprehension^19,26. In those studies, we used a picture-sentence matching task that involved no memory load. It should be also noted that there were wide individual variations in the extent of BA 44, as reported in a previous anatomical study, which stated that “the volumes of area 44 differed across subjects by up to a factor of 10”²⁷. Therefore, we did not separate the dorsal L. IFG into BAs 44, 45 and 6. The eight voxels overlapped among the three clusters (Fig. 4c–e) were located in BAs 44/6.

Regarding the “who,” the overall activations were weaker and spatially more restricted for Group II than for Group I. In Group II, the proficiency improvement from the initial phase was absent during the experimental testing, while it was present in Group I (Fig. 3e–h). Moreover, brain activations reflected individual differences in proficiency levels in terms of the two groups (Fig. 4a–c). This might represent a prior developmental phase for those participants in Group II or might be a developmental “delay” in comparison to Group I, although the explanation for these hypotheses is beyond the scope of this paper. Another possibility is that the significant group differences may originate from difficulty in the processing of the phonology/phonetics and formal semantics simultaneously. This possibility is supported by the bilateral STG/MTG activations in the direct group comparison (Fig. 4c), which were stronger in the L. STG/MTG (included in Wernicke’s area), the core region for phonological processes¹⁰. However, activations in the dorsal L. IFG (included in Broca’s area), the core region for syntactic processes, also indicate that the crucial factor was constructing phrase-level structures, substantiating “the basic property of language”²⁸. The constraints and mechanisms involved in language acquisition, accompanied by substantial individual differences, require further elucidation.

We observed bilateral activations in the LPMC during the Sentence events, which were stronger for Group I than for Group II (Fig. 4a). Moreover, more localized activations in the L. LPMC were evident during the NV pair events, also stronger for Group I (Fig. 4b). Our previous fMRI studies on grammatical judgments have consistently reported activations in the dorsal L. IFG and L. LPMC^{6,10,19,29,30,31}. Our recent fMRI study explicitly tested subject-predicate correspondence for sentences in L1, and revealed critical activations in the bilateral LPMCs for processing dependencies solely determined by hierarchical structures, when compared with those based on linear sequences of words³². These findings provide additional support for the results of other neuroimaging experiments^7,33,34,35.

A number of case studies on aphasic patients have commonly and simplistically identified the crucial roles of Wernicke’s area as pertaining to input/comprehension alone, and those of Broca’s area as output/production alone³⁶. However, if we assume that these regions are responsible for not only loss but also acquisition of a language specific grammar, in line with the present study, it is necessary to revise the classical notions of Wernicke’s and Broca’s areas with respect to language processing³⁷. Moreover, neuroimaging studies have identified the dorsal L. IFG as a core hub for the computation of linguistic information for both signed and verbal languages—each of which uses a different modality for externalization^38,39,40. Given the clear distinction between the core language system and external sensory-motor systems²⁸, we conclude that the core system of the dorsal L. IFG is independent from both sensory input and motor output.

In line with proposals that the same principles constrain L1 and L2 acquisition⁵, as well as for L3⁴¹ and subsequent L4, …, Ln, we predicted that the dorsal L. IFG, the grammar center, becomes functional during acquisition of a language-specific grammar for any new language, suggesting an essential and universal property about human linguistic capacity, which enables both unlimited acquisition and use of multiple languages. To conclude, the current neuroscientific evidence for a grammar acquisition of an L4, together with the CEM which is hypothesized to account for (i.e., the “why”) the mechanisms that universally underlie language acquisition, provides further essential insight critical for clarification concerning what is involved in successful building of sentence structures in a new language.

Materials and methods

For more details, see the Supplementary Methods.

Participants

Volunteers, who were native speakers of Japanese, were recruited from multiple sources for this study; these included the LEX Institute (Hippo Family Club), the University of Tokyo, and Sophia University. Thirty-three participants, in total, met the criteria set for the G1-G3 conditions (as described in the Introduction) and reached G4. Right-handedness was estimated as a laterality quotient (LQ) according to the Edinburgh inventory⁴². Because of their left-handedness (i.e., negative LQ), two participants were eliminated from the analyses. We divided the resultant 31 participants into two groups (see the Introduction for the criteria): Group I [16 participants; nine multilinguals and seven bilinguals] and Group II [15 participants; eight multilinguals and seven bilinguals] (see Supplementary Table S2). There was no group difference in duration of exposure (DOE) to English, Avant score (i.e., language proficiency level) in English, and LQ (p > 0.2). The mean age was significantly lower for Group II (t(29) = 2.3, p = 0.03). Note that Groups I and II included one and five participants under the age of 19, respectively; for those participants above age 19, age was not significantly different (t(23) = 1.7, p = 0.1). Age was thus used as a nuisance factor in the activation analyses (see Supplementary Methods, fMRI data analyses). None of the participants in the study had neurological or psychiatric disorders.

Prior to their participation in the study, the nature and possible consequences of the study were explained to each participant and written informed consent was obtained immediately after this introduction. Approval for the experiments was obtained from the ethical review board of experimental studies on human subjects at Graduate School of Arts and Sciences, the University of Tokyo (No. 464). All research was performed in accordance with the Declaration of Helsinki, Singapore Statement on Research Integrity, and relevant guidelines/regulations in Japan (Science Council of Japan, and Japan Society for the Promotion of Science). This clinical trial has been registered in a publicly accessible primary register at Japan Registry of Clinical Trials (jRCT) on 25/12/2020 (No. jRCT1030200294).

Stimuli

Auditory stimuli in Kazakh consisted of 76 sentences (44 grammatical and 32 ungrammatical) with a limited number of lexical items shown in Supplementary Table S3. The stimulus sentences were recorded by a male native speaker of Kazakh, and individual words were also separately recorded. Both grammatical and ungrammatical sentences were articulated at a somewhat slower pace for the participants, who were not familiar with Kazakh. By using the Wavelab 8 software (Steinberg Media Technologies GmbH, Hamburg, Germany), we digitized the stimuli (16 bit, 44.1 kHz, stereo), where the maximum volume of each stimulus was equally set to − 1 dBFS. The duration of all the stimulus sentences was adjusted to 4.75 s (see Fig. 2b), maintaining the original pitch of the sentences. As a lexical reference, five words (“Lexical list”) used in the sentence were presented auditorily at the beginning of each task trial; individual words translated into English were visually presented (see Fig. 2b). The English translations provided a clue to meanings and parts of speech for the Kazakh words. During the MR scans, the participants wore a set of MRI-compatible headphones (Resonance Technology Inc., Northridge, CA), a pair of earmuffs (3 M Peltor, St. Paul, MN), and a pair of earplugs (Earasers, Persona Medical, Casselberry, FL) to reduce the high-frequency noises (> 1 kHz) of the scanner.

Tasks

Each of demo and task blocks consisted of two trials each for four construct conditions (OS, OO, SO, and SS) in a randomized order. We used four trials per demo block for the G1-G3 conditions but increased the number of trials to eight for the most demanding G4 condition. Different sets of sentences were used for the demo and task blocks to avoid simple memorization of the stimuli sentences by the participants. Although the participants were inside the scanner, we conducted a demo block when there was no scanning being conducted; during this exposure period, the auditory stimuli were presented without the loud noise involved in MR scanning. In a demo block, the first two sentences were always grammatical, and the remaining six sentences consisted of four grammatical and two ungrammatical sentences presented in a randomized order. In a task block when there was scanning, both grammatical and ungrammatical sentences were completely randomized. Regarding the NV pairs in the demo block, the first two pairs were always matched; a mismatched pair never followed an ungrammatical sentence. Otherwise, matched, and mismatched pairs were randomized. In a task block, both matched and mismatched pairs were completely randomized.

During the experiments, we used the same criteria as those used for G1-G3 for the G4 condition, such that each participant correctly performed at least six out of the eight task trials in each of two blocks (not necessarily consecutive) for both the GR and SV tasks. With one or two days of experimentation, 15 out of the 31 participants reached these criteria. Among the remaining 16, five out of ten participants who correctly performed at least six out of the eight task trials in one block for both the GR and SV tasks were further tested on another day. Three out of those five participants reached the criteria on the third day. In the end, there were between 5 and 28 task blocks (i.e., 40–224 task trials) depending on the participant, and thus between 2 and 7 task blocks for the fourth quarter. After each block of task trials, the participants were informed of the number of their correct responses (e.g., 6 out of 8) separately for the GR and SV tasks. In each scanning run, we added five task trials under the Words condition (see Supplementary Methods, Tasks).

MRI data acquisition and analyses

The MRI scans were conducted in a 3.0 T scanner (Signa HDxt; GE Healthcare, Milwaukee, WI) with a bird-cage head coil. Each participant was in a supine position, and his or her head was immobilized inside the coil. With respect to the structural images, high-resolution T1-weighted images of the whole brain [136 axial slices, 1 × 1 × 1 mm³] were acquired with a three-dimensional fast spoiled gradient-echo (3D FSPGR) acquisition [repetition time (TR) = 8.6 ms, echo time (TE) = 2.6 ms, flip angle (FA) = 25°, field of view (FOV) = 256 × 256 mm²]. With respect to the fMRI time-series data we used a gradient-echo echo-planar imaging (EPI) sequence [TR = 2 s, TE = 30 ms, FA = 78°, FOV = 192 × 192 mm², resolution = 3 × 3 mm²]. We scanned a set of 30 axial slices that were 3-mm thick with a 0.5-mm gap, covering the range of −38.5 to 66 mm from the line of the anterior commissure to posterior commissure (AC-PC). In a single scanning session, we obtained 145 volumes, and dropped the initial four volumes from analyses due to MR signal increases. The fMRI data were analyzed in a standard manner using SPM12 statistical parametric mapping software (Wellcome Trust Center for Neuroimaging, http://www.fil.ion.ucl.ac.uk/spm)⁴³ implemented on MATLAB (Math Works, Natick, MA). For the fMRI data analyses, we used all trials including both correctly and incorrectly answered trials in order that we would be able to examine the activations that reflected accuracy rates directly for the tasks (see Fig. 5a, b); all conditions tested were equally weighted regarding the number of trials administered. See the Supplementary Methods for details.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

Flynn, S., Foley, C. & Vinnitskaya, I. The cumulative-enhancement model for language acquisition: Comparing adults’ and children’s patterns of development in first, second and third language acquisition of relative clauses. Int. J. Multiling. 1, 3–16. https://doi.org/10.1080/14790710408668175 (2004).
Article Google Scholar
Flynn, S. & Berkes, É. Toward a new understanding of syntactic CLI: Evidence from L2 and L3 acquisition. In L3 Syntactic Transfer: Models New Developments and Implications Bilingual Processing and Acquisition (eds Angelovska, T. & Hahn, A.) 35–61 (John Benjamins, 2017).
Chapter Google Scholar
Flynn, S. Microvariation in multilingual situations: The importance of property-by-property acquisition: Pros and cons. Second Lang. Res. 37, 481–488. https://doi.org/10.1177/0267658320945761 (2021).
Article Google Scholar
Fernández-Berkes, É. & Flynn, S. Vindicating the need for a principled theory of language acquisition. Linguist. Approaches Biling. 11, 30–36. https://doi.org/10.1075/lab.20095.fer (2021).
Article Google Scholar
Epstein, S. D., Flynn, S. & Martohardjono, G. Second language acquisition: Theoretical and experimental issues in contemporary research. Behav. Brain Sci. 19, 677–714. https://doi.org/10.1017/S0140525X00043521 (1996).
Article Google Scholar
Umejima, K., Flynn, S. & Sakai, K. L. Enhanced activations in syntax-related regions for multilinguals while acquiring a new language. Sci. Rep. 11, 7296. https://doi.org/10.1038/s41598-021-86710-4 (2021).
Article CAS PubMed PubMed Central Google Scholar
Musso, M. et al. Broca’s area and the language instinct. Nat. Neurosci. 6, 774–781. https://doi.org/10.1038/nn1077 (2003).
Article CAS PubMed Google Scholar
Sakai, K. L., Miura, K., Narafu, N. & Muraishi, M. Correlated functional changes of the prefrontal cortex in twins induced by classroom education of second language. Cereb. Cortex 14, 1233–1239. https://doi.org/10.1093/cercor/bhh084 (2004).
Article PubMed Google Scholar
Tatsuno, Y. & Sakai, K. L. Language-related activations in the left prefrontal regions are differentially modulated by age, proficiency, and task demands. J. Neurosci. 25, 1637–1644. https://doi.org/10.1523/jneurosci.3978-04.2005 (2005).
Article CAS PubMed PubMed Central Google Scholar
Sakai, K. L. Language acquisition and brain development. Science 310, 815–819. https://doi.org/10.1126/science.1113530 (2005).
Article CAS PubMed ADS Google Scholar
Zaccarella, E., Schell, M. & Friederici, A. D. Reviewing the functional basis of the syntactic Merge mechanism for language: A coordinate-based activation likelihood estimation meta-analysis. Neurosci. Biobehav. Rev. 80, 646–656. https://doi.org/10.1016/j.neubiorev.2017.06.011 (2017).
Article PubMed Google Scholar
Matchin, W. & Hickok, G. The cortical organization of syntax. Cereb. Cortex 30, 1481–1498. https://doi.org/10.1093/cercor/bhz180 (2020).
Article PubMed Google Scholar
Muhamedowa, R. Kazakh: A Comprehensive Grammar (Routledge, 2016).
Google Scholar
Fukui, N. & Sakai, H. The visibility guideline for functional categories: Verb raising in Japanese and related issues. Lingua 113, 321–375. https://doi.org/10.1016/s0024-3841(02)00080-3 (2003).
Article Google Scholar
Caplan, D., Chen, E. & Waters, G. Task-dependent and task-independent neurovascular responses to syntactic processing. Cortex 44, 257–275. https://doi.org/10.1016/j.cortex.2006.06.005 (2008).
Article PubMed Google Scholar
Ohta, S., Fukui, N. & Sakai, K. L. Computational principles of syntax in the regions specialized for language: Integrating theoretical linguistics and functional neuroimaging. Front. Behav. Neurosci. 7(204), 1–13. https://doi.org/10.3389/fnbeh.2013.00204 (2013).
Article Google Scholar
Ueno, M. & Garnsey, S. M. An ERP study of the processing of subject and object relative clauses in Japanese. Lang. Cogn. Process. 23, 646–688. https://doi.org/10.1080/01690960701653501 (2008).
Article Google Scholar
Treisman, A. M. & Gelade, G. A feature-integration theory of attention. Cogn. Psychol. 12, 97–136. https://doi.org/10.1016/0010-0285(80)90005-5 (1980).
Article CAS PubMed Google Scholar
Kinno, R., Ohta, S., Muragaki, Y., Maruyama, T. & Sakai, K. L. Differential reorganization of three syntax-related networks induced by a left frontal glioma. Brain 137, 1193–1212. https://doi.org/10.1093/brain/awu013 (2014).
Article PubMed Google Scholar
Tanaka, K., Kinno, R., Muragaki, Y., Maruyama, T. & Sakai, K. L. Task-induced functional connectivity of the syntax-related networks for patients with a cortical glioma. Cereb. Cortex Commun. 1(tgaa061), 1–15. https://doi.org/10.1093/texcom/tgaa061 (2020).
Article Google Scholar
Peterson, W. W., Birdsall, T. G. & Fox, W. C. The theory of signal detectability. Trans. IRE Prof. Group Inf. Theory 4, 171–212. https://doi.org/10.1109/TIT.1954.1057460 (1954).
Article Google Scholar
Gourevitch, V. & Galanter, E. A significance test for one parameter isosensitivity functions. Psychometrika 32, 25–33. https://doi.org/10.1007/BF02289402 (1967).
Article CAS PubMed Google Scholar
Zaccarella, E. & Friederici, A. D. Merge in the human brain: A sub-region based functional investigation in the left pars opercularis. Front. Psychol. 6(1818), 1–9. https://doi.org/10.3389/fpsyg.2015.01818 (2015).
Article Google Scholar
Makuuchi, M., Bahlmann, J., Anwander, A. & Friederici, A. D. Segregating the core computational faculty of human language from working memory. Proc. Natl. Acad. Sci. USA 106, 8362–8367. https://doi.org/10.1073/pnas.0810928106 (2009).
Article PubMed PubMed Central ADS Google Scholar
Iwabuchi, T., Nakajima, Y. & Makuuchi, M. Neural architecture of human language: Hierarchical structure building is independent from working memory. Neuropsychologia 132(107137), 1–9. https://doi.org/10.1016/j.neuropsychologia.2019.107137 (2019).
Article Google Scholar
Kinno, R. et al. Agrammatic comprehension caused by a glioma in the left frontal cortex. Brain Lang. 110, 71–80. https://doi.org/10.1016/j.bandl.2009.05.001 (2009).
Article PubMed Google Scholar
Amunts, K. et al. Broca’s region revisited: Cytoarchitecture and intersubject variability. J. Comp. Neurol. 412, 319–341. https://doi.org/10.1002/(sici)1096-9861(19990920)412:2%3c319::aid-cne10%3e3.0.co;2-7 (1999).
Article CAS PubMed Google Scholar
Chomsky, N. Minimalism: Where are we now, and where can we hope to go. Gengo Kenkyu 160, 1–41. https://doi.org/10.11435/gengo.160.0_1 (2021).
Article Google Scholar
Kinno, R., Kawamura, M., Shioda, S. & Sakai, K. L. Neural correlates of noncanonical syntactic processing revealed by a picture-sentence matching task. Hum. Brain Mapp. 29, 1015–1027. https://doi.org/10.1002/hbm.20441 (2008).
Article PubMed Google Scholar
Ohta, S., Fukui, N. & Sakai, K. L. Syntactic computation in the human brain: The degree of merger as a key factor. PLOS ONE 8(e56230), 1–16. https://doi.org/10.1371/journal.pone.0056230 (2013).
Article CAS Google Scholar
Tanaka, K. et al. Merge-generability as the key concept of human language: Evidence from neuroscience. Front. Psychol. 10(2673), 1–16. https://doi.org/10.3389/fpsyg.2019.02673 (2019).
Article CAS Google Scholar
Umejima, K. et al. Differential networks for processing structural dependencies in human language: Linguistic capacity vs. memory-based ordering. Front. Psychol. https://doi.org/10.3389/fpsyg.2023.1153871 (2023).
Article Google Scholar
Caplan, D., Alpert, N. & Waters, G. PET studies of syntactic processing with auditory sentence presentation. NeuroImage 9, 343–351. https://doi.org/10.1006/nimg.1998.0412 (1999).
Article CAS PubMed Google Scholar
Price, C. J. The anatomy of language: Contributions from functional neuroimaging. J. Anat. 197, 335–359. https://doi.org/10.1046/j.1469-7580.2000.19730335.x (2000).
Article PubMed PubMed Central Google Scholar
Zaccarella, E., Meyer, L., Makuuchi, M. & Friederici, A. D. Building by syntax: The neural basis of minimal linguistic structures. Cereb. Cortex 27, 411–421. https://doi.org/10.1093/cercor/bhv234 (2017).
Article PubMed Google Scholar
Rutten, G.-J. Broca-Wernicke theories: A historical perspective. In Handbook of Clinical Neurology Vol. 185 (eds Hillis, A. E. & Fridriksson, J.) 25–34 (Elsevier, 2022).
Google Scholar
Friederici, A. D., Chomsky, N., Berwick, R. C., Moro, A. & Bolhuis, J. J. Language, mind and brain. Nat. Hum. Behav. 1, 713–722. https://doi.org/10.1038/s41562-017-0184-4 (2017).
Article PubMed Google Scholar
Sakai, K. L., Tatsuno, Y., Suzuki, K., Kimura, H. & Ichida, Y. Sign and speech: Amodal commonality in left hemisphere dominance for comprehension of sentences. Brain 128, 1407–1417. https://doi.org/10.1093/brain/awh465 (2005).
Article PubMed Google Scholar
Inubushi, T. & Sakai, K. L. Functional and anatomical correlates of word-, sentence-, and discourse-level integration in sign language. Front. Hum. Neurosci. 7(681), 1–13. https://doi.org/10.3389/fnhum.2013.00681 (2013).
Article Google Scholar
Trettenbrein, P. C., Papitto, G., Friederici, A. D. & Zaccarella, E. Functional neuroanatomy of language without speech: An ALE meta-analysis of sign language. Hum. Brain Mapp. 42, 699–712. https://doi.org/10.1002/hbm.25254 (2021).
Article PubMed Google Scholar
Cenoz, J. The additive effect of bilingualism on third language acquisition: A review. Int. J. Biling. 7, 71–87. https://doi.org/10.1177/13670069030070010501 (2003).
Article Google Scholar
Oldfield, R. C. The assessment and analysis of handedness: The Edinburgh inventory. Neuropsychologia 9, 97–113. https://doi.org/10.1016/0028-3932(71)90067-4 (1971).
Article CAS PubMed Google Scholar
Friston, K. J. et al. Statistical parametric maps in functional imaging: A general linear approach. Hum. Brain Mapp. 2, 189–210. https://doi.org/10.1002/hbm.460020402 (1995).
Article Google Scholar

Download references

Acknowledgements

We would like to thank Elizabeth White from LEX Language Project (https://www.lexlrf.org/), as well as Kenshi Suzuki, Kazutake Hiraoka, and Genya Hiraoka from the LEX Institute (https://www.lexhippo.gr.jp/english/), for coordinating the Multilingual-Brain project, Aidos Sultankulov for contributions in Kazakh, Atora Yamada for contributions in experimental designs and MR scanning, Run Chen for contributions in language background assessments, Naoko Komoro for technical assistance, and Hiromi Matsuda for administrative assistance.

Funding

The authors declare that this study received funding from the LEX Institute. The funder was not involved in any aspect of the study: the study design, data collection, analysis or interpretation of data, the writing of this article, or the decision to submit it for publication. This research was also supported by the Grant-in-Aid for Challenging Research (Pioneering) (Nos. 21K18115) from the Ministry of Education, Culture, Sports, Science, and Technology of Japan.

Author information

Authors and Affiliations

Department of Basic Science, Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro-Ku, Tokyo, 153-8902, Japan
Keita Umejima & Kuniyoshi L. Sakai
Department of Linguistics and Philosophy, Massachusetts Institute of Technology, 77 Massachusetts Avenue, 32-D808, Cambridge, MA, 02139, USA
Suzanne Flynn

Authors

Keita Umejima
View author publications
You can also search for this author in PubMed Google Scholar
Suzanne Flynn
View author publications
You can also search for this author in PubMed Google Scholar
Kuniyoshi L. Sakai
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.U., S.F., and K.L.S. planned the research; K.U. performed the experiments; K.U. and K.L.S. analyzed the data; K.U., S.F., and K.L.S. wrote the paper.

Corresponding author

Correspondence to Kuniyoshi L. Sakai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Umejima, K., Flynn, S. & Sakai, K.L. Enhanced activations in the dorsal inferior frontal gyrus specifying the who, when, and what for successful building of sentence structures in a new language. Sci Rep 14, 54 (2024). https://doi.org/10.1038/s41598-023-50896-6

Download citation

Received: 16 October 2023
Accepted: 27 December 2023
Published: 02 January 2024
DOI: https://doi.org/10.1038/s41598-023-50896-6

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.