A prime-masked ERP investigation on phonology in visual word processing among bilingual speakers of alphasyllabic and alphabetic orthographies

In this study, we experimentally manipulated the phonology of the cross-script prime-target dyads in an ERP-coupled masked priming paradigm to explore the role phonology plays in visual word processing. The written characters of certain bilingual dyads seldom show any visual/orthographic similarity, yet have the same phonological representation. While the Bilingual Interactive Activation (BIA) model relies on the orthographic similarity between the languages in a bilingual dyad, its revised version (BIA + model) additionally banks on the phonological (and semantic) similarity between the words in such dyads. Thus, there exists the need to investigate the role of phonological (and semantic) similarity between the words of a bilingual dyad, especially in the absence of orthographic similarity. Borrowed words from one language to another provide a suitable avenue to explore this question. Cross-orthographic (or cross-script) bilingual participants of this study performed the semantic judgment of visually presented words in a masked priming paradigm in each of their languages while we simultaneously collected the event-related potentials (ERPs). The primes were either translations (different phonology & orthography: P–O–; phonologically incongruent) or transliterations (same phonology & different orthography: P + O–; phonologically congruent) of the target. Overall, the results showed no difference between the two prime conditions. We discuss our findings in light of the BIA and BIA + models of bilingual visual word processing and discuss the relevance of the former model in orthographically distinct bilingual language dyads.

www.nature.com/scientificreports/ rare targets in L1 and another button for similar targets in their L2. While the participants performed this task, the ERPs associated with the nonwords (non-targets) that, in turn, appeared like targets were measured. They hypothesized that the ERP indices of infrequent stimuli in an oddball paradigm (e.gs. N2 & P3 components) would be present on the target-like nonwords only if their orthographic cues are processed in the lexical decision task. Both behavioral and ERP findings from this study provided evidence for the early (i.e., pre-lexical) usage of orthographic cues. The target-like non-words (i.e., orthographically similar to the targets) received more false alarms compared to those that did not show such similarity with the targets. As predicted, they observed the N2 and P3 components suggestive of early processing of the orthographic cues. Finally, a very recent study 7 that compared the cross-script (Japanese-English) and same-script (Spanish-English) bilinguals on a pictureword interference paradigm provided further evidence for the perceptual differences between orthographies as a language cue which can facilitate faster processing of words in the target language. As stated earlier, according to the BIA models, the visual word processing commences from the early identification of the letter features [8][9][10] . In light of these models, those cross-script studies that proposed the activation of the phonology of the non-target language while processing words in the target language indicate that the orthography of the former language is activated by the target language. However, other studies maintain that cross-script primes conveniently limit the visual word processing to the relevant (target) language. A primary question per the BIA models, here, is that how do the features of letters in the target language that do not share any visual similarity with letters in the non-target language activate the latter. One way to explore this question is to use the cross-script cognate translation (e.g., borrowed words: henceforth, transliteration) and cross-script non-cognate translation (henceforth, translation) primes during visual word processing. The transliterations are phonologically similar but orthographically dissimilar to each other (P + O-). The borrowed words from one language to another (that do not have a unique translation in the latter language) qualify these conditions. Such words are written with the script of the recipient language, yet spoken akin to the lender language. The translations, on the other hand, are neither phonologically nor orthographically similar (P-O-). That is, such words have a unique word form in each language. Both transliterations and translations share the same semantics (S +) between the languages.

Current study
In the current study, we administered the masked priming paradigm coupled with ERP on a group of Malayalam-English bilinguals. Malayalam (the primary language spoken in Kerala, a southern state in India) belongs to the alphasyllabic (or semi-syllabic) orthography, like most other Indian languages. Consonantal graphemes in this language are always syllabic, whereas canonical vowel graphemes are phonemic in pronunciation. Further, the visuospatial organization of graphemes in this language is complex compared to English (alphabetic orthography) 33 . More importantly, due to historical reasons, Malayalam, like many languages in India, borrowed several words from English, thus providing us an opportunity to compare the transliteration (i.e., semantically and phonologically similar, but orthographically dissimilar) and translation words as primes to explore the relevance of orthography in cross-orthographically distinct language dyads.
In this study, we hypothesize that if the phonology influenced the visual word processing in cross-script bilingual contexts, the transliteration trials (P +O-, e.g., CAR-{ka:r}) shall yield faster response time (RT) compared to the translation trials (P-O-, e.g., TABLE-{me:ʃa}). This, in turn, would support the BIA + model 10 . On the other hand, a lack of difference in RT between the two conditions would support the BIA model 8,9 , and raise concern over the influence of phonology (and thus, on the BIA + model 10 ) in cross-script bilingual visual word processing contexts. Further, such a lack of difference could augment the argument on the role of orthography in cross-script bilingual visual word processing contexts.
Regarding the ERPs, as the primes and targets are always in different languages, a comparison between the translation and transliteration trials in the early time window (100-200 ms), where the visual/orthographic features are extracted, shall not yield any difference (i.e., no N/P150 effect). More importantly, if phonology influenced the mapping of orthography to phonology in bilingual visual word processing, the N250 effect could be expected (within 200-350 ms) due to phonological mismatch between transliteration trials (P +O-) and translation trials (P-O-). Finally, as both the translation and transliteration trials share the same semantic concepts, we do not expect a difference between translation and transliteration trials in the later time window (i.e., no N400 effect).

Results
Behavioral measures. The reaction times and error rates are shown in Fig. 1. We performed separate repeated measures ANOVA on the reaction time and error rates. Though the Transliteration trials elicited faster response time (mean difference = 26 ms) compared to the Translation trials, this difference was not statistically significant (F(1, 17) = 3.46, p = 0.08, ŋ p 2 = 0.16). Similarly, we did not observe a significant main effect of target language (F(1, 17) = 0.26, p = 0.61, ŋ p 2 = 0.01), though the participants were 9 ms faster while processing English targets compared to Malayalam targets. And, the interaction between prime congruency and target language was not significant (F(1, 17) = 0.2, p = 0.66, ŋ p 2 = 0.01). The results of error data analysis were different from that of the RTs. Participants committed more errors on Translation trials compared to the Transliteration trials (F(1, 17) = 5.42, p = 0.03, ŋ p 2 = 0.24). However, neither the main effect of the target language (p = 0.16) nor the interaction between prime congruency and the target language (p = 0.64) were statistically significant.   Data-driven analysis. The cluster-based ANOVA that included all 30 electrodes and all time points between 0 to 600 ms did not show any significant main effects. However, the interaction between Target language and Congruency was significant between 140 and 180 ms (cluster p value = 0.041; Cohen's d = 0.83). The difference in the ERP between translated and transliterated trials was more positive for the Malayalam targets than for the English targets between 140 and 180 ms (Fig. 2).

Discussion
In the present study, we compared the behavioral and electrophysiological data from a group of Malayalam-English bilinguals while they processed the prime-target pairs that were either translations or transliterations. The participants performed the semantic judgment of prime-masked target words separately in each language. The two languages considered in this study belonged to entirely different orthographies having minimal or no visual/orthographic similarity between them. As mentioned in the Introduction, while the transliterations share the same phonology and semantics between the languages but not the orthography, the translations share only semantics but not the phonology and orthography. Thus, the primary difference between these two types of words lies in their phonology, and this allowed us to precisely manipulate the phonology between the primes and targets.
The reaction time of the target words preceded by transliterated (i.e., congruent: P+ O-) primes was shorter (i.e., faster) by 26 ms compared to that of the targets preceded by translation (i.e., incongruent: P-O-) primes. However, the difference was not significant. This primarily indicates that the phonological similarity in the case of transliterated primes did not have an influence on the target word processing. This finding is in contrast to the earlier reports on phonological facilitation 15,18 . Note that, in the current study, the orthography of primes always differed from that of the targets. Further, the prime and target had the same semantic representation (i.e., translation & transliteration). While the transliterated primes were phonologically congruent with the target word, the translated primes were phonologically incongruent with the target words. Therefore, the behavioral findings from the current study show that in the absence of orthographic similarity, the phonological similarity between primes and targets did not have any influence on the target word processing. That is, the possible phonological facilitation effect reported elsewhere 34 could be annulled at subliminal prime display times when the prime's orthography differs from that of the target. The findings from this study may, therefore, strengthen the arguments on the plausible role of orthography in bilingual visual word processing, at least, in orthographically distinct languages 6 .
The ERP findings of this study mostly endorsed the behavioral data. The only ERP effect observed was between 100 and 200 ms, where (1) the prime congruency had an effect on Malayalam targets, but not on English targets, and (2) the Target language had an effect on Translation (P-O-) trials, but not on Transliteration (P + O-) trials. A closer look at these results reveals that both effects arose from significantly higher ERP amplitude of the Malayalam translations (4.78 µV) over Malayalam transliterations (3.9 µV) and English translations (2.78 µV). These effects were also present in the fully data-driven cluster-based permutation analysis. We did not predict an effect in this time window as the dissimilarities between prime and target were equivalent. What could have caused the rise in the amplitude of Malayalam translation trials? Is this an effect of phonology?
The N/P150 effect observed in the 100-200 ms time window could be attributed to the effect of phonology, as we manipulated this variable in the current study. Though N/P150 effect has been reported in a cross-script  35 ), and the earlier time window shows (e.g., 150-250 ms) the orthographic effects 25,35,36 . The unidirectional difference in ERP amplitude from English-to-Malayalam translation prime-target dyads indicates that certain additional factor(s) might have influenced the ERP amplitude in this time window. In the following section, we discuss the possible reasons for these observations. One possible explanation for this effect is that, in the current study, the loaner language was always English. That is, borrowed words were always Malayalam transliterations of English words, but not in the reverse direction (as such words are extremely meager in this bilingual dyad). Thus, the Malayalam translations had a unique lexical form in each language (e.g., /me:ʃa/-TABLE). However, this was not the case with Malayalam transliterations as these words had only one word form (in the lending language: here, English) shared with the borrowing language (e.g., CAR-{ka:r}). Further, the Malayalam targets were always preceded by English primes. Thus, in the case of translated Malayalam targets, the English primes would have pre-activated their word forms (e.g., TABLE) followed by the word forms in Malayalam (e.g., /me:ʃa/-table) by the targets after 500 ms (i.e., duration of the backward mask). These two distinct word forms would have led to certain conflict, giving rise to an increased ERP amplitude (note that there exists ERP evidence for the early access to lexical features (i.e., 100-200 ms) such as word frequency and lexicality while processing visually presented letter strings 37 ). On the other hand, in the case of transliteration trials, both prime and target activate the same word form, giving rise to no such conflicts, thus leading to no difference in ERP amplitude. However, this sort of an elevation in ERP amplitude happened only with translated Malayalam targets, but not when they were presented as primes (followed by English unique targets). This indicates that the difference in the word form similarity between the transliteration and translation trials alone cannot explain the observed ERP effect in the 100-200 ms time window. Below, we discuss additional factors that could have possibly influenced the ERP amplitude in this time window.
First, it may be noted that, behaviorally, the Malayalam targets showed a non-significantly longer RT compared to English targets. Second, Malayalam is orthographically (and visually) more complex compared to English 33 . Finally, the participants in this study (applicable to the majority of the youngsters in India who are enrolled in English medium schools), despite their native fluency in speaking and reading Malayalam, had limited exposure to the written form of this language compared to written English early from the kindergarten level. Note that, at the time of their recruitment to this study, the participants were immersed in a fully English dominant (academic) environment for a minimum of 2-6 years, where formal exposure to written Malayalam was absent. The increased error rates with Malayalam targets in our participants (see Results) substantiate this reasoning. Considering all these points above, we believe that the briefly presented English primes might have been processed more effectively, compared to Malayalam primes, at subliminal levels, giving rise to the ERP effect in the case of translated English-Malayalam prime-target pairs in this time window. However, this needs confirmation with additional investigations.
The ERPs in the subsequent time windows (i.e., 200-350 ms & 350-550 ms) also failed to reveal any significant effect of either the prime congruency or the target language. That is, the ERPs failed to show both the N250 and N400 effects. The former effect is considered as an index of orthography-to-phonology mapping processing, whereas the latter is believed to represent semantic processing 25 . In the current study, we hypothesized that as both translation and transliteration prime-target dyads share the same semantic representation, a comparison would yield null difference, and thus, an absent N400 effect. The ERPs in the final time window (350-550 ms) supported our prediction. However, more interesting is the ERP findings in the middle time window (200-350 ms), where the N250 effect was expected. Importantly, and in accordance with the behavioral data from this study, the middle time window did not show any significant effect (i.e., an absent N250 effect), thus, failing to show any influence of phonological similarity between the primes and targets in transliterations compared to translations in each language. While the earlier studies 26,27 attributed the N250 effect to sub-lexical processing of orthographic stimuli, finding from the 200-350 ms time window in current study is indicative of the non-phonological nature of this effect. Further, it may be noted that those studies that showed an N250 effect have either compared semantically congruent and incongruent prime-target pairs 28,29 or only used semantically incongruent prime-target pairs 31 . Therefore, it may be the case that semantic (and not phonological) congruency needed to be manipulated to observe an N250 effect.
Consolidating the observations, thus far, it becomes apparent that the RT failed to show any influence of phonology, which in turn was corroborated by an absent N250 ERP effect. These findings, in general, did not support the influence of phonology in bilingual visual word processing context. Based on the behavioral and electrophysiological findings from the current study, we argue that in the absence of orthographic overlap between the primes and targets, phonological (or combined phonological + semantic) similarity may not influence the visual word processing, at least, in orthographically distinct bilingual dyads, such in the current study. These findings could augment the arguments that early orthographic cues might guide visual word processing in a resourceeconomic, language-selective manner 6,7,32 . Finally, the ERP difference in the early time window (100-200 ms) with translated Malayalam targets and the overall increased error rates with Malayalam targets in this study could possibly be due to the difference in participants' exposure between written forms of Malayalam and English. www.nature.com/scientificreports/ In light of these observations, we recommend that future studies on visual word processing in bilinguals shall consider the participants' exposure to the written forms of the languages under consideration.
Implication on BIA models. The findings from this study provide certain insights on the BIA models. As delineated in the Introduction, the BIA model 8,9 relied primarily on the orthographic similarities between the languages in a bilingual lexicon. The findings from the current study are explainable based on this model. For instance, the orthographic features of the target language are initially identified. According to the BIA model 8,9 , the orthographic features of the target language inhibit the neighboring, less-activated units of the non-target language and the activation of the former progresses to the word and then to the language nodes. These language nodes subsequently inhibit the non-target words in the bilingual lexicon. Behaviorally, the absence of the main effect of prime congruency supports the model's prediction as neither transliterated nor translated primes (both in different orthographies with reference to the target word) influenced the target word processing. However, this finding from the current study raises certain concerns on the phonological similarity between the prime and target words, as proposed in the BIA + model 10 . That is, according to the BIA + model, in addition to the orthographic overlap, the phonological and semantic similarities between the languages also contribute to bilingual visual word processing. Behavioral results from the current study show that, in the absence of orthographic similarity, the phonological overlap between the prime and target may not exert an influence on bilingual visual word processing. For instance, in the case of a transliterated (P + O-) prime-target dyad, both the prime and target shared the same phonology and semantics, differing only in their orthography. However, such primetarget dyads failed to reveal any response time (RT) advantage over the translation (P-O-) dyads, thus, raising concerns over the role of phonology in bilingual visual word processing, especially in the cross-script contexts. Similarly, from the ERP data, we expected an N250 effect arising from the difference in phonology between the two types of prime-target dyads used in this study. However, such an effect was not observed. Thus, based on both behavioral and electrophysiological data, we consider that in the absence of the orthographic overlap (such as in the current study), the phonological (and semantic) similarity between the primes and targets fail to show any processing advantage. The findings from this study, thus, support the BIA 8,9 model, but not the BIA + model 10 in cross-script visual word processing contexts.

Participants. Eighteen Malayalam-English bilingual adults between 18 and 25 years of age (Mean = 21.4 years)
were recruited to this study from the university community. Malayalam was their native language (L1) and English was their second language (L2). All participants started learning English before the age of 5 years with the commencement of kindergarten, and rated their proficiency in this language between 5 and 7 on a scale of 1-7, indicating 'Good' to 'Native-like' proficiency in speaking, listening, reading, and writing. None of them had any history or complaint of neurological, auditory, or language disorders. All of them had a normal or correctedto-normal vision. The sample size is comparable with the studies that used similar methodology to investigate phonological processing using EEG 5,14,15,23,29,37 . The sample size was also adequate in detecting differences in ERPs between two conditions in a within-subject design with 80% power (using data simulation for clusterbased permutation tests 38 ). The study was approved by the ethics committee at the College of Health Professions, Manipal Academy of Higher Education. Informed consent was obtained from all participants. All methods were performed in accordance with the relevant guidelines and regulations.

Stimuli.
A set of 100 English words (names of objects/items) were chosen for this study, of which 50 had unique translation equivalents in Malayalam (e.g., Table-/me:ʃa/), and the remaining did not have such translations in Malayalam (e.g., CAR-/car/). That is, the latter set of English words was borrowed to Malayalam such that they were written in Malayalam orthography, though spoken akin to English pronunciation (i.e., transliterated words: e.g., -CAR). The English words were 3-11 letters in length and their transliterated and translated Malayalam counterparts were 2-5 letters in length. This difference in character length is expected as English is an alphabetic language, whereas, Malayalam is an alphasyllabary (more like a syllabic orthography). The translated and transliterated English words (to Malayalam) did not differ in their frequency (t = − 0.38, p = 0.7), word length (t = − 0.97, p = 0.33), bigram frequency (t = − 1.3, p = 0.19), as well as on the orthographic neighborhood density (t = − 1.01, p = 0.31) (English Lexicon Project). As the Malayalam words did not have information on these variables, we obtained the familiarity rating (on a scale of 5: 1-highly familiar, 2-familiar, 3-unfamiliar, 4-highly unfamiliar, & 5-never heard) of an initial corpus of Malayalam 170 words by five native speakers. From their ratings, we selected 50 words each that received either 'highly familiar' or 'familiar' rating from all the participants (who rated), to the translation and transliteration trials. These two sets of Malayalam words did not differ from each other in their familiarity rating (t = − 0.8, p = 0.42). The stimuli used in this study as well as their details are provided in the Supplementary material.
In this study, we specifically explored the influence of phonological similarity/dissimilarity of the primes on the target word processing. Therefore, we did not introduce any semantic controls (e.g., non-words). Thus, all the primes and targets were true words. We used four types of prime-stimulus relations such as: a) L2-L1 Phonologically congruent (i.e., English Loan Word Prime -Malayalam Transliteration Word Target : e.g.,car-{ka:r} ), b) L2-L1 Phonologically incongruent (i.e., English Unique Word Prime -Malayalam Translation Word Target : e.g., Procedure. The participants were seated ~ 100 cm away from the computer monitor (with a 60 Hz refresh rate) in a comfortable chair. Both the prime and target words were displayed in the center of a 24″ monitor as black letters on a white background in the Baraha font (Baraha 7.0). The presentation scheme of a trial is depicted in Fig. 3. Each trial began with the presentation of a fixation cross ( +) on the center of the screen for 500 ms which was then followed by a forward mask (######) for 500 ms. The prime was presented for 48 ms which was then replaced by a backward mask (######) for 500 ms. The target followed the backward mask and remained on the screen for 1500 ms or until the participant made a response. English words in the prime position were in lowercase (Block 1), and in the target position, were presented in upper case (Block 2). After the target word disappeared, an inter-trial interval in the form of a blank screen of 1000 ms was presented which in turn was followed by a 'blink screen' (*|*) for 500 ms where participants were allowed to blink, if desired. Their task was to press '1' for the name words of natural objects (e.g., earth), and '2' for man-made objects (e.g., computer) on the right-sided number keys of a standard computer keyboard. All participants performed the semantic judgment on the translated and transliterated prime-target pairs (N = 50 each) in both languages (total number of trials = 200). Half of the participants received Malayalam words as targets (with English primes) initially and the remaining received English targets first (with Malayalam primes). The experiment was designed and deployed with E-Prime experiment software (E-prime 2.0).
EEG recording procedure. Continuous EEG was acquired with 30 Ag-AgCl scalp electrodes mounted in an electrode cap (Quick-Cap, Compumedics Neuroscan 4.5) in accordance with the standard 10/20 system. The ground electrode was positioned between FPz and Fz. Electrical activity from both the mastoids was recorded, with the left mastoid (M1) as the reference. Two electrodes positioned above and below the left eye measured the vertical eye movement (VEOG), and another electrode pair placed on the outer canthi of each eye measured the horizontal eye movements (HEOG). The impedance at each electrode was maintained below 5 kΩ. The signal from the scalp electrodes was amplified 20,000 times (SynAmps 2 amplifier, Compumedics), sampled at 1000 Hz, and low pass filtered at 100 Hz online.
Data analysis. We used the EEGLAB 39 , ERPLAB 40 , and fieldtrip 41 toolboxes running on MATLAB 2020a (Mathworks, Natik, MA, USA), for the data analysis. The continuous EEG data was (band-pass) filtered with windowed-sync finite impulse response (FIR) filter between 0.1 and 30 Hz with a roll-off of 12 dB/Octave. EEG was then divided into epochs of -100 to 800 ms duration relative to the target onset using ERPLAB. There were 4 conditions: (1) Phonology-congruent English Prime -Malayalam Target (i.e., Malayalam transliterated words); (2) Phonology-incongruent English Prime -Malayalam Target (i.e., Malayalam translated words); (3) Phonology-congruent Malayalam Prime -English Target (i.e., English loan words); and (4) Phonology-incongruent Malayalam Prime -English Target (i.e., English unique words). These epochs were baseline-corrected between -100 to 0 ms. The EEG data were then subjected to independent component analysis using EEGLAB. Subsequently, we used the automated classification tool (ICLabel 42 ) to identify the noisy components in the EEG data. Components with > 80% threshold for 'eye blinks' , 'muscle' , and 'channel noise' were eliminated from each participant's EEG data. Additionally, we removed all epochs across the participants that exceeded the amplitude of ± 80 µV at any time point (artifacts). Each participant had at least 80% artifact-free trials in each condition. Each participant's artifact-free epochs belonging to each of these four conditions were averaged. Finally, we separately averaged the epochs of the four conditions across the participants to obtain the grand average of the four conditions.
Statistical analyses of ERP data. The statistical analysis was done in two stages. The first analysis was literature-driven, whereby the electrodes and time ranges of interest were selected based on previous research 27,28,43,44 . The ERPs were averaged across 9 electrodes around Cz (CP4, C4, CP3, CPz, Cz, FC4, C3, FCz, FC3) and the amplitudes were calculated from three time-windows: 100-200 ms, 200-350 ms, and 350-550 ms. www.nature.com/scientificreports/ These amplitudes were subjected to a 2 × 2 repeated measures analysis of variance (ANOVA) with the factors Target language (English, Malayalam) and prime congruency (translated, transliterated). The second analysis was completely data-driven as the literature-driven analysis in the previous step was limited only to the known ERP effects. To this end, non-parametric cluster-based permutation tests were used 45 .
We computed a repeated measures 2 × 2 cluster-based permutation ANOVA with Target language (English, Malayalam) and prime congruency (translated, transliterated) as factors. The main effects and interactions were tested separately. The analysis included all the time points between 0 to 600 ms and all electrodes. First, a series of t-tests were computed at each electrode and time point. From these time points, where the waveforms differed significantly from each other was identified (p < 0.05, two-tailed) for each channel. Clusters were then formed by connecting the time points that showed a significant effect based on temporal and spatial adjacency. This was done separately for data points that showed positive and negative t values. Cluster level statistics were computed by adding together all t values within a cluster (mass t score). To control Type I errors due to multiple comparisons (30 channels × 600 time-points), a permutation approach was used. For this, a data-driven null hypothesis distribution was created by randomly swapping the stimuli labels within participants 5,000 times and computing mass t scores for each randomization. The mass t scores obtained in the first step were then compared with the null hypothesis distribution. The cluster was determined to be significant if it fell in the top 2.5 or bottom 2.5 percentile of the null hypothesis distribution.