Representational similarity analysis reveals task-dependent semantic influence of the visual word form area

Wang, Xiaosha; Xu, Yangwen; Wang, Yuwei; Zeng, Yi; Zhang, Jiacai; Ling, Zhenhua; Bi, Yanchao

doi:10.1038/s41598-018-21062-0

Download PDF

Article
Open access
Published: 14 February 2018

Representational similarity analysis reveals task-dependent semantic influence of the visual word form area

Xiaosha Wang^1,2,3^na1,
Yangwen Xu^2,3^na1,
Yuwei Wang^4,5,
Yi Zeng^4,5,
Jiacai Zhang¹,
Zhenhua Ling⁶ &
…
Yanchao Bi^2,3

Scientific Reports volume 8, Article number: 3047 (2018) Cite this article

6032 Accesses
21 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Access to semantic information of visual word forms is a key component of reading comprehension. In this study, we examined the involvement of the visual word form area (VWFA) in this process by investigating whether and how the activity patterns of the VWFA are influenced by semantic information during semantic tasks. We asked participants to perform two semantic tasks - taxonomic or thematic categorization - on visual words while obtaining the blood-oxygen-level-dependent (BOLD) fMRI responses to each word. Representational similarity analysis with four types of semantic relations (taxonomic, thematic, subjective semantic rating and word2vec) revealed that neural activity patterns of the VWFA were associated with taxonomic information only in the taxonomic task, with thematic information only in the thematic task and with the composite semantic information measured by word2vec in both semantic tasks. Furthermore, the semantic information in the VWFA cannot be explained by confounding factors including orthographic, low-level visual and phonological information. These findings provide positive evidence for the presence of both orthographic and task-relevant semantic information in the VWFA and have significant implications for the neurobiological basis of reading.

Word frequency and reading demands modulate brain activation in the inferior frontal gyrus

Article Open access 11 October 2023

The visual word form area (VWFA) is part of both language and attention circuitry

Article Open access 06 December 2019

Visual and linguistic semantic representations are aligned at the border of human visual cortex

Article 28 October 2021

Introduction

The left posterior occipitotemporal sulcus is a key region in the neural circuitry of reading. It is consistently activated by visual words across various writing systems^1,2, adapts to repeated presentation of words^3,4,5 and captures orthographic similarity among words^6,7,8. Its sensitivity to visual words develops with reading acquisition^9,10 and decreases upon damage^11,12. All these lines of evidence indicate the involvement of this region in orthographic representation and justify its name as the “visual word form area” (VWFA)¹³.

A central goal of reading is mapping word forms onto word meanings for comprehension¹⁴. Because of its structural and functional connections with higher-order language regions^9,15,16, the VWFA is commonly assumed to play an essential role in such mapping^17,18. The exact mechanism of the form-meaning mapping and whether the VWFA activity is modulated by word semantic properties remain inconclusive. Significant semantic priming effects were found in this region in a visual lexical decision task¹⁹, but not in a similar paradigm with a semantic oddball detection task³ or a naming task⁴. The effects that semantically related word pairs showed more similar activity patterns than semantically unrelated ones in the VWFA were found to be marginally significant in one recent study⁷, but not in an earlier study²⁰.

One possible explanation for these inconsistent findings of semantic effects in the VWFA might be related to the multidimensional organization of the semantic space^21,22,23. Previous studies examined semantic effects by contrasting conditions of strong and weak semantical relatedness, with relatedness quantified from subjective association strength³ or computational linguistics⁷. Nevertheless, concepts can be related to each other in very different ways. For example, taxonomic and thematic relations are two dissociable types of relations in the semantic system²⁴, with the former based on shared features (e.g. teacher-doctor) and the latter on frequent co-occurrence in events (e.g. teacher-classroom). A single dimension of semantic relatedness may entangle various relations and dilute semantic effects to be observed.

In this study, to examine whether and how the VWFA activity is influenced by semantic processing in reading comprehension, we tested multiple types of semantic relations in the VWFA in tasks requiring explicit semantic access. Specifically, we asked participants to perform taxonomic or thematic categorization tasks on the 45 words that fell into nine conditions arising from the combinations of three taxonomic categories (people, manmade objects, and locations) and three thematic categories (school, medicine, and sports), while obtaining the blood-oxygen-level-dependent (BOLD) fMRI responses to each word. We constructed semantic representational dissimilarity matrices (RDMs) based on four types of semantic relations: taxonomic, thematic, subjective semantic rating and word2vec²⁵ (a computational linguistic measure based on word co-occurrence patterns in a large language corpus) (Figure 1, top panel). Taxonomic and thematic RDMs targeted at specific dimensions of the semantic space by indicating whether the two words belonged to the same taxonomic or thematic category. The subjectively rated semantic distance and word2vec RDMs measured composite semantic relationships, which may integrate multiple dimensions of relatedness. The representational similarity analysis (RSA) approach²⁶ was adopted to evaluate the representational content of the VWFA by correlating the semantic RDMs with the neural RDMs derived from the word-word correlation distance embedded in neural patterns in each task. Orthographic, low-level visual and phonological RDMs were constructed and controlled for in further analyses to rule out the possibility that any semantic effects may be driven by these non-semantic information types in this region.

Results

Relationships among the theoretical/behavioral RDMs

Figure 1 illustrates eight theoretical/behavioral RDMs constructed by the pairwise relations of the 45 words, including four semantic RDMs (top panel, see Introduction) and four non-semantic RDMs (bottom panel, see Methods). The logographeme, pixelwise and phonological RDMs characterized word-word dissimilarity in orthographical, low-level visual and phonological information, respectively. The co-occurrence RDM measured how likely the two words would appear together in a 5-word window in texts, which might reflect the co-occurrence statistics in the visual field.

The Spearman correlation coefficients among these RDMs are shown in Table 1. Among the four semantic RDMs, the taxonomic and thematic RDMs were not correlated due to careful selection of stimuli. The two composite semantic RDMs (subjective rating and word2vec) were significantly correlated with each other (r = 0.441, P < 10⁻¹⁰) and differed in how they related to the taxonomic and thematic RDMs. The subjectively rated semantic distance was strongly correlated with the thematic RDM (r = 0.798, P < 10⁻¹⁰), not with the taxonomic RDM (r = 0.017, P = 0.584), whereas the word2vec distance showed significant correlations with both taxonomic and thematic RDMs (rs > 0.422, Ps < 10⁻¹⁰). For the relations between semantic and non-semantic RDMs, all the semantic RDMs were significantly correlated with the co-occurrence RDM (rs > 0.189, Ps < 10⁻⁸), implying that visual co-occurrence may be a confounding variable in any observed semantic effects. The word2vec distance was also significantly correlated with the logographeme and pixelwise RDMs (Ps < 10⁻⁴), which is consistent with the notion that this algorithm captures multiple dimensions of similarity²⁵. The significant correlation between taxonomic and logographeme RDMs (r = 0.107, P = 0.0007) is likely to be due to the fact that the majority of Chinese characters are so-called composite characters, containing a semantic radical and a phonological radical. Characters belonging to the same taxonomic category sometimes share the same semantic radical. For instance, many animal words in Chinese share the semantic logographeme “”, e.g. “”(cat), “”(dog), “” (wolf), “” (fox)). The correlations between pixelwise and co-occurrence (r = 0.166, P < 10⁻⁶), pixelwise and phonological (r = −0.105, P = 0.0009) RDMs are less straightforward to interpret and might be epiphenomenal in Chinese written language given the prevalence of orthographic neighbors/homophones of Chinese characters.

Table 1 Spearman correlation coefficients among theoretical/behavioral representational dissimilarity matrices.

Full size table

Behavioral results in the fMRI experiment

In the scanner, participants were presented with the 45 words that fell into nine conditions arising from the combinations of three taxonomic categories (people, manmade objects, and locations) and three thematic categories (school, medicine, and sports), with five words per condition (see Supplementary Table S1). In different runs, they were asked to categorize each word either by taxonomic or thematic memberships. They performed the two tasks with equally high accuracy (taxonomic task, mean = 96%, standard deviation (SD) = 3%; thematic task, mean = 96%, SD = 3%; task difference, paired t₁₈ = 0.28, P = 0.78) and with comparable reaction times (taxonomic task, mean = 1513.50 ms, SD = 415.33 ms; thematic task, mean = 1497.03 ms, SD = 437.34 ms; task difference, paired t₁₈ = 0.46, P = 0.65).

Orthographic representation in the VWFA

We defined the VWFA in both anatomical and functional ways. To verify orthographic representation in the anatomically defined VWFA and to functionally localize voxels sensitive to orthography, we examined the correspondence between the logographeme RDM and the neural RDMs based on the overall functional data (i.e. the collapsed dataset of the taxonomic and thematic tasks, see Methods).

Anatomically defined VWFA

The anatomical mask was defined as a box covering the left posterior occipitotemporal sulcus with cerebellum voxels excluded (Figure 2A)²⁷. One-sample t tests (one-tailed) revealed significantly positive correlation for logographeme information in this region of interest (group-averaged Fisher-z-transformed Spearman r (mean r) = 0.025; t₁₈ = 3.112, P = 0.003), but not for pixelwise, co-occurrence, or phonological information (Ps > 0.41; Figure 2B, left panel). Comparing correlation coefficients of logographeme information with each of these control variables revealed significant differences between logographeme and pixelwise information (paired t₁₈ = 3.380, P = 0.003), between logographeme and co-occurrence information (paired t₁₈ = 4.024, P < 0.001). The difference between logographeme and phonological information showed a nonsignificant trend toward significance (paired t₁₈ = 1.788, P = 0.091).

Functionally defined VWFA

For the functional VWFA localization, a whole-brain searchlight RSA with the logographeme RDM (cluster-level FWE corrected P < 0.05, voxelwise Z > 3.09) revealed one single cluster in the left posterior occipitotemporal cortex (peak MNI coordinates xyz = −46, −64, −18; for the other two significant regions see Table 2), which partially overlapped with the anatomically defined VWFA (Figure 2A). We then identified functional VWFA in individual subjects using the same way (see Methods) and examined its encoding of pixelwise, co-occurrence and phonological information (Figure 2B, right panel). One-sample t tests (one-tailed) revealed that none of the three types of information was significantly associated with the neural activity patterns of the functional VWFA (Ps > 0.30).

Table 2 Brain regions whose activity patterns encoded logographeme information of Chinese words in a whole-brain searchlight analysis (cluster-level FWE corrected P < 0.05 with voxelwise Z > 3.09).

Full size table

Task modulation effects

We investigated whether semantic demands could modulate the orthographic representation in the VWFA by comparing the correlations of the logographeme and neural RDMs between the two tasks. We first looked at logographeme representation in the VWFA in each task and found that while the neural activity patterns of the functional VWFA significantly associated with orthographic information in both tasks (taxonomic task: mean r = 0.030, t₁₈ = 6.014, P < 0.001; thematic task: mean r = 0.030, t₁₈ = 5.130, P < 0.001), the anatomically defined VWFA showed weaker representations (taxonomic task: mean r = 0.008, t₁₈ = 1.463, P = 0.080; thematic task: mean r = 0.009, t₁₈ = 1.028, P = 0.159), possibly due to the imprecise localization of orthography-sensitive voxels in individuals. Nevertheless, in both VWFA masks, paired t tests comparing logographeme information between the two tasks revealed no significant differences (Ps > 0.922), indicating task-independent orthographic representation in this region.

Semantic information in the VWFA

We then examined how various types of semantic information were encoded in the VWFA in each task and how they were modulated by task demands (Figure 2C).

Taxonomic information

In the taxonomic task, the taxonomic RDM showed significantly positive correlation with the neural RDM in the VWFA regardless of the mask definition (anatomical mask: mean r = 0.029, t₁₈ = 4.166, P < 0.001; functional mask: mean r = 0.030, t₁₈ = 3.890, P = 0.001). In the thematic task, the taxonomic information was not associated with the activity patterns of the VWFA in either mask (Ps > 0.115). Significant task differences were found in both the anatomical (paired t₁₈ = 2.302, P = 0.033) and functional (paired t₁₈ = 2.790, P = 0.012) VWFA masks.

Thematic information

In the taxonomic task, the thematic RDM was not correlated with the neural RDM of the VWFA in either mask (Ps > 0.338). In the thematic task, the thematic information showed significantly positive correlation with the neural RDM in the VWFA (anatomical mask: mean r = 0.022, t₁₈ = 3.773, P < 0.001; functional mask: mean r = 0.023, t₁₈ = 3.242, P = 0.005). Significant task differences were found in both the anatomical (paired t₁₈ = 3.261, P = 0.004) and functional (paired t₁₈ = 3.051, P = 0.007) VWFA masks.

Subjective semantic rating

In the taxonomic task, the subjectively rated semantic RDM was not associated with the neural RDM of either the anatomical or functional VWFA masks (Ps > 0.737). In the thematic task, the presence of this information in the VWFA approached significance (anatomical mask: mean r = 0.012, t₁₈ = 1.468, P = 0.080; functional mask: mean r = 0.015, t₁₈ = 1.645, P = 0.059). Direct comparison between the two tasks did not reveal significant differences (Ps > 0.154).

Word2vec

The word2vec RDM was significantly associated with the neural RDM of the VWFA in both the taxonomic task (anatomical mask: mean r = 0.029, t₁₈ = 3.541, P = 0.001; functional mask: mean r = 0.034, t₁₈ = 3.701, P < 0.001) and the thematic task (anatomical mask: mean r = 0.040, t₁₈ = 4.731, P < 0.001; functional mask: mean r = 0.044, t₁₈ = 5.591, P < 0.001). No significant task differences were observed (Ps > 0.311).

Semantic information encoded in the VWFA: Controlling for non-semantic confounding variables

To test whether semantic information can explain the variance of the neural activity patterns in the VWFA over and above non-semantic factors, we computed partial correlations between the neural and semantic RDMs, controlling for the logographeme, pixelwise, co-occurrence and phonological RDMs. As shown in Table 3, task-relevant semantic information and word2vec distance remained significant in these partial correlation analyses. The subjectively rated semantic distance was (marginally) significant in the thematic task when the co-occurrence RDM was included as a nuisance variable. Linear regression analyses were then carried out to examine the unique contribution of orthography and semantic information to the neural RDM of the functionally defined VWFA (the anatomically defined VWFA was not analyzed here due to the insignificant orthographic representation in each task). Taking the group-averaged neural RDM as the dependent variable and the logographeme, taxonomic and thematic RDMs as the independent variables, we found that in the taxonomic task the logographeme (β = 0.126, P = 0.003) and taxonomic (β = 0.105, P = 0.008) information were significant predictors of the neural RDM, whereas in the thematic task the logographeme (β = 0.135, P = 0.001) and thematic (β = 0.086, P = 0.028) information were significant predictors. These results suggest the joint presence of orthographic and task-relevant semantic information in the VWFA in semantic tasks.

Table 3 Group-averaged Fisher-z-transformed Spearman correlation coefficients for semantic information in the VWFA, controlling for non-semantic confounding variables.

Full size table

Discussion

The aim of this study was to investigate whether the VWFA activity encodes semantic information in explicit semantic tasks. Using RSA, we computed the correlations between RDMs derived from neural activity patterns in the VWFA with various types of semantic RDMs in two semantic categorization tasks–taxonomic and thematic categorization. We found that the VWFA activity patterns were modulated by the semantic tasks, with words’ neural RDMs showing significant association with semantic dimensions that were relevant for the specific task being performed. That is, words that are taxonomically related (e.g. teacher-doctor) had more similar VWFA activity patterns under the taxonomic categorization task (people, objects, or locations) and those that are thematically related (e.g. teacher-classroom) had more similar VWFA activity patterns under the thematic categorization task (school, medicine, or sports). The composite semantic similarity measure derived from the advanced natural language processing algorithm (i.e., word2vec) together with big-data language corpora showed significant effects in both semantic categorization tasks, so did the orthographic similarity (the logographeme RDM). These findings provide positive evidence that both orthographic and semantic information was encoded in the VWFA during semantic processing and that the semantic effect dimensions change with task goals.

We first verified that the activity pattern in the left posterior occipitotemporal cortex is sensitive to the orthographic similarity of Chinese words. By constructing an orthographic RDM based on the overlap of logographemes–the basic functional unit in Chinese characters²⁸–between words, we found that the logographeme RDM showed significantly positive correlations with the neural RDM in the pre-defined anatomical mask and localized a cluster in the same region in the whole-brain searchlight analysis. Together with previous findings of orthographic representations in this region using RSA^6,7,8, this line of evidence echoes neuroimaging studies with conventional univariate approaches^3,5,19 and lesion studies¹¹ in supporting the central role of the VWFA in the orthographic representation. Orthographic computation appears to be an inherent property of the VWFA, because of either its sensitivity to specialized orthographic inputs^17,29 and/or synthesis of bottom-up inputs and top-down predictions¹⁸ and is thus robust regardless of tasks.

The effects of semantics in the VWFA are more complex. We did find positive semantic effects, but the effects varied by the type of semantic dimensions. For specific dimensions including thematic and taxonomic relations, the organization was tuned according to that particular dimension being judged. For subjectively rated semantic similarity measure, we did not see any significant effects, except for a trend in the thematic categorization task. For the semantic similarity derived from large-scale text using statistical learning models (word2vec), the effects were present in both semantic tasks. Worth noting is that semantic effects in the VWFA cannot be explained by the orthographic, low-level visual, first-order co-occurrence and phonological effects. Among these variables that were excluded from explaining the semantic effects, the first-order co-occurrence RDM is of particular interest. This model can be considered as an extended version of orthographic representation by characterizing how likely two word forms would co-occur in a local visual context (five words) during natural reading. Semantically related words (in both specific and composite semantic RDMs) tend to visually co-occur (Table 1), raising the possibility that semantic effects could be ascribed to visual co-occurrence in reading. Nevertheless, RSA results showed that words with greater first-order co-occurrences did not evoke more similar activation patterns in the VWFA (Figure 2B) and, more importantly, semantic effects remained significant when the first-order co-occurrence measure was controlled for. That is, the semantic effects in the VWFA we observed are not explained by these non-semantic properties we tested.

Why are there task-sensitive dimension-specific semantic effects in the VWFA and why are the word2vec effects present in both tasks? One possibility is that the VWFA contains neuronal populations sensitive to both taxonomic and thematic organizations. Attention boosts task-relevant information and/or tune down task-irrelevant information so that only task-relevant information is observed in the VWFA activity³⁰. Such semantic information, even if present, seems to be subtle or redundantly coded in other regions, as lesion/disruption to the VWFA had minimal influence on object recognition and language comprehension abilities¹². Another scenario consistent with the broader empirical findings is that the VWFA itself does not store semantic information, but inherits activation patterns in the higher-order semantic regions via top-down feedback. In semantic judgment tasks, when a reader sees the word “teacher”, the visual input activates its orthographic representation (likely to be in the VWFA) and then the corresponding word meaning representation (stored somewhere else in the semantic neural system). The semantic representations that are related to the target meaning (e.g. “doctor” or “classroom”) along various dimensions are also activated through spreading of activation due to overlapping features or associations. The types of neighboring meanings receiving more activation are dependent on the task at hand – when the judgment is about taxonomic classes, the taxonomic neighbors are more strongly activated; when the judgment is about thematic relations, the thematic neighbors are more strongly activated. Such activated semantic neighboring representations in turn feeds back to their own orthographic representations in the VWFA, resulting in more similar VWFA activity patterns for items sharing that semantic dimension. Such feedback does not seem simply epiphenomenal, but may contribute to orthographic identification³¹ and overall task performance. Given the distributed neural basis of semantic memory^23,32, future studies are warranted to uncover the specific mechanisms of modulation between semantic regions and the VWFA using approaches that are optimized to study task-specific functional connectivity patterns.

Our study highlights the importance of taking the multidimensional and dynamic nature of semantic information into account when investigating the neural correlates of semantic processing. Previous studies that used subjective semantic relatedness have reported null results for semantic effects in the VWFA^3,4. Our rating results showed that the subjectively rated semantic RDM tended to be more similar to the thematic RDM than the taxonomic RDM, indicating that in our free rating context, the group-level subjectively perceived semantic distance is biased towards thematic relations. This is consistent with a similar preference for thematic thinking in the matching or free association tasks and accords with the impact of thematic relations on word similarity judgment³³. Thus, the semantic effects based on such measures may not be detectable in semantic tasks that do not rely on such dimension, e.g., detecting certain taxonomic categories³. In comparison, the word2vec distance was found to correlate with both the taxonomic and thematic RDMs, indicating that this composite semantic space is a multidimensional one that captures both taxonomic and thematic relations, thus explaining the results that the word2vec RDM correlated with the VWFA neural activity in both tasks. This is consistent with the marginally significant effect of the LSA distance–another composite measure containing both types of relations^34,35. The significant effects of word2vec in our study may be because word2vec captures richer semantic information than LSA²⁵.

The significant semantic effects observed here, in comparison to previous studies, are also likely to be driven by the explicit semantic tasks we used. For tasks where (deep) semantic processing was not necessary such as lexical decision, semantic effects tended not to be consistent in the VWFA^3,4,7. To our knowledge, there was only one study reporting both orthographic and semantic effects in the primed lexical decision task in the posterior fusiform gyrus¹⁹. In that study, the target word was presented 1300 ms, a period long enough for participants to explicitly associate it with the visible prime (presented for 150 ms). This is in contrast with other priming studies using very short stimulus representation time that emphasizes bottom-up input properties (e.g. 300 ms^3,4). Therefore, it seems that explicit and detailed semantic processing, as well as the consistency between semantic contents and task demands, would be required for robust semantic effects in the VWFA.

To conclude, by including multiple types of semantic distance measures and different task demands, we demonstrate that in explicit semantic tasks the activity patterns of the VWFA also contain task-relevant semantic information of written words in addition to orthographic information. Future studies are warranted to examine how semantic processing in the VWFA interacts with orthographic representations to support fluent reading.

Methods

Subjects

Twenty young healthy adults recruited from Peking University participated in this study (10 males; aged 18–27 years). They were all right handed, native Chinese speakers, with normal or corrected-to-normal vision. The study was approved by the Human Subject Review Committee at Peking University. All the experiments were performed in accordance with relevant guidelines and regulations. Informed consent was obtained from all participants. One participant was excluded from data analysis due to recoding errors of button press.

Stimuli and fMRI procedure

The stimuli set contained 45 Chinese words (see Supplementary Table S1) that belonged to nine conditions arising from the combinations of 3 taxonomic categories (people, manmade objects, and locations) and 3 thematic categories (school, medicine, and sports), with five exemplar words per condition. Three out of five words were bisyllabic (two characters) and the other two trisyllabic (three characters). Before scanning, participants were shown pictures of the intended meaning of each word to reduce word meaning ambiguity when words are presented alone.

The condition-rich rapid event-related design was adopted for the fMRI scan²⁶, with each word as an experimental condition. Lasting 260 s, each run started and ended with a 10 s blank screen and included 45 word trials, with each word presented exactly once. Each word trial started with a fixation cross on the center of a gray background for 500 ms, followed by the word (Song bold font, 36 point in font size) for 500 ms and a blank screen with varying lengths between 3 and 13 s. The duration of the blank screen as well as the stimulus sequence (organized as nine conditions) were determined using the optseq 2 optimization algorithm³⁶. Five words within each condition were randomly presented and run orders were further randomized across participants. There were 10 runs in total.

Two semantic categorization tasks were adopted. In half of the runs, a taxonomic judgment task was performed, in which participants were asked to categorize each word into three taxonomic categories (people, objects, and locations) by pressing three buttons with their right middle finger, right index finger and left index finger, respectively. In the other half of the runs, participants performed a thematic judgment task in which they categorized words into three thematic categories (school, medicine, and sports) using the same fingers and buttons in the taxonomic task. The run order of taxonomic and thematic tasks was randomized across participants.

fMRI acquisition and preprocessing

The fMRI results were reanalyses of data that were collected for another study investigating the neural basis of semantic relations. The acquisition and preprocessing procedures are as follows. Whole-brain imaging was performed on a 3 T Siemens MRI Scanner (MAGNETOM Prisma) at the Center for MRI Research, Peking University. Functional images were acquired using the multi-band echo-planar sequence [repetition time (TR) = 2000 ms, echo time (TE) = 30 ms, flip angle (FA) = 90°, matrix size = 112 × 112, 64 axial slices, voxel size = 2 × 2 × 2.2 mm, multi-band factor = 2]. High-resolution three-dimensional T1-weighted images were acquired using the magnetization-prepared rapid gradient-echo sequence (TR = 2530 ms, TE = 2.98 ms, inversion time = 1100 ms, FA = 7°, matrix size = 448 × 512, 192 sagittal slices, voxel size = 0.5 × 0.5 × 1 mm).

The images were preprocessed using SPM12 (Wellcome Trust Center for Neuroimaging, http://www.fil.ion.ucl.ac.uk/spm/software/spm12/). For each participant data, after discarding the first five volumes of each run, functional images were corrected for slice timing and head motion. The resulting un-smoothed and un-normalized images were entered into the general linear model (GLM) for further analysis. The structure image was co-registered to the mean functional images and segmented into different tissues. The deformation fields for spatial normalization of native space to the Montreal Neurological Institute (MNI) space and reverse normalization were also obtained in this step.

fMRI data analysis

The whole-brain activation maps for each word in individual subjects were obtained via GLM in the first-level analysis. Two GLMs were built, differing on whether to include task-specific regressors for each word. The first GLM included 45 regressors for each run, one for each word and and the second GLM included 90 regressors, two for each word with one for the taxonomic task and the other for the thematic task. Trial-level differences in reaction time (RT) were controlled for by convolving each trial with a boxcar equal to the length of its reaction time³⁷. Six head motion parameters and a global mean predictor for each run were also included in GLMs. A high-pass filter cut-off was set at 128 s. The subsequent word versus baseline contrast produced a whole-brain t map for each word and for each word under each task, which was used for the following activation pattern analyses.

Representational similarity analysis

RSA is a widely used approach to characterize the correspondence between brain activity patterns and theoretical/behavioral measurement³⁸. This method consists of constructing representational dissimilarity matrices (RDMs) for both measures and calculating the correlation between them. An RDM is a symmetric n × n matrix, where n is the number of experimental conditions (n = 45 in this study) and the off-diagonal values indicate the dissimilarity (or distance) for each pair of conditions in a certain aspect.

Theoretical/behavioral RDMs

Four semantic RDMs were constructed to investigate the potential semantic information embedded in the activity patterns of the VWFA. The taxonomic RDM was a binary RDM, assigning 0 to word pairs that belong to the same taxonomic category (e.g. teacher-doctor, chalk-bandage, classroom-hospital) and 1 to the remaining cells. The thematic RDM was also a binary RDM, assigning 0 to word pairs that belong to the same thematic category (e.g. teacher-student, teacher-chalk, teacher-classroom) and 1 to the remaining cells. The subjectively rated semantic RDM was based on pairwise ratings of semantic distance. Eighteen healthy college students (nine females, mean age = 23.5 years, range = 18–27 years) were recruited to rate how close two words were in meaning using a 7-point Likert scale (7 for the closest). Ratings for a total of 990 word pairs (pairwise combination of the 45 words) were collected and the RDM was computed as seven minus the averaged rating scores of 18 participants for each word pair, which resulted in a symmetric 45 × 45 matrix. The word2vec RDM was based on continuous vector representations of words generated by the skip-gram architecture²⁵. For the Baidu encyclopedia corpus containing approximately one billion word tokens, a vocabulary of the most frequent 249,222 words was first obtained through the Stanford parser. The word2vec tool was then used to train vector representations of words (https://code.google.com/p/word2vec/) with the following parameters: window size = 5, sub-sampling rate = 10⁻⁴, negative sample number = 5, learning rate = 0.025, dimension number = 300. The word distance was measured as one minus the cosine angle between feature vectors of each word.

To validate that the VWFA activity patterns are sensitive to orthographic information, we constructed a logographeme RDM to characterize orthographic dissimilarity between words. The logographeme has been proposed to be the basic unit of Chinese characters^28,39. The logographeme RDM was measured by one minus the proportion of shared logographemes between two words regardless of position. For instance, the word “” (campus) is composed of seven logographemes (“); the word “”(tampon) is composed of five logographemes (“”). They shared one logographeme (“”) and therefore the dissimilarity is 1-(1/12) = 0.917. Three control RDMs were constructed. The visual pixelwise RDM measured the pixelwise overlap of the binary silhouette images of word pairs^7,38. The co-occurrence RDM measured how likely the two words would appear together in a 5-word window in texts and was based on summed counts of co-occurrence frequency for each word pair within five words in the Chinese Web 5-gram Corpus (https://catalog.ldc.upenn.edu/LDC2010T06), which contains about 883 billion word tokens extracted from Chinese Web pages. The co-occurrence counts were log-transformed using ln (f + 40), where f is the raw summed counts and 40 is the lowest n-gram counts kept in the database, and then reversed to construct the co-occurrence RDM. The phonological RDM was calculated as one minus the proportion of shared sub-syllabic (initials or finals) units and tones regardless of position.

VWFA localization

The VWFA was defined in both anatomical and functional ways. An a priori anatomical mask covering the posterior occipitotemporal sulcus was adopted²⁷. This mask ranged from −54 < x < −30, −70 < y < −45 and −30 < z < −4 in the MNI space and voxels in the cerebellum according to the automated anatomical labeling template were excluded⁴⁰. This mask was reverse-normalized into each subject’s native space for further analysis.

To localize functional VWFA, a whole-brain searchlight RSA⁴¹ was first performed to identify brain regions sensitive to logographeme information. For each voxel in native space, we built a spherical region of interest (ROI, radius 6 mm) centering on the voxel, extracted t values in this ROI to each of the 45 words and calculated one minus Spearman rank correlations of all word pairs within this ROI to construct a neural RDM. The relationship between the neural RDM and the logographeme RDM was then assessed using partial Spearman correlation with the visual pixelwise RDM being controlled for (to ensure that orthographic representation was not contaminated by low-level visual similarity), which produced a correlation coefficient for this voxel. Moving the searchlight center throughout the cortex, we obtained a whole-brain r-map in the native space. Note that the searchlight analysis was restricted to the voxels with a probability higher than 1/3 in the native gray matter image generated from the segmentation step. For a group-level random-effects analysis, the r maps in the native space were Fisher-z-transformed, normalized to the MNI space using the forward deformation field and spatially smoothed using a 6 mm full-width at half maximum Gaussian kernel. The permutation-based statistical non-parametric mapping (SnPM; http://go.warwick.ac.uk/tenichols/snpm) was used (no variance smoothing, 10,000 permutations) to test for significance of positive correlations between the neural and logographeme RDMs across participants. Clusters surviving the cluster-level FWE correction at P < 0.05 with a voxelwise threshold of Z > 3.09 were reported. A single cluster was found in the left posterior occipitotemporal cortex, partially overlapping with the anatomical mask of the VWFA and was defined as the group-level functional VWFA (see Figure 2A and results).

For each subject, we then identified the voxels in the anatomical mask of the VWFA whose neural RDMs showed a significantly positive correlation with the logographeme RDM in the above-mentioned searchlight analysis (one-tailed P < 0.05, uncorrected; mean number of voxels across participants, 108, range: 10–302 voxels). These voxels together with their adjacent voxels within a 6-mm-radius sphere were considered as individual subjects’ functional VWFA (mean number of voxels: 1362, range: 349–2590 voxels).

RSA procedures for the VWFA

For both anatomical and functional VWFA masks, we calculated the neural RDMs as one minus Spearman’s rank correlation between each pair of words. To validate that the activation patterns of the VWFA showed some specificity of orthographic information, we first calculated the Spearman correlation between the neural RDM and the logographeme, visual pixelwise, co-occurrence and phonological RDMs for each ROI (Note that the logographeme effect of the functional ROI was shown for illustration purposes). We then investigated the semantic information in the VWFA in detail. Specifically, the neural RDMs for each task were compared with four semantic RDMs using the Spearman rank correlation. Partial correlations were also performed to control for logographeme, visual pixelwise, co-occurrence and phonological RDMs. The resulting correlation coefficients were Fisher-z-transformed and statistically inferred across participants. One-sample t tests were used to test whether the correlation was significantly greater than zero. Paired t tests were used to compare different information types and the same information type in different tasks.

Data availability

The datasets generated and/or analyzed during the current study are available from the corresponding authors on reasonable request.

References

Bolger, D. J., Perfetti, C. A. & Schneider, W. Cross-Cultural Effect on the Brain Revisited: Universal Structures Plus Writing System Variation. Hum. Brain Mapp. 104, 92–104 (2005).
Article Google Scholar
Liu, C. et al. The Visual Word Form Area: Evidence from an fMRI study of implicit processing of Chinese characters. Neuroimage 40, 1350–1361 (2008).
Article PubMed Google Scholar
Glezer, L. S., Jiang, X. & Riesenhuber, M. Evidence for Highly Selective Neuronal Tuning to Whole Words in the ‘Visual Word Form Area’. Neuron 62, 199–204 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kherif, F., Josse, G. & Price, C. J. Automatic Top-Down Processing Explains Common Left Occipito-Temporal Responses to Visual Words and Objects. Cereb. Cortex 21, 103–114.
Dehaene, S. et al. Cerebral Mechanisms of Word Masking and Unconscious Repetition Priming. Nat. Neurosci. 4, 752–758 (2001).
Article CAS PubMed Google Scholar
Zhao, L. et al. Orthographic and Phonological Representations in the Fusiform Cortex. Cereb. Cortex 27, 5197–5210 (2017).
Fischer-Baum, S., Bruggemann, D., Gallego, I. F., Li, D. S. P. & Tamez, E. R. Decoding levels of representation in reading: A representational similarity approach. Cortex 90, 88–102 (2017).
Article PubMed Google Scholar
Baeck, A., Kravitz, D., Baker, C. & Op de Beeck, H. P. Influence of lexical status and orthographic similarity on the multi-voxel response of the visual word form area. Neuroimage 111, 321–328 (2015).
Article PubMed PubMed Central Google Scholar
Saygin, Z. M. et al. Connectivity precedes function in the development of the visual word form area. Nat. Neurosci. 19, 1250–1255 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dehaene, S. et al. How Learning to Read Changes the Cortical Networks for Vision and Language. Science. 330, 1359–1364 (2010).
Article ADS CAS PubMed Google Scholar
Hirshorn, E. A. et al. Decoding and disrupting left midfusiform gyrus activity during word reading. Proc Natl Acad Sci USA 113, 8162–8167 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gaillard, R. et al. Direct intracranial, fMRI and lesion evidence for the causal role of left inferotemporal cortex in reading case study. Neuron 50, 191–204 (2006).
Article CAS PubMed Google Scholar
Cohen, L. et al. The visual word form area Spatial and temporal characterization of an initial stage of reading in normal subjects and posterior split-brain patients. Brain 123, 291–307 (2000).
Article PubMed Google Scholar
Coltheart, M., Rastle, K., Perry, C., Langdon, R. & Ziegler, J. C. DRC: a dual route cascaded model of visual word recognition and reading aloud. Psychol. Rev. 108, 204–256 (2001).
Article CAS PubMed Google Scholar
Wang, X., Caramazza, A., Peelen, M. V., Han, Z. & Bi, Y. Reading Without Speech Sounds: VWFA and its Connectivity in the Congenitally Deaf. Cereb. Cortex 25, 2416–2426 (2015).
Article PubMed Google Scholar
Yeatman, J. D., Rauschecker, A. M. & Wandell, B. A. Anatomy of the visual word form area: Adjacent cortical circuits and long-range white matter connections. Brain Lang 125, 146–155 (2012).
Article PubMed PubMed Central Google Scholar
Dehaene, S. & Cohen, L. The Unique Role of the Visual Word Form Area in Reading. Trends Cogn. Sci. 15, 254–262 (2011).
Price, C. J. & Devlin, J. T. The Interactive Account of Ventral Occipitotemporal Contributions to Reading. Trends Cogn. Sci. 15, 246–253 (2011).
Gold, B. T. et al. Dissociation of Automatic and Strategic Lexical-Semantics: Functional Magnetic Resonance Imaging Evidence for Differing Roles of Multiple Frontotemporal Regions. J. Neurosci. 26, 6523–6532 (2006).
Article CAS PubMed Google Scholar
Baeck, A., Kravitz, D., Baker, C. & Op de Beeck, H. P. Influence of lexical status and orthographic similarity on the multi-voxel response of the visual word form area. Neuroimage 111, 321–328 (2015).
Article PubMed PubMed Central Google Scholar
Yee, E. & Thompson-schill, S. L. Putting concepts into context. Psychon. Bull. Rev. 23, 1015–1027 (2016).
Article PubMed PubMed Central Google Scholar
Binder, J. R. In Defense of Abstract Conceptual Representations. Psychon. Bull. Rev. 1096–1108 (2016).
Lambon Ralph, M. A., Jefferies, E., Patterson, K. & Rogers, T. T. The neural and computational bases of semantic cognition. Nat. Rev. Neurosci. 18, 1–14 (2017).
Google Scholar
Mirman, D. et al. Taxonomic and Thematic Semantic Systems. Psychol. Bull. 143, 499–520 (2017).
Mikolov, T., Chen, K., Corrado, G. & Dean, J. Efficient Estimation of Word Representations in Vector Space. Proc. Int. Conf. Learn. Represent. (ICLR 2013) 1–12, https://doi.org/10.1162/153244303322533223 (2013).
Kriegeskorte, N., Mur, M. & Bandettini, P. Representational similarity analysis - connecting the branches of systems neuroscience. Front. Syst. Neurosci. 2, 4 (2008).
Article PubMed PubMed Central Google Scholar
Twomey, T., Kawabata, K. J., Price, C. J. & Devlin, J. T. Top-down modulation of ventral occipito-temporal responses during visual word recognition. Neuroimage 55, 1242–1251 (2011).
Article PubMed PubMed Central Google Scholar
Han, Z., Zhang, Y., Shu, H. & Bi, Y. The orthographic buffer in writing Chinese characters: Evidence from a dysgraphic patient. Cogn. Neuropsychol. 24, 431–450 (2007).
Article PubMed Google Scholar
Dehaene, S., Cohen, L., Morais, J. & Kolinsky, R. Illiterate to literate: behavioural and cerebral changes induced by reading acquisition. Nat. Rev. Neurosci. 16, 234–244 (2015).
Article CAS PubMed Google Scholar
Nastase, S. A. et al. Attention Selectively Reshapes the Geometry of Distributed Semantic Representation. Cereb. Cortex 27, 4277–4291 (2017).
Article PubMed Google Scholar
Kim, A. & Lai, V. Rapid Interactions between Lexical Semantic and Word Form Analysis during Word Recognition in Context: Evidence from ERPs. J. Cogn. Neurosci. 24, 1104–1112 (2012).
Article PubMed Google Scholar
Binder, J. R. & Desai, R. H. The neurobiology of semantic memory. Trends in Cognitive Sciences 15, 527–536 (2011).
Article PubMed PubMed Central Google Scholar
Estes, Z., Golonka, S. & Jones, L. L. Thematic Thinking: the Apprehension and Consequences of Thematic Relations. In The Psychology of Learning and Motivation (ed. Ross, B.) 249–294 (Academic Press, 2011).
Binder, J. R. et al. Toward a brain-based componential semantic representation. Cogn. Neuropsychol. 33, 130–174 (2016).
Article PubMed Google Scholar
Jackson, A. F. & Bolger, D. J. Using a High-dimensional Graph of Semantic Space to Model Relationships among Words. Front. Psychol. 5, 385 (2014).
Article CAS Google Scholar
Dale, A.M. Optimal experimental design for event-related fMRI. Hum. Brain Mapp. 8, 109–114 (1999).
Grinband, J., Wager, T. D., Lindquist, M., Ferrera, V. P. & Hirsch, J. Detection of time-varying signals in event-related fMRI designs. Neuroimage 43, 509–520 (2008).
Article PubMed PubMed Central Google Scholar
Kriegeskorte, N. et al. Matching Categorical Object Representations in Inferior Temporal Cortex of Man and Monkey. Neuron 60, 1126–1141 (2008).
Article CAS PubMed PubMed Central Google Scholar
Law, S. P. & Leung, M.T. Structural Representations of Characters in Chinese Writing: Evidence from a Case of Acquired Dysgraphia. Psychologia 43, 67–83 (2000).
Google Scholar
Tzourio-Mazoyer, N. et al. Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain. Neuroimage 15, 273–289 (2002).
Article CAS PubMed Google Scholar
Kriegeskorte, N., Goebel, R. & Bandettini, P. Information-based functional brain mapping. Proc. Natl. Acad. Sci. USA 103, 3863–3868 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Weiwei Men and Jia-Hong Gao for technical support and Xiaoying Wang for helpful discussion. This project is funded by the National Basic Research Program of China Program (2013CB837300 and 2014CB846100), National Natural Science Foundation of China (31521063, 31700943, 61375116 and 91520202), the China Postdoctoral Science Foundation (2017M610791), the Fundamental Research Funds for the Central Universities (WK2350000001; 2017XTCX04), Beijing Brain Project (Z16110100020000, Z161100000216124, Z161100000216125), the Interdiscipline Research Funds of Beijing Normal University (Y.B.), National Program for Special Support of Top-notch Young Professionals (Y.B.) and Beijing Advanced Innovation Center for Future Education (BJAICFE2016IR-003).

Author information

Xiaosha Wang and Yangwen Xu contributed equally to the work.

Authors and Affiliations

College of Information Science and Technology, Beijing Normal University, Beijing, 100875, China
Xiaosha Wang & Jiacai Zhang
National Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China
Xiaosha Wang, Yangwen Xu & Yanchao Bi
Beijing Key Laboratory of Brain Imaging and Connectomics, Beijing Normal University, Beijing, 100875, China
Xiaosha Wang, Yangwen Xu & Yanchao Bi
Research Center for Brain-inspired Intelligence & National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, 100190, China
Yuwei Wang & Yi Zeng
Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, 200031, China
Yuwei Wang & Yi Zeng
National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei, 230027, China
Zhenhua Ling

Authors

Xiaosha Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yangwen Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yuwei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Jiacai Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhenhua Ling
View author publications
You can also search for this author in PubMed Google Scholar
Yanchao Bi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.B., X.W. and Y.X. designed research; Y.X. collected data; X.W., Z.L., Y.W., Y.Z. and J.Z. analyzed data; Y.B. and X.W. wrote the paper.

Corresponding authors

Correspondence to Jiacai Zhang or Zhenhua Ling.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, X., Xu, Y., Wang, Y. et al. Representational similarity analysis reveals task-dependent semantic influence of the visual word form area. Sci Rep 8, 3047 (2018). https://doi.org/10.1038/s41598-018-21062-0

Download citation

Received: 09 November 2017
Accepted: 29 January 2018
Published: 14 February 2018
DOI: https://doi.org/10.1038/s41598-018-21062-0

This article is cited by

Intersecting distributed networks support convergent linguistic functioning across different languages in bilinguals
- Shujie Geng
- Wanwan Guo
- Jianfeng Feng
Communications Biology (2023)
The Brain Connectome for Chinese Reading
- Wanwan Guo
- Shujie Geng
- Jianfeng Feng
Neuroscience Bulletin (2022)
A data-driven framework for mapping domains of human neurobiology
- Elizabeth Beam
- Christopher Potts
- Amit Etkin
Nature Neuroscience (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Word frequency and reading demands modulate brain activation in the inferior frontal gyrus

The visual word form area (VWFA) is part of both language and attention circuitry

Visual and linguistic semantic representations are aligned at the border of human visual cortex

Introduction

Results

Relationships among the theoretical/behavioral RDMs

Behavioral results in the fMRI experiment

Orthographic representation in the VWFA

Anatomically defined VWFA

Functionally defined VWFA

Task modulation effects

Semantic information in the VWFA

Taxonomic information

Thematic information

Subjective semantic rating

Word2vec

Semantic information encoded in the VWFA: Controlling for non-semantic confounding variables

Discussion

Methods

Subjects

Stimuli and fMRI procedure

fMRI acquisition and preprocessing

fMRI data analysis

Representational similarity analysis

Theoretical/behavioral RDMs

VWFA localization

RSA procedures for the VWFA

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Intersecting distributed networks support convergent linguistic functioning across different languages in bilinguals

The Brain Connectome for Chinese Reading

A data-driven framework for mapping domains of human neurobiology

Comments

Search

Quick links