Brain networks underlying the processing of sound symbolism related to softness perception

Kitada, Ryo; Kwon, Jinhwan; Doizaki, Ryuichi; Nakagawa, Eri; Tanigawa, Tsubasa; Kajimoto, Hiroyuki; Sadato, Norihiro; Sakamoto, Maki

doi:10.1038/s41598-021-86328-6

Download PDF

Article
Open access
Published: 01 April 2021

Brain networks underlying the processing of sound symbolism related to softness perception

Ryo Kitada^1,2,
Jinhwan Kwon³,
Ryuichi Doizaki⁴,
Eri Nakagawa⁵,
Tsubasa Tanigawa^5,6,
Hiroyuki Kajimoto⁴,
Norihiro Sadato^5,6 &
…
Maki Sakamoto⁴

Scientific Reports volume 11, Article number: 7399 (2021) Cite this article

2055 Accesses
6 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Unlike the assumption of modern linguistics, there is non-arbitrary association between sound and meaning in sound symbolic words. Neuroimaging studies have suggested the unique contribution of the superior temporal sulcus to the processing of sound symbolism. However, because these findings are limited to the mapping between sound symbolism and visually presented objects, the processing of sound symbolic information may also involve the sensory-modality dependent mechanisms. Here, we conducted a functional magnetic resonance imaging experiment to test whether the brain regions engaged in the tactile processing of object properties are also involved in mapping sound symbolic information with tactually perceived object properties. Thirty-two healthy subjects conducted a matching task in which they judged the congruency between softness perceived by touch and softness associated with sound symbolic words. Congruency effect was observed in the orbitofrontal cortex, inferior frontal gyrus, insula, medial superior frontal gyrus, cingulate gyrus, and cerebellum. This effect in the insula and medial superior frontal gyri was overlapped with softness-related activity that was separately measured in the same subjects in the tactile experiment. These results indicate that the insula and medial superior frontal gyrus play a role in processing sound symbolic information and relating it to the tactile softness information.

Sound symbolism processing is lateralized to the right temporal region in the prelinguistic infant brain

Article Open access 17 September 2019

Action sound–shape congruencies explain sound symbolism

Article Open access 29 July 2020

Sound symbolic congruency detection in humans but not in great apes

Article Open access 03 September 2019

Introduction

In modern linguistics, it is widely assumed that the acoustic features of a word are arbitrarily associated with its meaning^1,2. This assumption is supported by the fact that the same concept is expressed by different sounds in different languages. However, an increasing number of studies have shown that a non-arbitrary association between sound and meaning can exist in some words called sound symbolic words^3,4,5,6,7,8. For instance, a curvy round shape and spiky angular shape can be associated with the nonsense word “maluma” or “bouba” and with the nonsense word “takete” or “kiki”, respectively (the “bouba/kiki effect”)⁸. However, the neural mechanisms underlying this non-arbitrary mapping between sound and its semantic dimensions are not well understood.

Because sound symbolic words are associated with specific semantic dimensions, the processing of such words can involve the neural substrates for processing the corresponding semantic dimensions. For instance, if the sound symbolic words are associated with object shape and size, then the processing of these words can involve the neural substrates that associate the sound with the sensory processing of object shape and size. This idea is supported by findings which showed that congenitally blind individuals show weaker sound-shape associations^9,10,11 and a study that found that prelingual auditory deprivation reduced the bouba–kiki effect, although they performed above chance level¹². These findings raise the possibility that the processing of sound symbolic words associated with object properties involves sensory-modality dependent brain networks.

Previous neuroimaging studies have examined the brain networks that are involved in the processing of sound symbolic words^{7,13,14,15,16,17,18,19}. Among them, more recent neuroimaging studies have examined brain activity when the subjects conducted matching tasks between sound symbolic words and visually presented objects such as: matching between Japanese mimetic words and body gestures¹⁶; matching between the size of visual stimuli and sound “bobo”/”pipi”¹⁸; and matching between sound “bouba”/”kiki” and the spikiness/roundness of visual stimulus¹⁹. Two of these studies showed that the matching between sound symbolic words and visually presented objects involves the region in and around the posterior part of the superior temporal sulcus (e.g., the middle temporal gyrus¹⁸), though the exact location of activation varied among the studies^16,18. Kanero et al.¹⁶ proposed that this region is a part of the unique neural networks that process the sound symbolism. On the other hand, several studies also reported the activation of sensory-dependent networks in the processing of sound symbolic words. For instance, a study also reported that regions in the occipital cortex can be sensitive to incongruency between the visually presented object size and its sound¹⁸. Another study showed that the imagery of unpleasantness from pain-related mimetic words evoked activation of several brain regions including the anterior cingulate cortex, a part of the network of pain processing¹⁴. These findings partially support the hypothesis that the processing of sound symbolic words can involve sensory-dependent and sensory-independent networks in the brain.

The majority of the aforementioned neuroimaging studies have focused on the mapping between visually presented objects and sound symbolic words. Thus, it is unclear if the same neural substrates are involved in the matching between sound symbolic words and objects presented in other sensory modalities. In the present study, we focused our investigation on the neural correlates of association between sound symbolic words and object properties perceived by touch.

Previous neuroimaging studies have shown that specific sets of brain regions are engaged in tactile processing of object properties. Tangible object properties are organized into two major categories: macro-spatial and material properties²⁰. The former category, which includes the perception of shape, orientation, and location, needs some form of a spatial reference system (spatial coding)²¹. By contrast, the latter category, which includes roughness, softness, and temperature, is expressed as intensity (intensity coding). It has been demonstrated that distinct but overlapping brain networks are involved in the processing of macro-spatial and material properties^22,23,24. Specifically, activity in the parietal operculum (including the secondary somatosensory cortex, PO), insula, and occipital cortex is greater for texture perception than for perception of shape²³ and of dot location on a cardboard²⁴. The activation pattern in the parietal operculum and insula is related to the magnitude of perceived roughness^25,26 and temperature²⁷, though the ascending pathways for mechanical and thermal inputs differ. A recent functional magnetic resonance imaging (fMRI) study found that softness magnitude perceived by touch is represented in the network including the posterior insula, anterior insula, parietal operculum, and medial superior frontal gyrus²⁸. Collectively, these studies indicate that the insula and PO are key nodes of the network for tactile perception of material properties^29,30. Thus, if the sensory-dependent network plays a key role in the understanding sound symbolic information, the insula and PO can be also involved in the processing of sound symbolic words for material properties. Though a previous behavioral study showed that high pitched sounds are matched to haptically perceived angular shape and softness¹⁰, the underlying neural substrates have not been investigated. Thus, it remains unknown whether this network involving the insula is also associated with the mapping between tactile material information and sound symbolic words.

Another issue regarding the sound symbolic word is the familiarity with such words. If conventional sound symbolic words evoke brain regions that are related to sensory information, it can be based on the learned arbitrary rules or inherent non-arbitrary associations. Previous functional MRI studies used either conventional words^14,16 or unfamiliar words^18,19. However, to the best of our knowledge, the effect of familiarity of sound symbolic words has not been investigated. If a specific sound is associated with specific object properties, we can expect common brain networks involved in the processing of sound symbolic information, regardless of the familiarity.

In this study, we tested the hypothesis that the mapping between sound symbolic words and tactually perceived softness involves the insula/PO and medial superior frontal gyrus as the softness-dependent regions²⁸ and the region in and around the posterior superior temporal sulcus (pSTS) as the sensory modality independent region^16,18. To this end, we conducted a functional MRI study that involves the three tasks: matching task (main task), tactile task, and word softness-judgment task (word task).

The purpose of the matching task was to examine the neural correlates for the mapping between sound symbolic words and associated softness information. This design was adopted from previous studies on sound symbolic words^16,18,19 and visuo-haptic association of material information³¹. Specifically, the task design includes two factors: the match between tactile and sound symbolic information and familiarity of sound symbolic words (Fig. 1). We then compared the conditions in which tactually perceived softness is matched to sound symbolic words (congruent conditions) with the conditions in which tactually perceived softness and sound symbolic words are mismatched (incongruent conditions). The assumption is that the interaction of the tactile information and sound symbolic information occurs to compare the two types of information in the hypothesized brain regions. We manipulated the familiarity of sound symbolic words by using genetic algorithms³².

Additionally, the two supplementary experiments (tactile and word judgment tasks) were conducted to examine signals when only tactile or sound symbolic information was given to the subjects. We assumed that activity of the hypothesized regions shows response even when only one type of the inputs was given.

Results

Thirty-two healthy volunteers participated in the study. The matching task included five conditions: congruent pairs of tactile stimuli and familiar words, congruent pairs of tactile stimuli and unfamiliar words, incongruent pairs containing familiar words, incongruent pairs including unfamiliar words, and the low-level control condition. In the low-level control condition, only pseudowords were presented without tactile stimulation. The list of the sound symbolic words is available in the Supplementary material (Supplementary Table 1).

Task performance

Matching task

Table 1 shows the congruency ratings and response times in the matching task. Two-way repeated-measures analysis of variance (ANOVA) (2 levels of matching × 2 levels of familiarity) for rating revealed a significant main effect of matching [F(1, 31) = 1629.3, p < 0.001] and a significant main effect of familiarity [F(1, 31) = 6.02, p = 0.02]. The same analysis also showed a significant interaction [F(1, 31) = 105.8, p < 0.001] with the effect of matching on ratings was greater for the familiar condition than the unfamiliar condition. Nevertheless, the post-hoc paired t-tests with Bonferroni correction confirmed that the rating was greater for congruency than for incongruency pairs, regardless of the familiarity (p values < 0.001).

Table 1 Behavioral results.

Full size table

The same ANOVA for response time showed only a significant main effect of familiarity [F(1, 31) = 14.67, p = 0.001]. A significant main effect was observed neither for matching nor its interaction. Post-hoc paired t-tests with Bonferroni correction showed greater response time for pairs with unfamiliar words than for those with familiar words (p values < 0.05).

Word judgment task

Two-way repeated-measures ANOVA (2 levels of softness × 2 levels of familiarity) on rating showed significant main effects of softness [F(1, 30) = 2498.62, p < 0.001] and familiarity [F(1, 30) = 9.08, p = 0.005]. However the same ANOVA also showed a significant interaction between the two factors [F(1, 30) = 268.41, p < 0.001] with the difference of rating between “soft” and “hard” words in the familiar condition being greater than the same difference of rating in the unfamiliar condition. Paired t-tests with Bonferroni correction confirmed that softness ratings for words associated with softness were greater than those related to hardness, regardless of the familiarity (p values < 0.001) (Supplementary Table 2).

Tactile task

The behavioral ratings for the tactile task were shown in a previous study¹⁸. Specifically, we confirmed that the rating for the stimulus with higher compliance was greater than the stimulus with lower compliance in all pairs of stimuli.

fMRI results

Matching task

Main effects of matching

As hypothesized, the contrast of congruent conditions with incongruent conditions (congruency effect) revealed regions of significant activation in both the bilateral insula and medial superior frontal gyrus. Additionally, the same contrast revealed activation in the bilateral inferior frontal gyrus, left orbitofrontal cortex, bilateral anterior insula, right cingulate gyrus, and right cerebellum (Fig. 2, Supplementary Table 3). A significant effect was not observed in the posterior temporal regions (e.g., middle temporal gyrus). The opposite contrast revealed no significant activation.

Main effects of familiarity

The contrast of conditions with familiar words against conditions with unfamiliar words revealed regions of significant activation bilaterally in the angular gyrus, cuneus, insula, lingual gyrus, middle temporal gyrus, precuneus, superior frontal gyrus, superior occipital gyrus, superior temporal gyrus, and supramarginal gyrus. Moreover, the same contrast showed activation in the left cingulate gyrus, left fusiform gyrus, left parahippocampal gyrus, left postcentral gyrus, left precentral gyrus, right middle occipital gyrus, and right parietal operculum (Fig. 3A, Supplementary Table 4).

The opposite contrast revealed regions of significant activation in the bilateral angular gyrus, bilateral inferior occipital gyrus, bilateral middle frontal gyrus, bilateral middle occipital gyrus, bilateral superior frontal gyrus, bilateral superior occipital gyrus, bilateral superior parietal lobule, left inferior frontal gyrus, left insula, left supramarginal gyrus, right precuneus, and bilateral cerebellum (Fig. 3B, Supplementary Table 4).

Interaction between matching and familiarity effects

No significant activation was observed.

Tactile task

The contrast for parametric-modulation revealed activation in the left insula, parietal operculum, and medial superior frontal gyrus. As shown in Fig. 4A, the congruency effect in the matching task was overlapped with the softness-related activation in the tactile task in the anterior insula and medial superior frontal gyrus. The conjunction analysis (with conjunction null) using an inclusive masking procedure^33,34 confirmed significant activation in the left anterior insula and medial superior frontal gyrus (FWE corrected p values < 0.05). Figure 4B shows the data of three representative subjects.

Word judgment task

The contrast of judgment of sound symbolic words against baseline revealed multiple regions of significant activation over the whole brain, including the bilateral insula, medial superior frontal gyrus, and pSTS (Supplemental Fig. 1A). The contrast of the interaction between softness impression and familiarity [(Familiar_Soft–Familiar_Hard) − (Unfamiliar_Soft–Unfamiliar_Hard)] revealed no significant activation in the hypothesized regions; instead it revealed regions of significant activation in the bilateral lingual gyrus, bilateral middle occipital gyrus, bilateral cuneus, and left inferior occipital gyrus. Given the presence of significant interaction, we evaluated the effect of softness in each level of familiarity. As compared to the hard unfamiliar word condition, the soft unfamiliar word condition revealed greater activation in the bilateral lingual gyrus, bilateral middle occipital gyrus, bilateral cuneus, and left inferior occipital gyrus (Supplementary Fig. 1B–E, Supplementary Table 5). By contrast, no brain region showed greater activation in the soft familiar word condition than in the hard familiar word condition. Collectively, no significant effect of softness-hardness impressions in the hypothesized regions was observed in the whole brain analysis.

VOI analysis

Next, we conducted volume-of-interests (VOI) analysis to examine the activation patterns in the hypothesized regions (insula and the medial parts of the superior frontal gyrus) across the three tasks (Fig. 5). To minimize the selection bias that using the same dataset for selection and selective analysis cause (“double-dipping” problem)³⁵, we used two independent data sets. One data set originated from our previous study²⁸ and was used to localize peak coordinates in these hypothesized regions. Then, we examined activity in these coordinates in the other data set that included the tasks in the present study.

Matching task

Two-way repeated-measures ANOVA (2 levels of matching × 2 levels of familiarity) on the contrast estimates (relative to the control) revealed significant main effects of matching, with the congruent condition showing greater contrast estimates than the incongruent condition in both regions [F(1,31) = 6.2, p = 0.019 for the left insula; F(1, 31) = 25.2, p < 0.001 for the superior frontal gyrus]. The same analysis showed significant main effects of familiarity, with unfamiliar conditions showing greater activation than familiar conditions in both regions [F(1,31) = 5.7, p = 0.023 for the insula; F(1, 30) = 11.8, p = 0.002 for the superior frontal gyrus]. None of the ROIs showed a significant interaction effect (p values > 0.4).

Word judgment task

Two-way repeated-measures ANOVA (2 levels of softness × 2 levels of familiarity) on parameter estimates revealed significant main effect of softness with hard words producing greater activity than soft words in the insula [F(1, 30) = 6.3, p = 0.018]; main effects of familiarity with unfamiliar words producing greater activity in the all ROIs [F(1, 30) = 23.7, p < 0.001 for the left insula; F(1, 30) = 14.5, p = 0.001 for the superior frontal gyrus]. None of the regions showed significant interaction (p values > 0.1).

Tactile task

Two-way repeated-measures ANOVA (4 levels of softness) on the contrast estimates (relative to the control) revealed a significant main effect in all regions [F(3, 93) = 4.7, p = 0.004 for the insula; F(3, 93) = 7.1, p < 0.001 for the superior frontal gyrus]. Pairwise comparisons with Bonferroni correction showed that the contrast estimates for the hardest stimulus were lower than those for the second-hardest (Med-Hard) stimulus in both regions (p values < 0.01), lower than those for the softest stimulus in the left insula (p = 0.033), and lower than those for the second-softest stimulus (Med-Soft) in the superior frontal gyrus (p = 0.007). These results confirm that the insula and superior frontal gyrus are affected by the magnitude of tactually perceived softness.

Multi-voxel pattern analysis

The univariate anlaysis showed the matching effect in the insula nad medial superior frontal gyrus, whereas no such effect was found in the pSTS. To further examine whether the pSTS contains information about sound symbolic information and its matching effect with tactile information, we conducted a multi-voxel pattern analysis (MVPA) (Fig. 6). Based on our previous studies^16,18,28 the insula/PO, medial part of superior frontal gyrus, and pSTS were chosen as ROIs. All ROIs in all tasks showed significantly greater accuracy than chance level (p values < 0.05 Bonferroni corrected).

Discussion

In this study, we examined the brain networks engaged in the processing of sound symbolic information that are associated with object’s softness and hardness. While a distributed set of brain regions was affected by the congruency between perceived softness and sound symbolic words, the congruency effect was observed in or close to the region that showed the graded response to perceived softness in the anterior insula and medial superior frontal gyrus. This effect was similarly observed regardless of the familiarity (i.e., no interaction effect between familiarity and matching). The MVPA showed that the PO/insula contained information on congruency, the magnitude of softness perceived by touch, and the magnitude of softness associated with sound symbolic words.

This study used different words between conditions. The matching task involved the same sets of words and tactile stimuli between the congruency and incongruency conditions. The subtraction of the incongruency effect from the congruency effect should cancel out the difference of these inputs. Thus, it is unlikely that the observed congruency effect is explained merely by the difference in stimuli. We used two independent data sets for the VOI and ROI analysis: one data set was used to localize the brain regions for tactile softness perception in VOI and ROI analysis from our previous study²⁸ and the other data set which consist of the three tasks (matching task, tactile task, and word judgement task) were used to evaluate activity in these identified regions. This procedure follows the policy of selective analysis³⁵ as used in previous studies^18,36,37. Thus, we have minimized selection bias in our procedure.

To the best of our knowledge, this is the first neuroimaging study that depicted the interaction of brain activity between sound symbolic words and material properties perceived by touch. More specifically, previous neuroimaging studies highlighted the involvement of the pSTS in the matching between visually presented body action and mimetic words¹⁶ and between the size of visual stimuli and auditorily presented onomatopoeia¹⁸. In contrast, the present study revealed that the insula and medial superior frontal gyrus are associated with the congruency effect between tactile softness perception and sound symbolic words. This network contained information on the perceived magnitude of softness, as well as the congruency effect. This result indicates that the brain network that processes information of perceived softness is also engaged in processing the associated sound symbolic words. This finding is consistent with the idea that the processing of sound symbolic information involves the sensory-modality dependent regions. For instance, the onomatopoeia of pain can evoke brain activation in the anterior cingulate cortex, which is considered a part of the pain processing network¹⁴. The comparison of visual sizes of stimuli with sound symbolic words activated the occipital lobe as well as the pSTS¹⁸.

Within the regions showing the congruency effect, the left insula contained some information about sound symbolic words, as evidenced by the VOI and MVPA analyses on the word judgment task. Thus, it is possible that the insula receives the information about softness perceived by touch and softness associated with sound symbolic words, causing the interaction between the two types of information. This speculation is fit to the concept of multisensory integration that occurs in a region with signals of each sensory modality^38,39.

We also found the congruency effect in other prefrontal regions, consistent with the previous finding of an intermodal interaction effect in the prefrontal cortex^6,7. In our stimuli, the sounds /k/ and /g/ were associated with words related to hard referents (e.g., kada-kada and gai-gai), whereas the sounds /p/ and /f/ were associated with words related to soft objects (e.g., funo-funo and yapu-yapu). To confirm the associated tactile softness, some subjects reported that they covertly produced the sounds of the presented words. Thus, activation in these regions may be partly associated with such heuristics. This speculation is consistent with the perspective that the anterior insula and inferior frontal gyrus are associated with covert articulation⁴⁰, as well as with previous findings that this region is also sensitive to matching between picture and words that do not contain sound symbolism^41,42,43. The congruency between the impression of softness from covertly-produced sounds and tactile information may become more salient, leading to activation in the insula and the medial areas in the superior frontal gyrus. This idea is in accord with the view that the anterior insula and medial prefrontal cortex form the network for salient stimuli (saliency network) ^44,45.

The MVPA also showed that the pSTS contains information about congruency. This indicates that the posterior temporal region is also a part of the network in matching between tactile stimuli and sound symbolic information. However, contrary to the findings in previous studies^16,18, this region showed a nonsignificant effect in the mass-univariate analysis. One interpretation for this weak effect is the difference in the stimuli. Again, the comparison of sensory stimulus with the sound symbolic information should occur in the regions that receive inputs from each related network. For instance, Kanero et al.¹⁶ used body actions as the referents of mimetic words. The pSTS is a key node of the network of action understanding^46,47,48 as well as language processing^48,49; hence, a suitable area for the matching effect. Likewise, other studies have shown an interaction effect in a distributed set of the occipital and temporal cortex, which is ideal for auditory-visual integration^18,19. By contrast, our study involved tactile softness perception, which involves a network of frontal regions rather than the posterior temporal region²⁸. Thus, it is possible that the pSTS plays a minor role in the matching between tactile stimuli and sound symbolic words, although it can be a part of the engaged network. Thus, further research is necessary to examine to what extent the pSTS plays a modality-independent role in processing sound symbolic information.

Familiar and unfamiliar sound symbolic words caused different patterns of brain activation in the matching between tactile and symbolic word information. Thus, our results indicate that, although mapping between tactile and sound symbolic words occurs in the same frontal regions, the familiarity affects the activation of cortical networks. The familiar sound symbolic words evoked activation in the brain regions including the infero-medial prefrontal cortex, angular gyrus, and posterior cingulate gyrus. This network for the recognition of familiar sound symbolic words is highly similar to that in the results of a meta-analysis on familiarity⁵⁰. On the other hand, unfamiliar sound symbolic words showed stronger activation in a different set of brain regions. Our behavioral results showed that the effect of matching on ratings was greater for the familiar condition than the unfamiliar condition, and the response time for unfamiliar words was greater than that for familiar words. These findings indicates that it requires greater task demand to interpret unfamiliar words than familiar words. Thus, it is possible that greater activation in the unfamiliar condition may be due to the increased task demand to process novel information.

Finally, it is worth noting two interpretational issues. First, the word judgment task showed a higher signal for hardness than for softness in the left anterior insula, whereas activity in the tactile task was positively graded response to softness. Even though this region contains information about softness associated with the sound symbolic word, the similarities and differences between tactile and sound symbolic representation of softness in this region remain unclear. Thus, further research is necessary to address this point. Second, it is not clear to what extent our result can be generalized to tactile tasks, since we used a limited set of sound symbolic words and a single tactile stimuli. This is because fMRI studies with stringent control of tactile stimuli are technically challenging and it is difficult to use multiple object properties in one experimental setup (e.g., both roughness and softness). Future studies should examine to what extent the matching effect is generalized by using other sound symbolic words and tactile stimuli with perceptual dimensions other than softness.

In conclusion, the insula and the medial parts of the superior frontal gyrus, which were associated with softness magnitudes perceived by touch, showed a congruency effect between softness perceived by touch and softness associated with sound symbolic words. This result indicates that these regions constitute nodes of the network for mapping sound symbolic information onto tactile material information. In contrast to the previous findings on the neural correlates for sound symbolic words, our finding highlights the role of the nodes in the prefrontal cortex for the network of the matching between sound symbolic information and tactile material information.