Abstract
We examined theories of cross-cultural differences in cognitive style on a sample of 242 participants representing five cultural groups (Czechia, Ghana, eastern and western Turkey, and Taiwan). The experiment involved immersive virtual environments consisting of two salient focal objects and a complex background as stimuli, which were presented using virtual reality headsets with integrated eye-tracking devices. The oculomotor patterns confirmed previous general conclusions that Eastern cultures have a more holistic cognitive style, while Western cultures predominantly have an analytic cognitive style. The differences were particularly noticeable between Taiwan and the other samples. However, we found that the broader cultural background of each group was perhaps just as important as geographical location or national boundaries. For example, observed differences between Eastern (more holistic style) and Western Turkey (more analytic style), suggest the possible influence of varying historical and cultural characteristics on the cognitive processing of complex visual stimuli.
Similar content being viewed by others
Introduction
Past research in cross-cultural psychology suggests the existence of systematic differences in perceptual and cognitive domains across nations and cultural groups. This area of research covers a variety of specific cognitive processes, such as object categorization, thinking about contradictions, object-background separation, selective attention to objects and background in complex scenes, processing of global and local features of objects, attention to changes in the visual field, and many others1.
Following this area of research, a theory of holistic and analytic cognition has been developed2. The theory distinguishes two relatively stable modes of cognitive processing, referred to as holistic and analytic cognitive styles, and states that certain individuals should process information from the world prevalently in one of these two distinctive ways. According to this theory, holistic individuals (compared to their analytic counterparts) (a) use more intuitive and less rule-based strategies in object categorization3, 4, (b) use dialectical thinking instead of rules of formal logic, (c) are less effective in separating objects from the background5 (d) focus more on the background and the relationships between objects and less on the salient (focal) objects and their attributes6, (e) focus more on the global features than the local features of objects7, and (g) are more sensitive to contextual changes than focal object changes8.
Our current research focuses on one of these processes: selective attention to objects and their surroundings (background), sometimes also referred to as context sensitivity. Other authors have previously examined cross-cultural differences in context sensitivity using natural scenes (free-viewing paradigm) in combination with eye-movement recording9,10,11. Some research has found distinct differences in eye-movement patterns between Chinese and Americans9 and Chinese and Africans10, whereas other studies have supported a null hypothesis on the lack of any systematic cultural differences in people viewing scenes11.
In the present study, we sought to replicate the results of previous experiments on cultural differences in context sensitivity in complex visual environments and to extend this design by transferring the experimental procedure into immersive virtual reality with 3D compositions of virtual environments. This type of adaptation also has two potential benefits over the presentation of 2D scenes: (1) hypotheses on the third dimension of the visual world (depth) can be formulated, and (2) given the three-dimensional nature of the visual world, such research can be considered more ecologically valid for everyday visual experience. In this manner, we are responding to the challenge posed by Elvio Blini and his colleagues, who argued that “depth is a fundamental, yet overlooked, dimension of human perception” and that “future studies in vision and perception should be depth aware”12.
We created a series of static 3D scenes, each consisting of two focal objects placed in a semantically congruent background environment. Because the focal objects were designed to be visually salient, we hypothesized that they would be of relatively greater interest to analytic individuals. In accordance with i.a. Brouwer13, Cryns14, and Pontius15, we therefore expected that these participants would spend relatively more time gazing at the focal objects (focal object dwell time) than they would on the background and less salient objects located within the background, whereas holistic individuals would spend relatively less time on the focal objects.
To more closely investigate the possible differences in perception of depth, one of the focal objects was located nearer to the viewer in approximately half of the 3D scenes (for more information, see the “Methods” section). In these scenes, we hypothesized that the more distant focal object would likely blend more with the background for the analytic participants and that they would therefore dwell less on the distant focal object than the nearer focal object.
A relatively intricate part of the present study was determining a priori which individuals from which specific countries could be expected to exhibit predominantly holistic or analytic patterns in context sensitivity. Most of the research in the field has focused on comparisons between “Western” (USA, Europe) and “Eastern” (Japan, China) samples, with Western samples being relatively consistently more analytic than Eastern samples (for a review, see1). However, while this conceptualization might be understandable when comparing two culturally distant samples, it doesn't have to be necessarily understood as dichotomous when comparing multiple less distinct cultures.
Quite the contrary, the East–West dichotomy seems to be rather reductionist and potentially distorting. Moreover, it ignores other potentially influential factors such as characteristics of the environment, socioeconomic status, religion, or other individual-level factors. This is especially valid for large and culturally heterogeneous countries or countries that are not confined to North America or Asia. For example, Chatterjee16, demonstrates this in reference to India which is commonly perceived as “Eastern” and collectivistic. She argues that India is a very diverse country where some areas tend toward collectivism and others toward individualism. In order to reflect this approach we included two purposefully selected samples from Turkey, the geographically largest and the most populous of the countries included in this research (for details, see the “Research sample” section).
Furthermore, in line with suggestions made by Matsumoto and Yoo17, we explored the possible influence of individual-level variables such as the size of the municipality the participant lives in (measured by population), gender, age, experience with living in other countries, number of siblings, socioeconomic status, and study area to “unpack” their possible influence on dependent variables.
In summary, even though we reflect the limits of the original theoretical foundations within the analytic-holistic paradigm, we still find the above-mentioned cross-cultural theories of cognitive style as viable and able to detect differences in cognitive processes even at a more detailed level and with a wider spectrum of cultures.
Our research sample was composed of five cultural sub-samples from Czechia, Taiwan, Ghana, eastern Turkey, and western Turkey (Fig. 1). Based on previous research, we expected the Czech participants to be, on average, the most analytic of all the samples6, 18, whereas the Taiwanese, as East Asians, were the most holistic9, 10. The other three cultural samples should have fallen somewhere between these two samples in terms of context sensitivity. However, in the absence of any eye-tracking research that has applied complex scenes with a free-viewing paradigm to directly compare samples from the Middle East and Sub-Saharan Africa, we were unable to form any strong hypotheses about the holistic/analytic patterns of perception in these three samples.
When we included Turkey in the research, we were careful not to neglect the cultural variance within the country’s borders; much data on the socioeconomic development index (data from the Ministry of Development in19), fertility rate20 and ethnic distribution21 according to province indicate significant cultural differences between eastern and western Turkey. Since research examining other research samples and methods22, 23 has also suggested that participants from eastern Turkey are more holistic than their fellow citizens originating from western Turkey, we considered these two groups as two separate samples that exhibit the above-mentioned expected differences.
Methods
Experimental procedure
Research ethics
The Research Ethics Committee of Masaryk University reviewed the application to conduct the research project and has approved this project (Proposal No.: 0257/2018) to be conducted on 13 March 2019. The nature of methods used, and their administration were performed in accordance with the relevant guidelines and regulations of Masaryk University and national laws. The necessary approvals and permissions to conduct this research were obtained from participating Universities in Ghana (University of Ghana), Turkey (Adnan Menderes University), and Taiwan (National Chengchi University). All local guidelines were followed during the data collection. Informed consent was obtained in writing from all participants.
Virtual reality and eye-tracking hardware and software: HTC Vive Pro head-mounted displays were used with Pupil Labs binocular add-ons to collect eye-tracking data with an estimated accuracy of 0.84° and precision of 0.16°24. The sampling frequency of the eye-tracker was set to 90 Hz to correspond to the default refresh rate of the head-mounted display. The experiment (stimuli sequence, presentation timing) was controlled with a set of utility scripts in the Toggle Toolkit25.
Procedure
Data gathering was conducted by a trained administrator under the supervision of one of the members of the research team. The experiment was conducted in several steps (Fig. 2), as follows: (1) participants read information about the research and signed the informed consent form; (2) using the Hypothesis software26, participants were asked to fill in a sociodemographic questionnaire (size of the municipality participant lives in, gender, age, experience with living in other countries, number of siblings, socioeconomic status, study area, and history of ADHD; for details see Supplement 1); (3) participants underwent a 6-min-long virtual environment adaptation session in a form of stereoscopic video clip to become accustomed to the potentially new experience in virtual reality and to learn the basics of controlling the virtual environment (e.g., moving their heads, an action some participants commonly avoid doing during the first few minutes in the virtual environment; for more information about this topic, see27); (4) the experiment’s session in virtual reality. Both the adaptation and experimental sessions were conducted with participants seated, and with no extra movement, constraints were imposed on them.
Each session of the experiment started with the calibration of the eye-tracker using the Pupil Labs native calibration application (6-point calibration with three levels of depth). Although the native calibration application provides feedback to the experiment’s administrator about successful calibration, we added a validation procedure to let us check the accuracy of eye movement during validation. The administrator instructed participants to fixate objects on the validation screen and in the case of insufficient accuracy the eye-tracker was recalibrated. If a particular participant repeatedly failed to pass the validation process, data collection for that participant was discontinued. Furthermore, during the data cleaning stage, we incorporated the validation data to ensure the integrity of our dataset (for more details, see the “Data analysis and cleaning procedure” section). After successful calibration and validation, the participants were presented with 30 trials of static virtual environments. The participants were instructed to “view the scenes freely” and informed that “they will be asked for an evaluation of some of the scenes afterward”. Before the presentation of each scene (4 s per trial), a fixation cross was displayed for 500 ms. After the presentation of all trials, respondents were asked to a) describe in their own words, and b) evaluate how much they liked (very pretty–very ugly) in a subset of the presented scenes. The responses are not used in this study.
Materials
To ensure cross-cultural equivalence and to avoid biases on item level caused by inappropriate adaptation28, all verbal parts of the experiment (i.e. the questionnaire, the instructions) were back-translated by two independent native speakers29 from various cultural backgrounds30. The instructions were as simple as possible to prevent or at least minimize any possible administration biases.
The stimulus material consisted of 30 three-dimensional virtual environments (two for training and 28 for testing). Trials were presented in pseudo-random order (a priori randomized and presented in the same order to all participants and groups). The virtual environments were created in Unity Engine using freely available 3D assets31. Each virtual environment consisted of two visually and semantically salient focal objects (e.g., cars and car wrecks, barrels, vases, motorcycles) placed in a complex background (typical background components included sky, horizon, vegetation, rocks, and other less salient man-made objects). Both the general virtual environment (background) and the objects were validated in cooperation with the local experts with the aim to minimize possible cultural influences such as familiarity with the scenes and the objects—all parts of the scene had to be commonly present in all cultures included in this research. Such a step of involving scholars from local cultures should increase the construct equivalence32,33,34.
In accordance with Brouwer’s note13 that too much detail in stimuli may distract the observers’ attention, we deliberately opted for simpler graphics than the utilized immersive virtual reality technology allows for (e.g., photorealistic digital twins of buildings35). Virtual environments also lacked locally specific distinct landscape features and culturally specific objects that could be completely unfamiliar to some cultural groups. On the contrary, the design aimed at commonly used objects.
We used two general virtual environment types in the experiment: (1) scenes in which both focal objects were placed at the same distance from the viewer (Fig. 3 upper part; 16 trials); (2) scenes in which one of the focal objects was placed farther from the viewer (Fig. 3 lower part; 12 trials). Each stimulus was presented for four seconds.
Eye-tracking data processing
Using Unity, we defined object colliders for each focal object since the experiment involved analysis of dwell times on various objects (focal objects and background). These colliders are the 3D equivalents of regions of interest in 2D eye-tracking studies. To correctly classify gaze points, colliders were defined as convex meshes to 110% of the size of the objects (i.e., the colliders did not directly occupy the object’s contours but filled the small space around the objects, Fig. 4). Gaze points on all other objects were coded as background gazes. Other research teams (e.g.,36) have previously adopted a similar procedure for studying object hits in virtual reality eye-tracking and provided reliable data while also eliminating the potential inaccuracy encountered with virtual reality eye-trackers.
In the next step, we executed a data cleaning script that filtered corrupt or inaccurate data. Since 50 ms is commonly recognized as a threshold for any cognitive recognition37, we considered gazing at an object for more than 50 ms a visit and rejected gazes below this limit. If a visit was followed by a very short hit (less than 32 ms) on another object followed by a return to the original object, the duration of the gaze on this object before and after the interruption was merged into one visit to prevent data distortion caused by measurement inaccuracy. Dwell times were calculated for each object category (focal object 1, focal object 2, background) as the sum of the duration of all visits. Data loss (blinks, closed eyelids, eye-tracking desynchronization, etc.) was computed per stimulus in addition to focal object and background dwell time.
Data analysis and cleaning procedure
Before data collection, we conducted a power analysis for one-way ANOVA in G*Power (v3.1.9.738) with the following settings: f = 0.25, α = 0.05, 1 − β = 0.80, number of groups = 5. The effect size was determined according to previous research by Duan10, who reported large effect sizes for eye-movement data (ηp2 ranging between 0.12 and 0.77). We decided on a relatively more conservative strategy at the medium effect size threshold. The results specified a minimum sample size of 200 participants (40 per group) to achieve 80% statistical power.
As mentioned above, we gathered virtual reality data from five cultural samples: Czechia, Ghana, Taiwan, eastern Turkey, and western Turkey. All participants were students enrolled at local universities. The participation was advertised in university courses and via student unions. The only exclusion criterion was the presence of severe uncorrected eye defects. Considering the potential experimental mortality typical of this kind of research (e.g., poor data quality and data loss, not all participants completing the experiment), we obtained 297 participants in total. The data were collected as part of a larger test battery that investigated cross-cultural differences in visual perception and included other tasks such as Navon Figures based methods7, and complex 2D scenes6. Results of those methods have not been published yet. For various reasons, some participants from each country could not be measured or included in the analysis, mainly because of impracticable pupil detection or nausea during the experiment. In this stage, 10 participants were excluded because of technical issues in the data collection process.
To ensure good data quality, we performed a data cleaning procedure. In the first step, we calculated the average data loss from the data loss per stimulus (% of missing eye-tracking records). The threshold for removing a participant was set arbitrarily at 5% of missing records. Twelve participants were removed because of high data loss. The average data loss in the remainder of the sample was 0.6%. The second step involved a qualitative assessment of the validation procedure by two independent evaluators (researchers with high expertise in eye-tracking technology) who rated the quality of each individual validation. The records from the validation scene of each participant were exported by a technical worker and subjected to visual inspection with an emphasis on the accuracy of gaze points on target objects and the absence of undesirable artifacts. Only in the case of agreement on the sufficient quality of the recording were the participants' data accepted for analysis. An additional 21 cases were removed based on poor validation. In the final step, we removed 12 participants based on their reported history of ADHD. The entire cleaning procedure resulted in a dataset of 242 participants (for more details see the next section).
To test the first hypothesis, we performed a one-way ANOVA. First, we summed the duration of visits on both focal objects to obtain a single aggregate focal object dwell time per stimulus; we then calculated the mean focal object dwell time across all 28 test trials. Then, we verified the ANOVA assumptions. Because Levene’s test indicated unequal variances across groups (F = 5.44, p < 0.001) and the Shapiro–Wilk test suggested normally distributed residuals (W = 0.99, p = 0.65), the Welch’s degrees of freedom correction for one-way ANOVA was conducted. We also applied Games–Howell post-hoc tests with Tukey HSD correction.
To test the second hypothesis, we performed a mixed ANOVA. Only the twelve stimuli with focal objects at varying distances from the viewer were subjected to this analysis. We calculated the mean dwell time for both types of focal objects. Regarding ANOVA assumptions, the data indicated unequal variance across groups (F = 6.82, p < 0.001), sufficient sphericity (p > 0.999), and non-normal distribution of residuals (W = 0.99, p < 0.001). Three cases (one from Ghana, one from eastern Turkey, and one from western Turkey) were identified as outliers based on a visual inspection of the quantile–quantile plots and consequently removed from the analysis. A repeated check of the ANOVA assumptions suggested unequal variance across groups (F = 6.86, p < 0.001), spherical data (p > 0.999), and normal distribution of residuals (W = 9.99, p = 0.111). Since Welch's correction is not implemented for mixed ANOVA and at the same time ANOVA is relatively robust for violation of equality of variances, we conducted mixed ANOVA with focal object dwell time as a dependent variable, cultural group as a between-subject factor and type of object (type: closer, further) as a within-subject factor. To overcome issues with inequality of variance, omega squared effect size is reported. All p-values were adjusted using the Holm–Bonferroni method.
We analyzed the data in R using the following packages: afex, effsize39, ggforce, ggdist40, gghalves, lme4, lsr, performance41, psych42, rstatix, rmisc, and tidyverse. The data matrices and R code containing the data cleaning procedure are available online at the URL https://osf.io/2kqts/.
Research sample
The entire data cleaning procedure resulted in a dataset composed of 242 participants representing five cultural samples: Czechia (42), Ghana (39), Taiwan (51), Turkey East (56), and Turkey West (54). See Table 1 for per-country descriptives of gender and age. For detailed sample descriptives (e.g., socio-economic status, number of siblings, size of municipality of birth, study area), see Supplement 1.
Results
Overall average dwell on focal objects
In the eye-movement analysis, we focused on visits and the respective dwell time on focal objects. Figure 5 shows per-country distributions of the mean focal object dwell time. The means and confidence intervals for each country are summarized in Table 2.
Next, we tested for differences between cultural groups. ANOVA revealed a statistically significant effect of the cultural group on the mean of focal object dwell time with a large effect size, F(4, 108.84) = 73.93, p < 0.001, ω2 = 0.386. Post hoc tests (Table 3) indicated that the Taiwanese spent significantly less time observing focal objects than participants from all the other countries. Participants from western Turkey spent significantly more time observing focal objects than all the reported groups apart from Czechia. No significant difference was observed between participants from eastern Turkey and Czechia or Ghana. Participants from Czechia spent significantly more time observing focal objects than participants from Ghana. All significant pairwise differences showed large effect sizes and insignificant comparisons indicated weak to medium effect sizes. Thus, this hypothesis was partially supported. See the Supplement for comparison of dwell time on focal objects and background.
Dwell time on closer and further objects
In the next step, we examined the potential role of depth in relative focus on focal objects placed closer or further away from the viewer. See Fig. 6 and Table 4 for details.
We performed a mixed ANOVA. The main effect of the country was statistically significant, with a large effect size F(4, 234) = 34.72, p < 0.001, ω2 = 0.103, therefore supporting the results of the previous one-way ANOVA. The main effect of the type of object (i.e., object distance from the viewer) was also statistically significant, with a medium effect size F(1, 234) = 8.84, p < 0.001, ω2 = 0.008, suggesting that if we investigate all the participants regardless of their country, on average they spend more time on objects which are closer than further. Finally, the interaction between cultural group and object type was also significant, with medium effect size F(4, 234) = 4.71, p < 0.001, ω2 = 0.009.
Since the interaction was significant, a more detailed analysis of the results was conducted via simple main effects. The simple main effect of cultural group was significant for both closer, F(4, 234) = 28.00, p < 0.001, η2g = 0.324, and further objects F(4, 234) = 26.30, p < 0.001, η2g = 0.310. For these differences, however, we did not state any hypotheses, and therefore we did not perform additional post-hoc tests. The simple main effect of the focal object position was significant for Turkey East and Turkey West participants, with medium to large effect sizes. In both of these cultural groups, the participants gazed at the closer focal objects for a longer time. No significant difference was evident in the other three cultural groups, with negligible effect sizes (see Table 5 for details). Since the type of focal object was a binary variable, no post-hoc tests were performed. Overall, the hypothesis concerning systematic cross-cultural differences in depth perception was not fully supported since they were observed only among participants from both Turkish groups.
Dwell on objects and other individual-level variables
In the last step, we performed an exploratory analysis of possible influence of other variables on the dwell time on focal objects. We performed ANCOVA with dwell time on focal objects as the dependent variable, culture as the independent variable, and size of the municipality the participant lives in, gender, age, experience with living in another country, number of siblings, socioeconomic status, and study area as covariates. Apart from a small to moderate effect of gender (p = 0.010, ηp2 = 0.051), no other variable was in a significant relationship with the dependent variable. The main effect of gender is associated with a decrease in the average dwell for females compared to males. The model with covariates showed the same pattern of differences in focal object dwell time across countries as the basic model (for full results, comparison of models and descriptives by country and gender, see Supplement 1).
Discussion
The paper presents the results we obtained from a study of the cognitive processing of complex visual stimuli in a sample of five cultural groups native to Czechia, Ghana, Taiwan, eastern Turkey, and western Turkey. We used immersive virtual environments as stimuli and an eye-tracking device to measure the participants’ oculomotor behaviors to assess the distribution of attention during a presentation of several virtual environments. Recent similar studies using 2D stimuli were inconclusive on the cross-cultural variance in focusing on salient objects (analytic cognitive style) or on the scene in its entire complexity (holistic cognitive style)6, 7, as originally reported by Masuda and Nisbett8.
2D images have been previously questioned as suitable proxies for real objects43), whereas 3D representations in an immersive virtual environment appear to provide a potential alternative for further research. Sharing the optimism of de la Rosa and Breidt44 on the potential of using virtual reality to enhance the quality of psychological research, we transferred two-dimensional scenes into a three-dimensional virtual world to provide greater ecological validity45, 46 and the option to include the aspect of depth make virtual reality a suitable tool for examining the aforementioned theories of cross-cultural differences.
In the first presented analysis, which examined the distribution of attention between visually salient focal objects and complex background, the results partially supported the geographic distribution of the cognitive styles we expected according to the assumptions by Masuda and Nisbett8. Taiwanese participants spent significantly less time on focal objects than on the rest of the scene, an observation that supports their position as typical representatives of holistic cognitive style. This difference was convincing, especially compared to participants from Czechia, or western Turkey. Interestingly, however, western Turkey was distinct, proving to be the most analytic sample.
We achieved the aim of capturing the possible cultural differences within the Turkish sample: the observed differences in oculomotor behaviour between both Turkish samples agreed with the previously detected within-country differences in cognitive style4, 23, 47. These differences might be explained by the above-mentioned present-day socio-demographic differences in Turkey and other historical factors. Asia Minor has been a border between Mediterranean and Eastern influences for centuries, and although the western area was under the continuous influence of the Byzantine Empire until 1453, the eastern area experienced significant influence from Persian, Arab, and Seljuk expansion. Differences can also be observed in the second half of the twentieth century, when a large portion of the workforce heading for West Germany was recruited from West Anatolia, Istanbul, and Ankara48. A large portion of these workers and their second-generation offspring remigrated back to their homeland which might have reciprocally influenced their home culture.
Surprisingly, the Czech Republic, which was hypothesized as the most analytical, was not distinct from the other studied samples. The object/background attentional patterns for the Czech sample were very similar to the patterns of the sample from Turkey. Whether forced collectivization or other historical phenomena may have caused this variance in central and western European countries in cognitive processes as suggested by Varnum49, the underlying perception and evaluation of complex stimuli remains a question. The present study, however, reports some of the first research in the field, comparing the oculomotor patterns of participants beyond commonly studied samples using diverse samples from central Europe, western and eastern Asia, and Africa, and applying methods in a 3D virtual environment50. It supports the original theory stating that cultural background affects visual perception, and that culture may be understood as a condensed experience that determines individuals’ perceptual schema51, 52 that drives the exploration activity in the top-down direction53. In the second analysis concerning the differences in processing scenes with focal objects at varying distances, both Turkish samples spent relatively more time looking at closer focal objects. No statistically significant difference of this kind was observed in samples from Czechia, Ghana, and Taiwan.
In summary, we can draw three main conclusions from the present study and the use of virtual reality combined with eye-tracking. A person’s cultural group plays a role in visual attention distribution in virtual environments, resp. in their relative attention to focal objects. Second, the findings of significant differences within both Turkey samples underscore the importance of not thinking about countries as culturally monolithic phenomena but being aware that aggregation of data at the national level may camouflage meaningful differences at the within-culture level (see54).
Finally, third, there are notable differences in perception of depth (and respectively relative attention to closer and further objects) across the countries, but they are not systematic and easily interpretable at this point. Whereas in Taiwan, Czechia, and Ghana there were practically no differences in attention to closer and further objects detected (differences between 5 to 25 ms), in the case of both Turkish samples they were distinct (145, resp. 174 ms). We find this difference in the patterns of observation of closer and further objects quite interesting, moreover, both cohorts significantly differ in the overall attention to focal object vs. background.
Our research is not without limitations. Generalization of the results is limited by the novelty of the technology: to eliminate the differences in digital literacy of the participants in the study, we selected a student population in which we expected less pronounced differences in digital skills. This particular generation is less likely to have technology-related biases, however, at the same time is most globalized and probably demonstrates a lower level of the culturally based differences the study was aiming at.
The scenes, used as visual stimuli, contain a variety of smaller and larger objects, making them complex in terms of content. However, it is important to distinguish between the complexity of the scene's content and the photorealistic quality of its rendering. In our study, we focused on the former, emphasizing the diversity and arrangement of visual elements rather than striving for photorealism. Manipulation with the level of photorealism of 3D environments55,56,57 and observation of the effects of this manipulation on the participants' perception and its cross-cultural specifics could offer intriguing prospects for further research.
Apart from this, further cross-cultural eye-tracking research should focus on more controlled manipulations with the objects within the virtual visual field related to monocular and binocular depth cues58. For example, on perception and interpretation of conflicting cues, changes in spatial position, and perception of moving objects.
Finally, an important limitation of this study concerns the operationalization of culture within our cross-cultural research framework. We acknowledge that the issue of equating location with culture is a persistent challenge in quasi-experimental laboratory studies of this nature. While exploring more refined cultural subgroups or local cultures within each studied region might offer valuable insights, it would necessitate larger samples and the inclusion of established culture measurement scales, such as individualism/collectivism or independent/interdependent self-construal, as explanatory variables. However, our previous attempts at validating these methods encountered difficulties28, 59, and we found it necessary to prioritize further developments in culture operationalization methods for more accurate cross-cultural comparisons.
Data availability
The dataset, data-analytic scripts (in R), and supplementary material are accessible in the Open Science Foundation (OSF) repository at the following URL: https://osf.io/2kqts/. This study was not preregistered.
References
Nisbett, R. E. & Masuda, T. Culture and point of view. PNAS 100(19), 11163–11170 (2003).
Nisbett, R. E., Peng, K., Choi, I. & Norenzayan, A. Culture and systems of thought: Holistic versus analytic cognition. Psychol. Rev. 108(2), 291–310 (2001).
Chiu, L. H. A cross-cultural comparison of cognitive styles in Chinese and American children. Int. J. Psychol. 7(4), 235–242 (1972).
Norenzayan, A., Smith, E. E., Kim, B. J. & Nisbett, R. E. Cultural preferences for formal versus intuitive reasoning. Cogn. Sci. 26(5), 653–684 (2002).
Kühnen, U. et al. Cross-cultural variations in identifying embedded figures: Comparisons from the United States, Germany, Russia, and Malaysia. J. Cross. Cult. Psychol. 32(3), 366–372 (2001).
Čeněk, J., Tsai, J. L. & Šašinka, Č. Cultural variations in global and local attention and eye-movement patterns during the perception of complex visual scenes: Comparison of Czech and Taiwanese university students. PLoS ONE 15(11), 1–22 (2020).
McKone, E. et al. Asia has the global advantage: Race and visual attention. Vis. Res. 50(16), 1540–1549 (2010).
Masuda, T. & Nisbett, R. E. Culture and change blindness. Cogn. Sci. 30(2), 381–399 (2006).
Chua, H. F., Boland, J. E. & Nisbett, R. E. Cultural variation in eye movements during scene perception. PNAS. 102(35), 12629–12633 (2005).
Duan, Z., Wang, F. & Hong, J. Culture shapes how we look: Comparison between Chinese and African university students. J. Eye Mov. Res. 9(6), 1–10 (2016).
Evans, K., Rotello, C. M., Li, X. & Rayner, K. Scene perception and memory revealed by eye movements and receiver-operating characteristic analyses: Does a cultural difference truly exist?. Q. J. Exp. Psychol. 62(2), 276–285 (2009).
Blini, E. et al. Mind the depth: Visual perception of shapes is better in peripersonal space. Psychol. Sci. 29(11), 1868–1877 (2018).
Brouwer, H. Communicating with pictures: The role of pictures in health education in outpatient clinics of rural African hospitals. Visual Stud. 10(1–2), 15–27 (1995).
Cryns, A. G. J. African intelligence: A critical survey of cross-cultural intelligence research in Africa South of the Sahara. J. Soc. Psychol. 57(2), 283–301 (1962).
Pontius, A. A. In similarity judgments hunter-gatherers prefer shapes over spatial relations in contrast to literate groups. Percept. Motor Skill. 81(3), 1027–1041 (1995).
Chatterjee, S. Culture and perception. In Understanding Cross-Cultural Neuropsychology: Science, Testing, and Challenges (eds Fernández, A. L. & Evans, J.) (Routledge, 2022).
Matsumoto, D. & Yoo, S. H. Toward a new generation of cross-cultural research. Perspec. Psychol. Sci. 1(3), 234–250 (2006).
Lacko, D., Šašinka, Č, Čeněk, J., Stachoň, Z. & Lu, W. L. Cross-cultural differences in cognitive style, individualism/collectivism and map reading between Central European and East Asian university students. Stud. Psychol. 62(1), 23–43 (2020).
Atun, R. et al. Universal health coverage in Turkey: Enhancement of equity. Lancet. 382(9886), 65–99 (2013).
Birth Statistics, 2020. TURKSTAT Data Portal. (2021). (Accessed 2 June 2022) https://data.tuik.gov.tr/Bulten/Index?p=Birth-Statistics-2020-37229.
Mutlu, S. Population of Turkey by Ethnic Groups and Provinces. New Perspectives on Turkey. 12, 33–60 (1995).
Alotaibi, A., Underwood, G. & Smith, A. D. Cultural differences in attention: Eye movement evidence from a comparative visual search task. Conscious. Cogn. 55, 254–265 (2017).
Uskul, A. K., Kitayama, S. & Nisbett, R. E. Ecocultural basis of cognition: Farmers and fishermen are more holistic than herders. PNAS. 105(25), 8552–8556 (2008).
MacInnes, J. J., Iqbal, S., Pearson, J., & Johnson, E. N. Wearable eye-tracking for research: Automated dynamic gaze mapping and accuracy/precision comparisons across devices. BioRxiv 299925 (2018).
Ugwitz, P., Šašinková, A., Šašinka, Č, Stachoň, Z. & Juřík, V. Toggle toolkit: A tool for conducting experiments in unity virtual environments. Behav. Res. Methods. 53(4), 1581–1591 (2021).
Šašinka, Č., Morong, K. & Stachoň, Z. The Hypothesis platform: An online tool for experimental research into work with maps and behavior in electronic environments. ISPRS Int. J. Geo-Inf. 6(12), 407, 1–22 (2017).
Hu, B., Johnson-Bey, I., Sharma, M. & Niebur, E. Head movements during visual exploration of natural images in virtual reality. In 51st Annual Conference on Information Sciences and Systems. 1–6 (IEEE, 2017).
Lacko, D. et al. The necessity of testing measurement invariance in cross-cultural research: Potential bias in cross-cultural comparisons with individualism–collectivism self-report scales. Cross. Cult. Res. 56(2–3), 228–267 (2022).
Brislin, R. W. Back-translation for cross-cultural research. J. Cross. Cult. Psychol. 1(3), 187–216 (1970).
Byrne, B. M. et al. A critical analysis of cross-cultural research and testing practices: Implications for improved education and training in psychology. Train. Educ. Prof. Psychol. 3(2), 94–105 (2009).
Fitz, J. Cross-Cultural Research in Immersive Virtual Reality: Possibilities of Eyetracking Usage [Bachelor's thesis]. (Masaryk University, 2020).
Čeněk, J. & Urbánek, T. The adaptation and equivalence of test methods: An inspiration for psychological assessment of minorities in the Czech Republic. Czechoslovak Psychol. 63(1), 42–54 (2019).
Davidov, E., Meuleman, B., Cieciuch, J., Schmidt, P. & Billiet, J. Measurement equivalence in cross-national research. Annu. Rev. Sociol. 40(1), 55–75 (2014).
Van de Vijver, F. Towards a theory of bias and equivalence. In Cross-Cultural Survey Equivalence (ed. Harkness, J.) 41–65 (Zentrum für Umfragen, Methoden und Analysen, 1998).
Stachoň, Z. et al. The possibilities of using virtual environments in research on wayfinding [Preprint, 2022]. Under Review. https://doi.org/10.21203/rs.3.rs-2045842/v.
Stokes, J. D., Rizzo, A., Geng, J. J. & Schweitzer, J. B. Measuring attentional distraction in children with ADHD using virtual reality technology with eye-tracking. Front. Virtual Real. https://doi.org/10.3389/frvir.2022.855895 (2022).
Galley, N., Betz, D. & Biniossek, C. Fixation durations—Why are they so highly variable?. Das Ende von Rational Choice? Zur Leistungsfähigkeit der Rational-Choice-Theorie 93, 83–106 (2015).
Faul, F., Erdfelder, E., Lang, A. G. & Buchner, A. G* Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behav. Res. Methods 39(2), 175–191 (2007).
Torchiano, M. effsize: Efficient Effect Size Computation. https://doi.org/10.5281/zenodo.1480624, R package version 0.8.1, https://CRAN.R-project.org/package=effsize (2020).
Kay, M. ggdist: Visualizations of Distributions and Uncertainty. https://doi.org/10.5281/zenodo.3879620, R package version 3.1.1, https://mjskay.github.io/ggdist/ (2022).
Lüdecke, D., Ben-Shachar, M., Patil, I., Waggoner, P. & Makowski, D. Performance: An R package for assessment, comparison and testing of statistical models. J. Open Source Softw. 6(60), 3139 (2021).
Revelle, W. psych: Procedures for Psychological, Psychometric, and Personality Research. Northwestern University, Evanston, Illinois. R package version 2.2.5, https://CRAN.R-project.org/package=psych (2022).
Gomez, M. A., Skiba, R. M. & Snow, J. C. Graspable objects grab attention more than images do. Psychol. Sci. 29(2), 206–218 (2018).
de la Rosa, S. & Breidt, M. Virtual reality: A new track in psychological research. Br. J. Psychol. 109(3), 427–430 (2018).
Matheis, R. J. et al. Is learning and memory different in a virtual environment?. Clin. Neuropsychol. 21(1), 146–161 (2007).
Grewe, P. et al. Real-life memory and spatial navigation in patients with focal epilepsy: Ecological validity of a virtual reality supermarket task. Epilepsy Behav. 31, 57–66 (2014).
Masuda, T., Wang, H., Ishii, K. & Ito, K. Do surrounding figures’ emotions affect judgment of the target figure’s emotion? Comparing the eye-movement patterns of European Canadians, Asian Canadians, Asian international students, and Japanese. Front. Integr. Neurosci. 6(72), 1–9 (2012).
Völker, G. E. Turkish labour migration to Germany: Impact on both economies. Middle Eastern Stud. 12(1), 45–72 (1976).
Varnum, M., Grossmann, I., Katunar, D., Nisbett, R. & Kitayama, S. Holism in a European cultural context: Differences in cognitive style between Central and East Europeans and westerners. J. Cogn. Cult. 8(3–4), 321–333 (2008).
Masuda, T., Batdorj, B. & Senzaki, S. Culture and attention: Future directions to expand research beyond the geographical regions of WEIRD cultures. Front. Psychol. 11, 1–15 (2020).
Neisser, U. Cognition and Reality: Principles and Implications of Cognitive Psychology (WH Freeman and Company, 1976).
Chun, M. M. & Wolfe, J. M. Visual attention. In Blackwell Handbook of Perception (ed. Goldstein, B.) 272–310 (Blackwell, 2001).
Senzaki, S., Masuda, T. & Ishii, K. When is perception top-down and when is it not? Culture, narrative, and attention. Cogn. Sci. 38(7), 1493–1506 (2014).
Cohen, A. B. Many forms of culture. Am. Psychol. 64(3), 194–204 (2009).
Handali, J. P., Schneider, J., Gau, M., Holzwarth, V., & Brocke, J. Visual complexity and scene recognition: How low can you go? In 2021 IEEE Virtual Reality and 3D User Interfaces (VR), (2021).
Huang, J. & Klippel, A. The effects of visual realism on spatial memory and exploration patterns in virtual reality. In Proceedings of the 26th ACM Symposium on Virtual Reality Software and Technology (VRST '20). Association for Computing Machinery 1–11 (2020).
Han, E. et al. People, places, and time: A large-scale, longitudinal study of transformed avatars and environmental context in group interaction in the metaverse. J. Comput. Mediat. Commun. 28(2), zmac031 (2023).
Howard, I. P. & Rogers, B. J. Perceiving in Depth (Oxford University Press, 2012).
Lacko, D., Čeněk, J. & Urbánek, T. Psychometric properties of the independent and interdependent self-construal questionnaire: Evidence from the Czech Republic. Front. Psychol. 12, 564011 (2021).
Acknowledgements
We would like to thank Aneta Pelánová, Natálie Káčová, Berna Eren, Alaattin Arikan, Albert Ahenkan, Scholastica Pahs, Tzu-Lan Yu, and Rou-Ting Wang for enabling a challenging data collection procedure. We would also like to thank the HUME Lab–Experimental Humanities Laboratory, Masaryk University, for providing us with the necessary machine time and equipment. This research was supported by the Czech Science Foundation (GC19-09265J: The Influence of Socio-Cultural Factors and Writing Systems on the Perception and Cognition of Complex Visual Stimuli; https://gacr.cz/en/ and by the Taiwan Ministry of Science and Technology (projects MOST 108-2923-H-004-001-MY2 and MOST 107-2410-H-004-204 -MY3). The funders had no role in the study’s design, data collection and analysis, the decision to publish, or preparation of the manuscript.
Author information
Authors and Affiliations
Contributions
Š.A.: conceptualization, methodology, investigation, writing—original draft, writing—review and editing, visualization; Č.J.: formal analysis, data curation, writing—original draft, writing—review and editing, visualization; U.P.: methodology, formal analysis, resources, data curation, writing—original draft; T.J.-L., S.Z.: investigation, writing—review and editing; G.I.: investigation, writing—review and editing; L.D.: formal analysis, writing—review and editing; F.J.: methodology, writing—review and editing; Š.Č.: conceptualization, methodology, investigation, writing—review and editing, visualization, supervision, funding acquisition.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Šašinková, A., Čeněk, J., Ugwitz, P. et al. Exploring cross-cultural variations in visual attention patterns inside and outside national borders using immersive virtual reality. Sci Rep 13, 18852 (2023). https://doi.org/10.1038/s41598-023-46103-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-46103-1
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.