Introduction

Japanese literacy and kanji abilities

The Japanese writing system is composed of two types of script, i.e., kana and kanji. The former is phonographic, in which a letter is mapped onto a unit of sound, as in English. The latter is logographic, in which a character is mapped onto a unit of meaning, as in Chinese. Like alphabetic orthographies with 24–36 letters1, the former consists of only 71 letters, which are visually simple, and all kana letters are acquired by the first year of elementary school. In contrast, the latter contains more than 5,000 characters, which vary in visual complexity, as is the case with Chinese orthography2. Kanji characters are acquired gradually based on school grade throughout Japan, in strict accordance with the school curriculum guidelines developed by the Ministry of Education, Culture, Sports, Science and Technology3,4. In the first grade, children learn 80 characters (from a total of 2,136 daily-use characters). They are expected to master 1,006 characters by the end of the sixth grade, and to be able to read all daily-use kanji, including visually very complex characters such as the kanji 鬱 (u-tsu, depression), and to be able to write most of them by hand before high school graduation3,4. In Japanese documents, most nouns, and the roots of most verbs, adjectives, and adverbs are generally written with kanji characters, whereas kana letters are used adjunctively for function words and inflectional endings. Although the Japanese language can be written exclusively with kana letters, many homophones in the Japanese language are represented with the same kana letters; therefore, texts written with only these letters are generally much less readable. Japanese kanji characters make homophones discriminable, and mastery of a larger number of kanji characters and words guarantees that one will be able to read and write more complex sentences with a higher proportion of abstract words.

Literacy acquisition in any language is underpinned by basic cognitive skills and can support the development of higher-level language skills. This means that literacy can be a mediator between early cognitive skills and intellectual abilities in adulthood. Japanese literacy is considered to depend largely on skills for managing kanji words, which are acquired gradually over time and are used in preference to kana. However, the cognitive-developmental bases of Japanese kanji abilities are unclear, whereas it has been repeatedly reported that kana acquisition is strongly predicted by phonological processing5,6,7, as is also the case with English8,9. The cognitive underpinnings or demands of kanji acquisition that lasts until adulthood are considered to involve more slowly developing functions than, or at least to be partly different from, those of kana acquisition. Given the higher prevalence of problems in literacy acquisition in kanji than in kana among Japanese children10, identifying the cognitive underpinnings of Japanese kanji abilities is important for reinforcing therapeutic or educational strategies for these problems. In addition, literacy on the word level is one of the essential bases supporting knowledge acquisition at school or elsewhere, as well as the development of reading/writing skills on the text level. Therefore, from the perspective of the development of language skills, this study investigated the relationship of kanji abilities with their cognitive underpinnings as well as with higher-level language skills in Japanese university students who had completed 12 school years of kanji education.

Cognitive underpinnings of literacy

First, to identify the cognitive predictors of kanji abilities, it is necessary to consider multiple and interacting risk/protective factors of literacy problems whose etiology is complex and multifactorial, such as developmental dyslexia11,12. Cross-cultural investigations have established the importance of phonological processing as one of the key predictors of literacy acquisition in alphabetic orthographies (Dutch, German, English, Finnish, French, Hungarian, and Portuguese)13,14, as well as in logographic Chinese15. A single-deficit phonological theory of literacy problems was formerly predominant; however, emerging evidence suggests the possible contribution of deficits in visuospatial processing, including visuomotor processing speed, to literacy problems in alphabetic8,9,16,17,18,19 and Chinese orthographies20. Additionally, it has long been hypothesized that naming speed is a predictor of reading acquisition8,9,21, and it has also been reported to be related to spelling ability in alphabetic orthographies13. These cognitive domains have been shown to contribute differently to reading and writing13,23.

With reference to Japanese kanji abilities, a few studies replicated results showing significant relationships between the three cognitive domains mentioned above and early acquisition of Japanese kanji reading/writing accuracy, with a relatively strong contribution of visuospatial processing6,10,22. However, to the best of our knowledge, no study has yet investigated the cognitive underpinnings of the semantic dimension of Japanese literacy skills, presumably because of the young age of the participants in the previous studies. The three-dimensional view of kanji abilities, in which semantic comprehension interacts with but is distinct from accuracy in reading and writing, can be supported by neuropsychological evidence24,25,26, as well as the results of a factor-analytic investigation into a large database of test-takers’ performance details on the Japan Kanji Aptitude Test (Nihon Kanji Noryoku Kentei: Kanken)27. As suggested by the results of previous studies of alphabetic and Chinese orthographies examining the cognitive underpinnings of the abilities to read and comprehend sentences or a text28,29, syntactic processing is considered to be a likely predictor of the semantic dimension of kanji acquisition in Japanese. Thus, we investigated the predictive relationships between basic cognitive skills, including phonological, visuospatial, and syntactic processing, and naming speed, and the dimension of semantic comprehension as well as reading/writing accuracy in Japanese kanji abilities. As might be expected, the writing dimension of literacy was assessed in this study using tasks that require participants to write words accurately by hand rather than to type them, in common with the studies referred to above.

Impact of literacy on higher-level language skills

Second, regarding the outcomes of later linguistic development, empirical evidence shows that literacy skills can predict the acquisition of verbal knowledge during the school years. For example, the Matthew effect suggests that good readers gain vocabulary, as well as reading skills, more rapidly30,31,32. In addition, it has been reported that scores on the Information subtest of the Wechsler intelligence scales in children/adults with developmental dyslexia are significantly lower than those in typically developing people33,34. Furthermore, literacy skills have been reported to be related to idea density, which represents the linguistic complexity of written or spoken output35. Idea density has been the subject of widespread attention in the context of its possible function as a protective factor against age-related cognitive decline and Alzheimer’s disease in late life. The seminal “Nun study” retrospectively analyzed autobiographies that had been written by nuns in their young adulthood (mean age of 22 years) and showed that higher idea density predicted intact cognition in later life, regardless of the presence or absence of Alzheimer’s disease lesions36,37. This relationship between higher idea density in young adulthood and intact cognition in later life was replicated in a study that analyzed medical school admission essays (mean age of 22 years)38. These findings suggest that attainment of higher language proficiency before and during young adulthood may protect against cognitive decline and dementia in late life, and in addition, seem to validate the notion that idea density can be a measure of highly sophisticated skills. From a developmental perspective, higher-level language skills such as idea density are considered to be partly underpinned by basic literacy skills, or, more specifically, idea density may be encouraged by the accumulation of verbal knowledge, which is based on literacy acquisition.

In recent decades, the rapid spread of digital writing devices such as computers and smartphones39 has drastically reduced the frequency of handwriting40, resulting in a dimension-specific reduction of kanji writing accuracy in Japanese adults27. When using these devices, we first write with kana letters and then convert them into kanji words by choosing one option from many homophones. When doing so, we recognize, not recall, the form of kanji characters. In addition, we can write Japanese words with kana whenever we cannot write kanji characters, which are large in number and visually complex, correctly. Therefore, less frequent handwriting in Japanese can easily lead to less correct handwriting of kanji. Considering these effects of environmental changes, it seems important to examine the impacts of each dimension of those abilities on higher-level language skills.

Aim of the study

This study aimed to clarify the relationships among Japanese kanji abilities, their cognitive underpinnings, and higher-level language skills from the perspective of the development of language skills. Considering the dimension-specificity of kanji acquisition, this cross-sectional study started by identifying the risk/protective factors that account for the individual differences in each of the three dimensions of kanji abilities, i.e., reading accuracy, writing accuracy, and semantic comprehension, in Japanese university students. Then, we examined the contributions of each dimension of kanji acquisition to higher-level language skills, i.e., acquired knowledge and idea density. The hypotheses tested in this study were that (1) multidimensional kanji abilities can be predicted by multiple cognitive factors; (2) those cognitive skills predict kanji abilities in a dimension-specific manner; (3) the relationships between kanji abilities and acquired knowledge are not correlations but predictions from the former to the latter; (4) kanji abilities mediate the relationships between their cognitive predictors and higher-level language skills; (5) kanji abilities predict idea density dimension-specifically and indirectly via acquired knowledge.

Methods

Participants

The present study included 30 university students (28 undergraduates and 2 graduates; mean age ± SD = 19.87 ± 1.25 years; 15 females and 15 males) recruited from several universities in Kyoto and Osaka, Japan. The sample size was calculated a priori using the GPower 3.1.9.2 (Kiel University, Germany) based on an α = 0.05, power of 80%, and a large effect size (f2 = 0.35) for identifying two predictors using a multiple linear regression analysis. The full-scale, verbal, and performance intelligence quotients (IQ) of all participants were measured using the Japanese version of the Wechsler Adult Intelligence Scale, third edition (WAIS-III)41,42. The IQs of all participants were normal or higher (IQs ≥ 72), and the variance and width of the distribution of the scores were substantially large (SDs ≥ 11.95; width of the IQ ranges ≥ 52), as expected. Moreover, to estimate the literacy achievement level of the participants, we used the reading and handwriting subtests of the Japanese version of the Kaufman Assessment Battery for Children, second edition (KABC-II)43. The KABC-II is standardized for children aged 2–18 years. Thus, we applied scaled scores for 18-year-olds to all participants, even though this may have meant slightly overestimating the achievement level of the participants. The scores for reading and handwriting were normal or higher (scaled scores ≥ 8), and the variance and width of the distribution of the scores were substantially large (SDs ≥ 2.11; width of the scaled score ranges ≥ 7). The demographic characteristics and the scores on WAIS-III and KABC-II of the participants are provided in Table 1.

Table 1 Demographic characteristics, intelligence, and literacy achievement level of participants.

All procedures in this study were approved by the Ethics Committee of Kyoto University Graduate School and Faculty of Medicine and were performed in accordance with the ethical standards in the 1964 Declaration of Helsinki and its later amendments. All participants provided written informed consent to participate in the study.

Measures

All cognitive tasks and neuropsychological tests were administered individually by a clinical psychologist trained in standardized testing procedures. The written exams, including a kanji exam and an essay task for measurement of idea density, were also conducted individually and overseen by a clinical psychologist or one of two undergraduate research assistants to prevent examinees from using their smartphones during the exams. Considering that fatigue could influence task performance, the measures described below were divided into two parts and implemented over two consecutive or non-consecutive days within 21 days, with the exception of one individual who participated with an interval of 51 days between the two parts of the test.

Kanji abilities

To measure kanji abilities, we used the 2016 past examination papers for the Kanken, whose three-dimensional structure, including reading accuracy, writing accuracy, and semantic comprehension, was established in our factor analytical study27. The Kanken was first offered in 1975 and is the most popular kanji exam in Japan, with approximately 2 million people taking it voluntarily or semi-voluntarily in 2016. From the 12 levels of difficulty provided by Kanken, ranging from the easiest (Level 10) to the most difficult (Level 1, including Pre-2 and Pre-1), we employed the Level Pre-2 exam paper (mastery of 1951 daily-use kanji: high school level). Additionally, considering the possible ceiling effects in the Reading subtest27, we also conducted that subtest from the Level 2 exam paper (mastery of all 2136 daily-use kanji; high school graduation level). The results showed that only one participant achieved a perfect score on the Level 2 Reading subtest, whereas six participants did so on that of Level Pre-2. Thus, we eventually analyzed the scores on the Level 2 Reading subtest, in addition to the other subtests of the Level Pre-2 exam papers. All participants completed those paper-and-pencil exams within 60 min (mean ± SD = 32.50 ± 7.09 min; the range was from 19 to 46 min), which was the time limit for the official Kanken exam.

Reading accuracy

The dimension of reading accuracy was assessed using the Reading subtest. This subtest requires examinees to write the correct pronunciation of a marked kanji word (i.e., convert it to kana) appearing in each of 30 sentences, taking context into consideration. As previously discussed, a kanji word can be written alternatively with kana letters, which have highly regular letter–sound correspondences. Thus, this conversion from kanji characters to kana letters is usually used in kanji education in Japan. Each correct item was given a score of 1, adding up to a maximum of 30 (0–30)27.

Writing accuracy

The dimension of writing accuracy was assessed using the following five subtests. We used the sum of the scores on these subtests as the score for this dimension (0–110). The Antonyms/Synonyms subtest required the participants to choose an antonym or synonym for each of ten two-character kanji compounds from kana words and write it correctly in kanji. Ten options of kana words were prepared. Each correct item was given a score of 2, adding up to a maximum of 20. In the Homophones subtest, the participants were required to differentially write two homophones of kanji words that were written as underlined kana letters in each of five pairs of sentences. Each correct item was given a score of 2, adding up to a maximum of 20. The Error Correction subtest required the participants to identify a homophonic error in a kanji character in each of five sentences and write the correct one. Each correct item was given a score of 2, adding up to a maximum of 10. In the Kana Suffixes subtest, the participants were required to write a correct kanji character and a kana suffix accompanying it based on marked kana letters in each of five sentences. Each correct item was given a score of 2, adding up to a maximum of 10. The Writing subtest required the participants to write a correct kanji word that was written as marked kana letters in each of 25 sentences. Each correct item was given a score of 2, adding up to a maximum of 5027.

Semantic comprehension

The dimension of semantic comprehension was assessed using the following four subtests. We used the sum of the scores on these subtests as the score for this dimension (0–60). The Radicals subtest required the participants to extract a radical from each of ten kanji characters. Radicals are the visual components of kanji characters, most of which represent the semantic category, e.g., the left part of the kanji 海 (umi or kai, sea) is regarded as the radical 氵 (sanzui), meaning “water” or “fluid.” In general dictionaries of Japanese kanji, 214 radicals are used for classifying kanji characters and each kanji is assigned one radical. Each correct item was given a score of 1, adding up to a maximum of 10. In the Compounds Structure subtest, the participants were required to classify ten two-character kanji compounds into five categories based on their structure. The categories included cases where the two characters have a similar meaning, the two characters have the opposite meaning, the former modify the latter, the latter is an object/complement of the former, and the former deny the meaning of the latter. Each correct item was given a score of 2, adding up to a maximum of 20. The Compounds Completion subtest required the participants to complete ten four-character kanji compounds by choosing one that precedes or follows each of ten two-character kanji compounds from kana words and converting kana to kanji. Ten options of kana words were prepared. Each correct item was given a score of 2, adding up to a maximum of 20. In the Compounds Meaning subtest, the participants were required to choose one option that represented the meaning of each of five sentences from ten four-character kanji compounds in subtest 4. Each correct item was given a score of 2, adding up to a maximum of 1027.

Cognitive skills

Drawing on the findings of previous studies6,8,9,10,13,14,15,16,17,18,19,20,21,22,23,28,29, we selected and measured cognitive skills in the domains of phonological processing, naming speed, visuospatial processing, and syntactic processing. To assess basic cognitive skills that are substantially independent of each other and likely underpin literacy acquisition, we employed simple tasks whose targets were clear.

Phonological processing

We used the Checking Sounds subtest of the Comprehensive Assessment of Reading Domains (CARD)44 to measure phonological awareness. In this task, the participants listened to two voices that read out each of 30 words (3–5 morae) in sequence, with one of the morae not read by the second voice; they were then asked to identify the location of the mora that was canceled. In this test, which was standardized for elementary school children, at normal speed, the audio stimuli that sequentially present words are played over about 215 s. However, in the present study, we played the stimuli at double speed to make the task more challenging for university students based on the results of our preliminary experiments. Taking the number of correct answers as a measure of accuracy, the high level of accuracy (mean = 97.23%) of the participants’ performance on this subtest appeared to support the assumption that this modification would not diminish the capability of this test.

To measure sound–letter decoding, we used the Listening subtest of the CARD44. In this task, the participants were asked to listen to a voice that read out each of 30 non-words (3–5 morae) and to choose one non-word that represented the sound heard from five options, each of which was written with 3–5 kana letters. In this test, which was standardized for elementary school children, the audio stimuli that sequentially present non-words are played over about 155 s at normal speed. However, to make the task more challenging for university students, we played the stimuli at double speed in the present study based on the results of our preliminary experiments. Taking the number of correct answers as a measure of accuracy, the high level of accuracy (mean = 92.43%) of the participants’ performance on this subtest appeared to support the assumption that the modification would not diminish the capability of this test.

To measure phonological short-term memory, the forward condition of the Digit Span subtest of the WAIS-III41,42 was used. In this task, participants were required to repeat a sequence of digits presented orally by the examiner. The measure used was the raw score.

Naming speed

We used the Rapid Automatized Naming tests45 to measure naming speed. In this task, the participants were asked to name a random sequence of symbols (i.e., objects, colors, numbers, and alphabetical letters) as quickly and accurately as possible. We administered this task in Japanese and used the sum of the time (in seconds) taken to name all the symbols in the Object, Color, and Number subtests as the measure. Considering the time needed to correct errors spontaneously, we adjusted the score by adding the number of errors overlooked to the time.

To measure letter–sound decoding, the Non-word Reading task46 was used. In this task, participants were required to read 30 non-words written with four kana letters (3–4 morae) aloud as quickly and accurately as possible. The time (in seconds) taken to read all the non-words was used as the measure, and we adjusted the score by adding the number of errors overlooked to the time.

Visuospatial processing

We used the Kana subtest of the Visual Cancellation task of the Clinical Assessment of Attention47 to measure visual attention. This task asked participants to look through a random sequence of kana letters only once and cross out a target letter that appeared repeatedly within the sequence as quickly and accurately as possible. The measure used was the time (in seconds) taken to complete the task. Considering the shorter completion time of the 17 participants who made more than one error (mean ± SD = 96.82 ± 20.38) compared with the 13 participants who made no errors (mean ± SD = 105.54 ± 22.05) and the number of errors in the former (mean ± SD = 2.06 ± 1.38), we adjusted the score by adding ten times the number of errors to the time (consequently, mean ± SD score in the participants who made more than one error = 118.59 ± 26.58).

To measure visual short-term memory, the forward condition of the Visuospatial Span subtest of the Japanese version of the Wechsler Memory Scale, Revised (WMS-R)48,49 was used. In this task, after the examiner had tapped the cubes in a predetermined sequence, participants were asked to repeat the sequence. The raw score was used as the measure.

To measure visual long-term memory, we used the Rey Complex Figure Test50. This task requires participants to draw a design relying on memory 3 min (immediate recall) and 30 min (delayed recall) after completing a trial copy. The measure used was the raw score of the delayed recall.

To measure visuospatial perception, we used the unsegmented condition of the Block Design task developed by Shah and Frith51. In this task, participants were asked to replicate designs using four blocks as quickly as possible, as with the Block Design subtest of the WAIS-III. The total time (in seconds) to construct all six designs was used as the measure.

To measure visuomotor processing speed, the Digit Symbol subtest of the WAIS-III41,42 was used. This task requires participants to copy symbols paired with digits as quickly as possible in the empty boxes below a random sequence of digits within 120 s. The measure used was the raw score.

In addition, we used the Purdue Pegboard Test52 to measure manual dexterity. This task requires participants to insert small metal rods into a row of holes using their right, left, or both hands. Because all of the participants were righthanded, the number of rods inserted within 30 s using their right hand was used as a measure. In this study, we related the demands of eye—hand coordination in this task, which could have an impact on literacy acquisition, to visuospatial processing.

Syntactic processing

The Sentence Reading Part 2 subtest of the CARD44 was used to measure syntactic processing. In this task, the participants were asked to read a sentence and then judge whether the five sentences following that sentence were semantically identical with it. Those five sentences differed in their syntactic structures and function words from the sentence read first. The measure used was the time (in seconds) taken to answer all 50 questions (ten sets of five sentences). Considering the time for the participants to complete the task (mean ± SD = 188.00 ± 48.20), we adjusted the score by adding five times the number of errors to the time.

Higher-level language skills

Acquired knowledge

Following the method suggested by Bannatyne53, we employed the sum of the scaled scores on the Vocabulary, Arithmetic, and Information subtests of the WAIS-III as the measure of acquired knowledge. The Vocabulary subtest asked participants to explain orally the meaning of words presented both orally and in written form. This test is similar to the semantic subtests of kanji abilities in requiring semantic knowledge but differs in terms of not requiring literacy skills from those subtests. In fact, the scores on the Vocabulary subtest in dyslexic children are not always lower than those in control children34,54,55, suggesting that vocabulary and literacy skills are not identical. In the Arithmetic subtest, participants mentally solved arithmetic questions presented orally. This test assumes not only working memory but also knowledge of the rules of arithmetic. The Information subtest asked for knowledge generally acquired at school.

Idea density

We calculated idea density scores in essays written by participants using the CPIDR 5.1.4637.21009 (Computerized Propositional Idea Density Rater, pronounced “spider”) software program56, which automatically determines the idea density of an English text based on part-of-speech tags and rules for adjusting the count of propositions, i.e., verbs, adjectives, adverbs, prepositions, and conjunctions (for more details, see the CPIDR 5.1 User Manual)56. In short, idea density was measured as the ratio of expressed propositions to the total number of words in an English text36. Before calculating idea density, in accordance with a suggestion from a previous study that demonstrated the feasibility of machine translation–based measurement of idea density in Japanese texts57, we translated the Japanese text written by participants into English using Google Translate. The participants were asked to write an essay concerning “general activities and interests” in 600–800 characters within 30 min using Microsoft Word on an offline PC. The thesis of an essay was determined according to the method used in a previous study that calculated idea density in essays written by university students38. All but four participants completed this task within 30 min, and the time was extended up to 35 min when not completed (mean ± SD = 23.73 ± 5.97 min; the range was from 10 to 35 min).

Statistical analyses

The data were analyzed in three steps. All statistical analyses were conducted using the R version 3.5.358 software environment. The statistical testing was two-tailed, and α was set at 0.05.

First, to identify the cognitive factors accounting for the individual differences in multidimensional kanji abilities, we performed simple linear regression analyses between all measures of cognitive skills as independent variables (predictors) and each dimension of kanji abilities as the dependent variable (outcome). Then, to identify the best predictive models for each dimension of kanji abilities, multiple linear regression analyses were conducted between significant predictors and kanji abilities.

Second, we assessed goodness of fit for each of the three models of relationships among each dimension of kanji abilities, their cognitive predictors, and acquired knowledge or idea density with path analyses using maximum likelihood estimation. The first model assumed the predictions from kanji abilities to acquired knowledge, whereas the second model assumed the correlations between them. The third model hypothesized that each of the kanji abilities could directly predict idea density. The following indices of model fit were employed: the traditional χ2 test, the root mean square error of approximation (RMSEA) with a 90% confidence interval, the comparative fit index (CFI), the Tucker-Lewis index (TLI), the standardized root mean square residual (SRMR), and Akaike’s information criteria (AIC). RMSEA values < 0.05 suggest a good fit, and values < 0.08 are considered acceptable. We also calculated p-values for the test of the close-fit hypothesis that RMSEA ≤ 0.05. This one-sided null hypothesis should be adopted (i.e., p close ≥ 0.05) for a good fit59. The CFI and TLI values should be > 0.95, the SRMR values should be < 0.08 for a good fit, and lower AICs indicate relatively better fit60,61,62. In addition, we conducted a bootstrapping method to test the mediation path in the models with a good fit.

Then, for testing the direct and indirect effect (via knowledge acquisition) of kanji abilities on idea density, we performed linear regression analyses with a bootstrapping method. An estimate of the indirect effect was the mean computed using 1000 bootstrap samples.

Results

Cognitive predictors of kanji abilities

Means, standard deviations, and range of scores on all measures are shown in Table 2, and the results of simple linear regression analyses between cognitive skills as predictors and three dimensions of kanji abilities as outcomes are presented in Table 3. These analyses showed that better performances on sound–letter decoding (β = 0.57, t(28) = 3.66, p = 0.001), rapid automatized naming (β = − 0.46, t(28) = − 2.77, p = 0.010), letter–sound decoding (β = − 0.45, t(28) = − 2.68, p = 0.012), and syntactic processing (β = − 0.54, t(28) = − 3.43, p = 0.002) significantly predicted higher scores in reading accuracy kanji ability. However, higher scores in writing accuracy were found to be predicted by better performance on phonological awareness (β = 0.44, t(28) = 2.62, p = 0.014), sound–letter decoding (β = 0.57, t(28) = 3.63, p = 0.001), visual short-term memory (β = 0.45, t(28) = 2.65, p = 0.013), visuospatial perception (β = − 0.53, t(28) = − 3.27, p = 0.003), and syntactic processing (β = − 0.47, t(28) = − 2.80, p = 0.009). Higher scores on semantic comprehension were predicted by better performance on phonological short-term memory (β = 0.41, t(28) = 2.35, p = 0.026), visuospatial perception (β = − 0.43, t(28) = − 2.53, p = 0.017), and syntactic processing (β = − 0.50, t(28) = − 3.07, p = 0.005). No other relationship between cognitive skills and kanji abilities was significant (all p ≥ 0.06). Additionally, simple linear regression analyses showed that sex, age, and years of education did not predict any dimension of kanji abilities, as well as acquired knowledge and idea density (all p ≥ 0.12).

Table 2 Scores on language and cognitive measures in participants.
Table 3 Results of simple linear regression analyses including cognitive skills as predictors and kanji abilities as outcomes.

Then, we administered step-wise linear regression analyses that included all significant predictors of each dimension of kanji abilities, which were identified in simple linear regressions. The final model demonstrated that the combination of sound–letter decoding (β = 0.34, t(26) = 2.02, p = 0.054), rapid automatized naming (β = − 0.23, t(26) = − 1.44, p < 0.163), and syntactic processing (β = − 0.33, t(26) = − 2.06, p = 0.049) significantly predicted the scores on reading accuracy of kanji abilities, which accounted for 41% of the variance in this dimension (F(3,26) = 7.84, p < 0.001, adjusted R2 = 0.41; see Fig. 1). When letter–sound decoding (β = 0.02, t(25) = 0.11, p < 0.915) was added to this model as a predictor, the variance accounted for was reduced to 39% (F(4,25) = 5.66, p = 0.002, adjusted R2 = 0.39). Regarding writing accuracy, the combination of sound–letter decoding (β = 0.45, t(27) = 3.09, p = 0.005) and visuospatial perception (β = − 0.40, t(27) = − 2.71, p = 0.012) was found to account for 43% in this dimension (F(2,27) = 11.75, p < 0.001, adjusted R2 = 0.43). When phonological awareness (β = 0.10, t(26) = 0.64, p < 0.526), visual short-term memory (β = 0.09, t(26) = 0.47, p < 0.644), or syntactic processing (β = 0.03, t(26) = 0.15, p < 0.884) was added to this model as a predictor, the variance accounted for was reduced to 41% or less (F(3,26) = 7.80, p < 0.001, adjusted R2 = 0.41; F(3,26) = 7.68, p < 0.001, adjusted R2 = 0.41; F(3,26) = 7.55, p < 0.001, adjusted R2 = 0.40). Regarding semantic comprehension, step-wise linear regression analyses demonstrated that syntactic processing (β = − 0.50, t(28) = − 3.07, p = 0.005) accounted for 22% in this dimension (F(1,28) = 9.39, p = 0.005, adjusted R2 = 0.22). However, the combination of phonological short-term memory (β = 0.23, t(27) = 1.30, p = 0.206) and syntactic processing (β = − 0.40, t(27) = − 2.24, p = 0.034) accounted for 24% of the variance of semantic dimension of kanji abilities (F(2,27) = 5.65, p = 0.009, adjusted R2 = 0.24).

Figure 1
figure 1

Illustration of the dimension-specific models of Japanese kanji acquisition: the results of regressions in path analyses that include each dimension of kanji abilities, their cognitive underpinnings, and acquired knowledge. Numbers on straight arrows indicate standardized regression coefficients, whereas those on curved double-headed arrows indicate correlation coefficients. The results of F tests and adjusted R2 are shown on the right of the figures. SLD sound–letter decoding, RAN rapid automatized naming, SP syntactic processing, VP visuospatial perception, PM phonological short-term memory.

Relationships between kanji acquisition and higher-level language skills

The results of path analyses that include each dimension of kanji abilities, and their cognitive predictors, and acquired knowledge or idea density are shown in Table 4. All goodness-of-fit indices demonstrated better fits of the models that assumed predictions from each of three dimensions of kanji abilities to acquired knowledge than the models that assumed correlations between them (see Fig. 1). The χ2 statistics and RMSEA estimates were both significantly low for the predictive models (all p > 0.05; all p close > 0.05) but not for the correlative ones (all p ≤ 0.006; all p close ≤ 0.009). The AIC values for the former were lower than those for the latter, and the CFI, TLI, and SRMR values also indicated good fits only for the former. In the prediction models, each dimension of kanji abilities predicted acquired knowledge (reading accuracy: β = 0.71, t(28) = 5.28, p < 0.001; writing accuracy: β = 0.74, t(28) = 5.84, p < 0.001; semantic comprehension: β = 0.79, t(28) = 6.71, p < 0.001), and accounted for 48% or larger portion of the variances in that score (reading accuracy: F(1,28) = 7.84, p < 0.001, adjusted R2 = 0.48; writing accuracy: F(1,28) = 34.15, p < 0.001, adjusted R2 = 0.53; semantic comprehension: F(1,28) = 45.00, p < 0.001, adjusted R2 = 0.60). Regarding the correlations among the cognitive predictors, in the model of reading accuracy, better performance on sound–letter decoding was associated with those on rapid automatized naming (r = − 0.39, p = 0.04) and syntactic processing (r = − 0.42, p = 0.03), whereas the significant correlation between the latter two was not shown (r = 0.30, p = 0.11). In the model of semantic comprehension, better performance on syntactic processing was associated with that on phonological short-term memory (r = − 0.42, p = 0.03). In the model of writing accuracy, the significant correlation between sound–letter decoding and visuospatial perception was not found (r = − 0.27, p = 0.14). Additionally, mediation analyses by a bootstrapping method showed significant indirect effects of all cognitive predictors on acquired knowledge via each dimension of kanji abilities (sound–letter decoding via reading: β = 0.41, z = 3.43, p = 0.001; rapid automatized naming via reading: β = − 0.34, z = − 2.68, p = 0.007; syntactic processing via reading: β = − 0.31, z = − 2.54, p = 0.012; sound–letter decoding via writing: β = 0.44, z = 2.60, p = 0.009; visuospatial perception via writing: β = − 0.36, z = − 2.31, p = 0.021; phonological short-term memory via semantic comprehension: β = 0.29, z = 2.37, p = 0.018; syntactic processing via writing: β = − 0.34, z = − 2.28, p = 0.023).

Table 4 Results of path analyses that test models including kanji abilities, their cognitive underpinnings, and higher-level language skills.

However, the predictive models that included each dimension of kanji abilities, their cognitive predictors, and idea density were not statistically supported (see Fig. 1). The χ2 statistics (all p ≤ 0.008) and RMSEA estimates (all p close ≤ 0.011), as well as other indices, did not show good fits for these models.

Indirect effects of kanji abilities on idea density via acquired knowledge

Finally, we administered linear regression analyses with a bootstrapping method to examine the validity of the mediation models that included each dimension of kanji abilities as a predictor, acquired knowledge as a mediator, and idea density as an outcome (see Fig. 2). The results of these analyses are shown in Table 5. Reading accuracy of kanji abilities did not predict idea density directly (β = 0.33, t(28) = 1.85, p = 0.075) or indirectly via acquired knowledge (β = 0.28, z = 1.57, p = 0.116). Writing Accuracy also did not predict idea density directly (β = 0.31, t(28) = 1.74, p = 0.093) but did do so indirectly (β = 0.33, z = 2.18, p = 0.029; see Fig. 2). However, semantic comprehension predicted idea density directly (β = 0.38, t(28) = 2.17, p = 0.039) but not indirectly (β = 0.27, z = 1.31, p = 0.189). When controlling for the effects of acquired knowledge, the direct effects of any dimension of kanji abilities, as well as acquired knowledge, were not significant (all p ≥ 0.10). Although acquired knowledge accounted for 16% of the variance in idea density by itself (F(1,28) = 6.34, p = 0.018, adjusted R2 = 0.16; β = 0.43, t(28) = 2.52, p = 0.018), whenever each dimension of kanji abilities was added to this model as a predictor, the variances accounted for were slightly reduced to 13% or less (reading accuracy: F(2,27) = 3.09, p = 0.062, adjusted R2 = 0.13; writing Accuracy: F(2,27) = 3.06, p = 0.063, adjusted R2 = 0.12; semantic comprehension: F(2,27) = 3.15, p = 0.059, adjusted R2 = 0.13).

Figure 2
figure 2

Illustration of the mediation model that includes writing accuracy of kanji abilities (predictor), knowledge acquisition (mediator), and idea density (outcome): Numbers on arrows indicate standardized regression coefficients. Black arrows represent the direct effects. Red arrows represent the indirect effects.

Table 5 Testing for indirect effects of kanji abilities on idea density via acquired knowledge.

Discussion

The present study tested hypotheses regarding the predictive relationships among kanji abilities, their cognitive underpinnings, and higher-level language skills in Japanese university students. Our results support all five hypotheses that (1) combinations of multiple cognitive skills predict each dimension of Japanese kanji abilities, i.e., reading accuracy, writing accuracy, and semantic comprehension; (2) whereas phonological processing is a common factor, the other three domains predicted kanji acquisition in a dimension-specific manner; (3) kanji abilities predict acquired knowledge; (4) each dimension of kanji abilities mediates the predictive relationship between its cognitive predictors and acquired knowledge; (5) the writing dimension of kanji abilities predicts idea density indirectly via acquired knowledge. To the best of our knowledge, these findings represent the first reported evidence of dimension-specific relationships of the three dimensions of Japanese kanji abilities with their cognitive predictors, as well as with higher-level language skills.

Cognitive underpinnings of Japanese kanji abilities

A series of linear regressions identified the multifactorial models of reading, writing, and semantic dimensions of Japanese kanji acquisition (see Fig. 1). Our results showed that the combinations of common and dimension-specific factors account for 24 to 43% of the variances in each dimension of Japanese kanji acquisition, which were equivalent to those reported for children6,10. These results support the methodological validity of investigating the cognitive predictors of Japanese kanji abilities in university students.

Phonological processing measures, including sound–letter decoding and phonological short-term memory, were found to predict all the dimensions of kanji abilities. The contribution of this universal factor of literacy acquisition8,9,63 has been reported to be relatively small for Japanese kanji reading/writing6,10,22, as well as for Chinese15. Our results are consistent with those previous findings. Nonetheless, we add a new finding that phonological processing plays a substantial role in the mastery of kanji, with its impact on all of the three dimensions of kanji abilities.

As dimension-specific factors, the following functions were identified: naming speed for reading accuracy, visuospatial processing for writing accuracy, and syntactic processing for reading and semantic comprehension. First, the contribution of naming speed to reading acquisition, which is smaller than that of phonological processing, has been reported across languages8,9,64, including Japanese10. Therefore, it is recommended that testing naming speed on kana letters and non-words, as well as kana words and sentences, should be employed in the diagnostic process for reading disorders in Japanese children46. Consistent with these findings, the significant effects of naming speed measures were found only on the reading dimension in this study. Second, regarding visuospatial processing, visual memory has been reported as a predictor of spelling or writing acquisition in alphabetic orthographies19,65, as well as in Japanese22. Although this study employed measures of visual short- and long-term memory, referring to those previous findings, their predictive power was slightly smaller than that of visuospatial perception measured by using a simple block design task. These results suggest the language-specificity of this dimension-specific factor underpinning the ability to write visually complex kanji characters accurately. Third, regarding syntactic processing, our results provided the first evidence of a significant relationship with Japanese literacy. Before now, the studies on alphabetic and Chinese orthographies reported that syntactic processing contributed to reading comprehension on the sentence or text level28,29. The present study expands these findings to reading accuracy and semantic comprehension on the word level and to Japanese orthography. In addition, these dimension-specific predictors seem to be evidence supporting the multidimensional nature of Japanese kanji abilities.

The multifactorial models identified in this study suggest the possibility that atypical characteristics in phonological processing cause general difficulties in kanji acquisition, whereas atypicalities in naming speed, visuospatial processing, or syntactic processing cause difficulties in specific dimensions of kanji abilities. However, on the other side of the coin, these models suggest that each of the multiple factors could work as a protective factor to compensate for the difficulties in Japanese literacy caused by other factors. Our results imply the necessity of therapeutic or educational strategies according to the assessment of the dimensions and causes of difficulties in Japanese kanji acquisition.

Effects of kanji acquisition on higher-level language skills

Path analyses showed the contributions of all the dimensions of kanji acquisition to knowledge acquisition. These results are consistent with both our intuitive expectations and the empirical evidence30,31,32,33,34. In addition, mediation analyses showed the indirect effects of all the cognitive predictors on acquired knowledge via kanji abilities, suggesting that, whatever their cause, literacy achievement/problems can affect knowledge acquisition during the school years.

A direct contribution of any dimension of kanji acquisition to idea density was not established in the present study. The direct effect of semantic comprehension on idea density was significant but attenuated to a non-significant level when controlling for acquired knowledge. However, as for the contributions of kanji abilities to idea density via acquired knowledge, a significant indirect effect was demonstrated for the dimension of writing (see Fig. 2). These results suggest that handwriting skills on the word level, rather than reading or semantic skills acquired during school years, can promote the development of writing abilities on the text level. We measured idea density from essays that were typed by participants, but not from handwritten ones. Therefore, the impact of handwriting acquisition does not seem to be confined to the process of handwriting per se but to be generalized to higher-level language skills. This finding may also be true for people using other orthographies. Alternatively, this may only be applicable to Japanese people because there are differences between Japanese and other languages, not only in the writing system but also with respect to many other facets66. The effects of the three dimensions of literacy acquisition, i.e., reading accuracy, writing accuracy, and semantic comprehension, on the development of higher-level language skills in non-Japanese people should be examined in the future.

In the present day, digital writing devices are increasingly replacing handwriting across the world. In Japan, the frequency of handwriting has decreased39, resulting in a dimension-specific reduction of kanji writing skills in adults27. Considering the possible impact on not only handwriting acquisition but also the development of language skills, we must carefully discuss the controversial issue of whether these technologies should be applied to early literacy education67,68. The ease of typing or other supportive capabilities of digital tools is considered to be advantageous for literacy learning, particularly in children with underdeveloped motor skills69 or reading/writing difficulties70, e.g., difficulties related to visuospatial processing. However, there is growing evidence demonstrating that the coupling of motor action and perception during handwriting can facilitate literacy acquisition71,72,73, perhaps especially in cases of superior visuospatial processing. The relationship between the acquisition of handwriting on the word level, not reading and semantic comprehension, and higher-level language skills on the text level shown in this study represents evidence supporting the latter view.

Finally, it is also worth mentioning again that idea density, which is employed in this study as a measure of higher-level language skills, has attracted wide attention in the context of its possible function as a protective factor against Alzheimer’s disease36,37,38. The Nun study demonstrated that young adults with higher idea density tended to preserve cognitive function in later life even when Alzheimer’s disease affected their brains36,37. This finding could be explained by the “cognitive reserve” hypothesis. This means that attainment of higher language proficiency in early life may increase the brain’s capacity to make efficient use of available cognitive/neural resources and protect against cognitive decline and dementia in later life74,75,76. Incorporating these findings into the interpretation of our current study, we may describe a lifelong trajectory of the effect of literacy acquisition, from basic cognitive skills through basic literacy (especially writing accuracy) acquired during school years to higher-level language skills (culminating in idea density) throughout adulthood and to the “cognitive reserve” as a protective factor against cognitive decline in late life. This theoretical framework implies that basic literacy acquisition may be a good predictor of later cognitive decline/preservation. Before now, most studies of English-speaking elders have focused on reading accuracy in late life and reported that higher performance in this area predicts cognitive preservation77,78. Our current results, which showed the impact of basic handwriting skills on higher-level language skills, might suggest that the domain of writing is also worth investigating, going back to the original Nun study which analyzed autobiographies handwritten by nuns36.

Limitations

First, the results of this cross-sectional study do not exactly examine longitudinal predictions. To clarify the predictive relationships among basic cognitive skills assessed in early childhood, literacy and knowledge acquisition during school years, and idea density in young adults, longitudinal investigations following participants over about one and a half decades are needed. Second, the participants in this study were university students with normal or higher literacy achievement. To generalize the interpretation of our results to those with linguistic difficulties, investigations of a population with a wider range of linguistic abilities are required. Third, the relatively small sample size allowed us to administer statistical analyses that included only one dimension of kanji abilities and was not large enough to support modeling to include all dimensions. Large-sample investigations into more complex relationships among the variables focused on in this study are expected in the future. Fourth, the ceiling effects of a few cognitive measures possibly influenced the results of the statistical analyses. The high accuracy (mean = 97.23%; half of the participants achieved a perfect score) of the measure of phonological awareness, which is seen as the key predictor of literacy acquisition in alphabetic orthographies, suggests the possibility of underestimating the effect of this skill. For examining the effects of phonological awareness on higher-level language skills, e.g., text reading/writing, the development of measures that can assess individual differences in this skill in adults is required. Finally, this study did not measure basic writing skills with digital devices. In the future, it is expected that tests, especially standardized tests like the KABC-II43, to measure basic digital writing acquisition will be developed to clarify the relationship between digital writing skills and basic literacy, including handwriting, or between digital writing skills and higher-level language skills.

Conclusion

The current study revealed the relationships of Japanese literacy with its cognitive underpinnings as well as with higher-level language skills based on the three-dimensional view of kanji abilities, i.e., reading accuracy, writing accuracy, and semantic comprehension. First, among basic cognitive skills, naming speed, visuospatial processing, and syntactic processing underpin kanji abilities in a dimension-specific manner, whereas phonological processing plays the role of a common factor. Second, although all the dimensions of kanji abilities predicted acquired knowledge equally, higher-level writing skills on the text level measured as idea density were predicted by only the writing dimension indirectly via acquired knowledge. These results imply that writing acquisition during school years has a key role in the development of language skills through to adulthood. These findings warrant further research on the effect of handwriting practices, such as those investigating the younger population with linguistic difficulties, as well as those investigating the potential of the cognitive reserve to protect against the development of dementia in later life.