Abstract
For normally sighted readers, word neighborhood size (i.e., the total number of words that can be formed from a single word by changing only one letter) has a facilitator effect on word recognition. When reading with central field loss (CFL) however, individual letters may not be correctly identified, leading to possible misidentifications and a reverse neighborhood size effect. Here we investigate this inhibitory effect of word neighborhood size on reading performance and whether it is modulated by word predictability and reading proficiency. Nineteen patients with binocular CFL from 32 to 89 years old (mean ± SD = 75 ± 15) read short sentences presented with the self-paced reading paradigm. Accuracy and reading time were measured for each target word read, along with its predictability, i.e., its probability of occurrence following the two preceding words in the sentence using a trigram analysis. Linear mixed effects models were then fit to estimate the individual contributions of word neighborhood size, predictability, frequency and length on accuracy and reading time, while taking patients’ reading proficiency into account. For the less proficient readers, who have given up daily reading as a consequence of their visual impairment, we found that the effect of neighborhood size was reversed compared to normally sighted readers and of higher amplitude than the effect of frequency. Furthermore, this inhibitory effect is of greater amplitude (up to 50% decrease in reading speed) when a word is not easily predictable because its chances to occur after the two preceding words in a specific sentence are rather low. Severely impaired patients with CFL often quit reading on a daily basis because this task becomes simply too exhausting. Based on our results, we envision lexical text simplification as a new alternative to promote effective rehabilitation in these patients. By increasing reading accessibility for those who struggle the most, text simplification might be used as an efficient rehabilitation tool and daily reading assistive technology, fostering overall reading ability and fluency through increased practice.
Similar content being viewed by others
Introduction
Individuals with Central Field Loss (CFL) induced by maculopathy experience severely impaired functional vision, leading to major incapacitating reading deficits1,2,3. Reading speed being a strong determinant of overall quality of life in low-vision patients of all ages, there is a real societal need to help promote their reading performance through rehabilitation and assistive reading technology4,5. Such initiative requires a better characterization of the underlying factors involved in their overall reading deficit, which remain poorly understood. The influence of psycholinguistic factors, for instance, have been generally overlooked in the context of low vision. In 2016, Calabrèse et al. investigated eye movement characteristics when low-vision patients with CFL read short sentences and showed that specific disruptions in fixation patterns occurred in the presence of low-frequency words6. This result alone suggests that psycholinguistic factors may be an important determinant of reading performance for low vision, opening the way for further investigations and potential use of text simplification for the visually impaired.
Word frequency and word neighborhood size are some of the most important lexical factors known to affect reading in normal vision7,8. However, whether findings from normally sighted readers can be applied to low-vision individuals is not obvious. In the case of CFL, visual input is deteriorated because of blur, distortion and/or occluded letters, and access to text is only partial (Fig. 1)9. For instance, when reading the word “father” with a central blind spot, some letters are totally or partially occluded by the scotoma. In addition, complete letters that fall out of the scotoma must be identified with eccentric portions of the retina, yielding degraded letter identification due to low acuity and strong crowding10. Therefore, incomplete and/or eccentric letters may not be correctly identified, leading to many possible misidentifications (“farmer”, “feather”, “halter”, etc.). Since bottom-up visual input is not reliable, patients must rely much more on top-down linguistic inference than normally sighted readers11,12,13. For this reason, the effect of lexical factors on reading performance should be rather different in visually impaired readers than reported before with normally sighted individuals.
A first attempt to characterize this effect has been made recently by Stolowy et al. who inspected the effect of word frequency on the reading performance of CFL individuals reading sentences in French14. The clear-cut frequency effects they reported on word reading time validated the hypothesis that, as for normally sighted readers, low-frequency words tend to decrease reading speed with CFL patients. However, the amplitude of this effect was much larger (differences in the range of seconds) than it had been reported before with normal vision (range of milliseconds). More interestingly, the same effect of frequency was not observed for all pairs of synonyms. For instance, the high-frequency word “utiliser” [use in English] was read on average more slowly than its lower frequency synonym “employer” [employ]. Such observation suggests that frequency cannot be the only predictive linguistic factor of reading speed with CFL. This reasoning was a cornerstone for the present work that aims at investigating two other psycholinguistic factors extensively studied in normal reading: orthographic similarity and word predictability.
The most common measure of orthographic similarity in the psychological literature is Coltheart’s orthographic neighborhood size metric ‘N’15. N measures the number of close orthographical neighbors of a stimulus word and is defined as the total number of words that can be formed by changing one letter, while preserving letter positions. For example, “shore” has many neighbors, including chore, score and share, while “neighbor” has 0 neighbors. For skilled readers with normal vision, N (i.e., word neighborhood size) seems to have a facilitator effect on word recognition: the more neighbors, the faster a word is identified—although this effect is now often assumed to be task dependent and language dependent (see16,17 for reviews). It is possible that the visual constraint imposed by the presence of a central scotoma, hiding portions of the text (i.e., letters) and forcing to use eccentric vision (low resolution), may lead to a reverse effect of neighborhood size. The lack of high resolution coupled with missing visual information, would lead CFL readers to confuse one word with its orthographic neighbors, creating even more uncertainty for words with large neighborhood size. Therefore, we hypothesize that, unlike normal vision, word neighborhood size has a negative effect on reading speed with CFL (Hypothesis 1). This first hypothesis has recently received support through some preliminary work of ours18.
In addition to frequency and orthographic similarity, word predictability counts as one of the most influential processing factors during word recognition with normal vision19. The predictability of a given word in a sentence (based on the context induced by the preceding words) has been shown to influence processing speed by enabling readers to make forward inferences (i.e. predictions about the probable upcoming word). Thus, as shown by eye movement studies, highly predictable words: (1) are skipped more often, (2) are more likely to be identified within a single fixation and (3) yield overall shorter fixations during sentence reading20,21. In the context of CFL, word predictability should also play an additional role by reducing uncertainty when identifying a word with many confusable neighbors. In the sentence “You should go for a walk along the shore” for instance, even if the word “shore” has many potentially confusable neighbors, most of them (such as chore, score and share) may be easily ruled out based on semantic context and forward inferences. Therefore, we hypothesize that the amplitude of the neighborhood size effect is influenced by word predictability: the more predictable a word is (thanks to sentence context), the smaller the effect of neighborhood size should be (Hypothesis 2).
Finally, for normally sighted adult readers, word predictability does influence reading processing differently depending on individual reading skills21,22. Simply put, less skilled readers rely more on context for word identification than highly skilled readers do. Based on this result, we assume that results of CFL individuals should also be influenced by their reading proficiency. Therefore, we hypothesize that the interaction between neighborhood size and word predictability (see Hypothesis 2) depends on patients’ reading proficiency. Our prediction is that this interaction will be more pronounced for less proficient readers than for proficient ones (Hypothesis 3).
In short, the purpose of this work is twofold: (1) to test the hypothesis that word neighborhood size exerts an inhibitory effect on reading performance with CFL (Hypothesis 1; Analyses 1 and 2) and (2) to test whether this effect is modulated by word predictability and reading proficiency (Hypothesis 2 and 3; Analysis 3).
Methods
Participants
19 patients (13 females) were recruited from the Low-Vision Clinic of the La Timone Hospital (Marseille, France) between March and June 2017. All presented a bilateral central scotoma with a monocular acuity of 0.4 logMAR (i.e., 20/50 or 4/10) or worse in their better eye. Patients with ophthalmologic disorders other than maculopathy (e.g. glaucoma), cognitive disorders or reading disorders present before their visual impairment were not included. The following information was collected for each participant: age, etiology, lens status, disease onset (duration of maculopathy in years), field loss (central only vs. central + peripheral), low-vision rehabilitation history. Additional information was collected regarding their reading habits, including: current daily reading time (in minutes), current/former profession, whether they considered themselves heavy readers before their impairment. Recruited participants ranged in age from 32 to 89 years (mean ± SD = 75 ± 15). Mean best-corrected visual acuity was 0.81 ± 0.28 logMAR. Details of the participants’ demographic, visual and reading characteristics are given in Table 1. The research was approved by the Ethics Committee of the French Society of Ophthalmology (IRB 00008855 Société Française d’Ophtalmologie IRB#1) and carried out in accordance with the Code of Ethics of the World Medical Association (Declaration of Helsinki). Informed consent was obtained from all participants after complete explanation of the nature and possible consequences of the study.
Apparatus and stimuli
Experiments were run at 60 Hz with an LCD HP LE2201W monitor (full display area: 1680 × 1050 pixels; 47.4 × 29.6 cm). Stimuli (i.e. sentences) were generated with the PsychoPy library23,24 and presented on a 1400 × 1050 pixel window that subtended 56° × 42° at 40 cm. Sentences were aligned to the left and displayed in Courier (non-proportional font) in black on a white background. Print size was chosen optimally for each participant as the value of his/her critical print size, measured before testing with a French computerized version of MNREAD25,26. Reading was monocular (eye with better visual acuity) with an appropriate correction for near vision (wide-field Metrovision lenses). Monocular vision allows to better control for individual eye characteristics and was shown to yield similar reading performance compared to binocular vision in patients with binocular AMD27.
Reading material
Reading material was created in French using ReSyf, a French lexicon with disambiguated and graded synonyms28 and Lexique3, a lexical database with word frequencies of standard written and oral French29. First, we created a pool of target words (Fig. 2—Step 1), by selecting 32 pairs of synonyms matching the following criteria: (1) equal length within a pair (from 3 to 8 characters); (2) difference in number of orthographic neighbors (N) comprised between 5 and 10 within a pair; (3) frequency ratio between a large neighborhood word and a its small neighborhood synonym could be either < 1 or > 1. Second, 32 pairs of matching sentences were created so that each word from a pair could fit within either sentence of the corresponding sentence pair (Fig. 2—Step 2). Three criteria were used: (1) all sentences had similar length (42 to 65 characters; mean ± SD = 54 ± 6), with a maximum difference of 5 characters within a pair; (2) pairs of sentences were specifically designed to fit the single and most frequent common sense of a synonym pair; (3) within each sentence, comprised of ‘n’ words, the target word could be located in any of these three locations: ‘n − 1’, ‘n − 2’, or ‘n − 3’. Target words were never presented in position ‘n’ to avoid any interference from a possible wrap-up effect14,30. At last, we created our final reading material by combining sentence pairs with their matching synonym pairs (Fig. 2—Step 3). In Condition 1, the first word of a pair was assigned to the first sentence of the corresponding pair, while the second word was assigned to the second sentence, thus creating 64 full sentences. In Condition 2, the “sentence—word” pairing was reversed to create a different set of 64 full sentences. These two experimental conditions allowed us to counterbalance any potential effect of the sentence structure and complexity (by randomly assigning participants to Condition 1 or 2), while providing two different measures of predictability for a single target word.
Reading procedure and experimental design
Sentences were presented within 4 blocks of 16 trials (8 pairs of sentences) each. Participants were randomly assigned to Condition 1 or 2 and read between two to four blocks, depending on their reading speed and level of fatigue. Sentences were displayed randomly within each block with non-cumulative self-paced reading, where sentences appear as a whole but with all words masked by strings of “ × ”31,32,33 (Fig. 3). As opposed to other reading paradigms specifically designed for low vision, such as “word mode”34, self-paced reading allows to present words individually while still maintaining a whole sentence view and therefore to remain closer to the visual constraints of natural reading (e.g., crowding). Participants were instructed to read each sentence aloud as quickly and accurately as possible while revealing each word one at a time using keyboard presses, with the possibility to unmask backward as many times as they wanted. When participants considered they had finished reading the sentence accurately (no matter which word was unmasked at that moment), they said the word “stop” and the experimenter stopped the trial. Prior testing, a training phase with short French proverbs was performed to familiarize the participant with the task and protocol. Reading accuracy (correct vs. incorrect) and total reading time (in seconds) were recorded for each target word. For words unmasked several times, reading time was obtained by summing the duration of all the unmasked periods.
Measure of word predictability
For each target word, we estimated its percentage of occurrence following the two preceding words in the sentence based on a large corpus of French texts. To do so, we selected all series of three consecutive words (i.e., 3-grams or trigrams) in our reading material that ended with a target word. Using the Google Books Ngram Viewer resource35, we extracted for each of these 128 trigrams (64 sentences × 2 conditions) its percentage of occurrence in the ‘French 2012’ corpus, a corpus of 792 118 digitized books published in French between 1800 and 2008. This measure will be referred to as the ‘trigram occurrence’.
Measures of reading proficiency
Prior to testing, each patient was asked to report the total duration of reading they performed on a typical day (in minutes per day; cf. Table 1). This value would include reading for both work and leisure, with or without visual aid systems, on all types of display (i.e., print, screen) and any kind of reading material (e.g., book, magazine, tag label, mail, recipe). Because of the large proportion of patients who reported to read 0 min/day (37%), this variable’s distribution was highly skewed and not suited to be used as a continuous variable. Therefore, we transformed it into the binary variable “Daily reading” (yes or no). Patients who reported to read at least a few minutes each day were categorized as Daily reading—yes; The other patients who reported to read 0 min daily were categorized as Daily reading—no. In addition to this measure of current reading proficiency, patients were also grouped based on their self-reported literacy level prior impairment through the variable “Former heavy reader” (yes or no).
Statistical analysis
Statistical analyses were carried out using R, a free software environment for statistical computing and graphics36. Reading accuracy (i.e. binary variable) was analyzed by fitting a generalized linear mixed-effects model (GLME; Analysis 1), while reading time (i.e. continuous variable) was analyzed with linear mixed-effects models (LME; Analyses 2 and 3), both allowing the modeling of experimental designs with unbalanced repeated measures37,38. Models were constructed with either target word accuracy or target word reading time as the dependent variable. Several factors of interest (Table 2) were included as independent variables to inspect their effect on accuracy and reading time, as well as their potential interaction with each other. The random structure of all models included a random intercept for participants, assuming a different “baseline” performance level for each individual, as well as random intercept for each target word. Before analysis, variables of interest were inspected and transformed as follow to satisfy the assumptions of parametric statistical tests39,40: reading time and word frequency were transformed in natural logarithm (ln) units, trigram occurrence was transformed with the ordered quantile normalization and context amount was square-root transformed. Word frequency and word length were centered around their mean. Optimal model structures were assessed using the Akaike Information Criterion (AIC) and likelihood-ratio tests41. Significance of the fixed effects was estimated using z-values for the GLME model and t-values for the LME models. Z- and t-values larger than 2 were considered significant, corresponding to a 5% significance level in a two-tailed test42,43. In the Results section fixed-effects estimates are reported along with their z- and t-values and 95% confidence intervals44.
Results
Analysis 1: Effect of neighborhood size on reading accuracy
On average, target words were read accurately 94% of the time, with individual variations ranging from 62 to 100% depending on patients. When all implemented in a GLME model, N, word frequency, word length and word predictability (expressed as trigram occurrence) showed no significant effect on accuracy (Table 3). As estimated by the model, percentage of accuracy for patients who continue reading on a daily basis was 99.1% (exp(4.716)/(1 + exp(4.716)) * 100) (z = 7.694; p < 0.001; 95%CI = [3.65; 6.17]). For patients who quit daily reading activities, percentage of accuracy was 97.3% (exp(4.716 − 1.126)/(1 + exp(4.716 − 1.126)) * 100). This 1.8% difference barely reached significance (z = − 2.064; p = 0.039; 95%CI = [− 2.31; − 0.03]). Figure 4 shows the null effect of N on accuracy, for these two groups of participants.
Analysis 2: Effect of neighborhood size on reading time
In this first LME model (Table 4), the respective effects of neighborhood size, word predictability and reading proficiency are estimated individually, without any interaction term, to test our Hypothesis 1. According to this simple model, words with zero neighbors were read on average in 2.3 s (exp(0.841)). Increasing the number of neighbors by 1 did increase reading time significantly, but moderately, by a factor of 1.01 (exp(0.013); t = 2.507, p = 0.013, 95%CI = [0.003; 0.023]; Fig. 5). In other words, increasing neighborhood size from 0 to 6 (the mean value in our pool of target words), increases reading time by a factor of 1.08 (exp(0.013)6), i.e. an 8% increase. Similarly, increasing neighborhood size from 0 to 10 (where most of our values lie), increases reading time by a factor of 1.14 (exp(0.013)10), i.e. a 14% increase. Word predictability (expressed as trigram occurrence) also showed a significant effect on reading time (t = − 4.129, p < 0.001, [− 0.17; − 0.06]). Age, acuity, disease onset, type of field loss and former heavy reader (which were dropped from the final model) showed no significant effect on reading time and no significant interaction with word neighborhood size.
Analysis 3: Effect of the interaction between neighborhood size, word predictability and reading proficiency on reading time
In this second LME model (Table 5), a 3-way interaction between neighborhood size, word predictability and reading proficiency was added to test our Hypotheses 2 and 3. According to this complex model, when trigram occurrence is at 0 (implying a highly infrequent trigram and low predictability; Fig. 6A), words with zero neighbors were read on average in 3.0 s (exp(1.099)) by patients who practice reading daily (Fig. 6A—blue dashed line). For this same group of readers, increasing the number of neighbors did not have a significant effect on reading time (t = − 1.043, p = 0.298; 95%CI = [− 0.02; 0.01]; Fig. 6A—blue dashed line). For patients who quit daily reading activities, average reading time of words with zero neighbors was 3.14 s (exp(1.099 + 0.044)) and was not significantly different from the ‘daily reading’ group estimate (t = 0.188, p = 0.853, 95%CI = [− 0.42; 0.52]; Fig. 6A—red solid line). However, for these participants who stopped reading on a daily basis, increasing the number of neighbors by 1 did increase reading time significantly by a factor of 1.07 (exp(0.07); t = 5.22, p < 0.001, 95%CI = [0.04; 0.10]; Fig. 6A—red solid line). In other words, for low predictability, increasing neighborhood size from 0 to 6 (the mean value in our pool of target words), increases reading time by a factor of 1.52 (exp(0.07)6), i.e. a 52% increase. Similarly, increasing neighborhood size from 0 to 10 (where most of our values lie), increases reading time by a factor of 2.01 (exp(0.07)10), i.e. a 101% increase.
As trigram occurrence increases to an average value (Fig. 6B) and a high value (Fig. 6C), the effect of neighborhood size remains null for the ‘daily reading’ group (t = 1.85; p = 0.066; 95%CI = [− 4e−04; 0.02]; blue dashed lines). However, for the other group of patients who stopped practicing reading daily (red solid lines), the amplitude of the neighborhood size effect is significantly reduced by a factor of 1.02 (exp(0.017); t = − 2.03; p = 0.043; 95%CI = [− 0.03; − 8e−04]) every time trigram occurrence increases by one unit. As given by a slopes post-hoc analysis, the amplitude of the effect for the group who quit daily reading (red solid line) was of 1.05 (exp(0.05); t = 5.39, p < 0.001) for average trigram occurrence values (Fig. 6B) and of 1.03 (exp(0.03); t = 2.77, p = 0.01) for highly frequent trigrams (Fig. 6C). In other words, for fairly frequent trigrams yielding average predictability, increasing neighborhood size from 0 to 10, increases reading time significantly by a factor of 1.65 (exp(0.05)10), i.e. a 65% increase, while the same increase in neighborhood size for highly frequent trigrams yielding high predictability, increases reading time significantly by a factor of only 1.35 (exp(0.03)10), i.e. a 35% increase.
Besides this significant 3-way interaction, word frequency also had a significant effect on reading time with a regression coefficient estimate of − 0.046 (t = − 3.39, p < 0.001, 95%CI = [− 0.07; − 0.02]). As both reading time and frequency are expressed in natural log units, multiplying frequency (in original units) by 10 multiplies reading time (in original units) by 0.90 (10−0.046), i.e. a 10% decrease. We found no significant interaction between frequency and the ‘daily reading’ factor, nor between frequency and the number of neighbors. Word length had no significant effect on reading time (t = 0.48, p = 0.633, 95%CI = [− 0.04; 0.07]). Similarly, age, acuity, disease onset, type of field loss and former heavy reader (all dropped from the final model) showed no significant effect on reading time and no significant interaction with word neighborhood size.
Discussion
The first goal of the present work was to test the hypothesis that the visual constraint imposed by the presence of a central scotoma leads to an inhibitory effect of neighborhood size during sentence reading. Therefore, we assessed the effect of word neighborhood size (N) on the reading performance of 19 patients with CFL, namely reading accuracy (Analysis 1) and reading time (Analysis 2). Our results show that N has no significant effect on accuracy, which ceils around 94% overall. On the other hand, we found a moderate inhibitory effect of N on reading time, with a 14% increase in word reading time (i.e., a 12% decrease in reading speed) when N goes from 0 to 10 neighbors (i.e., the range where most of our values lie). This result confirms our first hypothesis and builds up the recent report of a reversed neighborhood size effect for visually impaired individuals compared to normally sighted readers18.
Effects of orthographic neighbors on word identification have been extensively explored in readers with normal vision. Despite the many contradictory findings accumulated over the years, it is now accepted that the neighborhood size effect depends on the task and is modulated by the frequency of the neighbors themselves (i.e. the neighborhood frequency effect). Indeed, large neighborhood has consistently been reported to facilitate responses in a variety of tasks such as word naming45 and lexical decision46,47, but this facilitative effect seems to be restricted to low-frequency neighbors48. Despite their great interest to help understand the underlying mechanisms of lexical processing, these results are difficult to interpret in the context of our work with natural reading since they are restrained to isolated word identification.
Because semantic context may help decide between visually similar words, it is necessary to examine the effects of neighborhood (i.e., size and frequency) during natural reading in context to assess their influence on reading performance. To this end, Pollatsek et al. examined eye movement patterns of normally sighted readers during both a lexical decision task and silent reading, when target words varying in neighborhood size were embedded in neutral sentences49. Their overall conclusion was that, for silent reading, increasing the number of higher frequency neighbors had a clear inhibitory effect on word identification, whereas increasing the number of lower frequency neighbors may have a weak facilitative effect. In the present work, reading material was created without controlling for neighborhood frequency, but a post-hoc analysis revealed that most of our target words (82%) had a majority of low-frequency neighbors (from 60 to 100%; mean = 83 ± 14%). Based on this distribution, results from normal vision would predict a weak facilitative effect of neighborhood size on reading performance49. On the contrary, we found a reverse effect of neighborhood size, with a weak inhibitory effect. This result confirms our assumption that under degraded visual conditions, the lack of complete stimulus information will have more of an effect on words that are visually similar to many others than on words with few neighbors. It is likely that in the interest of time performance, readers may infer a word by guessing one of its high-frequency neighbors. However, because of the incongruousness between this guess and the overall sentence meaning, processing would persist until a better match, that fits both the visually identified letters and the meaning, is found. This would explain why reading accuracy remained very high among our participants, at the expense of reading time.
The second goal of the present work was to test the hypotheses that, with CFL, the effect of neighborhood size is modulated by both word predictability and reading proficiency. Therefore, we assessed the word neighborhood size effect through a 3-way interaction, including a measure of predictability (trigram occurrence) and proficiency (daily reading—yes or no) (Analysis 3). First, we found that the inhibitory effect of N is modulated by word predictability: the more familiar a sequence of words is, the weaker the effect of neighborhood size on the last word identification time. This result confirms our second hypothesis that the amplitude of the neighborhood size effect is influenced by word predictability.
However, we found that this is only true for the less proficient readers, who have stopped reading on a daily basis, confirming our third hypothesis that the interaction between neighborhood size and word predictability depends on patients’ reading proficiency. In short, we found that for the less proficient readers reaching a low-predictable word in a sentence, reading speed decreases by up to 50% (101% increase in reading time) when the number of neighbors goes from 0 to 10. As predictability increases, the amplitude of this effect lessens gradually, with 39% decrease in reading speed for average predictability (65% increase in reading time) and 26% decrease in reading speed for high predictability (35% increase in reading time). For proficient readers who reported to retain a daily leisure activity of reading however, even for a few minutes each day, the effect of N remains null, regardless of the word predictability. This result, close to what has been reported with normally sighted readers49, leads us to the conclusion that practice can help minimize the adverse effect of ambiguity induced by orthographic similarity when visual input is degraded and access to text is only partial because of maculopathy.
These results are particularly relevant in the context of low-vision rehabilitation, as they reinforce the need to provide patients with individualized readaptive care of functional vision, in order to help maintain daily reading practice. More importantly, our results suggest that text simplification might be a powerful way to leverage text accessibility for low-vision patients. Text simplification is the process of reducing the linguistic complexity of a text, while still retaining the original information and meaning50,51. Its main goal is to make a text more accessible to people with low literacy52 or individuals with reading disorders (e.g., aphasia53, dyslexia54). For the first time, our results show that it could also be used efficiently to improve low-vision rehabilitation, especially for the most impaired patients, who have stopped reading on a daily basis (36.8% in our population sample). For these less proficient readers, substituting complex words (i.e., words with many orthographic neighbors) with synonyms that have less neighbors and equal or higher frequency should reduce reading difficulty. Therefore, using simplified texts with increased accessibility as rehabilitation training material might help (1) improve overall reading ability and fluency, while (2) fostering the long-term motivation necessary to resume daily reading practice. Text simplification could then be used daily as an efficient reading aid (made available through tablets, e-readers, or web plugins) to keep practicing reading at home and enhance everyday reading performance.
As a side note, we would like to point to the fact that our results were obtained using the occurrence percentage of trigrams (3-grams, i.e., sequence of three words) in the French literature. It is worth mentioning that we also ran the analysis with other n-gram values, namely 2- and 4-grams and that neither of them showed significant effect. Since most of our target words were common nouns (89%; against 9.4% of adjectives and 1.6% of verbs), they were most likely preceded by an article that did not convey meaningful information. Therefore, an analysis based on 2-gram prediction was not likely to be meaningful. On the other hand, we expected 4-gram analysis to be highly significant. However, because we created our reading material so that target words were not too predictable, 77% of our 4-gram were highly infrequent (less than 40 occurrences across the ‘French 2012’ corpus) and rated at 0 percent occurrence by Google Ngram. We suspect that the absence of significant effect with 4-grams is due to this highly skewed distribution towards 0.
Overall, the present work presents some limitations that should be considered in future investigations of neighborhood effects on low vision. First, the range of participants’ age and disease onset should be expanded to better represent the full spectrum of adaptation exhibited following early and late onset CFL. It is possible that the absence of effect reported here may be due to our highly skewed distribution, with 16 participants between 70 and 89, against only 3 young individuals (aged 32, 48 and 59). Second, future investigations should also include measures of microperimetry (size and shape of the scotoma, fixation eccentricity, etc.) to take into account individual vision loss characteristics. Third, neighborhood frequency should be gauged thoroughly when designing the reading material in order to better control for its effect when assessing the effect of neighborhood size. Second, the definition of orthographic neighbor used in this work is letter-position-specific and length-dependent15. Given that letter position uncertainty is a crucial factor limiting peripheral word recognition, and reading without central vision in general55, Coltheart’s definition should probably be extended to include letter transposition (e.g., trial and trail), addition (e.g. trial and tribal) and deletion (e.g. trial and rial) in order to encompass a larger number of highly similar words. Finally, the results presented here should be interpreted cautiously in the context of reading under natural conditions, since they were obtained with a paradigm that does not allow word skipping, forcing participants to read each single word of a sentence56. Additionally, reading performance was measured with monocular vision, allowing to control for specific eye characteristics (e.g., lens status). Such approach is critical in research settings, but may not always mimic actual clinical conditions, where patients would read with one or two eyes, based on their own preference.
References
Brown, J. C. et al. Characterizing functional complaints in patients seeking outpatient low-vision services in the United States. Ophthalmology 121, 1655–1662 (2014).
Kanonidou, E. Reading performance and central field loss. Hippokratia 15, 103–108 (2011).
Chung, S. T. L. Reading in the presence of macular disease: a mini-review. Ophthal. Physiol Opt. 40, 171–186 (2020).
Murro, V. et al. Reading ability and quality of life in stargardt disease. Eur. J. Ophthalmol. 27, 740–745 (2017).
Pondorfer, S. G. et al. association of vision-related quality of life with visual function in age-related macular degeneration. Sci. Rep. 9, 15326 (2019).
Calabrèse, A., Bernard, J.-B., Faure, G., Hoffart, L. & Castet, E. Clustering of eye fixations: a new oculomotor determinant of reading speed in maculopathy. Invest. Ophthalmol. Vis. Sci. 57, 3192–3202 (2016).
Leroy, G. & Kauchak, D. The effect of word familiarity on actual and perceived text difficulty. J. Am. Med. Inform. Assoc. 21, e169–e172 (2014).
Adelman, J. S. & Brown, G. D. A. Phonographic neighbors, not orthographic neighbors, determine word naming latencies. Psychon. Bull. Rev. 14, 455–459 (2007).
Taylor, D. J., Edwards, L. A., Binns, A. M. & Crabb, D. P. Seeing it differently: self-reported description of vision loss in dry age-related macular degeneration. Ophthal. Physiol. Opt. 38, 98–105 (2018).
Chung, S. T. L. Learning to identify crowded letters: does it improve reading speed?. Vis. Res. 47, 3150–3159 (2007).
Bullimore, M. A. & Bailey, I. L. Reading and eye movements in age-related maculopathy. Optom. Vis. Sci. 72, 125–138 (1995).
Fine, E. M. & Peli, E. The role of context in reading with central field loss. Optom. Vis Sci. 73, 533–539 (1996).
Legge, G. E., Klitz, T. S. & Tjan, B. S. Mr. Chips: an ideal-observer model of reading. Psychol. Rev. 104, 524–553 (1997).
Stolowy, N. et al. The influence of word frequency on word reading speed when individuals with macular diseases read text. Vis. Res. 155, 1–10 (2019).
Coltheart, M., Davelaar, E., Jonasson, J. E. & Besner, D. Access to the internal lexicon. In Attention and Performance VI (ed. Dornio, S.) 535–555 (Academic Press, London, 1977).
Andrews, S. The effect of orthographic similarity on lexical retrieval: resolving neighborhood conflicts. Psychon. Bull. Rev. 4, 439–461 (1997).
Perea, M. & Martínez, E. The effects of orthographic neighborhood in reading and laboratory word identification tasks. Psicológica 21(3), 327–340 (2000).
Sauvan, L. et al. Text simplification to help individuals with low vision to read more fluently. In Workshop Tools and Resources to Empower People with Reading Difficulties (READI) at International conference on Language Resources and Evaluation. 27–32 (2020).
Rayner, K. Eye movements in reading and information processing: 20 years of research. Psychol. Bull. 124, 372–422 (1998).
Balota, D. A., Pollatsek, A. & Rayner, K. The interaction of contextual constraints and parafoveal visual information in reading. Cogn. Psychol. 17, 364–390 (1985).
Hawelka, S., Schuster, S., Gagl, B. & Hutzler, F. On forward inferences of fast and slow readers. An eye movement study. Sci. Rep. 5, 8432 (2015).
Ashby, J., Rayner, K. & Clifton, C. Eye movements of highly skilled and average readers: differential effects of frequency and predictability. Q. J. Exp. Psychol. A 58, 1065–1086 (2005).
Peirce, J. W. PsychoPy–psychophysics software in Python. J. Neurosci. Methods 162, 8–13 (2007).
Peirce, J. W. Generating stimuli for neuroscience using PsychoPy. Front. Neuroinform. 2, 10 (2009).
Calabrèse, A., Bernard, J.-B., Faure, G., Hoffart, L. & Castet, E. Eye movements and reading speed in macular disease: the shrinking perceptual span hypothesis requires and is supported by a mediation analysis. Invest. Ophthalmol. Vis. Sci. 55, 3638–3645 (2014).
Calabrèse, A., Mansfield, J. S., & Legge, G. E. mnreadR, an R Package to Analyze MNREAD Data. version 2.1.3 (accessed December 2020). https://CRAN.R-project.org/package=mnreadR (2019).
Kabanarou, S. A. & Rubin, G. S. Reading with central scotomas: is there a binocular gain? Optom. Vis. Sci. 83, 789–796 (2006).
Billami, M., François, T., & Gala, N. ReSyf: a French lexicon with ranked synonyms. In Proceedings of the 27th Conference on Computational Linguistics (COLING 2018), Santa Fe, USA, 2570–2581 (accessed December 2020). https://cental.uclouvain.be/resyf/ (2018).
New, B., Ferrand, L., Pallier, C. & Brysbaert, M. Reexamining the word length effect in visual word recognition: new evidence from the English Lexicon Project. Psychon. Bull. Rev. 13, 45–52 (2006).
Just, M. A. & Carpenter, P. A. A theory of reading: from eye fixations to comprehension. Psychol. Rev. 87, 329–354 (1980).
Aaronson, D. & Scarborough, H. S. Performance theories for sentence coding: some quantitative evidence. J. Exp. Psychol. Hum. Percept. Perform. 2, 56–70 (1976).
Mitchell, D. C. & Green, D. W. The effects of context and content on immediate processing in reading. Q. J. Exp. Psychol. 30, 609–636 (1978).
Just, M. A., Carpenter, P. A. & Woolley, J. D. Paradigms and processes in reading comprehension. J. Exp. Psychol. Gen. 111, 228–238 (1982).
Wallis, S., Yang, Y. & Anderson, S. J. Word Mode: a crowding-free reading protocol for individuals with macular disease. Sci. Rep. 8, 1241 (2018).
Michel, J.-B. et al. Quantitative analysis of culture using millions of digitized books. Science 331, 176–182 (2011).
R Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing (accessed December 2020). https://www.R-project.org/ (2018).
Bolker, B. M. et al. Generalized linear mixed models: a practical guide for ecology and evolution. Trends Ecol. Evol. 24(3), 127–135 (2009).
Cheng, J., Edwards, L. J., Maldonado-Molina, M. M., Komro, K. A. & Muller, K. E. Real longitudinal data analysis for real people: building a good enough mixed model. Stat. Med. 29, 504–520 (2010).
Tabachnick, B. G., Fidell, L. S. & Ullman, J. B. Using Multivariate Statistics (Pearson, Boston, 2007).
Howell, D. C. Statistical Methods for Psychology (Cengage Learning, Boston, 2009).
Zuur, A. F., Ieno, E. N. & Elphick, C. S. A protocol for data exploration to avoid common statistical problems. Methods Ecol. Evol 1, 3–14 (2010).
Baayen, R. H., Davidson, D. J. & Bates, D. M. Mixed-effects modeling with crossed random effects for subjects and items. J. Mem. Lang. 59, 390–412 (2008).
Gelman, A. & Hill, J. Data Analysis Using Regression and Multilevel/Hierarchical Models (Cambridge University Press, Cambridge, 2007).
Bates, D., Mächler, M., Bolker, B. & Walker, S. fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2015).
Andrews, S. Frequency and neighborhood effects on lexical access: activation or search? J. Exp. Psychol. Learn. Mem. Cognit. 15, 802–814 (1989).
Sears, C. R., Hino, Y. & Lupker, S. J. Neighborhood size and neighborhood frequency effects in word recognition. J. Exp. Psychol. Hum. Percept. Perform. 21, 876–900 (1995).
Forster, K. I. & Shen, D. No enemies in the neighborhood: absence of inhibitory neighborhood effects in lexical decision and semantic categorization. J. Exp. Psychol. Learn. Mem. Cogn. 22, 696–713 (1996).
Carreiras, M., Perea, M. & Grainger, J. Effects of orthographic neighborhood in visual word recognition: cross-task comparisons. J. Exp. Psychol. Learn. Mem. Cogn. 23, 857–871 (1997).
Pollatsek, A., Perea, M. & Binder, K. S. The effects of “neighborhood size” in reading and lexical decision. J. Exp. Psychol. Hum. Percept. Perform. 25, 1142–1158 (1999).
Siddharthan, A. A survey of research on text simplification. Int. J. Appl. Linguist. 165, 259–298 (2014).
Saggion, H. Automatic text simplification. Synth. Lect. Hum. Lang. Technol. 10, 1–137 (2017).
Candido, Jr., A. et al. Supporting the adaptation of texts for poor literacy readers: a text simplification editor for Brazilian Portuguese. In Proceedings of the Fourth Workshop on Innovative Use of NLP for Building Educational Applications 34–42 (2009).
Carroll, J. et al. Simplifying text for language-impaired readers. In Proceedings of the 9th Conference of the European Chapter of the Association for Computational Linguistics (EACL) 269–270 (1999).
Rello, L., et al. DysWebxia 2.0!: more accessible text for people with Dyslexia. In Proceedings of the 10th International Cross-Disciplinary Conference on Web Accessibility 25, 1–2 (2013).
Bernard, J.-B. & Castet, E. The optimal use of non-optimal letter information in foveal and parafoveal word recognition. Vis. Res. 155, 44–61 (2019).
Albrengues, C., Lavigne, F., Aguilar, C., Castet, E. & Vitu, F. Linguistic processes do not beat visuo-motor constraints, but they modulate where the eyes move regardless of word boundaries: evidence against top-down word-based eye-movement control during reading. PLoS ONE 14, 1–47 (2019).
Acknowledgements
This work was supported by the Fondation de France and by the Belgian FNRS.
Author information
Authors and Affiliations
Contributions
L.S., T.F. and N.G. prepared the linguistic material used for data collection. C.A. developed the program used for data collection. L.S. and N.S. collected the data. A.C. performed the data analysis and wrote the main manuscript text. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Sauvan, L., Stolowy, N., Aguilar, C. et al. The inhibitory effect of word neighborhood size when reading with central field loss is modulated by word predictability and reading proficiency. Sci Rep 10, 21792 (2020). https://doi.org/10.1038/s41598-020-78420-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-78420-0
This article is cited by
-
Clustering analysis of factors affecting academic career of university students with dyslexia in Italy
Scientific Reports (2022)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.