Intact word processing in developmental prosopagnosia

Burns, Edwin J.; Bennetts, Rachel J.; Bate, Sarah; Wright, Victoria C.; Weidemann, Christoph T.; Tree, Jeremy J.

doi:10.1038/s41598-017-01917-8

Download PDF

Article
Open access
Published: 10 May 2017

Intact word processing in developmental prosopagnosia

Scientific Reports volume 7, Article number: 1683 (2017) Cite this article

4203 Accesses
45 Citations
3 Altmetric
Metrics details

Subjects

Abstract

A wealth of evidence from behavioural, neuropsychological and neuroimaging research supports the view that face recognition is reliant upon a domain-specific network that does not process words. In contrast, the recent many-to-many model of visual recognition posits that brain areas involved in word and face recognition are functionally integrated. Developmental prosopagnosia (DP) is characterised by severe deficits in the recognition of faces, which the many-to-many model predicts should negatively affect word recognition. Alternatively, domain-specific accounts suggest that impairments in face and word processing need not go hand in hand. To test these possibilities, we ran a battery of 7 tasks examining word processing in a group of DP cases and controls. One of our prosopagnosia cases exhibited a severe reading impairment with delayed response times during reading aloud tasks, but not lexical decision tasks. Overall, however, we found no evidence of global word processing deficits in DP, consistent with a dissociation account for face and word processing.

Normal colour perception in developmental prosopagnosia

Article Open access 02 July 2021

Normal recognition of famous voices in developmental prosopagnosia

Article Open access 12 November 2020

Both identity and non-identity face perception tasks predict developmental prosopagnosia and face recognition ability

Article Open access 19 March 2024

Introduction

The recent many-to-many model of visual recognition proposes that specialised brain regions for the processing of faces and words are functionally integrated^1,2,3; for example, areas specialised to recognise faces will also, to a lesser extent, contribute towards the recognition of words. The many-to-many model predicts that as a group, those with deficits in one area (e.g., face processing) should also show deficits in the other⁴ (e.g., word processing). Evidence for this view comes from patients with acquired prosopagnosia (AP), a disorder characterised by an inability to recognise faces following some form of trauma to the brain regions specialised for face processing; these cases have been shown to exhibit subtle word processing deficits³. Furthermore, individuals with alexia, a disorder associated with word processing deficits after damage to the brain areas specialised for processing words (typically the visual word form area: VWFA), have also been found to exhibit signs of face recognition impairment^{3, 5}. Taken together, these findings give prima facie support to the many-to-many model’s proposal that word and face recognition are functionally integrated.

In general, however, evidence of associated deficits is not as compelling as evidence of a dissociation⁶. Numerous studies have shown AP and alexia cases, with unilateral damage, to be spared in their respective word and face processing abilities^7,8,9,10,11. It has been suggested that the discrepancy in these results where a dissociation was found between word and face processing^7,8,9,10,11, and other work that identified associative deficits between the two domains³, is due to the latter’s testing of AP cases that also suffered from object recognition impairments: these cases were likely impaired at an earlier stage of visual processing, or had damage to cortical areas that not only processed faces, but also contributed towards the recognition of words. The obvious conclusion from these results is that face and word recognition are reliant upon specialised processes that do not overlap.

Prosopagnosia can also be developmental (DP) in nature, occurring in individuals with no history of brain damage^12,13,14. DP cases have been shown to exhibit reduced matter density and abnormal neural responses to faces throughout the brain’s face processing regions^{15,16,17,18,19}. Typically, the Warrington Recognition Memory Test for Words²⁰ has shown no evidence of word processing impairment in DP^21,22,23, although it only comprises a single study-test cycle and thus may be too crude to detect subtle reading impairments. More recently it has been shown that DP cases are apparently unimpaired when reading aloud words of various lengths^{24, 25} and single letters²⁴. These studies, however, comprised basic reading tasks which did not fully test word processing under a broad set of linguistic and perceptual demands.

Alexia cases exhibit abnormally slower reading latencies as word length increases, otherwise known as the word length effect (WLE)^{26, 27}. However, these impairments are directly linked to damage in the VWFA and the confusability of a word’s constituent letters, that is, how perceptually similar (confusable) each letter in the word is to other letters in the alphabet²⁶. For example, O is highly confusable due its similarity with C, G and Q, by contrast, X is low in confusability because of its dissimilarity to other letters²⁸. When a word’s summed confusability is controlled for across words of different lengths, the WLE is abolished in alexia²⁶. This suggests that alexia cases only exhibit abnormal WLEs due to the increasing confusability of a word’s constituent letters, rather than its actual length per se.

In this respect, it is maybe not surprising that DP cases evinced neurotypical reading abilities in recent studies where confusability was not controlled for^{24, 25}; those with alexia only exhibit an abnormal WLE as confusability increases with increasing word length. This fact suggests the need for DP cases to be thoroughly tested on a battery of tasks where confusability is carefully controlled for. If alexia and DP cases share similar deficits in their early perceptual processing of faces and words, then those with DP should show similarly elevated WLEs when the sum confusability of a word increases with word length. Conversely, we should also see those with DP exhibit neurotypical WLEs when asked to read aloud words where sum confusability is held constant as word length increases.

In addition to reading aloud, lexical decision tasks, where participants are asked to quickly decide whether a presented string of letters constitutes a valid word or not, are a popular tool to test word recognition²⁹. While neuropsychological evidence has shown that damage to the VWFA impairs reading aloud, lexical decision making is spared³⁰, suggesting a dissociation between these two tasks. However, despite reading words aloud and alexia being directly linked to the VWFA, neuroimaging research has suggested that reading relies more on the dorsal pathway, whereas lexical decisions are associated with a stronger involvement of the occipito-temporal cortex^31,32,33 which includes many of the face related cortical regions. A case could therefore be made that lexical decision tasks, rather than simple reading aloud tasks, might be better suited to testing the many-to-many model’s predictions of common word and face processing deficits in DP.

Similarly, DP cases are characterised by their very inability to retrieve confirmation that a face has been encountered before. We hypothesise that performance in lexical decision tasks, rather than naming tasks, might be more diagnostic of the common difficulties DP cases experience when judging facial identity. When participants see a word during a lexical decision task, they need to access the semantic memory system which stores facts about the world to confirm that they know that this word is a word³⁴. Recognition memory models typically posit that recognition works the same way for different types of stimuli, with words and faces both able to elicit a familiarity signal on which a recognition decision is based³⁵. There are a series of stages at which this type of recognition can fail for face stimuli in DP. In some cases, those with DP may fail to match the presented face to a previously stored representation due to poor perceptual processing of the face’s attributes. Other DP cases, however, are thought to be successful in perceptually activating this stored representation of the presented face³⁶. Instead, this success in perceptual processing somehow fails to connect downstream to the semantic memory store to confirm familiarity or to episodic memory where information such as when and where the face was previously encountered is stored. In this respect, if the face recognition system is integrated with word recognition, then we should expect to see those with DP exhibiting similar failures, either through mistakenly judging the lexicality of a visually presented word or non-word, or being slower in confirming word familiarity due to degraded perceptual processing. It should be noted that individuals with DP are generally able to confirm that a celebrity’s name is known to them. For example, after a famous faces test the experimenter will check whether the DP case has failed to recognise a particular face because of their face recognition problems, or simply because they do not know who the celebrity is. While this may indicate that DP cases are unimpaired at processing the familiarity of non-face stimuli, no study has yet confirmed this fact with a lexical decision task.

Confusability and word length place distinct perceptual demands upon the visual recognition system, however, this system can also be tested in its ability to process words of changing linguistic complexity. For example, the mere frequency of a word appearing in written language can crudely index one’s level of visual experience with that word. If DP is associated with deficits in their sensitivity to experience, then such deficits should not only impact their ability to identify famous faces, but also impair performance on word processing tasks where word frequency is varied. Similarly, the age at which one acquires a word has also been shown to affect reading performance³⁷. Age of Acquisition (AoA), however, is linked to word frequency, and both variables should therefore be examined jointly. Finally, a word’s orthographic neighbourhood is comprised of all other words that can be derived by changing one of its constituent letters (the size of a word’s neighbourhood is denoted by N)³⁸; for example, lob has the orthographic neighbours mob, log, lot and lab. Intriguingly, activity in the brain’s right hemisphere, which exhibits many of the neural abnormalities in DP^{15,16,17,18,19}, appears to be sensitive to N³⁹. Under the assumptions of the many-to-many model, one might therefore expect any linguistic deficits in DP to vary with N.

We tested the many-to-many model’s account of visual recognition by examining the performance of a group of DP cases on a comprehensive battery of 7 behavioural word recognition experiments. We label tasks where we vary word length across conditions as testing the role of perceptual information in word processing due to the fact that such a manipulation varies the physical length of our stimuli between trials. By contrast, any task that maintains the physical length of words while varying linguistic properties, such as frequency or AoA, will be labelled as testing the processing of linguistic information. We should add a caveat, however, that this classification is rather crude and is only meant to facilitate discussion of the different tasks. While the many-to-many model broadly predicts word processing impairments in prosopagnosia, it may be the case that these impairments only manifest themselves when demands are placed upon perceptual, rather than linguistic, processing. If DP cases were also impaired in linguistic processing, then it might indicate a much more basic, low level visual problem where words and faces are processed prior to functionally specialised regions. We therefore wanted to examine whether this was the case across perceptual and linguistic tasks. Each set of tasks consisted of one lexical decision task and a number of word reading tests.

Methods

Participants

The 11 DP cases that participated in the behavioural tasks were aged 20–73 years old (Mean = 41.55 years, 3 males). The 37 controls comprised of 2 groups: a younger group of 18 participants aged between 20–33 years (Mean = 23 years, 6 males) and an older group of 19 participants ranging from 56–77 years (Mean = 66 years, 7 male) to be roughly comparable to DP cases aged 32 years and younger or 52 years and older respectively. Due to the small numbers in each group, they were collapsed together for our analyses. All participants had normal or corrected to normal vision and were native English speakers. All controls and DP cases were either studying at, or had completed, university education. None of the controls reported difficulties in recognising faces, a fundamental criterion for prosopagnosia, and none of the participants had dyslexia. It should be noted that due to time constraints, not all DP cases completed all 7 behavioural tasks but their data is still included where possible. The study was given ethical approval by the Swansea University Research Ethics Committee. All methods were carried out in accordance with approved guidelines and required informed consent to be obtained from all participants.

Figure 1 lists the DP cases that participated in the experiments and their neuropsychological tests of face processing impairment, which included: a shortened Famous Faces Test⁴⁰ (FFT), the Cambridge Face Memory⁴¹ (CFMT), and the Cambridge Face Perception Test⁴² (CFPT), with further details found in the citations. We collected data for the shortened FFT from 164 participants (101 female) to ascertain normative means and SDs for the general population in the local geographical area (M = 94.6%, SD = 6.23). Normative scores for the CFMT and CFPT were taken from the cited literature. As can be seen from Fig. 1, all of our DP participants scored more than 2 SD below the control mean on the FFT and CFMT, with 4 showing impaired performance on the CFPT. As with previous DP research^43,44,45, our criteria for identifying DP cases required impairment on both the CFMT and FFT.

General Procedure

The seven experiments were completed in a random sequence for each participant. We analysed our data using mixed model ANOVAs, the purpose of which was to test the prediction that individuals with prosopagnosia should, as a population, exhibit word processing deficits⁴. To this end, we only report main effects or interactions involving the factor Group (controls vs. DP cases), with any follow up comparisons Bonferroni corrected. All response times were for correct responses and all group analyses two-tailed. Bayesian analyses were also performed to test the weight of evidence for the null hypothesis (Supplementary Information). Slope values for the word length effect²⁷ were calculated by regressing the response times and errors, with individual DP cases’ WLEs reported in the Supplementary Information. Additionally we used the Crawford’s t-test⁴⁶ to detect any abnormalities in individual DP cases’ performance. As we were testing the many-to-many model’s prediction that DP cases would exhibit global deficits in word recognition, we used a one-tailed test with 18 degrees of freedom to produce a critical t-value of 1.737 for the older DP cases; any individual with a t-value above this score will be identified as impaired. The critical t-value for the younger group with 17 degrees of freedom was 1.743. Any variables (e.g., bigram frequency) that were matched across conditions on any given task were confirmed as not being statistically different from one another. All word lists are provided on Scientific Reports’ website.

Impact of Perceptual Information (Word Length)

Lexical Decision: Length (word confusability not controlled)

Lexical decision tasks should reveal any difficulties DP cases may have in confirming word familiarity under perceptually demanding conditions of varying word length. Stimuli comprised 120 words and non-words. The 120 words consisted of 3 groups of 3-, 5- or 7-letters in length. Groups were matched for CELEX frequency, AoA (Bristol Norms) and bigram frequency. Mean bigram frequency merely means the frequency with which any pairs of adjacent letters found in a word occur within the printed English language. It was not possible to control for N across the 3 different letter length groups due to the inverse relationship between N and word length: 3-letter words avg. 13 neighbours, 5-letter words avg. 2.25 neighbours, 7-letter words avg. 0.2 neighbours. The 160 non-words were taken from the ARC Non-Word Database⁴⁷. Non-words were pseudowords matched with the respective word stimuli for string length, orthographic neighbours, and bigram frequency. Examples include treaps, grauds and guites.

Each trial began with a centrally presented black fixation cross for 2000 ms against a white background. Then one of the 160 word or 160 non-word targets was presented in black, replacing the fixation cross. Participants were required to judge as quickly and accurately as they could, whether each target was a word or non-word by pressing the appropriate response keys on a keyboard. Immediately after their response, an asterisk (*) appeared onscreen for 500 ms before the beginning of the next trial. Presentation of the stimuli was randomised and controlled using SuperLab Pro. Stimuli were presented in 24 point, lower-case Arial font. Prior to the experiment, participants were required to complete 12 practice trials (6 words and 6 non-words).

Reading Aloud: Length (word confusability not controlled)

Alexia cases are impaired when asked to read aloud words of different lengths where confusability is not controlled for. To test whether DP cases exhibit similar impairment, we designed the present task to mimic such conditions. Word stimuli were the same as in the previous task, however, the non-words were not used; all of our reading aloud stimuli lists comprised real words alone. Each trial was exactly the same as described for the previous task apart from the following details: instead of responding word or non-word by pressing response keys, participants were required to read the word aloud when it was presented on the screen. The targets remained on the screen until the participant responded. Vocal responses were detected using an SV-1 voice key (Cedrus Software). Due to the fact that the voice key could be triggered by any sound, participants’ responses were also checked for accuracy from separate recordings using a digital voice recorder. As for the lexical decision task, participants initially completed 6 practice trials.

Reading Aloud: Length (sum confusability maintained across words)

Alexia cases are spared when reading words of different lengths where the sum confusability of all words is maintained. To examine whether DP cases exhibit similar performance, we controlled the sum confusability of all words in this task so confusability was the same for each word regardless of word length. Stimuli were comprised of 120 words taken from prior work on summed confusability so that any abnormal performance by our DP cases could be interpreted with respect to their alexia cases²⁶. Words were matched on N, summed letter confusability and frequency while varying word length, with the 120 items comprising equal numbers of 5-, 6-, and 7-letter long words. The procedure was the same as the previous reading aloud task. Participants had to complete 6 practice trials prior to the experiment.

Reading Aloud: Length (average letter confusability maintained across words)

As mentioned, alexia cases are impaired when confusability increases across words of different lengths. We decided to better control this variable than in the second length task by maintaining the average letter confusability across words of different lengths. This will have the effect of increasing the average word confusability in a linear fashion as word length increases. If DP cases have similar difficulties in reading as those with alexia, then they should exhibit elevated WLE when attempting to read words in this condition. Stimuli comprised of 120 words again taken from prior work²⁶, and were matched on N, average letter confusability, and frequency. Length was varied with equal numbers of our 120 stimuli comprised of 5-, 6-, and 7-letter long words. The procedure was the same as for the previous reading aloud task and included 6 practice trials.

Impact of Linguistic Information

Lexical Decision: Frequency x Age of Acquisition (AoA)

Word frequency crudely indexes our visual experience with different words. As AoA influences performance where frequency is varied, we tested participants across words that varied in AoA too. Stimuli comprised 160 words and 160 non-words. The 160 words were divided into four orthogonal conditions according to AoA (early/late) and frequency of use (high/low): half of the words were early acquired (Mean = 5.37 years of age, earliest word = 3.7 years of age, latest word = 8.3 years of age), with the remainder acquired late (Mean = 9.29 years of age, earliest word = 6.7 years of age, latest word = 12.6 years of age; Bristol Norms⁴⁸). While there was some overlap between the highest and lowest AoA groups, this was necessary to still enable us to have distinct high and low frequency conditions. Within each condition (early/late AoA), half were high-frequency words and the remaining half were low-frequency. High frequency words had a word frequency score of >240 per million whereas low frequency words were <30 per million (CELEX database⁴⁹). Words were matched across all groups for length (in letters), number of orthographic neighbours (N) and mean bigram frequency. The 160 non-words were taken from the ARC Non-Word Database⁴⁷. Non-words were also assigned into four groups and matched with the respective word stimuli for string length, orthographic neighbours, and bigram frequency. The procedure was exactly the same as the previous lexical decision task.

Reading Aloud: Frequency x AoA

We used the word, but not non-word, stimuli from the above lexical decision task crossing word frequency with AoA in a reading aloud task using the same procedures as described for previous word naming tasks.

Reading Aloud: N Confusability

N has been shown to modulate reading performance in alexia cases when letter confusability is varied⁵⁰. We therefore examined reading performance across different levels of N and letter confusability. It should be noted that this task does place considerable perceptual demands upon the visual recognition system, so may not be as exclusively testing linguistic processing as our previous linguistic tasks. Stimuli consisted of 200, 4-letter long words taken from prior work on alexia cases so that our results could be comparable if our DP cases were abnormal⁵⁰. The words were varied by letter confusability and N. The words were split into 4 groups: 50 high confusability high N, 50 high confusability low N, 50 low confusability high N, and 50 low confusability low N. The cutoffs were: low N < 5, high N > 8, low confusability <0.45, and high confusability >0.53. Participants had to complete 10 practice trials prior to the main task. Procedure was the same as previous reading aloud tasks.

Results

Impact of Perceptual Information

Lexical Decision: Length (word confusability not controlled)

Figure 2 presents the results for the lexical decision task where word length was varied. To test for any possible effects of lexicality between the groups, response times were subjected to a mixed model ANOVA with Stimuli (words vs. non-words) as a within subject factor and with Group (controls vs. DP) as a between subject factor. No significant effect for the factor Group [F (1, 45) = 1.47, MSE = 68909, p = 0.23] nor any significant Group x Stimuli interaction was found [F (1, 45) = 0.001, MSE = 22, p = 0.98]. A similar ANOVA performed on the errors also revealed no effect for Group [F (1, 45) = 0.75, MSE = 12, p = 0.39], nor any Group x Stimuli [F (1, 45) = 0.15, MSE = 2, p = 0.7] interaction.

To examine any possible differences between response times across length of word stimuli, an ANOVA was performed with factors of Length (3-, 5- and 7-letters) as a within subject factor and Group (controls vs. DP) as between subject factors. No significant Group [F (1, 45) = 3.6, MSE = 107057, p = 0.064] effect was found, nor any significant Length x Group interaction [F (2, 90) = 1.76, MSE = 2039, p = 0.18]. Between group comparisons on the WLE slopes also indicated that there were no significant response time (DP: M = −16.44 ms/letter; Controls: M = 2.58 ms/letter, [t(45) = 1.32, p = 0.2]) or error related (DP: M = −.75 errors/letter; Controls: M = −1.23 errors/letter, [t(45) = 1.09, p = 0.28]) WLE differences between the groups. The same ANOVA performed on the errors yielded no significant effect of Group [F (1, 45) = 1.15, MSE = 6.25, p = 0.29] nor a Group x Length interaction [F (2, 90) = 0.79, MSE = 0.81, p = 0.46]. In summary, our analyses revealed that those with DP do not exhibit any deficits in lexical decisions as word length is varied.

Reading Aloud: Length (word confusability not controlled)

Figure 3 displays the results for the reading aloud task where word length was varied. A mixed model ANOVA was performed on the response times with Length (3-, 5- and 7-letters) as a within subject factor and with Group (controls vs. DP) as a between subject factor. No significant effect was found for Group [F (1, 45) = 0.41, MSE = 13187, p = 0.53] nor any significant Group x Length interaction [F (2, 90) = 0.39, MSE = 421, p = 0.68] either. A mixed model ANOVA performed on the errors revealed no main effect for Group [F (1, 45) = 0.8, MSE = 3.41, p = 0.38] nor any significant Group x Length interaction [F (2, 90) = 0.72, MSE = 0.49, p = 0.49]. Independent samples t-tests on the WLE slopes for the response times (DP: M = 14.64 ms/letter; Controls: M = 10.79 ms/letter, [t(45) = 0.38, p = 0.7]) and errors (DP: M = −0.05 errors/letter; Controls: M = 0.18 errors/letter, [t(45) = 1.02, p = 0.31]) found no significant WLE differences between the groups. In summary, the DP group exhibited no impairment in their performance when reading words of different lengths.

Reading Aloud: Length (sum confusability of letters maintained across words)

Figure 4 displays the results for the reading task where sum confusability was kept constant across varying word lengths. A mixed model ANOVA was performed on the response times with Length (5-, 6- and 7-letters) as a within subject factor and with Group (controls vs. DP) as a between subject factor. We found no significant Group effect [F (1, 45) = 0.11, MSE = 5343, p = 0.75] nor any Group x Length [F (2, 90) = 0.07, MSE = 62, p = 0.93] interaction. A similar ANOVA performed on the errors also produced no effect of Group [F (1, 45) = 1.35, MSE = 8.35, p = 0.25] nor any Group x Length [F (2, 90) = 1.01, MSE = 0.84, p = 0.37] interaction. Between group comparisons on the slopes showed no significant differences between the groups in their response time (DP: M = 20.33 ms/letter; Controls: M = 17.58 ms/letter, [t(45) = 0.31, p = 0.76]) nor error rate related (DP: M = 0.55 errors/letter; Controls: M = 0.23 errors/letter, [t(45) = 1.61, p = 0.11]) WLE. In summary, those with DP appear to have no impairment in their reading abilities across words of different lengths when controlling for sum confusability.

Reading Aloud: Length (average letter confusability maintained across words)

Figure 5 shows the results for the reading task where average confusability was kept constant as word length was varied. A mixed model ANOVA was performed on the response times with Length (5-, 6- and 7-letters) as a within subject factor and with Group (controls vs. DP) as a between subject factor. No significant main effect of Group was found [F (1, 45) = 0.96, MSE = 30264, p = 0.33], nor any significant Group x Length interaction [F (2, 90) = 0.83, MSE = 609, p = 0.44]. A similar ANOVA performed on the errors revealed no significant main effect for Group [F (1, 45) = 0.54, MSE = 4.33, p = 0.47]. The Group x Length interaction was not significant either [F (2, 90) = 1.91, MSE = 2.5, p = 0.15]. Between group comparisons on the slope values yielded no significant differences between the groups in their response time WLE (DP: M = 25.41 ms/letter; Controls: M = 18.36 ms/letter, [t(45) = 0.95, p = 0.35]) but the DP group exhibited an abnormal trend in their error related WLE (DP: M = 0.6 errors/letter; Controls: M = 0.1 errors/letter, [t(45) = 1.84, p = 0.073]). Visual inspection of Fig. 5 shows that this was due to the DP cases evincing superior performance in the 5- and 6-letter long word conditions, but comparable performance to the controls in the 7-letter condition. This suggests no apparent abnormalities in the DP group despite their elevated WLE. Overall, the DP cases did not exhibit any deficits when reading words of different lengths where average letter confusability was kept constant, as shown by their neurotypical response times and errors made.