Introduction

Infants born with dense bilateral cataracts lack patterned vision until their sight is restored by cataract removal surgery. When surgery is performed late, i.e. beyond the first few weeks from birth, these individuals have been reported to suffer from permanent visual and multisensory impairments1,2,3,4,5. These impairments are hypothesized to be a behavioral consequence of neural system changes, resulting from aberrant sensory input within a sensitive period of development6,7. One way to assess visual and multisensory functional recovery in cataract reversal individuals is to test their susceptibility to well-known perceptual illusions. Perceptual illusions are typically extremely robust, suggesting that they arise from automatic processing principles8. Thus, the lack (or reduced likelihood) of perceiving a visual or multisensory illusion is indicative of impaired visual or multisensory processing, respectively.

Putzar et al.9 tested the automatic detection of illusory contours in congenital cataract reversal individuals who had undergone cataract surgery between 1 and 17 months of age. They observed deficits in individuals who experienced more than 5–6 months of visual deprivation, and interpreted this result as evidence for impairments in the automatic binding of visual features—i.e. visual feature binding. This finding was extended by McKyton et al.10 to a population of cataract reversal individuals who had undergone surgery only after the age of 5 years. These results suggested that early visual experience is required to acquire the neural circuits necessary for visual feature binding, providing evidence in favor of the sensitive period hypothesis. While this conclusion is compatible with the long developmental trajectory of illusory contour perception11, other studies replicated this finding for individuals treated for monocular12 (but not binocular) congenital cataracts.

Gandhi et al. tested the Ponzo and Müller-Lyer illusions, wherein equally long lines are perceived to be of different lengths depending on their surroundings13. Forty-eight hours after cataract surgery, they found a full recovery of this illusion13, despite the fact that sight was partially restored only at 8 years of age or later. Since both these illusions were thought to arise from the interpretation of two-dimensional perspective cues as three-dimensional depth, they concluded that this process did not depend on early childhood vision.

Illusions have additionally been used to investigate the extent of multisensory recovery following cataract surgery. In an initial study, Putzar et al.14 employed an audio-visual temporal capture effect, characterized as follows: if an auditory stimulus is presented with a short temporal offset with respect to a visual stimulus, the visual stimulus is often perceived as temporally shifted towards the time point when the sound was presented. This effect was significantly reduced in congenital cataract reversal individuals14. The residual visual impairments would have, according to the inverse efficiency rule of multisensory integration, predicted a larger capture in this group, suggesting that the multisensory binding process was impaired15. Similarly, cataract reversal individuals had a significantly reduced likelihood of showing the McGurk effect16. In this illusion, auditory speech, presented concurrently with an incongruent visual lip movement, produces a percept that matches neither the auditory nor the visual input. The absence of the McGurk effect in congenital cataract reversal individuals was subsequently shown to be related to a lack of multisensory enhancement in superior temporal brain regions, known to be essential for audio-visual speech perception17,18,19. Finally, visual motion after-effects (perception of stationary stimuli as moving in the direction opposite to previously presented moving stimuli) were found after auditory motion adaptation in cataract-reversal individuals20, i.e., a cross-modal after-effect which has not been found in normally sighted individuals. These results converge to the conclusion that multisensory binding processes do not fully recover after sight restoration in individuals with a history of congenital cataracts. Although recent studies with cataract reversal individuals have demonstrated recovery of multisensory redundancy effects2,21 and partial recovery for auditory-visual simultaneity judgements22, multisensory binding based on more complex features, such as speech14, seems to depend on early visual input.

It is currently unclear to what degree visuo-haptic and visuo-motor processing recovers after a transient phase of congenital visual deprivation. A developmental study with children found that visuo-haptic integration reaches adult-like performance only by 10 years of age23. Prior to that, children show signs of either vision or touch dominating visuo-tactile perception. Evidence from studies in a small number of individuals who had dense (but not necessarily congenital) bilateral cataracts suggested a quick emergence of visuo-haptic interactions after surgery, when these individuals were tested in vision-to-touch object matching tasks (n = 124; n = 525). However, later single case studies pointed towards impaired spatial representations for visuo-tactile localization after at least two years of visual deprivation26,27. Further, a study testing sight recovery individuals on an automatic imitation task, which mapped vision to motoric performance, found performance deficits even two years after surgery28.

A phenomenon observed in typical visuo-haptic development is the perception of the size-weight illusion (henceforth referred to as the SWI), which has been described as “immutable”29. The SWI is an illusion perceived when two unequally sized objects of the same weight are compared—the smaller object is perceived as being heavier than the larger one.

The “classical” SWI is assumed to require the integration of visual and haptic input (for review, see Dijker)30. Though the SWI has been documented for a long time, there is still a debate on whether it occurs due to conflicting sensorimotor input, or is a purely cognitive effect due to a mismatch in expectations31. At present, the contribution of early visual experience to the occurrence of the illusion is unclear.

Several lines of evidence have suggested a crucial role of ongoing visual input in the perception of the SWI. First, the SWI was reported to disappear when visual cues were not presented to normally sighted participants, even if they were allowed to access them beforehand, suggesting a crucial role of continued visual perception for the illusion, and providing a strong argument that the SWI reflects visuo-haptic integration32. Additionally, the SWI was observed to increase with an increase in visual disparity between sizes; if two objects of the same weight had a greater difference between their visually perceived sizes, the illusion perceived was stronger33. This was found to be true even in the absence of haptically perceived size differences, when visually perceived size was varied using objects with adjustable heights but constant surface area34. Finally, in a rapid adaptation study using functional MRI with an SWI task, the ventral premotor area (PMv) responded more when the SWI was perceived, i.e. when participants compared the weights of two objects of different sizes and the same weight, than when participants compared objects of the same size and weight35. Importantly, the PMv did not show an independent adaptation to size and weight properties, but adapted to the combination of these properties, therefore providing neural evidence for the integration of concurrent but separate sensory input. These results suggest that early visual deprivation might affect the SWI, if the ability to integrate visual and haptic cues does not recover.

The SWI was observed to occur when size information was perceived exclusively visually (i.e. using a string set-up to weigh objects, preventing individuals from using haptic cues to estimate size) or exclusively haptically (i.e. blindfolding individuals to prevent them from accessing visual estimates of size)36,37. However, both these studies reported that the SWI was smaller when the task presented sighted individuals with size cues that were exclusively visual, as compared to when the task was haptic or visuo-haptic. Moreover, congenitally blind individuals have been reported to experience the SWI as well37,38. These results would lead us to hypothesize that the SWI emerges independent of visual input, and thus, should manifest unimpaired in sight-recovery individuals too—at least if the objects are perceived exclusively haptically. However, there might be some competition between belatedly available visual input and other sensory modalities39,40, which might worsen visual performance14,20. Consistent with this assumption, a reduced visuo-haptic SWI was reported in visually impaired individuals with some residual visual capabilities41. Therefore, it could be hypothesized that the visuo-haptic SWI, both with and without haptic size information, is reduced in individuals who recover from transient congenital or developmental visual deprivation.

The second account for the occurrence of the SWI has proposed that this illusion is a result of violated expectations, and originates from top-down rather than bottom up processes42,43,44,45. If the SWI is a result of exclusively visual statistics gathered over the lifespan—i.e. an increased amount of force on the larger object because of the belief that smaller objects should weigh less—then individuals who have not had access to vision early in life, when tested with and without haptic size information, should either not manifest the illusion, or perceive it to a lower degree.

In order to test for the dependence of visuo-haptic interactions on early visual input, we performed two experiments using an SWI paradigm employed by Buckingham and Goodale, wherein subjects were asked to rate cubes across three sizes (Small, Medium, Large) and two weights (350 g, 700 g) on how heavy they thought they were42. The first experiment tested the “classic” visuo-haptic SWI, that is the SWI when simultaneously receiving visual and haptic size information. A group of dense bilateral congenital cataract reversal individuals (CC) as well as a group of individuals who had suffered developmental cataracts (DC) were tested, and compared to a group of normally sighted controls (SC). We additionally ran and separately analyzed data from two congenitally blind (CB) individuals to pilot whether we would replicate the findings from Ellis and Lederman37. Such an illusion would necessarily be based on exclusively haptic estimates of size. The second experiment used a string set-up to ensure only visual size cues were available to participants, in order to isolate the visual contribution to the SWI36,37,46. A group of congenital cataract reversal individuals (CC) and a group of normally sighted controls (SC) were compared. We hypothesized that in Experiment 1, the SWI would be impaired in CC compared to SC and DC individuals, due to aberrant visual experience interfering with multisensory integration39,40, and that in Experiment 2, CC individuals would not experience the SWI when deprived of haptic size information37,46.

Results

Sight-recovery individuals show an intact visuo-haptic SWI (Experiment 1)

Experiment 1 allowed participants to access both visual and haptic estimates of size (Fig. 1A). When z-scored weight ratings were assessed in a size-by-weight-by-group analysis of variance (ANOVA with repeated measures), all groups performed the task in a principled manner, as indicated by a main effect of weight, wherein the 700 g weight was rated as heavier than the 350 g weight (F(1,23) = 639.15, p < 0.001, ηg2 = 0.85).

Figure 1
figure 1

(A) Large-sized cube stimulus placed in the dominant hand, demonstrating the procedure for Experiment 1. (B) The six stimuli used in the study (front to back: Small, Medium, Large, 350 g and 700 g respectively), with attachments for Experiment 2. (C) String set-up of Experiment 2. A smooth ribbon is used to lift the cube during each trial, ensuring that only visual size cues are available to the participant.

There was a main effect of size, demonstrating the presence of the SWI (F(2,46) = 147.11, p < 0.001, ηg2 = 0.74), i.e. participants perceived smaller sized objects of the same weight to be heavier. Crucially, the group-by-size (F(4,46) = 0.96, p = 0.419, ηg2 = 0.03), group-by-weight (F(2,23) = 2.63, p = 0.094, ηg2 = 0.04), and size-by-weight-by-group interactions (F(4,46) = 1.64, p = 0.181, ηg2 = 0.05) were all not significant. Therefore, CC individuals displayed an indistinguishable SWI from DC and SC individuals (Fig. 2).

Figure 2
figure 2

Experiment 1 (with visual and haptic size information); z-scored weight ratings of all individuals in the Sighted Control (SC, yellow), the Developmental Cataract group (DC, blue), the Congenital Cataract group (CC, orange), and two Congenitally Blind (CB, red) individuals, t (A) for the 350 g and (B) for 700 g weights (Experiment 1).

We found a size-by-weight interaction, indicating that the illusion was stronger for the 700 g than the 350 g weight (F(2,46) = 11.82, p = 0.001, ηg2 = 0.15). In order to follow up on this effect, two post-hoc 3 (size) × 3 (group) repeated measures ANOVAs were conducted, separated by the weight condition. There was a main effect of size for both the 350 g (F(2,46) = 59.08, p < 0.001, ηg2 = 0.66) and the 700 g (F(2,46) = 108.24, p < 0.001, ηg2 = 0.79) weights, demonstrating that the SWI was highly significant for both weights in all groups. Again, we found no group-by-size interaction (350 g: F(4,46) = 0.57, p = 0.63, ηg2 = 0.04; 700 g: F(4,46) = 1.74, p = 0.179, ηg2 = 0.11), indicating that the degree to which the three groups experienced the SWI was indistinguishable (Fig. 3).

Figure 3
figure 3

Experiment 1 (with visual and haptic size information); average z-scored weight ratings for the Small (S), Medium (M) and Large (L) cubes across the Sighted Control (SC), Developmental Cataract (DC) and Congenital Cataract (CC) groups, for the 350 g and 700 g weights (Experiment 1). Error bars depict standard error of mean (SEM).

Paired t-tests across the three groups confirmed that participants experienced the smaller cube as heavier than the medium (350 g: t(24) = 8.28, p < 0.001; 700 g: t(24) = 7.64, p < 0.001) and large cube (350 g: t(24) = 9.95, p < 0.001; 700 g: t(24) = 11.52, p < 0.001) of the same weight, and the medium sized cube as heavier than the large cube of the same weight (350 g: t(24) = 5.97, p < 0.001; 700 g: t(24) = 8.49, p < 0.001).

Both CB individuals tested showed an impressively clear SWI (Fig. 2). We found that their z-scores (Fig. 2) fell within the core range of the remaining groups. Thus, these two participants replicated prior results37,46.

SWI Index

In order to obtain a measure of illusion strength for each individual, we calculated an SWI Index by subtracting the mean z-scored weight judgment of the largest cube from that of the smallest cube, separately for each weight.

We conducted separate one way ANOVAs for the 350 g and 700 g weights to assess the effect of group on the calculated SWI Index. The CC, DC, and SC groups did not differ in the strength of the illusion—neither for the 350 g (F(2,23) = 0.72, p = 0.497, ηg2 = 0.06) nor the 700 g weights (F(2,23) = 0.74, p = 0.487, ηg2 = 0.06) (Fig. 3). We further tested these null findings against effect sizes obtained from studies in the literature. Based on a comparable study and task found for the 700 g weight, we used equivalence analyses47 to confirm that the SWI index was equivalent in the CC, DC and SC groups—i.e. the SWI indices were within the bounds obtained by Buckingham and Goodale42 (the presence of any meaningful effect of group was rejected with all p’s < 0.024, see Supporting Information S1).

The two CB individuals tested showed SWI indices within the range of the other groups (350 g: − 1.145 and − 1.034; 700 g: − 1.507 and − 0.646), demonstrating a full strength SWI.

Sight recovery individuals show an SWI with exclusively visual size estimates (Experiment 2)

Experiment 2 was performed as a follow up to the results from Experiment 1, in order to assess the occurrence of the SWI in the absence of haptic size cues (Fig. 1B,C). Since we found no group differences in Experiment 1, and our a priori hypothesis was restricted to the effects of transient congenital patterned visual deprivation on the development of the SWI, we ran two groups: CC and SC individuals. Further, given the limitations on recruitment of special populations, as our goal with this experiment was to isolate the visual contribution to the full-sized SWI observed in CC individuals, DC individuals were not tested. In light of the occurrence of a full-sized SWI with (necessarily) exclusively haptic size cues in CB individuals in Experiment 1 and prior studies37,46, we did not repeat an additional haptics-only condition.

In a group-by-weight-by-size ANOVA performed for Experiment 2, we obtained a main effect of weight (F(1,11) = 110.80, p < 0.001, ηg2 = 0.833), indicating that participants performed the task in a principled manner. Additionally, the main effect of size was significant (F(2,22) = 8.82, p = 0.009, ηg2 = 0.203), confirming an SWI with this task across groups (Fig. 4). The CC participants did not differ from the SC group in their performance on this task, as evidenced by the lack of a significant main effect of group (F(1,11) = 0.158, p = 0.999, ηg2 < 0.001), and of significant group-level interactions (group-by-size: F(2,22) = 0.048, p = 0.861, ηg2 = 0.002; group-by-weight: F(1,11) = 1.866, p = 0.199, ηg2 = 0.077; group-by-size-by-weight: F(2,22) = 0.556, p = 0.509, ηg2 = 0.009).

Figure 4
figure 4

Experiment 2 (with only visual size information); z-scored weight ratings of all individuals in the Sighted Control group (SC, yellow) and Congenital Cataract group (CC, orange) groups (A) for the 350 g and (B) for 700 g weights (Experiment 2).

Across groups, participants experienced the smaller cube as heavier than the medium cube (350 g: t(12) = 2.03, p = 0.056; 700 g: t(12) = 2.02, p = 0.066) as well as the large cube (350 g: t(12) = 2.74, p = 0.018; 700 g: t(12) = 2.86, p = 0.014) of the same weight, and the medium sized cube as heavier than the large cube of the same weight (350 g: t(12) = 2.62, p = 0.023; 700 g: t(12) = 3.03, p = 0.010), confirming the presence of the SWI with this paradigm (Fig. 5).

Figure 5
figure 5

Experiment 2 (with only visual size information); average z-scored weight ratings for the Small (S), Medium (M) and Large (L) cubes for the Sighted Control group (SC) and Congenital Cataract group (CC), for the 350 g and 700 g weights. Error bars depict standard error of mean (SEM).

SWI Index

The CC and SC groups did not differ in the strength of the illusion for either the 350 g (F(1,11) = 0.169, p = 0.688, ηg2 = 0.015), or the 700 g weights (F(1,11) = 0.001, p = 0.971, ηg2 < 0.001) (Fig. 5). When compared using equivalence testing for the 700 g weight, conducted based on available effect sizes in the literature, we found the SWI indices to be equivalent, significantly rejecting any effect of group (both p’s < 0.006, see Supporting Information S1).

The visual contribution to the SWI is identical in sight recovery and sighted individuals

We compared the SC and CC groups of Experiment 1 and 2 in a group-by-weight-by-experimental task ANOVA, in order to assess whether the groups differed between how they performed when both visual and haptic size estimates were available, compared to when only visual size information was available. We confirmed that the SWI was significantly stronger in Experiment 1 compared to Experiment 2 (main effect of Experimental Task: F(1,26) = 56.396, p < 0.001, ηg2 = 0.511), and stronger for the 700 g than the 350 g weight (main effect of Weight: F(1,26) = 9.089, p = 0.006, ηg2 = 0.153) (Fig. 6). The group-by-experiment interaction (F(1,26) = 0.357, p = 0.555, ηg2 = 0.006), group-by-weight interaction (F(1,26) = 1.083, p = 0.307, ηg2 = 0.021), and group-by-weight-by-experiment interaction were all non-significant (F(1,26) = 0.631, p = 0.424, ηg2 = 0.012), confirming that in both experiments CC individuals perceived an indistinguishably strong SWI from SC individuals, suggesting that the “visual” contribution to the SWI did not differ in the two groups.

Figure 6
figure 6

Size-Weight Illusion Indices across groups and for the two CB individuals for Experiment 1 (with visual and haptic size information, black bars, labelled VH) and Experiment 2 (with only visual size information, gray bars, labelled V), for the 350 g and 700 g weights. The y-Axis depicts the difference in z-scores between the large and small cubes of each weight, averaged for each group (CC, DC, and SC). Error bars depict the SEM.

Relationship between strength of the SWI and the duration of visual deprivation

To test for a possible effect of duration of patterned visual deprivation on the strength of the SWI, we calculated the correlation between age at surgery and the average SWI Index (across 350 g and 700 g) for CC individuals. This correlation was not significant neither for Experiment 1 (r = 0.05, t(4) = 0.10, p = 0.463), nor did the correlation reach significance for Experiment 2 (r = − 0.76, t(4) = − 2.357, p = 0.078). Additionally, no correlation was observed between illusion size and age at time tested in either group (see Supporting Information S4.3).

Discussion

The present study investigated whether the manifestation of the size-weight illusion (SWI) depends on patterned visual experience after birth. We tested sight recovery individuals with a history of dense bilateral congenital cataracts, and compared this group to sight recovery individuals with a history of developmental cataracts, as well as a group of normally sighted controls. Our results demonstrated a significant “classical” SWI (with visual and haptic size information available) in all groups; indeed, the size of the SWI was indistinguishable between the three groups. Furthermore, we replicated previous results from Ellis and Lederman37 and showed that two permanently congenitally blind individuals experienced the SWI to a degree that fell within the range of all the other participants.

We additionally used a string set-up in Experiment 2 to test whether the CC group used visual size cues, rather than relying only on haptic size cues in Experiment 146. As in Experiment 1, the SWI experienced with exclusively visual size information was equivalent across sight recovery individuals with a history of congenital cataracts, and sighted controls. Together, these results suggest that the visuo-haptic SWI is resilient to atypical visual experience after birth.

No sensitive period effects for visuo-haptic integration as tested by the SWI

A previous study of individuals who were operated upon for dense bilateral congenital cataracts reported that in an object matching task conducted two days-post-surgery, while unimodal tactile and visual performance was observed to be at ceiling, tactile to visual mapping was found to be severely impaired (Held et al., n = 5)48. However, the authors observed that this ability rapidly improved over the next five days. A subsequent case study of sight recovery suggested that visuo-tactile processing recovers in object recognition and object matching tasks within three days of sight restoration, despite a lack of visual experience after birth (Chen et al., n = 1)24. As these studies tested participants closer to the date of surgery, and given that the sight recovery individuals in the present study were all tested one year from surgery (in order to exclude acute but transient effects of surgery) our findings are consistent with these existing studies on visuo-haptic object recognition. However, both Held et al. and Chen et al. tested visuo-haptic transfer through object matching tasks. By contrast, we provide evidence for the recovery of visuo-haptic integration (i.e. a unified percept by fusing input from both sensory modalities) despite early patterned visual deprivation, therefore extending these studies40,49. Our findings might be considered surprising in light of two prospective studies, which suggested a protracted developmental pathway for the SWI. The first study observed that the SWI increased in size after the age of 5 years43. They related this increase to the development of abstract reasoning skills. However, abstract reasoning explained no more than about 10% of the effect, and the SWI existed even in the youngest group. A second study showed that typically developing children did not optimally integrate visuo-haptic input in an adult-like manner until the age of 10 years23. However, optimal integration is typically defined as optimal cue integration as predicted by forced fusion models. These models weight individual cues according to their relative reliability, to derive a multisensory outcome. It has more recently been demonstrated that in situations where it is ambiguous whether or not to integrate sensory information, the data from children as young as 5 years of age, like those of adults, are better explained by causal inference models50. Given that some (n = 4) of our CC participants had been older than 10 years of age at the time of surgery, our results suggest that patterned vision during this period of multisensory development was not crucial for the typical manifestation of the SWI, with either visuo-haptic or only visual size information.

These results showing an indistinguishable SWI in sight recovery individuals, both with a congenital as well as a developmental history of transient blindness, provide evidence that the multisensory processes underlying the SWI are resilient to atypical visual experience. First, participants of both cataract groups still suffered visual impairments at the time of testing. Nevertheless, the SWI was not smaller in magnitude in either group, compared to normally sighted individuals. A smaller SWI in sight recovery individuals would have been expected from the aforementioned reliability-based optimal integration account51. Second, neither years of blindness nor the timing of the transient phase of blindness (developmental vs. congenital) had a significant influence on the size of the SWI. Finally, the extent of the “visual” contribution to the SWI was indistinguishable between sight recovery and sighted individuals. This identical behavioral performance, regardless of atypical visual history across groups and tasks, provides strong evidence that visuo-haptic processing, as assessed with the SWI, does not rely on sensitive period plasticity to develop normally.

It is possible that different underlying neural mechanisms support the identically sized SWI in sight recovery individuals52. As sensitive periods are properties of neural circuits, further neuroimaging studies need to confirm whether visuo-haptic processing develops normally in the absence of typical visual experience6,53. Additionally, the absence of sensitive period effects in one tested behavior does not contradict the general role of sensitive period plasticity5,53,54,55,56. Disengaging functions which do and do not develop within sensitive periods will, in the long run, be essential to uncovering the general principles of functional brain development.

To the best of our knowledge, the present study is the first to conclusively show the manifestation of a full-strength SWI in sight recovery individuals, both for the condition with visual and haptic size information, as well as the condition with only visual size information. Further, strict criteria were used for the inclusion of sight recovery participants, in order to ensure high homogeneity of etiology within the CC and DC group. We chose a retrospective, developmental approach with individuals who underwent cataract reversal surgery before the age of 23 years, and were older than 8 years of age at the time of testing. After the age of 8 years, no further increase in the SWI had been observed in prospective studies43. The present study did not find any difference in the size of the SWI, neither for the visuo-haptic nor for the visual experiment. A significant, equivalent SWI was consistently perceived by individual participants across all groups, despite the fact that in the cataract groups, age at surgery, time since surgery and visual acuity at time of testing varied, potentially increasing between group differences. Our stringent inclusion criteria restricted the sample size within a special population, however, all individual subjects showed the SWI with established paradigms in a consistent pattern (Figs. 2,3), allowing us to interpret the lack of group differences and confirming them with equivalence testing. We consider these results to be highly robust evidence against the dependence of the SWI on early visual input, therefore suggesting the lack of a sensitive period for the development of the SWI.

Mechanisms of the SWI

What are the possible mechanisms by which the SWI could manifest in CC individuals? While our study does not allow us to disentangle the models explaining the occurrence of the SWI, interpreting our results in light of these hypotheses can shed light on the mechanisms of the SWI. Below, we engage with the dominant models of the SWI.

On one hand, it could be assumed that the SWI is a purely haptic process, as was consistent with earlier reports of the SWI manifesting in congenitally blind adults, and replicated in two congenitally bind adults in the present study37,38. However, a purely haptic account of the SWI would have predicted the absence of the SWI when haptic size cues are unavailable to sight recovery individuals. In fact, prior studies employing a string set-up in congenitally blind individuals, as expected, did not observe an SWI37,46. Instead, in the present study, sight recovery individuals perceived an SWI even when only visual size cues were available. Additionally, a recent study observed that congenitally blind individuals experience the SWI without haptic size cues, but when size estimates were obtained through echolocation46. Together, this evidence strongly argues against an exclusively haptic account of the SWI.

On the other hand, it has been suggested that the SWI is a multisensory phenomenon occurring due to a conflict between concurrent visual (size) and haptic (weight) sensory input30,33,57. Within this framework, our data suggest that while the SWI can develop through haptic input alone, it might nevertheless be modulated by visual input, due to the recovery of visuo-haptic processing despite atypical visual experience22,24,25. Indeed, full or partial recovery of visuo-tactile functions  has been reported, depending on the task. While prior studies have shown that in a simultaneity judgement task designed to test a unified multisensory percept, visuo-tactile performance was unimpaired despite a lack of early visual experience21,22, sight recovery individuals did not show normal visuo-tactile temporal order biases27,58. Additionally, our findings of a larger SWI when both visual and haptic size information was available than when only a visual size estimate was possible, in both sight recovery and sighted individuals, fit with a multisensory framework for the occurrence of the SWI36,37.

Conclusions

The occurrence of the Size-Weight Illusion (SWI), both when visual and haptic size information was available, as well as when only visual size information was assessable, was resilient to atypical visual experience within the first months and years of life. These results provide strong evidence that the visuo-haptic processes underlying the SWI do not require typical visual experience within a sensitive period for normal development. Further studies are needed to explore whether the SWI is supported by the same neural mechanisms in typical and atypical development, by employing neuroscience techniques35.

Methods

Ethical approval

All participants, as well as their legal guardians in case of minors, provided written and informed consent. Testing was conducted after obtaining ethical approval from the German Psychological Society (DGP) and the local ethics board of the Hyderabad Eye Research Foundation. All methods and tests were performed in accordance with the relevant guidelines and regulations of both collaborating institutions.

Experiment 1

Participants

We tested three groups of individuals. The first group consisted of seven individuals born with dense bilateral cataracts, who subsequently underwent cataract removal surgery (referred to as CC individuals: 3 females, 4 males; Age = 8–35 years, M = 21.2 years, SD = 9.8, Table 1). The CC individuals were diagnosed by ophthalmologists and optometrists at the LV Prasad Eye Institute (LVPEI) in Hyderabad (India). They were tested by the some of the authors, partially with the help of a translator, in English, Hindi or Telugu. The data from four additional CC individuals were excluded due to unwillingness to cooperate (n = 2) or a documented developmental delay (n = 2). Individuals were categorized as part of this group based on the presence of dense bilateral cataracts at birth, a pre-surgery visual acuity of counting fingers at 1 m or less (barring absorption of lenses), presence of nystagmus, occlusion of fundus/retina, and immediate family members who had also been diagnosed with dense bilateral congenital cataracts. Duration of blindness was calculated by subtracting the date of birth from the date of the first eye surgery (M = 13.21 years, SD = 8.24, Range = 2–23.05 years). One individual did not have his precise date of surgery information available (operated after 6 months of age), and was excluded from duration calculations. Visual acuity pre-surgery in the better eye ranged from a minimum of light perception (PL +) to a maximum of counting fingers close to the face (CFCF). One participant had been able to count fingers at a distance of 3 m pre-surgery. We included this participant due to clearly partially absorbed lenses (OD Visual Acuity: counting fingers at 1.5 m, OS Visual Acuity: counting fingers at 3 m). All other criteria such as nystagmus and family history pointed towards the presence of dense bilateral cataracts at birth. Visual acuity post-surgery in the better eye in this group ranged from a minimum of counting fingers at a distance of 1 m to a maximum of 20/40. All individuals included in this group lacked patterned vision at birth, in accordance with the criteria set by the WHO59.

Table 1 Participant characteristics for all sight recovery participants in Experiments 1 and 2.

The second group consisted of nine individuals who had either partial congenital cataracts or developmental cataracts, and were subsequently operated upon to remove the cataracts (referred to as DC individuals: 2 females, 7 males; Age = 8–37 years, M = 14.8 years, SD = 9.2, Table 1). The testing procedure was the same as that of the CC group. Comparing this group with the CC individuals allowed us to isolate effects caused by transient patterned visual deprivation  at birth from effects due to general visual impairments caused by a changed periphery. Visual acuity pre-surgery in the better eye ranged from following light to a maximum of 20/80. Visual acuity post-surgery in the better eye ranged from 20/1200 to 20/20. All individuals included in this group did not lack patterned vision at birth, but suffered from degraded visual input for some or all of their early childhood, therefore providing a control group for the possibility that any observed impairments of the CC group were not specific to visual input at birth, but due to degraded visual input at any stage of life.

The third group consisted of 10 individuals with normal or corrected-to-normal vision and who had no history of visual deficits or eye injuries (referred to as SC group: 7 females 3 males; Age = 19–36 years, M = 25.8 years, SD = 5.3). SC individuals were tested at the University of Hamburg, Hamburg, Germany, in German.

In addition to these three groups, we ran two congenitally blind individuals who had no more than light perception since birth and at the time of the study (referred to as CB individuals: 1 female, 1 male; Ages = 33 and 44 years). They were tested at the University of Hamburg, Germany, using German. Their data were analyzed separately due to the small group size. The purpose of including these two participants was to replicate the presence of the SWI in individuals who totally lack vision since birth, but not for statistical comparisons between groups37,46.

All individuals included in the data analysis reported no history of neurological or cognitive impairments. All participants were right handed, and used that hand to perform the task.

Stimuli and apparatus

Participants were tested using a free-rating, absolute-magnitude-estimation procedure: they freely chose a rating scale of their preference and estimated how much an object weighed on that scale42. This scale was adjustable during the course of the experiment.

Participants rated one of 6 gray plastic cubes that were placed on their palms. The cubes were small (5 cm side length), medium (7.5 cm side length) or large (10 cm side length), and had one of two different weights (either 350 g or 700 g) (Fig. 1). The weight was invisibly fixed in one corner, with the rest of the inside being hollow to ensure the same distribution of weight in each cube. Participants were instructed to hold the dominant arm bent at 90 degrees, and for each trial, the cube was placed on the palm of their dominant hand such that it was clearly visible. Participants were required to lift each cube for approximately 15 s, and to judge its weight. Upon the rating response, the experimenter removed the cube from the participants’ hand.

We used random orders, and in one run, each cube was lifted 5 times. We repeated this across two runs, leading to a total of 60 trials. For participants who did not complete 60 trials, we used only the completed run of 30 trials (CC: n = 1; DC: n = 3). Participants were instructed not to rotate the arm for additional sensory cues, and not to throw the cube in the air and catch it again. This procedure took around 30 minutes in total.

Data analysis

In order to compare subjective judgements across participants, rating scores were z-transformed within each participant. This was done by subtracting the individual’s mean weight judgement from each weight judgement, and dividing by the standard deviation42. Due to this z-scoring, all weight judgements reflect deviations from the same mean (zero), with higher z values indicating heavier weight judgements. In order to be included in the data analysis, participants had to consistently rate the 350 g cubes as lighter than the 700 g cubes, to exclude the possibility of a response bias as opposed to a principled difference in perceived weight due to size. This was true of all participants in Experiment 1.

We used frequentist statistics to analyze the data. Z-scores across participants were submitted to a mixed ANOVA. Our model considered two within-group factors—namely, weight (2 levels: 350 g, 700 g) and size (3 levels: small, medium, large), and one between-group factor—group (3 levels: CC, DC, and SC) in a repeated measures ANOVA. Levene’s test for Homogeneity of Variance was performed on the z-score data to ensure that the scores do not violate the assumption of equal variance for parametric testing, due to unequal sample sizes between groups (F(2,23) = 0.27, p = 0.764). Post-hoc ANOVAs and t-tests were performed according to the resulting interactions, and post-hoc equivalence testing was conducted to confirm the results (Supplementary Information S1).

All analyses were conducted in R (version 3.3.2), using the ez-package (https://github.com/mike-lawrence/ez). This package corrects for violations of sphericity when there are more 2 levels in the within subject variable (size) via the Greenhouse–Geisser correction. All effect sizes reported are generalized eta squared (ηg2) values.

This study was not pre-registered, and sample sizes were limited by strict inclusion criteria within a special population.

Experiment 2

Data for this experiment was collected at LVPEI, Hyderabad, India, by the some of the authors, partially with the help of a translator, in English, Hindi or Telugu.

Participants

The CC group consisted of six individuals defined and classified the same way as in Experiment 1 ( 1 female, 5 males; Age = 17–44.7 years, M = 27.67 years, SD = 12.37; Duration of blindness = 2–22.01 years, M = 12.98 years, SD = 7.28, Table 1). An additional CC participant was excluded as they did not consistently rate the 350 g cubes as less heavy than the 700 g cubes, indicating that they were not performing the task in a principled manner, possibly due to translation issues (see Supporting Information S2). For four out of six included participants, visual acuity pre-surgery in the better eye ranged from a minimum of counting fingers at 1 m to a maximum of 20/300, with a history of partially absorbed lenses in all four of them. All other criteria, such as presence of nystagmus and family history, pointed towards the presence of dense bilateral cataracts at birth. For the remaining two participants, visual acuity pre-surgery was unknown, but based on the combination of a family history of dense congenital cataracts, very poor visual acuity post-surgery, nystagmus and esotropia, these participants were classified as having dense bilateral congenital cataracts. Visual acuity post-surgery in the better eye in this group ranged from a minimum of 20/400 to a maximum of 20/125.

The SC group consisted of seven individuals with normal or corrected-to-normal vision, with no history of eye injuries or abnormalities (5 females, 2 males; Age = 21–29 years, M = 24.13, SD = 3.08).

Stimuli and apparatus

Participants used a white, smooth ribbon to hold and lift one of the same six cubes used in Experiment 1, by pulling it with their dominant hand. The ribbon was wrapped around a metal ring fixed to a wall, in order to minimize friction that could possibly affect weight judgements, and allow participants to estimate the weight of the cube while it was suspended at eye level (Fig. 1). An important experimental consideration for the use of a string/handle set-up to test the SWI in sight recovery individuals was to control for the precision of the visual cues provided, due to residual visual impairments in visually impaired individuals5. Therefore, viewing distance was determined by each CC and SC participant based on how comfortable they were seeing the cube. However, viewing distance was not significantly different between groups (t(4) = 1.656, p = 0.137; CC: Mean = 58 cm, SD = 8.69 cm, Range = 50–70 cm; SC: Mean = 64.71 cm, SD = 5.19 cm, Range = 60–73 cm). This was done to minimize the possibility of potential differences in illusion size being confounded with differences in visual acuity, due to viewing at a fixed distance. Participants were instructed identically to Experiment 1 described above, and the experimenter placed and removed the cubes from the apparatus for each trial to prevent the participant from having any haptic contact with the stimuli.

Participants were not permitted to haptically handle the cubes at any time, before or during the course of the task, and were naïve to how many sizes and weights were presented. A post-study questionnaire recorded their estimates for how many weights and sizes were presented (see Supporting Information S3).

The same random trial orders used in Experiment 1 were used for Experiment 2, and 60 trials were completed per participant.

Data analysis

Z-scores were calculated using a procedure identical to the one described for Experiment 1 above.

The ANOVA model comprised two within-group factors (Size: 3 levels, Weight: 2 levels) and one between-group factor with 2 levels (CC, SC). Post-hoc ANOVAs and t-tests were performed according to the resulting interactions.

Additionally, a cross-experiment ANOVA with group and experimental task as between- subject factors and weight as a within- subject factor (group: 2 levels for SC and CC; task: 2 levels for Experiment 1 and 2) was performed.