Third-party punishment by preverbal infants

Kanakogi, Yasuhiro; Miyazaki, Michiko; Takahashi, Hideyuki; Yamamoto, Hiroki; Kobayashi, Tessei; Hiraki, Kazuo

doi:10.1038/s41562-022-01354-2

Download PDF

Article
Open access
Published: 09 June 2022

Third-party punishment by preverbal infants

Nature Human Behaviour volume 6, pages 1234–1242 (2022)Cite this article

19k Accesses
23 Citations
700 Altmetric
Metrics details

Subjects

Abstract

Third-party punishment of antisocial others is unique to humans and seems to be universal across cultures. However, its emergence in ontogeny remains unknown. We developed a participatory cognitive paradigm using gaze-contingency techniques, in which infants can use their gaze to affect agents displayed on a monitor. In this paradigm, fixation on an agent triggers the event of a stone crushing the agent. Throughout five experiments (total N = 120), we show that eight-month-old infants punished antisocial others. Specifically, infants increased their selective looks at the aggressor after watching aggressive interactions. Additionally, three control experiments excluded alternative interpretations of their selective gaze, suggesting that punishment-related decision-making influenced looking behaviour. These findings indicate that a disposition for third-party punishment of antisocial others emerges in early infancy and emphasize the importance of third-party punishment for human cooperation. This behavioural tendency may be a human trait acquired over the course of evolution.

Threat-induced prosocial behavior: enhanced exogenous attention to protect others from harm

Article Open access 15 July 2024

Preverbal infants’ understanding of social norms

Article Open access 05 February 2024

Infants infer third-party social dominance relationships based on visual access to intergroup conflict

Article Open access 29 October 2022

Main

Third-party punishment is a disposition of individuals to punish transgressors or norm violators who have not harmed them directly, and it seems to be universal across cultures¹. The dominant explanation is that this disposition is a mechanism for maintaining cooperation^2,3,4,5,6. Third-party punishment is unique to humans^7,8 and has been well documented in adults^1,2,3. However, debates about its evolved propensity⁹ and motivations¹⁰ are ongoing, and its point of emergence in ontogeny remains unknown.

Previous research asserts that even 19-month-old toddlers are willing to punish antisocial individuals in third-party contexts by taking treats away from them¹¹. Young children are willing to incur a cost to avoid interacting with wrongdoers¹², intervene against or tattle on moral transgressions¹³, and seem to expect antisocial actions to be punished¹⁴. Moreover, children not only punish wrongdoers but also prioritize helping the victim¹⁵. For example, they return a resource to the victim rather than remove a resource from a thief when they have options to punish or help. By age six, children engage in costly third-party punishment; they sacrifice their own resources to punish a transgressor who has acted unfairly^16,17 and punish moral transgressors to satisfy both consequentialist and retributive motives¹⁸. However, to our knowledge, little to no work has investigated third-party punishment in preverbal infants, and thus its point of emergence in ontogeny remains unknown.

We focused on physical aggression, which is assumed to be salient to preverbal infants. It may therefore function as an intuitive form of punishment and be the most basic form of aggression that infants prefer to intervene against. We specifically focused on the hitting action¹⁹ and hitting interactions between agents^20,21,22. Infants can discriminate between caressing (positive) and hitting (negative) interactions involving two agents²⁰, and the latter interactions are assumed to be negative from the infants’ viewpoint²³. Moreover, not only do infants infer dominance hierarchies (the strong and the weak) from body size²⁴, social interactions²⁵ and relative height²⁶, but they can also discriminate the aggressor from the victim in hitting interactions²¹. More importantly, infants show aversiveness to the aggressor²¹, affirm the agents who disturbed (doing negative action to) the aggressor and assume that the aggressor should be hit by other agents²². On the basis of current evidence, these types of actions might be functional as punitive behaviour, and the interaction might be worth interfering for infants.

This study aimed to reveal the developmental origins of third-party punishment in early infancy and determine whether and how preverbal infants punish antisocial agents who have not harmed them directly. We developed a participatory cognitive paradigm by adopting gaze-contingency techniques^27,28,29, in which infants can use their gaze to affect agents displayed on a monitor. In this paradigm, fixation on an agent triggers the event of a stone crushing the agent. Prior research that used the same hitting interaction employed in the current study has demonstrated that infants over six months old regard this interaction as negative^20,21,22,23 and that eight-month-olds can act on objects on a monitor by their gaze^28,29. We therefore chose eight-month-olds as participants in this study. We familiarized infants with a gaze-contingent association between looking at one of two objects or agents and a subsequent punitive event (for example, stones falling and crushing one of the objects or agents; Fig. 1a, Experiment 1). We then compared their tendency to look at each agent before and after the aggressive interaction between agents (Fig. 1b and Supplementary Video 1). If, as a third party, infants are disposed to punish a transgressor, they will increase their selective gaze at the aggressor after watching an aggressive interaction.

**Fig. 1: Schema of Experiments 1 to 4.**

Results

In Experiment 1, 24 eight-month-old infants were familiarized with gaze-contingent events. When the infants fixated on a single object (for example, a red sphere or a blue sphere) or either of two objects presented side by side (for example, red and blue spheres), the contingent event (for example, a square stone falling and crushing the object) occurred in the practical phase. Subsequently, the infants experienced ten identical gaze-contingent events, except that the target objects were two geometric agents with eyes (for example, green and orange geometric shapes) (pretest; Fig. 1b). After watching an aggressive interaction between the geometric agents (one was the aggressor, and the other was the victim), the infants again experienced ten gaze-contingent events identical to the pretest (posttest; Fig. 1b). If the infants sought to punish the transgressor, it is likely that they would increase their selective looks at the aggressor in the posttest phase.

We conducted a generalized linear mixed model (GLMM) analysis with a binomial error structure and a logit link function to assess whether watching the aggressive interaction influenced selective looks in the posttest phase. The response variable was infant selective looks at the aggressor (= 1) or the victim (= 0) in the pretest or posttest. The explanatory variables included test type (pretest or posttest) and trial number. We compared models on the basis of the Bayes factor (BF). The model candidates were (1) the null model, (2) a model with the main effect of test type, (3) a model with the main effect of trial number, (4) a model with the main effects of test type and trial number, and (5) a model with the main effect of test type, the main effect of trial number and the interaction between test type and trial number. All models were compared with the null model, and we computed the BF (BF₁₀)—namely, the relative evidence in favour of each model over the null model. We assumed that the prior model probability was uniform, and we evaluated the degree to which the data had changed the prior model odds for each model. We also computed the inclusion BF (ref. ³⁰) (BF_incl) for each effect to evaluate how probable the data were under models that included the effect compared with models that excluded the effect. To report BF₁₀ and BF_incl, we set the Cauchy distribution with location 0 and scale 1/√2 as a prior distribution for a coefficient parameter³¹. BFs are sensitive to the prior distribution for model parameters. It is therefore important to check whether the inferences from the data are robust to different prior specifications. We conducted a sensitivity analysis for BF_incl, following recommendations made by previous studies regarding Bayesian analysis^32,33.

According to Lee and Wagenmakers³⁴, a BF of 1–3 is ‘anecdotal evidence’ or ‘can be considered’, 3–10 is ‘moderate evidence’, 10–30 is ‘strong evidence’ and 30–100 is ‘very strong evidence’ for the alternative hypothesis or model. In contrast, a BF of 1/3–1 is ‘anecdotal evidence’, 1/10–1/3 is ‘moderate evidence’, 1/30–1/10 is ‘strong evidence’ and 1/100–1/30 is ‘very strong evidence’ for the null hypothesis or model. A BF of 1 is ‘no evidence’ in favour of either the alternative hypothesis (model) or the null hypothesis (model).

The model comparison results demonstrated that the data were best represented by the model with the main effect of test type (Table 1). The posterior model probability of the model with the main effect of test type was the largest in the candidate models (P(M|data) = 0.590). The BF₁₀ was 2.473, which indicated anecdotal evidence in favour of this model compared with the null model. Table 2 shows the inclusion probability and BF_incl for each effect. On average, the data anecdotally supported the model including the main effect of test type (BF_incl = 1.748) and moderately supported the model excluding the main effect of trial (BF_incl = 0.139) and the interaction term (BF_incl = 0.161). The results of the sensitivity analysis (Fig. 2) robustly supported the model including the main effect of test type against reasonable change in the Cauchy prior width for the effect size, although the evidence was anecdotal. However, the model excluding the main effect of trial and the interaction term was more likely to be supported as the prior width became large. Note that when the Cauchy prior width is zero, the BF equals 1—irrespective of the data. Infants’ selective looks at the aggressor increased in the posttest phase compared with the pretest for the best model. The effect of test type relative to the pretest had a 0.988 probability of being positive (test: posterior median, 0.742; 95% credible interval (CI), (0.102, 1.431); odds ratio (OR), 2.101; Supplementary Table 1). In summary, we found that eight-month-olds were more likely to look selectively towards the aggressor in the posttest than in the pretest; however, this result was inconclusive, as the evidence was anecdotal (Fig. 3a).

Table 1 Results of model comparison from Experiments 1 to 5

Full size table

Table 2 Inclusion probability and BF_incl for each effect in Experiments 1 to 5

Full size table

**Fig. 2: Results of the sensitivity analysis for BF_incl for each effect across experiments (Experiments 1 to 5).**

**Fig. 3: Results from Experiments 1 to 5.**

We subsequently considered three alternative parsimonious interpretations of selective looks at the aggressor before concluding that looking behaviours involved decision-making regarding punishment. First, the increase in infant selective looks at the aggressor could be due to mere visual preference for said aggressor (for example, preference for a causer of action). To exclude this possibility, in Experiment 2, we tested another group of infants (N = 24) who experienced aggressive interactions identical to those in Experiment 1 but with less negative gaze-contingent events in the pretest and posttest phases. Specifically, materials fell onto an object or agent more softly than in Experiment 1 (Fig. 1a, Experiment 2). If selective looks were driven by preference for the aggressor after watching aggressive interactions, infants would more likely selectively look at the aggressor at posttest even though the gaze-contingent event is less negative. However, if selective looks at the aggressor involved a sense of punishment, then infants would not selectively look at the aggressor at posttest because they have no means to punish the agent. In support of this latter prediction, the model comparison demonstrated that the data were best represented by the null model (Table 1). The posterior model probability of the null model was the largest in the candidate models (P(M|data) = 0.651). The BF₁₀ was 1.000 since the null model was compared with itself. On average, the data moderately supported the model excluding the main effects of test type (BF_incl = 0.190) and trial type (BF_incl = 0.136), and very strongly supported the model excluding the interaction term (BF_incl = 0.032) (Table 2). The results of the sensitivity analysis (Fig. 2) robustly supported the model excluding the two main effects and the interaction term against reasonable change in the Cauchy prior width for the effect size. The model excluding each effect was more likely to be supported as the prior width became large. In the null model, the proportion of an infant’s selective looks at the aggressor was not different from that at the chance level (intercept: posterior median, 0.067; 95% CI, (−0.242, 0.381); OR = 1.070; Supplementary Table 2). In summary, the data moderately supported the idea that eight-month-olds did not change the proportion of selective looks towards the aggressor between the pretest and the posttest (Fig. 3b). We therefore excluded the alternative parsimonious explanation that the increase in infant selective looks at the aggressor was due to a mere visual preference for the aggressor rather than a selective choice for punishment.

A second possible explanation for the increase in infant selective looks at the aggressor in Experiment 1 is a mere expectation that the aggressor would be punished³⁵ as opposed to a sense that punitive action is the consequence of the infants’ intentions. To understand this, in Experiment 3, we decreased the strength of the gaze-contingent association. Specifically, we changed the reinforcement probability between looking at a specific agent and a subsequent punitive event from 100% (Experiment 1) to 50% (chance level) (Fig. 1a, Experiment 3). If infants looked at the aggressive agent because of a mere expectation that the agent would be punished, they would selectively look at the agent at posttest even without a sense of self-agency. However, if infants looked at the aggressive agent due to a sense that the punitive action is a consequence of their intentions (in other words, an understanding of their own causal efficacy), they would not selectively look at the aggressive agent at posttest when they lacked a sense of self-agency. Consistent with this latter prediction, the model comparison demonstrated that the data were best represented by the null model (Table 1). The posterior model probability of the null model was the largest in the candidate models (P(M|data) = 0.705). The BF₁₀ was 1.000 since the null model was being compared with itself. On average, the data moderately supported the model excluding the main effect of test type (BF_incl = 0.167), strongly supported the model excluding the main effect of trial type (BF_incl = 0.095) and very strongly supported the model excluding the interaction term (BF_incl = 0.022) (Table 2). The results of the sensitivity analysis (Fig. 2) robustly supported the model excluding the two main effects and the interaction term against reasonable change in the Cauchy prior width for the effect size. The model excluding each effect was more likely to be supported as the prior width became large. In the null model, the proportion of an infant’s selective looks at the aggressor was not different from that at the chance level (intercept: posterior median, 0.020; 95% CI, (−0.201, 0.238); OR = 1.020; Supplementary Table 3). In summary, the data moderately supported the idea that eight-month-olds did not change the proportion of selective looks towards the aggressor between the pretest and the posttest (Fig. 3c). We therefore excluded the alternative parsimonious explanation that the increase in selective looks at the aggressor was due to a mere expectation that the agent would be punished.

A previous study proposed that infants may consider collisions between geometric figures to be merely negative physical events rather than social interactions²³. If this was the case in the present study, infants may have regarded geometric agents as the cause of a negative physical event rather than as aggressors. In Experiment 4, we tested this possibility by recruiting additional eight-month-old infants (N = 24) who were familiar with the same gaze-contingency events but modified the aggressive interactions used in Experiment 1. We tested infants using geometric figures with perceivable ‘animacy or agency’ removed by eliminating their eyes, ability to self-propel and distortion upon contact (Fig. 1a, Experiment 4). If selective looks were driven by infant perception of a geometric figure causing an unpleasant physical event in Experiment 1, then infants would probably selectively look at the causer of physical collisions at posttest. However, if selective looks were driven by infant perception of an aggressive interaction (that is, infants want to punish the agents in Experiment 1), they would not selectively look at the causer of physical collisions at posttest. Consistent with this latter prediction, model comparison demonstrated that the data were best represented by the null model (Table 1). The posterior model probability of the null model was the largest in the candidate models (P(M|data) = 0.544). The BF₁₀ was 1.000 as the null model was being compared with itself. On average, the data anecdotally supported the model excluding the main effect of test type (BF_incl = 0.404), moderately supported the model excluding the main effect of trial type (BF_incl = 0.109) and strongly supported the model excluding the interaction term (BF_incl = 0.062) (Table 2). The sensitivity analysis results (Fig. 2) robustly supported the model excluding the two main effects and the interaction term against reasonable change in the Cauchy prior width for the effect size. The exclusion of each effect was more likely to be supported as the prior width became large; however, the strength of the evidence for excluding the main effect of test type was anecdotal when the prior width was relatively small. In the null model, the proportion of an infant’s selective looks at the causer was not different from that at the chance level (intercept: posterior median, −0.012; 95% CI, (−0.447, 0.427); OR = 0.988; Supplementary Table 4). In summary, the data anecdotally supported the idea that eight-month-olds did not change the proportion of selective looks towards the causer between the pretest and the posttest (Fig. 3d). We thus excluded the non-social explanation that the Experiment 1 results were due to perceiving geometric figures as causing a negative physical event rather than as aggressors.

Finally, we performed Experiment 5 to replicate Experiment 1 for the following reasons. First, the evidence in Experiment 1 was too weak to be conclusive, as we used a new experimental paradigm. Second, there is increasing concern over the lack of replication in psychology research³⁶. We therefore tested another infant group (N = 24) with identical procedures and the same sample size used in Experiment 1. The model comparison results demonstrated that the data were best represented by the model with the main effect of test type (Table 1). The posterior model probability of the model with the main effect of test type was the largest in the candidate models (P(M|data) = 0.795). The BF₁₀ was 24.362, indicating strong evidence in favour of this model compared with the null model. On average, the data strongly supported the model including the main effect of test type (BF_incl = 16.179) and moderately supported the model excluding the main effect of trial (BF_incl = 0.139) and the interaction term (BF_incl = 0.188). The sensitivity analysis results (Fig. 2) robustly supported the model including the main effect of test type in a wide range of the Cauchy prior on the effect size. However, the model excluding the main effect of trial and the interaction term was more likely to be supported as the prior width became large. In the best model, infants’ selective looks at the aggressor increased during the posttest phase compared with the pretest. The effect of test type relative to the pretest had a 0.999 probability of being positive (test: posterior median, 0.870; 95% CI, (0.362, 1.424); OR = 2.387; Supplementary Table 5). In summary, the data strongly supported the idea that eight-month-olds increased the proportion of selective looks towards the aggressor in the posttest compared with the pretest (Fig. 3e). This result indicates the potential of the findings to reflect robust psychological phenomena in early infancy.

The analyses reported above demonstrate that compared with the pretest phase, infants increased selective looking at an aggressor at the posttest phase in Experiment 1 and Experiment 5, but not in the other experiments. However, employing a contrast between the pretest and posttest phases for each experiment did not necessarily elucidate the differences in effect sizes of test type between the experiments³⁷. Therefore, to compare the effect size of the test type for each experiment, we combined all experiment data and estimated the interaction effects between test type (pretest or posttest) and experiment (Experiment 1, Experiment 2, Experiment 3, Experiment 4 or Experiment 5) by using GLMM.

We conducted comparisons of the effect size of test type for each experiment (Supplementary Fig. 1). We calculated the effect size difference of test type between experiments from estimates of the interaction between the test type and the experiment. We checked whether the 95% CIs of the effect size difference excluded zero. The effect size of test type in Experiment 1 was larger than in Experiment 3, and the 95% CI of the effect size difference excluded zero (Exp.1 − Exp.3: posterior median, 0.756; 95% CI, (0.030, 1.498)). However, the 95% CIs of the effect size difference included zero when we compared the test type effect in Experiment 1 with that in Experiments 2, 4 and 5 (Exp.1 − Exp.2: posterior median, 0.581; 95% CI, (−0.157, 1.307); Exp.1 − Exp.4: posterior median, 0.285; 95% CI, (−0.453, 1.023); Exp.1 − Exp.5: posterior median, −0.165; 95% CI, (−0.915, 0.568); see also Supplementary Table 6). The effect size of test type in Experiment 5 was larger than that in Experiments 2 and 3, and the 95% CIs of the effect size difference did not include zero (Exp.5 − Exp.2: posterior median, 0.744; 95% CI, (0.010, 1.488); Exp.5 − Exp.3: posterior median, 0.920; 95% CI, (0.196, 1.656)). However, the 95% CI of the effect size difference included zero when we compared the test type effect in Experiment 5 with that in Experiment 4 (Exp.5 − Exp.4: posterior median, 0.449; 95% CI, (−0.294, 1.198); see also Supplementary Table 6). The 95% CIs of the effect size difference included zero for the pairs in Experiments 2, 3 and 4 (Exp.4 − Exp.2: posterior median, 0.293; 95% CI, (−0.436, 1.037); Exp.4 − Exp.3: posterior median, 0.468; 95% CI, (−0.253, 1.211); Exp.3 − Exp.2: posterior median, −0.175; 95% CI, (−0.904, 0.549); see also Supplementary Table 6). In the above model, the increase in selective looks at an aggressor after the movie phase was larger in Experiments 1 and 5 than in Experiments 2 and 3. However, the increase in selective looks after the movie phase in Experiment 4 was not different from the increase in the main experiments (Experiments 1 and 5) or the other control experiments (Experiments 2 and 3).

Discussion

We investigated a disposition for third-party punishment of antisocial others in early infancy. After watching an aggressive interaction, infants as young as eight months old selectively looked at the aggressor more often with the apparent intent to punish (Experiment 1). Three control experiments excluded alternative parsimonious interpretations of these increases in selective looks at the aggressor: mere preferential looking at agents (Experiment 2), mere expectation that the agent would be punished (Experiment 3) and perceiving collisions as a negative physical event rather than aggression (Experiment 4). Finally, we replicated Experiment 1 to confirm that our findings indicated robust psychological phenomena (Experiment 5). Importantly, between-experiment differences were not attributable to variation in attention in the movie phase, as the Bayesian one-way analysis of variance results moderately supported the idea that there was no difference in looking time during the movie phase between the experiments (BF₁₀ = 0.23; Supplementary Table 7). In addition, we found that in the main experiments (Experiment 1 and Experiment 5), selective looks at the aggressor after the movie phase tended to increase compared with the control experiments except for Experiment 4. Overall, infants as young as eight months old seem to punish antisocial others in third-party contexts by using their gaze, indicating that third-party punishment emerges much earlier than previously thought^{11,14,15,16,17,18,19}.

Although many developmental studies have revealed that infants can evaluate the moral actions of others^11,21,22,38, preverbal infants’ moral behaviour towards others has not been previously investigated. Our findings draw a connection between moral evaluation and moral behaviour among preverbal infants, bringing us closer to elucidating morality in early ontogeny. Furthermore, our findings imply that the primary motivations of punishment are probably intrinsic, rather than extrinsic results of cultural learning⁹ or higher-order desires to attain benefits for the self (for example, enhancing one’s reputation)¹⁰. This outcome might provide crucial evidence for ongoing debates regarding the motivations and evolved propensity underlying third-party punishment. The tendency towards third-party punishment may be engrained in preverbal infants’ minds and may have evolved only in humans.

One might doubt that the selective looks of infants reflect decision-making regarding punishment. Gaze-contingent techniques have been broadly used to investigate decision-making in patients with impaired limbs, such as those with amyotrophic lateral sclerosis³⁹. However, similarity in the underlying mechanism of gaze control between infants and these patients is not evident. Nonetheless, previous research using gaze-contingency techniques demonstrated that infants of the same age showed gaze behaviours for intentional control on the monitor^27,29. Furthermore, the three control experiments implied that selective looking behaviour involves punishment-related decisions; infants increased their selective looks at the specific agent (that is, aggressor) only when their gaze was associated with a negative event (that is, punishment; Experiment 2) that consistently occurred (that is, 100% reinforcement; Experiment 3) and when the event provided social information about the agents (that is, who was the aggressor or victim; Experiment 4). In other words, infants changed their behaviour to accomplish their goal only when they perceived the means to punish, had a sense of self-agency for punitive behaviour and were in a situation that called for punishment. They did not change their behaviour if any of these three elements were lacking. Consequently, infant looking behaviours were probably decisions made with the intention to punish.

A point to note is whether the gaze–action association learned during the pretest phase is preserved until the posttest phase even if the movie phase is inserted between the tests. During the movie phase, when infants gazed at the agent, the infants had no contingent events. It is thus possible that the gaze–action association is not preserved until the posttest phase. However, there are differences in the increase of selective looks after the movie phase between the experiments in which infants can learn the association (Experiments 1 and 5) and those where they cannot learn the same (Experiment 3). In addition, if infants were motivated to punish the aggressor, and if the association learning could not be maintained in the beginning of the posttest phase, the punishment rate would be at the chance level in the beginning of the posttest phase and would increase as the trials of the posttest phase elapsed. However, the observed data moderately supported the model excluding the interaction between test type and trial as well as the main effect of trial in Experiments 1 and 5, suggesting that the punishment rate for the aggressor in the posttest phase remained unchanged. We can therefore assume that the association between gaze and contingent event can be kept until the posttest phase.

There are some limitations worth noting. First, infants might think that the victim received a squeeze and thus the other actor should be squeezed as well; previous studies have indicated that infants expect equal treatment of others^40,41. However, previous studies demonstrated that infants showed aversiveness to an agent who hit another agent²¹, affirmed an agent who disturbed the aggressor, and assumed that the aggressor should be hit by other agents²². It therefore seems plausible that infants regarded an agent who hit another agent as negative, thus expecting the aggressor to be punished, and consequently punishing the aggressor with their gaze. However, it may be slightly theory-laden to assert the psychological process of this punitive behaviour. Future studies are needed to identify associated underlying mechanisms. For example, because the aggressive interactions in this study involved multiple behaviours (for example, following the agent around and bumping), explorations on what exactly infants pick up as the critical cue or whether they need to see multiple cues to view interactions between agents as truly aggressive would be valuable.

Second, although our data supported the idea that infants did not change their selective looks between the pretest and posttest in Experiment 4, the evidence for this was weak. This is consistent with the results comparing the effect size of test type between Experiment 4 and the main experiments. These results might be due to the individual differences in animacy perception for objects in Experiment 4. Although we removed the aspect of perceivable ‘animacy or agency’ in Experiment 4 on the basis of a previous study²², the objects seemed to move autonomously to some extent, and thus some infants might perceive the objects to be animates or agents. Finally, although infants showed intentional use of gaze for their decision-making in our study, we do not conclusively know whether the infants were aware that they punished the agent by their gaze. In other words, it is unclear whether the infants looked at the agent with a consciousness of punishment. A previous study proposed a multi-level framework that self-agency is based on complex mechanisms on several levels, ranging from implicit to explicit⁴². It is interesting to observe the levels of self-agency involved in the punishment behaviours in the current study.

The presented paradigm in which infants can exhibit decision-making in a social context on a monitor might enable new infant cognitive research. Largely owing to limited methodologies as well as immature motor and verbal abilities in infants, most previous studies on infant cognition examined their perception and understanding of events from the viewpoint of a third party—that is, passive responses to physical⁴³ and social^24,25,26 events. In contrast, recent research using the gaze-contingent technique has revealed active infant responses to contingent events^27,28,29. We incorporated such techniques to investigate behaviour accompanying decision-making regarding others and determined that we can measure infants’ moral behaviour towards others. The application of this paradigm could reveal undiscovered cognitive abilities in preverbal infants.

Methods

This study was approved by Otsuma Women’s University’s Life Sciences Research Ethics Committee (no. 28-015) and the Behavioral Research Ethics Committee of the Osaka University School of Human Sciences (no. HB020-032).

Experiment 1

Participants

The participants were 24 full-term eight-month-old infants (12 boys and 12 girls; mean age, 8 months 13 days; range, 7 months 13 days to 9 months 27 days). The sample size was determined on the basis of prior infant morality studies^11,21,22,38. Eleven additional infants were tested but excluded owing to distress or fussiness (N = 4), or side-looking bias (N = 7, left = 7, right = 0; see the details of the criteria below). The parents provided written informed consent before the experiment and were financially compensated for participation.

Apparatus and stimuli

Infant gaze movements were measured using a Tobii TX300 near-infrared eye tracker (Tobii Technology), integrated with a 23-inch computer display (1,280 × 720 pixels). The sampling rate was 120 Hz. Task programming was completed in Visual Basic 2015 Express (Microsoft Corp.) and Tobii SDK (Tobii Technology). In all tasks, when an eye gaze was detected at a point on the display, a translucent red circle with a radius of 25 pixels appeared (Fig. 1a) to facilitate gaze control²⁹. However, during the occurrence of contingent events, the red circle disappeared to allow for focus on said contingent events. The display background was aqua in colour.

The participants’ faces were monitored and recorded with a video camera (Panasonic HC-WX990M). Images on the PC screen (presented to the participants) and images of the participants were synthesized (Picture in Picture) using a video mixer device (Roland, V-1600HD) and recorded on a laptop PC (HP, Elite Book 8570w/CT) with a monitor-capturing device (Avermedia, AVT-C875).

In the practical phase, the first six trials subjected the infants to gaze-contingent events in which fixation on a single object (a red or blue circle positioned alternately on the left or right) for 500 ms resulted in a stone falling and crushing the object. This phase was set to reduce side-looking bias. In four subsequent trials, the infants were presented with two objects side by side (a red circle and a blue circle) instead of a single object. When the infants fixated on either of the two objects for 500 ms, a stone fell and crushed it. The presented position of each object or pair of objects was fixed among the participants.

In the following pretest, the infants experienced gaze-contingent events identical to those in the practical phase except that the targets were two geometric agents with eyes (for example, green and orange squares; pretest in Fig. 1a). The presented position of the geometric agents (left or right) was counterbalanced across participants but consistent between the pretest and posttest within participants.

In the movie phase, the infants were presented with an aggressive interaction animation (20 s in duration) depicting one geometric figure hitting and crashing into another geometric figure^20,21,22 (Fig. 1b and Supplementary Video 2). The roles of the geometric figures (aggressor or victim) were counterbalanced between participants. Following the movie phase, the infants completed the posttest phase with gaze-contingent events identical to those of the pretest.

Procedure

The infants were fastened in a baby carrier to prevent them from standing up and were placed on their mothers’ laps approximately 60 cm from the monitor. Nine-point calibration was used. The parents were instructed not to watch the monitor and not to talk or interact with their children during the experiment.

The infants experienced ten gaze-contingent events in the practical phase. Then, the infants experienced ten gaze-contingent events in the pretest. In the movie phase, the infants were presented with animated movies of aggressive interactions three times. Finally, the infants experienced ten gaze-contingent events in the posttest. Attractive animated clips (a rotating oval checkerboard) with sound were inserted between trials if infants did not pay attention to the monitor.

Data analysis

We excluded data from further analysis if infants showed a side-looking bias, which was defined as looking to one side in more than 12 of the 14 gaze-contingent events (the last four trials of the practical phase and the ten trials of the pretest) (Bayesian binomial test, two-tailed, BF₁₀ = 8.11, moderate evidence in favour of the alternative hypothesis; traditionally, the binomial test gives a P value below 0.05). To compare the proportion of infant selective looks at agents between pretest and posttest, we used GLMMs with a binomial error structure and a logit link function. The response variable was infant selective looks at the aggressor (= 1) or the victim (= 0) in the pretest or posttest. The explanatory variables (fixed effects) were test type (pretest or posttest) and trial number. We set participant identity as a random intercept. To keep the random effects structure “maximal”⁴⁴, we also included all possible random slopes within participants and correlations.

We compared models on the basis of the BF. The model candidates were (1) the null model, (2) a model with the main effect of test type, (3) a model with the main effect of trial number, (4) a model with the main effects of test type and trial number, and (5) a model with the main effect of test type, the main effect of trial number and the interaction between test type and trial number. All models were compared with the null model, and we computed the BF (BF₁₀), with the relative evidence in favour of each model over the null model (Table 1). We assumed that the prior model probability was uniform and evaluated the degree to which the data had changed the prior model odds for each model. We also computed BF_incl (ref. ³⁰) for each effect to evaluate the level of likelihood that the data were under models that included the effect compared with models that excluded the effect (Table 2). BF_incl was computed on the basis of inclusion probabilities (that is, the sum of the model probabilities for the models that included the effect) across all models. For reporting BF₁₀ and BF_incl, we set the Cauchy distribution with location 0 and scale 1/√2 as a prior distribution for a coefficient parameter³¹. We also set the default prior (a t distribution with degrees of freedom 3 and scale 2.5) of brms as the prior distribution of an intercept and the standard deviation of random effects. To check whether the main conclusions from the data were robust to different priors, we conducted a sensitivity analysis for BF_incl (Fig. 2). We computed BF_incl for each effect and set the scale parameter of the Cauchy prior for the effect size from 0.05 to 1.5 in increments of 0.05.

We estimated the posterior distributions of the model parameters and checked the posterior predictive distribution for an infant’s selective looks towards the aggressor for the best model in the model comparison results (Supplementary Fig. 2a). We set the improper prior distribution for a coefficient parameter. Additionally, we set the default prior (a t distribution with degrees of freedom 3 and scale 2.5) of brms as a prior distribution of an intercept and the standard deviation of random effects. The posterior median and a 95% CI were calculated for each parameter.

All analyses were conducted using freely available packages in the R environment for statistical computing. The analysis codes are shared publicly on GitHub (https://github.com/dororo1225/PunishmentStudy).

References

Henrich, J. et al. Costly punishment across human societies. Science 312, 1767–1770 (2006).
Article CAS PubMed Google Scholar
Fehr, E. & Gächter, S. Altruistic punishment in humans. Nature 415, 137–140 (2002).
Article CAS PubMed Google Scholar
Fehr, E. & Fischbacher, U. Third-party punishment and social norms. Evol. Hum. Behav. 25, 63–87 (2004).
Article Google Scholar
Boyd, R., Gintis, H., Bowles, S. & Richerson, P. J. The evolution of altruistic punishment. Proc. Natl Acad. Sci. USA 100, 3531–3535 (2003).
Article CAS PubMed PubMed Central Google Scholar
Fowler, J. H. Altruistic punishment and the origin of cooperation. Proc. Natl Acad. Sci. USA 102, 7047–7049 (2005).
Article CAS PubMed PubMed Central Google Scholar
Hauert, C., Traulsen, A., Brandt, H., Nowak, M. A. & Sigmund, K. Via freedom to coercion: the emergence of costly punishment. Science 316, 1905–1907 (2007).
Article CAS PubMed PubMed Central Google Scholar
Riedl, K., Jensen, K., Call, J. & Tomasello, M. No third-party punishment in chimpanzees. Proc. Natl Acad. Sci. USA 109, 14824–14829 (2012).
Article CAS PubMed PubMed Central Google Scholar
Raihani, N. J., Thornton, A. & Bshary, R. Punishment and cooperation in nature. Trends Ecol. Evol. 27, 288–295 (2012).
Article PubMed Google Scholar
Guala, F. Reciprocity: weak or strong? What punishment experiments do (and do not) demonstrate. Behav. Brain Sci. 35, 1–15 (2012).
Article PubMed Google Scholar
Jordan, J. J., Hoffman, M., Bloom, P. & Rand, D. G. Third-party punishment as a costly signal of trustworthiness. Nature 530, 473–476 (2016).
Article PubMed CAS Google Scholar
Hamlin, J. K., Wynn, K., Bloom, P. & Mahajan, N. How infants and toddlers react to antisocial others. Proc. Natl Acad. Sci. USA 108, 19931–19936 (2011).
Article CAS PubMed PubMed Central Google Scholar
Tasimi, A. & Wynn, K. Costly rejection of wrongdoers by infants and children. Cognition 151, 76–79 (2016).
Article PubMed Google Scholar
Vaish, A., Missana, M. & Tomasello, M. Three-year-old children intervene in third-party moral transgressions. Br. J. Dev. Psychol. 29, 124–130 (2011).
Article PubMed Google Scholar
Kenward, B. & Östh, T. Enactment of third-party punishment by 4-year-olds. Front. Psychol. 3, 373 (2012).
Article PubMed PubMed Central Google Scholar
Riedl, K., Jensen, K., Call, J. & Tomasello, M. Restorative justice in children. Curr. Biol. 25, 1–5 (2015).
Article CAS Google Scholar
Jordan, J. J., McAuliffe, K. & Warneken, F. Development of in-group favoritism in children’s third-party punishment of selfishness. Proc. Natl Acad. Sci. USA 111, 12710–12715 (2014).
Article CAS PubMed PubMed Central Google Scholar
McAuliffe, K., Jordan, J. J. & Warneken, F. Costly third-party punishment in young children. Cognition 134, 1–10 (2015).
Article PubMed Google Scholar
Marshall, J., Yudkin, D. A. & Crockett, M. J. Children punish third parties to satisfy both consequentialist and retributive motives. Nat. Hum. Behav. 5, 361–368 (2021).
Article PubMed Google Scholar
Marshall, J., Gollwitzer, A., Wynn, K. & Bloom, P. The development of corporal third-party punishment. Cognition 190, 221–229 (2019).
Article PubMed Google Scholar
Premack, D. & Premack, A. J. Infants attribute value± to the goal-directed actions of self-propelled objects. J. Cogn. Neurosci. 9, 848–856 (1997).
Article CAS PubMed Google Scholar
Kanakogi, Y., Okumura, Y., Inoue, Y., Kitazaki, M. & S. Itakura, S. Rudimentary sympathy in preverbal infants: preference for others in distress. PLoS ONE 8, e65292 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kanakogi, Y. et al. Preverbal infants affirm third-party interventions that protect victims from aggressors. Nat. Hum. Behav. 1, 0037 (2017).
Article Google Scholar
Scarf, D., Imuta, K., Colombo, M. & Hayne, H. Social evaluation or simple association? Simple associations may explain moral reasoning in infants. PLoS ONE 7, e42698 (2012).
Article CAS PubMed PubMed Central Google Scholar
Thomsen, L., Frankenhuis, W. E., Ingold-Smith, M. & Carey, S. Big and mighty: preverbal infants mentally represent social dominance. Science 331, 477–480 (2011).
Article CAS PubMed Google Scholar
Mascaro, O. & Csibra, G. Representation of stable social dominance relations by human infants. Proc. Natl Acad. Sci. USA 109, 6862–6867 (2012).
Article CAS PubMed PubMed Central Google Scholar
Meng, X., Nakawake, Y., Nitta, H., Hashiya, K. & Moriguchi, Y. Space and rank: infants expect agents in higher position to be socially dominant. Proc. R. Soc. B 286, 20191674 (2019).
Article PubMed PubMed Central Google Scholar
Wang, Q. et al. Infants in control: rapid anticipation of action outcomes in a gaze-contingent paradigm. PLoS ONE 7, e30884 (2012).
Article CAS PubMed PubMed Central Google Scholar
Deligianni, F., Senju, A., Gergely, G. & Csibra, G. Automated gaze-contingent objects elicit orientation following in 8-month-old infants. Dev. Psychol. 47, 1499–1503 (2011).
Article PubMed PubMed Central Google Scholar
Miyazaki, M., Takahashi, H., Rolf, M., Okada, H. & Omori, T. The image-scratch paradigm: a new paradigm for evaluating infants’ motivated gaze control. Sci. Rep. 4, 5498 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hinne, M., Gronau, Q. F., van den Bergh, D. & Wagenmakers, E. J. A conceptual introduction to Bayesian model averaging. Adv. Methods Pract. Psychol. Sci. 3, 200–215 (2020).
Article Google Scholar
Morey, R. D., Rouder, J. N. & Jamil, T. BayesFactor: Computation of Bayes factors for common designs. R package version 0.9.12-4.2 https://CRAN.R-project.org/package=BayesFactor (2015).
Tendeiro, J. N. & Kiers, H. A. A review of issues about null hypothesis Bayesian testing. Psychol. Methods 24, 774–795 (2019).
Article PubMed Google Scholar
Kruschke, J. K. Bayesian analysis reporting guidelines. Nat. Hum. Behav. 5, 1282–1291 (2021).
Article PubMed PubMed Central Google Scholar
Lee, M. D. & Wagenmakers, E. J. Bayesian Cognitive Modeling: A Practical Course (Cambridge Univ. Press, 2014).
Mendes, N., Steinbeis, N., Bueno-Guerra, N., Call, J. & Singer, T. Preschool children and chimpanzees incur costs to watch punishment of antisocial others. Nat. Hum. Behav. 2, 45–51 (2018).
Article PubMed Google Scholar
Open Science Collaboration Estimating the reproducibility of psychological science. Science 349, aac4716 (2015).
Article CAS Google Scholar
Nieuwenhuis, S., Forstmann, B. U. & Wagenmakers, E. J. Erroneous analyses of interactions in neuroscience: a problem of significance. Nat. Neurosci. 14, 1105–1107 (2011).
Article CAS PubMed Google Scholar
Hamlin, J. K., Wynn, K. & Bloom, P. Social evaluation by preverbal infants. Nature 450, 557–559 (2007).
Article CAS PubMed Google Scholar
Spataro, R., Ciriacono, M., Manno, C. & La Bella, V. The eye-tracking computer device for communication in amyotrophic lateral sclerosis. Acta Neurol. Scand. 130, 40–45 (2014).
Article CAS PubMed Google Scholar
Meristo, M. & Surian, L. Do infants detect indirect reciprocity? Cognition 129, 102–113 (2013).
Article PubMed Google Scholar
Sloane, S., Baillargeon, R. & Premack, D. Do infants have a sense of fairness? Psychol. Sci. 23, 196–204 (2012).
Article PubMed Google Scholar
Synofzik, M., Vosgerau, G. & Newen, A. I move, therefore I am: a new theoretical framework to investigate agency and ownership. Conscious. Cogn. 17, 411–424 (2008).
Article PubMed Google Scholar
Téglás, E., Vul, E., Girotto, V., Gonzales, M. & Tenenbaum, J. B. Pure reasoning in 12-month-old infants as probabilistic inference. Science 332, 1054–1059 (2011).
Article PubMed CAS Google Scholar
Barr, D. J., Levy, R., Scheepers, C. & Tily, H. J. Random effects structure for confirmatory hypothesis testing: keep it maximal. J. Mem. Lang. 68, 255–278 (2013).
Article Google Scholar
Bürkner, P.-C. Brms: an R package for Bayesian multilevel models using Stan. J. Stat. Softw. 80, 1–28 (2017).
Article Google Scholar
Bürkner, P.-C. Advanced Bayesian multilevel modeling with the R package brms. R J. 10, 395–411 (2018).
Article Google Scholar
R Core Team R: A Language and Environment for Statistical Computing Version 4.0.3 (R Foundation for Statistical Computing, 2020).
Carpenter, B. et al. Stan: a probabilistic programming language. J. Stat. Softw. 76, 1–32 (2017).
Article Google Scholar
Margoni, F. & Surian, L. Infants’ evaluation of prosocial and antisocial agents: a meta-analysis. Dev. Psychol. 54, 1445–1455 (2018).
Article PubMed Google Scholar

Download references

Acknowledgements

This research was supported by a Grant-in-Aid for Scientific Research (B) to Y.K. (no. 20H04495); a Grant-in-Aid for Young Scientists (B) to M.M. (no. 16K21341); and CREST, JST (no. JPMJCR18A4), CAO (sip) and JSPS (no. 20H05555) grants to K.H. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. We thank M. Ishikawa for technical advice.

Author information

These authors contributed equally: Yasuhiro Kanakogi, Michiko Miyazaki, Hideyuki Takahashi.

Authors and Affiliations

Graduate School of Human Sciences, Osaka University, Suita, Japan
Yasuhiro Kanakogi & Hiroki Yamamoto
Faculty of Social Information Studies, Otsuma Women’s University, Chiyoda-ku, Japan
Michiko Miyazaki
Graduate School of Engineering Science, Osaka University, Toyonaka, Japan
Hideyuki Takahashi
Graduate School of Letters, Kyoto University, Kyoto, Japan
Hiroki Yamamoto
NTT Communication Science Laboratories, Seika, Japan
Tessei Kobayashi
Graduate School of Arts and Sciences, The University of Tokyo, Meguro-ku, Japan
Kazuo Hiraki

Authors

Yasuhiro Kanakogi
View author publications
You can also search for this author in PubMed Google Scholar
Michiko Miyazaki
View author publications
You can also search for this author in PubMed Google Scholar
Hideyuki Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Hiroki Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Tessei Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Kazuo Hiraki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.K., M.M. and H.T. developed the study concept and design, which was supervised by T.K. and K.H. Y.K. and M.M. performed the experiments. Y.K. and H.Y. analysed the data. Y.K. drafted the paper. All authors discussed the results, commented on the final manuscript and approved its submission.

Corresponding author

Correspondence to Yasuhiro Kanakogi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Human Behaviour thanks Francesco Margoni, Thorsten Kolling, Michael Frank, Jorge Tendeiro and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Figs. 1–3 and Tables 1–13.

Reporting Summary

Peer Review File

Supplementary Video 1

Examples of the sequence (pretest, movie phase and posttest) in Experiment 1 (and 5) (MP4, 6.71 MB). Permission to use the participant’s image was obtained by M.M.

Supplementary Video 2

Aggressive interaction (MP4, 3.43 MB) between two geometric-cube agents in Experiments 1, 2, 3 and 5.

Supplementary Video 3

Physical (inanimate) interaction (MP4, 2.70 MB) between two geometric cubes in Experiment 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kanakogi, Y., Miyazaki, M., Takahashi, H. et al. Third-party punishment by preverbal infants. Nat Hum Behav 6, 1234–1242 (2022). https://doi.org/10.1038/s41562-022-01354-2

Download citation

Received: 16 January 2021
Accepted: 01 April 2022
Published: 09 June 2022
Issue Date: September 2022
DOI: https://doi.org/10.1038/s41562-022-01354-2

This article is cited by

Preverbal infants’ understanding of social norms
- Moritz Köster
- Robert Hepach
Scientific Reports (2024)
Anger and disgust shape judgments of social sanctions across cultures, especially in high individual autonomy societies
- Per A. Andersson
- Irina Vartanova
- Kimmo Eriksson
Scientific Reports (2024)
Human and animal dominance hierarchies show a pyramidal structure guiding adult and infant social inferences
- Olivier Mascaro
- Nicolas Goupil
- Nicolas Claidière
Nature Human Behaviour (2023)

Subjects

Abstract

Similar content being viewed by others

Main

Results

Discussion

Methods

Experiment 1

Participants

Apparatus and stimuli

Procedure

Data analysis

Experiment 2

Participants

Apparatus and stimuli

Procedure

Data analysis

Experiment 3

Participants

Apparatus and stimuli

Procedure

Data analysis

Experiment 4

Participants

Apparatus and stimuli

Procedure

Data analysis

Experiment 5

Participants

Apparatus, stimuli and procedure

Data analysis

Comparison of the effect sizes of test type for each experiment

Post-hoc confirmation of the validity of the sampling design

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links