Social interactions are often complicated by conflicts of interest. Humans and other animals adopt diverse strategies to resolve such disputes. Stronger individuals can often secure their interests at the expense of weaker individuals, but this strategy can be costly if it requires aggression. Strategies that are more cooperative and egalitarian can also develop among kin1 or individuals who reciprocate in repeated interactions2. Theoretical and experimental studies suggest that cooperation depends on cognitive control processes that override the impulse to acquire tangible rewards3. This theory now finds support from Choe et al.4, writing in Nature Communications. The authors demonstrate that pairs of mice can learn to coordinate their behaviour to achieve an egalitarian distribution of rewards — but only when rewards are delivered directly to the brain, rather than through food.
Mice are flexible in their social behaviour. At low population densities, they establish and aggressively defend territories, whereas at higher population densities, they develop strict hierarchies in which a single male dominates several subordinates5. Neither of these strategies whiffs of cooperation. For male mice, as for many other animals, size, aggressiveness and persistence strongly determine social rank. Mice decide whether to compete by comparing potential costs and benefits on the basis of perceived asymmetries in these qualities. These computations rely on a neural circuit that connects two brain regions — the mediodorsal thalamus and the dorsomedial prefrontal cortex6.
Choe et al. set out to investigate whether mice have the capacity to override their natural tendencies towards dominance-based conflict resolution. To do this, they developed a clever coordination task. They trained mice to enter a central start zone in a three-chambered box, and then to follow a visual cue to either the left or right chamber of the box to receive a reward. Next, they paired trained animals to take the trial together. When both mice occupied the start zone, a trial was initiated (Fig. 1a). The first mouse to enter the correct chamber received a reward of either food pellets or wireless brain stimulation (WBS) of the medial forebrain bundle — a region that, when stimulated, can override all other rewards, including food, water and sex7 (Fig. 1b). In the WBS trials, the reward was terminated if the second mouse entered the chamber (Fig. 1c), although this was not possible in the food trial.
As expected, when mice were rewarded with food pellets, dominant ones coerced their subordinate partners into the start zone to enable the trial to begin, and then monopolized the rewards. By contrast, Choe and colleagues found that most animals that were rewarded with WBS developed and maintained a simple alternate-side-allocation rule: each mouse in a pair monopolized only one reward chamber and avoided the other (Fig. 1d). As a result, one mouse gained rewards in trials when the left-hand chamber was the reward chamber, and the other gained rewards when the right-hand chamber was the reward chamber. By following this rule, mice increased both the total amount of reward received and the equality with which that reward was divided.
Remarkably, WBS seemed to override the hierarchical, despotic behaviour that developed over food rewards. Rule-following mice in WBS trials displayed very little aggression, and the limited aggression observed had minimal impact on choice behaviour. Asymmetries in the sizes of the paired animals, which are a key determinant of social status, also had no effect on WBS-induced cooperation. Even when the authors reshuffled the mice into new pairs in which both animals had the same side preference (both monopolizing the left chamber in their previous trials, for instance), the animals rapidly re-established the alternate-side-allocation rule — thus demonstrating remarkable flexibility.
Choe and colleagues’ experiments indicate that certain factors can put natural limitations on cooperation. These include food deprivation8 and the presence of a powerful appetitive stimulus, the food pellet, which was clearly visible in the food-reward trials, and was presumably aromatic, too. By contrast, although WBS was associated with a light cue, it was otherwise not obvious to the unrewarded animal. These findings resonate with previous studies showing that the physical presence of tangible rewards impairs delayed gratification in blue jays9, complex rule-following by monkeys10 and chimps11, and cooperation in humans12.
The current study raises several questions. First, is social coordination by rule-following supported by the same neural circuit between the mediodorsal thalamus and dorsomedial prefrontal cortex that underlies status-based conflict resolution? If not, perhaps WBS overrides this circuit by triggering different circuits that stamp in a more ‘cognitive’ strategy.
Second, what role do internal states, such as hunger, have in strategy selection? The mice in Choe and colleagues’ WBS trials were not food-deprived, and it would be interesting to determine how hunger would affect their behaviour.
And third, to what extent is rule-based coordination social at all? Determining to what extent this coordination depends on physical similarity between partners, transmission of social signals, or the implementation of a similar computational routine could provide clues to this question. If WBS could elicit the same type of coordination between a mouse and a robot, for example, this would demonstrate that the behaviour observed in the current study does not involve any sort of attribution of agency or strategic thinking, and instead arises from pure associative learning. Could WBS drive cooperation between the cartoon characters Tom the cat and Jerry the mouse, or might it just stop them from fighting?
Choe et al. have provided a compelling demonstration of a transition from aggressive to more-egalitarian interactions, at a time when examples of cooperation between animals in the laboratory are controversial and rare8. Crucially, they have done so in mice, an animal model that will allow the whole range of powerful techniques in the neuroscience toolbox, from behaviour tracking to molecular-genetic tools such as optogenetics to electrophysiology, to be brought to bear on the these tricky but important social questions.
Nature 553, 284-285 (2017)