Reputation management promotes strategic adjustment of service quality in cleaner wrasse

Adjusting one’s behaviour in response to eavesdropping bystanders is considered a sophisticated social strategy, yet the underlying mechanisms are not well studied. Cleaner wrasse, Labroides dimidiatus, cooperate by eating ectoparasites off “client” fishes, or cheat (i.e. bite) and eat client mucus. Image scoring by bystander clients generally causes cleaners from socially-complex (i.e. high cleaner and client abundance; high client species richness) habitats to increase levels of cooperation. However, some individuals may periodically provide tactile stimulation to small resident clients, which attract bystanders close that are bitten, a form of tactical deception. Cortisol injection can reproduce this pattern. Here, we tested whether cleaners from socially-complex versus simple habitats respond differently to cortisol injections in terms of their cleaning interactions with clients. We found that only cleaners from the socially-complex habitat respond to cortisol injection with strategies functioning as tactical deception: i.e. increased tactile stimulation to small clients and increased cheating of large clients relative to small ones. At the socially-simple site, where reputation management is less important, cortisol-treated fish increased their overall levels of cheating, especially of small clients. Thus, strategic adjustments to cooperative behaviour and tactical deception are likely context-dependent, forming part of general reputation management abilities in cleaner wrasse.

it was initially proposed that tactical deception requires complex cognitive processes, including a 'theory of mind' (the ability to perceive other individuals as agents with own desires and beliefs 12 ). Thus, tactical deception was seen as an integral part of the Machiavellian intelligence hypothesis and the social brain hypothesis, which both propose that the complexity of life in social groups selected for large brains 11,13 . Evidence of tactical deception is, nonetheless, found in a diverse range of taxa. For instance, Amazonian flycatchers (Lanio versicolor and Thamnomanes schistogynus) and capuchin monkeys (Cebus apella nigritus) occasionally give predator alarm calls in the absence of a threat to monopolize food resources from con-and/or heterospecifics 14,15 . Other forms of tactical deception have been documented in birds [16][17][18] , primates 8,19,20 , antelopes 21 , fish 22,23 , and cephalopods 24 . Such evidence makes studying the mechanisms underlying various social strategies a key prerequisite for understanding the links between behaviour, cognition and brain evolution. Although humans use theory of mind for tactical deception, other species may learn to tactically deceive through operant conditioning 7 . Alternatively, changes in physiological states may lead to context-dependent tactical strategies 23,25 . However, few studies have empirically investigated the physiological or cognitive mechanisms underlying when and in what contexts such strategies should be employed, but see refs 23, 26. Interspecific cleaning mutualisms involving the bluestreak cleaner wrasse, Labroides dimidiatus, represent an ideal opportunity to explore the drivers of variation in levels of cooperation and tactical deception across contexts. Cleaners provide a service by eating ectoparasites off the surfaces of heterospecific "client" fishes. Although these interactions are generally mutualistic [27][28][29] , a variety of potential conflicts of interest exist between L. dimidiatus and its clients. Most notably, L. dimidiatus prefers eating the mucus of clients over ectoparasites 30,31 . Such an action constitutes cheating on behalf of the cleaner as mucus serves as a protective layer for fish against abrasions, ultraviolet radiation and pathogen infection 32,33 . Cheating by cleaners is correlated with the clients' reaction of a full-body "jolt" of movement in response to surface contact with a cleaner mouth (a bite) 34 . Clients can employ a variety of partner control mechanisms to dissuade cheating depending on their strategic options. For instance, visitor clients with access to several cleaning stations may switch to a different cleaner for their next inspection if cheated, whereas resident clients with access to the local cleaner use punishment through aggressive chasing to incite honest service [35][36][37][38] . Furthermore, visitor fish arriving at a cleaning station will eavesdrop to extract information about the local cleaner's service quality and will only respond with invitation for inspection if the cleaner is seen behaving cooperatively 22,36 . Despite these disincentives, the temptation to cheat remains high: L. dimidiatus gain not only additional calories, but also essential amino acids by eating client mucus 39,40 . As a result, there is pressure on cleaners to make strategic decisions about how often and which clients they should bite under which circumstances in order to maximize their gain.
Service quality can vary greatly between and within individual cleaners. For example, normally cooperative individuals can temporarily become 'biting cleaners' that selectively cheat large non-predatory clients, in particular visitors 22 . Such individuals can experience reputational problems as cheating can be observed by bystanders, which will not typically allow a cleaner to approach after witnessing a cheating event. Biting cleaners can improve their reputation by providing small, resident clients with tactile stimulation (TS) from their pelvic and pectoral fins 22 , an action that reduces stress in clients 28 . Functionally, provisioning of TS to small clients in this circumstance works as tactical deception as is used outside of its normal context: TS is typically used for reconciliation and manipulation of current client decisions 41 . When performed on small residents, TS incites passing larger clients to invite inspection by cleaners, often to their own detriment and to the biting cleaners' advantage. As tactical deception is generally employed by females, mostly in the spawning season, it is likely used to maximize energy gain when energetic demands are high 42 . Indeed, a physiological basis for cleaner's switch from cooperation to cheating has recently been demonstrated experimentally. Soares et al. 23 injected hydrocortisone (cortisol) into female cleaners to simulate the fish's stress response, which is the outcome of the activation of the hypothalamic-pituitary-interrenal (HPI) axis, and induce a state of high energy demand 43 . Hormone-treated cleaners provided more TS to small clients and more bites to large clients, consistent with what is expected if cleaners are employing tactical deception 23 . Yet, whether cleaners choose to employ this strategy based on learned decision rules, or simply respond behaviourally to a physiological state of energetic need, is unclear.
To better understand the mechanisms underlying strategic adjustments to levels of cooperation in cleaner wrasse, we took advantage of recent findings suggesting that the interspecific social complexity of a habitat predicts the extent to which 'normal' cleaners care about their reputation (i.e. take the audience into account 44,45 ). Earlier research demonstrating that cleaners respond to the presence of image scoring bystanders with increased levels of cooperation had invariably used cleaners caught from reef habitats with high levels of social complexity 36,37 . In contrast, Wismer et al. 44 found that cleaners from a low social complexity reef, characterized by low client density and richness as well as low competition between cleaners over access to clients, typically did not show such audience effects. Although explanations for these observed differences in strategic behaviour are lacking, some evidence suggests that both the opportunity and need to learn are lower in cleaners from habitats with low social complexity (Wismer et al. unpublished data). In any case, these observed differences allowed us to test whether cortisol injections cause patterns of strategic behavioural adjustment consistent with tactical deception independently or as a function of everyday reputation management strategies. If reputation management is a prerequisite for such strategies, we hypothesize that strategic adjustments to baseline levels of cooperation should be employed by cortisol-injected cleaners in high complexity habitats only. If cleaners are engaging in tactical deception, we predict that cortisol-injected cleaners from high complexity habitats should provide more TS to small clients, and more bites to large clients. Conversely, cortisol-injected cleaners from low-complexity habitats should not adjust their behaviour based on client size, but should cheat more overall (i.e. give more bites to all clients) in response to their increased state of stress.

Results
Reef surveys. Our two study sites differed in four measures of social complexity (Fig. 1). The high-complex-   Table S1): Cortisol-treated cleaners gave more TS to small clients than saline-treated cleaners but only in high social complexity habitats (Supplementary Figure S3). Results from parametric bootstrapping showed that the three-way interaction term explained only a marginally significant proportion of the residual deviance when compared to the full model lacking this term (PBtest: LRT = 3.766, p = 0.052, 1000 simulations). We therefore report significant lower-order terms as well. There was a significant two-way interaction between client size and site (χ 2 = 8.61, p = 0.003, Supplementary Table S1): large clients receive more TS than small clients in sites with low social complexity. The main effect of treatment on the occurrence of TS was significantly different between cortisol and saline-injected fish (χ 2 = 14.29, p = 0.0002) with cortisol-treated cleaners giving TS to clients in a higher proportion of interactions than saline-treated fish. There was also a positive relationship between interaction duration and the occurrence of TS (χ 2 = 81.217, p < 0.0001).
Jolts. Fixed factors in our model explained approximately 11% of the variation in number of client jolts observed (pseudo R 2 = 0.109). There was a significant three-way interaction among client size, treatment and site (N = 2 180 client interactions; GLMM: χ 2 = 7.238, p = 0.007; Fig. 3, Supplementary Table S2). In the high social complexity site, cortisol-treated cleaners caused large clients to jolt more often than small clients ( Supplementary Fig. S4). In contrast, cortisol-treated cleaners caused small clients to jolt more than larger clients in the low complexity site ( Supplementary Fig. S5). Although there was a tendency at the high complexity site for larger clients to receive more jolts from cortisol-injected than saline-injected cleaners, this difference was not significant (Fig. 3). Results from parametric bootstrapping showed that the three-way interaction term explained a significant proportion of the residual deviance when compared to the full model lacking this term (PBtest: LRT = 7.578, p = 0.0059, 1000 simulations). Jolt frequency increased significantly with interaction duration (GLMM: χ 2 = 28.73, p < 0.0001, Supplementary Table S2).

Discussion
The rationale of our experiments were based on three previous studies: the documentation of tactical deception of clients by cleaner wrasse 22 , the demonstration that injection of cortisol can trigger increased occurrence of tactical deception 23 , and the observation that cleaners from high versus low social complexity habitats differ in their reputation management strategies 44 . By subjecting cleaners from both habitats to cortisol injections, we could ask whether cortisol invariably triggers behavioural changes that are functionally tactical deception, or whether a cleaner's response to increased cortisol depends on other variables. Our results show that only cleaners from the habitat with high social complexity respond to cortisol injection with strategies functioning as tactical deception, i.e. increased tactile stimulation to small clients and increase cheating of large clients relative to small ones. This was achieved through a reduction in the number of bites given to small clients rather than an increase in the overall number of bites given to large clients as we had predicted. The differences between cleaners from the high and low social complexity site cannot be explained by different basal cortisol levels leading to different end concentrations of cortisol due our injections ( Supplementary Fig. S2). Thus, there is no direct causal link between cortisol concentrations and the occurrence of selective changes in service quality in cleaner wrasse.  Figure S3 for model predictions and effects plots depicting the significant difference between the proportion of interactions with tactile stimulations given by cortisol and saline treated cleaners to small clients at high-complexity sites.
Instead, cortisol injections signal a change of energetic need and individuals must decide how to respond to meet these requirements. We hypothesized that reputation management strategies may affect the behavioural response of cleaners to cortisol injection. Indeed, our results are broadly consistent with this hypothesis: only cleaners from the high social complexity site, where cleaners typically respond to image scoring clients with audience effects (i.e. increased service quality) 44,45 , displayed patterns consistent with tactical deception when injected with cortisol. In contrast, cleaners from the low social complexity habitat, where evidence for audience effects under normal conditions is lacking 45 , did not strategically adjust TS or cheating rates to different client types, but cheated more frequently, especially on smaller clients ( Supplementary Fig. S4). We note that cortisol injections at the high-complexity site seemed to reduce jolts in small clients more than it increased jolts in large clients (Fig. 3). Thus, it is not clear from our data that the changes in behaviour observed in cortisol-injected cleaners represent a cost to large clients relative to the baseline levels of cheating occurring in the controls. Furthermore, we could not explicitly test the frequency with which TS given to small residents is immediately followed by a jolt given to a large client due to the short time window for observations following manipulations (45 min per cleaner). The increased proportion of TS given to small clients at the high complexity site may, indeed, entice large clients to approach the cortisol-injected cleaners. However, this strategy may simply be a way for cleaners to access image-scoring clients in situations when energy needs are high rather than to deceive per se. Nonetheless, the significant three-way interaction confirms the differences in strategic adjustments between these two sites.
It is important to realize that the observed link between reputation management and strategic behavioural adjustments are only correlational. Furthermore, we only compare two sites. Although we interpret the behavioural differences observed between these sites to be due to differences in their social complexity, other explanations exist. For instance, cleaners from the two sites may differ in various other ways, including physiologically (i.e. different amount of gluccocorticod receptors in their brains and/or responsiveness to increases in cortisol). Differences could also be directly caused by intrinsic habitat differences (i.e. water currents, habitat quality). For example, Wismer et al. 44 reported a lower overall frequency of interactions, especially with large visitor clients, in habitats with low social complexity. Thus, situations in which strategic behavioural adjustments yield benefits (i.e. a large potential client arriving at the cleaning station and inviting inspection because it sees a small client receiving high quality service) may simply be too infrequent at these sites for cleaners to benefit much by adjusting to them. Although we did not observe differences in the number of interactions with small and large clients between sites or injection treatments (Supplementary Table S3), evidence from field observations suggests that relevant situations, such as cleaners having to choose between residents and visitors seeking cleaning simultaneously, are generally much rarer when social complexity is low (Wismer et al. unpublished data). However, laboratory experiments suggest that cleaners from low social complexity habitats are not capable of adjusting to image scoring even when it would be beneficial to do so 44,45 . Also, the circumstances leading to increased service quality or to tactical deception in cleaners from high complexity habitats are similar: both involve image scoring bystander clients and cleaners showing behavioural adjustments in response. We therefore propose that cleaners from high complexity habitats learn to respond to image scoring in a flexible way, i.e. as a function of their  Figure S4 for model predictions and effects plots depicting the significant difference between jolts received by small and large clients at high-complexity sites by cortisol-injected cleaners and S5 for the significant difference in jolts received by small clients from cortisol treated cleaner wrasse at sites of low and high social complexity. internal physiological state. A 'normal' physiological state leads to overall increased levels of cooperation while high energetic demands lead to strategies which may function as tactical deception. Mathematical modelling shows that such state dependent decision making can be adaptive 25 .
We currently lack an understanding of the mechanisms underlying audience effects, including behaviours qualifying functionally as tactical deception. Our study provides insights into the mechanisms behind context-dependent strategic behavioural adjustment in cleaner wrasse: we can exclude the simplest potential explanation, i.e. a physiological agent directly causing the behavioural patterns. Instead, while previous studies have suggested that energetic needs alone seem to cause more cheating, individual cleaners appear to decide how to implement cooperative behaviours versus cheating with a variety of clients within a communication network. In this context, the general ability to manage one's own reputation emerges as a prime candidate for the expression of strategic behavioural adjustment that may function as tactical deception. Its expression is likely due to individual flexibility in decision making. While the current study cannot address whether cleaners understand how their reputation works, nor the direct costs of deceptive strategies to receivers, our results do show that cleaner wrasse are able to fine-tune their behaviour in sophisticated ways to both internal and external circumstances.  44,46 . One scuba diver swam with a transect tape across the reef at a constant speed, and recorded the number and species identity of all individuals greater than 10 cm in length within 2.5 m on either side of the tape (large clients; 150 m 2 ). A second diver followed behind the first to verify transect length. Once 30 m was reached, the two divers then swam back along the tape recording the abundance and species identity of all individuals smaller than 10 cm within 50 cm on either side of the tape (small clients; 30 m 2 ). Having the second diver lying out the transect tape closely behind the first has been shown to reduce diver effects 47 . Ten transects were recorded per reef site covering all representative microhabitats at each site (1 500 m 2 of habitat surveyed per site). All transects were conducted by the same diver team (OR and SW) on relatively calm weather days between 9:00 and 15:00 in July and August when visibility was good to standardize the fish counts as much as possible. Total client and cleaner wrasse (L. dimidiatus) abundance was estimated for each transect and standardized to an area of 100 m 2 .

Methods
Baseline cortisol levels in cleaner wrasse. We compared whole body cortisol levels from adult female cleaner wrasse collected from sites differing in social complexity around Lizard Island. Seven adult female cleaners from patches at Watson Bay were collected in July-August 2015. As fish collection for the purpose of tissue extraction is not permitted at the Big Vickie's site, we collected 14 individuals from nearby socially-complex reefs that have been used in earlier studies documenting audience effects. Fish were collected using a small monofilament barrier net (2 m H × 1 m L, 10 mm stretch mesh) and a hand net, and transported directly to a waiting researcher (ZT) on the boat for immediate processing. Fish were sacrificed by cervical transection and the bodies were kept on ice during transport back to the LIRS facilities. Samples were then frozen at −20 °C, and shipped to the University of Neuchâtel in Switzerland for further processing.
In Neuchâtel, fish bodies were weighed and placed in a volume of 100% methanol equivalent to nine times the tissue weight. The mixture was then homogenized using a VWR ® 200 Homogenizer, and 1 mL of the homog- length (TL) was measured to the nearest cm using a flexible ruler to determine injection volume based on known length-mass relationship for this species (see Supplementary Fig. S1). Fish were then injected intramuscularly with either saline solution or cortisol solution corresponding to 1 µg per g of body mass. Injection volumes ranged from 20 µl to 40 µl. Treatment order was counterbalanced within and between sites. Fish were released following injection and filmed by a diver for 45 min with Canon G15 cameras from a distance of approximately 2 m. No fish suffered from detectable injury or death after the injections or behavioural observations. All methods of fish capture, handling, injection and observations were approved by the Government of Queensland's animal ethics committee.
Video analyses. Video recordings were analyzed by the same researcher (OR), who was blind to the site and treatment of each video at the time of analysis (videos were renamed by a third party prior to analysis). For each video, OR recorded a) client fish family; b) client fish size (TL estimated visually to the nearest cm); c) the duration (seconds) of each interaction d) whether tactile stimulation (when a cleaner touches the body of the client with its ventral region and pelvic fins in the absence of feeding) was provided; and e) the number of jolts (whole-body shudders in response to cleaner mouth contact) performed by the client during each interaction. Jolts are good correlates of biting (cheating) in clients as they typically occur when cleaners ingest client mucus and/or scales as opposed to ectoparasites 34 . We distinguished two client size categories as in Soares et al. 23 . Small clients were <14 cm TL and consisted mostly of resident individuals from the following families: Chaetodontidae, Pomacentridae, and Labridae. Large clients measured >14 cm TL and were predominantly Acanthuridae, Caesionidae, Labridae and Nemipteridae, which tend to be roving visitors at cleaner stations. Client size has previously been used as a proxy of cleaner wrasse food value with larger clients considered higher value 48, 49 . Statistical analyses. We performed statistical analyses using R 3.1.2. From our reef transect surveys, we compared the abundance and diversity of client fish and the abundance of cleaner wrasse between our two study sites using Student's t-tests. Baseline levels of cortisol between reef types (high vs. low social complexity) were also analyzed using a Student's t-test. We tested for homogeneity of variances and normality using Fisher's F-tests and Shapiro-Wilkes tests. Client abundance and whole body cortisol levels were log10-transformed to meet model assumptions.
Differences in large and small client species richness between the two study sites were tested using generalized linear models (GLM) with a poisson error distribution using the glm function in the R package "lme4" 50 . We tested for overdispersion using the dispersiontest function in the R package "AER" 51 . Normality of residuals were verified using qqplots. We calculated a pseudo R 2 by comparing the residual deviance by the null deviance of the model (pseudo R 2 = 1 − (residual deviance / null deviance)).
We recorded 2180 interactions from 36 cleaners. From these videos, we scored all occurrences of cheating behaviour and tactile stimulations. We used a generalized linear mixed model (GLMM) with a binomial error distribution (glmer function in the package lme4 50 ) to test whether the occurrence of tactile stimulations was affected by hormone treatment, habitat social complexity (site) and/or client fish size (fixed factors) while controlling for cleaner identity and client family (random factors). Interaction duration was centered and Z-standardized using the "scale" function 52 , and included as a covariate. We excluded interaction durations longer than 100 seconds as these occurred infrequently (3 occurrences in 2180 interactions). Excluding these values did not qualitatively change the model results, but improved diagnostics. We included two-way interactions between client size, treatment and site and a three-way interaction among these factors. There was no significant difference ( p > 0.10) between models with and without 2-way interactions between main effects and duration, and the model without interactions between main effects and duration had a lower AIC score than models which included these additional two-way interactions. Normality of residuals for random factors were verified using qqplots. We calculated the marginal R 2 (variance explained by the fixed factors; R 2 GLMM(m) ) and conditional R 2 (variance explained by the fixed and random factors; R 2 GLMM(c) ) following Nakagawa and Schielzeth 53 using the "MuMIn" package 54 . We used the "effects" function from the R package "effects" 55 to visualize interactions. We also performed a parametric bootstrapping model comparison (100 bootstrapped samples) using the function "PBmodcomp" from the package "pbkrtest" 56 to assess the significance of the three-way interaction (Site X Treatment X Size) with a likelihood ratio test.
We used a GLMM with a negative binomial error distribution (glmer.nb function in the R package "lme4") to test whether cheating rate (number of client jolts) was affected by hormone treatment, habitat social complexity (site) and/or client fish size (fixed factors) while controlling for cleaner identity and client family (random factors). Scaled interaction duration was included as a covariate (all 2180 data points included). As above, we only included two-way interactions between client size, treatment and site and a three-way interaction among these factors because there was no significant difference (p > 0.66) between models with and without 2-way interactions between main effects and duration, and the model without interactions between main effects and duration had a lower AIC score. Normality of residuals for random factors were verified using qqplots. We calculated a pseudo R 2 by comparing the residual deviance by the null deviance of the model (pseudo R 2 = 1 − (residual deviance/null deviance)). We used the "effects" function from the R package "effects" 55 to visualize interactions. As above, we performed a parametric bootstrapping model comparison to assess the significance of the three-way interaction (Site X Treatment X Size) with a likelihood ratio test. Data Availability. The data for this study will be publicly archived in the repository figshare upon acceptance of the manuscript (doi:10.6084/m9.figshare.3817392) following best practices 57 . Reviewers can access the data here: https://figshare.com/s/28421593a0d49d4bb2c5.