Introduction

Sensitivity to reward is modulated by prior experiences, particularly during early developmental periods [1,2,3,4,5,6,7,8,9,10,11]. In the United States, over 30% of children experience some type of early-life adversity (ELA) related to poverty, trauma, and chaotic environments [12] and these events have been associated with poor cognitive and emotional health [13,14,15,16]. Anhedonia, the loss of pleasure or lack of reactivity to previously rewarding stimuli, is another consequence associated with ELA. Adversity during childhood has been shown to alter (in rodents) or predict (in humans) the development of the reward system and impair reward responsiveness later in life [17,18,19,20,21,22,23]. Important questions remain, however, regarding causality in human studies, and the mechanisms and salient signals of adversity in both humans and animal models.

Blunted reactivity to previously rewarding activities is also commonly associated with major depressive disorder [24], bipolar disorder [25], schizophrenia [26], post-traumatic stress disorder [27, 28], and substance use disorders [29]. Given the prevalence of anhedonia across diverse neuropsychiatric conditions and the absence of effective pharmacotherapeutic treatment options, an active research domain has emerged dedicated to examining the features and mechanisms of this highly heterogeneous construct [30]. For decades, clinical assessments of anhedonia have relied heavily on self-report questionnaires such as the Snaith–Hamilton Pleasure Scale [31], Chapman Physical and Social Anhedonia Scales [32], and Fawcett-Clark Pleasure Capacity Scale [33]. In rodents, assays such as the sucrose preference task and intracranial self-stimulation have been commonly employed to assay anhedonic-like phenotypes [34]. However, they depart significantly from diagnostic instruments used in the clinic. Therefore, recent work has focused on the development and empirical validation of coordinated laboratory assays in clinical patients and laboratory animals designed to quantify more objectively responsiveness to reinforcement contingencies as a means to identify deficits. Here, the expectation is that bi-directional alignment in methodologies across species will advance our understanding of the mechanisms involved in disorders in which anhedonia is prominent and accelerate the discovery of treatment options.

One such approach that has been chosen as a recommended task for the reward learning subdomain in the latest revision [35] of the RDoC matrix [36] is the Probabilistic Reward Task (PRT) [37, modified after 38]. In this computerized task, human participants make rapid discriminations between two lines that differ in length, represented as the mouth on a cartoon face (see Fig. 1). Unbeknownst to the participants, asymmetric 3:1 probabilistic reinforcement contingencies are programmed such that correct responses to one line length are rewarded 60% of the time, whereas correct responses to the other line length are rewarded only 20% of the time. As predicted by signal detection theory [39,40,41], healthy participants reliably develop a response bias toward the more richly rewarded stimulus; however, participants with anhedonia consistently exhibit a blunted response bias that has been found to correlate with current and predict future anhedonia in depression and other neuropsychiatric conditions [42,43,44,45,46].

Fig. 1: Study overview.
figure 1

Experimental timeline across postnatal (P) days and task schematics for the ELA limited bedding and nesting paradigm, sucrose preference test, and touchscreen-based PRT.

The ability of the PRT to objectively quantify reward responsiveness in clinical populations with anhedonia has inspired reverse translation of this task for rodents [47, 48] and nonhuman primates [49]. Studies in laboratory animals have already elucidated functions related to neurochemical signaling systems [50], revealed expected reductions in reward responsiveness following social defeat stress [51], and documented increases in sensitivity following treatment with drugs known to enhance hedonic tone [48, 49]. In order to examine the relationship between ELA and anhedonic phenotypes, the present studies employed the touchscreen-based PRT in rats to evaluate whether the limited bedding and nesting protocol (a model of chronic ELA as a result of unpredictable maternal care) would be associated with a blunting in reward responsiveness that persists into adulthood. PRT outcomes were also compared to traditional sucrose preference measures and entropy metrics of dam/pup interactions as a means to determine specificity in blunted reward responsiveness with predictability in maternal interaction.

Methods

Subjects

A total of 32 male Sprague Dawley rats across two cohorts and born from six timed-pregnant dams (delivered embryonic day 15; Envigo, Livermore, CA) served in the present studies. Cohort 1 was comprised of 6 control and 6 ELA rats and Cohort 2 was comprised of 11 control and 9 ELA rats. The experimental timeline and task schematics are summarized in Fig. 1. Following standard or ELA rearing protocols, entropy measurements, and subsequent sucrose preference testing at the University of California-Irvine detailed below, Cohort 1 was sent via 1-day shipping from California to Massachusetts in December 2019 and Cohort 2 was sent using the same carrier in December 2020. Upon arrival at McLean Hospital, Cohort 1 was quarantined for 60 days in an isolated climate-controlled vivarium bay with unrestricted access to rodent chow and water. Cohort 2 was quarantined under identical conditions but, due to COVID-19-related delays in sentinel testing, was not released for approximately 90 days. Following clearance from quarantine, subjects were transferred to a larger vivarium with otherwise identical housing conditions. Subjects continued to have unrestricted access to water in their home cage; however, to establish sweetened condensed milk as a reinforcer, subjects were food-restricted via daily post-session portions of 10–15 g of rodent chow. All vivarium technicians and research assistants responsible for animal husbandry duties and conducting the behavioral studies were blinded to the subjects’ group assignment (i.e., control or ELA). See Table 1 for additional birth date and developmental weight data across these studies. The protocol was approved by the Institutional Animal Care and Use Committee at McLean Hospital and in accordance with guidelines from the Committee on Care and Use of Laboratory Animals of the Institute of Laboratory Animals Resources, Commission on Life Sciences (2011) [52].

Table 1 Subject details by cohort, including the number of pups per dam, date of birth, shipping date from California to Massachusetts, and mean (±SEM) body weight (g) during P10, P21, P62, and P78, following clearance from quarantine, and during the PRT testing condition.

Early-life adversity paradigm

Six dams across two cohorts (two in Cohort 1, four in Cohort 2) were randomly assigned as either control or ELA on P2. Pups were cross-fostered between litters born within 12 h to create litters of 12 with equal numbers of sexes where possible. ELA was imposed using the limited bedding and nesting paradigm, which consists of limiting nesting and bedding materials in cages between P2 and P9 as described previously [53,54,55]. For the ELA group, a plastic-coated mesh platform was placed 2.5 cm above the floor of a standard cage. Cob bedding was reduced to cover the cage floor sparsely, and one-half of a single paper towel was provided for nesting material on the platform. Control dams and litters resided in standard cages containing ample cob bedding and one whole paper towel which dams shred to create a nest. See Fig. 1 for a representative photograph of control and ELA rearing conditions. Both control and ELA cages were undisturbed during P2-P9, housed in temperature- and humidity-controlled rooms. On P10, all groups were transferred to standard cages. Rats were weaned on P21 and then group housed. Male rats were used in these experiments and the females were the subjects of other studies aimed at understanding female-specific effects of ELA [7].

Characterization of maternal unpredictability

Maternal behaviors were assessed via home cage observation for 60-min periods during P3–P6. Mothers were identified as performing one of seven different behaviors: licking/grooming pups, carrying pups, eating, nursing (in high- or low-arched posture), nest building, off pups, or self-grooming. Behaviors were recorded if lasting for a minimum of 3 s. Coding of maternal behaviors was performed live and from video recordings by two experimenters blinded to the group assignment. Recorded videos were manually coded using Behavioral Observation Research Interactive Software [56]. Entropy is a natural summary measure of randomness or unpredictability of a probability distribution and the entropy rate can be used to measure the unpredictability of a sequence of observed behaviors. To calculate the maternal entropy for each dam, the dam’s sequence of behaviors was characterized using the empirical transition matrix <pij>i,j = 1…7 of conditional probabilities of moving from one behavior (behavior i) to another (behavior j). The entropy rate of each dam was calculated from the transition matrix as previously described [8, 57] and detailed below.

Sucrose preference testing

When animals were two months of age, a preference for a weak sucrose solution (1.5% in filtered water) in comparison to water was performed in a limited two-bottle choice setting as described previously [3, 8]. Rats were acclimated to individual housing for 7 days with ad libitum food and given free access to two bottles containing 50 ml of filtered water and were handled daily for approximately 2 min. On day 8, rats were provided with two bottles, one containing 50 ml of filtered water and the other 50 ml of 1.5% sucrose solution during a 1-week period. The left/right position of the bottles was interchanged daily to avoid effects from a side-bias. Fluid (water and sucrose) consumption was recorded each morning, correcting for evaporation/spillage. Following completion of the task, rats were returned to their original housing groups.

PRT training and testing

Following arrival at McLean Hospital, subsequent clearance from quarantine, and implementation of food restriction conditions, PRT training and testing protocols were initiated. Empirical validation and task optimization of the touchscreen-based rat PRT can be found in [48]. Details of the rat touch-sensitive experimental chamber can be found in [58] and a task schematic and photograph are presented in Fig. 1.

Line length discrimination training

Discrete trials began with the presentation of a white line on a black background, with its lower edge presented 3 cm above the 5 × 5 cm left and right blue response boxes. The length of the line was either 600 × 120 px (31.5 × 6.5 cm: long line) or 200 × 60 px (10.5 × 3.25 cm: short line). Long and short line length trial types varied in a quasi-random manner across 100-trial sessions such that there were exactly 50 trials of each type, but a given trial type would not be presented more than five times in a row. Subjects were trained to respond to the left or right response box depending on the length of the white line (long line: respond left, short line: respond right, or vice versa). Response box designation was counterbalanced across subjects. During line length discrimination training, each correct response was reinforced with 0.1 ml of 30% sweetened condensed milk, paired with an 880-ms yellow screen flash and a 440 Hz tone, and followed by a 5-s blackout period, whereas each incorrect response immediately resulted in a 10-s blackout period. A correction procedure was implemented during initial discrimination training in which each incorrect trial was repeated until a correct response was made [59], and was discontinued after <10 repeats of each trial type occurred in two consecutive sessions. Discrimination training sessions continued without correction until accuracies for both line length trial types were ≥80% correct for two consecutive sessions, concordant with the performance criteria in previous human PRT studies [37, 43, 44]. After this criterion was met, PRT testing commenced.

PRT testing

Subjects were exposed to a 5-session protocol using 3:1 probabilistic reinforcement contingencies such that a correct response to one of the line lengths (long or short) was reinforced 60% of the time (rich stimulus), whereas a correct response to the other line length was reinforced 20% of the time (lean stimulus). Incorrect responses were never reinforced. The line length associated with the rich and lean contingency was determined for each subject during their final two line length discrimination training sessions by examining their accuracies and designating the line length with a higher mean accuracy as the stimulus to be rewarded on the lean schedule. This approach was expressly designed to examine the effects of ELA on response bias generated by responsivity to asymmetrical probabilistic contingencies, rather than the amplification of a preexisting inherent bias that is a function of uncontrolled variables.

Data analysis

Maternal entropy

Entropy was calculated using R (version 4.0.3; code available at https://github.com/bvegetabile/entropyrate). Specifically, the entropy rate of each dam was derived from the transition matrix [8, 57] as follows:

$$E = - \mathop {\sum }\limits_{i = 1}^M \pi _i\mathop {\sum }\limits_{j = 1}^M p_{ij}{{{{{\rm{log}}}}}}\left( {p_{ij}} \right)$$

where pij is the conditional probability that maternal behavior j is observed next after a mother is observed performing behavior i, πi is the frequency with which behavior i is observed, and M (=7) is the number of different behaviors. Entropy across P3-P6 was analyzed using two-way repeated-measures ANOVA, with time as a repeated factor. Average entropy was taken across the 4 days and tested for significance across the two conditions using Student’s t-test (p < 0.05) for differences between means.

Sucrose preference

Preference for the sucrose solution was calculated as (sucrose consumption [ml] / total fluid consumption [ml]) × 100 and averaged across the 6 days. Due to missing values, preference over time was analyzed using a mixed-effects model (Restricted Maximum Likelihood), with time as a repeated factor. Average preference over 6 days was tested for significance using Student’s t-test (p < 0.05) for differences between means.

PRT

The implementation of probabilistic contingencies yields two primary dependent measures: response bias and discriminability, which can be quantified using equations derived from signal detection theory [39,40,41] by examining the number of Correct and Incorrect responses for Rich and Lean trial types. Response bias is calculated using the following log b equation:

$$\log b = 0.5 \ast {{{{{{{\mathrm{log}}}}}}}}\left( {\frac{{\left( {{{{{{\mathrm{Rich}}}}}}_{{{{{{\mathrm{Correct}}}}}}} + 0.5} \right) \ast \left( {{{{{{\mathrm{Lean}}}}}}_{{{{{{\mathrm{Incorrect}}}}}}} + 0.5} \right)}}{{\left( {{{{{{\mathrm{Rich}}}}}}_{{{{{{\mathrm{Incorrect}}}}}}} + 0.5} \right) \ast \left( {{{{{{\mathrm{Lean}}}}}}_{{{{{{\mathrm{Correct}}}}}}} + 0.5} \right)}}} \right)$$

High bias values are produced by high numbers of correct responses for rich trials and incorrect responses for lean trials. Discriminability is calculated using the following log d equation:

$$\log d = 0.5 \ast {{{{{{{\mathrm{log}}}}}}}}\left( {\frac{{\left( {{{{{{\mathrm{Rich}}}}}}_{{{{{{\mathrm{Correct}}}}}}} + 0.5} \right) \ast \left( {{{{{{\mathrm{Lean}}}}}}_{{{{{{\mathrm{Correct}}}}}}} + 0.5} \right)}}{{\left( {{{{{{\mathrm{Rich}}}}}}_{{{{{{\mathrm{Incorrect}}}}}}} + 0.5} \right) \ast \left( {{{{{{\mathrm{Lean}}}}}}_{{{{{{\mathrm{Incorrect}}}}}}} + 0.5} \right)}}} \right)$$

High discriminability values are produced by high numbers of correct responses for both rich and lean trials. (0.5 is added to all parameters in both equations to avoid instances where no errors are made on a given trial type, thus making log transforms impossible.) The utility of these equations has been repeatedly confirmed in prior studies in humans [37, 42,43,44, 46, 60, 61] and laboratory animals [47,48,49,50,51]. In addition, accuracy (percent correct) and reaction time (latency from line presentation to response) were calculated and presented as individual subject values and session-wide group means (±SEM) for rich and lean trials. Unpaired t-tests and Cohen’s d were used to evaluate, respectively, group differences and effect sizes (control vs. ELA) within a cohort or with both cohorts combined in log b, log d, reaction time, sessions to meet discrimination training criteria, body weight, sucrose preference, and entropy. Accuracy during rich and lean trial types during the 5-session PRT testing condition was analyzed using a three-way repeated-measures ANOVA, with “session” and “stimulus” as repeated factors and “experience” as a between subject’s factor. Relationships among log b, sucrose preference, and entropy were evaluated via Pearson’s correlation coefficient. The criterion for significance was set at p < 0.05. All statistical analyses were conducted using GraphPad Prism 9 Software (San Diego, CA, USA).

Results

Reward responsiveness in the PRT is blunted in ELA rats

Figure 2 summarizes group mean (±SEM) and individual subject data across PRT outcomes in control (filled symbols) and ELA subjects (unfilled symbols) from Cohort 1 (left panels), Cohort 2 (middle panels), and both cohorts combined (right panels). Critically, and in line with our hypothesis, significant group differences were observed when examining response biases in favor of the rich stimulus averaged across the 5-session PRT testing condition (upper panels), with subjects in the ELA group exhibiting smaller log b values relative to control subjects in Cohort 1 (t[10] = 3.50, p = 0.006; d = 2.02), Cohort 2 (t[18] = 3.20, p = 0.005; d = 1.48), and when examining both cohorts combined (t[30] = 4.50, p < 0.0001; d = 1.62). In contrast to the response bias findings, groups did not differ in their discriminability values (log d) during the same 5-session PRT testing condition (middle row) in Cohort 1 (t[10] = 0.13, p = 0.90; d = 0.07), Cohort 2 (t[18] = 0.88, p = 0.39; d = 0.38), or when examining both cohorts combined (t[30] = 0.62, p = 0.54; d = 0.22). Likewise, all subjects in both groups completed all 100-trial PRT testing sessions with reaction times from line length stimulus onset to touchscreen response (lower row) that did not differ between groups in Cohort 1 (t[10] = 1.18, p = 0.27; d = 0.68), Cohort 2 (t[18] = 0.62, p = 0.54; d = 0.28), or when examining both cohorts combined (t[30] = 0.01, p = 0.99; d = 0.004).

Fig. 2: PRT outcomes by group and cohort.
figure 2

Effects of control (filled symbols) and ELA (unfilled symbols) rearing conditions in Cohort 1 (left panels; circles; control, n = 6, ELA, n = 6), Cohort 2 (middle panels; diamonds; control, n = 11, ELA, n = 9), and both cohorts combined (right panels; control, n = 17, ELA, n = 15) on response bias (upper row), task discriminability (middle row), and reaction time (lower row). Horizontal lines represent group mean (±SEM) and data points represent values for individual subjects. **p < 0.01, ***p < 0.0001.

To further interrogate the group differences in response bias, Fig. 3 presents the development of log b values (top panels) and accuracy by rich/lean trial type (bottom panels) in both cohorts combined across the 5-session PRT testing condition. As the top panels of Fig. 3 show, log b values in the control group were approximately 0.3 in the first test session and increased across subsequent test sessions, reaching an asymptote of approximately 0.6, whereas log b values in the ELA group remained stagnant at approximately 0.2–0.3 throughout the 5-session testing condition. As the baseline (BL) values plotted in the lower panels of Fig. 3 show, discrimination accuracy was nearly identical between groups immediately prior to exposure to the 3:1 rich:lean probabilistic contingencies. Following exposure to asymmetric probabilistic contingencies, accuracy for the rich trial type was higher than lean accuracy in both groups. However, the disparity between accuracy for rich and lean trial types was larger in the control group. That is, across the 5-session testing condition, the control group was characterized by consistently higher rich accuracies and lower lean accuracies, relative to the ELA group. Indeed, these early-life experiences produced significant alterations in the development of the expected response bias. For brevity, only effects involving “experience” are reported here. There was no interaction of session × stimulus × experience (F[4,120] = 1.76, p = 0.14); however, there was a significant stimulus × experience interaction (F[1,30] = 19.25, p = 0.0001). Moreover, the disparity between accuracy for rich (94.46 ± 1.14) and lean (58.78 ± 2.56) trial types was larger in the control group compared to that of rich (88.00 ± 1.22) and lean (71.07 ± 2.62) in the ELA group. These PRT accuracy outcomes illustrate how log b values in the control group were significantly higher than the ELA group, despite the fact that log d values were similar between groups (cf. Fig. 2). Importantly, as described above, the line length stimulus (long and short) for each subject that was designated as rich and lean was defined by assigning the line length with higher baseline accuracy as lean. Therefore, the response biases observed in both groups during the subsequent 5-session PRT testing condition were a product of the subject’s responsivity to the asymmetric probabilistic contingencies arranged, rather than simply an inflation of an inherent bias.

Fig. 3: Development of PRT outcomes by group.
figure 3

Group mean (±SEM) log b (upper panels) and accuracy (lower panels) during rich (triangles) and lean (inverted triangles) stimulus trial types during the last two sessions of baseline (BL) discrimination training and across the five sessions of PRT testing in both cohorts of control (left panels; n = 17) and ELA (right panels; n = 15) subjects.

Sucrose preference is blunted in ELA rats

Preference for sucrose solution was measured in adult animals reared in standard control (filled symbols) or ELA conditions (unfilled symbols). As the top panel of Fig. 4 illustrates, ELA rearing conditions significantly blunted sucrose preference across the 6-day condition (F[1,30] = 9.74, p = 0.004). The middle panel of Fig. 4 displays individual subject data in Cohort 1 (circles) and Cohort 2 (diamonds) using the mean preference across the 6-session condition and reveals significantly lower sucrose preference in ELA subjects relative to control (t[30] = 3.24, p = 0.002; d = 1.14). There were no significant differences (t[30] = 0.04, p = 0.97) in mean daily total fluid consumption between control (61.28 ± 0.88) and ELA (61.22 ± 1.28) subjects. The bottom panel of Fig. 4 presents the relationship between sucrose preference and log b in the control subjects (filled symbols) and ELA subjects (unfilled symbols) in Cohort 1 (circles) and Cohort 2 (diamonds). In the control subjects, there was a modest negative correlation between the two measures; however, it did not meet statistical criteria (r = –0.31, R2 = 0.10, p = 0.23). In the ELA subjects, there was no correlation between sucrose preference and log b (r = –0.12, R2 = 0.02, p = 0.66).

Fig. 4: Sucrose preference outcomes by group and cohort.
figure 4

Top panel: effects of control (filled symbols; n = 17) and ELA (unfilled symbols; n = 15) rearing conditions on mean (±SEM) sucrose preference across the 6-day testing condition. Middle panel: condition-wide mean sucrose preference values in control (filled symbols) and ELA (unfilled symbols) subjects in Cohort 1 (circles) and Cohort 2 (diamonds). Horizontal lines represent group mean (±SEM) and data points represent values for individual subjects. Bottom panel: relationship between sucrose preference and log b values in control (filled symbols) and ELA subjects (unfilled symbols) in Cohort 1 (circles) and Cohort 2 (diamonds).

Probing the salient aspects of ELA that result in reward sensitivity deficits

ELA in humans and rodents is multifaceted. It is characterized by stress and elevation of stress hormones, but it is unclear whether the activation of the hypothalamic-pituitary adrenal stress axis is directly responsible for enduring behavioral deficits. More recently, evidence has emerged suggesting the maturation of the reward circuit after ELA may be disrupted by aberrant, unpredictable environmental signals during ELA [3, 8, 62]. Indeed, the need for appropriate signals for circuit development has been shown for both visual and auditory systems [63, 64]. To test whether unpredictable, chaotic signals from the dam to the pups might contribute to the reward–response deficits, we computed an entropy rate, as described previously [8, 57]. This involves examining a matrix of conditional probabilities that a given maternal behavior “A” is followed by a second behavior “B”. In this context, higher entropy values are indicative of increased unpredictability. As shown in the top panel of Fig. 5, across P3–P6, entropy values were consistently higher in the ELA group relative to control, confirmed by a main effect of ELA (F[1,4] = 22.11, p = 0.009) with no effect of time (F[2.52,10.08] = 0.1, p = 0.94) or a time x ELA interaction (F[3,12] = 0.32, p = 0.81). As there was no significant change in entropy over time, the mean entropy was calculated to create a single value for each litter in Cohort 1 and 2 as shown in the bottom panel of Fig. 5 which also revealed significant differences between control and ELA subjects (t[4] = 4.71, p = 0.009).

Fig. 5: Maternal unpredictability by group, cohort, and litter.
figure 5

Top panel: effects of control (filled symbols; n = 17) and ELA (unfilled symbols; n = 15) rearing conditions on mean (±SEM) entropy values across P3–P6. Bottom panel: condition-wide mean entropy values in individual control (filled symbols) and ELA (unfilled symbols) dam/pup interactions in Cohort 1 (circles) and Cohort 2 (diamonds). Horizontal lines represent group mean (±SEM), n = 3 litters per group.

Addressing potential confounders

The studies above employed two cohorts that had modest variations. In addition, it is conceivable that other variables such as body weight might confound the results. Therefore, Table 1 presents subject details by cohort, including the number of pups per dam, date of birth, shipping date from California to Massachusetts, and mean (±SEM) body weight (g) P10, P21, P62, and P78, following clearance from quarantine, and during the 5-session PRT testing condition. Body weights, between Cohort 1 and 2 and between control and ELA subjects, were highly similar throughout developmental stages from rat infancy to adulthood. Most critically, average body weight during the food-restricted 5-session PRT testing conditions was also similar (t[30] = 1.17, p = 0.25; d = 0.41) between control (363.56 [±4.29] g) and ELA (372.38 [±6.44] g) subjects. There were, however, several notable differences between cohorts. First, subjects in Cohort 1 originated from 1 control and 1 ELA dam that each contributed 6 pups to the study, whereas subjects in Cohort 2 originated from 2 control dams contributing 6 and 5 pups and 2 ELA dams contributing 5 and 4 pups. Second, Cohort 1 was 17 days older than Cohort 2 when they were shipped from California to Massachusetts. Third, despite being younger at ship date, because of delays related to COVID-19 in sentinel testing, Cohort 2 spent approximately 30 more days in quarantine than Cohort 1 that made them slightly older during PRT training and testing. Despite these differences, all subjects learned to touch the screen and discriminate long vs. short lines. The control and ELA groups met discrimination criteria of ≥80% correct during both long and short line length trial types for two consecutive sessions following, respectively, 12.24 (±0.76) and 14.80 (±1.65) training sessions, a group difference that did not meet statistical criteria (t[30] = 1.47, p = 0.15; d = 0.51).

Discussion

The present studies identify the PRT as a powerful translational tool that can advance understanding of mechanisms underlying reward responsiveness across species. Using this assay, we identified enduring effects of ELA on reward responsiveness during adulthood. Specifically, blunted development of a response bias in the PRT was observed, analogous to previous studies in human participants with anhedonia. Anhedonia-related phenotypes were also observed in these rats when tested in the traditional sucrose preference test. The level of unpredictability in maternal care was significantly higher in subjects that would later display blunted reward responsiveness in both the PRT and the sucrose preference test during adulthood, highlighting predictability early in life as a salient signal for reward system development.

Maternal patterns and signals are an important factor in the regulation of brain circuits [63, 64]. Recent work in humans has verified that unpredictable maternally derived signals early in development are correlated with worse cognitive outcomes later in life [6, 65]. In rodents, chronic unpredictable and fragmented maternal care causes disruption to the sensory signals received by the pups (without causing physical stress [66] and minimal effects on other aspects of maternal care [8, 53, 67,68,69], resulting in enduring cognitive and emotional deficits [8, 55, 68]). Evidence from the literature also suggests that predictable maternal care is required for reward responsiveness. Previous studies using maternal separation as an early-life stressor have demonstrated anhedonia-related phenotypes in adulthood are observed only after a secondary stressor is experienced [70, 71]. However, unpredictability in maternal care has been shown to directly induce anhedonia-related behavior in adulthood as measured by sucrose preference and social play [3, 8]. Exposure to a secondary stressor can accelerate the effects of unpredictable maternal care on learning and memory [72]. Therefore, it is conceivable that rats experiencing ELA are more vulnerable to stress, such as those caused by social manipulations during testing and in transit. The present studies add to this research domain by identifying the direct correlations between the magnitude of maternal entropy and reward responsiveness later in life.

Interestingly, although group differences were observed in both assays, their respective primary metrics (i.e., preference for the sucrose bottle and log b) did not correlate with each other among individual subjects. Reasons for this are currently unclear; while this study is only powered to identify large effect sizes, anhedonia is a construct with high heterogeneity and neither sucrose preference nor the PRT captures its full spectrum. In fact, they likely evaluate different aspects of reward responsiveness (reward consumption vs. reward learning). Therefore, the lack of correlation between assay outcomes is perhaps not surprising but, rather, consistent with an RDoC-like framework that requires a diversity of tasks to investigate multidimensional psychological systems [35, 36]. These novel findings, which utilize a reverse-translated assay used with clinical patients to quantify deficits in reward responsiveness, are also highly consistent with long-standing literature on childhood traumas associated with subsequent vulnerability to anhedonia that can persist throughout life [18,19,20,21,22]. They are similarly concordant with rodent studies describing altered reward system function and development following ELA [3, 8, 62].

There are a few caveats that warrant consideration. First, although significant group differences were observed across two cohorts of subjects, both sucrose preference and PRT outcomes were each examined at only one timepoint—early and middle adulthood, respectively. Although it would be interesting to detail the time course of how quickly this blunting in reward responsiveness emerges and whether it persists until death, it is nevertheless remarkable that its effects are sustained well into rat adulthood with a magnitude that is clearly distinguishable from control subjects. Moreover, the systematic replication of these findings across 2 years, two cohorts, 32 rats, and 6 litters under somewhat varying conditions supports the robust and reproducible nature of these ELA effects. Second, it is important to note that these studies were conducted on male subjects only. Given the fact that diagnoses of mood disorders in which anhedonia is prominent are more prevalent in women [73] and the known sex-dependent effects of ELA on reward [7], it will be important to determine the extent to which these effects are also observed in female rats.

In conclusion, the PRT is a valuable translational assay for elucidating the mechanisms by which ELA impacts the reward circuit in both humans and laboratory animals. The present data confirm a relationship between maternal entropy during early-life and reward responsiveness in adulthood. Moreover, these findings corroborate previous work implicating maternal signals during early-life as modulators of reward circuit signaling specifically and cognitive development generally.