Introduction

Different individuals of the same species do not behave the same as has already been reported for many species (see1,2,3,4). Particularly songbirds and rats have been studied the most in this regard5. Even though inter-individual behavioural differences are more pronounced in wild and wild-derived animals1,6,7, some differences may also be found between laboratory animals1. In laboratory animals, however, the focus is on the differences in personality traits between various strains8. These are crucial for designing pharmacological experiments, where it is customary to select a single strain based on its behaviour in various experimental procedures. Different individuals of the same strain, on the other hand, are considered rather identical and in some research areas, it is customary to conduct experiments on fewer than ten individuals (e.g.,9,10,11), because very limited variability of behaviour is expected. Nevertheless, in this paper we would like to point out that there are also inter-individual differences within a single strain which are not negligible. The interpretation of results obtained from studies with a low number of individuals should therefore be more cautious.

Personality

Inter-individual behavioural differences, consistent in time and throughout contexts, are of special research interest. Although terms referring to this variability may differ depending on the field of the study, “personality” is the term most widely used in the behavioural sciences, with synonyms such as temperament, behavioural profile, or others4,12. In human and primate studies, personality is described by a 5-axis paradigm (with each axis representing a continuum of behaviours) derived from The Big Five and NEO personality questionnaires13,14,15, while fewer axes are used in most of the animal taxa, see for example2,3,4. Individual scores on these axes should comprehensively describe animal behaviour in any experimental or real-life situation. Nevertheless, most studies do not test the full spectrum of animal behaviour, but rather focus on traits corresponding to only one or two axes (see a meta-analysis6 or a review2; see also Table S1 in Supplementary materials for experiments commonly used for extraction of personality axes). Correlation of any two axes is referred to as a behavioural syndrome, which is manifested when the behaviour of an animal in one context is dependent on its behaviour in another situation, possibly reflecting limited behavioural plasticity16,17.

Behavioural consistency

Although it is assumed that inter-individual differences in behaviour are relatively stable in time, it does not mean the behaviour of an individual animal cannot change6,18. Nevertheless, relative differences among individuals should remain consistently stable throughout different contexts19,20,21.

When testing for behavioural consistency, two types of repeatability estimates can be distinguished. The first one, agreement repeatability (RA), expresses the reproducibility of the absolute scores the of behaviour of the animals tested, therefore it does not account for changes of behaviour in time6,22,23. In general, repeatability estimates vary from 0 (no repeatability) to 1 (maximum repeatability), with the average RA estimate of personality traits of 0.371. If there is a shift in behaviour between experimental trials (for example due to habituation), the agreement repeatability estimate would underestimate the “real” repeatability of behaviour18. In contrast, the second type of repeatability estimate, consistency repeatability (RC), takes changes in time (e.g., habituation) into account and therefore enables researchers to assess the relative scores of individual animals (e.g., by comparing the ranks of individual animals in several tasks) concerning a behavioural trait6,18.

Methods used for personality testing

Personality is defined as stable inter-individual differences in behaviour. There are, therefore, a myriad of possibilities to test for such differences and diverse methods have become commonly used when testing various species (for details on birds, fish, rodents, or wild animals see5). In rodents, three axes are studied most often—activity, exploration, and boldness6. Activity may be measured in any experimental setup and it is quite easily assessed by simply measuring the path the animal travels (for references see Table S1). The exploration—avoidance axis represents the reaction to a new (not necessarily risky) situation4 and in rodents, it is usually assessed in experiments such as Open field test (empty arena), Hole board test (arena with holes in the ground), or in more complex novel environment designs (for references see6 or Table S1). The third most commonly tested axis is boldness, which corresponds to the reaction of the individual to a risky situation4. In rodents, this trait is usually assessed by the Elevated plus maze (elevated X-shaped apparatus with two open and two closed arms), Novel object test, or Startle test (for references see Table S1). The boundaries between these three dimensions (or axes) are, however, rather blurred as can be seen by the disagreement on the interpretations of the behavioural traits recorded in these procedures (e.g.,24,25,26,27,28). It is, for example, quite intriguing to decide whether behaviour in a brightly lit Open field test represents exploration or boldness and whether, on the other hand, behaviour in a dimly lit Elevated plus maze might represent exploration, rather than boldness. Moreover, in studies that do not consider personality, these procedures are used to measure the anxiety of the animals (e.g.,29,30). But how does anxiety translate into personality dimensions? Is it analogous to boldness or to the exploration-avoidance axis? In addition, activity is an inherent and inseparable feature of most of the behaviours recorded in these procedures, and therefore it is rather interesting to differentiate between the various personality axes. Furthermore, subsequent exposures to the same experiment always present a challenge in the interpretation of the behaviour (e.g.,31,32). Especially when considering personality and deciding whether the situation is new and potentially risky to the animal or whether it remembers the apparatus and therefore, the experiment is not a dangerous new threat any more.

In this study we used three standard experimental procedures—Open field test, Elevated plus maze test, and a modified version of the T-maze test. The Open field test is an empty apparatus with walls which prevent the animal from escaping it. At first, this apparatus was used to test the emotional reactivity of animals, nowadays it is mostly used to assess activity, anxiety, or exploration (reviewed in29, for more details see the Methods section). The Elevated plus maze test is an elevated X-shaped apparatus with two closed and two open arms (the same arms are located opposite each other, for more details see the Methods section). This experiment is used to test the exploration-avoidance axis and risk assessment (reviewed in30). The T-maze (or its alternative, the Y-maze) is used to study spontaneous alternation33, anxiety or exploration-avoidance axis34, and working memory35 (for more details see the Methods section). Even though these procedures test equivalent behavioural traits, studies adopting a multivariate analytical approach often show that the behaviours measured in these procedures create separate axes and therefore should not be interpreted as corresponding, e.g.,34,36,37.

The aim of this study was to examine inter-individual differences in behaviour in the Long Evans strain of laboratory rats. To this end, we utilized a battery of standard experimental procedures consisting of the Open field test, Elevated plus maze test38,39,40, and a modified version of the T-maze test34. These procedures are usually interpreted as tests of activity (Open field test), exploration (Open field test, Elevated plus maze test, T-maze test), or boldness (Elevated plus maze test). Since exploration and boldness are the most widely studied personality axes in rodents6, we decided to focus on these procedures and observe the behaviour laboratory rats exhibit in these tests. We were especially interested in whether the behaviours which are usually recorded in these experiments will be intercorrelated as it is often expected that these procedures measure the same behavioural traits. Moreover, there seems to be a disagreement as to how to interpret the behaviours we actually see during the procedure (this is especially true for the Elevated plus maze test, where various researchers interpret the same behavioural trait quite differently—see Discussion) and how the repeated exposure to the same procedures affects the behaviour of the animals. In addition, these experiments are widely used in behavioural pharmacology (e.g.,9,41,42), which means that this study might hold important implications not only for the study of the personality of laboratory animals but also when interpreting the results of various pharmacological studies.

The specific aims of the study were to (i) check if the behaviours exhibited in the experiments meet a basic criterion of personality—repeatability and (ii) examine correlations between the behaviours exhibited in these tests—a behavioural syndrome.

Methods

Animals

For the purpose of this study, we obtained 36 male laboratory rats of the Long Evans strain from the breeding facilities within the Czech Academy of Sciences. Experimental animals were divided into two groups, each group consisted of 18 animals from three litters (36 animals from 6 litters in total). The litters were manipulated (female pups and surplus male pups were removed) and we obtained the animals when they were two days old. The adult animals were housed in a box (25 × 25 × 50 cm) in groups consisting of two or three individuals. Throughout the experiment, the rats were kept in standard laboratory conditions—stable temperature (21 °C) and 12-h light–dark-cycle. All animals had ad libitum access to food pellets (Altromin diet 1314) and water.

General procedure

During ontogeny, the animals were weighed and tested for motor skills (data are not shown in this study). The experimental testing battery began when the rats reached three months of age. Personality tests featured multiple sessions in order to assess repeatability of behaviour—the Open field test and Elevated plus maze test were repeated three times (in both procedures, the inter-test intervals were five days between the first and second experiment and fourteen days between the second and the third experiment), the T-maze test consisted of four consecutive trials administered on the same day (see Fig. 1). The rats were also tested for aggression in a social interaction test. Analysis of this experiment, however, did not reveal any aggressive behaviour, the results of these experiments are therefore not given here. As a part of a subsequent study, the animals were also tested in cognitive experiments, the results of which will be reported elsewhere. The sequence of the experiments was planned with the main goal of recording and inspecting repeatability estimates from the experiments.

Figure 1
figure 1

A time axis illustrating the experimental design of this study. Each point on the axis represents one day, bold points represent testing days, dotted points represent days with no experiments. OF—Open field test, EPM—Elevated plus maze test, T-maze—modified T-maze test. Cognitive tasks are shown in grey: AAPA—Active Allothetic Place Avoidance task43, MWM—Morris Water Maze task44.

The apparatuses were thoroughly cleaned with Sterilium to eliminate odour cues before the start of each experiment. All experiments were recorded and behavioural parameters scored were selected on the basis of previous studies, e.g.,7,34,36,45,46, for the list of parameters recorded and analysed in each procedure see the description of the experiments below. Parameters with minimum frequencies (e.g. freezing) were omitted. All experiments except for the T-maze test were performed without the presence of the experimenter in the room.

All experiments were conducted at the Institute of Physiology of the Czech Academy of Sciences (the department of Neurophysiology of Memory). All animal manipulations were approved by the Ministry of Agriculture committee and done according to the approved project of experiments no. 136/2013. The procedures complied with the Animal Protection Code of Czechia and the appropriate directive of the European Union (2010/63/EC).

Open field test (OF)

This experiment was invented by Calvin Hall with the aim of studying emotional reactivity in rodents. Individuals that did not move around the arena so much, but defecated a lot, were labeled emotional46. Even though the Open field test was originally created for rodents29, it benefits from its simplicity and nowadays various species are tested in this paradigm and the Open field test is used for assessing emotionality, activity, and anxiety8,29,39.

In our study the apparatus was a brightly illuminated (standard indoor working space illumination—more than 250 lx) white square box (80 × 80 cm) with high walls (80 cm). At the beginning of the experiment, we put the animal into the middle of the apparatus. The open field test was conducted three times (see Fig. 1—experimental design), each trial lasted 10 min.

For the purpose of the statistical analysis, we divided the surface of the apparatus into three zones (see Fig. 2). The centre zone covered 50% of the base, the rest of the arena was divided into a wall and corner zone (all four corners were put together to create one zone called “corner zone”, the same was done for the walls).

Figure 2
figure 2

Zones in the Open field test. The centre zone covered 50% of the base of the arena, the rest of the arena was divided into a wall zone and a corner zone.

Parameters recorded in this experiment were: distance travelled, number of rears, time spent grooming, time spent sitting, and defecation (number of events). Number of rears, grooming and sitting were also analysed for each zone separately as well as number of visits to each zone and time spent there. Video recordings were analysed in EthoVision software platform (Noldus, 10.1 version).

Elevated plus maze test (EPM)

The Elevated plus maze apparatus is a non-transparent cross-shaped maze, with two open arms (without walls) and two closed arms (with walls), the same arms are situated opposite to each other. This apparatus is used in various areas of research, usually for assessment of exploratory and avoidance behaviours or to study anxiety8,30,39. The arms of the maze we used were 50 cm long and had 1 cm high rims on the edges (EPM apparatus used for pharmacological experiments). At the beginning of each trial, the individual animal was placed on an open arm, facing the centre of the maze. The Elevated plus maze was repeated three times (see Fig. 1—scheme of experiments), each trial lasted 10 min.

Parameters recorded in this test: number of entries in the three areas of the maze (open and closed arms, centre zone) and time spent in the three areas of the maze (open and closed arms, centre zone). The recordings were analysed in the Track Analysis program (Biosignal Group, US).

T-maze test

Variations of this test are used in behavioural testing, cognitive studies and pharmacology, see47,48,49. We used the T-maze to study the animal’s tendency to explore a novel environment. When put into the Y- or T-shaped maze, the animal enters one of the closed arms (to explore or hide in). In the next trial, the animal usually enters the other (unexplored) arm. This behaviour is called spontaneous alternation33.

In our variant of this experiment, the rat was placed at the end of an open arm of a T-maze (adapted from the Elevated plus maze apparatus by blocking one open arm) with its head facing the middle of the maze. Its behaviour was recorded until it chose one of the closed arms and went inside, the rat was then quickly retracted from the maze and put into the open arm again (experimental procedure and analysis was adopted similarly to34). This procedure was repeated in four subsequent trials during one day.

Parameters recorded in this experiment were: laterality (whether the individual chose the right or the left arm), latency to enter one of the closed arms.

Analysis and statistics

All parameters were checked for normal distribution and transformed when necessary—count, duration, and latency measures were square root, square root arcsin, and natural logarithm transformed, respectively.

In order to check whether inter-individual differences in behaviour are not solely attributable to differences between litters or the two testing groups (in this study we were not interested in the causes that lead to emergence of inter-individual differences in behaviour, but rather the differences themselves), we performed Linear models with hierarchically organised random effects (lme4 package50), using group, litter, and individual identity as random factors. Since most of the variability in behaviour could not be easily attributed to either individual identity, litter, or group (see Table S4 in Supplementary materials), we pooled all animals together for the purpose of this study. In the Results section, we therefore present only the results of pooled analyses.

Firstly, we computed repeatability estimates (both agreement and consistency) for all parameters we used R 3.6.1 (R Core Team 202051) and RStudio (RStudio Team 202052) using the rptR package53 (lmm method of repeatability estimation, setting the Gaussian data type and ID as a single random effect).

Secondly, we analysed the inner structure of personality tests (Open field test, Elevated plus maze test) using the Principal Component Analysis in Statistica 9.1 (StatSoft Inc.,201054). For each experiment, we also computed a Parallel analysis in SPSS 20 (IBM Analytics55) to assess how many axes or parameters should be used for further analyses in order not to increase the probability of the type I error.

Parallel analysis revealed that only one axis should be used for further analyses. Exploratory analyses revealed original parameters, which expressed the behaviour of the animals extremely well (loading on the first axis of the Principal component analysis was at least 0.79, but mostly over 0.9). Therefore we decided to use the original behavioural variables in subsequent analyses instead of using axes extracted from multivariate analyses. If we were to use the axes derived from the multivariate analyses, the subsequent results would be slightly more accurate but difficult to interpret because we can easily imagine the original behaviour and deduce its meaning. It is, however, increasingly more difficult to do so with synthetic axes representing behaviour. We, therefore, use the primary analyses only to choose the parameters which are then entering the overall multivariate analysis.

After having analysed behaviour in each experiment, we performed an overall analysis of the behaviours measured in the experiments. An elegant way to analyse such data would be a MCMCglmm method56,57,58,59,60, however, our sample size did not allow us to use this approach58. Instead we performed a Factor Analysis of all personality tests put together so we could investigate relationships between different personality experiments. For this analysis, we used only one parameter from each personality test. The parameters used in the overall analysis were selected according to these three rules: (1) The variable should be repeatable. (2) The variable should represent other parameters from the test—it should comprise as much variability from the first axis of PCA as possible. (3) The variable should be either easily measurable or a standard parameter in the test.

Results and discussion

Repeatability of behaviour in laboratory rats

We detected several parameters with significant repeatability. For the Open field test, the parameters with the highest repeatability were: distance travelled, number of supported rears in the “corner zone” and time spent in active movement (see Fig. 3a and 3b). For the Elevated plus maze test, both time spent in closed and open arms were repeatable, although their repeatability estimates were higher, when changes in time were considered (consistency repeatability). Latency to enter a closed arm in the T-maze test was also repeatable (for all significant repeatability estimates see Table 1).

Figures 3
figure 3

(a,b) Distance travelled in the Open field test and the number of rears (both supported at the walls and unsupported) in the Open field tests, shown for each replication of the task. Distance travelled did not change across replications, whereas number of rears decreased, probably reflecting habituation to the apparatus.

Table 1 Parameters from all personality tests with significant repeatability. Variables with higher repeatability scores than 0.3 are in bold. RA = agreement repeatability, RC = consistency repeatability.

Correlations across tests

From each test, we selected one parameter for further multivariate analyses: distance travelled for the Open field test, time spent in the open arms of the maze for Elevated plus maze test, and latency to enter one of the closed arms for the T-maze test. Distance travelled in the Open field test is usually interpreted as activity, time spent or the number of visits into open arms is usually interpreted as boldness (see Table S1 in Supplementary and its references).

The first factor of a rotated (Varimax normalized) Factor Analysis (F1) explained 19.9% of variability and best correlated with time spent in the open arms in the first (− 0.98) and second (− 0.77) Elevated plus maze test. The second factor (F2) explained 15.5% of variability in the data and correlated with distance travelled in the second Open field test (− 0.93, see Table 2). The T-maze test did not significantly contribute to characterizing and explaining inter-individual differences in our study.

Table 2 Factor loadings of variables from personality tests (Open field, Elevated plus maze, and T-maze test). Variables with factor loading 0.7 or higher are in bold.

Discussion

Consistency of behaviour

The first aim of this study was to look for parameters reflecting personality in the Long Evans strain of laboratory rats. As we have shown in Table 1, we detected several parameters with significant repeatability. However, the consistency of the rats’ behaviour was not very high—the highest repeatability estimate was 0.34 for agreement repeatability (distance travelled in the Open field test) and 0.36 for consistency repeatability (time spent in the open arms of the Elevated plus maze test). Most of the estimates we found were lower than 0.30. According to a meta-analysis concerning the general repeatability of animal behaviour, mean repeatability estimates should be around 0.371.

Despite thorough interbreeding and selection programmes that give rise to laboratory strains (in order to create animals as uniform as possible), studies demonstrate that laboratory animals show behavioural variability (for details on laboratory strains of mice and rats see8), it is, however, less pronounced than in wild or wild-derived animals1,6,7. The lower repeatability estimates in our data might, therefore, be a explained by the fact that we worked with laboratory animals and we did not manipulate environmental effects (the manipulation was identical for all tested animals) during this study61,62,63,64. Nevertheless, even though we used a rather uniform set of individuals (belonging to the same laboratory strain) the inter-individual differences in behaviour proved to be significant and not easily negligible.

Moreover, low agreement repeatability and absence of behavioural syndrome could signal habituation to the apparatus when animals are tested repeatedly65,66,67. However, when we calculated consistency repeatability (corrected for systematic changes in behaviour over time), the repeatability estimates did not improve much for most parameters derived from the Open field test. On the other hand, for boldness measured in the Elevated plus maze test, estimates of consistency repeatability were markedly higher than agreement repeatability estimates (see Table 1). Therefore, we can say that we detected a consistent change in boldness across trials in the Elevated plus maze test, probably due to ongoing habituation to the apparatus.

Behavioural syndrome

Our second aim was to inspect a behavioural syndrome in the Long Evans strain of laboratory rats. However, we found no link between behaviour in the tests we adopted and therefore we did not confirm the existence of a behavioural syndrome using these experimental procedures. Several previous studies have, however, also found significant repeatability of behaviour (personality) without a behavioural syndrome, e.g.,20,68.

Even though the Open field test and Elevated plus maze test held the highest predictive value for assessing differences in behaviour (Table 2), these two experimental procedures represented separate orthogonal axes of behaviour in our data. The fact that behaviour in the Open field test is not related to behaviour in the Elevated plus maze test, however, is rather intriguing as these two experiments are sometimes used interchangeably when researchers study exploration or anxiety (see references under the Introduction or Methods sections, or Table S1 in Supplementary materials and its references). In contrast, our results show that these experiments should not be regarded as synonymous, because they seem to test different aspects of behaviour.

The finding that behaviour in the Open field test is not correlated to behaviour in the Elevated plus maze test suggests that interpretation of behavioural parameters from these two experiments should be considered carefully. This finding should be especially interesting for pharmacological studies as they use these procedures quite often and generally consider them interchangeable. In contrast, our results indicate that the behaviour recorded in these two procedures is not equivalent and should be interpreted more cautiously.

Even though behaviour in the T-maze (adapted from EPM test in our case) did not significantly contribute to the explanation of inter-individual behavioural differences of our experimental animals (see Table 1), the result of this experiment is still worth mentioning in light of another study on laboratory rats, where it explained more than 14% of variability in the data34. Latency to enter one of the closed arms is also a basis for selection in a selectively bred strain of prematurely aging mice. In this strain of mice, a lower curiosity level (higher latency to enter a closed arm) is positively associated with rapid aging8. Results from our experimental setting, however, do not support the notion that motivation to explore (or curiosity) represents a personality trait.

Systematic changes in behaviour, interpreting the behaviour in the experiments

Another interesting result is that behaviour in these tests changes over time. Subsequent trials of these experiments (especially Open field test and Elevated plus maze test) correlated rather poorly. This finding is in accordance with the results of previous studies25,31,32, which also show systematic changes in behaviour in these experiments. These result, however, come from rather older studies and they are often neglected when planning, conducting and interpreting experiments with laboratory rats. Nevertheless, our understanding is that these insights are crucial as overlooking them may cause misinterpretation of the behaviour of the animals.

In the Open field test, a clear pattern has been shown for many species (especially rodents). During the trial, distance travelled in the arena decreases31. At first, the animal runs around trying to find a way to escape from the apparatus; towards the end of the trial, the individual ceases to explore the environment and rather engages in grooming, sitting, or rearing31,69. This change in behaviour is detectable even across trials69. A detailed analysis of repeated exposure to the Open field test, therefore, suggests that the first exposure encompasses both activity and exploration, whereas succeeding exposures reflect activity and habituation to the apparatus6,25,31,63,70.

In our data, however, distance travelled did not decrease across trials. Only the number of rears decreased. Therefore, in the second and third trial of the Open field test, the rats moved through the arena, but probably did not try to find a way to escape it. Distance travelled in the Open field test had high loading on the first factor in all three trials of the test (see Table S2 in Supplementary materials) and probably represents activity as has been shown in many other studies (mice26,41; rats27,34). The slight discrepancy between our results and the previous studies may have been caused by the fact that the animals we tested were not naïve and were accustomed to handling and testing in several other procedures.

Similarly, the first exposure to the Elevated plus maze differs from the succeeding trials30,32. Contrary to previous studies (reviewed in30), we observed that time spent in the open arms increased across trials, which can be interpreted as the increasing boldness of the tested animals. We also observed systematic changes in explorative behaviours. The distance travelled in the maze and number of entries into the arms (both open and closed) decreased across trials. Therefore, rats showed habituation to the apparatus in this test. The increasing boldness could also be attributed to the fact that our animals were not naïve.

When analysing behaviour in the Elevated plus maze with multivariate methods, the first factor is usually represented by the number of entries into the open arms of the maze or time spent there. This is almost universal among experimental studies and it was also the case for our data (see Table S3 in Supplementary materials). However, the interpretation of this factor and therefore interpretation of the experiment as a whole still seems to be an issue. Various researchers use different interpretations of behaviour recorded in the Elevated plus maze test. Some authors interpret it as “approach/avoidance towards aversive stimuli”28,71, others label it as “anxiety”27,72, “emotional reactivity”26 or “impulsivity”73.

Conclusions

Even though strains of laboratory rats are bred to be homogenous, there are inter-individual differences in behaviour between the animals, which are apparent when the idnividuals are tested in standard behavioural experiments. In this study we confirmed the existence of consistent inter-individual differences (personality) in behaviour in the Long Evans strain of laboratory rats. In several tasks, we found parameters with significant repeatability (ranging from 0.18 to 0.36). This variability is, therefore, rather limited (repeatability estimates were quite low), but still it should not be overlooked when designing and interpreting experiments.

Moreover, repeated exposure to the Elevated plus maze test revealed a consistent systematic change of behaviour among all animals—increasing boldness. Nevertheless, we were not able to confirm a behavioural syndrome among the behaviours measured in the Open field test, Elevated plus maze test, and the T-maze test. However, the absence of a behavioural syndrome in our results is in accordance with several previous studies and suggests that these experimental procedures measure different dimensions of behaviour and are suitable for assessing behaviour corresponding to several personality axes. The results from these procedures should, therefore, be interpreted with caution and these experimental designs should not be used interchangeably.