A simulated comparison of behavioural observation sampling methods

Brereton, James Edward; Tuke, Jonathan; Fernandez, Eduardo J.

doi:10.1038/s41598-022-07169-5

Download PDF

Article
Open access
Published: 23 February 2022

A simulated comparison of behavioural observation sampling methods

James Edward Brereton¹,
Jonathan Tuke² &
Eduardo J. Fernandez³

Scientific Reports volume 12, Article number: 3096 (2022) Cite this article

8423 Accesses
13 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Behavioural research requires the use of sampling methods to document the occurrence of responses observed. Sampling/recording methods include ad libitum, continuous, pinpoint (instantaneous), and one-zero (interval) sampling. Researchers have questioned the utility of each sampling method under different contexts. Our study compared computerized simulations of both pinpoint and one-zero sampling to continuous recordings. Two separate computer simulations were generated, one for response frequency and one for response duration, with three different response frequencies (high, medium, or low) and response durations (short, medium, and long) in each simulation, respectively. Similarly, three different observation intervals (5, 50, and 500 s) were used to record responses as both pinpoint and one-zero sampling methods in the simulations. Under both simulations, pinpoint sampling outperformed one-zero sampling, with pinpoint sampling producing less statistical bias in error rates under all frequencies, durations, and observation intervals. As observation intervals increased, both mean error rates and variability in error rates increased for one-zero sampling, while only variability in error rate increased for pinpoint sampling. The results suggest that pinpoint sampling techniques are effective for measuring both frequency (event) and duration (state) behaviours, and that pinpoint sampling is a less statistically biased behavioural observation method than one-zero sampling.

New approaches to selecting a scan-sampling method for chicken behavioral observations and their practical implications

Article Open access 11 October 2023

The relevance of a right scale for sampling when studying high-resolution behavioral dynamics

Article Open access 16 August 2023

Automated analysis of activity, sleep, and rhythmic behaviour in various animal species with the Rtivity software

Article Open access 09 March 2022

Introduction

Behavioural studies are a valuable tool for the scientific study of both human and non-human animal performance. In animal research, behavioural observations are used more often than other welfare indicators such as glucocorticoid analysis^1,2. Behavioural research may also be used to investigate the prevalence of positive behaviours, like foraging, or negative behaviours, such as stereotypies^3,4,5. Studies of behaviour are also frequently conducted for wild animal populations and to better understand natural histories or investigate the impact of human disturbance^2,6. Research on animal behaviour is now so well recognised that there are numerous journals dedicated to its study, for instance: Animal Behaviour, Applied Animal Behaviour Science, and Ethology.

The methods used in behavioural research can be traced back to laboratory studies. Scientists during the mid-twentieth Century often used a mixture of both human and animal models to answer questions in the field of behavioural psychology^7,8. Based on the range of different techniques that were generated by earlier studies, Altmann⁹ summarised the methods available. This paper became fundamentally important to those interested in behavioural research and remains a keystone paper for researchers. Other authors, such as Bateson and Martin¹⁰ further refined the behavioural methods and their definitions. Bateson and Martin¹⁰ distinguish between sampling and recording rules, which detail differences between the number of subjects observed (i.e. sampling rules; focal [1 subject] vs. scan [> 1 subjects]), or the observation method used to sample behaviour (i.e. recording rules; see below). However, little distinction is made between sampling and recording rules outside of this text, and for the purposes of our paper, we will refer to recording rules as sampling methods or techniques.

Types of sampling (recording) methods

Since Altmann’s⁹ review, some behavioural sampling techniques have become increasingly popular in the research literature, whereas others are rarely used. Several behaviour measurement techniques have received criticism in terms of their repeatability¹¹. For example, ad libitum (qualitative) sampling may be useful for developing ethograms and for pilot studies but has methodological flaws with regards to its lack of standardisation^12,13. However, ad libitum sampling is still used in studies of behaviour, with a review by Mann¹⁴ identifying that between 53 and 59% of cetacean studies published in Marine Mammal Science used this sampling technique.

Continuous recording is considered the gold standard for behavioural sampling, as this method records all occurrences of behaviour and their durations^6,15. In the past, this made continuous recording often challenging for researchers: for instance, an active animal that rapidly changed behaviour would have been difficult to observe and record¹⁶. Similarly, measurement of multiple animals using a continuous method would have been incredibly challenging to document accurately, hence the method is often considered synonymous with the focal sampling of one individual^9,12. Use of modern technology has in part ameliorated some of these issues by allowing behaviour to be recorded and analysed later¹⁷. However, continuous recording may remain a challenge, especially where large amounts of data are being recorded or direct comparisons of response frequencies and durations are made, and as such there is a need for alternative methods. As a result, several sampling (recording) methods have been developed that allow multiple animals and behaviours to be measured at one time (scan sampling), as well in a non-continuous fashion.

The use of pinpoint sampling, also referred to as instantaneous or momentary time sampling, is a commonly used method for observational study^6,18,19,20. With pinpoint sampling, one or more responses are recorded at preselected moments in time (e.g., every 15 s for an hour). The benefits of pinpoint sampling are that it is less intensive than continuous sampling, and therefore may be more feasible for researchers to conduct^12,21,22. The methods are also more versatile, allowing researchers to make decisions as to how long intervals should be spaced. For example, some researchers might choose to use 15-s intervals, particularly when studying an active animal or when conducting observations of key times, such as when enrichment is provided^23,24. On the other hand, observers might choose to use much longer intervals, such as one-, two- or five-minute intervals when their subjects are inactive or if they are observing for long time periods^25,26. Shorter intervals tend to result in values that match more closely the continuous behaviour scores but require more recording effort²⁷.

One-zero or interval sampling involves choosing specific intervals of time, like pinpoint sampling, but instead recording whether one or more responses occur (or conversely, do not occur) within that interval of time^6,28,29. While popular with both human and non-human primate research, one-zero sampling seems to receive less representation than pinpoint sampling in most animal behaviour studies and has been criticised by previous researchers^9,30. However, one-zero sampling has some of the same benefits of instantaneous sampling in that interval length can be tailored in line with the requirements of the study. Additionally, one-zero sampling has the potential to collect more behaviours during a predefined period, as multiple behaviours can be recorded during each interval⁹. Leger³¹ identified good agreement with continuous behaviour measures when using one-zero sampling at 15-s intervals for chimpanzees (Pan troglodytes). Likewise, Rhine and Flanigon³² found similar levels of occurrence when comparing continuous, pinpoint, and one-zero sampling methods with a colony group of stumptail macaques (Macaca arctoides). As noted above, one-zero (interval) sampling is also frequently used in studies on human behaviour, for example in the classroom^33,34.

Both pinpoint and one-zero sampling overcome some of the issues associated with continuous recording by reducing the amount of input required by the researcher, while still aiming to keep the sample representative of the subject’s behavioural repertoire^35,36,37,38. However, one key question is how closely these techniques correlate with continuous recording? A major concern focuses on distinguishing between the frequency vs duration of some response, with behaviours of short duration typically referred to as “events”, while behaviours of long durations are called “states”. Pinpoint sampling loses information in terms of the duration of any response and is potentially less likely to pick up any behaviours of short duration (events)^12,39. By contrast, one-zero sampling is better at recording all observable behaviours, but both behavioural frequency and duration could be easily misrepresented: there is no way to identify whether a behaviour recorded as present for one interval was seen once or thirty times during that time period⁴⁰.

Sampling method simulations

Researchers in various fields have compared differences between pinpoint and one-zero sampling methods. Early simulations lacked the precision and/or ability to run extensive repetitions of their simulations to accurately assess sampling method differences^41,42,43,44. Other researchers have attempted to make similar methodological comparisons via the data collection of actual behavioural occurrences^{31,32,45,46,47,48,49}. While the results of differences in sampling methods for real occurrences of behaviour varied, most studies found pinpoint sampling to be more accurate than one-zero sampling, at least with respect to duration (state) behaviours. Nonetheless, caution should be used in making determinations of the validity of any result based on specific examples, as exceptions to any rule can and do occur.

Only three recent studies, all conducted by behaviour analysts interested in observations for applied, behaviour change purposes with human populations, have attempted to simulate data sets and compare some aspect of pinpoint and one-zero sampling methods^50,51,52. In two of these studies^50,51, limited simulations were produced via the rolling of die and pinpoint sampling was compared to a type of one-zero sampling, Partial Interval Recording (PIR), in which the response only need occur at any point during an observation interval to be recorded. In both studies, pinpoint sampling generally outperformed one-zero sampling for the detection of duration responses, with some variation in the ability of PIR to accurately detect frequency responses compared to pinpoint sampling and continuous recordings. Wirth et al.⁵² is the only study to date to use extensive computer-generated simulations to examine differences between pinpoint and one-zero sampling methods. Their study utilised both PIR and Whole Interval Recording (WIR), where the duration response must occur during the entire observation interval to be recorded. Overall, they found that pinpoint sampling outperformed one-zero sampling methods on most measures.

The following study proposes to compare computer simulated occurrences of both low/short, medium, and high/long frequency/duration behaviours, as well as similar observation intervals for pinpoint and one-zero sampling methods. Different durations of behaviour were used to provide generalised situations researchers may encounter: some behaviours are normally short (e.g. sneezing), medium (e.g. feeding) or long (e.g. resting) in their duration. We hypothesised that: (1) one-zero sampling would be more accurate (less statistically biased) for detecting the occurrence of low frequency (event) behaviours, particularly when comparing less frequent pinpoint and one-zero observation methods (e.g., 500 s observation intervals), and (2) pinpoint sampling would provide a more accurate representation of percentages of occurrence for both low, medium, and high duration (state) behaviours than one-zero sampling.

Results

Response frequency

The mean error rate for both pinpoint and one-zero sampling was calculated for each interval length and each of the three behavioural frequencies (see Fig. 1).

The mean error for pinpoint sampling was minimal for all interval lengths and behavioural frequencies. However, variance for the pinpoint sampling increased as interval length increased. For one-zero sampling, error rates increased as the interval length increased, with the 500 s interval showing the largest error rates irrespective of behavioural frequency.

Overall, mean error rates were consistently lower for the pinpoint sampling method in comparison to the one-zero sampling method (χ² = 9, df = 1, p = 0.0027, W = 1) (see Table 1). Post-hoc tests for all 9 comparisons (3 frequencies × 3 recording intervals) were p < 0.001.

Table 1 Mean error rates for each sampling method under 5 s, 50 s and 500 interval lengths for the response frequency simulation.

Full size table

Response duration

The accuracy of both pinpoint and one-zero sampling was calculated for each interval length and all three behavioural durations (short, medium, and long) (see Fig. 2).

For all simulation frequencies, pinpoint sampling was less statistically biased, with minimal error rates. By contrast, mean error rates were much higher for one-zero sampling, and these increased as interval length increased. For both pinpoint and one-zero sampling, the variance in error increased with interval length.

The pinpoint sampling method consistently produced lower error rates than the one-zero method (χ² = 9, df = 1, p = 0.0027, W = 1) (see Table 2). Post-hoc tests for all 9 comparisons (3 durations × 3 recording intervals) were p < 0.001.

Table 2 Mean error rates for each sampling method under 5 s, 50 s and 500 interval lengths for the response duration simulation.

Full size table

Discussion

Our study attempted to answer two hypotheses: (1) one-zero sampling would be more accurate (less statistical error or bias) for detecting the occurrence of low frequency (event) behaviours, particularly when comparing less frequent pinpoint and one-zero observation methods, and (2) pinpoint sampling would provide a more accurate representation of percentages of occurrence for both low, medium, and high duration (state) behaviours than one-zero sampling. The first hypothesis was not supported, as pinpoint sampling was better able to detect frequency responses than one-zero sampling, even when events occurred less frequently, and when recording intervals were longer. The second hypothesis was supported in that pinpoint sampling had lower error margins than one-zero sampling for detecting duration behaviours. One-zero sampling was similarly capable at detecting duration behaviours of any length at low (5 s) or medium (50 s) recording intervals. At longer recording intervals (500 s), pinpoint sampling substantially outperformed one-zero sampling for the detection of duration (state) behaviours. Finally, for both sampling methods, increasing the interval recording length appeared to increase the variability in error rates for both frequency and duration responses. As the recording interval increased, one-zero sampling became less accurate (more statistically biased), as observed by an increase in mean error rate. Increased recording intervals also increased variability in the mean error rate for one-zero sampling of duration responses. Pinpoint sampling maintained low error rates regardless of the recording interval length, however, as the recording interval increased, pinpoint sampling showed greater variability in the mean error rate for both frequency and duration responses.

As noted in the Introduction, Wirth et al.⁵² is the only other study to date to use extensive computer-generated simulations to examine differences between pinpoint and one-zero sampling methods, in their case both partial interval recording (PIR) and whole interval recording (WIR) methods. Like our study, they generated 100 simulations, and found pinpoint sampling to be more accurate (less statistically biased) than PIR or WIR, which overestimated and underestimated cumulative event durations, respectively. One limitation of their simulation was that it used a truly randomized rather than block structure for the simulated responses, as ours did, which more directly limits the applicability of their simulation to real-world behaviours (behaviour is rarely, if ever, truly random). Regardless, their results were similar to our study in that pinpoint sampling was generally more accurate than one-zero sampling methods.

Taken together, the results of our study and previous simulations suggest that pinpoint sampling is more accurate in detecting responses than one-zero sampling. Below we consider these implications, as well as factors that should influence the selection of behavioural sampling methods.

Which sampling method is most appropriate for my study?

Pinpoint sampling has not been recommended for measuring frequency (event) responses, particularly those of low occurrence^6,9. However, in our simulation this method was accurately able to detect low occurrence (< 1%) frequencies. Therefore, the use of pinpoint sampling to measure any event responses, regardless of their frequency of occurrence, appears to be a viable option if large amounts of behavioural data are collected.

One-zero sampling methods are often preferred as an observational method because of the ease with which behaviours can be observed, recorded, and assessed for Interobserver Agreement (IOA)^53,54. The same can also be said for pinpoint sampling, which provides an equally user-friendly research method when compared to continuous (focal) recordings. In addition, researchers attempting to account for under- or over-estimates of one-zero recordings have devised different sampling methods, including partial, whole, occurrence, and non-occurrence interval (one-zero) recordings. Still, the difficulty here is that, if pinpoint sampling provides a more accurate representation of behavioural occurrence, then the solution should be to adopt this method rather than adjusting to a less accurate one-zero recording method.

An added benefit of using either pinpoint or one-zero sampling methods over continuous recordings are that frequency (event) versus duration (state) behaviours can be compared more clearly. For instance, if a researcher were assessing the impact of pacing on the welfare of an animal, measuring pacing as an event or state would result in different data being generated. Lehner⁶ suggests that the former could be assessed as a bout of event responses, but it is still not clear how to evaluate the difference between about of responses to less frequent but longer duration behaviours. Pinpoint and one-zero sampling methods avoid this problem by only recording whether the response occurred during some observation period, regardless of the frequency or duration of the recorded response. This makes these observation methods valuable in circumstances where presence or absence of a particular behaviour is more important than the measurement of its frequency or duration, such as in studies of courtship or reproduction^10,12.

There may remain several valuable uses for one-zero sampling as a tool for researchers. For example, one-zero sampling may still be the most useful technique when a specific, important behaviour occurs very rarely and is of short duration. The value of one-zero sampling would be further enhanced in studies where smaller amounts of data are collected. Examples could include courtship displays, where the behaviours may occur only a handful of times per individual per year for some species¹². The chance of the behaviour being recorded by pinpoint sampling may be minimal, yet the value of identifying the behaviour may be disproportionately high. However, caution is still warranted in the application of one-zero sampling methods to record rare, short duration responses, as it is not clear whether such interval recording methods would produce an accurate representation of such low occurrence responses.

Sampling method selection and laboratory lore

Historically, a major factor in determining behavioural observation methodology has been the prevalence of that sampling method within some field or observational species. For instance, Mann¹⁴ found that over half of all cetacean studies in their review used ad libitum sampling, even though such sampling methods are recognized to be both less quantitative and systematic. Likewise, one-zero sampling methods are typically used by primatologists and behaviour analysts for the study of non-human primate and human behaviour, respectively^{30,31,32,33,34,35,36,49,53,54,55,56,57}. The concept of using methodology passed down from previous studies and labs has been referred to as “laboratory lore” and is an asset to the cultural transmission of scientific knowledge^58,59. Nonetheless, the selection of behavioural observation methods, like all aspects of scientific research, should be based on the efficacy of the methodology used. In the case of selecting between pinpoint or one-zero sampling methods to estimate behavioural occurrences, our study indicates that pinpoint sampling outperforms one-zero sampling on all frequency (event) and duration (state) measures simulated. Thus, laboratory lore aside, pinpoint sampling seems to be the better option for measuring some aspect of behavioural prevalence when compared to one-zero sampling methods.

Methods

For all simulations, patterns of behaviour were computer generated for both frequency of occurrence (how often the behaviour appeared) and percentage of occurrence (the percentage of time that the behaviour occurred). On these simulated patterns of behaviour, two different non-continuous sampling methods were directly compared: pinpoint (instantaneous) and one-zero (interval) sampling. Two sets of simulations were produced: response frequency (to measure the ability of both behaviour methods to detect short, event behaviours at different rates of occurrence) and response duration (to measure the ability of the methods in assessing state behaviours of different lengths). Three levels for response frequency and response duration were determined, based on a level of frequency/duration: 3 s, 30 s, and 300 s. These three durations were selected because they are reflective of different durations of behaviour in published studies^10,12. The interval lengths for both pinpoint and one-zero sampling were set at 5 s, 50 s, and 500 s, in order to compare the effect of interval length on sample accuracy. These three interval lengths were chosen to reflect some of the common sampling lengths (frequent, regular and infrequent) used in human and animal research^10,12.

Simulations

All the simulations were done in the R computing language version 3.6.3 using the GUI RStudio (code publicly available at https://github.com/jonotuke/animal_simulation_2020)⁶⁰. For both sets of simulations, observation periods were set to a length of one hour, or 3600 s, as this time length is often set in observational studies⁶¹,. A total of 1800 h of simulated data were generated across the response frequency and duration conditions.

Response frequency

This simulation focused on the recording of event behaviours: behaviours of short duration¹⁰. For the simulation, the duration of all event behaviours was set to exactly one second. Next, three different frequencies of event behaviour were selected: high (3 s), medium (30 s) and low (300 s) frequency of occurrence, in order to reflect different types of behaviour that occur very frequently, less frequently, or infrequently⁶² (Fig. 3). The observation period was one-hour in length (3600 s). A total of 100 simulated data sets were generated for each of the three response frequencies. The exact time that each event occurred within the 3, 30 or 300 s period was randomised within the predefined blocks (e.g. the behaviour exactly once within its 3, 30 or 300 s period).

The real (continuous) occurrence of each simulated response frequency was determined by calculating the number of seconds of each event that were possible in a simulated hour of data (observation period divided by frequency of occurrence; high frequency = 1200 s; medium frequency = 120 s; and low frequency = 12 s). The event behaviour seconds were then transformed into a percentage of total time (as is often shown in behaviour studies in the form of an activity budget), as well as frequency of occurrence. Thus, high frequency (3 s) responses occurred 33% of the hour, medium frequency (30 s) responses occurred 3.3%, and the low frequency (300 s) responses occurred 0.3% of the time.

To compare against this real (continuous) measurement, pinpoint and one-zero sampling were used on the simulated data sets. One-zero sampling recorded an event if it occurred at any point during the observation period, also commonly referred to as partial interval recording (PIR). The three interval lengths (5, 50, and 500 s) were used for both pinpoint and one-zero sampling. This resulted in nine-hundred data sets (nine combinations of simulation parameters and sampling parameters, each combination simulated 100 times) being developed.

Response duration

This simulation was developed for longer duration or state behaviours. In the literature, state behaviours can be of variable length, lasting anywhere from seconds (e.g. scratching) to minutes (e.g. preening) or hours (e.g. resting). To accommodate this, three levels of behavioural duration were selected. These durations were set as short (3 s), medium (30 s) and long (300 s) durations of occurrence (Fig. 4). Each of these states were treated separately (only short, medium, or long behaviours occurred in each simulation). As per the Response Frequency investigation, the observation period was set to one-hour in length (3600 s). Each duration simulation was repeated 100 times.

The chosen behaviour occurred once per 600 s period. The exact time that each behaviour occurred within its respective 600 s period was selected at random (though the behaviour was not allowed to slip into the next period of 600 s). Continuous data sets were developed by using the raw, simulated data and transforming this into percentages. This meant that each behaviour occurred six times during each one-hour simulation, with the short duration (3 s) responses occurring 0.5% of the hour, the medium duration (30 s) responses occurring 5%, and the long duration (300 s) responses occurring 50% of the time.

Each of the three behaviour durations (short, medium, and long) were measured using one-zero (PIR) and pinpoint sampling. Three interval lengths were recorded, again consisting of 5 s, 50 s and 500 s, as had been selected for the Response Frequency investigations. These interval lengths were used for both the pinpoint and one-zero sampling. Once complete, the results were then transformed into percentages and compared to the continuous data to determine the level of error.

Statistical analysis

Statistical analyses were conducted on the mean error scores for pinpoint and one-zero sampling at each respective interval length. The Friedman test was used to investigate whether there was a statistically significant effect of sampling method on the estimation error. The sampling/simulation combination was used as a blocking factor. The non-parametric Friedman test was used due to the non-normality of the errors and the observed heteroscedascity. When significant differences were found, paired Wilcoxon tests were used to compare the treatments. To compensate for multiple comparisons, we used an FDR adjustment. The method is statistically not-biased, meaning the process is giving an estimate of the true parameters that are correct (i.e. not over- nor under-estimated).

References

Fraser, D. Animal behaviour, animal welfare and the scientific study of affect. Appl. Anim. Behav. Sci. 118, 108–117. https://doi.org/10.1016/j.applanim.2009.02.020 (2009).
Article Google Scholar
Sands, J. & Creel, S. Social dominance, aggression and faecal glucocorticoid levels in a wild population of wolves, Canis lupus. Anim. Behav. 67, 387–396. https://doi.org/10.1016/j.anbehav.2003.03.019 (2004).
Article Google Scholar
Carlstead, K., Seidensticker, J. & Baldwin, R. Environmental enrichment for zoo bears. Zoo Biol. 10, 3–16. https://doi.org/10.1002/zoo.1430100103 (1991).
Article Google Scholar
Fernandez, E. J. & Timberlake, W. Mutual benefits of research collaborations between zoos and academic institutions. Zoo Biol. 27, 470–487. https://doi.org/10.1002/zoo.20215 (2008).
Article PubMed Google Scholar
Ward, S. J., Sherwen, S. & Clark, F. E. Advances in applied zoo animal welfare science. J. Appl. Anim. Welf. Sci. 21, 23–33. https://doi.org/10.1016/0003-3472(79)90016-2 (2018).
Article CAS PubMed Google Scholar
Lehner, P. N. Handbook of Ethological Methods (Cambridge University Press, 1998).
Google Scholar
Domjan, M. The Principles of Learning and Behavior (Nelson Education, 2014).
Google Scholar
Pierce, W. D. & Cheney, C. D. Behavior Analysis and Learning (Psychology Press, 2013).
Book Google Scholar
Altmann, J. Observational study of behavior: Sampling methods. Behavior 49, 227–266. https://doi.org/10.1163/156853974X00534 (1974).
Article CAS Google Scholar
Bateson, M. & Martin, P. Measuring Behaviour: An Introductory Guide 4th edn. (Cambridge University Press, 2021).
Book Google Scholar
Bernstein, I. S. An empirical comparison of focal and ad libitum scoring with commentary on instantaneous scans, all occurrence and one-zero techniques. Anim. Behav. 42, 721–728. https://doi.org/10.1016/S0003-3472(05)80118-6 (1991).
Article Google Scholar
Martin, P. & Bateson, P. Recording methods. In ‘Measuring Behaviour: An Introductory Guide’ (Cambridge University Press, 2007).
Book Google Scholar
Rhine, R. J. & Ender, P. B. Comparability of methods used in the sampling of primate behavior. Am. J. Primatol. 5, 1–15. https://doi.org/10.1002/ajp.1350050102 (1983).
Article PubMed Google Scholar
Mann, J. Behavioral sampling methods for cetaceans: A review and critique. Mar. Mamm. Sci. 15, 102–122. https://doi.org/10.1111/j.1748-7692.1999.tb00784.x (1999).
Article Google Scholar
Hämäläinen, W. et al. Measuring behaviour accurately with instantaneous sampling: A new tool for selecting appropriate sampling intervals. Appl. Anim. Behav. Sci. 180, 166–173. https://doi.org/10.1016/j.applanim.2016.04.006 (2016).
Article Google Scholar
Tyler, S. Time-sampling: A matter of convention. Anim. Behav. 27, 801–810. https://doi.org/10.1016/0003-3472(79)90016-2 (1979).
Article Google Scholar
Amato, K. R., Van Belle, S. & Wilkinson, B. A comparison of scan and focal sampling for the description of wild primate activity, diet and intragroup spatial relationships. Folia Primatol. 84, 87–101. https://doi.org/10.1159/000348305 (2013).
Article Google Scholar
Fernandez, E. J., Kinley, R. C. & Timberlake, W. Training penguins to interact with enrichment devices for lasting effects. Zoo Biol. 38, 43–49. https://doi.org/10.1002/zoo.21510 (2019).
Article Google Scholar
Sanders, K. & Fernandez, E. J. Behavioral implications of enrichment for golden lion tamarins: A tool for ex situ conservation. J. Appl. Anim. Welf. Sci. 1, 1–10 (2020).
Google Scholar
Stevens, J., Thyssen, A., Laevens, H. & Vervaecke, H. The influence of zoo visitor numbers on the behaviour of harbour seals (Phoca vitulina). J. Zoo Aquat. Res. 1, 31–34. https://doi.org/10.19227/jzar.v1i1.20 (2013).
Article Google Scholar
Grenier, D., Barrette, C. & Crête, M. Food access by white-tailed deer (Odocoileus virginianus) at winter feeding sites in Eastern Québec. Appl. Anim. Behav. Sci. 63, 323–337. https://doi.org/10.1016/S0168-1591(99)00017-9 (1999).
Article Google Scholar
Gilby, I. C., Pokempner, A. A. & Wrangham, R. W. A direct comparison of scan and focal recording rules for measuring wild chimpanzee feeding behaviour. Folia Primatol. 81, 254–264. https://doi.org/10.1159/000322354 (2010).
Article Google Scholar
Fernandez, E. J., Ramirez, M. & Hawkes, N. C. Activity and pool use in relation to temperature and water changes in zoo hippopotamuses (Hippopotamus amphibious). Animals 10, 1022 (2020).
Article Google Scholar
Fernandez, E. J. & Timberlake, W. Foraging devices as enrichment in captive walruses (Odobenus rosmarus). Behav. Proc. 168, 103943. https://doi.org/10.1016/j.beproc.2019.103943 (2019).
Article Google Scholar
Shora, J. A., Myhill, M. N. G. & Brereton, J. E. Should zoo foods be coati chopped. J. Zoo Aquar. Res. 6, 22–25. https://doi.org/10.19227/jzar.v6i1.309 (2021).
Article Google Scholar
Teixeira, D. L., Machado Filho, L. C. P., Hötzel, M. J. & Enríquez-Hidalgo, D. Effects of instantaneous stocking rate, paddock shape and fence with electric shock on dairy cows’ behaviour. Livestock Sci. 198, 170–173. https://doi.org/10.1016/j.livsci.2017.01.007 (2017).
Article Google Scholar
Pullin, A. N. et al. Instantaneous sampling intervals validated from continuous video observation for behavioral recording of feedlot lambs. J. Anim. Sci. 95, 703–4707. https://doi.org/10.2527/jas2017.1835 (2017).
Article CAS Google Scholar
Bailey, J. S. & Burch, M. R. Research Methods in Applied Behavior Analysis (Routledge, 2017).
Book Google Scholar
Bakeman, R. & Quera, V. Behavioral observation. In APA Handbooks in Psychology APA Handbook of Research Methods in Psychology. Foundations Planning Measures and Psychometrics Vol. 1 (eds Cooper, P. M. et al.) 207–225 (American Psychological Association, 2012). https://doi.org/10.1037/13619-013.
Chapter Google Scholar
Kraemer, H. C. One-zero sampling in the study of primate behavior. Primates 20, 237–244 (1997).
Article Google Scholar
Brecht, K. F. et al. The status and value of replications in animal behavior science. Anim. Behav. Cogn. 8, 97–106. https://doi.org/10.26451/abc.08.02.01.2021 (2021).
Article Google Scholar
Rhine, R. J. & Flanigon, M. An empirical comparison of one-zero, focal-animal, and instantaneous methods of sampling spontaneous primate social behavior. Primates 19, 353–361. https://doi.org/10.1007/BF02382803 (1978).
Article Google Scholar
Dunkerton, J. Should classroom observation be quantitative?. Educ. Res. 23, 144–151. https://doi.org/10.1080/0013188810230208 (1981).
Article Google Scholar
Omark, D. R., Fiedler, M. L. & Marvin, R. S. Dominance hierarchies: Observational techniques applied to the study of children at play. Instruct. Sci. 5, 403–423. https://doi.org/10.1007/BF00051807 (1976).
Article Google Scholar
Rhine, R. J. & Linville, A. K. Properties of one-zero scores in observational studies of primate social behavior: The effect of assumptions on empirical analyses. Primates 21, 111–122. https://doi.org/10.1007/BF02383828 (1980).
Article Google Scholar
Rhine, R. J., Norton, G. W., Wynn, G. M. & Wynn, R. D. Weaning of free-ranging infant baboons (Papio cynocephalus) as indicated by one-zero and instantaneous sampling of feeding. Int. J. Primatol. 6, 491–499. https://doi.org/10.1007/BF02735572 (1985).
Article Google Scholar
Mitlöhner, F. M. et al. Behavioral sampling techniques for feedlot cattle. J. Anim. Sci. 79, 1189–1193. https://doi.org/10.1016/S0168-1591(99)00017-9 (2001).
Article PubMed Google Scholar
Simpson, M. J. A. & Simpson, A. E. One-zero and scan methods for sampling behaviour. Anim. Behav. 25, 726–731. https://doi.org/10.1016/0003-3472(77)90122-1 (1977).
Article Google Scholar
Xiao, J., Wang, K. & Wang, D. Diurnal changes of behavior and respiration of Yangtze finless porpoises (Neophocaena phocaenoides asiaeorientalis) in captivity. Zoo Biol. 24, 531–541. https://doi.org/10.1002/zoo.20070 (2005).
Article Google Scholar
Saibaba, P., Sales, G. D., Stodulski, G. & Hau, J. Behaviour of rats in their home cages: Daytime variations and effects of routine husbandry procedures analysed by time sampling techniques. Lab. Anim. 30, 13–21. https://doi.org/10.1258/002367796780744875 (1996).
Article CAS PubMed Google Scholar
Griffin, B. & Adams, R. A parametric model for estimating prevalence, incidence, and mean bout duration from point sampling. Am. J. Primatol. 4, 261–271 (1983).
Article CAS Google Scholar
Harrop, A. & Daniels, M. Methods of time sampling: A reappraisal of momentary time sampling and partial interval recording. J. Appl. Anim. Behav. Anal. 19, 73–77 (1986).
Article CAS Google Scholar
Repp, A. C. et al. A comparison of frequency, interval, and time-sampling methods of data collection. J. Appl. Behav. Anal. 9, 501–508. https://doi.org/10.1901/jaba.1976.9-501 (1976).
Article CAS PubMed PubMed Central Google Scholar
Suen, H. K. & Ary, D. Variables influencing one-zero and instantaneous time sampling outcomes. Primates 25, 89–94. https://doi.org/10.1007/BF02382298 (1984).
Article Google Scholar
Gardenier, N. C., MacDonald, R. & Green, G. Comparison of direct observational methods for measuring stereotypic behavior in children with autism spectrum disorders. Res. Devel. Disabilit. 25, 99–118 (2004).
Article Google Scholar
Meany-Daboul, M. G., Roscoe, E. M., Bourret, J. C. & Ahearn, W. H. A comparison of momentary time sampling and partial-interval recording for evaluating functional relations. J. Appl. Behav. Anal. 40, 501–514 (2007).
Article Google Scholar
Murphy, M. J. & Harrop, A. Observer error in the use of momentary time sampling and partial interval recording. Br. J. Psych. 85, 169–179. https://doi.org/10.1111/j.2044-8295.1994.tb02517.x (1994).
Article Google Scholar
Radley, K. C., O’Handley, R. D. & Labrot, Z. C. A comparison of momentary time sampling and partial-interval recording for assessment of effects of social skills training. Psych. Schools 52, 363–378 (2015).
Article Google Scholar
Rapp, J. T. et al. Interval recording for duration events: A re-evaluation. Behav. Interv. 22, 319–345 (2007).
Article Google Scholar
Devine, S. L. et al. Detecting changes in simulated events using partial-interval recording and momentary time sampling III: Evaluating sensitivity as a function of session length. Behav. Interv. 26, 103–124 (2011).
Article Google Scholar
Rapp, J. T. et al. Detecting changes in simulated events using partial-interval recording and momentary time sampling. Behav. Interv. 23, 237–269 (2008).
Article Google Scholar
Wirth, O., Slaven, J. & Taylor, M. A. Interval sampling methods and measurement error: A computer simulation. J. Appl. Behav. Anal. 47, 83–100. https://doi.org/10.1002/jaba.93| (2014).
Article PubMed Google Scholar
Cooper, J. O., Heron, T. E. & Heward, W. L. Applied Behavior Analysis 3rd edn. (Pearson Publishing, 2020).
Book Google Scholar
Poling, A., Methot, L. L. & LeSage, M. G. Fundamentals of Behavior Analytic Research (Springer, 1995).
Book Google Scholar
Doran, D. M. Comparison of instantaneous and locomotor bout sampling methods: A case study of adult male chimpanzee locomotor behavior and substrate use. Am. J. Phys. Anthropol. 89, 85–99. https://doi.org/10.1002/ajpa.1330890108| (1992).
Article CAS PubMed Google Scholar
Merrell, K. Assessment of children’s social skills: Recent developments, best practices, and new directions. Exceptionality 9, 3–18 (2001).
Article Google Scholar
Seyfarth, R. M., Cheney, D. L. & Marler, P. Vervet monkey alarm calls: Semantic communication in a free-ranging primate. Anim. Behav. 28, 1070–1094 (1980).
Article Google Scholar
Buskist, W. & Johnston, J. M. Laboratory lore and research practices in the experimental analysis of human behavior. The Behavior Analyst 11, 41–42 (1988).
Article CAS Google Scholar
Johnston, J. M. & Pennypacker, H. S. Strategies and Tactics of Behavioral Research (Routledge, 2010).
Book Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, 2020). https://www.R-project.org/.
Leger, D. W. An empirical evaluation of instantaneous and one-zero sampling of chimpanzee behavior. Primates 18, 387–393. https://doi.org/10.1007/BF02383116 (1977).
Article Google Scholar
Farrar, B. G., Voudouris, K. & Clayton, N. S. Replications, comparisons, sampling and the problem of representativeness in animal cognition research. Anim. Behav. Cogn. 8, 273–295. https://doi.org/10.26451/abc.08.02.14.2021 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors would like to thank Mrs S Brereton for proofreading the manuscript. As noted in the text, the code has been made publicly accessible. This allows anyone to adjust all variables and run the simulations and comparisons themselves: https://github.com/jonotuke/animal_simulation_2020.

Author information

Authors and Affiliations

University Centre Sparsholt, Sparsholt College, Westley Lane, Sparsholt, Winchester, SO21 2NF, Hampshire, UK
James Edward Brereton
School of Mathematical Sciences, The University of Adelaide, Adelaide, SA, 5005, Australia
Jonathan Tuke
School of Animal and Veterinary Sciences, The University of Adelaide, Adelaide, SA, 5005, Australia
Eduardo J. Fernandez

Authors

James Edward Brereton
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Tuke
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo J. Fernandez
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.F. and J.E.B. wrote the main manuscript text and S.T. ran the simulations and statistical tests. All authors reviewed the manuscript.

Corresponding author

Correspondence to James Edward Brereton.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Brereton, J.E., Tuke, J. & Fernandez, E.J. A simulated comparison of behavioural observation sampling methods. Sci Rep 12, 3096 (2022). https://doi.org/10.1038/s41598-022-07169-5

Download citation

Received: 05 May 2021
Accepted: 21 January 2022
Published: 23 February 2022
DOI: https://doi.org/10.1038/s41598-022-07169-5

This article is cited by

New approaches to selecting a scan-sampling method for chicken behavioral observations and their practical implications
- Alice Cartoni Mancinelli
- Angela Trocino
- Cesare Castellini
Scientific Reports (2023)
Social touch in the age of computational ethology: Embracing as a multidimensional and complex behaviour
- Sebastian Ocklenburg
- Julian Packheiser
- Guillermo Hidalgo-Gadea
Current Psychology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.