Behavioural research requires the use of sampling methods to document the occurrence of responses observed. Sampling/recording methods include ad libitum, continuous, pinpoint (instantaneous), and one-zero (interval) sampling. Researchers have questioned the utility of each sampling method under different contexts. Our study compared computerized simulations of both pinpoint and one-zero sampling to continuous recordings. Two separate computer simulations were generated, one for response frequency and one for response duration, with three different response frequencies (high, medium, or low) and response durations (short, medium, and long) in each simulation, respectively. Similarly, three different observation intervals (5, 50, and 500 s) were used to record responses as both pinpoint and one-zero sampling methods in the simulations. Under both simulations, pinpoint sampling outperformed one-zero sampling, with pinpoint sampling producing less statistical bias in error rates under all frequencies, durations, and observation intervals. As observation intervals increased, both mean error rates and variability in error rates increased for one-zero sampling, while only variability in error rate increased for pinpoint sampling. The results suggest that pinpoint sampling techniques are effective for measuring both frequency (event) and duration (state) behaviours, and that pinpoint sampling is a less statistically biased behavioural observation method than one-zero sampling.
Behavioural studies are a valuable tool for the scientific study of both human and non-human animal performance. In animal research, behavioural observations are used more often than other welfare indicators such as glucocorticoid analysis1,2. Behavioural research may also be used to investigate the prevalence of positive behaviours, like foraging, or negative behaviours, such as stereotypies3,4,5. Studies of behaviour are also frequently conducted for wild animal populations and to better understand natural histories or investigate the impact of human disturbance2,6. Research on animal behaviour is now so well recognised that there are numerous journals dedicated to its study, for instance: Animal Behaviour, Applied Animal Behaviour Science, and Ethology.
The methods used in behavioural research can be traced back to laboratory studies. Scientists during the mid-twentieth Century often used a mixture of both human and animal models to answer questions in the field of behavioural psychology7,8. Based on the range of different techniques that were generated by earlier studies, Altmann9 summarised the methods available. This paper became fundamentally important to those interested in behavioural research and remains a keystone paper for researchers. Other authors, such as Bateson and Martin10 further refined the behavioural methods and their definitions. Bateson and Martin10 distinguish between sampling and recording rules, which detail differences between the number of subjects observed (i.e. sampling rules; focal [1 subject] vs. scan [> 1 subjects]), or the observation method used to sample behaviour (i.e. recording rules; see below). However, little distinction is made between sampling and recording rules outside of this text, and for the purposes of our paper, we will refer to recording rules as sampling methods or techniques.
Types of sampling (recording) methods
Since Altmann’s9 review, some behavioural sampling techniques have become increasingly popular in the research literature, whereas others are rarely used. Several behaviour measurement techniques have received criticism in terms of their repeatability11. For example, ad libitum (qualitative) sampling may be useful for developing ethograms and for pilot studies but has methodological flaws with regards to its lack of standardisation12,13. However, ad libitum sampling is still used in studies of behaviour, with a review by Mann14 identifying that between 53 and 59% of cetacean studies published in Marine Mammal Science used this sampling technique.
Continuous recording is considered the gold standard for behavioural sampling, as this method records all occurrences of behaviour and their durations6,15. In the past, this made continuous recording often challenging for researchers: for instance, an active animal that rapidly changed behaviour would have been difficult to observe and record16. Similarly, measurement of multiple animals using a continuous method would have been incredibly challenging to document accurately, hence the method is often considered synonymous with the focal sampling of one individual9,12. Use of modern technology has in part ameliorated some of these issues by allowing behaviour to be recorded and analysed later17. However, continuous recording may remain a challenge, especially where large amounts of data are being recorded or direct comparisons of response frequencies and durations are made, and as such there is a need for alternative methods. As a result, several sampling (recording) methods have been developed that allow multiple animals and behaviours to be measured at one time (scan sampling), as well in a non-continuous fashion.
The use of pinpoint sampling, also referred to as instantaneous or momentary time sampling, is a commonly used method for observational study6,18,19,20. With pinpoint sampling, one or more responses are recorded at preselected moments in time (e.g., every 15 s for an hour). The benefits of pinpoint sampling are that it is less intensive than continuous sampling, and therefore may be more feasible for researchers to conduct12,21,22. The methods are also more versatile, allowing researchers to make decisions as to how long intervals should be spaced. For example, some researchers might choose to use 15-s intervals, particularly when studying an active animal or when conducting observations of key times, such as when enrichment is provided23,24. On the other hand, observers might choose to use much longer intervals, such as one-, two- or five-minute intervals when their subjects are inactive or if they are observing for long time periods25,26. Shorter intervals tend to result in values that match more closely the continuous behaviour scores but require more recording effort27.
One-zero or interval sampling involves choosing specific intervals of time, like pinpoint sampling, but instead recording whether one or more responses occur (or conversely, do not occur) within that interval of time6,28,29. While popular with both human and non-human primate research, one-zero sampling seems to receive less representation than pinpoint sampling in most animal behaviour studies and has been criticised by previous researchers9,30. However, one-zero sampling has some of the same benefits of instantaneous sampling in that interval length can be tailored in line with the requirements of the study. Additionally, one-zero sampling has the potential to collect more behaviours during a predefined period, as multiple behaviours can be recorded during each interval9. Leger31 identified good agreement with continuous behaviour measures when using one-zero sampling at 15-s intervals for chimpanzees (Pan troglodytes). Likewise, Rhine and Flanigon32 found similar levels of occurrence when comparing continuous, pinpoint, and one-zero sampling methods with a colony group of stumptail macaques (Macaca arctoides). As noted above, one-zero (interval) sampling is also frequently used in studies on human behaviour, for example in the classroom33,34.
Both pinpoint and one-zero sampling overcome some of the issues associated with continuous recording by reducing the amount of input required by the researcher, while still aiming to keep the sample representative of the subject’s behavioural repertoire35,36,37,38. However, one key question is how closely these techniques correlate with continuous recording? A major concern focuses on distinguishing between the frequency vs duration of some response, with behaviours of short duration typically referred to as “events”, while behaviours of long durations are called “states”. Pinpoint sampling loses information in terms of the duration of any response and is potentially less likely to pick up any behaviours of short duration (events)12,39. By contrast, one-zero sampling is better at recording all observable behaviours, but both behavioural frequency and duration could be easily misrepresented: there is no way to identify whether a behaviour recorded as present for one interval was seen once or thirty times during that time period40.
Sampling method simulations
Researchers in various fields have compared differences between pinpoint and one-zero sampling methods. Early simulations lacked the precision and/or ability to run extensive repetitions of their simulations to accurately assess sampling method differences41,42,43,44. Other researchers have attempted to make similar methodological comparisons via the data collection of actual behavioural occurrences31,32,45,46,47,48,49. While the results of differences in sampling methods for real occurrences of behaviour varied, most studies found pinpoint sampling to be more accurate than one-zero sampling, at least with respect to duration (state) behaviours. Nonetheless, caution should be used in making determinations of the validity of any result based on specific examples, as exceptions to any rule can and do occur.
Only three recent studies, all conducted by behaviour analysts interested in observations for applied, behaviour change purposes with human populations, have attempted to simulate data sets and compare some aspect of pinpoint and one-zero sampling methods50,51,52. In two of these studies50,51, limited simulations were produced via the rolling of die and pinpoint sampling was compared to a type of one-zero sampling, Partial Interval Recording (PIR), in which the response only need occur at any point during an observation interval to be recorded. In both studies, pinpoint sampling generally outperformed one-zero sampling for the detection of duration responses, with some variation in the ability of PIR to accurately detect frequency responses compared to pinpoint sampling and continuous recordings. Wirth et al.52 is the only study to date to use extensive computer-generated simulations to examine differences between pinpoint and one-zero sampling methods. Their study utilised both PIR and Whole Interval Recording (WIR), where the duration response must occur during the entire observation interval to be recorded. Overall, they found that pinpoint sampling outperformed one-zero sampling methods on most measures.
The following study proposes to compare computer simulated occurrences of both low/short, medium, and high/long frequency/duration behaviours, as well as similar observation intervals for pinpoint and one-zero sampling methods. Different durations of behaviour were used to provide generalised situations researchers may encounter: some behaviours are normally short (e.g. sneezing), medium (e.g. feeding) or long (e.g. resting) in their duration. We hypothesised that: (1) one-zero sampling would be more accurate (less statistically biased) for detecting the occurrence of low frequency (event) behaviours, particularly when comparing less frequent pinpoint and one-zero observation methods (e.g., 500 s observation intervals), and (2) pinpoint sampling would provide a more accurate representation of percentages of occurrence for both low, medium, and high duration (state) behaviours than one-zero sampling.
The mean error rate for both pinpoint and one-zero sampling was calculated for each interval length and each of the three behavioural frequencies (see Fig. 1).
The mean error for pinpoint sampling was minimal for all interval lengths and behavioural frequencies. However, variance for the pinpoint sampling increased as interval length increased. For one-zero sampling, error rates increased as the interval length increased, with the 500 s interval showing the largest error rates irrespective of behavioural frequency.
Overall, mean error rates were consistently lower for the pinpoint sampling method in comparison to the one-zero sampling method (χ2 = 9, df = 1, p = 0.0027, W = 1) (see Table 1). Post-hoc tests for all 9 comparisons (3 frequencies × 3 recording intervals) were p < 0.001.
The accuracy of both pinpoint and one-zero sampling was calculated for each interval length and all three behavioural durations (short, medium, and long) (see Fig. 2).
For all simulation frequencies, pinpoint sampling was less statistically biased, with minimal error rates. By contrast, mean error rates were much higher for one-zero sampling, and these increased as interval length increased. For both pinpoint and one-zero sampling, the variance in error increased with interval length.
The pinpoint sampling method consistently produced lower error rates than the one-zero method (χ2 = 9, df = 1, p = 0.0027, W = 1) (see Table 2). Post-hoc tests for all 9 comparisons (3 durations × 3 recording intervals) were p < 0.001.
Our study attempted to answer two hypotheses: (1) one-zero sampling would be more accurate (less statistical error or bias) for detecting the occurrence of low frequency (event) behaviours, particularly when comparing less frequent pinpoint and one-zero observation methods, and (2) pinpoint sampling would provide a more accurate representation of percentages of occurrence for both low, medium, and high duration (state) behaviours than one-zero sampling. The first hypothesis was not supported, as pinpoint sampling was better able to detect frequency responses than one-zero sampling, even when events occurred less frequently, and when recording intervals were longer. The second hypothesis was supported in that pinpoint sampling had lower error margins than one-zero sampling for detecting duration behaviours. One-zero sampling was similarly capable at detecting duration behaviours of any length at low (5 s) or medium (50 s) recording intervals. At longer recording intervals (500 s), pinpoint sampling substantially outperformed one-zero sampling for the detection of duration (state) behaviours. Finally, for both sampling methods, increasing the interval recording length appeared to increase the variability in error rates for both frequency and duration responses. As the recording interval increased, one-zero sampling became less accurate (more statistically biased), as observed by an increase in mean error rate. Increased recording intervals also increased variability in the mean error rate for one-zero sampling of duration responses. Pinpoint sampling maintained low error rates regardless of the recording interval length, however, as the recording interval increased, pinpoint sampling showed greater variability in the mean error rate for both frequency and duration responses.
As noted in the Introduction, Wirth et al.52 is the only other study to date to use extensive computer-generated simulations to examine differences between pinpoint and one-zero sampling methods, in their case both partial interval recording (PIR) and whole interval recording (WIR) methods. Like our study, they generated 100 simulations, and found pinpoint sampling to be more accurate (less statistically biased) than PIR or WIR, which overestimated and underestimated cumulative event durations, respectively. One limitation of their simulation was that it used a truly randomized rather than block structure for the simulated responses, as ours did, which more directly limits the applicability of their simulation to real-world behaviours (behaviour is rarely, if ever, truly random). Regardless, their results were similar to our study in that pinpoint sampling was generally more accurate than one-zero sampling methods.
Taken together, the results of our study and previous simulations suggest that pinpoint sampling is more accurate in detecting responses than one-zero sampling. Below we consider these implications, as well as factors that should influence the selection of behavioural sampling methods.
Which sampling method is most appropriate for my study?
Pinpoint sampling has not been recommended for measuring frequency (event) responses, particularly those of low occurrence6,9. However, in our simulation this method was accurately able to detect low occurrence (< 1%) frequencies. Therefore, the use of pinpoint sampling to measure any event responses, regardless of their frequency of occurrence, appears to be a viable option if large amounts of behavioural data are collected.
One-zero sampling methods are often preferred as an observational method because of the ease with which behaviours can be observed, recorded, and assessed for Interobserver Agreement (IOA)53,54. The same can also be said for pinpoint sampling, which provides an equally user-friendly research method when compared to continuous (focal) recordings. In addition, researchers attempting to account for under- or over-estimates of one-zero recordings have devised different sampling methods, including partial, whole, occurrence, and non-occurrence interval (one-zero) recordings. Still, the difficulty here is that, if pinpoint sampling provides a more accurate representation of behavioural occurrence, then the solution should be to adopt this method rather than adjusting to a less accurate one-zero recording method.
An added benefit of using either pinpoint or one-zero sampling methods over continuous recordings are that frequency (event) versus duration (state) behaviours can be compared more clearly. For instance, if a researcher were assessing the impact of pacing on the welfare of an animal, measuring pacing as an event or state would result in different data being generated. Lehner6 suggests that the former could be assessed as a bout of event responses, but it is still not clear how to evaluate the difference between about of responses to less frequent but longer duration behaviours. Pinpoint and one-zero sampling methods avoid this problem by only recording whether the response occurred during some observation period, regardless of the frequency or duration of the recorded response. This makes these observation methods valuable in circumstances where presence or absence of a particular behaviour is more important than the measurement of its frequency or duration, such as in studies of courtship or reproduction10,12.
There may remain several valuable uses for one-zero sampling as a tool for researchers. For example, one-zero sampling may still be the most useful technique when a specific, important behaviour occurs very rarely and is of short duration. The value of one-zero sampling would be further enhanced in studies where smaller amounts of data are collected. Examples could include courtship displays, where the behaviours may occur only a handful of times per individual per year for some species12. The chance of the behaviour being recorded by pinpoint sampling may be minimal, yet the value of identifying the behaviour may be disproportionately high. However, caution is still warranted in the application of one-zero sampling methods to record rare, short duration responses, as it is not clear whether such interval recording methods would produce an accurate representation of such low occurrence responses.
Sampling method selection and laboratory lore
Historically, a major factor in determining behavioural observation methodology has been the prevalence of that sampling method within some field or observational species. For instance, Mann14 found that over half of all cetacean studies in their review used ad libitum sampling, even though such sampling methods are recognized to be both less quantitative and systematic. Likewise, one-zero sampling methods are typically used by primatologists and behaviour analysts for the study of non-human primate and human behaviour, respectively30,31,32,33,34,35,36,49,53,54,55,56,57. The concept of using methodology passed down from previous studies and labs has been referred to as “laboratory lore” and is an asset to the cultural transmission of scientific knowledge58,59. Nonetheless, the selection of behavioural observation methods, like all aspects of scientific research, should be based on the efficacy of the methodology used. In the case of selecting between pinpoint or one-zero sampling methods to estimate behavioural occurrences, our study indicates that pinpoint sampling outperforms one-zero sampling on all frequency (event) and duration (state) measures simulated. Thus, laboratory lore aside, pinpoint sampling seems to be the better option for measuring some aspect of behavioural prevalence when compared to one-zero sampling methods.
For all simulations, patterns of behaviour were computer generated for both frequency of occurrence (how often the behaviour appeared) and percentage of occurrence (the percentage of time that the behaviour occurred). On these simulated patterns of behaviour, two different non-continuous sampling methods were directly compared: pinpoint (instantaneous) and one-zero (interval) sampling. Two sets of simulations were produced: response frequency (to measure the ability of both behaviour methods to detect short, event behaviours at different rates of occurrence) and response duration (to measure the ability of the methods in assessing state behaviours of different lengths). Three levels for response frequency and response duration were determined, based on a level of frequency/duration: 3 s, 30 s, and 300 s. These three durations were selected because they are reflective of different durations of behaviour in published studies10,12. The interval lengths for both pinpoint and one-zero sampling were set at 5 s, 50 s, and 500 s, in order to compare the effect of interval length on sample accuracy. These three interval lengths were chosen to reflect some of the common sampling lengths (frequent, regular and infrequent) used in human and animal research10,12.
All the simulations were done in the R computing language version 3.6.3 using the GUI RStudio (code publicly available at https://github.com/jonotuke/animal_simulation_2020)60. For both sets of simulations, observation periods were set to a length of one hour, or 3600 s, as this time length is often set in observational studies61,. A total of 1800 h of simulated data were generated across the response frequency and duration conditions.
This simulation focused on the recording of event behaviours: behaviours of short duration10. For the simulation, the duration of all event behaviours was set to exactly one second. Next, three different frequencies of event behaviour were selected: high (3 s), medium (30 s) and low (300 s) frequency of occurrence, in order to reflect different types of behaviour that occur very frequently, less frequently, or infrequently62 (Fig. 3). The observation period was one-hour in length (3600 s). A total of 100 simulated data sets were generated for each of the three response frequencies. The exact time that each event occurred within the 3, 30 or 300 s period was randomised within the predefined blocks (e.g. the behaviour exactly once within its 3, 30 or 300 s period).
The real (continuous) occurrence of each simulated response frequency was determined by calculating the number of seconds of each event that were possible in a simulated hour of data (observation period divided by frequency of occurrence; high frequency = 1200 s; medium frequency = 120 s; and low frequency = 12 s). The event behaviour seconds were then transformed into a percentage of total time (as is often shown in behaviour studies in the form of an activity budget), as well as frequency of occurrence. Thus, high frequency (3 s) responses occurred 33% of the hour, medium frequency (30 s) responses occurred 3.3%, and the low frequency (300 s) responses occurred 0.3% of the time.
To compare against this real (continuous) measurement, pinpoint and one-zero sampling were used on the simulated data sets. One-zero sampling recorded an event if it occurred at any point during the observation period, also commonly referred to as partial interval recording (PIR). The three interval lengths (5, 50, and 500 s) were used for both pinpoint and one-zero sampling. This resulted in nine-hundred data sets (nine combinations of simulation parameters and sampling parameters, each combination simulated 100 times) being developed.
This simulation was developed for longer duration or state behaviours. In the literature, state behaviours can be of variable length, lasting anywhere from seconds (e.g. scratching) to minutes (e.g. preening) or hours (e.g. resting). To accommodate this, three levels of behavioural duration were selected. These durations were set as short (3 s), medium (30 s) and long (300 s) durations of occurrence (Fig. 4). Each of these states were treated separately (only short, medium, or long behaviours occurred in each simulation). As per the Response Frequency investigation, the observation period was set to one-hour in length (3600 s). Each duration simulation was repeated 100 times.
The chosen behaviour occurred once per 600 s period. The exact time that each behaviour occurred within its respective 600 s period was selected at random (though the behaviour was not allowed to slip into the next period of 600 s). Continuous data sets were developed by using the raw, simulated data and transforming this into percentages. This meant that each behaviour occurred six times during each one-hour simulation, with the short duration (3 s) responses occurring 0.5% of the hour, the medium duration (30 s) responses occurring 5%, and the long duration (300 s) responses occurring 50% of the time.
Each of the three behaviour durations (short, medium, and long) were measured using one-zero (PIR) and pinpoint sampling. Three interval lengths were recorded, again consisting of 5 s, 50 s and 500 s, as had been selected for the Response Frequency investigations. These interval lengths were used for both the pinpoint and one-zero sampling. Once complete, the results were then transformed into percentages and compared to the continuous data to determine the level of error.
Statistical analyses were conducted on the mean error scores for pinpoint and one-zero sampling at each respective interval length. The Friedman test was used to investigate whether there was a statistically significant effect of sampling method on the estimation error. The sampling/simulation combination was used as a blocking factor. The non-parametric Friedman test was used due to the non-normality of the errors and the observed heteroscedascity. When significant differences were found, paired Wilcoxon tests were used to compare the treatments. To compensate for multiple comparisons, we used an FDR adjustment. The method is statistically not-biased, meaning the process is giving an estimate of the true parameters that are correct (i.e. not over- nor under-estimated).
Fraser, D. Animal behaviour, animal welfare and the scientific study of affect. Appl. Anim. Behav. Sci. 118, 108–117. https://doi.org/10.1016/j.applanim.2009.02.020 (2009).
Sands, J. & Creel, S. Social dominance, aggression and faecal glucocorticoid levels in a wild population of wolves, Canis lupus. Anim. Behav. 67, 387–396. https://doi.org/10.1016/j.anbehav.2003.03.019 (2004).
Carlstead, K., Seidensticker, J. & Baldwin, R. Environmental enrichment for zoo bears. Zoo Biol. 10, 3–16. https://doi.org/10.1002/zoo.1430100103 (1991).
Fernandez, E. J. & Timberlake, W. Mutual benefits of research collaborations between zoos and academic institutions. Zoo Biol. 27, 470–487. https://doi.org/10.1002/zoo.20215 (2008).
Ward, S. J., Sherwen, S. & Clark, F. E. Advances in applied zoo animal welfare science. J. Appl. Anim. Welf. Sci. 21, 23–33. https://doi.org/10.1016/0003-3472(79)90016-2 (2018).
Lehner, P. N. Handbook of Ethological Methods (Cambridge University Press, 1998).
Domjan, M. The Principles of Learning and Behavior (Nelson Education, 2014).
Pierce, W. D. & Cheney, C. D. Behavior Analysis and Learning (Psychology Press, 2013).
Altmann, J. Observational study of behavior: Sampling methods. Behavior 49, 227–266. https://doi.org/10.1163/156853974X00534 (1974).
Bateson, M. & Martin, P. Measuring Behaviour: An Introductory Guide 4th edn. (Cambridge University Press, 2021).
Bernstein, I. S. An empirical comparison of focal and ad libitum scoring with commentary on instantaneous scans, all occurrence and one-zero techniques. Anim. Behav. 42, 721–728. https://doi.org/10.1016/S0003-3472(05)80118-6 (1991).
Martin, P. & Bateson, P. Recording methods. In ‘Measuring Behaviour: An Introductory Guide’ (Cambridge University Press, 2007).
Rhine, R. J. & Ender, P. B. Comparability of methods used in the sampling of primate behavior. Am. J. Primatol. 5, 1–15. https://doi.org/10.1002/ajp.1350050102 (1983).
Mann, J. Behavioral sampling methods for cetaceans: A review and critique. Mar. Mamm. Sci. 15, 102–122. https://doi.org/10.1111/j.1748-7692.1999.tb00784.x (1999).
Hämäläinen, W. et al. Measuring behaviour accurately with instantaneous sampling: A new tool for selecting appropriate sampling intervals. Appl. Anim. Behav. Sci. 180, 166–173. https://doi.org/10.1016/j.applanim.2016.04.006 (2016).
Tyler, S. Time-sampling: A matter of convention. Anim. Behav. 27, 801–810. https://doi.org/10.1016/0003-3472(79)90016-2 (1979).
Amato, K. R., Van Belle, S. & Wilkinson, B. A comparison of scan and focal sampling for the description of wild primate activity, diet and intragroup spatial relationships. Folia Primatol. 84, 87–101. https://doi.org/10.1159/000348305 (2013).
Fernandez, E. J., Kinley, R. C. & Timberlake, W. Training penguins to interact with enrichment devices for lasting effects. Zoo Biol. 38, 43–49. https://doi.org/10.1002/zoo.21510 (2019).
Sanders, K. & Fernandez, E. J. Behavioral implications of enrichment for golden lion tamarins: A tool for ex situ conservation. J. Appl. Anim. Welf. Sci. 1, 1–10 (2020).
Stevens, J., Thyssen, A., Laevens, H. & Vervaecke, H. The influence of zoo visitor numbers on the behaviour of harbour seals (Phoca vitulina). J. Zoo Aquat. Res. 1, 31–34. https://doi.org/10.19227/jzar.v1i1.20 (2013).
Grenier, D., Barrette, C. & Crête, M. Food access by white-tailed deer (Odocoileus virginianus) at winter feeding sites in Eastern Québec. Appl. Anim. Behav. Sci. 63, 323–337. https://doi.org/10.1016/S0168-1591(99)00017-9 (1999).
Gilby, I. C., Pokempner, A. A. & Wrangham, R. W. A direct comparison of scan and focal recording rules for measuring wild chimpanzee feeding behaviour. Folia Primatol. 81, 254–264. https://doi.org/10.1159/000322354 (2010).
Fernandez, E. J., Ramirez, M. & Hawkes, N. C. Activity and pool use in relation to temperature and water changes in zoo hippopotamuses (Hippopotamus amphibious). Animals 10, 1022 (2020).
Fernandez, E. J. & Timberlake, W. Foraging devices as enrichment in captive walruses (Odobenus rosmarus). Behav. Proc. 168, 103943. https://doi.org/10.1016/j.beproc.2019.103943 (2019).
Shora, J. A., Myhill, M. N. G. & Brereton, J. E. Should zoo foods be coati chopped. J. Zoo Aquar. Res. 6, 22–25. https://doi.org/10.19227/jzar.v6i1.309 (2021).
Teixeira, D. L., Machado Filho, L. C. P., Hötzel, M. J. & Enríquez-Hidalgo, D. Effects of instantaneous stocking rate, paddock shape and fence with electric shock on dairy cows’ behaviour. Livestock Sci. 198, 170–173. https://doi.org/10.1016/j.livsci.2017.01.007 (2017).
Pullin, A. N. et al. Instantaneous sampling intervals validated from continuous video observation for behavioral recording of feedlot lambs. J. Anim. Sci. 95, 703–4707. https://doi.org/10.2527/jas2017.1835 (2017).
Bailey, J. S. & Burch, M. R. Research Methods in Applied Behavior Analysis (Routledge, 2017).
Bakeman, R. & Quera, V. Behavioral observation. In APA Handbooks in Psychology APA Handbook of Research Methods in Psychology. Foundations Planning Measures and Psychometrics Vol. 1 (eds Cooper, P. M. et al.) 207–225 (American Psychological Association, 2012). https://doi.org/10.1037/13619-013.
Kraemer, H. C. One-zero sampling in the study of primate behavior. Primates 20, 237–244 (1997).
Brecht, K. F. et al. The status and value of replications in animal behavior science. Anim. Behav. Cogn. 8, 97–106. https://doi.org/10.26451/abc.08.02.01.2021 (2021).
Rhine, R. J. & Flanigon, M. An empirical comparison of one-zero, focal-animal, and instantaneous methods of sampling spontaneous primate social behavior. Primates 19, 353–361. https://doi.org/10.1007/BF02382803 (1978).
Dunkerton, J. Should classroom observation be quantitative?. Educ. Res. 23, 144–151. https://doi.org/10.1080/0013188810230208 (1981).
Omark, D. R., Fiedler, M. L. & Marvin, R. S. Dominance hierarchies: Observational techniques applied to the study of children at play. Instruct. Sci. 5, 403–423. https://doi.org/10.1007/BF00051807 (1976).
Rhine, R. J. & Linville, A. K. Properties of one-zero scores in observational studies of primate social behavior: The effect of assumptions on empirical analyses. Primates 21, 111–122. https://doi.org/10.1007/BF02383828 (1980).
Rhine, R. J., Norton, G. W., Wynn, G. M. & Wynn, R. D. Weaning of free-ranging infant baboons (Papio cynocephalus) as indicated by one-zero and instantaneous sampling of feeding. Int. J. Primatol. 6, 491–499. https://doi.org/10.1007/BF02735572 (1985).
Mitlöhner, F. M. et al. Behavioral sampling techniques for feedlot cattle. J. Anim. Sci. 79, 1189–1193. https://doi.org/10.1016/S0168-1591(99)00017-9 (2001).
Simpson, M. J. A. & Simpson, A. E. One-zero and scan methods for sampling behaviour. Anim. Behav. 25, 726–731. https://doi.org/10.1016/0003-3472(77)90122-1 (1977).
Xiao, J., Wang, K. & Wang, D. Diurnal changes of behavior and respiration of Yangtze finless porpoises (Neophocaena phocaenoides asiaeorientalis) in captivity. Zoo Biol. 24, 531–541. https://doi.org/10.1002/zoo.20070 (2005).
Saibaba, P., Sales, G. D., Stodulski, G. & Hau, J. Behaviour of rats in their home cages: Daytime variations and effects of routine husbandry procedures analysed by time sampling techniques. Lab. Anim. 30, 13–21. https://doi.org/10.1258/002367796780744875 (1996).
Griffin, B. & Adams, R. A parametric model for estimating prevalence, incidence, and mean bout duration from point sampling. Am. J. Primatol. 4, 261–271 (1983).
Harrop, A. & Daniels, M. Methods of time sampling: A reappraisal of momentary time sampling and partial interval recording. J. Appl. Anim. Behav. Anal. 19, 73–77 (1986).
Repp, A. C. et al. A comparison of frequency, interval, and time-sampling methods of data collection. J. Appl. Behav. Anal. 9, 501–508. https://doi.org/10.1901/jaba.1976.9-501 (1976).
Suen, H. K. & Ary, D. Variables influencing one-zero and instantaneous time sampling outcomes. Primates 25, 89–94. https://doi.org/10.1007/BF02382298 (1984).
Gardenier, N. C., MacDonald, R. & Green, G. Comparison of direct observational methods for measuring stereotypic behavior in children with autism spectrum disorders. Res. Devel. Disabilit. 25, 99–118 (2004).
Meany-Daboul, M. G., Roscoe, E. M., Bourret, J. C. & Ahearn, W. H. A comparison of momentary time sampling and partial-interval recording for evaluating functional relations. J. Appl. Behav. Anal. 40, 501–514 (2007).
Murphy, M. J. & Harrop, A. Observer error in the use of momentary time sampling and partial interval recording. Br. J. Psych. 85, 169–179. https://doi.org/10.1111/j.2044-8295.1994.tb02517.x (1994).
Radley, K. C., O’Handley, R. D. & Labrot, Z. C. A comparison of momentary time sampling and partial-interval recording for assessment of effects of social skills training. Psych. Schools 52, 363–378 (2015).
Rapp, J. T. et al. Interval recording for duration events: A re-evaluation. Behav. Interv. 22, 319–345 (2007).
Devine, S. L. et al. Detecting changes in simulated events using partial-interval recording and momentary time sampling III: Evaluating sensitivity as a function of session length. Behav. Interv. 26, 103–124 (2011).
Rapp, J. T. et al. Detecting changes in simulated events using partial-interval recording and momentary time sampling. Behav. Interv. 23, 237–269 (2008).
Wirth, O., Slaven, J. & Taylor, M. A. Interval sampling methods and measurement error: A computer simulation. J. Appl. Behav. Anal. 47, 83–100. https://doi.org/10.1002/jaba.93| (2014).
Cooper, J. O., Heron, T. E. & Heward, W. L. Applied Behavior Analysis 3rd edn. (Pearson Publishing, 2020).
Poling, A., Methot, L. L. & LeSage, M. G. Fundamentals of Behavior Analytic Research (Springer, 1995).
Doran, D. M. Comparison of instantaneous and locomotor bout sampling methods: A case study of adult male chimpanzee locomotor behavior and substrate use. Am. J. Phys. Anthropol. 89, 85–99. https://doi.org/10.1002/ajpa.1330890108| (1992).
Merrell, K. Assessment of children’s social skills: Recent developments, best practices, and new directions. Exceptionality 9, 3–18 (2001).
Seyfarth, R. M., Cheney, D. L. & Marler, P. Vervet monkey alarm calls: Semantic communication in a free-ranging primate. Anim. Behav. 28, 1070–1094 (1980).
Buskist, W. & Johnston, J. M. Laboratory lore and research practices in the experimental analysis of human behavior. The Behavior Analyst 11, 41–42 (1988).
Johnston, J. M. & Pennypacker, H. S. Strategies and Tactics of Behavioral Research (Routledge, 2010).
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, 2020). https://www.R-project.org/.
Leger, D. W. An empirical evaluation of instantaneous and one-zero sampling of chimpanzee behavior. Primates 18, 387–393. https://doi.org/10.1007/BF02383116 (1977).
Farrar, B. G., Voudouris, K. & Clayton, N. S. Replications, comparisons, sampling and the problem of representativeness in animal cognition research. Anim. Behav. Cogn. 8, 273–295. https://doi.org/10.26451/abc.08.02.14.2021 (2021).
The authors would like to thank Mrs S Brereton for proofreading the manuscript. As noted in the text, the code has been made publicly accessible. This allows anyone to adjust all variables and run the simulations and comparisons themselves: https://github.com/jonotuke/animal_simulation_2020.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Brereton, J.E., Tuke, J. & Fernandez, E.J. A simulated comparison of behavioural observation sampling methods. Sci Rep 12, 3096 (2022). https://doi.org/10.1038/s41598-022-07169-5
This article is cited by
Social touch in the age of computational ethology: Embracing as a multidimensional and complex behaviour
Current Psychology (2022)
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.