Statistical Analysis of Zebrafish Locomotor Behaviour by Generalized Linear Mixed Models

Liu, Yiwen; Ma, Ping; Cassidy, Paige A.; Carmer, Robert; Zhang, Gaonan; Venkatraman, Prahatha; Brown, Skye A.; Pang, Chi Pui; Zhong, Wenxuan; Zhang, Mingzhi; Leung, Yuk Fai

doi:10.1038/s41598-017-02822-w

Download PDF

Article
Open access
Published: 07 June 2017

Statistical Analysis of Zebrafish Locomotor Behaviour by Generalized Linear Mixed Models

Yiwen Liu¹,
Ping Ma¹,
Paige A. Cassidy²,
Robert Carmer^2,3,
Gaonan Zhang²,
Prahatha Venkatraman²,
Skye A. Brown²,
Chi Pui Pang⁴,
Wenxuan Zhong¹,
Mingzhi Zhang⁵ &
…
Yuk Fai Leung^2,6,7,8

Scientific Reports volume 7, Article number: 2937 (2017) Cite this article

4214 Accesses
25 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Upon a drastic change in environmental illumination, zebrafish larvae display a rapid locomotor response. This response can be simultaneously tracked from larvae arranged in multi-well plates. The resulting data have provided new insights into neuro-behaviour. The features of these data, however, present a challenge to traditional statistical tests. For example, many larvae display little or no movement. Thus, the larval responses have many zero values and are imbalanced. These responses are also measured repeatedly from the same well, which results in correlated observations. These analytical issues were addressed in this study by the generalized linear mixed model (GLMM). This approach deals with binary responses and characterizes the correlation of observations in the same group. It was used to analyze a previously reported dataset. Before applying the GLMM, the activity values were transformed to binary responses (movement vs. no movement) to reduce data imbalance. Moreover, the GLMM estimated the variations among the effects of different well locations, which would eliminate the location effects when two biological groups or conditions were compared. By addressing the data-imbalance and location-correlation issues, the GLMM effectively quantified true biological effects on zebrafish locomotor response.

Emergence of consistent intra-individual locomotor patterns during zebrafish development

Article Open access 20 September 2019

Jennifer A. Fitzgerald, Krishna Tulasi Kirla, … Colette M. vom Berg

BonZeb: open-source, modular software tools for high-resolution zebrafish tracking and analysis

Article Open access 14 April 2021

Nicholas C. Guilbeault, Jordan Guerguiev, … Tod R. Thiele

Automated analysis of activity, sleep, and rhythmic behaviour in various animal species with the Rtivity software

Article Open access 09 March 2022

Rui F. O. Silva, Brígida R. Pinho, … Jorge M. A. Oliveira

Introduction

Zebrafish are widely used in neurobehavioural research because this model confers several unique advantages. For example, zebrafish have high fecundity and routinely lay hundreds of embryos when mated in pairs. These embryos are also small and develop quickly into freely-swimming larvae in three to four days, which makes simultaneous tracking of their locomotor behavior under different experimental conditions straightforward. This approach has indeed generated data that provide new insights into neurobiology^{1,2,3,4,5,6,7,8,9,10,11,12,13,14,15}, pharmacology^{3, 5,6,7, 9,10,11,12, 16,17,18} and toxicology^{19,20,21,22,23,24,25,26,27}. Nonetheless, the resulting data are high-dimensional and complex, and require new methods of statistical analysis to unveil critical information about the underlying neurobehaviour.

To illustrate the analytical challenges, we will focus on one popular approach for high-throughput behavioural analysis: the visual motor response (VMR). This is an instantaneous locomotor response displayed by zebrafish larvae upon drastic light onset or offset^{4, 28,29,30}. In a typical VMR experiment, zebrafish are arranged in a 96-well plate, isolated from environmental light in a lightproof chamber, and stimulated by controlled white light. Their activities are recorded and summarized as the number of pixels moved in successive frames or as absolute displacement³¹. The resulting VMR data have two major features. First, the distribution of the larval activity would likely deviate from a Gaussian distribution because many larvae display little or no movement. This deviation creates data imbalance, and may pose challenges to statistical analysis since most traditional methods rely on the assumption of a Gaussian distribution. Second, the larval activities are observed repeatedly over time and in groups, such as larvae from the same location of the plate. Different locations in a well plate may have different effects on larval activity. Treating all location equally ignores not only the variations among those location effects, but also the correlations of larvae in the same location of the plate. This variation accounts for the unobserved heterogeneity of the data, while the correlation between larvae in the same wells would result in correlated samples. These repeated observations must be properly handled during data analysis.

These data features pose challenges to analyzing VMR or similar locomotor data by traditional approaches. For example, the t-test and analysis of variance (ANOVA) are often used to compare data between two groups, and three or more groups respectively. These tests have been implemented in analyzing similar locomotor data³² despite several limitations: the t-test has a higher Type I error rate when more comparisons are performed, whereas both t-test and ANOVA do not handle the time-dependency issue as commonly observed in time-series data. This time-dependency issue is often tackled by repeated-measures ANOVA^{13, 21, 33, 34}, a variant of ANOVA that can handle dynamical changes in behavior and repeatedly measured samples that are correlated in time. This analysis, however, assumes that the variances of the differences between group combinations are equal, an assumption that is hardly satisfied in behavioural data. To address these analytical issues, we recently introduced the Hotelling’s T-squared test and multivariate analysis of variance (MANOVA; a multivariate analog of ANOVA) for analyzing locomotor data³¹. Hotelling’s T-squared test not only reduces the Type I error rate compared to the t-test, but it also takes into account the time dependency between repeated measures; whereas MANOVA considers the time dependency and quantifies the effect sizes of variables that contribute to locomotor behaviour. These two methods, however, still treat samples collected in the same location of the plate as independent measurements and do not consider the correlations between them. They also do not address the data-imbalance issue, where the larval activity does not satisfy normality assumption. Several zebrafish behavioural studies used nonparametric tests such as Kruskal-Wallis test and Wilcoxon signed-rank test when the normality test indicated the data were not normally distributed^{35,36,37,38,39,40}. However, simple non-parametric tests have their disadvantages. For example, they cannot make quantitative statements about the difference between two groups. Moreover, non-parametric tests such as Wilcoxon signed-rank also suffer from loss of information, since they only utilize the ranks of the data. They are also less sensitive to differences between groups and require a larger sample size to achieve the same power as parametric tests.

To address these analytical issues, we present an alternative approach for the analysis of locomotor behaviour: the generalized linear mixed model (GLMM). This approach handles binary response variables and can be used to estimate the probability of the binary response based on multiple predictors⁴¹. It also assumes that the conditional distribution of the response variable is a Bernoulli distribution rather than a Gaussian distribution. When this approach is used to analyze locomotor data, the activity values are transformed into binary responses and encoded as 0 (no movement) and 1 (otherwise). This transformation renders the data less imbalanced. Moreover, different from displacement, the GLMM characterizes the larval activity by the proportion of larvae moved at each second, which represents how active the whole group is. Furthermore, it also treats group-level terms, such as location, as random effects. By controlling for these unobserved covariates, the approach can efficiently estimate the coefficients of other variables. It also adjusted for the lack of independence among the multiple observations for each location. The GLMM was used in this study to analyze a standard VMR dataset that was used previously to develop the Hotelling’s T-squared test³¹. Our results indicate that the GLMM efficiently handled the complex structure of high-throughput behavioral data. This GLMM approach complements the Hotelling’s T-squared test for analyzing VMR data, and these approaches together establish a framework that can be used to analyze behavioural data with a similar structure.

Materials and Methods

Experimental data

All experimental data were previously collected, reported³¹ and are accessible from the Harvard Dataverse (http://dx.doi.org/10.7910/DVN/HTXXKW). A summary of the data will be provided here. These data were collected from VMR experiments on three wild-type (WT) zebrafish strains: AB, TL and TLAB. Their VMR were analyzed daily from 3 days post-fertilization (dpf) to 9 dpf, using a standard VMR experimental scheme^{4, 14, 15, 18, 28, 31}. In this scheme, the larvae were arrayed in a 96-well plate and dark adapted for 3.5 hrs. They were then subjected to three consecutive trials of light onset (Light-On) and light offset (Light-Off). Each trial session lasted for 30 mins. We also controlled other experimental variables that might affect the results³¹. For example, only healthy larvae were included in the final analysis, and the same type of 96-well plate was used for all experiments. We also ran all strains separately. The reasons of this experimental design are as follows. Firstly, to quantify the variations among wells (locations), the biological groups (i.e. strains) should not be confounded with the technical groups (i.e. locations). In other words, there was no interaction between strain and location under such circumstance. We could then assume larvae at the same location would have the same location effect, regardless of their biological groups. Secondly, we also conducted biological replicates of each experiment. The estimated strain effect is the “mean effect” of all replicates, which alleviated the variation due to running all strains separately and ensured a valid biological interpretation of the results.

Statistical Analysis

Activity summarization

The VMR dataset used in this study summarized the larval activity as Burst Duration, the fraction of frames in each second that a larva moved³¹. Each frame was compared with the previous one. A larva would be declared moving in a frame if it moved more than a preset threshold. However, these summarized Burst-Duration values were imbalanced since a large number of zebrafish larvae displayed little or no movement at all. This data-imbalance issue was handled by transforming the Burst-Duration values into binary responses: all non-zero values were transformed to 1 or 0 otherwise.

Data modeling and statistical inference

In the VMR experiment, zebrafish larvae would display very drastic movement after sudden light change. We previously used their activity data from the 30-second period after each light change to develop analytical tools for VMR data³¹. In this study, we used the same period of time for analysis. All explanatory variables in our models are introduced in section Explanatory variables. The effects of these variables are analyzed by GLMM and introduced in section GLMM. All statistical analyses were performed using R software version 3.2.3 (https://www.r-project.org). The analysis scripts are available in the Supplementary file.

Explanatory variables

The explanatory variables used in our models include: (1) Strain: AB, TL and TLAB; (2) Stage: 3–9 dpf; (3) Light stimulus: light-onset sessions (Light-On) and light-offset sessions (Light-Off); (4) Time: 1–30 seconds after the light change; (5) Location: The well position in a 96-well plate; and (6) Interactions between the variables (1–5).

GLMM

Assume that y _ij is the observation of the j th zebrafish larva in group i for j = 1, …, n _i, with y _ij = 1 representing an active zebrafish larva and y _ij = 0 otherwise, and x _ij as a column vector of values of explanatory variables for this larva. Then, the GLMM⁴¹ has the following form:

$$logit[\,P(\,{y}_{ij}=1|{\gamma }_{i}\,)]={{\boldsymbol{x}}}_{ij}^{T}{\boldsymbol{\beta }}+{\gamma }_{i},$$

(1)

where $logit[\,P({y}_{ij}=1|{\gamma }_{i})\,]=\,\mathrm{log}\,[\frac{P({y}_{ij}=1|{\gamma }_{i})}{1-P\,({y}_{ij}=1|{\gamma }_{i}\,)}]$, and β represents the fixed-effect model parameters. γ _i is the random effect of group i, and {γ _i} are independent N(0, σ ²). In our studies, the larval location in the 96-well plate was modeled as the random effect. Zebrafish larvae at different locations (i.e. wells) were independent from each other, whereas zebrafish larvae at the same location had the same effect size. Other explanatory variables included the strain and stage of the zebrafish, time, and light stimulus.

Results

In this study, we used the GLMM to resolve the aforementioned data-analysis issues that were not handled by both traditional analyses and Hotelling’s T-squared test³¹. We focused on the 1^st Light-On session (i.e. 1^st technical repeat) in this study whenever possible. This selection simplified the analysis, as the 1^st Light-On session (i.e. 1^st technical repeat) was qualitatively different from the 2^nd and 3^rd (p < 0.05) due to a difference in the length of the prior dark adaptation³¹. By using one technical repeat, we could effectively compare the analyses of the Hotelling’s T-squared test and GLMM, and illustrate the potential of the GLMM in VMR data analysis. Three examples will be shown below.

Example 1: Difference in activities of different WT strains during the same time interval

To determine this difference, a model (1) was built on the VMR data of larvae at 6 dpf from 1 to 30 s with the following variables: strain (categorical; S), time and its squared term (continuous; t and t ²), their interactions, and random effect of location (γ _i). The time variable was centered to have a mean of zero to reduce the degree of multicollinearity. The squared term of time was included since the log odds of moving larvae proportion were not linear across time. For each strain m (AB, TL, TLAB), we denoted ${\beta }_{s}^{(m)},{\beta }_{{I}_{1}}^{(m)}$, and ${\beta }_{{I}_{2}}^{(m)}$ as the coefficients of strain, and its interactions with t and t ² respectively. Each level of the categorical variables was compared with a reference level. For example, AB was the reference level for the variable strain, and its corresponding coefficient was set to zero (i.e. β _s satisfied the constraint ${\beta }_{s}^{(AB)}=0)$. The results of the model were interpreted as log odds of activeness (LOA), defined as

$$LOA=\,\mathrm{log}\,\frac{P({y}_{ij}=1|{\gamma }_{i})}{1-P({y}_{ij}=1|{\gamma }_{i})}.$$

(2)

The LOA describes the likelihood that the larva in a particular location would move. Its value might be different in different strains. The LOA difference between different strains (dLOA) was deduced by the following formula, using the comparison between TL and AB strains in the same location as an example:

$$LOA(TL)-LOA(AB)={\beta }_{s}^{(TL)}-{\beta }_{s}^{(AB)}+({\beta }_{{I}_{1}}^{(TL)}-{\beta }_{{I}_{1}}^{(AB)})t+({\beta }_{{I}_{2}}^{(TL)}-{\beta }_{{I}_{2}}^{(AB)}){t}^{2}.$$

Since the other basic effects were the same between the two strains at time t, they canceled each other. The random effects (γ _i) also canceled each other for the same location. The first part of the formula, ${\beta }_{s}^{(TL)}-{\beta }_{s}^{(AB)}$, describes the average shift in LOA between TL and AB independent of time; whereas the second and third parts indicate how dLOA changed over time.

The fitting results are shown in Table 1, and the corresponding data are plotted in the left panel of Fig. 1. The effect of strain on LOA was decomposed into two parts. The first part was in the main effect. The LOAs of TL and TLAB strains increased by 1.2551 (p < 0.0001) and 0.5601 (p < 0.0001) respectively when compared with that of AB. Thus, more TL and TLAB larvae tended to move when exposed to a Light-On stimulus. In terms of instant behaviour $({\rm{i}}.{\rm{e}}.\,{\rm{at}}\,t=1)$, there was no significant difference between TL and TLAB larvae. Their LOAs increased by 0.6076 and 0.8342 respectively when compared with that of AB. This can be due to the SNPs in the TL line were associated with dominant genes for higher locomotor activities. The hybrid TLAB line would therefore still display the dominant active phenotype. The second part of strain effect on LOA was in its interaction with time. The dLOA between TL and TLAB had a significant linear pattern over time (2.1698, p < 0.0001); whereas the dLOA between TLAB and AB strains had significant linear (−3.0496, p < 0.0001) and quadratic patterns (−7.9245, p = 0.0330) over time. This trend was also shown by the predicted probability of WT strains that moved during the same time interval (Fig. 1, middle panel). When TL and TLAB strains were exposed to a Light-On stimulus, more of them tended to move compared to AB. TLAB larvae, however, returned to the baseline activity faster than AB and TL larvae, as the proportion of moving TLAB larvae decreased faster than that of AB and TL.

Table 1 The GLMM results of Light-On VMR from 1 to 30s for different WT strains at 6 dpf.

Full size table

This comparison unveiled new information compared to that obtained from the Hotelling’s T-squared test performed on the same data (Table 2; also plotted accordingly in the right panel of Fig. 1)³¹. In our previous work, the Hotelling’s T-squared test showed that the average activity of every strain across time was different from the others (Table 2; p-values for all pairwise comparison were <0.0001). As a complementary analysis, the GLMM further improves on the explanation of this difference: (1) TL had a larger strain effect (Table 1, main effects); (2) the LOA of TLAB strain decreased faster than that of TL strain during the 30s period; and (3) the LOAs of all strains decreased in significant nonlinear patterns across time (Table 1, interactions with time).

Table 2 The Hotelling’s T-squared test of Light-On VMR data used in Table 1.

Full size table

Difference in activity of the same strain

Example 2: Difference in activity of the same strain during light onset and offset

The larvae displayed substantially different activities by the Light-On and Light-Off stimuli. This difference was quantitatively evaluated by GLMM, using VMR data of AB at 6 dpf from 1 to 30s (i.e. after light change) with the following variables: light stimulus (categorical; L), time and its squared term (continuous; t and t ²), their interactions, and random effect of location (γ _i). For light stimulus l (ON or OFF), we denoted ${\beta }_{L}^{(l)},\,{\beta }_{{I}_{1}}^{(l)}$ and ${\beta }_{{I}_{2}}^{(l)}$ as the coefficients of light stimulus and its interactions with t and t ² respectively.

The fitting results are summarized in Table 3, and the corresponding data are plotted in the left panel of Fig. 2. In main effect, the LOA of larvae upon Light-On stimulus was significantly smaller than that upon Light-Off stimulus (−3.0963; p < 0.0001). In other words, fewer zebrafish larvae moved upon Light-On stimulus. The larvae also displayed different decreasing LOA patterns during light onset and offset, as evidenced by the significant dLOAs in the interaction between light stimulus and time (linear term: −2.6759, p < 0.0001; quadratic term: 8.4131, p < 0.0043).

Table 3 The GLMM results of VMR data from 1 to 30s for AB larvae at 6 dpf.

Full size table

The GLMM results were more informative compared to those obtained from the Hotelling’s T-squared test on the same data. Previously we showed that the larval activity differed upon light onset and offset by Hotelling’s T-squared test (p < 0.0001). We further explained this difference using the results of the GLMM (Fig. 2, middle panel; Table 3, main effects), which showed that the proportion of active larvae was larger during light onset than offset. The GLMM also revealed that larval activity decreased at a different rate upon light onset and offset (Table 3, interaction with time). The results from the two analyses therefore complemented each other and described different aspects of the larval activity.

Example 3: Difference in activities of larvae at different developmental stages

The VMR data were collected from 3 to 9 dpf, when the larvae were developing. Their maturation would alter the locomotor behaviour^{14, 31}. This developmental difference was modeled by the GLMM, using the VMR data of AB larvae. The model focused on the 1^st technical repeat of the Light-On stimulus from 1 to 30 s. It comprised the following variables: stage (categorical; G), time and its squared term (continuous; t and t ²), their interaction s, and random effect of location (γ _i). For stage k (3, …, 9), we denoted ${\beta }_{G}^{(k)},\,{\beta }_{{I}_{1}}^{(k)}$ and ${\beta }_{{I}_{2}}^{(k)}$ as coefficients of stage and its interactions with t and t ² respectively.

The fitting results of 3, 6 and 9 dpf are summarized in Table 4, and the corresponding data are plotted in the left panel of Fig. 3. The effect of stage on LOA was decomposed into two parts. The first part was the main effect. The LOA of AB larvae was significantly larger at 9 dpf than 3 and 6 dpf (0.9693, p < 0.0001; 1.1838, p < 0.0001). Thus, fewer zebrafish larvae tended to move upon light onset at 3 and 6 dpf than at 9 dpf (Fig. 3, middle panel). The second part of stage effect on LOA was in the interactions of stage effect with time. The LOAs of larvae at all stages decreased in nonlinear patterns, and were quite different from each other (Table 4, interactions with time). At 3 dpf, the LOA gradually increased upon light onset, and then gradually decreased (Table 4, β _t = −5.6808; ${\beta }_{{t}^{2}}$ = −23.7371). At 6 dpf, however, the LOA drastically increased upon light onset and then gradually decreased (Table 4, ${\beta }_{t}+{\beta }_{{I}_{1}}^{(6)}$= −2.4029; ${\beta }_{{t}^{2}}+{\beta }_{{I}_{2}}^{(6)}$= 12.5912). At 9 dpf, the LOA decreased in both linear and nonlinear pattern over time, and its nonlinear term was significantly different than that at 3 dpf (Table 4, ${\beta }_{{I}_{2}}^{(9)}-{\beta }_{{I}_{2}}^{(3)}$=14.7728, p < 0.0001).

Table 4 The GLMM results of Light-On VMR from 1 to 30s for AB larvae at different stages.

Full size table

These results again unveiled new information and complemented the findings from the Hotelling’s T-squared test that were thoroughly discussed in our previous study³¹. For example, in our previous work, the Hotelling’s T-squared test showed a significant difference in the activities of larvae from different stages of development upon the same Light-On stimulus (Table 5); whereas the results of GLMM in this study further explained the details of these differences. First, the LOAs were different between larvae at 3 and 6 dpf due to significant interactions with time (Table 4; ${\beta }_{{I}_{1}}^{(6)}-{\beta }_{{I}_{1}}^{(3)}$: 3.2779; ${\beta }_{{I}_{2}}^{(6)}-{\beta }_{{I}_{2}}^{(3)}$: 36.3283). This indicates the LOAs of these larvae were changing differently over time, as also shown by the left panel of Fig. 3. Second, the LOAs were different between larvae at 3 dpf and 9dpf, both at the main-effect level (Table 4; ${\beta }_{G}^{(9)}-{\beta }_{G}^{(3)}$:0.9693), and the interactions-with-time level (${\beta }_{{I}_{2}}^{(9)}-{\beta }_{{I}_{2}}^{(3)}$: 14.7728). The significant main effect suggests the LOA curves between these larvae should be similar in shape, as demonstrated in the left panel of Fig. 3. The main difference between these curves is that the one for 9-dpf larvae was generally shifted upward (${\beta }_{G}^{(9)}-{\beta }_{G}^{(3)}$: 0.9693). Third, the LOAs were different between larvae at 6 dpf and 9 dpf in both the main effect (Table 4; ${\beta }_{G}^{(9)}-{\beta }_{G}^{(6)}$: 1.1838) and the interactions with time (${\beta }_{{I}_{1}}^{(9)}-{\beta }_{{I}_{1}}^{(6)}$: −2.3585; ${\beta }_{{I}_{2}}^{(9)}-{\beta }_{{I}_{2}}^{(6)}$: −21.5533). This indicates that the LOA curve of the 9-dpf larvae was shifted upward and was changing in a nonparallel pattern compared with that of the 6-dpf larvae, as showed in the left panel of Fig. 3.

Table 5 The Hotelling’s T-squared test of Light-On VMR data used in Table 4.

Full size table

Discussion

The locomotor behaviour of zebrafish larvae has been widely used to study neurobehaviour. One reason for this popularity is that these larvae are small and are amenable for high-throughput collection of data from multiple larvae arranged in multi-well plates. The larval activities collected from this arrangement, however, present challenges to statistical analysis. These values are not only measured repeatedly over time, but they are also imbalanced and correlated in time and by location. These statistical issues cannot be dealt with by traditional methods including the t-test and ANOVA. In a previous study, we addressed the time-dependency issue by the Hotelling’s T-squared test³¹. In this investigation, we addressed the data-imbalance problem and location-correlation issue using the GLMM.

The GLMM modeled the relationship between binary responses and explanatory predictors with both fixed and random effects⁴¹. This approach offered at least two advantages in analyzing VMR data: First, it reduced the degree of imbalance in zebrafish responses by transforming the activity values into binary responses. All non-zero values were transformed to 1, or zero otherwise. For example, all larvae in the right panel of Fig. 1 had mean activities less than 0.075. Many of them actually did not move during any second and had a zero in their response value. This phenomenon resulted in more zero values in responses. When these larval activities were transformed into binary responses, the maximum proportions of moving larvae were close to 0.5, comparable to the proportion of zero values. Second, the GLMM considered the location effect introduced by repeated measurements as a random effect, and explicitly quantified the variations among different locations by estimating the variance of random effect. For example, the variance of random effect in Example 1 was estimated to be 0.1149, indicating that the variations among different wells was 0.1149.

The GLMM had some limitations in analyzing VMR data. First, it only handled binary responses and quantified the probability of larval movement; it could not handle larval displacement. Second, the LOAs from the model might show nonlinear patterns across time (Fig. 2, left and right panel), and the linear and quadratic terms in the model could not capture such patterns. To address the first limitation, we propose that the VMR data should be analyzed by both the GLMM and the Hotelling’s T-squared test. The GLMM quantifies the probability of moving (i.e. how many larvae moved), whereas the Hotelling’s T-squared test defines whether the mean activities (i.e. how much the larvae moved) between two groups are different. Combining these observations would provide a better interpretation of the larval movement. The Hotelling’s T-squared test would also facilitate building a GLMM. When the test finds larval activities significantly affected by certain variables, these variables can be used to build the GLMM. To address the second limitation of GLMM, further analysis should be focused on using smoothing spline ANOVA, a nonparametric model to characterize the nonlinear pattern across time.

To conclude, this study has implemented the GLMM to solve the data-imbalance and location-correlation issues in VMR data analysis. This approach also complements the Hotelling’s T-squared test. Together, they reveal distinctive aspects of locomotor output of a group of larvae induced by light and by different experimental perturbations. This information would facilitate the analysis of activation circuitry that drives locomotor behaviour in zebrafish⁴². Such knowledge may aid translating the interesting findings from neurobiology^{1,2,3,4,5,6,7,8,9,10,11,12,13,14,15}, pharmacology^{3, 5,6,7, 9,10,11,12, 16,17,18} and toxicology^{19,20,21,22,23,24,25,26,27} to humans. We recommend the following general data-analysis workflow: (1) Compare the average larval activities of different groups with the Hotelling’s T-squared test; (2) Select significant variables as candidate predictors and apply the GLMM to model the relationship between binary responses and candidate predictors; and (3) Combine the results from (1 & 2) to interpret larval activities. These two statistical approaches therefore have established a statistical framework for VMR analysis that can be generalized to other locomotor behavioural data with similar data structure. This framework is expected to provide new insights into neurobehavioural studies.

References

Prober, D. A., Rihel, J., Onah, A. A., Sung, R.-J. & Schier, A. F. Hypocretin/orexin overexpression induces an insomnia-like phenotype in zebrafish. J. Neurosci. 26, 13400–13410 (2006).
Article CAS PubMed Google Scholar
Kokel, D. et al. Rapid behavior-based identification of neuroactive small molecules in the zebrafish. Nat. Chem. Biol. 6, 231–237 (2010).
Article CAS PubMed PubMed Central Google Scholar
Rihel, J. et al. Zebrafish behavioral profiling links drugs to biological targets and rest/wake regulation. Science 327, 348–351 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Emran, F. et al. OFF ganglion cells cannot drive the optokinetic reflex in zebrafish. Proc. Natl. Acad. Sci. USA 104, 19126–19131 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Bruni, G. et al. Zebrafish behavioral profiling identifies multitarget antipsychotic-like compounds. Nat. Chem. Biol. 12, 559–66 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hoffman, E. J. et al. Estrogens Suppress a Behavioral Phenotype in Zebrafish Mutants of the Autism Risk Gene, CNTNAP2. Neuron 89, 725–33 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kokel, D. et al. Photochemical activation of TRPA1 channels in neurons and animals. Nat. Chem. Biol. 9, 257–63 (2013).
Article CAS PubMed PubMed Central Google Scholar
Rennekamp, A. J. et al. σ1 receptor ligands control a switch between passive and active threat responses. Nat. Chem. Biol. 12, 552–8 (2016).
Article CAS PubMed PubMed Central Google Scholar
Nath, A. K. et al. Chemical and metabolomic screens identify novel biomarkers and antidotes for cyanide exposure. FASEB J. 27, 1928–38 (2013).
Article CAS PubMed PubMed Central Google Scholar
Woods, I. G. et al. Neuropeptidergic signaling partitions arousal behaviors in zebrafish. J. Neurosci. 34, 3142–60 (2014).
Article CAS PubMed PubMed Central Google Scholar
Dinday, M. T. & Baraban, S. C. Large-Scale Phenotype-Based Antiepileptic Drug Screening in a Zebrafish Model of Dravet Syndrome(1,2,3). eNeuro 2 (2015).
Baxendale, S. et al. Identification of compounds with anti-convulsant properties in a zebrafish model of epileptic seizures. Dis. Model. Mech. 5, 773–84 (2012).
Article CAS PubMed PubMed Central Google Scholar
Fernandes, A. M. et al. Deep Brain Photoreceptors Control Light-Seeking Behavior in Zebrafish Larvae. Curr. Biol. 22, 2042–2047 (2012).
Article CAS PubMed PubMed Central Google Scholar
Gao, Y. et al. Computational classification of different wild-type zebrafish strains based on their variation in light-induced locomotor response. Comput. Biol. Med. 69, 1–9 (2016).
Article ADS PubMed Google Scholar
Gao, Y. et al. A high-throughput zebrafish screening method for visual mutants by light-induced locomotor response. IEEE/ACM Trans. Comput. Biol. Bioinforma. 11, 693–701 (2014).
Article Google Scholar
Rihel, J. & Schier, A. F. Behavioral screening for neuroactive drugs in zebrafish. Dev. Neurobiol. 72, 373–385 (2012).
Article CAS PubMed Google Scholar
Laggner, C. et al. Chemical informatics and target identification in a zebrafish phenotypic screen. Nat. Chem. Biol. 8, 144–6 (2011).
Article PubMed PubMed Central Google Scholar
Zhang, L. et al. A Naturally-Derived Compound Schisandrin B Enhanced Light Sensation in the pde6c Zebrafish Model of Retinal Degeneration. PLoS One 11, e0149663 (2016).
Article PubMed PubMed Central Google Scholar
MacPhail, R. C. et al. Locomotion in larval zebrafish: Influence of time of day, lighting and ethanol. Neurotoxicology 30, 52–58 (2009).
Article CAS PubMed Google Scholar
Ali, S., Champagne, D. L. & Richardson, M. K. Behavioral profiling of zebrafish embryos exposed to a panel of 60 water-soluble compounds. Behav. Brain Res. 228, 272–283 (2012).
Article CAS PubMed Google Scholar
de Esch, C. et al. Locomotor activity assay in zebrafish larvae: influence of age, strain and ethanol. Neurotoxicol. Teratol. 34, 425–433 (2012).
Article PubMed Google Scholar
Ali, S., van Mil, H. G. J. & Richardson, M. K. Large-scale assessment of the zebrafish embryo as a possible predictive model in toxicity testing. PLoS One 6, e21076 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Deeti, S., O’Farrell, S. & Kennedy, B. N. Early safety assessment of human oculotoxic drugs using the zebrafish visualmotor response. J. Pharmacol. Toxicol. Methods 69, 1–8 (2014).
Article CAS PubMed Google Scholar
Rudin-Bitterli, T. S. et al. Combining motion analysis and microfluidics–a novel approach for detecting whole-animal responses to test substances. PLoS One 9, e113235 (2014).
Article ADS PubMed PubMed Central Google Scholar
Hua, J., Vijver, M. G., Richardson, M. K., Ahmad, F. & Peijnenburg, W. J. G. M. Particle-specific toxic effects of differently shaped zinc oxide nanoparticles to zebrafish embryos (Danio rerio). Environ. Toxicol. Chem. 33, 2859–68 (2014).
Article CAS PubMed Google Scholar
Hua, J., Vijver, M. G., Ahmad, F., Richardson, M. K. & Peijnenburg, W. J. G. M. Toxicity of different-sized copper nano- and submicron particles and their shed copper ions to zebrafish embryos. Environ. Toxicol. Chem. 33, 1774–82 (2014).
Article CAS PubMed Google Scholar
Akhtar, M. T. et al. Developmental effects of cannabinoids on zebrafish larvae. Zebrafish 10, 283–93 (2013).
Article CAS PubMed Google Scholar
Emran, F., Rihel, J. & Dowling, J. E. A behavioral assay to measure responsiveness of zebrafish to changes in light intensities. J. Vis. Exp. pii:923, doi:10.3791/923 (2008).
Gao, Y. et al. A high-throughput zebrafish screening method for visual mutants by light-induced locomotor response. IEEE/ACM Trans Comput Biol Bioinform Under review (2013).
Zhang, L. et al. Drug Screening to Treat Early-Onset Eye Diseases: Can Zebrafish Expedite the Discovery? Asia-Pac J Ophthalmol 1, 374–383 (2012).
Article CAS Google Scholar
Liu, Y. et al. Statistical analysis of zebrafish locomotor response. PLoS One 10 (2015).
Scott, C. A., Marsden, A. N. & Slusarski, D. C. Automated, high-throughput, in vivo analysis of visual function using the zebrafish. Dev. Dyn. 245, 605–613 (2016).
Article PubMed PubMed Central Google Scholar
Vignet, C. et al. Systematic screening of behavioral responses in two zebrafish strains. Zebrafish 10, 365–375 (2013).
Article PubMed Google Scholar
Kopp, R., Legler, J. & Legradi, J. Alterations in locomotor activity of feeding zebrafish larvae as a consequence of exposure to different environmental factors. Environ. Sci. Pollut. Res., doi:10.1007/s11356-016-6704-3 (2016).
Magno, L. D. P., Fontes, A., Gonçalves, B. M. N. & Gouveia, A. Pharmacological study of the light/dark preference test in zebrafish (Danio rerio): Waterborne administration&quot. Pharmacol. Biochem. Behav. 139, 141–148 (2015).
Article CAS PubMed Google Scholar
Zac Stephens, W. et al. The composition of the zebrafish intestinal microbial community varies across development. ISME J. 10, 644–654 (2016).
Article CAS PubMed Google Scholar
Nordgreen, J., Tahamtani, F. M., Janczak, A. M., Horsberg, T. E. & Canavello, P. Behavioural Effects of the Commonly Used Fish Anaesthetic Tricaine Methanesulfonate (MS-222) on Zebrafish (Danio rerio) and Its Relevance for the Acetic Acid Pain Test. PLoS One 9, e92116 (2014).
Article ADS PubMed PubMed Central Google Scholar
Hallare, A., Nagel, K., Kohler, H. R. & Triebskorn, R. Comparative embryotoxicity and proteotoxicity of three carrier solvents to zebrafish (Danio rerio) embryos. Ecotoxicol Env. Saf 63, 378–388 (2006).
CAS Google Scholar
Watkins, J., Miklósi, A. & Andrew, R. J. Early asymmetries in the behaviour of zebrafish larvae. Behav. Brain Res. 151, 177–183 (2004).
Article PubMed Google Scholar
Fernandes, Y., Tran, S., Abraham, E. & Gerlai, R. Embryonic alcohol exposure impairs associative learning performance in adult zebrafish. Behav. Brain Res. 265, 181–187 (2014).
Article CAS PubMed PubMed Central Google Scholar
Agresti, A. Categorical data analysis. Wiley series in probability and mathematical statistics. Applied probability and statistics (Wiley, 1990).
Ganzen, L., Venkatraman, P., Pang, C. P., Leung, Y. F. & Zhang, M. Utilizing Zebrafish Visual Behaviors in Drug Screening for Retinal Degeneration. In revision (2017).

Download references

Acknowledgements

Yiwen Liu, Ping Ma and Wenxuan Zhong were partially supported by National Science Foundation (Grant No. DMS-1440037, 1440038, 1438957) and NIH (R01GM113242, R01GM122080). Robert Carmer was partially supported by a Howard Hughes Medical Institute Undergraduate Research Experience Program from Purdue University. Gaonan Zhang was partially supported by a William H. Phillips Summer Research Internship from Purdue University. Prahatha Venkatraman was partially supported by a Faculty for the Future Fellowship by the Schlumberger Foundation. Chi Pui Pang was partially supported by a Direct grant (Grant No. 2041771) from the Medical Panel, The Chinese University of Hong Kong, and a General Research Fund (Grant No. 2140694) from the Research Grants Council of Hong Kong. Mingzhi Zhang was partially supported by the National Scientific Foundation of China (Grant No. 81486126), the Provincial Natural Scientific Foundation of China (Grant No. 8151503102000019), and the Ministry of Health program of Public Welfare (Grant No. 201302015). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

Department of Statistics, University of Georgia, 101 Cedar St, Athens, GA, 30602, USA
Yiwen Liu, Ping Ma & Wenxuan Zhong
Department of Biological Sciences, Purdue University, 915 W. State Street, West Lafayette, IN, 47907, USA
Paige A. Cassidy, Robert Carmer, Gaonan Zhang, Prahatha Venkatraman, Skye A. Brown & Yuk Fai Leung
Department of Statistics, 250 N University Street, Purdue University, West Lafayette, IN, 47907, USA
Robert Carmer
Department of Ophthalmology and Visual Sciences, Chinese University of Hong Kong, Hong Kong, Hong Kong
Chi Pui Pang
Joint Shantou International Eye Center, Shantou University & the Chinese University of Hong Kong, Shantou, China
Mingzhi Zhang
Department of Biochemistry and Molecular Biology, Indiana University School of Medicine Lafayette, 625 Harrison Street, West Lafayette, IN, 47907, USA
Yuk Fai Leung
Purdue Institute for Integrative neuroscience, 610 Purdue Mall, Purdue University, West Lafayette, IN, 47907, USA
Yuk Fai Leung
Purdue Institute for Drug Discovery, 610 Purdue Mall, Purdue University, West Lafayette, IN, 47907, USA
Yuk Fai Leung

Authors

Yiwen Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ping Ma
View author publications
You can also search for this author in PubMed Google Scholar
Paige A. Cassidy
View author publications
You can also search for this author in PubMed Google Scholar
Robert Carmer
View author publications
You can also search for this author in PubMed Google Scholar
Gaonan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Prahatha Venkatraman
View author publications
You can also search for this author in PubMed Google Scholar
Skye A. Brown
View author publications
You can also search for this author in PubMed Google Scholar
Chi Pui Pang
View author publications
You can also search for this author in PubMed Google Scholar
Wenxuan Zhong
View author publications
You can also search for this author in PubMed Google Scholar
Mingzhi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yuk Fai Leung
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceived and designed the experiments: Y.L., P.M., C.P.P., W.Z., M.Z., Y.F.L. Performed the experiments: Y.L., R.C., G.Z., P.V., S.A.B., Y.F.L. Analyzed the data: Y.L., P.M., W.Z., Y.F.L. Wrote the paper: Y.L., P.M., P.A.C., W.Z., Y.F.L.

Corresponding authors

Correspondence to Wenxuan Zhong, Mingzhi Zhang or Yuk Fai Leung.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

R-Script

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Liu, Y., Ma, P., Cassidy, P.A. et al. Statistical Analysis of Zebrafish Locomotor Behaviour by Generalized Linear Mixed Models. Sci Rep 7, 2937 (2017). https://doi.org/10.1038/s41598-017-02822-w

Download citation

Received: 04 January 2017
Accepted: 19 April 2017
Published: 07 June 2017
DOI: https://doi.org/10.1038/s41598-017-02822-w

This article is cited by

An updated review on animal models to study attention-deficit hyperactivity disorder
- Daegeon Kim
- Dhananjay Yadav
- Minseok Song
Translational Psychiatry (2024)
Key HPI axis receptors facilitate light adaptive behavior in larval zebrafish
- Han B. Lee
- Soaleha Shams
- Karl J. Clark
Scientific Reports (2024)
Cannabidiol improves haloperidol-induced motor dysfunction in zebrafish: a comparative study with a dopamine activating drug
- Akihiro Hasumi
- Hideyuki Maeda
Journal of Cannabis Research (2023)
l-Carnitine ameliorates congenital myopathy in a tropomyosin 3 de novo mutation transgenic zebrafish
- Po-Jui Hsu
- Horng-Dar Wang
- Chiou-Hwa Yuh
Journal of Biomedical Science (2021)
Automatic multiple zebrafish larvae tracking in unconstrained microscopic video conditions
- Xiaoying Wang
- Eva Cheng
- Donald Wlodkowic
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Materials and Methods

Experimental data

Statistical Analysis

Activity summarization

Data modeling and statistical inference

Explanatory variables

GLMM

Results

Example 1: Difference in activities of different WT strains during the same time interval

Difference in activity of the same strain

Example 2: Difference in activity of the same strain during light onset and offset

Example 3: Difference in activities of larvae at different developmental stages

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links