A generalised framework for detailed classification of swimming paths inside the Morris Water Maze

Vouros, Avgoustinos; Gehring, Tiago V.; Szydlowska, Kinga; Janusz, Artur; Tu, Zehai; Croucher, Mike; Lukasiuk, Katarzyna; Konopka, Witold; Sandi, Carmen; Vasilaki, Eleni

doi:10.1038/s41598-018-33456-1

Download PDF

Article
Open access
Published: 10 October 2018

A generalised framework for detailed classification of swimming paths inside the Morris Water Maze

Scientific Reports volume 8, Article number: 15089 (2018) Cite this article

4113 Accesses
27 Citations
7 Altmetric
Metrics details

Subjects

Abstract

The Morris Water Maze is commonly used in behavioural neuroscience for the study of spatial learning with rodents. Over the years, various methods of analysing rodent data collected during this task have been proposed. These methods span from classical performance measurements to more sophisticated categorisation techniques which classify the animal swimming path into behavioural classes known as exploration strategies. Classification techniques provide additional insight into the different types of animal behaviours but still only a limited number of studies utilise them. This is primarily because they depend highly on machine learning knowledge. We have previously demonstrated that the animals implement various strategies and that classifying entire trajectories can lead to the loss of important information. In this work, we have developed a generalised and robust classification methodology to boost classification performance and nullify the need for manual tuning. We have also made available an open-source software based on this methodology.

Quantifying free behaviour in an open field using k-motif approach

Article Open access 27 December 2019

A framework to identify structured behavioral patterns within rodent spatial trajectories

Article Open access 11 January 2021

Strategies discovery in the active allothetic place avoidance task

Article Open access 25 July 2022

Introduction

The Morris Water Maze (MWM), designed by Richard Morris, was first described in 1981 in a study regarding the spatial localisation of rats¹. The MWM quickly became popular and by the end of the eighties a large number of published work using the MWM had been reported². For instance, the review work of D’Hooge and Deyn mentions more than 2000 publications related to the MWM task within the decade 1990–2001³. More recently, virtual forms of the MWM have been used directly on human subjects and this generalisation made it possible to comparatively assess human and rodent place navigation⁴, compare spatial learning between sexes⁵ and directly study how certain factors (e.g. stimuli, age, etc.) affect spatial navigation and how certain areas of the brain perform under their effects^6,7,8,9.

In a typical MWM experiment the rodent is placed inside a circular pool filled with water and is tasked with finding a hidden platform, which is placed in one of the four quadrants of the pool. Since the animal is unable to see the platform, it has to rely on external visual cues in order to navigate within the pool and find the platform. After a number of trials, it is expected that the animal will have learned the location of the platform and will therefore be able to find it in less time than in the beginning of the trials¹⁰.

Most of the studies using the MWM experiment utilise several measurements of performance in order to assess learning and memory. Many of these measurements have also been used to ensure that the animal groups have equal skills and abilities (e.g. swimming ability, speed, ‘understanding’ of the escape mechanism)^11,12. Common measurements include the time that the animal spends inside each quadrant of the pool, the latency of finding the platform in each trial, the directionality and the total swimming distance in each trial^2,10,13. There are also a number of more sophisticated measurements such as the body temperature of the animals throughout the experiment¹⁴ or the cumulative distance to platform, which is the distance between the animal location and the platform location calculated a number of times with a specific sampling rate^15,16.

These simplistic measurements and statistics have been criticised as being insufficient to capture all of the different animal behaviours that are observed during MWM experiments^16,17. For this reason researchers started to study the various behaviours that the animals were expressing inside the pool, which are known as exploration strategies. Notable are the studies of Wolfer et al., who computed a large amount of measures for each swimming path inside the maze in order to categorise the various strategies^18,19,20. Other studies include the automatic classification procedures of Graziano et al.²¹ and Garthe et al.²², both of which specified regions of interest inside the arena. The categorisation method of Graziano et al. was based on a number of path measures while in the work of Garthe et al. a hierarchical classification algorithm was used and the categorisation of each swimming path was primarily based on the amount of time that the animal spent in each region of the arena. The latter method was also used in more recent studies^23,24 (Illouz et al.²⁵ proposed a classification technique based on support vector machines (SVM)²⁶. While they trained their algorithm to detect nine behavioural strategies, their method was generic enough to detect these behavioural strategies on a variety of different MWM experiments. However, similar to previously proposed classification methods, it did not have the ability to detect mixed animal behaviours within the same trial but assigns the whole swimming path of the animal during the trial into one behavioural class¹⁷.

In our previous work¹⁷ we argued that animals employ several behavioural strategies during each trial in order to find the platform and assigning whole animal trajectories to single behavioural classes results in the loss of important information. For this reason, we proposed a more sophisticated automatic quantification methodology capable of classifying and presenting the various animal behaviours in much more detail during each trial. According to this approach, the animal swimming path is first split into segments and then the segments are classified into behavioural strategies. In this way, changes in the animal behaviour within each trial can be detected and the animal swimming path, as a whole, falls under more than one strategy, revealing how the animal behaviour evolves within the trial.

For the classification of the segments we used a semi-supervised classification procedure which requires manual classification (labelling) of a small amount of data. An advantage of this procedure is that our classification is based on a clustering algorithm which is able to detect patterns in the data. Therefore, the behavioural classes didn’t necessarily have to be defined a priori. On the other hand, the method developed in our previous study required a certain degree of knowledge about machine learning methods, which prevented the direct application to other datasets.

In this work, we present an automatic boosted classification procedure based on majority voting, which improves on the classification error, and a validation framework which leads to conclusions with a high degree of confidence. Majority voting refers to the fact that more than one classifier are used in order to assign a segment into a class. We have implemented this framework into a fully working software capable of performing all of our analyses, without requiring machine learning knowledge from the user. This software is called RODA (ROdent Data Analytics)²⁷ and is focused on the MWM experiment. It provides an easy to use graphical user interface (GUI) for loading the data and defining the experimental specifications. It also supports automatic segmentation and semi-automatic classification, and produces quality figures which can be exported into various image formats.

Results

Trajectory Segmentation Analysis (TSA) & the RODA Software

We have developed a framework that allows Morris Water Maze trajectory segmentation analysis that requires little input from the user. Trajectories are divided into overlapping segments, a percentage of which (8–12%) are labelled by an expert user as belonging to one of eight different behavioural strategies. Multiple labels can also be used for a segment (see Methods for more information about the behavioural classes). The remaining segments are automatically classified via a semi-supervised clustering algorithm to one of the user-defined strategies, and via a smoothing procedure are mapped back to the full trajectories. This procedure allows us to identify multiple strategies in a single trial.

The user, in addition to providing labels, needs to define the segmentation length and overlap. For the segmentation parameters, appropriate regimes have been identified for the MWM with dimensions from 2 up to 2.5 times the arena radius (see Results: Robustness across different segmentation configurations).

In order to reduce the required tuning from the user (an issue of our previous work of Gehring et al.¹⁷) and improve the objectivity of the classification, we employ ensembles of classifiers that vote to assign the segment to a strategy according to a simple majority voting rule.

A software, called RODA²⁷ (shown in Fig. 1), has been developed in order for our proposed framework to be available for usage by the scientific community. The software is available on the github repository https://github.com/RodentDataAnalytics/mwm-ml-gen under the GNU General Public License version 3 (GPL-3.0). A manual of the corresponding software can be found under the wiki section of the repository (https://github.com/RodentDataAnalytics/mwm-ml-gen/wiki) details on the methodology and technical information about RODA can be found under Materials and Methods.

Advantages of Trajectory Segmentation Analysis (TSA)

Our methodology finds quantitative behavioural differences beyond those identified by standard metrics on the full swimming paths of the animals. It is able to detect additional significant differences between the behavioural strategies employed by the two or more animal groups in comparison to the categorisation of the whole animals trajectories. In more detail, from a manual behavioural analysis of the whole swimming paths of the animals the strategies thigmotaxis, incursion, scanning, self oriented, target scanning and direct finding are detected and analysed. By using TSA the additional behavioural classes of focused search, chaining response and scanning surroundings were able to be identified and analysed (for more details on the aforementioned classes of behaviour refer to Methods and Fig. 7).

We applied our framework to the dataset of Gehring et al.¹⁷ composed of two rodent groups (stressed and control rats). The reason for selecting the same dataset was because we wanted a benchmark for our improved method and the ability to demonstrate its robustness and generality. The two animal groups differ on the strategies of Thigmotaxis (Friedman test p-value = 0.004, Q = 8.516, k = 2), Incursion (Friedman test p-value = 0.009, Q = 6.811, k = 2) and Chaining Response (Friedman test p-value = 0.007, Q = 7.220, k = 2) in favour of the stressed group meaning that stressed animals implement these strategies more often than the control group. In addition, stressed animals tend to transit between different strategies more often than the control animals (Friedman test p-value = 0.037, Q = 4.340, k = 2). For relevant results refer to Fig. 2.

Commonly used measurements of learning (animal speed, escape latency and path length) suggest that there is a significant difference among the two animal groups in the sense that the stressed animals are faster (Friedman test p-value = 2 × 10⁻⁸, Q = 31.510, k = 2) and swap longer paths (Friedman test p-value = 0.002, Q = 9.836, k = 2) within the trials but they still fail to find the platform in less time than the control animals (Friedman test p-value = 0.154, Q = 2.030, k = 2). For relevant results refer to Fig. 3 for the relevant results). Manual classification of the full swimming paths to different behavioural strategies was performed due to the small amount of data; this analysis suggests that the reason for this phenomenon is because stressed animals tend to use the low level strategy of Thigmotaxis more than the control group (Friedman test p-value = 0.015, Q = 5.888, k = 2), which lowers their chances of finding the platform since they spent most of the time close to the arena periphery (refer to Fig. 4 for the relevant results). TSA agrees on that conclusion but it is also able to detect that stressed animals tend to use a series of low level strategies, both Thigmotaxis (Friedman test p-value = 0.004, Q = 8.516, k = 2) and Incursion (Friedman test p-value = 0.009, Q = 6.811, k = 2, refer to Fig. 2), which lower their chances of finding the platform since they spent most of the time on or close to the arena periphery. In addition, stressed animals implement the Chaining Response strategy more often than the control animals (Friedman test p-value = 0.007, Q = 7.220, k = 2, refer to Fig. 2), which implies that they haven’t memorised the location of the platform but its distance to the wall¹⁸; so they swim at that distance in hope to find it by chance; a behaviour that is again, on average, time consuming. Furthermore, TSA allows us to detect that stressed animals change their behaviour inside the arena more often than the control animals (Friedman test p-value = 0.037, Q = 4.340, k = 2, refer to Fig. 2). These results are relevant to studies such as^28,29,30 which suggest that high levels of stress lead to weak attention and frequent behavioural switches.

Robustness across different segmentation configurations

It is expected that the segmentation length affects the results, i.e. a full trajectory will not reveal more than one strategy or a very small segment will not have enough information for mapping it onto a strategy. We therefore focus on segmentation lengths between 2 times and 3 times the arena radius in order to investigate the robustness of our process.

The animal swimming paths inside the maze were segmented using four different segmentation configurations (different segment length and/or segment overlap). For each segmentation, we provided labels to approximately 10% of the segments (refer to Table 1 for a summary of our different configurations) and we use our framework to classify the rest. We based our conclusions on both the ensemble classification result as well as the percentage of classifiers in an ensemble that agree to this result (95% binomial confidence intervals clearly above 50%). Three out of four segmentation configurations (with segment lengths 2 and 2.5 times the arena radius) led to the conclusion that the two animal groups (stressed and control) have significant difference on the strategies of Thigmotaxis, Incursion and Chaining Response and strategy transitions (Friedman test p-value < 0.05 and 95% binomial confidence intervals clearly above 50%, see Fig. 5 for detailed statistics) in favour of the stressed group meaning that stressed animals implement these strategies and transit between different strategies more often than the control animals. One out of four segmentations (segment length of 3 times the arena radius) failed to capture significant difference in the Chaining Response strategy and a probable reason is that the segment length is too large, thus strategies that are rarer and significantly smaller are overshadowed by more common ones (e.g., Chaining Response may be overshadowed by Scanning Surrounding or Thigmotaxis, refer to Table 3). This is an issue introduced already during the labelling procedure. For example, the Segmentation 1 only 0.67% of the samples were single-labelled as chaining response vs 1.58%, 0.72%, 1.06% for the Segmentations 2 to 4 correspondingly. The larger segment makes it more difficult for the human expert to distinguish rare classes that are adjoint to frequent ones.

Table 1 Parameters for the classification of four different segmentation configurations with variable segment lengths and overlaps.

Full size table

Discussion

Methodologies that classify swimming paths in MWM to behavioural classes can reveal different stages of learning in animal groups. However, up to now, there are very few examples of earlier research that have made use of machine learning techniques to automatically detect animal behaviours. Most of them have proposed methods that are difficult to generalise and require machine learning knowledge. In our previous study¹⁷ we addressed limitations of the previous techniques by focusing on the fact that forcing whole swimming paths into a single class of behaviour can be suboptimal as each trajectory incorporates a number of different behaviours. Our methodology of detailed trajectory classification can reveal additional behavioural differences between two groups of animals and can be used even when small amount of trajectory data are available since the segmentation process, due to overlapping, typically creates a significant amount of data. Nevertheless, our previously proposed method of segmented trajectories classification required a certain degree of machine learning knowledge to be used correctly, and allowed an amount of subjectivity when choosing classifiers.

In this work, we address these issues by proposing to improve the robustness of the technique via majority voting. Our results are no longer based on a single classification tuning (classifier) but on the agreement of many. This technique alleviated the subjective assignment of the swimming path segments to classes since, in practice, many classifiers that seemingly perform equally well in validation, have relatively high disagreement, and how to best chose among them might be unclear. Here, we systematically investigate different segmentations to identify what are the bounds under which our method produces meaningful results. The bounds refer to the minimum and maximum segmentation length and the number of labels that needs to be provided. Furthermore, the binomial confidence intervals on the ensemble of the classifiers are informative regarding the quality of our results.

The dataset from our previous work¹⁷ was used as a benchmark of our new methodology and also as a way to demonstrate its robustness and generality. We report that it leads to results similar to our earlier work¹⁷ but with one difference; we do not detect any significant difference for the scanning strategy which we had detected before based on the result of a single classifier. This is due to a number of factors: (i) the use of only one classifier, which results in higher error (see also the confidence intervals in Fig. 5), (ii) the merging of three different segmentations that resulted in classifications that didn’t fully agree with each other. Here we base our conclusions on the majority voting of many classifiers that are shown to have an improved performance versus the simple classifiers, and therefore lead to more reliable results.

One important point that should be mentioned is that despite the fact that for each segmentation the ensemble formed has extremely low to zero error percentage, the largest segmentation failed to indicate difference on the Chaining Response strategy. We have identified as cause of this issue the difficulty involved when labelling large segments; in this case the chaining response can be masked by more dominant classes such as Thigmotaxis. It is worth noting that the smoothing function, which is used to map the segments back to the whole trajectories, again do not affect the conclusions formed based on the strategies. Even without the smoothing function, again, three segmentations agree on the differences between the two animal groups on the Thigmotaxis, Incursion and Chaining Response strategies while the segmentation with the more lengthy segments cannot capture the difference on Chaining Response (refer to the Supplementary material for the non-smoothed classification results). For this reason, the criterion for correct classification cannot be based on the classification error alone. We also require consistent results within a reasonable variation of the segmentation length, in our case 200–250 cm, i.e., 2 R and 2.5 R, with R being the radius of the maze. We have also verify that these segmentation parameter arranges are directly applicable to other MWM experiments (results not shown).

To facilitate the use of this methodology by the scientific community, we provide a complete software incorporating of our framework which includes a Graphical User Interface (GUI) to guide the user throughout all the analysis stages and allows for the manual configuration of each procedure.

Though our proposed framework is able to detect behavioural information in much detail we should highlight that it has a substantial limitation; the segmentation length will inevitably have an effect on the behavioural strategy length. For instance, it is not able to detect behavioural strategies of lengths shorter than $2\cdot R$ (refer to the supplementary material, where we provide the average length of each behavioural class for all the segmentation tunings). Thus if groups have different strategy lengths, e.g. if only one group is having lengths more than $2\cdot R$ then our current method is still applicable but it will not provide much information about behavioural differences because most of the segments will be classified as Direct Finding. Main reasons for this limitation are the following: (a) it is difficult to put manual labels to segments with lengths below $2\cdot R$ and (b) trajectories with lengths below the length of the segmentation tuning will be automatically classified as Direct Finding. To alleviate this limitation we gave the user the ability to provide labels to trajectories shorter than the specified segmentation tuning. However, for other experimental data if the strategies are of average length below $2\cdot R$, the tuning procedure we presented here must be repeated to identify appropriate values.

A proposed future application of our current framework is to address differences in sequence of behavioural strategies. As it was suggested in the literature^31,32, strategies within one trial occur in reliable sequences. Since the output of our method are sequences of strategies, one could potentially apply a Markovian analysis³³ on the sequences and detect differences between animal groups on the probabilities of transition between behavioural strategies. A more detailed analysis on the different kinds of transition has the potential to reveal additional differences among animal groups. Such analysis can be viewed as a Markov model where each behavioural strategy is a state and when the animal is in a particular state it can either remain in the same state, i.e. repeat the same behaviour, or transit to another behaviour.

Finally, it should be noted that the work we present here can generalise to other species of rodents inside the MWM (e.g. mice) as well as other experiments similar to the MWM (e.g. open field tasks, place avoidance). Two main significant changes to be made are the strategy definitions and the trajectory features. In our recent work³⁴ we addressed the issue of pre-defined strategies by using a fully unsupervised procedure to find patterns of behaviour in the active allothetic place avoidance task. In that experiment there is no previous knowledge of animal behaviours thus supervised or semi-supervised techniques cannot be applied. However, we mentioned that our classification depends on the trajectory features that we used. A combined work of the classification boosting technique, an unsupervised methodology³⁴, and the engineering of trajectory features that not linked to a specific experiment has the potential to lead to a robust generalised framework of trajectory analysis for many different animal species used in experimental procedures (e.g. octopus³⁵ and zebrafish³⁶).

Methods

Analysis Overview

In our proposed analysis method, the swimming paths of the animals inside the Morris Water Maze are divided into segments of approximately equal length and a fixed overlap percentage. For each segment a set of eight features is computed (refer to the Supplementary material for a short description of each feature). The features are then used in the classification procedure. Finally, a small portion of the segments needs also to be assigned manually to a specific strategy (labelling); this information is used as prior knowledge to guide the classification procedure.

Our classification procedure, which assigns segments to classes of behaviour, is based on a semi-supervised clustering algorithm called Metric Pairwise Constrained K-Means (MPCK-Means)³⁷. This algorithm incorporates the two main approaches of semi-supervised clustering: metric learning (the measuring of similarity, ‘distance’, between data) and constrained-based learning (the use of labels or constraints that produce a better grouping of the data). To turn the algorithm into a classifier, the labelled data were used not only to guide the clustering process but also to assign clusters to classes (see Supplementary material).

A common issue with many clustering algorithms, including MPCK-Means, is that a predefined number of target clusters needs to be provided; this number indicates the number of clusters into which the data will be partitioned. Determining the optimal number of target clusters is challenging and, although many different quality measures were proposed over time³⁸, this value will depend on the specific clustering method and data at hand.

In this work, instead of searching for an optimal number of clusters and attempting to generate an optimal classifier, we select to generate a pool of ‘strong’ classifiers whose ‘goodness’ is assessed based on the 10-fold cross validation error. The strong classifiers generated in that way are then used to form an ‘ensemble’ which uses majority voting to reach a classification decision. The two conditions of having both strong and diverse classifiers are essential in majority voting in order to reach an optimal classification solution^39,40. This will be discussed in more detail later. In order to assess the labelling procedure (if enough and consistent labels have been provided) the criterion of having a minimum of 40 strong classifiers has been added prior to majority voting. Finally, the classification result of the ensemble is expected to have a low percentage of unclassified segments (less than 3%) because, since the classifiers are diverse, they will have different errors or will fail to classify different segments. Thus if they work together and form an ensemble, the individual errors will be compensated by the correct responses of the other members of the ensemble³⁹. A diagram of the procedure is illustrated in Fig. 6.

Trajectories Segmentation and Partial Labelling

To assign one trajectory to multiple classes, we earlier proposed the division of the full animal swimming paths into segments¹⁷. In our method each segment overlaps significantly with its previous one (percentages of 70% and 90% have been performed on this analysis) to make sure that important information is not lost due to an unfavourable segmentation. The segment length was empirically selected to be equal to, or slightly longer than, one arena diameter. If the segment length is too short it might be difficult to identify to which class segments belong; if it’s too long it might happen that more than one class of behaviour is represented. The latter case can be seen in our results, where the large segment length (3 times the arena radius) causes some classes to be overshadowed by the more common classes (refer to Fig. 5).

In this study, nine predefined strategies were adopted (see Classes of Behaviour). We have found empirically that the amount of data that needs to be labelled should be roughly between 8% to 12% of the total segment number but the exact value depends greatly on the dataset under investigation. As a rule of thumb, if fewer labels are provided then the classification results will be poor in the sense that a lot of segments will remain unclassified or fall under the wrong class. Since the labelling procedure is prone to error and subjectivity a number of validation criteria have been implemented throughout our analysis.

Classification Boosting with Majority Voting

The classification boosting is an ensemble technique that is based on the idea that many weak learners can be converted to a strong learner⁴¹. In machine learning terms an ensemble of weak classifiers (classifiers that make mistakes) can be used to form a strong classifier (classifier that makes fewer mistakes) by combining each individual’s opinion^42,43. This approach has been used in various classification tasks (see Oza et al.⁴⁴ for a survey) and in addressing complex real-world problems, when single algorithmic classification solutions are unable to achieve high performance⁴⁵.

One way to perform classification boosting is through majority voting⁴²: many classifiers form an ensemble, vote for the class of each datapoint and the class with the most votes wins. The output of the ensemble is expected to have improved accuracy since individual errors of each classifier are compensated by the correct responses of the other members of the ensemble³⁹. In order to achieve such an outcome, the classifiers need to at least be diverse in the sense that they should not share the same errors^42,46. It should be noted, however, that diversity alone is insufficient to ensure that randomly selected, arbitrarily weak, classifiers will achieve high classification accuracy³⁹ and⁴⁷. Individual Classifiers have to also be strong meaning that they should be sufficiently accurate on their own⁴⁷ (and⁴⁸ indicate an accuracy of at least 50%).

Majority Voting Implementation

In our framework, we need to classify different trajectory segments into animal behavioural classes (strategies) having only a partial set of labelled data. The classification is parametrized by the target number of clusters of the clustering algorithm, a value that is difficult to estimate in advance. In order to overcome this problem we generate a number of classifiers by providing different numbers of target clusters in succession; At the end of this process a pool of classifiers is generated. We then use the 10-fold cross validation⁴⁹ process to evaluate different numbers of target clusters (10 to 100). Only classifiers with a validation error lower than 25% are used to form an ensemble (for more information about the 10-fold cross validation procedure refer to the Supplementary material). We set the minimum number of required classifiers that fulfill this criteria to 40. The reasoning behind this process is that we require a sufficient number of ‘strong’ classifiers. For the majority voting, we adopt the simple scheme where the vote of each classifier has the same weight^50,51 and that in case of a tie the datapoint (segment) is marked as undefined.

Framework Validation

We thoroughly validated every procedure of our framework in terms of robustness and results consistency. Overall, we performed four different segmentations with the aim to find the bounds (error margins) for the segment length between which we have consistent analysis conclusions. As discussed in the results section, it is expected that the segmentation length affects the results and we showed that consistency for the MWM can be achieved with segmentation lengths between 2 times and 2.5 times the arena radius. For more information refer to Fig. 5 where we show that longer segment lengths fails to capture the difference between the two groups of the Chaining Response behavioural strategy.

For each of the four different segmentations we compared the performance of the classifiers, the ensemble and multiple ensembles formed by random sample of ‘strong’ classifiers. Table 2 shows the relevant results of the last stage of analysis, where the overlapped segments have been mapped back to the original swimming paths. For the latter the smoothing function is applied on the segments (refer to Methods: Mapping Segment Classes to the Full Swimming Paths) and this detail is important because the smoothing procedure increases the performance of the classifiers (for the statistical analysis prior to the smoothing function refer to the Supplementary material). As expected, ensembles have higher accuracy, a lower percentage of unclassified segments and a higher percentage of agreement among them in comparison to individual classifiers. However, since in our method the cross validation was used for both tuning and testing, additionally we manually assess the error of the ensembles on two out of the four segmentations (see Supplementary material for more information about the manual assessment).

Table 2 Classification statistics (average) for the four segmentation configurations of Table 1 and benefits of majority voting.

Full size table

Classifier Diversity

To evaluate the diversity of the classifiers, we assess the percentage of their agreement for the class of each segment. The result is a symmetric matrix with rows and columns representing the classifiers where each element shows the percentage of segments for which two classifiers agree on the assigned class. The diagonal values of this matrix equal to 100 as each classifier is in 100% agreement with itself (refer to the Supplementary material for an example of an agreement matrix). An overall agreement can be computed by averaging the upper or lower triangular of the matrix. In addition, we consider the average cross validation error (accuracy) over the classifiers. In order for the classifiers to be both diverse and strong it is expected that they should have an average percentage of agreement well below 100% (in our case around 60%) and low cross validation error (refer to Table 2).

As has been previously reported³⁹, ensembles have far less variance in comparison with individual classifiers thus it is expected to have much higher agreement. To demonstrate this observation, we generated a number of ensembles by picking classifiers at random from the pool. Afterwards we performed the same statistical measurement of agreement for the ensembles, similar to the one described for the classifiers. In contrast to the classifiers, the ensembles have high agreements among them (more than 80%) and nearly nullify the cross validation error of the classifiers (see Table 2). However, since in our method the cross validation was used for both tuning and testing¹⁷, additionally we manually assess the error of the ensembles on two out of the four segmentations (see the Supplementary material for the manual assessment results).

Percentage of unclassified segments

A useful measure for the quality of the classification is the percentage of unclassified segments. For certain segments, it is expected that none of the classifiers in the ensemble will be able to determine a class, or that there could be a draw for segments that transit between classes (refer to Table 3 on the Results section). This, however, does not have an impact on the consistency of results (see Fig. 5).

Table 3 Percentage of segments falling under each class for the four segmentation configurations of Table 1.

Full size table

Mapping Segment Classes to the Full Swimming Paths

The classification has been performed on overlapping segments of the animals’ swimming paths, we therefore need to map them back to the whole trajectories.

As a first approach, we considered the classified segments as continuous parts of the trajectories ignoring the overlap percentage. This method provides consistent results on the significant differences of the strategies but fails to detect differences on strategy transition between groups (refer to the Supplementary material for the relevant result). The reason for this is that sparse segments within each swimming path fall under different classes thus viewing them as a sequence leads to an overestimation of transitions (a transition occurs when a segment falls under a different class after a sequence of segments that fall under the same class).

To address this limitation, we use a smoothing technique with parameters independent of segmentation choice. This was done for two reasons: (i) to avoid subjective conclusions based on a specific segmentation configuration and (ii) to be able to directly compare different segmentations. In more detail, given that R equals to the radius of the arena, the swimming paths are now divided into intervals of length R. Each of the intervals is assigned to a certain class based on a weighed voting of all the overlapping segments. The mathematical expression for this operation is shown in equation 1,

$${C}_{{T}_{i}}\equiv ar{g}_{{c}_{k}}\,max\,\sum _{(\begin{array}{c}{S}_{j}\in {c}_{k}\\ {T}_{i}\cap {S}_{j}\ne \varnothing \end{array})}\,{w}_{k}\cdot {e}^{-\frac{{d}_{ij}^{2}}{2\cdot {\sigma }^{2}}}$$

(1)

where T_i is the i_th interval, d_i,j is the distance from the centre of the j_th segment (S_j) overlapping with the i_th interval to the centre of the i_th interval, c_k is the k_th segment class and w_k is a class weight normalised so that $\sum \,{w}_{k}=1$. The sum is to be taken over the segments intersecting with the interval T_i, belong to class c_k (unclassified segments are excluded) and fulfill the threshold requirement ${e}^{-\frac{{d}_{ij}^{2}}{2\cdot {\sigma }^{2}}} > \,=0.14$, where σ is the variance of the Gaussian and the value 0.14 is obtained when ${d}_{ij}=2\cdot \sigma $. The reason for the latest requirement is to create a cutoff for the segments that are too far away from the centre of the interval. The parameter σ controls the weight of the vote of each segment based on its distance from the interval and in our analysis it was set equal to R in order to achieve proportionality with the arena dimensions (other values have also been tested, refer to the Supplementary material). Finally, the class weight w_k was defined as ${w}_{k}=\frac{1}{P({c}_{k})}$, where P(c_k) is the percentage of segments belonging to class k. The intuition for setting the class weights inversely proportional to the amount of segments that fall under each class was to prevent rare classes from being overshadowed by common ones. To prevent having too small or too large class weights the bounds of [0.01 0.5] were set, which means that if less than 1% or more than 50% of the segments fall under a certain category then this class will receive weight equal to 0.01 or 0.5 respectively.

Statistics

The non-parametric Friedman test⁵² was used for the analysis of variance of each strategy between the two animal groups. This test was selected because the data are not normally distributed and because of its ability to control the variability among subjects over the different observations⁵³.

For our analysis the null hypothesis is that there in no difference between the two animal groups (stressed and control) over each one of the strategies (refer to Methods: Classes of Behaviour) as well as over the number of times that the animals change their behaviour within single trials (strategy transitions). Small p-values (<0.05) generated by the Friedman test lead us to discard the null hypothesis that the results are identical and that any differences are only due to chance (random sampling). When the test is used we report the Friedman test p-value and the Friedman’s chi-square statistic (Q)⁵⁴. Since we compare two animal groups, stress and control, we have k = 2 variables and the degrees of freedom are equal to df = 1.

In addition to the Friedman test, the 95% confidence intervals of a binomial distribution⁵⁵ are being used, where the significance of a specific classification, as judged by each of the classifiers that form the ensemble, is viewed as a random process generating one (significant differences) or zero (non-significant differences). In more detail, the confidence intervals indicate our confidence that the classifiers forming the ensemble are on average pointing to the same conclusion as the ensemble (i.e. the majority agrees that there is significant difference over strategies or strategies transitions). Given that the Friedman test can have two outcomes, we hypothesise the outcomes to be the result of a binomial distribution. We require that the 95% confidence intervals to be clearly above 0.5 (or 50%) in order to be confident that the result in not due to chance^56,57.

The RODA Software

RODA²⁷ consists of a series of graphical user interfaces (GUIs) which offer straightforward analysis of trajectory data extracted from the Noldus Ethovision System⁵⁸. Every stage of the process can be tuned to meet the user’s needs. The generated figures can be exported into a variety of different image formats (JPEG, TIFF, etc.) while the numerical data depicted in the figures are also saved in Comma Separated Values (CSV) file format in case the user wishes to generate the figures using a different software (e.g. Microsoft Excel).

The software is entirely written in MATLAB⁵⁹ and uses a modified version of the WEKA library⁶⁰ written in Java which is known as WekaUT (for more information refer to http://www.cs.utexas.edu/users/ml/risc/code/).

The code of RODA is open-source and available on the github repository https://github.com/Rodent-DataAnalytics/mwm-ml-gen. The code requires the MATLAB’s Statistics Toolbox⁶¹ to be installed. Compiled versions of the software are also available for Windows and MAC OS (see the releases tab of the repository https://github.com/RodentDataAnalytics/mwm-ml-gen/releases).

Classes of Behaviour and Strategy Transitions

The choice of the classes of behaviours (strategies) in our analysis is motivated by previous studies (e.g.^19,21,62) which have observed and reported stereotypical animal behaviours inside the MWM (for an example of each strategy refer to Fig. 7).

Thigmotaxis (TT)

The animal moves exclusively on the periphery of the arena and most of the time it touches the walls of the arena.

Incursion (IC)

The animal starts to distant itself from the arena periphery with visible inward movements.

Scanning (SC)

A behaviour associated with random searches focused in the centre of the pool. Another characteristic of this behaviour is that the animal rapidly turns away from the arena walls if it touches them²¹.

Focused Search (FS)

This behaviour is also associated with random searches but here the animal actively searches a particular small region of the arena.

Chaining Response (CR)

A behaviour first observed in the study of Wolfer et al.¹⁸ where the animal appears to have memorised the distance to the platform from the arena wall and swims circularly in order to find it.

Self Orienting (SO)

The animal performs a loop and orients itself inside the arena²¹.

Scanning Surroundings (SS)

The animal crosses a region very close to the platform of the arena but moves away¹⁷.

Scanning Target (ST)

The animal actively searches for the arena by swapping paths around it.

Direct Finding (DF)

The animal navigates straight to the platform.

Strategy Transitions (tr)

In addition to the behavioural strategies, we have analysed the number of times that the animals change their behaviour within single trials.

Morris Water Maze Experiment and Data Properties

The data have been collected from experiments performed at the Laboratory of Behavioural Genetics, EPFL at Lausanne, Switzerland. All procedures were conducted in conformance with the Swiss National Institutional Guidelines on Animal Experimentation and approved by a license from the Swiss Cantonal Veterinary Office Committee for Animal Experimentation.

The water maze had a diameter of 200 cm with a submerged platform of diameter 12 cm. The recordings of the animals trajectories were performed by using the tracking software, Noldues EthoVision⁵⁸ version 3.1. The dataset contains 57 rats, 30 of which were inducted into stress at peripubertal age⁶³ and 27 of which were the control group. A total of 12 trials were performed per animal divided into 3 consecutive days with 4 trials per day. The timeout of each trial was 90 seconds and if the animal failed to find the platform within the time limit it was guided to it. The inter-trial interval between the trials of the same day was only a few minutes. The starting position of the animals was altered between trials.

Data Availability

The data used in this work are available in the same GitHub repository that hosts RODA (https://github.com/Rodent-DataAnalytics/mwm-ml-gen). RODA has a demo function embedded for importing the data and reproducing the results of this work (see the Wiki section of the repository).

References

Morris, R. G. Spatial localization does not require the presence of local cues. Learning and motivation 12, 239–260 (1981).
Article Google Scholar
Brandeis, R., Brandys, Y. & Yehuda, S. The use of the morris water maze in the study of memory and learning. International Journal of Neuroscience 48, 29–69 (1989).
Article CAS Google Scholar
D’Hooge, R. & De Deyn, P. P. Applications of the morris water maze in the study of learning and memory. Brain research reviews 36, 60–90 (2001).
Article Google Scholar
Schoenfeld, R., Schiffelholz, T., Beyer, C., Leplow, B. & Foreman, N. Variations of the morris water maze task to comparatively assess human and rodent place navigation. Neurobiology of Learning and Memory (2017).
Astur, R. S., Ortiz, M. L. & Sutherland, R. J. A characterization of performance by men and women in a virtual morris water task: A large and reliable sex difference. Behavioural brain research 93, 185–190 (1998).
Article CAS Google Scholar
Cornwell, B. R., Johnson, L. L., Holroyd, T., Carver, F. W. & Grillon, C. Human hippocampal and parahippocampal theta during goal-directed spatial navigation predicts performance on a virtual morris water maze. Journal of Neuroscience 28, 5983–5990 (2008).
Article CAS Google Scholar
Daugherty, A. M., Bender, A. R., Yuan, P. & Raz, N. Changes in search path complexity and length during learning of a virtual water maze: Age differences and differential associations with hippocampal subfield volumes. Cerebral Cortex bhv061 (2015).
Piber, D. et al. Mineralocorticoid receptor stimulation effects on spatial memory in healthy young adults: A study using the virtual morris water maze task. Neurobiology of Learning and Memory 136, 139–146 (2016).
Article CAS Google Scholar
Korthauer, L., Nowak, N., Frahmand, M. & Driscoll, I. Cognitive correlates of spatial navigation: Associations between executive functioning and the virtual morris water task. Behavioural brain research 317, 470–478 (2017).
Article CAS Google Scholar
Morris, R. Developments of a water-maze procedure for studying spatial learning in the rat. Journal of neuroscience methods 11, 47–60 (1984).
Article CAS Google Scholar
Vorhees, C. V. & Williams, M. T. Assessing spatial learning and memory in rodents. ILAR Journal 55, 310–332 (2014).
Article CAS Google Scholar
Maei, H. R., Zaslavsky, K., Teixeira, C. M. & Frankland, P. W. What is the most sensitive measure of water maze probe test performance? Frontiers in integrative neuroscience 3, 4 (2009).
PubMed PubMed Central Google Scholar
Lindner, M. D. Reliability, distribution, and validity of age-related cognitive deficits in the morris water maze. Neurobiology of learning and memory 68, 203–220 (1997).
Article CAS Google Scholar
Lindner, M. D. & Gribkoff, V. K. Relationship between performance in the morris water task, visual acuity, and thermoregulatory function in aged f-344 rats. Behavioural brain research 45, 45–55 (1991).
Article CAS Google Scholar
Gallagher, M., Burwell, R. & Burchinal, M. R. Severity of spatial learning impairment in aging: development of a learning index for performance in the morris water maze. Behavioral neuroscience 107, 618 (1993).
Article CAS Google Scholar
Dalm, S., Grootendorst, J., De Kloet, E. R. & Oitzl, M. S. Quantification of swim patterns in the morris water maze. Behavior Research Methods, Instruments, & Computers 32, 134–139 (2000).
Article CAS Google Scholar
Gehring, T. V., Luksys, G., Sandi, C. & Vasilaki, E. Detailed classification of swimming paths in the morris water maze: multiple strategies within one trial. Scientific reports 5 (2015).
Wolfer, D. P. & Lipp, H.-P. Dissecting the behaviour of transgenic mice: is it the mutation, the genetic background, or the environment? Experimental physiology 85, 627–634 (2000).
Article CAS Google Scholar
Wolfer, D. P., Stagljar-Bozicevic, M., Errington, M. L. & Lipp, H.-P. Spatial memory and learning in transgenic mice: fact or artifact? Physiology 13, 118–123 (1998).
Article Google Scholar
Wolfer, D. P., Madani, R., Valenti, P. & Lipp, H.-P. Extended analysis of path data from mutant mice using the public domain software wintrack. Physiology & Behavior 73, 745–753 (2001).
Article CAS Google Scholar
Graziano, A., Petrosini, L. & Bartoletti, A. Automatic recognition of explorative strategies in the morris water maze. Journal of neuroscience methods 130, 33–44 (2003).
Article Google Scholar
Garthe, A., Behr, J. & Kempermann, G. Adult-generated hippocampal neurons allow the flexible use of spatially precise learning strategies. PloS one 4, e5464 (2009).
Article ADS Google Scholar
Rogers, J., Churilov, L., Hannan, A. J. & Renoir, T. Search strategy selection in the morris water maze indicates allocentric map formation during learning that underpins spatial memory formation. Neurobiology of learning and memory 139, 37–49 (2017).
Article Google Scholar
Yeshurun, S. et al. Elevated paternal glucocorticoid exposure modifies memory retention in female offspring. Psychoneuroendocrinology (2017).
Illouz, T., Madar, R., Louzon, Y., Griffioen, K. J. & Okun, E. Unraveling cognitive traits using the morris water maze unbiased strategy classification (must-c) algorithm. Brain, behavior, and immunity 52, 132–144 (2016).
Article Google Scholar
Cortes, C. & Vapnik, V. Support-vector networks. Machine learning 20, 273–297 (1995).
MATH Google Scholar
Vouros, A., Gehring, T. V., Croucher, M. & Vasilaki, E. RodentDataAnalytics/mwm-ml-gen: Version 4.0.3-beta RodentDataAnalytics/mwm-ml-gen: Version 4.0.3-beta, https://doi.org/10.5281/zenodo.1117837 (2017).
Aston-Jones, G., Rajkowski, J. & Cohen, J. Locus coeruleus and regulation of behavioral flexibility and attention. Progress in brain research 126, 165–182 (2000).
Article CAS Google Scholar
Luksys, G., Gerstner, W. & Sandi, C. Stress, genotype and norepinephrine in the prediction of mouse behavior using reinforcement learning. Nature neuroscience 12, 1180–1186 (2009).
Article CAS Google Scholar
Luksys, G. & Sandi, C. Neural mechanisms and computations underlying stress effects on learning and memory. Current opinion in neurobiology 21, 502–508 (2011).
Article CAS Google Scholar
Whishaw, I. Q. & Mittleman, G. Visits to starts, routes, and places by rats (rattus norvegicus) in swimming pool navigation tasks. Journal of Comparative Psychology 100, 422 (1986).
Article CAS Google Scholar
Hamilton, D. A., Rosenfelt, C. S. & Whishaw, I. Q. Sequential control of navigation by locale and taxon cues in the morris water task. Behavioural brain research 154, 385–397 (2004).
Article Google Scholar
Gagniuc, P. A. Markov Chains: From Theory to Implementation and Experimentation (John Wiley & Sons, 2017).
Gehring, T. V., Wesierska, M. J., Wójcik, D. K. & Vasilaki, E. Analysis of behaviour in the active allothetic place avoidance task based on cluster analysis of the rat movement motifs. bioRxiv 157859 (2017).
Boal, J. G., Dunham, A. W., Williams, K. T. & Hanlon, R. T. Experimental evidence for spatial learning in octopuses (octopus bimaculoides). Journal of Comparative Psychology 114, 246 (2000).
Article CAS Google Scholar
Gerlai, R. Zebrafish and relational memory: Could a simple fish be useful for the analysis of biological mechanisms of complex vertebrate learning? Behavioural Processes (2017).
Article Google Scholar
Bilenko, M., Basu, S. & Mooney, R. J. Integrating constraints and metric learning in semi-supervised clustering. In Proceedings of the twenty-first international conference on Machine learning, 11 (ACM, 2004).
Kovács, F., Legány, C. & Babos, A. Cluster validity measurement techniques. In 6th International symposium of hungarian researchers on computational intelligence (2005).
Sharkey, A. & Sharkey, N. Diversity, selection, and ensembles of artificial neural nets. Neural Networks and their Applications (NEURAP’97) 205–212 (1997).
Zhou, Z.-H., Wu, J. & Tang, W. Ensembling neural networks: many could be better than all. Artificial intelligence 137, 239–263 (2002).
Article MathSciNet Google Scholar
Kearns, M. J. & Valiant, L. G. Cryptographic limitations on learning boolean formulae and finite automata. In Machine Learning: From Theory to Applications, 29–49 (Springer, 1993).
Gerecke, U., Sharkey, N. E. & Sharkey, A. J. Common evidence vectors for self-organized ensemble localization. Neurocomputing 55, 499–519 (2003).
Article Google Scholar
Jurek, A., Bi, Y., Wu, S. & Nugent, C. Classification by cluster analysis: A new meta-learning based approach. In International Workshop on Multiple Classifier Systems, 259–268 (Springer, 2011).
Oza, N. C. & Tumer, K. Classifier ensembles: Select real-world applications. Information Fusion 9, 4–20 (2008).
Article Google Scholar
Acharya, A., Hruschka, E. R., Ghosh, J. & Acharyya, S. C 3e: A framework for combining ensembles of classifiers and clusterers. In International Workshop on Multiple Classifier Systems, 269–278 (Springer, 2011).
Schapire, R. E. The strength of weak learnability. Machine learning 5, 197–227 (1990).
Google Scholar
Zhu, M. Use of majority votes in statistical learning. Wiley Interdisciplinary Reviews: Computational Statistics 7, 357–371 (2015).
Article MathSciNet Google Scholar
Ruta, D. & Gabrys, B. A theoretical analysis of the limits of majority voting errors for multiple classifier systems. Pattern Analysis and Applications 5, 333–350 (2002).
Article MathSciNet Google Scholar
Varma, S. & Simon, R. Bias in error estimation when using cross-validation for model selection. BMC bioinformatics 7, 91 (2006).
Article Google Scholar
Liaw, A. & Wiener, M. Classification and regression by randomforest. R news 2, 18–22 (2002).
Google Scholar
Bouziane, H., Messabih, B. & Chouarfia, A. Profiles and majority voting-based ensemble method for protein secondary structure prediction. Evolutionary bioinformatics online 7, 171 (2011).
CAS PubMed PubMed Central Google Scholar
Siegel, S. Nonparametric statistics for the behavioral sciences. Nonparametric Statistics for the Behavioral Sciences (1956).
Theodorsson-Norheim, E. Friedman and quade tests: Basic computer program to perform nonparametric two-way analysis of variance and multiple comparisons on ranks of several related samples. Computers in biology and medicine 17, 85–99 (1987).
Article CAS Google Scholar
Hollander, M. & Wolfe, D. A. Nonparametric statistical methods. Wiley Series in Probability and Statistics (1999).
Wallis, S. Binomial confidence intervals and contingency tests: mathematical fundamentals and the evaluation of alternative methods. Journal of Quantitative Linguistics 20, 178–208 (2013).
Article Google Scholar
Brown, L. D., Cai, T. T. & DasGupta, A. Interval estimation for a binomial proportion. Statistical science 101–117 (2001).
Peck, R. Statistics: the exploration and analysis of data. (Brooks/Cole, Cengage Learning, Australia United States, 2012).
Google Scholar
Noldus, L. P., Spink, A. J. & Tegelenbosch, R. A. Ethovision: a versatile video tracking system for automation of behavioral experiments. Behavior Research Methods, Instruments, & Computers 33, 398–414 (2001).
Article CAS Google Scholar
MATLAB. version 9.0 (R2016a) (The MathWorks Inc., Natick, Massachusetts, 2016).
Frank, E., Hall, M. A. & Witten, I. H. The WEKA Workbench (Online Appendix for “Data Mining: Practical Machine Learning Tools and Techniques”, Morgan Kaufmann, Fourth Edition, 2016).
MATLAB. Matlab statistics toolbox (2016).
Wolfer, D. P. & Lipp, H.-P. A new computer program for detailed off-line analysis of swimming navigation in the morris water maze. Journal of neuroscience methods 41, 65–74 (1992).
Article CAS Google Scholar
Márquez, C. et al. Peripuberty stress leads to abnormal aggression, altered amygdala and orbitofrontal reactivity and increased prefrontal maoa gene expression. Translational psychiatry 3, e216 (2013).
Article Google Scholar

Download references

Acknowledgements

E.V. acknowledges the grant number 264872 by EC-FP7-PEOPLE from the NAMASEN Marie-Curie Initial Training Network. C.S. acknowledges the grant number 31003A_176206 from the Swiss National Science Foundation Project. K.L. acknowledges the grant number W19/7.PR/2014 from the Polish Ministry of Science and Education and the grant number 602102 from the FP7-HEALTH project (EPITARGET).

Author information

Authors and Affiliations

Department of Computer Science, University of Sheffield, Sheffield, UK
Avgoustinos Vouros, Tiago V. Gehring, Zehai Tu & Eleni Vasilaki
Laboratory of Epileptogenesis, Nencki Institute of Experimental Biology, Warsaw, Poland
Kinga Szydlowska & Katarzyna Lukasiuk
Neurobiology Center, Nencki Institute of Experimental Biology, Warsaw, Poland
Artur Janusz & Witold Konopka
School of Computing, University of Leeds, Leeds, UK
Mike Croucher
Laboratory of Behavioral Genetics, Brain Mind Institute, EPFL, Lausanne, Switzerland
Carmen Sandi

Authors

Avgoustinos Vouros
View author publications
You can also search for this author in PubMed Google Scholar
Tiago V. Gehring
View author publications
You can also search for this author in PubMed Google Scholar
Kinga Szydlowska
View author publications
You can also search for this author in PubMed Google Scholar
Artur Janusz
View author publications
You can also search for this author in PubMed Google Scholar
Zehai Tu
View author publications
You can also search for this author in PubMed Google Scholar
Mike Croucher
View author publications
You can also search for this author in PubMed Google Scholar
Katarzyna Lukasiuk
View author publications
You can also search for this author in PubMed Google Scholar
Witold Konopka
View author publications
You can also search for this author in PubMed Google Scholar
Carmen Sandi
View author publications
You can also search for this author in PubMed Google Scholar
Eleni Vasilaki
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.V. and E.V. designed the methodology and wrote the manuscript. A.V. developed the stand-alone RODA software and performed the experiments. T.V.G. and E.V. designed the initial methodology. T.V.G. developed the original software tools. Z.T., K.S., A.J., K.L. and W.K. validated and tested the analysis and the software. M.C. contributed to the software development. C.S. provided the MWM experimental data and assisted with the interpretation of the results. All authors provided feedback to the manuscript.

Corresponding authors

Correspondence to Avgoustinos Vouros or Eleni Vasilaki.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Vouros, A., Gehring, T.V., Szydlowska, K. et al. A generalised framework for detailed classification of swimming paths inside the Morris Water Maze. Sci Rep 8, 15089 (2018). https://doi.org/10.1038/s41598-018-33456-1

Download citation

Received: 28 February 2018
Accepted: 28 September 2018
Published: 10 October 2018
DOI: https://doi.org/10.1038/s41598-018-33456-1

Keywords

This article is cited by

Geometry-based navigation in the dark: layout symmetry facilitates spatial learning in the house cricket, Acheta domesticus, in the absence of visual cues
- Bartosz Baran
- Michał Krzyżowski
- Mateusz Hohol
Animal Cognition (2023)
Search strategy analysis of Tg4-42 Alzheimer Mice in the Morris Water Maze reveals early spatial navigation deficits
- Nadine Curdt
- Franziska W. Schmitt
- Yvonne Bouter
Scientific Reports (2022)
Strategies discovery in the active allothetic place avoidance task
- Avgoustinos Vouros
- Tiago V. Gehring
- Eleni Vasilaki
Scientific Reports (2022)
A framework to identify structured behavioral patterns within rodent spatial trajectories
- Francesco Donnarumma
- Roberto Prevete
- Giovanni Pezzulo
Scientific Reports (2021)
Behavioral characteristics as potential biomarkers of the development and phenotype of epilepsy in a rat model of temporal lobe epilepsy
- Karolina Nizinska
- Kinga Szydlowska
- Katarzyna Lukasiuk
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Trajectory Segmentation Analysis (TSA) & the RODA Software

Advantages of Trajectory Segmentation Analysis (TSA)

Robustness across different segmentation configurations

Discussion

Methods

Analysis Overview

Trajectories Segmentation and Partial Labelling

Classification Boosting with Majority Voting

Majority Voting Implementation

Framework Validation

Classifier Diversity

Percentage of unclassified segments

Mapping Segment Classes to the Full Swimming Paths

Statistics

The RODA Software

Classes of Behaviour and Strategy Transitions

Thigmotaxis (TT)

Incursion (IC)

Scanning (SC)

Focused Search (FS)

Chaining Response (CR)

Self Orienting (SO)

Scanning Surroundings (SS)

Scanning Target (ST)

Direct Finding (DF)

Strategy Transitions (tr)

Morris Water Maze Experiment and Data Properties

Data Availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Comments

Search

Quick links