DeepLabStream enables closed-loop behavioral experiments using deep learning-based markerless, real-time posture detection

Schweihoff, Jens F.; Loshakov, Matvey; Pavlova, Irina; Kück, Laura; Ewell, Laura A.; Schwarz, Martin K.

doi:10.1038/s42003-021-01654-9

Download PDF

Article
Open access
Published: 29 January 2021

DeepLabStream enables closed-loop behavioral experiments using deep learning-based markerless, real-time posture detection

Communications Biology volume 4, Article number: 130 (2021) Cite this article

7646 Accesses
18 Citations
47 Altmetric
Metrics details

Subjects

Abstract

In general, animal behavior can be described as the neuronal-driven sequence of reoccurring postures through time. Most of the available current technologies focus on offline pose estimation with high spatiotemporal resolution. However, to correlate behavior with neuronal activity it is often necessary to detect and react online to behavioral expressions. Here we present DeepLabStream, a versatile closed-loop tool providing real-time pose estimation to deliver posture dependent stimulations. DeepLabStream has a temporal resolution in the millisecond range, can utilize different input, as well as output devices and can be tailored to multiple experimental designs. We employ DeepLabStream to semi-autonomously run a second-order olfactory conditioning task with freely moving mice and optogenetically label neuronal ensembles active during specific head directions.

Deep learning-based behavioral analysis reaches human accuracy and is capable of outperforming commercial solutions

Article Open access 25 July 2020

Facemap: a framework for modeling neural activity based on orofacial tracking

Article Open access 20 November 2023

EthoLoop: automated closed-loop neuroethology in naturalistic environments

Article 29 September 2020

Introduction

A major goal in behavioral neuroscience is the correlation of behavioral expressions with neuronal activity. For best effectiveness, however, the behavior should be identified in real-time, allowing for instantaneous feedback, i.e., closed-loop manipulation based on the current behavioral expression^1,2. Currently, such experimental systems often rely on specialized, on-purpose setups, including intricate beam brake designs, treadmills, and virtual reality setups to approximate the movement of the investigated animal in a given environment and then react accordingly^{3,4,5,6,7,8,9,10,11}.

Classic manipulations of neuronal activity such as lesions, transgenic alterations, and pharmacological injections result in long-lasting, and sometimes chronic changes in the tested animals, which can make it difficult to interpret behavioral effects. In recent years, there has been a shift towards techniques that allow for fast, short-lived manipulation of neuronal activity. Optogenetic manipulation, for example, offers high temporal precision, enabling the manipulation of experience during experimental tasks that test mechanisms of learning and memory^12,13,14, perception^15,16, and motor control^17,18. Such techniques offer a temporal resolution precise enough that the neuronal manipulation can match the timescale of either behavioral expression or neuronal computation.

Recent developments in the field of behavioral research have made offline pose estimation of several species possible using robust deep learning-based markerless tracking^19,20,21. DeepLabCut (DLC)¹⁹, for example, uses trained deep neural networks to track the position of user-defined body parts and provides motion tracking of freely moving animals. Additionally, sophisticated computational approaches have allowed for disentangling the complex behavioral expressions of animals into patterns of reoccurring modules^{22,23,24,25,26}. In vivo single-unit recording²⁷, along with recent advances in in vivo voltage imaging²⁸ and miniaturized calcium imaging techniques^29,30,31, facilitate real-time measurements of neuronal activity in freely moving mice. Together, these techniques provide a platform for correlating recorded neuronal activity and complex behavior.

We here introduce DeepLabStream (DLStream), a multi-purpose software solution that enables markerless, real-time tracking, and neuronal manipulation of freely moving animals during ongoing experiments. Its core capability is the orchestration of closed-loop experimental protocols that streamline posture-dependent feedback to several input, as well as output devices. We modified state-of-the-art pose estimation based on DLC¹⁹ to be able to track the postures of mice in real-time. To demonstrate the software’s capabilities, we conducted a classic, multilayered, freely moving conditioning task, as well as a head direction-dependent optogenetic stimulation experiment using a neuronal activity-dependent, light-induced labeling system (Cal-Light)¹. Finally, we discuss the versatility of DLStream to adapt to different experimental conditions and hardware configurations.

Results

DLStream enables closed-loop stimulations directly dependent on actual expressed behavioral postures. Our solution is fully autonomous and requires no additional tracking-, trigger- or timing-devices. All experiments can be conducted without restriction to the animal’s movement and each experimental session is run fully autonomously after the first setup. Initially, we trained DLC-based pose estimation networks offline for each experimental environment and then integrated them into DLStream (see “Methods” section). Briefly, frames were taken from a video camera stream, and analyzed using an integrated deep neural network, trained using the DLC framework. Next, the resulting pose estimation was converted into postures and transferred to an additional process that supervises the ongoing experiment and outputs feedback to connected devices (Fig. 1). As a result, experiments run by DLStream comprise a sequence of modules (Fig. 2c) depending on the underlying experimental protocol. Basic modules, such as timers and stimulations, are posture-independent and control fundamental aspects of the experiment. Timers keep track of the time passing during frame-by-frame analysis and act as a gate for posture-based triggers and stimulations (e.g., inter-stimulus time). Stimulations specify which devices are triggered and how each device is controlled once it was triggered (e.g., reward delivery). Posture-based triggers are sets of defined postures (e.g., position, head direction, etc.) that initialize a predefined cascade (stimulation) once detected within an experiment (see Fig. 2 for examples). As an experiment is conducted, DLStream records and subsequently exports all relevant information, including posture tracking, experimental status, and response latency in a table-based file. During any experiment, posture tracking is visualized on a live video stream directly enabling the monitoring of the conducted experiment and tracking quality. Additionally, the raw video camera stream is timestamped and recorded, allowing high-framerate recording, with lower-framerate closed-loop posture detection to save processing power (Fig. 1).

**Fig. 1: A visual representation of DLStream.**

**Fig. 2: Experimental design using DLStream.**

Classical second-order conditioning using DLStream

To comprehensively test DLStream we first designed a semi-automated classical second-order conditioning task (Fig. 3a–e). Using DLStream, mice were trained to associate two unknown odors (rose and vanillin) with two visual stimuli, which were initially associated with either a reward or an aversive tone (Fig. 3a). We subsequently tested the conditioned mice in an odor preference task. In the first conditioning stage, DLStream triggered trials when a mouse was facing the screen. For this, a trigger module was designed that utilizes the general head direction of mice, activating stimulation modules only when mice were looking towards the screen in a 180° window. The mice were conditioned to associate two unknown visual stimuli (a high-contrast black and white image) with a reward or an aversive tone (Fig. 3a) using combinations of predefined stimulation modules. In the positive trial, DLStream delivered a liquid reward by triggering the corresponding stimulation module in a fixed reward location and withdrew it if it was not collected within a preset time period monitored with a timer module. In the negative trial, DLStream delivered only the aversive tone (Fig. 3a). All mice (n = 10) were trained for 13 days and selected based on their individual performance to reach the success criterion (85% reward collection within one session, n = 6 mice). We limited the number of sessions to 1 h or 40 trials per day within our experimental protocol. Note that no mouse needed more than 45 min to complete a session. During the subsequent second-order conditioning, the mice were presented with two novel odors (rose and vanillin), placed in a petri dish in front of the screen (Fig. 3b). Visual stimuli were previously paired with an odor and pairing was kept throughout all experiments. Upon exploration of one of the two presented odors, DLStream showed the mice the paired, previously conditioned visual stimulus (Fig. 3b). The session was completed when DLStream detected that the mice had explored both odors at least 10 times, or after 10 min had passed. Second-order conditioning was then conducted in two stages. The first stage consisted of the mouse being in direct contact with the odor location (petri dish), while the second was dependent on the proximity of the mouse to one of the locations and its head direction (Fig. 3b). For this, trigger modules designed to detect proximity and the heading direction of mice were used. Each stage was repeated twice with exchanged odor locations.

**Fig. 3: Closed-loop conditioning task.**

We then tested for successful second-order conditioning by conducting an offline odor preference task (Fig. 3c). Mice (n = 6) were placed in an open field arena with one odor in each of the quarters. In addition to the two conditioned odors, two novel odors (acetophenone and valeric acid) were presented. Mice were given 10 min twice to explore and total investigation time was measured by taking the circular odor location into account. Mice showed a preference towards the positively conditioned (S+) odor compared to the negatively conditioned (S−) odor by spending significantly more time at the S+ odor location than in the S− odor location (Fig. 3d). While the investigation time of the S− odor was significantly less compared to the investigation time of both novel odors, we found no significant difference between the S+ odor and the novel odors.

Optogenetic, head direction-dependent labeling of neurons using DLStream

As a second example of DLStream’s applicability, we tested the possibility to optogenetically label active neurons in the anterior dorsal nucleus of the thalamus (ADN) dependent on the mouse’s head direction using the neuronal activity-dependent labeling system Cal-Light¹. Activity within ADN neurons is known to be modulated by the angle of head direction²⁷. Within a stable environment, the angular tuning curve of an ADN neuron remains constant, facilitating experimental designs that span several days³². To label ADN ensembles, we utilized DLStream to deliver light stimuli within precisely defined head direction angles (target window) (Fig. 4). The timing was controlled by designated timer modules controlling the onset and offset of light stimulation once the stimulation module was triggered. Mice were placed in a circular white arena with a single black cue at one side and allowed to investigate the arena in one 30-min session per day for four consecutive days. During each session, mice were stimulated via a chronically implanted optical fiber with blue light (488 nm) triggered by their head direction angle. Mice were able to freely move their heads in all directions, but stimulation was limited to periods when they oriented their head to the designated head direction target window (60° to reference point, Fig. 4b, c and Supplementary Fig. 4). Each stimulation lasted 1–5 s depending on the time spent orienting to the target window (60°) with a minimum inter-stimulus time of 15 s. In the case of the inter-stimulus timer, the module blocked the link between the trigger module and the stimulation module when activated, disabling posture-dependent stimulation for its designated duration.

**Fig. 4: Optogenetic labeling of head direction-dependent neuronal activity.**

The resulting average light stimulation per session (48 ± 10 s) occurred selectively in the target angle window across all experimental animals (Fig. 4h). Note that stimulation with outside-target head direction angles can result from individual stimulations having a chosen minimum duration of 1 s, in which the mouse theoretically could sweep its head away from the target window. The average total stimulation time across all four sessions was 357 ± 53 s (n = 10 mice). As a control, a yoked group of mice was run such that each mouse regardless of its actual head direction, received the exact same temporal stimulus as a paired experimental mouse. Therefore, in the yoked group, light stimuli were decoupled from head direction (Supplementary Fig. 1a).

Mice explored the entire arena during the task and the resulting light stimulation was not dependent on the animal’s position in the arena, as animals could angle their head in the target orientation from any position within the arena (Fig. 4e, f). Randomly sampling angles equal to the number of stimulated angles, revealed a nonspecific distribution of angles—i.e., mice oriented in all directions (Fig. 4d, left). Note that for each individual mouse, the mean resultant length for stimulated angles was significantly larger than would be expected by random sampling (see “Methods” section, n = 1000 samples, p < 0.01) (Fig. 4d, right).

Next, we quantified the percentage of ADN neurons that were labeled in three different groups (experimental, no light, and yoked). Only mice that matched selection criteria (correct fiber ferrule placement as well as injection placement) were taken into account when quantifying Cal-Light conversion (see “Methods” section and Supplementary Fig. 2 for details). Cal-Light infected neurons showed a 46% conversion within the ADN (Fig. 4j, n = 2 mice) while mice receiving no light stimulation but underwent the same sessions had no light-induced labeling present (Fig. 4g–j). Furthermore, within the yoked group, only a very low percentage (~4%, n = 2 mice) labeling was observed (Supplementary Fig. 1b, c), indicating that the repeated pairing between light stimulation and head direction triggered activity was essential for Cal-Light-mediated fluorescent labeling.

Computational performance of DLStream

A reality of any closed-loop system is that there are temporal delays between real-time detection of particular postures and stimulus output. To address this challenge, we first rigorously defined the variance of behavioral parameters we are measuring. To estimate the spatiotemporal resolution of postures that can be detected using our integrated network configuration, we compared the pose estimation error of our networks and the correlated parameter changes between frames. Note that, due to the inherent individual network performances, DLStream’s effective accuracy in posture detection is heavily influenced by the previous training of utilized networks. Nevertheless, if performance is not sufficient for the executed experiment, DLC networks can always be retrained using the DLC provided tools. In our hands, the trained network used during optogenetic experiments resulted in an estimated average pose estimation error of 4 ± 12 pixels (px) for the neck point, 3.3 ± 4.4 px for the nose, and 3.3 ± 2.0 px for tail root (n = 597 images) when compared to a human annotator labeling the same data set (mice without tail were ~60 px long in our 848 × 480 px recordings). Body part estimation resulted in an average head direction variance of 3.6 ± 9.6° (tested in 80 sessions for 1000 frames per session) between consecutive frames with an estimated average error of 7.7 ± 15.1° compared to human annotation (n = 597, ground truth) per frame. The frame-by-frame variance is a product of performance errors and the inhomogeneous movement of the animal during experiments while the difference between network pose estimation and human annotation is most likely a result of inaccurate tracking which can be reduced by additional training and/or bigger training sets. Note that depending on the mixture of episodes of fast movements and slow movements during sessions, the variance might change. We next manually evaluated posture detection accuracy during optogenetic experiments and found a false-positive rate of 11.8%. In the evaluated sessions most, false-positive events were anomalies in mouse behavior such as spontaneous jumping, that can possibly be further reduced by additional network training if necessary. Additionally, we estimated the general false-positive/false-negative rate for our head direction trigger based on a human-labeled data set and found a false negative rate of 11.1 ± 4.1%, while false-positive rates were 11.6 ± 4.8% (n = 597; see Supplementary Fig. 3 for additional data).

During optogenetic experiments (n = 80), DLStream reached an average performance time of 33.32 ± 0.19 ms per frame, matching the average camera framerate of 30 Hz (33.33 ms), including posture detection and computation of the resulting experimental protocols until output generation. We also measured the hardware latency to estimate the time between posture detection and triggered stimulation during optogenetic sessions from three different mice (n = 164 stimulation events). Here, the resulting light stimulation occurred within 5 frames (4.8 ± 1.1 frames at 30 fps; ≈150 ms). It is important to consider here that the total latency critically depends on the individual setups and the intrinsic parameters of connected components. To evaluate the limits of DLStream, we tested different hardware configurations and investigated performance levels and response time. First, average performance was measured during 10,000 frames in two different configurations with two different camera settings (30 fps and 60 fps with 848 × 480 px resolution) using the same camera used in our experiments. With the standard 30 fps camera setting, the advanced configuration (Intel Core i7-9700K @ 3.60 GHz, 64 GB DDR4 RAM, and NVidia GeForce RTX 2080 Ti(12GB) GPU) achieved reliable 30 fps (33.33 ms per frame) real-time tracking with 30 ± 7 ms, while the other system (Intel Core i7-7700K CPU @ 4.20 GHz, 32GB DDR4 RAM and NVidia GeForce GTX 1050 (2GB) GPU) only reached an average analysis time of 91 ± 10 ms. Using a higher framerate input from the camera (60 fps; 16.66 ms per frame), the overall performance did not change considerably (24 ± 9 and 90 ± 9 ms, respectively). Second, we tested a different camera (Basler acA1300 – 200 μm), which lacks the depth capabilities of the Intel RealSense camera but comes with increased framerate, on the advanced configuration with different image resolutions (ranging from 1280 × 1014 to 320 × 256 px) to benchmark DLStream’s upper-performance limits with more standardized cameras and resolutions. While we initially used DLC trained ResNet50^33,34 networks during experiments, we additionally evaluated the capabilities of the other available models (ResNet101^33,34, MobileNetv2³⁵) and also a higher number of body parts (3, 9, and 13 body parts). In our hands, DLStream’s latency reached a maximum of 130 ± 6 Hz (ca. 8 ms) with the MobileNetv2 architecture at 320 × 256 px resolution, while the ResNet50 network reached its upper limit at 94 ± 6 fps (ca. 10 ms) at the same resolution (see Supplementary Table 1 for more details).

Discussion

There has been a recent revolution in markerless pose estimation using deep neural networks. However, these system’s intrinsic design delays analysis until after the end of the experiment owing to their heavy computation. Here we take advantage of the power of DLC’s offline body part tracking to train a neural network and integrate it into our real-time, closed-loop solution.

As observers, experimenters often record and interpret an animal’s behavior by taking its movement as an approximation of the underlying intention or state of mind. Building on this generalization, behavior can be defined, categorized, and even sequenced by examining estimations of the animal’s movement^23,24,36,37. Classified periods of behavior, so-called behavior modules, are commonly used for offline quantification (e.g., phenotyping). In addition, behavior modules are also very promising in closed-loop approaches to react specifically to complex behavior. Such an analysis yields the prospect of predicting behavior, for example by matching initial elements of a uniquely arranged behavioral sequence. With DLStream, a combination of triggers based on the animal’s posture or a sequence of postures can be integrated into experimental designs. Example triggers include center-of-mass position, direction, and speed of an animal, although multiple individual tracking points can also be utilized, such as the position and trajectory of multiple, user-defined body parts. This allows the design of advanced triggers that include head direction, kinematic parameters, and even specific behavior motifs (e.g., rearing, grooming, or sniffing). Out of the box, DLStream supports triggers based on single-frame as well as sequential postural information, although complex behavior modules could also be utilized once behavior based on collected posture data has been classified, modeled, and integrated as custom trigger modules into DLStream. The challenge in manually designing triggers for relevant behavior is similar to the challenges faced in offline analysis, where it has already been done for a variety of relevant read-outs, such as described in VAME²⁵, B-SOID²⁶, and SIMBA³⁸. While this is relatively simple when only single-frame posture detection with low-level features are utilized (e.g., head direction angle), defining sequential changes of features to capture more complex changes in the animal’s movement requires the careful exploration and extraction of relevant features. Once feature extraction is established, however, the behavior of interest can be detected and implemented as a custom trigger into DLStream. Promising approaches for machine-guided classification are being actively developed using DLC-based pose estimation as input^25,26,38, which should increase the range of available triggers considerably. The integration of fast behavior classifiers, for example, would enable the design of a trigger that reacts to complex behaviors without the need for a strict, manual description of relevant feature changes. To facilitate the design of custom experiments and triggers, we offer several tutorials and guides with our DLStream code (https://github.com/SchwarzNeuroconLab/DeepLabStream). Additionally, easy-to-use, GUI-based toolkits such as SimBA³⁸ facilitate the generation and open-source distribution of robust classification models.

Two of the most considerable limitations in all real-time applications are the latency of the system to react to a given input and the rate at which meaningful data are obtained. While the latency is dependent on the computational complexity, the rate is dependent on several factors, and hardware constraints in particular. A researcher might only need the broadest movements or behavioral states to understand an animal’s basic behavior, or fast, accurate posture sequences to classify behavioral modules on a sub-second scale^24,36. Considering that animals behave in a highly complex manner, a freely moving approach is favorable since restricting movement likely reduces the read-out of the observable behavioral spectrum.

DLStream is designed as a universal solution for freely moving applications and can, therefore, be used to investigate a wide range of organisms. DLC networks already have the innate capability to track a variety of animals across different species³⁹ which can be directly translated to experiments within DLStream. Additionally, its architecture was designed for short to mid-length experiments (minutes to hours). There are no built-in limitations to conduct long-lasting experiments (days to weeks), but DLStream currently lacks the capability to automatically process the large amounts of raw video data or other utilities that become necessary when recording for longer periods of time. One possible solution would be to remove the raw video output and only save the experimental data that includes posture information, which would considerably lighten the necessary data storage space.

With regards to latency, the current fully tested, closed-loop timescale enables the tracking and manipulation of a wide range of activities a rodent might perform during a task. Very fast movements, however, like whisker movement^40,41 and pupil contraction^42,43 might not be fully detected using the 30 Hz configuration from our experiments, but might be possible using lower camera resolution and a different network architecture (e.g., MobileNetv2; Supplementary Table 1). Most freely moving applications usually lack the resolution to visualize whiskers and pupils while maintaining an overview of the animal’s movement in a large arena. Note that offline analysis of raw, higher framerate videos can still be recorded if desired. DLStream is able to take frames from a higher framerate stream but still maintain a lower, loss-less closed-loop processing rate. On a side note, developments in alternative, non-video-based, specialized tracking (e.g., eye-tracking⁴⁴) might lead to a solution for researchers interested in capturing truly holistic behavioral data.

Using posture-dependent conditioning, mice were able to successfully learn an association between a visual stimulus and a reward, thus demonstrating DLStream’s capabilities with respect to the automatization of classical learning tasks. Second-order conditioning resulted in an odor preference between the conditioned odors. Importantly, mice did not need any previous training apart from initial habituation to the reward delivery system to perform this task. Interestingly, mice investigated the novel odors at the same level as the positively reinforced odor, suggesting a novelty component that influences the animal’s investigative behavior. It is likely that previous habituation to the neutral odors would reduce that effect. Importantly, possible applications are not limited to classical conditioning tasks. Many behavioral tasks, an operant conditioning task, for example, could also be accomplished by setting a specific posture or sequence of postures as a trigger to reward, punish or manipulate freely behaving animals during an experimental session.

To dissect and better understand the neuronal correlates of complex behaviors a better understanding of the actively participating neuronal assembles is desirable. Techniques that can bridge connectomics, electrophysiology, and ethology hold the potential to reveal how computations are realized in the brain and subsequently implemented to form behavioral outcomes. For instance, by utilizing neuronal activity-dependent labeling systems such as Cal-Light¹, Flare², or CaMPARI⁴⁵, it is already possible to visualize active neurons during episodes of behaviors of interest. However, the identification of repetitive/reoccurring episodes and following activation of a specific trigger is currently restricted by a lack of dynamic closed-loop systems. With DLStream, we show that the real-time detection of specific behaviors in freely moving mice can be combined with neuronal activity-dependent labeling systems (Cal-Light) to investigate the neuronal correlates of behavior. We here delivered light stimuli to the ADN to label neural ensembles active during specific head directions. Within the selected experimental animals, labeling of active neurons was successful and resulted in the labeling of a subset of cells (ca. 46%). Our goal was to demonstrate that DLStream can potentially label such specific ensembles of active neurons during relevant behavioral expressions. Direct optogenetic activation and inhibition^46,47,48,49 of neuronal population based on posture detection might also be possible with DLStream, although our stimulation setup had delays of ~150 ms between detection and manipulation, which may be too slow for certain applications. In our hands, delays were still short enough to allow for targeting activity triggered calcium dynamics by the Cal-Light system^1,50. Using a solution like DLStream the range of detectable behaviors would increase substantially and applications for action- and posture-dependent labeling and subsequent manipulation of different freely moving species are wide-ranging. Additionally, optimizing the setup might allow faster feedback times as our hardware limited the effective use of the underlying software performance of DLStream.

Comparative tests between our available computer configurations suggest that the GPU power is responsible for major performance gains in real-time tracking utilizing DLStream. CPU power is also important since several parallel processes need to be maintained during complex experimental protocols and processing of pose estimation. DLStream is able to analyze new frames as soon as the current frame is fully processed, therefore a higher framerate does not slow down DLStream but rather enables it to work at the upper-speed limit (Supplementary Table 1). At this stage, the full utilization of higher framerates will heavily depend on the hardware configuration and the experimenter’s resolution requirements. From a pure performance perspective, the use of faster neural network architectures (e.g., MobilNetV2³⁵) trained within the DLC framework already increases the available framerate by a factor of four (30–130 fps, Supplementary Table 1), which is in line with the recent big-scale benchmark tests run by DLC^51,52 and other publications¹¹.

DLStream is compatible with old and new versions of DLC. Although originally developed for DLC 1.11 (Nature Neuroscience Version¹⁹), we have successfully tested the newest DLC version (DLC 2.x¹⁹) without encountering problems. Networks trained on either version can be fully integrated into DLStream and used as needed. Additionally, DLStream is in principle able to support positional information from other pose estimation networks^20,21 but would currently require some customization by the user as these networks have different input/output formats that would need to be adapted to the current workflow. An experimental implementation of additional pose estimation sources can be found on our GitHub page (https://github.com/SchwarzNeuroconLab/DeepLabStream). This includes the implementation of models exported by DeepPoseKit²¹ (LEAP²⁰, StackedHourglass⁵³, StackedDenseNet²¹, DLC) and DLC-Live⁵¹ (DLC) as well as multiple animal DLC (maDLC). However, the performance speed of such network implementations needs to be evaluated and compared to established pose estimation speed. Recent developments by DLC regarding online pose estimation reported real-time network performances for architectures used by DLC⁵¹.

With the recent advances in markerless, multiple animal tracking (e.g., maDeepLabCut, SLEAP^20,54, id-tracker⁵⁵) an adaptation of DLStream to include multiple animal-based triggers would further enhance its versatility. In theory, such an adaptation should be similar to using multiple body parts. The challenge will most likely be the precise definition of social triggers and the design of relevant experiments using closed-loop stimulation. We briefly tested closed-loop multiple animal tracking using a pair of differently colored mice (standard DLC), as well as a maDLC-trained network on mice with same-colored fur, and were able to confirm that DLStream can utilize the similar output pose estimation of both. However, full verification of implementation within an experiment is yet to be done.

Notably, DLStream could also be upgraded to use 3D posture detection as for example implemented recently by EthoLoop¹⁰. To achieve this, two reasonable approaches exist that allow 3D tracking of animals based on video analysis. A DLC native approach would be the use of multiple camera angles to triangulate the animal’s position (see ref. ³⁹ for further information). An alternative approach would be the use of depth cameras to estimate the distance of an animal to the camera and thereby generate a 3D representation.

DLStream is a highly versatile, closed-loop software solution for freely moving animals. While we show its applicability in posture-dependent learning tasks and optogenetic stimulation using mice, we see no obvious limitations to the applicability of DLStream on different organisms and other experimental paradigms.

Methods

Mice

C57BL/6 mice were purchased from Charles River (Sulzfeld, Germany) and maintained on a 12-h light/12-h dark cycle with food and water always available. All the experiments were carried out in accordance with the German animal protection law (TierSCHG), FELASA, and were approved by the animal welfare committee of the University of Bonn.

AAV production

AAV pseudo-typed vectors (virions containing a 1:1 ratio of AAV1 and AAV2 capsid proteins with AAV2 ITRs) were generated as described^57,58. Briefly, human embryonic kidney 293 (HEK293) cells were transfected with the AAV cis plasmid, and the helper plasmids by standard calcium phosphate transfection. Forty-eight hours after transfection the cells were harvested and the virus purified using heparin affinity columns (Sigma, St. Louis, MO)⁵⁹. Purification and integrity of the viral capsid proteins (VP1-3) were monitored on a Coomassie-stained SDS/protein gel. The genomic titers were determined using the ABI 7700 real-time PCR cycler (Applied Biosystems) with primers designed to WPRE.

Surgical procedure

Viral injections were performed under aseptic conditions in 2-months-old C57BL/6 mice. Mice were initially anesthetized with an oxygen/isoflurane mixture (2–2.5% in 95% O₂), fixed on the stereotactic frame, and kept under a constant stream of isoflurane (1.5–2% in 95% O₂) to maintain anesthesia. Analgesia (0.05 mg/kg of buprenorphine; Buprenovet, Bayer, Germany) was administered intraperitoneal prior to the surgery, and Xylocaine (AstraZeneca, Germany) was used for local anesthesia. Stereotactic injections and implantations of light fiber ferrules were performed using a stereotactic frame (WPI Benchmark/Kopf) and a microprocessor-controlled minipump (World Precision Instruments, Sarasota, Florida). The viral solution (1:1:2; AAV-TRE-EGFP, Addgene #89875; AAV-M13-TEV-C-P2A-TdTomato, Addgene #92391; AAV-TM-CaM-NES-TEV-N-AsLOV2-TEVseq-tTA, Addgene plasmid #92392) was injected unilaterally into the ADN. Viruses were produced as previously described. To reduce swelling animals were given Dexamethasone (0.2 mg/kg). For implantation, the skin on the top of the scalp was removed and the skull cleared of soft tissue. Light fiber ferrules were implanted and fixed using a socket of dental cement. Loose skin around the socket was fixed to the socket using tissue glue (3 M Vetbond). Directly after the surgery animals were administered 1 ml 5% Glucosteril solution. To prevent the wound pain, analgesia was administered on the three following days. Animals were left to rest for at least 1 week before starting handling. Experiments were conducted 3 weeks after surgery.

Perfusion

Mice were anesthetized with a mixture of Xylazine (10 mg/kg; Bayer Vital, Germany) and ketamine (100 mg/kg; Bela-pharm GmbH & Co. KG, Germany). Using a peristaltic pump (Laborschlauchpumpe PLP33, Mercateo, Germany), the mice were transcardially perfused with 1× PBS followed by 4% paraformaldehyde (PFA) in PBS. Brains were removed from the skull and post-fixed in 4% PFA overnight (ON) at +4 °C. After fixation, the brains were moved into PBS containing 0.01% sodium azide and stored at +4 °C until sectioning. Fixed brains were sectioned coronally (70 or 100 μm) using a vibratome (Leica VT1000 S) and stored in PBS containing 0.01% sodium azide at +4 °C.

Conditioning task

Mice were placed in an open field arena (70 × 70 cm). Each session lasted 1 h or a maximum number of 40 trials. A session consisted of a random sequence of trials. Additionally, if an animal successfully finished 20 positive trials, the session was ended. A trial was initiated when the animal was facing the screen. Each trial lasted 20 s with an inter-trial interval of 30 s. At the beginning of each trial, a visual stimulus was shown on the screen for 10 s. In the positive trial, a reward was delivered at the end of the visual stimulus and withdrawn if not collected within 7 s. In the negative trial, a loud tone (100 dB) was delivered and no reward was given. After at least five sessions, animals that learned the association successfully (>85% success rate in the positive trial) were transferred to the next stage. We did not evaluate the success rate of negative trials since the aversive stimulation was delivered regardless of the animal’s behavior.

The visual stimulus was a high-contrast, black, and white image of an X or + spanning the whole screen. The screen was the same size as the arena wall it was placed at.

Second-order conditioning task

Animals were placed in the open-field arena. Two Petri dishes filled with fresh bedding were placed on the wall facing the screen. Two odorants (10 µl on filter paper) were placed in one of the Petri dishes each. A pair of an odorant and visual stimulus (negative or positive) was chosen and kept throughout the experiments. Upon exploration of an odor location, the animal was shown the corresponding visual stimulus. The session was completed after the animal explored both odors for at least 10 individual times or after 10 min. Conditioning was conducted in two stages. Both stages were repeated with switched odor positions, resulting in a total of four repetitions per animal. The first stage consisted of direct contact with the odor location, while the second was dependent only on the proximity of the animal to a location and the animal facing towards it.

Preference task

The mouse was placed in a different open-field arena (70 × 40 cm) with one odor in each of the quarters. In addition to the conditioned odors, two neutral odors were presented. The mouse was given 10 min twice to explore the arena with an inter-trial time of 10 min in between. Total investigation time was measured with circular ROIs, corresponding to the odor location, above each petri dish. Trials in which the mice did not investigate any odor source were excluded (1 trial out of 12; n = 6 mice).

Head direction-dependent optogenetic stimulation

Mice were put in a cylindrical white arena with a single cue (a black vertical bar). The arena was enclosed by a black curtain. A random point was chosen to act as a reference for head direction (0°). The reference point was kept constant between experimental sessions and animals but was not visible to the animal. To habituate the animal to the arena, the animal was put into the arena for 30 min for 2 days and reward pellets were placed randomly inside the arena at the 0, 10, and 20 min mark.

Experimental group: During the experiment, light stimulation (488 nm, 15 mW; Laser OBIS LX/LS, controlled by OBIS LX/LS Single Laser Remote, Coherent Inc., Santa Clara, CA, USA) was initiated whenever the animal’s head direction was within a 60° window around the reference point. Stimulation lasted 1 s or as long as the head direction was maintained in the window up to a maximum of 5 s. After each stimulation, further stimulation was discontinued for at least 15 s to avoid overheating of brain tissue and in line with the originally published Cal-Light experiments¹. The animal was allowed to investigate the arena over four consecutive days for 30 min sessions each day during which the animal was stimulated. Animals were perfused 1 day after the last session.

Yoked group: In the yoked control group animals were previously paired with another animal from the experimental group. Each control animal received the exact same temporal stimulus as the paired experimental animal, decoupled from its own head direction. Animals were treated and ran the experiment in the same way as the experimental group in all other aspects.

No light group: In the no-light control group, animals ran the experiment as all other groups but received no light stimulation.

Head direction analysis

The analysis was performed using custom python scripts. To determine whether light stimulation precisely targeted to a particular window of angles, we calculated the mean resultant vector length for the distribution of stimulated angles, which measures the concentration of angles in a distribution. Lengths vary between 0 (the underlying distribution is uniform) to 1 (all angles in the underlying distribution are exactly the same). Thus, for stimulated angles, we expect non-zero lengths close to 1. It is possible that the distribution of stimulated angles could be determined simply by a bias in the animals’ behavior (i.e., the animal by chance always faces the direction we have chosen as the target window). To test against this possibility, we generated null distributions by randomly sampling angles from the full distribution of angles explored by the animal. The number of samples was set to equal the number of stimulation angles. Angles were randomly sampled in this way 1000 times, and each time a mean resultant vector length was calculated. The null distribution comprised the 1000 means (note that null distributions were centered near 0). For each session, the resultant mean vector length was well above a 99% cutoff of the null distribution, indicating that our stimulation angle precision was a result of accurate posture detection rather than a bias in animal behavior.

Imaging of brain sections

Brain sections were DAPI labeled (0.2 µg/ml) and overview images were acquired using a widefield microscope (Zeiss AxioScan.Z1). Based on the overall expression and fiber placement, selected sections were additionally imaged with a spinning disk microscope (VisiScope CSU-W1). Acquired z-stacks were used for quantification using FIJI⁶⁰. Selection criteria for the quantification of Cal-Light labeling included the correct placement of the fiber ferrule above the target region as well as an injection (Supplementary Fig. 2). Mice that did not match the criteria were only included in the evaluation and quantification of DLStream performance.

Experimental setup

The corresponding arenas were placed in a closable compartment with isolation from external light sources. A light source was placed next to the setup so that the arena was evenly lit. The camera was placed directly above the arena. During experiments, the compartment was closed to minimize any disrupting influences from outside. All devices were triggered using NI 6341 data-acquisition board (National Instruments Germany GmbH, Munich) in combination with the Python nidaqxm library connected via USB 3.0 to a PC (Intel Core i7-9700K @ 3.60 GHz, 64 GB DDR4 RAM, and NVidia GeForce RTX 2080 Ti(12GB) GPU). For all experiments, we used the Intel Realsense Depth Camera D435 (Intel Corp., Santa Clara, CA, USA) at 848 × 480 and 30 Hz to enable reliable streaming at all times. Although the webcam is capable of 60 Hz and higher resolution, we found that these settings gave reliable framerate and the optional addition of depth data.

We have successfully installed and tested DLStream on Windows 10 and Ubuntu 18.04.05 OS. DLStream was developed in the open-source programming language Python. Python includes open-source libraries for most available devices or desired functions, which allows DLStream to utilize and control a wide range of devices. Virtually any webcam/camera can be used with any framerate and resolution considering hardware requirements and limitations.

Hardware latency and detection accuracy during optogenetic stimulation

The latency between posture detection and optogenetic stimulation was estimated by manually annotating videos of sessions from three different mice. For this, the recorded video was analyzed frame-by-frame and the frames between the event start (posture detection leading to stimulation) taken from the table-based output file and the visible onset of the laser in the video was counted. To evaluate the false-positive detection rate during experiments, we manually annotated all stimulation events during the above sessions. A detection was counted as false-positive when the annotator judged the posture of the animal (head direction) not inside the head direction window at the exact time of detection. Note that the accuracy of the pose estimation network is a major source of false detection, however, inaccurate event definitions can also lead to unintended stimulation events. Additional training of the network can increase the accuracy of the triggered stimulation.

Reward delivery and acoustic stimulation

The liquid reward was delivered via a custom-built reward delivery system using a peristaltic pump (Takasago Electric, Inc.). A nozzle connected to the pump was placed in the center of the northern arena wall (where the screen was located). The animal was briefly habituated to the reward during handling before continuing with habituation to the delivery system. For this, mice were first habituated to the arena and then received pretraining for reward consumption for 3 days, where they were presented with a reward at random time points. The liquid reward consisted of diluted sweetened condensed milk (1:10 with Aqua dest.) and was delivered in a volume of ca. 4–6 µl. If not collected, the reward was withdrawn again. The aversive tone (ca. 100 dB) was delivered via a custom build piezo alarm tone generator. The device was placed above the arena.

Pose estimation using DLC

In all experiments, we used 3-point tracking to estimate the position, direction, and angle of the animal using the head, neck, and tail root as body parts of interest. Networks were trained using the DLC 1.11 framework. First, 300 images of relevant behavior in the corresponding arena were annotated and 95% were used for DLC network training sets. Note that for some cases, a small number of test images (5%, 15) might require further evaluation of the trained network to guarantee sufficient accuracy and generalization. Second, we used a ResNet50-based neural network^33,34 with default parameters for 500k number of training iterations and evaluated network performance. For each experiment type, a different network was trained using the same approach.

For benchmarking DLStream’s upper-performance limits, we used 300 labeled images with relevant behavior (95% training set) labeling either 9 or 13 body parts. The same training set was used to train several neural networks based on different architectures or depths (ResNet50, ResNet101^33,34, MobileNetv2³⁵) available through the DLC 2 framework with default parameters for 500k number of training iterations. After training, the networks were benchmarked within DLStream using a DLStream function (python deeplabstream.py --dlc-enabled --benchmark-enabled) with 3000 consecutive frames. Data were collected and average framerate, as well as standard deviation, was calculated for 4 different image resolutions (1280 × 1024, 640 × 512, 416 × 341, 320 × 256) available to the Basler acA1300-200um camera (Basler AG, Germany), which acquired frames at a rate of 172 Hz.

Posture detection in DLStream

We extracted the raw score maps from the deep neural network analysis and used them for posture detection. First, body part estimation, similar to the DLC approach, was conducted by local maxima detection using custom image analysis scripts. The resulting pose estimation was then transferred into postures. For this, each possible combination of body parts was investigated and filtered using a closest distance approach. DLStream detects estimated postures and compares them to relevant trigger modules for closed-loop control of experiments. To evaluate our own DLC trained networks, we measured the pose estimation error and compared it to a human-labeled data set (labeled by a single human annotator). For this, we extracted a new image set from our optogenetic experiment sessions (n = 597) and measured the average difference (Euclidean distance) between human annotation and pose estimation in position, as well as resulting head direction angle. Additionally, we calculated the false-positive/false-negative rate of hypothetical head direction triggers with differently sized angle windows (60, 50, 40, 30, 20, 10). To counter any non-uniform distribution of head direction angles, we averaged the rates for multiple ranges per bin (e.g., 0–60°, 60–120°, 120–180°) and calculated the standard deviation. See Supplementary Fig. 3 for details.

DLStream output and adaptability

DLStream is storing posture detection and information from experiments in a table-based file (see also Supplementary Data 3) that can be opened by any standard system. The file is indexed by the frame ID from the camera stream and provides information on the estimated position of all tracked body parts, the status of the experiment (whether it is active or not), and a “trial” column which is used to give event/trial-specific information during experiments (e.g., negative or positive trial during conditioning or stimulation active/not active during optogenetic experiments). The table also includes a “time” column where experimenters can see the exact inference time between each frame and the actual time that passed during the experiment.

DLStream experiments are not limited to the body parts used in our experiments and can utilize any combination of pose estimated body parts. DLStream’s posture detection is stored as a “skeleton” (a set of named body parts) which is directly taken from the DLC network. Each body part or a set of body parts can be selected for the design of user-defined experiments.

DLStream users are not limited to the triggers and experiments used for the experiments in this paper but can either use provided modules or design their own modules with the help of our in-depth tutorials (https://github.com/SchwarzNeuroconLab/DeepLabStream). Currently available triggers include speed (Supplementary Movie 1), head direction, and ROI-based detection.

For a further step-by-step explanation, we included a guide on our GitHub page.

Statistics and reproducibility

Paired t-tests were used for statistical comparisons of data. All data presented in the text are shown as the mean ± standard deviation. Uncorrected alpha (desired significance level) was set to 0.05 (* <0.05, ** <0.01, *** <0.001). Sample sizes and numbers are indicated in detail in each figure caption and main text. Exclusion criteria, if applied, are specified in each corresponding method section.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Code availability

DLStream⁵⁶ is available to the scientific community under https://github.com/SchwarzNeuroconLab/DeepLabStream. Tutorials and further information on how to use and adapt DLStream is available under the same address.

References

Lee, D., Hyun, J. H., Jung, K., Hannan, P. & Kwon, H.-B. A calcium- and light-gated switch to induce gene expression in activated neurons. Nat. Biotechnol. 35, 858 (2017).
Article CAS PubMed Google Scholar
Wang, W. et al. A light- and calcium-gated transcription factor for imaging and manipulating activated neurons. Nat. Biotechnol. 35, 864 (2017).
Article CAS PubMed PubMed Central Google Scholar
Paulk, A. C., Kirszenblat, L., Zhou, Y. & van Swinderen, B. Closed-loop behavioral control increases coherence in the fly brain. J. Neurosci. 35, 10304–10315 (2015).
Article CAS PubMed PubMed Central Google Scholar
Solari, N., Sviatkó, K., Laszlovszky, T., Hegedüs, P. & Hangya, B. Open source tools for temporally controlled rodent behavior suitable for electrophysiology and optogenetic manipulations. Front. Syst. Neurosci. 12, 18 (2018).
Article PubMed PubMed Central Google Scholar
Thurley, K. & Ayaz, A. Virtual reality systems for rodents. Curr. Zool. 63, 109–119 (2017).
Article PubMed Google Scholar
Bourboulou, R. et al. Dynamic control of hippocampal spatial coding resolution by local visual cues. eLife 8, https://doi.org/10.7554/eLife.44487 (2019).
Fuhrmann, F. et al. Locomotion, theta oscillations, and the speed-correlated firing of hippocampal neurons are controlled by a medial septal glutamatergic circuit. Neuron 86, 1253–1264 (2015).
Article CAS PubMed Google Scholar
Musso, P.-Y. et al. Closed-loop optogenetic activation of peripheral or central neurons modulates feeding in freely moving Drosophila. eLife 8, https://doi.org/10.7554/eLife.45636 (2019).
Štih, V., Petrucco, L., Kist, A. M. & Portugues, R. Stytra: an open-source, integrated system for stimulation, tracking and closed-loop behavioral experiments. PLoS Comput. Biol. 15, e1006699 (2019).
Article PubMed PubMed Central CAS Google Scholar
Nourizonoz, A. et al. EthoLoop: automated closed-loop neuroethology in naturalistic environments. Nat. Methods 17, 1052–1059 (2020).
Article CAS PubMed Google Scholar
Forys, B. J., Xiao, D., Gupta, P. & Murphy, T. H. Real-time selective markerless tracking of forepaws of head fixed mice using deep neural networks. eNeuro 7, https://doi.org/10.1523/ENEURO.0096-20.2020 (2020).
Kwon, J.-T. et al. Optogenetic activation of presynaptic inputs in lateral amygdala forms associative fear memory. Learn. Mem. 21, 627–633 (2014).
Sousa, A. Fde et al. Optogenetic reactivation of memory ensembles in the retrosplenial cortex induces systems consolidation. Proc. Natl Acad. Sci. USA 116, 8576–8581 (2019).
Article PubMed CAS PubMed Central Google Scholar
Oishi, N. et al. Artificial association of memory events by optogenetic stimulation of hippocampal CA3 cell ensembles. Mol. Brain 12, 2 (2019).
Article PubMed PubMed Central Google Scholar
Marshel, J. H. et al. Cortical layer-specific critical dynamics triggering perception. Science 365, https://doi.org/10.1126/science.aaw5202 (2019).
Carrillo-Reid, L., Han, S., Yang, W., Akrouh, A. & Yuste, R. Controlling visually guided behavior by holographic recalling of cortical ensembles. Cell 178, 447–457.e5 (2019).
Article CAS PubMed PubMed Central Google Scholar
Magno, L. A. V. et al. Optogenetic stimulation of the M2 cortex reverts motor dysfunction in a mouse model of Parkinson’s disease. J. Neurosci. 39, 3234–3248 (2019).
Article CAS PubMed PubMed Central Google Scholar
Ebina, T. et al. Arm movements induced by noninvasive optogenetic stimulation of the motor cortex in the common marmoset. Proc. Natl Acad. Sci. USA https://doi.org/10.1073/pnas.1903445116 (2019).
Mathis, A. et al. DeepLabCut: markerless pose estimation of user-defined body parts with deep learning. Nat. Neurosci. 21, 1281–1289 (2018).
Article CAS PubMed Google Scholar
Pereira, T. D. et al. Fast animal pose estimation using deep neural networks. Nat. methods 16, 117–125 (2019).
Article CAS PubMed Google Scholar
Graving, J. M. et al. DeepPoseKit, a software toolkit for fast and robust animal pose estimation using deep learning. eLife 8, e47994 (2019).
Article CAS PubMed PubMed Central Google Scholar
Markowitz, J. E. et al. The striatum organizes 3D behavior via moment-to-moment action selection. Cell 174, 44–58.e17 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wang, Z., Mirbozorgi, S. A. & Ghovanloo, M. An automated behavior analysis system for freely moving rodents using depth image. Med. Biol. Eng. Comput. 56, 1807–1821 (2018).
Article PubMed Google Scholar
Wiltschko, A. B. et al. Mapping sub-second structure in mouse behavior. Neuron 88, 1121–1135 (2015).
Article CAS PubMed PubMed Central Google Scholar
Luxem, K., Fuhrmann, F., Kürsch, J., Remy, S. & Bauer, P. Identifying behavioral structure from deep variational embeddings of animal motion. Preprint at bioRxiv https://doi.org/10.1101/2020.05.14.095430 (2020).
Hsu, A. I. & Yttri, E. A. B-SOiD: an open source unsupervised algorithm for discovery of spontaneous behaviors. Preprint at bioRxiv https://doi.org/10.1101/770271 (2019).
O’Keefe, J. Place units in the hippocampus of the freely moving rat. Exp. Neurol. 51, 78–109 (1976).
Article PubMed Google Scholar
Abdelfattah, A. S. et al. Bright and photostable chemigenetic indicators for extended in vivo voltage imaging. Science 365, 699–704 (2019).
Article CAS PubMed Google Scholar
Skocek, O. et al. High-speed volumetric imaging of neuronal activity in freely moving rodents. Nat. Methods 15, 429–432 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ghosh, K. K. et al. Miniaturized integration of a fluorescence microscope. Nat. Methods 8, 871–878 (2011).
Article CAS PubMed PubMed Central Google Scholar
Szabo, V., Ventalon, C., Sars, V., de, Bradley, J. & Emiliani, V. Spatially selective holographic photoactivation and functional fluorescence imaging in freely behaving mice with a fiberscope. Neuron 84, 1157–1169 (2014).
Article CAS PubMed Google Scholar
Taube, J. S. Head direction cells recorded in the anterior thalamic nuclei of freely moving rats. J. Neurosci. 15, 70–86 (1995).
Article CAS PubMed PubMed Central Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Proc. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE June 27, 2016–June 30 770–778 (IEEE Computer Society, 2016).
Insafutdinov, E., Pishchulin, L., Andres, B., Andriluka, M. & Schiele, B. in Computer Vision – ECCV 2016 (eds Leibe, B., Matas, J., Sebe, N. & Welling, M.) 34–50 (Springer International Publishing, Cham, 2016).
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. & Chen, L.-C. Proc. 2018 IEEE/CVF Conference on Computer 2018 4510–4520 (2018).
Berman, G. J., Choi, D. M., Bialek, W. & Shaevitz, J. W. Mapping the stereotyped behaviour of freely moving fruit flies. J. Roy. Soc. Interface 11, https://doi.org/10.1098/rsif.2014.0672 (2014).
Stacher Hörndli, C. N. et al. Complex economic behavior patterns are constructed from finite, genetically controlled modules of behavior. Cell Rep. 28, 1814–1829.e6 (2019).
Article PubMed PubMed Central CAS Google Scholar
Nilsson, S. R. O. et al. Simple behavioral analysis (SimBA)—an open source toolkit for computer classification of complex social behaviors in experimental animals. Preprint at bioRxiv https://doi.org/10.1101/2020.04.19.049452 (2020).
Nath, T. et al. Using DeepLabCut for 3D markerless pose estimation across species and behaviors. Nat. Protoc. 14, 2152–2176 (2019).
Article CAS PubMed Google Scholar
Knutsen, P. M., Derdikman, D. & Ahissar, E. Tracking whisker and head movements in unrestrained behaving rodents. J. Neurophysiol. 93, 2294–2301 (2005).
Article PubMed Google Scholar
Sofroniew, N. J., Cohen, J. D., Lee, A. K. & Svoboda, K. Natural whisker-guided behavior by head-fixed mice in tactile virtual reality. J. Neurosci. 34, 9537–9550 (2014).
Article PubMed PubMed Central CAS Google Scholar
Kretschmer, F., Tariq, M., Chatila, W., Wu, B. & Badea, T. C. Comparison of optomotor and optokinetic reflexes in mice. J. Neurophysiol. 118, 300–316 (2017).
Article PubMed PubMed Central Google Scholar
Mitchiner, J. C., Pinto, L. H. & Vanable, J. W. Visually evoked eye movements in the mouse (Mus musculus). Vis. Res. 16, 1169 (1976). IN7.
Article CAS PubMed Google Scholar
Payne, H. L. & Raymond, J. L. Magnetic eye tracking in mice. eLife 6, https://doi.org/10.7554/eLife.29222 (2017).
Fosque, B. F. et al. Neural circuits. Labeling of active neural circuits in vivo with designed calcium integrators. Science 347, 755–760 (2015).
Article CAS PubMed Google Scholar
Josselyn, S. A. The past, present and future of light-gated ion channels and optogenetics. eLife 7, https://doi.org/10.7554/eLife.42367 (2018).
Nagel, G. et al. Channelrhodopsin-1: a light-gated proton channel in green algae. Science 296, 2395–2398 (2002).
Article CAS PubMed Google Scholar
Boyden, E. S., Zhang, F., Bamberg, E., Nagel, G. & Deisseroth, K. Millisecond-timescale, genetically targeted optical control of neural activity. Nat. Neurosci. 8, 1263–1268 (2005).
Article CAS PubMed Google Scholar
Han, X. & Boyden, E. S. Multiple-color optical activation, silencing, and desynchronization of neural activity, with single-spike temporal resolution. PLoS ONE 2, e299 (2007).
Article PubMed PubMed Central Google Scholar
Ebner, C. et al. Optically induced calcium-dependent gene activation and labeling of active neurons using CaMPARI and Cal-light. Front. Synaptic Neurosci. 11, 16 (2019).
Article CAS PubMed PubMed Central Google Scholar
Kane, G., Lopes, G., Saunders, J. L., Mathis, A. & Mathis, M. W. Real-time, low-latency closed-loop feedback using markerless posture tracking. eLife 9, e61909 (2020).
Article PubMed PubMed Central Google Scholar
Mathis, A. & Warren, R. On the inference speed and video-compression robustness of DeepLabCut. Preprint at bioRxiv https://doi.org/10.1101/457242 (2018).
Newell, A., Yang, K. & Deng, J. in Computer Vision – ECCV 2016 (eds Leibe, B., Matas, J., Sebe, N. & Welling, M.) 483–499 (Springer International Publishing, Cham, 2016).
Pereira, T. D. et al. SLEAP: Multi-animal pose tracking. Preprint at bioRxiv https://doi.org/10.1101/2020.08.31.276246 (2020).
Pérez-Escudero, A., Vicente-Page, J., Hinz, R. C., Arganda, S. & Polavieja, G. Gde idTracker: tracking individuals in a group by automatic identification of unmarked animals. Nat. Methods 11, 743–748 (2014).
Article PubMed CAS Google Scholar
Schweihoff, J., Matvey Loshakov & Schwarz Lab. SchwarzNeuroconLab/DeepLabStream: Nature Communications Biology Version. https://doi.org/10.5281/zenodo.4304259 (Zenodo, 2020).
Kügler, S., Lingor, P., Schöll, U., Zolotukhin, S. & Bähr, M. Differential transgene expression in brain cells in vivo and in vitro from AAV-2 vectors with small transcriptional control units. Virology 311, 89–95 (2003).
Article PubMed CAS Google Scholar
Shevtsova, Z., Malik, J. M. I., Michel, U., Bähr, M. & Kügler, S. Promoters and serotypes: targeting of adeno-associated virus vectors for gene transfer in the rat central nervous system in vitro and in vivo. Exp. Physiol. 90, 53–59 (2005).
Article CAS PubMed Google Scholar
During, M. J., Young, D., Baer, K., Lawlor, P. & Klugmann, M. Development and optimization of adeno-associated virus vector transfer into the central nervous system. Methods Mol. Med. 76, 221–236 (2003).
CAS PubMed Google Scholar
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Jonathan Ewell for language editing and feedback on the manuscript. We also want to thank Liubov Sokhranyaeva for assistance in establishing the Cal-Light system and Lina Zabawa for assistance in data visualization and performance testing. Work was supported by the DFG, SFB 1089, SPP 2041 to M.K.S., and VW Stiftung Freigeist fellowship to L.A.E.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Functional Neuroconnectomics Group, Institute of Experimental Epileptology and Cognition Research, Medical Faculty, University of Bonn, Bonn, Germany
Jens F. Schweihoff, Matvey Loshakov, Irina Pavlova & Martin K. Schwarz
Institute of Experimental Epileptology and Cognition Research, Medical Faculty, University of Bonn, Bonn, Germany
Laura Kück & Laura A. Ewell

Authors

Jens F. Schweihoff
View author publications
You can also search for this author in PubMed Google Scholar
Matvey Loshakov
View author publications
You can also search for this author in PubMed Google Scholar
Irina Pavlova
View author publications
You can also search for this author in PubMed Google Scholar
Laura Kück
View author publications
You can also search for this author in PubMed Google Scholar
Laura A. Ewell
View author publications
You can also search for this author in PubMed Google Scholar
Martin K. Schwarz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, J.F.S. and M.K.S.; methodology/experiments, J.F.S., M.L., L.K.; code development, J.F.S. and M.L.; experimental design, J.F.S., I.P., L.A.E., and M.K.S.; writing/reviewing/editing, J.F.S., L.A.E., and M.K.S.; supervision, M.K.S.; funding acquisition, M.K.S. and L.A.E.

Corresponding author

Correspondence to Martin K. Schwarz.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Description of Supplementary Files

Supplementary Movie 1

Supplementary Data 1

Supplementary Data 2

Supplementary Data 3

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Schweihoff, J.F., Loshakov, M., Pavlova, I. et al. DeepLabStream enables closed-loop behavioral experiments using deep learning-based markerless, real-time posture detection. Commun Biol 4, 130 (2021). https://doi.org/10.1038/s42003-021-01654-9

Download citation

Received: 01 July 2020
Accepted: 31 December 2020
Published: 29 January 2021
DOI: https://doi.org/10.1038/s42003-021-01654-9

This article is cited by

A-SOiD, an active-learning platform for expert-guided, data-efficient discovery of behavior
- Jens F. Tillmann
- Alexander I. Hsu
- Eric A. Yttri
Nature Methods (2024)
An automated, low-latency environment for studying the neural basis of behavior in freely moving rats
- Maciej M. Jankowski
- Ana Polterovich
- Israel Nelken
BMC Biology (2023)
AI-enabled, implantable, multichannel wireless telemetry for photodynamic therapy
- Woo Seok Kim
- M. Ibrahim Khot
- Sung Il Park
Nature Communications (2022)
SLEAP: A deep learning system for multi-animal pose tracking
- Talmo D. Pereira
- Nathaniel Tabris
- Mala Murthy
Nature Methods (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.