Automated maternal behavior during early life in rodents (AMBER) pipeline

Lapp, Hannah E.; Salazar, Melissa G.; Champagne, Frances A.

doi:10.1038/s41598-023-45495-4

Download PDF

Article
Open access
Published: 25 October 2023

Automated maternal behavior during early life in rodents (AMBER) pipeline

Hannah E. Lapp¹,
Melissa G. Salazar¹ &
Frances A. Champagne¹

Scientific Reports volume 13, Article number: 18277 (2023) Cite this article

2523 Accesses
4 Citations
52 Altmetric
Metrics details

Subjects

Abstract

Mother-infant interactions during the early postnatal period are critical for infant survival and the scaffolding of infant development. Rodent models are used extensively to understand how these early social experiences influence neurobiology across the lifespan. However, methods for measuring postnatal dam-pup interactions typically involve time-consuming manual scoring, vary widely between research groups, and produce low density data that limits downstream analytical applications. To address these methodological issues, we developed the Automated Maternal Behavior during Early life in Rodents (AMBER) pipeline for quantifying home-cage maternal and mother–pup interactions using open-source machine learning tools. DeepLabCut was used to track key points on rat dams (32 points) and individual pups (9 points per pup) in postnatal day 1–10 video recordings. Pose estimation models reached key point test errors of approximately 4.1–10 mm (14.39 pixels) and 3.44–7.87 mm (11.81 pixels) depending on depth of animal in the frame averaged across all key points for dam and pups respectively. Pose estimation data and human-annotated behavior labels from 38 videos were used with Simple Behavioral Analysis (SimBA) to generate behavior classifiers for dam active nursing, passive nursing, nest attendance, licking and grooming, self-directed grooming, eating, and drinking using random forest algorithms. All classifiers had excellent performance on test frames, with F₁ scores above 0.886. Performance on hold-out videos remained high for nest attendance (F₁ = 0.990), active nursing (F₁ = 0.828), and licking and grooming (F₁ = 0.766) but was lower for eating, drinking, and self-directed grooming (F₁ = 0.534–0.554). A set of 242 videos was used with AMBER and produced behavior measures in the expected range from postnatal 1–10 home-cage videos. This pipeline is a major advancement in assessing home-cage dam-pup interactions in a way that reduces experimenter burden while increasing reproducibility, reliability, and detail of data for use in developmental studies without the need for special housing systems or proprietary software.

Geometric deep learning enables 3D kinematic profiling across species and environments

Article 19 April 2021

Deep-learning-based identification, tracking, pose estimation and behaviour classification of interacting primates and mice in complex environments

Article 21 April 2022

Deep learning-based behavioral analysis reaches human accuracy and is capable of outperforming commercial solutions

Article Open access 25 July 2020

Introduction

Maternal behavior during early life in mammals ensures offspring survival by supporting the physical needs of the offspring including transfer of nutrients and warmth to young and protection from predators and the environment. Beyond ensuring survival, mother–offspring interactions also guide emotional, social, and cognitive development of offspring. Maternal behaviors provide stimulation to offspring through vestibular, auditory, tactile, and visual modalities during sensitive periods of elevated plasticity early in life. This somatosensory input directs development of pup physiology to have lasting effects on offspring brain and behavior across the lifespan^1,2,3. Intergenerational patterns of maternal behavior are observed in several mammalian species, thus effects of these early social experiences can be perpetuated across generations⁴.

Factors in the broader ecological environment, such as chemical exposure, distal stressors, resource availability, and social context, can affect offspring development in early life through alterations to maternal offspring-directed behavior^5,6. Maternal behavior can moderate environmental effects on offspring by buffering the effects of the environment, or increase susceptibility when disruptions to maternal care occur. Maternal effects signal to the infant the conditions of the distal environment and may modify offspring physiology to prepare the offspring for the environment they will encounter later in life^7,8. Early environmental exposures are also potential contributors to vulnerability for neurodevelopmental disorders, which can be moderated by maternal-offspring interactions⁹.

The extensive, enduring effects of mother–offspring interactions and potential for disruption of these interactions by external influences make maternal behavior an important measure for all developmental studies. Laboratory rodent models are a preferred choice for studying the impact of early life exposures on offspring development because of the ability to precisely control environmental conditions, their relatively short lifespan, overlap in the hormonal and neural mechanisms governing maternal behavior with humans, and ability to examine the molecular physiological outcomes following early life exposures not possible or ethical in human research^{10,11,12,13,14}. Like humans, rodent maternal behavior is influenced by the external environment and by offspring physiological and behavioral cues^15,16,17. Measures of rodent pup-directed maternal behaviors typically include nest attendance, anogenital and body licking and grooming of pups, nursing (sometimes further specified into blanket, low-arched back, high-arched back, and passive nursing), nest building, and retrieval of pups when they are displaced from the nest¹⁸. Natural variations of home-cage maternal behaviors in Long-Evans rats are well characterized and variation in these maternal behaviors leads to shifts in offspring developmental trajectories^{19,20,21,22,23,24}.

Scoring of home-cage maternal behavior live or through video recordings provides detailed information on dam-pup interactions guiding pup development²⁵. However, between-study methodological variation in the acquisition and coding of this data limits comparisons between studies. Time-sampling methods, where point observations about maternal behavior are made throughout the day, permit analyses of behavior throughout the circadian cycle, but this approach has poor temporal resolution relative to focal observations and prohibits accurate quantification of behavior durations^26,27. Frequency, duration, and bout length of maternal behaviors are increasingly used in analyses, particularly measures of “entropy” that are used to capture consistency of dam behavior patterns²⁸. However, these measurements are notoriously time-intensive to score from videos and involve substantial training on coding maternal behavior. To address these issues, more sophisticated tools are needed to quantify maternal behavior over longer periods of time.

Recent advances in machine learning tools for behavior analysis have the capacity to build standardized behavior pipelines for video data to produce high-quality and highly-reproducible measurements²⁹. Using open-source tools, we developed the Automated Maternal Behavior during Early life in Rodents (AMBER) pipeline for scoring home-cage maternal behavior. AMBER uses DeepLabCut^30,31 to extract pose estimation data from rat dam and pups from home-cage recordings within standard cages. Side-view recordings permit tracking of key body points on dams and pups, unlike top-down recordings where the dam occludes pups during nest attendance. AMBER then uses Simple Behavior Analysis³² (SimBA) to run behavior classifiers for seven maternal behaviors: nest attendance, licking and grooming, active nursing, passive nursing, self-directed grooming, dam eating, and dam drinking. We applied the AMBER pipeline to a set of 242 videos from postnatal day (P) 1–10 to evaluate maternal phenotype. All scripts and models used in the pipeline are publicly available (https://github.com/lapphe/AMBER-pipeline; https://osf.io/e3dyc/).

Results

Video recording

Home cage behavior was recorded from Long-Evans dams and litters on postnatal day (P) 1–14 with Raspberry Pi 3B + minicomputers equipped with Raspberry Pi Module 1 NoIR cameras. One hour and 24-h recordings captured video during the light and dark phase under infrared LED lights.

Dam pose estimation model

Pose estimation for dams was achieved using single animal DeepLabCut³⁰. 4,710 frames were extracted from 255 videos. Thirty-two dam body points were labeled providing adequate coverage of the dam regardless of partial body occlusion or body orientation relative to the camera (Fig. 1B). Two percent of labeled frames were used for a test set for model evaluation and the remaining 98% of labeled frames were included in the training set. The dam model was trained for 650,000 iterations saving a snapshot every 50,000 iterations using ResNet-50 with a batch size of 8. Model performance was evaluated at all snapshots and loss, a quantification of the error between predictions and true values (user labels) during training, was calculated every 1000 iterations (Fig. 2A). Snapshot 10 (550,000 iterations) had the best performance (comparing model-predicted key point location to user key point annotation location) on test frames with an average error of 14.39 pixels (4.1–10 mm depending on animal depth in frame) and 6.36 pixels on training frames after filtering to only points with a likelihood cutoff threshold above 0.5 (Fig. 2B). Visual inspection of labeled held-out videos confirmed model performance.

Pup pose estimation model

The pup pose estimation model was developed using multi-animal DeepLabCut³¹. 1,712 frames from 238 videos were extracted using k-means clustering and uniform distribution. Nine pup points from the nose to tail base were labeled on each pup visible (Fig. 1C). Two percent of labeled frames were reserved for model evaluation and the remaining 98% of labeled frames were included in the training set. The pup model was trained for 200,000 iterations saving a model snapshot every 5000 iterations using DLCRNet_ms5 with batch size of 8. Model loss was calculated every 1000 iterations (Fig. 2C). Pup pose estimation model performance was evaluated at all snapshots (Fig. 2D). The final snapshot had the best performance, with an average error of 4.67 pixels on training frames and 11.81 pixels (3.44–7.87 mm depending on proximity of pup to camera) on test frames after filtering to points with a likelihood cutoff threshold above 0.5. Root mean square error (RMSE) for individual pups and each body point was similar (Supplemental Fig. S1). Visual inspection of labeled hold-out videos confirmed accurate tracking of individual pup body parts for pup detections. While pups remained in the nest the majority of the time, pups outside the nest were also tracked provided a sufficient portion of their body was visible in the frame.

Pose estimation data postprocessing

Pups are often in a huddle mass in the nest in the home-cage, with many pups fully or partially occluded by other pups, the dam, or nesting material at any given time. This occurrence presents a challenge for assignment of pup body part detections to individual pup identities during individual assembly (tracklet creation and tracklet stitching) across frames in the second half of the multi-animal DeepLabCut workflow. Although pup detections achieved good performance, we observed substantial loss of pup points after individual assembly, where only pups with a majority of tracked body points visible had reliable tracking. A primary goal of tracking pups for this pipeline is to identify nest location, so the litter can be treated as a unit rather than identify individual pups. Therefore, all pup key point detections, at the midpoint of the multi-animal workflow by running the DeepLabCut function analyze_videos with auto_track = false, rather than individual pup tracks obtained at the end of the workflow, were used for pup pose estimation. This ensured that all pup detected key point coordinates are kept for the entire litter, with the drawback of not know which points belong to specific pups. A custom python script (pheno_pickle_raw.py) was used to convert the pup detections pickle file to a csv file containing pup key point detections. Raw detection key points assigned to individual pups after conversion to csv do not necessarily belong to the assigned pup.

Next, unfiltered dam and pup pose estimation files were joined by frame number using a custom script (join_dam_pup.py). Column headers were reformatted to match the expected input by SimBA.

Behavior classifier project set up

Simple Behavior Analysis (SimBA) was used to generate seven behavior classifiers from pose estimation data from a subset of videos using the standard workflow except where noted below (https://github.com/sgoldenlab/simba)³². We used a single animal SimBA project configuration with user-defined body points consisting of all dam and pup key points. Width of the wire cage top at the lowest point (approximately half the depth of the cage) was used to define pixels per mm during the Video Settings step (Supplemental Fig. S3). The outlier correction step of SimBA was skipped because it relies on body-length distance across frames to perform these calculations, which is influenced by the dramatic differences in body length when the dam is near the front versus back of the cage. Instead, low probability key points are accounted for during feature extraction by weighting calculations by key point probabilities.

Classifier feature extraction

SimBA behavior classifiers train on features derived from pose estimation data. Dam and pup features were extracted from the pose estimation data using a custom script to create dam-specific and pup-specific features (Supplemental Table S1). Features were derived from pup pose estimation data (19 features), dam pose estimation data (172 features) or both dam and pup data (27 features). Features can also be broken down into categories: dam location (e.g. y coordinate of dam centroid), dam areas, dam key point angles, dam key point probabilities, dam movement, pup area, pup probabilities, and dam-pup distances. Features also included summary statics (mean, sum, standard deviation) across rolling windows of 0.1 s, 1 s, and 2 s. In addition, 30 m and 60 m rolling windows for pup centroid were calculated and used for some dam-pup distance features to account for longer periods when pups are mostly or completely occluded (e.g. by bedding or the dam).

Random forest behavior classifiers

Classifier training videos were carefully annotated for seven maternal behaviors using BORIS³³ then imported into SimBA. A total of 3,366,254 frames (31.1 h of recording at 30 fps) from 28 one-hour videos and 10 additional shorter clips (1.5–22 m each) were annotated for nest attendance, active nursing, passive nursing, licking and grooming, self-directed grooming, eating, and drinking. Behavior definitions are provided in Fig. 3 and a detailed ethogram of these behaviors can be found at https://github.com/lapphe/AMBER-pipeline^16,17. Because of the large number of frame annotations, the frames used for training classifiers was reduced by taking every other frame from each video. Adjacent frames are likely to have similar features and behavior annotations, so this allowed for a reduction in data set size while maintaining diversity of the training set within and across videos. 20% of remaining frames were used as a test set to evaluate model performance. Frequency of behaviors in the training and test set ranged from 2.5% (passive nursing) to 47.8% of frames (nest attendance). Random forest models were run in SimBA with the following hyperparameters: 100–1500 trees, minimum node = 1 or 2, RF_criterion = gini, RF_max_features = sqrt, test size = 20%, and no sampling adjustment (Supplemental Table S2).

An additional four one-hour recordings (30 fps) were manually scored as a hold-out video validation set. Because no frames from the hold-out videos are used for model training, this allows for evaluation of model generalizability.

Behavioral classifier evaluation

Behavior classifiers were evaluated by calculating the precision (fraction of true positives among all frames scored as positive), recall (fraction of true positives retrieved out of all true positives in data set), and F1 scores (harmonic mean of precision and recall) for all models (Supplemental Table S2). All behavior classifiers obtained good accuracy at or above 0.886 on the test fraction of frames. Discrimination thresholds, or probability above which a behavior is classified as present, were determined using precision-recall curves and visually inspecting behavior predictions in videos (Fig. 4; Supplemental Table S2). Classifier performance on the hold-out video set remained high for nest attendance (F₁ = 0.990), active nursing (F₁ = 0.828), and licking and grooming (F₁ = 0.766). Self-directed grooming, eating, and drinking classifiers performance was substantially lower (F₁ = 0.534-0.550). Lower performance for eating and drinking was partly due to overlap in false positive and false negatives between these classifiers. Considering eating and drinking as a single behavior improved performance (eating or drinking precision = 0.71, recall = 0.69, F₁ = 0.70). Passive nursing did not occur in the hold-out video set, precluding performance evaluation for that classifier.

Pipeline validation for maternal phenotype

A set of 242 videos of one-hour home cage recordings taken on P1–10 from 49 dams were analyzed with AMBER pipeline workflow as shown in Fig. 1A. Thresholds used for behavior classifiers are noted in Supplemental Table S2. Total duration, percent time, bout number, and mean bout duration were calculated in SimBA for each behavior for each video (Fig. 5 and Supplemental Fig. S5). Change in behavior durations over time was analyzed using linear mixed models in R with the lmerTest package with litter ID included as a random effect³⁶. Nest attendance (β = − 179.61, t = − 10.12, p < 0.001), licking and grooming (β = − 18.37, t = − 4.11, p = 0.01), active nursing (β = − 120.29, t = − 7.93, p < 0.001), and passive nursing (β = − 2.15, t = − 2.17, p = 0.03) significantly decreased with litter age. Drinking (β = 33.64, t = 7.15, p < 0.001) and eating (β = 18.68, t = 3.05, p < 0.01) significantly increased with litter age and self-directed grooming (β = 9.649, t = 1.42, p = 0.15) did not significantly change with litter age (Fig. 5). The mean percent time engaging in the behavior across all videos was 39.3% for nest attendance (min = 0.3%, max = 100%, median = 32.7%), 10.1% for licking and grooming (min = 0, max = 32.2%, median = 9.4%), 21.6% for active nursing (min = 0, max = 98.4%, median = 16.59%), 0.2% for passive nursing (min = 0, max = 11.1%, median = < 1%), 16.85% for self-directed grooming (min = 0, max = 66.9%, median = 15.6%), 9.5% for eating (min = 0, max = 59.7%, median = 7.1%), and 7.0% for drinking (min = 0, max = 43.2%, median = 5.4%).

AMBER pipeline deployment

Although AMBER relies on the capabilities of DeepLabCut and SimBA software, it deviates significantly from the standard workflows and involves additional steps to work. To improve user experience and reduce barrier of entry for inexperienced programmers, we provide materials to simplify the workflow and reduce user burden for large video sets (Fig. 1A). First, given that DeepLabCut is installed, all pose estimation steps can be performed automatically with a single command line function using the AMBER_pose_estimation.py runner script (e.g. python AMBER_pose_estimation.py path/to/videos). This program performs dam and pup pose estimation, converts pup detections, combines dam and pup data, and formats data for all videos in the indicated video directory. Alternatively, users can perform the same steps or modify code using the provided Jupyter notebook. Next, the pre-configured AMBER SimBA project can be used to perform the behavior classification steps. Instructions for implementing the AMBER pipeline are available at: https://github.com/lapphe/AMBER-pipeline.

Post-hoc explanability metrics for behavior classifiers

While not part of the AMBER pipeline, explanability metrics offer interpretable descriptions for how model decisions are made from feature values³⁴. Feature importance permutations provide an estimation of information loss when the feature is replaced with randomly shuffled values from the same distribution as the original feature data. Feature importance permutations were calculated for each behavior classifier with eli5 python library in SimBA. Relative importance of each feature within each model was determined by ranking features from most important (rank 1) to least important (rank 217) based on feature importance score for between-model comparison. The average rank for each feature category was calculated for heatmap visualization (Fig. 6). Dam location features are the most important feature category (lowest average rank) for active nursing, drinking, licking and grooming, passive nursing, dam-pup distance features were most important for nest attendance, and dam key point distances were most important for self-directed grooming. Dam movement is the least important average feature category (highest rank) for all models except the nest attendance.

Shapely Additive Explanations (SHAP) is another explanability metric that uses a game theoretic approach that can be applied to tree-based machine learning models to allocate the contributions of individual features to the overall final behavior probability in a frame based on magnitude of feature attributions^34,35. SHAP analysis was run in SimBA on 150 random frames with behavior present and 150 random frames with behavior absent for each model to calculate individual feature contributions to overall frame behavior probability (https://github.com/slundberg/shap). Figure 7A shows the top six features with the largest absolute SHAP scores for each behavior classifier, where the solid black line is the base rate for the behavior (probability of a given frame containing the behavior by chance), each individual point reflects the change in behavior probability relative to the base rate (SHAP score) for that feature for one frame, and the color of the point reflects the z-score of the actual feature value for that frame. Consequently, the relationship between behavior probability shift and actual feature value can be deduced as positively or negatively associated. For the nest attendance classifier, features with high SHAP scores include distances between dam centroid and pup centroid and dam convex hull features. Likewise, classifiers for other on nest behaviors (active nursing, licking and grooming, passive nursing) also include dam-pup distance features and dam convex hull features among the top SHAP features. Top features for self-directed grooming and licking and grooming classifiers include ear movement and dam distances. The sum of SHAP score for all features by feature categories (Fig. 7B) show that dam movement features did not have a substantial impact on nest attendance, but had a moderate influence on increasing behavior probabilities in remaining models and had a particularly large effect on licking and grooming, self-directed grooming, and active nursing. Pup probabilities and pup area features had little effect on behavior probabilities.

Discussion

Continuous home-cage monitoring is an optimal approach to assess dam-pup interactions in a laboratory setting, but the burden of manual scoring limits the implementation of this approach. We present a pipeline that automates scoring of rodent dam-pup home-cage video recordings to produce frame-level annotations of seven maternal behaviors with high accuracy. Pup-directed maternal behaviors performed particularly well on the hold-out video set with F₁ scores of 0.990 (nest attendance), 0.828 (active nursing), and 0.766 (licking and grooming). AMBER uses open-source software, standard rat housing equipment, and does not require any specialized recording hardware or animal identification markers. When paired with automated recording equipment, home-cage behavior can be collected from an entire cohort of animals simultaneously for long time periods while avoiding the effects of experimenter presence or bias on behavior. Maternal behavior affects a variety of developmental outcomes, and AMBER eliminates the reproducibility concerns, training, and inter-rater reliability drawbacks of manual scoring home cage maternal behavior, allowing assessment of maternal behavior regardless of behavior expertise.

The validation set of recordings show that AMBER-scored videos produce expected patterns of maternal behaviors over the first ten postnatal days. The duration of pup-directed behaviors was high on P1 and declined over the first week while the durations of eating and drinking increased. This is consistent with previous work from manually-coded behavior reporting declines in dam-pup contact and licking and grooming from P1–10³⁷. Licking and grooming, nest attendance, and self-directed grooming were relatively normally distributed, with the range of percent of time for licking and grooming similar to frequency observed in time-sampling studies for Long-Evans rats³⁷. Active nursing was not normally distributed, although this difference may be attributed to the one-hour observation during early dark phase for the present study versus sample observations throughout the dark and light cycle used in previous work³⁸. The number of bouts for behaviors was very high in a few videos (Supplementary Fig. S3) and mean bout duration was lower than expected based on manually scored data from previous studies. This difference is explained by frame-level behavior measurements, where one frame labeled with the behavior presence (or absence) is sufficient to determine a “bout”. Smoothing methods such as employing a minimum bout duration can be applied directly in SimBA to filter the data.

Using the AMBER pipeline as presented in Fig. 1A, users get frame-level resolution of seven maternal behaviors. However, the individual components of AMBER (dam pose estimation, pup pose estimation, and behavior classification) can also be used for other applications. First, dam pose estimation and pup pose estimation models may also be used separately in other contexts to track adult rats and pups in any side-view recording which is more compatible with most standard rodent home cages. Second, pose estimation data may be used with compatible software to perform unsupervised behavior clustering^39,40,41. Third, specific features, e.g. the convex hull area of pups, dam centroid-pup centroid distance, or degree of dam back curvature during nursing, extracted from the pose estimation data may be informative in concert with behavior annotations. These data are calculated during feature extraction and are readily available for further analysis. Finally, the small size of neonatal pups and large number of pups in the litter makes manual behavior scoring for pups very difficult. While the primary purpose of the pup pose estimation model in the AMBER pipeline is to determine pup and nest location and dam-pup distances, pup pose estimation data could be used separately to evaluate pup behavior in relation to dam behavior.

Model explanability metrics shed light on the “black-box” of behavioral classification by providing articulated descriptions of how features influence model performance that allows users to critique model construct validity and compare different models beyond model performance³⁴. Feature importance permutation results showed low importance of dam movement features. The majority of features comprising the dam movement category are the movement of individual dam body points, so this may suggest that this information is less informative or reliable for behavior classifier predictions than features that use information from multiple body points. SHAP analysis revealed several intuitive relationships between dam pup-directed behaviors and change in behavior probability in our classifiers: (1) the convex hull areas of the dam will get larger as she moves toward the nest (closer to the camera) at the front of the cage; (2) dam-pup Euclidian distances will decrease when the dam is interacting with pups and will increase when the dam is off-nest; (3) dam movement features are informative for classifiers that can be operationally defined by specific body movements (e.g. licking and grooming); (4) the water bottle is located in the top of the cage, so the angle of the dam’s back can be informative in identifying drinking behavior.

AMBER is a substantial improvement over manual scoring methods for dam-pup home cage behavior, but also has some notable limitations. Classifier performance for pup-directed behaviors may be compromised in recordings where pups are occluded for the duration of the video since pup coordinate information and dam-pup distances are important features for several classifiers. This shortcoming could be circumvented by manually adding nest location information in place of pup tracking. Furthermore, the behavior classifiers presented here are trained on tracking information from side-view recordings and will not generalize to top view recordings. We chose side view recordings to allow for better pup tracking and to eliminate the need for any specialized home-cage equipment as many standard home cages contain food and water in the cage lid. In addition, F1 scores for self-directed grooming, eating, and drinking classifiers were lower on the hold-out video set compared to the test set. The improvement in performance when combining eating and drinking behaviors suggest that information about the location of the food and water could improve the models. Finally, the pose estimation models at present are optimized for detecting key points in Long-Evans rats and are unlikely to generalize well to other rodent species that are visually different without training on additional frames. Likewise, differences in camera angle, bedding material, enrichment objects, cage layouts, or lighting in recordings compared to the training videos may interfere model transferability, requiring some additional labeled frames and pose estimation model retraining⁴². We are currently expanding these model training sets to include frames from videos of Sprague–Dawley rats, C57/Bl6 mice, and CD1 mice in different home cages to make the pose estimation models more robust and able to perform well for a wider variety of rodent developmental studies. These models will be made publicly available on the AMBER repositories. Despite these current limitations, the AMBER pipeline is a significant step forward for improving analysis of home cage dam-pup interactions to provide standardized, detailed behavioral data likely to yield new insights in developmental studies.

Materials and methods

Animal husbandry and breeding

All animal protocols were approved by the IACUC at the University of Texas at Austin, were performed in accordance with IACUC guidelines and regulations, and are reported in accordance with the ARRIVE guidelines. Animals were housed in polycarbonate cages (19″ × 10.5″ × 8″) with standard wire tops and were kept on a 12:12 h light cycle (lights off at 10 am EST). All dams were provided with Aspen shavings (Nepco) for bedding material, which can be manipulated by dams to construct nests. No additional bedding was provided. All animals were fed standard chow (Lab diet 5LL2) and water ab libitum through bottles held at approximately a 45-degree angle in the wire tops. Eighty-eight adult P60–70 Long-Evans females and 35 adult Long-Evans males were purchased from Charles River Labs and acclimated to the vivarium for at least two weeks before breeding. During breeding, P75–85 females were screened daily for receptive behavior and housed with a breeder male overnight on the day lordosis was observed. All dams were socially housed throughout pregnancy until they were separated into individual cages a few days before giving birth. Day of birth was considered P0.

Video recording

Home cage behavior recording was conducted with Raspberry Pi 3B + minicomputers running Debian bullseye with the Raspberry Pi Desktop and equipped with Raspberry Pi Module 1 NoIR cameras. One Raspberry Pi was placed perpendicular to the short end of the cage and closest to the nest location for each cage (Fig. 1). Cages were set up on wire racks with the water bottle spout facing away from the wall and the camera on the side closest to the wall as rats typically prefer to place their nest near the wall and away from the water bottle. In the event that the dam moved the location of the nest to the opposite end of the cage, the camera side was also switched at the first opportunity. Instances of dams moving the location of the nest to the opposite end during a recording were rare, and those videos were not included. Raspberry Pis were held in place using phone mounts attached to magic arms clamped to the rack and were positioned to capture the width of the front of the cage with a view of the entire cage (except when occluded by excessive bedding or the dam). Pi distance from the cage was not standardized and thus varied slightly between recordings (Supplemental Fig. S2). Raspberry Pis were programmed to record for one hour starting an hour after lights-off at 30 fps in greyscale at 1280 × 780 or 920 × 550 resolution on postnatal day (P) 0–13. Two infrared LED strip lights (940 nm; LED Lights World) were attached to the bottom of the wire shelf above the cages and set to turn on and off for the recording automatically with a digital timer. Another 156 videos in were taken to capture 24-h recordings at 2 fps and 920 × 550 resolution to capture video footage during both the light and dark phases. These frames were used to improve the pose estimation models, but only videos recorded at 30fps were used to build behavior classifiers. Raspberry Pis were headless and accessed remotely in order to prevent disruption to home cage behavior by experimenter presence before or during the recordings. Following video recording, videos were automatically converted to mp4 format with MP4Box and uploaded to cloud storage. Raspberry Pi recording setup instructions and recording scripts are available at https://github.com/lapphe/raspberry_rat.

Pose estimation models

Pose estimation for dams was achieved using single animal DeepLabCut³⁰. A total of 4,710 frames were extracted from 255 videos. Thirty-two dam body points were selected from 60 candidate body points for labeling based on the user ability to label key points consistently and to provide adequate coverage of the dam regardless of partial body occlusions or orientation relative to the camera (Fig. 1B). Only body points that were visible in the frame were labeled. For example, if only the left side of the body was visible, points on the right arm, leg, and ventrum were not labeled. Two percent of labeled frames were used for a test set for model evaluation. The dam model was trained for 650,000 iterations using ResNet-50 with a batch size of 8.

The pup pose estimation model was developed using multi-animal DeepLabCut³¹. 1712 frames from 238 videos of pups between postnatal day 0–10 were extracted using k-means clustering and uniform distribution. Nine pup points from the nose to tail base were labeled on each pup visible in the frame (Fig. 1C). Two percent of labeled frames were used for a test set. The pup model was trained for 200,000 iterations using DLCRNet_ms5 with batch size of 8. Pup detections, rather than individual tracks, obtained by running the DeepLabCut function analyze_videos with auto_track = false, were used for pup pose estimation. A custom python script (PhenoPickleRaw.py) was used to convert the pup detections pickle file (ending in “full.pickle”) to a csv file containing the pup key point detections. Unfiltered dam and pup pose estimation files were joined by frame number using a custom R script (join_dam_pup.py). Column headers were reformatted to match the expected input by SimBA for single animal DeepLabCut pose estimation.

Behavior classifier development

A SimBA single animal project configuration with a user-defined body points consisting of all dam and pup key points was used for creating behavior classifiers. Because AMBER uses a custom feature extraction script specific to dam and pup points, the only difference between single animal and multi-animal SimBA projects is the expected pose estimation data format imported into SimBA. The width of the wire cage top at the lowest point corresponds to approximately the center of the long side of the cage and was used to define pixels per mm during the Video Settings step (see Supplemental Fig. S2). Because of the side camera view, the actual pixel/mm distance will change based on the dam and pup location in the cage, but setting this distance helps account for differences in cage distance from the camera and frame resolution between recordings. The outlier correction step of SimBA was skipped.

Dam and pup features were extracted from the pose estimation data using a custom script to calculate 218 features (Supplemental Table S1). Because outlier correction is skipped and a large number of occlusions in each frame is expected for dam key point (e.g. points on the right side are not visible when the left side of her body is facing the camera) and pups (often partially or fully occluded by each other, bedding, or the dam), the majority of feature calculations involve weighting calculations by key point probabilities or applying a minimum probability threshold to exclude occluded points. One feature requires the installation of the circle-fit python package to first fit a circle through the back points then calculate the angle between the first back point, the center of the circle, and the last back point. All other requirements are satisfied by SimBA dependencies.

Classifier training videos were carefully annotated for seven maternal behaviors using BORIS³³ then imported into SimBA. A total of 3,366,254 frames (31.1 h of recording at 30 fps) from 28 one-hour videos and 10 additional shorter video clips (1.7–22 min each) were annotated for nest attendance, active nursing, passive nursing, licking and grooming, self-directed grooming, eating, and drinking (Supplemental Table S2). Clear definitions for inclusion in behavior scoring were developed by incorporating existing definitions for these behaviors, establishing clear rules for the precise start and end of behaviors, and adding and modifying rules until intra-rater and inter-rater reliability was high (> 0.96) across an initial set of 10 videos for each behavior. To avoid bias in selecting only the most obvious examples of behaviors, the entire recording was scored for all behaviors for each annotated video. The videos were selected for manual scoring included a range of different dams, pups of different ages, and did not include any videos where the nest was at the end of the cage opposite the camera or videos where the dam moved the nest during the recording. The 10 additional video clips were selected to provide more examples of infrequent behaviors (i.e. eating, drinking, passive nursing). A detailed ethogram guide that includes behavior definitions, instructions for scoring, and example images is available at https://github.com/lapphe/AMBER-pipeline.

Random forest models were run in SimBA with the following hyperparameters: 100–1500 trees, 1–2 minimum leaf node, RF_criterion = gini, RF_max_features = sqrt, test size = 20% and no sampling adjustment (Supplemental Table S2). Twenty percent of frames were excluded from training and used as a test set to evaluated model performance.

Pipeline validation for maternal phenotype

Four one-hour recordings were manually scored as described above and used as a hold-out data set to assess model generalizability (not included in the training or test set). These hold-out videos were of four different dams and pups of different ages (P2-9). Passive nursing is an infrequent behavior in Long-Evans rats provided with sufficient bedding material and unfortunately, passive nursing did not occur in any of the four videos precluding evaluation of the passive nursing classifiers in the hold-out video set.

242 videos of one-hour home cage recordings taken beginning one hour after lights off on P1–10 from 49 dams were used in the AMBER pipeline workflow as shown in Fig. 1A to assess overall patterns of maternal behavior. This set includes the 28 one-hour videos used to create the behavior classifiers. Thresholds used for behavior classifiers are noted in Supplemental Table S2. Total duration, percent time, bout number, and mean bout duration were calculated in SimBA for each behavior for each video (Fig. 7 and Supplemental Fig. S5). Change in behavior durations over time were analyzed using linear mixed models in R with the lmerTest package with litter ID included as a random effect³⁶.

Explanability metrics for behavior classifiers

Feature importance permutations were calculated for each behavior classifier in SimBA. SHAP analysis was run in SimBA on 150 random frames with behavior present and 150 random frames with behavior absent for each model. Full results files for feature importance permutations and SHAP analysis are available on our OSF repository: https://osf.io/e3dyc/.

Computer hardware and software for machine learning models

All models were trained on a Dell Precision 7920 Tower with a Dual Intel Xeon Gold 5122 3.6 GHz processor, 64 GB RAM, Windows 10 operating system, and a NVIDIA Quadro P5000 video card. Pose estimation models were trained using DeepLabCut version 2.3. Behavioral classifiers were generated using SimBA version 1.65.5. Python was used for pickle file conversion and pose estimation data joining.

Data availability

Pose estimation models, scripts, the ethogram guide, and AMBER analysis instructions are available at: https://github.com/lapphe/AMBER-pipeline. Behavior classifier models are available on our OSF repository: https://osf.io/e3dyc/. Please contact the corresponding author with additional inquiries.

References

Hofer, M. A. Early relationships as regulators of infant physiology and behavior. Acta Paediatr. 83, 9–18 (1994).
Article Google Scholar
van Oers, H. J., de Kloet, E. R., Whelan, T. & Levine, S. Maternal deprivation effect on the infant’s neural stress markers is reversed by tactile stimulation and feeding but not by suppressing corticosterone. J. Neurosci. 18, 10171–10179 (1998).
Article PubMed PubMed Central Google Scholar
Levine, S., Haltmeyer, G. C., Karas, G. & Denenberg, V. Physiological and behavioral effects of infantile stimulation. https://doi.org/10.1016/0031-9384(67)90011-X (1967).
Article Google Scholar
Champagne, F. A. Epigenetic mechanisms and the transgenerational effects of maternal care. Front. Neuroendocrinol. 29, 386–397 (2008).
Article CAS PubMed PubMed Central Google Scholar
Mousseau, T. A. & Fox, C. W. The adaptive significance of maternal effects. Trends Ecol. Evol. 13, 403–407 (1998).
Article CAS PubMed Google Scholar
Sheriff, M. J. & Love, O. P. Determining the adaptive potential of maternal stress. Ecol. Lett. 16, 271–280 (2013).
Article CAS PubMed Google Scholar
Mateo, J. M. Development, maternal effects, and behavioral plasticity. Integr. Compar. Biol. 54, 841–849 (2014).
Article Google Scholar
Gluckman, P. D., Hanson, M. A. & Spencer, H. G. Predictive adaptive responses and human evolution. Trends Ecol. Evol. 20, 527–533 (2005).
Article PubMed Google Scholar
Bale, T. L. et al. Early life programming and neurodevelopmental disorders. Biol. Psychiatry 68, 314–319 (2010).
Article PubMed PubMed Central Google Scholar
Waters, R. C. & Gould, E. Early life adversity and neuropsychiatric disease: differential outcomes and translational relevance of rodent models. Front. Syst. Neurosci. 16, (2022).
Knop, J., Joëls, M. & van der Veen, R. The added value of rodent models in studying parental influence on offspring development: opportunities, limitations and future perspectives. Curr. Opin. Psychol. 15, 174–181 (2017).
Article PubMed Google Scholar
Fleming, A. S. & Kraemer, G. W. Molecular and genetic bases of mammalian maternal behavior. Gender Genome 3, 247028971982730 (2019).
Article Google Scholar
Rilling, J. K. & Young, L. J. The biology of mammalian parenting and its effect on offspring social development. Science 345, 771–776 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Lonstein, J. S., Lévy, F. & Fleming, A. S. Common and divergent psychobiological mechanisms underlying maternal behaviors in non-human and human mammals. Horm. Behav. 73, 156–185 (2015).
Article PubMed PubMed Central Google Scholar
Moore, C. L. & Morelli, G. A. Mother rats interact differently with male and female offspring. J. Compar. Physiol. Psychol. 93, 677–684 (1979).
Article CAS Google Scholar
Moore, C. L. A hormonal basis for sex differences in the self-grooming of rats. Hormones Behav. 20, 155–165 (1986).
Article CAS Google Scholar
Stern, J. M. & Johnson, S. K. Perioral somatosensory determinants of nursing behavior in Norway rats (Rattus norvegicus). J. Compar. Psychol. 103, 269–280 (1989).
Article CAS Google Scholar
Fleming, A. S. & Rosenblatt, J. S. Maternal behavior in the virgin and lactating rat. J. Compar. Physiol. Psychol. 86, 957–972 (1974).
Article CAS Google Scholar
Curley, J. P. & Champagne, F. A. Influence of maternal care on the developing brain: Mechanisms, temporal dynamics and sensitive periods. Front. Neuroendocrinol. 40, 52–66 (2016).
Article PubMed Google Scholar
Sullivan, R., Perry, R., Sloan, A., Kleinhaus, K. & Burtchen, N. Infant bonding and attachment to the caregiver: Insights from basic and clinical science. Clin. Perinatol. 38, 643–655 (2011).
Article PubMed PubMed Central Google Scholar
Caldji, C. et al. Maternal care during infancy regulates the development of neural systems mediating the expression of fearfulness in the rat—PMC. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC20261/.
Liu, D. et al. Maternal care, hippocampal glucocorticoid receptors, and hypothalamic-pituitary-adrenal responses to stress. Science 277, 1659–1662 (1997).
Article CAS PubMed Google Scholar
Moore, C. L., Wong, L., Daum, M. C. & Leclair, O. U. Mother-infant interactions in two strains of rats: Implications for dissociating mechanism and function of a maternal pattern. Dev. Psychobiol. 30, 301–312 (1997).
Article CAS PubMed Google Scholar
McIver, A. H. & Jeffrey, W. E. Strain differences in maternal behavior in rats. Behaviour 28, 210–216 (1967).
Article CAS PubMed Google Scholar
Lapp, H. E. & Champagne, F. A. Rodent models for studying the impact of variation in early life mother–infant interactions on mood and anxiety. in Psychiatric Vulnerability, Mood, and Anxiety Disorders: Tests and Models in Mice and Rats (ed. Harro, J.) 309–328 (Springer US, 2023). https://doi.org/10.1007/978-1-0716-2748-8_15.
Franks, B., Curley, J. P. & Champagne, F. A. Measuring variations in maternal behavior: relevance for studies of mood and anxiety. in Mood and Anxiety Related Phenotypes in Mice: Characterization Using Behavioral Tests, Volume II (ed. Gould, T. D.) 209–224 (Humana Press, 2011). https://doi.org/10.1007/978-1-61779-313-4_13.
Capone, F., Bonsignore, L. T. & Cirulli, F. Methods in the analysis of maternal behavior in the rodent. Curr. Protocols Toxicol. 26, 13.9.1–13.9.16 (2005).
Ivy, A. S., Brunson, K. L., Sandman, C. & Baram, T. Z. Dysfunctional nurturing behavior in rat dams with limited access to nesting material: A clinically relevant model for early-life stress. Neuroscience 154, 1132–1142 (2008).
Article CAS PubMed Google Scholar
Datta, S. R., Anderson, D. J., Branson, K., Perona, P. & Leifer, A. Computational neuroethology: A call to action. Neuron 104, 11–24 (2019).
Article CAS PubMed PubMed Central Google Scholar
Mathis, A. et al. DeepLabCut: Markerless pose estimation of user-defined body parts with deep learning. Nat. Neurosci. 21, 1281–1289 (2018).
Article CAS PubMed Google Scholar
Lauer, J. et al. Multi-animal pose estimation and tracking with DeepLabCut. 2021.04.30.442096. https://doi.org/10.1101/2021.04.30.442096 (2021).
Nilsson, S. R. et al. Simple Behavioral Analysis (SimBA)—an open source toolkit for computer classification of complex social behaviors in experimental animals. bioRxiv 2020.04.19.049452. https://doi.org/10.1101/2020.04.19.049452 (2020).
Friard, O. & Gamba, M. BORIS: a free, versatile open-source event-logging software for video/audio coding and live observations. Methods Ecol. Evol. 7, 1325–1330 (2016).
Article Google Scholar
Goodwin, N. L., Nilsson, S. R. O., Choong, J. J. & Golden, S. A. Toward the explainability, transparency, and universality of machine learning for behavioral classification in neuroscience. Curr. Opin. Neurobiol. 73, 102544 (2022).
Article CAS PubMed PubMed Central Google Scholar
Lundberg, S. M. et al. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2, 56–67 (2020).
Article PubMed PubMed Central Google Scholar
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Soft. 67, (2015).
Champagne, F. A., Francis, D. D., Mar, A. & Meaney, M. J. Variations in maternal care in the rat as a mediating influence for the effects of environment on development. Physiol. Behav. 79, 359–371 (2003).
Article CAS PubMed Google Scholar
Ader, R. & Grota, L. J. Rhythmicity in the maternal behaviour of Rattus norvegicus. Anim. Behav. 18, 144–150 (1970).
Article CAS PubMed Google Scholar
Berman, G. J., Choi, D. M., Bialek, W. & Shaevitz, J. W. Mapping the stereotyped behaviour of freely moving fruit flies. J. R. Soc. Interface 11, 20140672 (2014).
Article PubMed PubMed Central Google Scholar
Wiltschko, A. B. et al. Revealing the structure of pharmacobehavioral space through motion sequencing. Nat. Neurosci. 23, 1433–1443 (2020).
Article CAS PubMed PubMed Central Google Scholar
Todd, J. G., Kain, J. S. & de Bivort, B. L. Systematic exploration of unsupervised methods for mapping behavior. Phys. Biol. 14, 015002 (2017).
Article ADS PubMed Google Scholar
von Ziegler, L., Sturman, O. & Bohacek, J. Big behavior: challenges and opportunities in a new era of deep behavior profiling. Neuropsychopharmacol. 46, 33–44 (2021).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Psychology, University of Texas at Austin, 108 E. Dean Keaton St, Austin, TX, 78712, USA
Hannah E. Lapp, Melissa G. Salazar & Frances A. Champagne

Authors

Hannah E. Lapp
View author publications
You can also search for this author in PubMed Google Scholar
Melissa G. Salazar
View author publications
You can also search for this author in PubMed Google Scholar
Frances A. Champagne
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L. collected video recordings, developed the pipeline, analyzed the validation video set, created the figures and tables, and wrote the main text. M.S. collected video recordings, annotated behavior videos, and edited the text. F.C. conceived and designed the study, supervised the project, and revised the text. All authors approved the manuscript.

Corresponding author

Correspondence to Hannah E. Lapp.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lapp, H.E., Salazar, M.G. & Champagne, F.A. Automated maternal behavior during early life in rodents (AMBER) pipeline. Sci Rep 13, 18277 (2023). https://doi.org/10.1038/s41598-023-45495-4

Download citation

Received: 26 April 2023
Accepted: 20 October 2023
Published: 25 October 2023
DOI: https://doi.org/10.1038/s41598-023-45495-4

This article is cited by

Simple Behavioral Analysis (SimBA) as a platform for explainable machine learning in behavioral neuroscience
- Nastacia L. Goodwin
- Jia J. Choong
- Sam A. Golden
Nature Neuroscience (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Geometric deep learning enables 3D kinematic profiling across species and environments

Deep-learning-based identification, tracking, pose estimation and behaviour classification of interacting primates and mice in complex environments

Deep learning-based behavioral analysis reaches human accuracy and is capable of outperforming commercial solutions

Introduction

Results

Video recording

Dam pose estimation model

Pup pose estimation model

Pose estimation data postprocessing

Behavior classifier project set up

Classifier feature extraction

Random forest behavior classifiers

Behavioral classifier evaluation

Pipeline validation for maternal phenotype

AMBER pipeline deployment

Post-hoc explanability metrics for behavior classifiers

Discussion

Materials and methods

Animal husbandry and breeding

Video recording

Pose estimation models

Behavior classifier development

Pipeline validation for maternal phenotype

Explanability metrics for behavior classifiers

Computer hardware and software for machine learning models

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Simple Behavioral Analysis (SimBA) as a platform for explainable machine learning in behavioral neuroscience

Comments

Search

Quick links