Automated classification of estrous stage in rodents using deep learning

Wolcott, Nora S.; Sit, Kevin K.; Raimondi, Gianna; Hodges, Travis; Shansky, Rebecca M.; Galea, Liisa A. M.; Ostroff, Linnaea E.; Goard, Michael J.

doi:10.1038/s41598-022-22392-w

Download PDF

Article
Open access
Published: 21 October 2022

Automated classification of estrous stage in rodents using deep learning

Nora S. Wolcott¹,
Kevin K. Sit²,
Gianna Raimondi³,
Travis Hodges^4,8,
Rebecca M. Shansky⁵,
Liisa A. M. Galea^4,6,
Linnaea E. Ostroff³ &
…
Michael J. Goard^1,2,7

Scientific Reports volume 12, Article number: 17685 (2022) Cite this article

3604 Accesses
1 Citations
11 Altmetric
Metrics details

Subjects

Abstract

The rodent estrous cycle modulates a range of biological functions, from gene expression to behavior. The cycle is typically divided into four stages, each characterized by distinct hormone concentration profiles. Given the difficulty of repeatedly sampling plasma steroid hormones from rodents, the primary method for classifying estrous stage is by identifying vaginal epithelial cell types. However, manual classification of epithelial cell samples is time-intensive and variable, even amongst expert investigators. Here, we use a deep learning approach to achieve classification accuracy at expert level. Due to the heterogeneity and breadth of our input dataset, our deep learning approach (“EstrousNet”) is highly generalizable across rodent species, stains, and subjects. The EstrousNet algorithm exploits the temporal dimension of the hormonal cycle by fitting classifications to an archetypal cycle, highlighting possible misclassifications and flagging anestrus phases (e.g., pseudopregnancy). EstrousNet allows for rapid estrous cycle staging, improving the ability of investigators to consider endocrine state in their rodent studies.

Machine learning reveals the control mechanics of an insect wing hinge

Article 17 April 2024

Inferring gene regulatory networks from single-cell multiome data using atlas-scale external data

Article Open access 12 April 2024

Simultaneous single-cell three-dimensional genome and gene expression profiling uncovers dynamic enhancer connectivity underlying olfactory receptor choice

Article Open access 15 April 2024

Introduction

With the broad incorporation of female animals into previously all-male studies^1,2 we are at a critical juncture for the interpretation of endocrine physiology. In naturally cycling humans, the menstrual cycle lasts 28 days and is characterized by defined peaks in steroid hormones such as estradiol and progesterone^{3,4,5,6,7,8,9}. In female rats and mice, the analogous cycle lasts only 4–5 days and is known as the estrous cycle¹⁰. The estrous cycle was first described over a century ago¹¹, yet the criteria for tracking this cycle remain subjective and variable between experimenters¹². Determining the stage of estrous is critical to evaluating the state of the hypothalamic-pituitary-ovarian axis, which has implications in a myriad of factors including gene expression^13,14, neuronal structure and connectivity^3,15, and pharmacological efficacy¹⁶. In addition, correct interpretation of estrous stage is useful for timed pregnancy in rodents, and changes in cycle regularity can be used as a proxy for changes in other critical hormones such as corticosterone^17,18.

The estrous cycle can be divided into four stages: diestrus, proestrus, estrus, and metestrus^{19,20,21,22,23}. While techniques such as vaginal opening evaluation, vaginal wall impedance, and urine biochemistry have all been used as methods for determining estrous stage^20,21, epithelial cell cytology remains the most common and reliable strategy^10,22,23. Classification using vaginal cytology is typically performed by manually counting or estimating the relative prevalence of epithelial cell types, including leukocytes, cornified epithelial, and nucleated epithelial cells, and using the proportionality of these subtypes to determine stage^10,19.

Despite the prevalence of this method, there are several limitations of epithelial cell cytology for estrous stage classification: (1) it requires extensive training for which no standardized training set exists. (2) it lacks generalizability; even expert classifiers may have trouble generalizing across rodent species, stains, and subjects. (3) it is inconsistent between labs, as classification can vary widely between human examiners¹². Here, we address these challenges using a novel deep learning algorithm that can generate estrous stage classifications in a fully automated and standardized manner.

Convolutional neural networks (CNNs) have outperformed human experts in diagnosing retinal disease²⁴, skin cancer²⁵, syndromic genetic diseases²⁶, and a host of other medical conditions²⁷. These networks are broadly useful for their speed and reliability. Although CNNs are difficult to train from scratch, requiring massive training data sets for accurate classification, transfer learning can exploit the multilayered architectures of pretrained networks to classify complex biological images^28,29.

Here, we have compiled a large-scale multi-laboratory dataset of cytology images (“EstrousBank”). We then used EstrousBank to train a deep learning algorithm (“EstrousNet”) to effectively recognize structural markers of the estrous cycle in a manner generalizable across subjects, stains, and rodent species. The resulting classifications are not significantly different than expert human examiners in any stage surveyed. The predictions generated by EstrousNet can be enhanced by using sequentially collected data to fit cytological samples with an archetypal estrous cycle. Cycle fitting, along with training, classification, and output, are operated through an interactive graphical user interface (GUI). Taken together, these results show that our deep learning approach is capable of rapid and accurate classification of estrous stage.

Results

EstrousBank: an open resource for analysis of vaginal cytology images

A major barrier to the development of software to analyze the estrous cycle is a data-poor environment that requires experimenters to collect their own cytology images. In our efforts to make the EstrousNet algorithm generalizable across groups, we have compiled the largest known image bank of estrous cytology images. EstrousBank currently spans five labs, five stains, two magnifications, and multiple rodent species (Fig. 1A–C, Supplementary Table S1). The complete image bank comprises 12,719 vaginal cytology images and is freely available for analysis by outside laboratories. We will continue to add images to the bank from our and other groups as more samples become available. Cytological samples across labs were collected using a standard lavage or swabbing procedure (See Methods). Briefly, epithelial cells were exfoliated from the superficial vaginal cavity via sterile saline and transferred to a glass microscope slide. Samples were allowed to dry for up to 24 h before staining with one of several compounds, and images were collected using brightfield microscopy at a range of magnifications (Supplementary Table S1). Once cytological images were taken, the sample was classified into a given estrous stage based on agreement from two or more expert examiners. This was used as a proxy for correct classification, in lieu of ground truth measurements of plasma steroid hormone concentration.

EstrousBank contains images from all four stages of the estrous cycle, which were classified by experts according to classical cytology parameters, which are as follows^20,21,22,23: mouse diestrus is characterized by an abundance of small leukocytes, a sharp decrease in proportions of keratinized anucleated epithelial cells, and lower numbers of both small and large nucleated epithelial cells (Fig. 1A–C). Mucosal secretions appear thick and stringy when present. Proestrus is a more transient stage characterized by a uniform spread of small rounded basophilic nucleated epithelial cells, and low proportions of anucleated cornified epithelial cells (Fig. 1A–C). Estrus is typically identified by the high proportion of large anucleated cornified epithelial cells, which often form clumps or sheets that become more prominent in late estrus (Fig. 1A–C). Metestrus is a short stage identified by the presence of both nucleated epithelial and cornified epithelial cells, with leukocytes clustered around them, and an elevated level of mucosal secretions (Fig. 1A–C). While others describe metestrus and diestrus as one continuous stage, here we consider metestrus to be its own distinct stage preceding diestrus. We have refrained from breaking diestrus into further substages due to a lack of sequential data, as well as the morphological uniformity of this period. Cytological characterizations are largely consistent between mice and rats, but the following differences have been observed: rats exhibit a higher proportion of large ovular nucleated epithelial cells in late estrus, shorter periods of proestrus/metestrus, and lower proportions of anucleated cornified epithelial cells in metestrus²¹. Given these similarities, we trained EstrousNet on cytology images from several strains of mice and rats to improve generalization across model systems; with 34.1% of the image set from mice and 65.9% from rats.

Although previous studies have used computational methods to analyze vaginal cytology^12,30, the input datasets for these networks have historically been restricted to a single stain. To further enhance generalizability, the training and validation image sets for EstrousNet include samples stained with H&E, Shorr, Giemsa, cresyl violet, and crystal violet stains, at magnifications of 10 × and 20 × (Fig. 1A–C, Supplementary Table S1).

A ResNet-50-based CNN architecture maximizes EstrousNet performance

To predict estrous stage from vaginal cytology images, we developed a classification pipeline using a convolutional deep learning network to detect cell boundaries and recognize endocrine biomarkers within cytological samples. For training and validation, we used consensus classifications (see Methods) to attach an estrous stage label to each image. EstrousNet is trained on subsets of EstrousBank images that are augmented for a greater volume of training data (Fig. 1D). Input images are first segmented into quadrants (Fig. 1D.i, ii), then reflected, rotated, scaled, and translated within the Net (Fig. 1D.iii). The augmented images undergo luminance normalization, then are converted to 3-channel grayscale arrays for more efficient feature extraction (Supplementary Fig. S1). Next, these augmented images are compiled into a large datastore and fed into the ResNet-50 architecture, which consists of four convolutional stages of increasing dimension (Fig. 1E). The convolutional layers of the network converge on a SoftMax classification layer, which outputs probabilistic classification of estrous cycle stage, including a Confidence Index (CI) that directly reports degree of certainty to the user (Fig. 1E, F). This classification is optionally supplemented by fitting the test images to a curve describing the length and phase of the estrous cycle (Fig. 1F). For images in which the cyclicity prediction and net prediction disagree, the interactive graphical user interface (GUI) will ask the user to select which classification is the best fit (Supplementary Fig. S2). For these images, if the CI is below a given threshold, users will be given the option to select a transition stage classification (Supplementary Fig. S2). The composite classifications of the EstrousNet and cyclicity predictions provide the experimenter with an informed estrous stage classification.

Previous studies investigating the efficacy of transfer learning in biological tissue classification have used several CNN architectures^12,28,29. Here, we evaluated four different pretrained networks: VGG-19, Inception v3, MobileNet V2, and ResNet-50, across three training epochs (Fig. 2A)^31,32,33,34. Each base architecture was originally trained on more than one million images from the ImageNet database and retrained on an augmented dataset made up of 80% of EstrousBank images, with 10% of images reserved for validation and 10% reserved for testing (Fig. 2B,C). All base architectures have previously been used for supervised learning in biological classification tasks and achieved accuracy comparable to or exceeding that of human coders^24,25,26,27. The mean validation accuracies averaged over 3 iterations for each architecture are as follows: VGG-19 = 77.5%, Inception v3 = 79.7%, MobileNetV2 = 65.5% and ResNet-50 = 88.9% (Fig. 2A). These accuracies are calculated based on ground truth data defined by benchmark classifications between two or more expert human examiners. Based on these results, we concluded that ResNet-50 was the most effective architecture.

EstrousNet outperforms human coders

The cytology images in our training set were originally sorted into stages by expert human classifiers. These classifications were made using subjective assessments according to established approaches^10,20,21 (see Methods). Unfortunately, human classification is limited by inter-experimenter variability and differences in experience with particular species, strains, and histological stains. In addition, the CNN may be capable of identifying subtle morphological features that are difficult for humans to identify, such as increased cell clumping in estrus and higher mucus content in metestrus and diestrus.

To quantify differences between EstrousNet and human coders, we compared classification performance on a comparison set of 400 randomly selected images (100 from each stage) between EstrousNet and three expert human coders. The randomly selected images span the distribution of species, stains, and laboratories of the EstrousBank training set, and were thus considered representative of the larger image bank. Across the test image set, EstrousNet classified stages significantly more accurately than human examiners (odds ratio = 0.68, 95% confidence interval = 0.55–0.83, p = 2.1 × 10^–4; Fisher’s Exact Test). Breaking down performance by stage, EstrousNet achieved significantly greater accuracy than expert human examiners for diestrus (odds ratio = 0.6791, 95% confidence interval = 0.55–0.83, p = 1.2 × 10^–5), whereas accuracy was higher, but not significantly different than expert examiners, for proestrus (odds ratio = 0.68, 95% confidence interval = 0.55–0.83, p = 0.075), estrus (odds ratio = 0.6791, 95% confidence interval = 0.55–0.83, p = 0.84) and metestrus (odds ratio = 0.68, 95% confidence interval = 0.55–0.83, p = 0.60; Fisher’s Exact Test for all comparisons; Fig. 2D–F). EstrousNet classifications also achieved impressive speed, with an average rate of 0.10 + / − 0.005 s (mean + / − SE) per image.

Expert human staging showed a large degree of variance, with only 275 images, or 68.75% of the total comparison set, shared between all three coders (Fig. 2G). A notable number of classifications, 15.9%, were unique to one human coder (Fig. 2G). Therefore, even amongst expert human classifiers, classifications can vary widely across a generalizable dataset of cytology images.

EstrousNet is generalizable across species, stains, and subjects

To further quantify EstrousNet performance for each estrous stage, we measured the area under the receiver operating characteristic (auROC) for each stage independently. EstrousNet demonstrated auROC values greater than 0.79 for all four estrous stages, with estrus achieving the highest auROC at 0.98 (Fig. 3A). Despite this high performance, there are areas in which EstrousNet shows tendencies towards misclassification. Sensitivity and specificity curves show that EstrousNet is stronger in eliminating false negative results than false positive results, indicating a higher degree of sensitivity than specificity (Fig. 3B). For example, if EstrousNet is given an image of an unknown stage and asked if the sample is from an animal in diestrus, EstrousNet is more likely to classify the sample as diestrus when it is not (false positive), than to classify it as not diestrus when it is (false negative). Therefore, most misclassifications are specificity errors, which could potentially be reduced with further optimization.

In out-of-sample trials in which the CNN was tested on different categories of unseen data, EstrousNet did not show significant differences in test accuracy between any of the given stains it was tested on, including H&E, Shorr, Giemsa, cresyl violet, or crystal violet (Fig. 3C). Additionally, despite cytological differences, images from mice and rats did not show significant differences in testing accuracy (Fig. 3D). Finally, cross-validation across 6 evenly split groups of subjects, including rats and mice of different strains, did not reveal any out-of-sample differences in test accuracy between animals (Fig. 3E).

Using cycle fitting for predictive stage classification

When an experimenter classifies estrous stage from epithelial cytology, they not only consider cell morphology and relative prevalence, but also how images might correspond to a typical estrous cycle. Helpfully, some common confusion errors occur between stages that are temporally distinct. For instance, true metestrus is classified as proestrus at a rate of 24.0% despite being non-adjacent stages of the cycle (Fig. 2D). As a result, we can exploit the natural sequence of the estrous cycle to identify these errors when test images are taken consecutively. To this end, EstrousNet uses a predictive algorithm that fits an archetypal estrous cycle to the labels generated by the net and identifies outliers (Fig. 4A, B; Supplementary Table S2).

A custom cycle waveform was created based on the duration of estrous stages reported from thirteen groups, with a total cycle period of 4.8 days^{10,18,20,21,22,23,30,35,36,37,38} (Fig. 4A; Supplementary Table S2). If more than 4 days of test images are selected (i.e., n > 4*x where x is the sampling frequency per day), the algorithm can fit an archetypal cycle to the data to determine the relative phase that best fits the classification labels. The phase of this periodic waveform is shifted by increments of 0.1 cycles to find the best fit for the input data (Fig. 4B). Additionally, we developed a MATLAB-based GUI that allows experimenters to select which stage to accept in cases where the net prediction and cyclicity predictions do not match, as well as a “transition flag” that suggests an intermediate estrous stage for cases in which classification certainty is low (Supplementary Fig. S2).

Fitting stages to an archetypal cycle also allows us to identify disruptions in the estrous cycle, such as those observed when the rodent enters pseudopregnancy, a condition occasionally induced by vaginal swab or lavage^21,22. Observations of anestrous stages are also useful for those inducing timed pseudopregnancy for reproductive management and embryo transfer^10,39. To address this, EstrousNet will alert the user with a pseudopregnancy warning flag if the animal stays in diestrus for > 50% longer than in previous cycles (Fig. 4C). Manual cell counts from an example cycle in which a mouse was lavaged once a day for 8 consecutive days show a significant increase in the proportion of leukocytes observed once the animal enters pseudopregnancy [Fig. 4D, F(1,6) = 7.44, p = 0.034]. Such persistent diestrus following a cornified swab is consistent with previous observations of chemically or mechanically induced pseudopregnancy, and can be seen in a series of cytological images (Fig. 4E)²².

Additionally, cycle fitting may help to identify stages that do not fall into a traditional category. While here we refer to estrous as consisting of 4 substages, as many as 13 substages have been identified, each corresponding to physiologically distinct steroid hormone concentrations^41,42. For the intermediate period(s) between each stage, manual cell counting of sequential samples revealed cell proportionalities distinct to these stages (Fig. 5). Images from transition stages of the estrous cycle result in greater uncertainty from EstrousNet, as these samples do not belong to a predefined category (Supplementary Fig. S4). To flag potential transition stages in the GUI, classification probabilities are plotted in a bar chart, and the degree of certainty is reported as a Confidence Index (CI) (Fig. 1F, Supplementary Fig. S2C), where a CI of 1 indicates complete certainty of the image belonging to the predicted stage and 0 indicates equal certainty between the top two guesses (Supplementary Fig. S4). In cases where the CI is less than a given threshold and the net and cyclicity predictions do not match, the GUI will give the user the option to select a transition stage as their final classification.

Due to the cyclic and continuous nature of the estrous cycle, characterizing transition stages with deep learning is a principal goal of the EstrousNet project moving forward. Continued contributions of sequentially catalogued data to EstrousBank will be a critical step towards accurate classification of estrous transition stages.

Discussion

Here, we created a deep learning network for automated classification of estrous stage. The 12,719 images that constitute EstrousBank allow us to classify the four stages of estrous in a manner generalizable to stain, subject, and rodent species. EstrousBank is a valuable tool for future developers in the rapidly advancing machine learning field, and the benchmark classifications within the bank provide an intuitive guide for those learning to identify estrous stage. Our EstrousNet GUI additionally makes the CNN easily accessible to untrained users.

We trained EstrousNet on a random 80% subset of EstrousBank using a ResNet-50-based transfer learning algorithm, yielding test accuracy significantly greater than expert human examiners (Figs. 2, 3). Our software incorporates a preloaded trained network for easy adoption, while allowing more advanced users to train their own networks with custom parameters, including as many stages as is desired (Supplementary Fig. S2). This may be helpful for groups that classify diestrus and metestrus as one stage, only want to differentiate proestrus from non-proestrus, or wish to add transition stages to their classification output.

To further improve estrous stage classification, EstrousNet incorporates a cycle fitting algorithm that flags outlier cases in which the deep learning classifications do not line up with an archetypal estrous cycle (Fig. 4). In these cases, the GUI gives the user the option to choose between EstrousNet and cyclicity classifications (Supplementary Fig. S2). In addition, if the CI is below a given threshold, the user will be given a third option: the transition stage that most closely aligns with an archetypal cycle. The user’s selection is then incorporated into the final GUI output.

Despite our progress in estrous stage classification with EstrousNet and EstrousBank, some limitations remain. Because of the heterogeneity of the training image set, we sacrifice some accuracy for the sake of generalizability. Other CNNs trained to distinguish 3 stages using a single dataset therefore exhibit higher validation accuracy within their own dataset, but fail to generalize to the broader EstrousNet dataset (Supplementary Fig. S3)¹². Since the presence of both cornified and nucleated epithelial cells in metestrus causes confusion with proestrus, more data will be useful for training CNNs to differentiate between these two stages. It should be noted that the machine learning approach described here was motivated by our previous attempts to classify estrous cycle stage from cell counting, which we found insufficient to capture changes in epithelial morphology across the cycle. However, cell segmentation is a rapidly advancing field^53,54, and it is possible that in the future these methods may complement machine learning in estrous stage classification.

In its current form, misclassifications by EstrousNet remain significantly lower than human experts in diestrus, and similar to expert human coders in proestrus, estrus, and metestrus (Fig. 2F). The significantly higher accuracy of diestrus classifications will be useful in flagging the diestrus-proestrus transition, during which estradiol levels spike up to 100-fold^41,42. The combination of the easy-to-use software and our highly generalizable algorithm makes EstrousNet an excellent resource for inexperienced classifiers. Our results indicate that human variability remains high even amongst expert coders, highlighting the need for increased inter-lab consistency (Fig. 2G). With many experimenters making the transition to using both sexes in rodent studies, generalizable and automated pipelines for tracking estrous stage will be useful for a range of laboratories.

Although 68.3% of EstrousBank images consist of uniform or semi-uniform stains such as crystal violet and H&E, stains designed specifically for hormonal cytodiagnosis offer an opportunity to identify more nuanced biomarkers of the estrous cycle. For instance, Shorr stain makes it possible to distinguish acidophilic and basophilic epithelial cell subtypes, either of which may be more prevalent in the early or late phase of a given estrous stage⁴⁰. Identifying such graded changes in cell type proportionality will be useful for classifying transition stages of the estrous cycle (Fig. 5). Characterization of substages will be a step forward in reframing our understanding of the estrous cycle as a continuum, instead of a series of discrete stages. This is a primary goal of the EstrousBank project, which we view as a dynamic and continuously growing resource for estrous stage classification. The addition of more sequentially collected cytological data to the open-source image repository will be a crucial step towards building a neural network that can accurately classify transition stages. Furthermore, we hope to add cytology images with additional stains, magnifications, and rodent strains to EstrousBank, including unstained samples. The inclusion of fresh smears, especially, will augment the ability of EstrousNet to classify samples prior to staining. By making the EstrousBank repository open source and encouraging the submission of data from outside researchers, we hope to further increase the generalizability of EstrousNet across laboratories.

It should be noted that currently there is no ground truth data for cytological stage in vivo, as the low concentrations of hormones such as estradiol and progesterone in the bloodstream make daily collection of endocrine data generally intractable in rodents. Although larger rats may have sufficient blood volume for repeated sampling, existing radioimmunoassay techniques are invasive, expensive, and time consuming⁴³. At present, most ground truth data from the estrous cycle is derived from terminal experiments in which animals are sacrificed at staggered timepoints and large volumes of blood are used to determine hormone concentration^18,22,41.

Despite these limitations, advances in biosensors for steroid hormone analysis, including aptamer^44,45, bioaffinity⁴⁶, and magnetic nanoparticle sensors⁴⁷, offer exciting opportunities for repeated estradiol and progesterone measurements. Additionally, physiological characteristics such as temperature⁴⁸, heart rate⁴⁹, uterine impedance²⁰, and blood oxygen content⁵⁰ could be incorporated into estrous stage identification as a proxy for steroid hormone concentrations. As new biomarkers become available, we hope to update EstrousNet to integrate these inputs and further improve classification accuracy.

Ultimately, it is our goal that accessible technologies for cytological classification will help reduce the exclusion of female animals from scientific studies, a disparity that is especially prevalent in fields such as neuroscience and pharmacology, in which significant sex differences have been described^1,2. We hope that by continuing to add new cytology images and metadata into our EstrousBank dataset over time, we will be able to bolster our network to identify biological processes that are modulated by steroid hormones.

Methods

Animals

The images in EstrousBank were collected from 5 different labs. Cytology images from the Goard lab were taken from female Thy1-GFP-M transgenic mice and Slc17a7-IRES2-Cre × TITL2-GC6s-ICL-TTA2 double transgenic mice, neither of which showed strain-specific disruptions to the estrous cycle. Animals were housed in cages of up to 5 animals, and singly housed after being surgically implanted with a headplate and cranial window for corresponding imaging experiments. Animals were given food and water ad libitum and kept on a 12 h light/dark cycle. Samples were taken at 16–40 weeks, with a median age of 30 weeks, using vaginal lavage. All animal procedures were approved by the Institutional Animal Care and Use Committee at University of California, Santa Barbara, protocol number 906.2.

Cytology from the Galea Lab was taken from wild-type female Sprague–Dawley rats. Animals were housed in cages of 2–3, given food and water ad libitum, and kept on a 12 h light/dark cycle. Samples were taken at 8–17 weeks of age using vaginal lavage. Older animals were concomitantly involved in behavioral experiments that may have resulted in elevated stress. All experimental procedures were approved by the University of British Columbia Animal Care Committee and were completed in accordance with the Canadian Council on Animal Care guidelines, protocol number A20-0147.

Cytology from the Ostroff lab was taken from wild-type female Sprague–Dawley rats. Animals were housed in cages of 2, given food and water ad libitum, and kept on either a 12 h or 14:10 light/dark cycle. Cages were filled with autoclaved standard Sani-Chip bedding (Teklad Global, Envigo) and one enrichment device. Samples were taken at 4–14 weeks of age using vaginal swab. All animal protocols were approved by the Institutional Animal Care and Use Committee at the University of Connecticut, protocol number A17-036.

Cytology from the Shansky Lab was taken from wild-type female Long Evans rats. Animals were housed in cages of 2, given food and water ad libitum, and kept on a 12 h light/dark cycle. Samples were taken at average 12–16 weeks using vaginal swab. All animal procedures were approved by the Institutional Animal Care and Use Committee at Northeastern University, protocol number 18-0828R.

Cytology from the Sutoh lab was taken from wild-type female C57BL/6 J mice. Animals were provided food and water ad libitum and kept on a 12 h light/dark cycle. Samples were taken at 5–14 weeks using vaginal swab. All animal-use procedures were in accord with the Guidelines for Animal Experimentation of Chiba University, protocol number 25-134.

Vaginal cytology

EstrousBank samples were collected primarily during the light phase of the cycle, and none were collected under reverse light cycle conditions (see above). Individual swab/lavage timing, as well as intervals between swab/lavage, varied between groups. Samples were taken using either saline lavage (9.2%) or vaginal swab (90.8%). Vaginal lavage samples were collected using a P200 micropipette. 50 µl sterile saline was pipetted into the vaginal opening and aspirated several times to obtain a sufficient cell count. The sample was pipetted onto a gel subbed microscope slide and allowed to dry 24 h before staining. For vaginal swabs, cotton-tipped swabs were soaked in sterile saline and briefly rolled against the superficial vaginal wall. The epithelial cells on the swab were then transferred to a dry gel subbed glass slide.

Gel subbing was performed in-house using standard IHC protocol to coat glass slides in gelatin/CrK(SO₄)₂ solution¹⁹. Staining procedures, including crystal violet, cresyl violet, Giemsa, H&E, and Shorr stain, are as described elsewhere^20,40,41.

EstrousBank curation

The 12,719 images in EstrousBank were contributed from the Goard lab, Ostroff lab, Shansky lab, Galea lab, and Sutoh lab. These labs provided cytology images from a diverse set of histological stains, magnifications, species, and strains (Supplementary Table S1). Initial classifications were made based on traditional cell type proportionality, as determined by the source lab. For cross-group consistency, benchmark classifications were made between the experimenters who provided the cytology images and those compiling EstrousBank. Images were classified into a given stage when 2 or more expert coders agreed on a stage classification, including those from transition stages (Fig. 5). Five total examiners were involved in generating benchmark classifications, each with > 2 years of experience in classifying cytology images. Images containing excessive debris, n < 10 cells, or < 300 pixels were excluded (4.6%).

Due to variability in experimental designs, only two laboratories (Goard and Ostroff) reported cycle timing in their metadata (7,301 images, 57.4% of total images). For these images, the timing of the sample in the context of the cycle was taken into account when classifying the sample. However, since a substantial portion of the input data to EstrousNet did not include sequential sampling, images were only classified into one of the four canonical estrous stages: diestrus, proestrus, estrus, and metestrus. Images suspected to be from a transition stage, or from a substage of a longer stage, like diestrus, were still assigned to one of these four stages via agreement between two or more expert examiners.

Image preprocessing

Input images were normalized by aligning maximum luminance peaks. Images were then converted to greyscale to allow EstrousNet to generalize onto different stains. After normalization, images in both cohorts were randomly divided into 80% training (10,177 images), 10% validation (1271 images), and 10% test sets (1271 images). The training, validation, and test sets were proportionally representative of EstrousBank as a whole, i.e. they contained the same proportions of magnifications, stains, strains, and laboratories. Stages were normalized to contain the same number of images to avoid bias towards any one stage. No seed was used for randomization; images were sampled without replacement in MATLAB. These images were then split into four quadrants within the same directory. Greyscale images were concatenated into 3D arrays to meet input image size requirements. Images were stored in an augmented datastore where each image was resized to 224 × 224 × 3 to meet ResNet-50 input parameters.

EstrousNet augmented the quadrupled dataset with X and Y translation, rotation, reflection, and scaling, according to user parameters in EstrousNetTrainNewNet.mlapp, the network training GUI. EstrousNet users can choose to train their own net using custom augmentation parameters in the EstrousNet GUI or load one of our open-source pretrained networks.

Implementation and training of CNN architectures

The pretrained EstrousNet is based on the ResNet-50 architecture, which yields the highest validation and test accuracy on the EstrousBank images, with a runtime of 2266 min on a Windows 10 Pro PC with Intel(R) Core(TM) i7-6700 CPU processor and 32 GB RAM. However, users can choose to train EstrousNet using VGG-19, MobileNet v2, or Inception v3 architectures, the connected layers of which have been prespecified in our code^31,32,33,34. VGG-19 is a network characterized by highly connected convolutional and fully connected layers which enable efficient feature extraction and use Maxpooling for downsampling, unlike the average pooling layers of ResNet-50³³. Compared to ResNet and VGG networks, Inception v3 uses auxiliary classifiers, asymmetric convolutions, and fewer overall parameters for high computational efficiency and low error rates³¹. Finally, MobileNet v2 is a lighter deep neural net that only uses a regular convolution on the first layer of an input image, designed for users with datasets that desire high accuracy with reduced parameters³².

In the standard ResNet-50 architecture, used here as the base architecture of EstrousNet, nonlinear skip connections and shortcuts are implemented to maintain high performance despite a deep architecture³⁴. The residual block on ResNet-50 is defined as follows:

$$ y = W_{s} x + F\left( {x, \left\{ {W_{i} } \right\}} \right) $$

where $x$ is input layer; $y$ is output layer; the function $F\left( {x, \left\{ {W_{i} } \right\}} \right)$ represents the residual mapping to be learned; and $W_{s}$ is the linear projection performed to match the dimensions of $x$ and $F$.

The architecture of ResNet-50 consists of 5 stages, each with a convolution and identity block made up of 3 convolution layers³⁴. The two initial layers accomplish convolution of size 7 × 7 and max-pooling of size 3 × 3 with a stride of 2³⁴. Input images are resized to 224 × 224 × 3 before undergoing augmentation and training. Training hyperparameters were specified using a Bayesian optimizer, which yielded highest accuracy with an initial learning rate of 1e^-5 and a mini batch size of 80. Several gradient descent optimization algorithms were tested, including RMSprop, adam, and sgdm, all designed to minimize the loss function of the network. RMSprop exceeded the other algorithms in terms of accuracy when combined with a squared gradient decay of 0.99. Due to the breadth of the input images only 3 epochs were necessary to maintain maximum accuracy, with shuffling occurring every epoch, as well as a piecewise learning rate drop factor of 0.1, the step decay algorithm of which is as follows:

$$ l_{r} = l_{r} 0*drop^{floor} \left( {\frac{epoch}{{epochs\_drop}}} \right) $$

where $l_{r}$ is learning rate; $l_{r} 0$ is initial learning rate (here 1e^-5); $drop$ is the factor by which the learning rate is decreased (here 0.1); $floor$ is the minimum learning rate; $epoch$ is the current epoch, and $epochs\_drop$ is the number of epochs after which the step decay will occur (here 1)⁵².

Cycle fitting

Here, a we designed a custom waveform describing the time course of the estrous cycle. The archetypical estrous cycle has a period of 4.8 days, calculated by averaging across temporal data reported in prior publications (Fig. 4A; Supplementary Table S2)^{10,18,20,21,22,23,30,35,36,37,38}. The stage classifications are ordered diestrus > metestrus in increments of 1.0 starting from 0.5, where 0.0 and 4.0 were defined as the transition stage between metestrus and diestrus (Fig. 4A; Supplementary Table S2). We fit these points with a two-term polynomial, calculating the coefficients using the temporal midpoints of each stage of the estrous cycle. The periodic waveform is fit to the input data for EstrousNet by shifting the phase by 0.1 cycles and selecting the phase shift with the maximum Pearson’s correlation coefficient (Fig. 4B).

Cycle fitting also allowed us to detect anestrous stages (i.e., pseudopregnancy), which are occasionally induced by cytology sampling methods such as vaginal swab and lavage. In our algorithm, the user will receive a pseudopregnancy warning message if the animal has been in diestrus 50% longer than in previous cycles, given that the user specifies sequential data sampling in the GUI (Fig. 4C-E). This characterization is consistent with our observation that more than 2 consecutive days of > 90% leukocytes is indicative of an anestrous state (Fig. 4D).

EstrousNet GUI

The EstrousNet GUI was developed in MATLAB 2020b (Mathworks, Inc.) using the App Designer platform. EstrousNet was trained using EstrousNetTrainNewNet.mlapp, classification input was given by EstrousNetGUI.mlapp, and classification output was plotted using EstrousNetPlotting.mlapp. The GUI is also used to tune augmentation parameters and number of stages desired for classification.

When determining stage classification, users of the EstrousNet GUI have the option to select either the raw EstrousNet classification, or a classification that considers the temporal location of the sample in a sequence of images. This decision is aided by our reporting of uncertainty in net predictions using a Confidence Index (CI), which is calculated as follows:

$$ CI = \frac{{P_{max} - P_{max - 1} }}{{P_{max} + P_{max - 1} }} $$

where P_max is the highest probability stage classification, and P_max-1 is the second-highest probability stage classification. If EstrousNet reports 100% probability that a given image falls into one category the CI will be 1, while if EstrousNet gives equal probability for the top two classifications, the CI will be 0. We have demonstrated that images falling within the standard four estrous stage classifications display a higher CI, while images from transition stages or of suboptimal quality display a lower CI (Supplementary Fig. S4).

Additionally, for cases in which the degree of classification certainty is low, the user will be given the option to choose a transition stage classification. This is accomplished using a “transition flag”, which is defined using the following criteria: (1) the EstrousNet and cycle timing classifications differ, (2) the CI falls below a threshold of 0.30. The suggested transition stage is defined as the two nearest classifications in the cycle timing linear regression, for instance “diestrus/proestrus”, or “proestrus/estrus”. If the user chooses to accept the suggested transition stage, the stage will be incorporated into the results structure.

Statistical information

To compare the accuracy of EstrousNet vs trained human examiners, a comparison set of 400 images was created by randomly selecting 100 images from each of the 4 estrous stages (Fig. 2D,E). This 400-image subset was separate from, but representative of, the larger EstrousBank training and validation image sets. Human examiners were expert coders who had each individually classified upwards of 2000 cytology images. EstrousNet was trained on the images in EstrousBank, as described previously, excluding the 400 images in the comparison set. Benchmark classifications, i.e. agreement between two or more expert classifiers, were used as a proxy for ground truth in the absence of intravenous hormone measurements, as described previously. Accuracy was determined by comparing these ground truth classifications to EstrousNet classifications. These comparisons are represented by a confusion matrix generated in MATLAB (Fig. 2D,E). The results of the classification quiz were not factored into the sorting of EstrousBank.

For statistical analysis, net accuracy and human accuracy vectors for each stage were concatenated and bootstrapped across 5000 iterations to create a normal distribution. Violin plots were made using an open-source MATLAB package⁵⁵. We performed the Fisher’s Exact Test within and across stages to test for significance (Fig. 2F).

For out-of-sample testing, three dimensions of sampling were used: stain, species, and subject. For stains and species, each respective category was removed from the training set and set aside for testing. EstrousNet was trained separately for each category on the revised datasets (Fig. 3C–E). It should be noted that multiple dimensions were nested in our framework, i.e., because each lab group used a different stain for their cytology images, removing any species also removed a set of stains. Accuracy was measured by taking the proportion of EstrousNet classifications that were consistent with benchmark classifications, run across 1000 iterations sampled with replacement to generate standard error. For out-of-sample subject testing 36 individual animals were chosen, including 20 WT Sprague Dawley rats and 16 Slc7a7-cre x TITL GCaMP6s B6 mice. k = 6 groups were used for k-fold out of sample cross-validation testing, with 6 subjects in each group. The resulting confusion matrix is an average of the k-fold accuracy results (Fig. 3E).

ROC curves were generated using the perfcurve MATLAB function to yeild a logistic regression, then the integral of each curve was taken to calculate the auROC for each estrous stage (Fig. 3A). For these curves, true positive was defined as an instance where a given positive stage was correctly classified, whereas false positive was defined as the number of negative stages falsely categorized into a given positive stage.

The sensitivity curve was calculated by finding the rate of images in a positive class, i.e., images belonging to a given stage, that were correctly classified as being in that stage (Fig. 3B). The specificity curve was calculated by finding the rate of images in a negative class, i.e., not part of a given stage, that were correctly classified as not belonging to that stage (Fig. 3B). The probability cutoff of 0.26 was defined as the intersection between these two curves (Fig. 3B). Pseudopregnancy cell count significance was determined by a two-way ANOVA (Fig. 4D). This study is reported in accordance with ARRIVE guidelines.

Data availability

All code necessary to run EstrousNet is available at http://github.com/ucsb-goard-lab/EstrousNet. EstrousBank is available in full on BioImage Archive (BIA) at https://www.ebi.ac.uk/biostudies/studies/S-BIAD545.

References

Woitowich, N. C., Beery, A. K. & Woodruff, T. K. A 10-year follow-up study of sex inclusion in the biological sciences. Elife 9, 1–8 (2020).
Article Google Scholar
Shansky, R. M. & Murphy, A. Z. Considering sex as a biological variable will require a global shift in science culture. Nat. Neurosci. 24, 457–464 (2021).
Article CAS PubMed Google Scholar
Pritschet, L. et al. Functional reorganization of brain networks across the human menstrual cycle. Neuroimage 220, 117091 (2020).
Article PubMed Google Scholar
Woolley, C. S. & McEwen, B. S. Roles of estradiol and progesterone in regulation of hippocampal dendritic spine density during the estrous cycle in the rat. J. Comp. Neurol. 336(2), 293–306 (1993).
Article CAS PubMed Google Scholar
Woolley, C. S. & McEwen, B. S. Estradiol mediates fluctuation in hippocampal synapse density during the estrous cycle in the adult rat. J. Neurosci. 12(7), 2549–2554 (1992).
Article CAS PubMed PubMed Central Google Scholar
Kim, J. & Frick, K. M. Distinct effects of estrogen receptor antagonism on object recognition and spatial memory consolidation in ovariectomized mice. Psychoneuroendocrinology 85, 110–114 (2017).
Article CAS PubMed Google Scholar
Galea, L. A. M., Perrot-Sinal, T. S., Kavaliers, M. & Ossenkopp, K. P. Relations of hippocampal volume and dentate gyrus width to gonadal hormone levels in male and female meadow voles. Brain Res. 821(2), 383–391 (1999).
Article CAS PubMed Google Scholar
Hara, Y., Waters, E. M., McEwen, B. S. & Morrison, J. H. Estrogen effects on cognitive and synaptic health over the lifecourse. Physiol. Rev. 95, 785 (2015).
Article CAS PubMed PubMed Central Google Scholar
Frick, K. M., Kim, J., Tuscher, J. J. & Fortress, A. M. Sex steroid hormones matter for learning and memory: Estrogenic regulation of hippocampal function in male and female rodents. Learn. Mem. 22, 472–493 (2015).
Article CAS PubMed PubMed Central Google Scholar
Byers, S. L., Wiles, M. V., Dunn, S. L. & Taft, R. A. Mouse estrous cycle identification tool and images. PLoS ONE 7, e35538 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Long, J. A. & Evans, H. M. The Oestrous Cycle in the Rat and its Associated Phenomena. (University of California Press, 1922).
Sano, K. et al. Deep learning-based classification of the mouse estrous cycle stages. Sci. Rep. 10, 1–8 (2020).
Article ADS Google Scholar
Iqbal, J. et al. Estradiol alters hippocampal gene expression during the estrous cycle. Endocr. Res. 45, 84–101 (2020).
Article CAS PubMed Google Scholar
Vastagh, C. & Liposits, Z. Impact of proestrus on gene expression in the medial preoptic area of mice. Front. Cell. Neurosci. 11, 183 (2017).
Article PubMed PubMed Central Google Scholar
Woolley, C. S., Gould, E., Frankfurt, M. & McEwen, B. S. Naturally occurring fluctuation in dendritic spine density on adult hippocampal pyramidal neurons. J. Neurosci. 10, 4035–4039 (1990).
Article CAS PubMed PubMed Central Google Scholar
Kashuba, A. D. M. & Nafziger, A. N. Physiological changes during the menstrual cycle and their effects on the pharmacokinetics and pharmacodynamics of drugs. Clin. Pharm. 34, 203–218 (2012).
Article Google Scholar
Gong, S. et al. Dynamics and correlation of serum cortisol and corticosterone under different physiological or stressful conditions in mice. PLoS ONE 10, e0117503 (2015).
Article PubMed PubMed Central Google Scholar
Haim, S., Shakhar, G., Rossene, E., Taylor, A. N. & Ben-Eliyahu, S. Serum levels of sex hormones and corticosterone throughout 4- and 5-day estrous cycles in Fischer 344 rats and their simulation in ovariectomized females. J. Endocrinol. Investig. 26, 1013–1022 (2014).
Article Google Scholar
Westwood, F. R. The female rat reproductive cycle: A practical histological guide to staging. Toxicol. Pathol. 36, 375–384 (2008).
Article PubMed Google Scholar
Ajayi, A. F. & Akhigbe, R. E. Staging of the estrous cycle and induction of estrus in experimental rodents: an update. Fertil. Res. Pract. 6, 1–15 (2020).
Article Google Scholar
Cora, M. C., Kooistra, L. & Travlos, G. Vaginal cytology of the laboratory rat and mouse: Review and criteria for the staging of the estrous cycle using stained vaginal smears. Toxicol. Pathol. 43, 776–793 (2015).
Article CAS PubMed Google Scholar
Goldman, J. M., Murr, A. S. & Cooper, R. L. The rodent estrous cycle: Characterization of vaginal cytology and its utility in toxicological studies. Birth Defects Res. B 80, 84–97 (2007).
Article CAS Google Scholar
Paccola, C. C. et al. The rat estrous cycle revisited: A quantitative and qualitative analysis. Anim. Reprod. 10, 677–683 (2018).
Google Scholar
de Fauw, J. et al. Clinically applicable deep learning for diagnosis and referral in retinal disease. Nat. Med. 24, 1342–1350 (2018).
Article PubMed Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Gurovich, Y. et al. Identifying facial phenotypes of genetic disorders using deep learning. Nat. Med. 25, 60–64 (2019).
Article CAS PubMed Google Scholar
Shen, D., Wu, G. & Suk, H. Deep learning in medical image analysis. Annu. Rev. Biomed. Eng. 19, 221–248 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hu, J. et al. Iterative transfer learning with neural network for clustering and cell type classification in single-cell RNA-seq analysis. Nat. Mach. Intell. 2, 607–618 (2020).
Article PubMed PubMed Central Google Scholar
Yao, K., Rochman, N. D. & Sun, S. X. Cell type classification and unsupervised morphological phenotyping from low-resolution images using deep learning. Sci. Rep. 9, 1–13 (2019).
Article ADS Google Scholar
Pantier, L., Li, J. & Christian, C. Estrous cycle monitoring in mice with rapid data visualization and analysis. Bio-Protoc. 9(17), e3354–e3354 (2019).
CAS PubMed PubMed Central Google Scholar
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. & Wojna, Z. Rethinking the inception architecture for computer vision. in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. 2818–2826 (2015).
Sandler, M., Howard, A., Zhu, M., Zhmoginov, A. & Chen, L. C. MobileNetV2: Inverted residuals and linear bottlenecks. in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition 4510–4520 (2018).
Simonyan, K. & Zisserman, A. Very deep convolutional networks for large-scale image recognition. in 3rd International Conference on Learning Representations, ICLR 2015 - Conference Track Proceedings (2014).
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 770–778 (2015).
Yoshinaka, K. et al. Effect of different light–dark schedules on estrous cycle in mice, and implications for mitigating the adverse impact of night work. Genes Cells 22, 876–884 (2017).
Article CAS PubMed Google Scholar
van Goethem, N. P. et al. Object recognition testing: Rodent species, strains, housing conditions, and estrous cycle. Behav. Brain Res. 232, 323–334 (2012).
Article PubMed Google Scholar
Caligioni, C. S. Assessing reproductive status/stages in mice. Curr. Protoc. Neurosci. 48, A.4I.1-A.4I.8 (2009).
Article Google Scholar
Spencer, J. L., Waters, E. M., Milner, T. A. & McEwen, B. S. Estrous cycle regulates activation of hippocampal Akt, LIM kinase, and neurotrophin receptors in C57BL/6 mice. Neuroscience 155, 1106–1119 (2008).
Article CAS PubMed Google Scholar
Kiyonari, H. et al. Targeted gene disruption in a marsupial, Monodelphis domestica, by CRISPR/Cas9 genome editing. Curr. Biol. 31, 3956-3963.e4 (2021).
Article CAS PubMed Google Scholar
Shorr, E. A new technic for staining vaginal smears: III, a single differential stain. Science 94, 545–546 (1941).
Article CAS PubMed ADS Google Scholar
McLean, A. C., Valenzuela, N., Fai, S. & Bennett, S. A. L. Performing vaginal lavage, crystal violet staining, and vaginal cytological evaluation for mouse estrous cycle staging identification. JoVE J. Vis. Exp. 67, e4389 (2012).
Google Scholar
Singletary, S. J. et al. Lack of correlation of vaginal impedance measurements with hormone levels in the rat. Contemp. Top. Lab. Anim. Sci./Am. Assoc. Lab. Anim. Sci. 44, 37 (2005).
CAS Google Scholar
Skenandore, C. S., Pineda, A., Bahr, J. M., Newell-Fugate, A. E. & Cardoso, F. C. Evaluation of a commercially available radioimmunoassay and enzyme immunoassay for the analysis of progesterone and estradiol and the comparison of two extraction efficiency methods. Domest. Anim. Endocrinol. 60, 61–66 (2017).
Article CAS PubMed Google Scholar
Jiménez, G. C. et al. Aptamer-based label-free impedimetric biosensor for detection of progesterone. Anal. Chem. 87(2), 1075–1082 (2015).
Article Google Scholar
Nameghi, M. A. et al. An ultrasensitive electrochemical sensor for 17β-estradiol using split aptamers. Anal. Chim. Acta 1065, 107–112 (2019).
Article CAS PubMed Google Scholar
De, S., Macara, I. G. & Lannigan, D. A. Novel biosensors for the detection of estrogen receptor ligands. J. Steroid Biochem. Mol. Biol. 96(3–4), 235–244 (2005).
Article CAS PubMed Google Scholar
Jia, Y. et al. Magnetic nanoparticle enhanced surface plasmon resonance sensor for estradiol analysis. Sens. Actuators B Chem. 254, 629–635 (2018).
Article CAS Google Scholar
Kent, S., Hurd, M. & Satinoff, E. Interactions between body temperature and wheel running over the estrous cycle in rats. Physiol. Behav. 49, 1079–1084 (1991).
Article CAS PubMed Google Scholar
Takezawa, H., Hayashi, H., Sano, H., Saito, H. & Ebihara, S. Circadian and estrous cycle-dependent variations in blood pressure and heart rate in female rats. Am. J. Physiology-Regul. Integr. Comp. Physiol. 267(5), R1250–R1256 (1994).
Article CAS Google Scholar
Mitchell, J. A. Y. J. Intrauterine oxygen tension during the estrous cycle in the rat: its relation to uterine respiration and vascular activity. Endocrinology 83, 701–705 (1968).
Article CAS PubMed Google Scholar
Gronroos, M. & Kauppila, O. Hormonal-cyclic changes in rats under normal conditions and under stress as revealed by vaginal smears after Shorr staining. Acta Endocrinol. 32(II), 261–271 (1959).
Article CAS Google Scholar
Rong, G., Kakade, S., Kidambi, R. & Netrapalli, P. The step decay schedule: A near optimal, geometrically decaying learning rate procedure for least squares. arXiv. https://doi.org/10.48550/arXiv.1904.12838 (2019).
Article Google Scholar
Greenwald, N., Miller, G. & Valen, D. Whole-cell segmentation of tissue images with human-level performance using large-scale data annotation and deep learning. Nat. Biotechnol. 40, 555–565 (2022).
Article CAS PubMed Google Scholar
Stringer, C., Wang, T., Michaelos, M. & Pachitariu, M. Cellpose: a generalist algorithm for cellular segmentation. Nat. Methods 18, 100–106 (2021).
Article CAS PubMed Google Scholar
Bechtold, B. Violin Plots for Matlab, Github Project. (2016).

Download references

Acknowledgements

We would like to thank Dr. Nina Miolane and Dr. Emily Jacobs for comments on this manuscript, as well as Dr. Chiro Sutoh for contributing data to the EstrousBank. We would like to thank William Castagna, Marie Karpinska, and Emily Youngblood for assistance collecting cytology samples. This work was supported by the Larry Hillblom foundation (M.J.G.).

Author information

Authors and Affiliations

Department of Molecular, Cellular, and Developmental Biology, University of California, Santa Barbara, Santa Barbara, CA, 93106, USA
Nora S. Wolcott & Michael J. Goard
Department of Psychological and Brain Sciences, University of California, Santa Barbara, Santa Barbara, CA, 93106, USA
Kevin K. Sit & Michael J. Goard
Department of Physiology and Neurobiology, University of Connecticut, Storrs, CA, 06269, USA
Gianna Raimondi & Linnaea E. Ostroff
Department of Psychology, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Travis Hodges & Liisa A. M. Galea
Department of Psychology, Northeastern University, Boston, MA, 02115, USA
Rebecca M. Shansky
Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, BC, V6T 1Z3, Canada
Liisa A. M. Galea
Neuroscience Research Institute, University of California, Santa Barbara, Santa Barbara, CA, 93106, USA
Michael J. Goard
Department of Psychology & Education, Mount Holyoke College, South Hadley, MA, 01075, USA
Travis Hodges

Authors

Nora S. Wolcott
View author publications
You can also search for this author in PubMed Google Scholar
Kevin K. Sit
View author publications
You can also search for this author in PubMed Google Scholar
Gianna Raimondi
View author publications
You can also search for this author in PubMed Google Scholar
Travis Hodges
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca M. Shansky
View author publications
You can also search for this author in PubMed Google Scholar
Liisa A. M. Galea
View author publications
You can also search for this author in PubMed Google Scholar
Linnaea E. Ostroff
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Goard
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.S.W. developed EstrousNet; N.S.W. and K.K.S. analyzed EstrousNet performance; N.S.W. and K.K.S. developed the EstrousNet GUI; G.R., T.H., and N.S.W. classified test images; G.R., T.H., R.M.S, L.A.M.G., L.O., and M.J.G. contributed to EstrousBank curation; N.S.W. and M.J.G. wrote the manuscript; all authors reviewed the manuscript.

Corresponding author

Correspondence to Michael J. Goard.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wolcott, N.S., Sit, K.K., Raimondi, G. et al. Automated classification of estrous stage in rodents using deep learning. Sci Rep 12, 17685 (2022). https://doi.org/10.1038/s41598-022-22392-w

Download citation

Received: 17 March 2022
Accepted: 13 October 2022
Published: 21 October 2022
DOI: https://doi.org/10.1038/s41598-022-22392-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.