EmbryoNet: using deep learning to link embryonic phenotypes to signaling pathways

Čapek, Daniel; Safroshkin, Matvey; Morales-Navarrete, Hernán; Toulany, Nikan; Arutyunov, Grigory; Kurzbach, Anica; Bihler, Johanna; Hagauer, Julia; Kick, Sebastian; Jones, Felicity; Jordan, Ben; Müller, Patrick

doi:10.1038/s41592-023-01873-4

Download PDF

Resource
Open access
Published: 08 May 2023

EmbryoNet: using deep learning to link embryonic phenotypes to signaling pathways

Nature Methods volume 20, pages 815–823 (2023)Cite this article

11k Accesses
7 Citations
64 Altmetric
Metrics details

Subjects

Abstract

Evolutionarily conserved signaling pathways are essential for early embryogenesis, and reducing or abolishing their activity leads to characteristic developmental defects. Classification of phenotypic defects can identify the underlying signaling mechanisms, but this requires expert knowledge and the classification schemes have not been standardized. Here we use a machine learning approach for automated phenotyping to train a deep convolutional neural network, EmbryoNet, to accurately identify zebrafish signaling mutants in an unbiased manner. Combined with a model of time-dependent developmental trajectories, this approach identifies and classifies with high precision phenotypic defects caused by loss of function of the seven major signaling pathways relevant for vertebrate development. Our classification algorithms have wide applications in developmental biology and robustly identify signaling defects in evolutionarily distant species. Furthermore, using automated phenotyping in high-throughput drug screens, we show that EmbryoNet can resolve the mechanism of action of pharmaceutical substances. As part of this work, we freely provide more than 2 million images that were used to train and test EmbryoNet.

Deep learning of cross-species single-cell landscapes identifies conserved regulatory programs underlying cell types

Article 13 October 2022

Jiaqi Li, Jingjing Wang, … Guoji Guo

Machine learning methods to model multicellular complexity and tissue specificity

Article 15 July 2021

Rachel S. G. Sealfon, Aaron K. Wong & Olga G. Troyanskaya

Uncovering developmental time and tempo using deep learning

Article Open access 23 November 2023

Nikan Toulany, Hernán Morales-Navarrete, … Patrick Müller

Main

Early development is governed by a handful of signaling pathways that balance tissue growth, differentiation and morphogenesis^1,2,3. Given their important roles in controlling cell identity and behavior, misregulation of signaling pathways in adult tissues can induce the formation of tumors with embryo-like properties, defective cell proliferation and migration^4,5.

During zebrafish development, seven major signaling pathways orchestrate the formation of the body plan. Bone morphogenetic protein (BMP), retinoic acid (RA), Wnt, fibroblast growth factor (FGF) and Nodal pattern the germ layers and regulate the formation of the orthogonal anterior–posterior and dorsal–ventral axes; Sonic hedgehog (Shh) and planar cell polarity (PCP) signaling, in turn, control the elongation and morphogenesis of the body axis and later shape the formation of tissues^2,3,6,7,8. The ligands activating these signaling pathways are dynamically expressed from specific source tissues in the embryo (Fig. 1a and Supplementary Note 1). Loss of activity in any of these pathways causes characteristic patterning defects, which, however, can be difficult to distinguish (Fig. 1b and Supplementary Videos 1–8). For example, both Nodal and Shh mutants have cyclopic eyes (Fig. 1b and Supplementary Videos 6 and 7), but the defect in Nodal mutants is caused by an early lack of mesoderm⁹, whereas cyclopia in Shh mutants is caused by a late defect in midline patterning¹⁰. Furthermore, while misregulation of the BMP, Wnt, RA, FGF and PCP signaling pathways leads to specific defects, for example, an enlarged head in the case of Wnt mutants^11,12, all of these mutants also have malformed shortened tails^{13,14,15,16,17} (Fig. 1b and Supplementary Videos 2–5, 8). Thus, the phenotypes caused by changes in the activity of different signaling pathways can be easily confused by even the most experienced developmental biologists. Automated and unbiased phenotyping based on a multitude of morphological features would overcome this challenge. Such an approach would rapidly link phenotypes arising from genetic defects, mutants identified in forward and reverse genetic screens, or treatment with small-molecule inhibitors to the relevant signaling pathway. Automated phenotyping of morphological defects would thus enhance both the speed and accuracy of biological and pharmaceutical discovery.

**Fig. 1: The CNN EmbryoNet robustly identifies molecular defects based on phenotype data.**

Advances in deep learning approaches¹⁸ have brought unprecedented breakthroughs in numerous fields ranging from bioimage analysis and visual object recognition¹⁹ to protein structure prediction^20,21,22,23 and earth system science²⁴. Deep learning approaches perform exceptionally well in decoding the content of images^25,26, and convolutional neural networks (CNNs) in particular have been extensively used for bioimage restoration²⁷, cell detection and classification²⁸ and bioimage data segmentation²⁹. Recent studies have also used machine learning approaches to examine embryonic phenotypes^{30,31,32,33,34,35,36}, but these approaches were limited to staging, segmentation and classification of specific embryos and organs without being able to uncover the molecular basis of morphological alterations.

Here, we introduce a deep learning approach, EmbryoNet, that can detect specific defects linked to the seven major vertebrate signaling pathways by automated phenotyping. EmbryoNet was trained on more than 2 million images, comprising thousands of trajectories of normally developing and signaling-defective zebrafish embryos. We found that EmbryoNet identified phenotypes more precisely and often long before human evaluators could detect them. By using the accelerated phenotype classification of EmbryoNet in an automated drug screen, we identified novel teratogenic effects caused by Food and Drug Administration (FDA)-approved substances not previously associated with the regulation of developmental signaling pathways. Finally, we show that EmbryoNet can identify signaling defects in evolutionarily distant species, demonstrating the generalizability of our approach.

Results

Identification of signaling defects in zebrafish embryos

To test whether deep learning approaches can be used for the automatic classification of complex phenotypes caused by the loss of signaling pathways in zebrafish, we combined high-throughput imaging with specific drug-mediated loss-of-function approaches. We started with proof-of-concept experiments focused on Nodal signaling because both the signaling pathway constituents and their functions during the first day of zebrafish embryogenesis are well described³⁷ (Supplementary Note 1 and Fig. 1a). In addition, a specific small molecule targeting the ATP-binding pocket of the receptor kinase is available³⁸ (Supplementary Note 2), facilitating the rapid acquisition of defined developmental phenotypes from a large number of embryos. Indeed, the inhibitor SB-505124 clearly recapitulated the known loss-of-function phenotypes of Nodal signaling pathway components³⁹, that is, cyclopia and loss of all endoderm and head–trunk mesoderm (Fig. 1a–c and Supplementary Video 6). We then acquired bright-field movies of SB-505124-treated and untreated embryos in random orientations, comprising a total of 342,559 images between 2 and 26 hours post-fertilization (h.p.f.). A modified version of the ResNet18 CNN that includes a timestamp of the images (Fig. 1e,f and Methods)⁴⁰ trained with these datasets robustly and correctly identified normal and Nodal-defective embryos, independent of their orientation and whether small molecules (SB-505124) or mutants (maternal zygotic oep mutants, MZoep⁴¹) were used to create Nodal loss-of-function phenotypes (Fig. 1c,d and Extended Data Fig. 1).

We next extended this approach to the seven major signaling pathways that control early development: BMP, RA, Wnt, FGF, Nodal, Shh and PCP (Fig. 1a,b). Using a chemical genetics approach with specific signaling pathway modulators^{38,42,43,44,45} (Supplementary Table 1 and Supplementary Note 2), we created a dataset of more than 2 million images with loss-of-function (or gain-of-function in the case of RA) phenotypes (Supplementary Videos 1–8). The dataset was manually annotated by curators who were informed about the treatment of each embryo. The curators assigned classes appropriate for each treatment (that is, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh and −PCP) at the developmental timepoint when the phenotype first became apparent for a given embryo. The class Unknown was assigned when an image did not contain sufficient information for classification, and the class Dead was assigned if an embryo disintegrated over the course of development. In addition, each image was assigned a timestamp for classification (Fig. 1f). This high-confidence dataset was then used to train a large-scale CNN with accelerated graphics processing unit computing (Methods and Supplementary Tables 2 and 3).

To correct for potential classification errors, we introduced a model transition logic based on our knowledge of developmental changes: in very early embryos, phenotypic differences are not yet apparent because signaling changes result in morphological changes only at later stages^1,2,3. These early embryos, characterized by the phenotype class Unknown, can then transition to another phenotype class (−BMP, +RA, −Wnt, −FGF, −Nodal, −Shh and −PCP) and can also change to Dead at any point in time (Extended Data Fig. 1c). However, certain transitions, for example, from Dead to Normal, are not possible. We therefore assigned a cost to every state transition in an individual embryo track and scored the cost for different models. The transition sequence that achieved the lowest cost was selected for classification. This approach yielded a classification performance of 89%. The deep learning-based classification network, termed EmbryoNet, was able to robustly identify the loss-of-function phenotypes caused by orthogonal approaches such as the injection of messenger RNAs (mRNAs) encoding the Nodal and BMP pathway inhibitors Lefty1 and Chordin, respectively (Fig. 1g–i, Supplementary Note 2 and Extended Data Fig. 1f,g). EmbryoNet’s algorithms for the detection, tracking, manual and automatic classification of embryos are available as easy-to-use, modular and open-source graphical user interface (GUI) software (Extended Data Fig. 1e; http://github.com/mueller-lab/EmbryoNet and http://embryonet.uni-konstanz.de).

EmbryoNet is proficient, fast and accurate

To evaluate EmbryoNet’s performance, we tested its classification speed and accuracy in competition with human assessors. We generated stacks of 98 embryo images, representing the full spectrum of our phenotype classes. These images had not been used previously for the training of EmbryoNet, and information about the specific treatment of each embryo was not disclosed to the assessors.

Random guessing resulted in an accuracy of 9% (F-score = 0.09; Fig. 2a and Supplementary Table 4). The images were then classified by non-experts. These 55 teams, each consisting of two assessors with a biology background, received 1 day of developmental biology training with a focus on developmental defects caused by modulated signaling in zebrafish (Supplementary Videos 1–8, Fig. 2b and Extended Data Figs. 2 and 3). We encouraged the assessors to discuss the phenotypes to make the best classification choice. On average, non-experts confidently identified the class Dead but identified signaling defects with an overall accuracy of only 53% (F-score = 0.52; Fig. 2b, Extended Data Fig. 2 and Supplementary Tables 5 and 6), even when temporal information about the developmental stage was provided (accuracy of 54%, F-score = 0.52; Fig. 2c, Extended Data Fig. 3 and Supplementary Tables 5 and 7). The images were next classified by an expert assessor, an experienced developmental biologist with several years of relevant background in early zebrafish embryogenesis. The expert confidently identified embryonic phenotypes across classes with an overall accuracy of 79% (F-score = 0.78; Fig. 2d and Supplementary Tables 5 and 8). Strikingly, EmbryoNet outperformed both expert and non-expert human assessors on these images: it completed the task in a few seconds and with an overall accuracy of 91% (F-score = 0.90; Fig. 2e and Supplementary Tables 5 and 9), comparable to the performance across the entire validation dataset (see above).

**Fig. 2: Classification of 98 single embryo images by non-expert teams, experienced researchers and EmbryoNet.**

To test whether context-dependent information could improve human classification performance, we asked two human experts to classify additional time-series experiments. Information about the specific treatment for each embryo was not disclosed to the assessors, but they were aware that all embryos in one video received the same treatment. Human classification performance slightly increased to an overall accuracy of 83% (F-score = 0.84). EmbryoNet still outperformed the human experts with an accuracy of 90% (F-score = 0.90), especially for the classification of the most difficult and subtle phenotypes (−Shh and −PCP) with F-scores of 0.54 (−PCP) and 0.72 (−Shh) compared with the human F-scores of 0.04 and 0.36, respectively (Extended Data Fig. 4a–d and Supplementary Tables 10–15). In addition, EmbryoNet accurately identified phenotypes that were not fully penetrant, such as weaker BMP¹⁴ and Nodal defects⁴⁶ (Extended Data Fig. 5 and Supplementary Tables 16–19).

Given EmbryoNet’s performance in identifying subtle phenotypes, we hypothesized that we could leverage artificial intelligence to detect very early embryonic defects before they would be recognized by human experts. We therefore retrained EmbryoNet by moving the relevant developmental timepoint corresponding to each treatment class to 4 hours earlier, before the phenotype was obvious to a human annotator (Fig. 2f). Strikingly, the resulting network, EmbryoNet-Prime, was able to identify Nodal loss-of-function phenotypes at the beginning of gastrulation, several hours before human annotators could confidently recognize them (Extended Data Fig. 6), with an accuracy of 90% (F-score = 0.93; Fig. 2h–j, Extended Data Fig. 4e and Supplementary Tables 10 and 20–24). Similarly, the network detected the −BMP, +RA, −Wnt, −Shh and −PCP phenotypes on average 2–3 hours earlier (Fig. 2g and Extended Data Fig. 6), consistent with the known expression and activation profiles of the signaling molecules (Supplementary Note 1).

EmbryoNet recognizes known and latent defect features

What could be the features that are detected by EmbryoNet-Prime, which facilitate earlier classification compared with human assessors? To address this question, we leveraged class activation map (CAM) visualization⁴⁷, which can be used to perform object localization without additional annotation by projecting the probability of the trained classes onto an input image. The resulting CAM should show the discriminative image regions used by the CNN to identify a class: positively activated domains should highlight image regions that support a particular class, whereas negative domains should show regions that oppose that class (Fig. 3 and Supplementary Note 3).

**Fig. 3: Embryo features activating the neural network.**

To evaluate the utility of CAM visualization, we first examined steep and sudden switches in classification. For example, BMP-inhibited embryos frequently disintegrate (Supplementary Video 2), switching from −BMP to Dead in terms of classification. Indeed, this classification switch can be observed in EmbryoNet-Prime’s CAMs. Once −BMP embryos disintegrate, their CAMs in the −BMP channel immediately show negative activation accompanied by a positive activation in the CAMs for the Dead class (Supplementary Videos 11 and 12). These results indicate that CAM visualization can provide insight into the logic of phenotype classification.

Using this approach, we found that EmbryoNet-Prime identified known defects caused by the disruption of signaling pathways, but also detected latent features at an earlier developmental stage compared with human assessors. For example, Wnt mutants are known to exhibit prominent tail bud and head defects at 24 h.p.f.^11,12. EmbryoNet-Prime was indeed activated in these regions at late stages (Fig. 3i). Strikingly, during early segmentation the whole body-axis already showed activation (Supplementary Videos 15 and 16), and the detection of head and tail defects also occurred as early as the bud stage (Fig. 3d, Extended Data Fig. 6 and Supplementary Videos 15 and 16). Thus, −Wnt embryos were detected earlier by EmbryoNet-Prime than by human assessors (Fig. 2g and Extended Data Fig. 6). Similarly, late-stage classification of −Nodal embryos by EmbryoNet-Prime relied on well-known defects in the ectodermal thickening (Fig. 3j), head, tail and trunk regions (Fig. 3k). Surprisingly, however, EmbryoNet-Prime was also able to classify early-stage −Nodal embryos (~6 h.p.f.; Supplementary Video 19) based on latent defects. The detection started with activation at the margin (Fig. 3f and Supplementary Videos 19 and 20) and continued with activation spots at the border between yolk and blastoderm, directly adjacent to the embryo proper. Although this fits well with known regions of Nodal expression and activity^37,48,49, it will be interesting to determine how these molecular signatures manifest in latent cellular and morphological features.

EmbryoNet can identify novel signaling modulators

High-content image-based drug screens can be used to identify novel compounds affecting cellular phenotypes. However, large-scale drug screens assessing whole phenomes with rich biological information⁵⁰ are currently hampered by the need to visually assess a very large number of images, and are further complicated by the potential ambiguity of defects and variability between assessors. To determine whether EmbryoNet could be used to link chemical manipulations to signaling-based phenotypic defects, we performed a large-scale zebrafish screen using FDA-approved and bioactive compounds (Fig. 4a).

**Fig. 4: Applications of EmbryoNet in drug screening and other species.**

We screened approximately 1,000 compounds using 96-well plates containing four to five zebrafish embryos per well. The screen spanned 2–26 h.p.f. We first tested well-known viability modulators with characterized mechanisms of action such as aphidicolin, bafilomycin A1, blebbistatin, brefeldin A, cycloheximide, cytochalasin B, latrunculin B, staurosporin, trichostatin A, tunicamycin and vinblastine. EmbryoNet reliably classified embryos treated with these substances as Dead, while classifying mock-treated embryos as Normal (Supplementary Tables 25–35, Extended Data Figs. 7 and 8 and Supplementary Videos 25–35). EmbryoNet also correctly detected known modulators of signaling pathways, such as the RA agonists all-trans-retinoic acid and TTNPB (Extended Data Fig. 7 and Supplementary Table 26).

Importantly, for some small molecules we identified previously unrecognized effects on signaling pathways in embryos. This group includes several hydroxymethylglutaryl-coenzyme A reductase inhibitors, a class of compounds also known as statins^51,52,53,54. Interestingly, embryos treated with several statins, including simvastatin, atorvastatin and lovastatin, were classified as −FGF by EmbryoNet (Fig. 4b, Supplementary Videos 36–38 and Supplementary Tables 29, 32 and 35). Consistent with the −FGF classification, embryos treated with these drugs showed defects in dorsal–ventral patterning^51,52 and had loss of posterior tissues typical of −FGF embryos^13,55,56,57 (Fig. 4c). Strikingly, the activity of the FGF signal transducer pErk was also reduced in statin-treated compared with untreated embryos (Fig. 4d,e), possibly due to dampened isoprenylation of the upstream small GTPase Ras⁵⁸. According to patient information regarding side-effects and databases of potentially embryotoxic teratogenic therapeutics, the intake of selected hydroxymethylglutaryl-coenzyme A reductase inhibitors such as lovastatin is not recommended during pregnancy and lactation (Supplementary Note 4). Notably, simvastatin is recommended as a substitute, although EmbryoNet detected the same defects in response to this drug. However, the bioavailability in zebrafish compared with human cells is currently unclear.

In summary, our drug screen shows that EmbryoNet can be used to identify teratogenic effects caused by bioactive compounds and to associate them with signaling pathways.

Generalization of EmbryoNet to other species

To test the generalizability of our approach, we next applied EmbryoNet to identify signaling defects in the evolutionarily distant species medaka (Oryzias latipes) and three-spined stickleback (Gasterosteus aculeatus). These fish diverged from zebrafish hundreds of millions of years ago^59,60. We adjusted the imaging length of our recordings to match the slower developmental speed of both species^61,62 and modified species-specific parameters such as temperature, number of embryos per well and drug concentration as needed.

We found that in both medaka and stickleback, wild-type animals had well-formed somites (Fig. 4f,g, black arrows) and eyes (Supplementary Videos 39 and 41), while Nodal-inhibited embryos showed a loss of somites (Fig. 4f,g, red arrows) concomitant with severe central nervous system defects and frequent cyclopia (Fig. 4f, red arrowhead; Supplementary Videos 40 and 42). After training with these datasets, EmbryoNet robustly identified wild-type and Nodal-inhibited individuals in both species (Fig. 4f,g). These results support the broad applicability of EmbryoNet in identifying signaling-based complex phenotypic defects in different species.

Discussion

Phenome refers to the entire set of phenotypes of an organism over time, and phenomics has emerged as a promising approach for connecting these phenotypes with the underlying genotypes and environmental influences⁵⁰. A quantitative understanding of how the phenome changes in response to genetic mutations and environmental stimuli would be highly informative, but phenomics requires the processing of large amounts of high-dimensional data^{36,63,64,65,66}. Computer vision and machine learning techniques are therefore promising approaches for advancing this field and indeed are increasingly being applied in plant and crop phenomics⁶⁷. Here, we present a machine learning-assisted method for the robust phenomic analysis of developmental defects during vertebrate embryogenesis.

The automated phenotyping tool that we developed, EmbryoNet, is based on CNNs. Strikingly, EmbryoNet outcompeted human assessors in terms of speed, accuracy and sensitivity. Assessing zebrafish embryos, EmbryoNet was able to quickly and accurately link phenotypes to major signaling pathways, including classifying incompletely penetrant phenotypes. We were also able to retrain EmbryoNet for assessing other fish species separated from zebrafish by hundreds of millions of years in evolution, enabling the analysis of high-dimensional phenomic data in different taxa. EmbryoNet may thus be able to accelerate the characterization of developmental mutants in multiple species. Finally, in a proof-of-concept drug screen with two drug libraries, we showed that EmbryoNet correctly associated compounds with signaling functions. We therefore believe that this approach can be used to understand the signaling effects of various compounds and medications, thus opening up the possibility of applying drugs to new therapeutic contexts and applications.

While EmbryoNet offers significant advantages in identifying phenotypes at earlier developmental stages, there are some caveats and weaknesses to consider. It remains uncertain whether EmbryoNet can outperform humans in detecting very mild phenotypes, such as those caused by low drug concentrations. Additionally, its reliance on a library of manual annotations limits its ability to classify novel phenotypes, particularly those arising from the combinatorial disruption of signaling pathways. The rapid development of deep learning technologies could be leveraged to enhance EmbryoNet’s capabilities and help address EmbryoNet’s current limitations. By building on these technological breakthroughs, it may become possible in the future to bridge the genotype–phenotype gap and tackle the long-standing question of how diverse body plans are genetically encoded⁶⁸.

We provide EmbryoNet as open-source software, with Python packages, a GitHub repository and GUIs for labeling data and phenotype classification (http://github.com/mueller-lab/EmbryoNet). We also provide the training, testing and the drug screen imaging data as a resource to the community (http://embryonet.uni-konstanz.de, http://github.com/mueller-lab/EmbryoNet). Due to its modular open-source nature, EmbryoNet can be easily adapted to a variety of purposes, including embryos of other species and organoids, in which automated phenotyping will expedite biological and pharmaceutical discovery.

Methods

Embryo preparation

The experiments were performed exclusively with embryos and larvae that were not yet freely feeding. All procedures and zebrafish, medaka and stickleback husbandry were carried out in accordance with the guidelines of the European Union directive 2010/63/EU and the German Animal Welfare Act as approved by the local authorities represented by the Regierungspräsidium Tübingen and the Regierungspräsidium Freiburg (Baden-Württemberg, Germany).

Zebrafish embryos of the TE strain were collected from batch crosses within 1 h after fertilization. Fertilized embryos were manually selected using a glass Pasteur pipette. At this timepoint, zebrafish embryos were between the 2- and 8-cell stages. A total of 10–20 embryos were pipetted into each well of a 24- or 48-well plate in 1 ml zebrafish embryo medium⁴⁸. Small-molecule signaling pathway agonists and antagonists were added by pipetting them into the filled well with the final concentrations listed in Supplementary Table 1. To obtain PCP phenotypes, 1 ng vangl2-targeting morpholino⁶⁹ was injected at the one-cell stage. Shh treatment was carried out either by cyclopamine incubation or gli3R-GFP mRNA injection. For overexpression of signaling antagonists, 10 pg lefty1 mRNA⁴⁸ together with 0.1 ng of 10 kDa Alexa647-coupled dextran (Invitrogen D22914) or 75 pg chordin mRNA⁷⁰ together with 0.1 ng of 10 kDa Alexa488-coupled dextran (Invitrogen D22910) were injected into one-cell stage wild-type zebrafish embryos. To validate phenotypes, swirl homozygous mutants⁷⁰ and maternal zygotic homozygous oep mutants⁴¹ were used. All zebrafish embryos were between 0 h.p.f. and 48 h.p.f.

Medaka eggs of the Cab strain were collected from standard crosses and separated with forceps. They were incubated at 28 °C in medaka embryo medium (17 mM NaCl, 0.4 mM KCl, 0.27 mM CaCl₂, 0.65 mM MgSO₄) and distributed into 24-well plates. Small-molecule signaling pathway antagonists were added at the early blastula stage to a concentration of 7.5 µM. Embryos were imaged from 8 h.p.f. until 45 h.p.f. with intervals of 5 min at 28 °C. Approximately 500 Medaka embryos between 0 h.p.f. and 48 h.p.f. were used for training and testing.

Stickleback embryos of wild-derived marine strains from Little Campbell River (Canada) and Tyne River (Scotland) were obtained by in vitro fertilization and incubated until 20 h.p.f. at 16 °C in stickleback embryo medium (3.5 g l⁻¹ instant ocean salt in reverse osmosis water). The eggs were separated using brushes and distributed into 48-well dishes. The small-molecule signaling pathway antagonists were applied at a concentration of 15 µM, and embryos were imaged for 120 h with intervals of 5 min at 15–18 °C. Approximately 200 stickleback embryos between 0 h.p.f. and 140 h.p.f. were used.

Image acquisition

Images were acquired using an ACQUIFER Imaging Machine (DITABIS AG) with a white light-emitting diode (LED) for bright-field imaging and a scientific complementary metal oxide semiconductor 2,048 × 2,048 camera (Hamamatsu sCMOS 2k × 2k) in a single plane with a ×2 Plan UW numerical aperture 0.06 objective (Nikon) using the Imaging Machine software (v4.00.21). The integration time was fixed at a 110 ms exposure time and 100% relative LED intensity in the bright-field channel. Imaging was performed at 28 °C with 720 iterations at intervals of 120 s. Images were stored as 12-bit TIFF files at a size of 2,048 × 2,048 pixels and 0.31 pixels μm⁻¹ and converted to JPEG or PNG files for further phenotype analyses.

To generalize the method independently of the microscope, a Keyence BZ-X810 microscope equipped with a ×2 apochromat objective, a 3.7 W LED lightsource and the BZ-X800 viewer software (Keyence, v01.03.00.01) was also used to acquire embryo images (Supplementary Tables 2, 14, 15, 23 and 24 and Extended Data Fig. 4d,e). The exposure time was 0.1 ms with 50% relative intensity. The images were stored as 8-bit JPEG files at a size of 1,920 × 1,440 pixels.

Medaka and stickleback eggs contain large lipid droplets and, compared to zebrafish, have a larger yolk in relation to the embryo proper. Additionally, medaka eggs are surrounded by adhesive filaments. Given that these features are visually very prominent, the embryos were required to be imaged until the late segmentation stages to robustly detect morphological differences.

Embryo detection

A dataset of manually annotated embryos was generated using the GUI FishLabeler (http://github.com/mueller-lab/EmbryoNet). The dataset was split into two subsets: 90% of the images were used for training and 10% for validation. Additionally, an independent manually annotated dataset was generated for testing.

Individual embryos were automatically detected at each image frame of the acquired movies using a standard object-detection algorithm based on the Hough transform⁷¹. The location of individual embryos was computationally determined using bounding boxes. The range of embryo radii in pixels was provided according to the microscope acquisition parameters for each experiment independently. As output, a set of JSON files containing the information about the bounding boxes of individual embryos was generated. The Hough transform-based embryo detector can be replaced by other object recognition methods (such as watershed segmentation) to detect non-spherical species (for example, Drosophila melanogaster).

Embryo tracking

To obtain information about the whole developmental path of each individual, the embryos identified at individual frames were grouped using an object-tracking approach. Detections of the same embryo in consecutive frames were confirmed using the DeepSort algorithm without re-identification⁷².

Manual labeling of training datasets

All embryos were initially set as class Unknown. Then, each embryo track was manually annotated with its specific phenotypic class (that is, Normal = Wild type, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP) from the timepoint when the phenotype could be observed by an experienced annotator. Additionally, embryos that disintegrated were labeled as Dead. Embryos that were only partially in the image or that showed an unspecific phenotype were annotated as Cut and were excluded from the training, validation and test datasets. Additionally, the −BMP and −Nodal classes were subclassified into severity levels: weak, ~30% phenotype severity; intermediate, ~60% phenotype severity; and severe, ~100% phenotype severity. For Nodal phenotypes, the percentage bins were determined by the concentration of the inhibitor, with 100% corresponding to the minimum concentration that led to full penetrance. The drug concentration was then directly used as a binned fraction of the fully penetrant dose as ground truth for the severity bins, accepting a certain spread of phenotypes. Binning of BMP severity levels was done based on previous classification schemes¹⁴, with the class C3 corresponding to 30%, C4 to 60%, and C5 to 100% severity. Altogether 14 classes were obtained for the classification process. The annotator had previous knowledge about the treatment of the respective embryo, and the expertise to recognize all of the phenotypes.

To train EmbryoNet-Prime for an earlier detection, the original manual annotations were used to determine the timepoint when the majority of the embryos were classified correctly. The appropriate class was then assigned 120 frames (4 h) earlier, when the phenotype could not be identified by eye. Cut and Dead embryos were not changed.

Model training and embryo classification

The use of embryo images as the only input could lead to misclassifications between embryo phenotypes, which have a similar appearance at different developmental stages. To increase classification performance, the developmental timepoint was added as a second input to the classification algorithm. In total, four channels were used as input for model training. The first three channels correspond to a standard RGB image, and the remaining one is a ‘timestamp’ channel representing the time that has passed from the beginning of the experiment. The size of the images was 224 × 224 pixels. The timestamp was linearly mapped from the real developmental time to the domain [0, 1], where 0 corresponds to ~2 h.p.f. and 1 to ~26 h.p.f. Given that the input classes were imbalanced (Supplementary Table 2), the overrepresented class Unknown was undersampled and the 13 remaining classes replicated⁷³.

For the classification task, a modified version of the widely used ResNet18⁴⁰ architecture was selected. The network architecture was chosen due to its easy and fast convergence in image classification tasks. The ResNet18 model was modified by using a time channel as additional input and thus feeding four instead of three channels, and by replacing the last classification layer with the current classification layer. Time was also used as input to the last fully connected layer. The parameters of the neural network weights, unlike the neural network architecture (that is, the mathematical function structure describing an artificial neural network), were changed during the training procedure via a back-propagation algorithm.

The CNN model was trained using the supervised back-propagation training method⁷⁴, a common algorithm for training neural networks. The Adam optimizer was used, which is a back propagation-based optimization algorithm that determines the value change of neural network weights based on the loss function gradient. Softmax cross-entropy was used as the loss function (that is, the penalty for a poor prediction, indicating how bad the model’s prediction was on a single example):

$$L = - 1/n{\sum} {ln\;p_i}$$

where p_i is the index of the correct probability of the i-th image, and n is the number of images in the batch. In the case in which the model’s prediction is perfect, the loss would be zero; otherwise, the loss is greater. Cross-entropy loss is a common loss function used in machine learning and it measures the expected negative logarithm’s value for the correct classification probability.

The Albumentations library⁷⁵ was used to increase the amount of training images by adding slightly modified copies of the existing images. Augmentations were applied during the training process, including random horizontal and vertical flips, rotations in the range of 1–90° with steps of 1°, crops and salt-and-pepper noise (Supplementary Table 3). During the training process a random augmentation from each group was picked and applied to the input image.

Given that the selected CNN model did not converge when all datasets and augmentations were used from the beginning of the training, a progressive training design involving different levels of difficulty was developed. In brief, the training was performed sequentially by dividing it into several steps with progressive addition of data and augmentations. At each step extra data were added as input and new augmentations applied at each epoch, that is, at each pass over the entire training dataset during the training procedure. The initial learning rate was set to 10⁻³ and it decayed by a factor of 0.1 after each epoch. The learning rate, that is, the parameter by which the loss gradient value is multiplied during each iteration, was restored to the value of 10⁻³ at the beginning of each iteration. The model was trained using eight steps with 10–20 epochs per step, resulting in a total of 152 epochs. For the whole training, a batch size (that is, the number of training examples used in one iteration) of n = 350 was used, and the training was performed on an NVIDIA RTX 3090 card in Ubuntu 20.04.4 LTS.

Rotation- and mirror-invariance of the embryo appearance was exploited to boost the classification performance by running the trained network for each detected embryo eight times: once with the original embryo image, once with the image flipped horizontally, then flipped vertically, then mirrored diagonally, and then with each of these samples rotated by 90°. Following this step, the classification probabilities were averaged. Each embryo was assigned to the class that had the maximum probability.

Model transition logic

To further improve the results of the classification, the information from embryo tracking as well as previous knowledge about transitions between phenotypes was incorporated into the classification task. In brief, first the classification results of the CNN for each embryo track were collected and transitions between classes identified. The only biologically possible transitions in an embryo track were set as follows: from Unknown to a phenotype class, from a phenotype class to Dead, or from Unknown to Dead. Any other transition in an embryo track was penalized in the model prediction. The quality of the whole track model prediction was evaluated by computing the number of frames between transition points with the class expected by the model being analyzed. The transition sequence that achieved the least cost was considered to be the correct one. The outliers were then ignored in the track history. Nodal and BMP severity classifications were similarly corrected by selecting the severity class that was most frequently observed over the timecourse.

For medaka, a semi-supervised training method was used by assigning a classification transition point from which the phenotypes were easy to distinguish for a human and automatically applying this to all training data. Given that medaka embryos disintegrated if they were treated with Nodal inhibitor before the blastula stages, the medaka experiments did not start at cleavage but at blastula stages. This opportunity was used to set the transition point to timepoint 1, such that the Unknown class did not have to be used at all. This did not reduce the training or classification efficiency (Fig. 4f), showing that an Unknown state for early stages is not unconditionally required.

Evaluation of classification efficiency

For the performance measure of classification, subset accuracy was computed. Subset accuracy is the fraction of images n that were classified correctly:

$${{{\mathrm{Accuracy}}}} = \frac{1}{n}\mathop {\sum }\limits_{i = 1}^n I\left( {{\check{y}_i} = y_i} \right)$$

F-scores were calculated as

$${{{\mathrm{F-score = 2}}}} \times \frac{{{{{\mathrm{Precision}}}} \times {{{\mathrm{Recall}}}}}}{{{{{\mathrm{Precision + Recall}}}}}}$$

with

$$\begin{array}{rcl}{\mathrm{precision}} & = & \frac{{{\mathrm{true}}\;{\mathrm{positives}}}}{{{\mathrm{true}}\;{\mathrm{positives}} + {\mathrm{false}}\;{\mathrm{positives}}}}\,{\mathrm{and}}\\ {\mathrm{recall}} & = & \frac{{{\mathrm{true}}\;{\mathrm{positives}}}}{{{\mathrm{true}}\;{\mathrm{positives + false}}\;{\mathrm{negatives}}}}\end{array}$$

In the confusion matrices, the class Unknown was not taken into account. The numerical data for the confusion matrices including the class Unknown are provided in Supplementary Tables 4, 9, 11–15 and 17–24, and this class was also included in the overall metrics of accuracy and F-score.

The evaluation dataset for Fig. 2a–e was generated by compiling three stacks of 98 images each selected from the full test dataset (Fig. 2h–j and Extended Data Fig. 4). To evaluate the performance of random guessing, the function randi from MATLAB R2022a was used for the generation of a pseudo-random scalar integer between 1 and n_c, where n_c is the number of classes. The image stacks were then labeled with the classes corresponding to the pseudo-random numbers and evaluated for performance by calculating accuracy and F-score. The non-expert teams received one randomly selected image stack for their assessment task. The experienced developmental biologist assessed all three image stacks, and average performance is shown in Fig. 2d.

CAMs

To visualize the regions of images that influenced the model to make classification decisions, CAMs were used. To visualize the CAMs generated by EmbryoNet, the weights of the final output layer in a fully connected layer were projected using global max pooling, as previously proposed⁴⁷. This approach enabled the visualization of regions positively or negatively activated for a particular class. CAMs were calculated for all classes, and their values were normalized so that the minimum and maximum values for all classes correspond to −1 and +1, respectively. To improve the visualization of areas with large positive or negative values (that is, relevant regions for the decision), the CAMs were remapped using the following function:

$$\tilde V_{{\mathrm{CAM}}} = {{{\mathrm{sgn}}}}(V_{{\mathrm{CAM}}}) \times \sqrt {V_{{\mathrm{CAM}}}}$$

where $V_{{\mathrm{CAM}}}$ are the normalized values of the CAMs, ${{{\mathrm{sgn}}}}\left( \cdot \right)$ is the sign function and $\tilde V_{{\mathrm{CAM}}}$ are the remapped CAM values. Finally, the values of the CAMs were mapped to 8-bit and visualized with the jet colormap.

Drug screening

Plates of the Screen-Well ICCB Known Bioactives Library BML-2840-0100 and the FDA-approved drug library BML-2843 were defrosted at 22 °C for 1 h and centrifuged at 1,890 ×g for 2 min (Eppendorf 5810 R). Six 96-well microtiter plates were pre-filled with 96 μl cell culture-grade PBS (Gibco). From each library plate, 4 μl per well were transferred, resulting in a 1:25 dilution. Blank wells were filled with 4 μl cell culture-grade PBS. Zebrafish embryos were collected as described above, but selected embryos were washed three times with 200 ml embryo medium and transferred to a 96-well plate (Greiner Bio-One), three to five embryos per well. Each well was filled with embryo medium to a volume of 135 μl. Subsequently, 15 μl solution were transferred from the 1:25 intermediate dilution plates to each well containing embryos. Plates were covered with transparent foil, and a plastic lid was placed on the plate.

Screening plates were placed in the ACQUIFER Imaging Machine as described above with an imaging interval between 135 s and 192 s. Image files were converted to JPEG files for further phenotype analyses. The images from the 96-well screening plates were sorted into separate directories related to respective wells using a custom Python script (Drug screen script 1; http://github.com/mueller-lab/EmbryoNet/tree/main/Train_Eval/tools/DrugScreen). The data files were read into the custom FishClassifier software and evaluated for detected phenotypes. For each image file, phenotype detections were stored as a separate JSON file. The JSON files were read using a custom Python script (Drug screen script 2; http://github.com/mueller-lab/EmbryoNet/tree/main/Train_Eval/tools/DrugScreen). Evaluated phenotypes were linked with corresponding treatments and finally stored as Excel files, containing the number of images for each class in the time series. These files were used to generate charts for predicted phenotypes resulting from each treatment (Drug screen script 3; http://github.com/mueller-lab/EmbryoNet/tree/main/Train_Eval/tools/DrugScreen). The majority phenotype for each well was determined as the class to which the highest number of embryo images was assigned.

Retest and characterization of statins in FGF signaling

Zebrafish embryos were treated with 20 µM simvastatin in embryo medium (Enzo Life Science BML-G244-0050, final concentration of DMSO solvent: 0.2%), 40 µM atorvastatin (Sigma PHR1422, final concentration of DMSO solvent: 0.4%) or 0.4 µM lovastatin (PHR1285, final concentration of DMSO solvent: 0.04%) starting at 1.5–2 h.p.f. or were left untreated and incubated at 28 °C.

Live embryos were imaged at 28 h.p.f. with a bright-field microscope (Leica M205 FCA). For close-up images, embryos were manually dechorionated using precision forceps and embedded in 2% methylcellulose in embryo medium.

For pErk immunostainings, untreated and statin-treated embryos were fixed at the shield stage with 4% formaldehyde in PBS overnight at 4 °C and then stepwise (25%, 50%, 75% methanol in PBST (PBS containing 0.1% Tween-20)) dehydrated. After an overnight incubation in 100% methanol at −20 °C, embryos were rehydrated in three steps (75%, 50%, 25% methanol in PBST). After permeabilization with ice-cold acetone for 20 min at −20 °C and additional washing steps with PBST, samples were blocked in 10% FBS in PBST for 2 h and incubated in 1:5,000 mouse anti-pERK antibody (Sigma, M8159) in 10% FBS in PBST overnight at 4 °C. Embryos were then washed at least 12 times with PBST, followed by another blocking step for 2 h with 10% FBS in PBST and overnight incubation with 1:5,000 donkey anti-mouse HRP-coupled secondary antibody (Jackson ImmunoResearch, 715-035-150) in 10% FBS in PBST at 4 °C. After washing at least 12 times with PBST and once with TSA 1x amplification buffer, embryos were incubated in 75 µl 1:75 Cy3-TSA in 1x amplification buffer for 45 min, protected from light. After washing for at least four times with PBST, embryos were incubated in 0.3 µM DRAQ7 (Invitrogen, D15106) in PBST for 30 min and then washed at least three times with PBST. Before imaging, stained embryos were wrapped in aluminum foil and stored overnight at 4 °C.

Fixed and stained embryos were mounted in 1.5% low-melting point agarose (Lonza, 50080) using a glass capillary (50 µl, Brand 701908) and imaged with a ZEISS Lightsheet Z.1 microscope using ZEN 3.1 Black Edition acquisition software. The imaging chamber was filled with water, and filters and lightsheets were auto-aligned prior to imaging⁷⁶. Embryos were positioned with the brightest pErk signal pointing towards the imaging objective (presumptive dorsal view). For each embryo, z-slices with 5 μm between each slice were acquired. All images were acquired with dual lightsheet illumination using a W Plan-Apochromat ×10 objective at ×0.9 zoom, with laser powers of 2% and 6% for pErk and nuclei, respectively.

To measure spatial intensity profiles from the margin to the animal pole, maximum intensity projections of 75 z-slices were generated using Fiji⁷⁷, and pErk intensity profiles were calculated as follows. First, a rectangular region of interest with a width of 300 pixels was manually drawn from the margin of the blastoderm to the animal pole. Only images of embryos that were oriented with the dorsal side facing the camera were used for the analysis. The dorsal side could be identified after generating maximum intensity projections from image stacks. Embryos with tilted dorso-ventral axes were excluded. Then, the average intensity along the profile was calculated using the function Measure in Fiji. The background intensity of pErk was estimated as the median intensity value of the profiles of untreated embryos at the animal pole (between 250 µm and 280 µm from the margin) and subtracted from the intensity profiles using MATLAB 2022a (https://doi.org/10.48606/55).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Training and evaluation datasets for EmbryoNet are available from http://embryonet.uni-konstanz.de and https://doi.org/10.48606/15. The drug screen data are available from https://doi.org/10.48606/37, https://doi.org/10.48606/38 and https://doi.org/10.48606/41. Additional data that support the findings of this study are available from https://doi.org/10.48606/53 and https://doi.org/10.48606/55.

Code availability

The source code for EmbryoNet is available from http://github.com/mueller-lab/EmbryoNet (https://doi.org/10.5281/zenodo.7531593). Additional custom scripts used for data analysis in this study are available from https://doi.org/10.48606/15.

References

Marlow, F. L. Setting up for gastrulation in zebrafish. Curr. Top. Dev. Biol. 136, 33–83 (2020).
CAS PubMed Google Scholar
Schier, A. F. & Talbot, W. S. Molecular genetics of axis formation in zebrafish. Annu. Rev. Genet. 39, 561–613 (2005).
CAS PubMed Google Scholar
Heisenberg, C. P. & Solnica-Krezel, L. Back and forth between cell fate specification and movement during vertebrate gastrulation. Curr. Opin. Genet. Dev. 18, 311–316 (2008).
CAS PubMed PubMed Central Google Scholar
Manzo, G. Similarities between embryo development and cancer process suggest new strategies for research and therapy of tumors: a new point of view. Front. Cell Dev. Biol. 7, 20 (2019).
PubMed PubMed Central Google Scholar
Nusse, R. & Clevers, H. Wnt/beta-catenin signaling, disease, and emerging therapeutic modalities. Cell 169, 985–999 (2017).
CAS PubMed Google Scholar
Gray, R. S., Roszko, I. & Solnica-Krezel, L. Planar cell polarity: coordinating morphogenetic cell behaviors with embryonic polarity. Dev. Cell 21, 120–133 (2011).
CAS PubMed PubMed Central Google Scholar
Huang, P. & Schier, A. F. Dampened Hedgehog signaling but normal Wnt signaling in zebrafish without cilia. Development 136, 3089–3098 (2009).
CAS PubMed PubMed Central Google Scholar
Woods, I. G. & Talbot, W. S. The you gene encodes an EGF-CUB protein essential for Hedgehog signaling in zebrafish. PLoS Biol. 3, e66 (2005).
PubMed PubMed Central Google Scholar
Schier, A. F. & Talbot, W. S. Nodal signaling and the zebrafish organizer. Int. J. Dev. Biol. 45, 289–297 (2001).
CAS PubMed Google Scholar
Nasevicius, A. & Ekker, S. C. Effective targeted gene ‘knockdown’ in zebrafish. Nat. Genet. 26, 216–220 (2000).
CAS PubMed Google Scholar
Hino, H. et al. Roles of maternal wnt8a transcripts in axis formation in zebrafish. Dev. Biol. 434, 96–107 (2018).
CAS PubMed Google Scholar
Lekven, A. C., Thorpe, C. J., Waxman, J. S. & Moon, R. T. Zebrafish wnt8 encodes two wnt8 proteins on a bicistronic transcript and is required for mesoderm and neurectoderm patterning. Dev. Cell 1, 103–114 (2001).
CAS PubMed Google Scholar
Rohner, N. et al. Duplication of fgfr1 permits Fgf signaling to serve as a target for selection during domestication. Curr. Biol. 19, 1642–1647 (2009).
CAS PubMed Google Scholar
Kishimoto, Y., Lee, K. H., Zon, L., Hammerschmidt, M. & Schulte-Merker, S. The molecular nature of zebrafish swirl: BMP2 function is essential during early dorsoventral patterning. Development 124, 4457–4466 (1997).
CAS PubMed Google Scholar
Shinya, M., Eschbach, C., Clark, M., Lehrach, H. & Furutani-Seiki, M. Zebrafish Dkk1, induced by the pre-MBT Wnt signaling, is secreted from the prechordal plate and patterns the anterior neural plate. Mech. Dev. 98, 3–17 (2000).
CAS PubMed Google Scholar
Begemann, G., Schilling, T. F., Rauch, G. J., Geisler, R. & Ingham, P. W. The zebrafish neckless mutation reveals a requirement for raldh2 in mesodermal signals that pattern the hindbrain. Development 128, 3081–3094 (2001).
CAS PubMed Google Scholar
Stainier, D. Y. & Fishman, M. C. Patterning the zebrafish heart tube: acquisition of anteroposterior polarity. Dev. Biol. 153, 91–101 (1992).
CAS PubMed Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
CAS PubMed Google Scholar
Russakovsky, O. et al. ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. 115, 211–252 (2015).
Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021).
CAS PubMed PubMed Central Google Scholar
Jumper, J. & Hassabis, D. Protein structure predictions to atomic accuracy with AlphaFold. Nat. Methods 19, 11–12 (2022).
CAS PubMed Google Scholar
Senior, A. W. et al. Improved protein structure prediction using potentials from deep learning. Nature 577, 706–710 (2020).
CAS PubMed Google Scholar
Tunyasuvunakool, K. et al. Highly accurate protein structure prediction for the human proteome. Nature 596, 590–596 (2021).
CAS PubMed PubMed Central Google Scholar
Reichstein, M. et al. Deep learning and process understanding for data-driven Earth system science. Nature 566, 195–204 (2019).
CAS PubMed Google Scholar
Hallou, A., Yevick, H. G., Dumitrascu, B. & Uhlmann, V. Deep learning for bioimage analysis in developmental biology. Development 148, dev199616 (2021).
CAS PubMed PubMed Central Google Scholar
Moen, E. et al. Deep learning for cellular image analysis. Nat. Methods 16, 1233–1246 (2019).
CAS PubMed PubMed Central Google Scholar
Weigert, M. et al. Content-aware image restoration: pushing the limits of fluorescence microscopy. Nat. Methods 15, 1090–1097 (2018).
CAS PubMed Google Scholar
McQuin, C. et al. CellProfiler 3.0: next-generation image processing for biology. PLoS Biol. 16, e2005970 (2018).
PubMed PubMed Central Google Scholar
Stringer, C., Wang, T., Michaelos, M. & Pachitariu, M. Cellpose: a generalist algorithm for cellular segmentation. Nat. Methods 18, 100–106 (2021).
CAS PubMed Google Scholar
Naert, T. et al. Deep learning is widely applicable to phenotyping embryonic development and disease. Development 148, dev199664 (2021).
CAS PubMed PubMed Central Google Scholar
Tyagi, G., Patel, N. & Ishwar, S. A fine-tuned convolution neural network based approach for phenotype classification of zebrafish. Procedia Computer Science 126, 1138–1144 (2018).
Google Scholar
Jeanray, N. et al. Phenotype classification of zebrafish embryos by supervised learning. PLoS ONE 10, e0116989 (2015).
Khosravi, P. et al. Deep learning enables robust assessment and selection of human blastocysts after in vitro fertilization. npj Digit. Med. 2, 21 (2019).
Baris Atakan, H., Alkanat, T., Cornaglia, M., Trouillon, R. & Gijs, M. A. M. Automated phenotyping of Caenorhabditis elegans embryos with a high-throughput-screening microfluidic platform. Microsyst. Nanoeng. 6, 24 (2020).
CAS PubMed PubMed Central Google Scholar
Suryanto, M. E. et al. Using DeepLabCut as a real-time and markerless tool for cardiac physiology assessment in zebrafish. Biology 11, 1243 (2022).
Tills, O. et al. A high-throughput and open-source platform for embryo phenomics. PLoS Biol. 16, e3000074 (2018).
PubMed PubMed Central Google Scholar
Shen, M. M. Nodal signaling: developmental roles and regulation. Development 134, 1023–1034 (2007).
CAS PubMed Google Scholar
DaCosta Byfield, S., Major, C., Laping, N. J. & Roberts, A. B. SB-505124 is a selective inhibitor of transforming growth factor-beta type I receptors ALK4, ALK5, and ALK7. Mol. Pharmacol. 65, 744–752 (2004).
PubMed Google Scholar
Hagos, E. G. & Dougan, S. T. Time-dependent patterning of the mesoderm and endoderm by Nodal signals in zebrafish. BMC Dev. Biol. 7, 22 (2007).
PubMed PubMed Central Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778 (2016).
Gritsman, K. et al. The EGF-CFC protein One-eyed pinhead is essential for Nodal signaling. Cell 97, 121–132 (1999).
Wang, X. et al. The development of highly potent inhibitors for porcupine. J. Med. Chem. 56, 2700–2704 (2013).
CAS PubMed PubMed Central Google Scholar
Sun, L. et al. Design, synthesis, and evaluations of substituted 3-[(3- or 4-carboxyethylpyrrol-2-yl)methylidenyl]indolin-2-ones as inhibitors of VEGF, FGF, and PDGF receptor tyrosine kinases. J. Med. Chem. 42, 5120–5130 (1999).
Cuny, G. D. et al. Structure-activity relationship study of bone morphogenetic protein (BMP) signaling inhibitors. Bioorg. Med. Chem. Lett. 18, 4388–4392 (2008).
CAS PubMed PubMed Central Google Scholar
Incardona, J. P., Gaffield, W., Kapur, R. P. & Roelink, H. The teratogenic Veratrum alkaloid cyclopamine inhibits sonic hedgehog signal transduction. Development 125, 3553–3562 (1998).
CAS PubMed Google Scholar
Dougan, S. T., Warga, R. M., Kane, D. A., Schier, A. F. & Talbot, W. S. The role of the zebrafish nodal-related genes squint and cyclops in patterning of mesendoderm. Development 130, 1837–1851 (2003).
CAS PubMed Google Scholar
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A. & Torralba, A. Learning deep features for discriminative localization. Preprint at https://arxiv.org/abs/1512.04150 (2015).
Müller, P. et al. Differential diffusivity of Nodal and Lefty underlies a reaction–diffusion patterning system. Science 336, 721–724 (2012).
PubMed PubMed Central Google Scholar
van Boxtel, A. L. et al. A temporal window for signal activation dictates the dimensions of a Nodal signaling domain. Dev. Cell 35, 175–185 (2015).
PubMed PubMed Central Google Scholar
Bilder, R. M. et al. Phenomics: the systematic study of phenotypes on a genome-wide scale. Neuroscience 164, 30–42 (2009).
CAS PubMed Google Scholar
Campos, L. M. et al. Alterations in zebrafish development induced by simvastatin: comprehensive morphological and physiological study, focusing on muscle. Exp. Biol. Med. 241, 1950–1960 (2016).
Campos, L. M. et al. Structural analysis of alterations in zebrafish muscle differentiation induced by simvastatin and their recovery with cholesterol. J. Histochem. Cytochem. 63, 427–437 (2015).
CAS PubMed PubMed Central Google Scholar
Maerz, L. D. et al. Pharmacological cholesterol depletion disturbs ciliogenesis and ciliary function in developing zebrafish. Commun. Biol. 2, 31 (2019).
PubMed PubMed Central Google Scholar
Thorpe, J. L., Doitsidou, M., Ho, S. Y., Raz, E. & Farber, S. A. Germ cell migration in zebrafish is dependent on HMGCoA reductase activity and prenylation. Dev. Cell 6, 295–302 (2004).
CAS PubMed Google Scholar
Leerberg, D. M., Hopton, R. E. & Draper, B. W. Fibroblast growth factor receptors function redundantly during zebrafish embryonic development. Genetics 212, 1301–1319 (2019).
CAS PubMed PubMed Central Google Scholar
Draper, B. W., Stock, D. W. & Kimmel, C. B. Zebrafish fgf24 functions with fgf8 to promote posterior mesodermal development. Development 130, 4639–4654 (2003).
CAS PubMed Google Scholar
Economou, A. D., Guglielmi, L., East, P. & Hill, C. S. Nodal signaling establishes a competency window for stochastic cell fate switching. Dev. Cell 57, 2604–2622 (2022).
CAS PubMed Google Scholar
Piotrowski, P. C. et al. Statins inhibit growth of human endometrial stromal cells independently of cholesterol availability. Biol. Reprod. 75, 107–111 (2006).
CAS PubMed Google Scholar
Pfister, P., Randall, J., Montoya-Burgos, J. I. & Rodriguez, I. Divergent evolution among teleost V1r receptor genes. PLoS ONE 2, e379 (2007).
Wittbrodt, J., Shima, A. & Schartl, M. Medaka: a model organism from the far East. Nat. Rev. Genet. 3, 53–64 (2002).
CAS PubMed Google Scholar
Iwamatsu, T. Stages of normal development in the medaka Oryzias latipes. Mech. Dev. 121, 605–618 (2004).
CAS PubMed Google Scholar
Swarup, H. Stages in the development of the stickleback Gasterosteus aculeatus (L.). J. Embryol. Exp. Morphol. 6, 373–383 (1958).
CAS PubMed Google Scholar
Houle, D., Govindaraju, D. R. & Omholt, S. Phenomics: the next challenge. Nat. Rev. Genet. 11, 855–866 (2010).
CAS PubMed Google Scholar
Brown, S. D. M. et al. High-throughput mouse phenomics for characterizing mammalian gene function. Nat. Rev. Genet. 19, 357–370 (2018).
CAS PubMed PubMed Central Google Scholar
D’Orazio, M. et al. Machine learning phenomics (MLP) combining deep learning with time-lapse-microscopy for monitoring colorectal adenocarcinoma cells gene expression and drug-response. Sci. Rep. 12, 8545 (2022).
PubMed PubMed Central Google Scholar
Watson, C. J. et al. Phenomics-based quantification of CRISPR-induced mosaicism in zebrafish. Cell Syst. 10, 275–286 (2020).
CAS PubMed PubMed Central Google Scholar
Nabwire, S., Suh, H. K., Kim, M. S., Baek, I. & Cho, B. K. Review: application of artificial intelligence in phenomics. Sensors 21, 4363 (2021).
Čapek, D., Ünalan, M. & Müller, P. Wie Tiere sich selbst konstruieren. Biospektrum 27, 473–477 (2021).
Google Scholar
Williams, B. B. et al. VANGL2 regulates membrane trafficking of MMP14 to control cell polarity and migration. J. Cell Sci. 125, 2141–2147 (2012).
CAS PubMed PubMed Central Google Scholar
Pomreinke, A. P. et al. Dynamics of BMP signaling and distribution during zebrafish dorsal–-ventral patterning. eLife 6, e25861 (2017).
Duda, R. O. & Hart, P. E. Use of the Hough transformation to detect lines and curves in pictures. Communications of the ACM 15, 11–15 (1972).
Google Scholar
Wojke, N., Bewley, A. & Paulus, D. Simple online and realtime tracking with a deep association metric. In 2017 IEEE International Conference on Image Processing (ICIP) 3645–3649 (2017).
Spelmen, V. S. & Porkodi, R. A review on handling imbalanced data. In 2018 International Conference on Current Trends towards Converging Technologies (ICCTCT) 1–11 (2018).
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning representations by back-propagating errors. Nature 323, 533–536 (1986).
Google Scholar
Buslaev, A. et al. Albumentations: fast and flexible image augmentations. Information 11, 125 (2020).
Rogers, K. W., ElGamacy, M., Jordan, B. M. & Müller, P. Optogenetic investigation of BMP target gene expression diversity. eLife 9, e58641 (2020).
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
CAS PubMed Google Scholar

Download references

Acknowledgements

This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement No. 863952 (ACE-OF-SPACE) to P.M.). This work was also funded by the EMBO Young Investigator Programme (P.M.), Max Planck Society (P.M., F.J.), the FWF (Project J-4507, D.Č.), the IZKF of the Medical Faculty of the University of Tübingen (P.M., N.T.), and the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy—EXC 2117—422037984 (P.M.). We thank K.W. Rogers, A. Schauer, K. Sarieva, I. Carmi and M. Rössler for scientific input, consulting and illustration support. We also thank O. Aust for initial drug screenings and A. Baccini for technical assistance with compound retesting. For participating in the classification of single embryo images, we thank M. Akyüz, L. Amann, A. Balb, A. Bangnowski, S. Baumgärtner, A.-S. Becker, S. Berber, S. Bergemann, T. Berger, L. Betz-Jung, L. Beuten, M. Brückner, L. Budig, N. Bürgers, P. Buslaps, D. Casaburi, L. Dangel, J. Davia, T. Decker, A. Eiberle, J. Engler, C. Feldmann, M. Franz, E. Frese, A. Fronius, B. Goldschmidt, C. Gomes, D. Gaßebner, L. Haas, L. Haßfeld, L. Heger, L. Helten, S. Hillman, S. Hinte, L. Huber, J. Iffelsberger, I. Jorzik, J. Jung, L. Kammerer, J. Klein, E. Kleinke, H. Klenk, V. Kneipp, M. Kölle, M. Kröner, V. Kuhn, P. Kukofka, J. Küpfer, Y. Lan, K. Land, C. Lewin, M. Lohmer, J. Lüders, X. Lyu, H. Mahl, R. Manukjan, M. Martini, A. Maslonka, P. Matijas-Graf, N. Meier, T. Morell, F. Natale, M. Nyesö, F. Piehler, A. Pirker, S. Rampp, V. Raupp, K.M. Reagan, A. Reiß, G. Rösler, F. Roßmann, J. Roylands, L. Ruf, J. Schiele, R. Schmidt, A. Schneider, M. Schön, M. Schröter, A. Schupp, F. Stiller, S. Stöckl, L. Thellmann, M. Thomann, D. Torcuk, Z. Umbach, R. Unsöld, C. Vogl, H.C. von Vegesack, R. Wagner, G. Wallig, M.A. Wannemacher, L. Wanner, F. Welsch, C. Wolfer, V. Zickenberg and M. Ziefle. We also thank T. Thumberger and J. Wittbrodt for providing the medaka Cab strain, and P. Huang for providing the pCS2-zGli3R-EGFP plasmid.

Author information

These authors contributed equally: Daniel Čapek, Matvey Safroshkin, Hernán Morales-Navarrete.

Authors and Affiliations

Systems Biology of Development, University of Konstanz, Konstanz, Germany
Daniel Čapek, Hernán Morales-Navarrete, Nikan Toulany, Anica Kurzbach, Ben Jordan & Patrick Müller
Friedrich Miescher Laboratory of the Max Planck Society, Tübingen, Germany
Daniel Čapek, Hernán Morales-Navarrete, Nikan Toulany, Johanna Bihler, Julia Hagauer, Sebastian Kick, Felicity Jones & Patrick Müller
Computer Vision Studio, Tübingen, Germany
Matvey Safroshkin & Grigory Arutyunov
Centre for the Advanced Study of Collective Behaviour, Konstanz, Germany
Hernán Morales-Navarrete & Patrick Müller

Authors

Daniel Čapek
View author publications
You can also search for this author in PubMed Google Scholar
Matvey Safroshkin
View author publications
You can also search for this author in PubMed Google Scholar
Hernán Morales-Navarrete
View author publications
You can also search for this author in PubMed Google Scholar
Nikan Toulany
View author publications
You can also search for this author in PubMed Google Scholar
Grigory Arutyunov
View author publications
You can also search for this author in PubMed Google Scholar
Anica Kurzbach
View author publications
You can also search for this author in PubMed Google Scholar
Johanna Bihler
View author publications
You can also search for this author in PubMed Google Scholar
Julia Hagauer
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Kick
View author publications
You can also search for this author in PubMed Google Scholar
Felicity Jones
View author publications
You can also search for this author in PubMed Google Scholar
Ben Jordan
View author publications
You can also search for this author in PubMed Google Scholar
Patrick Müller
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.M. and B.J. conceived the study. P.M. supervised the project. D.Č., J.B., M.S., N.T. and H.M.-N. manually annotated images. M.S. and G.A. developed the software for EmbryoNet. M.S., G.A. and H.M.-N. developed software for downstream analysis. N.T. and D.Č. performed the drug screens. N.T. analyzed the drug screen. H.M.-N. analyzed the zebrafish data, M.S. analyzed the medaka data and N.T. analyzed the stickleback data. A.K. performed the retests of the statins, carried out immunostainings and lightsheet microscopy, and acquired time series on the Keyence BZ-X810 microscope. H.M.-N. contributed to lightsheet imaging and performed pErk quantification. D.Č. carried out all other experiments. J.H., S.K. and F.J. provided in vitro fertilized stickleback embryos. D.Č. and P.M. wrote the paper with input from all of the authors.

Corresponding author

Correspondence to Patrick Müller.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Methods thanks Marc Muller and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available. Primary Handling Editor: Rita Strack, in collaboration with the Nature Methods team.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Schematic of the training pipeline for EmbryoNet.

(a) Overview of a training iteration. Augmented embryos were collected into a training batch with known embryo age. After EmbryoNet processed a batch of input images with ages, network outputs were compared with ground truth values. Based on cross-entropy loss, EmbryoNet weights were updated to minimize the loss. (b) Examples of augmentations used. (c) In our model transition logic, embryos have a limited set of allowed class transitions. All start in the class Unknown and can transition to any other class, from where they can go only to Dead, but not to other classes. Other transitions were assigned a cost. The model with the least cost was selected. (d) Schematic of the classification pipeline. (e) Graphical user interface (GUI) of EmbryoNet. (f,g) Comparison of EmbryoNet’s performance to recognize phenotypes induced by signaling modulation using small-molecule inhibitors, overexpression of signaling antagonists or pathway mutants. Nodal phenotypes (f) induced by small-molecule inhibitor treatment (SB-505124, n=33), injection of a pathway antagonist (lefty1 mRNA, n=27) or in a receptor mutant (MZoep, n=27) were all classified by EmbryoNet as –Nodal with similar accuracy. BMP phenotypes (g) induced by small-molecule inhibitor treatment (LDN-193189, n=45), pathway antagonist injection (chordin mRNA, n=26) or in a pathway ligand mutant (swirl^-/-, n=13) were all classified by EmbryoNet as –BMP with similar accuracy. Scale bars: 500 µm.

Extended Data Fig. 2 Complete data of the non-expert teams to assess 98 embryo images without time information.

Confusion matrices show the classification of the respective non-expert teams. The non-expert teams did not have extra information about the age of the displayed embryos.

Extended Data Fig. 3 Complete data of the non-expert teams to assess 98 embryo images with time information.

Confusion matrices show the classification of the respective teams. Embryo age was supplied with the images.

Extended Data Fig. 4 Classification of time-lapse data by experts and EmbryoNet, and classification performance with different microscope data sets.

(a-c) Schematics and confusion matrices show classification of image series of the respective assessors. Human experts (a,b) knew that all embryos within a movie received the same treatment. (d,e) Confusion matrices showing the performance of EmbryoNet (d) and EmbryoNet-Prime (e) if only test data from either microscope 1 (d’,e’, ACQUIFER Imaging Machine) or microscope 2 (d’’,e’’, Keyence BZ-X810) was used.

Extended Data Fig. 5 Classification of mild Nodal and BMP phenotypes.

(a) Schematic of the experiment with lower inhibitor doses. Lower doses of the BMP inhibitor LDN-193189 lead to weaker phenotypes, detectable from late gastrulation onwards. While in the severe cases no clear structures are distinguishable, moderate embryos have a head and somites and display the characteristic BMP loss-of-function phenotype with curled-up tails (Kishimoto et al. 1997). Mild embryos have a largely intact body axis only missing the tail. (b-d) Confusion matrices showing the performance of classification of weaker phenotypes. EmbryoNet (b) had a lower performance on milder compared to severe phenotypes, especially for Nodal-inhibited samples. Human annotators were also less consequent as seen from the confusion matrix between the accepted ground truth and a second labeler (c). EmbryoNet-Prime had better success in detecting weak Nodal phenotypes compared to EmbryoNet, but was less performant on –BMP and on average (d).

Extended Data Fig. 6 Detection time points of all classes by EmbryoNet-Prime.

Plots of the number of detected phenotypes (dotted lines) for each class over time for EmbryoNet (pink diamonds), EmbryoNet-Prime (blue triangles) and human experts (green dots). The error envelopes show standard error of the mean. Solid lines show the fit of a sigmoid curve to the data. Gray boxes show major developmental periods. Different classes could be detected at different time points. –BMP, + RA, –Wnt, –Nodal and –Shh could be classified earlier by EmbryoNet-Prime than by humans. n[Normal]: 74, n[–BMP]: 119, n[+RA]: = 66, n[–Wnt]: 70, n[–FGF]: 74, n[–Nodal]: 110, n[–Shh]: 63, n[–PCP]: 57. Data also shown in Fig. 2g.

Extended Data Fig. 7 Results of the Enzo ScreenWell 2840 library drug screen.

(a–f) Layout of plates 1 (a), 2 (b), 3 (c), 4 (d), 5 (e) and 6 (f) from the BML-2840 library with classifications per well by majority phenotype. See Supplementary Tables 25–30 for details.

Extended Data Fig. 8 Results of the Enzo ScreenWell 2843 library drug screen.

(a–e) Layout of plates 1 (a), 2 (b), 3 (c), 5 (d) and 6 (e) from the BML-2843 library with classifications per well by majority phenotype. See Supplementary Tables 31–35 for details.

Supplementary information

Supplementary Information

Supplementary Notes 1–4 and References.

Reporting Summary

Peer Review File

Supplementary Tables

Supplementary Tables 1–35.

Supplementary Video 1

Representative timecourse of zebrafish Normal wild-type development. The movie spans 24 h with 2 min intervals.

Supplementary Video 2

Representative timecourse of zebrafish −BMP loss-of-function development. Embryos were treated with BMP inhibitor starting at the 4-cell stage. The movie spans 24 h with 2 min intervals.

Supplementary Video 3

Representative timecourse of zebrafish +RA gain-of-function development. Embryos were treated with retinoic acid starting at the 4-cell stage. The movie spans 24 h with 2 min intervals.

Supplementary Video 4

Representative timecourse of zebrafish −Wnt loss-of-function development. Embryos were treated with Wnt inhibitor starting at the 4-cell stage. The movie spans 24 h with 2 min intervals.

Supplementary Video 5

Representative timecourse of zebrafish −FGF loss-of-function development. Embryos were treated with FGF inhibitor starting at the 4-cell stage. The movie spans 24 h with 2 min intervals.

Supplementary Video 6

Representative timecourse of zebrafish −Nodal loss-of-function development. Embryos were treated with Nodal inhibitor starting at the 4-cell stage. The movie spans 24 h with 2 min intervals.

Supplementary Video 7

Representative timecourse of zebrafish −Shh loss-of-function development. Embryos were treated with Shh inhibitor (cyclopamine) starting at the 4-cell stage. The movie spans 24 h with 2 min intervals.

Supplementary Video 8

Representative timecourse of zebrafish −PCP loss-of-function development. Embryos were injected with a vangl2-targeting morpholino at the 1-cell stage. The movie spans 24 h with 2 min intervals.

Supplementary Video 9

Activation maps for different phenotype classes in wild-type zebrafish embryos. The movie spans 24 h with 2 min intervals. Activation maps for different phenotype classes (Unknown, Normal, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP and Dead) are shown.

Supplementary Video 10

Activation maps for the phenotype class Normal in 10 wild-type zebrafish embryos. The movie spans 24 h with 2 min intervals.

Supplementary Video 11

Activation maps for different phenotype classes in zebrafish BMP loss-of-function development. Embryos were treated with BMP inhibitor starting at the 4-cell stage. The movie spans 24 h with 2 min intervals. Activation maps for different phenotype classes (Unknown, Normal, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP and Dead) are shown.

Supplementary Video 12

Activation maps for the phenotype class −BMP in 10 BMP loss-of-function zebrafish embryos. The movie spans 24 h with 2 min intervals.

Supplementary Video 13

Activation maps for different phenotype classes in zebrafish RA gain-of-function development. Embryos were treated with retinoic acid starting at the 4-cell stage. The movie spans 24 h with 2 min intervals. Activation maps for different phenotype classes (Unknown, Normal, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP and Dead) are shown.

Supplementary Video 14

Activation maps for the phenotype class +RA in 10 RA gain-of-function zebrafish embryos. The movie spans 24 h with 2 min intervals.

Supplementary Video 15

Activation maps for different phenotype classes in zebrafish Wnt loss-of-function development. Embryos were treated with Wnt inhibitor starting at the 4-cell stage. The movie spans 24 h with 2 min intervals. Activation maps for different phenotype classes (Unknown, Normal, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP and Dead) are shown.

Supplementary Video 16

Activation maps for the phenotype class −Wnt in 10 Wnt loss-of-function zebrafish embryos. The movie spans 24 h with 2 min intervals.

Supplementary Video 17

Activation maps for different phenotype classes in zebrafish FGF loss-of-function development. Embryos were treated with FGF inhibitor starting at the 4-cell stage. The movie spans 24 h with 2 min intervals. Activation maps for different phenotypic classes (Unknown, Normal, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP and Dead) are shown.

Supplementary Video 18

Activation maps for the phenotype class −FGF in 10 FGF loss-of-function zebrafish embryos. The movie spans 24 h with 2 min intervals.

Supplementary Video 19

Activation maps for different phenotype classes in zebrafish Nodal loss-of-function development. Embryos were treated with Nodal inhibitor starting at the 4-cell stage. The movie spans 24 h with 2 min intervals. Activation maps for different phenotype classes (Unknown, Normal, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP and Dead) are shown.

Supplementary Video 20

Activation maps for the phenotype class −Nodal in 10 Nodal loss-of-function zebrafish embryos. The movie spans 24 h with 2 min intervals.

Supplementary Video 21

Activation maps for different phenotype classes in zebrafish Shh loss-of-function development. Embryos were treated with Shh inhibitor (cyclopamine) starting at the 4-cell stage. The movie spans 24 h with 2 min intervals. Activation maps for different phenotype classes (Unknown, Normal, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP and Dead) are shown.

Supplementary Video 22

Activation maps for the phenotype class −Shh in 10 Shh loss-of-function zebrafish embryos. The movie spans 2 h with 2 min intervals.

Supplementary Video 23

Activation maps for different phenotype classes in zebrafish PCP loss-of-function development. Embryos were injected with a vangl2-targeting morpholino at the 1-cell stage. The movie spans 24 h with 2 min intervals. Activation maps for different phenotype classes (Unknown, Normal, −BMP, +RA, −Wnt, −FGF, −Nodal, −Shh, −PCP and Dead) are shown.

Supplementary Video 24

Activation maps for the phenotype class −PCP in 10 PCP loss-of-function zebrafish embryos. The movie spans 24 h with 2 min intervals.

Supplementary Video 25

Zebrafish drug screen with ScreenWell Library 2840, plate 1. The movie spans 26 h with 139 s intervals. The embryos were drug-treated according to Supplementary Table 25 starting at the 512-cell stage.

Supplementary Video 26

Zebrafish drug screen with ScreenWell Library 2840, plate 2. The movie spans 24 h with 139 s intervals. The embryos were drug-treated according to Supplementary Table 26 starting at the 128-cell stage.

Supplementary Video 27

Zebrafish drug screen with ScreenWell Library 2840, plate 3. The movie spans 28 h with 138 s intervals. The embryos were drug-treated according to Supplementary Table 27 starting at the 512-cell stage.

Supplementary Video 28

Zebrafish drug screen with ScreenWell Library 2840, plate 4. The movie spans 24 h with 138 s intervals. The embryos were drug-treated according to Supplementary Table 28 starting at the 256-cell stage.

Supplementary Video 29

Zebrafish drug screen with ScreenWell Library 2840, plate 5. The movie spans 26 h with 143 s intervals. The embryos were drug-treated according to Supplementary Table 29 starting at the 256-cell stage.

Supplementary Video 30

Zebrafish drug screen with ScreenWell Library 2840, plate 6. The movie spans 25 h with 144 s intervals. The embryos were drug-treated according to Supplementary Table 30 starting at the 512-cell stage.

Supplementary Video 31

Zebrafish drug screen with ScreenWell Library 2843, plate 1. The movie spans 27 h with 135 s intervals. The embryos were drug-treated according to Supplementary Table 31 starting at the 256-cell stage.

Supplementary Video 32

Zebrafish drug screen with ScreenWell Library 2843, plate 2. The movie spans 24 h with 137 s intervals. The embryos were drug-treated according to Supplementary Table 32 starting at the 32-cell stage.

Supplementary Video 33

Zebrafish drug screen with ScreenWell Library 2843, plate 3. The movie spans 24 h with 138 s intervals. The embryos were drug-treated according to Supplementary Table 33 starting at the 64-cell stage.

Supplementary Video 34

Zebrafish drug screen with ScreenWell Library 2843, plate 5. The movie spans 24 h with 192 s intervals. The embryos were drug-treated according to Supplementary Table 34 starting at the 512-cell stage.

Supplementary Video 35

Zebrafish drug screen with ScreenWell Library 2843, plate 6. The movie spans 24 h with 142 s intervals. The embryos were drug-treated according to Supplementary Table 35 starting at the 512-cell stage.

Supplementary Video 36

Zebrafish development upon exposure to simvastatin. The movie spans 24 h with 137 s intervals. The embryos were treated with 40 µM simvastatin starting at the 32-cell stage.

Supplementary Video 37

Zebrafish development upon exposure to atorvastatin. The movie spans 24 h with 142 s intervals. The embryos were treated with 40 µM atorvastatin starting at the 512-cell stage.

Supplementary Video 38

Zebrafish development upon exposure to lovastatin. The movie spans 24 h with 143 s intervals. The embryos were treated with 20 µg ml⁻¹ (that is, 50 µM) lovastatin starting at the 256-cell stage.

Supplementary Video 39

Medaka normal development. The movie spans 37.5 h with 5 min intervals, starting at stage 11.

Supplementary Video 40

Medaka development with Nodal loss of function. The movie spans 37.5 h with 5 min intervals. The embryos were treated with the chemical Nodal inhibitor SB-505124 at a concentration of 7.5 µM starting at stage 11.

Supplementary Video 41

Stickleback normal development. The movie spans 120 h with 5 min intervals, starting at stage 10.

Supplementary Video 42

Stickleback development with Nodal loss of function. The movie spans 120 h with 5 min intervals. The embryos were treated with the chemical Nodal inhibitor SB-505124 at a concentration of 15 µM, starting from stage 10.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Čapek, D., Safroshkin, M., Morales-Navarrete, H. et al. EmbryoNet: using deep learning to link embryonic phenotypes to signaling pathways. Nat Methods 20, 815–823 (2023). https://doi.org/10.1038/s41592-023-01873-4

Download citation

Received: 26 September 2022
Accepted: 05 April 2023
Published: 08 May 2023
Issue Date: June 2023
DOI: https://doi.org/10.1038/s41592-023-01873-4

This article is cited by

Artificial intelligence assisted patient blood and urine droplet pattern analysis for non-invasive and accurate diagnosis of bladder cancer
- Ramiz Demir
- Soner Koc
- Devrim Gozuacik
Scientific Reports (2024)
Uncovering developmental time and tempo using deep learning
- Nikan Toulany
- Hernán Morales-Navarrete
- Patrick Müller
Nature Methods (2023)
Method of the Year 2023: methods for modeling development

Nature Methods (2023)