Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

The emergent landscape of the mouse gut endoderm at single-cell resolution


Here we delineate the ontogeny of the mammalian endoderm by generating 112,217 single-cell transcriptomes, which represent all endoderm populations within the mouse embryo until midgestation. We use graph-based approaches to model differentiating cells, which provides a spatio-temporal characterization of developmental trajectories and defines the transcriptional architecture that accompanies the emergence of the first (primitive or extra-embryonic) endodermal population and its sister pluripotent (embryonic) epiblast lineage. We uncover a relationship between descendants of these two lineages, in which epiblast cells differentiate into endoderm at two distinct time points—before and during gastrulation. Trajectories of endoderm cells were mapped as they acquired embryonic versus extra-embryonic fates and as they spatially converged within the nascent gut endoderm, which revealed these cells to be globally similar but retain aspects of their lineage history. We observed the regionalized identity of cells along the anterior–posterior axis of the emergent gut tube, which reflects their embryonic or extra-embryonic origin, and the coordinated patterning of these cells into organ-specific territories.

Access options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.

Fig. 1: Single-cell map of the mouse endoderm, from blastocyst to midgestation.
Fig. 2: Differentiation of EPI into endoderm before gastrulation.
Fig. 3: Spatial pattern emerges within visceral endoderm at the onset of post-implantation development at E5.5.
Fig. 4: Anterior–posterior pseudo-spatial axis of cells that reside within the gut tube at E8.75.
Fig. 5: Spatial patterning and organ identities within the gut tube of the mouse embryo at E8.75.

Data availability

All the generated data including bulk and scRNA-seq data are available through the Gene Expression Omnibus, under accession numbers GSE123046 (scRNA-seq) and GSE123124 (bulk RNA-seq). The data can be explored at, and any other relevant data are available from the corresponding authors upon reasonable request.

Code availability

Harmony is available as a Python module at Palantir is available as a Python module at A Jupyter notebook detailing the usage of Harmony along with sample data is available at


  1. 1.

    Zorn, A. M. & Wells, J. M. Vertebrate endoderm development and organ formation. Annu. Rev. Cell Dev. Biol. 25, 221–251 (2009).

    CAS  Article  Google Scholar 

  2. 2.

    Tremblay, K. D. Formation of the murine endoderm: lessons from the mouse, frog, fish, and chick. Prog. Mol. Biol. Transl. Sci. 96, 1–34 (2010).

    CAS  Article  Google Scholar 

  3. 3.

    Chazaud, C. & Yamanaka, Y. Lineage specification in the mouse preimplantation embryo. Development 143, 1063–1074 (2016).

    CAS  Article  Google Scholar 

  4. 4.

    Nowotschin, S. & Hadjantonakis, A. K. Cellular dynamics in the early mouse embryo: from axis formation to gastrulation. Curr. Opin. Genet. Dev. 20, 420–427 (2010).

    CAS  Article  Google Scholar 

  5. 5.

    Viotti, M., Nowotschin, S. & Hadjantonakis, A. K. SOX17 links gut endoderm morphogenesis and germ layer segregation. Nat. Cell Biol. 16, 1146–1156 (2014).

    CAS  Article  Google Scholar 

  6. 6.

    Viotti, M., Nowotschin, S. & Hadjantonakis, A. K. Afp:mCherry, a red fluorescent transgenic reporter of the mouse visceral endoderm. Genesis 49, 124–133 (2011).

    CAS  Article  Google Scholar 

  7. 7.

    Kwon, G. S., Viotti, M. & Hadjantonakis, A. K. The endoderm of the mouse embryo arises by dynamic widespread intercalation of embryonic and extraembryonic lineages. Dev. Cell 15, 509–520 (2008).

    CAS  Article  Google Scholar 

  8. 8.

    Sherwood, R. I., Chen, T. Y. & Melton, D. A. Transcriptional dynamics of endodermal organ formation. Dev. Dyn. 238, 29–42 (2009).

    CAS  Article  Google Scholar 

  9. 9.

    Hou, J. et al. A systematic screen for genes expressed in definitive endoderm by Serial Analysis of Gene Expression (SAGE). BMC Dev. Biol. 7, 92 (2007).

    Google Scholar 

  10. 10.

    Setty, M. et al. Palantir characterizes cell fate continuities in human hematopoiesis. Preprint at (2018).

  11. 11.

    Setty, M. et al. Characterization of cell fate probabilities in single-cell data with Palantir. Nat. Biotechnol. 37, 451–460 (2019).

    CAS  Article  Google Scholar 

  12. 12.

    Kwon, G. S. et al. Tg(Afp-GFP) expression marks primitive and definitive endoderm lineages during mouse development. Dev. Dyn. 235, 2549–2558 (2006).

    CAS  Article  Google Scholar 

  13. 13.

    Azizi, E. et al. Single-cell map of diverse immune phenotypes in the breast tumor microenvironment. Cell 174, 1293–1308 (2018).

    CAS  Article  Google Scholar 

  14. 14.

    Levine, J. H. et al. Data-driven phenotypic dissection of AML reveals progenitor-like cells that correlate with prognosis. Cell 162, 184–197 (2015).

    CAS  Article  Google Scholar 

  15. 15.

    Amir, E.-a. D. et al. viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia. Nat. Biotechnol. 31, 545–552 (2013).

    CAS  Article  Google Scholar 

  16. 16.

    Setty, M. et al. Wishbone identifies bifurcating developmental trajectories from single-cell data. Nat. Biotechnol. 34, 637–645 (2016).

    CAS  Article  Google Scholar 

  17. 17.

    Ibarra-Soria, X. et al. Defining murine organogenesis at single-cell resolution reveals a role for the leukotriene pathway in regulating blood progenitor formation. Nat. Cell Biol. 20, 127–134 (2018).

    CAS  Article  Google Scholar 

  18. 18.

    Haghverdi, L., Büttner, M., Wolf, F. A., Buettner, F. & Theis, F. J. Diffusion pseudotime robustly reconstructs lineage branching. Nat. Methods 13, 845–848 (2016).

    CAS  Article  Google Scholar 

  19. 19.

    Farrell, J. A. et al. Single-cell reconstruction of developmental trajectories during zebrafish embryogenesis. Science 360, eaar3131 (2018).

    Article  Google Scholar 

  20. 20.

    Plusa, B., Piliszek, A., Frankenberg, S., Artus, J. & Hadjantonakis, A. K. Distinct sequential cell behaviours direct primitive endoderm formation in the mouse blastocyst. Development 135, 3081–3091 (2008).

    CAS  Article  Google Scholar 

  21. 21.

    Chazaud, C., Yamanaka, Y., Pawson, T. & Rossant, J. Early lineage segregation between epiblast and primitive endoderm in mouse blastocysts through the Grb2-MAPK pathway. Dev. Cell 10, 615–624 (2006).

    CAS  Article  Google Scholar 

  22. 22.

    Artus, J., Piliszek, A. & Hadjantonakis, A. K. The primitive endoderm lineage of the mouse blastocyst: sequential transcription factor activation and regulation of differentiation by Sox17. Dev. Biol. 350, 393–404 (2011).

    CAS  Article  Google Scholar 

  23. 23.

    Silva, J. et al. Nanog is the gateway to the pluripotent ground state. Cell 138, 722–737 (2009).

    CAS  Article  Google Scholar 

  24. 24.

    Schrode, N., Saiz, N., Di Talia, S. & Hadjantonakis, A. K. GATA6 levels modulate primitive endoderm cell fate choice and timing in the mouse blastocyst. Dev. Cell 29, 454–467 (2014).

    CAS  Article  Google Scholar 

  25. 25.

    Morgani, S. M. et al. A Sprouty4 reporter to monitor FGF/ERK signaling activity in ESCs and mice. Dev. Biol. 441, 104–126 (2018).

    CAS  Article  Google Scholar 

  26. 26.

    Molotkov, A., Mazot, P., Brewer, J. R., Cinalli, R. M. & Soriano, P. Distinct requirements for FGFR1 and FGFR2 in primitive endoderm development and exit from pluripotency. Dev. Cell 41, 511–526 (2017).

    CAS  Article  Google Scholar 

  27. 27.

    Kang, M., Garg, V. & Hadjantonakis, A. K. Lineage establishment and progression within the inner cell mass of the mouse blastocyst requires FGFR1 and FGFR2. Dev. Cell 41, 496–510 (2017).

    CAS  Article  Google Scholar 

  28. 28.

    Ohnishi, Y. et al. Cell-to-cell expression variability followed by signal reinforcement progressively segregates early mouse lineages. Nat. Cell Biol. 16, 27–37 (2014).

    CAS  Article  Google Scholar 

  29. 29.

    Kang, M., Piliszek, A., Artus, J. & Hadjantonakis, A. K. FGF4 is required for lineage restriction and salt-and-pepper distribution of primitive endoderm factors but not their initial expression in the mouse. Development 140, 267–279 (2013).

    CAS  Article  Google Scholar 

  30. 30.

    Thomas, P. Q., Brown, A. & Beddington, R. S. Hex: a homeobox gene revealing peri-implantation asymmetry in the mouse embryo and an early transient marker of endothelial cell precursors. Development 125, 85–94 (1998).

    CAS  Google Scholar 

  31. 31.

    Meno, C. et al. Mouse Lefty2 and zebrafish Antivin are feedback inhibitors of nodal signaling during vertebrate gastrulation. Mol. Cell 4, 287–298 (1999).

    CAS  Article  Google Scholar 

  32. 32.

    Belo, J. A. et al. Cerberus-like is a secreted factor with neutralizing activity expressed in the anterior primitive endoderm of the mouse gastrula. Mech. Dev. 68, 45–57 (1997).

    CAS  Article  Google Scholar 

  33. 33.

    Arnold, S. J. & Robertson, E. J. Making a commitment: cell lineage allocation and axis patterning in the early mouse embryo. Nat. Rev. Mol. Cell Biol. 10, 91–103 (2009).

    CAS  Article  Google Scholar 

  34. 34.

    Hayashi, S., Lewis, P., Pevny, L. & McMahon, A. P. Efficient gene modulation in mouse epiblast using a Sox2Cre transgenic mouse strain. Mech. Dev. 119, S97–S101 (2002).

    Article  Google Scholar 

  35. 35.

    Kwon, G. S. & Hadjantonakis, A. K. Transthyretin mouse transgenes direct RFP expression or Cre-mediated recombination throughout the visceral endoderm. Genesis 47, 447–455 (2009).

    CAS  Article  Google Scholar 

  36. 36.

    Muzumdar, M. D., Tasic, B., Miyamichi, K., Li, L. & Luo, L. A global double-fluorescent Cre reporter mouse. Genesis 45, 593–605 (2007).

    CAS  Article  Google Scholar 

  37. 37.

    Takaoka, K., Yamamoto, M. & Hamada, H. Origin and role of distal visceral endoderm, a group of cells that determines anterior–posterior polarity of the mouse embryo. Nat. Cell Biol. 13, 743–752 (2011).

    CAS  Article  Google Scholar 

  38. 38.

    Paca, A. et al. BMP signaling induces visceral endoderm differentiation of XEN cells and parietal endoderm. Dev. Biol. 361, 90–102 (2012).

    CAS  Article  Google Scholar 

  39. 39.

    Kruithof-de Julio, M. et al. Regulation of extra-embryonic endoderm stem cell differentiation by Nodal and Cripto signaling. Development 138, 3885–3895 (2011).

    CAS  Article  Google Scholar 

  40. 40.

    Artus, J. et al. BMP4 signaling directs primitive endoderm-derived XEN cells to an extraembryonic visceral endoderm identity. Dev. Biol. 361, 245–262 (2012).

    CAS  Article  Google Scholar 

  41. 41.

    Serra, M. et al. Pluripotent stem cell differentiation reveals distinct developmental pathways regulating lung- versus thyroid-lineage specification. Development 144, 3879–3893 (2017).

    CAS  Article  Google Scholar 

  42. 42.

    Becker, M. B., Zülch, A., Bosse, A. & Gruss, P. Irx1 and Irx2 expression in early lung development. Mech. Dev. 106, 155–158 (2001).

    CAS  Article  Google Scholar 

  43. 43.

    Yang, Y., Akinci, E., Dutton, J. R., Banga, A. & Slack, J. M. Stage specific reprogramming of mouse embryo liver cells to a beta cell-like phenotype. Mech. Dev. 130, 602–612 (2013).

    CAS  Article  Google Scholar 

  44. 44.

    Offield, M. F. et al. PDX-1 is required for pancreatic outgrowth and differentiation of the rostral duodenum. Development 122, 983–995 (1996).

    CAS  PubMed  Google Scholar 

  45. 45.

    Tsai, Y. H. et al. In vitro patterning of pluripotent stem cell-derived intestine recapitulates in vivo human development. Development 144, 1045–1055 (2017).

    CAS  Article  Google Scholar 

  46. 46.

    Deschamps, J. & van Nes, J. Developmental regulation of the Hox genes during axial morphogenesis in the mouse. Development 132, 2931–2942 (2005).

    CAS  Article  Google Scholar 

  47. 47.

    Di Gregorio, A., Bowling, S. & Rodriguez, T. A. Cell competition and its role in the regulation of cell fitness from development to cancer. Dev. Cell 38, 621–634 (2016).

    Article  Google Scholar 

  48. 48.

    Hiramatsu, R. et al. External mechanical cues trigger the establishment of the anterior-posterior axis in early mouse embryos. Dev. Cell 27, 131–144 (2013).

    CAS  Article  Google Scholar 

  49. 49.

    McCauley, H. A. & Wells, J. M. Pluripotent stem cell-derived organoids: using principles of developmental biology to grow human tissues in a dish. Development 144, 958–962 (2017).

    CAS  Article  Google Scholar 

  50. 50.

    van Dijk, D. et al. Recovering gene interactions from single-cell data using data diffusion. Cell 174, 716–729 (2018).

    Article  Google Scholar 

  51. 51.

    Behringer, R. G. M., Nagy, K. V. and Nagy, A. Manipulating the Mouse Embryo: A Laboratory Manual, 4th edn (Cold Spring Harbor Laboratory, Cold Spring Harbor, 2014).

  52. 52.

    Downs, K. M. & Davies, T. Staging of gastrulating mouse embryos by morphological landmarks in the dissecting microscope. Development 118, 1255–1266 (1993).

    CAS  PubMed  Google Scholar 

  53. 53.

    Morgani, S. M. et al. Totipotent embryonic stem cells arise in ground-state culture conditions. Cell Reports 3, 1945–1957 (2013).

    CAS  Article  Google Scholar 

  54. 54.

    Zheng, G. H. et al. Massively parallel digital transcriptional profiling of single cells. Nat. Commun. 8, 14049 (2017).

    ADS  CAS  Article  Google Scholar 

  55. 55.

    Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).

    CAS  Article  Google Scholar 

  56. 56.

    Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).

    CAS  Article  Google Scholar 

  57. 57.

    Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).

    Article  Google Scholar 

Download references


We thank K. Anderson, A. Joyner, A. Martinez-Arias and L. Mazutis for discussions; L. Beccari, D. Duboule, L. Sussel, M. Torres, D. Wellik and M. Wilkinson for plasmids; B. Merill for antibodies; and J. Brickman for embryonic stem cells. This work was supported by grants from the NIH (R01-DK084391 and R01-HD094868 to A.-K.H.; DP1-HD084071 and R01-CA164729 to D.P.; P30-CA008748 to C. Thompson), MSKCC Society for Special Projects and Functional Genomics Initiative (to A.-K.H. and D.P.) and NSERC (RGPIN-2018-05018 to P.A.H.). C.S.S. is supported by a NYSTEM postdoctoral training award from the Center for Stem Cell Biology MSKCC.

Reviewer information

Nature thanks Thorsten Boroviak, Valerie Wilson and the other anonymous reviewer(s) for their contribution to the peer review of this work.

Author information




S.N., M.S., D.M.C., P.A.H., A.-K.H. and D.P. conceived the research. S.N., Y.-Y.K. and V.G. collected cells from embryos with assistance from C.S.S., N.S. and A.-K.H. S.N., Y.-Y.K. and R.G. performed FACS experiments. S.N., V.G. and S.C.B. generated scRNA-seq libraries. S.C.B. and D.M.C. performed RNA-seq. M.S. and D.P. conceived and developed the Harmony and Palantir algorithms. M.S., V.L. and R.S. performed computational analyses of sequence data. V.G. performed blastocyst immunofluorescence. S.N. and Y.-Y.K. performed lineage-tracing, immunofluorescence and ISH on post-implantation embryos and gut tubes. S.N., M.S., A.-K.H. and D.P. analysed and interpreted the data, and wrote the manuscript with input from all authors.

Corresponding authors

Correspondence to Anna-Katerina Hadjantonakis or Dana Pe’er.

Ethics declarations

Competing interests

S.C.B. and D.M.C. are employees and shareholders at 10x Genomics.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Endoderm cell representation in mouse embryos, from blastocyst through midgestation, and single-cell collection pipeline.

a, Distribution of extra-embryonic endoderm cells (GFP, green) from blastocyst (E3.5) to midgestation (E8.75, 13ss) demarcated using PdgfraH2B-GFP20 (pre-implantation stages) and Afp-GFP12 (post-implantation stages) reporters. Extra-embryonic endoderm (PrE and visceral endoderm derivatives) cells contribute to the gut tube of the E8.75 embryo. b, Pie charts depicting the fraction of endoderm cells per embryo, for all stages analysed in this study. c, Schematic of protocol used for single-cell collection, with E8.75 gut tube provided as an example. Gut tubes were micro-dissected from embryos, then dissociated into single cells. Single cells of either anterior and posterior halves of gut tubes, or AFP–GFP-positive (visceral endoderm descendants) and AFP–GFP-negative (definitive endoderm descendants) collected using fluorescence-activated cell sorting, were used for single-cell 3′ mRNA library construction on the 10x Genomics Chromium platform. For bulk RNA-seq, whole gut tubes that had been dissociated into single cells and then pooled, whole intact gut tubes and whole gut tubes dissected into quarters were collected for sequencing. d, t-SNE plots of collected libraries for each time point with each dot representing a single cell. Phenograph was used to identify clusters of cells, colour-coded by cell type with annotation based on expression of known markers.

Extended Data Fig. 2 Computational pipeline and comparison of scRNA-seq data with bulk RNA-seq data.

a, Flow chart of computational data processing pipeline. b, Plots showing the Pearson’s correlation between aggregated scRNA-seq data of anterior and posterior halves of the gut tube with bulk RNA-seq of dissociated (and pooled) cells and bulk tissue, respectively. The two rows represent two replicates.

Extended Data Fig. 3 Mutually nearest neighbour augmentation to correct batch effects between time points, and Harmony unified framework for scRNA-seq data analysis.

a, Force-directed layouts for cells of the E3.5, E4.5, E5.5, E6.5, E7.5 and E8.75 (amalgamation of anterior and posterior gut tube halves) time points. Cells are coloured by time point. The graph was generated using an adjacency matrix derived from the standard kNN graph. Differences between consecutive time points represent underlying developmental changes, but are also confounded by technical batch effects—including discontinuity between E3.5 and E.4.5, and the lack of spatial alignment between E6.5 and E7.5. b, E6.5 and E7.5 cells projected along their respective first two diffusion components. These projections reveal a dominant first component with strong spatial signal within individual time points. Cells are coloured by Phenograph clusters. c, Top, the number of edges connecting cells between time points are limited in the kNN graph. Bottom, plots showing the number of mutually nearest neighbours (MNNs) between E6.5 and E7.5 time points. The MNNs are enriched along the boundary between time points, providing support for augmentation of the kNN graph with additional edges between MNNs between the consecutive time points. d, The MNN distances can be converted to affinities on a similar scale as the kNN affinities, using linear regression to determine the relationship between the two scales, kNN and (l < k) MNN distances. e, Example of the augmented MNN affinity matrix construction. Left, kNN affinities for a subset of E6.5 and E7.5 cells. Middle, MNN affinity matrix determined using linear regression (in d) to convert distances E6.5 and E7.5 cells to affinities. Right, augmented affinity matrix: sum of the kNN and MNN affinity matrices. f, Comparison of force-directed layouts. Left, standard kNN affinity matrix. Middle, Harmony augmented affinity matrix. Right, plot generated using mnnCorrect18 for global batch effect correction, leading to ‘over-correction’ and loss in signal between time points. g, Harmony framework starts with the augmented affinity matrix generated as described in Supplementary Note 2. The augmented affinity matrix is used to generate the force-directed graph for visualizing the data. The same augmented matrix is used to compute the diffusion operator for determining the diffusion components, which forms the basis for (1) Palantir trajectory detection and (2) MAGIC imputation. h, Robustness of Harmony. Plots showing the correlation between diffusion components for different values of k, the number of nearest neighbours for kNN graph construction. Visceral endoderm cells in Fig. 4 were used for testing robustness. i, Harmony applied to replicates. Plots showing the Pearson’s correlation between diffusion components without Harmony (x axis) and with Harmony applied between the two replicates of the E8.75 gut tube. Plots shown for 3,512 cells.

Extended Data Fig. 4 Lineage decisions in the mammalian blastocyst.

Results from pooling cells of two replicates at E3.5 and E4.5, followed by Harmony augmentation. a, Force-directed layout of E3.5 and E4.5 cells depicting the relationship between three blastocyst lineages. Cells coloured by time point or annotated cell types. b, Plot showing projection of E3.5 and E4.5 cells along the first two diffusion components. Distances between lineages were computed using multiscale distances. c, Table showing the connectivity between different compartments in a kNN graph of E3.5 cells. Each row represents the fraction of outgoing edges from cells of the respective compartment that connect to cells in the compartments specified in the columns. d, Force-directed layout of E3.5 and E4.5 cells after removal of trophectoderm cells, showing relationships between ICM, EPI and PrE cells. Cells are coloured by time point or Phenograph14 clusters. e, Palantir10,11 determined pseudo-time ordering, differentiation potential and branch probabilities of PrE and EPI cell lineages. f, Plots showing the second derivative of PrE and EPI differentiation potential along pseudo-time, which suggests that changes in differentiation potential—and therefore lineage commitment—in both lineages occur at E3.5. Points of highest changes along pseudo-time represent inferred lineage specification and commitment. g, Distribution of E3.5 lineage cells along pseudo-time. Each distribution represents cells from one Phenograph cluster. h, Histograms showing the distribution of differentiation potential (left), PrE fate probability (middle) and EPI fate probability (right) in the E3.5 ICM clusters. i, Gene expression patterns of parietal and visceral endoderm markers. Each cell is coloured on the basis of its MAGIC50 imputed expression level for the indicated gene. Black and orange arrowheads mark presumptive parietal and visceral endoderm lineages, respectively.

Extended Data Fig. 5 Gene expression trends in EPI, PrE, visceral and parietal endoderm lineages in the blastocyst.

a, Plots comparing gene expression trends along pseudo-time for genes that encode components of the FGF signalling pathway (Fgf4, Fgf5, Fgf8, Fgfr1, Fgfr2 and Spry4), the endoderm-marker transcription factors Gata6, Gata4, Sox7 and Sox17, and Nanog during EPI and PrE lineage specification. Solid line represents the mean expression trend and shaded region represents ± 1 s.d. b, Dynamics of transcription factor ratios as lineages emerge. Ratio between Gata6 and Nanog, and Gata6 and Fgf4 along EPI; and between Nanog and Gata6, and Fgf4 and Gata6 along PrE, compared to changes in differentiation potential (dotted line). Transcription factor ratios were computed for each cell by using the MAGIC50 imputed data for each gene. c, Plots comparing gene expression trends along pseudo-time. Gata and Sox transcription factors, Fgf receptors or Fgf ligands during PrE or EPI specification. Colours at the bottom of each panel represent time point and—where applicable—E3.5 and E4.5 Phenograph clusters. Dashed lines represent branch probabilities in commitment towards respective lineages. d, Gene expression patterns of FGF signalling pathway components, and Gata and Sox transcription factor genes. Orange, black and green arrowheads point to high expression in parietal endoderm, visceral endoderm and EPI, respectively. e, Laser-scanning confocal data depicting TCF7L1 expression at E3.5 (top panel) (n = 14) and E4.5 (bottom panel) (n = 11). SOX2 and GATA6 were used as EPI and PrE lineage markers, respectively. f, Gene expression patterns of Tcf7l1 and Nanog depicting similar expression of Tcf7l1 in EPI as Fgf4 (green arrowhead).

Extended Data Fig. 6 Force-directed layouts of single E5.5 cells reveal relationships between EPI, visceral endoderm and ExE lineages.

a, Force-directed layouts of E5.5 data generated after pooling replicates, showing the relationship between EPI, visceral endoderm and ExE lineages. Cells are coloured by cell type. Black arrowheads mark cells that transdifferentiate from EPI to visceral endoderm. b, Plot showing the projection of EPI, visceral endoderm and ExE cells along the first two diffusion components. Distances between lineages were computed using multiscale distances. c, Plots highlighting the extremes of the diffusion components, serving as the boundaries of the phenotypic space for each lineage identity. d, Plots showing the shortest path-step sizes for paths from EPI to visceral endoderm (left) and EPI to ExE (right). e, Gene expression plots of AVE (Cer1 and Dkk1), visceral endoderm (Eomes, Foxa1 and Ttr), and visceral endoderm and EPI (Nodal) markers along EPI, and PrE and visceral endoderm lineages from E3.5–E5.5. Cells coloured on the basis of marker expression of indicated gene after MAGIC50. f, Laser-scanning confocal images of E5.5 and E6.0 Sox2-creTG/+;Rosa26mT/mG and Ttr-creTG/+;Rosa26mT/mG embryos immunostained for GFP, RFP (red fluorescent protein, membrane-localized tdTomato) and GATA6 (a marker of endoderm identity). Cell nuclei stained with Hoechst, and membranes labelled with RFP. Yellow arrowheads point to cells of EPI origin that are present within the visceral endoderm epithelial layer (n = 10/20 GFP-positive cells in visceral endoderm of Sox2-creTG/+;Rosa26mT/mG embryos; n = 0/27 GFP-positive cells in the EPI of Ttr-creTG/+;Rosa26mT/mG embryos). Results validated in at least three independent experiments. Scale bars, 50 μm (low-magnification images), 20 μm (high-magnification images). g, Laser-scanning confocal images of two E5.5 wild-type 4n embryo <-> H2B-tdTomato embryonic stem cell (ESC) chimaeras. Top two rows and bottom two rows represent low- and high-magnification 2D images, respectively (n = 9/19 embryo chimaeras showed tdTomato-positive cells in the visceral endoderm). Yellow arrowheads indicate EPI cells intercalating into the visceral endoderm layer. Embryos are counterstained with Hoechst to label nuclei, and phalloidin to label F-actin. Scale bars, 20 μm (low-magnification images), 10 μm (high-magnification images).

Extended Data Fig. 7 Emergence of spatial patterning of the embryo at E5.5.

a, Plot showing Palantir pseudo-time versus differentiation potential of visceral endoderm cells from stage E3.5 to E8.75. Drops in differential potential occur at two time points. The first occurs at E5.5, as cells acquire a distal versus proximal fate; the second occurs at E7.5, as cells acquire an anterior versus posterior fate. b, Plots of branch probabilities of commitment towards yolk sac endoderm, and anterior and posterior gut endoderm. c, Marker based (top left) and bulk RNA-seq based (top right) prediction of exVE and emVE at E7.5. Bottom, plots show the Pearson’s correlation between bulk RNA-seq replicates of exVE and emVE. d, Plots show differentially expressed genes between exVE (291 genes) and emVE (2,239 genes) derived using bulk RNA-seq data. e, Plots showing the branch probabilities of E7.5, E6.5 and E5.5 exVE and emVE cells to commit towards yolk sac endoderm (extra-embryonic) and gut tube (embryonic). Cells are labelled as exVE and emVE on the basis of expression of known markers (leftmost plot), match expected Palantir branch probabilities (four plots on the right). The branch probabilities of E5.5 cells in committing towards yolk sac endoderm and gut tube were used to infer putative exVE and emVE identities at E5.5. f, Plot showing pseudo-time versus differentiation potential of endoderm cells at E5.5, coloured by the inferred cell type. This panel is a magnified view of a. g, Heat maps of highly expressed genes specifically in exVE or emVE at E5.5 also distinguish exVE and emVE cells at E6.5 and E7.5. h, ISH of E6.25 embryos showing expression of Lhx1 (n = 3) and Lefty1 (n = 3), which are genes that are specific for emVE, and Apln (n = 3) and Msx1 (n = 3), which are genes that are specific for exVE. Scale bars, 50 μm.

Extended Data Fig. 8 Characterization of E8.75 gut tube anterior–posterior pseudo-space.

a, Force-directed layout as in Fig. 5. Top, plots show the probabilities of anterior–posterior positioning for the AFP–GFP-positive and AFP–GFP-negative cells inferred using the manifold classifier trained on anterior–posterior cells. Bottom, plots show the probabilities of GFP-positive and GFP-negative status for the cells from the anterior–posterior compartment, inferred using the manifold classifier trained on GFP-positive and GFP-negative cells. b, Top, anterior and posterior cells labelled by measured data (leftmost two columns). Anterior and posterior positions of AFP–GFP-positive and AFP–GFP-negative cells inferred (rightmost two columns) using probabilities in a (top panels). Bottom, GFP-positive and GFP-negative cells labelled by measured data (leftmost two panels). GFP-positive and GFP-negative status of the anterior–posterior compartment cells inferred (rightmost two panels) using probabilities in a (bottom panels). c, Top left, plot showing the first diffusion component of the E8.75 cells. Top middle, top right, plots showing the expression of anterior marker Nkx2.1 and posterior marker Hoxb9 in E8.75 cells. Bottom, bulk RNA-seq expression of Nkx2-1 and Hoxb9 in quadrants of the gut tube along the anterior–posterior axis compares with anterior–posterior single-cell expression patterns. d, Plot showing the proportion of anterior and posterior cells in bins along the anterior–posterior pseudo-space. e, Heat map showing Pearson’s correlations between anterior–posterior pseudo-space orderings, determined using a varying number of diffusion components and highlighting the robustness of the ordering. f, Plots comparing the anterior–posterior pseudo-space ordering of GFP-positive and GFP-negative cells (replicate 2, 13,335 cells) generated de novo using only the replicate 2 cells (x axis, left) with the projected ordering from replicate 1 (8,143 cells) (y axis). Right, similar comparison with the pseudo-space ordering determined using cells of both the replicates on the x axis. g, Same as f, for replicates of anterior–posterior cells (replicate 1, 1,821 cells; replicate 2, 1,691 cells). Plots show the Pearson’s correlation. h, ROC for classification of E7.5 visceral and definitive endoderm cells (4,378 cells). i, Plots showing the expression patterns of genes that are best predictive of the definitive endoderm class in the visceral and definitive endoderm classifier (top, definitive endoderm; bottom, visceral endoderm). j, Plots showing the expression patterns of genes in the definitive endoderm that are best predictive of visceral endoderm class in the visceral and definitive endoderm classifier. k, Force-directed layouts following Harmony of E7.5 and E8.75 visceral and definitive endoderm cells, with E7.5 cells coloured in red (definitive endoderm) and blue (visceral endoderm) (left). E7.5 visceral and definitive endoderm cells coloured by the branch probability of anterior localization (middle) and posterior localization (right). Black arrowheads indicate early emergence of anterior–posterior spatial patterning at E7.5, with E7.5 definitive endoderm cells predominantly destined towards the anterior, and visceral endoderm cells predominantly destined towards the posterior. l, Three-dimensional renderings of gut tube, depicting all endoderm cells along the anterior–posterior axis. Nuclei of visceral and definitive endoderm cells are labelled in green and grey, respectively. m, Plots comparing the ranks of proportion of GFP-positive cells along anterior–posterior positioning in the AFP–GFP-embryo-derived Neurolucida reconstructed gut tube replicates (x axis), and the ranks of visceral endoderm cell proportions in bins along the anterior–posterior pseudo-space axis (y axis); the anterior–posterior axis was partitioned into 20 bins; each dot represents the fraction of visceral endoderm cells in that bin.

Extended Data Fig. 9 Spatial patterning of the gut tube at E8.75.

a, Plots showing individual Phenograph cluster densities of the E8.75 gut tube cells ordered along the anterior–posterior pseudo-space (left) and in force-directed layouts (middle). ISH of representative differentially expressed genes in each cluster on whole E8.75 embryos (n > 3 for each gene) or micro-dissected E8.75 gut tubes (n > 3 for each gene) (right). Arrowheads point to expression of representative gene for each particular cluster. Scale bars, 200 μm (except for Nkx2-1, 100 μm). no, notochord. b, Density of E8.75 cells along the anterior–posterior pseudo-space axis. c, Comparison of empirical anterior–posterior pseudo-space axis and the predicted anterior–posterior pseudo-space, using expression of transcription factors. Each dot represents the anterior–posterior pseudo-space of a cell computed by all genes, versus pseudo-space prediction by the selected transcription factors alone. d, Plot showing the ranking of different transcription factors according to their predictive power, on the basis of the regression model. e, Heat map showing the coefficients for the top transcription factors when different proportions of cells are subsampled for the regression (total cells, 24,990). f, Heat map showing the Pearson’s correlation of transcription factor coefficients in e, highlighting the robustness of transcription factor coefficients in regression.

Extended Data Fig. 10 Hox gene expression within the E8.75 gut tube.

a, Force-directed layouts showing Hox genes expressed in gut endoderm cells at E8.75. b, Whole-mount mRNA ISH on whole E8.75 embryos (n > 3 for each gene) and micro-dissected gut tubes (n > 3 for each gene) of Hox genes, depicting the distribution of Hox genes along the anterior–posterior axis. Scale bars, 100 μm (Hoxc10 and Hoxd11), 200 μm (all other panels).

Extended Data Fig. 11 Signalling map of the gut tube of the E8.75 mouse embryo.

Force-directed layouts of context-independent targets of key signalling pathways acting within the endoderm lineage of the embryo. Fibroblast growth factor (FGF); WNT; bone morphogenic protein (BMP); NOTCH; Hedgehog (HH); NODAL and TGFβ signalling (NODAL); JAK and STAT; retinoic acid; and HIPPO.

Supplementary information

Supplementary Information

This file contains Supplementary Notes, Supplementary Figures 6-11, and Supplementary References.

Reporting Summary

Supplementary Figures 1-5.

Supplementary Table 1

List of differentially expressed genes for E3.5 clusters.

Supplementary Table 2

List of differentially expressed genes for E4.5 clusters.

Supplementary Table 3

List of clusters of genes with similar dynamics in EPI and VE differentiation from E3.5–E5.5.

Supplementary Table 4

List of genes that are exVE and emVE specific at E5.5.

Supplementary Table 5

List of genes that are most predictive of VE and DE classes in the E7.5 classifier.

Supplementary Table 6

List of differentially expressed genes in the E8.75 gut tube clusters.

Supplementary Table 7

mRNA in situ hybridization probes.

Supplementary Table 8

mRNA in situ hybridization probes cloned from E8.5 embryo RNA by RT–PCR.

Supplementary Table 9


Supplementary Table 10


Supplementary Table 11


Supplementary Table 12

Numbers of GFP+ and GFP cells in gut tubes of 13ss embryos.

Supplementary Table 13

List of bulk RNA-seq samples.

Supplementary Table 14

Palantir parameters.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Nowotschin, S., Setty, M., Kuo, YY. et al. The emergent landscape of the mouse gut endoderm at single-cell resolution. Nature 569, 361–367 (2019).

Download citation

Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing