Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Article
  • Published:

Gene family evolution underlies cell-type diversification in the hypothalamus of teleosts

Abstract

Hundreds of cell types form the vertebrate brain but it is largely unknown how similar cellular repertoires are between or within species or how cell-type diversity evolves. To examine cell-type diversity across and within species, we performed single-cell RNA sequencing of ~130,000 hypothalamic cells from zebrafish (Danio rerio) and surface and cave morphs of Mexican tetra (Astyanax mexicanus). We found that over 75% of cell types were shared between zebrafish and Mexican tetra, which diverged from a common ancestor over 150 million years ago. Shared cell types displayed shifts in paralogue expression that were generated by subfunctionalization after genome duplication. Expression of terminal effector genes, such as neuropeptides, was more conserved than the expression of their associated transcriptional regulators. Species-specific cell types were enriched for the expression of species-specific genes and characterized by the neofunctionalization of expression patterns of members of recently expanded or contracted gene families. Comparisons between surface and cave morphs revealed differences in immune repertoires and transcriptional changes in neuropeptidergic cell types associated with genomic differences. The single-cell atlases presented here are a powerful resource to explore hypothalamic cell types and reveal how gene family evolution and shifts in paralogue expression contribute to cellular diversity.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Integration of zebrafish and Mexican tetra single-cell data reveals extensive conservation of cell types.
Fig. 2: Shared subclusters are highly similar between species and express paralogous genes.
Fig. 3: Paralogue shifts are due to differential divergence after duplication between species.
Fig. 4: Divergence of GRNs underlying neuronal genes.
Fig. 5: Species-specific subclusters are associated with species-specific genes.
Fig. 6: Divergence in subcluster repertoires and transcriptomes across Pachon, Tinaja and Molino cave morphs.

Similar content being viewed by others

Data availability

Processed scRNA-seq counts and metadata, marker gene lists, trinarized gene lists, SI results, SCENIC results, results from genetic analysis and GO lists are available as Supplementary Data. Raw sequencing results are available at the Sequence Read Archive (SRA) under BioProject ID PRJNA754013.

References

  1. Ramón y Cajal, S. Histology of the Human Nervous System of Man and Vertebrates (Maloine, 1911).

  2. Zeng, H. & Sanes, J. R. Neuronal cell-type classification: challenges, opportunities and the path forward. Nat. Rev. Neurosci. 18, 530–546 (2017).

    Article  CAS  PubMed  Google Scholar 

  3. Arendt, D., Bertucci, P. Y., Achim, K. & Musser, J. M. Evolution of neuronal types and families. Curr. Opin. Neurobiol. 56, 144–152 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Ohno, S. Evolution by Gene Duplication (Springer, 1970).

  5. Shafer, M. E. R. Cross-species analysis of single-cell transcriptomic data. Front. Cell Dev. Biol. 7, 175 (2019).

  6. Tosches, M. A. et al. Evolution of pallium, hippocampus, and cortical cell types revealed by single-cell transcriptomics in reptiles. Science 360, 881–888 (2018).

    Article  CAS  PubMed  Google Scholar 

  7. Briggs, J. A. et al. The dynamics of gene expression in vertebrate embryogenesis at single-cell resolution. Science 360, eaar5780 (2018).

  8. La Manno, G. et al. Molecular diversity of midbrain development in mouse, human, and stem cells. Cell 167, 566–580 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  9. Pollen, A. A. et al. Establishing cerebral organoids as models of human-specific brain evolution. Cell 176, 743–756 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Kanton, S. et al. Organoid single-cell genomic atlas uncovers human-specific features of brain development. Nature 574, 418–422 (2019).

    Article  CAS  PubMed  Google Scholar 

  11. Krienen, F. M. et al. Innovations present in the primate interneuron repertoire. Nature 586, 262–269 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Pasquier, J. et al. Gene evolution and gene expression after whole genome duplication in fish: the PhyloFish database. BMC Genomics 17, 368 (2016).

    Article  PubMed  PubMed Central  Google Scholar 

  13. Ravi, V. & Venkatesh, B. The divergent genomes of teleosts. Annu. Rev. Anim. Biosci. 6, 47–68 (2018).

    Article  CAS  PubMed  Google Scholar 

  14. Voldoire, E., Brunet, F., Naville, M., Volff, J.-N. & Galiana, D. Expansion by whole genome duplication and evolution of the sox gene family in teleost fish. PLoS ONE 12, e0180936 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  15. Gross, J. B. The complex origin of Astyanax cavefish. BMC Evol. Biol. 12, 105 (2012).

    Article  PubMed  PubMed Central  Google Scholar 

  16. Keene, A., Yoshizawa, M. & McGaugh, S. E. Biology and Evolution of the Mexican Cavefish (Academic Press, 2015).

  17. Mitchell, R. W., Russell, W. H. & Elliott, W. R. Mexican Eyeless Characin Fishes, Genus Astyanax: Environment, Distribution, and Evolution (Texas Tech Press, 1977).

  18. Herman, A. et al. The role of gene flow in rapid and repeated evolution of cave-related traits in Mexican tetra, Astyanax mexicanus. Mol. Ecol. 27, 4397–4416 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Prober, D. A., Rihel, J., Onah, A. A., Sung, R.-J. & Schier, A. F. Hypocretin/orexin overexpression induces an insomnia-like phenotype in zebrafish. J. Neurosci. 26, 13400–13410 (2006).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Richter, C., Woods, I. G. & Schier, A. F. Neuropeptidergic control of sleep and wakefulness. Annu. Rev. Neurosci. 37, 503–531 (2014).

    Article  CAS  PubMed  Google Scholar 

  21. Xie, Y. & Dorsky, R. I. Development of the hypothalamus: conservation, modification and innovation. Development 144, 1588–1599 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. Denes, A. S. et al. Molecular architecture of annelid nerve cord supports common origin of nervous system centralization in Bilateria. Cell 129, 277–288 (2007).

    Article  CAS  PubMed  Google Scholar 

  23. Liu, J. et al. Evolutionarily conserved regulation of hypocretin neuron specification by Lhx9. Development 142, 1113–1124 (2015).

    CAS  PubMed  PubMed Central  Google Scholar 

  24. Liang, C., Musser, J. M., Cloutier, A., Prum, R. O. & Wagner, G. P. Pervasive correlated evolution in gene expression shapes cell and tissue type transcriptomes. Genome Biol. Evol. 10, 538–552 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Gu, X. Understanding tissue expression evolution: from expression phylogeny to phylogenetic network. Brief. Bioinform. 17, 249–254 (2016).

    Article  PubMed  Google Scholar 

  26. Dunn, C. W., Zapata, F., Munro, C., Siebert, S. & Hejnol, A. Pairwise comparisons across species are problematic when analyzing functional genomic data. Proc. Natl Acad. Sci. USA 115, E409–E417 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Farré, D. & Albà, M. M. Heterogeneous patterns of gene-expression diversification in mammalian gene duplicates. Mol. Biol. Evol. 27, 325–335 (2010).

    Article  PubMed  Google Scholar 

  28. Pratapa, A., Jalihal, A. P., Law, J. N., Bharadwaj, A. & Murali, T. M. Benchmarking algorithms for gene regulatory network inference from single-cell transcriptomic data. Nat. Methods 17, 147–154 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Aibar, S. et al. SCENIC: single-cell regulatory network inference and clustering. Nat. Methods 14, 1083–1086 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Yates, A. D. et al. Ensembl 2020. Nucleic Acids Res. 48, D682–D688 (2020).

    CAS  PubMed  Google Scholar 

  31. Santos, M. E., Bouquin, A. L., Crumière, A. J. J. & Khila, A. Taxon-restricted genes at the origin of a novel trait allowing access to a new environment. Science 358, 386–390 (2017).

    Article  CAS  PubMed  Google Scholar 

  32. Cao, J. & Lv, Y. Evolutionary analysis of the jacalin-related lectin family genes in 11 fishes. Fish Shellfish Immunol. 56, 543–553 (2016).

    Article  CAS  PubMed  Google Scholar 

  33. Burgoyne, R. D. Neuronal calcium sensor proteins: generating diversity in neuronal Ca2+ signalling. Nat. Rev. Neurosci. 8, 182–193 (2007).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Jaggard, J. B. et al. Hypocretin underlies the evolution of sleep loss in the Mexican cavefish. eLife 7, e32637 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  35. Alié, A. et al. Developmental evolution of the forebrain in cavefish, from natural variations in neuropeptides to behavior. eLife 7, e32808 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  36. Peuß, R. et al. Adaptation to low parasite abundance affects immune investment and immunopathological responses of cavefish. Nat. Ecol. Evol. 4, 1416–1430 (2020).

    Article  PubMed  Google Scholar 

  37. Eisenberg, T. et al. Induction of autophagy by spermidine promotes longevity. Nat. Cell Biol. 11, 1305–1314 (2009).

    Article  CAS  PubMed  Google Scholar 

  38. Duffy, C. M., Xu, H., Nixon, J. P., Bernlohr, D. A. & Butterick, T. A. Identification of a fatty acid binding protein4–UCP2 axis regulating microglial mediated neuroinflammation. Mol. Cell. Neurosci. 80, 52–57 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Zhang, H. et al. Polyamines control eIF5A hypusination, TFEB translation, and autophagy to reverse B cell senescence. Mol. Cell 76, 110–125 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Rodrigues, F. T. S. et al. Major depression model induced by repeated and intermittent lipopolysaccharide administration: long-lasting behavioral, neuroimmune and neuroprogressive alterations. J. Psychiatr. Res. 107, 57–67 (2018).

    Article  PubMed  Google Scholar 

  41. Raj, B. et al. Simultaneous single-cell profiling of lineages and cell types in the vertebrate brain. Nat. Biotechnol. 36, 442–450 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Saunders, A. et al. Molecular diversity and specializations among the cells of the adult mouse brain. Cell 174, 1015–1030 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Hughes, L. C. et al. Comprehensive phylogeny of ray-finned fishes (Actinopterygii) based on transcriptomic and genomic data. Proc. Natl Acad. Sci. USA 115, 6249–6254 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Nakatani, M., Miya, M., Mabuchi, K., Saitoh, K. & Nishida, M. Evolutionary history of Otophysi (Teleostei), a major clade of the modern freshwater fishes: Pangaean origin and Mesozoic radiation. BMC Evol. Biol. 11, 177 (2011).

    Article  PubMed  PubMed Central  Google Scholar 

  45. Tarashansky, A. J. et al. Mapping single-cell atlases throughout Metazoa unravels cell type evolution. eLife 10, e66747 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Kondrashov, F. A. & Koonin, E. V. A common framework for understanding the origin of genetic dominance and evolutionary fates of gene duplications. Trends Genet. 20, 287–290 (2004).

    Article  CAS  PubMed  Google Scholar 

  47. Thomson, G. J. et al. Metabolism-induced oxidative stress and DNA damage selectively trigger genome instability in polyploid fungal cells. EMBO J. 38, e101597 (2019).

    Article  PubMed  PubMed Central  Google Scholar 

  48. Gillard, G. B. et al. Comparative regulomics supports pervasive selection on gene dosage following whole genome duplication. Genome Biol. 22, 103 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Konstantinides, N. et al. Phenotypic convergence: distinct transcription factors regulate common terminal features. Cell 174, 622–635 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. True, J. R. & Haag, E. S. Developmental system drift and flexibility in evolutionary trajectories. Evol. Dev. 3, 109–119 (2001).

    Article  CAS  PubMed  Google Scholar 

  51. Hilgers, L., Hartmann, S., Hofreiter, M. & von Rintelen, T. Novel genes, ancient genes, and gene co-option contributed to the genetic basis of the radula, a molluscan innovation. Mol. Biol. Evol. 35, 1638–1652 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Florio, M. et al. Evolution and cell-type specificity of human-specific genes preferentially expressed in progenitors of fetal neocortex. eLife 7, e32332 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  53. Arendt, D. et al. The origin and evolution of cell types. Nat. Rev. Genet. 17, 744–757 (2016).

    Article  CAS  PubMed  Google Scholar 

  54. Viets, K., Eldred, K. C. & Johnston, R. J. Mechanisms of photoreceptor patterning in vertebrates and invertebrates. Trends Genet. 32, 638–659 (2016).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Bowmaker, J. K. Evolution of vertebrate visual pigments. Vis. Res. 48, 2022–2041 (2008).

    Article  CAS  PubMed  Google Scholar 

  56. Chitnis, T. & Weiner, H. L. CNS inflammation and neurodegeneration. J. Clin. Invest. 127, 3577–3587 (2017).

    Article  PubMed  PubMed Central  Google Scholar 

  57. Riddle, M. R. et al. Insulin resistance in cavefish as an adaptation to a nutrient-limited environment. Nature 555, 647–651 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Elipot, Y. et al. A mutation in the enzyme monoamine oxidase explains part of the Astyanax cavefish behavioural syndrome. Nat. Commun. 5, 3647 (2014).

    Article  PubMed  Google Scholar 

  59. Yoshizawa, M. et al. Distinct genetic architecture underlies the emergence of sleep loss and prey-seeking behavior in the Mexican cavefish. BMC Biol. 13, 15 (2015).

    Article  PubMed  PubMed Central  Google Scholar 

  60. Fischer, E. K. & O’Connell, L. A. Modification of feeding circuits in the evolution of social behavior. J. Exp. Biol. 220, 92–102 (2017).

    Article  PubMed  Google Scholar 

  61. Kroeger, D. et al. Galanin neurons in the ventrolateral preoptic area promote sleep and heat loss in mice. Nat. Commun. 9, 4129 (2018).

    Article  PubMed  PubMed Central  Google Scholar 

  62. Yamashita, J. et al. Male-predominant galanin mediates androgen-dependent aggressive chases in medaka. eLife 9, e59470 (2020).

  63. Wee, C. L. et al. Social isolation modulates appetite and defensive behavior via a common oxytocinergic circuit in larval zebrafish. Preprint at bioRxiv https://doi.org/10.1101/2020.02.19.956854 (2021).

  64. Elipot, Y., Legendre, L., Père, S., Sohm, F. & Rétaux, S. Astyanax transgenesis and husbandry: how cavefish enters the laboratory. Zebrafish 11, 291–299 (2014).

    Article  CAS  PubMed  Google Scholar 

  65. Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  66. Zeisel, A. et al. Molecular architecture of the mouse nervous system. Cell 174, 999–1014 (2018).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  67. Sawh, A. N. et al. Lamina-dependent stretching and unconventional chromosome compartments in early C. elegans embryos. Mol. Cell 78, 96–111 (2020).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184–1191 (2009).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Strimmer, K. fdrtool: a versatile R package for estimating local and tail area-based false discovery rates. Bioinformatics 24, 1461–1462 (2008).

    Article  CAS  PubMed  Google Scholar 

  70. Fresno, C. & Fernández, E. A. RDAVIDWebService: a versatile R interface to DAVID. Bioinformatics 29, 2810–2811 (2013).

    Article  CAS  PubMed  Google Scholar 

  71. Singh, P. P. & Isambert, H. OHNOLOGS v2: a comprehensive resource for the genes retained from whole genome duplication in vertebrates. Nucleic Acids Res. 48, D724–D730 (2020).

    CAS  PubMed  Google Scholar 

  72. Smith, R. N. et al. InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data. Bioinformatics 28, 3163–3165 (2012).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  73. Wilkinson, S. P. & Davy, S. K. phylogram: an R package for phylogenetic analysis with nested lists. J. Open Source Softw. 3, 790 (2018).

    Article  Google Scholar 

  74. Yu, G. Using ggtree to visualize data on tree-like structures. Curr. Protoc. Bioinformatics 69, e96 (2020).

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

We thank C. J. Tabin for providing A. mexicanus samples and advice on experimental design. We thank members of the Schier laboratory for discussion and advice, including B. Raj, J. Liu, P. Abitua and A. Nichols and the Harvard zebrafish and cavefish facilities staff, including B. Martineau, for technical support. We thank G. Camp, W. Salzburger and N. Jurisch-Yaksi for helpful comments on the manuscript. This work was supported by a postdoctoral fellowship from the Canadian Institutes of Health Research to M.E.R.S., a grant from the Swiss National Science Foundation (SNSF) to M.E.R.S. (SPARK 196313), grants from SNSF (SPARK 195955) and the University of Basel to A.N.S. and a National Institutes of Health grant (DP1HD094764), an ERC Advanced grant (834788), an Allen Discovery Center grant and a McKnight Foundation Technological Innovations in Neuroscience Award to A.F.S.

Author information

Authors and Affiliations

Authors

Contributions

M.E.R.S. and A.F.S. conceived and designed the study. A.N.S. and M.E.R.S. conceived and performed SI analysis. M.E.R.S. performed all other experiments and analysis, including scRNA-seq experiments and all bioinformatic analysis. M.E.R.S., A.N.S. and A.F.S. wrote the manuscript. All authors read and approved of the manuscript.

Corresponding authors

Correspondence to Maxwell E. R. Shafer or Alexander F. Schier.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Ecology & Evolution thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Hypothalamic and POA cell types in zebrafish and Mexican tetra.

(a) UMAP of zebrafish cells coloured and labelled by annotated cell type. (b) UMAP of Mexican tetra surface- and cave-morphs coloured and labelled by annotated cell type. (c) DotPlot of the top 2 maker genes for each zebrafish cluster from (a). (d) DotPlot of the top 2 marker genes for each Mexican tetra cluster from (b). Examples of potentially homologous cell types and their top marker genes share a colour (blue, green, red) in (c) and (d). (e) UMAP of merged but not batch-corrected zebrafish and Mexican tetra single-cell datasets.

Extended Data Fig. 2 Marker genes for cell types shared between zebrafish and Mexican tetra.

(a) DotPlot of the top 5 marker genes for each integrated cluster. (b) Proportion of cells from each cluster by species or species-morph (height of each bar along the x-axis). Width of each bar along the y-axis indicates the proportion of that cluster in the integrated data. Red outlines indicate the Mexican tetra-specific Ciliated cluster, and the integrated Immune clusters which are over-represented in the Mexican tetra dataset. (c) Density plot of the number of subclusters versus the fraction of each subcluster that is either from the zebrafish or Mexican tetra dataset. Subclusters with the majority of cells from the zebrafish dataset are shown in purple, and those with the majority of cells from the Mexican tetra dataset in yellow.

Extended Data Fig. 3 Shared subclusters are highly similar due to paralogous gene expression.

(a) Gene orthology confidence from Ensembl for all marker genes, or those marker genes which were paralogs of a marker gene in the other species. (b) Gene order score from Ensembl for all marker genes, or those marker genes which were paralogs of a marker gene in the other species. (c) The percentage of conserved, species-specific, and species-specific paralogous subcluster marker genes corrected by SCORPiOS synteny-correction. (d) The percentage of morph-specific marker genes for each subcluster which were paralogs of either the conserved or opposite species-specific marker gene for surface- and cave-morphs of Mexican tetra. (e) The odds ratio for the enrichment of paralogs in the species-specific genes for each subcluster for zebrafish and Mexican tetra. (f) The row-scaled ΔSI for all subclusters between zebrafish and Mexican tetra. Yellow indicates the highest ΔSI value between Mexican tetra and zebrafish subclusters. For all boxplots, box bounds represent the first and third quartiles and whiskers 1.5 times the interquartile range, thicker line represents the median.

Extended Data Fig. 4 Paralog shifts are associated with loss of ancestral gene expression patterns.

(a-b) Empirical cumulative distribution function (ECDF) for expression divergence (dT) for paralogous gene pairs. (c-d) ECDF of the number of cell types that have overlapping expression patterns within ancestral cell types for paralogous genes pairs (redundancy score, orange highlight in b). (e-f) ECDF of the number of non-ancestral cell types expressing each individual paralogous gene. Results for c-e are grouped by the age of the duplication inferred from the last common ancestor (LCA) which had both genes - from the oldest (Opisthokonta, yellow), to the most recent common ancestor (Otophysi, red), and to those gene duplicates which are only found in either D. rerio or Astyanax mexicanus (dark red). Results from b, d, and f are filtered and grouped by the originating whole genome duplication event (WGD), either vertebrate (2 R) or teleost (3 R).

Extended Data Fig. 5 Gene regulatory networks identified by GENIE3/SCENIC.

(a) Comparison of the random forest weights for orthologous transcription factors in the zebrafish (y-axis) and Mexican tetra (x-axis) data for example terminal effector genes. Colours indicate whether those transcription factors are in the top 2% of transcription factors for each gene in either zebrafish (blue) and Mexican tetra (red), both (yellow), or none (black).

Extended Data Fig. 6 Species-specific subcluster identities are not dependent on species-specific genes.

(a) tSNEs of cells from clusters containing a species-specific neuronal subcluster coloured by the original subcluster identity. (b) tSNEs of cells from clusters containing a species-specific neuronal subcluster coloured by subcluster identity derived from subclustering without species-specific genes. (c) Sankey diagrams illustrating the relationship between original subcluster identities and identities from subclustering without species-specific genes. Box heights and line widths are proportional to the number of cells in each subcluster and connection, respectively. Shaded connections represent cells from species-specific subclusters.

Extended Data Fig. 7 Comparison of subcluster identities between independent and integrated analysis.

(a) Sankey diagram of Mexican tetra surface-morph-specific subclusters and their relationship to integrated subclusters, and zebrafish subclusters. Box heights and line widths are proportional to the number of cells in each subcluster and connection, respectively. (d) Sankey diagram of Mexican tetra cave-morph-specific subclusters and their relationship to integrated subclusters, and zebrafish subclusters. Box heights and line widths are proportional to the number of cells in each subcluster and connection, respectively. (c) Sankey diagram of the Zebrafish species-specific subclusters (middle) and their relationship to subclusters independently identified in the zebrafish (right) or Mexican tetra datasets (left). Box heights and line widths are proportional to the number of cells in each subcluster and connection, respectively. (d) Sankey diagram of the subclusters shared by, (“Shared (147)”) or specific to, surface- and/or cave-morphs (“Cave-specific” or “Surface-specific”). The middle column depicts whether each subcluster is found in all cave-morph samples (“All Caves”), different combinations of multiple caves, or only in the datasets from specific cave-lineages (“Pachon” or “Molino”). Box heights and line widths are proportional to the number of cells in each subcluster and connection, respectively.

Extended Data Fig. 8 Comparison of neuropeptides and gene regulatory networks between surface- and cave-morphs.

(a) DotPlot showing expression of galn in the cells from the galn+ cluster (Neuronal_07), and expression of oxt, avp, and ENSAMXG00000021172 in the Neuronal_19 cluster. Cells are grouped by species-morph and cave-lineage. (b) Similarity Index between the transcription factor sets for surface- and cave-morphs of Mexican tetra for neuropeptides, neurotransmitters, synaptic genes, and ion channels. (c-f) Random forest weights for orthologous transcription factors in the Mexican tetra surface-morph (y-axis) and Mexican tetra cave-morph (x-axis) data for the neuropeptides galn, hcrt, oxt, and avp. Colours indicate whether those transcription factors are in the top 2% of transcription factors for each gene in either surface-morphs (green) and cave-morphs (yellow), both (purple), or none (black). For all boxplots, box bounds represent the first and third quartiles and whiskers 1.5 times the interquartile range, thicker line represents the median.

Extended Data Fig. 9 Transcriptional signatures of neuro-inflammation resistance in cave-morphs.

(a) tSNE reduction of immune clusters (Tcells, Bcells, Microglia, Macrophages, Mast cells, Thrombocytes, Neutrophils, and Erythorcytes) from surface- and cave-morph Mexican tetra coloured and labelled by species-morph. (b) tSNE reduction of immune cell types from surface- and cave-morph Mexican tetra coloured by cluster. (c) Marker genes for surface- and cave-morph versions of each immune cell type. Red outlines indicate differential expression of neuro-inflammation associated genes in cave-morph immune cells. Gene expression is quantified by both the percentage of cells which express each gene (dot size) and the average expression in those cells (colour scale). (d) tSNE reduction showing expression of ccr9a in Mexican tetra immune cells. (e) Proportion of cells within each immune subcluster which come from Choy surface-morphs, or Molino, Tinaja, or Pachon cave-morphs.

Extended Data Fig. 10 A permanent stress response in a cave-morph-specific neuronal subcluster.

(a) tSNE reduction of Neuronal_03 cluster from Mexican tetra coloured and labelled by subcluster. (b) tSNE reduction of Neuronal_03 cluster from Mexican tetra coloured by species-morph. (c) DotPlot of the top 5 marker genes for each subcluster of the Neuronal_03 cell type (x-axis), and their expression across all subclusters (y-axis). Gene expression is quantified by both the percentage of cells which express each gene (dot size) and the average expression in those cells (colour scale). (d) Dendrogram of the Neuronal_03 subclusters based on the Variable Features of the Neuronal_03 cluster, and the proportion barplot of cells from each species-morph per subcluster. (e) GO analysis of genes differentially expressed between Neuronal_03_1 and Neuronal_03_4. (f) tSNE reduction of Neuronal_03 cluster from Mexican tetra coloured by hspb1 expression. Neuronal_03_4 subcluster is highlighted by a dotted line. (g) Sankey diagram of the relationships between the Mexican tetra subclusters (left-hand side), integrated subclusters (middle), and zebrafish subclusters (right-hand side). Box heights and line widths are proportional to the number of cells in each subcluster and connection, respectively.

Supplementary information

Supplementary Information

Supplementary Methods/Results, Refs 1–17 and Figs. 1–9.

Reporting Summary.

Supplementary Data

Supplementary data for Cavefish single-cell sequencing publication. This archive contains the supplementary data for the paper “Gene family evolution underlies cell type diversification in the hypothalamus of teleosts”, which includes all of the raw and partially processed data produced by the analyses presented. The archive contains: (1) The raw count data for both the zebrafish (D. rerio) and Mexican tetra (A. mexicanus) single-cell experiments, as compressed.csv files. (2) The Seurat object metadata for the zebrafish (D. rerio), Mexican tetra (A. mexicanus) and integrated Seurat objects, containing sample, species and cell-type cluster labels for each cell. (3) CSVs for all marker gene lists used in the publication. (4) CSVs for all pseudobulk expression data for all cell type labels. (5) The raw data used for calculating the SI for each cluster and subcluster identity in the integrated data. (6) Results from SCENIC/GENIE3 analysis, including the Linklists and tfModules outputs from SCENIC. (7) Results of the weir fst analysis between cave and surface populations, for both INDELs and SNPs. (8) Ensembl biomart export files for determining paralogy relationships between genes within and across species. (9) Results of trinarization of gene expression across all identities and the uniquely expressed genes per identity. These are provided as R object files (.rds). Supplemental_data/3-marker_gene_lists. This folder contains marker gene lists for clusters and subclusters (“.sub”), for the zebrafish (“Drerio”), Mexican tetra (“Amexicanus”) or integrated (“Integrated”) datasets. Supplemental_data/4-pseudobulk_expression. This folder contains psuedobulk expression profiles for clusters (“Clusters”) and subclusters (“Subclusters”), for the zebrafish (“Drerio”), combined Mexican tetra and the surface- and cave-morphs of Mexican tetra, and integrated datasets.

Peer Review Information

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shafer, M.E.R., Sawh, A.N. & Schier, A.F. Gene family evolution underlies cell-type diversification in the hypothalamus of teleosts. Nat Ecol Evol 6, 63–76 (2022). https://doi.org/10.1038/s41559-021-01580-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41559-021-01580-3

This article is cited by

Search

Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing