Y chromosomes underlie sex determination in mammals, but their repeat-rich nature has hampered sequencing and associated evolutionary studies. Here we trace Y evolution across 15 representative mammals on the basis of high-throughput genome and transcriptome sequencing. We uncover three independent sex chromosome originations in mammals and birds (the outgroup). The original placental and marsupial (therian) Y, containing the sex-determining gene SRY, emerged in the therian ancestor approximately 180 million years ago, in parallel with the first of five monotreme Y chromosomes, carrying the probable sex-determining gene AMH. The avian W chromosome arose approximately 140 million years ago in the bird ancestor. The small Y/W gene repertoires, enriched in regulatory functions, were rapidly defined following stratification (recombination arrest) and erosion events and have remained considerably stable. Despite expression decreases in therians, Y/W genes show notable conservation of proto-sex chromosome expression patterns, although various Y genes evolved testis-specificities through differential regulatory decay. Thus, although some genes evolved novel functions through spatial/temporal expression shifts, most Y genes probably endured, at least initially, because of dosage constraints.
Subscribe to Journal
Get full journal access for 1 year
only $3.90 per issue
All prices are NET prices.
VAT will be added later in the checkout.
Rent or Buy article
Get time limited or full article access on ReadCube.
All prices are NET prices.
Gene Expression Omnibus
Sequence Read Archive
RNA and DNA sequencing data as well as reconstructed Y/W sequences have been deposited in the Gene Expression Omnibus (GEO), Sequence Read Archive (SRA) and Transcriptome Shotgun Assembly (TSA) Database under the accession codes GSE50747, SRP029216, SRP026469 and PRJNA236159.
Kashimada, K. & Koopman, P. Sry: the master switch in mammalian sex determination. Development 137, 3921–3930 (2010)
Wilson, M. A. & Makova, K. D. Genomic analyses of sex chromosome evolution. Annu. Rev. Genomics Hum. Genet. 10, 333–354 (2009)
Potrzebowski, L. et al. Chromosomal gene movements reflect the recent origin and biology of therian sex chromosomes. PLoS Biol. 6, e80 (2008)
Veyrunes, F. et al. Bird-like sex chromosomes of platypus imply recent origin of mammal sex chromosomes. Genome Res. 18, 965–973 (2008)
Grützner, F. et al. In the platypus a meiotic chain of ten sex chromosomes shares genes with the bird Z and mammal X chromosomes. Nature 432, 913–917 (2004)
Charlesworth, B. & Charlesworth, D. The degeneration of Y chromosomes. Phil. Trans. R. Soc. Lond. B 355, 1563–1572 (2000)
Bachtrog, D. Y-chromosome evolution: emerging insights into processes of Y-chromosome degeneration. Nature Rev. Genet. 14, 113–124 (2013)
Julien, P. et al. Mechanisms and evolutionary patterns of mammalian and avian dosage compensation. PLoS Biol. 10, e1001328 (2012)
Hughes, J. F. et al. Chimpanzee and human Y chromosomes are remarkably divergent in structure and gene content. Nature 463, 536–539 (2010)
Hughes, J. F. et al. Strict evolutionary conservation followed rapid gene loss on human and rhesus Y chromosomes. Nature 483, 82–86 (2012)
Skaletsky, H. et al. The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes. Nature 423, 825–837 (2003)
Li, G. et al. Comparative analysis of mammalian Y chromosomes illuminates ancestral structure and lineage-specific evolution. Genome Res. 23, 1486–1495 (2013)
Marais, G. A., Campos, P. R. & Gordo, I. Can intra-Y gene conversion oppose the degeneration of the human Y chromosome? A simulation study. Genome Biol. Evol. 2, 347–357 (2010)
Murphy, W. J., Pevzner, P. A. & O’Brien, S. J. Mammalian phylogenomics comes of age. Trends Genet. 20, 631–639 (2004)
Ayers, K. L. et al. RNA sequencing reveals sexually dimorphic gene expression before gonadal differentiation in chicken and allows comprehensive annotation of the W-chromosome. Genome Biol. 14, R26 (2013)
Moghadam, H. K., Pointer, M. A., Wright, A. E., Berlin, S. & Mank, J. E. W chromosome expression responds to female-specific selection. Proc. Natl Acad. Sci. USA 109, 8207–8211 (2012)
Ross, M. T. et al. The DNA sequence of the human X chromosome. Nature 434, 325–337 (2005)
Murtagh, V. J. et al. Evolutionary history of novel genes on the tammar wallaby Y chromosome: implications for sex chromosome evolution. Genome Res. 22, 498–507 (2012)
Lahn, B. T. & Page, D. C. Four evolutionary strata on the human X chromosome. Science 286, 964–967 (1999)
Wilson Sayres, M. A. & Makova, K. D. Genome analyses substantiate male mutation bias in many species. Bioessays 33, 938–945 (2011)
Pearks Wilkerson, A. J. et al. Gene discovery and comparative analysis of X-degenerate genes from the domestic cat Y chromosome. Genomics 92, 329–338 (2008)
Chang, T. C., Yang, Y., Retzel, E. F. & Liu, W. S. Male-specific region of the bovine Y chromosome is gene rich with a high transcriptomic activity in testis development. Proc. Natl Acad. Sci. USA 110, 12373–12378 (2013)
Weller, P. A., Critcher, R., Goodfellow, P. N., German, J. & Ellis, N. A. The human Y chromosome homologue of XG: transcription of a naturally truncated gene. Hum. Mol. Genet. 4, 859–868 (1995)
Rodríguez-Delgado, C. L., Waters, P. D., Gilbert, C., Robinson, T. J. & Graves, J. A. Physical mapping of the elephant X chromosome: conservation of gene order over 105 million years. Chromosome Res. 17, 917–926 (2009)
Rens, W. et al. The multiple sex chromosomes of platypus and echidna are not completely identical and several share homology with the avian Z. Genome Biol. 8, R243 (2007)
Warren, W. C. et al. Genome analysis of the platypus reveals unique signatures of evolution. Nature 453, 175–183 (2008)
Cutting, A., Chue, J. & Smith, C. A. Just how conserved is vertebrate sex determination? Dev. Dyn. 242, 380–387 (2013)
Western, P. S., Harry, J. L., Graves, J. A. & Sinclair, A. H. Temperature-dependent sex determination in the American alligator: AMH precedes SOX9 expression. Dev. Dyn. 216, 411–419 (1999)
Hattori, R. S. et al. A Y-linked anti-Mullerian hormone duplication takes over a critical role in sex determination. Proc. Natl Acad. Sci. USA 109, 2955–2959 (2012)
Tsend-Ayush, E. et al. Identification of mediator complex 26 (Crsp7) gametologs on platypus X1 and Y5 sex chromosomes: a candidate testis-determining gene in monotremes? Chromosome Res. 20, 127–138 (2012)
Dalloul, R. A. et al. Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): genome assembly and analysis. PLoS Biol. 8, e1000475 (2010)
Warren, W. C. et al. The genome of a songbird. Nature 464, 757–762 (2010)
Adolfsson, S. & Ellegren, H. Lack of dosage compensation accompanies the arrested stage of sex chromosome evolution in ostriches. Mol. Biol. Evol. 30, 806–810 (2013)
Nam, K. & Ellegren, H. The chicken (Gallus gallus) Z chromosome contains at least three nonlinear evolutionary strata. Genetics 180, 1131–1136 (2008)
Matunis, M. J., Michael, W. M. & Dreyfuss, G. Characterization and primary structure of the poly(C)-binding heterogeneous nuclear ribonucleoprotein complex K protein. Mol. Cell. Biol. 12, 164–171 (1992)
Wu, Z. et al. Targeted ubiquitination and degradation of G-protein-coupled receptor kinase 5 by the DDB1–CUL4 ubiquitin ligase complex. PLoS ONE 7, e43997 (2012)
Smith, C. A. et al. The avian Z-linked gene DMRT1 is required for male sex determination in the chicken. Nature 461, 267–271 (2009)
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature Genet. 25, 25–29 (2000)
Zhou, Q. & Bachtrog, D. Sex-specific adaptation drives early sex chromosome evolution in Drosophila. Science 337, 341–345 (2012)
Kondrashov, F. A. & Koonin, E. V. A common framework for understanding the origin of genetic dominance and evolutionary fates of gene duplications. Trends Genet. 20, 287–290 (2004)
Soumillon, M. et al. Cellular source and mechanisms of high transcriptome complexity in the mammalian testis. Cell Rep. 3, 2179–2190 (2013)
Cocquet, J. et al. The multicopy gene Sly represses the sex chromosomes in the male mouse germline after meiosis. PLoS Biol. 7, e1000244 (2009)
Wilson, M. A. & Makova, K. D. Evolution and survival on eutherian sex chromosomes. PLoS Genet. 5, e1000568 (2009)
Berlin, S. & Ellegren, H. Fast accumulation of nonsynonymous mutations on the female-specific W chromosome in birds. J. Mol. Evol. 62, 66–72 (2006)
Hughes, J. F., Skaletsky, H. & Page, D. C. Sequencing of rhesus macaque Y chromosome clarifies origins and evolution of the DAZ (Deleted in AZoospermia) genes. Bioessays 34, 1035–1044 (2012)
Charlesworth, B. Model for evolution of Y chromosomes and dosage compensation. Proc. Natl Acad. Sci. USA 75, 5618–5622 (1978)
Sutton, E. et al. Identification of SOX3 as an XX male sex reversal gene in mice and humans. J. Clin. Invest. 121, 328–341 (2011)
Park, C., Carrel, L. & Makova, K. D. Strong purifying selection at genes escaping X chromosome inactivation. Mol. Biol. Evol. 27, 2446–2450 (2010)
Necsulea, A. et al. The evolution of lncRNA repertoires and expression patterns in tetrapods. Nature 505, 635–640 (2014)
Brawand, D. et al. The evolution of gene expression levels in mammalian organs. Nature 478, 343–348 (2011)
Derrien, T. et al. The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression. Genome Res. 22, 1775–1789 (2012)
Kircher, M., Stenzel, U. & Kelso, J. Improved base calling for the Illumina Genome Analyzer using machine learning strategies. Genome Biol. 10, R83 (2009)
Chen, N., Bellott, D. W., Page, D. C. & Clark, A. G. Identification of avian W-linked contigs by short-read sequencing. BMC Genomics 13, 183 (2012)
Carvalho, A. B., Dobo, B. A., Vibranovski, M. D. & Clark, A. G. Identification of five new genes on the Y chromosome of Drosophila melanogaster. Proc. Natl Acad. Sci. USA 98, 13225–13230 (2001)
Carvalho, A. B. & Clark, A. G. Efficient identification of Y chromosome sequences in the human and Drosophila genomes. Genome Res. 23, 1894–1907 (2013)
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009)
Flicek, P. et al. Ensembl 2012. Nucleic Acids Res. 40, D84–D90 (2012)
Luo, R. et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1, 18 (2012)
Li, R., Li, Y., Kristiansen, K. & Wang, J. SOAP: short oligonucleotide alignment program. Bioinformatics 24, 713–714 (2008)
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature Biotechnol. 29, 644–652 (2011)
Kent, W. J. BLAT–the BLAST-like alignment tool. Genome Res. 12, 656–664 (2002)
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S. L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009)
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004)
Huang, X. & Madan, A. CAP3: A DNA sequence assembly program. Genome Res. 9, 868–877 (1999)
Church, D. M. et al. Lineage-specific biology revealed by a finished genome assembly of the mouse. PLoS Biol. 7, e1000112 (2009)
Untergasser, A. et al. Primer3–new capabilities and interfaces. Nucleic Acids Res. 40, e115 (2012)
Löytynoja, A. & Goldman, N. An algorithm for progressive multiple alignment of sequences with insertions. Proc. Natl Acad. Sci. USA 102, 10557–10562 (2005)
Criscuolo, A. & Gribaldo, S. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol. Biol. 10, 210 (2010)
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010)
Katsura, Y. & Satta, Y. No evidence for a second evolutionary stratum during the early evolution of mammalian sex chromosomes. PLoS ONE 7, e45488 (2012)
Yang, Z. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13, 555–556 (1997)
Yang, Z. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol. Biol. Evol. 15, 568–573 (1998)
Chamary, J. V., Parmley, J. L. & Hurst, L. D. Hearing silence: non-neutral evolution at synonymous sites in mammals. Nature Rev. Genet. 7, 98–108 (2006)
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature Biotechnol. 28, 511–515 (2010)
Gentleman, R. C. et al. Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 5, R80 (2004)
Schriml, L. M. et al. Tyramide signal amplification (TSA)-FISH applied to mapping PCR-labeled probes less than 1 kb in size. Biotechniques 27, 608–613 (1999)
Kortschak, R. D., Tsend-Ayush, E. & Grutzner, F. Analysis of SINE and LINE repeat content of Y chromosomes in the platypus, Ornithorhynchus anatinus. Reprod. Fertil. Dev. 21, 964–975 (2009)
Goto, H., Peng, L. & Makova, K. D. Evolution of X-degenerate Y chromosome genes in greater apes: conservation of gene content in human and gorilla, but not chimpanzee. J Mol Evol. 68, 134–144 (2009)
Kim, H. S. & Takenaka, O. Evolution of the X-linked zinc finger gene and the Y-linked zinc finger gene in primates. Mol. Cells 10, 512–518 (2000)
Whitfield, L. S., Lovell-Badge, R. & Goodfellow, P. N. Rapid sequence evolution of the mammalian sex-determining gene SRY. Nature 364, 713–715 (1993)
Moreira, M. A. SRY evolution in Cebidae (Platyrrhini: Primates). J. Mol. Evol. 55, 92–103 (2002)
Gubbay, J. et al. A gene mapping to the sex-determining region of the mouse Y chromosome is a member of a novel family of embryonically expressed genes. Nature 346, 245–250 (1990)
Ma, K. et al. A Y chromosome gene family with RNA-binding protein homology: candidates for the azoospermia factor AZF controlling human spermatogenesis. Cell 75, 1287–1295 (1993)
Agulnik, A. I. et al. A mouse Y chromosome gene encoded by a region essential for spermatogenesis and expression of male-specific minor histocompatibility antigens. Hum. Mol. Genet. 3, 873–878 (1994)
Mitchell, M. J. et al. Homology of a candidate spermatogenic gene from the mouse Y chromosome to the ubiquitin-activating enzyme E1. Nature 354, 483–486 (1991)
Ehrmann, I. E. et al. Characterization of genes encoding translation initiation factor from X-inactivation and evolution. Hum. Mol. Genet. 7, 1725–1737 (1998)
Strausberg, R. L. et al. Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences. Proc. Natl Acad. Sci. USA 99, 16899–16903 (2002)
Hall, N. M. et al. Usp9y (ubiquitin-specific protease 9 gene on the Y) is associated with a functional promoter and encodes an intact open reading frame homologous to Usp9x that is under selective constraint. Mamm. Genome 14, 437–447 (2003)
Mazeyrat, S. et al. The mouse Y chromosome interval necessary for spermatogonial proliferation is gene dense with syntenic homology to the human AZFa region. Hum. Mol. Genet. 7, 1713–1724 (1998)
Mardon, G. & Page, D. C. The sex-determining region of the mouse Y chromosome encodes a protein with a highly acidic domain and 13 zinc fingers. Cell 56, 765–770 (1989)
Touré, A. et al. Identification of novel Y chromosome encoded transcripts by testis transcriptome analysis of mice with deletions of the Y chromosome long arm. Genome Biol. 6, R102 (2005)
Touré, A. et al. A protein encoded by a member of the multicopy Ssty gene family located on the long arm of the mouse Y chromosome is expressed during sperm development. Genomics 83, 140–147 (2004)
We thank K. Harshman and the Lausanne Genomics Technology Facility for high-throughput sequencing support; I. Xenarios and the Vital-IT computational facility for computational support; S. Pääbo for great ape DNA samples; C. Roos for marmoset DNA samples; P. Jensen for chicken samples; E. Tsend-Ayush for help in determining the complete sequence of the AMHY gene in platypus; P. Gonzalez-Rubio for help with figure designs; M. Cardoso-Moreira, F. Carelli, A. Necsulea and M. Warnefors for comments on the manuscript; the Kaessmann group in general for discussions; and the Marmoset Genome Sequencing Consortium for making the marmoset genome assembly and annotation data available and for granting permission to use them for the analyses described in this study before publication. D.T.F. was supported by the Mexican National Council for Science and Technology (CONACyT). P.D.W. was supported by an ARC fellowship. F.G. was supported by an ARC fellowship. This research was supported by grants from the European Research Council (Starting Independent Grant: 242597, SexGenTransEvolution) and the Swiss National Science Foundation (Grant: 130287) to H.K.
The authors declare no competing financial interests.
Extended data figures and tables
Extended Data Figure 1 Overview of subtraction approach to detect and assemble Y (W) chromosome genes/transcripts.
Extended Data Figure 2 Resampling simulations to evaluate the performance of the Y transcript reconstruction approach.
a–d, Results of one simulation showing the completeness (length of reconstructed sequences with respect to annotated sequences at ≥ 99% identity) of Y protein-coding transcripts using increasing numbers of randomly sampled male reads (million reads, Mr) when performing reconstructions. Simulations were carried out for human, chimpanzee, macaque and mouse where Y chromosomes have been fully sequenced. Twenty randomly selected genes from chicken were included in the simulations to control for false positive reconstructions (the median of the completeness of these control genes is zero). e, Distribution of the completeness of Y transcripts for repeated resampling analyses for different numbers of randomly sampled male reads (50 resampling replicates for each read number). Error bars, maximum and minimum values, excluding outliers.
See Fig. 1 for detailed legend. Numbers indicate the references in which Y and W genes are described.
Extended Data Figure 4 Phylogenetic trees constructed on the basis of alignments of concatenated genes for a given stratum.
a–g, All trees are based on coding nucleotide sequences, except for therian S1 and marsupials S1 trees, which are based on an amino acid alignment, given that coding sequences are highly diverged, leading to a tree that is less well supported at key nodes. Bootstrap values are based on 100 bootstrap replicates. In the specific case of therian S1 and eutherian S3, we included all different gene copies (where these were available from previous work or could be reconstructed) in the analyses for the multi-copy genes (RBMY, RPS4Y, HSFY, Cyorf15) by repeatedly (100 times) and randomly selecting gene copies for the concatenations, and then reconstructing phylogenetic trees (100 bootstrap replicates). Reported bootstrap values at each node thus represent the median value for 100 trees obtained with different combinations of multi-copy sequences. Trees for individual genes are available at ftp://ftp.vital-it.ch/papers/kaessmann/Nature-Cortez/Cortez_etal_Nature_YX_gene_trees_alignments.zip.
a–f, Phylogenetic trees based on synonymous site divergences (dS) of concatenated Y (W) genes for different strata; key branch lengths (dS and inferred corresponding millions of years, Myr) as well as divergence times and inferred strata ages (in green) are indicated. Branch lengths and corresponding age ranges are given as 95% confident intervals calculated with simulated data (Methods). The following genes were used in underlying concatenated alignments (other genes were lacking orthologues in some species and were thus not included): SRY, RBMY and RPS4Y (total alignment length: 2,607 nucleotides (nt)) for therian S1; KDM5D (4,794 nt) for eutherian S2; KDM5D, HCHC1Y, MECP2Y, UBE1Y1 and HUWE1Y (24,657 nt) for marsupial S2; USP9Y, UTY and DDX3Y (12,408 nt) for eutherian S3; AMHY, FEM1CY, AKAP8LY and MED26Y (5,469 nt) for monotreme S1; HNRNPKW and KCMF1W (2,268 nt) for bird S1. g, dS values for the first (most basal) branches that follow different stratification events and lead to Y and X clades, respectively. Statistically significant differences (Mann–Whitney U-test): Benjamini–Hochberg-corrected *P < 0.05, ***P < 0.001. Error bars, maximum and minimum values, excluding outliers. h, Pairwise dS values for Y and X gametologues in platypus. Genes were grouped into strata on the basis of phylogenetic information and dS value clustering (Methods).
a, Male (M) to female (F) genomic read coverage ratio/difference for 23 unassembled contigs (red dots) containing the closest homologues of all platypus Y genes except for RREB1Y (no homologue could be traced) and JUND (low genomic read coverage), fully differentiated part of X5 (yellow), assembled autosomes (black), and autosomal contigs (green), previously experimentally mapped X-linked contigs (blue), and all other unassembled contigs (grey). The density plots describe the distribution of unassembled contigs; the major peak in the y-axis distribution reflects similar coverage in males and females, as expected for autosomal contigs and observed for assembled autosomes. The minor peak reflects a twofold higher coverage in females, as expected for X-linked sequences and observed for the assembled X5 chromosome. b, Coverage profiles for chromosome 1, X5, and the 24 contigs containing gametologues of reconstructed Y genes (RREB1Y contig is missing, as no gametologue could be identified for this gene). Plots are sorted by contig sequence lengths. Consistent with the expectation for X gametologues, 19 out of 24 contigs show an overall twofold higher coverage in females (M:F log2 ratio significantly different from 0 but not from -1; one-sample Wilcoxon signed-rank test; Benjamini–Hochberg-corrected, P < 0.05; Methods and Supplementary Tables 19, 20). The contig containing the ‘Novel Gene’, which has a particularly low M:F ratio, has low coverage but shows twofold higher coverage in females compared to males in regions with mapped reads, consistent with X-linkage. Contigs containing SDHAY and HNRNPKY homologues show a twofold higher coverage in females in the region containing the genes, suggestive of a location in a non-recombinant X region (these genes are likely located close to pseudoautosomal boundaries). The closest homologue of SYCP3Y shows a profile typical of autosomes, which indicates that SYCP3Y was recruited directly to the Y from an autosome. JUND cannot be analysed owing to the limited genomic read coverage.
Extended Data Figure 7 Synonymous and nonsynonymous substitution rates across Y/X (W/Z) branches and physical mapping of AMHY.
a–d, dS, dN, dN/dS values for all Y/X branches (medians across branches for listed Y genes and their gametologues), or the first (most basal) branches that follow different stratification events and lead to Y and X clades, respectively. The Mann–Whitney U-test was used to compare distributions of Y and X rates. Significant (Benjamini–Hochberg-corrected) P values are shown in red. a, Eutherians, 17 Y genes (SRY/SOX3, RBMY/X, RPS4Y1/X, AMELY/X, DDX3Y/X, EIF1AY/X, PRKY/X, TSPY1/X, USP9Y/X, ZFY/X, UTY/X, EIF2S3Y/X, NLGN4Y/X, KDM5D/C, UBE1Y1/UBA, TMSB4Y/X and Cyorf15/TBL1X). b, Marsupials, 13 genes (ATRY/X, HCFC1Y/X, MECP2Y/X, HUWE1Y/X, RBM10Y/X, RPL10Y/X, TFE3Y/X, THOC2Y/X, KLF8Y/X, HMGB3Y/X, PHF6Y/X, KDM5D/C and UBE1Y1/UBA). c, Monotremes, 17 genes (AMHY/X, FEM1CY/X, AKAP8LY/X, PPP4R2Y/X, MED26Y/X, THAP11Y/X, STK11Y/X, RFX1Y/X, KHSRPY/X, ARF4Y/X, CTCFY/X, MAU2Y/X, PPP1R10Y/X, HNRNPKY/X, DAZAP1Y/X, PRPF4BY/X and SSR1Y/X). d, Chicken, 20 genes (NIPBLW/Z, ATP5A1W/Z, BTF3W/Z, C18orf25W/Z, CHD1W/Z, GOLPH3W/Z, HINT1W/Z, KCMF1W/Z, MIER3W/Z, NEDD4LW/Z, RASA1W/Z, RPL17W/Z, SMAD2W/Z, SPIN1W/Z, ST8SIA3W/Z, UBAP2W/Z, VCPW/Z, ZFRW/Z, ZNF532W/Z and HNRNPKW/Z). e, DAPI (4′,6-diamidino-2-phenylindole) staining of male platypus metaphase chromosomes. Chromosome Y5 is minute and the smallest chromosome in platypus. The next smallest chromosomes (Y3, Y4 and X4) are also indicated, for comparison. f, Localization of AMHY gene using FISH. The AMHY PCR probe (green signal) hybridized specifically to chromosome Y5. Error bars, maximum and minimum values, excluding outliers.
Extended Data Figure 8 Expression levels of X gametologues and proto-sex chromosome genes, expression evolution towards testis-specificity, and kinetics of Y gene loss.
a, Expression level characteristics (somatic tissues) of gametologues on current X and proto-sex chromosomes. Expression level distributions for current X gametologues (X), precursors of current X/Y genes on proto-sex chromosomes as inferred from 1:1 autosomal orthologues in outgroup species, all current X-linked genes, all proto-sex chromosomal genes. Similar distributions for chicken Z-linked genes and proto-sex chromosome precursors. Statistically significant differences (Mann–Whitney U-test): Benjamini–Hochberg-corrected *P < 0.05, ***P < 0.001. b, Current (Y) and inferred ancestral (proto-sex) expression levels of Y genes that gained testis-specific expression during evolution (right) and those that did not (left) in somatic tissues and testis (as indicated on x axis). Note: for proto-sex chromosome plots, inferred expression output values were calculated per single gene copy (see Fig. 2 legend). c, Kinetics of ancestral gene decay on the therian Y. Gene numbers are plotted on the y axis (ancestral gene numbers are indicated at the top), and time (in Myr) is plotted on the x axis. Dots indicate minimum inferred or observed gene numbers at different time points of mammalian evolution based on our dS inferences (see Fig. 1 and Extended Data Fig. 5) and known lineage divergence times: 181 Myr (S1 formation), 152 Myr (S2a formation), 115 Myr (S2b and S3), 105 Myr (afrotherian split from other placentals), 80 Myr (American–Australian marsupial split), 25 Myr (Old-World monkey–ape split) and 6 Myr (human–chimp split). Dashed lines represent best-fit curves to data points using each of the decay models as indicated. Ancestral gene numbers and decay rates (K, Myr − 1) were taken from Hughes et al.10 except where alternative rates are indicated. Error bars, maximum and minimum values, excluding outliers.
Extended Data Figure 9 Spermatogenic expression patterns of Y protein-coding genes and along the mouse Y chromosome.
a, Left, expression levels of 4 mouse Y genes with ubiquitous spatial expression profiles across different organs and individual spermatogenic cell types. Right, expression levels of 7 mouse Y genes with testis-specific expression across organs and cell types. b, Top, spermatogenic expression of protein-coding genes located in the Y-conserved region (YCR). Lower panel: Expression of 31 reconstructed Y-linked transcript contigs (including unknown presumably noncoding sequences, the known protein-coding genes SLY and SSTY; all with more than 10 copies) along the ampliconic region of the mouse Y chromosome. Genes and contigs were mapped to 1,452 positions using BLASTn to the assembled mouse Y chromosome from genome version GRCm38 (ref. 65).
Extended Data Figure 10 Nonsynonymous over synonymous substitution rates across Y/X (W/Z) branches and simulated multi-copy genes.
a, Median dN/dS values for all Y/X branches and branches leading to autosomal orthologues in outgroup species with different sex chromosome systems (A). Statistically significant differences (Mann–Whitney U-test): Benjamini–Hochberg-corrected *P < 0.05, **P < 0.01, ***P < 0.001. b, Mean coverage values from 1,000 different complementary DNAs from human (>1 kb <10 kb) that were introduced to a mock genome having different copy-numbers (CN), that is, from 1 copy to 20 identical copies. Error bars, maximum and minimum values, excluding outliers.
This zipped file contains all newly established Y and W sequences, CDS sequences of XY and ZW gametologs, the alignments of former pseudogenes in chimpanzee and relevant scripts. See the "SI Guide.txt" file in the folder for further details. (ZIP 474 kb)
Supplementary Tables 1-3 contain information regarding the RNA-seq and genomic data used in this study. The original file uploaded was corrupt and was replaced on 2 May 2014 (XLS 74 kb)
This Supplementary Table contains information regarding the Y transcript reconstruction pipeline. The original file uploaded was corrupt and was replaced on 2 May 2014 (XLS 23 kb)
Supplementary Tables 5-18 contain information regarding Y and W genes in 13 mammals and chicken. The original file uploaded was corrupt and was replaced on 2 May 2014 (XLS 658 kb)
Supplementary Tables 19 and 20 contain information regarding predicted X chromosome localization for X gametologs and statistical information about X-linked contigs in platypus. The original file uploaded was corrupt and was replaced on 2 May 2014 (XLS 42 kb)
Supplementary Tables 21-23 contain information regarding GO terms enrichment among Y (W) genes and tissue-specificity analyses (statistics). The original file uploaded was corrupt and was replaced on 2 May 2014 (XLS 80 kb)
About this article
Cite this article
Cortez, D., Marin, R., Toledo-Flores, D. et al. Origins and functional evolution of Y chromosomes across mammals. Nature 508, 488–493 (2014). https://doi.org/10.1038/nature13151
Trends in Genetics (2020)
Comparative Transcriptomics Analyses across Species, Organs, and Developmental Stages Reveal Functionally Constrained lncRNAs
Molecular Biology and Evolution (2020)
Evolution of Sex Determination in Amniotes: Did Stress and Sequential Hermaphroditism Produce Environmental Determination?