The little skate genome and the evolutionary emergence of wing-like fins

Marlétaz, Ferdinand; de la Calle-Mustienes, Elisa; Acemel, Rafael D.; Paliou, Christina; Naranjo, Silvia; Martínez-García, Pedro Manuel; Cases, Ildefonso; Sleight, Victoria A.; Hirschberger, Christine; Marcet-Houben, Marina; Navon, Dina; Andrescavage, Ali; Skvortsova, Ksenia; Duckett, Paul Edward; González-Rajal, Álvaro; Bogdanovic, Ozren; Gibcus, Johan H.; Yang, Liyan; Gallardo-Fuentes, Lourdes; Sospedra, Ismael; Lopez-Rios, Javier; Darbellay, Fabrice; Visel, Axel; Dekker, Job; Shubin, Neil; Gabaldón, Toni; Nakamura, Tetsuya; Tena, Juan J.; Lupiáñez, Darío G.; Rokhsar, Daniel S.; Gómez-Skarmeta, José Luis

doi:10.1038/s41586-023-05868-1

Download PDF

Article
Open access
Published: 12 April 2023

The little skate genome and the evolutionary emergence of wing-like fins

Nature volume 616, pages 495–503 (2023)Cite this article

23k Accesses
11 Citations
310 Altmetric
Metrics details

Subjects

Abstract

Skates are cartilaginous fish whose body plan features enlarged wing-like pectoral fins, enabling them to thrive in benthic environments^1,2. However, the molecular underpinnings of this unique trait remain unclear. Here we investigate the origin of this phenotypic innovation by developing the little skate Leucoraja erinacea as a genomically enabled model. Analysis of a high-quality chromosome-scale genome sequence for the little skate shows that it preserves many ancestral jawed vertebrate features compared with other sequenced genomes, including numerous ancient microchromosomes. Combining genome comparisons with extensive regulatory datasets in developing fins—including gene expression, chromatin occupancy and three-dimensional conformation—we find skate-specific genomic rearrangements that alter the three-dimensional regulatory landscape of genes that are involved in the planar cell polarity pathway. Functional inhibition of planar cell polarity signalling resulted in a reduction in anterior fin size, confirming that this pathway is a major contributor to batoid fin morphology. We also identified a fin-specific enhancer that interacts with several hoxa genes, consistent with the redeployment of hox gene expression in anterior pectoral fins, and confirmed its potential to activate transcription in the anterior fin using zebrafish reporter assays. Our findings underscore the central role of genome reorganization and regulatory variation in the evolution of phenotypes, shedding light on the molecular origin of an enigmatic trait.

Simultaneous single-cell three-dimensional genome and gene expression profiling uncovers dynamic enhancer connectivity underlying olfactory receptor choice

Article Open access 15 April 2024

Evolution of tissue-specific expression of ancestral genes across vertebrates and insects

Article 15 April 2024

Complexity of avian evolution revealed by family-level genomes

Article 01 April 2024

Main

The origin and diversification of vertebrates was accompanied by the appearance of key developmental innovations^2,3. Among them, paired appendages show an exquisite diversity of forms and adaptations not only in tetrapods, but also in chondrichthyans (cartilaginous fish) in which fin structures are considerably diverse². The wing-like appendages of batoid fishes (skates and rays) (Fig. 1a) are fascinating examples, in which the pectoral fins extend anteriorly and fuse with the head. This unique structure creates power for forward propulsion and led to the emergence of swimming mechanisms that enabled skates to colonize the sea floor¹. Transcriptomic analysis of skate developing fins revealed a major reorganization of signalling gradients relative to other vertebrates¹. The redeployment of developmental transcription factors, such as 3′ hox genes, initiates an anterior signalling centre analogous to the posterior apical ectodermal ridge (AER). These changes arose ~286–221 million years ago (Fig. 1b) after the divergence between sharks and skates. Nevertheless, the genomic and regulatory changes underlying these novel expression domains have remained elusive.

**Fig. 1: The little skate morphology and genome evolution.**

Many vertebrate evolutionary innovations were influenced by the substantial genomic reorganizations caused by two rounds of whole-genome duplication (WGD). The ancestral chordate chromosomes were duplicated and rearranged to give rise to the diversity of existing karyotypes in vertebrates⁴. Concomitantly, the pervasive loss of paralogous genes after WGDs produced gene deserts enriched in regulatory elements⁵. Compellingly, those genomic alterations were paralleled by marked changes in gene regulation, contributing to an increase in pleiotropy in developmental genes⁵ and to the complexity of their regulatory landscapes⁶. In vertebrates, regulatory landscapes are spatially organized into topologically associating domains (TADs)^7,8. TADs correspond to large genomic regions with increased self-contact that promote the interaction between cis-regulatory elements (CREs) and cognate promoters to constitute precise transcriptional patterns. While TADs constrain the evolution of gene order⁹, genomic rearrangements that alter these domains can be a source for developmental phenotypes¹⁰ and evolutionary innovation^11,12. Yet the importance of TAD organization for the evolution of gene regulation and the emergence of lineage-specific traits after vertebrate WGDs remains largely unexplored.

To gain insights into the evolution of the jawed vertebrate (gnathostome) karyotypes and of wing-like appendages, we generated a chromosome-scale assembly of the little skate L. erinacea and performed extensive functional characterization of its developing fins. Our analyses revealed a karyotype configuration resembling the gnathostome ancestor, characterized by slower paralogue loss and smaller chromosomes than other jawed vertebrates, which suggests fewer fusion events after the second round (2R) of WGD in the skate lineage. We find evidence that three-dimensional (3D) genome organization in skate arises from an interplay between transcription-based A/B compartments and TADs formed by loop extrusion, as described in mammals¹³. The comparison of the 3D organization of α and β chromosomes after the gnathostome-specific WGD revealed a prominent loss of complete TADs, probably contributing to karyotype stabilization. By combining RNA sequencing (RNA-seq) and assay for transposase-accessible chromatin with sequencing (ATAC–seq) data, we identified the planar cell polarity (PCP) pathway and hox gene regulation as key contributors to skate fin morphology, which we further validated using functional assays in zebrafish and skate. Our study illustrates how comparative multi-omics approaches can be effectively used to elucidate the molecular underpinnings of evolutionary traits.

Genome sequencing and comparative genomics

We assembled the little skate genome at the chromosome scale by integrating long- and short-read genome sequencing with chromatin conformation capture (Hi-C) data. Our assembly includes 40 chromosome-scale (>2.5 Mb) scaffolds, with 19 macrochromosomes (>40 Mb), 14 mesochromosomes (between 20 and 40 Mb) and 7 microchromosomes (<20 Mb) that together represent 91.7% of the 2.2 Gb assembly. This chromosome number is within the range reported for other Rajidae species¹⁴. Despite technical challenges due to high polymorphism levels (1.6% heterozygosity) and a repeat content dominated by recently expanded LINE retrotransposons (Extended Data Fig. 1), our assembly showed a similar or higher degree of completeness with respect to gene content compared to other sequenced chondrichthyans (BUSCO; Supplementary Table 1).

We annotated 26,715 protein-coding genes using extensive transcriptome resources¹⁵, with 23,870 possessing homologues in other species. Using comparative analysis with 20 other sequenced vertebrates we reconstructed the complete set of skate gene evolutionary histories (the phylome) and used it to infer patterns of gene duplication and loss, as well as orthology and paralogy relationships (Supplementary Table 2; resources are available at PhylomeDB and MetaPhoRs^16,17). We used phylogenomic methods to reconstruct jawed vertebrate phylogeny and infer divergence times, finding a more ancient divergence between sharks and skates (around 286 million years ago) than previously estimated¹⁸ (Fig. 1b). Compared with other reported chondrichthyan genomes, L. erinacea displays the lowest number of species-specific gene losses (616 losses; Supplementary Fig. 1). Similar to sharks (selachians)^19,20, the little skate has larger introns than tetrapods (median size, 2,167 bp versus 1,586 bp in human), although these are not enriched in a particular repeat category (Extended Data Fig. 1).

Skate microchromosomes have an overall higher gene density compared with macro-chromosomes (Extended Data Fig. 1a–c,g), suggesting that, as in birds, these small chromosomes are prone to GC-biased gene conversion²¹. Skate microchromosomes also show a higher degree of interchromosomal contacts compared with other chromosomes (Fig. 1d,e), as also found in snakes and other tetrapods²².

Chromosome evolution

We surveyed the arrangement of syntenic chromosomal segments derived from ancestral chordate linkage groups (CLGs) in skate, gar and chicken, using amphioxus as an unduplicated outgroup²³, and found that the chromosomal organization of the skate genome closely resembles that of the most recent jawed vertebrate common ancestor (Fig. 2a and Extended Data Fig. 2). By analysing the chromosomal locations of single-copy orthologues, we designated chromosomal segments according to their origin at 1R (1 or 2) or 2R (α or β) vertebrate WGDs²³ (Fig. 2b). The relatively large number of elasmobranch chromosomes (≥40) reflects the ancestral condition among gnathostomes; with the exception of the losses of two ancestral segments in the skate lineages, and one secondary fusion on chromosome 1, the skate possessed 37 out of the 39 ancestral vertebrate linkages (Supplementary Table 3). The evolution of reduced chromosome number in osteichthyan (bony fishes) lineages is therefore due to subsequent chromosomal fusions.

**Fig. 2: Ancestral linkage and the architecture of early vertebrate genomes.**

The smaller vertebrate chromosomes often show a reciprocal correspondence across species and correspond to a single ancestral gnathostome unit^23,24,25 (10 chromosomes have a 1:1:1 orthology between skate, gar and chicken; Fig. 2b). The trios LER25≡LOC20≡GGA15 and LER28≡LOC22≡GGA19 represent two surviving copies of CLG-G from the 1R event. Other trios such as LER21≡LOC18≡GGA20 and LER29≡LOC19≡GGA28 derive from CLG fusions, and the occurrence of some in all gnathostome genomes implies that they happened between the pan-vertebrate 1R and the gnathostome-specific 2R^23,25 (Fig. 2b).

In many gnathostomes, larger chromosomes also derive from fusions of CLGs. The skate often represents an ancestral state among jawed vertebrates, with subsequent fusions in bony fishes, including in chicken (for example, GGA5), in gar (for example, LOC5) or in their common ancestor (for example, LER 2 and 4; see below). For example, ancestral gnathostome chromosomes resembling skate LER9, LER12 and LER18 fused in different ways to form chromosomes in gar and chicken. Similarly, LER10≡GGA8 and LER23≡GGA18 (≡BFL8) probably represent ancestral units that fused in gar chromosome LOC10 through a centric Robertsonian fusion (Fig. 2b). Notably, these two chromosomes are also preserved in their ancestral condition in the bowfin, the sister group of gar, implying that fusion occurred specifically in this lineage²⁶.

Alternatively, ancestral chromosomes resembling LER2 and LER4 probably fused in the bony fish ancestor to give rise to chicken GGA2, whereas gar LOC9 and LOC11 are secondarily split from this fused ancestral chromosome. This may have involved a Robertsonian fission that split a metacentric chromosome at the centromere into two acro- or telocentric products. We also observe cases in which microchromosomes have been added to macro-chromosomes recently by terminal translocation, such as the addition of a chromosome similar to LER35≡GGA22 to the start of LOC1, or a LER12-like chromosome to the end of GGA4 (a recent translocation not found in other birds)²⁷.

The extensive conservation of chromosomal identity and gene order between the little skate and the bamboo shark²⁸, despite over 300 million years of divergence, indicates that most chondrichthyans may share this ancestral chromosomal organization (Fig. 1b,c and Extended Data Fig. 2). Notably, gene order collinearity across cartilaginous fish is more extensively conserved than within clades of comparable divergence, such as mammals and frogs²⁹. By contrast, gene order is heavily disrupted between chondrichthyans (such as skate or shark) and osteichthyans (gar or chicken; Fig. 2a,b and Supplementary Fig. 2).

Evolution of the gene complement

The gene complement of the little skate, as in other chondrichthyans, evolved slower than that of Osteichthyes with respect to gene loss (Supplementary Fig. 1). Using species-tree-aware phylogenetic methods, we found that the retention of ohnologues (paralogues derived from vertebrate-specific WGDs) was higher than that observed in bony fishes (Fig. 2c,d and Extended Data Fig. 1h). According to the auto-then-allotetraploidy scenario for jawed vertebrate evolution²³, the chromosomes derived from 2R behave distinctly, with beta segments showing increased loss and higher rates of molecular evolution (Fig. 2c,e and Extended Data Fig. 1i).

On the basis of patterns of duplication and loss, we found 68 cases in which one ohnologue was differentially retained in varying jawed vertebrate lineages, 19 genes retained in chondrichthyans but lost in bony fishes, 17 retained in chondrichthyans and coelacanth, and 24 retained in chondrichthyans and actinopterygians (ray-finned fish) but lost in lobe-finned fish (Supplementary Table 3). Some of these retained ancestral ohnologues, including previously characterized genes such as wnt6b²⁰ or novel genes such as chondroitin sulfate proteoglycan 5 (cspg5), show distinct expression patterns among stages and organs (Fig. 2e).

Conservation of 3D regulatory principles

We investigated 3D chromatin organization in skates using Hi-C analysis of developing pectoral fins. We found a type II architecture³⁰ with chromosomes preferentially occupying individual territories within the nucleus (Supplementary Fig. 3), consistent with a complete set of condensin II subunits (smc2, smc4, caph2, capg2 and capd3) in the genome. At higher resolution, skate chromosomes are organized into two distinct compartments, as described in other animals³¹. The A compartment displays higher gene density, chromatin accessibility and gene expression levels compared with the B compartment (Extended Data Fig. 3).

At the sub-megabase scale, the skate genome is organized into TADs with a median size of 800 kb (Extended Data Fig. 4a,b), an intermediate regime between mammals and teleosts (Supplementary Fig. 4). Aggregate analyses revealed that skate TADs are associated with chromatin loops at the upper corner of domains (Fig. 3a). Chromatin accessibility (ATAC–seq) and motif enrichment analysis revealed binding sites for the architectural factor CTCF at skate TAD boundaries (Extended Data Fig. 4c,d), in comparable proportions to mammals and teleosts (Supplementary Fig. 5). These CTCF sites display an orientation bias with motifs oriented towards the interior of TADs, suggesting that these domains are formed by loop extrusion (Fig. 3b and Extended Data Fig. 4c). Notably, the critical genes involved in loop extrusion are present in the skate genome, including ctcf and those encoding cohesin complex subunits (smc1a, smc3, scc1 and two copies of scc3). An example of skate TAD organization can be observed at the hoxa and hoxd clusters (Fig. 3c and Extended Data Fig. 4d), which display the characteristic bipartite TAD configuration of jawed vertebrates³². Manual microsynteny analysis confirmed that the 3′ and 5′ TADs found at both skate hox loci are orthologous to those described in mammals and teleosts. Such deeply conserved 3D organizations reflect the existence of regulatory constraints that influenced TAD evolution across the whole jawed vertebrate clade.

**Fig. 3: Features of 3D chromatin organization in the little skate.**

To investigate enhancer–promoter interactions, we used Hi-C combined with immunoprecipitation (HiChIP) to associate H3K4me3-rich active promoters with potential regulatory loci in the anterior and posterior skate pectoral fin. Notably, these fin regions display transcriptional signatures that differ from other vertebrates. In particular, several 3′ hoxa and hoxd genes are preferentially expressed in the anterior pectoral fin, whereas 5′ hoxa and hoxd genes are located in the posterior pectoral domain. This pattern of expression has been consistently found in other batoid species^1,33. HiChiP analyses revealed 50,601 interactions associated with 7,887 different promoters (6.4 interactions per active promoter). Interactions connecting promoters with distal ATAC–seq peaks (χ², P < 10⁻¹³⁸; Extended Data Fig. 5a) and intra-TAD interactions were enriched (empirical P < 10⁻⁴; Extended Data Fig. 5b). Differential analysis revealed similar looping patterns between tissues (Pearson correlation > 0.96; Extended Data Fig. 5c), with only 9 and 5 interactions statistically enriched in anterior and posterior fins, respectively (Extended Data Fig. 5d). Promoters with differential looping included hoxa and hoxb genes and the transcription factor alx4 (Extended Data Fig. 5e–g), which are involved in limb development. To confirm those interactions, we performed Hi-C in anterior and posterior pectoral fins, finding only minor variations. Compartment differences were subtle and restricted to less than 10% of the genome (Extended Data Fig. 6a–d). TADs were also extremely similar (Fig. 3d,e and Extended Data Fig. 6e), with insulation score correlations of above 0.98 (Extended Data Fig. 6f). Similarly, high correlations were observed for chromatin loops (Extended Data Fig. 6g) and differential analysis revealed a single significantly stronger loop in the posterior pectoral fin (Extended Data Fig. 6h,i). Notably, the differential contacts predicted by HiChIP were not noticeable (Fig. 3d,e and Extended Data Fig. 6j). The differences in HiChIP data are therefore probably derived from variations in H3K4me3 occupancy, consistent with the selective activation of the hoxa cluster in anterior fins. Overall, both analyses indicate that 3D chromatin folding is largely maintained in the different pectoral fin territories.

To investigate possible regulatory constraints on TAD evolution, we considered 1,464 microsyntenic pairs of genes (that is, consecutive orthologues) conserved between skate, mouse and gar. In skates, such conserved gene pairs shared TADs more often than other consecutive genes (98% versus 95%, χ², P = 3.7 × 10⁻¹³; Extended Data Fig. 7a). Those pairs were present in 718 out of the 1,678 skate TADs (42%), highlighting that individual TADs are constrained but not invariant across deep evolutionary timescales (Extended Data Fig. 7b). TADs containing deeply conserved microsyntenic pairs are significantly larger and contain more distal ATAC–seq peaks and putative promoter–enhancer interactions, as defined on the basis of HiChIP analysis, compared with non-conserved TADs (Extended Data Fig. 7c; Mann–Whitney U-test, P = 1.23 × 10⁻²⁴, 3.81 × 10⁻³⁶ and 1.04 × 10⁻⁴¹, respectively). This suggests that the deep conservation of individual TADs emerges from regulatory constraints (Extended Data Fig. 7d,e).

Our results suggest that 3D chromatin organization in skates results from the interplay of two mechanisms—compartmentalization driven by transcriptional state and TADs formed by loop extrusion. Such organization is similar in bony fishes/tetrapods, indicating that TAD formation through loop extrusion was present in the gnathostome ancestor. As the appearance of this common ancestor was temporally close to 2R, we explored the regulatory fate of homologous TADs in relation to this duplication event. We found that, although the size and gene density of TADs is similar between α and β chromosomes, there are notably fewer TADs in beta (Fig. 3f,g and Extended Data Fig. 7f). Regulatory landscapes derived from H3K4me3 HiChIP experiments followed a similar trend (Extended Data Fig. 7g,h). We confirmed that the lower number of TADs in beta could not be explained by TAD fusions in beta or boundary gains in α segments (Extended Data Fig. 7i). These results indicate that many TADs disappeared from the early gnathostome genome after 2R, while those that persist are comparable in size (Fig. 3g). Whether losses in beta segments were caused by the deletion of whole redundant TADs or the progressive erosion and pseudogenization of their genes is difficult to ascertain.

PCP pathway as a driver of fin expansion

To examine whether genomic rearrangements could have driven skate pectoral fin evolution through TAD alterations, as reported for other mammalian traits¹¹, we identified synteny breaks by aligning six jawed vertebrate genomes (Fig. 4a). As expected, the number of (micro)syntenic changes between species increases with phylogenetic distance (Fig. 4a and Extended Data Fig. 8a), from 18 breaks in L. erinacea that occurred after the split of the two skate lineages to 1,801 between cartilaginous and bony fishes (around 2 breaks per million years).

**Fig. 4: Skate-specific genomic rearrangements and the PCP pathway.**

As anterior expansion of the pectoral fin is a defining characteristic of skates, we focused on the 123 synteny breaks shared by the little and thorny skate genomes relative to other vertebrates. We found an enrichment of synteny breaks near TAD boundaries—42 breaks occurred within 50 kb of a TAD boundary, compared with 15 expected under a random break model (empirical P < 1 × 10⁻⁴; Fig. 4b). This enrichment supports the hypothesis that genome rearrangements that interrupt TADs are evolutionarily disfavoured owing to deleterious enhancer–promoter rewiring⁹.

Conversely, we hypothesized that the 81 breaks that interrupt TADs could be enriched for enhancer–promoter rewiring associated with gene regulatory changes. Interrupted TADs include 2,041 genes and, by filtering those with interactions across synteny breaks on the basis of anterior fin H3K4me3 HiChIP analysis, we identified 180 genes that are potentially associated with pectoral fin expansion. Signalling pathway analysis revealed enrichment for Wnt/PCP pathway components (Fig. 4c and Extended Data Fig. 8b,c), including the important regulator prickle1 (Fig. 4d) and other potentially relevant genes such as the hox gene activator psip1³⁴ (Extended Data Fig. 8d–g). Among eight candidate genes of which we determined the expression using whole-mount in situ hybridization (WISH), only prickle1 and psip1 exhibited clear anteriorly enriched expression patterns (Fig. 4e and Supplementary Fig. 6).

To test whether alterations in TADs drove changes in gene expression, we performed comparative WISH analysis of prickle1 between skate and chain catshark (S. retifer) embryos at equivalent stages (Fig. 4e). prickle1 expression was higher in the anterior pectoral fin of skates compared to a weak expression without spatial enrichment in shark fins (Supplementary Fig. 7). Similarly, we found differential expression for Psip1, suggesting a potential involvement of Hox-related pathways in the skate fin phenotype (Extended Data Fig. 8f,g).

Given the specific pattern of prickle1 expression, we examined the function of the PCP pathway in anterior fin expansion using cell shape analysis, and found that anterior mesenchymal cells are more oval than those in the central and posterior regions (Supplementary Fig. 8). Treatment with a Rho-kinase (ROCK) inhibitor from stage 29 to 31 showed that the overall number of fin rays associated to each tribasal bone of the skate fin (propterygium, mesopterygium and metapterygium) was reduced in the ROCK-inhibited embryos compared with in the controls, with greater losses in the anterior than in the posterior fin region (Fig. 4f,g, Extended Data Fig. 9 and Supplementary Figs. 9 and 10). Despite significant variation across stage and treatment (Extended Data Fig. 9 and Supplementary Fig. 10), geometric morphometric analyses suggest that ROCK-inhibitor-treated embryos showed a less pronounced anterior expansion of the pectoral fin, in contrast to control embryos in which it extends anteriorly towards the eye by stage 31 (Extended Data Fig. 10). To rule out a general delay in body growth, we implanted acrylic beads soaked in ROCK inhibitor into the anterior pectoral fins at stage 29 and investigated fin rays at stage 31 (Extended Data Fig. 11). In contrast to control embryos with DMSO beads, specimens with ROCK inhibitor exhibited aberrant branching, fusion and loss of fin rays near beads or at potential bead implantation sites (6 out of 9 embryos for 100 μM and 6 out of 10 for 1 mM inhibitor beads). Taken together, these findings suggest that TAD rearrangements had a role in recruiting and repurposing genes and pathways during the evolution of the unique batoid fin morphology.

HOX-driven gli3 repression in skate fins

To examine the transcriptional drivers of skate fin morphology, we generated and compared RNA-seq datasets between pectoral fins and pelvic fins, which exhibit a characteristic tetrapod gene expression pattern¹. We identified 193 and 117 genes preferentially expressed in pectoral and pelvic fins, respectively (Supplementary Table 4), including several transcription factors and components of different signalling pathways. To identify changes in the appendage gene regulatory network, we compared differentially expressed genes in skate fins with corresponding mouse fore- and hindlimb RNA-seq data^35,36 (Fig. 5a and Supplementary Fig. 11a). Key genes in determining anterior and posterior paired appendages, such as tbx5 and tbx4, display a similar expression pattern, suggesting a conserved function across jawed vertebrates³³. However, several genes, including hox genes or the master regulator of vertebrate hindlimb specification pitx1³⁷, displayed clear differences between skates and mice (Supplementary Figs. 11a and 12), suggesting that altered regulation of appendage-related factors may contribute to skate pectoral fin expansion.

**Fig. 5: Functional experiments in skate fin samples.**

To examine the transcriptional changes associated with skate pectoral fins, we analysed available anterior and posterior pectoral fin RNA-seq data¹. In skates, hox genes show distinctive expression differences between the anterior and posterior pectoral fin (Supplementary Table 5 and Supplementary Fig. 11b). Anterior expression of the hoxa and hoxd genes forms a secondary AER-like organizer that is probably involved in the overgrowth of the skate pectoral fins^1,38,39. Secondary AER formation is associated with changes in the expression of gli3—a key regulator of hedgehog signalling in appendage patterning^40,41. Specifically, gli3 is expressed in the posterior pectoral fin versus predominantly anterior expression in pelvic fins, as in several vertebrate species¹ (Fig. 5b). Recently, it has been shown that (1) the Hoxa13 and Hoxd13 genes downregulate Gli3 expression for proper thumb formation⁴² in the mouse limb, (2) HOX13 proteins bind to and repress Gli3 limb enhancers and (3) compound Hox13 mutants cause anterior extension of Gli3 expression⁴². Anterior Hox genes may also have a role in GLI3 transcriptional regulation, as Hoxa2 binds to several enhancers within the Gli3 locus (shown by ChIP–seq data⁴³; Extended Data Fig. 12a). Overexpression of hoxa2 in zebrafish pectoral fins also induces transcription of wnt3 (an AER marker gene) potentially inhibiting gli3 expression¹. Some of these hox genes, including hoxa13 and hoxa2, are strongly expressed in skate anterior pectoral fins (Supplementary Fig. 11b).

On the basis of this evidence, and considering the redundancy between Hoxd13 and Hoxa13 proteins^44,45,46, we explored the Hox–Gli3 relationship using a validated hoxd13a-GR overexpression construct in zebrafish⁴⁷. After dexamethasone treatment, overexpression of Hoxd13a caused increased fin proliferation, distal expansion of chondrogenic tissue and fin fold reduction⁴⁵. Furthermore, 35% of the injected zebrafish embryos showed a decrease in gli3 fin expression (Extended Data Fig. 12b). Moreover, a gli3 loss-of-function mutant in medaka fish shows multiple radials and rays in a pattern similar to the polydactyly of mouse gli3 mutants, but also to pectoral skate fins⁴⁸. These findings, together with the anterior expression of 3′ hox genes, suggest that Gli3 downregulation, mediated by Hox repression, is a potential mechanism underlying the striking pectoral skate fin shape.

A skate-specific hoxa fin enhancer

We hypothesized that the anteroposterior expression differences found in other vertebrates but not in skates could arise from changes in cis-regulation. To identify CREs, we performed ATAC–seq analysis in anterior and posterior pectoral fins, as well as in whole pelvic fins. DNA methylation profiling (Supplementary Fig. 16a) revealed that differentially accessible ATAC peaks are hypomethylated in developing pectoral and pelvic fins and remain hypomethylated in adult fins (Supplementary Fig. 16b,c), suggesting epigenetic memory as reported in other vertebrates^48,49,50. We used our HiChIP datasets to associate CREs with target genes, and identified many differentially accessible ATAC peaks clustered around genes that are critical for appendage patterning, such as, tbx5, tbx4, pitx1 and hox genes (Supplementary Tables 6 and 7). Notably, Pitx1 displays a similar regulatory landscape in skate pectoral and pelvic fins (Supplementary Figs. 12 and 13), contrasting with the tissue-specific regulatorion in mouse⁵¹.

To further investigate anterior Hox gene regulation in skate pectoral fins, we integrated our anterior and posterior pectoral fin ATAC–seq data with existing RNA-seq data from these tissues¹. The few differentially accessible CREs were associated with differentially expressed genes relevant for patterning, such as hoxa2, pax9, tbx2 and alx4 anteriorly, as well as chordin, hoxa9, hoxd10, hoxd11, hoxd12 and grem1 in the posterior region (Supplementary Table 5). Notably, a region located between hoxa1 and hoxa2 is more accessible in anterior pectoral than in posterior pectoral or pelvic fins (Fig. 5c). Zebrafish transgenic assays confirmed enhancer activity for this open chromatin region, which drives gene expression in anterior pectoral fins (Fig. 5d). This element is conserved in cartilaginous fishes but not found in bony fishes (Supplementary Fig. 14). Importantly, the orthologous region in catshark does not promote transgene expression in zebrafish (Fig. 5d), suggesting that, although this region is conserved in different chondrichthyan species, only the skate sequence is functionally active during early development. As this potential enhancer lies close to the hoxa2 promoter, we examined whether it is specific for hoxa2 or shared with other hox genes. Using H3K4me4 HiChIP, HiC and virtual 4C data, we observed that this enhancer forms robust interactions with most genes of the hox cluster in the anterior pectoral fin (Fig. 5c and Supplementary Fig. 15a), including hoxa13 located in the 5′ adjacent TAD (Figs. 3c and 5c) and expressed in the anterior pectoral fin (Fig. 5b and Supplementary Fig. 15b). Overall, these results demonstrate the existence of skate-specific CREs that can be linked to the formation of a secondary AER-like domain in the anterior pectoral fin.

Discussion

Here we combined genomic and functional approaches to uncover fundamental principles of genome regulation in the skate lineage and provide a molecular basis for the formation of wing-like batoid fins². The position of skates in the vertebrate evolutionary tree, and their slow rate of genome evolution, revealed new insights into karyotype stabilization after two rounds of WGD. Gene loss and karyotype evolution dynamics have occurred at a different pace across jawed vertebrate lineages. Analysis of the elephant shark genome found a slower rate of evolution and reduced gene loss compared with tetrapods^25,52. Here we showed that skate not only possesses comparably low rates of change, but also retains numerous ancestral gnathostome chromosomes, and that the smaller chromosome numbers of chicken and spotted gar arose by fusion of these ancestral units. This process was accompanied by considerable gene order rearrangement between cartilaginous and bony fishes, despite extensive conservation of TAD gene contents. Conservation of TADs in the absence of a globally colinear gene order emphasizes the impact of regulatory constraints in maintaining gene groupings.

The skate genome is functionally constrained by 3D regulatory mechanisms that parallel those described in bony fishes and tetrapods, including the presence of a CTCF-orientation code and associated loop extrusion¹³. Our findings imply that these mechanisms emerged early in vertebrate evolution, probably influencing the appearance of phenotypic novelties. These mechanisms further constrain genome evolution, as most skate-specific chromosome rearrangements occur at TAD boundaries, resulting in limited effects on gene regulation, as reported in mammals⁵³. Notably, we observed the complete disappearance of TADs in the paralogous regions prone to gene loss after 2R (beta segments), with the remaining β and α TADs having the same average size and gene number. Although asymmetric paralogue loss after WGDs is considered to be a key factor in the emergence of novel gene regulation⁵, the loss of TADs in beta regions indicates that entire paralogous regulatory units can be lost after WGDs and stresses the importance of regulatory constraints in shaping genome organization. It remains to be seen whether the regulatory potential of missing TADs is incorporated into other regulatory landscapes and enhances pleiotropy.

Related to novel skate morphology, we found lineage-specific TAD-disrupting rearrangements affecting genes involved in PCP signalling—an ancient developmental pathway⁵⁴ that is essential for cell orientation and patterning. We found that the main effector of this pathway, prickle1, has anteriorized pectoral fin expression as well as in anterior pelvic fins and in the clasper (Fig. 4e and Supplementary Fig. 6)—two structures that also extend laterally and posteriorly during skate development⁵⁵. Importantly, unique pectoral and pelvic fin morphologies evolved simultaneously during batoid diversification, suggesting a deployment of similar/same genetic cascades during paired fin development⁵⁶, as suggested by the presence of common markers like wnt3 and hoxa11^1,39. The tissue-specific modulation of the PCP pathway through redeployment of a main pathway effector (prickle1) provides a compelling example of how existing gene networks can evolve new functions through genomic rearrangements.

Finally, we implicate altered regulation of 3′ hox genes and their activator psip1 in novel skate pectoral fin development. Although these genes show posterior expression in most vertebrate appendages (including skate pelvic fins), they are notably expressed in skate anterior pectoral fin. Our hoxd13a overexpression experiments (Extended Data Fig. 12b) suggest that the increased levels of hox gene expression in anterior pectoral fins, together with other regulatory changes, downregulates Gli3, leading to substantially altered morphology and illustrating the plasticity of the Shh–Gli3–Ptch1 pathway in the evolution of vertebrate appendage morphology^{46,56,57,58,59}. The identified skate-specific hoxa fin enhancer suggests a cis-regulatory basis for altered Shh–Gli3–Ptch1 signalling. Overall, our study shows how changes in CREs and 3D chromatin organization act as essential forces driving adaptative evolution.

Methods

Animal use

All fish work, including experiments with skate embryos, was conducted according to standard protocols approved by the Institutional Animal Care and Use Committee (IACUC) of Rutgers University (protocol number, 201702646), the IACUC of Marine Biological Laboratory (protocol number, 18-36) and the University of Chicago IACUC (protocol number, 71033). Danio rerio embryos were obtained from AB and Tübingen strains, and manipulated according to protocols approved by the Ethics Committee of the Andalusia Government (license number, 182-41106) and the national and European regulation established. Zebrafish procedures were reviewed and approved by the ethical committees from the University Pablo de Olavide, CSIC, and the Andalusian government, and performed in compliance with all relevant ethical regulations.

Genomic DNA extraction and library construction

Skate DNA was isolated using extensive proteinase K digestion and phenol–chloroform extraction from the muscle of a single L. erinacea specimen. For genome assembly, we generated both accurate short reads and noisy long reads. A contiguous long read (CLR) library for Pacbio sequencing was prepared and sequenced at the Vincent J. Coates Genomics Sequencing Laboratory at UC Berkeley. A total of 32 cells were sequenced on the Pacbio Sequel instrument using the V7 chemistry and yielded a total 10.2 million Pacbio reads totalling 163 Gb with a median size of 10.9 kb and a read N₅₀ of 29 kb.

A paired-end Illumina library with a 600 bp insert was also sequenced for 2 × 250 bp in rapid run mode on the HiSeq 2500 instrument at BGI yielding 641 million reads and 160.3 Gb of sequence.

Genome assembly

Genome size was estimated by analysing a k-mer spectrum with a mer size of 31. By fitting a multimodal distribution using Genomescope 2.0, and estimated a genome size of 2.13 Gb (as well as an heterozygosity of 1.56%)⁶². To take advantage of both short and long reads, we opted for a hybrid assembly strategy. First, we generated de Brujin graph contigs using megahit (v.1.1.1) using a multi-k-mer approach (31, 51, 71, 91 and 111-mers) and filtering out k-mers with a multiplicity lower than 5 (--min-count 5). We obtained 2,750,419 contigs with an N₅₀ of 1,129 bp representing a total of 2.23 Gb. We then used these contigs to prime the alignment and assembly of the Pacbio reads using dbg2olc (c. 10037fa)⁶³ using a k-mer of 17 (k 17), a threshold on k-mer coverage of 3 (KmerCovTh 3), a minimal overlap of 30 (MinOverlap 30) and an adaptive threshold of 0.01 (AdaptiveTh 0.01) and removing chimeric reads from the dataset (RemoveChimera 1). This assembler generated an uncorrected backbone of overlapping reads with an N₅₀ of 4.96 Mb and a total size of 2.25 Gb. To correct sequencing errors, we processed this sequence file to two successive rounds of consensus by aligning Pacbio reads with minimap2 (v.2.12, map-pb setting)⁶⁴ and Racon (v.1.3.1) using the default parameters followed by one final round of consensus using the Illumina reads. We evaluated the progress of the polishing process with the BUSCO tool (v.3.0.2) that seeks widely represented single-copy gene families in the assembly⁶⁵. Our final polished assembly contained 95.1% of vertebrate BUSCO genes (Supplementary Table 1). To exclude residual haploid contigs from the assembly, we aligned Illumina reads once more using bwa and computed a distribution of coverage that showed some residual positions at half coverage (31×). We used purge_haplotigs (v.1.0.2)⁶⁶ by defining a coverage threshold between haploid and diploid contigs at 40× (and a minimum of 10× and maximum of 100×). The filtered assembly has a size of 2.19 Gb, an N₅₀ of 5.35 Mb and 2,595 contigs in total, and the same BUSCO statistics as the unfiltered one (Supplementary Table 1).

This assembly was then scaffolded using chromatin-contact evidence obtained from Hi-C sequencing analysis of L. erinacea fins (see below) at Dovetail Genomics using the HiRise pipeline⁶⁷. The accuracy of the resulting scaffolded assembly was verified and proofread by carefully inspecting the contact map in Juicebox⁶⁸ and HiGlass browser⁶⁹. This assembly comprises 50 scaffolds larger than 1 Mb that represent 92% of the assembly size and 39 scaffolds larger than 10 Mb that show mostly internal contacts. Despite no karyotyping evidence is directly available for L. erinacea, closely related species show a haploid number of 49 chromosomes, which is consistent with the observed number of chromosomes¹⁴.

As the final assembly size was smaller than the experimentally assessed genome size of 3.5 Gb, we performed gap closing on the final assembly using PBjelly⁷⁰ that proceeds through alignment of the PacBio reads on each gap border and local reassembly. The effect on the assembly statistics was marginal, but we used this assembly as our final one (Supplementary Table 1).

Annotation

RNA-seq reads of strand-specific libraries from five bulk embryonic stages and 13 organs were aligned to the genome using STAR (v.2.5.2b)⁷¹ and each library assembled independently using stringtie (v.1.3.3)⁷². Stringtie assemblies were then merged using TACO (v.0.7.3)⁷³. RNA-seq reads were also assembled de novo using Trinity (v.2.8.4)⁷⁴. Finally, the iso-seq protocol was applied to generate full-length transcripts using Pacbio long-reads. Both Trinity assembled transcripts and iso-seq transcripts were aligned to the genome using GMAP (v.2018-07-04)⁷⁵. Then, both TACO assembled transcripts and aligned de novo transcripts were leveraged using Mikado (v.1.2.1)⁷⁶ to generate one consensus reference transcriptome, while predicting coding loci using Transdecoder (v.5.5.0). Using selected transcripts (2 introns or more, complete CDS, single hit against swissprot), we built an Augustus (v.3.3.3) hidden Markov model (HMM) profile for ab initio gene prediction⁷⁷. We predicted skate genes using this profile and hints derived from (1) the mikado transcript assembly (exon hints); (2) intron hits obtained using bam2hints on a merged bam alignment of the RNA-seq data after filtering spurious junctions with portcullis (v.1.2.0)⁷⁸; and (3) an alignment of human protein using exonerate (v.2.2.0)⁷⁹.

A repeat library was constructed using Repeatmodeler and repeats were masked in the genome using Repeatmasker (v.4.0.7). We filtered out gene models that overlap massively with mobile elements and obtained 30,489 genes models. For these genes, isoforms and untranslated regions were added by two rounds of reconciliation with an assembled transcriptome using PASA⁸⁰. Our set of coding genes includes 5,800 PFAM domains, a similar value to other well-annotated vertebrate genomes. To further examine the validity of gene models, we assessed (1) whether their coding sequence showed similarity to that of another species using gene family reconstruction (see below); (2) whether they possessed an annotated PFAM domain; and (3) whether they are expressed above 2 FPKMs in at least one RNA-seq dataset. These criteria reduced the number of bona fide coding genes to 26,715.

Gene family, synteny and phylogenetic analyses

We performed gene family reconstruction using OMA (v.2.4.1)⁸¹ between selected vertebrate species to identify single-copy orthologues. These orthologues were used to infer gene phylogeny after processing as described previously⁸²: HMM profiles were built for each orthologous gene family and searched against translated transcriptomes using the HMMer tool (v.3.1b2)⁸³. Alignments derived from each orthologue were aligned using MAFFT (v.7.3)⁸⁴, trimmed for misaligned regions using BMGE (v.1.12)⁸⁵ and assembled in a supermatrix. Phylogeny was estimated using IQTREE (v.2.1.1) assuming a C60+R model and divergence times estimated using Phylobayes (v.4.1e)⁸⁶ assuming a CAT+GTR substitution, and a CIR clock model, soft constraints and a birth-death prior on divergence time. Calibrations were taken from previous papers^18,87.

We identified conserved segments across vertebrates, by counting single-copy copy genes derived from OMA clustering sharing the same set of chromosomal locations in selected species, to identify putative ancestral vertebrate units. We examined conserved syntenic orthology by identifying sets of genes shared by pairs of chromosomes in distinct species using reciprocal best hits computed using Mmseq2⁸⁸. We performed a Fisher test to detect pairs of chromosomes showing significant enrichment, and assigned ancestral linkage groups (ALG) based on comparison with amphioxus and sea scallop. We computed gene family composition and analysed patterns of gene loss and duplications using reconstructed gene trees derived from gene families established with Broccoli⁸⁹ and subjected to species-tree aware gene tree inference using Generax⁹⁰.

Hi-C

The Hi-C protocol was performed as described previously with minor modifications^91,92,93. Two biological replicates of L. erinacea Stg.30 pectoral fin buds, each consisting of ten fins, were fixed in a final concentration of 1% PFA for 10 min at room temperature. Fixation was stopped by placing the samples on ice and by adding 1 M glycine up to a concentration of 0.125 M. The quenched PFA solution was then removed and the tissue was resuspended in ice-cold Hi-C Lysis Buffer (10 mM pH 8 Tris-HCl, 10 mM NaCl, 0.2% NP-40 and 1× Roche Complete protease inhibitor). The lysis was helped with a Dounce Homogenizer Pestle A on ice (series of 10 strokes in 10 min intervals). Nuclei were then pelleted by centrifugation for 5 min, 750 rcf at 4 °C, washed twice with 500 µl of 1× PBS and finally resuspended with water to final volume of 50 µl. A total of 50 µl of 1% SDS was then added and the sample incubated 10 min at 62 °C. The SDS was then quenched by adding 292 µl water and 50 µl of 10% Triton X-100. Chromatin was then digested by adding 50 µl of 10× DpnII buffer and 8 µl of 50 U µl⁻¹ DpnII (NEB, R0543M) followed by incubation at 37 °C overnight in a ThermoMixer with shaking (800 rpm). DpnII was then heat-inactivated at 65 °C for 20 min with no shaking. Chromatin sticky ends were then filled-in and marked with biotin by adding 50 µl of Fill-in Master Mix (5 µl of 10× NEBuffer2, 1.5 µl of 10 mM mix of dCTP, dGTP and dTTP, 37.5 µl of 0.4 mM biotin-dATP (Thermo Fisher Scientific, 19524016) and 10 µl of 5 U µl⁻¹ Klenow (NEB, M0210)) and incubating for 1 h at 37 °C with rotation. Filled-in chromatin was then ligated by adding 500 µl of ligation master mix (100 µl of 10× NEB T4 DNA ligase buffer with ATP (NEB, B0202), 100 µl of 10% Triton X-100, 10 µl of 10 mg ml⁻¹ BSA and 6.5 µl of 400 U µl⁻¹ of T4 DNA ligase (NEB, M0202)) and incubated 4 h at 16 °C with mixing (800 rpm, 30 s pulses every 4 min). Ligated chromatin was then reverse-cross-linked by adding 50 µl of 10 mg ml⁻¹ proteinase K and incubating the sample at 65 °C for 2 h. De-cross-linking was completed by adding 50 µl extra of proteinase K and incubating overnight at 65 °C. DNA from the reverse-cross-linked chromatin was purified using phenol–chloroform extraction and ethanol precipitation. Pelleted DNA was resuspended in 100 µl of TLE. Biotin removal from unligated ends was performed in a final volume of 130 µl with 5 µg of the purified DNA, 13 µl of 10× NEBuffer2.1, 3.25 µl of 1 mM dNTPs, 5 µl of 3 U µl⁻¹ T4 DNA polymerase (NEB, M0203L). The sample was incubated in a thermocycler 4 h at 20 °C and the reaction subsequently stopped by adding EDTA to a final concentration of 10 mM followed by 20 min at 75 °C. A total of 130 µl was used for DNA sonication in a M220 Covaris Sonicator (peak power, 50; duty factor, 20%; cycles/burst, 200; duration, 65 s). After sonication, DNA was size-selected using AMPure XP beads (Agencourt, A63881). In brief, in a first selection, 0.6× bead mix was used and the supernatant was recovered. In the second selection, 1.2× bead mix was used and the bead fraction was recovered. Size-selected DNA was resuspended in 50 µl of TLE and then processed for end repair. End repair was performed by adding 20 µl of the end repair mix (7 µl of 10× NEB ligation buffer, 1.75 µl of 10 mM dNTP mix, 2.5 µl of T4 DNA polymerase (3 U µl⁻¹ NEB M0203), 2.5 µl of T4 PNK (10 U µl⁻¹, NEB M0201) and 0.5 µl of Klenow DNA polymerase (5 U µl⁻¹, NEB, M0210)) and incubating in a thermocycler with the following program: 15 °C for 15 min, 25 °C for 15 min and 75 °C for 20 min. Biotinylated ligation ends were then pulled down using 10 µl of Dynabeads MyOne Streptavidin C1 (Invitrogen, 650.01) per µg of DNA. The beads were washed twice with Tween wash buffer (85 mM Tris-HCl pH 8, 0.5 mM EDTA, 1 M NaCl, 0.05% Tween-20) before being resuspended in 400 µl of 2× bead binding buffer (10 mM Tris-HCl pH 8, 1 mM EDTA, 50 mM NaCl) and incubated for 15 min with rotation with 400 µl of the end repaired sample (70 µl of end repair reaction plus 330 µl of TLE). The beads were then washed once with 400 µl of 1× bead binding buffer and once with 100 µl TLE before being finally resuspended in a final volume of 41 µl. A-tailing was then performed in a total volume of 50 µl by adding 5 µl of 10× NEBuffer2.1, 1 µl of 10 mM dATP and 3 µl of 5 U µl⁻¹ Klenow fragment 3′→5′ exo- (NEB, M0212) in the thermocycler with the following program: 37 °C for 30 min then 75 °C for 20 min. A-tailed sample was then washed with 400 µl of 1× T4 ligase buffer and resuspended in 40 µl of the same buffer to prepare it for the adaptor ligation, which was performed by adding 1 µl of 10× T4 ligation buffer, 4 µl of T4 DNA ligase and 5 µl of 15 µM Illumina paired-end pre-annealed adapters. The reaction was incubated for 2 h at room temperature and the beads were then washed twice with 1× NEBuffer2.1. The beads were resuspended in 50 µl of the final library PCR reaction for library generation (25 µl of NEBNext High-Fidelity 2× PCR Mix, 0.5 µl of PE1 primer 25 µM and 0.5 µl of PE2 primer 25 µM plus milliQ water). The PCR was performed in a thermocycler with the following program: 98 °C for 60 s; 5–10 cycles of 98 °C for 10 s, 65 °C for 30 s, 72 °C for 30 s and 72 °C for 5 min. Test PCRs were used to determine the number of cycles. Final single-sided AMPure XP bead purification was performed to eliminate primer-dimers (1.1× proportion). Final libraries were sent for paired-end sequencing.

Hi-C analysis

Hi-C paired-end reads were mapped to the skate genome using BWA⁹⁴. Ligation events (Hi-C pairs) were then detected and sorted, and PCR duplicates were removed using the pairtools package (https://github.com/mirnylab/pairtools). Unligated and self-ligated events (dangling and extra-dangling ends, respectively) were filtered out by removing contacts mapping to the same or adjacent restriction fragments. The resulting filtered pairs file was converted to a .tsv file that was used as input for Juicer Tools 1.13.02 Pre, which generated multiresolution .hic files⁹⁵. These analyses were performed using previously published custom scripts (https://gitlab.com/rdacemel/hic_ctcf-null): the hic_pipe.py script was first used to generate .tsv files with the filtered pairs, and the filt2hic.sh script was then used to generate Juicer .hic files. Visualization of normalized Hi-C matrices and other values described below, such as insulation scores, TAD boundaries, aggregate TAD, Pearson’s correlation matrices and eigenvectors, were calculated and visualized using FAN-C⁹⁶ and custom scripts available in the GitLab repository (https://gitlab.com/skategenome). The observed–expected interchromosomal matrix (Fig. 1d) was calculated counting interchromosomal normalized interactions in the 1 Mb KR normalized matrix (with the two replicates merged). Expected matrix was calculated as if interchromosomal interactions between two given chromosomes were proportional to the total number of interchromosomal interactions of these two chromosomes. A/B compartments were first called in each of the replicates separately using the first eigenvector of the 500 kb KR normalized matrix. Eigenvector correlation was high (r = 0.91, Extended Data Fig. 3b) and the replicates were then merged. The first eigenvector was calculated again and oriented according to open chromatin using the amount of ATAC–seq signal in the anterior pectoral fin sample. The same strategy was used to look at compartment differences between anterior and posterior fin Hi-C, but this time using 250 kb resolution (Extended Data Fig. 6). ATAC–seq, percentage of GC, gene models and RNA-seq signal overlaps with compartments were calculated using bedtools intersect⁹⁷. Compartment calling and the different overlaps are available in Supplementary Table 8. The saddle plot was calculated using FAN-C. To define TADs, insulation scores were also calculated separately in the 25 kb resolution KR matrices of each of the replicates (using FAN-C and as described previously⁹⁸ with a window size of 500 kb). Again, correlation between insulation scores of both replicates was high (r > 0.94, Extended Data Figs. 4b and 6f). Definitive boundaries and TADs were then calculated in a merged 25 kb matrix with a window size of 500 kb and using a boundary score cut-off of 1 (Supplementary Table 9) or no cut-off for interspecies comparison analyses with mouse and zebrafish. CTCF motifs and their relative orientations were mined inside ChIP–seq peaks in mouse and zebrafish or merged ATAC–seq peaks between the anterior and posterior pectoral fin samples using Clover⁹⁹ or FIMO¹⁰⁰ (MA0139.1 Jaspar PWM, PWM score threshold of 8). They were later overlapped with previously calculated boundaries. Boundary feature heat maps from Supplementary Fig. 5 were generated using profileplyr¹⁰¹ (https://bioconductor.org/packages/release/bioc/html/profileplyr.html) after binning the different signals in 5 kb windowed bigwig files. Chromatin loops were called using HICCUPS⁹⁵ with the default parameters in merged replicates of the anterior and posterior fin Hi-C experiments, and in a megamap merging anterior and posterior fin Hi-C maps. A consensus set of loops was then calculated using hicMergeLoops from the HiCExplorer suite¹⁰² and reads were counted in the different replicate 10 kb resolution Hi-C maps to perform the differential loop analysis with EdgeR¹⁰³. Virtual 4C-seqs were plotted from 10-kb-resolution Hi-C matrices using custom scripts.

HiChIP

HiChIP assays were performed as previously described¹⁰⁴, with some modifications. In brief, 10 anterior and posterior pectoral fins of stg. In total, 30 skate embryos were fixed in a final concentration of 1% PFA for 10 min at room temperature. Fixation was quenched with 1 M glycine up to a concentration of 0.125 M. The tissue was then resuspended in 5 ml cell lysis buffer and homogenized using a Douncer on ice. After the lysis, nuclei were pelleted by centrifuging at 2,500 rcf, and washed in 500 µl of lysis buffer. Chromatin digestion and ligation, ChIP, tagmentation and library preparation were performed as previously described⁹². The antibody used was a ChIP-grade anti-histone H3 trimethyl K4 from Abcam (ab8580). The total amount of antibody used was 20 µg, at a dilution of 1 µg µl⁻¹.

HiChIP analysis

Paired-end reads from HiChIP experiments were aligned to the skate genome using the TADbit pipeline¹⁰⁵ with the default settings. In brief, duplicate reads were removed, DpnII restriction fragments were assigned to resulting read pairs, valid interactions were kept by filtering out unligated and self-ligated events and multiresolution interaction matrices were generated. Dangling-end read pairs were used to create 1D signal bedfiles that are equivalent to those of ChIP–seq experiments. Coverage profiles were then generated in the bedgraph format using the bedtools genomecov tool (https://bedtools.readthedocs.io/en/latest/content/tools/genomecov.html), and bedgraph to bigwig conversions were also performed for visualization using the bedGraphToBigWig tool from UCSC Kent Utils (https://github.com/ucscGenomeBrowser/kent). 1D signal bedgraph files were used to call peaks with MACS2¹⁰⁶ using the no model and extsize 147 parameters and FDR < 0.01.

FitHiChIP¹⁰⁷ was used to identify ‘peak-to-all’ interactions at 10 kb resolution using HiChIP-filtered pairs and peaks derived from dangling ends. Loops were called using a genomic distance of between 20 kb and 2 Mb, and coverage bias correction was performed to achieve normalization. FitHiChIP loops with q values smaller than 0.1 were retained for further analyses. Further filtering was performed to enrich enhancer–promoter interactions. First, loops established by two H3K4me3 peaks (likely promoter–promoter interactions) or no H3K4me3 peaks (likely enhancer–enhancer and others) were filtered out. Second, loops related to the H3K4me3 peak of the same gene promoter are grouped together into a common ‘regulatory landscape’, composed of a promoter anchor and several distal anchors. Then, regulatory landscapes with only one distal anchor were filtered out. Third, to filter out further spurious interactions, we used the rationale that genomic bins that interact with a given promoter rarely do so in isolation. We therefore calculated a distance cut-off for ‘interaction gaps’ in regulatory landscapes. Regulatory landscapes containing interaction gaps bigger than the distance cut-off were trimmed and the distal anchors beyond the interaction gap were discarded. The cut-off was determined for each sample independently by calculating the distribution of the biggest gaps (calculating the biggest gap for each regulatory landscape) and setting the cut-off to the sum of the third quartile plus twice the interquartile range (classic outlier definition). Overlaps with ATAC–seq peaks in the pectoral fin were calculated using bedtools intersect (Extended Data Fig. 5a). Inter-TAD loops were also calculated using bedtools intersect -c using the TADs and the loops. Loops intersecting more than one TAD were considered inter-TAD loops. Randomized controls were generated shuffling TAD positions before the intersection using bedtools shuffle. For differential analysis between the anterior and the posterior fin, filtered distal anchors were fused when closer than 20 kb using GenomicRanges reduce¹⁰⁸. The loops with the merged distal anchors are provided in Supplementary Table 10. To perform the differential analysis, the number of reads supporting the union set of loops was extracted for each of the sample replicates. Correlations shown in Extended Data Fig. 5c and the differential analysis performed using EdgeR¹⁰³ (Extended Data Fig. 5d) were calculated with this table. An FDR cut-off of 0.1 was chosen to consider a loop to be significantly stronger in either the anterior or the posterior fin. Custom code used for enhancer–promoter loop filtering and differential analysis is included in the GitLab repository (https://gitlab.com/skategenome).

RNA-seq

RNA-seq experiments from anterior and posterior pectoral and whole pelvic skate fins were performed as previously described⁶. In brief, two anterior or posterior pectoral and two pelvic fins of stage 31 skate embryos were used for each biological replicate. Total RNA was extracted from each sample using Direct-zol RNA MiniPrep (Zymo Research) and sent for library preparation and sequencing.

RNA-seq analysis

For the RNA-seq data analysis, we used the nf-core/rnaseq pipeline (v.1.4)¹⁰⁹ for read alignment, read count and quality control of the results. After this, we performed a differential gene expression analysis using the DESeq2 R library (v.1.30.1)¹¹⁰. Gene Ontology term enrichment analysis was performed using TopGO R library (v.2.42.0)¹¹¹, with the elim algorithm and Fisher test, retaining terms with P < 0.01.

ATAC–seq

ATAC–seq experiments from anterior and posterior regions of pectoral skate fins and whole pelvic fins were performed as previously described^6,112. After dissecting the pectoral fins, one anterior and one posterior regions were used for each biological replicate. In the case of pelvic fins, one fin was used for each biological replicate. Tissue was homogenized using a Pellet Pestle Motor (Kimble) coupled to a plastic pestle, and treated with lysis buffer. Individual cells were counted, and 75,000 cells were tagmented. ATAC–seq libraries were generated by PCR, using 13 cycles of amplification, purified and sent for external sequencing.

ATAC–seq analysis

ATAC–seq data analysis was performed using the nf-core/atacseq pipeline (v.1.0.0)¹⁰⁹, which runs Nextflow (v.19.10.0)¹¹³, for quality controls, read alignment against the new skate assembly, filtering, data visualization, peak calling, read count and differential accessibility analysis. To compare whole pectoral and pelvic fin samples, we merged the anterior and posterior pectoral samples into one single pectoral fin sample.

Microsyntenic pair analysis

The analysis of microsyntenic pairs shared across the gnathostome lineage was based on a previously described analysis¹¹⁴. In brief, we used the genome assembly and annotation presented in this paper for the little skate in combination with public assemblies and annotations for mouse and garfish downloaded from ensembl (www.ensembl.org; Mus musculus: GRCm38v101; Lepisosteus oculatus: LepOcu1v104). Annotations in .gtf format were converted to genepred with gtfToGenePred (UCSC Kent Utils). Then, for each pair of consecutive genes in skates, we determined whether the orthologue pairs of genes in mouse and garfish were also consecutive (allowing 4 intervening gene models as described previously¹¹⁴). The intergenic space between pairs of genes categorized as syntenic and non-syntenic in skates was overlapped with TAD boundaries and with TADs again using bedtools intersect. TADs were categorized according to the presence or absence of conserved microsyntenic pairs and then the overlap between the different TADs with ATAC–seq peaks or HiChIP loops was calculated again using bedtools intersect. A list of conserved microsyntenic pairs is provided in Supplementary Table 11 and the code is available in the GitLab repository (https://gitlab.com/skategenome).

TAD rearrangements in the skate lineage

To identify skate-specific TAD rearrangements, global alignments were performed with lastz¹¹⁵ against six different gnathostome genomes using as a reference the little skate assembly presented in this study. The chosen species were the thorny skate Amblyraja radiata, two species of shark (the white shark Carcarodon carcarias and the white-spotted bamboo shark Chiloscyllium plagiosum), one chimera (the elephant shark Callorhinchus milii) and a bony fish (the spotted gar Lepisosteus oculatus).The parameters of lastz were adapted to the phylogenetic distance with skate according to previous recommendations¹¹⁶ (see assemblies, substitution matrices and lastz parameters used in Supplementary Table 12). Syntenic chains and nets were then devised as proposed elsewhere¹¹⁷ and further polished using chainCleaner¹¹⁸. Synteny breaks were then defined as the junctions between syntenic nets of any level, excluding those that were caused by the end of a scaffold for such genome assemblies that were not chromosome grade (white shark, elephant shark). The overlap between synteny breaks of different species was inferred using bedtools multiinter. Breaks that were found to be common in sharks, chimeras and a bony fish (garfish) were further considered. The distance between candidate synteny breaks and TAD boundaries (Supplementary Table 9) was next determined using bedtools closest -d and breaks that were located closer than 50 kb to a TAD boundary were discarded. Randomized analysis of the overlap between synteny breaks and TAD boundaries (Fig. 4b) was performed, combining bedtools closest and bedtools shuffle. Finally, we selected candidate genes that displayed enhancer–promoter HiChIP interactions in the anterior or the posterior pectoral fin samples that crossed the synteny break, using again bedtools intersect. Enrichment of signalling pathways of candidate genes was performed using the ReactomePA¹¹⁹ and ClusterProfiler¹²⁰ R packages. A list of the final synteny breaks and candidate genes is provided in Supplementary Table 13, and the exact code used is provided at the GitLab code repository (https://gitlab.com/skategenome).

WISH

Skate and shark embryos were recovered from egg cases at stage 27 and 30 and fixed by 4% PFA at 4 °C overnight. The next day, the embryos were rinsed three times with PBS-0.1% Tween-20, soaked in 100% methanol and stored at −80 °C. WISH was conducted as previously described¹, except for hybridizing the embryos and probes at 72 °C.

Gain of function analysis

Experiments were performed as previously described⁴⁷. Zebrafish eggs were injected at the one-cell stage with hoxd13a-GR mRNA (70 pg per embryo). Dexamethasone at 10 nM (Sigma-Aldrich, D4902) was added to the medium at 24 h after fertilization, and embryos were fixed at 48 h after fertilization.

RT–qPCR

The pectoral fins of three shark juveniles (S. retifer) were dissected out in DEPC-PBS at stage 30. Three replicates were prepared. Total RNA was separately extracted from each replicate by Trizol (Invitrogen). cDNA was synthesized from the total RNA using the iScript cDNA Synthesis Kit (Bio-Rad). Then, quantitative reverse transcription PCR (RT–qPCR) analysis of gapdh and prickle1 was conducted using the KAPA HiFi HotStart ReadyMix PCR Kit (Kapa Biosystems) and the Applied Biosystems 7300 Real time PCR system. A list of the primers used in this study is provided in Supplementary Table 14. The obtained C_t value from RT–qPCR was converted to arbitrary gene expression values.

Cell elongation analysis

Pectoral fins were dissected from stage 29 skate embryos and fixed by 4% paraformaldehyde overnight. The next day, the fins were rinsed with PBS including 0.1% Triton X-100 and incubated in the blocking buffer (10% sheep serum and 0.1% BSA in PBS-0.1% Triton X-100) at room temperature for 1 h. The blocking buffer was then replaced with blocking buffer including CellMask Deep Red Plasma membrane Stain (1/1,000 dilution, Invitrogen) and DAPI (1:4000 dilution), and incubated at 4 °C overnight. Subsequently, the fins were washed five times for 1 h with PBS-Triton X-100 and mounted onto glass slides. The fins were then scanned using a confocal microscope (Zeiss, LSM510 META). The scanned images were incorporated into Fiji and cell outlines in fin mesenchyme were manually traced. The cell elongation ratio was automatically calculated by the macro ‘Tissue Cell Geometry Stats’ included in Fiji.

ROCK inhibitor treatment

To test the function of the PCP pathway in the pectoral fin development, skate embryos were treated with Y27632—a ROCK inhibitor—from stage 29 to stage 31 and investigated for their fin morphology. ROCK inhibitor (500 µl; stock 50 mM, final 50 µM, Selleck chemicals) or DMSO solution (negative control) was added to 500 ml of artificial saltwater (Instant Ocean), and five skate embryos at stage 29 for each condition were kept submerged in these solutions. Once the negative control embryos reached stage 31, all embryos were fixed by 4% PFA and their total body length was measured under a stereomicroscope. The embryos were stained with Alcian Blue as previously described¹²¹ (n = 5 per condition).

To locally inhibit the PCP pathway by the ROCK inhibitor, the beads soaked in the inhibitor solution (100 μM or 1 mM in DMSO) or DMSO were repeatedly implanted into the anterior pectoral fin from stage 29 (one bead per week, three times as total for each embryo). The embryos were raised up to stage 31 in artificial saltwater, fixed by 4% PFA and stained with Alcian Blue (the replicates were 9 or 10 embryos per condition).

Morphometrics analysis of skate fins

Skate embryos at each stage were photographed from the ventral side. A landmark scheme was designed to capture the shape of the pectoral fin (Extended Data Fig. 10). Six homologous landmarks and three curves were assessed in each sample; curves were used to generate sliding semi-landmarks. The samples were digitized in R using the package Stereomorph¹²². Digitized files were then uploaded to ShinyGM^123,124, in which all downstream analyses were performed. The samples were aligned using a generalized Procrustes analysis to account for shape differences due to differences in specimen size, specimen orientation and scaling. A morphospace was generated using these aligned landmark coordinates; deformation grids were generated for the control stage 31 and ROCK-inhibited stage 31 samples (Extended Data Fig. 10). A linear model was run to test for the effect of length, treatment and stage on shape. Both treatment and stage were significantly associated with shape (P = 0.002 and P = 0.001, respectively); as expected, total length was not significantly associated with the size-corrected shapes (P = 0.711).

Transgenic enhancer activity assay

Shark and skate hoxa enhancers were cloned into pCR8/GW/TOPO vector (Invitrogen) by PCR. A list of the primers is provided in Supplementary Table 14. The cloned enhancers were transferred into the pXIG-cfos-EGFP vector by Gateway LR Clonase II (Invitrogen)⁵³. The created vectors were injected into one-cell-stage zebrafish eggs with tol2 mRNA as previously described¹²⁵. The injected embryos were observed under a stereo-type fluorescent microscope and photographed at 48 h after fertilization.

Phylome reconstruction

The phylome of L. erinacea, meaning the collection of phylogenetic trees for each protein-coding gene in its genome, was reconstructed using an automated pipeline that mimics the steps that one would take to build a phylogenetic tree and based on the PhylomeDB pipeline¹²⁶. First, a database with the proteomes (that is, full set of protein-coding genes) of 21 species was built that included L. erinacea (a full list of species included is provided in Supplementary Table 1). A BLASTp search was then performed against this database starting from each of the proteins included in the L. erinacea genome. BLAST results were filtered using an e-value threshold of 1 × 10⁻⁵ and a query sequence overlap threshold of 50%. The number of hits was limited to the best 250 hits for each protein. A multiple sequence alignment was performed for each set of homologous sequences. Three different programs were used to build the alignments (Muscle (v.3.8.1551)¹²⁷, mafft (v.7.407)¹²⁸ and kalign (v.2.04)¹²⁹) and the alignments were performed in forward and in reverse, resulting in six different alignments. From this group of alignments, a consensus alignment was obtained using M-coffee from the T-coffee package (v.12.0)¹³⁰. Alignments were then trimmed using trimAl v1.4.rev15 (consistency-score cut-off 0.1667, gap-score cut-off 0.9)¹³¹. IQTREE (v.1.6.9)¹³² was then used to reconstruct a maximum-likelihood phylogenetic tree. Model selection was limited to five models (DCmut, JTTDCMut, LG, WAG, VT) with freerate categories set to vary between 4 and 10. The best model according to the BIC criterion was used. Then, 1,000 rapid bootstrap replicates were calculated. A second phylome starting from D. rerio was also reconstructed according to the same approach. All trees and alignments are stored in phylomedb¹²⁶ with phylomeIDs 247 for the L. erinacea phylome and 275 for the D. rerio phylome (http://phylomedb.org).

Species tree reconstruction

A species tree was reconstructed using a gene concatenation approach. The trimmed alignments of 102 protein families with a single orthologue per species were concatenated into a single multiple-sequence alignment. IQTREE¹³² was then used to reconstruct the species tree using the same parameters as above. The final alignment contained 48,958 positions. The model selected for tree reconstruction was JTTDCMut+F+R5. Moreover, duptree¹³³ was used to reconstruct a second species tree using a super tree method. Duptree searches for the species tree that minimizes the number of duplications inferred when each gene is reconciled with the species tree. All trees built during the phylome reconstruction process were used to reconstruct this species tree. The two topologies were fully congruent.

Skate MethylC-seq library preparation

MethylC-seq library preparation was performed as described previously¹³⁴. In brief, 1,000 ng of genomic DNA extracted from the embryonic stage 31 and adult skate pelvic and pectoral fins was spiked with unmethylated λ phage DNA (Promega). DNA was sonicated to ~300 bp fragments using the M220 focused ultrasonicator (Covaris) with the following parameters: peak incident power, 50 W; duty factor, 20%; cycles per burst, 200; treatment time, 75 s. Sonicated DNA was then purified, end-repaired using the End-It DNA End-Repair Kit (Lucigen) and A-tailed using Klenow fragment (3′→5′ exo-) (New England Biolabs) followed by the ligation of NEXTFLEX Bisulfite-Seq Adapters. Bisulfite conversion of adaptor-ligated DNA was performed using the EZ DNA Methylation-Gold Kit (Zymo Research). Library amplification was performed using KAPA HiFi HotStart Uracil+ DNA polymerase (Kapa Biosystems). Library size was determined using the Agilent 4200 Tapestation system. The libraries were quantified using the KAPA library quantification kit (Roche).

Skate methylome data analysis

Embryonic stage31 and adult skate pelvic and pectoral fin DNA methylome libraries were sequenced on the Illumina HiSeq X platform (150 bp, paired-end). Elephant shark C. milii raw whole genome bisulphite sequencing data (adult liver) were downloaded from NCBI BioProject (PRJNA379367)¹³⁵. Zebrafish D. rerio raw whole genome bisulphite sequencing data (adult liver) were downloaded from the GEO (GSE122723)¹³⁶. Sequenced reads in FASTQ format were trimmed using the Trimmomatic software (ILLUMINACLIP:adapter.fa:2:30:10 SLIDINGWINDOW:5:20 LEADING:3 TRAILING:3 MINLEN:50). Trimmed reads were mapped to the Leri_hhj.fasta genome reference (containing the lambda genome as chrLambda) using WALT¹³⁷ with the following settings: -m 10 -t 24 -N 10000000 -L 2000. Mapped reads in SAM format were converted to BAM format; BAM files were sorted and indexed using SAMtools¹³⁸. Duplicate reads were removed using Picard Tools (v.2.3.0; http://broadinstitute.github.io/picard/). Genotype and methylation bias correction were performed using MethylDackel (MethylDackel extract Leri_hhj_lambda.fasta $input_bam -o $output --mergeContext --minOppositeDepth 5 --maxVariantFrac 0.5 --OT 10,110,10,110 --OB 40,140,40,140) (https://github.com/dpryan79/MethylDackel). Methylated and unmethylated calls at each genomic CpG position were determined using MethylDackel (MethylDackel extract Leri_hhj_lambda.fasta $input_bam -o output --mergeContext). DNA methylation profiles at differentially accessible ATAC–seq peaks between embryonic pelvic and pectoral fin samples were generated using deepTools2 computeMatrix reference-point and plotHeatmap¹³⁹.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Raw and processed sequencing data were deposited at the Gene Expression Omnibus (GEO; GSE188980 and GSE190730) and SRA (PRJNA783899). Mouse hindlimb RNA-seq data used for comparative analyses are publicly available at the GEO (GSE104459) and mouse forelimb RNA-seq data at the GEO (GSE136437). Zebrafish and elephant shark bisulphite sequencing data used for comparison were downloaded from NCBI BioProject (PRJNA379367) and the GEO (GSE122723136), respectively. Skate RNA-seq data are publicly available at NCBI BioProject (PRJNA288370 and PRJNA686126).

Code availability

Code used is available at GitLab (https://gitlab.com/skategenome).

References

Nakamura, T. et al. Molecular mechanisms underlying the exceptional adaptations of batoid fins. Proc. Natl Acad. Sci. USA 112, 15940–15945 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Turner, N. et al. The evolutionary origins and diversity of the neuromuscular system of paired appendages in batoids. Proc. Biol. Sci. 286, 20191571 (2019).
CAS PubMed PubMed Central Google Scholar
Shimeld, S. M. & Holland, P. W. Vertebrate innovations. Proc. Natl Acad. Sci. USA 97, 4449–4452 (2000).
Article CAS PubMed PubMed Central ADS Google Scholar
Simakov, O. et al. Deeply conserved synteny and the evolution of metazoan chromosomes. Sci. Adv. 8, eabi5884 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Touceda-Suárez, M. et al. Ancient genomic regulatory blocks are a source for regulatory gene deserts in vertebrates after whole genome duplications. Mol. Biol. Evol. https://doi.org/10.1093/molbev/msaa123 (2020).
Marlétaz, F. et al. Amphioxus functional genomics and the origins of vertebrate gene regulation. Nature 564, 64–70 (2018).
Article PubMed PubMed Central ADS Google Scholar
Dixon, J. R. et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485, 376–380 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Nora, E. P. et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature 485, 381–385 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Berthelot, C., Muffato, M., Abecassis, J. & Roest Crollius, H. The 3D organization of chromatin explains evolutionary fragile genomic regions. Cell Rep. 10, 1913–1924 (2015).
Article CAS PubMed Google Scholar
Lupiáñez, D. G. et al. Disruptions of topological chromatin domains cause pathogenic rewiring of gene-enhancer interactions. Cell 161, 1012–1025 (2015).
Article PubMed PubMed Central Google Scholar
Real, F. M. et al. The mole genome reveals regulatory rearrangements associated with adaptive intersexuality. Science https://doi.org/10.1126/science.aaz2582 (2020).
Acemel, R. D. & Gómez-Skarmeta, J. L. Reprogramming nuclear architecture: just a TAD. Cell Stem Cell 24, 679–681 (2019).
Article CAS PubMed Google Scholar
Rowley, M. J. & Corces, V. G. Organizational principles of 3D genome architecture. Nat. Rev. Genet. 19, 789–800 (2018).
Article CAS PubMed Google Scholar
Stingo, V. & Rocco, L. Selachian cytogenetics: a review. Genetica 111, 329–347 (2001).
Article CAS PubMed Google Scholar
Hirschberger, C., Sleight, V. A., Criswell, K. E., Clark, S. J. & Gillis, J. A. Conserved and unique transcriptional features of pharyngeal arches in the skate (Leucoraja erinacea) and evolution of the jaw. Mol. Biol. Evol. 38, 4187–4204 (2021).
Article CAS PubMed PubMed Central Google Scholar
Chorostecki, U., Molina, M., Pryszcz, L. P. & Gabaldón, T. MetaPhOrs 2.0: integrative, phylogeny-based inference of orthology and paralogy across the tree of life. Nucleic Acids Res. 48, W553–W557 (2020).
Article CAS PubMed PubMed Central Google Scholar
Fuentes, D. et al. PhylomeDB V5: an expanding repository for genome-wide catalogues of annotated gene phylogenies. Nucleic Acids Res. 50, D1062–D1068 (2021).
Article PubMed Central Google Scholar
Irisarri, I. et al. Phylotranscriptomic consolidation of the jawed vertebrate timetree. Nat. Ecol. Evol. 1, 1370–1378 (2017).
Article PubMed PubMed Central Google Scholar
Hara, Y. et al. Shark genomes provide insights into elasmobranch evolution and the origin of vertebrates. Nat. Ecol. Evol. 2, 1761–1771 (2018).
Article PubMed Google Scholar
Kuraku, S. Shark and ray genomics for disentangling their morphological diversity and vertebrate evolution. Dev. Biol. 477, 262–272 (2021).
Article CAS PubMed Google Scholar
Duret, L. & Galtier, N. Biased gene conversion and the evolution of mammalian genomic landscapes. Annu. Rev. Genomics Hum. Genet. 10, 285–311 (2009).
Article CAS PubMed Google Scholar
Perry, B. W., Schield, D. R., Adams, R. H. & Castoe, T. A. Microchromosomes exhibit distinct features of vertebrate chromosome structure and function with underappreciated ramifications for genome evolution. Mol. Biol. Evol. 38, 904–910 (2021).
Article CAS PubMed Google Scholar
Simakov, O. et al. Deeply conserved synteny resolves early events in vertebrate evolution. Nat. Ecol. Evol. 4, 820–830 (2020).
Article PubMed PubMed Central Google Scholar
Nakatani, Y., Takeda, H., Kohara, Y. & Morishita, S. Reconstruction of the vertebrate ancestral genome reveals dynamic genome reorganization in early vertebrates. Genome Res. 17, 1254–1265 (2007).
Article CAS PubMed PubMed Central Google Scholar
Nakatani, Y. et al. Reconstruction of proto-vertebrate, proto-cyclostome and proto-gnathostome genomes provides new insights into early vertebrate evolution. Nat. Commun. 12, 4489 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Thompson, A. W. et al. The bowfin genome illuminates the developmental evolution of ray-finned fishes. Nat. Genet. 53, 1373–1384 (2021).
Article CAS PubMed PubMed Central Google Scholar
Dalloul, R. A. et al. Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo): genome assembly and analysis. PLoS Biol. 8, e1000475 (2010).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. The white-spotted bamboo shark genome reveals chromosome rearrangements and fast-evolving immune genes of cartilaginous fish. iScience 23, 101754 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Mitros, T. et al. A chromosome-scale genome assembly and dense genetic map for Xenopus tropicalis. Dev. Biol. 452, 8–20 (2019).
Article CAS PubMed Google Scholar
Hoencamp, C. et al. 3D genomics across the tree of life reveals condensin II as a determinant of architecture type. Science 372, 984–989 (2021).
Article CAS PubMed PubMed Central Google Scholar
Rowley, M. J. et al. Evolutionarily conserved principles predict 3D chromatin organization. Mol. Cell 67, 837–852 (2017).
Article CAS PubMed PubMed Central Google Scholar
Acemel, R. D. et al. A single three-dimensional chromatin compartment in amphioxus indicates a stepwise evolution of vertebrate Hox bimodal regulation. Nat. Genet. 48, 336–341 (2016).
Article CAS PubMed Google Scholar
Gibson-Brown, J. J. et al. Evidence of a role for T-box genes in the evolution of limb morphogenesis and the specification of forelimb/hindlimb identity. Mech. Dev. 56, 93–101 (1996).
Article CAS PubMed Google Scholar
Pradeepa, M. M., Sutherland, H. G., Ule, J., Grimes, G. R. & Bickmore, W. A. Psip1/Ledgf p52 binds methylated histone H3K36 and splicing factors and contributes to the regulation of alternative splicing. PLoS Genet. 8, e1002717 (2012).
Article CAS PubMed PubMed Central Google Scholar
Onimaru, K. et al. Developmental hourglass and heterochronic shifts in fin and limb development. eLife 10, e62865 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wang, J. S., Infante, C. R., Park, S. & Menke, D. B. PITX1 promotes chondrogenesis and myogenesis in mouse hindlimbs through conserved regulatory targets. Dev. Biol. 434, 186–195 (2018).
Article CAS PubMed Google Scholar
DeLaurier, A., Schweitzer, R. & Logan, M. Pitx1 determines the morphology of muscle, tendon, and bones of the hindlimb. Dev. Biol. 299, 22–34 (2006).
Article CAS PubMed Google Scholar
Swenson, J. D., Klomp, J., Fisher, R. A. & Crow, K. D. How the devil ray got its horns: the evolution and development of cephalic lobes in myliobatid stingrays (Batoidea: Myliobatidae). Front. Ecol. Evol. 6, 181 (2018).
Article Google Scholar
Barry, S. N. & Crow, K. D. The role of HoxA11 and HoxA13 in the evolution of novel fin morphologies in a representative batoid (Leucoraja erinacea). Evodevo 8, 24 (2017).
Article PubMed PubMed Central Google Scholar
Lopez-Rios, J. et al. GLI3 constrains digit number by controlling both progenitor proliferation and BMP-dependent exit to chondrogenesis. Dev. Cell 22, 837–848 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tanaka, M. Fins into limbs: autopod acquisition and anterior elements reduction by modifying gene networks involving 5′Hox, Gli3, and Shh. Dev. Biol. 413, 1–7 (2016).
Article CAS PubMed Google Scholar
Bastida, M. F. et al. The formation of the thumb requires direct modulation of Gli3 transcription by Hoxa13. Proc. Natl Acad. Sci. USA 117, 1090–1096 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Amin, S. et al. Hoxa2 selectively enhances Meis binding to change a branchial arch ground state. Dev. Cell 32, 265–277 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fromental-Ramain, C. et al. Hoxa-13 and Hoxd-13 play a crucial role in the patterning of the limb autopod. Development 122, 2997–3011 (1996).
Article CAS PubMed Google Scholar
Sheth, R. et al. Distal limb patterning requires modulation of cis-regulatory activities by HOX13. Cell Rep. 17, 2913–2926 (2016).
Article CAS PubMed PubMed Central Google Scholar
Nakamura, T., Gehrke, A. R., Lemberg, J., Szymaszek, J. & Shubin, N. H. Digits and fin rays share common developmental histories. Nature 537, 225–228 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Freitas, R., Gómez-Marín, C., Wilson, J. M., Casares, F. & Gómez-Skarmeta, J. L. Hoxd13 contribution to the evolution of vertebrate appendages. Dev. Cell 23, 1219–1229 (2012).
Article CAS PubMed Google Scholar
Letelier, J. et al. The Shh/Gli3 gene regulatory network precedes the origin of paired fins and reveals the deep homology between distal fins and digits. Proc. Natl Acad. Sci. USA 118, e2100575118 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bogdanović, O. et al. Active DNA demethylation at enhancers during the vertebrate phylotypic period. Nat. Genet. 48, 417–426 (2016).
Article PubMed PubMed Central Google Scholar
Hon, G. C. et al. Epigenetic memory at embryonic enhancers identified in DNA methylation maps from adult mouse tissues. Nat. Genet. 45, 1198–1206 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kragesteen, B. K. et al. Dynamic 3D chromatin architecture contributes to enhancer specificity and limb morphogenesis. Nat. Genet. 50, 1463–1473 (2018).
Article CAS PubMed Google Scholar
Venkatesh, B. et al. Elephant shark genome provides unique insights into gnathostome evolution. Nature 505, 174–179 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Krefting, J., Andrade-Navarro, M. A. & Ibn-Salem, J. Evolutionary stability of topologically associating domains is associated with conserved gene regulation. BMC Biol. 16, 87 (2018).
Article PubMed PubMed Central Google Scholar
Schenkelaars, Q., Fierro-Constain, L., Renard, E. & Borchiellini, C. Retracing the path of planar cell polarity. BMC Evol. Biol. 16, 69 (2016).
Article PubMed PubMed Central Google Scholar
Maxwell, E. E., Fröbisch, N. B. & Heppleston, A. C. Variability and conservation in late chondrichthyan development: ontogeny of the winter skate (Leucoraja ocellata). Anat. Rec. 291, 1079–1087 (2008).
Article Google Scholar
Carrier, J. C., Musick, J. A. & Heithaus, M. R. Biology of Sharks and Their Relatives 2nd edn (CRC Press, 2012).
Kvon, E. Z. et al. Progressive loss of function in a limb enhancer during snake evolution. Cell 167, 633–642 (2016).
Article CAS PubMed PubMed Central Google Scholar
Leal, F. & Cohn, M. J. Loss and re-emergence of legs in snakes by modular evolution of Sonic hedgehog and HOXD enhancers. Curr. Biol. 26, 2966–2973 (2016).
Article CAS PubMed Google Scholar
Lopez-Rios, J. et al. Attenuated sensing of SHH by Ptch1 underlies evolution of bovine limbs. Nature 511, 46–51 (2014).
Article CAS PubMed ADS Google Scholar
Enny, A., Flaherty, K., Mori, S., Turner, N. & Nakamura, T. Developmental constraints on fin diversity. Dev. Growth Differ. 62, 311–325 (2020).
Article PubMed PubMed Central Google Scholar
Gehrke, A. R. et al. Deep conservation of wrist and digit enhancers in fish. Proc. Natl Acad. Sci. USA 112, 803–808 (2015).
Article CAS PubMed ADS Google Scholar
Ranallo-Benavidez, T. R., Jaron, K. S. & Schatz, M. C. GenomeScope 2.0 and Smudgeplot for reference-free profiling of polyploid genomes. Nat. Commun. 11, 1432 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Ye, C., Hill, C. M., Wu, S., Ruan, J. & Ma, Z. S. DBG2OLC: efficient assembly of large genomes using long erroneous reads of the third generation sequencing technologies. Sci. Rep. 6, 31900 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed Google Scholar
Roach, M. J., Schmidt, S. A. & Borneman, A. R. Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies. BMC Bioinform. 19, 460 (2018).
Article CAS Google Scholar
Putnam, N. H. et al. Chromosome-scale shotgun assembly using an in vitro method for long-range linkage. Genome Res. 26, 342–350 (2016).
Article CAS PubMed PubMed Central Google Scholar
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Kerpedjiev, P. et al. HiGlass: web-based visual exploration and analysis of genome interaction maps. Genome Biol. 19, 125 (2018).
Article PubMed PubMed Central Google Scholar
English, A. C. et al. Mind the gap: upgrading genomes with Pacific Biosciences RS long-read sequencing technology. PLoS ONE 7, e47768 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS PubMed Google Scholar
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Niknafs, Y. S., Pandian, B., Iyer, H. K., Chinnaiyan, A. M. & Iyer, M. K. TACO produces robust multisample transcriptome assemblies from RNA-seq. Nat. Methods 14, 68–70 (2017).
Article CAS PubMed Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Wu, T. D., Reeder, J., Lawrence, M., Becker, G. & Brauer, M. J. GMAP and GSNAP for genomic sequence alignment: enhancements to speed, accuracy, and functionality. Methods Mol. Biol. 1418, 283–334 (2016).
Article PubMed Google Scholar
Venturini, L., Caim, S., Kaithakottil, G. G., Mapleson, D. L. & Swarbreck, D. Leveraging multiple transcriptome assembly methods for improved gene structure annotation. Gigascience 7, giy093 (2018).
Article PubMed PubMed Central Google Scholar
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435–W439 (2006).
Article CAS PubMed PubMed Central Google Scholar
Mapleson, D., Venturini, L., Kaithakottil, G. & Swarbreck, D. Efficient and accurate detection of splice junctions from RNA-seq with Portcullis. Gigascience 7, giy131 (2018).
Article PubMed PubMed Central Google Scholar
Slater, G. S. C. & Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinform. 6, 31 (2005).
Article Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9, R7 (2008).
Article PubMed PubMed Central Google Scholar
Roth, A. C. J., Gonnet, G. H. & Dessimoz, C. Algorithm of OMA for large-scale orthology inference. BMC Bioinform. 9, 518 (2008).
Article Google Scholar
Marlétaz, F., Peijnenburg, K. T. C. A., Goto, T., Satoh, N. & Rokhsar, D. S. A new spiralian phylogeny places the enigmatic arrow worms among gnathiferans. Curr. Biol. 29, 312–318 (2019).
Article PubMed Google Scholar
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).
Article MathSciNet CAS PubMed PubMed Central ADS Google Scholar
Katoh, K., Misawa, K., Kuma, K.-I. & Miyata, T. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30, 3059–3066 (2002).
Article CAS PubMed PubMed Central Google Scholar
Criscuolo, A. & Gribaldo, S. BMGE (Block Mapping and Gathering with Entropy): a new software for selection of phylogenetic informative regions from multiple sequence alignments. BMC Evol. Biol. 10, 210 (2010).
Article PubMed PubMed Central Google Scholar
Lartillot, N., Lepage, T. & Blanquart, S. PhyloBayes 3: a Bayesian software package for phylogenetic reconstruction and molecular dating. Bioinformatics 25, 2286–2288 (2009).
Article CAS PubMed Google Scholar
Benton, M. J., Donoghue, P. C. J. & Asher, R. J. in The Timetree Of Life (ed. Kumar, S. B. H.) 35–86 (Oxford Univ. Press, 2009).
Steinegger, M. & Söding, J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol. 35, 1026–1028 (2017).
Article CAS PubMed Google Scholar
Derelle, R., Philippe, H. & Colbourne, J. K. Broccoli: combining phylogenetic and network analyses for orthology assignment. Mol. Biol. Evol. 37, 3389–3396 (2020).
Article CAS PubMed Google Scholar
Morel, B., Kozlov, A. M., Stamatakis, A. & Szöllősi, G. J. GeneRax: a tool for species-tree-aware maximum likelihood-based gene family tree inference under gene duplication, transfer, and loss. Mol. Biol. Evol. 37, 2763–2774 (2020).
Article CAS PubMed PubMed Central Google Scholar
Belaghzal, H., Dekker, J. & Gibcus, J. H. Hi-C 2.0: an optimized Hi-C procedure for high-resolution genome-wide mapping of chromosome conformation. Methods 123, 56–65 (2017).
Article CAS PubMed PubMed Central Google Scholar
Franke, M. et al. CTCF knockout in zebrafish induces alterations in regulatory landscapes and developmental gene expression. Nat. Commun. 12, 5415 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Rao, S. S. P. et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell 159, 1665–1680 (2014).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Durand, N. C. et al. Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst. 3, 95–98 (2016).
Article CAS PubMed PubMed Central Google Scholar
Kruse, K., Hug, C. B. & Vaquerizas, J. M. FAN-C: a feature-rich framework for the analysis and visualisation of chromosome conformation capture data. Genome Biol. 21, 303 (2020).
Article PubMed PubMed Central Google Scholar
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Article CAS PubMed PubMed Central Google Scholar
Crane, E. et al. Condensin-driven remodelling of X chromosome topology during dosage compensation. Nature 523, 240–244 (2015).
Article CAS PubMed PubMed Central ADS Google Scholar
Frith, M. C. et al. Detection of functional DNA motifs via statistical over-representation. Nucleic Acids Res. 32, 1372–1381 (2004).
Article CAS PubMed PubMed Central Google Scholar
Grant, C. E., Bailey, T. L. & Noble, W. S. FIMO: scanning for occurrences of a given motif. Bioinformatics 27, 1017–1018 (2011).
Article CAS PubMed PubMed Central Google Scholar
Barrows, T. C. A. profileplyr (Bioconductor, 2019); https://doi.org/10.18129/B9.BIOC.PROFILEPLYR
Wolff, J., Backofen, R. & Grüning, B. Loop detection using Hi-C data with HiCExplorer. Gigascience 11, giac061 (2022).
Article PubMed PubMed Central Google Scholar
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Article CAS PubMed Google Scholar
Mumbach, M. R. et al. HiChIP: efficient and sensitive analysis of protein-directed genome architecture. Nat. Methods 13, 919–922 (2016).
Article CAS PubMed PubMed Central Google Scholar
Serra, F. et al. Automatic analysis and 3D-modelling of Hi-C data using TADbit reveals structural features of the fly chromatin colors. PLoS Comput. Biol. 13, e1005665 (2017).
Article PubMed PubMed Central Google Scholar
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
Article PubMed PubMed Central Google Scholar
Bhattacharyya, S., Chandra, V., Vijayanand, P. & Ay, F. Identification of significant chromatin contacts from HiChIP data by FitHiChIP. Nat. Commun. 10, 4221 (2019).
Article PubMed PubMed Central ADS Google Scholar
Lawrence, M. et al. Software for computing and annotating genomic ranges. PLoS Comput. Biol. 9, e1003118 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ewels, P. A. et al. The nf-core framework for community-curated bioinformatics pipelines. Nat. Biotechnol. 38, 276–278 (2020).
Article CAS PubMed Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central Google Scholar
Alexa, A. & Rahnenfuhrer, J. topGO; https://doi.org/10.18129/B9.BIOC.TOPGO (Bioconductor, 2017).
Fernández-Miñán, A., Bessa, J., Tena, J. J. & Gómez-Skarmeta, J. L. Assay for transposase-accessible chromatin and circularized chromosome conformation capture, two methods to explore the regulatory landscapes of genes in zebrafish. Methods Cell. Biol. 135, 413–430 (2016).
Article PubMed Google Scholar
Di Tommaso, P. et al. Nextflow enables reproducible computational workflows. Nat. Biotechnol. 35, 316–319 (2017).
Article PubMed Google Scholar
Irimia, M. et al. Extensive conservation of ancient microsynteny across metazoans due to cis-regulatory constraints. Genome Res. 22, 2356–2367 (2012).
Article CAS PubMed PubMed Central Google Scholar
Harris, R. S. Improved Pairwise Alignment of Genomic DNA. PhD thesis, Pennsylvania State Univ. (2007).
Hiller, M. et al. Computational methods to detect conserved non-genic elements in phylogenetically isolated genomes: application to zebrafish. Nucleic Acids Res. 41, e151 (2013).
Article CAS PubMed PubMed Central ADS Google Scholar
Kent, W. J., Baertsch, R., Hinrichs, A., Miller, W. & Haussler, D. Evolution’s cauldron: duplication, deletion, and rearrangement in the mouse and human genomes. Proc. Natl Acad. Sci. USA 100, 11484–11489 (2003).
Article CAS PubMed PubMed Central ADS Google Scholar
Suarez, H. G., Langer, B. E., Ladde, P. & Hiller, M. chainCleaner improves genome alignment specificity and sensitivity. Bioinformatics 33, 1596–1603 (2017).
Article CAS PubMed Google Scholar
Yu, G. & He, Q.-Y. ReactomePA: an R/Bioconductor package for reactome pathway analysis and visualization. Mol. Biosyst. 12, 477–479 (2016).
Article CAS PubMed Google Scholar
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dahn, R. D., Davis, M. C., Pappano, W. N. & Shubin, N. H. Sonic hedgehog function in chondrichthyan fins and the evolution of appendage patterning. Nature 445, 311–314 (2006).
Article PubMed ADS Google Scholar
Olsen, A. M. & Westneat, M. W. StereoMorph: an R package for the collection of 3D landmarks and curves using a stereo camera set‐up. Methods Ecol. Evol. 6, 351–356 (2015).
Article Google Scholar
Baken, E. K., Collyer, M. L., Kaliontzopoulou, A. & Adams, D. C. geomorph v4.0 and gmShiny: Enhanced analytics and a new graphical interface for a comprehensive morphometric experience. Methods Ecol. Evol. 12, 2355–2363 (2021).
Adams, D., Collyer, M., Kaliontzopoulou, A. & Baken, E. geomorph: geometric morphometric analyses of 2D/3D landmark data. R package version 4.0.1 (2021).
Suster, M. L., Abe, G., Schouw, A. & Kawakami, K. Transposon-mediated BAC transgenesis in zebrafish. Nat. Protoc. 6, 1998–2021 (2011).
Article CAS PubMed Google Scholar
Huerta-Cepas, J., Capella-Gutierrez, S., Pryszcz, L. P., Marcet-Houben, M. & Gabaldon, T. PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome. Nucleic Acids Res. 42, D897–D902 (2014).
Article CAS PubMed Google Scholar
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
Article CAS PubMed PubMed Central Google Scholar
Katoh, K., Kuma, K., Toh, H. & Miyata, T. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33, 511–518 (2005).
Article CAS PubMed PubMed Central Google Scholar
Lassmann, T. & Sonnhammer, E. L. Kalign—an accurate and fast multiple sequence alignment algorithm. BMC Bioinform. 6, 298 (2005).
Article Google Scholar
Wallace, I. M., O’Sullivan, O., Higgins, D. G. & Notredame, C. M-Coffee: combining multiple sequence alignment methods with T-Coffee. Nucleic Acids Res. 34, 1692–1699 (2006).
Article CAS PubMed PubMed Central Google Scholar
Capella-Gutierrez, S., Silla-Martinez, J. M. & Gabaldon, T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Article CAS PubMed PubMed Central Google Scholar
Nguyen, L.-T., Schmidt, H. A., von Haeseler, A. & Minh, B. Q. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 32, 268–274 (2015).
Article CAS PubMed Google Scholar
Wehe, A., Bansal, M. S., Burleigh, J. G. & Eulenstein, O. DupTree: a program for large-scale phylogenetic analyses using gene tree parsimony. Bioinformatics 24, 1540–1541 (2008).
Article CAS PubMed Google Scholar
Urich, M. A., Nery, J. R., Lister, R., Schmitz, R. J. & Ecker, J. R. MethylC-seq library preparation for base-resolution whole-genome bisulfite sequencing. Nat. Protoc. 10, 475–483 (2015).
Article CAS PubMed PubMed Central Google Scholar
Peat, J. R., Ortega-Recalde, O., Kardailsky, O. & Hore, T. A. The elephant shark methylome reveals conservation of epigenetic regulation across jawed vertebrates. F1000Research 6, 526 (2017).
Article PubMed PubMed Central Google Scholar
Skvortsova, K. et al. Retention of paternal DNA methylome in the developing zebrafish germline. Nat. Commun. 10, 3054 (2019).
Article PubMed PubMed Central ADS Google Scholar
Chen, H., Smith, A. D. & Chen, T. WALT: fast and accurate read mapping for bisulfite sequencing. Bioinformatics 32, 3507–3509 (2016).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Ramírez, F., Dündar, F., Diehl, S., Grüning, B. A. & Manke, T. deepTools: a flexible platform for exploring deep-sequencing data. Nucleic Acids Res. 42, W187–W191 (2014).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank R. Schneider, D. Sherwood and the staff of the Marine Biological Laboratory Embryology course for providing laboratory space; L. Bertrand and the staff at Leica Microsystems for microscopy support; D. Remsen, S. Bennett, D. Calzarette, and the staff of the Marine Biological Laboratory and MBL Marine Resources Center for technical and animal husbandry assistance; A. Gillis for support and advice with RNA-seq and skate functional experiments; and A. Shindo for technical support of image analysis. T.N., D.N. and A.A. were supported by institutional support provided by the Rutgers University School of Arts and Sciences and the Human Genetics Institute of New Jersey, a Whitman Center Fellowship (Marine Biological Laboratory) and the National Science Foundation under grant no. 2210072. D.N. was further supported by the NIH-IRACDA funded INSPIRE program at Rutgers University; F.M. and D.S.R. by funding from the Okinawa Institute for Science and Technology; D.S.R. by the Marthella Foskett-Brown Chair in Computational Biology; D.G.L. and R.D.A. by a grant from the Deutsche Forschungsgemeinschaft (LU 242672-1) and by a Helmholtz ERC Recognition Award grant from the Helmholtz-Gemeinschaft (ERC-RA1045 0033); R.D.A. and C.P. by EMBO Postdoctoral Fellowships (EMBO ALTF 537-2020 and ALTF 346-2020, respectively); P.M.M.G. by a postdoctoral fellowship from Junta de Andalucía (DOC_00397); J.J.T. and J.L.G.-S. by the European Research Council (ERC, grant no. 740041) and the Spanish Ministerio de Economía y Competitividad (grant no. PID2019-103921GB-I00); J.D. by the NIH grant HG003143; F.M. by the Royal Society (URF\R1\191161); V.A.S. by a Wolfson College Junior Research Fellowship and Marine Biological Laboratory Whitman Early Career Fellowship; J.L.-R. by the Spanish Ministerio de Ciencia e Innovacion (PID2020-113497GB-I00); and A.V. and F.D. by NIH grants R01DE028599 and R01HG003988. Research conducted at the E.O. Lawrence Berkeley National Laboratory was performed under US Department of Energy contract DE-AC02-05CH11231, University of California. J.D. is an investigator of the Howard Hughes Medical Institute.

Author information

Fabrice Darbellay
Present address: Department of Genetic Medicine and Development, Faculty of Medicine, University of Geneva, Geneva, Switzerland
These authors contributed equally: Ferdinand Marlétaz, Elisa de la Calle-Mustienes, Rafael D. Acemel, Christina Paliou
These authors jointly supervised this work: Tetsuya Nakamura, Juan J. Tena, Darío G. Lupiáñez, Daniel S. Rokhsar, José Luis Gómez-Skarmeta
Deceased: José Luis Gómez-Skarmeta

Authors and Affiliations

Centre for Life’s Origin and Evolution, Department of Genetics, Evolution and Environment, University College London, London, UK
Ferdinand Marlétaz
Molecular Genetics Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Japan
Ferdinand Marlétaz & Daniel S. Rokhsar
Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas/Universidad Pablo de Olavide/Junta de Andalucía, Seville, Spain
Elisa de la Calle-Mustienes, Rafael D. Acemel, Christina Paliou, Silvia Naranjo, Pedro Manuel Martínez-García, Ildefonso Cases, Lourdes Gallardo-Fuentes, Ismael Sospedra, Javier Lopez-Rios, Juan J. Tena & José Luis Gómez-Skarmeta
Epigenetics and Sex Development Group, Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany
Rafael D. Acemel & Darío G. Lupiáñez
Department of Zoology, University of Cambridge, Cambridge, UK
Victoria A. Sleight & Christine Hirschberger
School of Biological Sciences, University of Aberdeen, Aberdeen, UK
Victoria A. Sleight
Barcelona Supercomputing Centre (BCS-CNS), Barcelona, Spain
Marina Marcet-Houben & Toni Gabaldón
Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
Marina Marcet-Houben & Toni Gabaldón
Department of Genetics, Rutgers the State University of New Jersey, Piscataway, NJ, USA
Dina Navon, Ali Andrescavage & Tetsuya Nakamura
Genomics and Epigenetics Division, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
Ksenia Skvortsova, Paul Edward Duckett, Álvaro González-Rajal & Ozren Bogdanovic
Faculty of Medicine, St Vincent’s Clinical School, University of New South Wales, Sydney, New South Wales, Australia
Ksenia Skvortsova & Álvaro González-Rajal
School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
Ozren Bogdanovic
Department of Systems Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
Johan H. Gibcus, Liyan Yang & Job Dekker
Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Fabrice Darbellay & Axel Visel
US Department of Energy Joint Genome Institute, Berkeley, CA, USA
Axel Visel
School of Natural Sciences, University of California, Merced, CA, USA
Axel Visel
Howard Hughes Medical Institute, Chevy Chase, MD, USA
Job Dekker
Department of Organismal Biology and Anatomy, University of Chicago, Chicago, IL, USA
Neil Shubin
Catalan Institution for Research and Advanced Studies (ICREA), Barcelona, Spain
Toni Gabaldón
CIBER de Enfermedades Infecciosas, Instituto de Salud Carlos III, Madrid, Spain
Toni Gabaldón
Department of Molecular and Cell Biology, University of California, Berkeley, CA, USA
Daniel S. Rokhsar
Chan-Zuckerberg Biohub, San Francisco, CA, USA
Daniel S. Rokhsar

Authors

Ferdinand Marlétaz
View author publications
You can also search for this author in PubMed Google Scholar
Elisa de la Calle-Mustienes
View author publications
You can also search for this author in PubMed Google Scholar
Rafael D. Acemel
View author publications
You can also search for this author in PubMed Google Scholar
Christina Paliou
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Naranjo
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Manuel Martínez-García
View author publications
You can also search for this author in PubMed Google Scholar
Ildefonso Cases
View author publications
You can also search for this author in PubMed Google Scholar
Victoria A. Sleight
View author publications
You can also search for this author in PubMed Google Scholar
Christine Hirschberger
View author publications
You can also search for this author in PubMed Google Scholar
Marina Marcet-Houben
View author publications
You can also search for this author in PubMed Google Scholar
Dina Navon
View author publications
You can also search for this author in PubMed Google Scholar
Ali Andrescavage
View author publications
You can also search for this author in PubMed Google Scholar
Ksenia Skvortsova
View author publications
You can also search for this author in PubMed Google Scholar
Paul Edward Duckett
View author publications
You can also search for this author in PubMed Google Scholar
Álvaro González-Rajal
View author publications
You can also search for this author in PubMed Google Scholar
Ozren Bogdanovic
View author publications
You can also search for this author in PubMed Google Scholar
Johan H. Gibcus
View author publications
You can also search for this author in PubMed Google Scholar
Liyan Yang
View author publications
You can also search for this author in PubMed Google Scholar
Lourdes Gallardo-Fuentes
View author publications
You can also search for this author in PubMed Google Scholar
Ismael Sospedra
View author publications
You can also search for this author in PubMed Google Scholar
Javier Lopez-Rios
View author publications
You can also search for this author in PubMed Google Scholar
Fabrice Darbellay
View author publications
You can also search for this author in PubMed Google Scholar
Axel Visel
View author publications
You can also search for this author in PubMed Google Scholar
Job Dekker
View author publications
You can also search for this author in PubMed Google Scholar
Neil Shubin
View author publications
You can also search for this author in PubMed Google Scholar
Toni Gabaldón
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuya Nakamura
View author publications
You can also search for this author in PubMed Google Scholar
Juan J. Tena
View author publications
You can also search for this author in PubMed Google Scholar
Darío G. Lupiáñez
View author publications
You can also search for this author in PubMed Google Scholar
Daniel S. Rokhsar
View author publications
You can also search for this author in PubMed Google Scholar
José Luis Gómez-Skarmeta
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.L.G.-S., F.M., J.J.T., T.N., D.S.R. and D.G.L. conceived the study and designed the experiments. F.M., E.d.l.C.-M., N.S. and J.L.G.-S. coordinated the sequencing of the little skate genome and F.M. assembled and annotated the genome. R.D.A., P.M.M.-G., L.Y., J.H.G., J.D. and D.G.L. performed analyses on 3D chromatin organization. F.M. and D.S.R. designed and performed synteny and comparative analyses. M.M.H., F.M. and T.G. performed phylogenetic and phylogenomic analyses. E.d.l.C.-M., C.P., S.N., R.D.A., J.J.T., I.C., L.G.-F., I.S. and J.L.-R. performed and analysed transgenics, ATAC–seq, RNA-seq and Hi-C experiments. V.A.S. and C.H. performed and analysed additional RNA-seq experiments. F.D. and A.V. performed additional functional assays. K.S., P.E.D., A.G.-R. and O.B. performed and analysed DNA methylation experiments. D.N., A.A. and T.N. conducted embryonic experiments of skates and sharks. J.L.G.-S., J.J.T., F.M., D.S.R. and D.G.L. wrote the manuscript with input from all of the authors.

Corresponding authors

Correspondence to Ferdinand Marlétaz, Tetsuya Nakamura, Juan J. Tena, Darío G. Lupiáñez or Daniel S. Rokhsar.

Ethics declarations

Competing interests

J.D. is on the scientific advisory board of Arima Genomics and of Omega Therapeutics. The other authors declare no competing interests.

Peer review

Peer review information

Nature thanks Chris Amemiya and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Characteristics of skate chromosomes, repeat content and skate genome.

a–c, Characteristics and classification of skate chromosomes according to their size (x-axis) and GC% (a), number of LINE insertions (b) and gene density (c) per 50kb window. d, Repetitive landscape computed as JC divergence of repeat occurrence toward the consensus element in the repeat library. e–f, Distribution of gene and intron size in selected chordate species: amphioxus (Branchiostoma floridae, Bralan), the cloudy catshark (Chiloscyllium punctatum, Chipun), the little skate (Leuraja erinacea, Leueri), the zebrafish (Danio rerio, Danrer) and human (Homo sapiens, Homsap). g, gene size distribution in three chromosomal categories. h, distribution of retention rates inferred for CLGs in the spotted gar (Lepisosteus oculatus). e–h, mean dot) and standard deviation (bar) are indicated with the violin plot area. i. Rates of evolution of genes located in α or β segments estimated as ML distance to the amphioxus outgroup (LG+Γ).

Extended Data Fig. 2 Chromosomal architecture and synteny conservation in cartilaginous and bony fishes.

a, Syntenic orthology relationship between skate and bamboo shark highlighting the conservation of chromosomal architecture among chondrichtyans. b–d, Organisation of segments derived from each CLG in bamboo shark, gar and chicken genome using the same colour code as in Fig. 1. Each bin along chromosomes represents 20 genes.

Extended Data Fig. 3 The skate genome is organized in A/B compartments.

a. 500 kb resolution Pearson matrices of a representative chromosome (Leri_11C) and their associated eigenvectors showing marked compartmentalization in A/B compartments in both replicates. b. Eigenvector correlation among the two replicates. c. Merged Pearson matrix presented together with its eigenvector, the normalized signal for ATAC-seq in anterior pectoral fin, the number of gene models and the percentage of GC content. As shown in d, the A compartment in skates correlates with chromatin accessibility and the number of gene models, but no clear correlation was observed with the GC content. e. Saddle plot demonstrating the aggregated enrichment of homotypic A-A and B-B interactions. f. Gene expression in either the A or the B compartment as measured with bulk RNA-seq performed in the anterior and posterior portions of the skate pectoral fin at Stg 30. Top: anterior, bottom: posterior, n = 4046 bins of 500kb (A compartment n = 2125, B compartment n = 1921). Boxes correspond to the median and the first and third quartiles (Q1 and Q3). Whiskers extend to the last point within 1.5 times the interquartile range below and above Q1 and Q3, respectively.

Extended Data Fig. 4 Skate chromosomes are organized in TADs flanked by convergent CTCF sites.

a. Hi-C interaction matrices in skate pectoral fins in either of the two replicates and the merge (25kb resolution). The TAD calling performed in the merged matrix and the associated boundary scores (BS) and insulation scores (IS) are shown below (window size of 500kb). b. Insulation score correlations between the two replicates. c. From top to bottom, enrichment around TAD boundaries (+-250kb) of ATAC-seq peaks and ATAC-seq peaks containing the CTCF motif regardless of the strand, in the plus and in the minus strand. d. Hi-C matrix around the HoxD locus showing the conserved bipartite configuration in two TADs with HoxD genes located precisely at the boundary. TADs, insulation scores and ATAC-seq peaks containing the CTCF motif are shown. The tendency of having divergent CTCF sites at insulation minima is observable.

Extended Data Fig. 5 H3K4me3 HiChIP unveils the regulatory landscapes of active genes in the anterior and posterior portions of the skate pectoral fin.

a. Proportion of distal loop anchors that also correspond to distal ATAC-seq peaks in the pectoral fin in both the anterior and posterior H3K4me3 HiChIP datasets. b. Proportion of inter-TAD interactions calculated in the anterior and posterior HiChIP datasets compared to a random shuffling of the TADs (grey). c. Spearman correlation of the three valid replicates (1 for anterior and 2 for posterior fins). The correlation between the matrices is limited to the non-redundant set of interactions (union = 50,601 interactions). d. Differential loop analysis derived from read counts in c. logFC vs. logCPM plot with significant differential loops highlighted in red. e,f. Anterior specific contacts in the Hoxa2 and Alx4 regulatory landscape (dark blue). g. Posterior specific contacts in the Hoxb8 regulatory landscape (turquoise).

Extended Data Fig. 6 Preformed 3D chromatin folding in anterior vs. posterior fin.

a. Pearson matrices and eigenvectors showing A/B compartmentalization of the chromosome Leri_12C of skates in the anterior and posterior portions of the pectoral fin. b. Genome-wide eigenvector correlations. c. Quantification of A/B compartment switches between anterior and posterior portions of the fin. d. Comparison of all EV values between anterior and posterior fin. Heatmaps are sorted according to anterior EV values and compartment switches are indicated in the colour bar on top. Most switches are concentrated towards the centre, where EV values are intermediate. e. Comparison of insulation scores and overall TAD structures around the HoxD locus. f. Genome wide insulation score correlations. g. Correlations of number of reads found inside a consensus set of loops consisting of the union of the loops (see Methods) h. Differential loop analysis derived from read counts in g. logFC vs. logCPM plot with the only significant differential loop highlighted in red. i. Snapshot of the Hi-C heatmap around the only significant differential loop located in the Csmd2 locus. Arrowheads indicate the position of the loop. j. Virtual 4C-seq profiles of Hoxa cluster genes derived from the Hi-C experiments. Few differences are appreciated, and no differences are evident in contacts between Hoxa2 and the differential loop predicted by HiChIP (purple asterisk).

Extended Data Fig. 7 Conservation of vertebrate TADs after the Whole Genome Duplications.

a. Intergenic spaces between microsyntenic pairs conserved across vertebrates (present in skate and osteichthyes, here mouse and garfish) are devoid of TAD boundaries. Syntenic gene pairs n = 3017, non-syntenic n = 25386. Two-sided χ² p-value = 3.7 x 10⁻¹³ b. 40% of skate TADs contain a deeply conserved microsyntenic pair. Several of them contain more than one association. c. TADs containing deeply microsyntenic associations are bigger, contain more ATAC-seq peaks and more loops as defined using HiChIP (Syntenic TAD n = 718, non-syntenic TAD n = 960). Foxc1/Gmds (d) and Ptch1/Eif2b3 (e) are examples of deeply conserved microsyntenic associations. Microsyntenic area is shaded in grey. Hi-C, TADs, HiChIP and ATAC-seq data are shown along with the gene tracks. f. Gene content of TADs associated to the different paralogous segments of the genome originated after the two rounds of WGD (1 or 2 for the 1R, alpha or beta for the 2R) Boxes correspond to the median and the first and third quartiles (Q1 and Q3). Whiskers extend to the last point within 1.5 times the interquartile range below and above Q1 and Q3, respectively. g. Number of regulatory landscapes (defined as the group of interactions anchored by a single gene promoter) belonging to the different paralogous segments of the genome originated after the two rounds of WGD (1 or 2 for the 1R, alpha or beta for the 2R). h. Regulatory landscape sizes observed in the paralogous segments of f defined as the genomic space spanning from the two more distal loop anchors anchored to a given promoter. Boxplots defined as in f. i. The fate of the counterparts of alpha TADs was investigated in the beta copy and vice versa. TADs with more than one gene conserved allowed us to infer scenarios of TAD fissions-fusions in either or the genome copies. Asterisks (*) highlight complete TAD losses in beta (yellow bar) and TAD fission events in alpha (blue bars).

Extended Data Fig. 8 Rearranged TADs in the skate lineage involve PCP-related genes.

a. Extended version of the upset plot presented in Fig. 4a with the quantification of synteny breaks detected in different vertebrate species using the skate genome as a reference. The barplot on top shows the quantification of synteny breaks for the species combination indicated by the dots below. The barplot on the left shows the total quantification of synteny breaks for each individual species. b. ReactomePA¹¹⁹ clustering of significant terms found in the set of candidate genes for regulatory rearrangements in the anterior pectoral fin. P-values are BH corrected p-values obtained with a one-sided Fisher test for term overrepresentation (ReactomePA default). A selection of these terms is shown in Fig. 4c. c. Cnetplot showing the relationship of candidate genes with each of the different enriched terms. d. Candidate rearrangement at the Psmd11 locus, implicated in the PCP pathway. Pectoral fin Hi-C map is shown on top together with the TAD predictions. Below, the synteny blocks that are shared with the different species studied and the candidate synteny break is highlighted in red. Finally, arachnogram with the contacts devised from the anterior fin H3K4me3 HiChIP experiment. e. Same as in d, but for the Notch-signalling related gene Adam10. f. Same as in d and e but for the Hox activator Psip1. Note that this time the presented H3K4me3 HiChIP is from posterior pectoral fins. g. Whole mount in situ hybridization against Psip1 in both the little skate L. erinacea and the catshark S. retifer shows species-specific expression of Psip1 in the anterior portion of the skate pectoral fins. n = 5 for skates and sharks. The scale bar corresponds to 2 mm.

Extended Data Fig. 9 Fin ray development in control and ROCK inhibitor-treated skate embryos.

a. Cartilages in control (stages 30 and 31) and ROCK inhibitor-treated embryos (stage 31) were examined by Alcian blue staining. Five replicates for each condition are shown. The whole-mount staining showed that anterior fin ray development is affected by ROCK inhibitor-treatment with some variations. The number of fin rays attached to propterygium (pro), mesopterygium (meso), and metapterygium (meta) was counted under a stereomicroscope and statistically analysed (Fig. 4). The scale bar is 2 mm. b. The total body length of control and ROCK-treated skate embryos. The total body length of control (stages from 29 to 31) and ROCK inhibitor-treated embryos (stage 31). Note that the body length of ROCK inhibitor-treated embryos is longer than stage 30 embryos (* = Bonferroni corrected two-sided t-test p-value = 0.01232), indicating that the embryos with the inhibitor normally developed, and the pectoral fin phenotype was not due to the overall defects of body development. Five replicates for each condition were examined and body length distributions were assumed to be normal. The minima, maxima, and median values of the box and whisker plots of stage 29, 30, 31, and ROCK inhibitor-treated embryos are 42, 45, and 44, 49, 51, and 50, 53, 56, and 54, 51, 55, and 53, respectively. Boxes correspond to the median and the first and third quartiles (Q1 and Q3). Whiskers extend to the last point within 1.5 times the interquartile range below and above Q1 and Q3, respectively.

Extended Data Fig. 10 Geometric morphometric analyses of the inhibition of the PCP pathway using a rho-kinase (ROCK) inhibitor in stage 31 skate embryos.

a. Schematic of the landmark design used in these analyses, including both landmarks (numbered red points) and semi-landmarks (small red points). b. Principal components analysis shows that specimen shapes cluster by treatment and stage. Points X and Y were used to generate the deformation grids showing the shape changes between the area of the PCA plot dominated by control (c) and ROCK-inhibited specimens (d). Note the inhibition of growth on the anterior region of the pectoral fin in the ROCK-inhibited specimens.

Extended Data Fig. 11 Cartilage staining of DMSO or the ROCK inhibitor-beads implanted skate embryos at stage 31.

The beads were repeatedly implanted into the anterior part of the right pectoral fin every two weeks (the total is three times) from stage 29. Some beads were retained until stage 31 (blue dots), while others fell during the treatment The embryos with the ROCK-inhibitor beads exhibited fusion, loss, or disorganized fin ray patterning (arrows, 6/9 for 100 μM and 6/10 for 1 mM). Note that abnormal fin ray patterning was never observed in control animals, indicating that the effects not directly associated with a bead in treated embryos were likely derived from the loss of the bead during the treatment. N = 9 for DMSO, 9 for 100 μM inhibitor, and 10 for 1 mM inhibitor beads. The scale bar is 2 mm.

Extended Data Fig. 12 Genetic interactions among Hox and Gli3 genes.

a. ChIP-seq experiment in mouse embryonic branchial arches performed in Amin et al. 2015, which shows the binding profile of HoxA2 to Gli3 genomic locus. b. Whole-mount in situ hybridization of gli3 in zebrafish embryos inyected with a hoxd13a-GR mRNA. Developing fins are indicated with red arrowheads. In the absence of dexamethasone (left panel), the construct is inactive and the embryos develop normally (50 out of 57, 88%). Upon treatment with dexamethasone (right panel), hoxd13a is activated and causes a reduction of gli3 expression at the developing fin region (mild reduction in 39 out of 93, 42%; strong reduction in 22 out of 93, 24%). Scale bars = 250 µm.

Supplementary information

Supplementary Information

This file contains Supplementary Figs. 1–16 and a guide to the Supplementary Tables.

Reporting Summary

Supplementary Tables

Supplementary Tables 1–5 and 8–14.

Supplementary Tables

Supplementary Tables 6 and 7.

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Marlétaz, F., de la Calle-Mustienes, E., Acemel, R.D. et al. The little skate genome and the evolutionary emergence of wing-like fins. Nature 616, 495–503 (2023). https://doi.org/10.1038/s41586-023-05868-1

Download citation

Received: 23 March 2022
Accepted: 21 February 2023
Published: 12 April 2023
Issue Date: 20 April 2023
DOI: https://doi.org/10.1038/s41586-023-05868-1

This article is cited by

Convergent gene losses and pseudogenizations in multiple lineages of stomachless fishes
- Akira Kato
- Supriya Pipil
- Yoshio Takei
Communications Biology (2024)
Genomic reconsideration of fish non-monophyly: why cannot we simply call them all ‘fish’?
- Shigehiro Kuraku
- Mana Sato
- Yoshinobu Uno
Ichthyological Research (2024)
Genome reveals how the skate got its wings
- Chris Amemiya
Nature (2023)
Genomic Characteristics of Okamejei kenojei and the Implications to Its Evolutionary Biology Study
- Na Song
- Siyu Ma
- Linlin Zhao
Marine Biotechnology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Main

Genome sequencing and comparative genomics

Chromosome evolution

Evolution of the gene complement

Conservation of 3D regulatory principles

PCP pathway as a driver of fin expansion

HOX-driven gli3 repression in skate fins

A skate-specific hoxa fin enhancer

Discussion

Methods

Animal use

Genomic DNA extraction and library construction

Genome assembly

Annotation

Gene family, synteny and phylogenetic analyses

Hi-C

Hi-C analysis

HiChIP

HiChIP analysis

RNA-seq

RNA-seq analysis

ATAC–seq

ATAC–seq analysis

Microsyntenic pair analysis

TAD rearrangements in the skate lineage

WISH

Gain of function analysis

RT–qPCR

Cell elongation analysis

ROCK inhibitor treatment

Morphometrics analysis of skate fins

Transgenic enhancer activity assay

Phylome reconstruction

Species tree reconstruction

Skate MethylC-seq library preparation

Skate methylome data analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data figures and tables

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links