Low mutation rate in epaulette sharks is consistent with a slow rate of evolution in sharks

Sendell-Price, Ashley T.; Tulenko, Frank J.; Pettersson, Mats; Kang, Du; Montandon, Margo; Winkler, Sylke; Kulb, Kathleen; Naylor, Gavin P.; Phillippy, Adam; Fedrigo, Olivier; Mountcastle, Jacquelyn; Balacco, Jennifer R.; Dutra, Amalia; Dale, Rebecca E.; Haase, Bettina; Jarvis, Erich D.; Myers, Gene; Burgess, Shawn M.; Currie, Peter D.; Andersson, Leif; Schartl, Manfred

doi:10.1038/s41467-023-42238-x

Download PDF

Article
Open access
Published: 19 October 2023

Low mutation rate in epaulette sharks is consistent with a slow rate of evolution in sharks

Nature Communications volume 14, Article number: 6628 (2023) Cite this article

5669 Accesses
2 Citations
342 Altmetric
Metrics details

Subjects

Abstract

Sharks occupy diverse ecological niches and play critical roles in marine ecosystems, often acting as apex predators. They are considered a slow-evolving lineage and have been suggested to exhibit exceptionally low cancer rates. These two features could be explained by a low nuclear mutation rate. Here, we provide a direct estimate of the nuclear mutation rate in the epaulette shark (Hemiscyllium ocellatum). We generate a high-quality reference genome, and resequence the whole genomes of parents and nine offspring to detect de novo mutations. Using stringent criteria, we estimate a mutation rate of 7×10⁻¹⁰ per base pair, per generation. This represents one of the lowest directly estimated mutation rates for any vertebrate clade, indicating that this basal vertebrate group is indeed a slowly evolving lineage whose ability to restore genetic diversity following a sustained population bottleneck may be hampered by a low mutation rate.

Shark genome size evolution and its relationship with cellular, life-history, ecological, and diversity traits

Article Open access 17 April 2024

Historical contingency shapes adaptive radiation in Antarctic fishes

Article 10 June 2019

Genomic insights into the historical and contemporary demographics of the grey reef shark

Article 16 March 2022

Introduction

Sharks are members of one of the most basal of vertebrate clades, the Chondrichthyans, that emerged from mass extinction events in the Permian and Jurassic periods to radiate and dominate many marine food webs^1,2. Modern sharks play important functional roles in the regulation and maintenance of a diverse range of marine ecosystems^3,4,5. However, little is known about the evolutionary rate and adaptive potential of shark populations, a fact that has come into sharper focus with the emergence of the dual ecological pressures of overfishing and habitat loss. Specific drivers of overfishing in shark populations are particularly impacting. Firstly, shark populations are severely adversely affected by their incidental capture in fisheries directed at other species, with sharks caught as bycatch in the high seas pelagic longline fisheries being particularly impactful⁶. Secondly, many species are directly targeted by the ‘fin trade’, where shark fins are harvested for human consumption. This removes between 26 and 73 million sharks each year, with more than half of the species being under threat of extinction⁷. Thirdly, for many years specific shark populations have also been harvested by an additional, particularly pernicious, industry which produces shark cartilage extracts as dietary supplements for cancer prevention or treatment. The use of this product is based on the claim that sharks do not get cancer^8,9. The shark cartilage supplement industry persists despite the clinical efficacy of shark cartilage-based treatments of cancer being directly refuted by clinical trials¹⁰. Furthermore, the existence of numerous studies documenting that different types of neoplasms do, in fact, occur in sharks has also failed to halt the use of shark cartilage supplements¹¹.

Exacerbating the intense fishing pressures currently facing shark populations is the extreme nature of the life history characteristics that are exhibited by most shark species. Extant sharks are slow-growing, reach sexual maturity late, and have few offspring. They exhibit some of the longest gestation periods and the highest levels of maternal investment in the animal kingdom¹². This generally results in slow population growth and delayed recovery after population collapse. They are, therefore, particularly sensitive to unsustainable fishing practices and rapid changes in habitats^13,14,15,16. How rapidly shark populations are able to evolve to counteract the mounting ecological threats that face them and rebound from historically low population densities will ultimately be dependent on the genetic diversity within populations, a value that itself is dependent on the germline mutation rate.

Mutations are the fundamental substrates of evolution because they generate variability within populations, enabling evolutionary change. The mutation rate (µ) is a crucial parameter for many calculations and predictive modelling in the fields of ecology and evolution, genetics, and genomics. Despite its importance, experimental determination of mutation rates in vertebrates has been strongly mammal-focused (Supplementary Table 1), including a recent study reporting mutation rates in 68 vertebrate species¹⁷. Synonymous substitution rates for chondrichthyans have been reported to be lower than those of osteichthyans, suggesting a low intrinsic mutation rate¹⁸. In addition, a mitochondrial DNA sequence-based study¹⁹ previously indicated that sharks might be a “slow molecular clock lineage”, which—if true—would have consequences for our understanding of the evolution, ecology, and genomics of this basal vertebrate group.

Here we provide a direct estimate of the de novo mutation rate in a species of shark—Hemiscyllium ocellatum (the epaulette shark)—a small, benthic, oviparous species that inhabits coral reef environments in the waters north-east of Australia (Fig. 1a). The epaulette shark is the most studied member of the genus Hemiscyllium or “walking” sharks for which a recent comprehensive molecular phylogenetic analysis based on whole mitochondrial genome sequences of all nine currently recognised species has been completed²⁰. Our development of captive breeding and pair mating protocols for the epaulette shark allows the development of this species as a general model system for shark research and allows us to genetically evaluate the mutation rate within a shark pedigree. To the best of our knowledge, our analysis defines the lowest directly estimated mutation rate for a vertebrate to date, indicating that this basal vertebrate group is a slowly evolving lineage. These results have the potential to at least partially explain the perception of a low rate of cancer in shark species, and they also illustrate an additional hurdle that sharks face as a clade in maintaining genetic diversity against an ever-increasing ratchet of ecological pressures.

**Fig. 1: Distribution and brood colony of epaulette sharks (*Hemiscyllium ocellatum*).**

Results

Development of husbandry and pedigree procedures for Hemiscyllium ocellatum

We developed infrastructure to house a captive broodstock of epaulette sharks (Fig. 1b). Reproductively mature adults were sourced from wild populations along the northeastern Australian Coast. Within this brood stock, we isolated a single captive female and male breeding pair. Epaulette sharks are oviparous, and females lay two to four eggs per month²¹. To avoid false paternity assignment due to possible sperm storage, the male and female sharks were maintained in isolation for a period of approximately 10 months prior to the onset of egg collection. Genomic DNA from ten F1 offspring was obtained from pre-hatching, whole embryos collected from the isolated breeding pair. Maternal and paternal genomic DNA were obtained from blood samples from each adult.

High-quality assembly of the Hemiscyllium ocellatum genome

A trio from our pedigree comprising the male and the female and one of their progeny was used for the genome assembly. Following the phase 1 pipeline of the Vertebrate Genome Project²², we used the “trio binning” strategy²³ to generate a haplotype-resolved genome assembly. For this method, we generated high coverage (113-135X) Illumina short-read sequences using genomic DNA isolated from maternal and paternal blood samples and 50X sequence coverage from a single F1 male offspring using PacBio Sequel II SMRT sequencing and genomic DNA isolated from tissue. A genome assembly was obtained using Canu²⁴. Chromosome scaffolding was performed by integrating data from Hi-C (Dovetail) and optical mapping (Bionano) data. The strategy resulted in a paternal haplotype assembly (used as the reference assembly; GCA_020745735.1) of 3.98 Gb with 52 autosomes plus X (CM036711.1) and Y (CM036712.1) chromosomes. The assembly is of high quality, with 5667 contigs resolved into 26 large scaffolds and an additional 1937 minor scaffolds. The N50 for the scaffolds is 83.6 Mb, and the scaffold L50 is 17 (Fig. 2, Table 1). A maternal haplotype assembly (GCA_020745765.1) was also assembled with a total length of 4.15 Gb. Karyotype analysis of cultured embryonic fibroblasts confirms the chromosome count from the genome assembly, demonstrating 52 autosome pairs and a pair of XY sex chromosomes (Fig. 3).

Table 1 Statistics of the genome assembly

Full size table

**Fig. 3: Karyotype of the epaulette shark.**

To annotate protein-coding genes, gene evidence from protein homology of other species, RNA-seq transcriptomes from epaulette shark embryos and ab initio predictions were integrated. A total of 18,225 protein-coding genes were annotated. The BUSCO completeness based on the vertebrata_odb9 data set was improved from 89.3% to 96.0% by the annotation process (Table 2). Totally, 1252 (6.8%) genes were annotated as pseudogenes. Of the 18,225 genes, 17,580 (96.5%) have a BLAST hit to the Swiss-Prot/RefSeq database. Of the protein-coding genes, 1275 (7%) are single exon genes. Additionally, 747 tRNA, 35 rRNA, 180 miRNA, and 854 other noncoding RNA genes were annotated (Table 2).

Table 2 Metrics of the genome annotation

Full size table

Identification of de novo mutations

To provide a direct estimate of the de novo mutation rate in the epaulette shark, we generated 10X Genomics linked-reads sequencing data for nine F1 progeny produced during our captive breeding experiment. As identification of de novo mutations requires high sequence coverage, we sequenced each offspring to ~49–82× coverage (Supplementary Table 2). The resulting sequences, along with parental Illumina sequences (~113–135× coverage) previously generated for genome assembly construction, were aligned to our genome assembly and genotypes called at both variant and invariant sites using GATK²⁵ (see “Methods”). High genotype concordance confirmed a single paternity across the pedigree. In a known pedigree, de novo mutations can be identified as variant sites where an offspring carries an allele absent in both parents. However, offspring-parent genotype discordance can also arise via sequencing and alignment errors²⁶. A standard genotype-calling pipeline (e.g., GATK best practices) will typically lead to most novel variants detected being false positives, as has been empirically demonstrated in the Atlantic herring²⁷. Hence, prior to screening for candidate de novo mutations, we applied a strict genotype filtering pipeline²⁸ (Fig. 4, see Methods) designed to identify genomic positions where sample genotypes could be confidently called. This pipeline resulted in 333–457 Mb of sequence available per trio for variant screening (Supplementary Table 2), which represents between 8.0% and 11.0% of the genome. Across these “callable” sites, we identified 12 candidate de novo mutations where offspring genotypes did not meet Mendelian expectations (Fig. 5A). Sanger sequencing/plasmid cloning confirmed four candidate de novo mutations as genuine and seven as false positives (Fig. 5A, Fig. 6), indicating a false positive rate of 63.6%. A similarly high false positive rate (61.4%) has been reported previously²⁹. Consistent with germline mutations, all peak ratios for the two alleles of the confirmed de novo mutations were close to 1:1 (Fig. 5B). All four were transitions, in line with the general observation that transitions are more common than transversions³⁰. Validation of the candidate de novo mutation on scaffold_28_mat (position: 17,411,576 bp) was not possible due to failed Sanger sequencing/difficulty cloning the target region in the focal offspring. Due to the presence of a flanking segregating SNP in the same sequencing reads, we could determine that the de novo mutation on scaffold_8_mat (position: 66,018,494 bp) was of paternal origin (Supplementary Fig. 2).

**Fig. 4: In-house genotype filtering pipeline.**

**Fig. 5: Identification of candidate de novo mutations (DNMs).**

Estimation of de novo mutation rate

To provide a correct estimate of the de novo mutation rate, we estimated the false negative rate. For a single offspring (ind1722), we simulated mutations at 947 invariant sites within high-confidence callable regions using the simulation tool SomatoSim³¹. We then repeated previously described genotype calling, variant filtering, and de novo mutation detection pipelines, compared genotype calls with expected genotypes based on the mutated sites and calculated the false negative rate. Our pipeline detected 910 out of 947 simulated mutations, indicating a low false negative rate of 3.9%. We took a conservative approach when estimating the mutation rate by including the candidate mutation on scaffold_28_mat in our calculations. This was the candidate mutation for which successful Sanger sequencing/plasmid cloning of the offspring carrying the mutation (ind2023) could not be conducted. We estimated the mutation rate per site per generation by dividing the number of de novo mutations identified by 2 x the total number of callable sites screened across the pedigree (5/(2 × 3,691,810,944) = 6.8 × 10⁻¹⁰). By correcting for the estimated false negative rate, we obtain 7 × 10⁻¹⁰ mutations per base pair per generation (95% CI: 1.4–14.1 × 10⁻¹⁰, assuming that the mutations are Poisson distributed). Thus, a newborn epaulette shark carries approximately five single base de novo mutations compared with the corresponding estimate of 50–100 de novo mutations in newborn humans, despite the human genome being 25% smaller.

Estimation of long-term effective population size

The relationship between nucleotide diversity (π), mutation rate (μ) and effective population size (N_e) for diploid organisms is π = 4N_eμ. Given this relationship, we calculated the long-term effective population size as N_e = π/4μ. Using the nucleotide diversity observed in the two parents (mean π = 0.002, Fig. 7) and our estimated mutation rate of 7 × 10⁻¹⁰ substitutions per site per generation, we obtained an estimated long-term N_e of ~710,000 individuals.

Discussion

Given the ecological and economic importance of chondrichthyans, and the relatively few genomic resources available from representatives of this clade³², we sought to establish the epaulette shark as a laboratory model system and generate a high-quality, haplotype-resolved reference genome. We could then use this resource to estimate the de novo mutation rate for a shark species. The epaulette shark was chosen for this purpose because of the possibility of performing captive breeding and thereby ensuring a full-sib family for whole genome resequencing. Our finding of a de novo mutation rate of 7 × 10⁻¹⁰ for the epaulette shark represents the lowest estimated rate yet reported for a vertebrate species, as illustrated in Fig. 8 and taking into account a recent study reporting mutation rates for 68 vertebrate species based on single trios¹⁷. Thus, our results indicate that sharks are a very slow molecular clock lineage.

**Fig. 8: Directly estimated vertebrate de novo mutation rates.**

The estimated mutation rate is 17-fold lower than in humans and an order of magnitude lower than the slowest evolving mammal recorded to date (Supplementary Table 1). However, it should be noted that this estimate reflects the mutation rate in the callable fraction of the genome, which does not include repeat regions (~44%). As replication of repetitive regions tends to be more error-prone, we acknowledge that the true genome-wide mutation rate is likely higher than reported here. However, the decreased ability to accurately call genotypes within repeat regions precludes unbiased screening within these regions. This issue is not unique to the epaulette shark, and as such similar caveats apply when estimating mutation rates in other species, meaning that results should be comparable across species. There is a clear trend for lower mutation rates in poikilothermic vertebrates than in homoeothermic species because the three species with the lowest hitherto reported mutation rates are all fish (Fig. 8). As a correlation between nucleotide substitution rates and metabolism has been documented³³, a possible explanation for the low mutation rate is that the metabolic rate of sharks is up to ten times lower than in mammals of a similar size^34,35. Given that epaulette sharks are restricted to warm tropical waters, even lower de novo mutation rates could be expected in shark species inhabiting cold waters where metabolic rates are likely lower. For example, one of the longest-lived vertebrates is the Greenland shark (Somniosus microcephalus), which inhabits the most extreme latitudes of any shark species, and is exposed to some of the coldest water temperatures on the planet (as low as −1.8 °C)³⁶, and exhibits the lowest mass-specific metabolic rate reported for a shark³⁷.

Sharks do get cancers like all other vertebrates, although this has been suggested to occur at a lower rate than in other vertebrates¹¹. As the shark skeleton is made of cartilage, the hypothesis was put forward that the high amount of cartilage in the shark body prevents the development of cancer. This was inferred from the well-established fact from mammalian cancer research that cartilage, including from sharks, inhibits neovascularization of tumours in vitro and thereby reduces their growth³⁸. Indeed, anti-angiogenic factors have been isolated from mammalian and even shark cartilage³⁹. The active principle behind this phenomenon is that biochemical components of cartilage can adsorb tumour-derived pro-angiogenic factors and thus inactivate them while others directly act as anti-angiogenesis molecules¹⁰.

Following the first reports on anti-angiogenic activity from cartilage, the belief was nurtured that consuming shark cartilage as a “drug” could protect against cancer in humans. A whole industry developed which produces shark cartilage pills. Cartilage companies harvest over 100,000 sharks in US waters per month and up to tens of millions worldwide per year to create their products⁴⁰. Shark cartilage, however, has not been shown to cure or modulate cancer progression in any way. It was ineffective in mouse tumour models⁴¹. This is also the conclusion from at least three randomised, FDA-approved clinical trials^42,43,44. Most cancers originate from spontaneous or induced somatic mutations^45,46. A study has shown that the somatic mutation rate is approximately one order of magnitude higher than the germline mutation rate⁴⁷. Both values correlate, with a lower germline mutation rate being accompanied by a lower somatic rate. Thus, we can also expect the somatic mutation rate in sharks to be low. Sharing their environments with other aquatic animals, which show a higher rate of neoplasms, we infer that the low spontaneous mutation rate of sharks could contribute to the low incidence of tumours suggested for sharks.

Based on the relationship between effective population size, nucleotide diversity, and the mutation rate, we estimated the long-term effective population size for the epaulette shark to be within the order of ~710,000 individuals. Such a large effective population size is unsurprising given that: (1) a previous mark-recapture census has estimated there to be thousands of epaulette sharks inhabiting the reefs surrounding Heron Island alone⁴⁸; and (2) that the species has a broad geographic distribution^20,49,50. While both a large population size and moderate nucleotide diversity (mean π = 0.002) make the epaulette shark likely resilient to loss of diversity following short-term population perturbations, its ultra-low mutation rate means the species’ ability to restore genetic diversity following a sustained population bottleneck would likely be low in particular if the bottleneck affects the entire species population.

Possible explanations for the low mutation rate in epaulette sharks are an intrinsic low mutation rate in poikilothermic species due to their low metabolic rate, combined with efficient purifying selection in a species with a large long-term effective population size that purges slightly deleterious mutations⁵¹, for instance by selecting for efficiency in genes encoding the DNA repair machinery. In line with this suggestion, positive selection for genes involved in the maintenance of genome stability has previously been reported in elasmobranchs⁵². Extrapolating our findings to other shark species that lack the population size stability evident in epaulette sharks suggests a similar low mutation rate may result in long-term negative effects of population bottlenecks in already endangered and overfished species. Our study, therefore, provides compelling evidence for the need to prioritise preservation of the remaining genetic diversity of global shark populations.

Methods

Epaulette shark breeding and sampling

An adult epaulette shark brood stock was maintained in a closed, recirculating marine system, including three 5000 L tanks and a single 2100 L tank housed indoors in the Monash University Aquacore facility. Animals were originally purchased as sexually mature adults from Cairns Marine, which sourced wild-caught epaulettes from a collection area 100 nM south, 200 nM north and 150 nM East of Cairns (Queensland, Australia). Water temperature was maintained at approximately 25 °C, and a graded light cycle was used to mimic sunrise and sunset with a 12-h photoperiod. Sharks were fed a mixed diet that included glassies, pilchard, whiting, pipis and squid four times per week. Epaulette husbandry, breeding, and egg collection were carried out in accordance with approved Monash University Animal Ethics Project ID 30347, and blood samples from adult sharks were collected according to approved Monash University Animal Ethics Project ID 13945. The adult breeding pair used for generating the trio assembly and pedigree analysis was housed in a custom-built 2100 L tank, which allowed the collection of eggs of known parentage. Newly laid eggs collected from the isolated breeding pair were tagged with their date of deposition, transferred to separate glass aquaria, and reared to late pre-hatching stages [Stages 37 and 38 according to refs. ^53,54], flash frozen in liquid nitrogen and stored at −80 °C prior to DNA extraction. The adult tanks and egg-rearing aquaria were maintained on a common marine system and received the same seawater. For blood collection, adults were temporarily anaesthetised with Aqui-S and blood drawn from the caudal vein.

Reference genome assembly and annotation

Sampling

Pre-hatchling sharks were flash frozen, and tissues were later dissected on dry ice. Whole blood from the parents was collected in EDTA-coated tubes and flash frozen and stored at −80 °C prior to DNA extraction.

PacBio sequencing

In total, 25 mg of spleen tissue was used to isolate genomic DNA for PacBio sequencing using the agarose plug Bionano Genomics protocol for cell culture DNA Isolation (#30026F). DNA quality was assessed by Pulsed Field Gel and quantified with a Qubit 2 Fluorometer. A total of 35.1 µg of “ultra” high molecular weight (uHMW) DNA was isolated. 12.4 µg of uHMW DNA was sheared using a 26 G blunt end needle (PacBio protocol PN 101-181-000 Version 05). A large-insert PacBio library was prepared using the Pacific Biosciences Express Template Prep Kit v2.0 (#100-938-900) following the manufacturer’s protocol. The library was then size-selected (>20 kb) using the Sage Science BluePippin Size-Selection System. After size selection, we obtained 2.2 µg of the final library (55.2 ng/µl) with an average size of 54 kb. The PacBio Library was sequenced on 7 PacBio 8 M SMRT cells on the Sequel instrument with the Sequel® II Sequencing Plate 1.0 using the Sequel® II Binding Kit 1.0, capturing 15 h movies with no pre-extension time.

Bionano measurements

In total, 37 mg of heart tissue was used for isolating genomic DNA for PacBio using the agarose plug Bionano Genomics protocol for cell culture DNA Isolation (#30026 F). uHMW DNA quality was assessed by Pulsed Field Gel and quantified with a Qubit 2 Fluorometer. A total of 15.17 µg of uHMW DNA was isolated. uHMW DNA was labelled for Bionano Genomics optical mapping using the Bionano Prep Direct Label and Stain (DLS) Protocol (30206E) and run on one Saphyr instrument flow cell.

Hi–C

was performed by Arima using Arima v1 chemistry (restriction sites: GATC, GANTC) and sequenced on an Illumina NovaSeq 6000 S4 module using 150 bp paired-end (PE) chemistry (~60× coverage).

10× Genomics linked-reads

Unfragmented HMW DNA was used to generate a linked-reads library on the 10× Genomics Chromium platform (Genome Library Kit & Gel Bead Kit v2 PN-120258, Genome Chip Kit v2 PN-120257, i7 Multiplex Kit PN-120262). We sequenced this 10× library on an Illumina NovaSeq S4 module using 150 bp PE chemistry (~60× coverage).

Parental Illumina short-read sequencing

In total, 10 µl of whole blood was used for each parent, and DNA was isolated using the Qiagen QIAmp DNA Blood Kit (Cat. # 51104), DNA was ligated to Illumina adaptors using the Illumina DNA PCR-Free Prep Tagmentation (Cat. # 20027213). We sequenced this library on an Illumina NovaSeq 6000 S4 module using 150 bp PE chemistry (~60× coverage).

Genome assembly

The VGP pipeline v1.6 was used to assemble this genome²². We used TrioCanu v. 2.1²³ (https://canu.readthedocs.io/en/latest/index.html) to assemble the PacBio contigs, and arrow/variantCaller v.2.3.3 was used to polish the contigs with PacBio data. The paternal and maternal assemblies were “purged” to remove false duplicates using purge_dups v.1.2.5⁵⁵ (https://github.com/dfguan/purge_dups). The two haplotypes were then scaffolded separately using scaff10x v.4.2 (https://github.com/wtsi-hpag/Scaff10X) for the 10X data, Bionano Solve v.3.6.1_11162020 for the optical maps and Salsa2 HiC v. 2.2⁵⁶ (https://github.com/marbl/SALSA) for the HiC data. Finally, three rounds of polishing were applied to the two assemblies simultaneously. First, the raw CLR PacBio reads were mapped using the PacBio version of minimap2⁵⁷ (https://github.com/PacificBiosciences/pbmm2) and then polished using variantCaller v.2.3.3. Two rounds of polishing 10× data were mapped to the two haplotypes using Longranger v.2.2.2 (https://github.com/10XGenomics/longranger) and polished using freebayes v.1.3⁵⁸ (https://github.com/freebayes/freebayes). Assemblies were evaluated using Merfin v1.0⁵⁹ (https://github.com/arangrhie/merfin). Assemblies were evaluated with Merqury⁶⁰. The two final haplotype assemblies were curated following the curation process described in ref. ⁶¹. The mitochondrial genome was assembled using mitoVGP⁶² (https://github.com/gf777/mitoVGP).

Repeat masking

For creating a repeat masked assembly (see https://genome.ucsc.edu) WindowMasker was run with the following parameters:

windowmasker -mk_counts true \ -input GCA_020745765.1_sHemOce1.mat.decon.unmasked.fa \ -output wm_counts windowmasker -ustat wm_counts -sdust true \ -input GCA_020745765.1_sHemOce1.mat.decon.unmasked.fa \ -output windowmasker.intervals perl -wpe 'if (s/^>lcl\|(.*)\n$//) {$chr = $1;} \if (/^(\d + ) - (\d + )/) {\$s = $1; $e = $2 + 1; s/(\d + ) - (\d + )/$chr\t$s\t$e/;}' windowmasker.intervals > windowmasker.sdust.bed

The windowmasker.sdust.bed included masking for areas of the assembly that are gaps. The file was ‘cleaned’ to remove those areas of masking in gaps, leaving only the sequence masking. The final result covers 1,838,924,616 bases in the assembly size 4,149,461,884 for a percent coverage of 44.32%.

Annotation

Protein coding genes were annotated by collecting and synthesising the gene evidence from homologous alignment, RNA-seq mapping and ab initio prediction. A pipeline from our previous study⁶³ was used in this process. For homology evidence, 458,466 protein sequences were aligned to the assembly using Exonerate⁶⁴ (https://www.ebi.ac.uk/about/vertebrate-genomics/software/exonerate) and Genewise⁶⁵ (https://www.ebi.ac.uk/Tools/psa/genewise/) for gene structure determination, respectively. Those protein sequences were collected from the vertebrate database of Swiss-Prot (https://www.uniprot.org/statistics/Swiss-Prot), RefSeq database (https://www.ncbi.nlm.nih.gov/refseq/, proteins with ID starting with “NP” from “vertebrate_other”) and the NCBI genome annotation of human (GCF_000001405.39_GRCh38, https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000001405.39/), zebrafish (GCF_000002035.6, https://www.ncbi.nlm.nih.gov/datasets/genome/GCF_000002035.6/), platyfish (GCF_002775205.1, https://ncbi.nlm.nih.gov/datasets/genome/GCF_002775205.1/), medaka (GCF_002234675.1, https://ncbi.nlm.nih.gov/datasets/genome/GCF_002234675.1/), elephant shark (GCF_018977255.1, https://ncbi.nlm.nih.gov/datasets/genome/GCF_018977255.1/), Asian bonytongue (GCF_900964775.1, https://ncbi.nlm.nih.gov/datasets/genome/GCF_900964775.1/), coelacanth (GCF_000225785.1, https://ncbi.nlm.nih.gov/datasets/genome/GCF_000225785.1/) and western clawed frog (GCF_000004195.4, https://ncbi.nlm.nih.gov/datasets/genome/GCF_000004195.4/). For transcriptome evidence, RNA-seq reads from mixed tissue collected from stage 23 and 27 epaulette shark embryos were aligned on the assembly using HISAT⁶⁶ (http://daehwankimlab.github.io/hisat2/). The gene models were determined using StringTie⁶⁷ (https://ccb.jhu.edu/software/stringtie/), and in parallel aligned reads were assembled using Trinity⁶⁸. The resulting transcripts were then aligned to the assembly to determine the gene structure using Splign⁶⁹ (https://www.ncbi.nlm.nih.gov/sutils/splign/splign.cgi). For ab initio prediction, AUGUSTUS⁷⁰ (https://bioinf.uni-greifswald.de/augustus/) was trained using those “good genes” that were determined consistently by Exonerate, Genewise, StringTie and Splign. The trained AUGUSTUS was then run for the ab initio gene prediction with all the gene models obtained above as hints. To synthesise this gene evidence into a final consistent set of annotations, we first clustered overlapped homology gene models and, for each cluster, kept the one best supported by transcriptome evidence. The terminal exons were replaced when they encountered a replacement that was better supported by transcriptome evidence. Genome regions with no homologous gene predicted by ab initio gene models were recruited when they were 100% supported by transcriptome evidence.

The final annotated gene set was blasted through databases of Pfam (https://pfam.xfam.org/), BUSCO⁷¹ (https://busco.ezlab.org/), Swiss-Prot (https://www.uniprot.org) and RefSeq (https://www.ncbi.nlm.nih.gov/refseq/) to check for protein domains, assess annotation completeness and assign gene symbol and name. Genes that are heavily covered by repeat elements, with low homologous coverage, lack transcriptome evidence and/or show no similarity to Pfam/Swiss-prot/RefSeq database were judged as poor quality and were discarded from the final gene set.

Karyotype analysis

Epaulette shark embryos were dissected from egg cases at Stages 32–33 and used to seed fibroblast culture as described⁷², with minor modifications. To prevent contamination, embryos were soaked in povidone-iodine (Betadine) solution for ten seconds and washed in shark PBS containing 1% antibiotic–antimycotic solution (Thermo Fisher Scientific GIBCO). Tissue was then macerated, plated on 24 well plates coated with rat tail collagen I following manufacturer recommendations (Thermo Fisher Scientific-GIBCO) and cultured in LDF media. Cultures were incubated at 26 °C in a humidified atmosphere with 5% CO₂. After 1 week, primary cultured fibroblasts were subcultured using 1.46 U/ml Dipase II (Thermo Fisher Scientific-GIBCO) in PBS supplemented with 299 mM urea and 68 mM NaCl. At maximum proliferation, cells were treated with colcemid (150 ng/ml) for 1.5 to 3 h, harvested and treated with 0.075 M KCl for 40 min. Cells were subsequently fixed in methanol:acetic acid (3:1), and the cell suspension was dropped onto glass slides and air-dried for DAPI banding analysis.

High coverage resequencing of offspring individuals

Muscle tissue was quickly cut from the frozen embryo (kept frozen on dry ice) by Dremel multifunctional tool Model 4000 with EZ SpeedClic Φ38mm at a speed 30,000 rpm and then stored at −80 °C until DNA extraction.

High molecular weight (HMW) genomic DNA (gDNA) was extracted from one tissue section using Nanobind Tissue Big DNA Kit (Pacific Biosciences of California, USA) according to the manufacturer’s protocol [Nanobind Tissue Big DNA Kit Handbook v1.0 (11/2019) -Standard TissueRuptor II HMW Protocol]. The quantity of gDNA was estimated with Qubit (Qubit dsDNA BR assay Kit) and NanoDrop Spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA), while the integrity of the DNA was verified using pulse field gel electrophoresis with the Pippin Pulse^TM device (SAGE Science).

10x genomic linked read sequencing

HMW gDNA was used for 10× genomic linked read sequencing following the manufacturer’s instructions (10× genomics Chromium^TM Reagent Kit v2, revision B). In brief, 1 ng of HMW gDNA was amplified in 10× genome in gel beads (Gel Bead-In-Emulsions = GEM), making use of the ChromiumTM device. Individual gDNA molecules were amplified in these individual GEMS in an isothermal incubation using primers that contain a specific 16 bp 10× barcode and the Illumima® R1 sequence. After breaking the emulsions, pooled amplified barcoded fragments were purified, enriched and went into Illumina sequencing library preparation as described in the protocol. Sequencing was done on a NovaSeq 6000 S1 flow cell using the 2 × 150 cycles paired-end regime plus 8 cycles of i7 index.

Detection of candidate de novo mutations

Read mapping and variant calling

Offspring and parental Illumina sequences were aligned to the maternal haplotype genome assembly using BWA-mem (https://github.com/lh3/bwa) v0.7.17⁷³ (https://github.com/lh3/bwa). Sequence alignments were used to call variants via the GATK²⁵ (https://gatk.broadinstitute.org/) v4.2.0 HaplotypeCaller, which performs simultaneous calling of SNPs and Indels via local de novo assembly of haplotypes (see GATK manual for details). We ran HaplotypeCaller separately for each individual to generate intermediate genomic VCF files (gVCF). Following this, we used the CombineGVCFs and GenotypeGVCFs modules in GATK to merge gVCF records from each individual using the multi-sample joint aggregation step that combines all records, generates correct genotype likelihoods, re-genotypes the newly merged records and reannotates each of the called variants²⁵. Raw variant calls were then filtered using GATK SelectVariants to retain only monomorphic sites and biallelic Single Nucleotide Polymorphisms (SNPs) for downstream analyses.

Genotype filtering

We excluded genotype calls from repetitive regions detected using Repeat Masker (https://github.com/rmhubley/RepeatMasker) v4.1.0 and genomic regions with a mappability score <1. Mappability was calculated using GENMAP⁷⁴ (https://github.com/cpockrandt/genmap) v1.3.0 using a k-mer length of 100 and a maximum of two mismatches. Second, we excluded any genotype call with a genotype quality (GQ) score <20 on the basis that genotype accuracy rapidly declines below this threshold (see GATK manual). Further, we removed sites where either parental genotype was missing, as these are not informative. Following this, we extracted a subset of sites where parents were homozygous for different alleles and all nine offspring were heterozygous (genotype calls were considered homozygous in parents if the minor allele balance was <0.1 and heterozygous in offspring if the minor allele balance was ≥0.25). From these high-confidence heterozygous sites, we extracted the following quality annotations from the VCF INFO field: base quality rank sum; read position rank sum; mapping quality; and quality by depth. In addition, from the VCF FORMAT field, we extracted the depth of coverage annotations for individual genotypes. We then examined their distributions in the high-confidence heterozygous sites and used the 5th and 95th percentiles calculated for each quality annotation as standard cut-offs to filter biallelic and monomorphic sites in our entire dataset. Note: for mapping quality and quality by depth, we only applied the lower cut-off (5th percentile) to prevent penalisation of high-quality sites. Sites that passed our in-house filtering pipeline were considered high-confidence “callable” sites.

De novo mutation calling

From the filtered dataset generated in the previous step, we identified candidate de novo mutations as sites where both parents were homozygous for the same allele and at least one offspring carried a variant allele in the heterozygous state. We conducted secondary genotype calling at these positions, using the mpileup and call functions of bcftools⁷⁵ v1.14 (https://samtools.github.io/bcftools/), and considered sites as true candidates when GATK and bcftools genotypes matched, and when the putative mutation was supported by at least 25% reads, i.e. had a minor allele balance ≥0.25.

Estimation of the false negative rate

We simulated mutations by introducing variants directly into sample BAM files using the Single Nucleotide Variant (SNV) simulation tool SomatoSim v1.0.0³¹ (https://github.com/BieseckerLab/SomatoSim). The advantages of this approach compared to generating synthetic reads from a reference file is that this approach allows for error profiles to be preserved and does not limit variant allele frequencies (VAFs), variant locations, or the number of variants that can be simulated. For a single offspring (ind1722), we simulated mutations at 947 invariant sites within high-confidence callable regions. Each mutated site had its frequency of mutated reads determined by sampling from the observed frequency distribution of callable heterozygous sites in the original dataset. We then repeated previously described genotype calling, genotype filtering and de novo mutation detection pipelines, compared the SNP calls with expected genotypes based on the mutated sites and calculated the false negative rate.

Experimental validation and parental origin of de novo mutations

To confirm the authenticity of candidate de novo mutations, we performed Sanger sequencing of the genomic regions around each candidate in both parents and all nine offspring. To confirm the sequence of the parents at candidate mutation sites, genomic DNA was extracted from blood samples using a PureLink™ Genomic DNA Mini Kit (Thermofisher) and used as a template for PCR amplification. PCR was performed using Phusion® High Fidelity DNA Polymerase (NEB) or PrimeSTAR GXL DNA Polymerase (Takara). Primer pairs are summarised in Supplementary Table 3. Sanger sequencing was performed on amplified fragments with the respective forward and reverse PCR primers, or fragments were cloned into pGEM®Teasy vector (Promega) and sequenced from M13 forward and M13 reverse sites.

For mutations detected in offspring individuals, the same DNA samples used for whole genome sequencing were used. PCR and Sanger sequence-based screen PCR amplification of the region of interest was performed in a total volume of 10 µl making use of the Phusion Flash Mastermix (Thermo Scientific) with 2 µl input of genomic DNA, 0.5% DMSO and 0.5 µM forward and reverse primer. All details on primer sequences (target-seq-primer) and on PCR conditions are listed in Supplementary Table 3. Sanger sequencing was performed either with the respective forward and reverse PCR primers or, if required, with internally located dedicated sequencing primers to cover the region of interest. For offspring ind2046 the target region around the candidate mutation located on scaffold_4_mat (position 13524629) was cloned into pGEM®Teasy vector, and four single plasmid clones sequenced by Plasmidsaurus (www.plasmidsaurus.com), using Oxford Nanopore Technology MinION with coverage that exceeded 200x.

Parental origin of de novo mutations

We attempted to infer the parental offspring of verified de novo mutations based on the occurrence of flanking SNP alleles that segregated between the two parents (i.e., positions where parents were homozygous for different alleles) and occurred within the same Illumina read or mate-pair read. Due to the limited presence of segregating sites, this was only possible for a single de novo mutation.

Estimation of nucleotide diversity and effective population size

For parental samples, we estimated nucleotide diversity (π) in non-overlapping 100 kb windows using pixy⁷⁶. Given the relationship between nucleotide diversity (π), mutation rate (μ) and effective population size (N_e) for diploid organisms is π = 4N_eμ, we extrapolated the effective population size using the formula: N_e = π/4μ.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Data used for genome assembly construction are available at: https://genomeark.github.io/genomeark-all/Hemiscyllium_ocellatum.html. Additional raw Illumina sequencing reads used for the detection of candidate de novo mutations have been deposited at NCBI under the BioProject PRJNA900175. Genome assemblies (paternal and maternal haplotypes) are available from NCBI under GenBank accession numbers GCA_020745735.1 and GCA_020745765.1. Both haplotypes are also available through the UCSC genome browser gateway (https://genome.ucsc.edu/h/GCA_020745735.1 and https://genome.ucsc.edu/h/GCA_020745765.1). Source data are provided in this paper.

Code availability

Custom code used for the detection of candidate de novo mutations is available on GitHub.

References

Compagno, L. J. V. Alternative life-history styles of cartilaginous fishes in time and space. Environ. Biol. Fishes 28, 33–75 (1990).
Article Google Scholar
Kriwet, J., Witzmann, F., Klug, S. & Heidtke, U. H. J. First direct evidence of a vertebrate three-level trophic chain in the fossil record. Proc. Biol. Sci. 275, 181–186 (2008).
PubMed Google Scholar
Ferretti, F., Worm, B., Britten, G. L., Heithaus, M. R. & Lotze, H. K. Patterns and ecosystem consequences of shark declines in the ocean. Ecol. Lett. 13, 1055–1071 (2010).
Article PubMed Google Scholar
Heithaus, M. R., Wirsing, A. J. & Dill, L. M. The ecological importance of intact top-predator populations: a synthesis of 15 years of research in a seagrass ecosystem. Mar. Freshw. Res. 63, 1039–1050 (2012).
Article Google Scholar
Stevens, J. D., Bonfil, R., Dulvy, N. K. & Walker, P. A. The effects of fishing on sharks, rays, and chimaeras (chondrichthyans), and the implications for marine ecosystems. ICES J. Mar. Sci. 57, 476–494 (2000).
Article Google Scholar
Oliver, S., Braccini, M., Newman, S. J. & Harvey, E. S. Global patterns in the bycatch of sharks and rays. Mar. Policy 54, 86–97 (2015).
Article Google Scholar
Clarke, S., Milner-Gulland, E. J. & Bjørndal, T. Social, economic, and regulatory drivers of the shark fin trade. Mar. Resour. Econ. 22, 305–327 (2007).
Article Google Scholar
William Lane, I. & Comac, L. Sharks Still Don’t Get Cancer. (Avery Publishing Group, 1996).
William Lane, I. Sharks Don’t Get Cancer. (Avery Publ., 1992).
Patra, D. & Sandell, L. J. Antiangiogenic and anticancer molecules in cartilage. Expert Rev. Mol. Med. 14, e10 (2012).
Article CAS PubMed Google Scholar
Ostrander, G. K., Cheng, K. C., Wolf, J. C. & Wolfe, M. J. Shark cartilage, cancer and the growing threat of pseudoscience. Cancer Res. 64, 8485–8491 (2004).
Article CAS PubMed Google Scholar
Cortés, E. Life history patterns and correlations in sharks. Rev. Fish. Sci. 8, 299–344 (2000).
Article Google Scholar
Musick, J. A. Life in the Slow Lana: Ecology and Conservation of Long-lived Marine Animals. (American Fisheries Society, Maryland, 1999).
Cortés, E. Incorporating Uncertainty into demographic modeling: application to shark populations and their conservation. Conserv. Biol. 16, 1048–1062 (2002).
Article Google Scholar
García, V. B., Lucifora, L. O. & Myers, R. A. The importance of habitat and life history to extinction risk in sharks, skates, rays and chimaeras. Proc. Biol. Sci. 275, 83–89 (2008).
PubMed Google Scholar
Dulvy, N. K. & Forrest, R. E. Life histories, population dynamics, and extinction risks in chondrichthyans. in Sharks and their relatives II 655–696 (CRC Press, 2010).
Bergeron, L. A. et al. Evolution of the germline mutation rate across vertebrates. Nature 615, 285–291 (2023).
Hara, Y. et al. Shark genomes provide insights into elasmobranch evolution and the origin of vertebrates. Nat. Ecol. Evol. 2, 1761–1771 (2018).
Article PubMed Google Scholar
Martin, A. P., Naylor, G. J. P. & Palumbi, S. R. Rates of mitochondrial DNA evolution in sharks are slow compared with mammals. Nature 357, 153–155 (1992).
Article CAS PubMed ADS Google Scholar
Dudgeon, C. L. et al. Walking, swimming or hitching a ride? Phylogenetics and biogeography of the walking shark genus Hemiscyllium. Mar. Freshw. Res. 71, 1107–1117 (2020).
Article Google Scholar
Heupel, M. R., Whittier, J. M. & Bennett, M. B. Plasma steroid hormone profiles and reproductive biology of the epaulette shark, Hemiscyllium ocellatum. J. Exp. Zool. 284, 586–594 (1999).
Article CAS PubMed Google Scholar
Rhie, A. et al. Towards complete and error-free genome assemblies of all vertebrate species. Nature 592, 737–746 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Koren, S. et al. De novo assembly of haplotype-resolved genomes with trio binning. Nat. Biotechnol. https://doi.org/10.1038/nbt.4277 (2018).
Koren, S. et al. Canu: Scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
Article CAS PubMed PubMed Central Google Scholar
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
Yoder, A. D. & Tiley, G. P. The challenge and promise of estimating the de novo mutation rate from whole-genome comparisons among closely related individuals. Mol. Ecol. 30, 6087–6100 (2021).
Article PubMed Google Scholar
Feng, C. et al. Moderate nucleotide diversity in the Atlantic herring is associated with a low mutation rate. Elife 6, e23907 (2017).
Article PubMed PubMed Central Google Scholar
Sendell-Price, A. T. et al. Low mutation rate in epaulette sharks is consistent with a slow rate of evolution in sharks (this paper), in-house genotype filtering pipeline. https://doi.org/10.5281/zenodo.8276020 (2023).
Koch, E. M. et al. De novo mutation rate estimation in wolves of known pedigree. Mol. Biol. Evol. 36, 2536–2547 (2019).
Article CAS PubMed PubMed Central Google Scholar
Vogel, F. & Kopun, M. Higher frequencies of transitions among point mutations. J. Mol. Evol. 9, 159–180 (1977).
Article CAS PubMed ADS Google Scholar
Hawari, M. A., Hong, C. S. & Biesecker, L. G. SomatoSim: precision simulation of somatic single nucleotide variants. BMC Bioinforma. 22, 109 (2021).
Article Google Scholar
Pearce, J., Fraser, M. W., Sequeira, A. M. M. & Kaur, P. State of shark and ray genomics in an era of extinction. Front. Mar. Sci. 8, 744986 (2021).
Martin, A. P. & Palumbi, S. R. Body size, metabolic rate, generation time, and the molecular clock. Proc. Natl Acad. Sci. USA 90, 4087–4091 (1993).
Article CAS PubMed PubMed Central ADS Google Scholar
Whitney, N. M., Lear, K. O., Gaskins, L. C. & Gleiss, A. C. The effects of temperature and swimming speed on the metabolic rate of the nurse shark (Ginglymostoma cirratum, Bonaterre). J. Exp. Mar. Biol. Ecol. 477, 40–46 (2016).
Article Google Scholar
White, C. R. & Seymour, R. S. Allometric scaling of mammalian metabolism. J. Exp. Biol. 208, 1611–1619 (2005).
Article CAS PubMed Google Scholar
MacNeil, M. A. et al. Biology of the Greenland shark Somniosus microcephalus. J. Fish. Biol. 80, 991–1018 (2012).
Article CAS PubMed Google Scholar
Ste-Marie, E., Watanabe, Y. Y., Semmens, J. M., Marcoux, M. & Hussey, N. E. A first look at the metabolic rate of Greenland sharks (Somniosus microcephalus) in the Canadian Arctic. Sci. Rep. 10, 19297 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Langer, R., Brem, H., Falterman, K., Klein, M. & Folkman, J. Isolations of a cartilage factor that inhibits tumor neovascularization. Science 193, 70–72 (1976).
Article CAS PubMed ADS Google Scholar
Lee, A. & Langer, R. Shark cartilage contains inhibitors of tumor angiogenesis. Science 221, 1185–1187 (1983).
Article CAS PubMed ADS Google Scholar
Camhi, M. D., Valenti, S. V., Fordham, S. V., Fowler, S. L. & Gibson, C. The conservation status of pelagic sharks and rays: Report of the IUCN shark specialist group pelagic shark red list workshop. IUCN Species Survival Commission Shark Specialist Group. Newbury, UK (2009).
Horsman, M. R., Alsner, J. & Overgaard, J. The effect of shark cartilage extracts on the growth and metastatic spread of the SCCVII carcinoma. Acta Oncol. 37, 441–445 (1998).
Article CAS PubMed Google Scholar
Miller, D. R., Anderson, G. T., Stark, J. J., Granick, J. L. & Richardson, D. Phase I/II trial of the safety and efficacy of shark cartilage in the treatment of advanced cancer. J. Clin. Oncol. 16, 3649–3655 (1998).
Article CAS PubMed Google Scholar
Lu, C. et al. Chemoradiotherapy with or without AE-941 in stage III non–small cell lung cancer: A randomized phase III trial. J. Natl Cancer Inst. 102, 859–865 (2010).
Article CAS PubMed PubMed Central Google Scholar
Loprinzi, C. L. et al. Evaluation of shark cartilage in patients with advanced cancer: A North Central Cancer Treatment Group trial. Cancer 104, 176–182 (2005).
Article PubMed Google Scholar
Cannataro, V. L., Mandell, J. D. & Townsend, J. P. Attribution of cancer origins to endogenous, exogenous, and preventable mutational processes. Mol. Biol. Evol. 39, msac084 (2022).
Qing, T. et al. Germline variant burden in cancer genes correlates with age at diagnosis and somatic mutation burden. Nat. Commun. 11, 2438 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Milholland, B. et al. Differences between germline and somatic mutation rates in humans and mice. Nat. Commun. 8, 15183 (2017).
Article CAS PubMed PubMed Central ADS Google Scholar
Heupel, M. R. & Bennett, M. B. Estimating abundance of reef-dwelling sharks: a case study of the epaulette shark, Hemiscyllium ocellatum (Elasmobranchii: Hemiscyllidae)1. Pac. Sci. 61, 383–394 (2007).
Article Google Scholar
Springer, V. G., Last, P. R. & Stevens, J. D. Sharks and rays of Australia. Copeia 1994, 1055 (1994).
Article Google Scholar
Allen, G. R., Erdmann, M. V., White, W. T. & Dudgeon, C. L. Review of the bamboo shark genus Hemiscyllium (Orectolobiformes: Hemiscyllidae). J. Ocean Sci. Found. 23, 51–97 (2016).
Google Scholar
Lynch, M. Evolution of the mutation rate. Trends Genet. 26, 345–352 (2010).
Article CAS PubMed PubMed Central Google Scholar
Marra, N. J. et al. White shark genome reveals ancient elasmobranch adaptations associated with wound healing and the maintenance of genome stability. Proc. Natl Acad. Sci. USA 116, 4446–4455 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Onimaru, K., Motone, F., Kiyatake, I., Nishida, K. & Kuraku, S. A staging table for the embryonic development of the brownbanded bamboo shark (Chiloscyllium punctatum). Dev. Dyn. 247, 712–723 (2018).
Article PubMed PubMed Central Google Scholar
Ballard, W. W., Mellinger, J. & Lechenault, H. A series of normal stages for development of Scyliorhinus canicula, the lesser spotted dogfish (Chondrichthyes: Scyliorhinidae). J. Exp. Zool. 267, 318–336 (1993).
Article Google Scholar
Guan, D. et al. Identifying and removing haplotypic duplication in primary genome assemblies. Bioinformatics 36, 2896–2898 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ghurye, J. et al. Integrating Hi-C links with assembly graphs for chromosome-scale assembly. PLoS Comput. Biol. 15, e1007273 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Article CAS PubMed PubMed Central Google Scholar
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv [q-bio.GN] (2012).
Formenti, G. et al. Merfin: improved variant filtering, assembly evaluation and polishing via k-mer validation. Nat. Methods 19, 696–704 (2022).
Article CAS PubMed PubMed Central Google Scholar
Rhie, A., Walenz, B. P., Koren, S. & Phillippy, A. M. Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 21, 245 (2020).
Article CAS PubMed PubMed Central Google Scholar
Howe, K. et al. Significantly improving the quality of genome assemblies through curation. Gigascience 10, giaa153 (2021).
Article PubMed PubMed Central Google Scholar
Formenti, G. et al. Complete vertebrate mitogenomes reveal widespread repeats and gene duplications. Genome Biol. 22, 120 (2021).
Article CAS PubMed PubMed Central Google Scholar
Du, K. et al. Genome biology of the darkedged splitfin, Girardinichthys multiradiatus, and the evolution of sex chromosomes and placentation. Genome Res. 32, 583–594 (2022).
Article PubMed PubMed Central Google Scholar
Slater, G. S. C. & Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinforma. 6, 31 (2005).
Article Google Scholar
Birney, E., Clamp, M. & Durbin, R. GeneWise and genomewise. Genome Res. 14, 988–995 (2004).
Article CAS PubMed PubMed Central Google Scholar
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
Article CAS PubMed PubMed Central Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kapustin, Y., Souvorov, A., Tatusova, T. & Lipman, D. Splign: algorithms for computing spliced alignments with identification of paralogs. Biol. Direct 3, 20 (2008).
Article PubMed PubMed Central Google Scholar
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic Acids Res. 34, W435–9 (2006).
Article CAS PubMed PubMed Central Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed Google Scholar
Uno, Y. et al. Cell culture-based karyotyping of orectolobiform sharks for chromosome-scale genome analysis. Commun. Biol. 3, 652 (2020).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics 25, 1754–60 (2009).
Article CAS PubMed PubMed Central Google Scholar
Pockrandt, C., Alzamel, M., Iliopoulos, C. S. & Reinert, K. GenMap: ultra-fast computation of genome mappability. Bioinformatics 36, 3687–3692 (2020).
Article CAS PubMed PubMed Central Google Scholar
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Article PubMed PubMed Central Google Scholar
Korunes, K. L. & Samuk, K. pixy: Unbiased estimation of nucleotide diversity and divergence in the presence of missing data. Mol. Ecol. Resour. 21, 1359–1368 (2021).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We gratefully acknowledge the contribution of the Long Read Team of the Dresden Concept Genome Centre. We thank M. Hof for the information and discussion. This study was supported by the DFG Research Infrastructure NGS-CC as part of the Next Generation Sequencing Competence Network (project 423957469) and grants from the Deutsche Forschungsgemeinschaft (SCHA 408/15-1) as part of the DFG Sequencing call to M.S., Vetenskapsrådet (2017-02907) and Knut and Alice Wallenberg Foundation (K.A.W. 2016.0361) to L.A., Australian Research Council Discovery Grant DP220102970 to P.D.C. and F.T., Florida Museum of Natural History to G.P.N., and an NHMRC Fellowship GNT1136567 to P.D.C. This research was supported in part by the Intramural Research Programme of the National Human Genome Research Institute (ZIAHG200386-06). The authors would like to acknowledge the use of computing resources at Uppsala Multidisciplinary Centre for Advanced Computational Science (UPPMAX) in carrying out this work.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors contributed equally: Ashley T. Sendell-Price, Frank J. Tulenko.
These authors jointly supervised this work: Shawn M. Burgess, Peter D. Currie, Leif Andersson, Manfred Schartl.

Authors and Affiliations

Department of Medical Biochemistry and Microbiology, Uppsala University, SE75123, Uppsala, Sweden
Ashley T. Sendell-Price, Mats Pettersson & Leif Andersson
Bioinformatics Research Technology Platform, University of Warwick, Coventry, UK
Ashley T. Sendell-Price
Australian Regenerative Medicine Institute, Monash University, Victoria, 3800, Australia
Frank J. Tulenko, Margo Montandon, Rebecca E. Dale & Peter D. Currie
The Xiphophorus Genetic Stock Center, Department of Chemistry and Biochemistry, Texas State University, San Marcos, TX, 78666, USA
Du Kang
Max-Planck Institute of Molecular Cell Biology and Genetics, 01307, Dresden, Germany
Sylke Winkler, Kathleen Kulb & Gene Myers
Florida Museum of Natural History, University of Florida, Gainesville, FL, 32611, USA
Gavin P. Naylor
Translational and Functional Genomics Branch, National Human Genome Research Institute, National Institutes of Health Bethesda, Bethesda, MD, 20892, USA
Adam Phillippy & Shawn M. Burgess
Vertebrate Genome Laboratory, Rockefeller University, New York, NY, 10065, USA
Olivier Fedrigo, Bettina Haase & Erich D. Jarvis
Research Center for Genomic and Computational Biology, Duke University, Durham, NC, 27708, USA
Jacquelyn Mountcastle & Jennifer R. Balacco
Cytogenetics and Microscopy Core, National Human Genome Research Institute, National Institutes of Health Bethesda, Bethesda, MD, 20892, USA
Amalia Dutra
Center of Systems Biology Dresden, 01307, Dresden, Germany
Gene Myers
EMBL Australia, Victorian Node, Monash University, Clayton, Victoria, 3800, Australia
Peter D. Currie
Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX77483, USA
Leif Andersson
Developmental Biochemistry, Theodor-Boveri Institute, Biocenter, University of Würzburg, 97074, Würzburg, Germany
Manfred Schartl

Authors

Ashley T. Sendell-Price
View author publications
You can also search for this author in PubMed Google Scholar
Frank J. Tulenko
View author publications
You can also search for this author in PubMed Google Scholar
Mats Pettersson
View author publications
You can also search for this author in PubMed Google Scholar
Du Kang
View author publications
You can also search for this author in PubMed Google Scholar
Margo Montandon
View author publications
You can also search for this author in PubMed Google Scholar
Sylke Winkler
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen Kulb
View author publications
You can also search for this author in PubMed Google Scholar
Gavin P. Naylor
View author publications
You can also search for this author in PubMed Google Scholar
Adam Phillippy
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Fedrigo
View author publications
You can also search for this author in PubMed Google Scholar
Jacquelyn Mountcastle
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer R. Balacco
View author publications
You can also search for this author in PubMed Google Scholar
Amalia Dutra
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca E. Dale
View author publications
You can also search for this author in PubMed Google Scholar
Bettina Haase
View author publications
You can also search for this author in PubMed Google Scholar
Erich D. Jarvis
View author publications
You can also search for this author in PubMed Google Scholar
Gene Myers
View author publications
You can also search for this author in PubMed Google Scholar
Shawn M. Burgess
View author publications
You can also search for this author in PubMed Google Scholar
Peter D. Currie
View author publications
You can also search for this author in PubMed Google Scholar
Leif Andersson
View author publications
You can also search for this author in PubMed Google Scholar
Manfred Schartl
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

L.A. and M.S. conceived and supervised the study; F.T., P.C. and R.E.D. provided the biological material; A.P., O.F., J.M., J.B., B.H., G.P.N., A.D., E.D.J. and S.B. generated the reference genome assembly, A.D. and M.M. prepared the karyotype; F.T. and P.C. provided transcriptome sequence; D.K. performed the annotation; S.W. and G.M. sequenced the offspring; A.T.S.P. and M.P. analysed the pedigree data and identified candidate mutations; S.W., F.T., K.K. and A.T.S.P. validated the mutations; L.A., P.C., S.B. and M.S. interpreted the data; ATSP drafted the paper; S.B., P.C. and L.A. revised the paper with input from other authors; all authors approved the paper prior to submission.

Corresponding authors

Correspondence to Shawn M. Burgess, Peter D. Currie, Leif Andersson or Manfred Schartl.

Ethics declarations

Competing interests

The authors declare no competing interest.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sendell-Price, A.T., Tulenko, F.J., Pettersson, M. et al. Low mutation rate in epaulette sharks is consistent with a slow rate of evolution in sharks. Nat Commun 14, 6628 (2023). https://doi.org/10.1038/s41467-023-42238-x

Download citation

Received: 31 March 2023
Accepted: 03 October 2023
Published: 19 October 2023
DOI: https://doi.org/10.1038/s41467-023-42238-x

This article is cited by

Shark genome size evolution and its relationship with cellular, life-history, ecological, and diversity traits
- Mario Torralba Sáez
- Michael Hofreiter
- Nicolas Straube
Scientific Reports (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.