Genomics of post-bottleneck recovery in the northern elephant seal

Hoelzel, A. Rus; Gkafas, Georgios A.; Kang, Hui; Sarigol, Fatih; Le Boeuf, Burney; Costa, Daniel P.; Beltran, Roxanne S.; Reiter, Joanne; Robinson, Patrick W.; McInerney, Nancy; Seim, Inge; Sun, Shuai; Fan, Guangyi; Li, Songhai

doi:10.1038/s41559-024-02337-4

Download PDF

Article
Open access
Published: 21 February 2024

Genomics of post-bottleneck recovery in the northern elephant seal

Nature Ecology & Evolution volume 8, pages 686–694 (2024)Cite this article

4739 Accesses
85 Altmetric
Metrics details

Subjects

Abstract

Populations and species are threatened by human pressure, but their fate is variable. Some depleted populations, such as that of the northern elephant seal (Mirounga angustirostris), recover rapidly even when the surviving population was small. The northern elephant seal was hunted extensively and taken by collectors between the early 1800s and 1892, suffering an extreme population bottleneck as a consequence. Recovery was rapid and now there are over 200,000 individuals. We sequenced 260 modern and 8 historical northern elephant seal nuclear genomes to assess the impact of the population bottleneck on individual northern elephant seals and to better understand their recovery. Here we show that inbreeding, an increase in the frequency of alleles compromised by lost function, and allele frequency distortion, reduced the fitness of breeding males and females, as well as the performance of adult females on foraging migrations. We provide a detailed investigation of the impact of a severe bottleneck on fitness at the genomic level and report on the role of specific gene systems.

Genetic and demographic history define a conservation strategy for earth’s most endangered pinniped, the Mediterranean monk seal Monachus monachus

Article Open access 11 January 2021

The genetic legacy of extreme exploitation in a polar vertebrate

Article Open access 20 March 2020

Population divergence and gene flow in two East Asian shorebirds on the verge of speciation

Article Open access 12 June 2019

Main

Iconic species are vanishing. Our past and present activities have reduced populations to the point where they may go extinct by demographic processes alone¹. When they survive, inbreeding and genetic drift may reduce the fitness of individuals and the survival potential of populations². Nevertheless, some species survive and apparently thrive. After heavy exploitation led to a severe population bottleneck in 1892, reducing the population to ~20 individuals³, the northern elephant seal (Mirounga angustirostris; hereafter NES) recovered nearly exponentially to over 220,000 today^4,5. Theoretically, a rapid demographic recovery may reduce the negative impact on genetic diversity, but this was not the case for the NES. Census data indicate a rapid recovery^4,5, but years of genetic studies show profoundly reduced genetic diversity⁶. In one of the earliest indications of the impact of population bottlenecks on the health of a species, ref. ⁷ reported the lack of protein (allozyme) diversity in the surviving population of NES. Further studies on genetic diversity at a range of markers (allozymes, mitochondiral (mt)DNA, minisatellite DNA, microsatellite DNA and immune system genes) showed reduced diversity compared with southern elephant seals (Mirounga leonina, a sibling species not impacted by a similar population bottleneck^3,6,8). The loss of diversity was also evident in comparisons of pre- and post-bottleneck NES DNA^9,10. Small population size compounded by extreme polygyny⁸ probably contributed to the loss of genetic diversity. Of 150,388 species reviewed for the IUCN Red List, 42,108 are threatened and 25,615 are endangered or critically endangered (http://www.iucnredlist.org). Many of the endangered or threatened species have previously existed or currently exist as small populations. Since the survival potential of small populations can be influenced by genetic diversity², we wonder whether the loss of genetic diversity for NES has had a measurable effect on their fitness, even though population growth has been robust during the 132 years (~22 generations) since the bottleneck. If so, they could still be vulnerable to some new environmental stress.

Here we sequenced 260 modern and 8 historical genomes, showing that inbreeding, loss of function and the distortion of allele frequencies have reduced the fitness of breeding males and females, as well as the performance of adults on foraging migrations. The loss of fitness associated with inbreeding is well documented¹¹, but the importance of allele frequency distortion and the presence of loss of function (LOF) alleles in specific gene systems is less well understood and provides new inference about the general and lasting impact of population bottlenecks. Ecosystem function depends on biodiversity and the contribution of species within that system. Effective conservation management requires an understanding of the scope of impact from depleted populations on specific functions in ecosystem communities.

Results and Discussion

All of the modern NES samples investigated in our study were from the breeding colony at Año Nuevo, California, USA. They were chosen due to their inclusion in studies on reproductive success or diving profiles. The source of historical samples is given in Supplementary Table 1. We consider diversity across NES genomes, comparing them before and after the bottleneck, followed by inference from the genomic analyses about demographic history. We then consider fitness, first associated with reproductive success, then with diving performance.

Diversity

Most of our modern nuclear genome sequences had greater than 30X read depth (see Extended Data Fig. 1 for full range), while 8 historical genomes with degraded DNA had a broader range of coverage (14.5X ± 12.7X after mapping; Supplementary Table 1). Average heterozygosity per genome was 0.00142 ± 0.00092 (s.d.) before the bottleneck (N = 5) and 0.000176 ± 0.000013 in the modern population (N = 180 adults; Fig. 1). The size range and number of runs of homozygosity (ROH) fragments >100 kb and >1 Mb are shown for male and female modern samples in Extended Data Fig. 2. Coverage was limited for most of the historical samples, restricting the potential for accurately estimating ROH. However, a pairwise comparison for both heterozygosity per sliding 50 kb window and ROH greater than 100 kb is shown in Extended Data Fig. 3, comparing the genomes of two individuals, one from 1884 (Hist8) that sequenced well enough for these analyses and a randomly chosen modern sample from 2009. The individual from 1884 was an adult male. His diversity reflects the population of his parents, who were alive when the species was sufficiently abundant to support a commercial hunt. Each comparison shows a loss of variation after the bottleneck. For this historical genome, the proportion of genotypes homozygous for the LOF allele (among all LOF loci identified by snpEff, see Methods) was 0.0011, while the proportion of heterozygotes was 0.287. Among 180 modern adult genomes, 0.379 ± 0.016 (s.d.) of the identified loci were homozygous for the LOF allele, while 0.193 ± 0.024 were heterozygous.

A mitogenome tree was constructed with the 5 pre-bottleneck samples, compared to 3 from shortly after the bottleneck (1906–1924) and 180 modern genomes (Fig. 1). Two major lineages remained after the bottleneck, and some mutations were gained in the modern population within those two lineages. The pre-bottleneck mitochondrial genomes had an average pairwise genetic distance (uncorrected percentage) between them of 0.00219 ± 0.00146 (s.d.). The two main modern haplotypes remaining (haplotypes 1 and 2; Fig. 1) differ by 0.00232, while diversity within each lineage was 0.0000048 in the haplotype 1 lineage and 0.0000081 in the haplotype 2 lineage. The tree indicates that two mtDNA lineages survived the bottleneck, consistent with earlier reports³.

Demographic history

The demographic history of the species can be estimated in deep time on the basis of coalescent analyses using genomes (with the pairwise sequential Markovian coalescent (PSMC), see Methods). The 14 modern genomes we chose at random (while ensuring 7 males and 7 females, all from the 1980s) all showed essentially the same pattern (Fig. 1). The effective population size (N_e) was ~40,000 during the last interglacial (Eemian; ~130–115 Ka) but fell to ~2,000–4,000 during the last glacial period, reaching a nadir around the last glacial maximum (~20 Ka; Fig. 1). Using the approximate effective to census population size (N_e/N_c)ratio published from a meta-analysis¹² of wildlife species (~0.1), this would suggest a census population of ~20,000, perhaps explaining why the population was so quickly nearly eradicated during the nineteenth century. The population may have been closer to its current size during the Eemian. For an estimate of current N_e (post-bottleneck), we used a method¹³ based on linkage disequilibrium (SNeP; Fig. 1c), which indicated an N_e of 100, suggesting a very low N_e/N_c ratio (~0.00045), consistent with low post-bottleneck diversity and rapid regrowth. This result was replicated with an alternative method (GONE)¹⁴, which indicated the same current N_e (see Extended Data Fig. 4).

Reproductive fitness

The average number of successfully weaned pups per year over a female’s lifetime was known for 40 females in our dataset. This fitness estimate reflects the lifetime contribution to future generations, and numerous studies have considered correlations between heterozygosity and fitness to assess the impact of inbreeding or heterosis (see ref. ¹⁵). We tested for correlation with F_ROH (for ROH > 1Mb; r² = 0.143, F = 6.35, P = 0.016) and with the number of affected alleles across 151 LOF loci (restricted to loss of start or new stop mutations; r² = 0.114, F = 4.88, P = 0.033; Fig. 2). The regression was also significant for all 328 LOF loci found (r² = 0.146, F = 6.49, P = 0.015; Extended Data Fig. 5). The correlation with F_ROH was marginally stronger when the total number of weaned pups over a lifetime was considered (N = 43; r² = 0.19, F = 9.59, P = 0.0035; Extended Data Fig. 5), which does not control for variation in longevity, but longevity itself was not significantly correlated (N = 44; r² = 0.024; F = 1.01; P = 0.32). There was also no significant correlation between F_ROH and the number of pups a female produced (fecundity; N = 44, r² = 0.07, F = 3.16, P = 0.083). We considered possible environmental effects, although females are all from a very similar time range, born between 1981 and 1988, and first weaning between 1985 and 1991. A multiple regression with lifetime reproductive success (weaned per year) as the response variable, and female birth year and year of first weaning as explanatory variables, was not significant (adjusted r² = −0.0096, F = 0.81, P = 0.45). When we included F_ROH as a third explanatory variable, F_ROH was significantly correlated with the response variable (P = 0.03), but none of the interaction factors between explanatory variables were significant (Supplementary Table 2).

A significant correlation for pups weaned, but not for pups produced, suggests that the impairment may be more about the fitness of offspring affected by inbreeding¹⁶ rather than fecundity; however, more data on the genotypes and phenotypes of the relevant pups would be needed to test this further. The linear regression between the number of LOF loci and F_ROH (x axis) was positive and significant (F = 9.78, P = 0.0034, r² = 0.3042; Extended Data Fig. 6), as expected. We also used an additional programme (PROVEAN; https://www.jcvi.org/research/provean) to identify missense mutations and confirm a positive correlation with F_ROH (F = 16.61, P = 0.0002; Extended Data Fig. 6). Three LOF loci were associated with oogenesis (MARF1)¹⁷, oocyte growth (KMT2B)¹⁸ and embryonic development (NBAS)¹⁹, but the profile among these loci was the same for all but three females, and there was no association with fitness. An association with diversity across the genome rather than specific loci would be consistent with the ‘general effect’ hypothesis of heterozygosity–fitness correlations²⁰.

We next consider a possible impact on male reproductive success. In an earlier study, the paternal success of the NES alpha male M12 was low compared with that expected by his frequency of observed copulations²¹. That study had investigated 10 NES harems, all from the same beach and same year, and 6 southern elephant seal (SES) harems. The average paternal success of alpha bulls was significantly lower than copulatory success for NES but not for SES. One NES male stood out as having especially low success (M12). Here we used genomic data to test the paternal success of alpha males at 4 of the same harems, including the harem held by M12. A total of 31 males and 77 pups were included in the paternity tests (including the 4 focal alphas) and there were 51 paternities detected. We found again that only M12 had significantly fewer paternities than expected based on observed copulations (Table 1). M12 also had the highest F_ROH, although all 4 were near the middle of the distribution for all adult NES (see Extended Data Fig. 7). We found a total of 5 loci with LOF (gained a stop codon), known to be associated with male fertility (associated with sperm production or function). These were LRGUK²², MNS1 (ref. ²³), TUBB4B²⁴, SRSF3 (ref. ²⁵) and EZR²⁶. We focused first on homozygotes for the LOF allele in case there is some function from co-dominance. M12 was homozygous for the non-functional version of 4 of these 5 loci, while the other alphas were homozygous for the non-functional version at 1 or 2 loci (Table 1). The negative relationship between paternal success and the frequency of the affected alleles was also seen for 27 non-alpha males (Extended Data Fig. 8 and Table 1). Males gain access to females in this highly polygynous species through competition among males, hence age and overall size are important factors associated with successes. For this reason, a general relationship with inbreeding or loss of function across the genome may not be expected. We tested this by looking for a correlation between non-alpha male paternal success and both F_ROH and relative LOF allele frequency (out of 151 LOF loci found that generated a new start or a stop codon). Neither were significant (F_ROH: r² = 0.031, F = 0.79, P = 0.38; LOF: r² = 0.057, F = 1.5, P = 0.23).

Table 1 Alpha male reproductive success

Full size table

These data suggest that males, including M12 and some of the 27 non-alpha males, were impacted by specific LOF loci, reducing their fertility. Deleterious alleles can be retained after allele frequencies are distorted and purifying selection is weakened by strong genetic drift. The most successful of the non-alpha males (two males with five paternities each; Extended Data Table 1) were homozygous at two or three LOF loci. Both of these males and all alphas apart from M12 had functional alleles at EZR. M12 was homozygous for the LOF allele. This locus is dysfunctional in male humans with asthenozoospermia (reduced sperm motility)²⁶. We compared the number homozygotes for the LOF allele at these five loci (average = 2.14) for all non-alpha males that had at least one offspring with those that had no offspring (average = 2.77). The difference was marginally significant (Mann–Whitney U = −1.72, P = 0.0427).

Diving performance

A critical aspect of elephant seal life history is their extensive deep-diving foraging excursions when they accumulate fat stores to facilitate fasting during the breeding season²⁷. We acquired dive performance data from 92 females with complete dive profile data for up to three foraging trips per female lasting 70–220 d per trip. We generated a ‘performance’ metric as the product of the deepest dive (in metres, averaged over multiple trips) times the proportion of dives deeper than 516 m (median dive depth) times the relative dive duration (relative to the longest duration among individuals). Diving performance was not associated with metrics of genomic diversity (see Extended Data Fig. 9 for a correlation test with F_ROH). We then categorized each seal as having high or low performance (either side of the midpoint). These categories were not differentiated with respect to LOF allele frequency (t = 1.20, d.f. = 90, P = 0.232 for the LOF loci resulting in lost start or new stop codons and t = 0.126, P = 0.90 for all LOF loci). We then compared these two categories at the five variable loci with either non-synonymous mutations or upstream mutations (that may be associated with transcriptional regulation) associated with hypoxia found in our genomes and supported by publications about their relevant function. Two hypoxia loci were variable at non-synonymous sites within the gene: HIF3A (involved in hypoxia gene expression)²⁸ and SETX (protects cells from DNA damage induced during transcription in hypoxia)²⁹. Three other loci had segregating sites upstream of the genes: MB (myoglobin; oxygen provision)³⁰, HIF1A (transcription regulator in response to hypoxia)³⁰ and HYOU1 (cyto-protection during oxygen deprivation)³¹. For all five of these loci, minor allele frequency (MAF) was higher for those individuals that had lower performance scores (Χ² = 9.789, P = 0.0017; Fig. 3). We compared 51 individuals born in the 1980s with 57 born between 2005 and 2015 to see whether there had been a change in MAF at these loci over time. All but one locus had diminished MAF over the intervening four or five generations (Table 2). This suggests that there may be purifying selection against these alleles (comparing MAF for all five loci combined, Χ² = 2.76, P = 0.0483; see Table 2 for details for each locus), but tests involving a longer timeframe would help confirm this. We have no data on the dive performance of the females born in the 1980s, but the range is probably comparable. For our historical genomes, it was possible to genotype eight single-nucleotide polymorphisms (SNPs) among four of these loci (all but HYOU1) for three (4 SNPs|) or four (4 SNPs) seals. MAF ranged from 0.125 to 0.167 (average = 0.151) compared with an average of 0.222 among these loci for modern high-performance and 0.307 for low-performance divers. Thus, for our very limited sample, historical MAF is relatively low, as it is for high-performance modern seals, which suggests that it could be due to purifying selection, although drift remains a possibility. More data would be required to make a strong inference about this.

**Fig. 3: Fitness impact associated with deep diving.**

Table 2 Genotype data for hypoxia loci in seals showing high and low dive performance

Full size table

As a control to test the hypoxia genes results, we used loci of known function (immune system) but for which we expected no correlation with diving performance (upstream SNPs at major histocompatibility complex (MHC) Class 1 B-alpha, CD74, DMA and DPA1, and a non-synonymous change in DQB1). Five MHC loci were found to have either a variable non-synonymous mutation, or a variable mutation in an upstream region. As expected, no significant correlation was found (Χ² = 1.35, P = 0.24; Fig. 3). Stochastic distortion of allele frequencies during a bottleneck can increase the frequency of less-fit minor alleles (although none of these showed LOF). These alleles could be retained in the post-bottleneck population if dominance or co-dominance is protective.

Conclusions

Our results show that the fitness of post-bottleneck northern elephant seals is impacted by stochastic effects and reduced diversity, even though recovery has been rapid, rebounding to a population size comparable to historical maxima. Population bottlenecks are known to distort allele frequency distributions, and distortions have been used to detect bottlenecks³². If alleles with stochastically increased frequency are deleterious, they can be maintained in small populations where purifying selection is relatively weak. This can lead to ‘mutational meltdown’ and extinction in asexual species (Müller’s ratchet³³), but potentially in small populations of sexual species as well³⁴. Genomic studies of endangered species have shown the accumulation of LOF loci at levels comparable to what we have found in the post-bottleneck northern elephant seal. For example, in the pygmy hog (Porcula salvania), of which there are only a few hundred left, there are substantially more frameshift, stop-gained and missense mutations than in related species³⁵.

For the northern elephant seal, we found three categories of post-bottleneck impact. There was a reduction in diversity and an increase in ROH comparing pre- and post-bottleneck genomes, and the loss of diversity was correlated with lower female lifetime reproductive success. Having details on lifetime reproductive success in long-lived animals (as we have here) is rare, but examples of correlations between diversity and fitness proxies are common³⁶. The frequency of LOF alleles was also negatively correlated with female lifetime reproductive success. At specific loci associated with reproductive health, LOF allele frequency was correlated with male reproductive success. However, the lack of a genome-wide heterozygosity–fitness correlation and instead a difference associated with MAF at relevant loci for dive performance was especially striking. There was variation within (non-synonymous change) or upstream of five loci associated with hypoxia, not identified as generating LOF. However, those individuals with higher MAF showed lower dive performance, suggesting an impact from post-bottleneck allele frequency distortion. Historical genomes showed lower average MAF than either high- or low-performance divers, and there was an indication of selection against higher MAF over time. We propose that together, these impacts leave the species vulnerable to environmental stresses (such as climate-induced resource bottlenecks)³⁷ that an uncompromised population may be able to overcome. In conclusion, our data show that despite rapid recovery and apparent stability, the northern elephant seal has reduced fitness impacting their reproductive output and ability to forage efficiently. Important aspects of impact included the stochastic distortion of allele frequencies and the retention of LOF alleles at critical loci.

Methods

Field observation and sampling

Field work was conducted at Año Nuevo State Reserve in California, USA. Details of elephant seal harem observation and tissue sample collection are given in ref. ³. Harem observations were for 6–7 hours per day at all harems. A harem was defined as a group of females associated with a single alpha male. The alpha was the highest-ranking male in the dominance hierarchy associated with the harem. Copulations provide a useful measure of reproductive success³⁸. Observations at night with a photomultiplier video camera revealed the same rate and pattern of copulations by individuals as during the day³⁸. Ronguers were used to take tissue samples from the outer edge of the hind flippers. The samples were stored in 20% dimethylsulfoxide saturated with NaCl³⁹. Samples for estimating reproductive success analyses were collected in 1990 and 1991 and those individuals were tagged and tracked over time (between 1981 and 2005), permitting lifetime reproductive success estimates for a subset of females. Details are provided in ref. ⁴⁰. Paternity testing was possible for females attending a given harem that were sampled the following year (a return rate of 40–60% of tagged females).

Data on diving performance were collected between 2004 and 2018. Details on the tags and data collected are provided in ref. ⁴¹. Tissue samples were again collected from hind flippers and stored in salt/dimethylsulfoxide. All subjects were marked and tagged and recognizable as individuals⁴². Satellite platform transmitter terminals (model ST-6, Telonics) were affixed on the head with marine epoxy, with the antenna angled so that it would be exposed when the seal was at the surface⁴². The transmitter interacted with the ARGOS satellite system to generate locations, which were filtered using standard methods⁴³. Data on position, date and time permitted a record of the distance, track and duration of time spent on the foraging trip. A separate tag was affixed to record the depth profiles (a time-temperature-depth-recorder; the MK7 by Wildlife Computers). Data recorded included the age of the seal, the departure and return dates, the mean depth, the maximum depth, the duration and the mass on departure and return. A single foraging trip was recorded for 8 seals, two trips for 68 seals and three trips for 16 seals. We calculated correlations between genomic diversity and a composite measure of diving behaviour. The composite measure was termed dive ‘performance’ and was generated as the average maximum depth (among all trips recorded) times the proportion of dives deeper than the median depth (516 m) times the relative mean duration (compared to the duration of all other dives from the set of 92 individuals). This metric was devised for this study to account for the various aspects of dive endurance recorded.

Genome sequencing and SNP detection

A total of 260 modern samples were subjected to sequencing. Total genomic DNA was initially extracted using standard phenol/ chloroform extraction and subsequently stored in TE buffer. Short-read libraries for whole-genome sequencing were then constructed as follows. The extracted DNA was sheared into 50–800 bp fragments. Fragments ranging from 150 bp to 500 bp were selected and treated with T4 DNA polymerase (Enzymatics, P7080L) to obtain blunt ends. T-tailed adapters were then ligated to repair these blunt ends. PCR amplification was performed and AMPure XP beads (Agencourt, A63881) were used to purify the PCR products. These short-read libraries were then sequenced on DNBSEQ-T1 sequencers at BGI-Shenzhen, to generate paired-end 100 bp reads. The reference for assembling re-sequencing reads was Mirounga angustirostris from DNA Zoo^44,45 (HI-C, chromosome level from https://www.dnazoo.org/assemblies/Mirounga_angustirostris, total length: 2,366,206,800 bp; scaffold N50: 139,676,048 bp; scaffold N90: 54,920,518 bp, GC content: 41.52%; Supplementary Table 4). The average number of cleaned reads was 1,026.11 million (range: 651.91–7,516.79 million). Percent Q30 was 94.41 on average (range 89.76–97.17; Supplementary Table 5) and average genome coverage was 99.59% (Supplementary Table 6). The average sequencing depth was 33.8X (see Supplementary Fig. 1 for full range). SOAPnuke (v.2.1.5)⁴⁶ was employed for quality control of sequencing reads with parameter ‘-J -l 10 -q 0.1 -n 0.05’ and a config file containing the settings ‘trimBadHead=13,100 trimBadTail=13,100 trim=0,0,0,0 qualSys=2 seqType=0 outQualSys=2’. The clean reads were mapped to the reference genome using BWA⁴⁷ and Realigner in Sentieon⁴⁸, while SNP detection utilized GVCFtyper in Sentieon with default parameters to generate joint-calling of raw SNPs. Sentieon was also used for duplicate read identification and removal (see Supplementary Table 6). The hard filtering of SNPs was performed using VariantFiltration in GATK (v.3.8.1)⁴⁹, with the criterion ‘QD < 2.0 || MQ < 40.0 || FS > 60.0 || ReadPosRankSum < −8.0 || MQRankSum < −12.5 || SOR > 3.0’ as recommended by GATK. Three replicate samples were removed from further analysis.

Historical samples were acquired on loan from the Smithsonian Museum of Natural History, the Harvard Museum of Comparative Zoology and the San Diego Natural History Museum. Samples were bone, tooth or dried dermal tissue. DNA extraction and library construction were done at the Smithsonian Institution’s Center for Conservation Genomics. Work was done in their contained ancient DNA facility using standard precautions to avoid cross-contamination. Extraction followed the protocol in ref. ⁵⁰. Briefly, samples were powdered using a Dremel drill and the calcium extracted from bone and tooth with EDTA (0.5 M, pH 8.0). Samples were then extracted for DNA in extraction buffer (after ref. ⁵⁰ Supplement SD3) overnight at 37 °C with shaking. Solutions were cleaned on Qiagen MinElute columns (28004). DNA quantity was checked on a Qubit 4 fluorometer. DNA libraries were constructed using a KAPA Hyper Prep kit (Roche) following manufacturer protocols. Samples were dual indexed with different pairs of P5 and P7 adapters. Libraries were quantified on Qubit and Tapestation (Agilent) and the results compared.

Historical samples were first sequenced on a MiSeq system (Illumina) to balance quantities on the basis of the number of reads. Reads were mapped against the DNA Zoo NES reference (see above) with Bowtie2 (v.2.3.4.1)⁵¹ using the ‘–very-sensitive’ setting. After balancing quantities, libraries were pooled for sequencing in two Novaseq 6000 (Illumina) S4 lanes (2 × 100 bp). All historical sample sequencing was done at the Baur Core facility at Harvard University. Total reads per sample are shown in Supplementary Table 1. Sequences were trimmed with BBDuk (sourceforge.net/projects/bbmap/) using these settings: ‘k = 25 mink = 15 edist = 1 ktrim = r rcomp = f t = 8 qtrim=rl trimq = 20 maq = 20’. Quality was on average Q36 and the peak fragment size was in the 50–60 bp range, calculated using FastQC (v.0.11.9)⁵². Using clumpify dedupe in BBtools⁵³, duplicate reads were removed. Trimmed and filtered reads were mapped to the NES DNA Zoo reference genome using Bowtie2 with the ‘–very-sensitive’ setting. Mapped reads with a quality score below 10 were removed using SAMtools (v.1.12)^54,55 ‘view’, and the filtered reads were sorted using SAMtools ‘sort’. Variant sites were called using the ‘mpileup’ and ‘call’ commands in BCFtools (v.1.12)⁵⁵. Using SnpSift⁵⁶ ‘filter’, variant calls with a quality score of 20 and depth below 5 were removed. VCF files were further filtered using BCFtools ‘view’ to include only the 17 identified chromosomal scaffolds.

Mitogenomes were generated for 180 modern adults and all 8 historical samples. The reads with low quality, duplications or adaptors were removed using SOAPnuke (v.2.1.5) (https://github.com/BGI-flexlab/SOAPnuke)⁴⁶, leaving clean reads for the final mitogenome assembly. To normalize the samples, randomly resampling the sequences to 40,000,000 reads from the clean reads of each sample was performed using Seqtk (https://github.com/lh3/seqtk). The mitogenome assembly of the NES was carried out using NOVOPlasty v.3.7, which is a de novo seed-extend-based assembler for organelle genomes⁵⁷. The mitochondrial genomes of the NES (Mirounga angustirostris, CM055130.1) and the SES (Mirounga leonine, NC_008422.1) were used as seed sequences for the mitochondria assembly and the assembly parameters were set as follows: PE mode, read length = 100, k-mer = 39, genome range = 12,000–22,000 and type = mito. The mitogenome tree was based on all historical (8) and all modern adult (180) samples and generated in PAUP (https://paup.phylosolutions.com/) by the neighbour-joining method using the Tamura Nei evolution rate correction and 1,000 bootstrap replicates. It was routed with the outgroup, Mirounga leonina.

Genome analysis

A sliding-window analysis using VCFtools⁵⁴ ‘–window-pi’ measured heterozygous sites every 50,000 bp. The total numbers of heterozygous sites across the genome were measured using the command grep on VCF files and heterozygosity determined by dividing by the total number of sites passing the minimum quality and depth filters for each sample. ROHs were measured for runs greater than 100 kb and 1 Mb long using (for 100 kb) the ‘–bed infile.bed–fam infile.fam–bim infile.bim–homozyg–noweb–allow-extra-chr–homozyg-kb 100–out data’ command in PLINK⁵⁸. Before assessing ROH in modern samples, the VCF file was filtered to remove singletons and minor allele frequencies lower than 0.01.

Demographic history was estimated using the coalescent method implemented in PSMC⁵⁹. Aligned mapped reads (BAM files) for 14 samples (7 males and 7 females chosen randomly within sex and among samples from the 1980s) were converted to consensus sequence in FASTQ format using the samtools/bcftools pipeline. First, the samtools ‘mpileup’ command was used to produce the VCF file from the BAM files, and then through bcftools the consensus sequence was generated with the original consensus calling model. Following that, the vcfutils.pl script was used for the FASTQ conversion: bcftools mpileup -Q 30 -q 30 -f NES.fa sample.bam | bcftools call -c | vcfutils.pl vcf2fq -d 10 -D 100 | gzip > diploid_sample.fq.gz. Mapped reads were filtered for a minimum mapping quality (q) of 30 and a minimum base quality (Q) of 30. The minimum (d) and maximum (D) coverages were calculated to allow for vcf2fq, and were set to 10 and 100, respectively (-d value to a third of the average depth and -D value to twice). The FASTQ file was then converted to the input format for PSMC using the following: fq2psmcfa sample.fq.gz > sample.psmcfa. For the final PSMC command, we used 64 atomic time intervals and 28 (=1 + 25 + 1 + 1) free interval parameters: psmc -p ‘4 + 25*2 + 4 + 6’ -o sample.psmc sample.psmcfa. Finally, the PSMC plot was drawn using the command ‘psmc_plot.pl’, with the per-generation mutation rate ‘-u’ and the generation time in years ‘-g’ set to 2.2 × 10⁻⁰⁹ and 2.2 × 10⁻⁰⁶, respectively. One hundred bootstrap replicates were run for each sample and 15/100 samples chosen at random to be included in the plot.

Current effective population size was estimated using the linkage disequilibrium method implemented in SNeP¹³ and GONE¹⁴. SNeP estimates the historical effective population size on the basis of the relationship between r², N_e and c (recombination rate) via linkage disequilibrium estimates using the standard PLINK input file format (.ped and .map files). The squared Pearson’s product-moment correlation coefficient between pairs of loci was used due to the unknown phase of the genotypes. The software uses the physical distance (δ) between two loci as a reference and translates it into linkage distance (d). We used the default values for minimum distance in bp between SNPs to be analysed ‘-mindist’ and maximum distance in bp between SNPs to be analysed ‘-maxdist’ to 50,000 and 4,000,000, respectively. To infer the recombination rate, we used the ‘-svedf’ flag⁵⁸ as a recombination rate modifier and the default MAF < 0.05, as it has been shown that accounting for MAF results in unbiased r² estimates irrespective of sample size^60,61.

The software GONE¹⁴ was used to obtain an additional estimate of N_e based on the linkage disequilibrium method. The VCF file was converted to .ped and .map format using the PLINK software. The parameters for the analysis were set as follows: number of generations 2,000, number of bins 400, no MAF pruning was applied and the maximum value of recombination rate (c) was set to 0.05. The number of internal replicates was set to 40 for the programme to provide the geometric mean of the consensus estimate of the historical N_e out of these replicates.

Correlations between pups weaned per year (a measure of lifetime reproductive success) or diving performance and F_ROH were done using linear regression. A multiple regression was run using the software at https://stats.blue/Stats_Suite/multiple_linear_ regression_calculator.html. F_ROH was calculated as the proportion of the genome represented by ROH at the given filter length. LOF loci were identified using SnpEff and the Mirounga angustirostris annotation GFF file. There were no good alternatives to using the DNA Zoo NES reference, but it is a high-quality and annotated genome, and the results showed credible patterns. Analyses were split between using only those loci that generated a new stop or caused loss of a start codon and all putative LOF loci discovered (including those of minor effect). For graphing, we coded the homozygote of the LOF allele as ‘1’, the heterozygote as ‘0.5’ and unaffected genotypes as ‘0’, and totalled across all identified loci. SnpEff was also used to identify variation at specific loci of interest, such as those associated with reproduction and hypoxia. We used the following pipeline: java -jar snpEff.jar build -gff3 -v NES.reference based on the annotation file GFF. Following that, we used the command ‘ann’ to annotate the variant file. We used the snpSift toolbox as implemented in snpEff to filter the annotated VCF file on the basis of the effect of the genetic variants, for example, LOF or non-synonymous substitutions (missenses). Although we focused on snpEff, which provides specificity on the nature of the change, we also used the Protein Variation Effect Analyzer (PROVEAN; https://github.com/MonashBioinformaticsPlatform/provean) as an alternative method to test the results. PROVEAN calculates the alignment score of a protein variant by comparing a query sequence with the existing database. To run PROVEAN, we configured it with BLAST v.2.2.31+, CD-HIT v.3.1.2 and the NCBI nr database containing 15,006,980 sequences from 15 August 2011. The tool was run with the default parameters, whereby missense variants with PROVEAN scores greater than or equal to −2.5 were classified as neutral, while those with scores less than −2.5 were considered deleterious on the basis of the default threshold.

To assess kinship, we used the relatedness2 function⁵⁴ implemented in VCFtools. We included 77 pups born in the following year to females, most of whom had been in the focal harems, and determined the number of paternities achieved by each of the 31 males in the sample (including the 4 focal alpha males). Manhattan plot comparisons of high- and low-diving-performance females were generated by writing a trait file and generating the plot in R and TASSEL⁶². There were no clear outliers (data not shown). Data relevant to the LRS and diving analyses are provided in Supplementary Tables 7 and 8.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Sequences are deposited at the National Center for Biotechnology Information (NCBI) under Bioproject PRJNA1060307. Whole-genome data were uploaded as raw reads (Sequence Read Archive), including all historical samples and six high-coverage modern samples (Biosamples SAMN39224291–SAMN39224298; SAMN39305437, SAMN39305439, SAMN39305440, SAMN39305442, SAMN39305447 and SAMN39305448). Variant information for all modern sequences is included as a VCF file (European Variation Archive; PRJNA1060307). All raw nuclear genome data for the modern samples have also been deposited to the China National GeneBank Sequence Archive (CNSA) of the China National GeneBank DataBase (CNGBdb) with accession number CNP0005170. Source data are provided with this paper.

References

Collen, B. et al. Predicting how populations decline to extinction. Phil. Trans. R. Soc. B. 366, 2577–2586 (2011).
Article PubMed PubMed Central Google Scholar
Speilman, D., Brook, B. W. & Frankham, R. Most species are not driven to extinction before genetic factors impact them. Proc. Natl Acad. Sci. USA 101, 15261–15264 (2004).
Article Google Scholar
Hoelzel, A. R. et al. Elephant seal genetic variation and the use of simulation models to investigate historical population bottlenecks. J. Hered. 84, 443–449 (1993).
Article CAS PubMed Google Scholar
Abadia-Cardoso, A., Freimer, N. B., Deiner, K. & Garza, J. C. Molecular population genetics of the northern elephant seal Mirounga angustirostris. J. Hered. 108, 618–627 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lowry, M. S. et al. Abundance, distribution and population growth of the northern elephant seal (Mirounga angustirostris) in the United States from 1991 to 2010. Aquat. Mamm. 40, 20–31 (2014).
Article Google Scholar
Hoelzel, A. R. Impact of a population bottleneck on genetic variation and the importance of life history; a case study of the northern elephant seal. Biol. J. Linn. Soc. 68, 23–39 (1999).
Article Google Scholar
Bonnell, M. L. & Selander, R. K. Elephant seals: genetic variation and near extinction. Science 184, 908–909 (1974).
Article Google Scholar
Le Boeuf, B. J. Male–male competition and reproductive success in elephant seals. Am. Zool. 14, 163–176 (1974).
Article Google Scholar
Hoelzel, A. R., Fleischer, R. C., Campagna, C. & Le Boeuf, B. J. Direct evidence for the impact of a population bottleneck on symmetry and genetic diversity in the northern elephant seal. J. Evol. Biol. 15, 567–575 (2002).
Article Google Scholar
Weber, D. S., Stewart, B. S., Garza, J. C. & Lehman, N. An empirical genetic assessment of the severity of the northern elephant seal population bottleneck. Curr. Biol. 10, 1287–1290 (2000).
Article CAS PubMed Google Scholar
Keller, L. F. & Waller, D. M. Inbreeding effects in wild populations. Trends Ecol. Evol. 17, 230–241 (2002).
Article Google Scholar
Frankham, R. Effective population size/adult population size ratios in wildlife: a review. Genet. Res. 66, 95–107 (1995).
Article Google Scholar
Barbato, M., Orozco-ter Wengel, P., Tapio, M. & Bruford, M. W. SNeP: a tool to estimate trends in recent effective population size trajectories using genome-wide SNP data. Front. Genet. 6, 109 (2015).
Article PubMed PubMed Central Google Scholar
Santiago, E. et al. Recent demographic history inferred by high-resolution analysis of linkage disequilibrium. Mol. Biol. Evol. 37, 3642–3653 (2020).
Article CAS PubMed Google Scholar
Szulkin, M., Bierne, N. & David, P. Heterozygosity–fitness correlations: a time for reappraisal. Evolution 64, 1202–1217 (2010).
PubMed Google Scholar
Ralls, K., Ballou, J. D. & Templeton, A. Estimates of lethal equivalents and the cost of inbreeding in mammals. Conserv. Biol. 2, 185–193 (1988).
Article Google Scholar
Su, T.-Q. et al. MARF1 regulates essential oogenic processes in mice. Science 335, 1496–1499 (2012).
Article CAS PubMed PubMed Central Google Scholar
Bilmez, Y., Talibova, G. & Ozturk, S. Dynamic changes of histone methylation in mammalian oocytes and early embryos. Histochem. Cell Biol. 157, 7–25 (2021).
Article PubMed Google Scholar
Anastasaki, C., Longman, D., Capper, A., Patton, E. E. & Ca´ceres, J. F. Dhx34 and Nbas function in the NMD pathway and are required for embryonic development in zebrafish. Nucleic Acids Res. 39, 3686–3694 (2011).
Article CAS PubMed PubMed Central Google Scholar
Hansson, B. & Westerberg, L. On the correlation between heterozygosity and fitness in natural populations. Mol. Ecol. 11, 2467–2474 (2008).
Article Google Scholar
Hoelzel, A. R., Le Boeuf, B. J., Reiter, J. & Campagna, C. Alpha male paternity in elephant seals. Behav. Ecol. Sociobiol. 46, 298–306 (1999).
Article Google Scholar
Liu, Y. et al. LRGUK-1 is required for basal body and manchette function during spermatogenesis and male fertility. PLoS Genet. 11, e1005090 (2015).
Article PubMed PubMed Central Google Scholar
Leslie, J. S. et al. MNS1 variant associated with situs inversus and male infertility. Eur. J. Hum. Genet. 28, 50–55 (2020).
Article CAS PubMed Google Scholar
Feng, M. et al. Tubulin TUBB4B is involved in spermatogonia proliferation and cell cycle processes. Genes 13, 1082 (2022).
Article CAS PubMed PubMed Central Google Scholar
Feng, S. et al. hnRNPH1 recruits PTBP2 and SRSF3 to modulate alternative splicing in germ cells. Nat. Commun. 13, 3588 (2022).
Article CAS PubMed PubMed Central Google Scholar
Salvolini, E. et al. Involvement of sperm plasma membrane and cytoskeletal proteins in human male infertility. Fertil. Steril. 99, 697–704 (2013).
Article CAS PubMed Google Scholar
Le Boeuf, B. J. Elephant Seals: Pushing the Limits on Land and at Sea (Cambridge Univ. Press, 2021).
Ebersole, J. L. et al. Hypoxia-inducible transcription factors, HIF1A and HIF2A, increase in aging mucosal tissues. Immunology 154, 452–464 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ramachandran, S. et al. Hypoxia-induced SETX links replication stress with the unfolded protein response. Nat. Commun. 12, 3686 (2021).
Article CAS PubMed PubMed Central Google Scholar
De Miranda, M. A., Schlater, A. E., Green, T. L. & Kanatous, S. B. In the face of hypoxia: myoglobin increases in response to hypoxic conditions and lipid supplementation in cultured Weddell seal skeletal muscle cells. J. Exp. Biol. 215, 806–813 (2012).
Article PubMed Google Scholar
Rao, S. et al. Biological function of HYOU1 in tumors and other diseases. OncoTargets Ther. 14, 1727–1735 (2021).
Article Google Scholar
Luikart, G., Allendorf, F. W., Cornuet, J.-M. & Sherwin, W. B. Distortion of allele frequency distributions provides a test for recent population bottlenecks. J. Hered. 89, 238–247 (1998).
Article CAS PubMed Google Scholar
Muller, H. J. The relation of recombination to mutational advance. Mutat. Res. 1, 2–9 (1964).
Article Google Scholar
Lynch, M., Conery, J. & Burger, R. Mutational meltdowns in sexual populations. Evolution 46, 1067–1080 (1995).
Article Google Scholar
Liu, L. et al. Genetic consequences of long-term small effective population size in the critically endangered pygmy hog. Evol. Appl. 14, 710–720 (2020).
Article PubMed PubMed Central Google Scholar
Hoffman, J. I. High-throughput sequencing reveals inbreeding depression in a natural population. Proc. Natl Acad. Sci. USA 111, 3775–3780 (2014).
Article CAS PubMed PubMed Central Google Scholar
Maron, M. et al. Climate-induced resource bottlenecks exacerbate species vulnerability: a review. Divers. Distrib. 21, 731–743 (2015).
Article Google Scholar
Le Boeuf, B. J. Sexual behavior in the northern elephant seal, Mirounga angustirostris. Behaviour 41, 1–26 (1972).
Article Google Scholar
Hoelzel, A. R. & Dover, G. A. Molecular techniques for examining genetic variation and stock identity in cetacean species. IWC Spec. Issue 11, 81–120 (1989).
Google Scholar
Le Boeuf, B. J., Condit, R. & Reiter, J. Lifetime reproductive success of northern elephant seals (Mirounga angustirostris). Can. J. Zool. 97, 1203–1217 (2019).
Article Google Scholar
Boedlert, G. W. et al. Autonomous pinniped environmental samplers: using instrumented animals as oceanographic data collectors. J. Atmos. Ocean. Technol. 18, 1882–1893 (2001).
Article Google Scholar
Le Boeuf, B. J. et al. Foraging ecology of northern elephant seals. Ecol. Monogr. 70, 353–382 (2000).
Article Google Scholar
Robinson, P. W. et al. Foraging behavior and success of a mesopelagic predator in the northeast Pacific Ocean: insights from a data-rich species, the northern elephant seal. PLoS ONE 7, e36728 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Article CAS PubMed PubMed Central Google Scholar
Dudchenko, O. et al. The Juicebox Assembly Tools module facilitates de novo assembly of mammalian genomes with chromosome-length scaffolds for under $1000. Preprint at bioRxiv https://doi.org/10.1101/254797 (2018).
Chen, Y. et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. GigaScience 7, 1–6 (2018).
Article PubMed PubMed Central Google Scholar
Li, H. & Durban, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Freed, D., Aldana, R., Weber, J. A. & Edwards, J. S. The Sentieon Genomics Tools – a fast and accurate solution to variant calling from next-generation sequencing data. Preprint at bioRxiv https://doi.org/10.1101/115717 (2017).
McKenna, A. et al. The genome analysis toolkit: MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS PubMed PubMed Central Google Scholar
McDonough, M. M., Parker, L. D., McInerney, N. R., Campagna, M. G. & Maldonando, J. E. Performance of commonly requested destructive museum samples for mammalian genomics studies. J. Mammal. 99, 789–802 (2018).
Article Google Scholar
Langmean, B. & Saltzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
Article Google Scholar
Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data (Babraham Bioinformatics, 2010).
Bushnell, B., Rood, J. & Singer, E. BBMerge – accurate paired shotgun read merging via overlap. PLoS ONE 12, e0185056 (2017).
Article PubMed PubMed Central Google Scholar
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Article CAS PubMed PubMed Central Google Scholar
Danacek, P. et al. Twelve years of SAMtools and BCFtools. GigaScience 10, 1–4 (2021).
Google Scholar
Cingolani, P. et al. Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift. Front. Genet. 3, 35 (2012).
Article PubMed PubMed Central Google Scholar
Dierckxsens, N., Mardulyn, P. & Smits, G. NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res. 45, e18 (2017).
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. A. J. Hum. Genet. 81, 559–575 (2007).
Article CAS Google Scholar
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Article CAS PubMed PubMed Central Google Scholar
Sved, J. A. & Feldman, M. W. Correlation and probability methods for one and two loci. Theor. Popul. Biol. 4, 129–132 (1973).
Article CAS PubMed Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010).
Article CAS PubMed PubMed Central Google Scholar
Bradbury, P. J. et al. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23, 2633–2635 (2007).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank S. Edwards for hosting A.R.H. on sabbatical at OEB, Harvard University; R. Fleischer for hosting A.R.H. at the National Zoo, Smithsonian Institution; P. Byerly for great help at the National Zoo lab; A. Gusick and S. Robson at the Natural History Museum, Los Angeles County, M. McGowen at the Smithsonian Institution, and M. Omura and H. Hoekstra at the Museum of Comparative Zoology, Harvard University for all their help in obtaining historical samples; P. Unit for help from the San Diego Natural History Museum; S. Gaughran for helpful comments on the manuscript; C. Hartmann and C. Daly at the Baur Core, Harvard University for helpful bioinformatic advice; Z. Yang, Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences for help in delivering and curating data; R. Condit for help and expertise with the maternity data; and the DNA Zoo team for establishing the Mirounga angustirostris reference genome. Approval for elephant seal handling and instrumentation for performance metrics was provided by the University of California Santa Cruz Institutional Animal Care and Use Committee and under National Marine Fisheries Service permits #786-1463, 87-143, 14636, 14535 and 19108. This work made use of the Hamilton HPC Service of Durham University. We acknowledge funding from the National Natural Science Foundation of China (Grant numbers 42225604 and 41422604) to S.L.; ‘One Belt and One Road’ Science and Technology Cooperation Special Program of the International Partnership Program of the Chinese Academy of Sciences (183446KYSB20200016) to S.L.; and the Specially-Appointed Professor Program of Jiangsu Province to I.S. Field work was supported by the National Science Foundation to BJL from 1970 to 2000, the National Geographic Society, and by the Office of Naval Research (N00014–18-1-2822 and N000014–13-1-0134 to D.P.C. and D. Crocker); the Strategic Environmental Research Development Program (RC20-2–1284 to D.P.C. and D. Crocker); and the Tagging of Pacific Predators Program including support from the Gordon and Betty Moore Foundation, the David and Lucile Packard Foundation, Alfred P Sloan Foundation and the Animal Telemetry Network.

Author information

These authors contributed equally: Georgios A. Gkafas, Hui Kang, Fatih Sarigol.

Authors and Affiliations

Biosciences, Durham University, Durham, UK
A. Rus Hoelzel & Fatih Sarigol
Department of Ichthyology and Aquatic Environment, University of Thessaly, Volos, Greece
Georgios A. Gkafas
Marine Mammal and Marine Bioacoustics Laboratory, Institute of Deep-sea Science and Engineering, Chinese Academy of Sciences, Sanya, China
Hui Kang, Inge Seim & Songhai Li
Innovation Research Center for Aquatic Mammals, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan, China
Hui Kang & Songhai Li
Ecology and Evolutionary Biology, University of California, Santa Cruz, CA, USA
Burney Le Boeuf, Daniel P. Costa, Roxanne S. Beltran, Joanne Reiter & Patrick W. Robinson
Center for Conservation Genomics, National Zoo and Conservation Biology Institute, Smithsonian Institution, Washington, DC, USA
Nancy McInerney
Integrative Biology Laboratory, College of Life Sciences, Nanjing Normal University, Nanjing, China
Inge Seim
BGI Research, Qingdao, China
Shuai Sun & Guangyi Fan

Authors

A. Rus Hoelzel
View author publications
You can also search for this author in PubMed Google Scholar
Georgios A. Gkafas
View author publications
You can also search for this author in PubMed Google Scholar
Hui Kang
View author publications
You can also search for this author in PubMed Google Scholar
Fatih Sarigol
View author publications
You can also search for this author in PubMed Google Scholar
Burney Le Boeuf
View author publications
You can also search for this author in PubMed Google Scholar
Daniel P. Costa
View author publications
You can also search for this author in PubMed Google Scholar
Roxanne S. Beltran
View author publications
You can also search for this author in PubMed Google Scholar
Joanne Reiter
View author publications
You can also search for this author in PubMed Google Scholar
Patrick W. Robinson
View author publications
You can also search for this author in PubMed Google Scholar
Nancy McInerney
View author publications
You can also search for this author in PubMed Google Scholar
Inge Seim
View author publications
You can also search for this author in PubMed Google Scholar
Shuai Sun
View author publications
You can also search for this author in PubMed Google Scholar
Guangyi Fan
View author publications
You can also search for this author in PubMed Google Scholar
Songhai Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.R.H. conceptualized the project, conducted field work, lab work, sample sequencing and analysis, and wrote the original draft. S.L. and G.F. conceptualized the project, acquired funding and revised the manuscript. S.S. and H.K. performed sample sequencing and analysis. I.S. revised the manuscript. G.A.G. and F.S. conducted analysis and revised the manuscript. B.L.B., D.P.C. and J.R. conceptualized the project, and performed field work, analysis and manuscript revision. R.S.B. and P.W.R. collected and analysed data. N.M. performed lab work and sample sequencing.

Corresponding authors

Correspondence to A. Rus Hoelzel, Shuai Sun, Guangyi Fan or Songhai Li.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Ecology & Evolution thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1

Histogram showing the distribution of read depths for the 260 modern samples.

Source data

Extended Data Fig. 2 ROH number of segments compared to total bp covered.

a) fragments > 1MB, and b) fragments greater than 100KB.

Source data

Extended Data Fig. 3 Comparison of a historical (Hist8) and a modern (sample 1234) nuclear genomes.

a) The number of segments of ROH of different sizes greater than 100Kb. b) The number of heterozygous sites in 50,000 bp sliding windows. For each illustration, the box represented the inter-quartile range (50% of the data), the horizotal line through the box is the median, the whiskers include 75% of the data, and the remaining dots are outside that range.

Source data

Extended Data Fig. 4

Historical demographic profile estimated using the program GONE.

Source data

Extended Data Fig. 5 Fitness correlations with total LOF and all weaned.

a) Correlation between all LOF loci reported from snpEff (weak and strong effect) against weaner success per year. b) Correlation between FROH and the total pups weaned over a lifetime.

Source data

Extended Data Fig. 6 Correlations between LOF and FROH.

Linear regression correlations between FROH ( > 1 MB) and a) loss of function based on the analysis in snpEff, and b) missense mutations estimated in PROVEAN.

Source data

Extended Data Fig. 7

Relative proportion of runs of homozygosity greater than 1 MB in different range categories (in brackets) for all adult individuals, showing level found for male M12 (arrow).

Source data

Extended Data Fig. 8

Relationship between the number of paternities achieved and the frequencuy of loss of function alleles at loci associated with male health in non-alpha males.

Source data

Extended Data Fig. 9

Relationship between the proportion of runs of homozygosity greater than 1 MB and the diving performance metric (no significant correlation).

Source data

Extended Data Table 1 Paternal success of all males

Full size table

Supplementary information

Reporting Summary

Supplementary Tables 1–8

Supplementary Tables 1–8.

Source data

Source Data Figs. 1–3 and Extended Data Figs. 1–9

Source data for all figures in one Excel file.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hoelzel, A.R., Gkafas, G.A., Kang, H. et al. Genomics of post-bottleneck recovery in the northern elephant seal. Nat Ecol Evol 8, 686–694 (2024). https://doi.org/10.1038/s41559-024-02337-4

Download citation

Received: 29 June 2023
Accepted: 19 January 2024
Published: 21 February 2024
Issue Date: April 2024
DOI: https://doi.org/10.1038/s41559-024-02337-4

Subjects

Abstract

Similar content being viewed by others

Main

Results and Discussion

Diversity

Demographic history

Reproductive fitness

Diving performance

Conclusions

Methods

Field observation and sampling

Genome sequencing and SNP detection

Genome analysis

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links