Abstract
Ungulate species have experienced severe declines over the past centuries through overharvesting and habitat loss. Even if many game species have recovered thanks to strict hunting regulation, the genome-wide impacts of overharvesting are still unclear. Here, we examine the temporal and geographical differences in genome-wide diversity in moose (Alces alces) over its whole range in Sweden by sequencing 87 modern and historical genomes. We found limited impact of the 1900s near-extinction event but local variation in inbreeding and load in modern populations, as well as suggestion of a risk of future reduction in genetic diversity and gene flow. Furthermore, we found candidate genes for local adaptation, and rapid temporal allele frequency shifts involving coding genes since the 1980s, possibly due to selective harvesting. Our results highlight that genomic changes potentially impacting fitness can occur over short time scales and underline the need to track both deleterious and selectively advantageous genomic variation.
Similar content being viewed by others
Introduction
Wild species are under increasing pressure from anthropogenic activities with up to 16–33% of all vertebrate species being threatened with extinction1,2. Over the past few centuries, many ungulate species have experienced severe population declines through habitat modification and overharvesting and some of them were nearly extirpated from their native range3. Management programmes combining habitat restoration, sustainable harvesting and reintroductions aided by the development of wildlife management institutions have allowed several game species to recover4, even in regions with high human-wildlife interactions such as Europe5,6. Most management plans recognise the role of ungulates in maintaining ecosystem functions as well as their impact on their ecosystem7,8. Consequently, ungulate management plans include consideration of population dynamics, migration, multi-ungulate competition, interactions with carnivores and humans, genetics, cascading effects on vegetation, and disease management6,7. In practice, these plans often incorporate seasonal hunting limits, quotas and selective harvesting (e.g., trophy hunting)6.
However, while the use of genetic tools in conservation programmes has increased rapidly over the past decades9, their integration in ungulate management7 and in conservation as a whole is still considered weak10,11. This is rather surprising because it is well recognised that genetic processes play a crucial role for the long-term survival of species12,13,14. Furthermore, declining populations are exposed to a number of genetic threats, referred to as genomic erosion15,16. These threats include loss of adaptive potential limiting the ability to adapt to long-term changes in the environment13, but also increase in genetic load (i.e., frequency of harmful mutations16,17,18) through drift and the reduced effect of purifying selection, maladaptation (i.e., mismatch between adaptations and environment), and genetic introgression after hybridisation16,19. Estimation of parameters associated with genomic erosion is thus increasingly considered as a major component of species conservation programmes15,20.
Conservation and management of wild game species have greatly benefited from decreasing costs in sequencing21 and from a growing number of genomic resources being generated for rare and threatened species22. These resources allow to reconstruct species’ demographic fluctuations and to estimate temporal and geographical changes in selection regimes, genome-wide diversity, and the amount of genetic load15,20. Several initiatives have advocated for a better use of genetics in conservation23. For instance, the new Global Biodiversity Framework of the Convention on Biological Diversity (CBD; www.cbd.int) includes goals to maintain genetic diversity of populations of all species to secure their adaptive potential. To secure this goal, the use of DNA-based techniques and the integration of quantitative goals to safeguard the retention of genetic variation in species management programmes are needed23,24,25,26. The new CBD framework also includes a monitoring framework with a Headline indicator for genetic diversity focusing on effective population size (Ne) and Essential Biodiversity Variables (EBVs) that include genome-wide diversity, inbreeding and gene flow, which are becoming increasingly important to monitor trends of genetic diversity 26,27. Estimating those indicators and other genomic parameters in intensively hunted game species would thus allow to assess the genomic impacts associated with natural disturbances or intensive hunting and greatly improve their long-term monitoring and management22,27.
A growing number of empirical studies have investigated the genomic consequences of near-extinction in the wild and have illustrated the complex dynamics of genetic load in small populations (e.g.,28,29,30). For instance, the magnitude and speed of a decline and recovery, as well as life-history traits, will have a strong impact on genetic load and of its components31,32. Theoretical predictions and empirical data indicate that purifying selection is most efficient at purging highly deleterious mutations31 while mildly deleterious mutations, which can have a non-negligible impact on fitness16,33, tend to accumulate. To date, only a few studies have examined the post-bottleneck genomic variation in game species. For instance, the Alpine ibex (Capra ibex), which had declined to ~100 individuals by the early 1900s and now numbers more than 50,000 individuals spread over most of its former range34,35, has lost ~80% of its pre-bottleneck mitochondrial genetic variation36. Moreover, a comparison of whole nuclear genomes of Alpine ibex and other caprinae indicates that after this historical bottleneck, the strong founder effects associated with reintroductions facilitated the purging of its highly deleterious genetic variation whereas mildly deleterious mutations accumulated37.
Reduction in adaptive potential or changes in allele frequency for genes relevant to individual fitness will also impact the likelihood of long-term survival of species9,38. In a rapidly warming world, identifying adaptive variation and tracking its changes through time is essential for species conservation26. For instance, evidence for local adaptation to distinct environments requires the designation of appropriate management units to maximise the long-term maintenance of evolutionary potential of species26,27. This type of information is crucial in the context of translocations, genetic rescue, or assisted gene flow39. Moreover, tracking changes in adaptive variation could inform us on the ability of species to adapt to changing environmental conditions40,41 or on the unintended effects of management or hunting practices21. Genome-scans of selection are thus often used to identify outlier genomic regions that are putatively under positive selection due to local adaptation42.
Moose (Alces alces) populations were severely reduced in Fennoscandia from the 15th century onwards43. Intensification of hunting during the 18th and 19th centuries led to near-extinction of moose in Fennoscandia44. Moose were rare outside South-central Scandinavia at the time and harvesting data from the 1890s onwards suggest that the population numbered in the low thousands45,46. Since the 1930s, increased commercial forestry has benefitted the moose43 and hunting legislation allowed the species to recover and its harvest is now intensively regulated47. The Swedish population currently numbers ~350,000 individuals and ~1/3 of its population is harvested annually48. In spite of this near-extinction event, the Swedish moose population has relatively low inbreeding levels relative to North American populations, which could be explained by rapid recolonisation of the range from isolated and less impacted populations in Norway or Finland after the near-extinction event49. However, Scandinavian moose also has one of the lowest genetic diversity in Europe and shows a high degree of isolation and genetic differentiation relative to other Fennoscandian populations50,51,52. Furthermore, such a severe decline to near -extinction suggests that the pre-decline diversity could have been much higher, and that the moose population may be exposed to genome erosion. Furthermore, there is evidence for geographical variation in body size and antler morphology at the extremes of moose range possibly associated with local adaptation to varying habitat types53,54. The Swedish moose population is thus an ideal model species to develop and test genomic tools for game management and monitoring55.
Here, we use temporally-spaced genomes collected over the past ~200 years across the whole range of moose in Sweden and examine the geographical and temporal differences in genome-wide diversity resulting from the recent human-driven near-extinction. We also estimate recently adopted genomic indicators relevant to the long-term monitoring of genetic diversity in wild populations24 and examine geographical and temporal patterns of genome differentiation. Our data shows limited impact of the recent bottleneck on the modern moose population. However, we found evidence for regional differences in genetic load while genomic indicators raise some concerns for future reduction in gene flow and genetic diversity. Finally, we found candidate regions for positive selection due to local adaptation when comparing the extremes of the range and as well as temporal genetic differentiation in regions containing genes potentially relevant to individual fitness.
Results
De-novo assembly
We improved the previous assembly from Dussex et al.49. (GenBank accession: GCA_015832495.1) from a sample of the same female individual and by adding long-read Hi-C scaffolding. The new moose assembly has a scaffold N50 of 76.6 Mb with 99% of the assembly composed of 34 scaffolds, suggesting that our assembly is at or close to chromosome-level. A total of 8660 out of 8771 single-copy mammalian BUSCOs were found within these 34 scaffolds. A synteny analysis against red deer (Cervus elaphus) allowed us to identify 32 autosomes as well as the X and Y chromosomes (HiC_scaffold_21, HiC_scaffold_24; Supplementary Data 1). In contrast, the red deer assembly had 33 autosomes and two sex chromosomes, suggesting few chromosomal rearrangements in moose.
Population structure and past demography
We sequenced 87 genomes at ~17X depth of coverage (historical: 9X; 1980: 16X; 2019/2020: 22X) from across the whole range of moose in Sweden (Supplementary Data 2). We found evidence for isolation by distance in the form of a cline as shown in Wennerström et al.48. We obtained the highest support for K = 3 (Fig. 1b) distinguishing a North, and South cluster as well as an admixed Transition cluster. However, the Transition cluster became fully apparent for K = 4. Genomes from the island of Öland grouped with the South cluster. Out of seven historical specimens (i.e., 1839–1905), six grouped with a cluster corresponding to their sampling location while one, with unknown sampling location (ND056), grouped with the South cluster (Supplementary Data 2).
Past demographic reconstruction using SMC++ indicated a gradual population decline starting ~100,000 years Before Present (BP) from a Ne of ~10,000 to ~100 by 500 years BP (Supplementary Fig. 1). However, there was no indication of a bottleneck coinciding with the near-extinction in the 1700s. In contrast, GONE identified in all three clusters a decline starting ~400 years BP with Ne being the lowest between 300–140 years BP (Fig. 1c; Supplementary Data 3-4). The island population of Öland showed the lowest Ne (i.e., 40–6) during this period.
Heterozygosity, inbreeding and nucleotide diversity
For modern populations (i.e., samples collected in 1980 and 2019/2020), the data supported a significant increase in inbreeding (FROH) and reduction in heterozygosity (N het. sites/1000 bp) in a southward direction (ANOVA, n = 4, P = 1.09 × 10−10; Fig. 2; Supplementary Data 2), with the island population of Öland showing the highest inbreeding and lowest heterozygosity (Fig. 2a, b; Supplementary Data 2). There was evidence for significant temporal changes in inbreeding within the South cluster including for ROH > 2 Mb between 1800s and 1980s genomes (Tukey’s HSD test, n = 2, P = 4.3 × 10−06; Fig. 2a) and in heterozygosity within the North cluster (Tukey’s HSD test, n = 3, P = 1.5 × 10−04 – 1.8 × 10−04; Fig. 2b).
The majority of ROH were <2 Mb long (Fig. 2). Mean ROH length was similar among modern populations (mean length Mb: North = 3.5; Transition = 3.4; South = 3.6; Öland = 3.9; Supplementary Fig. 2), whereas maximal ROH value increased from the north to the south of the range (max. ROH Mb: North= 15.4; Transition = 14.8; South = 30.6; Öland = 23.9; Supplementary Fig. 2). Based on the distribution of ROH length, the majority of inbreeding events in modern genomes (i.e., < 2 Mb) date back to >168 years BP. In contrast, large ROH \(\ge\)23 Mb in the South cluster and on Öland indicate that inbreeding among individuals with shared ancestry could have occurred as recently as 11 years BP. For historical genomes, most inbreeding events also date back to >168 years before sampling. Thus for samples collected in 1839-1905, the inbreeding events occurred ca. 1670–1740 CE. Moreover, a small number of ROH \(\ge\)5–10 Mb correspond to inbreeding events occurring between 67 and 33 years before sampling (e.g., for ND044 collected in 1843, the inbreeding event occurred ca. 1780–1800 CE ; Supplementary Fig. 2).
Nucleotide diversity (π) was significantly different among modern populations, with the highest diversity found in the North and Transition clusters (t test; n = 4, P = 2.2 × 10−16 < 0.001; Supplementary Data 3). We also found significant temporal changes in π within all clusters (t test, n = 2-3, P = 2.2 × 10−16 – 4.37 × 10−12; Supplementary Data 3). Within the Northern cluster, π increased between the 1800s and 2019/2020 whereas π decreased within the Transition cluster between 1800s-2019/2020 and in the South cluster between 1800s–1980.
Genetic load
We quantified genetic load by identifying High and Moderate impacts variants in coding regions. There were significant geographical differences in the total counts of both High (ANOVA, n = 4, P = 1.28 × 10−5) and Moderate (ANOVA, n = 4, P < 2 × 10−16) impact variants among modern populations, with a general reduction in total load towards the south the range (Fig. 3; Supplementary Figs. 3, 4; Supplementary Data 2). While there was no change in genetic load over the past 40 years, we found significant increase in the North and South clusters for the total number of both High (Tukey’s HSD test, n = 2-3, P < 9.23 × 10−5 – 7.7 × 10−3; Fig. 3a) and Moderate (Tukey’s HSD test, n = 2-3, P < 1 × 10−7 – 2 × 10−7; Fig. 3a) impact variants since the 1800s.
When considering the frequency of variants in each population using Rxy, there was also evidence for a reduction in load towards the South of the range (Supplementary Fig. 4). Öland showed a reduction in the frequency of High impact but an excess in frequency of Moderate impact variants relative to the South cluster (Supplementary Fig. 4). Furthermore, modern genomes showed an excess in LoF and Moderate impact variation relative to historical ones, indicating a temporal increase in the frequency of deleterious alleles (Supplementary Fig. 4). There were 41 more High impact and 1,339 more Moderate impact variants in modern genomes compared to historical ones, corresponding to 1.7-fold and 1.8-fold increases in those variant categories, respectively (Supplementary Data 5).
We only found significant differences in realised load across the modern range of moose for Moderate impacts variants, with a highest load in the South cluster and on Öland (ANOVA, n = 4, P = 1.5 × 10−14; Fig. 3b). Moreover, realised load estimates indicated significant temporal reductions only in the South cluster for High impact variants (Tukey’s HSD test, n = 4, P = 0.046; Fig. 3a) and in the North cluster for Moderate impact variants (Tukey’s HSD test, n = 4, P = 1.8 × 10−2 – 2.8 × 10−2; Fig. 3b).
Among the genes including High and Moderate impact variants in modern genomes, we identified genes associated with male fertility, hair and skin morphology, bone and eye development, embryogenesis (Supplementary Data 5, 6).
Genomic indicators
To assess the status of moose according to CBD recommendations which aim at maximising the retention of genetic variation, we estimated three genetic indicators following Andersson et al.24. Diversity within the northern cluster was classified as Acceptable for both the ΔH and Ne indicators (Fig. 4; Supplementary Data 7). In the Transition cluster, indicator ΔH showed increased heterozygosity and decreased FROH deemed Acceptable, yet an Alarm reduction in nucleotide diversity (π). The Ne indicator was classified as Acceptable. The South cluster showed an Acceptable reduction in heterozygosity whereas changes in π and FROH and Ne indicators were classified as Warning (Fig. 4; Supplementary Data 7).
A statistically significant temporal increase in FST was found between the North and the Transition cluster and between the North and South cluster (Wilcoxon signed-ranked test, n = 2, P = 2 × 10−16; Fig. 4, Supplementary Data 8). When translating changes in FST to gene flow between clusters, both values exceeded threshold values and were classified as Alarm. This indicator was classified as Acceptable with regard to the maintenance of distinct populations over time, since no populations went extinct during the sampling period.
Genome scans of selection and gene ontology
Genome scans of selection based on Z(FST) values identified 206 candidate genes within outlier windows when comparing the North and South clusters (Fig. 5a; Supplementary Data 9, 10). We identified genes associated with skin pigmentation (e.g., KLHL5, HPS5, RPS21, TEP1; Supplementary Data 9, 10) including three keratin genes (i.e., KRT80, KRT82, KRT84) as well as 14 genes expressed in skin and horns of sheep and goat (e.g., KRT80, KLK15, SIGLEC1, CAPN12) including one gene (NGFR) highly expressed in roe and sika deer antlers56. We also found two additional genes from the KLK gene family (KLK1, KLK6) as well as other genes (HIVEP1, PHACTR1, SEMA6D) known to be associated with the cardiovascular system. Finally, we identified outlier genes associated with retinal development (SEMA6D) as well as nervous system (HEXA; Supplementary Data 9, 10).
When testing for association between outlier windows and climate using latitude as a proxy for temperature, 71 out of the 206 candidate genes had variants associated with latitude and included skin pigmentation and antler development genes (e.g., HPS5, FCER2, KLK15, CAPN12, KRT84; Supplementary Data 9, 10).
Temporal comparison between the 1980 and 2019/2020 periods for the North cluster identified 212 candidate genes within outlier windows (Fig. 5b; Supplementary Data 9, 10). Among those, seven genes were associated with antler development (Supplementary Data 9, 10). Among the top outliers windows, we found genes associated with male fertility (e.g., ELMO1, Clip1, Sgo2, GMNC, Dnah8), insulin or fat metabolism (e.g., M6PR, SLC27A5, IGF2BP2), cardiovascular function (e.g., ZNF398, HDAC9, Vps33a, RERE, SENP2) and body weight (e.g., TLL2) (Supplementary Data 9, 10).
Discussion
We analysed 87 modern and historical genomes over the whole range of moose in Sweden to examine the genome-wide impacts of near-extinction in the 18th century. We found some regional increase in inbreeding and in genetic load since the 1800s, although more samples would be needed to confirm this trend. Importantly, we found a slight but significant southward increase in inbreeding and a reduction in total genetic load but little change in these estimates over the past 40 years. In spite of the weak change in inbreeding and load, we estimated a significant annual reduction in nucleotide diversity over the same period, as well as indications of reduced gene flow highlighting the need for continuous monitoring. Also, effective population size estimate for the modern (1980s) South cluster is below the CBD threshold value of 500, which is cause for concern and calls for an assessment of the present Ne of this region. Finally, genome scans of selection identified regions putatively under positive selection due to local adaptation and temporal changes involving genes associated with antler development and other biological functions potentially impacting fitness. These results suggest that the moose population in Sweden is not at immediate risk of exposure to genomic erosion. However, future reduction in gene flow and increases in inbreeding could result in the expression of this genetic load16 and harvesting practices could impact functional variation and population fitness21.
The population structure described here is consistent with a previous study based on microsatellite data which identified a North-South genetic cline48. While this cline could have resulted from colonisation from both the North and South, sharing of rare European mitochondrial haplotypes distinct from the Finnish lineage in the North and South of its range50,57 supports a colonisation from the South after the Last Glacial Maximum. Moreover, our data indicates that the island population of Öland belongs to the South cluster and suggests ongoing gene flow. However, even though moose can swim the short distance (~10 km) between the mainland and Öland, non-invasive DNA data suggests that gene flow is probably limited58.
Demographic reconstructions supports gradual decline starting ca. 100,000 BP consistent with a previous demographic reconstruction using the PSMC49. This decline, which is congruent with a lack of moose fossil remains from Europe during the Last Glacial Maximum ~25,000 years BP59, could indicate shifts in the preferred moose habitat characterised by boreal forests and taiga during the cold periods49. However, the SMC++ did not detect any reduction in Ne associated with hunting intensification of the 18th and 19th centuries. In contrast, the linkage disequilibrium approach in GONE, which outperforms other methods for recent time frames60, showed strong support for a demographic decline at that time, as previously shown by Wennerström et al.48. We note that the signature of decline on Öland could also correspond to a founder effect.
The overall decline observed here is consistent with a reduction in Major Histocompatibility Complex (MHC) diversity in modern moose populations61,62 but is in stark contrast with previous studies on Norwegian and Finnish moose that did not find genetic evidence for reduction in Ne63,64 associated with intensive hunting. This is most likely due to local variation in hunting intensity or alternatively to the lower resolution of markers used in these studies (i.e., microsatellite loci). Interestingly, Öland as well as the Transition and the North clusters show a recent (i.e., ~150 years BP) increase in Ne, coinciding with the reduction in hunting pressure and the implementation of new hunting regulation in the 1930s47.
Estimates of inbreeding were relatively low and ROH distributions were in majority of a length <10 Mb, indicating only a moderate impact of the recent decline. However, the southward increase in FROH in modern moose and maximum ROH length in the South cluster is consistent with a more intense bottleneck in that part of the range resulting from either a founder effect after recolonisation of this area at the beginning of the 20th century or a strong hunting pressure over the past decades65. In contrast, while the Öland population is small (i.e., 95% CI: 115–156) and the most inbred population in Sweden, the management strategy and reduced number of individuals harvested ranging from 0 to 43 individuals per year, suggests that the main causes of the high inbreeding are the founder effect and reduced gene flow58.
We found little evidence for temporal changes in FROH or heterozygosity since the 1800s, except in the South and North clusters. We caution that more genomes would be required to confirm the trend observed between the 1800s and 1980s. Yet, inbreeding estimates based on single genomes can be highly informative in inferring the timing and magnitude of past bottlenecks (e.g., Woolly rhinoceros, Coelodonta antiquitatis66). Since recombination breaks ROH into smaller fragments, large fragments reflect recent inbreeding events, whereas smaller ones reflect older inbreeding events67. Here, the distribution of ROH indicates that most inbreeding events occurred >168 years prior sampling of historical and modern genomes, and thus date back to ca. 1670–1850, a period coinciding with intensive hunting. However, inbreeding among closely-related individuals may have occurred as recently as 10–20 years ago in the South cluster and on Öland. Consequently the low proportion of large ROH suggests that overharvesting did not impact inbreeding levels substantially or that the population recovered rapidly after the introduction of the new hunting regulation. Alternatively, gene flow and immigration from populations less impacted by hunting in eastern Europe or other parts of Scandinavia may have compensated for the strong bottleneck effect associated with overhunting49. While uncertainty around the recombination rate for moose can bias inference of the timing of inbreeding events, our estimate is consistent with the period of intense hunting of the 18th to 19th centuries and also reveals that contemporary moose populations may be potentially exposed to the negative effects of high inbreeding.
Additional genomes are also required to further examine the temporal dynamics of load in Swedish moose. Nevertheless, two competing hypotheses can explain the pattern observed here. In contrast with heterozygosity and inbreeding, modern genomes had a higher load than historical ones, suggesting that drift may have facilitated the increase in frequency of deleterious alleles during the near-extinction event. Alternatively, if an increase really occurred, drift alone may not have been sufficient to increase genetic load. We thus cannot exclude the possibility that new deleterious variation was introduced via gene flow from genetically distinct subpopulations during the recovery stage of the population.
When considering modern populations only, we observed an overall reduction in High and Moderate impact variants towards the South of the range. Higher inbreeding may have facilitated the exposure of deleterious variation in homozygous state (i.e., higher realised load) and their removal through purifying selection32,33. The moose population on Öland, which showed the highest inbreeding and a long-term small Ne over the past 2,000 years, did not have a significantly different total genetic load compared to the South cluster. However, we found a reduction in the frequency of High impact variants and an accumulation of Moderate impact ones. This result is consistent with theory and recent genomic studies (e.g.,28,37,68) showing that inbreeding facilitates the removal of highly deleterious variation through purifying selection, whereas mildly deleterious variation, with lower impact on fitness, tends to accumulate16,32. Temporal data would be required to formally examine the dynamics of purging on Öland.
Recent data supports a lower fitness on Öland, with calf survival rate being lower (~0.22) compared to other populations (~0.7)69. It has been hypothesised that poor body condition in females associated with suboptimal foraging habitat as well as a temporal mismatch between resource availability and parturition could explain this difference. However, our data suggest that higher exposure of deleterious alleles (i.e., higher inbreeding and realised load) and an excess in Moderate impact variants relative to the South cluster may also contribute to lower female conditions and lower calf survival. Furthermore, while Öland and the mainland may be connected by occasional gene flow58, which could lead to a genetic rescue effect, there is also a risk of the introduction of new deleterious variation from less inbred and more genetically diverse individuals from the mainland70. For instance, the migration of a single male into the highly inbred Isle Royal wolf (Canis lupus) population induced a fitness reduction associated with the introduction of new deleterious alleles71.
It is worth noting that various metrics can be used as proxies for genetic load and that these need to be interpreted carefully16. While the realised load affects individual fitness in the current generation, recessive deleterious mutations in heterozygote (i.e., masked load) state do not16. However, deleterious variants in heterozygous state are likely to be expressed in future generations if inbreeding increases, whereas outcrossing could mask homozygous variants into heterozygous state. The interpretation of these metrics is further complicated by the lack of information on selection (s) and dominance (h) coefficients, epistasis as well as variation in non-coding (e.g., regulating) regions31.
Genomic indicators raise concerns for the genetic health of Swedish moose. While Ne in the North and Transition clusters remain above acceptable levels, the South cluster exhibits NeLD < 500, interpreted as a Warning due to the risk of losing adaptive and evolutionary potential. Additionally, inbreeding has increased and nucleotide diversity decreased over the past 150 years. However, additional historical and modern samples in this part of the range are required to confirm the observed temporal diversity patterns. Furthermore, population divergence has increased over time between the North and South clusters and between the North cluster and the Transition cluster, in both cases to such a degree that an Alarm is warranted for the FST-indicator. This suggests that migration has decreased between genetic clusters, possibly due to higher hunting pressure and conflicts with humans (e.g., increased urbanisation and traffic-related deaths), presenting a risk of genetic erosion if populations become more isolated, especially in the South cluster.
Assessing the functional variation in wild populations is highly relevant to species monitoring9,38. Our genome scan analyses identified a number of outlier windows containing 206 genes under putative selection due to local adaptation, with 71 of those showing a significant association with latitude. The 206 candidate genes included functions associated with skin pigmentation, keratin synthesis, cardiovascular and nervous system. We also found 14 genes previously that are expressed in skin and horn of ungulates, including roe and sika deer antlers56. There is evidence for regional and habitat differences in moose antlers in Eurasia72 and North America73. For instance, in North America, moose from open tundra habitat have larger antlers than those from boreal forest (i.e., taiga)73. Yet, the relative genetic or diet type basis for such association is unclear73. Moreover, moose from mountain habitat in North America (A. a. shirasi) and Asia (A. a. cameloides) both have small body and antler size74. In Europe, moose from Sweden and Finland, distributed from open tundra habitat in the North to boreal forest in the South also show latitudinal variation in growth pattern and body size53 as well as an association between habitat and antler morphology, with the ‘palmated’ (i.e., webbed) type most prevalent in the North and the ‘cervina’ (i.e., deer-like) type most prevalent in the South54. Such association between habitat and morphology may thus indicate local adaptation and genes within the outlier windows identified here could thus be candidates for positive selection75.
Selective or trophy hunting may also induce allelic shifts involving genes associated with antler morphology as well as a reduction in antler size or even body size over time76,77,78. For instance, antler size in Alaskan moose has been reduced with increased harvest intensity79. Moreover, the Finnish moose population, which has been subjected to similar hunting pressures as the Swedish moose80,81, has experienced a rapid reduction in the ‘palmated’ type as a result of intensive hunting and/or fitness differences between antler types in managed forests between 1976 and 199954. However, selective hunting alone may not be the only cause for horn size reduction and a combination of environmental factors, inbreeding depression and selective hunting may induce changes in male ornamentation (e.g.,82). When comparing 1980 and 2019/2020 genomes for the North cluster, our genome scan analysis identified a number of outlier windows containing genes associated with antler development56 as well as other functions (e.g., fertility, insulin and fat metabolism, cardiovascular system, body weight). Thus, over a short period of time (i.e., ca. five generations, assuming a generation time of seven years49), hunting pressure may not only have induced allelic changes in the vicinity of genes associated with male ornamentation but may have also impacted other types of functional variation with potential important fitness effects. Yet, strong drift during the bottlenecks can also induce changes in the frequency of certain alleles and thus result in genome-wide differentiation which could be interpreted as selection83. For instance, it has been suggested that the high number of outlier regions among populations of three-spine stickleback (Gasterosteus aculeatus) from different environments may have arisen due to repeated bottlenecks and strong drift from small lake populations83,84.
In conclusion, our study suggests that while the near-extinction of moose in Sweden did not severely impact its genome-wide diversity, at least in terms of inbreeding, our modern data raises concerns that reduction in gene flow, sudden declines and hunting practices may negatively impact the functional variation of moose populations. At present, the indicators proposed by the CBD do not include estimates of adaptive or deleterious variation26,27. We thus advocate for the inclusion of metrics estimating both deleterious and selectively advantageous variation in genetic monitoring programmes. Doing so will also allow the incorporation of genetic metrics with more relevance to species survival within the IUCN Red List15,85 and Green Status of Species86.’ Our study thus illustrates the need for continued genomic monitoring using genomic indicators and to incorporate estimates of deleterious and adaptive variation for the effective management of wild species.
Methods
Ethical statement
The moose samples used for the assembly generation were obtained from frozen tissue banks collection from 1980 maintained by L.L. and N.R. at Stockholm University and from 2019/2020 maintained by G.E. and G.S. at Umeå University. No Ethics approval was required.
Sampling, DNA extraction and library preparation
We extracted DNA from a total of 87 genomes (57 from 1980, 23 from 2019/2020, and seven historical genomes from 1839–1905; Supplementary Data 2) representing the whole geographical distribution of moose in Sweden (Fig. 1a). Modern DNA was extracted from ~20 mg of muscle tissue using a DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany) and historical DNA from antlers or jawbone following Yang et al.87.
Genomic library preparation from modern DNA extracts was performed using a PCR-free protocol at the Science for Life Laboratories (SciLifeLab), Stockholm. Genomic libraries for historical DNA were prepared as described in Dussex et al.88, following Meyer & Kircher89 and including USER treatment to remove deaminated bases90 (and thus sequencing error) and five independent PCRs were run to increase complexity. Libraries were sequenced on a NovaSeq S4 flowcell using a 2x150bp and 2x100bp setup for modern and historical libraries, respectively.
De-novo assembly
We improved the assembly from Dussex et al.49 (GenBank accession: GCA_015832495.1) by using Hi-C scaffolding. In brief, 36 mg of fresh frozen muscle tissue from the same female individual from central Sweden (Province of Gävleborg) was ground to a fine powder and used as input for Omni-C™ Proximity Ligation Assay, Mammalian Samples Protocol version 1.2 from Cantata Bio according to the specifications from the manufacturer. The resulting library was pooled and sequenced to approximately 25% lane capacity on an Illumina NovaSeq 6000 instrument using S4 Reagent Kit v1.5 (300 cycles). The scaffolding was performed following Dudchenko et al.91, which includes Juicer (v1.6) for read-mapping and filtering, 3D-DNA (v180922) for draft scaffolding and Juicebox (v1.11.08) for manual curation of Hi-C scaffolds. For assembly evaluation we used QUAST (v5.0.2)92 and BUSCO (v. 5.3.1)93 with the “mammalia_odb10” lineage dataset.
We identified repeats using repeatmodeller v1.0.11 and repeatmasker v4.0.7 [Smit, A.F.A. and Hubley, R. (2008–2015) RepeatModeler Open-1.0, http://www.repeatmasker.org/] and CpG sites using the GenErode bioinformatics pipeline94. Finally, we used Minimap295 to perform a synteny analysis against red deer (Cervus elaphus; Genbank ID: GCA_002197005.1) and identify sex-linked scaffolds. Since we also mapped the data to O. hemionus for the SNPeff and genome scan analyses, we used the same approach to identify the X chromosomes (HiC_scaffold_35) in O. hemionus.
Modern and historical data mapping
Mapping and variant calling of modern and historical genomes was done using the GenErode bioinformatics pipeline94. Briefly, adapter trimming was done with fastp v0.22.096 for modern data while we used Seqprep v1.1 (https://github.com/jstjohn/SeqPrep) to trim and merge forward and reverse reads for historical data. Reads for modern and historical data were then mapped using BWA v0.7.17 mem and aln algorithms, respectively, while sorting and removing PCR duplicates were done using SAMtools v1.12. Finally, reads were realigned around indels using GATK IndelRealigner v3.4.097.
Variant calling was done with the mpileup command of bcftools v1.898,99. We used a minimum depth of coverage (DP4) of ~1/3 (i.e., 5X) of the average depth of coverage, and filtered SNPs by base quality QV ≥ 30 and those within 5 bp of indels. Sites with SNPs in heterozygous state were filtered out if the allele frequency fell outside an allelic balance (i.e., number of reads displaying the reference allele/depth) of <0.2 and >0.8 in order to avoid biases caused by contamination, mapping or sequencing error. After merging all individual vcf files, we masked CpG sites using BEDtools v2.27.1100 to limit possible biases from DNA damage in historical genomes. We also masked repeat sites with BEDtools, and excluded scaffolds linked to X and Y chromosomes. Finally, we examined DNA damage patterns for the seven historical genomes using MapDamage101. The USER treatment removed the majority of typical post-mortem damage (Supplementary Fig. 5).
We obtained a total of 4,265,381 high-quality SNPs across our 87 genomes. For all downstream analyses involving population comparisons, we used PLINK v2102 to retain only SNPs called in all 87 individuals, making for a total of 617,661 SNPs.
For analyses that required the calling of variants relative to an ancestral allele (e.g., SnpEff, PBS), we mapped the data as described above against mule deer Odocoileus hemionus; Odocoileus_hemionus_HiC, https://www.dnazoo.org/assemblies/Odocoileus_hemionus). After filtering for missing data and excluding the sex chromosome, we obtained a total of 5,305,454 SNPs.
Population structure
We performed an ADMIXTURE v1.3.0103 analysis to estimate individual-based ancestry and identify genetic clusters (K = 1–5). This approach assumes that individuals are unrelated and uses a cross-validation procedure to determine the best number of possible genetic groups present in the dataset.
We estimated pairwise FST104 between the North and South clusters as well as between the North and Transition clusters for each time period (i.e., historical, 1980, 2019/2020) with VCFTools v0.1.16105 using non-overlapping windows of size 50 kb (see also Genomic indicators section).
Past demography
To estimate the recent past demography (i.e., <10,000 years BP), we used the SMC++ v.1.15.2106. Like PSMC107, this approach relies on Sequential Markov Coalescent (SMC) simulations from unphased genome data from multiple genomes instead of one single diploid genome, thereby increasing the number of recent coalescent events to estimate the effective population size (Ne). Using the same substitution rate and generation time as for the PSMC from Dussex et al.49 (i.e., 7 years; 7 × 10e−9), past demography was estimated using the ‘cross validation’ approach with --em-iterations 5000, and --thinning 1300 and --regularisation-penalty 6.
We also examined the past demography of moose over the past 100–200 generations using GONE60 which estimates changes in NeLD calculated as the geometric mean over 40 independent estimates from the observed spectrum of linkage disequilibrium (LD; https://github.com/esrud/GONE). We only retained the 32 largest autosomal chromosomes and used the following parameters: PHASE = 2; cMMb=1; DIST = 1; NGEN = 2000; NBIN = 400; MAF = 0.0; ZERO = 1; maxNCHROM=85; maxNSP = 50000; hc = 0.05; REPS = 40; threads = −99. For both of these analyses, historical genomes were excluded. We performed this analysis separately for the three genetic clusters identified by Wennerström et al.48 and the island population of Öland to avoid any underestimation caused by population substructure. We also used a generation time of seven years49.
Heterozygosity, genome-wide diversity and inbreeding
We estimated heterozygosity using mlRho v2.7108 which estimates the individual mutation rate (θ). Under the infinite sites model, θ approximates the genome-wide heterozygosity measured as the number of heterozygous sites per 1,000 bp. We downsampled each genome to the average coverage of the genome with lowest coverage (i.e., 7X), filtered out bases with quality (-Q) < 30, mapped sequencing reads with mapping quality (-q) <30 and positions with root-mean-square mapping quality (MQ) < 30 from the historical and modern bam files. We then filtered out sites with depth <1/3 (i.e., 5X) of the average depth of coverage. We also estimated nucleotide diversity (π) per population and time point using VCFTools v0.1.16 and with 50 kb non-overlapping windows.
Inbreeding coefficients (FROH) were estimated based on Runs of Homozygosity (ROH) that were identified using the sliding-window approach implemented in PLINK v1.9. We used a set of strict parameters to avoid overestimation and overrepresentation of long ROH: a sliding window size of 100 (homozyg-window-snp 100); no more than 1 site per window to assume a window as homozygous (homozyg-window-het 1); at least 5% of all windows including a given SNP to define the SNP as being in a homozygous segment (homozyg-window-threshold 0.05); a homozygous segment was defined as a ROH if the segment included ≥25 SNPs (homozyg-snp 25) and covered ≥100 kb (homozyg-kb 100); the minimum SNP density was one SNP per 50 kb (homozyg-density 50); and the maximum distance between two neighbouring SNPs was ≤1,000 kb (homozyg-gap 1,000). Finally, we set the value at 750 heterozygous sites within ROH (homozyg-het 750) in order to prevent sequencing errors to cut ROH. The inbreeding coefficient FROH was then estimated as the overall proportion of the genome (autosomes only) comprising ROH. We statistically tested for differences in heterozygosity and FROH among modern populations using ANOVAs and used Tukey’s HSD tests to test for pairwise differences among time periods for each population and among modern populations in R109.
Using the length of ROH, we also estimated the timing of inbreeding events using:
where g corresponds to the number of generations, L to the length of ROH in Mb, and r to the recombination rate67. For the latter we used an estimate of 1.04 cM/Mb estimated in red deer110. Based on these parameters and using a generation time of seven years49, ROH ≥ 0.5, ≥2, ≥5, ≥10 and ≥30 correspond to inbreeding events occurring ca. 96 (670 years), 24 (168 years), 9.6 (67 years), 4.8 (33 years) and 1.6 generations (11 years) before present (BP), respectively.
Genetic load
We used SnpEff v4.3111 to annotate synonymous and non-synonymous nucleotide substitutions in coding regions. All genomes were mapped against mule deer (Odocoileus hemionus; Odocoileus_hemionus_HiC) to reduce reference mapping and annotation bias. After removing gene models with in-frame STOP codons from the annotation (Odocoileus_hemionus_HiC.fasta_v2.functional.gff3), missing START and terminal STOP codons (-J option) and genes labelled as pseudogenes (--no-pseudo option) with Cufflinks v2.2.1112, we obtained a total of 22,732 genes.
Next, we generated a database for O. hemionus using the protein sequences extracted from the annotation. We first identified putative deleterious variants by allocating variants to three different impact categories as defined in the SnpEff manual: a) Low/Synonymous: mostly harmless or unlikely to change protein behaviour; b) Moderate: non-disruptive variants that might change protein effectiveness; c) High: variants assumed to have high (disruptive) impact on protein, probably causing protein truncation, loss of function or triggering nonsense-mediated decay and including stop gained codons, splice donor variant and splice acceptor, start codon lost111. We also excluded intergenic (-no-intergenic) and intron (-no-intron) variants. We estimated individual genetic load in two ways. First, after removing variants fixed in all individuals, we estimated the total individual load by summing the number of variants of each category i. We also corrected for potential mapping biases dividing the number of each category by the total number of synonymous SNPs, following Xue et al.113. Secondly, we estimated the individual realised load (i.e., total number of homozygous variants of category i divided by twice the total number of segregating sites for category i114). By taking into account the mode of dominance, this estimate allows us to estimate the proportion of potential load that is realised or expressed in each individual. We statistically tested for differences in genetic load among modern populations using ANOVAs and used Tukey’s HSD tests to test for pairwise differences among time periods for each population and among modern populations in R109.
To take into account the frequency of variants in each population, we also calculated the Rxy ratio of Moderate and High impact variants for modern population pairs and for temporal comparisons following von Seth et al.115. An Rxy equal to 1 corresponds to no change in frequency between two populations, whereas Rxy < 1 or >1 corresponds to a decrease/deficiency or an increase/excess in frequency in population x relative to population y, respectively. We used a jack-knife procedure in R109 to estimate the variance in the ratio.
Genomic indicators
Three indicators to monitor trends in genetic diversity have recently been introduced in Swedish monitoring work following science-management elaborations24. The indicators are based on standard population genetic metrics for within and between population variation and agree with Essential Biodiversity Variables (EBVs) for genetic diversity26 and are also relevant for the indicator assessments required by the new monitoring framework of the Convention on Biological Diversity (CBD)27. Two indicators assess diversity within populations (ΔH and Ne)20,24. The third indicator (ΔFST) reflects temporal changes in genetic diversity among populations as well as the number of populations maintained over time.
The ΔH indicator uses several measures of within population diversity and was estimated for the three clusters, with each cluster stratified into separate time points based on available samples. For the ΔH indicator, we used ΔHET, Δπ, and ΔFROH based on our estimates of heterozygosity, nucleotide diversity, and inbreeding (see above). For the North cluster, three temporal stratifications were possible; 1800s, 1980, and 2019/2020. Two stratifications were made for each of the remaining two clusters; 1980 and 2019/2020 for the Transition cluster and for the South cluster 1800s and 1980. For the Ne indicator, we used estimates from the GONE analysis from the most recent generation (i.e., 1 generation BP; Supplementary Data 3). The ΔFST indicator requires FST between at least two populations at two points in time to be calculated and was thus only possible for two temporal comparisons: FST between North and South clusters in the 1800s and in 1980, and between the North cluster and the Transition zone in 1980 and 2019/2020.
Next, for each indicator, threshold values reflecting specific rates of change over time were used to assess trends in genetic diversity of populations (see Fig. 3 in Andersson et al.24). Three indicator signals (i.e., Alarm / Warning / Acceptable) were given. For the ΔH indicator, and in cases of significant temporal change, the difference in diversity between two time points over the whole sampling period (between 40 and 170 years for our different comparisons) was translated into an annual change. The threshold values for annual reduction (≤0.05%; Acceptable, 0.06–0.3%; Warning, >0.3%; Alarm) were applied to the three measures of genomic diversity (i.e., heterozygosity, π, and FROH). For the Ne indicator, NeLD ≥ 500 is classified as Acceptable116, 50< NeLD < 500 Warning, and NeLD < 50 Alarm based on116,117.
The ΔFST indicator was classified as Acceptable if no statistical change was detected. In cases of a significant temporal change, FST was converted to gene flow between clusters (i.e., expected number of migrants24). A ΔFST reflecting an increase or decrease in migration was classified as follows: reduced migration by 25% and 50% was considered Warning and Alarm, respectively and increased migration of 50% and 100% classified as Warning and Alarm, respectively. This indicator was also classified as Alarm if any population went extinct over the monitoring period.
Only statistically significant changes in genetic metrics were considered24 and statistical testing of indicators was performed in R109 using t-tests and Wilcoxon signed-rank tests. Only results from the t-test are reported in cases where these two tests are concordant with each other.
Genomic differentiation and genome scans of selection
To identify candidate regions under putative positive selection and potentially due to local adaptation, we estimated pairwise FST between the North (n = 30) and South (n = 30) clusters in ANGSD118. We first calculated the sample allele frequency (saf) for the data mapped to O. hemionus with the following arguments: -doSaf 1; -gl 1; -minMapQ 30; -minQ 30. We used O. hemionus as reference for the ancestral state to estimate the unfolded Site Frequency Spectrum (SFS). Next, we calculated the SFS using the realSFS command to calculate 2d sfs for the North vs South populations comparison using the following options: -P 10; -r. We then used the realSFS fst index command to calculate Fst binary files. We used the realSFS command to extract Fst values and a sliding with the following options: -win 50,000; -step 10000. We performed a Z-transformation of FST values and retained FST-outlier windows with a Z-score >5 (i.e., values outside 5 standard deviations from the mean119). Z(FST) was calculated as Z(FST) = (FST – μ FST)/σ FST, where FST is the FST value in a window, μ FST the average FST over all windows, and σ FST the standard deviation of FST values over all windows. Finally, we extracted those Z(FST) outlier windows as a bed file and cross-referenced them with the annotation for O. hemionus (Odocoileus_hemionus_HiC.fasta_v2.functional.gff3) using BEDtools intersect100.
In order to test for association between the identified outlier windows and climate, we used a Latent Factor Mixed Model (LFMM) implemented in the LEA R package120. We used BEDtools intersect to extract the variants that were overlapping with Z(FST) outlier windows from our 80 modern genomes vcf file (data mapped to O. hemionus). Latitude was used as a proxy for temperature and as the explanatory variable in the model. We ran the model with K = 3 and for 20 repetitions. Significance was applied using an alpha value of 0.001 and a Bonferroni correction for multiple testing. We then extracted candidate SNPs and cross-referenced them with the annotation for O. hemionus using BEDtools intersect.
Finally, we estimated pairwise FST between the 1980 (n = 17) and 2019/2020 (n = 17) periods for the North cluster to identify recent temporal genomic differentiation as described above. This analysis was only performed for this cluster since it was the only one for which sample size was large enough for a temporal comparison and since substructure over the moose range could bias our results.
Gene ontology
For all genes identified with the genome scans analyses, we used the Mouse Genome Informatics database (www.informatics.jax.org) to manually retrieve gene ontologies and mammalian phenotype information for each candidate gene. We also compared our list of candidate genes with that of Wang et al.56.
Statistics and reproducibility
The research sample included 80 modern and 7 historical moose samples collected across the whole range of the species in Sweden. All statistical tests were conducted using publicly available programs and packages as described in the methodological sections above. Reproducibility can be accomplished by following the laboratory and analytical protocols described above and by following. The author’s GitHub (https://github.com/ndussex/Moose_genomics) contains details on the code and analyses used in this study.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
Assembly: Genbank (BioProject: PRJNA668262; Accession number: GCA_015832495.2; NRM_Aalces_2_0.fsa; JADEYB000000000). Resequencing data: PRJEB60841. See Supplementary Data 2 for estimates of Heterozygosity, inbreeding and genetic load.
Code availability
The full code used for the population genomic analyses is available on: https://github.com/ndussex/Moose_genomics.
References
Ceballos, G., Ehrlich, P. R. & Raven, P. H. Vertebrates on the brink as indicators of biological annihilation and the sixth mass extinction. Proc. Natl Acad. Sci. USA 117, 13596–13602 (2020).
Ceballos, G., Ehrlich, P. R. & Dirzo, R. Biological annihilation via the ongoing sixth mass extinction signaled by vertebrate population losses and declines. Proc. Natl Acad. Sci. USA 114, E6089–E6096 (2017).
Ripple, W. J. et al. Collapse of the world’s largest herbivores. Sci. Adv. 1, e1400103 (2015).
Linnell, J. D. C. et al. The challenges and opportunities of coexisting with wild ungulates in the human-dominated landscapes of Europe’s Anthropocene. Biol. Conserv. 244, 108500 (2020).
Linnell, J. D. C. & Zachos, F. E. Status and distribution patterns of European ungulates: genetics, population history and conservation. Ungulate Manag. Europe 12–53 (2010).
van Beeck Calkoen, S. T. S. et al. Ungulate management in European national parks: Why a more integrated European policy is needed. J. Environ. Manag. 260, 110068 (2020).
Apollonio, M. et al. Challenges and science-based implications for modern management and conservation of European ungulate populations. Mammal Res. 62, 209–217 (2017).
Demarais, S. et al. Ungulate management in national parks of the United States and Canada. Wildl. Soc. Tech. Rev. 12, 1–68 (2012).
Holderegger, R. et al. Conservation genetics: Linking science with practice. Mol. Ecol. 28, 3848–3856 (2019).
R. Taylor, H., Taylor, H. R., Dussex, N. & van Heezik, Y. Bridging the conservation genetics gap by identifying barriers to implementation for conservation practitioners. Global Ecol. Conserv. 10, 231–242 (2017).
Laikre, L. et al. Post-2020 goals overlook genetic diversity. Science 367, 1083–1085 (2020).
Hohenlohe, P., Chris Funk, W. & Rajora, O. Population genomics for wildlife conservation and management. Mol. Ecol. 30, 62–82 (2021).
Kardos, M. et al. The crucial role of genome-wide genetic variation in conservation. Proc. Natl Acad. Sci USA. 118, e2104642118 (2021).
DeWoody, J. A., Harder, A. M., Mathur, S. & Willoughby, J. R. The long-standing significance of genetic diversity in conservation. Mol. Ecol. 30, 4147–4154 (2021).
van Oosterhout, C. et al. Genomic erosion in the assessment of species extinction risk and recovery potential. Preprint at https://doi.org/10.1101/2022.09.13.507768.
Bertorelle, G. et al. Genetic load: genomic estimates and applications in non-model animals. Nat. Rev. Genet. 23, 492–503 (2022).
van Oosterhout, C. Mutation load is the spectre of species conservation. Nat. Ecol. Evol. 4, 1004–1006 (2020).
Wallace, B. Genetic Load, Its Biological and Conceptual Aspects. (Prentice Hall, 1970).
Charlesworth, B. Effective population size and patterns of molecular evolution and variation. Nat. Rev. Genetics 10, 195–205 (2009).
Díez-Del-Molino, D., Sánchez-Barreiro, F., Barnes, I., Gilbert, M. T. P. & Dalén, L. Quantifying Temporal genomic erosion in endangered species. Trends Ecol. Evol. 33, 176–185 (2018).
Allendorf, F. W., England, P. R., Luikart, G., Ritchie, P. A. & Ryman, N. Genetic effects of harvest on wild animal populations. Trends Ecol. Evol. 23, 327–337 (2008).
Formenti, G. et al. The era of reference genomes in conservation genomics. Trends Ecol. Evol. 37, 197–202 (2022).
Hoban, S. et al. Genetic diversity targets and indicators in the CBD post-2020 global biodiversity framework must be improved. Biological Conservation 248, 108654 (2020).
Andersson, A., Karlsson, S., Ryman, N. & Laikre, L. Monitoring genetic diversity with new indicators applied to an alpine freshwater top predator. Mol. Ecol. 31, 6422–6439 (2022).
Thurfjell, H., Laikre, L., Ekblom, R., Hoban, S. & Sjögren-Gulve, P. Practical application of indicators for genetic diversity in CBD post-2020 Global Biodiversity Framework implementation. Ecol. Indicators 142, 109167 (2022).
Hoban, S. et al. Global genetic diversity status and trends: towards a suite of Essential Biodiversity Variables (EBVs) for genetic composition. Biol. Rev. Camb. Philos. Soc. 97, 1511–1538 (2022).
CBD. Convention on Biological Diversity. Decisions adopted by the Conference of the Parties to the Convention on Biological Diversity. Fifteenth meeting – Part II. in The Kunming-Montreal Global Biodiversity Framework and the Monitoring framework for the Kunming-Montreal Global Biodiversity Framework. 130–130 (2022).
van der Valk, T., Díez-Del-Molino, D., Marques-Bonet, T., Guschanski, K. & Dalén, L. Historical Genomes Reveal the Genomic Consequences of Recent Population Decline in Eastern Gorillas. Curr. Biol. 29, 165–170.e6 (2019).
Robinson, J. A. et al. The critically endangered vaquita is not doomed to extinction by inbreeding depression. Science 376, 635–639 (2022).
von Seth, J. et al. Genomic insights into the conservation status of the world’s last remaining Sumatran rhinoceros populations. Nat. Commun. 12, 2393. (2021).
Robinson, J., Kyriazis, C. C., Yuan, S. C. & Lohmueller, K. E. Deleterious variation in natural populations and implications for conservation genetics. Annu Rev. Anim. Biosci. 11, 93–114 (2023).
Dussex, N., Morales, H. E., Grossen, C., Dalén, L. & van Oosterhout, C. Purging and accumulation of genetic load in conservation. Trends Ecol. Evol. 38, 961–969 (2023).
Hedrick, P. W. & Garcia-Dorado, A. Understanding inbreeding depression, purging, and genetic rescue. Trends Ecol. Evol. 31, 940–952 (2016).
Stüwe, M. & Nievergelt, B. Recovery of alpine ibex from near extinction: the result of effective protection, captive breeding, and reintroductions. Appl. Animal Behav. Sci. 29, 379–387 (1991).
Brambilla, A., Von Hardenberg, A., Nelli, L. & Bassano, B. Distribution, status, and recent population dynamics of Alpine ibex Capra ibex in Europe. Mammal Rev. 50, 267–277 (2020).
Robin, M. et al. Ancient mitochondrial and modern whole genomes unravel massive genetic diversity loss during near extinction of Alpine ibex. Mol. Ecol. 31, 3548–3565 (2022).
Grossen, C., Guillaume, F., Keller, L. F. & Croll, D. Purging of highly deleterious mutations through severe bottlenecks in Alpine ibex. Nat. Commun. 11, 1001 (2020).
Ralls, K. et al. Call for a Paradigm Shift in the Genetic Management of Fragmented Populations. Conservation Lett. 11, e12412 (2018).
Flanagan, S. P., Forester, B. R., Latch, E. K., Aitken, S. N. & Hoban, S. Guidelines for planning genomic assessment and monitoring of locally adaptive variation to inform species conservation. Evol. Appl. 11, 1035–1052 (2018).
Lamichhaney, S. et al. Population-scale sequencing reveals genetic differentiation due to local adaptation in Atlantic herring. Proc. Natl Acad. Sci. USA 109, 19345–19350 (2012).
Han, F. et al. Ecological adaptation in Atlantic herring is associated with large shifts in allele frequencies at hundreds of loci. Elife 9, e61076 (2020).
Booker, T. R., Jackson, B. C. & Keightley, P. D. Detecting positive selection in the genome. BMC Biol. 15, 1–10 (2017).
Markgren, G. The moose in Fennoscandia. Le. Naturaliste Canadien 101, 185–194 (1974).
Strandgaard, S. Factors Affecting the Moose Population in Sweden During the 20th Century with Special Attention to Silviculture. (1982).
Lykke, J. Selective harvest management of a norwegian moose population. Alces 41, 9–24 (2005).
Lykke, J. Moose management in Norway and Sweden. Nat. Can. 10, 723–735 (1974).
Sylvén, S. Management and regulated harvest of moose (Alces alces) in Sweden. PhD thesis. 371, Swedish University of Agricultural Sciences. (2003).
Wennerström, L. et al. Genetic landscape with sharp discontinuities shaped by complex demographic history in moose (Alces alces). J. Mammal. 97, 1–13. (2016).
Dussex, N. et al. Moose genomes reveal past glacial demography and the origin of modern lineages. BMC Genomics 21, 854 (2020).
Niedziałkowska, M. et al. Spatial structure in European moose (Alces alces): genetic data reveal a complex population history. J. Biogeogr. 41, 2173–2184 (2014).
Niedziałkowska, M., Jędrzejewska, B., Danyłow, J. & Niedziałkowski, K. Diverse rates of gene flow and long-distance migration in two moose Alces alces subpopulations in Europe. Mammal. Res. 61, 171–178 (2016).
Niedziałkowska, M. et al. The contemporary genetic pattern of European moose is shaped by postglacial recolonization, bottlenecks, and the geographical barrier of the Baltic Sea. Biol. J. Linn. Soc. Lond. 117, 879–894 (2016).
Sand, H., Cederlund, G. & Danell, K. Geographical and latitudinal variation in growth patterns and adult body size of Swedish moose (Alces alces). Oecologia 102, 433–442 (1995).
Nygrén, T., Pusenius, J., Tiilikainen, R. & Korpelainen, J. Moose antler type polymorphism: age and weight dependent phenotypes and phenotype frequencies in space and time. Ann. Zool. Fennici 44, 445–461 (2007).
Posledovich, D., Ekblom, R. & Laikre, L. Mapping and monitoring genetic diversity in Sweden - SWEDISH ENVIRONMENTAL PROTECTION AGENCY. (2021).
Wang, Y. et al. Genetic basis of ruminant headgear and rapid antler regeneration. Science 364, eaav6335 (2019).
Hundertmark, K. J. et al. Mitochondrial phylogeography of moose (Alces alces): late pleistocene divergence and population expansion. Mol. Phylogenet. Evol. 22, 375–387 (2002).
Blåhed, I.-M., Ericsson, G. & Spong, G. Noninvasive population assessment of moose (Alces alces) by SNP genotyping of fecal pellets. Eur. J. Wildlife Res. 65, 1–11 (2019).
Meiri, M., Lister, A., Kosintsev, P., Zazula, G. & Barnes, I. Population dynamics and range shifts of moose (Alces alces) during the Late Quaternary. J. Biogeogr. 47, 2223–2234 (2020).
Santiago, E. et al. Recent Demographic History Inferred by High-Resolution Analysis of Linkage Disequilibrium. Mol. Biol. Evol. 37, 3642–3653 (2020).
Ellegren, H., Mikko, S., Wallin, K. & Andersson, L. Limited polymorphism at major histocompatibility complex (MHC) loci in the Swedish moose A. alces. Mol. Ecol. 5, 3–9 (1996).
Mikko, S. & Andersson, L. Low major histocompatibility complex class II diversity in European and North American moose. Proc. Natl Acad. Sci. USA 92, 4259–4263 (1995).
Haanes, H., Røed, K. H., Solberg, E. J., Herfindal, I. & Sæther, B.-E. Genetic discontinuities in a continuously distributed and highly mobile ungulate, the Norwegian moose. Conserv. Genetics 12, 1131–1143 (2011).
Kangas, V.-M., Kvist, L., Laaksonen, S., Nygrén, T. & Aspi, J. Present genetic structure revealed by microsatellites reflects recent history of the Finnish moose (Alces alces). Eur. J. Wildlife Res. 59, 613–627 (2013).
Charlier, J., Laikre, L. & Ryman, N. Genetic Structure and Evidence of a Local Bottleneck in Moose in Sweden. J. Wildlife Manag. 72, 411–415 (2008).
Lord, E. et al. Pre-extinction Demographic Stability and genomic signatures of adaptation in the woolly rhinoceros. Curr. Biol. 30, 3871–3879.e7 (2020).
Pemberton, T. J. et al. Genomic patterns of homozygosity in worldwide human populations. Am. J. Hum. Genet. 91, 275–292 (2012).
Dussex, N. et al. Population genomics of the critically endangered kākāpō. Cell Genomics 1, 100002 (2021).
Allen, A. M. et al. Habitat-performance relationships of a large mammal on a predator-free island dominated by humans. Ecol. Evol. 7, 305–319 (2017).
Bell, D. A. et al. The Exciting Potential and Remaining Uncertainties of Genetic Rescue. Trends Ecol. Evol. 34, 1070–1079 (2019).
Robinson, J. A. et al. Genomic signatures of extensive inbreeding in Isle Royale wolves, a population on the threshold of extinction. Sci. Adv. 5, eaau0757 (2019).
Kolesnikov, V. V. & Kozlovskii, I. S. Geographical variation of antler morphology of moose (Alces alces) in Russia. Acta Theriologica 59, 443–448 (2014).
Terry Bowyer, R., Stewart, K. M., Pierce, B. M., Hundertmark, K. J. & Gasaway, W. C. Geographical variation in antler morphology of Alaskan moose: Putative effects of habitat and genetics. Alces 38, 155–165 (2002).
Geist, V. Deer of the World: Their Evolution, Behavior and Ecology. 52 (Stackpole Books, Mechanicsburg, Pennsylvania, USA, 1998).
Anderson, S. J., Côté, S. D., Richard, J. H. & Shafer, A. B. A. Genomic architecture of phenotypic extremes in a wild cervid. BMC genomics. 23, 1–13 (2022).
Hundertmark, K. J. & Bowyer, R. T. Genetics, evolution, and phylogeography of moose. Alces 40, 103–122 (2004).
Coltman, D. W. et al. Undesirable evolutionary consequences of trophy hunting. Nature 426, 655–658 (2003).
Hundertmark, K. J., Thelen, T. H. & Schwartz, C. C. Population and genetic effects of selective harvest strategies in moose: a modeling approach. Alces 29, 225–234 (1993).
Schmidt, J. I., Ver Hoef, J. M. & Terry Bowyer, R. Antler size of alaskan moose Alces alces gigas: effects of population density, hunter harvest and use of guides. Wildlife Biol. 13, 53–65. (2007).
Nygren, T. & Pesonen, M. Moose harvest and production of moose meat in Finland 1964-87. Suomen Riista (Finland) 139–140 (1989).
Luoma, A. Moose hunting in Finland — management of a heavily harvested population. https://helda.helsinki.fi/bitstream/handle/10138/22172/moosehun.pdf?sequence=2 (2002).
Hedrick, P. W. Rapid decrease in horn size of bighorn sheep: environmental decline, inbreeding depression, or evolutionary response to trophy hunting? J. Hered. 102, 770–781 (2011).
Ferchaud, A.-L. & Hansen, M. M. The impact of selection, gene flow and demographic history on heterogeneous genomic divergence: three-spine sticklebacks in divergent environments. Mol. Ecol. 25, 238–259 (2016).
Barton, N. H. Genetic hitchhiking. Philos. Trans. R. Soc. Lond. B Biol. Sci. 355, 1553–1562 (2000).
Schmidt, C., Hoban, S., Hunter, M., Paz-Vinas, I. & Garroway, C. J. Genetic diversity and IUCN Red List status. Conserv. Biol. https://doi.org/10.1111/cobi.14064 (2023).
Grace, M. K. et al. Testing a global standard for quantifying species recovery and assessing conservation impact. Conserv. Biol. 35, 1833–1849 (2021).
Yang, D. Y., Eng, B., Waye, J. S., Dudar, J. C. & Saunders, S. R. Technical note: improved DNA extraction from ancient bones using silica-based spin columns. Am. J. Phys. Anthropol. 105, 539–543 (1998).
Dussex, N. et al. Complete genomes of two extinct New Zealand passerines show responses to climate fluctuations but no evidence for genomic erosion prior to extinction. Biol. Lett. 15, 20190491 (2019).
Meyer, M. & Kircher, M. Illumina sequencing library preparation for highly multiplexed target capture and sequencing. Cold Spring Harb. Protoc. 2010, db.prot5448 (2010).
Briggs, A. W. et al. Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA. Nucl. Acids Res. 38, e87 (2010).
Dudchenko, O. et al. De novo assembly of the genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
Mikheenko, A., Prjibelski, A., Saveliev, V., Antipov, D. & Gurevich, A. Versatile genome assembly evaluation with QUAST-LG. Bioinformatics 34, i142–i150 (2018).
Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A. & Zdobnov, E. M. BUSCO Update: Novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Mol. Biol. Evol. 38, 4647–4654 (2021).
Kutschera, V. E. et al. GenErode: a bioinformatics pipeline to investigate genome erosion in endangered and extinct species. BMC bioinformatics. 23, 1–17 (2022).
Li, H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34, 3094–3100 (2018).
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
McKenna, A. et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
Jónsson, H., Ginolhac, A., Schubert, M., Johnson, P. L. F. & Orlando, L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics 29, 1682–1684 (2013).
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience 4, s13742–015 (2015).
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
Weir, B. S. & Clark Cockerham, C. Estimating F-statistics for the analysis of population structure. Evolution 38, 1358 (1984).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
Terhorst, J., Kamm, J. A. & Song, Y. S. Robust and scalable inference of population history from hundreds of unphased whole genomes. Nat. Genetics 49, 303–309 (2017).
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Haubold, B., Pfaffelhuber, P. & Lynch, M. mlRho - a program for estimating the population mutation and recombination rates from shotgun-sequenced diploid genomes. Mol. Ecol. 19, 277–284 (2010).
R Core Team. R: A language and environment for statistical computing. (R Foundation for Statistical Computing, 2020).
Johnston, S. E., Huisman, J., Ellis, P. A. & Pemberton, J. M. A high density linkage map reveals sexual dimorphism in recombination landscapes in red deer (Cervus elaphus). G3: Genes, Genomes, Genetics 7, 2859–2870 (2017).
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 6, 80–92 (2012).
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protocols. 7, 562–578 (2012).
Xue, Y. et al. Mountain gorilla genomes reveal the impact of long-term population decline and inbreeding. Science 348, 242–245 (2015).
Mathur, S. & DeWoody, J. A. Genetic load has potential in large populations but is realized in small inbred populations. Evol. Appl. 14, 1540–1557 (2021).
von Seth, J. et al. Genomic trajectories of a near-extinction event in the Chatham Island black robin. BMC Genomics 23, 747 (2022).
Jamieson, I. G. & Allendorf, F. W. How does the 50/500 rule apply to MVPs? Trends Ecol. Evol. 27, 578–584 (2012).
Allendorf, F. W., Chris Funk, W., Aitken, S. N., Byrne, M. & Luikart, G. Conservation and the Genomics of Populations. (Oxford University Press, 2022).
Korneliussen, T. S., Albrechtsen, A. & Nielsen, R. ANGSD: Analysis of Next Generation Sequencing Data. BMC Bioinforma. 15, 356 (2014).
Willoughby, J. R., Harder, A. M., Tennessen, J. A., Scribner, K. T. & Christie, M. R. Rapid genetic adaptation to a novel environment despite a genome-wide reduction in genetic diversity. Mol. Ecol. 27, 4041–4051 (2018).
Frichot, E. & François, O. LEA: An R package for landscape and ecological association studies. Methods Ecol. Evol. https://doi.org/10.1111/2041-210X.12382 (2015).
Acknowledgements
We thank Daniela Kalthoff (NRM, Stockholm) for access to historical samples, and Kristin Löwenborg at the Swedish Environmental Protection Agency for collaborations in the Swedish program for monitoring genetic diversity. The data (SLU) collection was funded by the thematic program Wildlife and Forestry, the Swedish University of Agricultural Sciences, the Swedish Environmental Protection Agency, the EU Interreg II program Kvarken-MittSkandia, the Swedish Association for Hunting and Wildlife Management, the Kempe Foundation, and the counties of Västerbotten and Norrbotten. N.D. and L.L. acknowledge support and financing of sequencing (208-20-009) of the Swedish Environmental Protection Agency (Naturvårdsverket). N.D. acknowledges support of the Carl Tryggers Foundation (Grant CTS 19: 257). L.L. acknowledges support from the Swedish Research Council Formas (2020-01290) and the Swedish Research Council (2019-05503). The authors acknowledge support from the National Genomics Infrastructure in Stockholm funded by Science for Life Laboratory, the Knut and Alice Wallenberg Foundation and the Swedish Research Council, and SNIC/Uppsala Multidisciplinary Center for Advanced Computational Science for assistance with massively parallel sequencing and access to the UPPMAX computational infrastructure. We thank the three anonymous reviewers for their constructive comments.
Funding
Open access funding provided by Stockholm University.
Author information
Authors and Affiliations
Contributions
N.D. and L.L. conceived the study. N.D. performed the laboratory work, analysed the data with the assistance of S.K. and wrote the main body of the manuscript. L.L., N.R., G.S. and G.E. provided modern tissue samples. R.A.O. and L.D. generated the de-novo reference genome. R.E. commented on the manuscript. All authors contributed to writing the manuscript. All authors have read and approved the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Biology thanks Morten Olsen and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: John Mulley and Christina Karlsson Rosenthal.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Dussex, N., Kurland, S., Olsen, RA. et al. Range-wide and temporal genomic analyses reveal the consequences of near-extinction in Swedish moose. Commun Biol 6, 1035 (2023). https://doi.org/10.1038/s42003-023-05385-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42003-023-05385-x
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.