Puma genomes from North and South America provide insights into the genomic consequences of inbreeding

Saremi, Nedda F.; Supple, Megan A.; Byrne, Ashley; Cahill, James A.; Coutinho, Luiz Lehmann; Dalén, Love; Figueiró, Henrique V.; Johnson, Warren E.; Milne, Heather J.; O’Brien, Stephen J.; O’Connell, Brendan; Onorato, David P.; Riley, Seth P. D.; Sikich, Jeff A.; Stahler, Daniel R.; Villela, Priscilla Marqui Schmidt; Vollmers, Christopher; Wayne, Robert K.; Eizirik, Eduardo; Corbett-Detig, Russell B.; Green, Richard E.; Wilmers, Christopher C.; Shapiro, Beth

doi:10.1038/s41467-019-12741-1

Download PDF

Article
Open access
Published: 18 October 2019

Puma genomes from North and South America provide insights into the genomic consequences of inbreeding

Nedda F. Saremi ORCID: orcid.org/0000-0003-2023-1212¹^na1,
Megan A. Supple²^na1,
Ashley Byrne³,
James A. Cahill ORCID: orcid.org/0000-0002-7145-0215²^nAff16,
Luiz Lehmann Coutinho⁴,
Love Dalén⁵,
Henrique V. Figueiró⁶,
Warren E. Johnson ORCID: orcid.org/0000-0002-5954-186X⁷^nAff17,
Heather J. Milne²,
Stephen J. O’Brien⁸,
Brendan O’Connell¹^nAff18,
David P. Onorato ORCID: orcid.org/0000-0002-4716-6847⁹,
Seth P. D. Riley^10,11,
Jeff A. Sikich¹⁰,
Daniel R. Stahler¹²,
Priscilla Marqui Schmidt Villela¹³,
Christopher Vollmers¹,
Robert K. Wayne¹¹,
Eduardo Eizirik ORCID: orcid.org/0000-0002-9658-0999⁶,
Russell B. Corbett-Detig¹,
Richard E. Green¹,
Christopher C. Wilmers¹⁴ &
…
Beth Shapiro^2,15

Nature Communications volume 10, Article number: 4769 (2019) Cite this article

16k Accesses
42 Citations
117 Altmetric
Metrics details

Subjects

An Author Correction to this article was published on 21 November 2019

This article has been updated

Abstract

Pumas are the most widely distributed felid in the Western Hemisphere. Increasingly, however, human persecution and habitat loss are isolating puma populations. To explore the genomic consequences of this isolation, we assemble a draft puma genome and a geographically broad panel of resequenced individuals. We estimate that the lineage leading to present-day North American pumas diverged from South American lineages 300–100 thousand years ago. We find signatures of close inbreeding in geographically isolated North American populations, but also that tracts of homozygosity are rarely shared among these populations, suggesting that assisted gene flow would restore local genetic diversity. The genome of a Florida panther descended from translocated Central American individuals has long tracts of homozygosity despite recent outbreeding. This suggests that while translocations may introduce diversity, sustaining diversity in small and isolated populations will require either repeated translocations or restoration of landscape connectivity. Our approach provides a framework for genome-wide analyses that can be applied to the management of similarly small and isolated populations.

Genome-wide genetic variation coupled with demographic and ecological niche modeling of the dusky-footed woodrat (Neotoma fuscipes) reveal patterns of deep divergence and widespread Holocene expansion across northern California

Article 15 December 2020

Native American gene flow into Polynesia predating Easter Island settlement

Article 08 July 2020

Introgression and disruption of migration routes have shaped the genetic integrity of wildebeest populations

Article Open access 12 April 2024

Introduction

The ancestors of the puma, Puma concolor, also known as the mountain lion or cougar, colonized North America approximately 6 million years ago (mya)^1,2,3. Although their Pliocene fossil record is sparse and felid fossil assignments have been difficult, previous mitochondrial analyses suggested that the ancestral puma lineage diverged from the extinct North American cheetah-like cat Miracinonyx around 3.2 mya⁴. The geographic origin of P. concolor remains contested, however. At sites across North America, the oldest puma fossils date to the Rancholabrean land mammal age⁵, ~200 thousand years ago (kya)⁶. Analyses of mitochondrial and microsatellite data, however, estimated that the common ancestor of North American pumas lived within the last 20,000 years^7,8 and that the genetic diversity of all modern pumas traces to eastern South America⁷. The combination of genetic and fossil data were interpreted as reflecting a North American origin of the puma lineage followed by local extinction in North America during the late Pleistocene and subsequent recolonization from South America as the climate warmed after the last ice age^7,8. Recently, however, an unequivocal puma fossil was discovered in Argentina that dates to 1.2–0.8 mya⁹. This discovery pushes back the age of the puma lineage by more than 500,000 years, and suggests that the ancestor of all living pumas may have evolved in South America rather than North America.

Today, pumas are among the most widely distributed mammals in the Western hemisphere, ranging from Canada’s Yukon to the southern tip of South America (Fig. 1)^10,11. During the 19th and 20th centuries, bounty hunting reduced, and in some cases extirpated, puma populations across North America¹⁰, restricting them to the North American West and the southern tip of Florida. By the middle of the 20th century, hunting quotas and some outright bans¹² allowed puma populations to increase and recolonize parts of their former range. Although some puma populations today are large and well-connected¹³, others are small and fragmented (e.g., Santa Ana, CA¹⁴; Santa Monica Mountains, CA¹⁵), and/or critically endangered (e.g., Florida¹⁶). Many populations are experiencing increased isolation with the expansion of highways, residential developments, and agriculture^14,15.

The consequences of geographic isolation on puma genetic diversity and fitness have been well documented, particularly in Florida, where they are a federally protected subspecies commonly called the Florida panther. By the 1990s, the canonical Florida panther population in Big Cypress National Preserve was suffering from reproductive failure and phenotypic defects associated with inbreeding^16,17. To rescue the Florida panthers from extinction, eight female pumas from Texas were released in South Florida in 1995, of which five successfully produced offspring. By 2008, the occurrence of phenotypic defects had significantly declined, survival measures had improved, and the population size increased almost threefold^16,18. All Florida panthers genotyped since 2012 show ancestry that includes admixture with the introduced lineages¹⁹.

Florida panthers in Everglades National Park are partially isolated from the core canonical population that persisted in Big Cypress National Preserve by a semipermeable barrier associated with hydrologic fluctuations of the Everglades. Intriguingly, during the 1990s, the Everglades panthers did not show the same high incidence of inbreeding-associated phenotypes as in the Big Cypress population. The absence of observed phenotypic defects in the Everglades population may be attributable to the release during the 1950s and 1960s of captive-bred Florida panthers with mixed Central American ancestry into Everglades National Park. The introduced individuals’ ancestry was unclear at the time of release, although it was known that the captive population had greater reproductive success than wild Florida panthers²⁰. The admixed ancestry of the Everglades population and potential explanation for the reproductive success of the captive population was later discovered through genotyping²¹.

Genetics has a long history as a tool in wildlife conservation²². In traditional conservation genetics, researchers sequence a small number of genetic markers across a large sampling of the species of interest. Advances in sequencing technologies have made it possible to sequence whole genomes of non-model organisms, including species of conservation concern. While the cost of sequencing continues to decrease, sequencing whole genomes will undoubtedly remain more costly than sequencing only a handful of genetic loci. This presents a choice: whole genome data sets exchange spatial resolution for finer-scale genomic resolution, allowing researchers to test different hypotheses. Each whole genome contains a multitude of largely-independent genealogies, which provides increased power to infer past events^23,24. In particular, the dense haplotype information provided by whole genomes is necessary to examine the very short timescales^25,26 relevant to conservation efforts.

Here, we reconstruct the last two million years of puma demographic history by generating and analyzing a draft genome from an individual sampled in the Santa Cruz Mountains (California, USA), along with nine resequenced genomes from pumas from North and South America. We confirm the recent maternal ancestry of North American pumas and describe genomic diversity in the sampled populations. We use shared tracts of homozygosity to predict the effectiveness of assisted gene flow in restoring lost genetic diversity. Finally, we analyze the genome of a Florida panther with admixed ancestry that was collected 30 years after the first release of Central American admixed pumas into the Everglades. This genome allows us to assess the long-term efficacy of inter-population admixture as a means to rescue small and isolated populations from the deleterious effects of inbreeding.

Results

Genome assembly and variant calling

We assembled a de novo nuclear genome for a wild male puma (SC36) from the Santa Cruz Mountains using a combination of shotgun Illumina (47× coverage), long-range linking Illumina, and Oxford Nanopore Technology (ONT) (1.2× coverage)²⁷ data (see Methods section). Our PumCon1.0 assembly has a BUSCO²⁸ score of 93.04%, a scaffold N50 of 100 Mb, and 87.6% of the genome represented on 26 autosomal scaffolds, each larger than 20 Mb (Supplementary Tables 1 and 2). Although our ONT coverage was only 1.2×, the use of these data for gap-filling recovered an additional 5.74% of the genome sequence, which we error-corrected by re-mapping the Illumina reads (Supplementary Table 1).

We obtained 27×−55× coverage whole-genome resequencing data from nine additional pumas from locations in North and South America (Fig. 1 and Supplementary Tables 3 and 4), and aligned the data to our reference assembly (PumCon1.0) for variant calling. We produced three final call sets: the first containing 8 million variable sites using the 10 pumas, the second decreased to 166,037 variable sites after filtering the first call set for linkage disequilibrium (LD), and the final call set containing 557,741 SNPs after LD filtering using the 10 pumas and the African cheetah (see Methods section).

Demographic history

We reconstructed puma demographic history using both mitochondrial and nuclear genomes. Analyses of mitochondrial DNA estimate the most recent common maternal ancestor of all sampled pumas ~300 kya (Fig. 2a). North American mitochondrial haplotypes cluster together, sharing an inferred common maternal ancestor 31–11 kya. The North American mitochondrial clade excludes the Florida Everglades puma (EVG21), which has a mitochondrial ancestry that is distinct from the rest of North America, consistent with the reported mixed ancestry of this individual¹⁶.

The nuclear genomic data revealed a similar demographic history to that inferred from the mitochondrial data, and allowed us to estimate changes in effective population size over time. Pairwise sequentially Markovian coalescent (PSMC) modeling²⁹ of the nuclear genomic data suggested that two puma lineages, one represented by the two Brazilian individuals and the other represented by all individuals sampled in North America, diverged by 300–100 kya (Fig. 2b and Supplementary Fig. 5), similar to the age of the oldest puma fossils in North America. Populations on both continents were largest around 130 kya, during the warmest part of Marine Isotope Stage (MIS) 5, and then declined throughout the colder MIS 4–2, with populations reaching their current small sizes by the peak of the last ice age 25–20 kya.

Our North American pumas showed a continued increase in effective population size between 500 and 200 kya, whereas the effective population size of the Brazil pumas stabilized. This increase may reflect an increase in numbers during colonization of unoccupied habitats in Central and North America, but may also be attributable to PSMC modeling overestimating effective population size when a species has divided into subpopulations³⁰. To test whether this observed peak was the result of population structure, we modeled pseudo-diploid individuals using the X chromosomes of our male pumas (see Methods section). We found evidence of a cessation of gene flow between all Brazil–North America pseudo-diploid male puma pairs by at least 100 kya, as shown by the sharp increase in the inferred effective population size (N_e), signifying no coalescent events occurred more recently than the estimated divergence time (Supplementary Fig. 7). The divergence dates obtained from the pseudo-diploid X chromosome PSMC analysis overlapped the time at which North and South American inferred N_e began to differ in the autosomal PSMC model. Thus, structure alone was not the reason behind the observed increase in N_e during this time. The spike in effective population size observed for EVG21 probably does not reflect the coalescent process within a single population, but is instead consistent with mixed ancestry comprised of two divergent lineages³¹.

Population structure

We used the nuclear genomic data to characterize genetic structure among puma populations (Fig. 3). We performed principal component analysis (PCA) of 166,037 LD-filtered SNPs and found evidence of a geographic pattern (Fig. 3a). The first two axes of the PCA, which explain 52% of the genetic variance, separated North and South America and revealed a gradient of relatedness from east to west across North America. The Everglades puma (EVG21) fell between the Big Cypress and Brazil populations, consistent with this individual’s known history of admixture. Pumas sampled from the same population clustered together.

We estimated a consensus nuclear phylogeny from 557,741 SNPs from the LD-filtered data set that included ten pumas and the African cheetah. This analysis found further evidence of structure, with the highest likelihood tree including a single migration event from a South (or Central) American lineage into the Everglades lineage (Fig. 3b and Supplementary Figs. 8 and 9). Finally, our cluster assignment tests based on the puma only LD-filtered SNPs also partitioned the data geographically, first separating out the two California populations at K = 2, and then the Florida and Brazil populations at K = 3 (Fig. 3c and Supplementary Fig. 10). Notably, EVG21 shares ancestry with both Florida and Brazil at all K values (Supplementary Fig. 11).

We note that the discrete populations identified in these analyses could simply reflect the spatial sampling of our data set. Spatially structured sampling can cause analyses to report distinct populations even when no discrete population structure exists³². This artifact is particularly likely to occur when geographically widespread samples are taken from well connected species where limited dispersal results in the accumulation of local genetic variants, resulting in genetic isolation by distance³³. However, the observed geographic structure could also be the result of discrete genetic structure due to population isolation. Some puma populations have experienced persecution and degradation of their habitat, resulting in limited gene flow between populations^16,34,35,36. These isolated populations would show increased divergence over time, resulting in geographic structure.

Heterozygosity and inbreeding

To examine the extent of inbreeding in our puma samples, we estimated for each individual average genome-wide heterozygosity and identified runs of homozygosity (ROH) across the 26 largest autosomal scaffolds (Fig. 4, see Methods section). We focused our analyses on ROH > 2 Mb, as we were able to call these longer tracts with high confidence. Although genetic drift is a dominant evolutionary force in small populations, the strong correlation among linked sites that is characteristic of ROH > 2 Mb requires close inbreeding, and would not be observed due to genetic drift alone^37,38. The distribution of ROH across the genome varied among scaffolds and individuals (Fig. 4a and Supplementary Fig. 12), as did average genome-wide heterozygosity and proportion of the genome in ROH (Fig. 4b). The two pumas from Brazil were the least inbred, with the highest heterozygosity and smallest proportions of their genomes in ROH. Conversely, the Big Cypress panthers sampled prior to the 1995 genetic rescue were the most inbred, with the lowest heterozygosity and the largest proportions of their genomes in ROH, consistent with the phenotypic defects recorded in these individuals¹⁶. The other North American pumas fell between these two extremes. Of the two individuals from the Santa Monica Mountains, SMM12 appeared to be less inbred than SMM22, with higher heterozygosity and a lower proportion of its genome in ROH. This is consistent with their origins, as genetic analysis suggests that SMM22 was likely born in the small and more isolated Santa Monica Mountains population south of US 101 freeway, whereas SMM12 was first observed in the larger and more connected population north of US 101 and dispersed into the Santa Monica Mountains as a subadult¹⁵.

EVG21, the admixed Florida panther from the Everglades population, was an outlier in the general correlation between heterozygosity and proportion of the genome in ROH. The proportion of EVG21’s genome in ROH was high relative to the expectation based on its average genome-wide heterozygosity. This is consistent with both ancestral admixture resulting in a more diverse genetic background and close inbreeding leading to long tracts of homozygosity (Supplementary Fig. 13).

To better explore inbreeding history, we examined the distribution of ROH tract lengths in each puma. We correlated those lengths with the expected number of generations since the individual’s maternal and paternal lineages shared a common ancestor using an estimated average recombination rate from the domestic cat of 1.1 cM per Mb³⁹ and the equation g = 100/(2rL), where g is the time in generations, r is the recombination rate, and L is the length of the ROH tract in Mb^26,40 (Fig. 4c). Long ROH (>15.2 Mb) occur due to close inbreeding (a common ancestor <3 generations ago). Short ROH (<5.7 Mb) occur due to shared ancestors further back in time (>8 generations ago). All North American pumas sampled had a large number of short ROH, indicating that these populations were small in the recent past (8–23 generations ago). The puma from Yellowstone had mostly short ROH and a small number of intermediate and long ROH, consistent with a population that was small in the recent past, but that does not suffer from a considerable amount of close inbreeding in recent generations. The pumas from the Santa Cruz and Santa Monica Mountains had patterns similar to the Yellowstone puma, except they had additional long ROH, suggesting that these populations are experiencing close inbreeding. The Big Cypress panthers each had many long ROH, which we estimated to reflect shared ancestors within the last three generations.

The admixed Everglades panther, EVG21, had a small number of short ROH, similar to the Brazilian pumas, but had mostly long ROH, similar to the more inbred Florida individuals. This combination can be attributed to EVG21’s complex history of admixture and inbreeding. EVG21 has historic admixture, and is the offspring of an inbreeding event—the sire of EVG21 was also EVG21’s half brother¹⁶ (Supplementary Fig. 4). The peak of the ROH length distribution for EVG21 occurs at 5.7–9.1 Mb, indicating that EVG21’s maternal and paternal lineages shared a common ancestor as far back as 5–8 generations, shortly after the admixture event that occurred 6–9 generations prior¹⁶.

Although the sampled North American pumas all have long ROH, these tracts were generally not identical by descent (IBD) between individuals (Fig. 4d). Long ROH that are also shared IBD between individuals are concerning because they represent regions of the genome with no genetic diversity in the four haplotypes analyzed. Of the pumas sequenced, only the two individuals from Big Cypress (CYP47 and CYP51) shared a considerable proportion (36%) of their genomes in ROH that are IBD between two individuals. The pumas from the Santa Cruz Mountains (SC29 and SC36) shared 12% of their genomes in IBD ROH, whereas the pumas that originate from different areas in and near the Santa Monica Mountains (SMM12 and SMM22) shared only 4%. Individuals from the Santa Cruz and Santa Monica Mountains shared between 3% and 5%. While most sampled North American populations show signs of close inbreeding, different populations are fixed for different variants and considerable genetic variation still exists when considering the species as a whole.

Discussion

We present a draft assembly of a puma genome, which we use to reconstruct the demographic history of the species and measure genome-wide heterozygosity and ROH, the latter of which is less practical with lower-quality or reference-guided genome assemblies. Our assembly strategy combined short-read Illumina data with long-read data from ONT to generate a scaffold N50 of 100 Mb, making this one of the most contiguous wild felid genomes assembled to date.

Our analyses of ten complete puma genomes revealed the dynamic history of a once widespread species whose population size is now reduced across much of its range. We showed that extant North American pumas are descended from a population that dispersed northward from South America by at least 200 kya, consistent with the age of the oldest puma fossils in North America. Previously, the incomplete fossil record paired with divergence estimates based on rapidly evolving microsatellites and partial mitochondrial genomes led to the hypothesis of a North American origin of the species, followed by a late Pleistocene local extinction in North America and then a recolonization from South America within the last 20,000 years^7,8. Our results using complete nuclear and mitochondrial genomes are consistent with previous genetic analyses in that we show that North American pumas represent a subset of puma genetic diversity. However, the nuclear genomic data suggest that the lineage leading to North American pumas diverged from South American pumas ~300–100 kya, considerably older than the 20 kya inferred previously. While we are unable to exclude the possibility of a local late Pleistocene extinction in North America followed by a recolonization from an unsampled lineage elsewhere in South or Central America, we argue that this nuclear genomic data in combination with a recently identified puma fossil in South America that dates to 1.2–0.8 mya⁹ supports a simpler demographic hypothesis in which the puma lineage originates in South America, disperses into North America by 300–100 kya and persists there to the present day. We note that new fossils or genomic data from late Pleistocene aged pumas or pumas from other locations in South and Central America will be necessary to test this demographic hypothesis.

If true, the new model for puma demographic history means that pumas would have been present in North America for at least one complete glacial/interglacial cycle, indicating that pumas were capable of surviving in a broad range of habitats and environments. This hypothesis is supported by data from living pumas, which, despite a preference in North America for mountainous habitats, are also known to occupy grassland habitats in South America, such as Patagonia¹⁰. Differences in habitat selection between the two continents probably reflect a long history of competition with a diverse carnivore guild on both continents. For example, jaguars are better adapted than pumas to living in habitats that flood periodically⁴¹, and predation by wolves in North America probably precludes pumas living in open habitat without escape terrain⁴².

Intriguingly, North American pumas share a common maternal ancestor around the peak cold period of the last ice age, ~20 kya. This period is associated with a reduction of available habitat across the continental United States, as the coalesced Laurentide and Cordilleran glaciers covered much of present-day Canada and the Upper Midwestern United States⁴³. Forests would have been reduced significantly at that time, as would available habitat for the smaller prey preferred by pumas, providing a potential mechanism for a reduction in puma population size around that time.

The recent history of pumas is marked by human persecution and encroachment on their habitat, resulting in small and isolated populations that are susceptible to loss of genetic diversity and predisposed to inbreeding. Over many generations, without the input of novel variation from migrants, isolated populations can accumulate local genetic variation while losing overall genetic diversity. Loss of genetic diversity may be a common situation for top predators, as their population densities are usually low and successful migrants are infrequent. Consequently, even moderate levels of fragmentation will affect their genomic diversity. While pumas in South America currently experience less habitat degradation than pumas in North America, pumas in South America will likely face further habitat loss and fragmentation as rapid human population growth and land development continues on the continent¹³. The result may be small, isolated populations in South America similar to those currently seen in North America. Thus conservation efforts and findings taken from isolated populations in North America may need be applied in the future to other parts of the puma range.

In North America, pumas were hunted extensively, resulting in low population densities in many areas of their range¹⁰. Hunting was so severe until regulations were put in place during the mid 20th century that pumas likely experienced a population bottleneck. All North American pumas sampled in this study exhibit short ROH that date to approximately the early 20th century, indicative of small effective population sizes during the time when hunting was severe.

In many areas of North America, including California and Florida, large-scale hunting was followed by shrinking habitat availability, resulting in small, isolated populations^15,44,45. Our sampling focused on populations in North America that are known to be isolated and, as such, our results highlight the genomic consequences of this isolation—reduced diversity and signatures of close inbreeding. Pumas in the isolated populations of Big Cypress (CYP), Santa Monica Mountains (SMM), and Santa Cruz Mountains (SC) all have many ROH of all length categories, indicating ongoing inbreeding as a result of continued small population sizes. In contrast, the Yellowstone individual had a similar number of short ROH to these more isolated populations, but fewer long ROH. This pattern is consistent with the known history of hunting and habitat availability in the Yellowstone area. Pumas in the Yellowstone area were hunted to low densities into the mid 20th century¹⁰, but today Yellowstone National Park is a large protected area surrounded by wildlands. This connectivity between the Park and wildlands facilitated the recovery and maintenance of genetic diversity in the local puma population once hunting pressures were reduced.

Florida panthers are among the most well-studied populations of pumas, especially with regard to the phenotypic manifestations of isolation and inbreeding. The 1995 introduction of pumas from Texas, the most geographically proximate population to the Florida panthers, is widely regarded as a successful genetic rescue via translocation. However, Florida panther genetic diversity in the Everglades population had been bolstered several decades earlier, when seven individuals were released into Everglades National Park from a captive facility where pumas from Central America had been included in the breeding population²⁰. One Florida panther that we sequenced, EVG21, is admixed, having both Floridian and Central American ancestry. Her genome is a combination of regions with comparatively high heterozygosity, similar to that observed in the Brazilian pumas, and long ROH, similar to the highly inbred Florida panthers. The distribution of the lengths of ROH suggest that her maternal and paternal lineages shared a common ancestor that lived shortly after the release of the admixed pumas into the Everglades population. This suggests that the genomic consequences of inbreeding happen quickly, with much of the gains from the genetic rescue being quickly erased. EVG21’s genome provides evidence that when the population is small, it is likely that an individual’s parental lineages will share a very recent common ancestor, even after genetic rescue through admixture (Supplementary Fig. 13). Thus, a consistent effort is required to maintain the benefits of translocation.

In many areas of the current puma range, human land use has reduced the connectivity that is critical to recovery and maintenance of healthy populations. Despite these barriers, gene flow among neighboring populations can be facilitated by enhancing landscape connectivity through coordinated land use planning and by adding bridges or underpasses across freeways⁴⁶. Although pumas are capable of traveling long distances, large roads are a major barrier to their movement^14,47. A model of population dynamics in the Santa Monica Mountains that incorporated landscape connectivity and its effects on genetic diversity predicted a high probability of extinction (99.7%) within 50 years after survival rates first began to decrease due to inbreeding, unless connectivity was increased⁴⁸. Our genomic analyses of the samples from the Santa Monica Mountains also support the effectiveness of population connectivity. The two pumas sequenced from the region (SMM12 and SMM22) both currently reside in the small subpopulation south of US 101 freeway. However SMM12 migrated into the subpopulation from north of US 101¹⁵, a larger area that shows greater connectivity to surrounding regions. Migrations between these two areas are now rare, but the two subpopulations were probably part of a larger panmictic population prior to the existence of US 101. The genomic analysis of ROH highlighted that SMM22 had an increased number of large ROH relative to SMM12, consistent with SMM22 originating in a population that is smaller and more isolated. The examination of IBD ROH between SMM12 and SMM22 showed that only 4% of their genomes are in ROH that are IBD between the two individuals. In contrast, individuals that originated from the same population have a much larger proportion of their genomes in IBD ROH (e.g., 12% for SC29 and SC36). This indicates that while inbreeding has reduced diversity in a considerable proportion of the genomes of individuals within small populations, these low diversity regions are generally not shared between populations. Thus, reconnecting the populations on either side of US 101, as currently proposed via a wildlife crossing over the freeway, would help restore the lost genetic diversity.

Genome-scale data sets have the potential to inform conservation planning. Our results highlight how whole genome data can provide new insights when compared to traditional conservation genetic techniques. For instance, measures of average heterozygosity are the most commonly used metrics to characterize the genetic health of a species, as estimates are relatively simple to generate and are easily comparable among organisms. However, average heterozygosity provides only a narrow insight into the health and genetic potential of a species⁴⁹. While in some species average genome-wide heterozygosity is highly correlated with the level of inbreeding estimated using ROH²⁶, in systems with admixture, average heterozygosity estimates can be deceptive, as demonstrated with our admixed Everglades puma (EVG21). The heterozygosity of EVG21 is almost as high as the Brazilian pumas, but EVG21 has a large portion of her genome in ROH. We would infer two very different genetic conditions when considering each metric separately, and thus both heterozygosity and proportion of the genome in ROH should be considered in assessing genomic health. Finally, knowledge of shared ROH, an analysis which can only be done with very high density markers across the genome, is critical when designing mitigation plans, as this analysis predicts whether enhancing connectivity would restore lost genetic diversity and helps identify potential candidates for translocation. In this context, this study can serve as a template for future conservation genomic research targeting species living in small, isolated populations.

Methods

Assembly and annotation of the puma reference genome

We captured and drew blood from a wild, male puma (SC36) who lived in the Santa Cruz Mountains in California, USA in accordance with guidelines and regulations of local governing bodies (Supplemental Methods). We extracted DNA and generated a combination of short-read paired-end, proximity-ligation, and long-read data (Supplemental Methods). We assembled a de novo shotgun assembly using trimmed paired-end short reads, and scaffolded the assembly using proximity-ligation data⁵⁰ (Supplementary Fig. 1). We performed gap-filling on the scaffolded genome assembly using long-read data and corrected the newly gap-filled sequence using the short-read paired-end libraries (Supplementary Methods and Supplementary Table 1). Given that the puma used for the shotgun assembly was a male, we identified three X chromosome scaffolds in a female genome assembly (SMM13) and added these scaffolds (scaffolds X1, X2, and X3) to the assembly for SC36 (Supplementary Fig. 2 and Supplementary Methods).

We assessed this final version of the genome (PumCon1.0) by alignment to the domestic cat genome (GCA_000181335.4) (Supplementary Fig. 3). We used the genome assessment tool BUSCO²⁸ (version 2.0.1) to evaluate genome completeness based on a set of conserved single-copy orthologous genes (human gene set; n = 4104). In the PumCon1.0 genome, 93.0% of these genes are complete and present in a single copy only (Supplementary Table 2). The final genome assembly is 2,432,985,507 bp in length with an N50 of 100.53 Mb, 178,994 gaps, and 114,069,924 Ns. We focused further analyses on the 87.6% of the genome that is represented on 26 autosomal scaffolds, each larger than 20 Mb.

We generated and sequenced a cDNA library from whole blood collected from a wild puma (SC85) from the Santa Cruz Mountains (Supplementary Methods). The PumCon1.0 genome was annotated by NCBI according to the NCBI Eukaryotic Genome Annotation Pipeline⁵¹ using our cDNA data and a publicly available data set generated from a wild puma from Arizona (SAMN02885420, SRX633288).

Additional puma genomes

We generated genomic data for a total of 11 pumas (Supplementary Tables 3 and 4). We used data from one female from the Santa Monica Mountains to assemble the X chromosome (SMM13). The other ten pumas, including the individual used for the genome assembly (SC36), were used in a panel for analysis of demographic history, population structure, and inbreeding. The ten pumas that formed our panel were: two pumas from the Santa Cruz Mountains in Northern California (SC29, SC36), one puma from Yellowstone National Park (YNP198), two pumas from the Big Cypress National Preserve that were part of the canonical (pre-Texas admixture) Florida panther population (CYP47, CYP51), one puma from the population that lived in Everglades National Park in Florida that was the admixed descendent of a canonical Florida panther and a puma of Central American ancestry that was released into the Everglades decades prior to the Texas panther introduction¹⁶ (EVG21), two pumas from the Santa Monica Mountains in Southern California (SMM12, SMM22), and two pumas from eastern Brazil (BR406, BR338). Capture, handling, and sampling of all pumas involved in this study were approved by the appropriate governing bodies (Supplementary Methods). We generated ~30× coverage of short-read data for the 11 pumas described above (Supplementary Methods), and downloaded shotgun sequencing data for the African cheetah⁵² (SRR2737512-SRR2737518) to use as the outgroup for our analyses.

To perform variant calling and filtering for the puma genomes, we mapped adapter-trimmed resequencing data and cheetah SRA data to the PumCon1.0 genome, including the mitochondrial scaffold (Supplementary Methods). Due to the high number of nuclear mitochondrial DNA segments (NUMTs) in felids⁵³, we sought to decrease mismappings of authentic mitochondrial DNA in our data to NUMTs. We generated three sets of genotypes: two sets comprised the ten pumas (one set was LD filtered and the other was not LD filtered), and a third included the ten pumas plus the cheetah (LD filtered). For all variant files, we masked or removed sites that were not biallelic SNPs, and did not pass our filtering criteria (Supplementary Methods). We removed mitochondrial and X chromosome related scaffolds, and used only autosomal scaffolds for further analyses (scaffold Mt, X1, X2, X3, 869, 1862) (Supplementary Methods).

The non-LD filtered puma-only variant file contained 8,212,535 SNPs. The final LD-filtered puma-only variant file contained 166,037 SNPs. The LD-filtered puma and cheetah variant file contained 557,741 SNPs. The larger number of variants in the puma and cheetah file is due to sites where the cheetah carries two of the alternate allele while all pumas carry the reference allele. Using the non-LD filtered SNP calls from the puma-only data set, we generated a fasta file for each sample, masking both failed SNP sites and failed individual genotypes to Ns (Supplementary Methods).

Mitochondrial genome assemblies and phylogeny inference

We assembled an initial mitochondrial sequence for SC36 using short and long read data that mapped to the available puma reference mitochondrial sequence (KP202261.1) (Supplementary Methods). We then used adapter-trimmed Illumina shotgun data to assemble the mitochondrial genome sequences of the remaining nine pumas. We used the iterative assembler mia⁵⁴, with the SC36 mitochondrial sequence as the reference. The coverages of these mitochondrial assemblies ranged from 35× to 138×. We annotated the mitochondrial genomes using MITOS⁵⁵.

We constructed a maximum likelihood phylogeny using a single partition data set, and a GTR + GAMMA substitution model using the program RAxML⁵⁶ (Supplementary Methods), including our ten assembled puma mitochondrial genomes, the available puma reference mitochondrial sequence (KP202261.1), and a cheetah mitochondrial sequence (KP202271.1) as the outgroup. We estimated divergence times using a prior composite estimate of the feline mitochondrial divergence rate of 1.15% bp per million years^7,57 (Supplementary Methods).

Demographic history

We used the pairwise sequentially Markovian coalescent (PSMC) model²⁹ to estimate the historical effective population sizes of puma populations (Supplementary Methods). We performed one hundred replicate bootstraps for each individual per the software instructions (Supplementary Fig. 5). We also ran the PSMC tool on outbred regions of the genome, identified by being void of ROH, and saw no considerable difference from the full genome results (Supplementary Methods and Supplementary Fig. 6). In addition, we investigated the divergence time between our North and South American male pumas by running PSMC modeling of X chromosome pseudo-diploid sequences of each male North American puma with that of either of the two male Brazilian pumas (Supplementary Fig. 7).

Population structure

We ran principal component analysis on the LD-filtered variant file for the ten pumas, which consisted of 166,037 SNPs (Supplementary Methods). We constructed a tree to show population splits, both with and without the admixed sample EVG21, using the 557,741 SNPs in the LD-filtered variant file that included the cheetah outgroup (Supplementary Methods). We inferred population structure of the pumas using the LD filtered variant file with 10 pumas and 166,037 SNPs (Supplementary Methods and Supplementary Figs. 10 and 11).

Genome-wide heterozygosity and runs of homozygosity

We calculated genome-wide heterozygosity using three different methods: two reference-based and one non reference-based (Supplementary Methods and Supplementary Table 4).

We used a hidden Markov model (HMM) to identify ROH by identifying transitions between inbred and outbred regions of the genome (https://github.com/russcd/Heterozygosity_HMM). We estimated HMM model parameters from the data and used the filtered fasta files with IUPAC codes as input (Supplementary Methods and Supplementary Table 5). We converted the ROH tract lengths to generations using an estimated average recombination rate from the domestic cat (Supplementary Methods).

We used the sliding window approach in PLINK⁵⁸ (version 1.90b4.4) to identify ROH for comparison with our ROH HMM. Even with relaxed parameters, PLINK still tended to break up long tracts (Supplementary Fig. 14). Since accurate estimates of tract lengths were key to our inbreeding analysis, we used the ROH called by our ROH HMM program for further analyses.

We observed a low frequency of short ROH in the genome of the admixed Everglades panther (EVG21) relative to the other Florida panthers. We hypothesized that, because an individual cannot have a shared maternal and paternal ancestor that dates to before the admixture event, admixture in previous generations may have prevented the formation of short ROH. To test our hypothesis, we used an HMM to classify tracts of ancestry in the IUPAC coded fasta file of EVG21 into three types: pure Central/South American ancestry, pure Floridian ancestry, and mixed Central/South American and Floridian ancestry. The genome of EVG21 was composed of 21.98% Central/South American ancestry, 28.24% Floridian ancestry, and 49.58% mixed ancestry based on the HMM (Supplementary Fig. 15). Using ROH greater than 2 Mb that we identified with the ROH HMM, we classified each ROH as one of the three ancestry types. The results of this analysis classified all ROH as either pure Florida or pure Central/South American ancestry. We saw no ROH that were classified as being of mixed ancestry. Thus, admixture effectively prevents the formation of mixed ancestry ROH (Supplementary Figs. 15 and 16 and Supplementary Methods).

We estimated the proportion of the ROH that are shared between pairs of pumas by finding genomic regions where ROH overlap between pairs of samples (Supplementary Methods). For each pair of pumas we calculated the proportion of the genome that occurs in ROH that are IBD (Supplementary Table 6).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The datasets generated for this study are available in public repositories. Sequence data used for the genome assembly have been deposited in the SRA with the accession numbers SRR7148342-SRR7148354 [https://www.ncbi.nlm.nih.gov/Traces/study/?acc=SAMN08662999]. The PumCon1.0 genome is available on GenBank with the accession number GCF_003327715.1. The RNA-Seq data is available on the SRA with the accession number SRX4067841. Sequencing reads for the panel of pumas have been deposited in the SRA with the accession numbers SRR7639695, SRR7639696, SRR7542886-SRR754288, SRR7660678-SRR7660679, SRR7664677-SRR7664678, SRR7956993-SRR7956994, SRR7610940-SRR7610941, SRR7661934-SRR7661935, SRR7690239-SRR7690240, SRR7543017-SRR7543018, SRR7537344-SRR7537345, and SRR7148342-SRR7148354. Annotated mitochondrial assemblies for the ten pumas are available on GenBank with the accession numbers MH807447, MH814703, MH814704, MH814705, MH814706, MH814707, MH818219, MH818220, MH818221, and MH818222. All other relevant data is available upon request.

Change history

21 November 2019
An amendment to this paper has been published and can be accessed via a link at the top of the paper.

References

Van Valkenburgh, B., Grady, F. & Kurtén, B. The Plio-Pleistocene cheetah-like cat Miracinonyx inexpectatus of North America. J. Vert. Paleontol. 10, 434–454 (1990).
Article Google Scholar
Martin, L. D., Gilbert, B. M. & Adams, D. B. A cheetah-like cat in the north american pleistocene. Science 195, 981–982 (1977).
Article ADS CAS PubMed Google Scholar
Johnson, W. E. et al. The late Miocene radiation of modern Felidae: a genetic assessment. Science 311, 73–77 (2006).
Article ADS CAS PubMed Google Scholar
Barnett, R. et al. Evolution of the extinct Sabretooths and the American cheetah-like cat. Curr. Biol. 15, R589–R590 (2005).
Article CAS PubMed Google Scholar
Bell, C. J. et al. 7. The Blancan, Irvingtonian, and Rancholabrean Mammal Ages. In Late Cretaceous and Cenozoic Mammals of North America (Columbia University Press, 2004).
Froese, D. et al. Fossil and genomic evidence constrains the timing of bison arrival in North America. Proc. Natl Acad. Sci. USA 114, 3457–3462 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Culver, M., Johnson, W. E., Pecon-Slattery, J. & O’Brien, S. J. Genomic ancestry of the American puma (Puma concolor). J. Hered. 91, 186–197 (2000).
Article CAS PubMed Google Scholar
Matte, E. M. et al. Molecular evidence for a recent demographic expansion in the puma (Puma concolor) (Mammalia, Felidae). Genet. Mol. Biol. 36, 586–597 (2013).
Article CAS PubMed PubMed Central Google Scholar
Chimento, N. R. & Dondas, A. First record of Puma concolor (Mammalia, Felidae) in the early-middle pleistocene of South America. J. Mamm. Evol. 25, 381–389 (2017).
Sunquist, M. & Sunquist, F. Wild cats of the world (University of Chicago Press, 2017).
Nowell, K., Jackson, P. & IUCN/SSC Cat Specialist Group. Wild Cats: Status Survey and Conservation Action Plan (World Conservation Union, 1996).
Feldhamer, G. A., Thompson, B. C. & Chapman, J. A. Wild Mammals of North America: Biology, Management, and Conservation. (JHU Press, 2003).
Hornocker, M. & Negri, S. Cougar: Ecology and Conservation. (University of Chicago Press, 2009).
Vickers, T. W. et al. Survival and mortality of Pumas (Puma concolor) in a fragmented, urbanizing landscape. PLoS One 10, e0131490 (2015).
Article PubMed PubMed Central CAS Google Scholar
Riley, S. P. D. et al. Individual behaviors dominate the dynamics of an urban mountain lion population isolated by roads. Curr. Biol. 24, 1989–1994 (2014).
Article CAS PubMed Google Scholar
Johnson, W. E. et al. Genetic restoration of the Florida panther. Science 329, 1641–1645 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Barone, M. A. et al. Reproductive characteristics of male Florida Panthers: comparative studies from Florida, Texas, Colorado, Latin America, and North American Zoos. J. Mammal. 75, 150–162 (1994).
Article Google Scholar
Onorato, D. et al. The Biology and Conservation of Wild Felids. 453–470 (Oxford University Press, 2010).
van de Kerk, M., Onorato, D. P., Hostetler, J. A., Bolker, B. M. & Oli, M. K. Dynamics, persistence, and genetic management of the endangered florida panther population. Wildl. Monogr. 203, 3–35 (2019).
Article Google Scholar
O’Brien, S. J. Tears of the Cheetah: the genetic secrets of our animal ancestors. (St. Martin’s Griffin, 2015).
O’Brien, S. J. et al. Genetic Introgression within the Florida Panther (Felis concolor coryi). Nat. Geogr. Res. 6, 484–494 (1990).
Google Scholar
Frankham, R. Conservation genetics. Annu. Rev. Genet. 29, 305–327 (1995).
Article CAS PubMed Google Scholar
Hudson, R. R. Gene genealogies and the coalescent process. in Oxford Surveys in Evolutionary Biology (eds. Futuyma, D. & Antonovics, J.) Vol. 7, 1–44 (Oxford University Press, 1991).
Schraiber, J. G. & Akey, J. M. Methods and models for unravelling human evolutionary history. Nat. Rev. Genet. 16, 727–740 (2015).
Article CAS PubMed Google Scholar
Palamara, P. F., Lencz, T., Darvasi, A. & Pe’er, I. Length distributions of identity by descent reveal fine-scale demographic history. Am. J. Hum. Genet. 91, 809–822 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kardos, M. et al. Genomic consequences of intensive inbreeding in an isolated wolf population. Nat. Ecol. Evol. 2, 124–131 (2018).
Article PubMed Google Scholar
Jain, M., Olsen, H. E., Paten, B. & Akeson, M. The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol. 17, 239 (2016).
Article PubMed PubMed Central CAS Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed CAS Google Scholar
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Article CAS PubMed PubMed Central Google Scholar
Mazet, O., Rodríguez, W., Grusea, S., Boitard, S. & Chikhi, L. On the importance of being structured: instantaneous coalescence rates and human evolution–lessons for ancestral population size inference? Heredity 116, 362–371 (2016).
Article CAS PubMed Google Scholar
Cahill, J. A., Soares, A. E. R., Green, R. E. & Shapiro, B. Inferring species divergence times using pairwise sequential Markovian coalescent modelling and low-coverage genomic data. Philos. Trans. R. Soc. Lond. B Biol. Sci. 371, 20150138 (2016).
Article PubMed PubMed Central Google Scholar
Schwartz, M. K. & McKelvey, K. S. Why sampling scheme matters: the effect of sampling scheme on landscape genetic results. Conserv. Genet. 10, 441–452 (2009).
Article Google Scholar
Wright, S. Isolation by distance. Genetics 28, 114–138 (1943).
Article CAS PubMed PubMed Central Google Scholar
Ernest, H. B., Vickers, T. W., Morrison, S. A., Buchalski, M. R. & Boyce, W. M. Fractured genetic connectivity threatens a southern california puma (Puma concolor) population. PLoS One 9, e107985 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Riley, S. P. D. et al. A southern California freeway is a physical and social barrier to gene flow in carnivores. Mol. Ecol. 15, 1733–1741 (2006).
Article CAS PubMed Google Scholar
McRae, B. H., Beier, P., Dewald, L. E., Huynh, L. Y. & Keim, P. Habitat barriers limit gene flow and illuminate historical events in a wide-ranging carnivore, the American puma. Mol. Ecol. 14, 1965–1977 (2005).
Article CAS PubMed Google Scholar
Ceballos, F. C., Joshi, P. K., Clark, D. W., Ramsay, M. & Wilson, J. F. Runs of homozygosity: windows into population history and trait architecture. Nat. Rev. Genet. 19, 220–234 (2018).
Article CAS PubMed Google Scholar
Pemberton, T. J. et al. Genomic patterns of homozygosity in worldwide human populations. Am. J. Hum. Genet. 91, 275–292 (2012).
Article CAS PubMed PubMed Central Google Scholar
Dumont, B. L. & Payseur, B. A. Evolution of the genomic rate of recombination in mammals. Evolution 62, 276–294 (2008).
Article CAS PubMed Google Scholar
Thompson, E. A. Identity by descent: variation in meiosis, across genomes, and in populations. Genetics 194, 301–326 (2013).
Article CAS PubMed PubMed Central Google Scholar
Schaller, G. B. & Crawshaw, P. G. Movement patterns of Jaguar. Biotropica 12, 161 (1980).
Article Google Scholar
Elbroch, L. M. & Kusler, A. Are pumas subordinate carnivores, and does it matter? PeerJ 6, e4293 (2018).
Article PubMed PubMed Central Google Scholar
Dyke, A. S. et al. The Laurentide and Innuitian ice sheets during the last glacial maximum. Quat. Sci. Rev. 21, 9–31 (2002).
Article ADS Google Scholar
Wang, Y., Allen, M. L. & Wilmers, C. C. Mesopredator spatial and temporal responses to large predators and human development in the Santa Cruz Mountains of California. Biol. Conserv. 190, 23–33 (2015).
Article Google Scholar
Sweanor, L. L., Logan, K. A. & Hornocker, M. G. Cougar dispersal patterns, metapopulation dynamics, and conservation. Conserv. Biol. 14, 798–808 (2000).
Article Google Scholar
Gustafson, K. D., Winston Vickers, T., Boyce, W. M. & Ernest, H. B. A single migrant enhances the genetic diversity of an inbred puma population. Royal Soc. Open Sci. 4, 170115 (2017).
Article ADS Google Scholar
Beier, P. Dispersal of Juvenile Cougars in fragmented habitat. J. Wildl. Manag. 59, 228 (1995).
Article Google Scholar
Benson, J. F., Sikich, J. A. & Riley, S. P. D. Individual and population level resource selection patterns of Mountain Lions Preying on Mule Deer along an Urban-Wildland Gradient. PLoS One 11, e0158006 (2016).
Article PubMed PubMed Central CAS Google Scholar
Robinson, J. A. et al. Genomic flatlining in the endangered island Fox. Curr. Biol. 26, 1183–1189 (2016).
Article CAS PubMed Google Scholar
Putnam, N. H. et al. Chromosome-scale shotgun assembly using an in vitro method for long-range linkage. Genome Res. 26, 342–350 (2016).
Article CAS PubMed PubMed Central Google Scholar
Françoise, T.-N. P., Alexander, S. P., Terence, M. P. & Dicuccio M. & Kitts, P. Eukaryotic Genome Annotation Pipeline (National Center for Biotechnology Information (US), 2013).
Dobrynin, P. et al. Genomic legacy of the African cheetah, Acinonyx jubatus. Genome Biol. 16, 277 (2015).
Article PubMed PubMed Central Google Scholar
Lopez, J. V., Yuhki, N., Masuda, R., Modi, W. & O’Brien, S. J. Numt, a recent transfer and tandem amplification of mitochondrial DNA to the nuclear genome of the domestic cat. J. Mol. Evol. 39, 174–190 (1994).
CAS PubMed Google Scholar
Green, R. E. et al. A complete Neandertal mitochondrial genome sequence determined by high-throughput sequencing. Cell 134, 416–426 (2008).
Article CAS PubMed PubMed Central Google Scholar
Bernt, M. et al. MITOS: improved de novo metazoan mitochondrial genome annotation. Mol. Phylogenet. Evol. 69, 313–319 (2013).
Article PubMed Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article CAS PubMed PubMed Central Google Scholar
Lopez, J. V., Culver, M., Stephens, J. C., Johnson, W. E. & O’Brien, S. J. Rates of nuclear and cytoplasmic mitochondrial DNA sequence divergence in mammals. Mol. Biol. Evol. 14, 277–286 (1997).
Article CAS PubMed Google Scholar
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, (2015).
U.S. Geological Survey. Gap Analysis Project, 2017, Cougar (Puma concolor) mCOUGx_CONUS_2001v1 Habitat Map. https://doi.org/10.5066/F79885C6 (2017).
Nielsen, C., Thompson, D., Kelly, M. & Lopez-Gonzalez, C. A. Puma concolor (errata version published in 2016). https://doi.org/10.2305/IUCN.UK.2015-4.RLTS.T18868A50663436.en (2015).
Cho, Y. S. et al. The tiger genome and comparative analysis with lion and snow leopard genomes. Nat. Commun. 4, 2433 (2013).
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 8, e1002967 (2012).
Falush, D., Stephens, M. & Pritchard, J. K. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164, 1567–1587 (2003).
Jakobsson, M. & Rosenberg, N. A. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806 (2007).
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).

Download references

Acknowledgements

We thank Paul Houghtaling for helping to collect samples, R. Miotto, E. Amorim, J. May, CENAP/ICMBio/Brazil and AMC/Brazil for access to samples, and S. Webber and C. Scelfo-Dalbey for assistance in generating sequencing data. The authors would like to acknowledge support from Science for Life Laboratory, the National Genomics Infrastructure, and UPPMAX for providing assistance in massive parallel sequencing and computational infrastructure. Sequencing was also performed by the Laboratório de Biotecnologia Animal at the Universidade de São Paulo in Brazil, UC San Diego Institute for Genomic Medicine Genomics Center, UC Berkeley Vincent J. Coates Genomics Sequencing Laboratory, and UC Santa Cruz Ancient and Degraded Processing Center. Funding was provided by the Blue Foundation, and by a grant to C.C.W. from the Gordon and Betty Moore Foundation. C.C.W. was funded in part by NSF grants 1255913 and 0963022. B.S., M.A.S., N.F.S., and R.K.W. were funded by a grant from the University of California Office of the President. N.F.S. was funded in part by T32 HG008345/HG/NHGRI NIH HHS/United States. L.D. was funded by Formas grant 2015-676. D.R.S. was funded in part by Yellowstone Forever. R.B.C.-D. was funded by NIH-R35GM128932. B.S. and R.E.G. were funded in part by NSF DEB-1754551. E.E., L.L.C., and H.V.F. were supported by funds from CNPq/Brazil and INCT-EECBio/Brazil. Portions of this manuscript were prepared while W.E.J held a National Research Council Research Associateship Award at the Walter Reed Army Institute of Research and the published material reflects the views of the authors and should not be construed to represent those of the Department of the Army or the Department of Defense.

Author information

James A. Cahill
Present address: Laboratory of Neurogenetics of Language, Rockefeller University, 1230 York Avenue, New York, NY, 10065, USA
Warren E. Johnson
Present address: Walter Reed Biosystematics Unit, Smithsonian Institution, 4210 Silver Hill Road, Suitland, MD, 20746, USA
Brendan O’Connell
Present address: Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, 3181 S.W. Sam Jackson Park Road, Portland, OR, 97239-3098, USA
These authors contributed equally: Nedda F. Saremi, Megan A. Supple.

Authors and Affiliations

Department of Biomolecular Engineering, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA, 95064, USA
Nedda F. Saremi, Brendan O’Connell, Christopher Vollmers, Russell B. Corbett-Detig & Richard E. Green
Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA, 95064, USA
Megan A. Supple, James A. Cahill, Heather J. Milne & Beth Shapiro
Department of Molecular, Cell, and Developmental Biology, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA, 95064, USA
Ashley Byrne
Laboratório de Biotecnologia Animal, Departamento de Zootecnia, ESALQ, Universidade de São Paulo, Caixa Postal 09, Piracicaba, SP, 13418-900, Brazil
Luiz Lehmann Coutinho
Department of Bioinformatics and Genetics, Swedish Museum of Natural History, P.O. Box 50007, Stockholm, 10405, Sweden
Love Dalén
Escola de Ciências, Pontifical Catholic University of Rio Grande do Sul, Avenida Ipiranga, 6681-Partenon, Porto Alegre-RS, 90619-900, Brazil
Henrique V. Figueiró & Eduardo Eizirik
Smithsonian Conservation Biology Institute, Smithsonian Institution, 600 Maryland Avenue SW, Washington, DC, 20002, USA
Warren E. Johnson
Theodosius Dobzhansky Center for Genome Bioinformatics, Saint Petersburg State University, 41 Sredniy Prospekt, Saint Petersburg, 199004, Russia
Stephen J. O’Brien
Fish and Wildlife Research Institute, Florida Fish and Wildlife Conservation Commission, 298 Sabal Palm Road, Naples, FL, 34114, USA
David P. Onorato
Santa Monica Mountains National Recreation Area, 401 West Hillcrest Drive, Thousand Oaks, CA, 91360, USA
Seth P. D. Riley & Jeff A. Sikich
Department of Ecology and Evolutionary Biology, University of California, Los Angeles, 610 Charles E. Young Drive South, Los Angeles, CA, 90095-1601, USA
Seth P. D. Riley & Robert K. Wayne
Yellowstone Center for Resources, P.O. Box 168, Yellowstone National Park, WY, 82190, USA
Daniel R. Stahler
EcoMol Consultoria e Projetos, Avenida Limeira, 1131- Areiao, Piracicaba-SP, Brazil
Priscilla Marqui Schmidt Villela
Environmental Studies Department, University of California, Santa Cruz, 1156 High Street, Santa Cruz, CA, 95064, USA
Christopher C. Wilmers
Howard Hughes Medical Institute, 400 Jones Bridge Road, Chevy Chase, MD, 20815, USA
Beth Shapiro

Authors

Nedda F. Saremi
View author publications
You can also search for this author in PubMed Google Scholar
Megan A. Supple
View author publications
You can also search for this author in PubMed Google Scholar
Ashley Byrne
View author publications
You can also search for this author in PubMed Google Scholar
James A. Cahill
View author publications
You can also search for this author in PubMed Google Scholar
Luiz Lehmann Coutinho
View author publications
You can also search for this author in PubMed Google Scholar
Love Dalén
View author publications
You can also search for this author in PubMed Google Scholar
Henrique V. Figueiró
View author publications
You can also search for this author in PubMed Google Scholar
Warren E. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Heather J. Milne
View author publications
You can also search for this author in PubMed Google Scholar
Stephen J. O’Brien
View author publications
You can also search for this author in PubMed Google Scholar
Brendan O’Connell
View author publications
You can also search for this author in PubMed Google Scholar
David P. Onorato
View author publications
You can also search for this author in PubMed Google Scholar
Seth P. D. Riley
View author publications
You can also search for this author in PubMed Google Scholar
Jeff A. Sikich
View author publications
You can also search for this author in PubMed Google Scholar
Daniel R. Stahler
View author publications
You can also search for this author in PubMed Google Scholar
Priscilla Marqui Schmidt Villela
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Vollmers
View author publications
You can also search for this author in PubMed Google Scholar
Robert K. Wayne
View author publications
You can also search for this author in PubMed Google Scholar
Eduardo Eizirik
View author publications
You can also search for this author in PubMed Google Scholar
Russell B. Corbett-Detig
View author publications
You can also search for this author in PubMed Google Scholar
Richard E. Green
View author publications
You can also search for this author in PubMed Google Scholar
Christopher C. Wilmers
View author publications
You can also search for this author in PubMed Google Scholar
Beth Shapiro
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.S., C.C.W., and R.E.G. conceived and designed the study. C.C.W., C.V., D.P.O., D.R.S., E.E., H.V.F., J.A.S., L.D., L.L.C., P.M.S.V., R.K.W., S.J.O., S.P.D.R., and W.J. provided samples or data for this work. A.B., B.O., H.J.M. and P.M.S.V. performed laboratory work. B.S., R.B.C.-D., and R.E.G. supervised the analysis. J.A.C., M.A.S., N.F.S., and R.B.C.-D analyzed the data. B.S., C.C.W., E.E., M.A.S., N.F.S., R.B.C.-D., R.E.G., R.K.W., S.J.O., and W.J. interpreted the results. B.S., M.A.S., and N.F.S. wrote the paper. All authors edited the paper.

Corresponding author

Correspondence to Beth Shapiro.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Reporting Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Saremi, N.F., Supple, M.A., Byrne, A. et al. Puma genomes from North and South America provide insights into the genomic consequences of inbreeding. Nat Commun 10, 4769 (2019). https://doi.org/10.1038/s41467-019-12741-1

Download citation

Received: 14 March 2019
Accepted: 26 September 2019
Published: 18 October 2019
DOI: https://doi.org/10.1038/s41467-019-12741-1

This article is cited by

Comparative genomics and genome-wide SNPs of endangered Eld’s deer provide breeder selection for inbreeding avoidance
- Vichayanee Pumpitakkul
- Wanna Chetruengchai
- Vorasuk Shotelersuk
Scientific Reports (2023)
PumaPlex100: an expanded tool for puma SNP genotyping with low-yield DNA
- John A. Erwin
- Robert R. Fitak
- Melanie Culver
Conservation Genetics Resources (2021)
Transcriptomic and genomic variants between koala populations reveals underlying genetic components to disorders in a bottlenecked population
- R. E. Tarlinton
- J. Fabijan
- R. D. Emes
Conservation Genetics (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Genome assembly and variant calling

Demographic history

Population structure

Heterozygosity and inbreeding

Discussion

Methods

Assembly and annotation of the puma reference genome

Additional puma genomes

Mitochondrial genome assemblies and phylogeny inference

Demographic history

Population structure

Genome-wide heterozygosity and runs of homozygosity

Reporting summary

Data availability

Change history

21 November 2019

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links