Introduction

Loss of biodiversity is one of the critical challenges faced both by our planet and our species, as many plants and animals have been eradicated from human-dominated landscapes or remain in small populations that face a serious threat of extinction (UNEP, 1992). Conservation of these remaining populations may, in the long run, critically depend on genetic factors (Allendorf and Luikart, 2007; Frankham, 2009). Genetic diversity indicates a population’s fitness and evolutionary potential, and consequently its adaptive potential and resilience to environmental change (Reed and Frankham, 2003; Allendorf and Luikart, 2007), which makes it a critical issue for conservation. Increased accessibility and decreasing costs are making the use of genetics in biodiversity conservation more attractive than ever, and increasingly large amounts of genetic data are available for species of conservation concern. Comparing these data between different populations along the range of a species would be useful for understanding and evaluating their genetic health and assessing the risk of inbreeding depression. However, genetic diversity of different populations is often evaluated using different methods and markers, making such comparisons difficult (see Swenson et al., 2011).

We propose a simple approach for calibrating genetic diversity of different populations, reported by different studies, to the same scale relative to a reference population. By using this one well-studied population as a ‘yardstick’, we can perform large-scale comparisons of genetic diversity across a species range using the existing data. We demonstrate the utility of this concept using the brown bear (Ursus arctos), a widely distributed carnivore species that has been extensively studied using genetic methods.

Throughout most of its global range, the brown bear is suffering from habitat loss and overharvest, and more than 50% of its range and numbers have been lost since the mid-1800s (Servheen et al., 1999). Large populations remain in Northeastern and Northwestern Russia, Alaska and Canada, but only smaller isolated populations remain in the rest of the bear’s former range in Europe, the contiguous United States and the southern portions of the range in Asia. Although genetic diversity of different brown bear populations has been well documented, different studies typically use different types or panels of markers, making the results difficult to compare (Swenson et al., 2011).

Centuries of persecution wiped out the bears from most of the Western Europe, and by the mid 20th century only a few isolated remnant populations remained in the Apennine Mountains, Italian Alps, Cantabrian Mountains and Pyrenees (Zedrosser et al., 2001). Bears in Central, Eastern and Northern Europe fared somewhat better, with indigenous populations remaining in the Dinaric Mountains, the Carpathians and Northern Europe, but most of these populations were much smaller than today (Zedrosser et al., 2001). This situation started to change in the second half of last century, when many remaining populations recovered and expanded as bears started making a comeback due to conservation and management efforts (Zedrosser et al., 2001). The last decade of that century also marked the beginning of reintroductions of this species to Western Europe (Clark et al., 2002). Although the overall situation is improving, many populations are still critically small (Linnell et al., 2005). This makes understanding of genetic diversity both within and between European brown bear populations particularly important, as it can facilitate selection of the most appropriate source for reintroductions or population augmentations, as well as help identify the populations that need assistance.

We studied the Northern Dinaric bear population and used it as a reference population in this case study example. This population stretches from Slovenia through Croatia and Bosnia, and Herzegovina into Western Serbia and Montenegro (Zedrosser et al., 2001), and has effective population size of approximately 280 bears (Skrbinšek et al., 2012). It is a part of the larger Alps-Dinara-Pindos population, which spans over 11 countries, is thought to have approximately 2100–2500 individuals, and is considered stable over most of its range (Zedrosser et al., 2001).

In this paper, we (1) introduce the reference population approach for calibrating and comparing genetic diversity reported by different studies of different populations, (2) survey the baseline genetic diversity data of the bears in Northern Dinaric Mountains and (3) use the reference population approach with the bears in Northern Dinaric Mountains as a reference population to calibrate and compare genetic diversity reported by different studies of bear populations across the range of the species.

Materials and methods

Comparing genetic diversity using the reference population approach

Different studies of genetic diversity typically vary in the number of samples and the sets of genetic markers they apply. Although this limits the degree to which the reported diversity indices are directly comparable, we can calculate the genetic diversity indices relative to the diversity indices of a single well-studied population (large sample size, a large number of loci) that we use as a ‘yardstick’ (the reference population).

For each pairwise comparison of a population with the reference, the genetic marker set of both the reference and the compared population is reduced to the loci they have in common. To correct for differences in sample size, individual genotypes from the larger sample size (typically the reference population) are randomly resampled with replacement many times (1000) to the sample size of the smaller data set (Leberg, 2002). Average allelic richness, expected heterozygosity and their standard errors are then calculated over all random subsamples, thus correcting for differences in sample size. The standard errors are calculated as a mean of standard errors of each subsample. Finally, a heterozygosity ratio (Her) and allelic diversity ratio (Art) indices are calculated for the compared population as Her=Hex/HesR and Art=Ax/AsR, where Hex and Ax are expected heterozygosity and allelic diversity for the compared population, and HesR and AsR the subsampling-corrected values of these indices in the reference population (assuming that the reference population had more samples). Standard errors of the Her and Art indices are calculated as the standard error (s.e.) of division,

and

Genetic diversity of brown bears in Northern Dinaric Mountains—the reference population

Tissue and blood samples were collected from 2003 to 2008 from 505 dead bears and 8 bears captured for translocation (to France in 2006) or telemetry in the northernmost part of the Northern Dinaric population, in Slovenia (Figure 1). We analyzed 22 microsatellite loci for these 513 bears in three multiplex PCRs. Locus names, primer sequences, dyes, primer concentrations, analytic and quality assurance protocols used are detailed in Appendix 1. Further analytic protocols used for these loci are described in Skrbinšek et al. (2010). We randomly selected 10% of samples and repeated the genotyping to estimate error rates, as suggested by Pompanon et al. (2005). The actual number of repeats was considerably higher as the entire multiplex was repeated if the genotype at any locus was unclear. We used the methods recommended by Broquet and Petit (2004) to estimate the frequency of allelic dropouts and false alleles, and program Micro-Checker (Van Oosterhout et al., 2004) to check the data for the presence of null alleles, and scoring errors due to stuttering and dropout of large alleles.

Figure 1
figure 1

Alps-Dinara-Pindos bear population and sampling area. Shaded areas show brown bear range. (a) Alps-Dinara-Pindos population NW, NW Dinaric Mountains; (b) Alps-Dinara-Pindos population SE; (2) Carpathian population (after Zedrosser et al., 2001). Rectangle—sampling area.

We used R statistical environment (R Development Core Team, 2011) and ‘adegenet’ package (Jombart, 2008) for data handling and calculation of genetic diversity indices—observed heterozygosity (Ho), expected heterozygosity (He) and allelic diversity (A). Probability of identity (PI) and probability of identity of siblings (PIsib) were calculated according to Waits et al. (2001). We used the procedure described in Guo and Thompson (1992) with 1 000 000 steps in Markov chain and 10 000 dememorization steps to detect per-locus significant departures from Hardy–Weinberg equilibrium using the program Arlequin (Excoffier and Lischer, 2010). Holm–Bonferronni multiple test correction with α=0.05 threshold was used to correct for multiple testing.

Using the reference population approach to explore differences in genetic diversity of brown bear populations across species range

We compared genetic diversity of different brown bear populations across the species range using the bears in Northern Dinaric Mountains as the reference. The details of the included studies are presented in the Appendix 2. The marker set we used for the reference population included the majority or all markers used in any other study, allowing for a large panel of loci for most comparisons. As our data set also included several times the number of samples analyzed in any other study, we always used it as the larger data set for resampling. We made 1000 random subsamples for each comparison. Finally, we calculated the Her and Art indices, and used these to compare genetic diversity of bear populations across the species range.

The R code required to run comparisons between populations using the reference population approach (in the form of an R package with user manual and a user-friendly vignette), as well as the genetic data from the Dinaric bear population used for this study, are accessible in the Dryad repository (doi:10.5061/dryad.qt3j5).

Results

Genotyping

No loci showed evidence of long allele dropout or scoring errors due to stuttering. Locus Mu26 had null alleles (estimated frequency using detected null homozygotes=0.117), and was excluded from downstream analyses. Locus G10H did not provide reliable genotyping results and was also excluded. Locus Mu23 had an irregular repeat pattern, as two out of the eight alleles had a single base deletion in the region flanking the (CA)n microsatellite, making their size a single base pair different from the neighbouring alleles. We were able to score the alleles reliably, so we can include this locus in the analyses. However, as the other studies may have missed this, or used primers that did not include the region with this single base polymorphism, using this locus for the reference population could bias the genetic diversity estimates for the reference population high. Considering this, we decided to exclude this locus from the reference population data.

On average 66% of per-locus genotype analyses were repeated more than once (varies between multiplexes: A=69%, B=71%, C=51%). Median allelic dropout rate was 0.19% (0.00–0.70%). We detected false alleles only on locus G10P (0.19%). Taking into account the number of loci, per-locus error rates, the number of samples genotyped and the number of times analyses of each sample were repeated, we can expect that there are still approximately 10 (9.6) single-locus errors in the data set. This makes the estimated remaining per-locus error rate in the entire data set 9.36 × 10−4.

Genetic diversity of bears in Northern Dinaric Mountains (Slovenia)

Average heterozygosities using the 20 remaining loci were 0.731 (He) and 0.738 (Ho). All these loci fit Hardy–Weinberg expectations after Holm–Bonferronni multiple test correction at P=0.05. Average allelic diversity was 6.75 (s.d.=1.77). Per-locus results are summarized in Table 1.

Table 1 Genetic diversity indices for brown bears in Northern Dinaric Mountains

Comparison of genetic diversity of brown bear populations across the range of the species

The results of the range-wide comparison of genetic diversity in brown bears are summarized in Table 2, and show considerable differences between populations. On one extreme, the most diverse is the Carpathian population in Romania, followed by large populations in Canada and Alaska. At the other extreme, the lowest levels of diversity are observed for island populations and very small populations of high conservation concern (Gobi Desert, Cantabrian Mountains—Spain, Kodiak Island—Alaska).

Table 2 Comparison of genetic diversity between bear populations using bears in NW Dinaric Mountains (Slovenia, population Alps-Dinara-Pindos NW in bold face) as a reference to correct for different panels of loci and sample sizes

Discussion

The reference population approach provides a simple and easy to implement method of comparing genetic diversity between different populations of a species that were analysed in different studies using different loci, while collecting no or only minimal additional data. We demonstrate the application of this approach by evaluating the global distribution of genetic diversity of brown bears. Typically, there are two obstacles to comparing genetic diversity reported by different studies of the same species: different panels of genetic markers used and differences in sample sizes. The standard approach to addressing this problem is to shrink the genetic marker set to the largest common denominator of all studies, and use the smallest sample size in any population to correct for unequal sampling (El Mousadik and Petit, 1996; Leberg, 2002). This approach works only if similar sets of markers were used to study all populations or if marker sets are very large, which is often not the case. Also, by using a very small sample size to correct for unequal sampling, the power to detect differences in allelic richness is greatly reduced decreasing the power of all comparisons (Leberg, 2002).

The reference population approach overcomes many of these issues with a simple solution of scaling the genetic diversity of each considered population relative to the genetic diversity of a single well-studied population, effectively using this reference population as a calibration ‘yardstick’. Its main advantage is the ability to compare studies that would be otherwise impossible to compare—for example, studies that have no common genetic markers—if the markers they used are also used in the study of the reference population. The problem of low power of comparison will still remain when a study with a small sample size is compared, but this would not affect the power of pairwise comparisons of other populations.

Technical considerations, application and limitations of the reference population approach

Application of this method requires a reference population with a large sample size and a large number of genotyped loci. It is beneficial if a large population with high genetic diversity is used as a reference. If a study is designed specifically to provide reference population data, the panel of loci chosen should cover all or the majority of the loci used in other populations of interest. As more journals require genotype-level data to be deposited in online data repositories, reference population data should be increasingly easy to obtain. When suitable reference population data are available, it is straightforward to compare genetic diversity estimated in any new study of the same species with the existing data, provided that a large enough proportion of the marker set matches the marker set of the reference population.

We used multiple subsampling (Leberg, 2002) to correct for unequal sample sizes in different studies. Although it is argued that allelic diversity is a better predictor of a population’s evolutionary potential than heterozygosity (Allendorf, 1986), it is also much more sensitive to sample size, and corrections for unequal sampling must be applied to calculate allelic richness if studies with different sample sizes are being compared (El Mousadik and Petit, 1996; Leberg, 2002). The most commonly used method is the rarefaction approach suggested by El Mousadik and Petit (1996). Simulations done by Leberg (2002) suggest that the multiple subsampling approach we used provides marginally better precision, but both methods perform adequately and without bias.

There was considerable variation in resampled allelic richness for the reference population (Table 2). This is a consequence of both subsampling to a smaller sample size, as rare alleles will get missed (see Leberg, 2002), as well as of the differences in locus panels that were subsampled to match the panels in the compared populations. The related standard error shows the standard error of allelic richness at the subsample size, providing the basis for comparison with the population of interest. Comparing calibrated expected heterozygosity to the values reported in original studies, it is clear that we would draw similar inferences using either the reported He or the calibrated indices (Table 2). Brown bears are studied with a relatively standard set of microsatellite markers, so all the studies included in this comparison had considerable overlap in markers. Although the reference population approach provides a formal framework for the bear case study, it should be even more useful in a species studied with a more diverse set of markers.

A logical precondition of the reference population approach is that it assumes the same type of genetic markers used in all studies that are to be compared. We implemented the approach using microsatellite data; however, the general idea of using a ‘yardstick’ reference population could be transferred to other types of markers suitable for measuring genetic diversity (for example, single-nucleotide polymorphisms). Another potential problem for application is that sometimes only summary genetic diversity data are reported for a population, without any estimate of standard errors. Although such data are still useful, testing hypotheses about statistical significance of the observed differences between populations is impossible. This shows the importance of publishing standard error estimates in all genetic diversity studies, even if only a single population was studied. However, with recent changes to published data accessibility policies such cases should become increasingly rare.

The brown bear case study

The dramatic range of genetic diversity in brown bears that was observed by Paetkau et al. (1998b) in North America is also evident at the global scale (Table 2). Most of the observed patterns are expected—high genetic diversity in large populations (Alaska, Canada, Carpathians, Dinaric Mountains) and very low levels of genetic diversity in populations that have been isolated for a long time or have passed through severe demographic bottlenecks. The demographic history of many of these populations shows a large decline and a questionable future: the Gobi population in Mongolia (McCarthy et al., 2009), Cantabrian population in Spain (Perez et al., 2009) and the population in the Apennines in Italy (Ciucci and Boitani, 2008).

However, the genetic diversity in these populations is higher than the diversity of Kodiak Island bears in Alaska. This latter population is relatively large (>2500) and healthy, with low genetic diversity attributed to a long period of isolation from the bears on the continent (Paetkau et al., 1998a, 1998b). On the other hand, the demographic history of the other populations with low genetic diversity is presumed to be one of a recent contraction and isolation. For example, the Apennine population is estimated at around 50 remaining animals (Gervasi et al., 2008) and has been isolated for at least 400–600 years (Ciucci and Boitani, 2008). The story is similar with the Cantabrian bears in Spain, where the population suffered a dramatic decline in recent centuries and is now threatened with extinction (Perez et al., 2009).

Despite evidence from Kodiak bears that a brown bear population can exist and even prosper at very low levels of genetic diversity measured at neutral markers, this should not be generalized to the small populations that live in human-dominated landscapes. An island population may stabilize in a mutation-drift equilibrium at very low levels of genetic diversity, but it is possible that these bears survived against all odds through many generations of reduced fitness, all the time purging strongly deleterious alleles (Peatkau et al., 1998b). Although this may be a plausible scenario in Alaskan wilderness with favourable habitat and low human densities, the risk of inbreeding depression is likely to increase due to increased stress in degraded and human-dominated landscapes (Armbruster and Reed, 2005). For these populations, it is quite possible that they will need genetic rescue or restoration (Tallmon et al., 2004; Hedrick, 2005), or face extinction.

The highest genetic diversity levels were observed in the Carpathian brown bears. The population is relatively large, estimated to number around 8100 animals (Zedrosser et al., 2001), which may explain the high diversity. Another possible explanation for such high diversity might be historical mixing of animals from Eastern and Western glacial refugia as suggested by mitochondrial DNA data (Zachos et al., 2008). It would be interesting to compare genetic diversity levels of large bear populations in Russian Far East, but unfortunately there is no published research that would enable these comparisons.

Conclusions

Genetic diversity is a key component of long-term population viability (Allendorf and Ryman, 2002; Keller and Waller, 2002; O’Grady et al., 2006). By calibrating previously incompatible studies through comparisons with a reference population, we were able to directly compare neutral genetic diversity of brown bears from all previously studied populations. This method can easily be applied to other species and to test hypotheses about variables that influence genetic diversity across the range of a species. The method will also be helpful for identifying populations with low levels of diversity that have the greatest need for direct conservation actions, and can aid in providing the scientific justification needed to gain management and public support. The simplicity of the reference population approach should make it useful in future comparisons of genetic diversity estimates between previously incompatible studies and in improving our understanding of how genetic diversity is distributed along a species range.

Data archiving

Data and all R code have been deposited at Dryad: doi:10.5061/dryad.qt3j5.