Introduction

The relative importance of genetic and non-genetic factors in extinction liability has been extensively debated since Lande's (1988) landmark study suggested that demographic factors, such as disturbance-induced life history changes or habitat destruction, were more important predictors of extinction than inbreeding (Frankham, 2005; O'Grady et al., 2004). A recent meta-analysis by Spielman et al. (2004) has shown that critically endangered species usually have reduced genetic variability relative to non-threatened sister species. This has been interpreted as giving renewed support to the idea that genetic factors are important (deSalle, 2005), although clearly the two (demographic and genetic changes) must be intimately related. Very few studies have succeeded in dissentangling genetic and other causes of extinction (Saccheri et al., 1998).

It seems unambiguous that genetic variability must be important to the restoration of extinct or near-extinct species into the wild. More genetically heterogeneous stocks are likely to have a better chance of establishing themselves in an environment, which may be different in some respects to the original environment, although quantitative data on this are lacking. Ark projects should be managed to maintain genetic variability in stocks of endangered species, and considerable effort has been put into this. However, it is likely that stocks of species, which are being maintained for potential reintroduction, will previously have been subject to a level of inbreeding in the natural habitat, either owing to natural expansion-contraction cycles or more recent anthropogenic factors before their maintenance in captivity. The influence of historical bottlenecks on effective population size and genetic variability can usually only be inferred indirectly. Although population bottlenecks must be important in some well-known cases of inbred natural populations (Merola, 1994; Crnokrak and Roff, 1998), other studies imply that even severe historical bottlenecks can sometimes have remarkably little effect on reductions in genetic variation at neutral markers (Groombridge et al., 2000; Nichols et al., 2001) and quantitative genetic variation for fitness traits (Howard, 1993; Saccheri et al., 1996), two measures of genetic depletion that need not be correlated (Britten, 1996; Butlin and Tregenza, 1998).

Goodeid fish have often been reported as extinct or extirpated from the wild (De la Vega-Salazar et al., 2003a). This is a family of North American Cyprinodontid topminnows with a continental distribution in areas of increased geological and historical desertification (Webb et al., 2004). One species (Empetrichthys merriami) in the subfamily Empetrichthyinae (consisting of the genera Crenichthys and Empetrichthys, originally found in Nevada, USA) became extinct in the last century, and two are classified as vulnerable (IUCN, 2006). The more numerous and ecologically diversified subfamily, the viviparous Goodeinae, occupies most basins of Central Mexico. Of the 37 recognized species two are reported extinct (Allotoca meeki and A. catarinae, and in addition Characodon garmani was described from a single specimen), one is extirpated from the wild (Skiffia francesae) and two have been found in small natural populations despite being repeatedly reported as extinct (De la Vega-Salazar et al., 2003b). Ameca splendens and Zoogoneticus tequila are native to the Teuchitlán River, in the Ameca basin (western Mexico), which has experienced massive changes following the building of a dam in 1955. Captive populations of both species have been kept by zoological institutions and dedicated aquarists, and it was from such populations and museum specimens that Z. tequila was described (Webb and Miller, 1998). In 2001, a wild population of Z. tequila was discovered (Figure 1) within a very small spring. This was composed of only a handful of adult fish and a few tens of juveniles (De la Vega-Salazar et al., 2003a, 2003b), and represents an extreme case of vulnerability to extinction. In contrast, relatively large populations of A. splendens are found in spa springs (De la Vega-Salazar et al., 2003a) within its historical geographic range. Here, we compare the genetic variability, using microsatellite loci, of wild samples of Z. tequila (of adults from springs in the Teuchitlán area and their offspring) and A. splendens with samples from aquarists stocks maintained in the UK for up to 30 years.

Figure 1
figure 1

Female (above) and male Z. tequila from the only existing natural population. This was discovered in 2001 in a minute (4 × 4 m in diameter) spring near Teuchitlán, the species was deemed extinct in the wild when it was described in 1998 from museum and aquaria specimens.

Materials and methods

Sampling

Tail fin clips were taken from four populations of A. splendens and three populations of Z. tequila (see Table 1). One wild population of A. splendens was sampled from its native range in the headwaters of the Teuchitlán river, and samples were also obtained from three captive aquaria populations in the UK. The recently discovered wild population (also in the headwaters of the Teuchitlán river) of Z. tequila was also sampled. This is the only known wild sample of Z. tequila and appears to have been isolated in an extremely small spring (ca. 4 m in diameter) for many generations (De La Vega-Salazar et al., 2003a, 2003b). To increase the number of wild alleles sampled without further interference with the population, we included the F1 offspring of the original population (it was expected that some females collected from the wild were carrying young from previous matings). Fin clips from a captive population of Z. tequila were obtained from Chester Zoo. Both sexes were represented in all populations sampled.

Table 1 Populations of A. splendens and Z. tequila sampled

DNA extraction and microsatellite analysis

Genomic DNA was extracted from ethanol-preserved fin clips using the PureGene protocol (Gentra Systems, Minneapolis, MN, USA). We tested eight microsatellites that had been previously developed from various Goodeid species (Hamill et al., 2007): As2, As5, Ca6, Ca10, Iw193, Iw196, Xc18 and Xc25, and a further five from the literature: Zt1.2, Zt1.3, Zt1.6, Zt1.7 and Zt1.9 (Boto and Doadrio, 2003). Each 15 μl PCR reaction contained 0.2 mM each dNTP, 0.20–0.33 μ M each fluorescently labeled primer (Sigma-Proligo, Proligo, France SAS, Paris, France), 1.5 μl 10 × NH4 reaction buffer, 1.5 mM MgCl2, 0.1 μl Biotaq DNA polymerase (Bioline UK Ltd., 16 The Edge Business Centre, London, UK) and 10–300 ng DNA template. The PCR profile for all reactions was (1) 5 min at 94°C initial denaturation, (2) 15 s at 94°C denaturation, (3) 10 s at 60°C annealing (55°C for Ca10 and 62°C for Ca6), (4) 1 min at 72°C extension, (5) 39 repeats of steps 2 to 4 and (6) 10 min at 72°C final extension. PCR products were checked on 1.5% agarose gels. All samples were analyzed on a Beckman Coulter 3000 automated sequencer using 0.5–2.0 μl PCR product plus 0.5–1.0 μl 400 or 600 bp DNA ladder (Beckman–Coulter (UK) Ltd., Biomedical Research, High Wycombe, Buckinghamshire, England).

Four loci, Ca6, Ca10, Iw193 and Xc25 were monomorphic in both species and were not included in the analysis. Loci Zt1.3 and Zt1.7 did not amplify cleanly despite attempts to re-optimize, so they were also excluded. Locus Zt1.2 was monomorphic in A. splendens but polymorphic in Z. tequila.

Data analysis

Hardy–Weinberg equilibrium (HWE) and pairwise population differentiation were tested using Genepop v.3. 4 (Raymond and Rousset, 1995). Within-population measures of mean heterozygosity and mean number of alleles per locus are often presented as proxies for genetic diversity; however, we estimated allelic richness to correct for bias owing to sample size. We also calculated FIS for each population at each locus, and for each population across all loci combined, to compare levels of inbreeding, and between-population pairwise FST for each species. Allelic richness and FIS were calculated with FSTAT v.2.9.3 (Goudet, 2001), and FST was calculated with Genepop v.3.4 (Raymond and Rousset, 1995). In all the above analyses, we conservatively treated the parental and the F1 samples of Z. tequila separately, as these samples may differ, for instance, if we collected a different sample of males from that which fathered the offspring.

We used Migrate v.1.7.3 (Beerli, 1997–2002), part of the package LAMARC, to obtain maximum likelihood estimates of Ne of each population of A. splendens and Z. tequila. Although Migrate is normally used for inferring gene flow between populations, it can be used to draw inferences about single populations (e.g. Kuhner et al., 1998). This uses a coalescent approach coupled with a Monte–Carlo Markov chain technique to estimate θ (θ=4 Neμ, where Ne is the effective population size and μ is the mutation rate) for each population. Migrate uses a random seed to initiate calculations, so replicate estimates of θ vary. Ten trials were performed for each species and results were compared using one-way ANOVAs. An alternative approach (Vitalis and Couvet, 2001a, 2001b) estimates F (averaged one-locus identity probabilities; a within-population measure of Fst) and Ne for each population based on two locus gene identity probabilities. We used ESTIM (Vitalis and Couvet, 2001c) to generate estimates and confidence intervals (from bootstrapping) of F and Ne to compare with the results we obtained using Migrate.

These analyses are useful for estimating contemporary effective population sizes, but are limited because they cannot detect whether a population has been historically declining, expanding or stable. Like Migrate, the programme msvar (Beaumont, 1999) is based on a MCMC technique, but it uses a coalescent approach (assuming a step-wise mutation process) to provide joint likelihood estimates for two additional population demographic parameters of interest to this study. The first is r, where r=N0/N1, N0 is the current size of the sampled population and N1 is the size of a single stable population before decline or expansion. A declining population is expected to give log10(r) <0 and an expanding one log10(r) >0. A stable population size would give log10(r)=0. The second parameter is tf, which is the time (in generations) since the stable ancestral population began to decline or expand (ta), scaled by the current population size (N0), giving tf=ta/N0. We estimated the parameters of interest for A. splendens and Z. tequila on the basis of field observations and historical data (De la Vega-Salazar et al., 2003a) and on data from captive-breeding programmes at UNAM, to ensure that they were consistent with the starting parameter bounds used in the program. For A. splendens: N0≈4000, N1≈10 000 and ta≈50. For Z. tequila: N0≈90, N1≈4500 and ta≈50.

We hypothesized that the wild populations of A. splendens and Z. tequila have experienced recent population declines and therefore used an exponentially varying population model in the analysis. Only wild populations from each species were included in this analysis, and the data from Mexico Parents and Mexico Offspring in Z. tequila were pooled as a single wild population. As the MCMC chain may require a period of burn-in, we discarded the first 10 000 update steps out of 109 thinned update steps. Additionally, starting seeds and scale values were varied for three independent runs per species to check for convergence. Our parameter bounds on log10(r) and log10(tf) were (−4,−0) and (−2,−1), respectively. Increasing the bounds beyond these reduced convergence, we limited the main bounds used to our biologically derived estimates. Similar bounds were used by Beaumont (1999), and encompass a range of parameter values that we considered to be biologically sensible given our estimates of N0, N1 and ta for both species. For example, for r, N0 was allowed to be–at the lower bound–four orders of magnitude smaller than N1, and–at the upper bound–equal in size to N1. We plotted the joint posterior distribution of log10(r) and log10(tf) following the procedure of Beaumont (1999) and Storz and Beaumont (2002).

Results

HWE and population differentiation

We did not detect deviations from HWE in any population of Z. tequila at any locus, nor across all loci combined, after Bonferroni correction. This test was applied to detect potential null alleles. In this situation, a Bonferroni correction reduces the stringency of the test, that is, the likelihood that departures from HWE will be detected (Moran, 2003). The use of Bonferroni correction in this case is appropriate to help distinguish loci that are consistently and markedly out of HWE from ‘false positives’ owing to multiple testing. In A. splendens locus As5 was out of HWE in all populations after Bonferroni correction, indicating the potential presence of null alleles, and was therefore discarded from all subsequent analyses. Following this, no A. splendens populations showed departures from HWE across the remaining five loci after Bonferroni correction. Allele frequencies are given in Supplementary Tables 1 and 2.

Population differentiation and FST

Pairwise tests of differentiation showed that all Z. tequila populations differed significantly in allele frequency, except Mexico parents and Mexico offspring (χ142=11.056, NS), and the only A. splendens populations that did not differ significantly were Chester Zoo and Chester Zoo Jaguar House (χ28=11.086, NS). Similarly, pairwise FST estimates (Table 2) show that the captive populations of A. splendens are more similar to each other than they are with the wild population, whereas the wild and captive Z. tequila show a substantial degree of divergence. Perhaps unsurprisingly, there are substantial changes in gene frequency between the different stocks.

Table 2 Population pairwise FST estimates for (a) A. splendens and (b) Z. tequila Negative FST values are given as zero.

FIS

Table 3 shows FIS values for all populations at each locus, and across all loci (excluding As5 in A. splendens). Captive populations typically show larger values, suggesting that they have experienced inbreeding (heterozygous deficit).

Table 3 FIS values for each population (and locus) of A. splendens and Z. tequila, and θ values per population generated using Migrate

Genetic diversity

In both A. splendens and Z. tequila, allelic richness was significantly higher in the wild Mexican populations than in the captive populations (Figure 2a and b, respectively). Other diversity indices such as mean heterozygosity, mean number of alleles per locus and total number of alleles per locus showed similar patterns. Averaged single-locus gene identities (F) show the same pattern (Table 4).

Figure 2
figure 2

Wild Mexican populations show greater allelic richness in both (a) A. splendens (Kruskal–Wallis: H3,16=9.66, P<0.02) and (b) Z. tequila (Kruskal–Wallis: H2,18=12.69, P=0.002). Median values and the first and third inter-quartile ranges are plotted.

Table 4 Estimates (and 95% confidence intervals generated by bootstrapping) obtained using ESTIM for F and Ne (effective population size) in both species. Ne estimates are given for an upper and lower range of mutation rates (μ)

Theta (θ) estimates

In A. splendens, mean θ across 10 trials was significantly higher in the wild Mexican population (by a factor of four) than in the captive populations (one-way ANOVA: F3,36=224.51, P<0.001, Tukey's comparison significant at 3.81). However, in Z. tequila, the mean estimates of θ of the wild populations were about 20% smaller than that obtained for the captive population, although the difference was not significant (one-way ANOVA: F2,27=2.85, P=0.075; Table 3).

Comparison of Ne over a range of mutation rates (μ)

The central aim of our analysis was to draw a comparison of genetic variation between wild and captive populations; however, these data do allow an inference of relative effective population size. Any such estimate relies on knowledge of the mutation rate (μ) of the genetic markers used in the analysis. The analysis performed in Migrate assumes a constant–and equal–mutation rate across all microsatellite loci, but estimates of microsatellite mutation rates range widely depending on both the organism and the locus examined (Weber and Wong, 1993; Schug et al., 1998; DeWoody and Avise, 2000; Schlötterer, 2000). DeWoody and Avise (2000) used microsatellite mutation rates between 10−2 and 10−4 (based on studies in humans and Drosophila melanogaster) to estimate Ne in freshwater fish. As with the θ estimates, average Ne of the wild A. splendens population is highest, varying from 19 to 4867 within the range of mutation rates from 10−2 to 10−4. Average Ne of both wild and captive Z. tequila populations are in the same order of magnitude as that of the captive A. splendens, in the lowest instance (CZJH) only reaching 1164 for a mutation rate of 10−4. Values of Ne calculated using ESTIM are consistently smaller (Table 4) but showed similar trends. ESTIM produced more cases of undetermined Ne (Ne=) when ns (the two-locus averaged identity disequilibrium within subsamples relative to the total sample) was negative. Nevertheless, similar conclusions can be drawn from the use of these different methodological approaches; both populations of Z. tequila appear to be genetically very small, and are comparable to the captive populations of A. splendens.

Detecting population decline

Both A. splendens and Z. tequila show evidence of exponential population decline (Figure 3), although the evidence in Z. tequila is stronger. The more severe decline in Z. tequila may have occurred over a longer period of time, as the 10% highest posterior density (HPD) contour for tf is about one and a half orders of magnitude greater than that of A. splendens, indicating that the time since decline for A. splendens may have been much shorter. Contours for the HPD limits indicate the region within which the probability of the parameter values is a critical value (Beaumont, 1999). For example, the 90% HPD contour gives the region within which the densest 90% of sample points lie, and can provide a measure of confidence in the parameter estimation. Convergence (i.e. consistency in the 90% HPD contours) occurred in both species over multiple long runs of 109 thinned update steps using variable starting seeds and parameter scale values. However, the presence of multiple 10% HPD peaks in each run of A. splendens suggests that convergence in this species may be weaker (see Figure 3). The 90% HPD limits on log10(r) never cross zero for either A. splendens or Z. tequila in any of the runs, however, which provides consistent support for exponential population decline in both species.

Figure 3
figure 3

Joint posterior distribution of log10(r) and log10(tf) for wild A. splendens population (dotted line) and wild Z. tequila population (solid line) showing 10, 50 and 90% HPD contours.

The stronger convergence in Z. tequila, that is, the consistency of the 10% HPD contour across several runs, suggests that the population size of this species before decline was roughly three orders of magnitude larger than the present size. Convergence of 10% HPD contours in A. splendens is weaker. However, if we consider a range of values corresponding to the two peaks in Figure 3 (which were also consistent across runs), a predecline population could have been between 1.5 and three orders of magnitude larger than the present size.

Discussion

We have found that a wild population of A. splendens is much more genetically diverse than stocks, which have been maintained in captivity for up to 10 years. Its effective population size is more than an order of magnitude greater. However, Z. tequila (endemic to the basin of the small Teuchitlán River) provides a striking contrast. Although a wild population contains more alleles than the captive stock, the calculated θ, an index of effective population size is not significantly greater than that of the captive population. The allelic richness of the wild Z. tequila population is lower (around half) than that of wild A. splendens, and across the family Goodeidae as a whole allelic richness is more like that seen in wild A. splendens or greater (Hamill et al., 2007). The variability of allele lengths in Z. tequila suggests that the population decline has been stronger and over a longer period than that in A. splendens, which is consistent with their current conservation status. Our interpretation is that the wild Z. tequila population, which is the only known extant population (De la Vega-Salazar et al., 2003a, 2003b), shows signs of declining genetic variability over its probable ancestral levels of diversity. A previous report using fewer markers (Boto and Doadrio, 2003) found that Z. tequila had a lower heterozygosity than its common and widespread sister species (Z. quitzeoensis).

Our results appear to be consistent with the hypothesis of Spielman et al. (2004) that inbreeding drives extinction. However, interspecific comparisons fail to disentangle the potentially confounding problem of causality (i.e. whether species go extinct because they become inbred or whether species that are vulnerable to extinction are also, for unrelated reasons, prone to have reduced genetic variability), so such evidence is far from conclusive (Frankham, 2005). Lande (1988) argued that demographic factors are paramount in driving extinction, a view supported by some recent reports of substantial demographic recovery of genetically impoverished populations (e.g. the Oryx, Oryx leucoryx; Mésochina et al. (2003); the elephant seal, Mirounga angustirostrus; Hoelzel et al., 1993), or others where population reduction was not clearly associated with a decrease in genetic variability (the Mauritius kestrel, Falco punctatus; Groombridge et al., 2000). However, Saccheri et al. (1998) found that extinction risk in butterflies was better predicted by genetic than ecological factors. It is even uncertain whether small population size and isolation necessarily lead to the loss of genetic diversity (e.g. the Mauritius kestrel, Falco punctatus; Groombridge et al., 2000 and the butterfly Erebia epiphron silesiana; Schmitt et al., 2005). It appears that sometimes, if the impact of anthropogenic factors is severe, populations may go extinct without undergoing genetic bottlenecks. However, seemingly genetically impoverished populations can thrive, perhaps because inbreeding measured from neutral markers is often only loosely (if at all) related to quantitative fitness-related traits (Britten, 1996; Butlin and Tregenza, 1998; Slate et al., 2004).

Although it is extremely unlikely that all species become extinct due to inbreeding, the loss of genetic diversity can be a risk factor (O'Grady et al., 2004), and maintaining diversity is a major aim of ark projects. Estimates of the effect of inbreeding on fitness based on experimental manipulations (for instance, the superiority of cross- over self-pollinated Silene douglasii; Kephart, 2004) may not be applicable to naturally endangered species, which might have an evolutionary history of coping with low heterozygosity. Indeed, outcrossing may be detrimental for naturally inbred populations (e.g. outcrossed inbred males produce malformed tadpoles in Rana temporaria; Sagvik et al., 2004), particularly when they are outcrossed with captive populations, as demonstrated by the loss, rather than gain of genetic diversity in the Western mosquito fish, Gambusia affinis following the release of fish from captive stocks (Stockwell et al., 1996). There is also evidence from captive studies that establishing populations from multiple stocks can lead to outbreeding depression in the related G. holbrooki (Leberg, 1993).

There are no estimates of demographic parameters such as mean litter size, age at first reproduction or growth rate for either A. splendens or Z. tequila, yet recently established stocks derived from these wild parents are currently thriving in outdoor ponds. Thus, there is no obvious evidence of inbreeding depression, perhaps because deleterious alleles have been purged. We propose that, evidence of low θ values for Z. tequila notwithstanding, the viability of wild stocks should not be compromised by attempts to increase their genetic diversity in the absence of evidence of inbreeding depression. In the case of A. splendens, introducing fish from captive stocks cannot substantially increase genetic diversity, and may promote the spread of deleterious alleles adapted to captivity (Lynch and O'Hely, 2001). Indeed, there is evidence (Kelley et al., 2006) that captivity has led to a higher, potentially maladaptive aggressive behavior in this species. The case of Z. tequila is more complex. Both captive and wild populations have private alleles and thus can potentially enrich the genetic diversity of each other. However, the wild population has survived in isolation, at small numbers, for at least 15 years (De la Vega-Salazar et al., 2003b), which is probably long enough to have undergone some level of local adaptation. In this case, we face a poignant trade-off: to risk breaking down a locally adapted genome (and one purged of deleterious alleles) by introducing alleles adapted to captivity, or to risk local extinction due to stochastic environmental changes to which this genetically impoverished population cannot adapt.

The solution to such trade-offs partly depends upon our definition of a population. If we accept that the small deme of Z. tequila still holding out at a tiny spring in Teuchitlán is unique, we would be endorsing the value of its genetic identity. This would preclude rescue attempts such as that conducted with the population of adders (Vipera berus) at Smygehuk, Sweden (Madsen et al., 1999), where population decline prompted an exercise in genetic rescue that led not only to a demographic recovery, but also produced a population that is genotypically different from the original one. If the objective was to save the unique population of adders at Smygehuk, the attempt failed, but if the aim was to keep alive a population of adders at Smygehuk (with some of its original genes), it was an unqualified success. We think that the unique population of Z. tequila at Teuchitlán is beyond the point of no return unless we manage to recover a substantial portion of its original habitat and thus allow it to increase its numbers. But we can preserve some of its genes, we can combine them with those of the captive stocks and we can design management programes that minimize the negative impact of captivity by incorporating selective agents as similar as possible to those experienced in the wild. But, in the long run, husbandry leads to domestication. Skiffia francesae, a relative of these species, is completely extinct in the wild and we can detect no allelic variation at these loci in captive populations (personal observation; see also Boto and Doadrio, 2003). Ultimately, we must restore the environment to maintain critically endangered populations of species. In the mean time, efforts should be made to understand the relationship between quantitative fitness traits and inbreeding at neutral loci, so as to be able to assess the potential for survival of small populations when released in recovered environments.