Introduction

In a fragmented habitat with restricted migration, genetic drift will, together with natural selection and mutation, lead to local genetic differentiation. In a continuous habitat, however, where no obvious fragmentation seems to prevent individual movement, genetic differentiation can also be present. Genetic drift occurs locally and may create genetic structure, a process called ‘isolation by distance’ (Wright, 1943). Dispersal among neighbouring clusters of related individuals leads to local gene flow, a process approximated by the stepping-stone model (Kimura & Weiss, 1964). The resulting pattern of local similarity and increasing differentiation with distance can, over considerable periods of time, extend to large geographical areas. At such large distances, historical events such as population bottlenecks, range expansion or colonization may, in addition to individual movement, shape the observable spatial structure (e.g. Avise, 1994). Here, we investigate the genetic structure of an abundant population of Siberian lemmings (Lemmus sibiricus Kerr 1792) living in a continuous habitat. Specifically, we ask to what extent the ‘isolation-by-distance’ model is appropriate for understanding the processes generating the observed spatial structure.

The choice of sampling units is crucial in order to investigate population structures, and may influence the conclusions of a study. No natural habitat is truly continuous, and at some scale a patchy distribution of individuals will typically be observed. It may, however, be difficult to define discrete subunits. The genetic structure of populations in approximately continuous habitats has been described for a variety of organisms. Such studies have, however, commonly been limited to some particular sampling scales. Some studies focus on very small scales and compare individual genotypes in order to obtain information about spatial clustering of relatives (e.g. Ishibashi et al., 1997a) or local groups of similar genotypes (e.g. van Staden et al., 1996). Other studies, addressing the consequences of habitat fragmentation, compare samples from habitat fragments of a certain size to samples taken from a continuous habitat on a similar scale (e.g. Knutsen et al., 2000). Few studies have attempted to compare levels of genetic differentiation at different geographical scales (e.g. Leblois et al., 2000). This we do here.

The Siberian lemming is a microtine rodent that is widespread on the tundra of the Russian Arctic, from the White Sea to north-eastern Siberia (e.g. Stenseth & Ims, 1993: chap. 3). In largely undisturbed regions, such as the northern part of Taimyr Peninsula, the tundra is a relatively homogeneous environment; suitable habitat for Siberian lemmings is predominant in the lowlands of this area. As in most of the Siberian Arctic, the Siberian lemming is the most common small rodent species on northern Taimyr (Sdobnikov, 1957; Chernyavski & Tkachev, 1982) and may be considered a good model of an abundant species living in a continuous habitat. Their numbers vary, however, with great amplitude as they experience large, regular fluctuations in population densities typical for northern microtine rodents (Sdobnikov, 1957; Stenseth & Ims, 1993: chap. 4).

Only fragmentary information is available on the spatial biology of lemmings. Mass movements similar to those described for the Norwegian lemming (L. lemmus) in Fennoscandia have not been reported in Siberian lemmings (Stenseth & Ims, 1993: chap. 8). For this species, local migratory movements during snow melt in spring seem typical. These movements rarely exceed one kilometre but may be quite conspicuous in years of high population densities (Chernyavski & Tkachev, 1982). During spring, lemmings have been observed on snow and ice. They have also been reported to cross a one kilometre wide frozen river on the Taimyr Peninsula and to swim over 30 m of open water (Orlov, 1980). Male lemmings appear to move more than females (e.g. Stenseth & Ims, 1993: chap. 20), as is typical for most microtine rodents (e.g. Gaines & McClenaghan, 1980). Home ranges are larger for males with an average estimate of 1.3 ha vs. 0.7 ha for females in L. sibiricus (Chernyavski & Tkachev, 1982) — estimates which are in the same order of magnitude as for other Lemmus species (Banks et al., 1975; C. Gower, personal communication). Sex-biased dispersal is expected to be reflected in the genetic data as contrasting female vs. male patterns (cf. Favre et al., 1997).

Here we aim to determine the scale at which genetic differentiation is evident in Siberian lemmings, and we ask in particular whether isolation by distance is found in the more or less continuous tundra habitat on western Taimyr. We analysed the spatial distribution of individual genotypes, as well as discrete population samples separated by different geographical scales. Sex-specific patterns of genetic differentiation were also addressed. We found a ‘patchy’ genetic structure characterized by negligible local differentiation up to about 8 km or more, no pattern of isolation by distance among such patches and a tendency for related genotypes to be clustered at the smallest scale. We interpreted these patterns as populations not yet having reached an equilibrium structure after a major mixing event — an interpretation supported by the lack of concordance between mitochondrial DNA phylogeny and the geographical distribution of haplotypes. We furthermore concluded that considerable male-biased gene flow occurs in Siberian lemmings.

Materials and methods

Study area and sample collection

Western Taimyr was chosen for its extensive continuous tundra areas. The landscape along the coast in this region consists of gently rolling hills up to 50 m high, with elevation gradually increasing inland. Numerous rivers flow into the sea. Most of the study area lies within the southern part of the arctic tundra subzone, and the four southernmost localities belong to the typical tundra subzone (Chernov & Matveyeva, 1997). The habitat remains wet throughout the summer in low-lying depressions, whereas on the hills it becomes drier; along rivers a meadow-like vegetation is found. Siberian lemmings prefer wet areas, where the vegetation is characterized by sedges, cotton grass and mosses (e.g. Chernyavski & Tkachev, 1982). During the summer, from July to mid-September, lemmings inhabit wet slopes or areas where the micro-relief allows them to establish dry burrows and feed in the wet meadows. The distribution of winter nests and runways indicates that in winter they concentrate in the lowest parts of the landscape, where a thick snow cover accumulates (Sdobnikov, 1957; D.E., personal observations). Distances between potential summer and winter habitats are up to 100 m.

Sampling was carried out at three geographical scales. (1) At the smallest scale, 223 lemmings were collected more or less continuously over a linear transect 11 km long (referred to as ‘the transect’); the geographical coordinates for each individual were determined using GPS. (2) At an intermediate scale, lemming samples were collected at 12 mainland localities separated by 10–30 km along the coast (Fig. 1). Sampling was carried out in the summer of 1998 (a year of rather low lemming densities) and 1999 (a peak density year). In 1998, lemmings were caught with snap-traps set in runways or at the entrance of burrows. In 1999, most lemmings were captured or dug out of their burrows by a dog. In order to avoid sampling close relatives, we generally collected only one individual from each burrow system. (3) At the largest scale, specimens from two remote localities on northern Taimyr (1 and 2; Fig. 1) were obtained from the Swedish–Russian Tundra Ecology Expedition 1994 (Grönlund & Melander, 1995).

Figure 1
figure 1

Map of the study area. The small square in the upper insert indicates the geographical location of the main map. Locality names are as follows: (1) Cheliuskin, (2) north-western Taimyr, (3) Morzhovaya, (4) Zeledeevo, (5) Dva Medvedya, (6) East Golomo, (7) West Golomo, (8) Dikson, (9) Medusa, (10) Budenovets, (11) Brazhnikova, (12) Osipovka, (13) Omulevaya and (14) Shirokaya. The shaded rectangle between localities 7 and 8 represents the small-scale transect: on the enlarged view (lower insert) dots represent individual lemmings and dotted circles indicate the 10 subsamples referred to as A to J.

The specimens collected were killed and liver, kidney or muscle tissue was preserved in 90% ethanol. Total genomic DNA was isolated using proteinase K digestion, NaCl precipitation of proteins and DNA precipitation with isopropanol (Miller et al., 1988).

Genetic analysis

Two types of genetic marker were used: the control region (CR) of the mitochondrial genome (mtDNA), and four autosomal microsatellite loci.

mtDNA

A 380-bp fragment of the 5′ hypervariable part of the mtDNA CR was sequenced for a total of 346 Siberian lemmings. For PCR amplification we used primers designed from a Clethrionomys glareolus mtDNA sequence (GeneBank Accession no. Y07543): 5′-TCCCCCACCATCAGCACCC-3′ and 5′-CATCCCGAAAATTAAAAAATACC-3′. Amplification was carried out in a total volume of 50 μL, which contained 0.5 μM of each primer, a small amount of template DNA, 75 mM Tris-HCl (pH 8.8), 20 mM (NH4)2SO4, 0.2 mM dNTP, 2.5 mM MgCl2, and 0.5 units DyNAzyme II DNA polymerase (Finnzymes OY, Espoo, Finland). The PCR programme was 5 min at 94°C, followed by 35 cycles of 30 s at 94°C, 30 s at 54°C and 1 min at 72°C, then a final 5 min step at 72°C. PCR fragments were purified using the PCR Product Pre-Sequencing Kit (Amersham Life Sciences, Cleveland, OH). Sequences were determined manually using the Thermo Sequenase Radiolabelled Terminator Cyclic Sequencing Kit (Amersham Life Sciences) and electrophoretic separation of the radioactively labelled fragments on polyacrylamide gels. Sequencing was also performed on an ABI Applied Biosystems 373A automated DNA sequencer using the Dye Terminator Cycle Sequencing Kit (PE Biosystems, Warrington, UK) according to the manufacturer’s instructions. The primers used for sequencing were the same as used for PCR amplification. Sequences of 332 bp (corresponding to positions 15 436–15 770 in Mus musculus; Bibb et al., 1981) were obtained for all individuals (GeneBank Accession Nos AF355404–AF355454, excluding AF355341 and AF355412).

Microsatellites

Initially, we tested 21 microsatellite primer pairs published for different species of microtine rodents. Four of them (MSCRB-5A, MSCRB-6, Ishibashi et al., 1997b; AV-13, Stewart et al., 1998; and MSCg-9, Gockel et al., 1997) detected polymorphic products and could be reliably scored (Table 1). The PCR amplification, for which one primer was end-labelled with radioactive α32P-ATP, and electrophoresis of the fragments on polyacrylamide gels was carried out as described in Ehrich et al. (2001). Annealing temperatures and MgCl2 concentrations for each locus are given in Table 1.

Table 1 Species of origin and reference, repeat motif, PCR conditions (annealing temperature and MgCl2 concentration) and total number of identified alleles for the four microsatellite loci that were reliably amplified in Lemmus sibiricus. The repeat motif is given as described for the source species

Statistical analysis

For most analyses, we pooled samples from the intermediate (2) and the large-scale (3) localities. For the small-scale transect data, two types of analysis were used: an individual-based analysis (including 140 individual mtDNA CR sequences and 204 genotypes at four microsatellite loci) and a group-based analysis for which the transect was divided a posteriori into 10 discrete geographical subsamples (A to J) by excluding individuals from intermediate locations (Fig. 1).

Levels of mtDNA sequence polymorphism were estimated as unbiased haplotype diversity (H) and nucleotide diversity using ARLEQUIN 1.1 software (Schneider et al., 1997). Levels of microsatellite polymorphism were estimated as average expected heterozygosity (HE; Nei, 1987: p.178). Inbreeding coefficients (FIS) were calculated for each locality using the program FSTAT 2.8 (Goudet, 1995). GENEPOP 3.1 (Raymond & Rousset, 1995) was used to conduct exact tests for deviations from Hardy–Weinberg and for linkage equilibrium for each locality and locus or pair of loci, respectively. For both series of exact tests the significance level was adjusted using a Bonferroni correction.

At the individual level, population structure was analysed using the mtDNA data by plotting the proportion of identical pairs of haplotypes per distance class. Spatial autocorrelation in the individual microsatellite data was examined by calculating Smouse & Peakall’s (1999) autocorrelation coefficient. In both cases, 300 m distance classes were chosen, corresponding roughly to twice an upper estimate of the diameter of Lemmus home ranges (Chernyavski & Tkachev, 1982). Significant departures of spatial autocorrelation patterns from random distribution of genotypes was tested for by plotting the 95% null hypothesis confidence intervals. These confidence intervals were estimated by resampling, using 5000 permutations of individuals (four-locus genotypes for microsatellites) among geographical locations.

From the individual microsatellite data, we also estimated Wright’s ‘neighbourhood size’ 4Dπσ2, where D is the population density and σ2 is the variance of the axial distance between parents and offspring under the assumption of isotropic dispersal in two dimensions. Because in this case the average distance is equal to zero σ2 also corresponds to the average of the squared distances. The quantity σ thus gives an indication about the average gene flow distance per generation (Rousset, 1997). The neighbourhood size was estimated as the inverse of the slope of the regression line of ar (a parameter analogous to FST/(1 − FST) which can be estimated for pairs of individuals) vs. the logarithm of geographical distance (ln-distance) according to Rousset (2000). This approximate linear relationship is expected to hold less well at very short distances (less than σ) and to progressively depart from linearity at large distances, depending on the mutation rate (μ) of the genetic marker used. Distances recommended for estimation range from σ to 0.5σ /√2μ (Rousset, 1997). Assuming a mutation rate of μ=5 × 10–4 for microsatellite loci (e.g. Goldstein & Schlötterer, 1999) and arbitrarily choosing σ=150 m, the recommended upper limit is in the order of 2.5 km, whereas for σ=500 m it is around 8 km. Because independent estimates of σ are lacking for Siberian lemmings, we excluded comparisons between individuals separated by less than 150 m and estimated the regression slope for maximal distances ranging from 1 km to 11 km (total transect). The significance of the slopes was tested by a Mantel test (5000 permutations of individuals among geographical locations; Mantel, 1967).

The genetic population structure among discrete sampling localities (the 10 subsamples of the transect as well as the larger scales) was characterized from haplotype or allele frequencies by FST-values estimated in an analysis of molecular variance (AMOVA) using ARLEQUIN 1.1. Our study focused on the effects of local genetic drift vs. migration in shaping population structure at a spatial and temporal scale at which the influence of mutation is assumed to be negligible. Therefore, sequence differences among mtDNA haplotypes or differences in number of repeats among microsatellite alleles were ignored. Private mtDNA haplotypes occurring within one local sample were pooled for the frequency analysis. Locality 5, where only four individuals were captured, was excluded from the AMOVA. Confidence intervals were estimated for microsatellite values as the second smallest and the second largest value of all 35 possible ‘bootstrap’ resampling arrangements across the four analysed loci. In addition, FST was calculated for pairs of adjacent localities. All FST-estimates were tested for significant difference from zero by means of a permutation test exchanging individual genotypes between populations (using 10 000 permutations or 1000 permutations for pairwise estimates). Isolation by distance was analysed by regressing pairwise estimates of FST/(1 − FST) against ln-distance between sites (Rousset, 1997). A Mantel test was performed to test the correlation between genetic differentiation and geographical distance using GENEPOP 3.1 (10 000 permutations).

The degree to which the observed microsatellite allele frequencies were characteristic of each sampling locality was examined by reassigning each individual to the sample where its genotype was most likely to occur (assignment test; Waser & Strobeck, 1998). The calculations were carried out with the ASSIGNMENT CALCULATOR program (J. Brzustowski; available at http://gause.biology.ualberta.ca/jbrzusto/Doh.html), choosing the option to adjust all allele frequencies in every population in order to avoid zero frequencies. Significant departure from a random distribution of genotypes was tested for by 10 000 randomizations, which were created by resampling existing individuals with replacement from the combined pool of populations. A corrected assignment index (AIc; cf. Favre et al., 1997) was calculated for each individual by subtracting population means after log-transforming the probability of occurrence of its genotype in its original locality. The AIc-values were compared between the sexes (excluding juveniles) to address sex-specific patterns (Favre et al., 1997).

A mtDNA CR cladogram was constructed according to the phylogeny reconstruction algorithm described by Templeton et al. (1992) using the software TCS 1.0 (Clement et al., 2000). This method, which was specifically developed for intraspecific phylogeny, estimates a network with connections having a probability of more than 0.95 of being parsimonious. The maximum number of differences between sequences allowing parsimonious connections was estimated using the program PARSPROB 1.1 (D. Posada; available at http://bioag.byu.edu/zoology/crandall_lab/programs.htm). When parsimony cannot be justified at the 0.95 probability level, both parsimonious and non-parsimonious connections are allowed until the cumulative probability of the connections exceeds 0.95 (Templeton et al., 1992). The algorithm thus explicitly allows for ambiguities in the cladogram.

Results

Gene diversity

The mtDNA CR sequences were highly variable. A total of 49 different haplotypes was found among the 346 individuals analysed (see Appendix 1). At the two larger geographical scales, the average sample size (n) per locality was 16 individuals (ranging from 8 to 24, excluding locality 5) and haplotype diversity ranged from 0.55 to 0.91 (average H=0.82, SD=0.11). Among the 140 individuals of the transect, 20 different haplotypes were found and haplotype diversity ranged from 0.69 to 0.92 (average H=0.86, SD=0.07) in the 10 subsamples (average n=10, ranging from 8 to 12). Thirty-nine nucleotide positions were variable in the analysed fragment (Appendix 2). The haplotypes differed by up to 17 bp (5.15%) and the mean number of pairwise differences was 7.7 (SD=3.6), resulting in an overall nucleotide diversity of 2.33% (SD=1.20%). Nucleotide diversity per locality ranged from 0.87% to 2.60% at the larger scales, with an average of 1.99% (SD=0.45%). It was slightly higher for the 10 subsamples of the transect for which the average was 2.31% (SD=0.43%, ranging from 1.76% to 2.75%). No geographical pattern in mtDNA diversity was evident. There was, however, considerable heterogeneity in the geographical distribution of the haplotypes (Appendix 1). Some haplotypes were common and occurred in almost all localities (e.g. haplotype 10), whereas others had a more limited geographical distribution. Apparently private haplotypes were found in most localities. Among these, 10 haplotypes were observed exclusively in one of the two remote northern localities, six at Cheliuskin (1) and four at north-western Taimyr (2).

Although the four microsatellite loci were polymorphic in all local samples, genetic variability differed considerably among loci. The number of alleles per locus ranged from four (with one highly frequent allele in all local populations) up to 33 (Table 1) and between two and 17 alleles were observed in samples from a single locality. Genetic diversity (HE) differed considerably among loci. Average values were similar for localities at the different scales with average HE=0.65 (SD=0.04, ranging from 0.57 to 0.71) for localities of the two larger geographical scales (average n=19, ranging from 8 to 29) and average HE=0.67 (SD=0.04, ranging from 0.61 to 0.74) for the subsamples of the transect (average n=18, ranging from 11 to 21). No consistent geographical pattern in microsatellite diversity was observed.

A significant deviation from Hardy–Weinberg proportions was observed only for locus AV-13 at north-western Taimyr (2). There was, however, a tendency for a deficit of heterozygotes at all loci and scales, and estimated inbreeding coefficients (FIS) were positive for all local samples except Zeledeevo (4). This tendency was significant at the two larger sampling scales (sign test among all single locus FIS-values corresponding to a significant deviation from Hardy–Weinberg proportions at α=0.05: P=0.003). Because the observed heterozygote deficiency was not particular to one locus, it cannot be attributed to a null allele. The most likely explanation seems subdivision in some of the local populations (i.e. a Wahlund effect). No significant linkage disequilibrium occurred at any locality, nor was it detected between any pair of loci for the total transect data (n=204). Hence, independence among loci was assumed in the further analyses.

Transect: individual-based analysis

For the mtDNA data, a plot of the proportion of identical haplotypes per 300 m distance class showed a clear trend with distance. In particular, we observed a higher similarity than expected under random distribution among individuals in the first three classes, and significantly lower similarity in some classes beyond 8 km (Fig. 2a). For the microsatellite data on the other hand, no obvious pattern was apparent from the multilocus spatial autocorrelogram (Fig. 2b). There was, however, a slight tendency for a decrease in similarity with distance up to about 2.5 km, with a single significantly positive autocorrelation coefficient in the second distance class (300–600 m between individuals). The same statistics were plotted for each sex separately (not shown). For both genetic markers, a marginally significant tendency for higher similarity at short distances (below 1 km for mtDNA and below 3 km for microsatellites) was apparent for females but not for males.

Figure 2
figure 2

Genetic similarity between individual Siberian lemmings on the 11 km transect vs. geographical distance. A 95% null hypothesis confidence interval was estimated from 5000 permutations of individuals among geographical locations and is indicated by dotted lines. (a) Proportion of identical pairs of mtDNA CR haplotypes per 300 m distance class. (b) Spatial autocorrelogram estimated from multilocus microsatellite genotypes for 300 m distance classes.

For all pairs of individuals separated by more than 150 m, the slope of the regression of individual genetic differentiation (ar; microsatellite data) against ln-distance was close to zero (estimated slope: 0.0003). The slope was, however, significantly positive when only parts of the transect were included in the analysis and pairs of individuals separated by the largest distances (more than 7.5 km) were excluded (Fig. 3). Estimates of the slope for this part were between 0.004 and 0.01, corresponding to neighbourhood sizes between 250 and 100.

Figure 3
figure 3

Genetic differentiation at four microsatellite loci between individual Siberian lemmings on the 11 km transect. The insert shows pairwise estimates of ar plotted against ln-distance, as well as two examples of regression lines including pairs of individuals separated by distances up to 1.5 km and 5 km. On the main figure the slopes of such regressions are plotted against the maximal distance taken into account for the estimation. The arrows indicate the location of the two example lines from the insert. Null hypothesis 95% confidence intervals were estimated from 5000 permutations of individuals among geographical locations and are indicated by dotted lines.

Transect: subsample analysis

The mtDNA sequences showed moderate, but significant, differentiation among the 10 subsamples of the transect with an FST of 0.058 (Table 2). Pairwise FST-values between neighbouring subsamples revealed that this differentiation was mainly due to an abrupt differentiation between subsamples H and I (Table 3). Dividing the transect at this position resulted in non-significant differentiation on each side of this ‘border’ (A–H: FST=0.025, P=0.09; I–J: FST=0.061, P=0.11). In order to examine the extent of the observed areas of no differentiation, we included the surrounding sampling localities in the analysis. Dikson (8) was negligibly differentiated from A–H (FST=0.026, P=0.07). West Golomo (7) was less differentiated from A–H (FST=0.027, P=0.07) than from I–J (FST=0.1, P=0.01), and East Golomo (6) was not differentiated from I–J (FST=0.021, P=0.23). The pattern emerging from these observations is that there are two distinct patches, within which mtDNA haplotype frequencies show no differentiation.

Table 2 Genetic differentiation among local samples of Siberian lemmings at different geographical scales
Table 3 Pairwise FST-estimates among neighbouring subsamples of Siberian lemmings on the small-scale transect

For the microsatellite data differentiation was lower than for mtDNA with an FST of 0.014 (but still significantly greater than zero; Table 2). Pairwise FST-values between neighbouring subsamples showed that most differentiation was found around subsample B (Table 3). Differentiation among the subsamples C–J was not significant (FST=0.007, P=0.08). As above, surrounding sampling localities were included in the AMOVA: C–J were not differentiated from West and East Golomo (6 and 7; FST=0.008, P=0.07) and B was not differentiated from Dikson (8; FST=0.015, P=0.12), indicating areas with homogeneous distributions of allele frequencies comparable to those revealed by the mtDNA data. In agreement with the F-statistics, an assignment test showed weak genetic structuring of microsatellites at this scale: only 14% of the individuals were correctly assigned to their source sample and only subsample B was significantly characterized by the observed allele frequencies (P=0.035). Mean AIc-estimates were slightly lower for males than for females (mean AIc=−0.059 for males and 0.048 for females), but the difference between sexes was not significant (Mann–Whitney U-test).

Pairwise estimates of FST/(1 − FST) from the mtDNA data increased significantly with ln-distance (Fig. 4; Mantel test P=0.042). This positive correlation was, however, largely due to the fact that samples from opposite sides of the ‘border’ were, on average, both genetically more differentiated and geographically more distant, and thus cannot readily be interpreted as a progressive pattern of increased differentiation with distance on this scale. No correlation of genetic differentiation with geographical distance was observed for the microsatellite data (Fig. 4).

Figure 4
figure 4

Genetic differentiation with distance for mtDNA and microsatellite data among pairs of subsamples of the transect (small scale) and pairs of localities (intermediate and large scale): pairwise estimates of FST/(1 − FST) are plotted against ln-distance. In the first panel, the fitted regression is FST/(1 − FST)=−0.11 + 0.02 ln(distance); full circles represent pairs of subsamples within one genetic patch, whereas open circles represent pairs from different patches.

Intermediate and large scale

Both markers showed clear genetic differentiation among localities at the intermediate sampling scale. For the mtDNA data, the FST-estimate of 0.18 was considerably higher than within the transect (Table 2). Only a slight additional increase in FST was observed for both markers when including the two remote northern sites (cf. Table 2). For mtDNA, pairwise FST-estimates revealed significant differentiation between all neighbouring localities (Table 4). For microsatellites, differentiation was weaker and significant only for four pairs of neighbouring localities (Table 4). In fact, when excluding localities 7–10 from the AMOVA, differentiation among the remaining localities of the intermediate sampling scale was not significant (FST=0.009, P=0.08).

Table 4 Pairwise FST-estimates among neighbouring samples of Siberian lemmings at an intermediate and large scale. The dotted lines represent the larger geographical distances to the first two localities (cf. Fig. 1)

The assignment test for microsatellites revealed a somewhat clearer characterization of local samples at the intermediate sampling scale. Among all individuals, 34% were correctly assigned to their source locality, and the number of correctly assigned individuals was significantly higher than under random distribution of genotypes for eight of the 11 sampling localities. Characterization of the local samples was weaker in the southern part of the study area and not significant for Brazhnikova (11), Omulevaya (13) and Shirokaya (14). As indicated at the small scale, AIc-estimates at the intermediate scale were significantly more negative for males than for females (mean AIc=−0.023 for males and 0.028 for females; Mann–Whitney U-test: 0.01 < P < 0.02).

No pattern of increased genetic differentiation with geographical distance was observed for either genetic marker at the intermediate scale (Fig. 4). Pairwise estimates of FST/(1 − FST) were somewhat higher only when including the northernmost locality, Cheliuskin (1). For microsatellites, the lack of concordance between genetic similarity and geography was confirmed by an UPGMA cluster diagram (not shown) based on Nei’s unbiased genetic distance among localities (Nei, 1987), which showed no clear clustering of adjacent localities (mtDNA see below). Pairwise values of FST/(1 − FST) were, however, generally higher at the intermediate scale than at the small scale of the transect (Fig. 4).

mtDNA CR phylogeography

A cladogram allowed us to identify distinct groups of related mtDNA haplotypes, despite a considerable amount of homoplasy in the dataset reflected by several unresolved loops (Fig. 5). Seven-bp differences were estimated as the maximum for parsimonious connections at a 0.95 probability level. Group B, whose haplotypes differ from all the others by at least 9 bp, could therefore not be connected in a parsimonious way to the rest of the network. In addition to the cladogram, a neighbour-joining (NJ) tree based on Kimura’s two-parameter distance among haplotypes (not shown) was estimated using the program MEGA 1.0 (Kumar et al., 1993). The clades of the NJ tree correspond to the groups appearing in the cladogram. Statistical support of the clades was assessed by 5000 bootstrap replicates of the NJ tree. Two of the larger haplotype groups correspond to clades with noteworthy bootstrap support. Hence, three major lineages could be identified: A (66% bootstrap support), B (90% bootstrap support), and the remaining haplotypes.

Figure 5
figure 5

Network showing the relationships among the 49 mtDNA CR haplotypes observed in Siberian lemmings. Numbers in circles identify the haplotypes and sizes of the circles are proportional to the number of individuals with each haplotype. Dark shading indicates haplotypes found only at Cheliuskin (1) and light shading indicates haplotypes found only at north-western Taimyr (2). Dotted lines surround clades supported by more than 65% (thin) or 90% (bold) bootstrap support in a neighbour-joining tree showing the same structure as the present network. A and B designate two of the three major lineages which could be identified; the rest of the network was considered as the third.

No associations of the three major lineages or other clades could be observed with geographical locations. Thus, haplotypes specific to the two remote northern localities (1 and 2) were spread over most of the cladogram and occurred in all phylogenetic groups, except in the clade surrounding haplotype 7 (Fig. 5). All three major lineages were geographically widespread on the scale of our study and closely related haplotypes were often found in distant localities (Appendix 1).

Discussion

Patchy genetic structure

The genetic structure of the Siberian lemming populations was characterized by patches of genetic similarity, among which no larger scale pattern was obvious. There was, notably, no progressive increase in differentiation with geographical distance as expected under the isolation-by-distance model. This general pattern was particularly clear for mtDNA. In the transect, most of the genetic differentiation could be attributed to a division into two distinct groups of subsamples, within which haplotype distribution was relatively homogeneous. Localities 7 and 8, sampled in 1998 adjacent to the transect, which was sampled in 1999, could be attributed to either patch, showing that the pattern was stable over at least two years. Despite the abrupt genetic division into two groups, we could not identify any corresponding physical barrier to movement in the field (such as a large river or a cliff), which could explain the genetic division between subsamples H and I. The first genetic patch (including subsamples A–H) covered at least 8 km, whereas the second might extend over as much as 17 km, as subsamples I and J were not significantly differentiated from the more remote East Golomo (6). On the intermediate scale, where distances between sampling localities were in the order of magnitude of the identified patches or larger, genetic differentiation was clear among all neighbouring samples, but did not increase with geographical distance. An increase in genetic differentiation was observed only at the large scale for pairs of localities including Cheliuskin (1; the northernmost locality).

The pattern revealed by microsatellites was largely consistent with the patchy structure of mtDNA variation. In the transect, the distribution of genetic variability might be interpreted as two patches of relative homogeneity comparable to those described for mtDNA, but the border between them occurred at a different location. This microgeographic discrepancy, together with the absence of visible geographical barriers at either genetic border, suggests that the observed pattern might be created by gene flow in the absence of habitat constraints. Because the mtDNA data reflect only female gene flow (and male dispersal within generations) whereas the autosomal microsatellite data reflect gene flow by both sexes, we might expect structures resulting from local gene flow and genetic drift to be different for the two types of markers, if gene flow is sex biased (see below). As for mtDNA, no pattern of progressive increase of microsatellite differentiation with geographical distance was observed at any scale.

The lower genetic differentiation (FST-estimates) revealed by the microsatellite data as compared to mtDNA at all scales may be explained by the four times higher effective number of genes at the diploid autosomal loci than for the haploid mtDNA. As differentiation due to genetic drift is proportional to the effective number of genes, assuming equal sex ratios, an approximately four times lower FST may be expected (Birky et al., 1983). In addition, FST-estimates are affected by gene diversity, which in turn is determined by the mutation rate and the effective population size (Hedrick, 1999). The surprising lack of genetic differentiation among the eastern and the southern localities of the intermediate sampling scale thus cannot readily be attributed to long-distance gene flow, but is as likely due to high mutation rates in relatively large populations. The measurable effect of genetic drift may be additionally reduced for microsatellite loci, as mutations at these loci (e.g. Goldstein & Schlötterer, 1999) may often not create distinguishable new alleles, but alleles identical to those present or having been present in the population.

Isolation by distance

The absence of an increase of genetic differentiation with distance might be explained, also at larger scales, by gene flow and random long-range dispersal of individual lemmings. This would imply assuming an island model (Wright, 1943) with a certain, quite low, number of migrants equally likely to be exchanged among adjacent and more distant localities. At the intermediate geographical scale of the present study, the number of animals migrating over distances from 10 to 170 km would have to be considered equal. We consider such a scenario highly unlikely. Siberian lemmings have in fact been reported to move typically over distances below one km (Orlov, 1980; Chernyavski & Tkachev, 1982) and in the present study we infer considerable gene flow at a local scale of hundreds of metres to a few km (see below).

Alternatively, the absence of an observable pattern of isolation by distance may suggest that a species has recently colonized the region (Slatkin, 1993). After the initial establishment of a population devoid of structure, local genetic drift should first result in a detectable increase of genetic differentiation with distance in the short distance range. This pattern will be weaker for more ongoing gene flow and will only slowly extend to larger geographical areas (Slatkin, 1993; P. E. Jorde, unpublished). The patchy genetic structure we observed in Siberian lemmings may thus be interpreted as a system not yet having reached drift–migration equilibrium after the establishment of the populations in the region or after a major mixing event. Whereas gene frequencies in our local samples did not show any pattern of isolation by distance, the analysis of individual genotypes revealed a slight tendency for decreasing genetic similarity with distance. In addition to considerable gene flow, immigration from adjacent areas, random long-distance dispersal (notably of males, see below) or high mutation rates (in the case of microsatellites) might explain the weakness of spatial autocorrelation at the small scale of the transect (Hardy & Vekemans, 1999).

Interpreting the observed genetic structure as resulting from recently established populations, is supported by the phylogeographic analysis of the mtDNA haplotypes. The lack of concordance between the phylogeny and the geographical distribution of haplotypes does indeed indicate that the high observed mtDNA diversity did not arise in situ, but was present before the populations were established, or before some possible major mixing event. Fedorov et al. (1999) studied the phylogeography of Lemmus over the entire Eurasian Arctic and found no clear relatedness among geographically co-distributed mtDNA haplotypes (RFLP data) within considerable regions of the major phylogenetic groups. Notably among three localities on the Taimyr Peninsula (our localities 1 and 2 and a sample from north-eastern Taimyr), pairs of haplotypes from the same population had no less divergent sequences than pairs from different populations. Fedorov et al. (1999) suggest that the observed lack of population structure in mtDNA within regions resulted from distribution shifts of populations with high ancestral haplotype diversity during the Holocene, a hypothesis which is consistent with the interpretation of our data as resulting from a recent establishment or large-scale mixing of populations. It is worth noticing Sdobnikov (1957) who wrote that the arctic explorer Middendorf in 1843 (a peak lemming year) observed Siberian lemmings only as far north as 74°N on the Taimyr Peninsula (cf. Fig. 1), whereas nowadays they are distributed over the entire Peninsula. A general northward expansion of the distribution in such recent times might have resulted in a major mixing event, after which isolation by distance may slowly build up a genetic structure, creating the observed patchy structure.

Neighbourhood size

The regressions of ar vs. ln-distance allowed us to estimate 4Dπσ2 as the inverse of the regression slope. Although the detailed interpretation of Wright’s neighbourhood concept is controversial, it has been suggested that this composite parameter may provide valuable information about population ecology (Rousset, 1997; 2000). We therefore attempt to interpret our genetic estimates of the neighbourhood size for Siberian lemmings, which were between 100 and 250 for significantly positive slopes, in terms of σ and population density.

No estimates of the absolute population density of Siberian lemmings are available from the study area, and little is known about natal dispersal distances in this species. The effective population density, however, is clearly quite low, as the average effective size in populations periodically fluctuating in numbers is close to the size during the low-density phase (e.g. Motro & Thomson, 1982). Lemmings (Lemmus sp.) are indeed well known for their particularly low numbers in some years (e.g. Boonstra et al., 1998). In addition, the genetically effective size of a population is typically smaller than the actual number of individuals (e.g. Frankham, 1995). Low-phase Siberian lemming densities have been estimated to be between 0 and 50 individuals per ha in the arctic tundra of north-eastern Taimyr (Underhill et al., 1993), and one individual or less per ha on Wrangel Island (Chernyavski & Tkachev, 1982). Low effective density estimates would correspond to quite large gene flow per generation: An average effective population density of one lemming per ha would for example result in σ between 300 and 450 m. Considerable gene flow could in turn explain the observed lack of local genetic differentiation within patches extending over up to 8 km or more, and the weakness of spatial autocorrelation for individual genotypes on the small-scale transect.

Male-biased dispersal

Males had significantly more negative AIc values than females at the intermediate scale; the same tendency, although not significant, was observed for the 10 subsamples of the transect. This result from the microsatellite analysis constitutes indirect evidence for male-biased dispersal (Favre et al., 1997), and is consistent with the general opinion that in lemmings males are the sex which moves more (e.g. Banks et al., 1975). A difference between sexes was observed also in the analysis of individual genotypes: for both markers, a trend for higher similarity among individuals up to one or a few km was apparent only for females. The relatively large average gene flow per generation deduced above, is thus likely to represent mostly male dispersal. At the larger geographical scales, male-biased dispersal may also be a factor contributing to the somewhat larger than expected difference observed between FST-estimates from mtDNA haplotypes, reflecting only the female population structure, and microsatellite estimates, reflecting the whole population.

Conclusions

We have documented in this study a patchy genetic structure within continuously distributed populations of Siberian lemmings, with little genetic differentiation in areas extending over 8 km or more. Locally, the described structure allowed us to infer a considerable amount of gene flow resulting mostly from male dispersal. A tendency for local spatial autocorrelation among individual genotypes was apparent for females, which seem more sedentary. There was no larger-scale pattern of isolation by distance among patches. These findings disagree considerably with the expectations of the isolation-by-distance model and thus point to particularities of these lemming populations. The weakness of spatial autocorrelation at the small scale may be due to considerable gene flow. At the larger scales, however, the observed pattern is unlikely to result from random long-distance dispersal. It may rather be explained by the recent history of Siberian lemmings on the Taimyr. The interpretation of our observations as a system not yet having reached a migration–drift equilibrium structure provides insight into how contemporary ecological processes like dispersal interact with population history to shape the extant geographical pattern of genetic variation.