Introduction

Phylogeographic patterns reported for numerous mammalian species in Europe are often interpreted as the consequences of the Quaternary climatic fluctuations, and the southern-European peninsulas (Iberian, Italian or Balkan regions) have been identified as refugia where flora and fauna survived during glacial phases (Taberlet et al., 1998; Hewitt, 2000, 2004). If a number of European taxa conformed to this now classical interpretation (see review in Hewitt, 2004; Randi, 2007), several studies indicated that this ‘southerly refugial model’ (Bhagwat and Willis, 2008) is probably too simplistic and not satisfactory for some taxa. Alternative concepts have been developed involving the existence of northern refugia (Stewart and Lister, 2001; Kotlik et al., 2006; Bhagwat and Willis, 2008; Tougard et al., 2008), microrefugia (small favourable areas outside the main refugium; Rull, 2009) or nunatak vs lowland refugia (specific to mountain species; Holderegger and Thiel-Egenter, 2009). According to Bhagwat and Willis (2008), populations not originating from southern refugia present common biogeographical traits, such as a small body size, a present-day northerly distribution and they are cold-tolerant animals. Among European mammals, several species of Mustelidae display such characteristics that would have allowed them to persist in northerly areas. However, phylogeographic studies conducted on Mustelidae are few and mostly inconclusive. The study of Davison et al. (2001) on Martes martes and Mustela putorius and the work of Ferrando et al. (2004) on Lutra lutra showed a lack of structuring and ancient lineages explained by an expansion from a single, but not localized, European refugium. For M. erminea, low genetic differentiation and structure were observed among continental Eurasian populations, with only one lineage from Europe (Ireland excepted) to Japan up to Alaska whose origin is not specified (Fleming and Cook, 2002; Kurose et al., 2000, 2005; Martinkova et al., 2007). Finally, the phylogeographical analysis of Meles meles across Eurasia (Marmi et al., 2006) evidenced four genetic groups, but only one lineage in Europe possibly resulting from postglacial recolonization from several refugia.

In this study, phylogeographic variations in the western-Palaearctic region and genetic diversification related to Quaternary climatic changes are investigated for the weasel (Mustela nivalis, Linnaeus 1766). This species has a wide distribution covering nearly the entire Holarctic area (Europe, North Africa, northern Asia and North America). It is widespread throughout the entire western-Palaearctic region, with the exception of several Atlantic islands (including Ireland, Iceland and the Canary Islands). The species displays considerable variability across this wide geographic range. Consequently, there is a wealth of literature concerning the identification and description of different weasel lineages in Europe (three or four subspecies are described in Europe and North Africa), mainly based on morphological variations (Frank, 1985; Meia and Mermod, 1992; Zyll De Jong, 1992; Abramov and Baryshnikov, 2000) and chromosomal patterns (Zima and Cenevova, 2002). In contrast, molecular studies are scarce and mainly concern Japanese and Eurasian regions (Kurose et al., 1999, 2005; Saarma and Tumanov, 2006). In western Europe, two lineages have been recognized (Lebarbenchon et al., 2006), one in the mainland (France, Italy) and one in Corsica. However, these studies are based on short sequences (partial cytochrome b (cytb) or control region (ctr)) and include only a few individuals. This precludes a clear estimation of the number of genetic groups as well as their possible origin and relationships. Finally, the study of Lebarbenchon et al. (2006) raised the question of the origin of the two genetic lineages observed for M. nivalis in a broader phylogeographic context at the European scale.

The second aim of our study concerns the colonization of Mediterranean islands. Several mammalian species have been studied to identify the genetic link between insular and mainland populations (Santucci et al., 1998; Pierpaoli et al., 1999; Cosson et al., 2005; Dubey et al., 2007a, 2008). From these and other studies, it can be concluded that modalities of island colonization are very diverse according to the island and/or species under study, leading to a complex pattern of mammalian composition representing endemic, native or introduced species (Masseti, 1998; Sarà, 1998).

M. nivalis is one of the most common mustelids occurring in the islands of the Palaearctic region, inhabiting all Mediterranean islands larger than 240 km2 with the exception of Ibiza and Cyprus (Masseti, 1995; De Marinis and Masseti, 2003). As is often reported for small mammals (Blondel and Vigne, 1993; Alder and Levins, 1994; Michaux et al., 2002), insular and mainland populations of M. nivalis differ morphologically. These variations include larger body size and darker coat colouration in insular populations (Beaucournu and Grulich, 1968; Alcover and Jaume, 1983) and led to the description of potentially one subspecies per island (Beaucournu and Grulich, 1968; De Marinis, 1996). For M. nivalis, the prevalent hypothesis is a colonization through human introduction (Masseti, 1995; Bover and Alcover, 2008). Until now, no genetic study has been performed to identify the geographic origin of insular weasel populations or to determine the impact of human movements on the presence of this species in the Mediterranean islands.

We used mitochondrial partial ctr and complete cytb sequences to investigate the genetic variation and phylogeographic pattern of the weasel in the western-Palaearctic region (from Morocco to Finland) including five Mediterranean islands (Minorca, Corsica, Sardinia, Sicily and Crete). On the basis of this dataset, specific objectives concerning the phylogeography of the weasel were addressed: (1) evaluate the impact of the Quaternary glacial cycles on the genetic diversity of weasels by dating the origin of the phylogenetic groups, (2) infer the locations of potential glacial refugia, (3) assess the demographic history of phylogenetic clades, (4) determine whether colonization of Mediterranean islands occurs through human intervention or natural migration and (5) compare current taxonomic classification with genetic groups.

Materials and methods

Sampling

We sequenced a total of 88 weasels distributed in the western-Palaearctic region (Figure 1). Our samples come from road-killed animals, museums or personal collections (Table 1). We added published sequences from the European Molecular Biology Laboratory Database for weasels of the western-Palaearctic region. Thus, six ctr and nine cytb (two complete and seven partial sequences; see Table 1) have been added to our dataset. Moreover, five other species of Mustela were used as outgroups: the western polecat M. putorius, the steppe polecat M. eversmanni, the European mink M. lutreola, the Japanese weasel M. itatsi and the stoat M. erminea (see Table 1 for accession numbers and references).

Figure 1
figure 1

Sample localities of M. nivalis (see Table 1 for details). An asterisk (*) denotes samples for which precise locality is unknown. Samples for which only the control region or the cytochrome b is available are annotated ‘a’ and ‘b’, respectively. White and black symbols correspond to phylogenetic clades identified in Figure 2. The dotted line indicates the approximate distribution of M. nivalis and the inset map shows the entire species range.

Table 1 Sampling location, sample symbols, tissue origin and sequence accession numbers

DNA sequencing and alignment

DNA was extracted from skin samples from the ear (frozen or preserved in ethanol) using the QIAamp DNA Mini Kit (Qiagen, Courtaboeuf, France). The first 550 nucleotides of the 5′ portion of the mitochondrial ctr were amplified using the L0Mni 5′-GCC CCR CCA TCA GCA CCC AAA GC-3′ (this study from weasel ctr alignment) and MSD (Kurose et al., 1999) primers. In case DNA was degraded, we used two internal primers (L1Mni 5′-ACC TCT TCT CGC TCC GGG CCC ATC A-3′; this study from weasel ctr alignment and E3 (Arnason et al., 1997) to amplify ctr in two overlapping fragments of 306 and 363 nucleotides. The mitochondrial DNA cytb gene was amplified entirely using L7 and H6 primers (Montgelard et al., 2002) or in two overlapping fragments (767 and 742 bp) with the internal primers H2 and L2 (Hassanin et al., 1998). All weasels have been sequenced for the ctr (65 from this study and 23 from an earlier study by Lebarbenchon et al., 2006). For the cytb, 68 samples have been sequenced, representing at least one individual from each geographic area and for each ctr haplotype. A total of 133 new sequences were obtained and deposited in the European Molecular Biology Laboratory Database with accession numbers given in Table 1.

For both genes, polymerase chain reaction was performed using an initial denaturation of 3 min at 94 °C, followed by 35 cycles of 45 s denaturation at 94 °C, 1 min annealing at 50 °C, 2 min extension at 72 °C and a 10 min final extension at 72 °C. Polymerase chain reaction products were purified with a QIAquick polymerase chain reaction Gel Extraction Kit (Qiagen). Sequencing was carried out with an ABI Prism Big Dye Terminator Cycle Sequencing Ready Reaction Kit on an ABI Prism 310 Genetic Analyser (Applied Biosystems, Courtaboeuf, France). Electrophoregrams were read and aligned manually using Sequence Navigator software (Applied Biosystems).

The ctr and cytb sequences were aligned by hand using ED editor of the MUST package (Philippe, 1993). The cytb was translated into protein and did not reveal any stop codons, suggesting that functional sequences were obtained. Indels present in the ctr alignment were considered as missing characters. The number of variable and informative sites was calculated using DNASP 4.50.2 software (Rozas et al., 2003).

Genetic variability and population structure

Genetic variability was estimated for both phylogenetic and geographic groups. The number of different haplotypes (nH), haplotype diversity (h) and nucleotide diversity (π) as well as the average number of nucleotide differences (k) within groups were estimated using DNASP program. The hypothesis of recent population growth was tested using different statistical tests. Deviation from neutrality of mutations was tested with Fu's Fs (Fu, 1997), Tajima's D statistic (Tajima, 1989) and R2 test (Ramos-Onsins and Rozas, 2002) implemented in DNASP. The significance of deviation was evaluated with 10 000 boostrap replicates (constant population size for P-values >0.05). Pairwise mismatch distributions of substitution differences were used to test for demographic expansion. Observed and expected values were compared with Arlequin 2.0 (Schneider et al., 2000) with 100 bootstrap replicates (demographic expansion rejected for P-values <0.05). Relationships between unique haplotypes were described by a parsimony network constructed with the program TCS 1.21 (Clement et al., 2000).

Saturation and phylogenetic analyses

Saturation of nucleotide substitutions was evaluated according to the procedure of Philippe et al. (1994). The inferred number of substitutions in maximum likelihood (programs PHYML version 2.4.4; Guindon and Gascuel, 2003 and Treeplot of MUST) was plotted against the pairwise number of observed differences between sequences (programs Comp_mat and Treeplot of MUST). The slope of the linear regression and the coefficient of correlation were then used to evaluate the level of saturation: the slope decreases from one as saturation increases.

Phylogenetic trees were reconstructed in maximum parsimony with PAUP (version 4.0b10; Swofford, 2002), in maximum likelihood using PHYML and with the Bayesian inference using MRBAYES (version 3.1.2; Ronquist and Huelsenbeck, 2003). The search for the maximum parsimony trees was performed with the option stepwise addition with simple addition sequence and stability of nodes was assessed with 1000 boostrap replications. For the two probabilistic methods, the optimal model of sequence evolution was determined with MODELGENERATOR (version 0.82; Keane et al., 2006), using the majority model indicated by the four tests (LRT, AIC1, AIC2 and BIC). As PHYML does not allow for data partition, the model of sequence evolution was inferred by MODELGENERATOR on whole genes (separately or in combination) and stability of nodes was tested with 1000 bootstrap replications. With MRBAYES, a mixed-model analysis was performed on partitioned genes using different models (selected by MODELGENERATOR) and parameters for four partitions (ctr and each codon position for the cytb). MRBAYES was run using four chains of four millions of generations sampled for each 100 generations, with the first 10 000 trees discarded as burn-in after checking stationary of log-likelihood values.

Dating analysis

Divergence dates were estimated by Bayesian coalescent analysis with the program BEAST (version 1.4.6; Drummond and Rambaut, 2007) using the complete combined dataset (68 weasels sequenced for both genes and 6 outgroups). We used the HKY model of nucleotide substitution with a proportion of invariable site (I) and a Gamma distribution (G). Three coalescent priors (constant size, exponential growth and expansion growth) as well as three molecular clock models (strict, relaxed exponential and lognormal) were evaluated and tested using the Bayes factor (BF) as implemented in the program TRACER (version 1.4; Rambaut and Drummond, 2007). BF significance was determined from the values of 2LnBF as described in Brandley et al. (2005). Chains were run twice independently for 30-million generations each, sampled every 1000 iterations with the first 10% of trees discarded as burn-in. The program TRACER was used to check for convergence of MCMC chains to stationarity (effective sample size >600). The results of the two independent analyses were combined using the program Logcombiner to calculate the time of divergence from the most recent common ancestor (TMRCA) and 95% highest posterior density (HPD) intervals.

Molecular dating was derived using as calibration point the age of the first occurrence of M. nivalis, which is dated at 300 kiloyear (kyr) in North America and 100 kyr in Eurasia (The PaleoBiology Database, http://paleodb.org). A lognormal distribution suitable for modelling fossil data (Ho, 2007) was used as prior with parameter values of 100 kyr as the minimum age (lower bound parameter), 300 kyr as the mean and the standard deviation of the distribution was chosen to 2, so that 95% of the HPD for the time of divergence of the M. nivalis clade lies in the interval 100–500 kyr (late-middle Pleistocene; Sheffield and King, 1994).

The expansion time was estimated from the mismatch distribution as t=τ /(2μ), where t is the time since expansion in generation, τ is the mode of the mismatch distribution and μ is the mutation rate per nucleotide multiplied by the sequence length (Rogers, 1995). We assumed one generation per year, and for the substitution rate, we used the mean and 95% HPD estimations obtained from BEAST analyses to calculate the expansion time with 95% confidence interval (CI).

Results

Phylogenetic analyses

Ninety-four ctr weasel sequences have been analysed for 550 nucleotides (including 7 indels), which yielded 49 haplotypes showing 43 (7.8%) variable and 33 (6%) informative sites. Intraspecific sequence variation between haplotypes varies from 0.18 to 3.85%. The slope of the regression and correlation coefficient of the saturation analysis were 0.77 and 0.83, respectively. These values indicate that the ctr was little affected by saturation, although values were slightly lower than for the cytb (see below). The complete ctr dataset includes 100 animals (94 weasels and 6 outgroups) and HKY+I+G was selected as the best-fitting model of sequence evolution. On the whole, the ctr did not reveal strong phylogeographic structure with the exception of a clade including samples from western Europe (clade I in Supplementary Appendix S1a).

Thirty-six different haplotypes have been identified for 70 weasel complete cytb sequences (Table 1), which showed 82 (7.2%) variable sites among which 42 (3.7%) were informative. Intraspecific sequence variation between haplotypes varied from 0.088 to 2.28%. The slope of the regression and correlation coefficient of the saturation analysis was 0.85 and 0.98, respectively, indicating that the cytb was not affected by saturation.

Phylogenetic analyses were performed on 83 cytb when partial sequences and outgroups were included. The best model of sequence evolution was TrNef+I, HKY and TrN, for the first, second and third position, respectively, whereas the HKY+G model was selected for the whole cytb. In Bayesian analysis, we used the GTR model instead of TrNef and TrN, which are not available in MRBAYES. The resulting trees revealed two moderately supported clades (I and II in Appendix S1b). Two groups can be identified in clade I. Subclade Ia contained all weasels from the western part of Europe from Spain to Finland among which a uniform phylogeographic pattern was observed. Subclade Ib was weakly supported and included seven samples (two individuals from Poland, two from Finland and one from central Italy, Estonia and Russia each). In contrast, group II was clearly structured in five rather well-supported subclades including individuals from (i) Greece; (ii) Crete, Minorca, Sardinia and Serbia; (iii) Poland, Romania, Slovenia and Serbia; (iv) Corsica, north Italy and Sicily and (v) Morocco and Sardinia.

The combined analysis included 74 samples (68 weasels and 6 outgroups) for 1690 nucleotides, and HKY+I+G was selected as the best model of sequence evolution. The phylogenetic tree (Figure 2a) strengthened the arrangement in two main clades (I and II) observed with the cytb alone (Appendix S1b), and the same five subclades were strongly supported in group II. This indicates that while less information was brought by the ctr, there was little conflict with the structure given by the cytb. The phylogenetic analysis of the western-Palaearctic weasels thus reveals an opposition between the clade I unstructured and including only one strong group (subclade Ia) and the clade II split into five well-supported subgroups.

Figure 2
figure 2figure 2

(a) Bayesian phylogram derived from the combined control region and cytochrome b for 68 M. nivalis (see Table 1 for sample names). Bayesian posterior probabilities and bootstrap proportions in maximum likelihood and maximum parsimony are shown at nodes from left to right, respectively. With the exception of clade Ib, only nodes supported by at least 90% support for at least one method are indicated. Supports for main clades and subclades are indicated in bold. (b) Minimum spanning network for the same 68 sequences. Black dots represent missing haplotypes. Unless specified, the number of substitution between haplotypes is one.

The minimum spanning network (Figure 2b) performed on the whole combined dataset (68 weasels) showed the same general pattern with clades I and II separated by 18 mutational steps. The absence of structure inside subclade Ia was also apparent, along with a star-like topology indicative of a demographic expansion (stronger evidence with cytb alone, data not shown).

Genetic structure

The mean divergence, estimated from combined markers by the uncorrected p distance (percentage of divergence), was low within each clade (0.49±0.3% for clade I and 0.6±0.31% for clade II), whereas the differentiation between the two groups was <2% (1.8±0.18%). These estimates are not uncommon for carnivores and, for example, are in agreement with the mean percentage divergence observed for stoat (M. erminea; Fleming and Cook, 2002; Martinkova et al., 2007).

Genetic diversity was calculated for phylogenetic (Figure 2a) and geographic (each Mediterranean island, Morocco and eastern-continental individuals) groups. Haplotype diversity was high (h>0.80) for all clades, whereas nucleotide diversity was heterogeneous between different groups (Table 2), with lower values recorded for groups including insular samples. This analysis, combined with the average number of nucleotide differences (k in Table 2), highlights that subclades IIa and IId, and especially samples from Corsica and Sicily, present the lowest genetic diversities.

Table 2 Genetic polymorphism and neutrality tests

Three statistical tests (Fu's Fs, Tajima's D statistic, R2 test; Table 2) as well as mismatch distributions were performed to infer demographic histories (stability or expansion of population). The Tajima's D statistic was the most restrictive test with 10 out of 13 groups tested showing a probability above 0.05 (thus rejecting the hypothesis of expansion), whereas the two other tests reject this hypothesis (P>0.05) for seven groups. In contrast, mismatch distributions accepted the hypothesis of recent population growth for all populations except one (subclade IIc). Discrepancies between tests can be explained by differences in their statistical power or in the sequence information used (Ramos-Onsins and Rozas, 2002). Therefore, we considered a population in expansion only if results from at least three tests were significant. According to these constraints, five phylogenetic clades (clade I, subclade Ia, clades II, IIb and IId) were considered as expanding groups.

Molecular dating

The difference in likelihood between the strict (−5038.26), relaxed lognormal (−5025.55) and relaxed exponential (−5015.01) molecular clock reveals that the exponential model was significantly more adapted to our dataset (2LnBF>10). Similarly, the test of three demography priors indicated that the constant size (−5015.28) hypothesis cannot be rejected (2LnBF<2) as compared with the exponential (−5014.84) and expansion (−5015.99) growth coalescents. Consequently, the relaxed exponential clock and constant population size prior were used to estimate the age of the TMRCA for different phylogenetic and geographic groups (Table 3; and Figure 4).

Table 3 Estimated time (in kiloyears) to the most recent common ancestor (TMRCA) and expansion time for different groups of interest using a Bayesian coalescent analysis (BEAST program) and the mismatch distributions

Three periods of diversification can be recognized for weasels: (i) 116 kyr for the TMRCA of the whole western-Palaearctic lineage (clade I), (ii) 50–60 kyr for the timing of the eastern-Palaearctic continental and western (subclade Ia) populations and (iii) 18–28 kyr for the TMRCA of the Moroccan, Sicilian and Corsican populations. For the outgroups, the split between M. lutreola and M. eversmanni+M. putorius was 133 kyr, which is the same timing as the divergence of clade I. The divergence of M. itatsi from other mustelids was dated at 389 kyr.

The mean mutation rate was evaluated at 112 substitutions per site per kyr (95% HPD: 8.9–233 kyr) by BEAST analyses. This value and 95% HPD interval were used to infer time of expansion from mismatch analysis for different clades (Table 3). Values were rather different, notably for clades I and Ia.

Discussion

Differentiation in northern refugia?

On the basis of extensive sampling in the western-Palaearctic region and the use of two mitochondrial genes (1690 bp), we have shown for the first time a clear phylogeographic pattern for the weasel, revealing a split into two major groups (clades I and II in Figure 2). Clade I includes all individuals sampled in the western-Palaearctic region, from Spain to Finland and including the United Kingdom (white symbols in Figure 1), and clade II encompasses samples from eastern Europe to the Black Sea as well as all Moroccan and insular weasels (black symbols in Figure 1).

Group I contained a strongly supported clade (Ia) including all western-European weasels (except It18) and a weakly supported clade (Ib) including five individuals located at the edge of the distribution area of clade I (Figure 1). As these haplotypes are the oldest among clade I and are mostly located in Finland and Poland, it can be inferred that clade I probably originated in northeastern Europe. Several explanations can be put forward to account for these individuals. First, they could carry ancestral polymorphisms because of incomplete lineage sorting (Maddison and Knowles, 2006). There is, however, little support for this hypothesis because samples are geographically localized, which is not consistent with the retention of ancestral variation. Second, these individuals could result from admixture with another genetic group not identified in our study because it is located farther east. To check for the existence of another group in the eastern part of Eurasia, we added all complete weasel cytb sequences available in European Molecular Biology Laboratory. Only two sequences from Japan and one from Taiwan were found. The resulting tree (not shown) indicates that these sequences also clustered at the base of clade I, thus suggesting that clade I might effectively include other lineages expanding to the eastern part of the Palaearctic region. Third, a replacement event can be proposed by which these intermediate samples would represent a residual clade I (formerly covering the whole western-Palaearctic region) displaced by range expansion of subclade Ia. This hypothesis is supported by the observation that subclade Ia is nestled within clade I, and, therefore, group Ia likely originated from group I. Such a replacement process has been used to explain the different lineages observed in Scotland and England/Wales for the water vole (Arvicola terrestris, Piertney et al., 2005) or in Ireland and Britain for the stoat (Martinkova et al. 2007).

The subclade Ia showed no phylogeographic structure and has characteristics consistent with sudden expansion: a bell-shaped curve of the mismatch distribution (Figure 3a), rejection of the hypothesis of neutrality (Table 2) and a star-like network (particularly for the cytb). These features suggest some contraction of population size in the past, likely during the last glacial cycles as indicated by the time of expansion (13 kyr; 95% CI: 7–167 kyr). For this clade, our data do not support the hypothesis of refugia located in the Mediterranean peninsulas because observation of the network (Figure 2b) reveals that individuals from Spain or Italy are mostly located at the periphery of the network, whereas central nodes are occupied by individuals from Denmark (Dk97), Switzerland (Sw26) or Germany (Ge114). These results thus lead to the assumption that clade Ia originated from clade I in northeastern Europe and then spread to southwestern regions. The interpretation of northern-European refugium would also be consistent with the ecology of weasels, which are presently found from the southern and northern extremes of Europe and in mountains up to 3000 m. These characteristics suggest that, as cold-tolerant animals, weasels would have been able to survive at different latitudes (to the boundary of the ice sheet) during successive glaciations. The presence of weasel subfossils recorded during the pleniglacial age (15–75 kyr) of central Europe (Germany, Poland; Sommer and Benecke, 2004) is also consistent with this hypothesis. Of course, more samples from northern countries (Sweden, Norway) to the Extreme Orient would be needed to support this scenario.

Figure 3
figure 3

Observed (dot line) and expected (solid line) mismatch distributions under a population growth-decline model for different M. nivalis clades (see Figure 2a) of interest: (a) clade Ia, (b) clade II east-continental samples (after exclusion of insular and Moroccan samples), (c) clade IId and (d) clade IIe.

Clade II showed a topology with five well-supported subgroups (Figure 2). When samples from clade II are restricted to continental individuals (nine weasels from Greece, Serbia, Romania, Italy, Poland), there is no indication of recent population growth: the mismatch distribution curve is multimodal (Figure 3b) and close to the rejection threshold (P=0.08; see also subclade IIc). Moreover, the failure to reject the neutrality hypothesis (Table 2) rather suggests constant population size. In contrast to western-European weasels, eastern-continental populations seem to have been more stable during the ice ages and would not have undergone reduction in population size. Although based on a small sampling, our data do not support the hypothesis of a refugium located in the Balkan area. This pattern of more stable populations in the Balkans has already been described for the bi-coloured shrew (Crocidura leucodon; Dubey et al., 2007b) and the yellow-necked fieldmouse (Apodemus flavicollis; Michaux et al., 2005). This might be explained by milder climatic conditions prevailing in places farther from the ice sheet. A more comprehensive sampling from northern and middle-eastern regions (Carpathian, Balkan, Black Sea until Caspian areas) would help to delineate the geographic distribution of clade II and to better identify refugia potentially located in more northern (Carpathian) or eastern (Caucasus) areas (see Jaarola and Searle, 2002; Brunhoff et al., 2003).

Molecular dating indicates that the divergence of subclade Ia and clade II (continental samples) occurred 62 and 50 kyr ago, respectively (95% HPD: 5–158 kyr; see Table 3; Figure 4). The proximity of these dates suggests that the two clades originated from the same event and fall during the last glacial period, namely the Würm glaciation (10–80 kyr). Thus, we can reasonably assume that the last glacial period was responsible for the emergence of these two distinct genetic groups, but with quite different consequences. Whereas the western-Palaearctic group has likely undergone all consequences of a distribution restricted to glacial refugia, the eastern-continental group does not express any characteristic of a refugium zone. The molecular dating obtained for the clade I (extended western Palaearctic) is dated approximately 116 kyr ago (95% CI: 126–287 kyr), which corresponds to the end of the Riss glaciation.

Figure 4
figure 4

Bayesian posterior density plot showing the distribution of the age of the most recent common ancestor for clade I (----), clade Ia (—) and Corsican (—..—), Sicilian (……), Moroccan (— — —) and east-continental (— . — .) populations of M. nivalis (see Table 3 for estimated ages).

Colonization of Mediterranean islands

The question of how (natural or introduced populations) and when (during the Pleistocene or at human scale) the current mammalian species colonized the different Mediterranean islands is still open to debate (Vigne, 1999; Lister, 2004; Marra, 2005). During the Holocene, the replacement of endemic mammalian species by a poorly diversified introduced fauna occurred on all of the larger Mediterranean islands (Blondel and Vigne, 1993). For M. nivalis, there is no clear explanation concerning the modality of island introduction. However, the lack of fossil records (Vigne and Alcover, 1985; Masseti, 1995; Vigne, 1999) suggests the absence of M. nivalis not only on most islands, but also in the western part of North Africa (Morocco, Algeria and Tunisia) during the Pleistocene and early Holocene ages. Hence, colonization through human intervention (voluntary or not) seems to be the prevailing hypothesis (Vigne, 1999; Bover and Alcover, 2008).

According to our results, it is noticeable that, with the exception of Crete, all Mediterranean islands do not belong to the same genetic group as the nearest continent. Corsica, Sardinia, Sicily and Minorca are included in the genetic clade II, whereas their corresponding closest mainland regions (France, Italy and Spain) are in the genetic group I (Figures 1 and 2). It can thus be inferred that islands have not been colonized from the nearest continent, as already shown from a smaller sampling (Lebarbenchon et al., 2006). Rather, the clustering of insular samples with weasels from Serbia, Greece and Romania suggests that islands were likely colonized by individuals originating from an eastern-European group. Owing to this common origin, our genetic data argue in favour of a human introduction, which is also congruent with the early trade routes that have been established in eastern civilizations since the Bronze age (De Marinis and Masseti, 2003; Costa, 2004). Moreover, a common eastern origin through natural colonization across land masses seems to be unlikely because of the high bathymetry of the Mediterranean Sea: large islands (Corsica, Sardinia and Crete) are surrounded by deep marine pits and only coastal islands were connected to the continents during glacial ages (Shackleton et al., 1984).

In our analysis, Sicilian and Corsican samples are clustered in the same subclade (IId; Figure 2) suggesting that both islands were colonized from the same eastern genetic group. Moreover, the low nucleotide diversity and high haplotype variability (Table 2) observed for this grouping as well as the unimodal mismatch distribution (Figure 3c) are indicative of population growth from a small number of individuals (founder effect). These results thus support the hypothesis of a recent introduction on both islands, which is compatible with the hypothesis of colonization through human activity. Similar to the ferret (Mustela putorius furo), which has been used for centuries for hunting purposes in western Europe, the weasel may be considered as an ‘anthropophilous’ species (Vigne, 1999 and references herein). As a matter of fact, it was used as a house animal (to preserve food from small rodents; Masseti, 1995) as well as for food, fur and even traditional medicine as is still the case in Morocco. The weasel was thus part of human activities and as such was involved in the Holocene immigration wave that colonized Mediterranean islands with man. It is also possible that in Sicily, introduced weasels replaced some autochthonous lineage because the remains from M. nivalis are documented in the Terminal Pleistocene (Masseti, 1995).

As supported by Mesolithic sites in Sardinia and Corsica, the early human settlements on large Mediterranean islands are dated approximately 11 000 before present (Vigne, 1999). However, the main wave of island colonization started with the Neolithic age, that is from the seventh millennium, while the permanent colonization of Corsica was realized during the sixth millennium. These dates are more recent than the 18 kyr (95% HPD: 1.2–48 kyr) we obtained from analyses of individuals from Corsica and Sicily (Table 3). In spite of a large CI, dating obtained for the time of expansion (10 kyr, 95% HPD: 5–120 kyr) remains more compatible with an early human intervention than by a colonization initiated by the Phoenician or Greek civilizations that started about 3 kyr ago (De Marinis and Masseti, 2003; Costa, 2004) in the whole Mediterranean region.

Colonization of North Africa

Weasels from Morocco are included in clade II (subclade IIe; Figure 2a) with insular and eastern-European samples and thus do not belong to the same group as western-European (Iberian, French or Italian) weasels. This result suggests that, like insular individuals, those in Morocco originated from the eastern part of Europe. Different routes have been proposed to explain colonization of North Africa (Dobson and Wright, 2000). The absence of genetic similarity between Iberian and Moroccan samples strongly suggests that North African weasels do not come from western Europe (route 1 in Dobson and Wright, 2000). Colonization through this route is, however, advocated for other mammals, including the woodmouse (Apdodemus sylvaticus), which colonized Morocco from the Iberian peninsula (Michaux et al., 2003).

Neutrality tests (Table 2; Figure 3d) performed on Moroccan samples are not indicative of recent population expansion, and nucleotide diversity is at least twice that seen in insular samples (Table 2). Although the number of analysed samples is low, these results suggest that North African populations did not undergo a recent bottleneck. With dating analyses, we obtained 28 kyr (95% HPD: 1.3–78 kyr) for the time of divergence of the Moroccan population, which is slightly older than estimations obtained for Sicilian and Corsican populations (18 kyr, 95% HPD: 1.4–49 kyr). However, considering the large CI, it is difficult to determine whether colonization occurred through human movements or naturally from the middle-eastern area along the northern coast of Africa (route 2 in Dobson and Wright, 2000). To corroborate the hypothesis of an eastern route of colonization, it would be very interesting to include weasel samples from Egypt (from the Nile delta). These weasels comprise the most eastern African population and are described as a different subspecies (M. n. subpalmata; Reig, 1997) or even a distinct species (M. subpalmata; Zyll De Jong, 1992), representing a glacial relict (Dayan and Tchernov, 1988).

Taxonomic considerations

The extreme variability of weasel morphological characteristics (body size, pattern of colouration, but also sexual, seasonal and geographic variation) led to the description of numerous taxonomic forms (and thus subspecies). Most recent studies on weasel taxonomy recognize 3–4 subspecies in the western-Palaearctic region (Frank, 1985; Zyll De Jong, 1992; Abramov and Baryshnikov, 2000; Zima and Cenevova, 2002), but there is no consensus about the geographic range or the evolutionary history of these taxa. Instead of arbitrarily adopting one particular taxonomic classification, we preferred to revert to the type locality (Sheffield and King, 1994) in which the different subspecies were originally described: (i) M. nivalis nivalis: 1776 ‘Province of Vesterbotten, Sweden’; (ii) M. nivalis vulgaris: 1777 ‘near Leipzig, Germany’; (iii) M. nivalis boccamela: 1800 ‘Sardinia’ and (iv) M. nivalis numidica: 1855 ‘Tangier, Morocco’. Our mitochondrial data lead to the recognition of two main phylogenetic groups in the western-Palaearctic region (I and II on Figure 2) and the attempt to compare them with assumed subspecies constitutes only a preliminary approach to the revision of weasel taxonomic classification. As individuals from Germany fall within group I, we might assign this clade to the vulgaris group, which would thus include the whole western Europe. There is no genetic evidence for the occurrence of two subspecies in Morocco (numidica) and Sardinia (boccamela) because these samples belong to the same genetic clade (II). Finally, the identification of the nivalis subspecies would necessitate analysing weasels from Scandinavia, and in particular Sweden. In any case, linking morphotypes to genetic lineages would require concurrent analyses of the same individuals for both morphological and molecular characteristics (including nuclear markers). Such studies remain unfortunately rare in the literature, especially for carnivores, because of the inherent logistical complexities therein.