Introduction

Speciation involves the establishment of reproductive isolation between organisms. Reproductive isolation may be achieved through various mechanisms acting singly or together at the pre- and/or post-zygotic stage, such as assortative mating or genetic incompatibilities (Turelli and Begun, 1997; Coyne and Orr, 2004). Incomplete reproductive isolation is often characterized by sex-specific processes, and in particular intrinsic genetic incompatibilities often more strongly affect hybrids of the heterogametic sex, a pattern known as Haldane’s rule (Haldane, 1922; Schilthuizen et al., 2011). One expectation from Haldane’s rule is that the extent of gene flow between incipient species may strongly differ if a gene is paternally or maternally transmitted (Turelli and Begun, 1997; Coyne and Orr, 2004; Macholan et al., 2007).

Hybrid zones offer the possibility to study the extent and potential sex-specificity of gene flow between incipient species in their natural environment. Here the consistency of introgression of genetic markers may provide valuable information on the degree of reproductive isolation between organisms (Barton and Hewitt, 1989; Raufaste et al., 2005; Schilthuizen et al., 2011). For example, in a hybrid zone with a high degree of reproductive isolation, Haldane’s rule might lead to very low gene flow for male-specific Y-chromosomal markers in mammals, whereas no impediments for gene flow might exist for maternally transmitted mitochondrial markers (for example, Beysard et al., 2012). Despite the recognition of the crucial information carried by the Y-chromosome in mammals (Petit et al., 2002), its investigation in hybrid zones of non-model organisms remains restricted to very few systems (for example, Jaarola et al., 1997; Hellborg et al., 2005; Yannic et al., 2008), most probably because of technical limitations.

The rich technical and genomic resources available for the laboratory mouse have eased the development of the secondary contact zone between the house mouse subspecies Mus musculus musculus and Mus musculus domesticus, running through Europe to a potential model system for speciation research in mammals. The exploration of this contact zone has shown, for example, that the subspecies may mate assortatively (Ganem et al., 2008), hybrids may differ from parentals in parasite load (see Baird et al., 2012) or male hybrids may be sterile (reviewed in Britton-Davidian et al, 2005; and also Mihola et al., 2009). The observation of male sterility combined with the detection of abrupt Y- and X-chromosome marker clines (Dod et al., 1993; Macholan et al., 2007, 2008, 2011) relative to other markers across the hybrid zone are consistent with the hypothesis that sex chromosomes and in particular factors on the X-chromosome have an important role in causing genetic incompatibilities in mouse hybrids (Coyne and Orr, 2004). Nevertheless, there is extensive geographical and individual variation in male sterility (Britton-Davidian et al., 2005; Good et al., 2008; Turner et al., 2012), and sex-chromosomal marker clines may be not coincident with other genetic markers in particular geographical regions, potentially due to asymmetrical processes (Macholan et al., 2008; Jones et al., 2010). For example, the Y-chromosome was found non-coincident with other markers in an area of about 330 km2 in the Czech–Bavarian part of the mouse hybrid zone (Macholan et al., 2008). This may be explained by two non-mutually exclusive scenarios: first, the hybrid zone may have moved and left the M. m. musculus Y-chromosome in the M. m. domesticus territory or second, the M. m. musculus Y-chromosome might have escaped from a static hybrid zone centre comprising autosomal and X-chromosomal markers and advanced up to 22 km into the M. m. domesticus territory (Macholan et al., 2008). It was proposed that a genetic conflict between the sexes may act together with a distortion of the sex-ratio to promote unidirectional sex-specific gene flow between the subspecies (Macholan et al., 2008). The variability in marker cline patterns stresses the importance of replicated analyses for hybrid zone analyses. In the present study, we collected information on multiple hybrid zones to assess the properties of sex-specific marker clines in a speciation-prone European small mammal.

Voles in the genus Microtus have been undergoing the fastest radiation known in mammals and diversified into at least 65 species across the world within <2 million years (Jaarola et al., 2004; Fink et al., 2010). Despite very little morphological differentiation, karyotypic variation and differentiation into deep genetic lineages is relatively common within recognized species, suggesting the presence of cryptic species or ongoing speciation processes (Jaarola et al., 2004; Heckel et al., 2005; Hellborg et al., 2005; Bastos-Silveira et al., 2012). In the common vole (Microtus arvalis), intraspecific divergence has resulted in four main, phenotypically cryptic evolutionary lineages in Europe (Western, Central, Italian and Eastern lineages; Heckel et al., 2005). Divergence between these lineages based on population samples of mitochondrial DNA (mtDNA) sequences and autosomal information was estimated to have coincided with or predated the last glacial maximum depending on the lineage, resulting, for example, in more than 50 000 generations of divergence between the Western and Central lineages (Heckel et al., 2005). An initial analysis of phylogeographic patterns in the region of secondary contact between lineages in the Alps demonstrated clear-cut parapatry of mtDNA lineages (Braaker and Heckel, 2009). Multi-locus nuclear microsatellite markers allow the allocation of individuals to genetic clusters whose distributions are highly consistent with mtDNA lineages—except for first evidence of cytonuclear discordance in a few individuals close to the potential contact zones (Braaker and Heckel, 2009). The detected patterns are consistent with male-biased dispersal in the species (Schweizer et al., 2007; Hahne et al., 2011) and relatively complex processes of genetic erosion after colonization of the region (Braaker and Heckel, 2009). However, they may also be explained by an ongoing speciation process and the effect of sex-specific gene flow associated with yet undocumented partial reproductive isolation between the highly divergent evolutionary lineages in M. arvalis. With male-biased dispersal in the species, a selectively neutral secondary contact would show broader paternal marker clines than maternal ones. In a case involving selection, for example, a form of Haldane’s rule, paternal marker clines would not be expected to be broader than maternal ones because they are transmitted by the heterogametic sex, and thus gene flow would be expected to be restricted.

Here we fine-map the geographic location of the secondary contact zones between all three evolutionary lineages (Western, Central, Italian) of M. arvalis, occurring in the region of the central Alps. We performed cline analyses of Y-chromosomal and mtDNA markers across these contact zones to quantify the relative contribution of sex-specific processes and the importance of non-neutral evolution linked to potential incipient speciation in the probably most abundant wild European mammal.

Materials and methods

Sampling of transects

We investigated the different secondary contact zones in the region of the Alps between three evolutionary lineages of the common vole with a total number of 706 voles in transects of 50–160 km length consisting of 23–40 localities each (Figure 1, Table 2, Supplementary Information Table 1). The first transect (Jura) was sampled perpendicular to the Jura mountain ridge, where the Western and the Central lineages meet. The second transect (Grisons) was sampled along the Rhine valley, a contact zone of the Central and the Italian lineages. The third transect (Valais) comprises transitions from the Central to the Western and then to the Italian lineage along the Rhone valley. Samples were obtained as described in Braaker and Heckel (2009).

Figure 1
figure 1

Map of Switzerland showing the sampling localities of Microtus arvalis samples included in cline analyses of the three transects. Transects are described in Table 1. Localities are designated according to the cytb lineage they harbour (white=Central, grey=Western, black=Italian). The Valais transect along the Rhone valley contains a double transition from the Central through the Western to the Italian lineage. Mixed populations (Figure 2) are indicated by bi-coloured circles. Localities closer than two kilometres to each other were merged for this figure.

Table 1 Haplotypes and polymorphic positions in the Y-chromosomal marker (intron SMCY 11; 730 bp) in the studied transects across contact zones in M. arvalis

Molecular markers, restriction fragment length polymorphism analysis and sequencing

We first determined a diagnostic Y-chromosome marker for the Italian, Western and Central lineages as follows: 730 bp of intron 11 of the SMCY gene were sequenced (see below for amplification and sequencing protocol) in 71 voles from across Europe, covering the range of the Western (Spain, France, and Belgium), Central (Switzerland, Germany and Austria) and Italian (Italy, Switzerland) lineages. Six haplotypes were found across Europe with a phylogeographic structure mirroring the parapatric distribution of mitochondrial lineages in M. arvalis (Heckel et al., 2005). These were termed Western Y (three haplotypes), Central Y (two haplotypes) and Italian Y (one haplotype) analogous to the mitochondrial lineages and genetic clusters in multi-locus nuclear markers (Heckel et al, 2005; Braaker and Heckel, 2009). Three of these lineage-diagnostic Y haplotypes defined by four substitutions and one insertion/deletion were found in the transects studied here (Tables 1 and 2).

Table 2 Summary of the three transects across contacts between evolutionary lineages in M. arvalis

SMCY11 fragments were amplified in a reaction volume of 25 μl using the primers SMCY11-f (5′-CTGCCCTGYRCCATGCAT-3′) and SMCY11-r (5′-TCCACCTGTTSMAGRACAT-3′) from Hellborg and Ellegren (2003) developed for M. agrestis. For PCR amplification, an initial step of denaturation at 95 °C for 2 min was followed by 40 cycles of denaturation at 95 °C for 30 s, annealing at 49 °C for 1 min and extension at 72 °C for 90 s. A final step at 72 °C for 10 min was added to complete primer extension. All PCR amplifications were performed in thermal cyclers PCR System 9700 (GeneAmp, Applied Biosystems, Rotkreuz, Switzerland) or PTC-100TM (MJ Research, Watertown, MA, USA). For sequencing, PCR products were cleaned with a GenElute PCR clean-up kit (Sigma-Aldrich, St Louis, MO, USA) and dissolved in 40 μl bi-distilled water. Cycle sequencing reactions were carried out using the Terminator Ready Reaction Mix ‘Big Dye’ 3.1 from Applied Biosystems and the PCR primers. The reactions (10 μl) were performed with the following conditions: denaturation at 95 °C for 50 s, 30 cycles of 96 °C for 10 s, 57 °C for 10 s and 60 °C for 4.5 min. PCR products were cleaned by precipitation according to the Applied Biosystems manual, and separated and detected on an Applied Biosystems Prism 3100 Genetic Analyser.

The mitochondrial Cytochrome b (cytb) gene was amplified following Fink et al. (2004; 2006). The allocation of cytb amplification products to one of the three lineages present in the study region was performed by enzyme digestion with AFLIII and BsrI and restriction fragment length polymorphism analysis. Samples from the Jura transect were analyzed with AFLIII, which cuts the Central, Italian and Eastern lineage cytb but not the Western lineage. Samples from the Grisons transect were analyzed with BsrI, an enzyme cutting cytb fragments of the Western, Central and Eastern lineage twice, whereas Italian samples are only cut once. The samples from the Valais transect were analyzed subsequently with both enzymes to enable allocation to one of the three lineages present. As a control, 20–40 samples per transect were additionally sequenced (Fink et al., 2007; Braaker and Heckel, 2009) to verify lineage allocation by restriction fragment length polymorphism. No disagreements were found. For completeness, published cytb data of individuals from the study region (Fink et al., 2004; Heckel et al., 2005; Braaker and Heckel 2009) were integrated into our analyses. Sequences were aligned using the Clustal W algorithm (Thompson et al., 1997) implemented in BioEdit 7.0.5 (Hall, 1999) and revised manually.

Cline analyses

Contact zones can be described by a sigmoid cline in allele frequencies (p) with a function of cline centre (c) and width (w; defined as the inverse of the maximum slope) following

with x equal to the position of the population along the cline (Szymura and Barton, 1986). Cline-fitting analysis was performed by reducing the location data to one-dimensional transects. The frequency of a lineage in a population was plotted against distance (km) along the transects from the north-westernmost (Jura), the northernmost (Grisons) or the westernmost (Valais) locality. Tanh curves were then fitted for the cytb and SMCY11 data separately using the ‘Fit 1D cline’ operation in the programme ANALYSE version 1.3 (Barton and Baird, 2002). Model parameters (centre and width) were computed using 2000 iterations from 20 different starting points. Likelihood profiles and two log-likelihood support limits (corresponding to 95% confidence intervals; Edwards, 1972) were explored using the ‘cross-section’ option. Cline coincidence (centres at the same position) and concordance (equal cline widths) were explored with a likelihood ratio test (LRT) following the procedure used by Phillips et al. (2004) and adapted by Leache and Cole (2007).

Results

Lineage assignment and marker mismatches

Dedicated sampling along transects allowed the precise localization of the intraspecific contact zones in M. arvalis and a general refinement of the distribution of mtDNA lineages compared with a previous study (Braaker and Heckel, 2009). The distribution of Y-chromosomal lineages was in general agreement with the mtDNA contact zones but with highly significant differences in each transect (Figure 2; Table 3).

Figure 2
figure 2

Plots of lineage frequencies in populations versus position (km) along the transects Jura, Grisons and Valais for mtDNA (white circles) and Y-chromosomal (grey triangles) markers. Sizes of symbols are proportional to the log number of samples. Bold lines are fitted clines for mtDNA and dashed lines for Y-chromosomal markers. Despite a sampling gap in the Valais transect, an admixed population was detected at km 78, pointing to at least partial contact between the Western and the Central lineage in this transect. See Table 2 for cline properties and statistical comparisons.

Table 3 Contact zone cline parameters and LRT results of the null hypothesis that cline centres and widths do not differ significantly between the maternal and parental marker for a given transect

In the Jura transect, 34 of 126 males showed a mismatch between the maternal and the paternal marker of which all but one had a Central Y-chromosome and Western mtDNA. In the Grisons transect, a mismatch between the maternal and paternal marker was detected in 41 of 104 males, all with Italian mtDNA and a Central Y-chromosome. In the Valais transect, our analyses showed a double transition between lineages along the Rhone valley for both markers. Despite extensive sampling attempts, no samples or evidence of the presence of M. arvalis could be obtained between kilometres 18 and 75 of the transect, separating the Central and Western mtDNA (Figure 1). Suitable habitats for the common vole are very scarce in this region due to intensive human usage. A mismatch between the maternal and paternal marker was detected in 28 out of 55 males sampled along the Valais transect. Two males had Western mtDNA and a Central Y-chromosome, whereas 26 had Western mtDNA and an Italian Y-chromosome. In the Valais transect, the Western Y-chromosome was found exclusively in combination with Western mtDNA.

Cline analyses

Cline shape analyses revealed significant cline centre shifts (non-coincidence) in all three transects and differences in cline widths (non-concordance) in two of the three transects (Figure 2; Table 3). In the Jura transect, the LRT revealed a significant north-westward shift of the Y-chromosomal cline relative to the mtDNA cline (χ12=129.398; for df=1 the critical value at P=0.01 is 6.635). Moreover, the mtDNA cline was significantly wider (χ12=34.342). The Grisons transect also displayed non-coincident (χ12=69.990) and non-concordant (χ12=56.774) clines. The Y-chromosomal cline was narrower than the mtDNA cline and the centre shifted to the South. In the Valais transect, analyses were performed separately for the two transition zones (Central–Western and Western–Italian). Centres and widths of the paternal and maternal cline of the Central–Western contact zone did not differ significantly in this part of the transect containing mostly unsuitable vole habitats. However, several males with a Central Y-chromosome were detected higher up in the Rhone valley at kilometre 78 in a population with only Western mtDNA, which was several dozens of kilometres away from the nearest known Central Y males along the transect (Figure 2). Coincidence of the Western–Italian contact zone was highly significantly rejected (χ12=52.890), whereas there was no evidence that the widths of the clines differed (χ12=0.000). Further analyses according to age classes of voles provided no evidence of differences in the position of centres or widths of clines between adults and juveniles in any transect (details not shown).

Discussion

Our analyses of sex-specific genetic markers across multiple contact zones revealed strong evidence for partial reproductive isolation between highly divergent evolutionary lineages within the common vole, M. arvalis. Cline analyses strongly suggest sex-specific processes acting in the contact zones consistent with the effect of a unidirectional manifestation of Haldane’s rule.

Evidence for selection

Very narrow clines for the Y-chromosome in comparison to mtDNA suggest the action of selection on common vole males. The width of a cline across a hybrid zone is theoretically shaped by dispersal, which tends to widen it, and selection, which may narrow it (Barton and Hewitt, 1985). In M. arvalis, dispersal is male-biased like in most mammals (Hamilton et al., 2005; Schweizer et al., 2007; Hahne et al., 2011). Thus, without the action of selection, paternally inherited markers are expected to show wider clines across the contact zones than maternally inherited ones. At odds with this neutral expectation, Y-chromosome clines were significantly narrower or at most equal to mtDNA clines, suggesting selective forces acting on males. At present, the mechanisms leading to this pattern are unknown (see below), but it is compatible with Haldane’s rule predicting negative fitness effects for the heterogametic sex due to recessive X-linked mutations or epistatic interactions between the Y-chromosome and another part of the genome (Haldane, 1922; Turelli and Orr, 2000; Coyne and Orr, 1998, 2004).

These patterns in M. arvalis are similar to other small mammal systems where speciation is ongoing or was recently completed, but where evolutionary divergence is much older and phenotypic differentiation between the taxa has been established. For example, in the European hybrid zone between M. m. musculus and M. m. domesticus (diverged >700 000 generations ago; She et al., 1990) the transition of sex chromosomes is similar in width to M. arvalis (for example, <4 km) and mtDNA clines are also wider (for example, Dod et al., 1993; Raufaste et al., 2005; Macholan et al., 2007, 2008). This is likely a consequence of Haldane’s rule in place in the hybrid zone, which may—among other factors—prevent the mouse Y-chromosome cline width to increase (Britton-Davidian et al., 2005; but see Turner et al., 2012). The rapidly speciating Microtus genus harbours several other examples of restricted hybridization and evidence for selection acting prevalently on males—typically associated with morphological and karyotypic differences between the involved sibling species or subspecies (Meier et al., 1996; Jaarola et al., 1997; Bulatova et al., 2010; Bastos-Silveira et al., 2012; Beysard et al., 2012). Among the cryptic evolutionary lineages in M. arvalis however, partial reproductive isolation appears to be additionally acting in an unusual unidirectional manner.

Asymmetrical sex-specific processes

The second sex-specific process across the contact zones is the shift between maternal and paternal clines, leading to an asymmetrical pattern of mismatches. These mismatches may be the result of two main processes: introgression of mtDNA and/or introgression of the Y-chromosome. First, a range expansion of one lineage may have left behind a trail of introgressed mtDNA in the other lineage (Buggs, 2007 for a review). Simulation studies have shown that a replacement of one taxon by another may result in the introgression of selectively neutral local genes into the invading taxon, and that markers transmitted by the least dispersing sex would be most introgressed (Currat et al., 2008; Petit and Excoffier, 2009). In our case, this scenario would have resulted in asymmetrical introgression of mtDNA. However, this scenario would require a general fitness advantage of one lineage over the other, for which there is currently no support given the apparent absence of phenotypic differences between the lineages. mtDNA might also introgress owing to a selective advantage into other taxa (examples in Currat et al., 2008 and Petit and Excoffier, 2009), but there is no evidence of mtDNA introgression or of a selective advantage of particular mtDNA lineages (Fink et al., 2004; Hamilton et al., 2005; Braaker and Heckel, 2009; Fischer et al., 2011). In addition, very high diversity of mtDNA haplotypes in populations and lineages (Heckel et al., 2005; Braaker and Heckel, 2009) argues against a recent mtDNA sweep, but additional analyses together with the nuclear genome will be necessary for dedicated testing of mtDNA introgression in the hybrid zones.

The alternative main explanation for asymmetrical mismatches is the potential existence of asymmetric pre- or post-zygotic mechanisms, leading to an introgression of the Y-chromosome in the opposite direction. Pre-zygotic mechanisms might consist for example, in mating preferences of females from both lineages for a certain Y-chromosomal lineage or in specific male–male interactions (for example, aggressive behaviour; Roubertoux et al., 1994). A potential post-zygotic mechanism in M. arvalis would be a unidirectional manifestation of Haldane’s rule with incompatibilities occurring only in backcrosses, involving hybrid males with one sex chromosome type but not the other. Mating and breeding experiments involving these M. arvalis lineages will allow to test for behavioural mechanisms or genetic incompatibilities acting as selective forces.

A possible unidirectional manifestation of Haldane’s rule was suggested by Gavrilets (1997) based on theoretical work. Non-coincident marker clines similar to the ones in M. arvalis in a particular region of the European house mouse hybrid zone are compatible with such a process as well (Macholan et al., 2008). The M. m. musculus Y-chromosome and to a lesser extent X-chromosomal markers and mtDNA are here more introgressed into the M. m. domesticus territory than the rest of the genome (Macholan et al., 2008; 2011). The authors suggested that this unusual pattern resulted from genetic conflict between the sexes, which may promote unidirectional sex-specific gene flow between subspecies. To date, this pattern has only been detected in one of the studied transects and the general importance of stochastic vs deterministic effects in the house mouse hybrid zone deserves further investigation (Macholan et al., 2011; Turner et al., 2012). A potentially promising similar case was found in Scandinavia, where 15 mice from seven M. m. domesticus populations were introgressed with the M. m. musculus Y-chromosome (Jones et al., 2010). Our analyses, however, display a highly consistent pattern of very sharp Y-chromosomal clines shifted relative to mtDNA clines in several hybrid zones between the morphologically cryptic lineages of M. arvalis. At present, it is unclear if similar forces are acting in the different contact zones, but additional X-chromosomal and autosomal markers will allow a more detailed characterization of introgression patterns in the future. This may also reveal particular differences between hybrid zones, potentially depending on the evolutionary lineages involved and their level of divergence.

Conclusion

Our analyses of multiple hybrid zones provide evidence for sex-specific processes, which are compatible with strong selection acting at the contact between evolutionary lineages in the common vole. The mechanisms contributing to partial reproductive isolation and the extent of actual hybridization in natural populations remain currently unknown. However, more detailed population genetic analyses covering the nuclear genome widely combined with classical experimental crosses and mate-choice experiments, and the analysis of less-diverged lineages (for example, Eastern) have the potential to inform us about causes and consequences of asymmetric gene flow in hybrid zones of the probably most abundant wild European mammal.

Data archiving

The SMCY11 sequences as well as novel cytb haplotypes have been deposited in GenBank with accession numbers KC283238–KC283240 and KC349131- KC349136, respectively.