Introduction

Natural hybridization has long raised the interest of evolutionists as it represents a unique source of information about the mechanisms of reproductive isolation, which can lead to speciation (Coyne and Orr, 2004). In particular, the maintenance of species integrity in the face of gene flow remains a central outcome in evolutionary biology (Sites and Marshall, 2003; Hey, 2006; Baack and Rieseberg, 2007; Petit and Excoffier, 2009). The two main models proposed to describe the dynamics of the hybridization process, the Tension zone model and the Mosaic zone model, account for many situations observed in nature.

Barton and Hewitt (1985) have championed the concept of Tension zone in which clines are maintained by a balance between parental effective migration and selection against hybrids. The low fitness of hybrids is caused by the disruption of intragenomic co-adaption (that is, endogenous selection). The tension zone model performs well in situations of parapatry and when zone width is small relative to the breeding range of the two taxa (Endler, 1977). In contrast, the mosaic zone model (Harrison, 1986; Howard, 1986) assumes that two species segregate between two habitat types. Differential selection caused by ecological factors (that is, exogenous selection) maintains species integrity by limiting the possibility of successful dispersal into the other habitat.

Although the tension zone and the mosaic models describe a wide range of situations encountered in animals and plants, they are not universal and their assumptions are not always met (Harrison, 1993). Predictions about hybridization dynamics become less clear (i) when the zone of sympatry covers a large fraction of the breeding ranges of focal species, and (ii) when species are frequently found in syntopy. In other words, classical models are not suitable for cases wherein species exhibit broad geographical and ecological overlap. A plausible reason why these situations did not raise strong interest in previous studies is that hybridization is expected to occur at a very low rate between broadly sympatric taxa and that reproductive or other barriers are strong enough to maintain species integrity. Hybridization events can thus be viewed as having no biological significance in these cases. However, a broad overlap zone presents a large interface surface between two species so that even if hybridization events are locally rare, they are not necessarily so at the zone scale. In addition, exogenous selection against hybrids is expected to be negatively related to the degree of ecological overlap between species. The more the two species ecologically overlap, the less environmental components of fitness will favour parental forms over hybrids. In such situations, the mechanisms responsible for the maintenance of species integrity in mixed populations are of particular interest.

Several broad hybrid zones have been studied (Roques et al., 2001; Höbel and Gerhardt, 2003; Borge et al., 2005; Turner et al., 2005; Cooley et al., 2006; Duvernell et al., 2007; Fuller, 2008; Jang et al., 2009). However, in these studies attention was largely given to cases of invasion by an allochthonous or clonal species into the distribution of an indigenous/wild species (Rubidge and Taylor, 2004; Duputié et al., 2007) or a recent range expansion (Martin and Cruzan, 1999). Studies investigating cases of ancient contact between autochthonous taxa remain rare. Such studies would bring new insights to our understanding of the mechanisms involved in the maintenance of species integrity in setups that differ from the classical tension or mosaic zones.

We investigated hybridization and introgression between the palmate newt Lissotriton helveticus and the smooth newt Lissotriton vulgaris. These related taxa diverged approximately 44 million years ago (mya) (Steinfartz et al., 2007) to less than 20 mya (Weisrock et al., 2001). The last secondary contact probably formed after the last glaciation (Spurway and Callan, 1960). The zone of sympatry extends over a thousand kilometres in longitude and latitude in Europe (IUCN, 2008) and is broadly relative to the breeding range of each species (Figure 1) (72% of L. helveticus range and 11% of L. vulgaris range is sympatric). Despite both species having different ecological requirements (Denoël and Ficetola, 2007, 2008), they frequently breed in the same water bodies (Arntzen, 1986; Griffiths, 1987) where extensive overlap in microhabitat selection, diet and breeding period is observed (Griffiths, 1986, 1987; Chadwick et al., 2006). Heterospecific courtship is relatively common in the lab, which suggests that premating barriers are not strong (Wambreuse and Bels, 1984; Roberts, 1990; Secondi et al., 2010).

Figure 1
figure 1

Sampled sites, distribution of Lissotriton helveticus and L. vulgaris and location of the zone of sympatry (from IUCN, 2008). The outlet shows the study site where we investigated patterns of introgression at a local scale (BD CARTHAGE IGN, Paris, France 2005). White circles indicate sampled sites where only L. helveticus were found and black triangles indicate sampled sites with only L. vulgaris. Grey rectangles indicate syntopic sites. Stars indicate the locations of hybrids as reported in the literature (a, Griffiths et al., 1987; b,c, Arntzen et al., 1998; d,e, Beebee et al., 1999; f, Schlüpmann et al., 1999).

Hybrids have been found in several locations (Griffiths et al., 1987; Arntzen et al., 1998; Beebee et al., 1999; Schlüpmann et al., 1999) (see Figure 1). Laboratory experiments suggested that if hybrids were less viable and fertile than the progeny of homospecific crosses (Spurway and Callan, 1960; Cogălniceanu, 1994), they could backcross and produce a viable offspring (Benazzi, 1957). An earlier study detected introgression in three areas of the sympatric zone (Arntzen et al., 1998). Only F1 hybrid males have been detected in wild adults so far (Griffiths et al., 1987; Arntzen et al., 1998; Beebee et al., 1999; Schlüpmann et al., 1999). The fact is striking as hybrid females are expected to be more viable than hybrid males (Haldane, 1922), males being the heterogametic sex in Lissotriton (Hillis and Green, 1990). Our current knowledge of the hybridization process between the two species may be strongly biased because of the difficulty in identifying phenotypic hybrids in one sex. In contrast to males that are readily identified by secondary sexual traits, females of each species look alike, and overlap for several traits such as colour pattern of underparts (Johanet et al., 2009b). Thus, female hybrids are difficult to identify and female backcrosses are almost impossible to distinguish from parental species (Naisbit et al., 2003) by morphological criteria alone. Moreover, in another study of hybridization between the small-bodied newts L. vulgaris and L. montandoni, even introgressed males were often indistinguishable from parental species (Babik et al., 2003).

By conducting a large scale study on hybridization between L. helveticus and L. vulgaris we addressed several questions that arise from two main objectives. First, we revaluated introgression and compared the efficiency of hybrid detection from morphometric and genotypic methods. We expected genotypic data to be more discriminant than morphometric ones. We also tested the effect of sex and species on introgression level. According to Haldane's rule (Haldane, 1922), we expected females to be more introgressed than males, as males are the heterogametic sex in Lissotriton (Hillis and Green, 1990). Second, we explored the maintenance of species integrity by comparison of introgression levels at different biogeographical scales. (i) We compared introgression levels inside the breeding ranges of the two species by sampling distant areas with different syntopic contexts (allopatry vs sympatry). We expected that limited introgression outside sympatry would maintain species integrity. (ii) To analyse introgression dynamics at the local scale (allotopy vs syntopy), we selected an area located at the southern edge of the sympatric zone. This area represented a unique configuration because L. vulgaris is always encountered syntopically with L. helveticus in floodplain valleys; the latter species is also found in allotopy in valley or plateau sites. The nesting of L. vulgaris habitat within L. helveticus habitat is an extreme case of ecological overlap between two hybridizing species. In this study area, the occurrence ratio is biased towards L. helveticus, a situation that tends to maximize hybridization rate (Hubbs, 1955; Randler, 2002; Secondi et al., 2006). Here again, limited introgression outside syntopic sites is expected.

Materials and methods

Sampling strategy

In order to estimate the overall hybridization rate, we collected DNA samples from 37 ponds. One pond, Liré, was sampled two years in a row (2007 and 2008). Five sites, from three and two distinct areas for L. helveticus and L. vulgaris, respectively, were located in allopatry. In addition, 32 ponds from 10 distinct areas were sampled in the sympatric zone. Among these, 10 ponds were allotopic and 22 were syntopic. In total, 928 L. helveticus and 344 L. vulgaris were sampled (Figure 1 and Table 1).

Table 1 Locality, context (syntopy, allotopy), sample size, sex-ratio, observed and expected unbiased heterozygosities for Lissotriton helveticus and L. vulgaris

In order to analyse genotypic and phenotypic introgression at a local scale, we increased our sampling effort in one area. Twenty out of the 32 sympatric ponds were located in the Loire river valley and adjacent plateaus between Saumur and Nantes (Figure 1 and Table 1).

Genetic analysis

Microsatellite analysis

We analysed DNA from the 928 L. helveticus and 344 L. vulgaris individuals. Individuals were toe- or tailed-clipped according to the capture permit provided for each sampling area. DNA was extracted following Chelex procedures (Walsh et al., 1991). We used six microsatellite markers to estimate introgression level: Th09, Th14, Th27, ThCa14, Tv3Ca9, Tv3Ca19 (Johanet et al., 2009a). PCR amplifications were performed using fluorescent-labelled forward primers and 10-μl reaction volumes containing 10 ng genomic DNA, 0.4 μM of each primer, 0.1 mM of each deoxynucleotide triphosphate, 2.0 mM MgCl2, 0.2 units Taq DNA polymerase (Promega, Charbonnières-les-Bains, France). Reactions were performed in a MJ Research PT200-Tetrad thermal cycler (GMI, Ramsey, MN, USA): 4 min at 95 °C, 35 cycles consisting of 30 s at 95 °C, 30 s at Tm °C (see Johanet et al., 2009a), 30 s at 72 °C, then 5 min at 72 °C. PCR products were separated on an ABI Prism 3130xl DNA Analyser.

We used MICRO-CHECKER to check microsatellite data for null alleles (Oosterhout et al., 2004). Genetic differentiation between species was calculated using FST estimates (Weir and Cockerham, 1984), and tested by permutation test using GENETIX (Montpellier, France, Belkhir et al., 2004). We ran a factorial correspondence analysis (CA) using GENETIX (Belkhir et al., 2004) on ponds with no a priori species identity. CA is particularly well suited for hybridization studies as it generally extracts between-species differentiation on the first axis, and within-species population differentiation on further axes (Daguin et al., 2001). Interestingly, CA simultaneously expresses the genetic variation in the data set and the respective contributions of each allele to the variation.

Hybrid identification

We used two Bayesian-Markov Chain Monte Carlo (MCMC) methods to detect hybrids, NEWHYBRIDS (Anderson and Thompson, 2002, avalaible at http://ib.berkeley.edu/labs/slatkin/eriq/software/software.htm) and STRUCTURE (Pritchard et al., 2000, avalaible at http://pritch.bsd.uchicago.edu/structure.html). Neither of them requires a priori assignment of alleles to taxonomic entities and both use markers that are not necessarily fixed in either parental species. Moreover, each method can accommodate null alleles.

STRUCTURE (Pritchard et al., 2000) was used to estimate the most likely number of genetic entities (that is, clusters) in the sample and then individual admixture proportions. The number of clusters in our data set was estimated by pooling all samples and calculating the probability that each individual belong to one of K clusters. We assumed that K ranged from one to six clusters, considering the fact that pure parents and various classes of hybrids (F1, F2 and backcrosses) can represent distinct genetic entities. Individual admixture proportions (that is, ancestry estimates) were calculated by assuming an admixture model (that is, allowing the genetic composition of individuals to be a mixture from different populations), using 10 000 iterations for burn-in and 10 000 iterations for parameter estimation. This was repeated 20 times at each K-value. The most likely K-value was estimated using the method proposed by Evanno et al. (2005). We used a threshold of 90% to assign an individual to one parental group or another (Duputié et al., 2007).

Additionally, we used NEWHYBRIDS (Anderson and Thompson, 2002) to estimate the posterior probability that an individual falls into one of the six following categories: pure parental (L. helveticus or L. vulgaris), F1, F2 and backcrosses to one parental species or another. The whole data set was analysed after 10 000 iterations. We assigned an individual to one of the parental groups when its inferred proportion of genes belonging to the group exceeded 90% (Smulders et al., 2008). Otherwise, individuals were considered as hybrids of first or further generation.

We tested for the effect of species and sex on hybrid frequency using log-linear analyses (StatSoft, 2007). This method analyses multifactor cross-tabulations by partitioning the variation in frequency data into the different factors, testing for interactions in a way similar to the analysis of variance.

Mitochondrial analysis of putative hybrids

We genotyped mitochondrial DNA of putative hybrids to identify their maternal parents. We used BioEdit (Hall, 1999) to align sequences from the control region of Lissotriton boscai (Martínez-Solano et al., 2006) and Primer 3 (Rozen and Skaletsky, 2000) to design primers for the amplification of an approximately 300 bp sequence. PCR conditions were the same as for Lissotriton boscai, with 0.4 μM of each primer (DloopboscaiF: CAGCTGGTATGAATCTATGA, DloopboscaiR: CTTAGAAAAACTAGCTCCGG). The annealing temperature was 51 °C. PCR products were then separated on a non-denaturing polyacrylamide gel to screen single-strand conformational polymorphism (Fujita and Silver, 1994). Amplified fragments were mixed with an equal volume of sample buffer, denatured at 95 °C for 5 min, and cooled on ice. A volume of 10 μl of the mix was deposited on a 8% non-denaturing gel containing 14 ml of 37.5% polyacrylamide-1% bisacrylamide mixture, 7 ml of glycerol 0.5 X, 7 ml of TBE 5 X (450 mM Tris-borate, 10 mM EDTA at pH 8) and 42 ml of H2O. Electrophoresis ran at a constant voltage of 8 W for 16 h in l X TBE electrophoresis buffer. Silver staining revelation followed Creste et al. (2001). Gels were washed in 2 l cold (4 °C) fixing solution (10% absolute ethanol, 1.5% acetic acid) for 10 min. Washed gels were soaked for 3 min in 2 l cold solution with 1.5% nitric acid, then washed and impregnated for 10 min in a 2 l cold solution of 1 g l−1AgNO3. Gels were rinsed quickly (20 s) once with 2 l distilled H2O and developed in a 30 g l−1 Na2CO3 solution with 40% formaldehyde until the bands appeared with a sufficient intensity.

Introgression analysis

Even low hybridization, as expected in Lissotriton (Arntzen et al., 1998), can lead to substantial introgression (Goodman et al., 1999). Here we used a method that is appropriate when hybridization is rare. It separates the contribution of ancestral polymorphism from current hybridization to apparent introgression, allowing no strictly diagnostic markers (Barton, 2000). Microsatellite alleles at a single locus (excluding those with null alleles) were clustered in two groups according to their first axis coordinate on the CA. We defined two species-specific synthetic alleles: Lh, characteristic of L. helveticus-like individuals and Lv, characteristic of L. vulgaris-like individuals. They also corresponded to the alleles inferred by STRUCTURE to be characteristic of clusters 1 and 2, respectively, when we fixed K=2 clusters. Synthetic alleles were used to compute between-genome deviations from Hardy–Weinberg equilibrium (κ1,1), and within-genome pairwise associations (κ0,2, linkage disequilibrium) following the method described by Barton (2000), using MATHEMATICA 5.0 (Wolfram Research, Inc, Champaign, IL, USA). We expected between-genome and within-genome pairwise associations to be at their maximum values when two syntopic taxa do not hybridize. As recommended by Barton (2000), the likelihood of different nested models was estimated. We assumed first that the population is in Hardy–Weinberg and linkage equilibria, then allowed for pairwise associations between-genomes (Hardy–Weinberg disequilibrium), within-genomes (linkage disequilibrium) or both, and finally for higher-order associations. The same sequence of models was repeated allowing different contributions of each locus to the association. The most appropriate model was chosen using the Akaike information criterion under which model 2 is more likely than model 1 if log(L2)-2n >log(L1), wherein n is the number of additional parameters used in model 2 relative to model 1. A 95% confidence interval was estimated using a random walk algorithm with parameter T (temperature of the simulated annealing) initially set up to 1 (Barton, 2000). This method does not need to use reference populations that represent parental gene pools.

We calculated for each pond the difference between the observed value for Hardy–Weinberg disequilibrium and linkage disequilibrium compared with their theoretical maximum value by the formula κmaximum−κobservedmaximum. In order to detect an influence of sex-ratio on interspecific matings, we then tested for a relation between Hardy–Weinberg and linkage equilibria and sex-ratio in each population using a linear model using R 2.6.1 (R Development Core Team, 2007, http://www.r-project.org/).

Phenotypic analysis

In order to phenotypically detect hybridization, we measured a sample of 560 adults from the Loire river area composed of 389 L. helveticus (181 males and 208 females) and 171 L. vulgaris (59 males and 112 females). Adult newts were caught in ponds from March to May in the year 2006 and 2007 using dipnets (capture and release permit provided by Préfectures du Maine-et-Loire and Loire-Atlantique). All newts were anaesthetized before manipulation by immersion in Tricaine methane sulphonate (0.2 g l−1), MS222, for duration of 5–10 min. Measurements were taken the same day or the day after capture. Four characters associated with body size and secondary sexual trait development were measured to the nearest 0.01 mm using a digital calliper: snout-to-vent length, maximum tail height, cloaca width and filament length (for details on the measurement method, see Secondi et al., 2007). Characters were measured twice and the mean value of the two measurements was recorded. We also scored the throat pattern by coding ‘0’ for a clear throat (typical of L. helveticus) to ‘4’ for a completely marbled throat (typical of L. vulgaris). These traits are known or likely to discriminate the two species (Griffiths and Mylotte, 1988; Johanet et al., 2009b). Individuals were subsequently released to their original capture sites.

Principal component and discriminant analyses were performed using STATISTICA (StatSoft, 2007) to determine whether morphological clusters could be detected without any a priori hypothesis of group membership. Sexes were treated separately because of sexual dimorphism for all traits. To check for a relationship between genotype and phenotype we tested for a correlation between individuals scores in principal component analyses (phenotypic score) and CA (genotypic score) for each sex and species using linear mixed-effects models with population as a random effect in R 2.6.1.

Results

Genetic analysis

Hybrid identification

Polymorphism levels of the six microsatellite markers ranged from 12 to 35 alleles per locus with an average of 25 alleles in both species. Two microsatellite loci, Th27 and Th14, were fully diagnostic. The proportion of private alleles was 41% in L. helveticus and 39% in L. vulgaris. After analysis with MICRO-CHECKER two loci showed null alleles (Th27 and Th09). Genetic differentiation was significant (ST=0.263, P=0.000) between the two taxa. Consistently, factorial CA on microsatellite data supported the genetic distinctiveness of L. helveticus and L. vulgaris by separating two clusters along the first axis (Figure 2). The large eigenvalue (analogous to a partial FST estimate) of the first axis, 25.13%, was indicative of a strong differentiation among samples. Samples from each species formed a scatterplot; however, the two 99% ellipsoids partially overlaid. A few individuals of each species, according to their phenotype, were located on the edge or within the ellipsoid of the other species. Interestingly, variation on both axes was much larger in L. vulgaris (x-values ranging from 0 to 2200 approximately) than in L. helveticus (x-values ranging from −800 to 0).

Figure 2
figure 2

Scatterplot of the two first axes of a factorial correspondence analysis based on multilocus analysis of 927 Lissotriton helveticus (open circles) and 343 L. vulgaris (black filled triangles) sampled over the breeding ranges of the two species. The dotted and plain ellipsoids represent 99% confidence intervals for L. helveticus and L. vulgaris, respectively. Grey open squares represent L. helveticus-like hybrids and grey filled squares L. vulgaris-like hybrids. The two asterisks indicate the L. helveticus-like hybrid with a L. vulgaris mitochondrial DNA and the L. vulgaris-like hybrid with a L. helveticus mitochondrial DNA.

Cluster analysis using STRUCTURE revealed that the most likely number of populations K in our sample was two (Appendix 1). 98.1% of all individuals (1248/1272) were classified as L. helveticus or L. vulgaris. The remaining individuals (24/1272) were considered as hybrids. Figure 3 illustrates the distribution of individuals according to their ancestry estimates (posterior probability of belonging to L. vulgaris). It shows a clear segregation of allopatric individuals. All individuals with intermediate genotypes come from the sympatric zone, 21 from syntopic sites and three from allotopic sites.

Figure 3
figure 3

Distribution of ancestry estimates from STRUCTURE analysis (proportion of individual genomes derived from Lissotriton vulgaris) in L. helveticus and in L. vulgaris in different ecological contexts.

According to our classification criterion and in agreement with STRUCTURE assignments, NEWHYBRIDS assigned 98.3% of all individuals (1251/1272) to a parental group, L. helveticus or L. vulgaris, and 1.7% (21/1272) of all individuals to the remaining four groups. NEWHYBRID assigned none of these individuals to any one of the hybrid classes with a high probability. For instance, no F1 was detected and five individuals were assigned with a probability higher than 30% to the F2 class, all of them possessing private alleles from the two species. Results may reflect the inability of NEWHYBRID to identify the correct hybrid class, a known weakness of this method when several generations of backcross occur (Anderson and Thompson, 2002). Focusing on the two diagnostic loci, five hybrid individuals clearly showed two private alleles for one species at one diagnostic locus and one L. helveticus allele plus one L. vulgaris allele for the other diagnostic locus, suggesting that recombination occurred.

Outputs given by the two Bayesian clustering methods were congruent. According to our criterion, a total of 25 hybrids were identified by NEWHYBRIDS and STRUCTURE. Twenty of them were identified by both methods using the same assignment threshold for posterior probabilities. Among the five remaining individuals, four were identified by STRUCTURE only and one by NEWHYBRIDS only. These five individuals were detected by both methods when we raised the assignment threshold to 95%.

We used these data to compute overall hybridization rate and its geographical variation. All hybrids were captured in sympatry from five distinct areas. Hybridization rate did not vary largely between areas: 2.4% (18/743 in 12 ponds) in Pays-de-la-Loire, 3.4% (1/29 in one pond) in Pas-de-Calais, 1.0% in Oise (1/97 in one pond), 2.6% (1/39 in one pond) in Sussex and 3.8% (1/26 in one pond) in Lorraine. All but three hybrids originated from syntopic ponds. Overall hybridization rate was 1.7%. It was 2.9% if we only consider syntopic sites. Note that we excluded from this computation individuals from site Liré 2008 (three hybrids were detected) that is a replicate of Liré 2007 (three hybrids were detected the year before).

Introgression analysis

Models assuming pairwise associations between genomes (Hardy–Weinberg disequilibrium) and within genomes (linkage disequilibrium) were the most appropriate. Pairwise associations of alleles at a locus between genomes, κ1,1 and across loci within a genome, κ0,2 were presented as a function of synthetic Lv allele mean frequency in Figure 4. κ1,1 and κ0,2 could not be estimated for all 13 allotopic L. helveticus sites, the Swedish allopatric L. vulgaris site and six syntopic sites because at least one allele was fixed in these sites. Theoretical values for these parameters were used in Figure 4. No departure from maximum disequilibrium was observed among L. vulgaris from allopatry (Norwegian population). Sixteen sites showed significant deviation both from Hardy–Weinberg, κ1,1 and linkage, κ0,2 disequilibrium maximum values, indicating that interspecific gene exchanges occurred. All of them were syntopic. Each sample of the site of Liré (2007 and 2008) was treated as separate populations and both showed significant introgression. Introgression was thus detected in 73% (16/22) of all syntopic sites.

Figure 4
figure 4

Pairwise associations within-genomes, κ0,2 (corresponding to an average pairwise linkage disequilibrium) and between-genomes, κ1,1 (corresponding to an average Hardy–Weinberg disequilibrium) estimated following the method of organized as a function of Lissotriton vulgaris allele frequency. Samples of L. helveticus in allotopy are represented by white circles, sample of L. vulgaris in allotopy are represented by black triangle. Samples in syntopy are represented by grey rectangulars for sites in Pays-de-Loire and by grey diamonds for sites elsewhere in Europe. Theoretical maximum values of κ0,2 and κ1,1 are represented by curves. Vertical bars represent 95% confidence intervals.

Sex, species and hybridization

Among the 25 newts identified as hybrids, nine were phenotypically classified as L. helveticus (six males and three females) and 16 as L. vulgaris (5 males and 11 females). Hybrid frequency was 4.8 times lower in L. helveticus-like individuals (1.0%) than in L. vulgaris-like individuals (4.7%). We used a log-linear model to test the effect of species and sex on hybrid frequency. The full model (model with both effects included) gave a significantly better fit to the data than the reduced model (model with no effect) (ΔLog-likelihood=7.8484, χ2=15.6968, df=3, P<0.0013). We observed more hybrids in L. vulgaris-like than in L. helveticus-like individuals (χ2=16.569, df=1, P<0.0001). No effect of sex (χ2=0.325, df=1, P=0.568) and no interaction between species and sex (χ2=0.499, df=1, P=0.478) were detected.

Interestingly, we observed significant contrasting sex-ratio values between our main study area (Loire river valley) and the other sites (Appendix 2). Sex-ratios were more significantly female biased in the Loire valley than elsewhere in Europe for L. helveticus (pMann−Whitney=0.001) and for L. vulgaris (pMann−Whitney=0.004). The difference was particularly strong for L. vulgaris. These differences are independent of the stages of the breeding season as our sampling scheme is composed of several spatio-temporal replicates. We tested the relationship between deviations from maximum values of κ1,1 and κ0,2, and site sex-ratio. Deviation from κ0,2 was correlated with sex-ratio in L. helveticus (F=5.019, P=0.047), whereas deviation from κ1,1 was correlated with sex-ratio in L. vulgaris (F=5.440 P=0.042). According to the coefficient signs (−0.550 and −1.038), the more interspecific gene exchanges occurred in the population (that is, the more linkage or Hardy–Weinberg disequilibrium decreased), the more sex-ratio was biased to the detriment of males.

Mitochondrial DNA analysis

Mitochondrial DNA analysis was carried out on 168 allopatric individuals and the 25 hybrids. Based on the 168 reference samples, we found two species-specific haplotypes for each species. Mitochondrial DNA matched phenotype for all allopatric individuals and for most hybrids. Mismatch between phenotype and mitochondrial DNA was observed only in one out of nine L. helveticus-like hybrids, and one out of 16 L. vulgaris-like hybrids (see Figure 2). These two individuals were males. Mismatch rate did not differ between species (Fisher exact test, P=0.581).

Phenotypic analysis

The Principal component analyses based on morphological traits (Figure 5) clearly separated males and females of the two species as 99% confidence ellipses did not overlap. Note that species discrimination was supported by Axis 1 for males (47.28% of phenotypic variation) and Axis 2 for females (axis 1=48.96%, axis 2=31.27%). Species discrimination was significant for each sex (males: Wilk’s-Lambda=0.0758, P<0.0000; females: Wilk’s-Lambda=0.0240 P<0.0000). Discriminant function analysis classified all males and females to the initial group they have been assigned to by observers.

Figure 5
figure 5

Scatterplot of the two first axes of a principal component analysis based on phenotypic traits measured in (a) 240 males and (b) 320 females of Lissotriton helveticus (white circles) and L. vulgaris (black triangles) sampled in the Loire valley. Note that dotted and plain ellipsoids represent 99% confidence intervals for L. helveticus and L. vulgaris, respectively.

We tested for the relationship between phenotype and genotype using principal component analyses -scores and CA-scores of individuals on axes responsible for species discrimination. We observed neither significant relationship in male and female L. helveticus, nor in male L. vulgaris. In contrast, we detected a significant positive relationship in female L. vulgaris (coefficient=225.4640, standard error (SE)=72.2812, df=4, F=9.7298, P=0.0024). This result means that the more genetically intermediate L. vulgaris females were, the more they resembled to L. helveticus females.

Discussion

In a situation of large geographical and ecological overlap, we quantified introgression between two newt species, tested for the effect of sex and species and sought to explain the maintenance of taxonomic integrity by studying different areas and syntopic contexts across their distribution.

Hybrid frequency

We used two assignment methods, NEWHYBRIDS and STRUCTURE, that identified hybrids left undetected by morphometry. No F1 hybrids were found but different levels of admixture were observed. Because of the low number of diagnostic loci (Vähä and Primmer, 2006) and the limits of the Bayesian models (only recent hybrids can be found, Anderson and Thompson, 2002), assignment methods are not expected to detect all hybrids. Nevertheless, results given by NEWHYBRIDS and STRUCTURE were congruent and the same individuals were identified as hybrids provided assignment threshold was slightly elevated for NEWHYBRIDS.

Our estimation of hybrid frequency in adults was two to three times higher (1.7%) than the frequency of alien genes (0.07%) observed earlier in larvae (Arntzen et al., 1998). The discrepancy between the two studies may reflect either better survival of hybrids to metamorphosis or properties of the markers used (microsatellites vs allozymes). The first hypothesis is not confirmed by experimental crossings (Spurway and Callan, 1960). The second hypothesis is more plausible. A high variance among markers in levels of differentiation between species is expected when the presence of semipermeable genetic barriers to gene exchange is suspected (Bierne et al., 2003). The small sample size of markers used in both studies (two enzymatic diagnostic markers and six microsatellites) may explain this high variance between markers and therefore the apparent discrepancy.

We found no morphological evidence of hybridization between the two newts. Phenotype distribution was strongly bimodal and discriminant analysis classified all individual in the group they have been assigned to in the field. It seems unlikely that morphology alone would be enough to survey the hybridization process as it is often difficult to discriminate introgressed from pure individuals in the wild (Naisbit et al., 2003; Thomas et al., 2008). Results highlighted the fact that most hybrids remain phenotypically undetected. Males’ sexual morphology is highly differentiated whereas females are very similar. It is thus not surprising that only hybrid males have been observed by others in the field (Griffiths et al., 1987; Arntzen et al., 1998; Beebee et al., 1999; Schlüpmann et al., 1999), even if both species are among the most common newts in Europe.

Species and sex asymmetric introgression

Mitochondrial analysis revealed that both maternal lineage L. helveticus and L. vulgaris occurred in hybrids. Consistently, nuclear gene flow was bidirectional between the two species. However, we found a significant asymmetric pattern of nuclear introgression as the proportion of hybrids or introgressed individuals was five times higher in L. vulgaris than in L. helveticus. The range of scores on the first CA axis was also much larger for L. vulgaris than for L. helveticus, which is consistent with the fact that a larger proportion of L. vulgaris was involved in heterospecific crosses than L. helveticus. Several ecological factors (relative abundance, reproductive phenology) and selective factors could explain this pattern but none of them have been rigorously tested here. Still, we observed that relative abundance was often biased as we found more L. helveticus than L. vulgaris in our study area. Unbalanced abundances facilitate the formation of heterospecific pairs and favour asymmetrical heterospecific mating rates (Arnold et al., 1993; Randler, 2002). This factor may have a major effect on the dynamics of the hybrid zone. We also observed a genotype–phenotype relationship in L. vulgaris females but not in L. helveticus females or males of both species. The rationale for this pattern is not straightforward as it could result from an actual asymmetric relationship if phenotypes of both species responded differentially to hybridization, or simply reflects a higher frequency of L. vulgaris-like hybrids. In any case, the current situation suggests that hybrid L. vulgaris-like females would more frequently backcross with L. vulgaris males than with L. helveticus males and therefore contribute to maintain an asymmetrical hybridization pattern.

The negative relationships between disequilibria deviations from their maximum values (κ) and sex-ratio for each species indicates that in sites where hybridization occurred, sex-ratio was significantly female-biased. Even if we randomized temporal sampling across the different sites, we cannot discard the possibility that temporal variation inside the breeding season biased sex-ratio estimates (Chadwick et al., 2006). However, the positive correlation between genotypic and phenotypic scores found in L. vulgaris females and not in males is also suggestive of selection against hybrid males. Haldane's rule could apply here as males are the heterogametic sex in Lissotriton (Hillis and Green, 1990). Sex-specific postmating barriers may be associated with a decrease in viability and in fitness of hybrid males. However this remains to be rigorously tested. Another way to interpret the negative relationship between sex-ratio and introgression level in populations is that the more biased the sex-ratio is, the more mismating may occur. According to Randler (2002), scarcity of conspecifics facilitates hybridization in general, no matter which sex is the rare one. Operational sex-ratio could also affect hybridization; mismating may be higher early in the season as female newts are less selective in early breeding (Gabor and Halliday, 1997).

Extent of introgression

Overall, about 1.7% of adults had probabilities of admixture with each species. Hybrid frequency fluctuated around this value in several areas of the sympatric zone. Sampling effort differed between those areas but is unlikely to explain such low values. Thus, hybridization appears as a phenomenon occurring over a large proportion of species’ breeding ranges. Tests for multilocus Hardy–Weinberg and linkage equilibrium provided additional evidence for the existence of mixing between the two taxa. We found significant deviations from theoretical maximum values in about 70% of syntopic sites. According to our data, introgression was thus common in syntopy but did not leak out into allotopy. The result is striking as allotopic and syntopic sites in the Loire river study area alternate along the valley on a short geographical scale (Johanet et al., 2009b). We may not have had enough power to accurately detect all hybrids as suggested by NEWHYBRIDS and STRUCTURE output. Method for estimating disequilibria is also awkwardly sensitive to allele fixation as only one fixed allele impairs computation. Thus, we may have underestimated actual hybrid frequency and introgression. Nevertheless, both approaches consistently indicate that introgression is present in sites where the two species physically co-occur. We cannot discard the occurrence of introgression in allotopy but its level is certainly lower than in syntopy. Overall, our results are consistent with the maintenance of species integrity, as observed on phenotypic traits, in spite of recurrent interspecific gene flow.

Introgression dynamics in broad hybrid zones

Our study showed that a moderate level of hybridization does not compromise genetic integrity of species with broad geographic overlap that are regularly found in syntopy. The contact zone between the two newts is old (that is, after the last glaciation, Spurway and Callan, 1960) and yet both taxa remain well genetically differentiated in the large fraction of the contact zone that we sampled. The apparent restriction of hybrids to syntopic sites raises questions about the factors that impede the diffusion of alleles to allotopy and more largely about the dynamics of introgression in broad hybrid zones. Owing to the age of these taxa, it is possible that most neutral or advantageous alleles have already spread into the other species distribution and that markers currently showing some degree of diagnosticity are subject to selection, directly or via strong linkage to another gene.

However, one has to consider introgression in the general context of intraspecific gene flow for each species. In L. helveticus, the number of detected hybrids is small relative to the total population size and allotopic sites are more abundant than syntopic ones. Thus, genetic drift may wipe out introgressed alleles from allotopic sites. A recent study by Petit and Excoffier (2009) proposes that introgression is impaired when intraspecific gene flow is high. Accordingly, net gene flow from allopatric or allotopic areas reduces the frequency of alien alleles that are then more easily lost by chance. Hybrid frequency is not particularly small in the L. helveticusL. vulgaris system. Still, hybridization rate does not seem high enough for introgression to spread. Interestingly, L. helveticus is quite ubiquitous and relatively abundant in our study site. It tends to occupy a wider niche and more breeding sites than L. vulgaris. In such a context, the frequency of hybrid effective migrants needs to be high enough to counter the homogenizing effect of intraspecific gene flow.

The maintenance of species integrity between broadly sympatric species is usually the result of strong post- and pre-zygotic barriers so that the few hybrids produced are considered as evolutionary dead-ends. However, other processes may prevent introgression without the elevation of particularly strong reproductive barriers. The relative abundance of species and the proportion of syntopic sites over the total number of sites (allotopic and syntopic) are two major factors that may strongly condition the demographic component of broad hybrid zones, and set the upper limit to introgression dynamics. The spatial arrangement of allotopic and syntopic sites will determine net relative intra- and inter-specific gene flow and for this reason should also be considered. The strength of the effects generated by these different factors may create demographic barriers that impair the diffusion of alien genes outside syntopic sites. Their efficiency is probably reinforced for aquatic species that breed in discrete habitats such as ponds or lakes. Modelling would likely shed light on the process of genome interaction between taxa hybridizing in broad hybrid zones. The emphasis should be put on the effects and interactions of drift, relative abundance and relative occurrence on introgression dynamics.