Introduction

The theory that new species may evolve despite gene flow has recently gained theoretical support (Schluter, 1998; Tregenza and Bridle, 1998; Dieckmann and Doebeli, 1999; Kondrashov and Kondrashov, 1999; Kirkpatrick and Ravigné, 2002) and has been accompanied by an increasing number of examples of sympatric host races (eg Emelianov et al, 1995; Via, 1999; Groman and Pellmyr, 2000). The influence of ecological factors in speciation has also recently been emphasized (Schluter, 1998; Schilthuizen, 2000; Via, 2001). For example, the selective use of diverse resources may lead to the evolution of ecological specialization and adaptation (Berembaum, 1996; Kawecki, 1997). This has been demonstrated in herbivorous insects feeding on a wide range of host plants: colonization of a new host may induce the selection of adaptive characters and population genetic differentiation (Rice, 1987; Diehl and Bush, 1989). Phenological, phytochemical and morphological differences between host plants may promote genetic isolation following host shifts – a first step toward speciation (Bush, 1994).

Here, we show that the European corn borer (ECB, Ostrinia nubilalis Hübner) may provide an interesting example of genetic divergence of sympatric host-plant populations. ECB is native to Europe, North and West Africa (Mutuura and Munroe, 1970) and was introduced into North America early in the 20th century (Hudon et al, 1989). Owing to its economic importance — ECB is the main pest of maize, causing estimated yield losses of $1 billion per year in the US (Ostlie et al, 1997) — this species has been extensively studied over the last century (review in Hudon et al, 1989).

A few studies have focused on population genetics of the ECB (Harrison and Vawter, 1977; Cardé et al, 1978; Willet and Harrison, 1999; Bourguet et al, 2000a,2000b). However, habitat heterogeneity has been little studied. This is unfortunate because a wide variety of host plants are suitable for ECB oviposition and larval development, including many crops such as snap beans, peppers, potatoes, hops and maize (Caffrey and Worthley, 1927). In a previous study, we investigated the genetic variability of samples collected on mugwort (Artemisia vulgaris L.), hop (Humulus lupulus L.) and maize (Zea mays L.) over a restricted area in northern France (Bourguet et al, 2000b). We found that these populations did not constitute a single panmictic unit, those collected on maize being genetically different from those collected on mugwort and hop. However, it was unclear whether this genetic differentiation extended over a broader geographical scale.

In this study, we investigated further the genetic variability of ECB populations collected on mugwort and maize. We began by determining the geographical distribution of ECB on mugwort in France. We then assessed and compared genetic differentiation within and between samples of O. nubilalis collected on the two host plants throughout this area. Finally, we used a combination of mitochondrial DNA (mtDNA) sequence analysis and allozyme electrophoresis to assess the effects of host association on the genetic structure of populations and to investigate the origin of host association.

Materials and methods

Geographical distribution

The geographical distribution of the ECB, Ostrinia nubilalis, on mugwort Artemisia vulgaris was established by investigating 74 sites throughout France (Figure 1a). At each site, we screened 100–500 stalks of mugwort for the presence of fifth instar larvae.

Figure 1
figure 1

Geographical locations: (a) mugwort sites investigated for the presence of the European corn borer; (b) sampling sites used for the analysis.

Enzyme electrophoresis

Sampling sites

Samples were taken from mugwort growing at 16 sites spread over the geographical distribution (Figure 1b). Individuals were collected as larvae diapausing on plants, their sex was determined and they were directly frozen at −80°C.

Data from former samplings on maize were used to evaluate the impact of host plant on the genetic structure of the population. Nine sites (Ger, Lor, Mes, Pic, Run and Wol, described in Bourguet et al (2000a), and Ave, Sol and Mar, described in Bourguet et al (2000b)) were chosen because their geographical distribution overlapped with the 16 mugwort sampling sites (Figure 1b).

Electrophoresis

The head was removed from each larva for further analysis and the rest of the body was homogenized in 150–200 μl of Tris-EDTA (pH 6.8). The homogenates were subjected to horizontal starch gel electrophoresis in Tris-borate-EDTA (pH 8.6) buffer systems (Pasteur et al, 1987). Six polymorphic enzymes allowing unequivocal genetic interpretation were revealed as described by Bourguet et al (2000a). These enzyme systems were: phosphoglucomutase (PGM, EC8.4.2.2), mannose-6-phosphate isomerase (MPI, EC5.3.1.8), hydroxybutyrate dehydrogenase (HBDH, EC1.1.1.30), glucose-phosphate isomerase (GPI, EC 5.3.1.9), aspartate aminotransferase (AAT, EC2.6.1.1) and triose phosphate isomerase (TPI, EC5.3.1.1). Tpi is located on the Z sex chromosome (Glover et al, 1990). Females are heterogametic (ZW) whereas males are homogametic (ZZ). Thus, at this locus, females are hemiploid whereas males are diploid.

Data analysis

We computed sample allelic frequencies, the mean number of alleles, the observed and expected heterozygosity and f^-values (ie Fis estimates according to Weir and Cockerham (1984)) for the 16 samples collected on mugwort using Fstat 2.8 software (Goudet, 1999). We tested for deviation from Hardy–Weinberg expectations at each locus and calculated genotypic linkage disequilibrium between loci within each sample, with GENEPOP 3.2a software (Raymond and Rousset, 1995). As Tpi is a sex-linked locus, observed heterozygosity and θ̂-values were estimated and tests for deviations from Hardy–Weinberg expectations were carried out for males only.

These results were compared with those obtained for nine samples collected on maize by Bourguet et al, 2000a,2000b. The genetic structure between the 25 samples (16 on mugwort and nine on maize) and between samples collected on each host group (a host group includes all samples collected on a given host plant over the whole area) was analyzed by testing for allelic differentiation using exact tests and by calculating the θ̂ estimator of Fst as described by Weir and Cockerham (1984), using GENEPOP 3.2a (Raymond and Rousset, 1995). In addition, we used TFPGA 1.3 software (Miller, 1997) to carry out hierarchical analyses of population structure to assess the effect of the host plant on popu-lation structure with respect to total genetic variation.

Isolation by distance patterns (Wright, 1943) was also investigated in each case by assessing the independence of geographic and genetic distances ((θ̂/(1−θ̂)) and ln(geographical distance)) as described by Rousset (1997). The null hypothesis that geographic and genetic distances were independent was tested against an alternative hypothesis of a positive correlation, as expected under isolation by distance. We used Spearman's rank correlation coefficient as the test statistic. The calculated correlation coefficient was compared with the distribution of correlation coefficients obtained from Mantel-like permutations as implemented in GENEPOP 3.2a (Raymond and Rousset, 1995).

mtDNA variations and haplotypes

mtDNA sequence variations

Total DNA was extracted from the heads of fifth instar larvae according to the protocol described by Estoup et al (1996). We amplified a 1882 bp mtDNA region starting (position 1) at nucleotide position 267 of the cytochrome oxidase subunit I (COI) (Caterino et al, 2001) and ending (position 1882) at position 531 of the cytochrome oxidase subunit II (COII) (Kim et al, 1999), including the entire tRNAleu. This mtDNA region was amplified for 15 individuals from three geographically separated populations (populations Car, Wol described in Bourguet et al (2000a) and population Chti described in Bourguet et al (2000b)). We identified 15 variable positions (details not shown), four of which affected restriction sites. The corresponding restriction enzymes (Sau3A, Ssp1, Vsp1 and Alu1) were used for screening and comparing mtDNA haplotypes in populations collected on the two host plants.

mtDNA haplotypes

We checked for the presence of restriction sites from position 142 to 431 (region I) for Vsp1, from position 452 to 1882 (region II) for Sau3A and Alu1, and from position 1096 to 1669 (region III) for Ssp1.

For the three regions, the PCR mixtures contained 1×PCR buffer (Eurogentec, Seraing, Belgium), 1.25 mM MgCl2, 0.2 μg/μl−1 BSA, 0.2 mM dTNPs, 1.2 unit Taq polymerase and 1 μM each primer (region I, forward: CTAATATT-CTCATGGAGGT, and reverse: TAAAATAGGATCTCCC-CCTC; region II, forward: TTTTGGACATCCAGAAGTTT, and reverse: CGACCTGGATTAGCATCTAC; for region III, forward: ATACCTCGACGTTATTCAGA, and reverse: TCAATATCATTGATGACCAAT). The final reaction volume was 30 μl including 20 ng of DNA template. Cycling conditions were as follows: 94°C for 5 min, followed by 35 cycles of 94°C for 1 min, annealing temperature for 1 min and 72°C (1 min for regions I and III and 2 min for region II), and then a final 10 min at 72°C. The annealing temperatures were 60, 57 and 51°C for regions I, II and III, respectively.

mtDNA haplotypes and their frequencies were investigated for 10 populations, five from mugwort sites (Kra, Sei, Tou, Ven described in Table 1 and Lef described in Bourguet et al, 2000b) and five from maize sites (Fon, Ger, Lor, Sol, Wol described in Bourguet et al, 2000a).

Table 1 Characteristics of samples of O. nubilalis collected on mugwort, A. vulgaris: location, date of sampling and number (N) of ECBs analyzed

Data analysis

Fst values were generated from mtDNA haplotype variation. The genetic structure of popu-lations was investigated by an analysis of molecular variance (AMOVA) approach (Excoffier et al, 1992), using ARLEQUIN 2.0 (Schneider et al, 2000). AMOVA is essentially similar to other approaches based on analyses of variance of gene frequencies (Weir and Cockerham, 1984), but it takes into account the number of mutations between molecular haplotypes. A hierarchical analysis of variance allowed us to partition total variance into covariance components and therefore to test for population differentiation among host plants. The significance of the fixation indices was assessed with a nonparametric permutation approach described by Excoffier et al (1992) and included in ARLEQUIN 2.0 (Schneider et al, 2000).

Results

Geographical distribution

Figure 1a indicates the presence or absence of ECB on mugwort at the various sites. O. nubilalis larvae were found at 28 sites (of 74) in the northern regions of France, with population densities of 0.1 to 1 larva per mugwort stalk. They were rare in western and eastern parts of France and, when present, population density was generally low (<0.01 larvae per stalk).

Allozyme analysis

For the 16 samples collected on mugwort, the Hbdh locus displayed no polymorphism and was therefore not included in the analysis. The allelic frequencies at the five other loci (Tpi, Gpi, Aat, Mpi and Pgm) for each population are given in the Appendix. We found no significant genotypic association between any pair of loci over the 16 sample sites.

Within samples, the mean number of alleles was 2.20–3.40 for the five loci tested. The observed and expected heterozygosities were almost identical and were 0.20–0.41 (observed) and 0.18–0.41 (expected). These values did not differ significantly from those for samples collected on maize, for which the mean number of alleles and the observed heterozygosities were from 2.20 to 3.20 and from 0.23 to 0.32, respectively (Bourguet et al, 2000a,2000b).

Separately analyzed, the populations showed a weak deficit of heterozygotes (14 of 16 populations), but these deviations from Hardy–Weinberg expectations for the five loci were not significant if multiple testing was taken into account (Bonferroni test, Holm, 1979). The f^-value estimate calculated over the sampling sites indicated a significant deficit of heterozygotes (f^ total=0.065, P=0.0044). This deficit was significant for two of the five loci investigated (Tpi, P=0.0045 and Mpi, P=0.0437).

Tests of allelic differentiation between pairs of samples for each locus (details not shown) identified four samples (Sei, Kra, Mor and Sal) with genetic structures different from the others. Finally, there was significant differentiation between the 16 samples collected on mugwort (P<10−5) but the mean θ̂-value was low (θ̂-within mugwort=0.036; see Table 2). However, the level of differentiation was higher than that for the nine samples collected on maize in the same geographical area (θ̂-within maize=0.009 and P<10−5; see Table 2).

Table 2 Hierarchical Fst -values based on genetic variations at allozyme loci and mtDNA

The overall differentiation between the 25 sites (θ̂-overall) was 0.048, a value significantly different from 0 ( Table 2). Hierarchical analysis indicated that the distribution of genetic variability was significantly influenced by the host plant (θ̂-host=0.044; P<10−5; see Table 2).

Genetic differentiation between the two host-plant groups can be further investigated by comparing within- and between-group isolation by distance patterns (Rousset, 1999a). Mantel-like tests were carried out to assess the independence of geographic and genetic distances, for all loci and then again for all loci except Mpi as this locus showed the highest level of differentiation between the two host plant groups (Figure 2). The isolation by distance pattern was not significant for the maize group (slope=0.0020, P=0.142 when considering all loci; slope=0.0021, P=0.237 if Mpi was excluded from the analysis). Conversely, a significant isolation by distance pattern was found for the mugwort group (slope=0.0163, P=0.004 when considering all loci; slope=0.0212, P=0.001 if Mpi was excluded from the analysis). Outliers were not responsible for this pattern because the regressions remained significant if extreme values were excluded. When considering all loci, between-group differentiation was not correlated with distance (slope=0.0029, P=0.525) and was higher than within-group differentiation at all distances. If Mpi was excluded from the analysis, between-group differentiation remained independent of geographical distance (slope=0.0043, P=0.170) and was still higher than within-group differentiation at short distances (see Figure 2).

Figure 2
figure 2

Regressions of θ̂/(1−θ̂) against ln (geographical distances) (km) for populations collected on Artemisia vulgaris (within mugwort), on Zea mays (within maize) and between populations collected on the two host plants (between-group). Regressions are given for all loci and for all loci except the Mpi locus.

Mitochondrial analysis

We found 14 restriction sites but only four of them (at nucleotide positions 258, 566, 1114 and 1139) were informative. We were able to define six mtDNA haplotypes over the 254 individuals collected at the 10 sampling sites. One of these haplotypes was predominant at all sites and fixed at four (details not shown). The number of haplotypes per site was small and similar for populations collected on mugwort (mean±SE=1.60±0.55) and maize (mean±SE=2.00±1.00). Maximum likelihood analysis of all the unique haplotypes for each population failed to reveal genetic structure corresponding to host affiliation. AMOVA showed that if genetic distances (calculated from pairwise differences) between haplotypes were combined with haplotype frequency, the variation was either distributed throughout all populations or corresponded to within host-plant groups rather than between host-plant groups ( Table 2).

Discussion

Geographical distribution

In Europe, the distribution, host-plant range, parasites and number of generations of Ostrinia nubilalis were studied 80 years ago by Thompson and Parker during a remarkable survey from 1919 to 1926 (Thompson and Parker, 1928). The ECB was found on several host plants but its favored hosts were undoubtedly maize (Z. mays) and mugwort (A. vulgaris). Although widely distributed in Europe, mugwort was not infested in southern France and northern Italy even if the ECB was common on corn; ECB colonization of this host plant was observed mostly in northern France, northern Germany and Belgium.

The results of our year 2000 sampling in France are consistent with a northern European location for ECB on mugwort. All sites located in northern France were infested with ECB. Conversely, mugwort was rarely infested with ECB in western and eastern France and none of the four sites in the south of France was infested. Our results, together with those of Thompson and Parker (1928), suggest that the geographical distribution of ECB on mugwort has been stable for a number of years. The absence of ECB on mugwort in southern Europe is not because of a lower density of the host plant because A. vulgaris is widespread throughout Europe (Thompson and Parker, 1928). Instead, it may be a consequence of environmental conditions having an effect either on the plant or directly on the ECB larvae (hygrometry and temperature).

The geographical distribution of ECB on maize was very different. Maize is widely cultivated in France and infestations, although unequal, have been recorded in all regions except western France. The highest infestations are generally recorded in the southwest, central and eastern regions of France. Conversely, unpredictable but low levels of damage to corn are often reported in northern regions, where mugwort is commonly, if not systematically, infested with ECB.

Population genetics structure and host-plant differentiation

The populations feeding on the two host plants had markedly different genetic structures. Fst analysis of allozyme frequencies showed a higher level of differentiation within the mugwort group (θ̂-within mugwort=0.036) than within the maize group (θ̂-within maize=0.009). In the mugwort group, the mtDNA estimate of population structure (Fst=0.038) was congruent with the nuclear estimate. Conversely, in the maize group, the mtDNA estimate of population structure (Fst=0.039) was greater than the nuclear estimate, possibly reflecting the greater tendency toward drift of mitochondrial genomes as a result of their maternal mode of inheritance and haploid nature.

The genetic differentiation between the two host groups (θ̂-host=0.044), calculated from allozyme data, indicated that the maize and mugwort groups were different ( Table 2). Thus, the differentiation reported by Bourguet et al (2000b) on a regional scale seems to be a general feature over a broader geographical range.

Host-associated genetic differentiation has already been documented in other moth families such as the Noctuidae (Pashley, 1986), Tortricidae (Emelianov et al, 1995) and Prodoxidae (Groman and Pellmyr, 2000). Populations of O. nubilalis differ in terms of pheromone blends (E vs Z) and voltinism (uni- vs bivoltine) (Hudon et al, 1989). Our results show that populations may also be distinguished on the basis of the type of host plant colonized (maize vs mugwort). Some relationships between these biological characteristics have already been observed. In New York State, three races have been described: bivoltine Z (BZ), univoltine Z (UZ) and bivoltine E (BE) (Roelofs et al, 1985). The BE race was found to injure more apple shoots and snap beans than the BZ and UZ races (Straub et al, 1986; Eckenrode and Webb, 1989). McLeod (1981) also noted differential infestations of bivoltine and univoltine populations on sweet pepper, Capsicum frutescens L. Unfortunately the diapausing ECB larvae sampled for the present study were frozen without determining the voltinism and pheromonal status of the populations collected on the two host plants. The precise relationships between biological parameters and host-plant colonization of the populations located in northern France are currently under investigation.

Host-plant-dependent selection

To date, allozyme studies of ECB have revealed that only the Tpi sex-linked locus displays clear differences in allele frequency between the BE, UZ and BZ races (Glover et al, 1991). The distribution of alleles at the Tpi locus is probably moulded by natural selection because this locus is closely linked to genes involved in the male behavioral response to female pheromones and to genes responsible for differences in voltinism (Glover et al, 1990, 1992). The Tpi locus showed no genetic differentiation between host groups but displayed a highly significant deficit in heterozygotes over the 16 sampling sites. This result may be the consequence of (i) a selection pressure acting on this locus or on linked loci, (ii) a Wahlund effect at the Tpi locus notably if E and Z populations of ECB were segregating on mugwort and (iii) the presence of null alleles.

The Mpi locus, which is located in a different region of the genome, also displayed a significant deficit in heterozygotes over the 16 samples collected on mugwort. This locus also showed greater between-group (θ̂-host=0.122) than within-group (θ̂-within mugwort=0.040 and θ̂-within maize=0.013) differentiation. This greater differentiation at the Mpi locus has already been shown for northern France (θ̂-host=0.114 in Bourguet et al, 2000b). Such a consistent allele frequency difference over space and time suggests host-dependent selection acting on Mpi or on linked loci.

Other examples of two populations of the same species inhabiting the same geographical area and yet displaying differentiation include the pea aphid Acyrthosiphon pisum (Via, 1999) and the fruit fly Rhagoletis pomonella (Walsh) (Feder et al, 1988). In this latter species, there is selective maintenance of sympatric host-plant races differing at six allozyme loci mapping to three different regions of the genome (Feder et al, 1997; Filchak et al, 2000).

Gene flow within- and between-host groups

Analysis of the spatial patterns of within- and between-group genetic differentiation can provide an indication as to the extent of gene flow within and between the two host groups. For each group, the slope of the regression between genetic and geographic distances can be used to estimate 1/(4De.πσ2), where De is the effective density of individuals (see Rousset, 1999b) and σ2 is the one-dimensional mean-squared parent offspring distance (Rousset, 1997). In mugwort-feeding populations, the slope of the regression between geographic and genetic distances was 0.0163 (significantly different from 0) giving an estimate of De.σ2 5 moths. Assuming a density of 5 individuals/km2 – which is certainly a conservative assumption – σ equals 1 km/gen1/2. In maize-feeding populations, the isolation by distance was not significant, with a slope of 0.0020. The estimate of De.σ2 is 40 moths. The difference between the two host-plant groups may be because of a difference in De rather than a difference in migration. Indeed, because of the intensive cultivation of maize, ECB density is probably much higher on maize than on mugwort.

When isolation by distance analysis was performed for all pairs of populations, one from each host, the slope was not significantly different from zero; between-group differentiation, which was slightly higher than within-group differentiations, remained constant at all geographical scales (Figure 2). This was not only the consequence of the host-plant-associated differentiation at the Mpi locus since the same pattern was found when this locus was excluded from the analysis (Figure 2). A similar result was reported by Hardy and Vekemans (2001), who studied allozyme variation in diploid and tetraploid knapweeds, Centaurea jacea L. This pattern is not expected if gene flow was homogenizing allele frequencies between neighboring populations on different hosts. We therefore conclude that there must be barriers to gene exchange between populations feeding on the two host plants. Although the actual level of gene flow cannot be evaluated, equation (A5) and the numerical examples in Figure 3 of Rousset (1999a) suggest that a 1% introgression rate per generation between the host races would be sufficient to erode any host-associated differentiation between pairs of populations separated by more than a few times σ (a few kilometers here). Gene flow between the two host groups is not necessarily the same across the genome. Indeed, recently diverged taxa that remain incompletely isolated reproductively may be permeable to introgression over a large portion of their genome (Ting et al, 2000). Hence, although barriers to gene flow may be quite intense for some regions (eg the region surrounding Mpi), substantial gene flow can still occur in other portion of the genome.

Implications for pest management strategy

The ECB is the main pest of maize in Europe and North America (Krattiger, 1997). To control this species, transgenic maize varieties have been developed that produce toxins from the bacterium Bacillus thuringiensis (referred to as Bt maize). The increasing use of these Bt maize varieties generates pressure for the selection of resistance mechanisms in natural populations of ECB (Gould, 1998; Wolfenbarger and Phifer, 2000). This has led to the development of management strategies to delay the appearance of resistance to Bt toxins (eg Alstad and Andow, 1995, but see Ives, 1996; Peck et al, 1999). All these strategies are based on the use of refuges – non-Bt maize planted and maintained in close proximity to Bt maize (Gould, 1998). The principle underlying this system of resistance management is that any resistant insects emerging from Bt maize are more likely to mate with one of the much larger number of susceptible adult pest insects emerging from the refuges than with each other, thereby decreasing the selection of Bt resistance alleles. Gould (1998) suggested that, for some generalist pest species, wild hosts and other crops could serve as part of a larger refuge. However, to be considered as complementary or alternative refuges, these plants must host ECB populations that will randomly mate with those emerging from maize. The geographical distribution and genetic differentiation of ECB growing on mugwort show that adults emerging from this weed are unlikely to delay the evolution of resistance to B. thuringiensis toxins. This does not rule out the possibility that other weeds or crops may be useful refuges. However, in the absence of such data, refuges of non-transgenic maize should be planted, to extend the durability of Bt maize.