Introduction

Reproductive isolation between closely related taxa may occur via multiple isolating barriers that occur either before or after mating1. In allopatric and peripatric speciation, both prezygotic and postzygotic reproductive isolation often evolves following geographic isolation due to divergent selection, genetic drift or combined effect2. For sympatric populations, one of the key mechanisms ensuring reproductive isolation is assortative mating3 which may occur due to ecological, temporal or behavioral isolation4.

Ecological speciation has been posited as an important mechanism creating new species via adaptation to different environments5,6,7. Small host-specific phytophagous insects were suggested to accounting for 25–40% of all animal species8. Some of this diversity may be attributed to differential host plant use that can lead to the formation of host associated forms and, ultimately, speciation. In many herbivore species that both feed and mate on the host plant9,10, reproductively active individuals from populations associated with different host plants species are more likely to encounter potential mates using the same host. In this way, the tendency of adult insects to remain on or return to the larval host plant may result in patterns of host-associated assortative mating that restrict gene flow between populations10. In some cases, distinct host specificity alone was suggested to act as a nearly complete pre-mating isolating barrier among insect populations, resulting in reproductive isolation in the absence of behavioral isolation and any post-mating barriers11,12,13. For these herbivorous insects, pre-mating isolation may have evolved as a by-product of adaptation to different ecological environments14. However, these environment dependent barriers may not be effective over evolutionary time15. In most cases, multiple barriers to reproductive isolation are expected to be needed for speciation16. However, the pre-mating isolation barriers between sympatric sibling specialists were only checked in limited systems11,12,13.

Pre-mating isolation due to host plant use may be occurring in the flea beetle genus Altica, a system that is becoming a model of host plant specialization and ecological speciation17,18,19,20. Many beetles in Altica are specialists, which feed and lay eggs on a limited range of plants. Among them, Altica fragariae Nakane (hereafter ‘AF’) is an oligophagous species with five recorded host plants belonging to the family Rosaceae and Duchesnea indica (Andrews) Focke is the primary host plant in the field. Altica viridicyanea (Baly) (hereafter ‘AV’) is a monophagous species that feeds on Geranium nepalens (Sweet) (Geraniaceae)18,19,20. The beetles do not feed and oviposit on each other's host, even under no choice conditions and the larvae do not develop on alternative host plants, indicating extreme host plant specificity18,19.

These two species occur sympatrically both locally and across broad geographic scales. Previous studies have shown that post-mating isolation between them is incomplete. Under laboratory conditions, the hatch rate of hybrid eggs is as high as 65.8% when AV is the mother (although it is lower than the hatch rates in intraspecific crosses: 89.8% in AF, 89.2% in AV), whereas hatch rate is only 1.19% when AF is the mother. The backcross between AF females and F1 (AV♀ × AF♂) males is inviable, whereas all other three backcrosses (F1♀ × AF♂, AV♀ × F1♂, F1♀ × AV♂) and F2 generation are viable. Furthermore, these viable offspring can develop on one or both host plants with high survival rates18,19,20. Interestingly, hybrid F1 and backcrosses feed on both plants under choice conditions18. Feeding choice tests of the sympatric field-sampled beetles provided no evidence of heterospecific hybridization (at least no F1 hybridization) (Xue HJ, unpublished data). Accordingly we speculate effective pre-mating reproductive isolation has evolved between them. These beetles' life histories and the circadian rhythms of sexual behavior are very similar and their emergence phenology overlaps widely (Xue HJ, personal observation); thus, temporal isolation is likely not an important isolating mechanism. Therefore, habitat isolation, behavioral isolation or their joint action should play a crucial role in preventing interspecific gene flow. In the Beijing area, their host plants often grow side by side and sympatric beetle populations are common, implying that interspecific hybridization may occur in nature. Considering the host-dependent assortative mating may not be perfect, we predicted behavioral isolation evolved besides habitat barriers. To test whether there is an effective pre-mating isolation between these two host specific lineages, we assessed the pattern of divergence and the level of gene flow between AF and AV using a combination of phylogenetic analyses and coalescent simulations. To test whether host-independent behavioral isolation has evolved between them, three kinds of mating tests that eliminated host plant cues were performed.

Results

Population structure

Summary statistics for molecular variation indicated much higher levels of genetic diversity in AV versus AF for mtDNA and EF1a and no variation in ITS2 within both species (Table 1). Uncorrected divergence based on mtDNA between species clusters was 3.01% while within species divergence was only 0.27% (AF) and 0.67% (AV), respectively. Similarly, divergence in EF1α was 2.02% between species, only 0.23% within AV and 0 within AF. Divergence in ITS2 ranged from 0.59% between species to zero divergence within AF and AV. AMOVA analyses showed that most of the genetic variance was attributable to the division between species (83.87% for COI-COII; 97.31% for EF1α and 100% for ITS2) and only a small proportion of the genetic variance was within species (Table 2). The analysis also showed that the genetic variance within populations contributed much more than that of among populations within species (Table 2). Significant Φst values (0.85 for COI-COII, 0.97 for EF1α and 1.00 for ITS2) revealed restricted gene flow between the two species.

Table 1 Summary statistics for molecular variation in COI, EF1a and ITS2
Table 2 Analysis of molecular variance for COI-COII, EF1a and ITS2 from two Altica species, AF and AV

The parsimony network for mitochondrial genes (COI-COII) showed of 45 mtDNA haplotypes in 144 individuals and none these were shared by both species (Fig. 1a). The number of haplotypes within AV was much greater than that of AF (36:9), although the number of sampled individuals of AV and AF were similar (AV:AF = 76:68). In contrast to the mtDNA, we observed only nine haplotypes in EF1α. One of these haplotypes was present in all of the individuals of AF, whereas AV harbored the other eight haplotypes (Fig. 1b). Only two ITS2 haplotypes were recovered: all of the individuals of AV shared one and all of the individuals of AF shared the other (Fig. 1c).

Figure 1
figure 1

Maximum parsimony networks for three loci (COI-COII, EF1α and ITS2).

Altica fragariae (AF) alleles are illustrated in grey and A. viridicyanea (AV) alleles are illustrated in white. The area of each circle is proportional to the number of individuals with that haplotype. Straight lines and small black dots reflect mutations and median vectors, respectively. (a) based on COI-COII; (b) based on EF1α; (c) based on ITS2.

Gene tree estimation

The phylogenetic analysis based on the COI-COII dataset showed that all the individuals of AF formed a well-supported clade that included five individuals of AV (Fig. 2a). The same tree topology was suggested with both phylogenetic reconstruction methods inferred from EF1α and ITS2. AF and AV were reciprocally monophyletic, strongly supported by high bootstrap or posterior probability values in the EF1α dataset (Fig. 2b) and weakly supported in the ITS2 dataset (Fig. 2c).

Figure 2
figure 2

Phylogenetic relationships of two Altica species.

The numbers above the branches indicate Bayesian posterior probabilities and maximum likelihood bootstrap values. (a) Bayesian consensus tree inferred based on COI-COII dataset. The left oval indicates individual number 0957 and the right oval shows the individuals 1032, 1052, 1055 and 1057. (b) Bayesian consensus trees inferred based on EF1α dataset; (c) Bayesian consensus trees inferred based on ITS2 dataset. The outgroup AK and the terminal labels were removed. The nodes of major branches with ≥50% bootstrap value or posterior probability are labeled. The symbol * is used to represent a node in conflict between the different analyses with less than 50% posterior probability or bootstrap support value.

Coalescent simulations

Four-gametic tests suggested that there is no evidence of recombination for both EF1α and ITS2. The independent runs resulted in very similar parameter estimates. Based on mitochondrial genes, significant unidirectional migration was observed from AV to AF (LLR test ≥ 12.217, p < 0.001), but the results did not detect significant migration from AF to AV (LLR test = 0, p ≥ 0.05) (Fig. 3a,b). In addition, the coalescent analysis based on the combined dataset suggested a similar pattern to that obtained with the mitochondrial dataset (AV to AF: LLR test ≥ 9.778, p < 0.01; AF to AV: LLR test = 0.651, p ≥ 0.05) (Fig. 3c,d). However, individual analysis of the nuclear genes could not reject the zero-migration-rate model (LLR test = 0, p ≥ 0.05) (Fig. 3e,f).

Figure 3
figure 3

Migration rates between AF and AV estimates from the IMa2 analysis using three datasets (mitochondrial, combined and nuclear data).

(a) from AF to AV, based on mitochondrial data, (b) from AV to AF, based on mitochondrial data, (c) from AF to AV, based on combined data, (d) from AV to AF, based on combined data, (e) from AF to AV, based on nuclear data, (f) from AV to AF, based on nuclear data. Note difference in scale in the Y-axis.

Strength of behavioral isolation

In the no-choice tests, we primarily observed intraspecific matings. We observed 64 intraspecific matings (39 AF♀-AF♂ and 25 AV♀-AV♂) as compared to 9 interspecific matings (all of them are AV♀ × AF♂) (Table 3). The overall sexual isolation value (Ipsi) was 0.7863 ± 0.0579 (p < 0.001). In male-choice tests, we observed only one interspecific mating (AV♀ × AF♂) as compared to 88 intraspecific mating events (42 AF-AF and 46 AV-AV) (Table 4) (Ipsi = 0.9670 ± 0.0269, p < 0.001). In the multi-choice mating tests, 71 mating events were observed during the trials, 69 of them were intraspecific (42 AF♀-AF♂ and 27 AV♀-AV♂) and only 2 were interspecific (2 AV♀ × AF♂) (Table 5) (Ipsi = 0.9330 ± 0.0407, p < 0.001).

Table 3 Results of no-choice mating test. One male and one female were placed in a 9-cm Petri dish lined with moistened filter papers and given two hours to mate. Four comparisons and 60 replicates for each combination were conducted
Table 4 Results of male-choice mating test. One male and two females were placed in a 9-cm Petri dish arena lined with moistened filter papers for three hours. Two comparisons and 100 replicates for each combination were carried out
Table 5 Results of multi-choice mating test. Five pairs of AF and AV (5AF♂ + 5AF♀ + 5AV♂ + 5AV♀) were placed in a glass jar simultaneously and allowed to mate freely for three hours, 18 replicates were conducted; thus, a total of 180 mating events were possible

Discussion

In contrast to the mito-nuclear concordance observed in a previous study dealing with the same system (but with a third species and small sample sizes, n = 9 for AF and n = 5 for AV)20 and another complex of specialist Altica species21, the present study showed a small discrepancy between mitochondrial and nuclear gene trees. In the mitochondrial analyses, five AV individuals always fell in the AF clade, suggesting some gene migration. However, in the nuclear gene trees (both ITS2 and EF1α), AV and AF were reciprocally monophyletic (Fig. 2a–c), indicative of complete reproductive isolation. Similarly, IMa analysis based on nuclear data suggested there is no gene migration between the beetle species, while the results inferred from mitochondrial and combined data showed unidirectional gene flow (effective gene flow from AV to AF, 2Nm = 0.8027 and 0.09338 respectively). Therefore, phylogenetic conclusions that rely on mitochondrial or nuclear marker alone maybe misleading22,23,24,25. Thus, a combination of markers from both the nuclear and mitochondrial genome provides a more comprehensive view of evolutionary history.

It is difficult to distinguish introgression from incomplete lineage sorting in many studies26,27. It was suggested that lineage sorting should proceed faster in mitochondrial genes than in nuclear genes because of the smaller effective population size and uniparental inheritance of mitochondrial genes28,29. In the present study, summary statistics for molecular variation indicated levels of genetic diversity in mitochondrial DNA are much higher than in nuclear DNA (Table 1). The absence of shared haplotypes (Fig. 1b,c) and distinct clades (Fig. 2b,c) showed that lineage sorting is complete in the two nuclear genes, while paraphyly of AF was detected in mitochondrial gene (Fig. 1a,2a). Furthermore, considering the incomplete post-mating isolation and sympatric distribution of these two species, we presume that introgression of mtDNA is a more reasonable explanation than incomplete lineage sorting. In fact, insect species in nature are often incompletely isolated for millions of years after their formation30,31 and gene flow may persist even between non-sibling species32. Mitochondrial DNA introgression has been reported in a number of recent studies33,34,35.

The mitochondrial genome can be replaced by that of another species as a result of historical hybridization without leaving any trace in the nuclear genome. At least four mechanisms that might result in introgressive hybridization of mtDNA have been suggested: reproductive asymmetry36, differential selection37, chance fixation by genetic drift in small populations38 and paternal leakage in mtDNA39,40. In the two Altica species, significant asymmetry occurred during the pre-mating stage of isolation. Although the heterospecific mating was not detected for AV males, the males of AF occasionally mated with sibling species females under both no-choice and choice conditions in the laboratory (Table 35). Furthermore, previous cross-breeding experiments showed that hybrid F1 with AV mother and three backcrosses (F1♀ × AF♂, F1♀ × AV♂, AV♀ × F1♂) were viable, while F1 with AF mother and one backcross (AF♀ × F1♂) were inviable, suggesting asymmetric post-mating isolation between AF and AV due to cytoplasmic incompatibility19,20. These results are consistent with asymmetric mitochondrial introgression estimated by coalescent analysis. Thus, reproductive asymmetry is the most probable scenario in the present system.

The molecular results (high Φst values estimated by AMOVA and low M values estimated by IMa2) suggested very limited gene flow between these two species. Considering the incomplete post-mating reproductive isolation18,19,20, we suggest near perfect assortative mating has evolved. Differing from many studies in which allochronic factors play an important role in reproductive isolation and speciation41, allochronic isolation contribute little in the present system. As many other herbivorous specialists, Altica flea beetles exhibit high host fidelity19 and the spatial segregation of the host plant species reduces encounters between the two beetles. Therefore, one explanation for the reproductive isolation observed between these beetle species is that the preference of beetles for feeding and mating sites leads to a strong assortative meeting and then assortative mating. However, their host plants often grow side by side and sympatric beetle populations are common, implying that interspecific meeting may occur in nature. Then considerable gene flow between these two beetles should be detected if host independent behavioral isolation contribute negligibly. Both no choice and choice mating tests without host cues showed only limited heterospecific mating. The close, artificial conditions of these experiments were likely an overestimate of the probability of heterospecific mating, further indicating almost perfect behavioral isolation has evolved between these species.

We thus suggested almost perfect assortative mating between these two flea beetles resulted from a combination of reduced encounter rates due to differential host preference and strong sexual isolation. One explanation for the behavioral reproductive isolation is a by-product of adaptation to the different hosts, even in the absence of direct selection42. And another, but more reasonable explanation is, behavioral isolation resulted from host-independent sexual selection acting or indirect effects of host-associated selection on sexually selected traits. For these two species, divergent contact sex pheromones seem to play a pivotal role in mating behavior (Xue HJ, unpublished data). Previous studies have suggested that some phytophagous insects may acquire bioactive chemicals or required chemical precursors of sex pheromones from host plant material43. Therefore, sex pheromone composition may change following a host shift, subsequently leading to host plant mediated sexual isolation44. Further chemical ecological studies are needed to estimate whether and how host plants affect host-associated divergence and speciation in these closely related Altica flea beetles.

Methods

Potential interspecific gene flow estimation

Field collections and identification

Adults of A. fragariae (n = 68) and A. viridicyanea (n = 76) were sampled from five sympatric locations in the Beijing area (Table 6). These two species are too similar to be distinguished by morphological characters, especially for the females. Furthermore, just using host plant during collection as a way to identify the beetle is not safe because there may be some sporadic wandering individuals in the non-host plant and some beetles may jump to the ground under mixed plants when they were disturbed, therefore we identified individuals using a 5-day feeding-choice tests18.

Table 6 Location, sample size and sampling date of the Altica beetles used in population genetic study

DNA extraction, amplification and sequencing

Total genomic DNA was extracted from whole beetles using nondestructive DNA extraction method with the TIANamp Genomic DNA Kit (Tiangen, Shanghai, China). After extraction, beetle specimens were retained as vouchers and were deposited in the Institute of Zoology, Chinese Academy of Sciences, Beijing, China.

Three gene regions were used in the present study: a mitochondrial region spanning parts of cytochrome oxidase I and II (COI-COII, 1368 bp), an internal transcribed spacer of the nuclear ribosomal RNA cluster (ITS2, 354 bp) and a fragment of the nuclear protein-coding gene elongation factor 1-alpha (EF1α, ~1221 bp with a ~525 bp intron). PCR protocols and primers were the same as Xue et al. (2011)20. The PCR products were sent to Beijing Sunbiotech Co. Ltd. for sequencing with both forward and reverse primers. The sequence data were aligned and edited using CodonCode Aligner 3.7.1 (CodonCode, Dedham, MA, USA). Variable sites were identified using the automated mutation detection function in CodonCode Aligner, followed by manual inspection of electropherograms and additional re-sequencing of ambiguous bases. Successful PCR amplification and sequencing of target genes were confirmed by aligning the resulting sequences to other closely related species in the Chrysomelidae, verifying the correct reading frame without stop codons. All EF1α and ITS2 sequences were apparently homozygous, as judged on the basis of a lack of double peaks in chromatograms from both directions.

Population structure

After assembling aligned DNA sequence data matrices, we used DNaSP 5.145 to calculate the following summary statistics for each gene in the total sample of individuals of each species separately: number of haplotypes (h); haplotypic diversity (Hd, Nei 1987); average number of nucleotide differences (K, Tajima 1983); and nucleotide diversity (π, Nei 1987). To estimate genetic divergence within and between two species, we analyzed the genetic distances with haplotypes data using MEGA 4.046. To characterize genetic differentiation between species, among populations within species and within populations and examine how these patterns varied among loci, we also assessed molecular variance (locus-by-locus AMOVA, with 1000 permutations) and calculated Φst values between species using Arlequin 3.547. The distribution of haplotypes between these two beetles was also summarized using statistical parsimony networks inferred in TCS 1.2.148. All haplotypes sequences are deposited in GenBank under the accession numbers KJ803199-KJ803254.

Gene tree estimation

Gene trees were estimated with the following three combinations: COI-COII (along with the intervening leucine tRNA, COI + Leu + COII), EF1α (exon + intron) and ITS2. The best-fit model of nucleotide substitution for each partition or each combination was selected using the Akaike Information Criterion (AIC) in jMODELTEST 0.1.149. Altica koreana was used as the outgroup and the associated sequences are under the following accession numbers in GenBank: DQ865067 (COI-COII), JN903082 (EF1α) and JN903103 (ITS2). Maximum likelihood (ML) analyses were performed with Garli 2.050, 1000 bootstrap replications were used to obtain a set of trees and the bootstrap consensus tree was constructed with PAUP* 4.051. Bayesian analyses were carried out with the program MRBAYES version 3.152,53. The settings were two simultaneous runs (each with two Markov chains) of the Markov chain Monte Carlo (MCMC) for 6–15 × 106 generations (making sure the average standard deviation of split frequencies was less than or approaching 0.01), sampling every 100 generations. The first 25% of the generations were discarded as the burn-in. Log likelihood plots of trees from the Markov chain samples were examined in TRACER version 1.554 to determine convergence to a stable log likelihood value. Posterior probability (PP) values represent the proportion of MCMC samples that contain a particular node. The default branch length priors in MrBayes may lead to unreasonable estimates of substitution rates and the bias of PPs; thus, empirical priors for this parameter were defined based on the mean branch lengths recovered from maximum likelihood topologies55 inferred using Garli 2.0.

Coalescent simulations

We estimated gene flow between AF and AV using IMa2, which allows for analysis of divergence and gene flow between two or more populations56. Because the IM model assumes that all sequences are free from intra-locus recombination, before using IMa2, we tested nuclear genes for recombination57 using DNaSP 5.1. Preliminary simulations were run with a wide range of priors to explore the sensitivity of parameter estimates to different upper bounds. We set the maximum values for q = 1000 in all the final runs. The other two parameters (t, the splitting time parameter; and m, the migration parameter) in the analysis were set to the following values: t = 100 and m = 2 for mitochondrial DNA, t = 600 and m = 2 for nuclear DNA and t = 600 and m = 3 for the total dataset. The HKY model was chosen for each locus. We performed at least five independent runs of twenty million steps (recording every 100 steps), with a burn-in period of two million steps per run, using different random seeds with 10 or 20 Metropolis coupled chains and geometric heating schemes with low heating parameters (a = 0.96, b = 0.9). The analysis was considered to have converged upon a stationary distribution if the independent runs showed effective sample size (ESS) values above 10058. Log-likelihood-ratio (LLR) statistics were used to test whether contemporary gene migration rates were greater than zero59.

Strength of sexual isolation

As in many other beetles, both sexes of Altica species mate multiple times, the courtship behavior is simple and mating typically lasts several hours. Mating is initiated by the male by first antennating a female, then mounting the female's back and finally, attempting to insert his intromittent organ, the aedeagus. As a consequence, male mate choice is an important stage of pre-mating reproductive isolation in Altica species. For sympatric populations, choice mating experiments seem to be more relevant, however, when beetles occur at low densities, a male encountering females of closely-related species alone is also a possible scenario, thus the no-choice mating experiments also should be considered. To gain a comprehensive estimation of the strength of behavioral isolation, we used three types of mating assays that excluded host plants: no-choice, male-choice and multi-choice experiments. We wanted to use both choice and no-choice assays because it has been shown in Drosophila that the strength of behavioral isolation may depend strongly on the experimental design, especially the possibility of choice: stronger sexual isolation was suggested when flies were allowed to choose between conspecific and heterospecific mates than those in no-choice trials60.

All mating tests were carried out between 13:00 and 18:00 hours in an air-condition room at 25–27°C under natural light conditions. In all experiments, a successful mating was scored whenever pairing occurred for longer than 5 minutes.

No-choice mating tests

More than 30 over-wintered adults of A. fragariae and A. viridicyanea were collected from field populations on their host plants at Matao, Mentougou (40.14°N, 115.79°E) and Sijiashui, Mentougou (40.09°N, 115.95°E) of the Beijing area respectively and subsequently maintained in the laboratory. The insects were fed their respective host plants (D. indica fed to AF and G. nepalens fed to AV) in growth chambers at 16:8 LD and 25°C18,19,20. To eliminate the possible effects due to different physiology, virgin captive-bred offspring were used for no-choice tests. Shortly after eclosion, adults were sexed and kept separately for 10 days after eclosion to ensure sexual maturity. One male and one female were placed in a 9-cm Petri dish lined with moistened filter papers and given two hours to mate. Four pairwise comparisons (AF♀ × AF♂; AV♀ × AV♂; AF♀ × AV♂; AV♀ × AF♂) and 60 replicates for each combination were conducted. The number of successfully copulating pairs was recorded. Each individual was tested only once.

Male-choice mating tests

Wild collected adults were used for a male-choice test. In order to increase mating motivation and ensure sexual maturity, the sexes and species were kept in separate glass jars for 10 days before mating tests. Because the species are difficult to distinguish based on appearance, we marked them with different colour enamel paint on their elytra to enable subsequent identification61. A pilot experiment showed that the enamel paint does not alter mating propensity in this system (Xue HJ, unpublished data). Males were provided with a choice between a female from each species. One male and two females (AF♂ + AF♀ + AV♀; AV♂+AF♀+AV♀) were placed in a 9-cm Petri dish arena lined with moistened filter papers. The number of successfully copulating pairs was recorded over a period of three hours. Both combinations were carried out for 100 replicates, respectively.

Multi-choice mating tests

Wild collected adults were also used for multi-choice tests. Five males and five females of each species (20 individuals total) marked with different colored enamel paint on the elytra were placed in a glass jar (11.5 cm tall by 12 cm in diameter) lined with moistened filter papers. The beetles were allowed to mate freely for three hours. Copulating pairs were removed from the Petri dish and the paired combination was recorded. Eighteen replicates were run with different individuals, thus, a total of 180 mating events were possible.

Statistics

To assess the degree of sexual isolation between AF and AV, we used the software JMATING 1.0.862 to estimate IPSI, an estimator proposed by Rolán-Alvarez & Caballero63. The index of sexual isolation, IPSI, ranges from −1 to 1, where −1 is complete disassortative mating, 0 is random mating and 1 is complete assortative mating. Standard deviations (SD) and tests of significance for total IPSI were obtained by bootstrapping with 10000 bootstrap iterations62.