It remains unclear how speciation history might contribute to species-specific variation and affect species delimitation. We examined concordance between cytoplasmic genetic variation and morphological taxonomy in two fir species, Abies chensiensis and A. fargesii, with overlapping distributions in central China. Range-wide genetic variation was investigated using mitochondrial (mt) and plastid (pt) DNA sequences, which contrast in their rates of gene flow. Four mtDNA haplotypes were recovered and showed no obvious species’ bias in terms of relative frequency. In contrast, a high level of ptDNA variation was recorded in both species with 3 common ptDNA haplotypes shared between them and 21 rare ptDNA haplotypes specific to one or other species. We argue that the lack of concordance between morphological and molecular variation between the two fir species most likely reflects extensive ancestral polymorphism sharing for both forms of cytoplasmic DNA variation. It is feasible that a relatively fast mutation rate for ptDNA contributed to the production of many species-specific ptDNA haplotypes, which remained rare due to insufficient time passing for their spread and fixation in either species, despite high levels of intraspecific ptDNA gene flow. Our phylogeographic analyses further suggest that polymorphisms in both organelle genomes most likely originated during and following glacial intervals preceding the last glacial maximum, when species distributions became fragmented into several refugia and then expanded in range across central China.
The repeated shifts in distribution ranges of plant and animal species caused by the Quaternary climatic oscillations have left distinct imprints on their current genetic structures (Hewitt, 2000). Thus, in addition to palaeobotanical data, the present-day distribution of molecular variation within plant species can provide evidence of species or vegetation shifts within a local region during the Pleistocene (for example, see Abbott et al., 2000; Petit et al., 2003). However, resolving the evolutionary history of genetic variation within and between closely related species is often challenging. On the one hand, an allopatric distribution across glacial refugia might promote interspecific genetic divergence through fixing species-specific haplotypes via genetic drift and selection (Avise and Walker, 1998). On the other hand, secondary contact following range expansions of previously isolated species can result in genetic changes through introgression (McKinnon et al., 2004).
DNA sequences are employed increasingly to examine concordance between genetic variation and traditional morphological boundaries among closely related species (for example, see Jaramillo-Correa et al., 2008; Du et al., 2009; Zhou et al., 2010). Such studies suggest that molecular polymorphisms of neutral markers are often widely shared between closely related species, and that fixed genetic differences vary according to the genomes or genes studied (see review by Petit and Excoffier, 2009). In conifer species, plastid (pt) DNA variation, which is mainly paternally inherited and shows high rates of gene flow, is often more species specific than mitochondrial (mt) DNA variation, which is mainly maternally inherited and exhibits lower rates of gene flow (Du et al., 2009; Zhou et al., 2010). Three possible mechanisms may account for this difference, which are often difficult to distinguish in empirical studies. First, high rates of intraspecific gene flow may override introgression and produce more diagnostic genetic differences between species (Petit and Excoffier, 2009), as suggested by recent simulation studies (Currat et al., 2008). Second, high rates of intraspecific gene flow may promote rapid lineage sorting of ancestral polymorphisms, such that alleles exhibiting higher dispersal may rapidly become more species specific (Hoelzer, 1997; Zhou et al., 2010). Finally, mutation-rate heterogeneity between different DNAs (Wolfe et al., 1987) could increase the development of species-specific genotypes in isolation (Sloan et al., 2008). A further complication is that ‘speciation genes’ might spread rapidly across populations of two diverging species due to powerful selection, causing reproductive isolation and morphological delimitation to occur (for example, see Ting et al., 2000) before sufficient time has passed to achieve monophyly of neutral makers (de Queiroz, 2007). Thus, a genetic mosaic might continue to persist for a long time across two species with high genetic divergence between them occurring in and around ‘speciation genes’, whereas low divergence prevails at unlinked or loosely linked neutral markers (Via, 2009). It is evident, therefore, that the sharing and partitioning of neutral genetic variations between species will depend greatly on historical processes occurring during and following their origin, that is, the speciation history (Wiens, 2007).
Here, we report a case study of genetic divergence between two fir species, Abies chensiensis Tieghem and A. fargesii Franchet, on the basis of surveys of mt and ptDNA sequence variation. These two species occur exclusively in central China where many indigenous species shifted their distribution ranges in response to Quaternary climatic oscillations even though the region was never covered by ice sheets during the Quaternary (Chen et al., 2008; Tian et al., 2009). The occurrence of range fragmentation and adaptation to new niches during this period may have promoted speciation in the region (Qian and Ricklefs, 2000). A. chensiensis is an endangered plant that is usually found scattered in small forest fragments at elevations from 2300 to 3000 m, whereas A. fargesii is a dominant member of coniferous forests found at elevations between 1500 and 3900 m (Fu et al., 1999). Morphologically, the two species are distinguished by A. chensiensis having branchlets that are yellowish gray or brownish yellow, mature seed cones that are light brown, inserted bracts, and pale-green leaf needles that are 3.9–4.6 cm long and 2.1–2.6 mm wide, whereas A. fargesii has purplish brown or reddish brown branchlets, dark purple or reddish brown mature seed cones, exserted bracts with much longer cusps, and dark-green leaf needles that are 1.4–1.8 cm long and 1.4–2.0 mm wide (Fu et al., 1999). As in other fir species, the pollen of both species is wind-dispersed and seeds are dispersed by animals (Fu et al., 1999). On the basis of their differences, A. chensiensis and A. fargesii are often grouped into different sections of genus Abies (Liu, 1971; Farjon and Rushforth, 1989). Their exact phylogenetic relationship remains uncertain, although our unpublished phylogenetic analysis of all fir species in China using ptDNA sequence variation suggests that they are closely related, and most likely sister species.
The biogeography and delimitation of fir species outside China have been explored previously using mtDNA and ptDNA markers (for example, see Tsumura and Suyama, 1998; Liepelt et al., 2002, 2010; Jaramillo-Correa et al., 2008). These studies have revealed different phylogeographic structures and/or inconsistent patterns of species delimitation for the two markers, with mtDNA variation tending to reveal stronger intraspecific genetic structure and weaker resolution of species boundaries when compared with ptDNA. In this study, we examined the genetic structure and evolutionary relationships among a total of 254 individuals from 25 populations of A. chensiensis and A. fargesii across their overlapping and allopatric ranges. In doing so, we aimed to address the following questions: (1) how is ptDNA and mtDNA variation distributed within and between the two species? Is ptDNA variation more species specific than mtDNA variation? If so, is this likely to result from reduced introgression or rapid lineage sorting of ptDNA due to a higher rate of intraspecific gene flow of ptDNA, or is it more likely to stem from a fast mutation rate of this genome? (2) Are the biogeographic histories of these two high-altitude tree species consistent with those reported for other species in the region?
Materials and methods
Species and population sampling
We sampled morphologically pure and autochthonous populations of A. chensiensis and A. fargesii from almost the entire range of the two species in China, such that a total of 254 individuals from 25 populations were sampled (Figure 1a; Supplementary Table S1). At most localities, only one species was present (Figure 1b and c). However, at two localities, both species occurred parapatrically (populations 5 and 22 were parapatric at one site, whereas populations 6 and 21 were parapatric at another site). At these two localities, A. chensiensis was particularly abundant at lower elevations, whereas A. fargesii was mainly restricted to higher altitudes (above 2200 m). At altitudes between 2000 m and 2200 m, some trees were morphologically intermediate and possibly of hybrid type. Hence, care was taken at these localities to collect individuals of different species separated by a minimum of 200 m in altitude to avoid sampling possible hybrids. Fresh leaves of 1–19 individuals, spaced at least 100 m apart, were collected from each population and dried in silica gel. For the phylogenetic ptDNA analysis (see below), A. nordmanniana was selected as outgroup on the basis of a previous molecular study showing its sister-group relationship with two studied species (Xiang et al., 2009). Voucher specimens of representatives of all populations of the two species and the outgroup species were collected and stored in the herbarium of Lanzhou university (more details in Supplementary Table S1). We recorded the latitude, longitude and altitude of each location site by Extrex GIS monitor (Garmin, Taiwan).
DNA extraction, amplification and sequencing
Total genomic DNA was extracted from approximately 20 mg of silica gel-dried, leaf needle material per sample, according to a cetytrimethyl ammonium bromide procedure (Doyle and Doyle, 1987). Variation was determined for the mtDNA sequence spanning intron 4 of subunit 5 of the mt NADH dehydrogenase gene (nad5 intron 4) and intron 1 of subunit 7 of the same gene (nad7 intron 1), described by Wu et al. (1998) and Jaramillo-Correa et al. (2004). In addition, ptDNA variation was determined for the trnS–trnG and trnL–trnF fragments following Demesure et al. (1995) and Taberlet et al. (1991). PCR was carried out in a 25-μl volume, containing 10–40 ng plant DNA, 50 mM Tris-HCI, 1.5 mM MgCl2, 250 μg ml−1 bovine serum albumin, 0.5 mM dNTPs, 2 μM of each primer and 0.75 U of Taq polymerase. All reactions were performed with the following program: 4 min at 94 °C, 37 cycles of 40 s at 94 °C, 40 s of annealing at 58 °C for all the mt genes and 60 °C for pt genes, 1.20 min at 72 °C, with a final 7-min extension at 72 °C, and reactions were held at 4 °C until further processing.
We then purified PCR products using a TIANquick Midi Purification Kit following the recommended protocol (TIANGEN, Beijing, China). Sequencing reactions were performed with the PCR primers described above and internal primers for ptDNA trnL–trnF (primer ‘e’: 5′-IndexTermGGTTCAAGTCCCTCTATCCC-3′) using ABI Prism Bigdye Terminator version 3.1 Cycle Sequencing Kit (Applied Biosystems, Foster City, CA, USA). Forward, reverse or internal sequences were checked carefully by eye for all new mutations. Depending on the quality of sequence and the site of mutation, the forward, the reverse or the internal primers were then used for large-scale sequencing. We aligned DNA sequences with CLUSTAL X version 1.83 (Thompson et al., 1997) and refined the alignments by hand. A matrix of combined sequences was constructed for the 254 individuals examined, in which 4 different mtDNA sequences (mitotypes) and 24 different ptDNA sequences (ptDNA haplotypes) were identified. These mt and pt fragment sequences have been deposited in GenBank under accession numbers: HM851212-HM851248.
We first calculated haplotype gene diversity (h) and nucleotide diversity (π) of mtDNA and ptDNA for each population (hS, πS) and each species (hT, πT) using DnaSP version 5.0 (Librado and Rozas, 2009). To determine phylogenetic relationships among mtDNA and ptDNA haplotypes, we constructed median-joining networks (Bandelt et al., 1999) with NETWORK version 188.8.131.52 (available at http://www.fluxus-engineering.com). Furthermore, we also used PAUP* 4.0b10 (Swofford, 2002) to perform maximum parsimony analyses on ptDNA haplotypes, using A. nordmanniana as outgroup. Maximum parsimony analyses involved a heuristic search strategy with 100 replicates of random addition of sequences, in combination with ACCTRAN character optimization, MULPARS+TBR (tree bisection–reconnection) branch swapping and STEEPEST DESCENT options on. The robustness of maximum parsimony trees was estimated by 1000 bootstrap replicates. Bayesian inference was conducted using MrBayes version 3.1.2 (Huelsenbeck and Ronquist, 2001; Ronquist and Huelsenbeck, 2003), with the best-fitting TrN+I model selected by Modeltest version 3.06 (Posada and Crandall, 1998). Four chains of the Markov chain Monte Carto were run each for 1 000 000 generations retaining every 100th sample, after discarding a burn-in of 30 000 generations. Each insertion/deletion (indel) was considered as a single mutation event, and all indels were therefore coded as single positions in the final alignment. In cases where it was not possible to code the indel variation by a single alignment position (overlapping indels), modified complex indel coding was used to code indels (Simmons and Ochoterena, 2000).
We used SAMOVA (spatial analysis of molecular variation; Dupanloup et al., 2002) to explore the population structure of sampling areas across the range of A. chensiensis and A. fargesii, without a priori hypotheses of the expected structure. This method was used to identify clusters of populations that are geographically homogeneous and genetically differentiated from each other. A simulated annealing process was run to obtain the number of K groups that maximizes FCT (proportion of total genetic variance due to differences among groups of populations). For these analyses, we used 100 simulated annealing processes for each value of K from K=2 to K=8 for both mtDNA and ptDNA haplotypes, and chose either the number of groups (K) that gave the highest FCT or the number at which FCT began to plateau.
We also calculated average gene diversity within populations (HS), total gene diversity (HT) and the coefficients of differentiation GST and NST for each species, separately and combined, for both mtDNA and ptDNA markers, using PERMUT (available at http://www.pierroton.inra.fr/genetics/labo/Software/Permut/). In such analyses, GST considers only haplotype frequencies, whereas NST takes into account both haplotype frequencies and genetic similarity between haplotypes. Phylogeographic structure was inferred by testing whether NST was significantly larger than GST using a permutation test with 1000 random permutations of haplotypes across populations (Pons and Petit, 1996). Taking the different levels of variation obtained from mtDNA and ptDNA markers into account, standardized GST values (G′ST) following Hedrick (2005) were calculated using RecodeData version 0.1 (Meirmans, 2006), in conjunction with FSTAT (Goudet, 2001). Hierarchical partitioning of diversity among species, populations and individuals was estimated on the basis of AMOVA(Excoffier et al., 1992), which considered both haplotype frequency differences and the number of site differences among observed haplotypes using the program Arlequin version 3.5 (Excoffier and Lischer, 2010), with significance tested by a non-parametric permutation procedure with 1000 permutations. A Mantel test assessed the significance of isolation by distance between populations with 1000 random permutations on matrices of pairwise population FST values and the natural logarithm of geographical distances (Rousset, 1997).
In an attempt to infer demographic processes, a mismatch distribution analysis was carried out for each species, with each phylogeographic group being arranged by SAMOVA partitions and all individuals using Arlequin version 3.5 (Excoffier and Lischer, 2010), separately. The shape of the graph of the mismatch distribution is expected to be multimodal in samples drawn from populations at demographic equilibrium, whereas unimodal distributions are generally found in populations that have passed through a recent demographic expansion (Rogers and Harpending, 1992). The sum of squared deviations and raggedness indices (r) between observed and expected mismatch distributions were used as test statistics; their P-values represented the probability of obtaining a simulated sum of squared deviations greater than or equal to the one observed. We also performed Fu's FS test and calculated Tajima's D to confirm evidence of demographic expansion. Estimation and testing were done with Arlequin version 3.5 (Excoffier and Lischer, 2010) using 10 000 bootstrap replicates for both Fu's FS and Tajima's D. Finally, separate haplotype networks for each species and phylogeographic group were constructed using median-joining networks with NETWORK version 184.108.40.206 (Bandelt et al., 1999; available at http://www.fluxus-engineering.com) to obtain additional information about the past demographic history of the two species.
If the hypothesis of rapid expansion were not rejected, the relationship τ=2ut (Rogers and Harpending, 1992) was used to estimate the age of expansion (t), where u is the mutation rate for the DNA sequences. The value for u was calculated as u=μkg, where μ is the substitution rate per nucleotide site per year (s s−1 y−1), k is the average sequence length of the DNA region under study, and g is the generation time in years. We estimated expansions using μ=2.61 × 10−10–4.02 × 10−10 s s−1 y−1, which represent a range of neutral substitution rates for ptDNA in Pinaceae on the basis of different fossil calibrations (Gernandt et al., 2008). This produced a range of values for u and hence also for t. One caveat is that a slight bias of nucleotide substitution rates for ptDNA may exist in Abies. Thus, the substitution rates used here are only approximate estimates, although they are consistent with estimates reported recently for other conifer species (Willyard et al., 2007; Gugger et al., 2010). An average haplotype sequence length of 1111 bp and a minimum estimate of generation time of 25 years in Pinaceae were used in the calculations (Brown et al., 2004). Times of expansion were calculated separately for each species, phylogeographic group and all individuals.
Polymorphism was observed within both mtDNA regions examined: nad5 intron 4 and nad7 intron 1. Different repeats of a 4-bp microsatellite formed three different size variants in intron 4 of nad5, whereas polymorphism in intron 1 of nad7 was produced by a single insertion at position 228 (Supplementary Table S2). When combined, these sequences resulted in identification of four distinct mitotypes ranging in size from 1033 to 1041 bp, with an alignment length of 1045 bp, and distributed as shown in Figure 1b and Supplementary Table S1. The three most frequent mitotypes (M1, M2 and M4) were shared between both species, whereas mitotype M3 was specific to A. fargesii, where it occurred at low frequency in populations 23 and 24 (Figure 1b and d). In all, 21 of the 25 populations surveyed were fixed for a single haplotype, whereas the remaining four polymorphic populations harbored two or three haplotypes. In the mitotype network, the four mitotypes were separated by only one or two mutational changes and, hence, were closely related (Figure 1d).
Estimates of h and π of mtDNA for each population are summarized in Supplementary Table S1. The analysis of spatial genetic structure for mitotype variation using SAMOVA showed that the value of FCT reached a plateau when the K equaled 3 (FCT=0.920). For K⩾3, some groups comprised a single population, indicating the disappearance of geographical structure (Supplementary Table S3). One of the three groups, termed the north-central group, was characterized by a high frequency of mitotype M2 and comprised populations 1–2, 4–7, 13–14, 16–22, 24. A second group, termed the western group, comprised populations 8–10 and 15, which were fixed for mitotype M4. The remaining group, termed the southern group, comprised populations 3, 11, 12 and 23, and was characterized by a high frequency of mitotype M1. Each group exhibited a distinct geographical range that neither overlapped those of other groups nor coincided with taxon identification (Figure 1b).
HT on the basis of mitotype variation across both species was much higher than HS, and consequently, both GST and NST were high in the two species (Table 1). There was also significant phylogeographic structure when populations from both species were considered together (NST=0.934>GST=0.905; P<0.05, Table 1). The G′ST for mtDNA variation was slightly higher than GST for both species (Table 1), whereas AMOVA showed that the percentage of variation explained at each hierarchical level was nearly identical across species (Table 2). Only 1.19% of the total mtDNA variation was attributed to variation between species, whereas 87.81% was attributed to variation among populations within species (Table 2). Mantel tests revealed a significant correlation between genetic divergence and geographical distance between populations combined over species (r=0.224, P<0.05) within A. chensiensis (r=0.521, P<0.05), but not in A. fargesii (r=0.130, P=0.179) (Table 1).
In all, 5 indels and 13 substitutions were detected in the trnS–trnG and trnL–trnF fragments leading to the resolution of 24 different ptDNA haplotypes (H1–H24) among trees surveyed (Supplementary Table S2). Haplotype sizes ranged from 1083 to 1139 bp, with a sequence alignment of 1186bp. The three most frequent ptDNA haplotypes (H1, H2 and H5) were shared between species and located near the center of the haplotype network, while numerous closely related tip haplotypes that were rare and species specific were derived from each of these haplotypes (Figure 1c and e; Supplementary Table S1). Phylogenetic analysis of all ptDNA haplotypes, with A. nordmanniana used as outgroup, showed that phylogenetic relationships among ptDNA haplotypes were poorly resolved (Supplementary Figure S1).
Estimates of haplotype and nucleotide diversity within populations for ptDNA were much higher than those for mtDNA, but varied widely among populations in each species, although averages were similar for both species (Supplementary Table S1). SAMOVA failed to reveal any meaningful phylogeographic groups of ptDNA haplotypes (Supplementary Table S3). In contrast to mtDNA variation, HT on the basis of ptDNA haplotype variation was only slightly higher than HS for both species. This resulted in relatively lower values for GST and NST, and in neither species was NST significantly greater than GST (Table 1). However, GST for ptDNA variation might considerably underestimate the extent of population differentiation when there is high genetic variation within populations (Hedrick, 2005). This appears to be the case here as G′ST, which overcomes this problem, was indeed much higher than GST in both species (G′ST=0.663 in A. chensiensis, G′ST=0.418 in A. fargesii; Table 1). AMOVA confirmed that most ptDNA diversity was distributed within populations (79.08%), with negligible variation present between species (Table 2). A Mantel test showed that ptDNA variation was correlated with geographical distance across the distribution of both species (r=0.158, P<0.05) within A. fargesii (r=0.324, P<0.05), but not in A. chensiensis (r=0.087, P=0.316) (Table 1).
Historical demography on the basis of ptDNA sequence variation
The mismatch distributions for both fir species, for the three mitotype groups identified by SAMOVA, and for all individuals, showed a good fit to the expansion model with curves appearing to be unimodal (Supplementary Figure S2). In addition, the nonsignificant variance (sum of squared deviations) and raggedness index, plus the significant negative values for Fu's FS and Tajima's D in these samples, supported a model of sudden demographic expansion (Supplementary Table S4), as did the star-like patterns resolved in the ptDNA networks for each species and mtDNA group (Supplementary Figure S3). On the basis of recent fossil-calibrated neutral substitution rates for ptDNA observed in Pinaceae, we calculated the demographic expansions to have occurred between 42 000 and 105 000 years ago (kya), according to mismatch analysis (Rogers and Harpending, 1992) (Supplementary Table S4).
In agreement with results from several previous studies on conifers (Liepelt et al., 2002; Du et al., 2009), we recorded low levels of mtDNA polymorphism in the two fir species, A. chensiensis and A. fargesii, which have overlapping distributions in the Tsinling and adjacent mountains of central China. Of the four mitotypes identified, three were shared between both species, with variation distributed mainly between populations rather than within populations or between species. In contrast, a high level of ptDNA polymorphism was recorded in both fir species, due largely to high levels of variation within rather than between populations. Neither mtDNA variation nor ptDNA variation delimited the two species from each other, despite the fact that rare ptDNA haplotypes were species specific. The difference in population structure exhibited by mtDNA relative to ptDNA variation most likely reflects a higher rate of sequence evolution for ptDNA (Wolfe et al., 1987).
Mitotype variation across both species showed significant phylogeographic structure (NST>GST, P<0.05) and SAMOVA-clustered populations into three mitotype groups. Each of these groups was characterized by a particular mitotype and exhibited a distinct geographical distribution; thus, a north-central group was characterized by a high frequency of mitotype M2, a western group was fixed for mitotype M4, and a southern group was characterized by a high frequency of mitotype M1. The occurrence of these groups suggests that at least three independent refugia were maintained for both species (or their common ancestor) possibly during Quaternary glacial periods when their distributions may have fragmented in central China. If this is correct, then the current distribution of populations is likely to have resulted from expansions from these different refugia, with secondary contact occurring in some areas giving rise to populations (for example, populations 2, 11, 23 and 24) polymorphic for mitotypes diagnostic of the different phylogeographic groups. Other surveys of molecular variation in different plant species occurring in central China, such as Taxus wallichiana (Gao et al., 2007), Pinus tabulaeformis (Chen et al., 2008), Ostryopsis davidiana (Tian et al., 2009), Eurycorymbus cavaleriei (Wang et al., 2009), have also indicated that multiple refugia existed in this region during the Quaternary.
Although SAMOVA did not detect a similar population structure for ptDNA variation, recent range expansions of the two fir species were also indicated by population genetic and phylogenetic analyses of ptDNA haplotype variation. Thus, the occurrence of significantly negative values for Fu's FS and Tajima's D, unimodal mismatch distributions, and ‘star-like’ phylogenies of ptDNA haplotypes within and across species, indicated that rapid expansions took place in both species in the relatively recent past (Hwang et al., 2003). Importantly, these signatures of rapid expansion were evident when analyses were restricted to ptDNA haplotypes within population groups representing the three mtDNA groups detected by SAMOVA. This supports the hypothesis of rapid migrations from isolated refugia (see above).
The much higher level of ptDNA haplotype variation relative to mitotype variation recorded across both fir species might reflect a higher mutation rate for ptDNA (see below). A difference in mutation rate has been suggested to be a cause of two very different types of genetic composition for populations that have undergone a recent bottleneck, followed by rapid expansion. In such populations, a single haplotype is likely to be present for sequences exhibiting a low mutation rate, whereas a dominant haplotype with multiple rare alleles, as found for ptDNA haplotype variation in the present study, is more likely to be present for sequences showing a high mutation rate (for example, see Akey et al., 2004).
An indication of when rapid expansions from refugia were most likely to have occurred was obtained for both fir trees from mismatch analyses of ptDNA haplotype variation. These analyses suggested that the species underwent a notable range expansion, approximately 42–105 kya, that is following the penultimate glacial period that occurred in China between approximately 333 and 136 kya (Yi et al., 2005). This range expansion might reflect the most recent demographic shift of conifer forests in the Tsinling and adjacent mountains of central China in response to climatic oscillations during the Pleistocene (Zheng, 2000). Thus, unlike species in North America and Europe, which commonly expanded in range after the last glacial maximum (around 18 kya) (Abbott et al., 2000; Hewitt, 2000; Petit et al., 2003), our results suggested that the last glacial maximum had little impact on the demographic history of the two fir species we studied from central China. This accords with results obtained from recent studies on a few other plants and animals in Asia, such as Dysosma versipellis (Qiu et al., 2009), Allium przewalskianum (Wu et al., 2010) and Leucodioptron canorum (Li et al., 2009), all of which suggest that the last glacial maximum had a less important effect on the demographic histories of species in this region relative to earlier Pleistocene glaciations.
Cytoplasmic DNA variation and species delimitation
AMOVA showed that only ∼0.79% of total ptDNA variation and ∼1.19% of total mtDNA variation was partitioned between the two fir species. Although some ptDNA haplotypes were species specific, the three most common ptDNA haplotypes (H1, H2 and H5), along with the three common mitotypes (M1, M2 and M4), were widely shared across both species (Figure 1). Shared polymorphisms across closely related species appear to be common in fir, pine and spruce genera of the Pinaceae (for example, see Tsumura and Suyama, 1998; Godbout et al., 2008; Jaramillo-Correa et al., 2008; Du et al., 2009). Sharing might result from introgression following secondary contact between long diverged species or, alternatively, to the maintenance of ancestral polymorphisms due to incomplete lineage sorting within recently diverged species (Wiens, 2007; de Queiroz, 2007). Recently, it was shown that in most conifers, ptDNA delimits closely related species more effectively than does mtDNA (Du et al., 2009). This difference in behavior of the pt and mtDNA could be explained in part by differences in the level of intraspecific gene flow, that is, the more the gene flow within species, the less the introgression between species (Currat et al., 2008; Petit and Excoffier, 2009). If introgression were the main cause of sharing of haplotypes between species, sharing would be expected to be more evident between parapatric than allopatric populations (Palmé et al., 2004). However, in the present study, the amount of sharing of mitotypes and ptDNA haplotypes between species showed no bias with regard to either parapatric or allopatric population distributions. Although we cannot fully exclude the possibility that introgression contributed to the observed genetic patterns between the two species, the shallow genealogy and the relative interior position of shared haplotypes in the species networks for both mtDNA and ptDNA suggests that these species may have diverged too recently for ancestral polymorphisms to have been sorted to the stage of reciprocal monophyly (Crandall and Templeton, 1993). It is feasible, therefore, that incomplete sorting of ancestral polymorphisms is the more likely explanation for the sharing of haplotypes between the two fir species.
The finding that all species-specific ptDNA haplotypes originated between 42 and 105 kya indicates that the speciation events that gave rise to A. chensiensis and A. fargesii probably occurred very recently in the Quaternary, as suggested for many endemic species in eastern Asia (Qian and Ricklefs, 2000). It is also likely that the near lack of species-specific mitotypes might stem from a low mutation rate for mtDNA (Wolfe et al., 1987), and that insufficient time has passed to produce species-specific mtDNA mutations that might ultimately become fixed in each species. If this is correct, we might tentatively conclude that it is a faster mutation rate of ptDNA in A. chensiensis and A. fargesii rather than a higher rate of gene flow that caused the development of many rare species-specific ptDNA haplotypes in these two species. Such a difference in mutation rate between the two types of cytoplasmic DNA might also be a cause of ptDNA variation being more species specific in other conifer species pairs (groups) examined (see review by Du et al., 2009).
During the initial stages of divergence, it is to be expected that a large amount of neutral genetic variations would commonly be shared between closely related species, and that sequences with relatively faster mutation rates would become species specific more rapidly (de Queiroz, 2007). The replacement of shared ancestral genotypes by new species-specific genotypes would be accelerated by high rates of intraspecific gene flow enabling new mutant genotypes to expand rapidly throughout a species’ range (Hoelzer, 1997). Thus, for older species pairs (groups) that originated and diverged from each other long ago, it would be expected that genetic markers showing relatively higher rates of mutation and gene flow should be more species specific and more likely to be fixed or occur at high frequency across a species’ range (Zhou et al., 2010). In contrast, in very recently originated species pairs, such as the two fir species examined here, there would have been insufficient time for lineage sorting of ancestral polymorphisms to have been completed or for new markers originating by mutation to have spread widely and become fixed in a species.
It is clear from the comparative studies undertaken to date (Du et al., 2009, Zhou et al., 2010, and the present one), plus the results of recent modeling simulations (Currat et al., 2008), that different patterns of neutral genetic variation can exist within and between closely related species due to different rates of mutation and gene flow, and the length of time that has passed since speciation occurred. These factors are clearly of crucial importance when assessing how effective different genetic markers might be for delimiting species in particular groups.
Abbott RJ, Smith LC, Milne RI, Crawford RM, Wolff K, Balfour J (2000). Molecular analysis of plant migration and refugia in the Arctic. Science 289: 1343–1346.
Akey JM, Eberle MA, Rieder MJ, Carlson CS, Shriver MD, Nickerson DA et al. (2004). Population history and natural selection shape patterns of genetic variation in 132 genes. PloS Biol 2: 1591–1599.
Avise JC, Walker D (1998). Pleistocene phylogeographical effects on avian populations and the speciation process. Philos Trans R Soc Lond B Biol Sci 265: 457–463.
Bandelt HJ, Forster P, Röhl A (1999). Median-joining networks for inferring intraspecific phylogenies. Mol Biol Evol 16: 37–48.
Brown GR, Gill GP, Kuntz RJ, Langley CH, Neale DB (2004). Nucleotide diversity and linkage disequilibrium in loblolly pine. Proc Natl Acad Sci USA 101: 15255–15260.
Chen KM, Abbott RJ, Milne RI, Tian XM, Liu JQ (2008). Phylogeography of Pinus tabulaeformis Carr. (Pinaceae), a dominant species of coniferous forest in northern China. Mol Ecol 17: 4276–4288.
Crandall KA, Templeton AR (1993). Empirical tests of some predictions from coalescent theory with applications to intraspecic phylogeny reconstruction. Genetics 134: 959–969.
Currat M, Manuel R, Petit JR, Excoffier L (2008). The hidden side of invasion: massive introgression by local genes. Evolution 62: 1908–1920.
de Queiroz K (2007). Species concepts and species delimitation. Syst Biol 56: 879–886.
Demesure B, Sodzi N, Petit RJ (1995). A set of universal primers for amplification of polymorphic non-coding regions of mitochondrial and chloroplast DNA in plants. Mol Ecol 4: 129–131.
Doyle JJ, Doyle JL (1987). A rapid DNA isolation procedure for small quantities of fresh leaf material. Phytochem Bull 19: 11–15.
Du KF, Petit RJ, Liu JQ (2009). More introgression with less gene flow: chloroplast vs. mitochondrial DNA in the Picea asperata complex in china, and comparison with other conifers. Mol Ecol 18: 1396–1407.
Dupanloup I, Schneider S, Excoffier L (2002). A simulated annealing approach to define the genetic structure of populations. Mol Ecol 11: 2571–2581.
Excoffier L, Lischer HE (2010). Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour 10: 564–567.
Excoffier L, Smouse PE, Quattro JM (1992). Analysis of molecular variance inferred from metric distance among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics 131: 479–491.
Farjon A, Rushforth KD (1989). A classification of Abies Miller (Pinaceae). Notes R Bot Gard Edinburgh 46: 59–77.
Fu LG, Li N, Elias TS (1999). Pinaceae. In: Wu Z, Raven PH (eds). Flora of China, Vol. 4. Science Press: Beijing, pp 15–90.
Gao LM, MöLLER M, Zhang XM, Hollingsworth ML, Liu J, Mill RR et al. (2007). High variation and strong phylogeographic pattern among cpDNA haplotypes in Taxus wallichiana (Taxaceae) in China and North Vietnam. Mol Ecol 16: 4684–4698.
Gernandt DS, Magalión S, López GG, Flores OZ, Willyard A, Liston A (2008). Use of simultaneous analyses to guide fossil-based calibrations of Pinaceae phylogeny. Int J Plant Sci 169: 1086–1099.
Godbout J, Fazekas A, Hewton CH, Yeh FC, Bousquet J (2008). Glacial vicariance in the Pacific Northwest: evidence from a lodgepole pine mitochondrial DNA minisatellite for multiple genetically distinct and widely separated refugia. Mol Ecol 17: 2463–2475.
Goudet J (2001). FSTAT (version 2.9.3): a program to estimate and test gene diversities and fixation indices. Available from http://www.unil.ch/izea/softwares/fstat.html.
Gugger PF, Sugita S, Cavender-Bares J (2010). Phylogeography of Douglas-fir based on mitochondrial and chloroplast DNA sequences: testing hypotheses from the fossil record. Mol Ecol 19: 1877–1897.
Hedrick PW (2005). A standardized genetic differentiation measure. Evolution 59: 1633–1638.
Hewitt GM (2000). The genetic legacy of the Quaternary ice ages. Nature 405: 907–913.
Hoelzer GA (1997). Inferring phylogenies from mtDNA variation: mitochondrial-gene trees versus nuclear-gene trees revisited. Evolution 51: 622–626.
Huelsenbeck JP, Ronquist F (2001). MrBayes: Bayesian inference of phylogenetic trees. Bioinformatics 17: 754–755.
Hwang SY, Lin TP, Ma CS, Lin CL, Jeng-Der Chung, Yang JC (2003). Postglacial population growth of Cunninghamia konishii (Cupressaceae) inferred from phylogeographical and mismatch analysis of chloroplast DNA variation. Mol Ecol 12: 2689–2695.
Jaramillo-correa JP, Beaulieu J, Bousquet J (2004). Variation in mitochondrial DNA reveals multiple distant glacial refugial in black spruce (Picea mariana) a transcontinental North American conifer. Mol Ecol 13: 2735–2747.
Jaramillo-Correa JP, Aguirre-Planter E, Khasa DP, Eguiarte LE, Pinero D, Furnier GR et al. (2008). Ancestry and divergence of subtropical montane forest isolates: molecular biogeography of the genus Abies (Pinaceae) in southern Mexico and Guatemala. Mol Ecol 17: 2476–2490.
Li SH, Yeung CK, Feinstein J, Han LX, Le MH, Wang CX et al. (2009). Sailing through the Late Pleistocene: unusual historical demography of an East Asian endemic, the Chinese Hwamei (Leucodioptron canorum canorum), during the last glacial period. Mol Ecol 18: 622–633.
Librado P, Rozas J (2009). DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25: 1451–1452.
Liepelt S, Bialozyt R, Ziegenhagen B (2002). Wind-dispersed pollen mediates postglacial gene flow among refugia. Proc Natl Acad Sci USA 99: 14590–14594.
Liepelt S, Mayland-Quellhorst E, Lahme M, Ziegenhagen B (2010). Contrasting geographical patterns of ancient and modern genetic lineages in Mediterranean Abies species. Plant Syst Evol 284: 141–151.
Liu TS (1971). A Monograph of the Genus Abies. Department of Forestry, College of Agriculture, National Taiwan University: Taipei.
McKinnon GE, Jordan GJ, Vaillancourt RE, Steane DA, Potts BM (2004). Glacial refugia and reticulate evolution: the case of the Tasmanian eucalypts. Philos Trans R Soc Lond B Biol Sci 359: 275–284.
Meirmans PG (2006). Using the AMOVA framework to estimate a standardized genetic differentiation measure. Evolution 60: 2399–2402.
Palmé AE, Su Q, Palsson S, Lascoux M (2004). Extensive sharing of chloroplast haplotypes among European birches indicates hybridization among Betula pendula, B. pubescens and B. nana. Mol Ecol 13: 167–178.
Petit RJ, Aguinagalde I, de Beaulieu JL, Bittkau C, Brewer S, Cheddadi R et al. (2003). Glacial refugia: hotspot but not melting pots of genetic diversity. Science 300: 1563–1565.
Petit RJ, Excoffier L (2009). Gene flow and species delimitation. Trends Ecol Evol 24: 386–393.
Pons O, Petit RJ (1996). Measuring and testing genetic differentiation with ordered versus unordered alleles. Genetics 144: 1237–1245.
Posada D, Crandall KA (1998). Modeltest: testing the model of DNA substitution. Bioinformatics 14: 817–818.
Qian H, Ricklefs RE (2000). Large-scale processes and the Asian bias in species diversity of temperate plants. Nature 407: 180–182.
Qiu YX, Guan BC, Fu CX, Comes HP (2009). Did glacials and/or interglacials promote allopatric incipient speciation in East Asian temperate plants? Phylogeographic and coalescent analyses on refugial isolation and divergence in Dysosma versipellis. Mol Phylogenet Evol 51: 281–293.
Rogers AR, Harpending H (1992). Population growth makes waves in the distribution of pairwise genetic differences. Mol Biol Evol 9: 552–569.
Ronquist F, Huelsenbeck JP (2003). MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19: 1572–1574.
Rousset F (1997). Genetic differentiation and estimation of gene flow from F-statistics under isolation by distance. Genetics 145: 1219–1228.
Simmons MP, Ochoterena H (2000). Gaps as characters in sequence based phylogenetic analyses. Syst Biol 49: 369–381.
Sloan DB, Camille MB, Matthew SO, Stephen RK, Douglas RT (2008). Evolutionary rate variation at multiple levels of biological organization in plant mitochondrial DNA. Mol Biol Evol 25: 243–246.
Swofford DL (2002). PAUP*: Phylogenetic Analyses Using Parsimony (* and Other Methods) Version 4 Sinauer Associates: Sunderland, MA.
Taberlet P, Gielly L, Pautou G, Bouvet J (1991). Universal primers for amplification of three non-coding regions of chloroplast DNA. Plant Mol Biol 17: 1105–1109.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG (1997). The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25: 4876–4882.
Tian B, Liu RR, Wang LY, Qiu Q, Chen KM, Liu JQ (2009). Phylogeographic analyses suggest that a deciduous species (Ostryopsis davidiana Decne., Betulaceae) survived in northern China during the last glacial maximum. J Biogeogr 36: 2148–2155.
Ting CT, Tsaur SC, Wu CI (2000). The phylogeny of closely related species as revealed by the genealogy of a speciation gene. Proc Natl Acad Sci USA 97: 5313–5316.
Tsumura Y, Suyama Y (1998). Differentiation of mitochondrial DNA polymorphisms in populations of five Japanese Abies species. Evolution 52: 1031–1042.
Via S (2009). Natural selection in action during speciation. Proc Natl Acad Sci USA 106: 9939–9946.
Wang J, Gao PX, Kang M, Lowe AJ, Huang HW (2009). Refugia within refugia: the case study of a canopy tree (Eurycorymbus cavaleriei) in subtropical China. J Biogeogr 36: 2156–2164.
Wiens JJ (2007). Species delimitation: new approaches for discovering diversity. Syst Biol 56: 875–878.
Willyard A, Syring J, Gernandt DS, Liston A, Cronn R (2007). Fossil calibration of molecular divergence infers a moderate mutation rate and recent radiations for Pinus. Mol Biol Evol 24: 90–101.
Wolfe KH, Li W-H, Sharp PM (1987). Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc Natl Acad Sci USA 84: 9054–9058.
Wu JY, Krutovskii KV, Strauss SH (1998). Abundant mitochondrial genome diversity, population differentiation and convergent evolution in pines. Genetics 150: 1605–1614.
Wu LL, Cui XK, Milne RI, Sun YS, Liu JQ (2010). Mutiple autopolyploidisations and range expansion of Allium przewalskianum Regel. (Allianceae) in the Qinghai-Tibetan Plateau. Mol Ecol 19: 1691–1704.
Xiang QP, Xiang QY, Guo YY, Zhang XC (2009). Phylogeny of Abies (Pinaceae) inferred from nrITS sequence data. Taxon 58: 141–152.
Yi CL, Cui ZJ, Xiong HG (2005). Numerical periods of Quaternary glaciations in China. Quatern Sci 25: 609–619.
Zheng Z (2000). Vegetation and climate since the Late Pleistocene in southern China. J Geosci China 2: 7–20.
Zhou YF, Abbott RJ, Jiang ZY, Du FK, Milne RI, Liu JQ (2010). Gene flow and species delimitation: a case study of two pine species with overlapping distributions in southeast China. Evolution 8: 6342–6352.
This research was supported by grants from the Natural Science Foundation of China (30725004) and a Royal Society-NSF China International Joint Project award 2010/R4 to RJA and JQL.
The authors declare no conflict of interest.
Supplementary Information accompanies the paper on Heredity website
About this article
Cite this article
Wang, J., Abbott, R., Peng, Y. et al. Species delimitation and biogeography of two fir species (Abies) in central China: cytoplasmic DNA variation. Heredity 107, 362–370 (2011). https://doi.org/10.1038/hdy.2011.22
- Abies chensiensis
- A. fargesii
- gene flow
- mutation rate
- species delimitation
Importance of incomplete lineage sorting and introgression in the origin of shared genetic variation between two closely related pines with overlapping distributions
Tree Genetics & Genomes (2017)
Genetic divergence and phylogeographic history of two closely related species (Leucomeris decora and Nouelia insignis) across the 'Tanaka Line' in Southwest China
BMC Evolutionary Biology (2015)
Plant Molecular Biology Reporter (2015)
Evolution of an Ancient Microsatellite Hotspot in the Conifer Mitochondrial Genome and Comparison with Other Plants
Journal of Molecular Evolution (2013)