Introduction

The genus Quercus is frequently designated as crux botanicorum among taxonomists. The systematics of this genus is complicated and controversial, and this is reflected in a different number of species recognized by different authors. Linnaeus (1753) proposed 12 species and since then, the number of oak taxa has increased to several hundred; Schwarz (1964) considered 320 taxa to be separate species, plus many forms, varieties or subspecies.

Among the European white oaks (subgenus Lepidobalanus), the species with the highest economic importance and therefore those which have been most intensively studied are Quercus robur L. and Q. petraea (Matt.) Liebl. Several other taxa have been described and found in natural forests over Europe (mainly eastern and southern Europe): Q. dalechampii Ten., Q. polycarpa Schur., Q. pedunculiflora C. Koch. In many floras (Greece, Rumania, Serbia, etc., even in Flora Europaea; Schwarz, 1964), these taxa are listed as separate species but there is no general agreement about their taxonomic status among botanists, since great intraspecific morphological variation makes them difficult to distinguish. Thus, they are frequently included in either Q. robur or Q. petraea. Even the taxonomic status of these species sensu lato is an issue of controversy, since they are interfertile (Steinhoff, 1997) and the frequency of individuals with intermediate morphology is high. Some authors even conclude that they should be considered ecotypes or subspecies within one composite species of Q. robur (Kleinschmit et al., 1995; Steinhoff, 1997).

The genetics of European white oaks has been studied extensively by means of isozyme and DNA genetic markers. Almost all studies have had to treat the problem of species identity and patterns of intraspecific and interspecific differentiation. In general, isozyme studies failed to find true species-specific alleles. Alleles that are found in samples of one species but absent in samples of the other, are generally of low frequencies, so that their absence may be due to sampling error. There are, however, significant differences in allelic frequencies between both species at several loci (Müller-Starck & Ziehe, 1991; Zanetto et al., 1994). Analysis of total proteins using two-dimensional electrophoresis showed similar results; no species-specific spots, but significant differences in the frequency of some protein fractions between both oak species (Barreneche et al., 1996).

Similarly, in most studies, DNA markers do not appear to differentiate the species. Sessile and pedunculate oaks share the same cytoplasmic genes when they occur in the same stand (Petit et al., 1993). It has even been shown that such local sharing applies to at least seven oak species of the subgenus Lepidobalanus (Dumolin-Lapègue et al., 1997). No difference between Q. robur and Q. petraea in the distribution of rDNA length variants was found by Petit et al. (1993). RAPD fragments also failed to discriminate the species (Moreau et al., 1994; Bodénès et al., 1997).

Despite this general lack of differentiation, the existence of interspecific differences has been shown in some studies based on genetic markers. Bordács & Burg (1997) demonstrated the existence of two species-specific RAPD markers differentiating Q. robur (s.l., including ssp. slavonica) from Q. petraea (s.l., including Q. dalechampii). Recently, Muir et al. (2000) presented microsatellite marker data that discriminated the two species, thus supporting the view that they represent different taxonomical units.

Most of the cited studies are based on western European populations. Samples from eastern Europe appear regularly, but they are much less well represented, especially samples from the eastern-most part of the oak distribution range (Russia). This study aims to fill this information gap by comparing the genetic structure of Russian oak populations with those from central and south-eastern Europe.

Materials and methods

Twenty-five indigenous populations (preferably from nature reserves) of sessile and pedunculate oaks were selected in three geographical regions (Fig. 1.): Slovakia (central Europe), Bulgaria (Balkans) and Republic Mari-El (Russia, near the Volga river, only pedunculate oak). Twigs with dormant buds were taken from 30 to 60 trees in each population. Enzymes were extracted from buds using Tris-HCl buffer pH 7.5 (Longauer, 1996) and separated by means of starch-gel electrophoresis using three buffer systems (Li-borate/tris-citrate pH 8.1, Na-borate pH 8.0/tris-citrate pH 8.7, and tris-citrate/tris-histidine pH 7.0). Eight isoenzyme systems were used (E.C. numbers and controlling loci given in parentheses): fluorescent α-esterase (E.C. 3.1.1.1; Fest-A), menadione reductase (E.C. 1.6.99.2; Mnr-A), malate dehydrogenase (E.C. 1.1.1.37; Mdh-D), isocitrate dehydrogenase (E.C. 1.1.1.42; Idh-B), shikimate dehydrogenase (E.C. 1.1.1.25; Skdh-A), glutamate-oxaloacetate transaminase (E.C. 2.6.1.1; Got-B), phosphoglucose isomerase (E.C. 5.3.1.9; Pgi-B), and a nonspecific NAD-dependent dehydrogenase, able to utilize glucose and sorbitol as a substrate (E.C. 1.1.1.47; Gludh-A). Locus designation conforms with Zanetto et al. (1996) and Müller-Starck et al. (1996), who showed Mendelian inheritance for these systems, except Gludh-A, where the Mendelian inheritance was inferred from the isozyme phenotypes (alternative occurrence of single-banded variants corresponding to putative homozygotes and triple-banded variants corresponding to putative heterozygotes for the case of a dimeric enzyme) and from the observation that putative genotype distributions correspond approximately to Hardy–Weinberg expectations (Gömöry, 2000). Alleles were designated by their relative migration rate as related to the most frequent one. Unfortunately, due to transport distances and drying-out of some samples, the locus Gludh-A could not be reliably scored in all populations and other enzyme systems that were tested failed to give interpretable zymograms.

Figure 1
figure 1

Location of the investigated oak populations.

Allelic frequencies at each locus were calculated based on diploid genotypes. Differences in allelic frequencies between populations, regions, and species, were tested using the probability test (Raymond & Rousset, 1995). Expected Hardy–Weinberg heterozygosity and effective number of alleles were used to characterize the genetic diversity. Since there were big differences in sample sizes among the regions, mean number of alleles was not considered an appropriate measure of the genetic multiplicity. Therefore, we used the measure of allelic richness developed by Petit et al. (1998). Allelic richness A[g] corresponds to the number of different alleles found when g gene copies are sampled at the locus in question. Since Q. robur in the Bulgarian region was represented by only one population, where 40 trees were sampled, g was set to 80.

Genetic distances between populations were calculated following Nei (1978). Principal coordinate analysis (Gower, 1966) was used for the interpretation of the genetic distance matrix. The structure of gene diversity (as measured by expected heterozygosity) within the Q. robur/Q. petraea complex was analysed by subdividing the total gene diversity (HT) into four components: HS, the within-population diversity; DSR, diversity among populations within regions; DRQ, diversity among regions within species; and DQT, diversity among species within the complex, following Nei (1973).

Results

Genetic diversity

Mean number of alleles per locus ranged between 1.8 and 2.6 in pedunculate oak populations and from 2.0 to 3.0 in sessile oak populations, this measure is very sensitive to sample size, and so must be regarded with caution. However, allelic richness (referring to equal sample sizes) indicates the same pattern; slightly higher levels of genetic multiplicity in Q. petraea compared to Q. robur (Table 1). On the other hand, diversity levels, as measured by both effective number of alleles and expected heterozygosity, are slightly higher in Q. robur compared to Q. petraea. There does not seem to be an association between allelic richness and diversity at the regional level, it may even seem to be negative; the Bulgarian pedunculate oak population possesses the smallest number of alleles, but, at the same time, exhibits the highest diversity level. However, this opposite trend was not confirmed at the population level, where a positive correlation between allelic richness and diversity was found: r=0.47* and r=0.61** for ne and HE, respectively (*P ≤ 0.05; **P ≤ 0.01).

Table 1 Characteristics of genetic diversity in the investigated oak species and regions

Amounts and patterns of genetic differentiation

Except for Idh-B/140, no allele proved to be species-specific in the sense that it occurred in all populations (or at least all regions) of one species and was absent in the other species. Some alleles appear to be region-specific (Gludh-A/133, Got-B/80, 70), but almost all these alleles are extremely rare (Table 2). There are, however, significant differences in allele frequencies between the species, as well as among populations within both pedunculate and sessile oaks. Except for Mdh-D, the interspecific differences are significant at all loci. However, there are only two loci within the investigated set, where the differences are apparent and concern frequent alleles (with a frequency over 0.05); Idh-B and Gludh-A. Unfortunately, Gludh-A is the locus that could not be scored in all populations.

Table 2 Mean allelic frequencies in the investigated oak species and regions

Specifically, Gludh-A proved to differentiate the species. Therefore, we made most calculations for those loci which could reliably be scored in all populations (Fest-A, Mnr-A, Mdh-D, Idh-B, Skdh-A, Got-B, Pgi-B), and subsequently with these loci plus Gludh-A set for an incomplete population. As indicated by genetic distances (Table 3), ), if Gludh-A is excluded, the differentiation between species is smaller than within both pedunculate and sessile oaks. However, inclusion of Gludh-A enhances considerably the between-species distances, whereas the differentiation within pedunculate oak is almost unaffected and the distances within the sessile oak population set increase only slightly.

Table 3 Average Nei’s genetic distances between populations within and between species and regions (the diagonal and below) and the number of loci exhibiting significant differences (P < 0.05) in allelic frequencies between regions and species (above the diagonal)

As usual in forest trees, the major proportion of the gene diversity (over 90%) is harboured within populations for almost all loci. The surprising observation is that in general, the component of diversity among populations within regions is higher than the component among regions within species. For such small and geographically distant regions, the opposite would be expected. The interspecific component exceeds 5% only in one case (Idh-B). The only locus deviating from this general picture is Gludh-A. Almost half of the variation at this locus can be attributed to the variation between species, and there are considerable differences among regions as well, mainly in Q. petraea.

Multilocus patterns of the genetic differentiation were expressed by genetic distances. Even when the Gludh-A locus is excluded, both oak species are quite clearly differentiated (Fig. 2a). There are some outliers, but the point fields representing both species are separated. When Gludh-A is taken into account, the oak species are separated even more clearly (Fig. 2b). However, in both cases, geographical trends are difficult to distinguish. Point fields of individual regions overlap considerably, mainly in Q. robur.

Figure 2
figure 2

(a) Scattergram of the principal coordinate analysis of Nei’s genetic distances between oak populations; locus Gludh-A excluded. (b) Scattergram of the principal coordinate analysis of Nei’s genetic distances between oak populations; locus Gludh-A included.

Discussion

The levels of genetic multiplicity found in our study roughly correspond to those reported by previous studies, which range from 2.5 to 3.2 for comparable sample sizes and comparable sets of isozyme loci (Müller-Starck & Ziehe, 1991; Zanetto et al., 1994). On the other hand, diversity values are generally lower by 15–20% in respect of the effective number of alleles and more than 50% in respect of the expected heterozygosity. However, 10–20 loci is not a representative random sample of the genome, so comparing average values of diversity characteristics based on different sets of markers is not very meaningful.

In spite of introgression occurring in mixed stands and the subsequent occurrence of intermediate forms, both species are morphologically and ecologically distinct (Aas, 1993). Taxonomists, as well as geneticists, dealing with European oaks almost all agree that Q. robur and Q. petraea should be considered separate species. On the other hand, the interspecific differentiation found within the Q. robur/Q. petraea complex using nuclear as well as cytoplasmic markers is weak. Genetic distances between populations of different species are generally almost twice as high as within species (Müller-Starck et al., 1993; Zanetto et al., 1994), but a greater difference between intraspecific and interspecific genetic distances could be expected.

For most European tree species, the differentiation patterns are closely associated with the history of the postglacial recolonization. Three main refugia of oaks are supposed to have existed during the last (Würm/Weichsel) glaciation; these were located on the Iberian, Apennine, and Balkan peninsulas. Dumolin-Lapègue et al. (1997, support the hypothetical existence of a fourth oak refugium in the Caucasus. Russian authors even consider the possibility of oak refugia in the hills and mountains east of the Volga river (Obshchij Syrt, Southern Urals) (Denisov, 1980). Among the populations investigated within this study, only the origin of the Bulgarian ones is unambiguous, since they are situated near the putative Balkan refugium. Dumolin-Lapègue et al. (1997, consider the immigration of oak to Russia both from the Balkans and the controversial Caucasus refugium possible. A similar situation applies to Slovak populations; haplotypes of both Apennine and Balkan origins were found in this region. Despite a large geographical gap between Russian populations and the rest, genetic distances within the pedunculate oak population set were very small and did not exhibit any geographical pattern. In sessile oak, genetic distances within both Slovak and Bulgarian regions were small and comparable to distances between regions (except for Gludh-A). Such a lack of geographical patterning would support the hypothesis of a common origin from one refugium, i.e. results confirm neither the assumption of the immigration of Russian pedunculate oak from the Caucasus nor its origin from local refugia. However, the information value of isozyme markers in this respect is much lower than that of neutral maternally inherited cpDNA markers (Petit et al., 1997).

Among the loci we studied, only two exhibited high levels of interspecific differentiation: Idh-B and Gludh-A. The first locus exhibits consistent significant differences in allelic frequencies between species over the whole range (Schroeder, 1989; Müller-Starck et al., 1993; Zanetto et al., 1994). Such differences between oak species are rather the rule than the exception, although they are very rarely consistent over different geographical regions. A phosphoglucomutase locus seems to differentiate both species also. The difference in the frequencies of the most abundant allele varies between 0.2 and 0.5 (Herzog, 1996; Zanetto et al., 1994). Unfortunately, for Gludh-A, such information is not available. Slovakia is the sole region where this locus was analysed in both oak species. However, the magnitude of differentiation and a relative consistency of allelic profiles within species suggest that the observed differentiation at this locus is characteristic for sessile and pedunculate oaks. The coefficient of interspecific genetic differentiation, GQT=47%, at Gludh-A is very much higher than any value reported for this pair of species. Nevertheless, no species-specific alleles were found. Further analyses in the other parts of the oak range would be necessary before using this as a diagnostic locus.

Table 4 Gene diversity components in the Q. robur/ Q. petraea complex