Introduction

The amount and distribution of genetic variation among populations of a forest tree species is usually associated with ecological and geographical variation across its natural distribution. Genetic processes during the recent history of the species, such as genetic drift and gene flow, often contribute to current patterns of genetic variation. Nothofagus nervosa (Phil.) Dimitri & Milano (Lennon et al., 1987) (=N. alpina), one of the most important forest tree species of the southern South American temperate forest, covers, in the Andes mountains of Argentina, a small distributional area (55 000 ha) restricted to the valleys of the west–east glacial lake watersheds. Natural hybridization between N. nervosa and N. obliqua has been described based on morphological traits (Donoso et al., 1990; Gallo, 1995). The hybridization was confirmed using species-specific isozyme markers, and N. nervosa was found to act mainly as the female and N. obliqua as the pollen donor (Gallo et al., 1997a,b). This introgression process may make an important contribution to the genetic variation of N. nervosa, perhaps greater than other factors.

The distribution pattern of N. nervosa and the west–east direction of strong winds, suggest that gene flow will be limited to within lake watersheds or the nearest connected ones. Thus, genetic variation may be associated with latitude and, particularly, with different lake watersheds. Chloroplast DNA markers showed a clear geographical variation between two groups of N. nervosa populations which were distributed south and north of a mountain chain (Marchelli et al., 1998).

Isozymic variation patterns related to geographical distribution have been described among the Fagaceae and the Nothofagaceae (Comps et al., 1990; Müller-Starck & Ziehe, 1991; Haase, 1993; Zanetto & Kremer, 1995; Premoli, 1997). In N. nervosa, isozymes were used to analyse Chilean populations (Carrasco & Eaton, 1997), whereas genetic studies of the species in Argentina have dealt mainly with the determination of isozyme gene markers (Godoy, 1994; Marchelli & Gallo, 2000). As in most other forest tree species, confirmed gene markers (i.e. through the genetic analysis of band segregation) have not been used to date for population genetic studies in N. nervosa. Additionally, isozyme genetic diversity and differentiation are usually estimated using Nei’s (1972, 1973) and Wright’s parameters (Wright, 1978), although strong arguments have been made against these (e.g. Gregorius & Roberds, 1986).

In the present study, genetic variation among and within 11 populations of N. nervosa, distributed along its entire natural Argentinean range was analysed with Gregorius’ parameters using eight isozyme gene markers. The distribution among different lake watersheds and the genetic variation supplied by the natural hybridization were especially taken into account.

Materials and methods

Sampled populations

Seeds from 11 populations of N. nervosa distributed over the entire natural Argentinean range were collected by placing nets below the canopy, ≈1.5 m above the ground (Fig. 1). Collections were made during the seed-fall season (February-April) between 1994 and 1997. The nets were distributed within the stand (2–3 ha each) in order to capture seeds from a minimum of 40 trees in each population. Seeds were kept in plastic tubes at 4°C until needed. A minimum of 100 seeds per population was analysed. Seeds of the ‘full’ mast year 1997 offers a better representation of all genotypes due to the high seed production (Marchelli & Gallo, 1999), thus seeds from this year were used whenever possible (except populations 5, 10, 17 and 19).

Fig. 1
figure 1

Geographical location of the 11 Nothofagus nervosa populations analysed. Scale 1: 900 000.

Electrophoretic analyses

Protein extraction was performed from embryo tissue using Cheliak & Pitel’s (1984) vegetative extraction buffer I with slight modifications and electrophoresis was carried out on the day following protein extraction. Electrophoresis conditions, as well as gene marker determinations, are described in Marchelli & Gallo (2000). Five enzyme systems encoding eight polymorphic loci were scored: Mdh-B, Mdh-C, Idh-A, Adh-A, Got-A, Got-B, Got-C and Pgi-B.

Measurement of genetic variation

Three among-population analyses were performed: (1) the first one corresponded to the 11 populations taken separately; (2) for a second analysis, populations were grouped according to the results of a previous chloroplast haplotype analysis (Marchelli et al., 1998); and (3) a third analysis was carried out comparing the populations grouped according to five geographically separated lake watersheds.

Genetic variation within populations was analysed as the mean number of alleles per locus (AL), the percentage of polymorphic loci (P), observed heterozygosity (Ho) and genetic diversity (ν; Gregorius, 1978), which corresponds to the effective number of alleles. P refers to the total number of polymorphic loci found over all analysed populations. Differences in allele frequencies among populations were statistically tested by means of a Chi-squared goodness of fit test (α=0.05). Allelic genetic distance between populations was estimated using Gregorius’ (1974) genetic distance, although for comparison purposes with other studies Nei’s (1972) genetic distance was also calculated. Differences in the genetic distances were tested (α=0.05) using the program GDA_NT (Genetic Data Analysis and Numerical Tests) with 500 permutations (Degen, unpublished). The amount of genetic differentiation for a given population (Dj), measured as the distance of each population from its complement (remaining pooled populations), and the mean level of differentiation among populations (δ), were estimated for each locus according to Gregorius & Roberds (1986). Population differentiation was measured both with allelic and genotypic data (δge and δgo, respectively); for comparison purposes, FST values (Wright, 1978) were also calculated. Dendrograms were constructed according to the UPGMA method using the SAS statistics program (SAS, 1989–96). Two computational programs were employed for the calculation of the different genetic parameters: GSED (Genetic Structures from Electrophoresis Data, Gillet, 1994) and POPGENE (Yeh & Boyle, 1997).

Results

Genetic diversity

Three of the analysed loci were almost monomorphic (Mdh-C, Adh-A and Pgi-B; Table 1). Mdh-C had one allele fixed in seven of the 11 populations, and the other allele was found only in the heterozygous condition. One allele was fixed at loci Adh-A and Pgi-B in eight and seven populations, respectively. These two loci were found to have species-specific alleles, since one is fixed in N. nervosa and the other is fixed in N. obliqua, with hybrids between these two species having both alleles (Gallo et al., 1997a,b, 2000). Heterozygous seeds for both or only one of these loci were observed among the analysed populations. The frequency of hybrid seeds in population 10 from the Lacar Lake watershed was the greatest if both loci were involved (3.5% for Adh-A and 4.4% for Pgi-B; Table 1). Two seeds from 200 analysed from the Huechulafquen-Currhue watershed were heterozygous for Pgi-B (population 21), although N. obliqua does not occur at this site. These two seeds have the nervosa-type Adh-A pattern (homozygous for allele 2). Similarly, one seed of the 1207 analysed from population 27 (including the trees analysed for the genetic control) was heterozygous for Pgi-B and homozygous for Adh-A. In addition, heterozygous seeds for Adh-A were found to be homozygous for Pgi-B (nervosa-type) in population 17 (Lolog Lake watershed) at a relatively high frequency (16.7%). Among the other five analysed loci, Mdh-B was the most polymorphic. The mean observed heterozygosity for the species was 17.3%.

Table 1 Genic diversity (ν), observed heterozygosity (Ho), number of seeds analysed in each population (n), mean number of alleles per locus (AL) and percentage of polymorphic loci (P) for eight loci in 11 Nothofagus nervosa populations

The mean number of alleles per locus (AL) for all populations was 2.3, whilst genic diversity (ν, effective number of alleles) ranged between 1.177 and 1.362 (Table 1).

Genetic differentiation

Table 2 shows Gregorius’ and Nei’s genetic distances for the gene pool and for Mdh-B (the locus with the highest genetic differentiation). Most pairwise Gregorius’ genetic distances were significantly different (α=0.05) (Table 2). Gregorius’ distances were much higher than Nei’s distances (Table 2). One of the criticisms that Gregorius (1974) made of Nei’s genetic distance is that in some cases the triangular inequality (d(x,z) + d(z,y)d(x,y)) does not hold. Several groups of populations were found where this condition was not fulfilled, e.g. populations 5, 6 and 10 and populations 6, 28 and 30 (Table 2). The greatest genetic distances were observed between populations 17 and 30 (0.113) and between populations 17 and 19 (0.093); the last two from the same lake watershed.

Table 2 Genetic distances among the 11 Nothofagus nervosa populations for the locus with the higher differentiation (Mdh-B) (in the upper right diagonal half) and for the gene pool (in the lower diagonal half) according to Gregorius (1974). Nei’s genetic distances (1972) are given in parentheses for comparison

Differences among population differentiation were shown by the Dj values in the gene pool; population 17 was the most differentiated (Dj=0.073) and population 21 was the least differentiated (Dj=0.029; Table 3). The mean allelic differentiation (δge) for the gene pool was 0.046, with Mdh-B being the most differentiated locus (δge=0.099), indicating the mean proportion of effective numbers of genes by which populations differ from their complements (remaining populations) (Gregorius & Roberds, 1986).

Table 3 Genetic differentiation (Dj and δ) at the genic and genotypic (in parentheses) level in 11 populations of Nothofagus nervosa. Wright’s FST values are given for comparison

Similar values were obtained for δge and FST in the gene pool although values for the different loci were highly variable. In general δge was much greater than FST, except for the almost monomorphic loci (Mdh-C, Adh-A and Pgi-B), where the latter was higher. Genetic differentiation at the genotypic level (δgo) was considerably greater than at the genic level; in some loci values for δgo were more than twice those obtained for δge. Graphic representations of the Dj and δ values (the ‘differentiation snails’) proposed by Gregorius & Roberds (1986) are shown in Fig. 2.

Fig. 2
figure 2

“Differentiation snails” (according to Gregorius & Roberds, 1986). (a) Genetic differentiation among populations of Nothofagus nervosa for the five analysed loci and the gene pool. The radius of the dotted circle in each graph represents the average level of allelic differentiation (δ) for that particular locus. Each triangle represents a population, the radius indicates the level of genetic differentiation (Dj) and the angle denotes sample size. (b) Amplification of the graphs for the three less differentiated loci (Mdh-C: 17 times; Adh: 3 times and Pgi: 9 times).

Genetic variation among regions and watersheds

The analysis of the allelic structure between the two groups of populations bearing different cpDNA haplotypes found by Marchelli et al. (1998), showed significant differences in allele frequencies at five of the eight analysed loci (Mdh-B, Idh-A, Adh-A, Got-B and Got-C; α=0.05). All estimated genetic parameters were higher in the southern compared to the northern region (Table 4).

Table 4 Genetic variation in Nothofagus nervosa in two regions separated by chloroplast DNA (Marchelli et al., 1998) and five different watersheds: 1 (Hermoso Lake); 2 (Lacar Lake); 3 (Lolog Lake) 4 (Huechulafquen–Currhue Lakes); 5 (Tromen–Quillén Lakes). FST values are given in parentheses

Depending on the watershed analysed, homogeneity in allele frequencies between populations within watersheds was found for three (Lacar), four (Lolog and Huechulafquen) or five (Tromen–Quillén) loci. Only Mdh-C and Got-A were homogeneous between populations in all watersheds. Moreover, population differentiation within watersheds (δge) was high in some cases (e.g. Lolog Lake watershed; Table 4). In spite of this partial heterogeneity, populations within watersheds were pooled for all loci in order to analyse the latitudinal variation. Significant differences in the allelic structure of all but one locus (Mdh-C) were found among watersheds (α=0.05). Genetic distances between all pairs of watersheds were also significantly different. The northernmost watershed (Tromen–Quillén) was completely separated from the rest according to the gene pool genetic distance, coinciding with cpDNA haplotypes (Fig. 3).

Fig. 3
figure 3

Clustering of the five lake watersheds according to the UPGMA method using Gregorius’ genetic distances.

Discussion

Most of the genetic variation observed in N. nervosa was found within, rather than among populations. Mean observed heterozygosity for the species was 17.3%, which is slightly lower than values observed in other Fagaceae (e.g. Müller-Starck & Ziehe, 1991; Zanetto & Kremer, 1995). On the other hand, observed heterozygosity in N. nervosa is much higher than values obtained in other South American Nothofagus species [e.g. N. dombeyi (8.2%), N. betuloides (9.8%) and N. nitida (3.6%) (Premoli, 1997)]. The same holds for values observed in New Zealand Nothofagus where they ranged from 2.5 to 10.5% in five species (Haase, 1993). However, it should be noted that in the last two works, both monomorphic and polymorphic loci were included in the analysis, most of which were monomorphic. In this study, monomorphic loci were excluded since they could not be recognized as gene markers in the segregation analysis (Marchelli & Gallo, 2000); only polymorphic loci allow genetic analysis and therefore the identification of gene markers.

As regards the other measures of intrapopulation variation, genic diversity (ν) for the gene pool ranged between 1.177 and 1.362. These values are slightly lower than values observed in Quercus robur and Q. petraea (ν=1.4 in both species; Müller-Starck et al., 1993) and in Fagus sylvatica (ν=1.55; Gregorius et al., 1986). Relatively small genetic diversities denote minor polymorphisms with one allele at high frequency and a few rare alleles (Müller-Starck et al., 1993). The mean number of alleles per locus was 2.25; a value that is similar to those obtained in other species (Müller-Starck et al., 1992). The difference between AL and ν accounts for the presence of alleles at low frequencies.

In most of the literature, Wright’s FST (or Nei’s GST) is used as a measure of genetic differentiation. However, Gregorius & Roberds (1986) have highlighted some conceptual weaknesses of this measure; FST equals 1 only if all populations are genetically fixed, including fixation for the same allele, which contradicts the concept of complete differentiation. In this sense, our data show that FST reaches the highest value for Adh-A (FST=6%, Table 3), which is fixed for the same allele in eight out of the 11 populations (Table 1), thus supporting our choice of the mean level of differentiation among populations (δ; Gregorius & Roberds, 1986) as an estimator of genetic differentiation.

Little population differentiation was observed among the 11 populations studied (δge=4.6%; FST=3.8%), coinciding with the values of δge obtained in Fagus sylvatica and Quercus robur (4.5 and 5.5%, respectively) but lower than that observed in Q. petraea (8.5%; Müller-Starck & Ziehe, 1991). However, this differentiation value could be considered as relatively high, given the very small natural distribution area of N. nervosa analysed in this study. The most differentiated locus was Mdh-B with differences in almost 10% of the effective number of genes by which populations differ from their complements. The general mean was low due to the low differentiation of Mdh-C and the two species-specific loci (Adh-A and Pgi-B). Differences in the mean level of differentiation were observed when comparing genic and genotypic data. Values for mean genotypic differentiation (δgo) were almost twice those found for mean genic differentiation (δge), indicating that organization of gametes into genotypes implies a higher differentiation. Gregorius & Roberds (1986) obtained similar results in Pinus ponderosa (δge=0.061 and δgo=0.113).

The results are in agreement with the distribution of the genetic variation observed in forest tree species, since most FST values were less than 0.100, indicating low genetic differentiation between populations (Comps et al., 1991). Great intrapopulation variation and little population differentiation were observed in beech and oaks (Müller-Starck & Ziehe, 1991), and in South American and New Zealand Nothofagus species (Haase, 1993; Premoli, 1997).

Even though genetic distances among watersheds were rather small (between 0.023 and 0.072), they were significantly different (α=0.05) (Fig. 3). Clustering of the watersheds in the dendrogram showed a greater similarity between Lacar and Huechulafquen-Currhue watersheds than between either of these and the Lolog watershed, which is geographically situated between them (Fig. 1). Analysing the whole distribution, a latitudinal separation can be seen, especially between the northernmost watershed and those situated in the south, since the Tromen–Quillén watershed was clearly separated from the rest (Fig. 3). The highest genetic distance was found between this watershed and the southernmost. This agrees with results obtained with chloroplast DNA markers and supports the idea that there were at least two refugia for N. nervosa during the last glaciation (Marchelli et al., 1998). It could be suggested that the main wind direction during pollination time along the valleys of the watersheds could geographically structure the genetic variation in gene flow corridors along the different lake watersheds. According to the cpDNA results, population 30 (from Quillén Lake) was grouped with populations 27 and 28 (from Tromen Lake). However, in the present allozyme analysis, it showed a higher differentiation level than them, suggesting that probably the Quillén watershed could belong to a different gene flow corridor.

Curiously, some populations belonging to the same watershed were genetically very different, e.g. populations 17 and 19 from the Lolog watershed (d=0.093), whilst others were relatively similar (populations 27 and 28 from Tromen Lake, d=0.013) (Fig. 2). Homogeneity in allele frequencies among populations within watersheds could only be found at some of the loci, showing the occurrence of differences within watersheds. The differentiation among populations within the Lolog watershed suggests the possibility of another refugium, perhaps to the east, from which population 17 could have originated. Analyses of more populations within watersheds would allow a better understanding of patterns of genetic variation in N. nervosa.

It is worth noting that the hybridization between N. nervosa and N. obliqua introduces an important source of genetic variation. When the two species-specific loci (Adh-A and Pgi-B), fixed in many populations, are not considered, all estimated parameters were increased (Ho=23%; ν ranged between 1.25 and 1.49; δge=5.7%). On the other hand, if only these loci (Adh-A and Pgi-B) are considered, which means the variation inherent to N. obliqua, clustering could be different due to the presence of hybrids in populations from the Lacar and Lolog watersheds. If this is done, the Lacar and Lolog watersheds are clearly distinct from the rest (data not shown). The high proportion of hybrid seeds for Adh-A in population 17 is the main contribution for the great distance observed between this population and population 19, and accounts for the high level of differentiation observed between these two populations (δ=0.093). The low frequency of hybrids in population 19 (only one seed among 106 analysed for Adh-A), is strongly related to the distribution of N. obliqua, since pollen could reach population 17 from a small mixed population situated on the other margin of the lake. Gene flow from population 17 to 19 is restricted due to a main west–east wind direction. It is important to point out here that the analysed seeds were collected during the same season, since it is known that frequency of hybridization not only varies in space, but also in time (Gallo et al., 1997b).

Among the analysed populations, the occurrence of hybrid seeds for one of the species-specific loci, but not for the other, was observed, indicating the independence of these two loci and, furthermore, suggesting the presence of adult hybrids that must be fertile. It is worth noting that these hybrid seeds, heterozygous for one locus, showed N. nervosa homozygous type for the other one. The high proportion of hybrid seeds observed for Adh-A in the Lolog watershed (population 17) together with the lack of hybrids for Pgi-B, contrasts with the frequency of hybridization observed in the Lacar lake watershed, where both species are in sympatry. This could indicate, among others, a different evolutionary stage in the hybridization (or introgression) process in each watershed. The Lolog lake watershed could be in a more advanced evolutionary stage than the Lacar lake watershed, since a great proportion of backcrosses should be assumed. The proportion of hybrid adult trees in natural populations is low compared to that observed among offspring of individual trees in progeny tests. Post-zygotic incompatibility is probably taking place through a diminished adaptive capacity of the hybrids. This selection helps to maintain the separation of the two species’ gene pools, at least to a certain degree (Gallo et al., 2000). It would be of great interest to conduct an exhaustive study of Lolog watershed to investigate gene flow distances and hybridization frequencies.

It should be emphasized that introgressive hybridization clearly sharpens the latitudinal differentiation caused by the unidirectional gene flow that takes place along each watershed. The different capacity of both species to overcome high altitude passes during the postglaciation colonization (due to their different thermal responses) has probably affected their current distribution along the different lake watersheds. There are watersheds with both species in sympatry (e.g. Lacar, Quillén) and others with the occurrence of only ‘pure’ (at least phenotypically) species (e.g. Hermoso, Lolog, Huechulafquen). Nevertheless, in the latter group, pollination with N. obliqua pollen coming from populations on the other side of the Andes Mountains cannot be excluded. If this is possible the concept of the genetic population could be extended to a larger regional level and therefore the ‘sympatry area’ of both species should be also increased. Hybrids between N. nervosa and N. obliqua should be taken into account in conservation and management of natural populations, especially of the former species, since it introduces another component into its genetic variation. Silvicultural management of the one-to-one mixed forest could affect the proportion of hybrid seeds, increasing the number of not so well adapted hybrid individuals with detrimental consequences for natural regeneration.