Introduction

Fossil records suggest that Taxodium was widely distributed throughout North America, Europe, and East Asia from the Cretaceous to the Pleistocene (Small, 1931; Florin, 1963). As the Pleistocene seas receded, Taxodium settled into its present distribution in the south-eastern United States and Mexico. Two taxa, bald cypress and pond cypress, occur in the south-eastern United States. The ranges of these taxa overlap in the south-eastern Coastal Plain, with the range of the latter being more restricted.

The taxonomic status of Taxodium has been debated since the early 19th century (Watson, 1985). Britton (1926), Dallimore & Jackson (1966) and Rehder (1940) recognized three species: T. distichum (L.) L. C. Rich. (bald cypress), T. ascendens Brongn. (pond cypress) and T. mucronatum Ten. (Montezuma bald cypress). Godfrey (1988) concurred with this taxonomic treatment of pond cypress and bald cypress, suggesting temporal differences in phenology.

Watson (1983), (1985) evaluated morphological, anatomical, biochemical and cytological aspects of pond cypress and bald cypress, and concluded that the differences between them for the characteristics he considered were minor. He suggested that pond cypress be considered as a variety of bald cypress (T. distichum var. imbricatum (Nutt.) Croom). He supported the hypothesis that pond cypress diverged from bald cypress, via isolated populations of bald cypress throughout the Coastal Plain during the Wisconsin Continental Glaciation. At this time, the genus was displaced southward, in conjunction with directional selection pressure exerted by habitats with reduced nutrient status and increased frequency or intensity of drought (Watson, 1983). Neufeld (1986) reported the ecophysiological implications of tree architecture for the two taxa, concluding that the morphology exhibited by pond cypress includes evolutionary adaptations resulting from climatically induced habitat changes. Miller et al. (1993) also found differences in interstitial soil pH between pond cypress (≈4.5) and bald cypress (≈6–7) populations. In the same study, Miller et al. repeated bioflavonoid analysis of leafy branchlet tissue from one of the populations Watson (1983) had evaluated, and concluded that these chemical ‘fingerprints’ can be altered by the condition, or health of the tree. Therefore, biflavonoids appear to have limited use as evidence of taxonomic status.

In molecular phylogenetic studies, no differences were apparent between the two taxa using PCR-RFLP analysis of six genes of chloroplast DNA (Tsumura et al., 1995). Lickey (1996) also failed to find significant differences between the two taxa after allozyme analysis of young shoot tissue from trees across their geographical range.

Sequence-Tagged Site (STS) primers have been used to generate PCR-based markers in many plant species, including Populus (Bradshaw et al., 1994), Cuphea (Slabaugh et al., 1997), Pinus taeda (Harry et al., 1998) and Picea mariana (Perry & Bousquet, 1998). STS-based cleaved amplified polymorphic sequences (CAPS) (Konieczny & Ausubel, 1993; Drenkard et al., 1997), are more reliable and convenient for genomic mapping and population genetics. The cDNA clones represent expressed genes and are extremely useful for establishing anchor points in genomic mapping and other genetic studies. Recently, we have developed STS markers in Cryptomeria japonica, a species closely related to Taxodium (Brunsfeld et al., 1994; Tsumura et al., 1995). The STS markers were derived from cDNA sequences of C. japonica (Tsumura et al., 1997) and half have been mapped in linkage groups (Mukai et al., 1995). The sequences of PCR primers for STSs therefore are especially well conserved in closely related genera, such as Taxodium and Cryptomeria (Brunsfeld et al., 1994; Tsumura et al., 1995). Tsumura et al. (1997) reported that about half of the STS markers from C. japonica could be amplified by PCR in Taxodium as single fragments.

We have applied the CAPS markers of C. japonica to Taxodium populations to investigate the genetic variation within and between populations, and between the two taxa in question. Using these results, we have evaluated the efficiency of CAPS markers, and the genetic relationship and taxonomic status of the two cypresses using these conserved markers.

Materials and methods

Plant materials

In early September 1994, fresh leafy branchlets were collected from 122 trees representing seven populations of pond cypress, and 130 trees representing six populations of bald cypress, in Georgia and Florida (Fig. 1). Samples were collected from an average of 19.55 trees in each population, with considerable distance between sampled trees to minimize sampling half-sib individuals. Ages of sampled trees were not determined but sizes of sampled trees tended to be similar within each population. All collection locations contained populations of pond cypress and bald cypress in sufficiently close proximity for potential cross-pollination except possibly Highlands Hammock, Tates Hell and Tocoi. Collected plant tissue was stored at −30°C prior to DNA extraction.

Fig. 1
figure 1

Natural distribution of Taxodium in the U.S.A. (Wilhite & Toliver, 1990) and sampling sites. P and B indicate pond cypress and bald cypress populations, respectively. The dashed line indicates the range of pond cypress. The following abbreviations were used for each population: Fargo, FAR; Tocoi, TCI; pond cypress populations in Tosohatchee, TP1 and TP2; bald cypress in Tosohatchee, TB1 and TB2; Loxahatchee, LXH; pond cypress populations in Starkey, SD1 and SD2; Tates Hell, THL; pond cypress in St. Mark, SMP, bald cypress in St. Mark, SMB; and Highland Hammock, HHM.

Laboratory analysis

Total DNA was extracted from each sample using the modified CTAB method of Murray & Thompson (1980). CAPS markers based on sequences of cDNA clones of C. japonica (Tsumura et al., 1997) were used to evaluate the genetic diversity and differentiation of pond cypress and bald cypress. Eighty STS markers of C. japonica were evaluated for use in this study. Basic PCR amplification conditions were as follows: reaction mixtures (100 μL) contained 10 mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 mM MgCl2, 0.1 mM dNTP, 100 pmol of each primer, 50 ng of template DNA, and 2.5 units of Taq polymerase. PCR amplification was carried out for 5 min at 94°C, followed by 36–45 cycles of 40 s at 94°C, 40 s at 58 or 60°C, and 80 s at 72°C, with a final 5-min incubation at 72°C using a PC700 model from Astech Co. Ltd, or an MJ Research Programmable Thermal Controller PTC100. In the survey for polymorphic STS markers in Taxodium, we used four individuals of each Taxodium variety.

Each PCR product was digested with eight four-base-cutter restriction enzymes (AluI, HaeIII, HhaI, MspI, NdeII HinfI, TaqI and RsaI) and electrophoresed in either 2% agarose gels or 7.5% polyacrylamide gels. Ten suitable polymorphic STS markers were identified out of the total of 80 primer gels tested in Taxodium (Table 1).

Table 1 PCR-based codominant markers developed from cDNA clones of Cryptomeria japonica used to assess genetic diversity in Taxodium

Statistical analysis

To estimate within-population variation, we used the following measures calculated from the allele frequencies of all loci analysed: the proportion of polymorphic loci (Pl) at the 95% criterion; the average number of alleles per locus (Na), and the averages of expected heterozygosities (He), where unbiased values for He were also estimated (Nei, 1978).

The fixation indices, FIS (= 1 − Ho/He), FIT and FST, for polymorphic loci and over all loci, were determined (Nei & Chesser, 1983) for comparison of the observed genotype frequencies to the expectations of Hardy–Weinberg equilibrium. Deviations from such expectations were analysed using the chi-squared test (Workman & Niswander, 1970).

Between-population variation was estimated in terms of the following statistics. Nei’s gene diversity statistics (Nei, 1973) were estimated as follows: HT=HP + DPV + DVT, HV=HP + DPV and GPV(T)=DPV/HT, GVT= DVT/HT, where HT is the total population gene diversity, HP and HV are the average gene diversities within populations and variety, respectively, and DVT and DPV are the average gene diversity between variety and between populations within variety, respectively, and GPV(T) and GVT are the relative extent of gene differentiation among populations within variety and between taxa, respectively. These analyses were carried out using the GENESTRUT program (Constantine et al., 1994). To estimate the amount of gene flow between populations, the number of migrants exchanged per generation, Nm, also was calculated indirectly from values of GPV(T) (or GVT) at each locus and the values over all loci by application of Wright’s infinite island formula (Wright, 1931): Nm=(1 − GPV(T) or (GVT)/4GPV(T) or GVT, where N is the effective population size and m is the proportion of migrants exchanged per generation. Finally, Nei’s unbiased genetic distances were calculated for all population pairs (Nei, 1972, 1978). The dendrogram was constructed based on the genetic distance matrix using UPGMA (Sokal & Michener, 1958) and neighbour-joining (NJ) methods (Saito & Nei, 1987). Bootstrap analysis was carried out using the DISPAN computer program (Ota, 1993).

Results

STS-based codominant CAPS markers

In Taxodium, single PCR products were obtained for 40 of the Cryptomeria STSs. These were cleaved with eight different restriction endonucleases. Ten of these 40 markers showed polymorphic restriction patterns, and all patterns appeared to be codominant (Fig. 2). Therefore, these 10 markers were used to evaluate the genetic diversity of Taxodium (Table 1).

Fig. 2
figure 2

Restriction patterns of CAPS markers in Taxodium. (a) CD618/TaqI; (b) CD1894/HhaI; (c) CD620/RsaI; (d) CD1613/HaeIII. Both marginal lanes in each panel show DNA size markers of X174/HincII, and the other lanes indicate the investigated individuals.

Genetic diversity of Taxodium

Genetic diversity of Taxodium was investigated using only polymorphic CAPS markers. Therefore, the average unbiased heterozygosities were relatively high, ranging from 0.325 to 0.432 (Table 3). The average heterozygosities of pond cypress and bald cypress were 0.380 and 0.395, respectively. The average number of alleles per locus was 2.00 for most populations. The CD1237 and CD1179 loci in the Tosohatchee P2 (TP2) population were monomorphic, as was the CD1179 locus of the Loxahatchee (LXH) population. Consequently, the average heterozygosities of these two loci were lower than those of the other eight (Table 2). A clear difference for the average allele frequencies of CD618 between the two cypress taxa was noted. The average frequencies of the a allele at the CD618 locus in pond cypress and bald cypress were 0.875 and 0.390, respectively (Fig. 3). The b allele frequency of CD1179 in the Fargo population (0.940) was greater than for the remaining populations (0.000–0.196).

Table 3 The average heterozygosity (He), the average number of alleles per locus (Na), the proportion of polymorphic loci (Pl, 95% criterion), and the unbiased heterozygosity (unbiased He) in 13 populations of Taxodium
Table 2 Allele frequency at 10 loci in 13 populations of Taxodium
Fig. 3
figure 3

Allele frequencies of 13 populations of Taxodium at the CD618 and CD1179 loci. Open and solid portions of bars show a and b alleles, respectively.

The gene diversity within populations (HP) varied for each locus, ranging from 0.129 for CD1237 to 0.500 for CD1514. The gene diversity in the total populations (HT) also ranged from 0.130 to 0.500 (Table 5). Half of the fixation indices (FIS) showed statistically significant values, at the 5% probability level for CD618 and CD1179 and at the 1% level for CD1514, CD620 and CD402 (Table 4). The chi-squared value of CD1514 had particularly high significance. The FST values showed statistical significance at seven loci, and the value of FST over all loci also was statistically significant.

Table 5 Gene diversity within and between populations and varieties in Taxodium
Table 4 F-statistics for 10 loci in Taxodium

Genetic variation within populations, between populations within taxa, and between taxa was 0.919, 0.049 and 0.032, respectively (Table 5). The numbers of migrants exchanged per generation among populations within and between taxa (Nm) were 4.85 and 7.56, respectively. The unbiased genetic distances between populations ranged from 0.000 to 0.166 and their average was 0.042. There was no clear geographical tendency in the allele frequencies or heterozygosities among populations. The dendrogram using the UPGMA method showed little differentiation between the two taxa of Taxodium (Fig. 4), and the NJ tree showed a very similar dendrogram to the UPGMA tree.

Fig. 4
figure 4

Dendrogram based on genetic distance using UPGMA of 13 populations of two cypresses in Taxodium. Numbers in italics within the phylogenetic tree represent bootstrap values based on 1000 replicates.

Discussion

We have applied CAPS markers from cDNA sequences of C. japonica to investigate the genetic diversity of natural populations of C. japonica and concluded that there is no large difference in results between allozyme and CAPS studies (Tsumura & Tomaru, 1999). Because Taxodium and Cryptomeria are closely related genera, we believe that the CAPS markers we have used are as appropriate for investigating the genetic diversity of Taxodium as allozyme studies.

The polymorphic DNA markers, derived from STS sequences of cDNA from C. japonica, have been used for other coniferous species (Tsumura et al., 1997). When screening for suitable markers in Taxodium, dominant markers and those considered to encode multicopy genes were excluded. The STS markers were presumed to correspond to expressed genes because they were derived from sequences of cDNA clones of C. japonica. These regions were presumed to correspond to genes in Taxodium because cDNA sequences between the closely related taxa (e.g. Taxodium and Cryptomeria) generally are conserved (Brunsfeld et al., 1994; Tsumura et al., 1995). The inheritance of these markers was not investigated in Taxodium because plant material from crosses was not available. However, inheritance of these markers in C. japonica provided the basis for use of these genes as putative loci.

The average unbiased heterozygosities of pond cypress and bald cypress ranged from 0.325 to 0.432, whereas the average heterozygosities were 0.380 and 0.395, respectively. Hamrick & Godt (1989) summarized levels of allozyme variation at the species and population levels for species with different attributes. This allows us to compare the data between isozyme and CAPS marker studies because Hamrick & Godt (1989) also used only polymorphic loci when they calculated their values (HT and HS). The average heterozygosity found for CAPS loci was approximately 30% greater than that of the isozymes.

Ten putative CAPS loci were investigated for populations of two Taxodium taxa that are sympatric in the south-eastern Coastal Plain. Based on our data, only one of these loci (CD618) showed clear differences in allele frequencies between the two taxa (Table 2). All pond cypress populations except the population in Fargo, Georgia (FAR) had high frequencies of the a allele. Bald cypress populations tended to have low frequencies of the same allele.

Five loci were not in Hardy–Weinberg equilibrium according to the FIS values. The mean values of FIS indices across populations varied from locus to locus, except for loci CD1514 and CD620. This observation corresponds with the negative mean values of FIS found across these loci in most populations of the two taxa. These small excesses of heterozygotes may be the result of selection but it remains unclear in these data. The genetic distance between the two taxa was only 0.043, except for the Fargo population of pond cypress, which had a value of 0.071 (Fig. 3). However, the genetic differentiation among the remaining pond cypress populations was very small.

At the CD1179 locus, all populations surveyed had high frequencies of the a allele, except the Fargo population, which was almost fixed for the b allele. The pond cypress population in Fargo was the most northern population investigated in this study, and was in close proximity to the Suwannee River. Unfortunately, plant material could not be collected from a sufficient number of trees from the co-occurring bald cypress population for reliable analysis. Therefore, direct comparison of these sympatric populations of bald cypress and pond cypress could not be carried out.

The genetic diversities of pond cypress and bald cypress were similar (Table 3), with the greatest genetic variation contained within populations (91.9%), 4.9% between populations, and 3.2% between taxa (Table 4). Liu et al. (1990) reported the genetic structure of bald cypress using three isozyme loci. According to their data, the genetic variation within populations was 96.4%, whereas variation between streams was only 0.4%, and between populations within streams was 1.5%. However, their study was confined to a single stream system in South Carolina, and the area had been affected by thermal effluent for many years. Consequently, it is difficult to compare the genetic variation between populations in these two studies. In our study, the genetic differentiation between populations within variety (GPV(T)) was 4.9% and between taxa (GVT) it was 3.2%. These values were high compared to those from the isozyme study (Liu et al., 1990). One of the reasons for this might be the large area of our study (Coastal Plain region of Florida and Georgia, ≈500 km long) compared to the single stream system in South Carolina in the allozyme study (about 15 km). Liu et al. (1990) also estimated the number of migrants per generation to have an Nm value of 8.72. In our study, the Nm values within and between taxa were 4.85 and 7.56, respectively. Nm values between isozyme and CAPS studies were similar. Lickey (1996) reported the genetic variation of 22 populations of two taxa in Taxodium. The heterozygosities of the two taxa were approximately the same, He=0.125 in bald cypress and He=0.122 in pond cypress, and the FST over all populations was 0.203. The pond cypress populations made a cluster when the dendrogram based on Nei’s genetic distance was constructed but the differentiation between pond cypress and bald cypress populations was very small.

The results of this study do not suggest that pond cypress is a species distinct from bald cypress. This is consistent with the findings of Watson (1983) and of Lickey (1996). Watson’s research focused on morphological, anatomical, biochemical and cytological characteristics, whereas Lickey evaluated genetic structure of the two taxa using allozymes. Considering the historical background of classification of this species, we have concluded that the two taxa of Taxodium should be classified as varieties, as suggested by Watson (1983) from the analysis of several characters.

The genetic diversity and differentiation in each species is different depending on its evolutionary history and distribution. Therefore, it is not easy to understand the level of genetic diversity and differentiation in each taxonomic level such as family, genus and species, which means there is no criterion to decide on each taxonomy level using genetic data. However, if we could add genetic data to traditional data such as morphological traits, it may be possible to decided on a suitable status in each species for which there is a taxonomic problem.