Introduction

Apomixis, the asexual propagation of offspring via female function, is a reproductive phenotype which has evolved independently in distantly related taxa (see Bell, 1982; Beukeboom & Vrijenhoek, 1998). Many apomictic taxa are polyploid, and two types of argument have been put forward to explain the association of these traits (Bell, 1982). On the one hand, polyploidy may be a requirement for apomixis, whereby expression and/or transmission of ‘apomixis genes’ is facilitated (Suomalainen, 1950; Mogie, 1986; Asker & Jerling, 1992; Savidan, 2000). Furthermore, duplicated genes in allopolyploid taxa may undergo asynchronous and subsequent autonomous (i.e. apomictic) development (Carman, 1997), a process resulting from differentiated (i.e. allopolyploid) genomes rather than from any specific gene(s). On the other hand, polyploidy may not be essential for the establishment of apomictic reproduction; instead, apomixis may be selectively advantageous in the polyploid over the diploid condition (Bell, 1982; Michod, 1995). Hence, diploid apomixis might be possible, although it might represent a short-lived, transitory state (Suomalainen, 1950; Bell, 1982).

Polyploidy in animals is relatively sporadic, with both even/odd-numbered and allo/autopolyploids known, many of which are apomictic (Bell, 1982; Suomalainen, 1950; but see Beukeboom et al. 2000). Polyploidy is more frequent in plants, and is highly correlated with speciation within the angiosperms (Müntzing, 1936; Ramsey & Schemske, 1998). Even-numbered polyploidy predominates in plants, and of these, sexual forms greatly outnumber apomicts (Ramsey & Schemske, 1998). Apomictic plants are almost exclusively polyploid, with tetraploidy being most prevalent (Savidan, 2000).

Arabis holboellii is a biennial or perennial member of the North American (including Greenland) Brassicaceae which can reproduce sexually or through diplosporous apomixis (i.e. meiosis is retained, Böcher, 1951). Among plants, A. holboellii is anomalous as both diploid and triploid (a rare polyploid condition in plants, see Ramsey & Schemske, 1998) apomicts have been described, in addition to facultative apomixis (Böcher, 1951). The basic chromosome number (x) of A. holboellii is 7, and polyploidy (typically 3x), aneuploidy and B chromosomes (Camacho et al. 2000) are common (Böcher, 1951; Böcher, 1954). This karyotype variability is consistent with other characteristics of apomictic A. holboellii, including large numbers of genetically identifiable clones, fixed heterozygosity, and disjunct apomictic populations (Roy & Rieseberg, 1989; Roy & Bierzychudek, 1993; Roy, 1993), which together imply multiple independent origins of apomixis (Roy, 1995).

Independent origin of multiple polyploid lineages could imply that A. holboellii is predisposed to apomictic development (sensu Mogie, 1992), as it is unlikely that triploidy would be transmitted stably to the offspring of sexual individuals with normal meiosis (the result of unbalanced gamete formation: Bell, 1982; Ramsey & Schemske, 1998). This study thus focuses on biogeographical patterns of polyploidy in the A. holboellii complex, which also includes the closely related A. drummondii (Rollins, 1941). A subsequent study will examine inheritance of molecular markers in diploid and polyploid lineages in order to identify reproductive mode. Interspecific hybridization has also been proposed as a possible mechanism leading to the evolution of apomixis (Böcher, 1954), and thus the present study includes related members of Arabis (Rollins, 1941).

Materials and methods

Seeds from single individuals were collected from six Arabis species and Halimolobus perplexa (phylogenetic outgroup, Koch et al., 1999) from multiple sites encompassing 33 geographical areas (Table 1; Fig. 1). Plants from Montana and adjacent parts of Idaho were identified using keys from Dorn (1984). Other identifications were based on Rollins (1941, 1981), and in addition, species identifications were compared to known herbarium specimens. Five to 10 seeds from each collection site were placed on moist filter paper in Petri plates, and vernalized at 5°C for 2–3 weeks. Upon germination, 3–5 seedlings were transferred to pots (11 × 11 × 13 cm) containing sterilized soil and grown in a controlled environmental growth room. Plants were grown in a 12-h L/D cycle under fluorescent lighting (cool white and GrowLux©), with a daily temperature variation from 22° to 28°C.

Table 1 Species and populations analysed (1see Fig. 1)
Fig. 1
figure 1

Geographical distribution of (a) A. holboellii, (b) A. drummondii and (c) Arabis species analysed (numbers refer to Table 1).

Sequence analysis

DNA was isolated from the leaf tissue of single individuals using a Nucleon (Amersham Pharmacia Biotech Europe GmbH) extraction kit. DNA quality and concentration were assessed by restriction digestion and visualization of 5 μL of the product on a 0.7% TAE-agarose gel. Part of the chloroplast tRNA-Leu (trnL) intron was PCR amplified from each individual using the oligonucleotides trnLf (5′ CGAAATCGGTAG ACGCTACG 3′) and trnLr (5′ GGGGATAGAGGGACTTGAAC 3′), and amplification products were assessed on agarose gels. Successful amplifications were purified using a QIAGEN PCR purification kit, and the products were sequenced directly using the ABI PrismTM BigDye Ready Reaction Terminator Cycle Sequencing Kit on an ABI 3700 genetic analyser. Sequences were generated from at least two separate PCR amplifications per individual, and both strands of all fragments were sequenced.

The program SEQMAN 3.58 (DNASTAR Inc.) was used to align the sequence data. The program DNASP 3.0 (Rozas & Rozas, 1999) was used to calculate nucleotide (ð) and haplotype (gene) diversities (Nei, 1987), and Tajima’s D statistic (Tajima, 1989) from sequence data. For haplotype diversity calculation, 2n was replaced by n in Nei’s (1987) eqns 8.4 and 8.12 (Rozas & Rozas, 1999). Phylogenetic analyses were performed using PHYLIP (version 3.5c, Felsenstein, 1989) and TREECON (version 1.3b, van de Peer & de Wachter, 1994).

Ploidy analyses

Ploidy analyses were performed on a PARTEC CCA-II flow cytometer using their High Resolution Type P nuclei extraction and staining kit (PARTEC GmbH). Samples were run in random order. Young leaf tissue from 5–10 individuals per sampling site were combined, and a single preparation and analysis was run on the pooled sample. If multiple peaks were observed in this pooled sample, every individual was reanalysed separately. A leaf sample from single diploid Brassica oleracea was added to each sample to serve as an internal size standard (Arumuganathan & Earle, 1991).

Results

DNA sequence and haplotype analyses

A 466 bp trnL region was sequenced and analysed from 245 individuals representing six Arabis species and Halimolobus perplexa (Tables 1 and 2; Fig. 1). All sequences have been deposited in GenBank (AF312971, AF312950-AF312963, AF312964-AF312970). Fifteen single nucleotide polymorphisms (SNPs), one 6-bp insertion–deletion polymorphism (indel) and a single region (position 207–211) in which five adjacent nucleotides differed between Arabis spp. and H. perplexa were identified, and this translated into 16 different chloroplast haplotypes (Tables 2 and 3). Interestingly, trnL haplotype C was found in A. holboellii, A. drummondii, A. lyallii and A. fendleri, and haplotype I was found in A. holboellii, A. drummondii, A. fecunda and A. beckwithii (Tables 1 and 3).

Table 2 The trnL sequence haplotypes of H. perplexa and Arabis species.
Table 3 Haplotype and ploidy (C) frequencies in Arabis holboellii and A. drummondii (frequencies were calculated within species)

Arabis holboellii and A. drummondii were characterized by 15 different haplotypes. Both species shared six haplotypes (C, D, E, G, I, L; Table 3), all of which occurred in both diploid and triploid individuals. Haplotype F occurred only in triploid A. drummondii, and haplotypes J, M and O occurred only in triploid A. holboellii (Table 3).

Ploidy distribution and karyotype variation

Flow cytometric analysis of six species of Arabis detected comparable genome size values in the diploid (2C ≈ 0.35) and polyploid ranges (3C ≈ 0.55 and 4C ≈ 0.72, Fig. 2). Both diploid and polyploid individuals of A. drummondii (2C, 3C and 4C) and A. holboellii (2C and 3C) were found (Fig. 2). Single samples of A. lyallii and A. fecunda were diploid, and only triploidy was observed in A. fendleri and A. beckwithii (Fig. 2).

Fig. 2
figure 2

Boxplots of ploidy analyses of different Arabis species (black numbers indicate sample sizes, bars with no numbers are based on a single individual, and circles represent outliers). Vertical axis represents index of fluorescence values obtained from flow cytometric analysis.

The fluorescence values for 2C A. holboellii differed significantly from normality (Kolmogorov–Smirnov test for normality=0.096, d.f.=185, P < 0.001), and appeared bimodal (Fig. 3). We therefore assumed a cut-off point of 0.380 fluorescence units between the lower and higher peaks in further statistical analysis. The 2C distribution for A. drummondii did not deviate from normality (nonsignificant Kolmogorov–Smirnov test for normality with Lilliefors Significance Correction). Interestingly, the lower outlier of the 2C A. drummondii distribution possessed haplotype G, and had a fluorescence value similar to A. holboellii (Fig. 3). The only other G haplotype identified here was a triploid A. holboellii, and furthermore both G haplotypes are from the same location (Table 1). The removal of haplotype G from the A. drummondii distribution had no effect on the significance tests for normality.

Fig. 3
figure 3

Distributions of (a) A. holboellii and (b) A. drummondii diploid fluorescence values.

The distribution of diploidy and triploidy was not restricted to any specific chloroplast haplotype of A. holboellii or A. drummondii (Fig. 4). Both 2C and 3C individuals possessed haplotypes C, D, E and L in A. drummondii (including a single 4x individual of haplotype D), and haplotypes C, D, I, K, L and N in A. holboellii (Fig. 4). However, genome size distributions differed among haplotypes (Figs. 4 and 5). The distribution of 2C fluorescence values in haplotype I was bimodal, and neither peak differed from normality (N=42, Shapiro–Wilk’s W nonsignificant). 2C individuals of haplotype K had a non-normal distribution (Kolmogorov–Smirnov test for normality with Lilliefors Significance Correction=0.43, d.f.=70, P=0.001) which was skewed towards greater fluorescence values. Finally, the distribution of 2C individuals having haplotype N did not deviate from normality (Shapiro-Wilk’s W nonsignificant), although its mode was shifted towards the skewed region of haplotype K (Fig. 5).

Fig. 4
figure 4

Index of fluorescence distributions for (a) Arabis drummondii and (b) A. holboellii. Black numbers over boxplots indicate sample sizes, horizontal lines with no numbers represent single individuals, and white circles represent outliers.

Fig. 5
figure 5

Index of fluorescence distributions for diploid individuals of haplotypes I, K and N (N=42, 67, 37, respectively).

Phylogenetic analysis of ploidy variation

The samples were divided into three geographical groups (Fig. 1). Haplotypes C and L were found in all three regions, and included both 2C and 3C individuals (Figs. 4 and 6). Haplotypes G and M were specific to the south-west, and included both 2C and 3C individuals (Figs. 4 and 6). Haplotypes B, E, F, H, J, and P were specific to the northern group and included 2C and 3C individuals (Fig. 4). No haplotypes were specific to the south-east group. Interestingly, haplotype N was found in samples from both Greenland and Montana (Table 1, Fig. 4).

Fig. 6
figure 6

Neighbour-joining tree calculated from trnL haplotypes, using (Jukes & Cantor, 1969) correction. Bootstrap values (100 replicates) appear on all nodes, and pie charts represent proportions of ploidy types. Not shown are two diploid individuals from Godhaun (Greenland) having haplotype N, and a single tetraploid individual from Saugatuck Dunes (Michigan) having haplotype D. Outgroup (R) is H. perplexa.

We assumed that both the 6 bp indel and site with five adjacent SNPs were each the result of a single mutation event, to yield 17 phylogenetically informative polymorphisms (Table 2). A phylogenetic analysis of the 15 Arabis haplotypes, using H. perplexa as an outgroup, showed little statistical support for any haplotype clade (Fig. 6). Haplotype diversity did not differ between samples grouped by species, region or ploidy (Table 4). Tajima’s D-test statistics were nonsignificant for all groups, and thus none of the polymorphisms deviated from selective neutrality (Table 4).

Table 4 Estimates of haplotype diversity (H), Tajima’s statistic (D), and nucleotide diversity (ð) for subsamples of the dataset

Discussion

Recurrent origins of polyploidy

Polyploid genotypes have arisen repeatedly within the Arabis holboellii complex. Of the seven A. drummondii and 14 A. holboellii haplotypes identified from the chloroplast sequences, four and six (respectively) were characterized by both diploid and polyploid individuals (Figs. 4 and 6, Table 3). Furthermore, low levels of boostrap support for any haplotype clade (Fig. 6) demonstrates that a gene phylogeny cannot be well resolved with our data, and thus the propensity for generating polyploidy is shared between unrelated chloroplast lineages.

Because genome doubling (i.e. autopolyploidization) at the zygote stage cannot result in triploidy, what is the mode of polyploidization in this complex? A newly formed triploid lineage could be formed via two mechanisms, both of which entail the fertilization of an unreduced gamete by a reduced gamete. In the first case, the source of the unreduced gamete is a tetraploid individual which undergoes normal meiosis (Bell, 1982), but the extremely low frequency of tetraploid Arabis precludes this mechanism. A more likely scenario is the occasional production of unreduced gametes through environmental influence. For example, cold-shock has been implicated in polyploidization for a number of temperate taxa (Bretagnolle & Thompson, 1995). The evolutionary first step towards the generation of triploidy thus requires no specific genetic mutation, although its stable transmission to offspring would not be possible unless some compensatory mutation enabled balanced chromosome segregation through meiosis (e.g. apomixis, although see Kadereit, 1991).

Ploidy variation in the Arabis holboellii complex

Polyploid angiosperms are typically even-ploid (Ramsey & Schemske, 1998). In contrast, 3C (triploid) genotypes are prevalent throughout the samples in this study (Fig. 2). Triploid Arabis have previously been documented (Böcher, 1951; Mulligan, 1995), and hence it appears to be a common ploidy type within this group.

The genome size distribution for all diploid A. holboellii together was bimodal (Fig. 3). Karyological analyses of individuals sampled from the lower range showed them to have 14 chromosomes, and thus the lower peak represents true diploid A. holboellii. Arabis holboellii in the diploid range had significantly smaller genome sizes than corresponding 2C A. drummondii (Figs. 3, 4 and 5, T=− 9.49, P < 0.001, N=178). Considering that both species have a haploid chromosome number, n=7 (Koch et al., 1999), genome size differences between them are likely attributable to noncoding DNA (e.g. heterochromatin).

Intraspecific variation in genome size was also detected within both diploid and triploid A. holboellii and A. drummondii (Figs. 3, 4, 5). The bimodal or skewed fluorescence distributions for 2C haplotypes I, K and N are suggestive of aneuploidy for one or two chromosomes, showing two different genome size increments, 28% (relative to haploid genome size) in haplotype I and 14% in haplotypes K and N (Figs 3, 4, 5). Univalent chromosomes, one small and one large, have previously been described from A. holboellii (Böcher, 1951; Roy, 1995), and this is consistent with the genome size variation documented here. Considering previous chromosome studies (Böcher, 1951) these data suggest that aneuploidy may be common (Fig. 4), a phenomenon which has furthermore been confirmed by karyological and microsatellite analyses (unpubl. data). An alternative explanation for variation in genome size could be intraspecific variation for heterochromatin concentration, although this is less consistent with our data and previous karyological studies of the complex.

Population structure

It is interesting that we have identified different Arabis species with the same chloroplast haplotype (Table 3); trnL haplotypes shared between species have also been described for Brassica (Lannér, 1998). Since reproductive isolation has not been tested among named Arabis taxa, one possible explanation may be gene flow between ‘morphospecies’, as has been suggested by Koch et al. (1999). Our data nonetheless demonstrate species-specific differences in genome size between A. holboellii and A. drummondii (see above), and thus there is some degree of genetic isolation between them which prevents genome homogenization through random mating (Price et al., 1983).

The extensive geographical distribution of haplotype N (i.e. in Greenland and Montana; Figs 1, 5 and 6) is noteworthy. Assuming that extant North American Arabis populations were established by post-Pleistocene migrants from the south (Pielou, 1991), the widespread distribution of haplotype N could have resulted from it being a highly successful (and old) colonization genotype. Alternatively, if one considers that haplotype N could have had a wide pre-Pleistocene distribution which encompassed Greenland and North America, then its present-day distribution could be the result of survival in multiple glacial refugia. For example, disjunct populations of A. holboellii have been described from the St. Lawrence River Valley in Québec (Böcher, 1951), which may have been a glacial refugium (Marie-Victorin & Rolland-Germain, 1964). Considering the loose relationships between Brassicaceae species designations and chloroplast sequence variation (Lannér, 1998; Koch et al., 1999), more loci are needed in order to resolve the actual colonization scenario.

Implications for apomixis evolution

Triploids that stably produce triploid offspring and maintain meiosis can only do so via apomictic reproduction (Bell, 1982; Ramsey & Schemske, 1998, but see Kadereit, 1991). Hence, the triploids in our sample analyses are likely to be apomictic (Böcher, 1951; Roy, 1995). Diploid apomixis has also been proposed in Arabis (Böcher, 1951; Roy, 1995), although we have insufficient information to determine the reproductive mode of the diploids analysed here. We suspect that diploid sexual A. holboellii do exist, as evidenced from our unpublished observations on endosperm production (e.g. 2x individuals producing 3x endosperm, sensu Matzk et al., 2000). As these putative sexual individuals have identical haplotypes to triploids (i.e. apomicts), apomictic lineages may have originated repeatedly in Arabis species.

Providing an explanation for the origin of apomixis has proven difficult, as its evolution from a sexual ancestor (Beukeboom & Vrijenhoek, 1998) is thought to entail simultaneous accumulation of at least two genetically unrelated mutations, leading to both unreduced egg production and parthenogenetic development (Bell, 1982; Mogie, 1992). The evolution of either one of these traits in the absence of the other decreases fitness, and thus envisioning two random mutations leading to balanced apomixis has been difficult (Mogie, 1992). One possible solution suggests the existence of a single ‘apomixis factor’ (Mogie, 1992), one which might be recessive and lethal in the diploid heterozygous and homozygous conditions, respectively, but which finds balanced expression and/or penetrance through dosage in the polyploid state. While this model is not widely applicable to the majority of apomictic plant taxa, which are tetraploid (Savidan, 2000), it may provide a working hypothesis to explain recurrent origins of apomixis in the A. holboellii complex.

If we make the parsimonious assumption that a genetic factor responsible for initiating apomixis evolved once during the evolution of Arabis, then widespread polymorphism throughout this group would imply a balance of advantages and disadvantages to its carriers. A recessive apomixis factor could in this way remain invisible to selection, unless occurring in the diploid homozygote condition (lethal) or in a balanced fashion with a dominant wild-type allele through polyploid dosage (e.g. Aaa apomict, sensu Mogie, 1992). Such a model of apomixis evolution is furthermore consistent with stable aneuploidy in A. holboellii, as proper dosage of the ‘apomixis’ chromosome(s) alone could hypothetically lead to the apomictic phenotype. Finally, functional (although disturbed) microsporogenesis in apomictic individuals (Böcher, 1951) would enable low levels of gene flow back into sympatric sexual ancestors, thus assuring the repeated introgression of the recessive apomixis factor into sexual genetic backgrounds. Hybridization between reproductive biotypes could lead to fine-tuning of the ‘apomixis factor’ through recombination, and furthermore may enable this factor to purge itself of the deleterious mutations that are expected to accumulate in asexual lineages.