Introduction

Reproduction is the fundamental challenge in the establishment of colonizing populations (Baker, 1955). The mating system has long been recognized as playing an important role in plant colonization, with a predominance of uniparental reproduction in colonizers (Baker & Stebbins, 1965; Brown & Burdon, 1987). This, and the associated occurrence of bottlenecks of population size, profoundly affects the genetic diversity and structure of colonizing populations, and their capacity to respond to local selection pressures (Hamrick et al., 1979).

The mating systems of many successful colonizers have been inferred from their reproductive features (Fryxell, 1957), and such features are associated with colonization success (Brown & Marshall, 1981). Genetic marker analysis can provide more accurate and detailed information on the mating system parameters. Marker-based estimates of mating systems have been conducted for only a few successful colonizers (Glover & Barrett, 1986; Burdon et al., 1988; Sun & Corke, 1992; Abbott & Forbes, 1993). Such inferences can provide new perspectives; local variation of mating system parameters provides insights into environmental factors affecting the mating system, and differences between central and marginal populations for mating system parameters indicate microevolutionary changes following founder events.

The mating system has traditionally been characterized by selfing rates, which influence patterns of population genetic structure. When patterns of paternity are also included as facets of the mating system, the genetic relatedness among progeny of a plant becomes relevant. The pattern of paternity can be measured by parameters of the ‘correlated mating model’ of Ritland (1989). One parameter is the correlation of outcrossed paternity within progeny arrays, rp (alternatively, the proportion of pairs of progeny which are full-sibs, as opposed to half-sibs; see Ritland, 1989). This correlation arises from repeated matings to the same near-neighbours, or from the tendency of pollinator pollen loads to be derived from single plants. In partially selfing populations, correlations among progeny also arise from variation of selfing rates among plants. This is measured by a second parameter, the correlation of selfing within progeny arrays, rs (Ritland, 1989). In general, correlations among sibs affect local patterns of genetic variation, and the effectiveness of local selection.

Centaurea solstitialis L. (yellow starthistle) is a diploid (2n=16), herbaceous annual of Eurasian origin. Since its accidental introduction to California in the mid-19th century, probably as a contaminant in alfalfa seed, the species has become a naturalized weed of wide distribution and serious economic impact (Maddox & Mayfield, 1985; Maddox et al., 1985). It frequently invades a variety of disturbed habitats, such as parks and rangelands, pastures, hayfields, orchards, vineyards, roadsides and irrigation banks (Robbins et al., 1951; Thomsen et al., 1994). Previous and ongoing research has concentrated on ecological studies in relation to management and biological control (Thomsen et al., 1989; Maddox et al., 1991; Roche, 1991; Callihan et al., 1993). Little is known about the mating system and population genetics of this species.

In this study, mating system parameters of C. solstitialis were estimated, and variation in these parameters assessed in relation to the colonization of North America by this species. Geographically marginal populations were compared with central populations to determine if colonization of marginal sites has led to changes in mating system parameters. Information on the mating system parameters of this species will also help explain the patterns of genetic diversity in North American populations of C. solstitialis.

Materials and methods

A preliminary pollinator-exclusion experiment was conducted within a fenced area at the Agronomy Farm of the University of California, Davis. Eighteen plants at reproductive stage, at least 2 m apart, were used. One to three preanthesis flower heads per plant were covered with a semitransparent, waterproof paper bag; other flower heads at a similar developmental stage on the same plant were used as controls (open-pollinated). After five to six weeks, these flower heads were harvested, the numbers of florets and seeds were counted, and percentages seed-set for the bagged and open-pollinated heads were determined.

Eight natural populations were sampled in 1995 for mating system studies. These populations were Airport, Davis II and San Diego from California; Asotin, Gold Hill, Klickitat and Walla Walla from Washington; and Lenore from Idaho (see Sun, 1997 for site information). They were selected on the basis of regional representation: Davis II and Airport represent populations in the Sacramento Valley area, where yellow starthistle's invasion is particularly prominent; San Diego represents the southern extreme of the distribution in California; Lenore represents an old invasion in Idaho; and the four populations of Washington came from areas with well-recorded histories of colonization from 1900 to the present; Gold Hill also represents the northern extreme of colonization.

On average, 12 seeds per plant and 40 plants per population were germinated to generate progeny arrays used in mating system analyses. Nine polymorphic allozyme loci were used as gene markers: Aco-1, Aco-2, Alp-2, Dia-2, Lap-1, Pgm-1, Pgm-2, Pgi-2 and Skdh-1. Details of electrophoretic procedures are given elsewhere (Sun, 1997). Single-locus, Mendelian inheritance at these loci was verified by the conformity of progeny array patterns to Mendelian predictions, a useful method when controlled crosses are not available (Kephart, 1990). Specifically, one maternal allele was always found in each progeny (no families showed more than two alternative homozygotes, and genotypes of aa and bb were not found with cd within the same family). In addition, these loci have been assayed in a large number of organisms (Kephart, 1990) and generally shown to follow Mendelian ratios.

Mating system parameters were estimated using the MLTR program (available from K. Ritland). The program calculates maximum likelihood estimates of both outcrossing rates (based on the model of Ritland & Jain, 1981) and correlated mating parameters (based on the correlated matings model of Ritland, 1989; see above). The mating system parameters estimated included: the multilocus population outcrossing rate tm, the average single-locus population outcrossing rate ts, the inbreeding coefficient of maternal parents F, the correlated selfing rate rs, and the correlation of paternal genotypes between outcrossed sibs rp.

Because pollen and ovule population allele frequencies did not differ significantly, they were pooled to increase statistical power for estimating mating system parameters. For populations with significant correlated selfing, individual-plant multilocus outcrossing estimates were obtained. The Newton–Raphson method was used to find the maximum likelihood estimates. Standard errors of population estimates, including tmts, were calculated using 100 bootstraps, where the unit of resampling was the progeny array. When several parameters are simultaneously estimated, the potential exists for nonconvergence of the estimates to the true maximum, because of ridges and possibly minor peaks. Although the data have sufficient degrees of freedom for all parameters (Ritland, 1989), the rs vs. rp surface will have a ridge (see fig. 3 in Ritland, 1989), reflecting their negative statistical correlation, and iterations can stall on this ridge if it is sufficiently flat. However, the use of more variable markers, as in this study, minimizes this problem (see fig. 2 in Ritland, 1989). Other parameters of the model are much more statistically independent, implying easier joint numerical maximization. At least, the method of bootstrapping is expected to incorporate errors caused by inaccurate numerical maximization of the likelihood function.

Results

The bagging experiment showed that seed-set is pollinator-dependent: only one of 18 bagged flower heads (capitula) set seed, and it produced only three mature seeds. Unbagged (control) capitula on the same plant always set seed, with an average of 53 seeds each, but some florets within a capitulum did not set seed. The presence of just one ovule per floret gave a seed/ovule ratio for these fertile controls of 0.74 seeds per ovule in contrast to 0.0016 for the bagged capitula. The number of florets per capitulum in the sample ranged from 56 to 100, averaging 75.

Nine enzyme loci were polymorphic in the eight populations (Table 1) and quite high levels of variation were often found, with two to six alleles per locus, and triallelism quite common. Estimates of allele frequencies for maternal parents differed from those for the pollen pool only 13 of 222 times at the 5 per cent level, which is about that expected by chance. For this reason, the two gene pools were combined for subsequent estimations.

Table 1 Allele frequencies in populations of yellow starthistle (pollen and egg frequencies combined; SEs in parentheses; rare alleles excluded at some loci)

All populations showed a predominant outcrossing. Multilocus outcrossing rates ranged from 0.948 (SE=0.039) in San Diego to 0.990 (0.015) in Davis II, with a mean of 0.978 (0.007) for the eight populations (Table 2). Despite the high multilocus population outcrossing rates, genetic substructuring may exist in these populations, leading to some degree of biparental inbreeding. This inbreeding is detected by the difference between multilocus and single-locus selfing rates, as the latter include biparental inbreeding (Ritland & Jain, 1981). Indeed, significant differences between tm and ts were found in nearly all populations, ranging from 0.007 to 0.122, with an average of 0.057. A significantly positive parental inbreeding coefficient was found only in the marginal population, San Diego, where F=0.270.

Table 2 Mating system estimates for the eight populations of yellow starthistle, and their grand means (SEs in parentheses)

The correlation of paternity (probability that sibs shared the same father) ranged from rp=0.047 to 0.636, with a mean of 0.189 in these populations (Table 2). The average ‘paternal mating pool’ was 5–6 plants (Table 2), with the extremes found in San Diego (1–2 plants) and Airport (21–22 plants). This ‘pool’ is calculated as the inverse of rp (Ritland, 1989) and is the number of males that gives rise to this correlation, assuming all males have equal mating probabilities and consecutive matings are independent (Ritland, 1989). If the observed correlation is low, this pool may be larger than the size of the progeny array (in fact, a desirable feature of this statistic is that it is independent of the progeny array sample size). Individual plant estimates revealed significant correlation of selfing (among-plant variation of outcrossing rate) only within the Asotin population (Table 2). A plot of individual outcrossing rates for this population indicates that this correlation probably arises from a few families exhibiting significant selfing (Fig. 1).

Fig. 1
figure 1

Distribution of outcrossing rates in the Asotin population of yellow starthistle.

Discussion

Because colonization frequently involves a small number of founders, a self-incompatibility system is generally disadvantageous for colonizing species, as often no compatible mates are available. Self-compatibility assures reproduction (Baker's rule; Baker, 1955) and, furthermore, allows uniparental reproduction which is not dependent upon pollinators, and which may also rapidly generate and maintain adaptive genotypes following successful establishment (Stebbins, 1957). Outbreeding species are often successful colonizers (Brown & Marshall, 1981), but nearly all are self-compatible. The rare cases of self-incompatible colonizers provide an opportunity to examine the genetics and selective pressures underlying self-incompatibility (Barrett & Richardson, 1985).

Self-incompatibility and reproductive success

The nearly complete outcrossing in most populations of C. solstitialis studied here supports the finding of Maddox et al. (1996) that the species is predominantly self-incompatible. A plant will simultaneously produce many flower heads and many florets per head concurrently at anthesis, and bees often visit several flower heads on a single plant (M. Sun, pers. obs.). High levels of outcrossing occur despite this potential for geitonogamous selfing.

How do colonizing populations overcome the limitation of self-incompatibility and achieve reproductive success? The following factors could be involved: (i) human-mediated migration involving large numbers of seeds; (ii) extended reproductive periods and nonspecialized pollinator requirements; (iii) massive seed production; and (iv) occasional breakdown of self-incompatibility or temporary reversals to self-compatibility (Pandey, 1980; Barrett, 1988).

The wind-assisted dispersal of the pappus-bearing seeds in C. solstitialis is not sufficient for long-distance dispersal (Roche, 1991). Anthropogenic factors, such as man-mediated migrations through transportation of contaminated hay or crop seeds, must have frequently been involved in the spread of this species throughout North America. These introductions probably involved many founders. One other factor limiting reproduction in a self-incompatible species is availability of pollinators at newly colonized sites. Like many species of Compositae, yellow starthistle flowers have a nonspecialized pollination mechanism involving generalist insects. Honeybees (Apis mellifera L.) and bumble bees (Bombus spp.) are the most common pollinators of this species in North America (Maddox et al., 1996; M. Sun, pers. obs.).

The plants also have a prolonged flowering period, from May to September in the Sacramento Valley, and May to October in San Diego (Beauchamp, 1986), although the flowering season is shorter in Washington and Idaho. Reproductive capacity is remarkable for this self-incompatible colonizer. An average of 267 capitula and over 10 000 seeds per plant were recorded in a population in the Sacramento Valley (Maddox, 1981). Additionally, occasional breakdown of self-incompatibility occurs in some individuals of yellow starthistle, which can help ensure reproduction. Maddox et al. (1996) reported that a mean number of 27 seeds per capitulum was produced by a self-fertile yellow starthistle plant following experimental self-pollination. Mutation of incompatibility genes or at modifier loci is thought to underlie the loss of incompatibility (Barrett, 1988).

Variation of self-incompatibility would be evidenced by a significant estimate of the correlation of selfing, and/or significant variation of individual family outcrossing rates. For example, in the self-incompatible tree, Shorea congestiflora, family tm was found to vary from 0.49 to 1.00, and this perhaps resulted from weak self-incompatibility in some individuals (Murawski et al., 1994). We found significant correlation of selfing only in the Asotin population; estimates of individual outcrossing rates indicate that perhaps four families mainly contribute to this correlation (Fig. 1). The parents of these families may have reduced or lost self-incompatibility. However, population substructure, specifically the occurrence of more inbred genetic neighbourhoods, can also cause among-plant variation in outcrossing rate (Ritland & Ganders, 1985). In the Asotin population, the seed collection came from a late-flowering patch, and thus seasonality caused by moisture availability may have contributed to a genetic neighbourhood (C. Roché, pers. comm.). It is also possible that self-incompatability may break down in late-flowering plants.

Biparental inbreeding

Nearly all the single-locus outcrossing estimates were significantly lower than the multilocus estimates, indicating biparental inbreeding (Ritland & Jain, 1981). Such inbreeding is caused by the clustering of related plants combined with limited pollen flow. Indeed, C. solstitialis tends to form clumped, dense stands in some populations, and Roche (1991) found that the majority of seeds fell within 1.2 m of the source plant. Variation in the extent of population substructure and plant density might account for the observed differences in levels of biparental inbreeding among C. solstitialis populations. In self-incompatible populations of Helianthus annuus and Shorea species, increased levels of inbreeding were found at high plant densities (Ellstrand et al., 1978; Murawski et al., 1994). Such biparental inbreeding may be significant for the maintenance of self-incompatibility alleles in populations.

The occurrence of completely outbred (F=0) parents in seven of the eight populations is somewhat puzzling in light of the biparental inbreeding rates. This can be explained by the presence of selection against biparentally inbred progeny, which can occur if genetic load is high.

Correlation of outcrossed paternity

Sharing of paternity within a family, as measured by the correlations of outcrossed paternity, rp, could arise from multiple deposits of pollen from a single male parent, or from mating to a small number of near-neighbours. The extent of correlated paternity between flowers of the same plant in Mimulus guttatus (20 per cent) was consistent with random mating to an effective pool of 1/0.20 or five neighbours (Ritland, 1989). In our study, we found a similar effective mate pool, averaging 8.8 (calculated as the mean of the reciprocals of rp in Table 2), although this average is highly affected by the San Diego population. If this population is excluded, the average number of mates increases to 9.8.

In general, compared with outcrossing rates, estimates of correlated mating show more variation among populations. For example, the among-population variance of multilocus outcrossing estimates was 0.00027 (computed from Table 2), and this was about equal to that expected from random statistical error (0.00032, obtained as the mean variance of estimates). This indicates no ‘actual variance’ of outcrossing among populations. By contrast, the among-population variance of correlated paternity was 0.0354, and the variance attributable to sampling error was only 0.0018, indicating substantial ‘actual variance’ (≈0.034) for correlated paternity. This indicates that correlated matings are potentially more illustrative of ecological factors. These factors include competition within pollen loads, fertility variation, phenological differences, pollen carryover and population substructure (Barrett et al., 1992).

The marginal population San Diego

Several lines of evidence indicated that a relatively high level of inbreeding and population substructure exists in the marginal population, San Diego. This population had the lowest tm (0.948), the highest maternal inbreeding coefficient (0.270), the highest correlation of outcrossed paternity (0.636) and a level of biparental inbreeding (0.095) nearly twice the average. This suggests the presence of small mating neighbourhoods that often consist of siblings. As this population apparently experienced a severe genetic bottleneck at founding (Sun, 1997), reduced allelic diversity at incompatibility loci may have also decreased the number of compatible mates in the neighbourhood. Finally, this population may have also lost some of its inbreeding depression, as the significant F of parents indicates that selfed progeny have survived to maturity. However, it is not known whether current demographic factors may play a part in the higher amount of inbreeding in the marginal population, such as possible lower density-dependent selection against inbred progeny.

Selfing variants of normally outcrossing plant species are often found at the margins of their range (Lloyd, 1980). Gold Hill represents the northern limit of yellow starthistle's distribution in western North America, although this population occurs in an area with a history of yellow starthistle invasion (since 1928) and is not geographically isolated from other populations; no apparent founder effects were detected for either genetic diversity (Sun, 1997) or mating system parameters (Table 2). In contrast, the San Diego site, representing the southern distributional limit, is a geographically isolated, marginal population. High levels of inbreeding and correlated matings could have evolved following the founder event. The shift from outbreeding to self-fertilization has evolved repeatedly following colonization of geographically marginal habitats in Turnera ulmifolia and Eichhornia spp. (Barrett, 1989).

Management implications

Information obtained on the mating system of yellow starthistle is valuable to its management and control. The outcrossing system can maintain high levels of genetic variation within populations, which may contribute to colonizing success of the species. Biological control, which depends on the interaction of the host plant genotype and an insect genotype, may not be very effective in the case of yellow starthistle, because of its high level of diversity and/or potential for rapid evolutionary response. As the species is predominantly self-incompatible, pollinators are the keys to its reproductive success, and limiting pollination at the flowering season may significantly reduce seed output at the sites of invasion. In this light, commercial production of honey based on yellow starthistle plants, as frequently practised in the Sacramento Valley, California, might have contributed to its spread in the area.