Introduction

To be successful in the in situ conservation of intraspecific genetic diversity, populations managers must rely upon descriptive information about genetic structure and reproductive biology of considered species, as well as upon a good understanding of the reproductive mechanisms (Wright, 1965; Brown, 1989; Hedrick, 1990).

The present work on wild Lima bean (Phaseolus lunatus L.) populations located in the Central Valley of Costa Rica is a part of a wider programme aimed at describing population genetic structure, gene flow, genetic variability at microgeographical level, and at understanding mechanisms controlling the genetic structure, the reproductive biology, and the population dynamics. Isozymes electrophoresis data on these wild populations (Zoro Bi, 1999) indicated a genetic variability occurring mainly at the interpopulation level, with low allelic richness, low observed and expected heterozygosity, and low interpopulation gene flow (Zoro Bi, 1999). It was argued that the high number of significantly positive fixation indices observed for 29 populations was related to the occurrence of inbreeding resulting from a high selfing rate and/or the bottleneck effects that characterized the target populations, and/or sampling populations or plant patches that differ in gene frequencies (Wahlund effect). Since it is difficult to distinguish between the three alternatives without a thorough examination of the mating system, the present study aimed to clarify this topic. Thus, we estimated the mating system parameters for 10 populations of wild Lima bean and described patterns of outcrossing using isozyme markers. The relationship between the outcrossing rate and population size was also examined, given that the pattern of this relationship could have an important application for in situ management strategies.

Materials and methods

Plant material

A total of 10 wild Lima bean populations with at least 10 plants bearing mature pods and expressing at least three polymorphic enzyme loci were sampled during the 1995, 1996, 1997, or 1998 first trimestrial period, corresponding to the time of their physiological maturity. Each population was followed for a complete season so that all individuals bearing pods during that season were sampled, resulting in 10–60 plants, sampled per population and four to six racemes per plant. Reproductive individuals can produce several racemes (about 400) with one to 20 pods per raceme, each pod containing one to five seeds. One seed was randomly chosen per raceme for electrophoretic analysis, resulting in sample sizes ranging from 58 to 334 seeds per population. These criteria were those suggested by Ritland and Jain (1981) and Shaw and Brown (1982). According to these authors, for the predominantly autogamous species, the best estimates of outcrossing rate can be obtained from few loci (three to six) and a large number of plants per population. The selected populations were identified by several alpha-numeric codes (Rocha et al, 1997).

Electrophoretic analysis

To estimate the gene frequencies and mating system parameters, we selected six polymorphic loci resolved from six enzymatic systems: alcohol dehydrogenase (ADH, E.C. 1.1.1.1.), diaphorase (DIA, E.C. 1.8.1.4.), fluorimetric esterase (fEST, E.C. 3.1.1.−), malate dehydrogenase (MDH, E.C. 1.1.1.37), phosphoglucomutase (PGM, E.C. 5.4.2.2), and shikimate dehydrogenase (SKDH, E.C. 1.1.1.25). Enzyme extraction was carried out by grinding 5-day-old cotyledon tissues in a potassium phosphate buffer, pH 7.0, containing 20% sucrose (Sigma # S-8501), 5% PVP-40, 0.05% triton X-100 (Sigma #T-8532), 14 mM 2-mercaptoethanol (Sigma #M-6250), and 0.1 M KH2PO4. The pH value was adjusted to 7.0 with a solution of 5 N NaOH. Electrophoresis was performed using a horizontal 10% starch-gel (Sigma #S-4501) containing 3% sucrose. Two buffer systems were employed: continuous histidine-citrate, pH 6.1 for ADH and MDH, and discontinuous lithium-borate, pH 8.1/Tris-citrate, pH 8.4 for DIA, fEST, PGM, and SKDH. The techniques for gel electrophoresis and histochemical staining procedures are those reported elsewhere (Zoro Bi et al, 1999).

Loci were labelled sequentially, with those migrating closest to the anodal end designated as number 1. The Centro Internacional de Agricultura Tropical (CIAT, Cali, Colombia) accession G25221, a Mexican wild form, was used as the control for our analyses. The allozyme from this genotype is designated 100 and all other allozymes are assessed according to their relative migration rate. The genetic control and the quaternary structure of the analysed enzyme systems were discussed previously (Zoro Bi et al, 1999).

Data analysis

We used the multilocus mixed-mating model and the estimation procedure of Ritland and Jain (1981), implemented by Ritland (1990), to estimate the following parameters: average single-locus (ts) and multilocus (tm) outcrossing rate, ovule and pollen allele frequencies, and the inbreeding coefficient of maternal parents (F). The model uses the Newton–Raphson method for joint maximum-likelihood estimation of outcrossing (ts and tm) and F, the expectation-maximisation method for determining maternal and pollen allele frequencies, and the method of Brown and Allard (1970) for inferring maternal plant genotypes. The calculation of outcrossing rate using the multilocus procedure improves the chance of detecting outcrosses when parents are related, so that the difference between tm and ts represents an estimate of the biparental inbreeding (Shaw et al, 1981; Ritland, 1984). Assumptions of this model are: (1) no genetic changes due to mutation or selection following fertilisation; (2) no assortative mating; (3) no heterogeneity in the incorporated pollen pool; and (4) outcrossing is independent of maternal genotypes. Since assumption 3 is often violated in plant species (Godt and Hamrick, 1991), we tested for each population the homogeneity between pollen and ovule allele frequencies using Fischer exact tests (Sokal and Rohlf, 1995). The goodness-of-fit of the observed to the expected frequencies of progeny genotypes under the mixed-mating model assumptions was tested for each locus using a χ2 test generated by MLT (Ritland, 1990). Only data for loci showing a good fit to this model were used to estimate the mating system parameters. This method provides not only greater statistical power, but is also free of the inbreeding equilibrium assumption (Ritland, 1983). Standard errors of mating system estimates, including tmts, were calculated by bootstraping, where the unit of resampling was the maternal family arrays. The mean maternal plant inbreeding coefficient based on progeny genotypes (F) was compared to the expected inbreeding coefficient at equilibrium Fe=(1−t)/(1+t) (Fyfe and Bailey, 1951). If populations are at a genetic equilibrium and genotypic frequencies are determined solely by the mating system, F and Fe are equal (Brown, 1979). Discrepancies between F and Fe reflect the amount by which a population deviates from inbreeding equilibrium. For each population, we tested the significance level of ts and tm by a one-tailed Student's t-test based on the null hypothesis that ts or tm=1 (Sokal and Rohlf, 1995). To verify the importance of the biparental inbreeding in autogamy for wild Lima bean, we also compared tmts to zero using a Student's test. The significance level of F was tested using a χ2 test (Li and Horvitz, 1953), whereas the mean FeF was tested for significant difference from zero using a paired-t-test. Variability of tm among the analysed populations was tested using a χ2 test suggested by Godt and Hamrick (1991). The test was carried out by subtracting each population estimate from the global mean, dividing these differences by the standard error associated with each population outcrossing rate, squaring these quantities, and summing over populations. This statistics was tested as χ2-value, with degree of freedom being one less than the number of populations.

To examine the relationships between population sizes and outcrossing rates (tm), Spearman rank correlation coefficients (r) and their probabilities (P) were calculated using the SAS statistical package version 8.2 (SAS Institute Inc., 1990).

Results

The homogeneity test (two-tailed test with α/2=0.025) indicated that pollen and ovule allele frequencies in the 10 populations of wild P. lunatus were not significantly different for all enzyme loci analysed (Table 1). Such results suggested that maternal individuals contributed equally to the pollen pool. Results of goodness-of-fit tests showed that of the 10 sampled populations, tm and ts could be estimated simultaneously for nine populations (Table 1): E25 with data from four loci, E50 with data from three loci, E76 with data from three loci, E84 with data from two loci, E88 with data from one locus, E100 with data from three loci, G1 with data from two loci, J48 with data from three loci, and KM30 with data from two loci. Significant departure from the expected frequencies of progeny genotypes was detected for the three loci analysed in population TR54.

Table 1 Estimated pollen and ovule allele frequencies at six enzyme loci in 10 populations of wild Phaseolus lunatus, significance levels of exact tests of homogeneity for allele frequencies between pollen and ovule pools (P), and chi-square goodness-of-fit tests for mixed-mating modela

The multilocus and mean single-locus outcrossing estimates for wild Lima bean from the Central Valley of Costa Rica were tm±SE=0.096±0.071 and ts±SE=0.091±0.065, respectively (Table 2). Significant self-fertilization (ie tm and ts <1) was detected in the nine populations in which tm and ts were calculated simultaneously. Accordingly, the means tm and ts were significantly lower than 1 (tm: Student's t=38.05, P<0.001; ts: Student's t=41.96, P<0.001). For the nine selected populations, the differences tmts were not significantly different from zero, indicating the absence of biparental inbreeding. Consequently, the mean tmts was not different from zero (Student's t=1.15; P=0.280).

Table 2 Sample sizes (m is the number of plants and n is the number of seeds), estimates (±SE) of multilocus (tm) and single-locus outcrossing (ts) rates, biparental inbreeding (tmts), inbreeding coefficients of maternal parents (F), expected inbreeding coefficients at equilibrium (Fe), and difference between Fe and F for nine wild Lima bean populations

Inbreeding coefficients based on genotypic frequencies of maternal plants were positive and significantly greater than zero, suggesting an excess of homozygotes in all the populations (Table 2). In all cases, the equilibrium inbreeding coefficients estimated from the multilocus outcrossing rates were greater than the observed maternal inbreeding coefficients (Table 2), a trend that was statistically significant (paired-t=8.71, P<0.001). The mean equilibrium inbreeding coefficient for the nine populations was 0.830. This result indicated the occurrence of inbreeding disequilibrium in the studied populations.

Significant heterogeneity in multilocus outcrossing rates was observed among the nine analysed populations (χ2=23.60, P=0.003).

A nonsignificant correlation was found between the multilocus outcrossing rate and population size expressed as both the number of individuals in population (r=0.092, P=0.814) and the number of seeds collected (r=0.234, P=0.544). Thus, for wild Lima bean, it appeared that population size was not significantly associated with the level of multilocus outcrossing rate.

Discussion

In this study, outcrossing rates ranged from 0. 027 to 0.268 across nine populations. These data were close to those reported from previous study (Harding and Tucker, 1969). Indeed, using cultivated forms of Lima bean and morphological markers, these authors observed that the outcrossing rates ranged from 0.032 to 0.242. P. lunatus could therefore be classified as mixed-mating and predominantly selfing, according to the criterion of Schemske and Lande (1985). The high rates of apparent selfing (up 90%) in populations of wild Lima bean were consistent with the flower morphology (Webster et al, 1979) and the main pollinator foraging behaviour (Hardy et al, 1997). Indeed, using a cultivated form of Lima bean to study the morphogenesis of the reproductive structures, Webster et al (1979) showed that this plant was primarily self-pollinated. Morphological characteristics of the reproductive structures that facilitate self-pollination included the stage of floral development at the time of anther dehiscence and the relative positions of anthers and stigma within the keel at the time of pollen shedding. Coincidental maturation of pollen and receptivity of the stigmatic surface reported for this plant material also enhanced the capacity of self-pollination. In addition, from direct observations made in the valley, Hardy et al (1997) noted that the common bee (Apis mellifera) was the main pollinator of wild Lima. It is well known that individuals of this species tend to forage in restricted areas (Free, 1993). According to Godt and Hamrick (1991), the genetic effect of such pollinator behaviour is to reduce the single-locus outcrossing estimates. In our case, by analysing seeds from different pods per plant, we may have reduced the effect of correlated mating on the estimates of outcrossing.

We used three approaches to get some insight into the nature of outcrossed matings: (i) the estimation of pollen and ovule pool allele frequencies and the test of homogeneity between them, (ii) the magnitude and significance of the difference between multilocus and monolocus outcrossing estimates, and (iii) the comparison of the maternal inbreeding coefficient (F) to the expected inbreeding coefficient at equilibrium (Fe). No difference was found between pollen and maternal allele frequencies at any locus analysed, suggesting that plants had equal pollination success. Comparison between F and Fe was a second way to obtain information about patterns of outcrossing. Indeed, the assumptions that make the Fe estimation model different from that of F are that: (1) the population has reached its equilibrium genotypic composition, and (2) selfing is the only factor that influences the inbreeding coefficient. Consequently, the observed significant difference between the two estimates could arise from (1) the fact that the populations were not yet at equilibrium, and/or (2) the existence of selection in favour and/or against certain genotypes (Ennos, 1981) or additional sources structuring the genetic variation (eg biparental inbreeding). A study to compare the genetic variation at different stage of wild Lima bean life cycle, namely soil seeds bank, seedlings and adults (Cabin, 1996; Cabin et al, 1998) is required to identify which of inbreeding disequilibrium or selection are responsible. Nevertheless, it is worth noting that the founder effects resulting from extinction/recolonisation episodes that characterised the study populations (Rocha et al, 1997) could enhance the effects of subsequent selection by facilitating the establishment and expansion of favoured genotypes. Indeed, in the target area, many wild Lima bean populations are found in coffee plantations, fallow land or along hedges, so that weeding practices contribute to the destruction of plants. Recolonisation of the cleared sites could be due to any nearby plants, to new individuals emerging from the soil seed bank, or to human activities (such as seed transportation over longer distances on shoes or tools). We found that the mean multilocus estimate of selfing (90%) was similar to the mean monolocus estimate (91%). On average, only 1% of the apparent selfing in the populations was due to biparental inbreeding. Thus, in the wild Lima bean, the autogamy could be attributed mainly to the true selfing. The breeding system could explain the high level of homozygosity observed in the studied populations.

This study revealed significant heterogeneity in outcrossing rates among populations of P. lunatus. Interpopulation differences in outcrossing rates are generally attributed to variation in population size, population density, floral morphology, floral phenology, and pollinator availability (Donnelly et al, 1998; Fausto et al, 2001; Neel, 2002). For wild P. lunatus, the heterogeneity observed in outcrossing rates seems unrelated to population size since among the 10 studied populations, we found a nonsignificant correlation. Similar results have been reported for several plant species (van Treuren et al, 1993; Raijmann et al, 1994; Kunin, 1997). Given that wild Lima bean populations have complex shapes in the target site, it was difficult to estimate the density with certainty. However, visual examination of the spatial distribution of plants in these populations indicated apparent differences in densities. Indeed, the study populations were characterised by linear or patchy and gregarious distribution, with different number of plants per patch and sometimes, few flowers per plant. In these conditions, the contribution of density to the observed outcrossing heterogeneity could be expected. There was no variation in floral morphology among the populations of wild Lima bean in the target site so that this hypothesis could be discarded. Presently, an in-depth study on the floral phenology of wild Lima bean populations from the target site is in progress. Results from such studies could allow us to check the relationship occurring between the interpopulation outcrossing rates variation and the possible variation in flower appearance period. Such observations should contribute to checking the origin of the heterogeneity of outcrossing rates among the studied populations.

Sampling populations through several years (1995–1998) could also influence the outcrossing rate homogeneity. This influence is important if only seedling recruitment is temporally or spatially limited (Aspinwall and Christian, 1992; Raijmann et al, 1994). For wild Lima bean, the mean seed germination rate within a year was 78%, and 4% of the germinating seeds reach the matured age the same year (Degreef, 1998), indicating that this hypothesis could also explain the observed tendency. For this study, our sampling efforts were mainly devoted to obtaining samples of at least 10 matured plants and expressing at least three polymorphic loci in order to satisfy the used models assumptions. Since in the target site only 16% of the 400 recorded populations displayed more than five plants, it was difficult to obtain a sufficient number of populations answering the indicated purpose within a year. Further studies should examine how temporal variation in outcrossing rates may interact with temporal and spatial variation in seedling recruitment (Schemske et al, 1994).

From the heterogeneity of outcrossing rates observed among the studied populations and the significant differences between F and Fe, we could also conclude that these populations were at different stages of mating system evolution. Indeed, theoretical models of mating system evolution predict various equilibrium outcomes, ranging from mixed-mating to stable equilibria at either extreme of outcrossing continuum (Lande and Schemske, 1985; Uyenoyama, 1986). The mating system evolution is a complex process which can be affected by a number of biological factors as well as abiotic conditions (Schemske and Lande, 1985; Godt and Hamrick, 1993). Accordingly, for wild Lima bean, a two-pronged study is needed to evaluate the mating system evolution. First, genetic variation in mating system must be quantified in experimental situations in which environmental effects are controlled. Such study should look for the existence of significant levels of genetic variation in relation to outcrossing rates (Kalher et al, 1975). Secondly, environmental effects must be considered through detailed studies of the factors that affect outcrossing rates (eg pollinator abundance and density).