Introduction

The physical and biotic alterations at the edge (boundaries) of fragmented habitats, called edge effects, have serious impacts on ecosystem functionality (Laurence et al., 2007). In a fragmented forest, there is a reduction in the density of reproductive trees and neighbour size, as well as changes in the behaviour of pollinators, all of which affect the mating system of reproductive tree species (Fuchs et al., 2003). The knowledge of how and whether edge effects influence the mating system of tree populations and, consequently, the coancestry and variance effective size within open-pollinated progenies is crucial for the development of seed harvesting, conservation, breeding and environmental restoration strategies (Moraes and Sebbenn, 2011). This is especially important in Brazil because of the intense destruction and fragmentation of its rain forests (Ribeiro et al., 2009) and savannahs (Klink and Machado, 2005). In this context, one important and practical question remains unclear: can we collect seeds at the edge of the remnants for conservation, breeding and environmental restoration purposes or must we collect seeds at the centre of the remnants to maximise the variance effective size of the harvested seed lot? As the genetic diversity, relatedness and variance effective size within open-pollinated progenies is determined by mating patterns (Murawski and Hamrick, 1991), our hypothesis is that habitat boundaries may not be the most suitable place to conduct seed harvesting, although these places provide easy access and low operational costs.

In hermaphroditic tree species, the level of outcrossing depends on the following factors: the reproductive population density or the flowering tree density; the foraging patterns of pollinators within and among cospecific individuals; mechanical, physiological and genetic barriers that prevent selfing; and the selective abortion of selfed fruits and seeds (Murawski and Hamrick, 1991; Goodwillie et al., 2005). Ecological variables, such as spatial isolation (Fuchs et al., 2003; Sork and Smouse, 2006), flowering phenology, plant density (Murawski and Hamrick, 1991) and changes in pollinator activity (Hirao et al., 2006), have been identified as factors that may affect the mating patterns in tree species. Moreover, temporal isolation (differences in flowering phenology) and spatial isolation (low density of flowering trees) have been proposed as causes of high rates of self-pollination in some species of trees (Murawski and Hamrick, 1991; Moraes and Sebbenn, 2011). A reduction in the density of reproductive trees is expected to increase the degree of spatial isolation, which increases selfing (Fuchs et al., 2003; Robledo-Arnuncio et al., 2004; Aguilar et al., 2008; Moraes and Sebbenn, 2011). Despite the increase in selfing that occurs with a decrease in population density, the reported results for correlated mating have conflicted. Some studies have indicated that an increase in correlated mating occurs with a decrease in population density (Fuchs et al., 2003; Moraes and Sebbenn, 2011), while the opposite was observed in another study (Robledo-Arnuncio et al., 2004). Selfing increases inbreeding in the offspring generation, which produces inbreeding depression. Selfing, mating among relatives and correlated mating increase the frequency of identical-by-descent alleles and decrease the variance effective size within open-pollinated progenies (Murawski and Hamrick, 1991).

Genetic markers, such as microsatellite loci, have been used successfully to quantify the mating system in tree species (Ashley, 2010). In maternal open-pollinated seeds, the mating system can be measured as the proportion of seeds that result from outcrossing or selfing events, and the outcrossing fraction can be split into the proportions of seeds that originated from the same pollen donor (full-sibs) and those from different pollen donors (half-sibs; Goodwillie et al., 2005). These mating system events can be quantified using the mixed mating system (Ritland and Jain, 1981) and correlated mating models (Ritland, 1989) that have been implemented in the classical MLTR programme (Ritland, 2002). Complementarily, pollen dispersal studies within continuous and dense populations of plants that do not require sampling potential pollen donors can be investigated using TWOGENER (Austerlitz and Smouse 2001; Smouse et al., 2001) and KINDIST (Robledo-Arnuncio et al., 2007) analyses. These two methods require mother–offspring diploid codominant genotypic data. The TWOGENER analysis estimates the pollen pool structure using the multi-locus male gamete genotypes of single seeds and subjects them to an analysis of molecular variance, yielding the intra-class correlation coefficient of pollen genotypes (Φft), which is a measure of the differentiation within the pollen pool (Austerlitz and Smouse, 2001; Smouse et al., 2001). This method also allows conclusions to be made about the pollen dispersal distance (δ), the effective pollination neighbourhood area (Smouse et al., 2001), the effective number of pollen donors (Nep), the coancestry coefficient (Θ), the effective size within progenies and the number of seed trees required to collect seeds (Bittencourt and Sebbenn, 2008). As an upgrade to the TWOGENER analysis, KINDIST is independent of the effective density of the pollen donors and provides a description of the relationship between among-sibship normalised correlated paternity estimates (Ψ), distance (z) and pollen dispersal distance (δ) (Robledo-Arnuncio et al., 2007).

Here, we investigated the mating system and pollen pool differentiation among seed trees in the insect-pollinated Neotropical tree C. langsdorffii within a continuous savannah woodland (SW) and its edge using a mixed mating system (Ritland and Jain, 1981) and correlated mating models (Ritland, 1989) and TWOGENER (Smouse et al., 2001) and KINDIST (Robledo-Arnuncio et al., 2007) analyses. As the mating system of a tree species may be affected by the population density of reproductive plants (Murawski and Hamrick, 1991; Naito et al., 2008; Moraes and Sebbenn, 2011), the lower density in the edge than in the centre of a forest fragment may result in increased rates of selfing, correlated matings, pollen pool differentiation among seed trees and inbreeding and decrease in genetic diversity and variance effective size in open-pollinated progenies. Thus, we specifically addressed three questions about edge effects: (i) is the average number of alleles, heterozygosity and outcrossing rate higher in progenies of seed trees located within continuous SW than its edge? (ii) Is the pollen pool differentiation lower in the continuous SW than in the edge of the remnant? (iii) Is the variance effective size within progenies higher in seed trees located within the continuous SW or at its edge?

Materials and methods

Copaifera langsdorffii Desf. (Fabaceae)

Copaifera langsdorffii is a hermaphroditic tree with wide distribution in the Brazilian savannah and Atlantic rain forest (Carvalho, 2003). This species produces seeds mainly by outcrossing, but selfing has been detected even in the seedlings of natural regeneration (Sebbenn et al., 2011). Manual crosses revealed that geitonogamy was the only observed selfing phenomenon, indicating that this species is highly dependent on its pollinators (Freitas and Oliveira, 2002). Its bisexual flowers exhibiting herkogamy are pollinated mainly by small native bees, such as Scaptotrigona, Trigona and the naturalised bee Apis mellifera. A high quantity of flowers is produced in several weeks, and the blossoming of the tree varies from annual to supra-annual (Freitas and Oliveira, 2002). In natural populations, the reproductive stage begins between 20 and 30 years because of the tree’s medium to slow growth rate; however, some individuals in experimental plantations with a 6 cm diameter at breast height may flower (Carvalho, 2003). C. langsdorffii seeds are mainly dispersed by gravity and birds. In addition to its ecological role, C. langsdorffii also has economic importance because of its high quality wood and oil with pharmaceutical proprieties that can be extracted directly from its trunk. However, the species is listed as being at risk of extinction in some Brazilian states as a result of the intense destruction and fragmentation of the Atlantic rain forest and savannah environments (reviewed in Carvalho, 2003).

Study site and sampling

The Brazilian savannah, known as ‘cerrado’, covers an area of 2 000 000 km2 from 3°N to 24°S and represents approximately 23% of the country’s land surface (Klink and Machado, 2005). The cerrado consists of a mosaic of physiognomies that vary along a gradient from open grasslands to dense woodland formations controlled by interactions among climatic, edaphic and disturbance factors, such as fire and herbivory. The typical savannah ‘cerrado sensu stricto’ is characterised by open grasslands, shrubs and up to 50% of discontinuous tree cover. On the other hand, SW ‘cerradão’, has a forest physiognomy, where the woody vegetation forms a continuous stratum, typically with >90% coverage, and the average height of the trees varies between 8 and 15 m (Pinheiro and Durigan, 2009). The Brazilian southern savannahs originally covered an area of 50 000 km2 (2.5% of the Brazilian savannah) in the State of São Paulo and parts of Minas Gerais and Paraná. The southern savannah is distinguished from the core savannah domain because it occurs under lower temperatures and has a shorter dry season (Durigan, 2006). Consequently, the southern savannahs form a distinct phytogeographical region, which is divided into two areas: one covered by SW, which represents 70% of the surveyed savannah sites in the State of São Paulo, and another covered by typical savannah. Remarkably, the third most frequently recorded woody species in the surveyed southern savannahs sites in the State of São Paulo is C. langsdorffii (Durigan, 2006).

This study was conducted at the Ecological Station of Assis (ESA), which is a reserve of the State of São Paulo, Brazil (22°33′20′′ to 22°37′41′′S and 50°24′4′′ to 50°21′27′′W). The reserve covers 1760 ha (Figure 1): 1312.18 ha is occupied by native vegetation, and 1115.68 ha is composed of SW. The ESA, one of the largest and among the few protected southern savannah remnants in São Paulo, is isolated in an agricultural landscape. The ESA boundaries are composed of open SW juxtaposed with shrubby savannah, open savannah and an agricultural landscape (Pinheiro and Durigan, 2009) that contains very few, far-between (>1 km) and small (<10 ha) isolated savannah fragments. The absolute density (trees ha−1) of C. langsdorffii individuals with a diameter at breast height >5 cm throughout the ESA in the SW was 296, making C. langsdorffii the most dominant woody species (Pinheiro, 2008). Within the ESA, we focused our study on the edge of the remnant and a pre-existing 10.24 ha permanent plot of SW that is at least 500 m from the edge of the remnant. In the plot, a total of 4009 C. langsdorffii individuals with a diameter at breast height that was >5 cm were found, measured and mapped by the BIOTA-FAPESP Permanent Plot Consortium in 2006; C. langsdorffii was the plot’s dominant species. To address our questions, the following sampling scheme in the ESA was conceived (Figure 1). We sampled seeds from all of the C. langsdorffii trees with a diameter at breast height 25 cm that flowered and fructified within the same period of 2007 that were in the plot (N=17) or that were up to 10 m from the edge of the ESA (N=11). The reproductive tree density was 1.66 trees ha−1 in the plot and 0.66 trees ha−1 at the edge. The open-pollinated offspring (seeds) of the 28 seed trees were directly collected from the crown of the trees. Twenty random offspring per seed tree (here referred as a progeny) were used in our analysis, totalling 560 offspring or 28 progenies.

Figure 1
figure 1

(a) Geographical distribution of C. langsdorffii (light grey) and the Brazilian savannah (dark grey) in Brazil. (b) Location of the city of Assis in São Paulo State. (c) Satellite image of the ESA and its surrounding agricultural landscape. Boundaries of the ESA are represented by continuous white dots, and areas inside the ESA that are not characterised as SW are represented by diagonal white lines. Distribution of the 17 seed trees inside the 10.24 ha plot in the continuous SW () and the 11 seed trees throughout the edge of the ESA (Δ).

Microsatellite genotyping

DNA was extracted from the leaflets of the offspring (seeds germinated in a greenhouse) and the leaves of the seed trees using the CTAB method (Doyle and Doyle, 1990). After extraction, DNA was quantified on 1% agarose gels with a lambda DNA standard and diluted to a concentration of 2.5 ng μl–1. The eight microsatellite loci used in this study and the PCR conditions were reported by Ciampi et al. (2000). The amplifications were performed using an MJ Research PT-100 thermal cycler (MJ Research, Matertown, MA, USA) adjusted to the following conditions: 5 min at 94 °C; 30 cycles of 1 min at 95 °C, 1 min at the specific temperature for each primer, and 1 min at 72 °C; and a final elongation step of 7 min at 72 °C. The amplification products were separated on 5% (w/v) polyacrylamide gels and stained with silver nitrate. As microsatellite genotyping errors can lead to biased results, we double-checked all gels. We used MICRO-CHECKER 2.2.3 (Oosterhout et al., 2004) to test possible stuttering, large allelic dropout and null alleles in the adult individuals, using a 95% confidence interval based on 1000 Monte-Carlo permutations. No evidence for scoring errors because of stuttering or for large allelic dropout was found. Null alleles may be present at loci CL02, CL32 and CL34, as is suggested by the general excess of homozygotes for most allele size classes (P<0.05). However, loci CL02 and CL34 had a mistyping of zero, and locus CL32 had only one mistyping, indicating that the homozygote excess may be due to selfing in the population and not necessarily to null alleles. Another method that may demonstrate that all loci have no null alleles is the check for genotyping errors and null alleles that we performed using MLTR 3.4 (Ritland, 2002) to detect seed tree–offspring mismatches. We obtained zero seed tree–offspring mismatching and an absence of null alleles in the offspring for all loci. In a previous study, Tarazi et al. (2010) demonstrated that the SSR marker of C. langsdorffii exhibited Mendelian inheritance and an absence of linkage in the studied offspring population.

Hypothesis tests

As the seed trees sample sizes were different between the sites, to compare the genetic diversity, mating system and TWOGENER parameters, and to verify their significance, we used a 95% bootstrap percentile confidence interval based on 1000 replications (CI95%). The lower and upper confidence intervals were determined by taking the 2.5 and 97.5 percentiles after ranking the bootstrap values. In all comparisons, we used progenies as replication units. In addition, we tested if sample size influenced our estimates by bootstrapping 1000 times 11 progenies (number of progenies of the edge) from the SW and re-estimated all of the parameters. We also tested if the isolated tree in the western edge of ESA could affect the parameters cited above. This analysis was conducted by comparing the overall 1000 bootstrapped estimates of 11 progenies of the edge with estimates without the isolated tree.

Genetic diversity and fixation index analyses

The genetic diversity of the offspring inside the SW and at the edge was estimated by the number of alleles per locus (k), allelic richness (R), using rarefaction method (El Moussadik and Petit, 1996), total number (ktotal) and average number of alleles per locus as well as the observed (Ho) and expected heterozygosity at Hardy–Weinberg equilibrium (He). The level of inbreeding was estimated using the fixation index (F). All parameters were calculated using the FSTAT programme (Goudet, 1995).

Mating system analyses

The mating system was analysed for site (SW and edge) and for progeny level based on the mixed mating model and a correlated mating model using the MLTR 3.4 programme (Ritland, 2002). Analyses at the population level were carried out using the Newton–Raphson numerical method and at the progeny level using the expectation-maximisation numerical method. The estimated parameters were the multilocus outcrossing rate (tm), the single-locus outcrossing rate (ts), the outcrossing rate among relative trees (tmts), the multilocus paternity correlation (rpm), the single-locus paternity correlation (rps) and the average effective number of pollen donors (Nep=1/rpm). We obtained a 95% bootstrap percentile confidence interval based on 1000 replications. The bootstrap analysis at the population level was carried out using progenies as replication units and at the progeny level using offspring within progenies as replication units.

TwoGener and KinDist analyses

To test the hypothesis that there is higher genetic differentiation in the pollen pool of seed trees from the edge than in the SW, we used TWOGENER (Smouse et al., 2001) and KINDIST (Robledo-Arnuncio et al., 2007) analyses. For these analyses, we used the progeny arrays of the 17 seed trees in the SW, which totalled 340 seeds, and the progeny of the 11 seed trees of the edge of the ESA, which totalled 220 seeds, as well as all of the genotypes and the spatial position of the sampled seed trees. The global differentiation of the pollen pool (Φft) among seed trees, the relationship between among-sibship normalised correlated paternity estimates (Ψ), distance (z) and pollen dispersal distance (δ) was estimated using the Poldisp 1.0 programme (Robledo-Arnuncio et al., 2007). The Φft value can be affected by inbreeding and selfing (Austerlitz and Smouse, 2001; Burczyk and Koralewski, 2005). However, our estimation of inbreeding in the seed trees of both sites was zero (FCSW=0.002 and Fedge=0.000), and consequently, the estimate of Φ′ft was not influenced by the inbreeding of the parental generation: (Austerlitz and Smouse, 2001). Therefore, Φft was only adjusted for selfing (s) as follows: (Burczyk and Koralewski, 2005). From , we also estimated the average effective number of pollen donors (, Smouse et al., 2001). If a significant negative relationship between among-sibship normalised correlated paternity estimates (Ψ) and distance (z) were to be detected in the sites, pairwise Ψ(z) estimates between all seed trees in the population should be calculated to derive the pollen dispersal distance (δ) and the estimated effective density of reproductive trees (de) in the normal and exponential dispersal function (Robledo-Arnuncio et al., 2007).

Estimation of the variance effective size and the number of seed trees for seed harvesting

The minimum number of seed trees (m) required for harvesting seeds for conservation purposes was calculated: m=Ne(reference)/Ne, where Ne(reference) is the reference variance effective size of the seeds to be harvested (500) and Ne is the average variance effective size within progenies (Bittencourt and Sebbenn, 2008). The Ne was estimated as follows: Ne=0.5/{Θ[(n−1)/n]+(1+Fs)/2n} (Cockerham, 1969), where Fs is the inbreeding coefficient of the seeds, Θ is the average coancestry within progenies and n is the sample size (here assumed to be 20). The average coancestry coefficient within progenies was estimated from the mating system parameter by the equation . This estimator corresponds to half of the relatedness coefficient within progenies (Θ=r/2) as derived by Ritland, 1989. The other parameters were previously described. The average coancestry coefficient was also estimated through the global differentiation of the pollen pool (): (Bittencourt and Sebbenn, 2008).

Analyses of seed germination

To evaluate whether there were differences in germination between the seeds collected from the plot in the SW and those from the edge, we performed a germination experiment. Seeds from seed trees were germinated separately in germination boxes using a completely randomised design with three repetitions and 25 seeds per germination box. To test whether there were significant differences in the absolute germination values between the plot in SW and the edge, we used the F test of variance analysis within the SAS programme (SAS Institute Inc., 1999). To test whether there was a significant association between the average progeny germination (absolute values) and selfing, the fixation index, the observed heterozygosity or the expected heterozygosity, we estimated the Spearman rank coefficient of correlation also using the SAS programme (SAS Institute Inc., 1999).

Results

Genetic diversity and inbreeding

The total (ktotal) number of alleles per locus and the observed (Ho) and expected (He) heterozygosities were significantly higher in seeds collected from the SW than in those from the edge (Table 1). Even after sampling 11 progenies from SW, the observed (Ho=0.780; CI95%=0.775–0.802) and expected (He=0.843; CI95%=0.823–0.860) heterozygosities were significantly higher in seeds collected from the SW than in the edge.

Table 1 Genetic diversity parameters and fixation index in the C. langsdorffii offspring from the continuous SW and the edge of the ESA

The removal of the isolated tree in the western edge of ESA from the analysis did not affected the observed (Ho=0.713; CI95%=0.689–0.736) and expected (He=0.796; CI95%=0.756–0.822) heterozygosities. In the seeds collected from the SW, we found a total of 136 different alleles in the eight loci, with an average of 17.0 alleles among loci. In seeds from the edge, we found 131 alleles, with an average among loci of 16.4 alleles. We found 14 private alleles (frequency ranging from 0.002 to 0.042) in the seeds collected from the SW and 10 (frequency ranging from 0.002 to 0.065) from the edge. Although the fixation index (F) was significantly positive in the SW and at the edge (Table 1), which suggested inbreeding, it did not differ between these two sites. No difference was found even after sampling 11 progenies from the SW (F=0.063; CI95%=0.021–0.099) and removing the isolated tree of the edge from the analysis (F=0.104; CI95%=0.026–0.166).

Mating system

The estimated outcrossing rate was significantly lower than unity (1.0) at both sites (Table 2a) and was significantly higher in the SW (tm=0.859) than its edge (tm=0.759). Furthermore, in the multilocus outcrossing rate after sampling 11 progenies from SW (tm=0.861; CI95%=0.810–0.922) and removing the isolated tree of the edge from the analysis (tm=0.749; CI95%=0.645–0.805), no significant differences were found from the previous estimates based on 17 and 11 progenies, respectively. The outcrossing rate among related trees (tmts) was significantly different from zero in the SW and at the edge but was similar between the sites (Table 2a). The multilocus (rpm) and single-locus (rps) paternity correlation was also significantly different from zero, but did not differ between the sites (Table 2a). The effective number of pollen donors (Nep) was similar at both sites, indicating low reproductive dominance (Table 2a).

Table 2 Mating system parameters estimated from the mixed and correlated mating model (a) and TWOGENER analysis (b) from continuous SW and the edge of the ESA in C. langsdorffii

The progeny multilocus outcrossing rate (tm) was variable among seed trees, ranging from 0.47 to 0.99 in the SW and from 0.40 to 1.00 in the edge (Table 3). The multilocus outcrossing rate in the edge was <0.75 in 7 of the 11 open-pollinated progenies (64%), in contrast to 2 of the 17 progenies (12%) sampled in the SW. Individual differences in tmts were also variable among the seed trees, ranging from −0.06 to 0.12 in the SW and from −0.04 to 0.15 in the edge (Table 3). Multilocus paternity correlation (rpm) ranged from 0.02 to 0.11 in the SW and from 0.02 to 0.18 in the edge. Consequently, the effective number of pollen donors (Nep) were also variable among the seed trees, ranging from 9.0 to 45.5 in the SW and from 5.6 to 62.5 in the edge (Table 3).

Table 3 Estimates of mating system parameter for each progeny of the continuous SW and the edge of the ESA

Pollen pool differentiation and pollen dispersal

The global differentiation of the pollen pool (Φft) was significantly different from zero among seed trees in the SW (=0.088) and at the edge (=0.107), however, it did not differ between these two sites (Table 2b). Selfing significantly influenced the estimated value of . Excluding the effect of selfing, Φ′′ft was 19% higher in the SW and 26% higher at the edge compared with Φft (Table 2b). Moreover, for pollen pool differentiation when we sampled 11 progenies from SW (Φft=0.088; CI95%=0.064–0.115 and Φ′′ft=0.106; CI95%=0.070–0.132) or removed the isolated tree of the edge from the analysis (Φft=0.104; CI95%=0.056–0.142 and Φ′′ft=0.128; CI95%=0.058–0.163), no differences were found from the previous estimates based on 17 and 11 progenies, respectively. The multilocus paternity correlation (rp=2Φ′′ft) and Nep estimates (Table 2b) were significantly different than the values estimated using the MLTR programme (Table 2a). No significant decrease in Ψ was detected with the increase in the distance between seed trees in the SW (r=−0.137; P=0.11) and at the edge (r=−0.059; P=0.67). Thus, the KINDIST estimation of the pollen dispersal distance becomes unreliable and, consequently, δ and de were not estimated.

Coancestry, variance effective size and the number of seed trees for seed harvesting

The average coancestry coefficient (Θ) within progenies estimated from mating system parameters and from TWOGENER analysis was significantly lower in the SW (Θ=0.170; Θ=0.182, respectively) than at the edge (Θ=0.199; Θ=0.212, respectively; Table 2). The estimated parameters Θ, Ne and m did not differ between MLTR and TWOGENER procedures (Table 2). In progeny level, the estimated coancestry coefficient (Θ) was variable, ranging from 0.133 to 0.295 in the SW and from 0.128 to 0.320 in the edge (Table 3).

To both estimates from mating system parameters and TWOGENER analysis, the average variance effective size within a progeny was significantly lower than expected for a panmictic population (Ne=4) and was significantly lower at the edge than in the SW (Table 2). These results suggest that seed collected from the edge have generally higher relatedness and lower variance effective size than seeds collected from the SW. Among progenies, the estimate of the variance effective size ranged from 1.65 to 3.39 in the SW and from 1.53 to 3.50 in the edge (Table 3). Therefore, the minimum number of seed trees required to harvest seeds was significantly higher at the edge (=217 for mating system parameters and =229 for TWOGENER analysis) than in the SW (=189 for mating system parameters and =200 for TWOGENER analysis).

Seed germination

The average seed germination value was significantly higher (P<0.024) in seeds from the SW (23.4; s.e.=0.2) than in those from the edge (22.1; s.e.=0.6). However, at the progeny level, we do not find any significant association between germination and selfing, observed heterozygosity, expected heterozygosity and fixation index, using a linear Spearman rank correlation.

Discussion

Genetic diversity, inbreeding and mating system

The results of our study supported our initial hypothesis that seeds collected from the centre of a remnant have higher heterozygosity levels (Ho and He) than seeds collected at the edge (Table 1). This is most likely a consequence of the low population density of reproductive trees at the edge of the remnant, which results in the selfing rate higher than 0.25 in 64% of the open-pollinated progenies at the edge, in contrast to 12% of progenies from in the SW. This provides direct evidence of low genetic recombination in seeds generated at the edge. Selfing is expected at a higher rate in harsh environments (Goodwillie et al., 2005; Lowe et al., 2005) and in low-density populations (Murawski and Hamrick, 1991). Significant levels of inbreeding were detected at both sites, the results suggest that selfing decreased the heterozygosity levels and increase inbreeding in seeds from the edge. Thus, the higher levels of selfing may explain the low observed heterozygosity at the edge. However, in general, selfing in tropical tree species produces inbreeding depression, causing the mortality of selfed seeds and individuals as was observed in Eucalyptus argutifolia (Kennington and James, 1997), Eucalyptus marginata (Millar et al., 2000), Pseudobombax munguba (Gribel and Gibbs, 2002), Platypodium elegans (Hufford and Hamrick, 2003) and Neobalanocarpus heimii (Naito et al., 2005). We observed significant higher germination in the SW than the edge, suggesting that inbreeding depression was higher in seeds from the edge, where selfing was highest. The lack of association between germination and selfing, fixation index, observed heterozygosity or expected heterozygosity may be was caused by the inbreeding depression. However, inbreeding depression occurs in many ontogenic stages and, consequently, the levels of heterozygosity and the observed inbreeding in C. langsdorffii for both sites may substantially change between the seed and adult stages. If inbreeding depression were to affect selfed seeds, the observed levels of inbreeding in the SW and the edge would decrease significantly until the adult stage. Although data regarding inbreeding depression in the C. langsdorffii are not available, this phenomenon most likely also occurs in this tropical tree species. In C. langsdorffii, as in many tropical tree species, outcrossing is important for promoting the maintenance of genetic diversity and the adaptive potential of the species. A high adaptive potential counteracts the effects of environmental stochasticity, especially in harsh environments, such as remnant edges or newly colonised areas.

Pollen pool differentiation

Although, the average global differentiation of the pollen pool (Φ′′ft) seemed to be higher in edge than in the SW, no significant difference was found between sites. The possible reasons for the lack of difference in Φ′′ft values between both sites are related to sample design and edge effects: (1) the spatial distance among seed trees in the edge was larger than in the SW (Figure 1); (2) in the edge, there is a lower reproductive density of trees and a probable scarcity of pollinators. The large distance among the sampled seed trees in the edge, associated to the low reproductive density of trees and the scarcity of pollen donors probably increased pollen pool differentiation in the edge in comparison with the SW, because of the low overlapping pollen pool among seed trees in the edge, but increased also the CI95% due the higher individual variation in outcrossing rate (Table 3). In contrast, the spatial distance among the seed trees of the SW was lower, probably increasing overlapping pollen pool among seed trees and decreasing average global pollen pool differentiation in comparison with the edge. Thus, it is not possible to attribute the higher genetic differentiation in pollen pool only to the edge effects because there is a mixture of sampling effect with edge effects. To separate these effects, a further study must be conducted, based on sampling seed trees in the SW more distant than has been sample in the present study.

The paternity correlation estimate (rpm) and the effective number of pollen donors (Nep) were significantly different between correlated matings model (Ritland, 1989) and TWOGENER analysis (Smouse et al., 2001). The correlated mating model estimated a low paternity correlation and consequently, high number of pollen donors in both the SW and the edge, and no differences were found between the two sites (Table 2a). In contrast, the TWOGENER analysis estimated a significantly higher paternity correlation and a lower number of pollen donors than the correlated mating model for both the SW and the edge (Table 2b). The difference in the results between the two approaches is associated to the differences in the methods of estimation. In the TWOGENER analysis, both parameters rp and Nep were estimated indirectly from the pollen pool differentiation (Φ′′ft), a parameter strongly affected by the sample design. For example, the distances between seed trees in SW were lower than seed trees located in the edge, which resulted in a higher average global pollen pool differentiation in the edge (Table 2) and consequently, resulted in the significant higher rp and lower Nep in the edge than in SW. In contrast, in the correlated mating model, the parameter rp is estimated from the expression, rp=[2/(1+F)]f, where F is the inbreeding in the parental parents and f is the correlation of paternal gametes (Ritland, 1989). Thus, joint estimation of inbreeding levels in the parental and offspring generation are necessary to estimate the paternity correlation (Ritland, 1989), but this estimate is independent of distance among seed trees. In summary, the correlating mating model estimates rp from the inbreeding in the parental parents and the correlation of paternal gametes and does not directly depend on the distance of seed trees as is the case of TWOGENER analysis. This can explain the observed difference between the two methods in the parameters rp and Nep. Thus, as our sample design, based on seed trees located in very different distance among seed trees between the SW and edge does not allow a robust comparison between these two sites. For our specific case, correlated mating model is more robust to compare the estimated parameters.

Although the estimated paternity correlation and the effective number of pollen donors were significantly different between the methods, in general no significant differences were observed for the coancestry coefficient (Θ), the variance effective size within progenies (Ne), or the necessary number of seed trees from which seeds should be collected (m). This absence of differences is caused by the fact that the parameters Ne and m were estimated from the parameter Θ, and this last parameter is mainly affected by the levels of selfing (or outcrossing). As we used the same values for selfing (and outcrossing), which were calculated from a mixed mating model to estimate the parameter Θ, for both models, no differences were detected in the estimate of the three parameters Θ, Ne and m.

Implications for conservation genetics and evolution

Selfing, mating among relatives and correlated matings increase the frequency of identical-by-descent alleles within open-pollinated progenies. In another word, these three cited process increase inbreeding and the relatedness and decrease the heterozygosities within the progenies, as observed comparing the SW and the edge (Table 1). This has strong implications for harvesting seeds for conservation, breeding programs and environmental restoration, as the objective is to achieve maximal genetic diversity and variance effective size in the seed samples. Therefore, seed harvesting must preferably be conducted where selfing, mating among relatives and correlated matings are less frequent. Thus, according to the estimates of genetic diversity, selfing, coancestry within progenies and average variance effective size (Ne), seed harvesting must be performed within the SW. At this site, seeds must be harvested from at least 189 seed trees so that the sampled progeny arrays reach the required Ne of 500. However, it is important to note that some seed trees should be selected at the edge of the remnant to compound the total sample, as these seeds contain private alleles that originated from other demes.

Data archiving

Data deposited in the Dryad repository: doi:10.5061/dryad.73g3d.