Introduction

Geographical patterns of population genetic variation within plant species are shaped by many factors. Some are related to traits intrinsic to the species’ biology (reproduction mechanisms, modes of seed and pollen dispersal, gregariousness), whereas others are linked to disturbances, mainly habitat fragmentation, induced by natural processes (for example, fire, climate change) or human impact (conversion of forested areas into agricultural land, logging). Quaternary climatic fluctuations caused important changes in the distribution ranges of many plant species and thereby strongly affected their large-scale genetic structure (Comes and Kadereit, 1998; Hewitt, 2000). The signature of the Quaternary ice ages in present-day patterns of genetic structure may be especially important for species with long life cycles such as trees, because probably fewer than 100–1000 generations (depending on the species) have elapsed since the last glacial maximum 15 000 radiocarbon (14C) years ago (Walker, 1995). The most recent glacial maxima occurred from 1 60 000 to 1 30 000 and from 24 000 to 12 000 years before present (Maley, 1996). During these episodes, the climate in tropical Africa was drier and cooler, causing an expansion of savannah and a contraction of lowland tropical rainforests into regions still presenting sufficient humidity, as demonstrated by pollen records in lake sediments (Maley, 1996). Hence, fragmentation-expansion cycles of the lowland rain forest habitats likely occurred during the Pleistocene, causing population fragmentation of their constituent species (Flenley, 1998). More recently (around 2500 cal years BP), the rainforest in Africa has also been disturbed by a southwards shift of the intertropical convergence zone, as attested by a high-resolution pollen profile obtained from South West Cameroon (Ngomanda et al., 2009).

Although the impact of the last ice age on the past distribution and the genetic structures of trees from Europe and North America is well documented, there is very little information for tropical tree species, particularly in African rainforests. The latter are mainly concentrated in the Guineo-Congolian phytogeographical region, which is composed of three domains (White, 1979): the Upper-Guinean domain (comprising forests of West Africa), the Lower-Guinean domain (the Atlantic Central Africa, from Nigeria to Congo) and the Congolian domain (comprising the whole Congo basin until the Albertine Rift in the East of the Democratic Congo Republic). The Lower-Guinean domain harbours the richest flora, and several centres of high species diversity and/or endemism for many genera have been identified (for example, in West-Cameroon and in West and Central Gabon; see White, 1979; Sonké et al., 2007). The richness of the Lower-Guinean domain has often been interpreted as the signature of past forest refuges, and the current distribution of plant species has been used to attempt to identify these refuges (Maley, 1996). However, given the paucity of direct evidences, such as pollen records, the intensity of forest fragmentation, the determination of historic localization of past forest fragments, and the degree of isolation between large fragments (gallery forests could constitute corridors or even refuges for various forest-dwelling species) are essentially matters of speculation. Moreover, centres of diversity or endemism are concordant with areas of complex topography and/or with high rainfall so that the effects of history and of environmental gradients on species diversity are generally confounded. As the last climate changes causing forest fragmentation occurred relatively recently, it might be more pertinent to investigate their effects on within-species patterns of diversity rather than on species endemism and richness.

Different forms of genetic signatures of past demographic changes may be exploited. The identification of differentiated and geographically coherent gene pools within species that nowadays show wide ranging and essentially continuous distributions could indicate past population fragmentation, potentially due to past changes in vegetation cover. Recent population expansion or decline may also leave a signature in the distribution of allele frequencies within locus due to mutation-drift disequilibrium (Cornuet and Luikart, 1996). Genetic markers keeping an information on the successive mutation events, such as DNA sequences or microsatellites mutating in a stepwise manner, may also be used to test the occurrence of a phylogeographic signal; for example, (Pons and Petit, 1996; Hardy et al., 2003), giving clues about the relative time depth of observed genetic divergences between populations. Depending on the extent of gene dispersal, the genetic signature of past population fragmentation will progressively vanish once differentiated gene pools begin to exchange genes. Hence, one avenue to improve our understanding of past tropical forest cover changes is to analyse large-scale geographic patterns of genetic variation of forest-dwelling species as well as their gene dispersal distances. Evidence for converging patterns among species would give support for a strong determinism of past vegetation changes. However, nowadays, such data are very rare for central African species (Born et al., 2008).

Gene dispersal in plants is possible only through the dispersal of pollen and seeds. Fairly reliable indirect estimates of gene dispersal distances can be obtained from the spatial genetic structure (SGS) (Fenster et al., 2003; Hardy et al., 2006). Such estimates can give insights into the number of generations during which a genetic signature of past fragmentation should remain. In addition, dispersal capacities may be constrained by peculiar landscape features and demographic parameters. Comparing SGS among sites may provide insights into the range of variation of gene flow within species. For instance, forest fragmentation can increase SGS by breaking up gene flow and increasing local drift because of the reduction of the number of cohorts of many species (Dick et al., 2008). In contrast, long-distance seed dispersal and landscape changes that increase dispersal capacities of species may reduce the stochastic action of drift and lead to a decrease in SGS.

Using the timber tree Distemonanthus benthamianus as a model species displaying, nowadays, a large continuous distribution in the Lower Guinea domain, the aim of this study is to characterize its genetic structure at various spatial scales in Cameroon and Gabon in order to detect possible signs of past population fragmentation and recolonization, and to estimate the extent of gene dispersal. More specifically, using microsatellite markers that were developed for this study, we address the following questions: (1) Are there large-scale geographic discontinuities in the pattern of genetic variation, indicating the occurrence of distinct gene pools? (2) Is there evidence for a contribution of stepwise mutations in the differentiation of gene pools, indicating ancient isolation? (3) Is there any genetic evidence of recent demographic changes? (4) What is the outcrossing rate and the extent of gene dispersal distances according to isolation by distance models? Our results are discussed in the light of possible causal processes and compared with those of a couple of other tree species from Central Africa to assess whether some general patterns emerge.

Understanding the genetic dynamics of tree populations and the origin of geographical patterns of genetic variation can provide key information about dispersal and regeneration for improving population management by logging companies (Lowe et al., 2005). This is all the more important for a heavily exploited timber tree such as D. benthamianus that suffers very limited in situ regeneration in mature forest. Although this species did not receive any IUCN classification yet, it was ranked 7th by the FAO among the 10 species considered potentially most susceptible to loss of genetic variation due to logging in southern Cameroon (Namkoong et al., 2002).

Materials and methods

Biological model and sampling

D. benthamianus Baillon (Caesalpinioideae) is a widely but sparsely distributed tree throughout the high forests of West and Central Africa, from Ivory Coast to Gabon and Congo. It is common in Cameroon and Gabon, where it is known as African satinwood, or more generally as Movingui. One of the big trees (40 m high and >1.20 m trunk diameter) of evergreen, semi-deciduous and secondary forests, D. benthamianus is traditionally used in African medicine to treat bacterial, fungal and viral infections (Nguelefack et al., 2005). Its modern uses include cabinetwork, joinery, flooring and decorative veneers, and it is generally suggested as an oak alternative. Apart from its geographic range and data on its population density in a few forest concessions, little is known about its biology. Flowers are hermaphroditic; the fruit is a flat, thin-walled, indehiscent pod measuring 10 cm by 3.5 cm (Aubréville, 1968). Seeds are dispersed by wind (Hecketsweiler, 1992) to distances of up to 50 m (Cordiez, 2000). At least three monkey species (Lophocebus albigena, Cercopithecus nictitans, Cercopithecus pogonias; Poulsen et al., (2002)) prey on its seeds (Gautier-Hion et al., 1993). Feeding only on immature fruits, they are quite unlikely to contribute to dispersal. Fruiting begins at a diameter at breast height (d.b.h.) of 20 cm, but only trees with a d.b.h. of 40 cm or larger fruit regularly. Trees grow on average between 3 and 4 mm in diameter annually. D. benthamianus is a light-demanding species and its natural regeneration in mature forest is very limited. Very few saplings are observed in closed-canopy forests (Doucet, 2003).

We collected samples (leaves or bits of cambium) from 295 adult D. benthamianus in eight different sites in Central Africa (three in Cameroon, five in Gabon), hereafter referred to as ‘populations’. These are not in fact locally circumscribed groups of individuals isolated from each others, as the species is distributed rather continuously through the whole sampling area (Figure 1a). Samples were collected and immediately dried in silica gel. For spatial analyses, we only used 286 trees with precise GPS localizations obtained with a Garmin GPSmap 60CSx (Garmin, KS, USA) with a 10 m estimated error.

Figure 1
figure 1

(a) The eight sampled populations of D. benthamianus in Central Africa. In Cameroon: 1-AbongMbang, 2-Ebolowa, 3-Sangmelima. In Gabon: 4-CBG, 5-CEB, 6-Libreville, 7-Mitzic, 8-Oyem. (b) Synthetic map of the genetic clustering of D. benthamianus in Central Africa, based on STRUCTURE results. Empty squares: individuals belonging to the cluster ‘North’. Black triangles: cluster ‘West’. Empty circles: cluster ‘South’. The discontinuous lines delimit the three clusters. The thermal equator is figured, as it may be a current obstacle to gene flow by pollen due to the inversion of seasons.

Development of microsatellite markers

Microsatellites were isolated using a fast and cheap protocol, modified from Hamilton et al. (1999) by TC Glenn (Savannah River Ecology Laboratory, University of Georgia, GA, USA). The detailed protocol of microsatellite isolation is available at request from us. Briefly, total genomic DNA was extracted from cambium (providing cleaner DNA than that from leaves) of two adult trees using our own CTAB protocol, modified from Doyle and Doyle (1987). A partial enriched genomic library was then constructed. A total of 192 white transformant clones were hybridized with (AC)12. Around 50% of these clones gave a positive signal and the inserted DNA fragment was sequenced in all of these positive clones. Twenty-seven primer pairs, designed using the software OLIGO 3.3 (Rychlik and Rhoads, 1989), were tested and 17 gave satisfactory amplification patterns (that is, a PCR product of the predicted size, and no supernumerary bands). Protocol for the PCR amplifications is also available on request. We sized PCR products using an automatic ABI PRISM 3100, the 500 LIZ GeneScan size standard (Applied Biosystems, Foster City, CA, USA), and GENEMAPPER v3.0 software (Applied Biosystems). For genotyping we only used 10 out of the 17 microsatellite loci that were isolated from D. benthamianus, chosen so as to easily genotype individuals by multiplexing loci in one genotyping run. These 10 loci are named dis028, dis054, dis069, dis103, dis116, dis125, dis126, dis127, dis138 and dis140, and were given GenBank accession numbers FJ654552, FJ654554, FJ654555 and FJ654557–FJ654563 (Supplementary Table S1).

Genetic data analyses

We first controlled for the absence of heterozygote deficiency or linkage disequilibrium by genotyping 60 individuals, sampled from two populations (population 1 from Cameroon and population 5 from Gabon; Supplementary Table S1).

To infer the impacts of limited gene dispersal, population history and habitat fragmentation on isolation by distance, we followed a stepwise approach consisting of a Bayesian clustering method to detect differentiated gene pools followed by the analysis of kinship-distance curves.

Genetic diversity and testing demographic changes

We used either GENEPOP 3.4 (Raymond and Rousset, 1995) or FSTAT 2.9.3.2 (Goudet, 1995) to compute standard estimators of genetic polymorphism. We computed V, the variance of allele size, and calculated expected (HE) and observed heterozygosity (HO), averaged over loci. We tested for heterozygote deficiency using the Markov chain method. FIS values were calculated for each population and their significance tested by permuting the distribution of alleles within populations. It provides information on the cumulative action of inbreeding, population substructure and potential null alleles. We used the rarefaction method (in FSTAT) to compute allelic richness (rAOL, rarefied mean observed number of alleles per locus) because it gives more weight to rare alleles and thus better takes into account mutation accumulation that may occur in older populations.

To detect past demographic changes, the ‘bottleneck’ statistic T2 (Cornuet and Luikart, 1996) was computed at the population level. It represents an average across loci of the deviation of the actual gene diversity HE from the gene diversity expected from the number of alleles in the population HA assuming mutation-drift equilibrium in a population of constant size. Positive values of T2 reflect a gene diversity excess possibly caused by recent founder events, whereas negative values are consistent with recent population expansion without immigration. T2 was computed with BOTTLENECK 1.2.02 (Piry et al., 1999), according to the Stepwise Mutation Model (T2 SMM). We used T2 under the SMM model because alleles at our loci were separated by multiples of 2 bp with the exception of a single allele at locus dis054, suggesting that stepwise mutation is the norm. T2 was also computed under the Infinite Allele Model to assess the robustness of the results with respect to the mutation model. Significant deviation from equilibrium gene diversity was determined using the Wilcoxon signed-rank test, which is the most appropriate test when only few polymorphic loci are analysed (Piry et al., 1999).

Identification of spatial genetic discontinuities

Historical events may have produced spatial discontinuities in the genetic variation of D. benthamianus that could be detected by Bayesian cluster analyses. We used the model-based clustering method implemented in STRUCTURE 2.2.3 (Pritchard et al., 2000). This method assigns individual multilocus genotypes (data consisting of unlinked markers) probabilistically to a user-defined number K of clusters (or gene pools), in order to approach linkage equilibrium within clusters. We ran STRUCTURE 10 times for each K=1–10, for 1 00 000 iterations after a burn-in period of 40 000 on the data set with precise localizations (n=286 individuals) with no earlier information on the population of origin of each sampled individual. We used the admixture model, in which the fraction of ancestry from each cluster is estimated for each individual. The parameter of individual admixture α was chosen to be the same for all clusters and it was given a uniform prior. The allele frequencies were kept independent among clusters in order to avoid overestimating the number of clusters (Falush et al., 2003).

To take into account the spatial coordinates, we used the spatial Bayesian clustering algorithm implemented in TESS 2.0 (Chen et al., 2007), which assigns individuals to geographical clusters and estimates their probabilities of membership in clusters. As suggested by the authors of the programme, to include more realistic features of spatial structure, we removed the very long edges in the neighbourhood network (those >150 km). In our case, including these very long edges did not change the results, but greatly increased the computation time. We used the no-admixture model with an interaction parameter ψ ranging from 0.6 to 1. The interaction parameter is requested to implement TESS before distribution on cluster configuration and affects the relative weight given to spatial positions and genotypes when assigning an individual to a cluster (authors suggest using ψ in the range (0.5, 1.0)). After preliminary computations up to K=10, the maximum number of clusters was fixed to K=6. We did 500 runs per ψ (ψ=0.6, 0.8 and 1), with a burn-in number of sweeps of 10 000 and 50 000 iterations.

Differences in allelic richness, observed and expected heterozygosities, and FIS between clusters were tested using FSTAT with 5000 permutations.

Characterisation of SGS

Differentiation between populations was analysed based on allele identity with the statistic FST, or on allele size with RST. To characterize polymorphism and among-population differentiation for the species, multilocus estimates of these statistics were obtained. The contribution of stepwise-like mutations to population differentiation, that is, whether RST >FST, was investigated by testing whether the observed RST was larger than its value obtained after permuting allele sizes among allelic states (Hardy et al., 2003).

We assessed SGS following the procedure described by Vekemans and Hardy (2004). We computed pairwise kinship coefficients between individuals using the software SPAGEDI v1.2g (Hardy and Vekemans, 2002) and J Nason's estimator of kinship coefficient (Loiselle et al., 1995). Kinship coefficient values (Fij) were regressed on the natural logarithm of the spatial distance between individuals, ln(dij), providing the regression slopes bLd. Standard errors were assessed by jackknifing data over loci. To visualize SGS, kinship coefficient values were averaged over a set of distance intervals (d), giving F(d), and plotted against geographical distance. Seven classes of pairwise distance intervals were chosen following a near-geometric distribution: the six boundaries were 1, 4, 8, 16, 32 and 64 km (a boundary at 2 km was removed because it caused an interval encompassing too few pairs of individuals). To test for SGS, spatial positions of individuals were permuted 9999 times in order to obtain the frequency distribution of bLd under the null hypothesis that Fij and ln(dij) were uncorrelated.

In four populations (1, 2, 3 and 5) in which at least 40 individuals were sampled, we examined intra-population genetic structure, estimating the statistic Sp (Vekemans and Hardy, 2004; Hardy et al., 2006). To compare the extent of SGS among these populations over the same spatial scale, we calculated bLd60, the regression slopes of Fij on ln(dij) for dij <60 km (the maximum distance that gave a good representation of pairs of individuals for each population, that is, at least 63% of pairwise comparisons were considered in each population). We then calculated Sp, defined as Sp=−bLd60/(1–F(1)), where F(1) is the mean Fij between neighbouring individuals, which was approximated by F(d) for the first distance interval (dij <1 km). Sp was also computed for dij <10 km to allow comparisons with other tropical tree species at comparable spatial scales (Dick et al., 2008), but a serious drop of precision was consequently achieved, as on average only 18% of all pairwise comparisons were considered.

Estimation of gene dispersal distances and outcrossing rate

Wright's neighbourhood size, Nb≈Deσ2g, where De is the effective density and σ2g is half the mean squared parent–offspring distance, can be estimated as Nb=−(1–F(1))/br when the regression slope, br, is computed within the restricted distance range σg >dij >20σg (Hardy et al., 2006). As σg is unknown, an iterative approach can be applied to estimate jointly Nb and σg knowing De, as implemented in the SPAGEDI software. The quality of the joint estimate is obviously highly sensitive to the quality of the value assumed for De (Hardy et al., 2006). We estimated Nb and σg for populations 2, 3 and 5. For population 5 (Gabon), precise species-specific tree density (D) computed from forest inventories was obtained from the CEB, a logging company of the Precious Woods group. For both Cameroonian populations (2 and 3), we estimated D from a still-unpublished atlas map of densities for Cameroonian tree species (C Doumenge, personal communication) and from forest inventories from concessions in the vicinities of our sampling locations. In all cases, D was computed for trees with a minimum d.b.h.40 cm, because in Cameroon these were the sole data available. Fortuitously, this appears to be a biologically relevant threshold size, because D. benthamianus trees only flower regularly when their d.b.h. is more than 40 cm (Doucet, 2003). Moreover, at the study site for population 5, taking into account only trees with a d.b.h. more than 40 cm instead of over 30 cm caused a reduction of only 15% in estimated density (Supplementary Table S2). The slight underestimation of mature tree density will lead to a slight overestimation of σ2g, and that will be minored when computing the square-root to obtain σg. De can be approximated as D × Ne/N, where Ne/N is the ratio of the effective population size to census population size (Nunney, 1993; Vekemans and Hardy, 2004). Demographic studies show that this ratio typically ranges from 0.1 to 0.5 in populations of adult plants (Frankham, 1995). We thus used D/2, D/4 and D/10 as alternative estimates for De. For a fixed De, lower and upper bounds for the 95% confidence interval (CI) of Nb were computed as (F(1)–1)/(br+2SEb) and (F(1)–1)/(br–2SEb), respectively, SEb being the standard error of the br estimates obtained by jackknifing data over each locus. The 95% CI of σg was obtained using an analogous procedure.

Intrapopulation multilocus selfing estimates were computed with the software RMES (David et al., 2007). These estimations of selfing rates are based on the distribution of multilocus heterozygosity and, unlike those derived from FIS, have been shown to be insensitive to technical artifacts such as null alleles (David et al., 2007).

Results

Genetic diversity

In the two populations (1 and 5) used to control the quality of our markers, we observed no significant deviation from Hardy–Weinberg equilibrium nor from linkage equilibrium, using exact tests (GENEPOP 3.4 package; Raymond and Rousset, 1995) after correction for multiple tests with a sequential Bonferroni procedure (Supplementary Table S1).

All loci were polymorphic over the eight sampled populations, with the number of alleles ranging from 6 to 21, except for dis125, which exhibited only two alleles. We thus used a total of 115 different alleles. High levels of variation were found among loci, and observed heterozygosity ranged from 0.00 to 0.93 per locus and per population. All loci were at Hardy–Weinberg and linkage equilibrium over the eight sampled populations.

Although the sampling areas of the eight populations varied greatly (from 1000 to 5000 km2), no relation was found between the extent of the area sampled and the following genetic descriptors (Table 1): rAOL (Spearman's ρ=0.262; P=0.53), V (Spearman's ρ=0.119; P=0.78), HO (Spearman's ρ=0.405; P=0.32), HE (Spearman's ρ=−0.02; P=0.96) and FIS (Spearman's ρ=−0.786; P=0.02). We thus treated populations as equivalent units for further analysis. There was slight but significant genetic differentiation among these populations: overall value of FST was 0.0739.

Table 1 Description of the samples and parameters of genetic variation for the eight studied populations

Under a strict SMM model, all populations showed a negative value of T2 (consistent with recent population expansion without immigration) and a significant heterozygosity deficit (P<0.01). Under an Infinite Allele Model, no populations showed significant heterozygosity excess or deficit (P >0.05), and T2 was negative only in populations 2, 5 and 7.

Macrogeographic genetic discontinuities

STRUCTURE's posterior probabilities of K, log-likelihood of the microsatellite data on 286 D. benthamianus individuals given K clusters, and the corresponding ΔK statistic (Evanno et al., 2005), all showed a peak at K=3, indicating that three genetic clusters was the best solution for K given the data. Only 17 individuals out of the 286 (that is, 6%) could not be assigned to one of the three clusters (Passignation <0.5 to each cluster). These three clusters contained 20, 41 and 39% of all assigned individuals (Table 2). These clusters corresponded quite closely to three geographical groups. The first (the least supported) contained 59% of its individuals from the Gabonese coast (populations 4 and 6), the others mostly coming from the western part of our Cameroonian sample (population 2); the second cluster contained 88% of its individuals coming from the inland of Gabon (populations 5, 7 and 8); and the third cluster contained 97% of its individuals coming from Cameroon (populations 1, 2 and 3), the few others coming from northern Gabon (population 8).

Table 2 Parameters of genetic variation for the three clusters (see Table 1 for signification of symbols) and mean pairwise differentiation between populations within and among clusters based on allele identity (mFST) or allele size (mRST)

We then used the spatial Bayesian clustering algorithm TESS. For three different values of the interaction parameter (1, 0.8 and 0.6), we obtained a likely clustering of our individuals into three to five different clusters. When ψ equalled 1 (influence of spatial data greater than that of genetic data), we obtained three clusters (smallest DIC estimates) that were the Gabonese coast, inland Gabon and the Cameroonian populations. When the value of ψ was decreased to 0.8, a fourth cluster was obtained through the division of Cameroonian samples in western (population 2) and eastern (populations 1 and 3) parts. When ψ=0.6, five clusters was the most likely solution for our data, but the fifth cluster was composed of only a single Gabonese coastal individual separated from the others.

From the STRUCTURE and TESS analyses, we thus concluded that the most likely clustering of our data was a division into three clusters: one called ‘North’, containing all three Cameroonian populations (1–3); ‘West’, containing the two coastal Gabonese populations (4 and 6) and ‘South’ that contained the three populations from inland Gabon (5, 7 and 8). There were no overall significant differences among the three clusters in allelic richness, observed heterozygosity, gene diversity or FIS (Table 2), and no pairwise significant differences (Wilcoxon tests; all P>0.05), except for allelic richness, which showed a tendency to be higher in the ‘North’ cluster than in the ‘South’ cluster (Z=−2.4; P=0.014).

Taking into account all eight populations, isolation by distance is marked (P=0.0133). Isolation by distance may cause fictitious clusters when using clustering method such as the STRUCTURE algorithm. One way to check the reliability of the clusters proposed is to compare a posteriori pairwise FST between populations distinguishing within- and among-cluster pairs. For the same range of geographical distances (that is, <350 km), average pairwise FST (mFST) increased threefold between within-cluster (mFST=0.027) and among-cluster comparisons (mFST=0.086; Figure 2). In addition, there was no FST-distance correlation for the among-cluster pairwise comparisons (slope b≈0, R2=0.005, P=0.76), suggesting that the genetic structure of each cluster was established independently. FST between the eight sampled populations revealed that the highest differentiations occurred between populations of the cluster “South” and the others. According to allele size randomization tests, RST did not exceed significantly FST (Table 2), suggesting that the number of generations since population isolation is lower than the reciprocal of the mutation rates of the locus considered (Hardy et al., 2003).

Figure 2
figure 2

Among- and within-clusters pairwise FST between populations according to geographical distances (in km). Black diamonds: among-cluster comparisons; empty squares: comparisons within the ‘North’ cluster; empty triangles: comparisons within the ‘South’ cluster; empty circles: comparisons within the ‘West’ cluster.

Spatial genetic structure and estimation of gene dispersal and outcrossing rate

In the four populations (1, 2, 3 and 5) with at least 40 individuals, the obtained slope bLd60 of the regression of the kinship coefficient values (Fij) on the spatial distance was significant (all P<0.001). Values of the statistic Sp were relatively similar, ranging from 0.008 to 0.013 (Table 3). Estimation of Sp at a smaller spatial scale (<10 km) gave values ranging from 0.010 to 0.027 (Table 3).

Table 3 Estimates of SGS and parameters of gene dispersal for four populations

We succeeded in estimating σg, the square root of half the mean squared parent–offspring distance, in three populations (2, 3 and 5); the algorithm did not converge for population 1, surely because of the low number of individuals (n=40). Over populations 2, 3 and 5, estimates of the neighbourhood size Nb ranged from 50 to 110 individuals (95% CI: 18-∞) and estimates of σg ranged from 400 to 1200 m (Table 3). CIs of these estimates usually had no upper bound limit. However, in population 3 in which the SGS was marked, we obtained more precise 95% CI for two effective densities (De) estimated from the density of adults (D): for De=D/4, σg=601 m (95% CI: 441–1072) and for De=D/10, σg=746 m (95% CI: 623–1783) (Table 3).

In all populations, both estimations of the multilocus selfing rate, s(ĝ2) and s, were not significantly different from zero, except for the estimate of s(ĝ2) in population 2 (s(ĝ2)=0.12; P=0.006). Globally, there was no among-population significant difference in the selfing rates encountered, and upper bound estimates were always less than 0.23 (Table 1).

Discussion

Macrogeographic genetic structuring

Despite the continuous geographic distribution of D. benthamianus in tropical forests of Central Africa, we detected three distinct genetic clusters of this tree species in Gabon and Cameroon. The distribution of individuals in these three clusters demonstrated a clear geographical pattern (Figure 1b): a genetic discontinuity separated ‘North’ (104 individuals) and ‘South’ (109 individuals) clusters from inland regions, and a less clearly defined ‘West’ cluster (56 individuals) contained mostly individuals from coastal regions, but also encompassed a few individuals with further inland location. According to microsatellite allele size randomization tests (Hardy et al., 2003), stepwise mutations have not contributed to the differentiation among these genetic clusters, which would be due to genetic drift only (RST=FST, no phylogeographic signal). Tests based on the distribution of allele frequencies suggest recent population expansion, at least when a model of stepwise mutations is assumed.

One potential explanation of the existence of several genetic clusters in the range of one species is the presence of barriers to gene flow. Along with historical factors (mainly, colonization history), environmental heterogeneity (for example, rainfall levels (Letouzey, 1968)) and physical barriers (for example, rivers (Anthony et al., 2007)) are key factors that have already been demonstrated in Africa to influence species distribution, and thus their genetic structure, but mostly for animals species. Another potential ecological barrier is due to the annual movement of the intertropical convergence zone, a band of clouds that encircles the globe near the thermal equator and determines the seasonality of precipitations. South and north of the thermal equator, situated 2°N in Africa, seasons are inverted, that is, they are shifted by 6 months, inducing a time-lag in plant flowering phenology that may slow down pollen-mediated gene flow, but should not affect seed-mediated gene flow. As for D. benthamianus, the limit between the ‘North’ and ‘South’ clusters matches the position of the thermal equator, while no physical barriers could explain that differentiation. Whether phenological isolation around the thermal equator really occurs in this species should be investigated: we know that fruiting of D. benthamianus usually occurs at the end of the short dry season (Doucet, 2003), and it should be studied if, for instance, one gene pool is programmed to flower during one dry season, while another gene pool flowers 6 months later. However, this explanation seems unlikely given the high inter-annual variability of seasonality and the ability of plant species to respond phenologically to this variability. Moreover, temporal isolation would not prevent gene flow through seed dispersal and cannot explain the presence of the ‘West’ cluster. Presence of ‘mountains’ (zones of higher elevation) may act as barriers limiting contemporary gene flow in the transition zone between the ‘West’ genetic unit and the others, but the lower elevation of the plain of the Ogooué River (the main river of Gabon) significantly disrupts this potential physical barrier.

Hence, these clusters should rather be linked to the biogeographical history of the region. Forest fragmentation, for example, during cold and dry periods of the Last Maximum Glaciation at the end of the Pleistocene (Maley, 1996; Plana et al., 2004), likely caused the decline of D. benthamianus, which would have survived only in isolated populations, each subject to drift. With the return of more favourable conditions, the species then recolonized the whole region from these relict populations. Being a light-demanding tree species, D. benthamianus may have undergone particularly fast population and range expansion during the early phase of forest recolonization. According to this hypothesis, genetic differentiation of several pools with distinct allelic frequencies could be the signature of expansion of differentiated relict populations following the Last Maximum Glaciation.

Dating past events from population genetics data is notoriously difficult because theoretical inferences are usually relative to a mutation rate or calibration points that are not available. We cannot rely on genetic data to obtain a narrow range of age estimates for past events given all the unknown parameters. We can only check whether the proposed demographic scenario occurred on a timescale compatible with genetic evidences, but this gives at best the order of magnitude of the time elapse since population isolation. In our case, as RST is close to FST, the number of generations of isolation should be less than the reciprocal of the mutation rate (Hardy et al., 2003). Assuming that the latter is close to 10−3, a reasonable value for microsatellite markers, we suggest that populations were isolated less than 1000 generations ago, otherwise RST would be expected to exceed FST. Assuming a mean generation time >100 years, consistent with the facts that in D. benthamianus the minimal d.b.h. of fruiting trees is 20 cm, the mean d.b.h. of fruiting trees is 40–45 cm and the mean annual diameter increment is 3–4 mm (Doucet, 2003), our data seem compatible with fragmentation of populations during the last glacial period (1 10 000 to 9700 years ago, just before the Holocene) of the current Quaternary period. With gene dispersal distances of the order of one kilometre per generation, many generations are needed to erase a genetic signature of past fragmentation throughout a region encompassing 1000 km from one end to the other, such as the one studied. This is again compatible with recolonization beginning early in the Holocene. The limited ability of the clustering software to assign some individuals to one of the three clusters, especially in western Gabon, may be a consequence of partial admixture. Forests in Central Africa have also been affected by major climatic events that occurred more recently, 4000 and 2500 years BP (Maley, 2002), and these might also have caused forest fragmentation. However, these disturbances lasted only for relatively short times, and population sizes in isolated fragments would have to have been very small to account for the observed FST (<150–250 individuals, assuming a generation time of 100 years and very recent population expansion). The intense drift would have resulted in relatively low levels of gene diversity, which is not observed. Hence, these recent events are unlikely to account alone for the observed patterns. The fact that signs of recent population expansion under a strict stepwise mutation model were observed again supports a scenario of relatively recent recolonization after fragmentation occurring during the Last Maximum Glaciation, at the end of the Pleistocene. However, this signature of population growth is dependent on the mutation model assumed.

The observed distribution of clusters (Figure 3a) can be compared with that observed for two other tree species living in the same region and genotyped with nuclear microsatellites: Milicia excelsa (Moraceae; Figure 3b) exhibits one cluster in southern Cameroon and north-eastern Gabon and a second in the rest of Gabon (Daïnou K and Heuertz M, personal communication), whereas Aucoumea klaineana (Burseraceae; Figure 3c) exhibits four differentiated genetic clusters within Gabon (Born, 2007). Hence, within the areas where these three species co-occur, there is no clear geographical matching between their respective genetic clusters. The basic assumption that all species share the same Quaternary refuges and recolonization pathways is invalidated here. Likewise, in Europe, even closely related species living in the same habitats can display highly contrasted phylogeographic patterns (Michaux et al., 2005). This suggests that the species-specific phylogeographic pattern may be a general feature and that the search for the influence of common historical factors to explain the phylogeographic structure of different species may turn out to be very complex but see reference Alvarez et al. (2009). Nevertheless, it is noteworthy that a north-south discontinuity of genetic variation occurring in the vicinity of the border between Cameroon and Gabon is observed both in D. benthamianus and in M. excelsa (Figure 3a and b), as well as in two other tree species using other genetic markers: Irvingia gabonensis according to RAPD (Lowe et al., 2000) and Greenwayodendron suaveolens according to cpDNA sequences (Dauby G, personal communication). Moreover, the northern extension of A. klaineana also matches approximately this limit (Figure 3c). If this trend was confirmed in additional species, it might indicate that forests north and south of the thermal equator were isolated for enough time to cause population differentiation and some species might have survived in only one of them. In D. benthamianus, a (slight) relationship exists between belonging to a genetic cluster and some ‘environmental’ heterogeneity (presence of ‘mountains’ and localization of the thermal equator), contrary to Aucoumea (Born, 2007). The riverine barrier hypothesis, which has been verified with respect to the structure of genetic diversity mainly in primates (Eriksson et al., 2004; Anthony et al., 2007), remains largely untested in plants, notably in African rainforests. Here, it appears that the two main rivers of the sub-region (the Ogooué River in Gabon, and the Sanaga River in Cameroon), that both flow east to west, may not particularly act as physical barriers.

Figure 3
figure 3

Maps of genetic clusters determined for three species of tropical African timber trees. Dashed lines stand for the known range limits of the species. (a) D. benthamianus (this study): the hatched zone in the upper light grey area shows the putative fourth cluster. (b) Milicia excelsa (Daïnou K and Heuertz M, personal communication) with one cluster in southern Cameroon and north-eastern Gabon and one in the rest of Gabon. (c) Aucoumea klaineana (Born, 2007) with four clusters in Gabon and western Congo.

Similarity in SGS between populations

A slight SGS (estimated using the Sp statistic for distances <60 km; Table 3) was detected within four populations, with a maximum Sp statistic of 0.013. The extent of SGS was very similar among populations. Moreover, values of genetic diversity (HE) and inbreeding index (FIS) were relatively homogeneous among all populations. Hence, genetic dynamics appear similar even in regions distant from one another, suggesting that SGS reflects equilibrium isolation by distance patterns. The sole exception was the statistically significant inbreeding index (FIS) for population 1, suggesting the occurrence of selfing and/or biparental inbreeding. This population is located at the margin of the distributional range of D. benthamianus (Vivien and Faure, 1985), but the low density (around one-fifth that observed in other populations) may be a more likely explanation for the higher level of inbreeding.

Considering only distances <10 km gave us doubled values of Sp (except for the population 3), with a maximum Sp statistic of 0.027 (Table 3), a value in the last quartile of SGS estimates for tropical trees in the literature (Dick et al., 2008). Sp values in D. benthamianus are typically of the same order of magnitude as those found in other tropical trees whose seeds are heavy gliders dispersed by the wind (for example, Dicorynia (Fabaceae), Aucoumea (Burseraceae); see Hardy et al., 2006; Dick et al., 2008). This is coherent with some field observations suggesting seed dispersal to distances up to 50 m (Hecketsweiler, 1992; Cordiez, 2000). The doubling of Sp values when considering distances <10 km instead of up to 60 km indicates that the curve of kinship coefficient values (Fij) presents a concave trend, suggesting a ‘fat-tailed’ dispersal kernel of genes (Heuertz et al., 2003). However, the precision obtained when considering distances <10 km is limited because only 7–40% of pairs of individuals were taken into account.

Gene dispersal distance in D. benthamianus

Information about gene dispersal is a prerequisite to understanding the distribution and evolutionary dynamics of species. Direct field measurements or parentage analyses, however, are not easy to perform for tropical species, and indirect approaches are often required (Heuertz et al., 2003; Smouse and Sork, 2004; Hardy et al., 2006), especially in species presenting long-distance pollen and/or seed dispersal. We were able to estimate the scale of gene dispersal based on the strength of SGS in three populations, and we found estimates of σg (0.71 the quadratic mean parent–offspring distance) comprised between 400 and 1200 m—depending on the population and on the assumption made concerning effective density (De). Compared with previous estimates of σg in tropical trees, generally found to be below 500 m (Hardy et al., 2006; Born et al., 2008; but see Bizoux et al., 2009), this dispersal distance is quite large. Nevertheless, it shows that at the geographic scale investigated (1000 km diameter), past population fragmentation should keep a genetic signature in the form of differentiated gene pools for many generations after a secondary contact between fragments. Our indirect estimates of gene flow are thus compatible with a hypothesis that the large scale genetic discontinuities observed in D. benthamianus in the Lower-Guinean domain results from the fragmentation of central African lowland rainforests during the Pleistocene.

Our indirect estimates of gene dispersal distances cannot distinguish between seed and pollen dispersal, and according to our knowledge of seed dispersal vectors, we expect that pollen is the main contributor to long-distance gene dispersal. Nevertheless, only seed dispersal is involved in the dynamic of recolonization and should be quantified more accurately. To distinguish the relative roles of seed and pollen dispersal and elucidate the contributions of dispersal agents and distances, further insights should be obtained by genotyping progeny arrays that should provide contemporary estimates of pollen dispersal distances, by using, for example, a TwoGener approach (Smouse et al., 2001; Robledo-Arnuncio et al., 2006), or by conducting parentage analyses. In addition, sequencing of chloroplast and nuclear genes would complete the characterization of the large-scale genetic structure by a phylogeographic approach that might give additional insights into the history of the species. General conclusions on the impact of past vegetation changes and the role of climate fluctuations on the patterns of genetic variation of forest species will only be reached when a sufficient number of species will be studied in Central Africa.