Introduction

Dynamic management (DM) is an evolutionary approach for the conservation of the genetic resources of crop species (Henry et al., 1991; Goldringer et al., 2001). DM aims at maintaining the genetic diversity of crop species by growing successive generations of genetically diverse populations in several contrasted environments, without intentional human selection. Similar to wild fragmented populations, the mid-term maintenance of the evolutionary potential of the DM populations is a major issue for the sustainable conservation of biodiversity. In isolated populations, drift and selection lead to the reduction of genetic diversity. Reduced genetic diversity might then result in a loss of fitness because of inbreeding depression caused by the accumulation of deleterious mutations (Lynch et al., 1995; Keller and Waller, 2002) and a loss of evolvability, that is, the ability of a population to respond to natural or artificial selection (Houle, 1992). Isolation is not the only threat for natural populations. In the case of connected populations, genetic load seems to be more important in very small populations, as shown theoretically by Theodorou and Couvet (2006) and by crossing experiments by Willi and Fischer (2005).

Introduction of migrants in the threatened populations may allow the restoration of evolutionary potential by introducing new alleles into the population and hence (1) increasing the within-population genetic variability and (2) producing heterosis by masking deleterious recessive mutations or increasing the level of heterozygosity. The potential positive effect of migration to manage natural fragmented populations has often been discussed in the literature (reviewed in Tallmon et al., 2004), but few studies have really applied the so-called ‘genetic rescue’ to real populations and surveyed its consequences. Madsen et al. (1999) applied it to inbred populations of adder species and Vila (2003) applied genetic rescue to Scandinavian wolf populations. In both cases, the target population size increased very quickly, in one or two generations, despite the very few number of migrants used (20 and 1, respectively). Similarly, in plants, Newman and Tallmon (2001) showed that migration increased the fitness of experimental populations of Brassica campestris.

Real application of genetic rescue in conservation strategies is infrequent, probably because of the risk of outbreeding depression. Outbreeding depression is the reduction in fitness of offspring from matings between genetically divergent individuals. Experimental crosses between populations are the most frequent way of measuring the amount of outbreeding depression in a structured population (Edmands, 2007). Several experimental crossing studies have been carried out on fragmented populations and have revealed either outbreeding depression (Fischer and Matthies, 1997; Waser et al., 2000; Willi et al., 2005; Galloway and Etterson, 2005), heterosis between populations (Fenster and Galloway, 2000; Bossuyt, 2007) or both within one species (Physa acuta; Escobar et al., 2008).

In general, two non-exclusive mechanisms may explain outbreeding depression (Schierup and Christiansen, 1996): (1) the ‘ecological mechanism’, which is also called the ‘dilution effect’, in which, due to local adaptation of the populations, individuals carrying migrant genes have a lower fitness because these genes are maladapted to the new environment (Waser, 1993) and (2) the ‘genetic mechanism’ in which positive epistatic interactions within gene complexes might be broken (Lynch, 1991), whereas deleterious epistatic interactions appear. The two mechanisms may have different influences on the different generations of the progeny after the crossing event, depending on the genetic architecture of fitness traits (for example, dominance) and on the mating system. In outcrossing species in which heterozygosity is high, complexes of genes could be selected by dominance-by-additive epistasis. The dominance component of these complexes might be broken during the first generation, leading to outbreeding depression because of the genetic mechanism (Escobar et al., 2008). On the contrary, in selfing species, in which homozygosity is high, mainly additive-by-additive epistasis is expected to be found (Goldringer et al., 1997). In the first generation after a cross, additive-by-additive epistasis complexes are not broken and hence epistasis is not expected to contribute much to outbreeding depression in selfing species. However, in selfing species underdominance might also contribute to outbreeding depression of the first generation. Under both mating systems, the first generation is expected to be further depressed by the ecological mechanism, that is, the additive effects of the migrant maladapted alleles, except in the rare case in which the migrant alleles are fully recessive. Although the ecological mechanism persists after the second generation, genetic incompatibility is expected to be fully expressed as recombination breaks cis parental genes complexes (Lynch, 1991). Other mechanisms could also be involved in outbreeding depression, such as interactions between Wolbachia and a fig wasp (Greeff et al., 2009).

When applying genetic rescue approaches, two components of migration should be studied: the ‘quality’ of the migrants and the quantity of the migrants. Managing the quantity of migrants is well described by Wang (2004), who applied the One-Migrant-per-Generation rule to conservation biology. We will not discuss this component of the genetic rescue but will focus on the ‘quality’ component. Migrant origin should be carefully chosen to benefit from the renewal of genetic variability while limiting outbreeding depression effects. A major issue is to identify the appropriate distance (geographical or genetic) of the migrant individuals. This distance can be described as the optimal crossing distance, and it depends mainly on the genetic structure of the populations, on the importance of local adaptation and on the mating system (Waser, 1993). Most of the studies described above estimate divergence between populations by geographical distances. A few estimate differentiation at the genetic level by kinship coefficient (Willi et al., 2005) or by FST (Escobar et al., 2008) or at the phenotypic level by QST (Escobar et al., 2008).

In this study, we investigated the conditions to set up a genetic rescue approach in an experimental DM system of a self-fertilizing crop species, bread wheat (Triticum aestivum L.). To this aim, genetic diversity and phenotypic variability were analyzed in three DM populations after 12 generations of evolution without conscious human selection. Previous studies on these populations have shown that populations have diverged for different traits, such as disease resistance (Paillard et al., 2000) and earliness (Goldringer et al., 2006), in relation to local conditions. Three levels of crosses (selfing and within-population and between-population crosses) were carried out and phenotypic values of the progeny were analyzed to assess individual inbreeding depression and between-population heterosis or outbreeding depression. Kinship coefficients between the parents of the crosses were estimated using molecular markers, and the relation between the kinship coefficient between parents and the progeny values was analyzed to test for the existence of an optimal genetic distance. These results, together with analyses of the genetic structure of the populations, allowed to improve our understanding of the DM system and to make decisions regarding migration strategies.

As the DM system corresponds to recently diverged populations, the findings of our study can be useful not only for designing optimized dynamic management of genetic resources for crops, but also for improving conservation strategies for wild species experiencing recent habitat fragmentation.

Materials and methods

Plant material

Development of the dynamic management populations

A total of 16 nearly homozygous genotypes (referred to as ‘initial lines’ hereafter) of Triticum aestivum L. representing a wide genetic basis were initially crossed using a pyramidal design (Goldringer et al., 2006). The hybrid seeds obtained were then sown and bulk harvested for three generations of seed multiplication. This composite cross population, G0, was used as the initial population of the DM system. In 1984, seed samples of G0 were distributed to three sites of a French experimental network, Le Moulon (48.4°N, 2.1°E), Rennes (48.1°N, 1.4°W) and Toulouse (43.4°N, 1.3°E), with contrasted environmental conditions. Since then, each population has been cultivated every year without any conscious human selection. Each year, the whole population was harvested in bulk, from which a random sample of at least 5000 seeds was sown in a 100 m2 plot isolated from other wheat cultures to constitute the next generation. Migration between populations has been avoided since the beginning (see Goldringer et al., 2006; Rhoné et al., 2008 for more details on the populations).

Crossing scheme

Because wheat plants are hermaphrodites, plants were considered either as females and the spikes for crosses were emasculated or as males and only their pollen were used. In each of the three populations of the twelfth generation (Le Moulon, Rennes and Toulouse), 30–39 female plants and 35–38 male plants were randomly chosen for the crossing scheme. For each female plant, three types of crosses were made (Figure 1): self-pollination (SELF), within-population crosses (WPC, two distinct samples were used for female plants and male plants in each population) and between-population crosses (BPC, cross between a female from one population and a male from each of the two other populations). The male sample in each population was the same for the two last types of crossing (WPC and BPC). For each cross, different spikes of the female plant were either selfed or pollinated during spring 2003 and harvested during summer 2003. The progeny from the resulting seeds of each spike was called a family. The total number of families was 416: 104 for the SELF type, 104 for the WPC type and 208 for the BPC type.

Figure 1
figure 1

Crossing scheme. Each ellipse represents a population (Le Moulon, Rennes and Toulouse). Cross type codes are SELF for selfing, WPC for within-population cross and BPC for between-population cross. The framed plant is the target female plant for the four crosses symbolized by the arrows.

Phenotypic evaluation

The first-generation progeny of all the crosses were sown in a common-garden experiment at Le Moulon in October 2006 in a two-replicate randomized complete block design. In each block, each family was evaluated with a 1.20 m single row of 15 seeds. The first replicate was cultivated without fungicide and phenotypic measures were recorded on each plant of each row. The second replicate was cultivated with fungicide to ensure seed quality at harvest and only average phenotypic values were recorded for each row.

Six phenotypic traits, related to plant fitness, were measured at maturity during the harvest in summer 2007: plant height (PH), spike number per plant (SNP), kernel number per plant (KNP), kernel number per spike (KNS), grain weight per plant (GWP) and thousand-kernel weight (TKW).

Genotyping of the parents of the crosses

In spring 2003, a leaf sample was taken on each parent used in the crossing scheme for genotyping. The 16 initial lines of the DM populations were sown in 2007 and they were also sampled for genotyping. For each plant, total DNA was extracted from 100 mg of fresh material following a protocol derived from the Dneasy 96 Plant Kit (Qiagen, Hilden, Germany).

In all, 17 microsatellite markers developed by Röder et al. (1998), 1 (wmc231) by Somers and Edwards (2004) and a bi-loci marker (the pair of primers amplifies two individual loci on 2 chromosomes) (cfd71) developed by Guyomarc'h et al. (2002) were used for genotyping the 217 individuals studied. This set of 20 markers covers the 21 chromosomes of bread wheat except the chromosome 1A. PCR protocols were adapted from Röder et al. (1998) and Guyomarc'h et al. (2002). Amplified fragments were separated on an ABI 3130xl semi-automatic sequencer (Applied Biosystems, Foster City, CA, USA) and analyzed using GeneMapper 3.7 (Applied Biosystems).

Analyses

Genetic diversity among parents

We analyzed the genetic diversity of the individuals involved in the crossing scheme and of the 16 parents of the DM populations. Sample sizes for each population (Le Moulon, Rennes and Toulouse) were 67, 70 and 76, respectively. For each population, gene diversity was estimated from the allelic frequencies using the unbiased estimator He (Nei, 1987). Mean average over loci of allele number per locus (A) and within-population fixation index FIS were computed using Genetix 4.03 (Belkhir et al., 2000, Montpellier, France). The 95% confidence interval of FIS was computed using 1000 bootstraps per locus. Differentiation between the three populations was estimated using Weir and Cockerham F-statistics estimator θ (Weir and Cockerham, 1984) with 1000 permutations used for significance tests. We also estimated differentiation between the three populations by the DEST measure (Jost, 2008). In contrast to FST estimator θ, DEST partitions total genetic variance into statistically independent within- and between-population components and therefore avoids underestimation of differentiation by FST-like measures because of high within-population diversity (Jost, 2008). We estimated population genetic effective sizes (e) between the 16 initial lines and the populations at the twelfth generation using temporal variation in allele frequencies estimated by the mean allelic frequency standardized variance (Fc, Nei and Tajima, 1981). We used the second sampling scheme from Waples (1989). We estimated 95% confidence intervals assuming that (nF̂c)/E(c) follows a χ2 with n degree of freedom (total number of alleles). We compared the genetic effective sizes obtained with the demographic sizes corrected for selfing following: Nselfing=(Ndemographic)/(1+FIS) (Pollak, 1987).

To investigate the possibility for an optimal genetic distance for crossing, we computed kinship coefficient for each pair used in the crossing scheme using SPAGeDi (Hardy and Vekemans, 2002, Bruxelles, Belgique), a software program that implements the computation of the kinship coefficient proposed by Loiselle et al. (1995).

Statistical analysis of quantitative traits

Preliminary analysis and correction for spatial effects

In the first replicate, we thought that the position of the plants in the row might influence trait expressions because of differences in light exposure such as, for instance, border effects for plants at the start and end position. To account for such effects, we tested a ‘position’ effect using the following analysis of variance (ANOVA) model that explained phenotypic traits measured on the first replicate: Yij=μ+posi+Famj+Rij where pos is the fixed position effect, Fam is the random family effect and R is the random residual. Family and position effects were significant for all the traits, and hence we corrected each individual trait value in the first replicate for the position effect. In the following analyses of quantitative traits, means of corrected individual values were used for each family in the first replicate. Thus, for each trait, each family was described by two values, one for each replicate. The estimated broad-sense heritability at the design level ranged from 0.25 to 0.86, indicating the high level of genetic variation and the high correlation between the two replicates, although the experimental conditions of the two replicates differed.

Genetic structure at quantitative traits

To describe the genetic structure in parental populations for the quantitative traits, we used the selfed progeny (32 families for Le Moulon, 34 for Rennes and 38 for Toulouse) to assess the QST statistics (Wright, 1969; Spitze, 1993) as follows:

where is the estimated between-population genetic variance, is the estimated within-population genetic variance and FIS is the Wright fixation index. The within- and the between-population genetic variances in the experiment were estimated with the following ANOVA model:

where Pop is the random population effect, Fam is the random family effect, rep is the fixed replicate effect and R is the random residual. was estimated as the variance associated with the Pop effect and as the variance associated with Fam. QST was computed when the Pop effect was significant.

Sources of variation for the progeny phenotypic values

Using all progeny (104 SELF, 104 WPC and 208 BPC), we tested the effect of cross type and the effect of the population of origin of the female plant with the following ANOVA model:

where ctype is the fixed effect of cross type as defined in the crossing scheme (SELF, WPC and BPC), OriginFpl is the random effect of the population of origin of the female plant (Le Moulon, Rennes and Toulouse) and the other effects are the same as described in model (1).

To analyze how the relatedness between the parents of a cross affected the phenotypic value of the progeny, the last model included the kinship coefficient K as covariates as follows:

We included the square of the kinship coefficient (K2) to define the phenotypic value of the progeny as a parabolic function of the kinship coefficient. A significant K2 effect would indicate that there is an optimal kinship coefficient maximizing the progeny value.

To test for local adaptation, we have compared least-square means of WPC progeny from the Le Moulon population (considered as residents as experimentation took place in Le Moulon), with Rennes or Toulouse WPC progeny (considered as migrants) and corresponding BPC (Le Moulon by Rennes and Le Moulon by Toulouse in both direction). Least-square means were estimated using the following model:

where cross is the fixed effect of the cross type taking into account the origin of the two parents and the other effects are the same as described in model (1).

All ANOVA analyses were run using PROC generalized linear model (GLM) in SAS (SAS Institute, 2000, Carry, NC, USA).

Hierarchical heterosis estimation

Finally, applying model (4) to the progeny of all crosses, we estimated heterosis at two levels: (1) within population and (2) between populations. For (1), we estimated within-heterosis for each population (Le Moulon, Rennes and Toulouse) as (YWPC,iYSELF,i)/(YWPC,i) where YWPC,i and YSELF,i are, respectively, the mean trait value for the progeny from within-population crosses and from selfing in population i. For (2), we estimated between-heterosis for each pair of populations (Le Moulon-Rennes, Le Moulon-Toulouse and Rennes-Toulouse) as

where YBPC,ij is the mean trait value for progeny from crosses between population i and population j for both directions. To test whether heterosis values are significantly different from 0, we applied a t-test on heterosis numerator with the ESTIMATE statement of generalized linear model (GLM) procedure in SAS.

Results

Population structure

Table 1 shows diversity statistics for the 16 initial lines of the DM system and for the three parental populations. Although the Rennes population had a slightly lower He, the values for the Nei diversity were quite close in the populations of the twelfth generation and in the 16 initial lines. This result indicated that there was no marked loss of within-population diversity during the 12 generations of DM. This was confirmed by the allele number. A factorial component analysis showed that the initial diversity of the 16 lines was included in the total diversity of the three populations (data not shown). For all populations, the fixation index was rather close to 1, from 0.84 (Rennes population) to 0.98 (16 initial lines), and significantly different from zero, as expected for a highly selfing species such as bread wheat. Genetic effective sizes ranged from 67 (95% confidence interval 46–93) for Rennes population to 173 (95% confidence interval 120–244) for Le Moulon population. They were consistent with the within-population diversity He: Rennes population had the smallest value (He=0.51) and Le Moulon population the highest (He=0.61). Ne values were 15–39 times smaller than demographic sizes corrected for selfing, that is, 2650.

Table 1 Diversity statistics for the three parental populations and for the 16 initial lines of the DM system

Global differentiation over the three populations (θ=0.07 and DEST=0.11) was significantly different from zero (tests performed on θ only). Differentiation (θ) for each pair of populations ranged from 0.05 (0.09 for DEST) between populations from Toulouse and Le Moulon to 0.10 (0.14 for DEST) between populations from Toulouse and Rennes, with an intermediate value of 0.07 (0.10 for DEST) between populations from Le Moulon and Rennes. The three values of θ were significantly different from zero, indicating spatial differentiation in the DM system. DEST values were of the same order than θ while slightly higher. In addition, DEST ranked pairs of populations as θ.

In model (1), the population effect was significant for PH, KNS and GWP. QST values for these traits were 0.29, 0.19 and 0.19, respectively, whereas FST was 0.07 (Table 2). Therefore, differentiation for these quantitative traits was strong, as illustrated by the ratio QST/FST equal to 4.14, 2.71 and 2.71, respectively. Although DEST (0.11) was higher than θ, quantitative differentiation was still higher than neutral differentiation. Plant height differentiation was mainly because of the higher mean value for Rennes progeny (Table 2), whereas Toulouse progeny had significantly lower mean GWP and KNS, leading to significant QST for these two traits.

Table 2 Quantitative differentiation among selfed progeny

Sources of variation for progeny phenotypic values

Model (2) showed significant cross type and family effects for all traits (Table 3). Progeny derived from selfing had significantly lower values than the progeny issued from the two other cross types for KNP, GWP, KNS and TKW, indicating marked individual inbreeding depression (Table 4). For PH and SNP, progeny from selfing had significantly lower mean values than between-population cross progeny and lower but not significantly different mean values than within-population cross progeny. Mean values for traits for progeny from between-population crosses were never significantly different than those of progeny from within-population crosses. The effect of the population of origin of the female plant was significant for all traits except for SNP and KNP. These results were consistent with the high QST values estimated for PH, KNS and GWP on selfed progeny, whereas QST was not significant for TKW. Progeny issued from female plants of the Rennes population were significantly taller and had higher TKW and GWP (Table 5). Progeny derived from female plants of the Toulouse population had a significantly lower KNS. The interaction between cross type and population of origin of the female plant was not significant for any trait except PH.

Table 3 ANOVA table with model (2)
Table 4 Phenotypic least-square means values for the cross type effect (SELF, WPC and BPC) in model (2)
Table 5 Phenotypic least-square mean values for the origin of the female plant (Le Moulon, Rennes and Toulouse) in model (2)

The kinship coefficient K had a significant effect on PH and GWP and a significant interaction effect with the female plant origin for TKW. K2 also had a significant effect on TKW in interaction with the female plant origin. Despite these significant effects of K, model (3) explained only 4–23% of the phenotypic variation compared with 59–88% explained by model (2). The discrete ctype effect explained much more phenotypic variation than the continuous K covariate.

The cross effect of model (4) was significant only for PH, SNP and TKW. For these three traits, phenotypic values of Le Moulon WPC progeny were significantly lower than PH of Rennes WPC progeny but not different than PH of Toulouse WPC progeny (see Figure 2). Reciprocal between-population crosses did not show any asymmetry except for PH between the Toulouse population and the Le Moulon population. In this case, BPC progeny from a female plant from the Le Moulon population had a significantly lower PH than BPC progeny from male plants from the Le Moulon population. We found no indication of a specific maternal effect.

Figure 2
figure 2

Least-square means values for the cross effect in model (4) for resident within-population crosses (Le Moulon × Le Moulon (MM)), migrant within-population crosses (XX, X is Rennes or Toulouse) and between-population crosses (MX for crosses with the female plant from the Le Moulon population and XM for the reciprocal crosses). Black lines represent X parent from the Rennes population, and grey lines represent X parent from the Toulouse population. Horizontal dotted lines represent the mid-parent values between each couple MM and XX. Letters correspond to multiple comparisons between migrant, resident and BPC, lowercase for X=Rennes and uppercase for X=Toulouse.

Within-population heterosis was much greater than between-population heterosis for all the quantitative traits (Figure 3). KNP and GWP showed large and significant heterosis in the Toulouse population (0.17 and 0.21, respectively) and in the Rennes population (0.12 and 0.17, respectively), whereas heterosis was not significant in the Le Moulon population (0.06 and 0.10, respectively). Within-population heterosis was always low but significant for PH (0.03 for the three populations), KNS (0.04–0.08, not significant for the Rennes population) and TKW (0.04–0.06). In general, within-population heterosis for the different traits varied in the same proportions when changing from one population to another, except for heterosis for SNP that was marginally negative in the Le Moulon population and between 0.08 and 0.10 in the two other populations. Between-population heterosis was always low or negative whatever the populations and traits and never significant. There was slight outbreeding depression for SNP, KNS, KNP and GWP for the progeny of the cross between the Le Moulon and Rennes populations. Progeny from the Le Moulon and Toulouse crosses also showed weak outbreeding depression for KNS.

Figure 3
figure 3

Bar chart of levels of within- and between-population heterosis for quantitative traits. GWP, grain weight per plant; KNP, kernel number per plant; KNS, kernel number per spike; PH, plant height; SNP, spike number per plant; TKW, thousand kernel weight. Heterosis level estimates are as follows: BMR, between Le Moulon and Rennes; BMT, between Le Moulon and Toulouse; BRT, between Rennes and Toulouse; WM, within Le Moulon; WR, within Rennes; WT, within Toulouse.

Discussion

Finding rules to apply when designing genetic rescue for threatened fragmented populations is a major issue in conservation biology (Tallmon et al., 2004; Edmands, 2007). In this study we used recently diverged experimental populations to which we applied self-pollination, within- and between-population crosses to investigate the link between the genetic structure of the subdivided population, and fitness-related traits values in the first-generation offspring, by assessing within-population heterosis or inbreeding depression, between-population heterosis or genetic load and outbreeding depression. This approach is complementary and intermediate to analytical and simulation studies (Theodorou and Couvet, 2002; Glemin et al., 2003; Jaquiéry et al., 2009) and experimental crosses in natural populations (Fenster and Galloway, 2000; Newman and Tallmon, 2001; Willi et al., 2005) because much information on the history of the populations (time since divergence, population census sizes, initial diversity and so on) is available.

We found high and significant inbreeding depression within populations for four of six traits with systematic superiority of within-population cross progeny compared with selfed progeny. Moreover, little between-population heterosis was found, with similar values for offspring from between-population and within-population crosses.

Within-population heterosis

Wheat is a mainly selfing species with an outcrossing rate between 2 and 10% (Martin, 1990; Enjalbert et al., 1998; Enjalbert and David, 2000). In a selfing species, a high level of homozygosity is expected at all loci. Recessive deleterious mutations are thus expressed, and selection is expected to purge them efficiently, leading to low inbreeding depression in a selfing population. Unexpectedly, significant inbreeding depression was revealed in each DM population for four of the six studied traits by comparing selfing and within-population cross progeny. Corresponding within-population cross progeny heterosis values ranged between 4 and 21%. These values are surprisingly higher than the reported values for wheat species: Dreisigacker et al. (2005) found heterosis values for grain yield ranging from −15.33 to 14.13% with a mean of 0.5% for crosses between specific inbred lines from a breeding program, whereas Husband and Schemske (1996) estimated that the mean within-population heterosis for 11 selfing species is 5% for seed production. In fact, inbreeding depression can be divided into two parts: one contributed by lethal or sub-lethal mutations that will be purged by inbreeding, independent of population size, unless the mutation is totally recessive (Nei, 1968; Bataillon and Kirkpatrick, 2000), and the second because of mildly deleterious mutations that are only partially recessive. This second type will be greatly sensitive to population size and it will decrease as population size becomes smaller and genetic drift increases (Bataillon and Kirkpatrick, 2000).

The evidence of inbreeding depression that we found within each population, associated with a supposedly low genetic load, was consistent with the theoretical results of Bataillon and Kirkpatrick (2000) for effective sizes of approximately 100 and deleterious effects of mutations >0.05. The researchers showed that small populations (N100) do not efficiently purge their genetic load, and have an overall reduction in fitness because of the accumulation of deleterious mutations. However, inbreeding depression is not visible when comparing crosses between individuals within these populations to self-fertilized individuals because of the fixation of many loci within the population as a whole. On the contrary, larger populations (N100) more efficiently purge their genetic load, and allelic diversity within the population is high enough to allow the detection of inbreeding depression when comparing selfed individuals to crosses between individuals within the population.

However, in our case, the two smallest populations, Rennes and Toulouse, showed stronger inbreeding depression than the largest population (Le Moulon). This discrepancy between theoretical expectations from Bataillon and Kirkpatrick (2000) and our experimental results could be explained by the difference of time scale taken into account in the two approaches. Their model assumes that the studied population is at mutation-selection-drift equilibrium. After 12 generations, we can hardly assume that DM populations reached such equilibrium. Simple simulations allowed us to assess the probability that a mildly deleterious mutation (mutation effect between 0.01 and 0.10) initially existing in the ancestral population would be eliminated after 12 generations and its frequency if instead of being eliminated it remained in the segregating population (see details in the Supplementary Information). For the three populations, the probability of fixation of the deleterious mutation is null (Supplementary Figure S1). The more probable case is that the deleterious mutation could still be segregating in the populations. The expected frequency of the deleterious mutation is lower in the Le Moulon population than in the two others (Supplementary Figure S2), which could explain the lower values of within-population heterosis in this population.

Between-population heterosis or outbreeding depression

Studying the first-generation progeny of crosses between individuals from two populations can reveal two mechanisms linked to the subdivided population structure. First, hybridization between divergent populations may result in loss of fitness, known as outbreeding depression. This decline in fitness can be attributed to the effect of locally maladapted genes, which have been selected in another environment (the ecological mechanism) (Waser, 1993) or to genetic mechanisms such as the break-up of positive epistatic interactions or underdominance effects (Lynch, 1991). Second, masking mildly deleterious mutations differentially fixed in the different populations (genetic load at the within-population level) may lead to between-population heterosis (Fenster and Galloway, 2000).

In our study, between-population progeny were never significantly different from within-population progeny, and heterosis at the between-population level was low (<4%). This may indicate either both low between-population heterosis and little outbreeding depression or compensation between the two phenomena.

Effective population size is expected to strongly influence the accumulation of a genetic load in a population, as small population sizes will increase genetic drift thus increasing fixation of deleterious alleles (Kimura et al. 1963). Simultaneously small population sizes will cause allele frequency variations among populations, leading to strong genetic differentiation. Hence, crosses between smaller populations are expected to show more heterosis than crosses between larger populations (Whitlock, 2000). On the other hand, larger divergence between populations might lead to outbreeding depression because of genetic incompatibilities (Escobar et al., 2008). Finally, selection is expected to be more efficient in larger populations, when genetic drift is limited, allowing for local adaptation to develop in the population. Hence, crosses between large but locally adapted populations might also yield outbreeding depression. In this study, genetic effective sizes of the populations were relatively large (Table 1) compared with those of wild populations (Frankham, 1995, 1999; Siol et al., 2007) and the FST estimated on 20 neutral markers between the three experimental wheat populations were quite low (FST ≈7%), which was consistent with reduced between-population heterosis. The slightly higher heterosis between the Rennes and Toulouse populations might be because of their larger differentiation (FST=0.10) and lower Ne (Table 1).

We found high and significant QST values, associated with QST/FST ratios >1 for PH, KNS and GWP (Table 2). This can be interpreted as the consequence of divergent selection acting on these traits (Whitlock, 1999) as was shown by previous studies on these populations (Rhoné, 2008) and for other traits such as disease resistance (Paillard et al., 2000) or heading date (Goldringer et al., 2006). Non-additive effects such as dominance or epistasis are expected to lower the estimation of QST (Whitlock, 1999; Goudet and Buchi, 2006). Hence, QST/FST ratio of >1 is a conservative indication that, in addition to drift, the main force acting on the DM populations is divergent selection, potentially leading to local adaptation of populations in each contrasted site. Comparison of least-square means of the population origins estimated on selfed progeny gave indications of the differentiation at quantitative traits under divergent selection. For GWP and KNS, which are expected to be tightly correlated with fitness, there were no significant differences between Le Moulon and Rennes progeny. However, Toulouse progeny showed significantly lower values for these fitness-related traits, possibly indicating maladaptation of this population to Le Moulon environment. One possible explanation is that the environments of Le Moulon and Rennes are more similar and hence genes selected for local adaptation in Rennes are also adapted to Le Moulon, whereas locally adapted genes selected in Toulouse are maladapted in Le Moulon. Such a pattern is consistent with adaptation to climatic conditions, as climatic conditions of the two northern sites (Le Moulon and Rennes) are quite similar to each other, whereas very different from conditions in the southern site (Toulouse). We studied an aspect of local adaptation by considering that WPC progeny from the Le Moulon population were residents and WPC progeny from the Rennes and Toulouse populations were migrants (see Figure 2). Results were not the same as for the analysis of selfed progeny except for PH and KNP. Indeed, there were no differences between populations for GWP and KNS (traits with a significant QST), whereas there were significant differences for SNP and TKW (traits without significant QST). This discrepancy could be explained by the fact that phenotypic values are expressions of genetic interactions between male and female alleles in progeny of crosses in addition to simple effect of homozygous female effects as in selfed progeny. This draws attention to the fact that estimation of QST is dependent on the type of family used. Goudet and Buchi (2006) showed that progeny from self-pollination is the most efficient design to estimate QST in the case of self-fertilizing species. Progeny from the Rennes populations showed significantly higher phenotypic values than progeny from the Le Moulon and Toulouse populations for SNP and TKW. This might indicate that the progeny from the Rennes population were more adapted than those from the Le Moulon population. However, differentiated traits for progeny of crosses (SNP and TKW) are less correlated with fitness than GWP because only part of the fitness components are taken into account, whereas GWP integrates both the number of offspring and survival potential of the offspring (heavier seeds are expected to be more viable). SNP and TKW could be therefore taken to represent life history traits and differences rather than different life history strategies.

In addition, the greater plant height of the Rennes progeny in both cases, selfed or progeny of crosses, might give them a competitive advantage when grown side by side with shorter families, even in the case of the ‘local’ families from Le Moulon.

Overall, knowing that the experimental design was not optimal to precisely assess local adaptation, we found only slight indications of local adaptation for the three populations, which was consistent with limited outbreeding depression. Finally, the most likely explanation is that both between-population heterosis and outbreeding depression were low in the DM system after 12 generations of divergent evolution.

Optimal crossing distance

A balance of the effects of heterosis and outbreeding depression as a function of relatedness between crossed individuals is expected to lead to the existence of an optimal genetic distance for genetic rescue strategies (for example, in Willi et al., 2005 and reviewed in Tallmon et al., 2004). In this study, we found no clear relationship (neither linear nor quadratic) between the kinship coefficient for pairs of crossed individuals and progeny phenotypic values. In fact, kinship coefficients averaged for each type of cross were very similar, although high values of kinship were slightly more frequent for within-population pairs of individuals (see Supplementary Figure S3). This result shows that neutral diversity is not as informative as desired for assessing potentially adaptive diversity in the DM system, although linkage disequilibrium is expected to be high in a selfing species (Golding and Strobeck, 1979). However, low linkage disequilibrium in DM has been shown around genes submitted to high selective pressure (Raquin et al., 2008).

Conclusion

On the basis of our results from between-population cross progeny, we think that planned migrations between populations would not bring high heterosis in the DM system, thus reducing its possible positive effect. According to Jaquiéry et al. (2009), information from neutral markers indicating that within-population diversity is still at a sufficient level is a clue that there are no important viability problems within the populations. Neutral marker information did not provide relevant predictive information on phenotypic traits and thus on the evolutionary potential of the populations. This might be a general feature to many experimental crossing experiments. Indeed, complementary information about the subdivided population (for example, presence of local adaptation) is necessary to have a more precise picture of the mechanisms involved in the population under consideration.

Lynch (1991) raised the fact that although heterosis may occur in the first generation after crosses, recombination between parental chromosomes and the break-up of coadapted gene complexes could lead to increased outbreeding depression in the following generations. Genetic rescue effects should then be studied beyond the first generation and these studies are currently in progress with the DM wheat populations. In addition, to further study local adaptation in the DM system, cross progeny have to be studied in each site of parental origin to assess the effects of adaptive genes in the environment in which they have been selected.

Local adaptation could be an important mechanism and is found to be quite frequent in natural populations (Merila and Crnokrak, 2001; McKay and Latta, 2002; Hufford and Mazer, 2003). Only very few theoretical studies take into account local adaptation through an environmental gradient along a species range (Garcia-Ramos and Kirkpatrick, 1997; Alleaume-Benharira et al., 2006). Integrating this kind of selection in theoretical or simulation approaches could be very useful to help conservation biologists to make management decisions.