Decoupling of differentiation between traits and their underlying genes in response to divergent selection

Kremer, A; Le Corre, V

doi:10.1038/hdy.2011.81

Download PDF

Original Article
Published: 14 September 2011

Decoupling of differentiation between traits and their underlying genes in response to divergent selection

A Kremer^1,2 &
V Le Corre³

Heredity volume 108, pages 375–385 (2012)Cite this article

3321 Accesses
69 Citations
1 Altmetric
Metrics details

Abstract

We dissected the relationship between genetic differentiation (Q_ST) for a trait and its underlying genes (G_STq, differentiation for a quantitative locus) in an evolutionary context, with the aim of identifying the conditions in which these two measurements are decoupled. We used two parameters (θ_B and θ_W) scaling the contributions of inter- and intrapopulation allelic covariation between genes controlling the trait of interest. We monitored the changes in θ_B and θ_W, Q_ST and G_STq over successive generations of divergent and stabilizing selection, in simulations for an outcrossing species with extensive gene flow. The dynamics of these parameters are characterized by two phases. Initially, during the earliest generations, differentiation of the trait increases very rapidly and the principal and immediate driver of Q_ST is θ_B. During subsequent generations, G_STq increases steadily and makes an equal contribution to Q_ST. These results show that selection first captures beneficial allelic associations at different loci at different populations, and then targets changes in allelic frequencies. The same patterns are observed when environmental change modifies divergent selection, as shown by the very rapid response of θ_B to the changes of selection regimes. We compare our results with previous experimental findings and consider their relevance to the detection of molecular signatures of natural selection.

Experimental evolution of adaptive divergence under varying degrees of gene flow

Article 11 January 2021

Correlational selection in the age of genomics

Article 15 April 2021

Polygenic adaptation: a unifying framework to understand positive selection

Article 29 June 2020

Introduction

The detection of genomic imprints of natural selection has become a major area of research in ecological genetics (Nielsen, 2005; Wright and Gaut, 2006). One of the approaches used is based on the detection of genes displaying higher levels of population differentiation than would be expected under neutral evolution (the so-called ‘outlier’ loci; Storz, 2005; Holderegger et al., 2008). In most cases, the rationale of ‘outlier’ detection is based on the existence of a high degree of phenotypic differentiation of the trait, as observed in common garden experiments, suggesting similar levels of differentiation at the genomic level. For example, in forest trees, in which extensive common gardens have been established, known as provenance tests, substantial population differentiation has been observed for almost all phenotypic traits assessed (see Wright (1976) and Morgenstern (1996) for reviews on North American species, and König (2005) for a review on European species). Only a few studies have investigated the differentiation of traits and their underlying genes, and these studies have concluded that there is considerable discrepancy between these two measurements (Hall et al., 2007; Luquez et al., 2007; Namroud et al., 2008). Theoretical predictions supported by simulations have indeed indicated that there may be large discrepancies between phenotypic differentiation (Q_ST) and differentiation of the genes controlling the trait (G_STq, for quantitative loci, as suggested by Santure and Wang (2009), to avoid confusion with neutral G_ST or F_ST). Latta (1998, 2004) and McKay and Latta (2002) have shown that this discrepancy stems from the intergenic disequilibria accumulated through selection, which decouples differentiation at the trait and gene levels. In a previous study, we investigated the contribution of intergenic disequilibria to overall phenotypic differentiation, theoretically, in more detail (Le Corre and Kremer, 2003). We confirmed that high levels of differentiation could be observed for a trait, with only low levels of differentiation for the genes controlling the trait, and reviewed various evolutionary scenarios under which this discrepancy would be maintained or increased. In extreme cases, such as outcrossing species with large populations and high gene flow undergoing divergent selection, as in trees, there might be no signature at all at individual loci, despite a high degree of phenotypic differentiation. Recent developments along these lines have considered the contribution of dominance to the decoupling of Q_ST and G_STq, (Goudet and Büchi, 2006; Goudet and Martin, 2007; Santure and Wang, 2009). Under divergent selection, the contrast between these two measurements of differentiation increases with dominance (Santure and Wang, 2009). We therefore consider that the additive model used in this study provides the lower limits of decoupling between Q_ST and G_STq in more complex situations, including gene interactions.

We explore here the progressive decoupling of Q_ST and G_STq under different evolutionary scenarios. Our previous study focused on the equilibrium values of Q_ST and G_STq (Le Corre and Kremer, 2003). Here, we address their transient dynamics over time and consider different genetic architectures of the trait undergoing selection. Interest in the speed of adaptive differentiation in heterogeneous landscapes has increased recently (Kawecki, 2008; Lopez et al., 2008; Bjorklund et al., 2009). Evolutionary responses are often inferred from the divergence or differentiation of adaptive traits observed in a set of populations undergoing divergent selection (Hendry et al., 2001; Crispo, 2008; Rasanen and Hendry, 2008) or from the molecular divergence of genes of adaptive relevance (Kapralov and Filatov, 2006; Nosil et al., 2009). Furthermore there is an increasing concern about the rate of response of natural populations to ongoing climate change (Aitken et al., 2008). It is therefore timely to reconsider the dynamics of differentiation in an evolutionary context. Our main objective is to monitor changes of Q_ST and G_STq and their decoupling in silico by considering different divergent selection regimes, including nonequilibrium situations. We used two parameters (θ_B and θ_W) scaling the contributions of between- and within-population disequilibria to Q_ST. We highlight the way in which divergent selection generates a positive θ_B, which becomes in most circumstances the main driver of Q_ST. We focus on the particular case of species with large populations and high levels of gene flow, such as trees, in which high levels of trait differentiation are maintained in the presence of high levels of gene flow (Savolainen et al., 2007). We aimed to identify the situations in which the traces of natural selection acting on the genes controlling the trait might be blurred by intergenic disequilibria, thereby limiting the power to detect ‘outlier’ loci.

Materials and methods

Decoupling of Q_ST and G_STq

We focus here on the comparative monitoring of population differentiation of a diploid organism for a phenotypic trait and the underlying genes contributing to the trait, assuming that all these genes are known. The trait is assessed in different populations subject to selection and exchanging genes through gene flow. At any time during the evolutionary process, differentiation of the trait and of the underlying genes can be assessed. In a previous study, we derived a relationship between differentiation for a quantitative trait (Q_ST) and the mean differentiation of the n loci controlling the trait (G_STq; Le Corre and Kremer, 2003) under the following assumptions:

Diallelic loci for which the two alleles have symmetric additive effects (+α/2 and −α/2).
Equal contributions of the various loci to the additive value of the trait.
Additive effects of the alleles at each locus, with no consideration of dominance or epistasis.

where

σ_Wi² is the within-population variance of additive effects at locus i contributing to the trait. At the single-locus level, σ_Wi² is also called genic variance. The genic variance is assessed in all populations, and in what follows σ_Wi² is the average value across populations. σ_Bi² is the variance of population effects of locus i.

Cov_Wi,j and Cov_Bi,j are the within- and between-population covariances of the additive and population effects.

θ_W and θ_B are the relative contributions of the genetic covariances between loci to the additive and between-population variances of the trait (V_W and V_B, respectively), with respect to the single-locus variances (σ_Wi²and σ_Bi²).

V_W is the additive variance within populations.

V_B is the between-population variance.

According to (1), Q_ST is a monotone increasing function of G_STq and θ_B, and a monotone decreasing function of θ_W, reflecting the contrasting contributions of the within- and between-population disequilibria to Q_ST. Interpopulation disequilibria between loci tend to increase phenotypic differentiation between populations, whereas intrapopulation disequilibria decrease differentiation.

Relationship (1) holds even when gene flow and selection are not at equilibrium. It is an ‘instantaneous’ relationship describing the architecture of Q_ST in terms of single-locus components (G_STq) and multilocus components (θ_B and θ_W) at any time through an evolutionary scenario. This relationship also holds for all types and strengths of selection within or between populations and contributing to the divergence between populations.

We can now consider the difference between trait differentiation and gene differentiation by rewriting (1) in a different way:

From (4) and (5), we can conclude that:

There is no simple relationship between Q_ST and G_STq in more complex situations, with unequal contributions of loci and multiallelism. However, we used simulations to determine whether relationship (1) was applied in such situations (Le Corre and Kremer, 2003; Figure 3). As expected, the observed differentiation of the trait (Q_ST) in the case of multiallelism and an unequal contribution of loci were larger than that predicted by Equation (1), but the difference was moderate when Q_ST was below 0.40. These comparisons suggest that the decoupling of Q_ST and G_STq would be even more pronounced in these more realistic situations. Hence, relationship (1) is a rather conservative figure, corresponding to the minimum observable discrepancy between Q_ST and G_STq. Similarly, when dominance is taken into account, the difference between the two measurements of differentiation increases with divergent selection and higher levels of dominance (Santure and Wang, 2009). Furthermore, inequalities (6) were also maintained when multiple alleles and unequal contributions of loci were considered in simulations (Le Corre and Kremer, 2003, Figure 4). From the inequalities in (6), we can conclude that the comparison of disequilibria between and within populations can be used to predict the level of decoupling of G_STq and Q_ST.

We further dissected θ_B and θ_W taking into account the genetic architecture of the trait, that is, the number of loci contributing to the trait. Within- and between-population genic variances can be expressed in terms of diversity and differentiation parameters in the case of biallelic loci having symmetrical additive effects (Le Corre and Kremer, 2003):

where H_si is the mean within-population diversity for locus i, F_i the mean within-population fixation index for locus i, and D_STi is the differentiation among populations for locus i (Nei, 1987 D_STi=H_Ti−H_Si, H_Ti being the total diversity in the metapopulation).

Covariances between allelic effects of loci i and j can be decomposed in terms of disequilibria and allelic effects (Weir, 1990; Hamilton and Cole, 2004) as:

where Δ_Wi,j is the mean composite genotypic disequilibrium (within population) between loci i and j according to Weir, (Δ_AB in Weir's notation), and Cov(p_i, p_j) is the covariance between frequencies p_i (allele of positive effect at locus i) and p_j (allele of positive effect at locus j) calculated across all populations.

If we further assume that the α_i and F_i values are the same for all loci, and taking H_S as the mean intrapopulation gene diversity over all loci, then θ_W becomes:

Similarly assuming that D_ST is the mean differentiation across all loci and that allelic effects are the same across loci, then θ_B becomes:

where n is the number of loci, and Δ_W and Δ_B are the mean values of Δ_Wi,j and Δ_Bi,j over all pairs of loci. As a result, θ_B and θ_W are now written in terms of disequilibria and diversity parameters, which can be estimated in natural populations. Expressions (11) and (12) show that θ_B and θ_W are dependent on the number of loci contributing to the trait. The decoupling of Q_ST and G_STq can now be expressed as a function of the number of loci by replacing in (4) θ_W and θ_B by their expressions given by (11) and (12)

where x=2(n−1) and B = Δ_B/D_ST and W = Δ_W/(1+F)H_S It can be shown that (13) is an increasing function of x when B>W (or when θ_B>θ_W) and a decreasing function of x when B<W (or when θ_B<θ_W).

Simulations

We monitored the changes in θ_B, θ_W, G_STq and Q_ST under different evolutionary scenarios, assessing the decoupling of G_STq and Q_ST over successive generations. Individual-based simulations were carried out with large populations and extensive gene flow, subject to divergent selection. Details of the METAPOP simulation model are provided in Le Corre and Kremer (2003). We considered a phenotypic trait controlled by 5, 10 or 30 unlinked loci, each with 20 allelic states. The mutation rate was set at 10⁻⁵ per locus and followed a k-allele model. The genetic value of the trait was the sum of allelic effects over all loci. The additive value at each allele was drawn at random from a Gaussian distribution. The environmental contribution to the phenotypic value of the trait was considered as a Gaussian variable of mean 0 and variance 1. The variance of the distribution of allelic effects was adjusted, so that the initial genetic variance within populations was equal to 5 (implying that the trait's heritability was initially 0.833). These starting values were selected, so that they match with estimates of heritabilities in extant progeny tests (Kremer, 1994); other initial values were tested as well, but the evolution in θ_B, θ_W, G_STq and Q_ST showed similar patterns. Throughout the simulations, covariance components of θ_B and θ_W were numerically calculated as differences between genetic variances (V_B and V_W, respectively) and genic variances (σ_Bi² and σ_Wi² , respectively) according to Equation (3). Hence, they comprise all sources of allelic associations, for example, between different loci arising on a same gamete (linkage disequilibrium stricto sensu) or of different gametic origins (because of Hardy–Weinberg disequilibrium).

We considered 25 populations, each of size 500, with equal gene flow between all pairs of populations (island model with Nm=10). The 25 populations were displayed on a 5 × 5 grid system with one-dimensional environmental variation, such that the optimal phenotypic value Z_OPT(k) for the trait in population k followed a cline of variation across the metapopulation. These settings (initial genetic variance and environmental variance within populations, mutation rate, number of populations, population size and migration rate) were kept constant across all simulations.

We focused on the effects of different selection pressures on population differentiation. We considered Gaussian stabilizing selection within each population k toward the local optimal phenotypic value Z_OPT(k). The fitness W(Z) of an individual with phenotype Z located in population k is given by:

The strength of within-population selection was scaled by ω², the selection intensity. We considered two cases with contrasting reported values of ω² (Roff, 2002): low selection intensity (ω²=50) and high selection intensity (ω²=5).

Divergent selection was introduced into this setting by the linear gradient of Z_OPT values. On the 5 × 5 grid system, this resulted in the assignment of –x, −x/2, 0, x and x/2 as Z_OPT values for the five populations in a single row. The level of divergent selection was scaled by VarZ_OPT, the variance of Z_OPT values between populations (VarZ_OPT=x²/2). Again, we considered two contrasting cases: low levels of divergent selection (VarZ_OPT=1) and moderate divergent selection (VarZ_OPT=5). In total, we investigated four different selection scenarios combining different selection regimes (Table 1a). At the starting point of the simulations (generation 0), we assumed that the 25 populations were at mutation–migration–drift equilibrium (for Nm=10 and G_STq=2.4%). Hence, some population differentiation for the phenotypic trait existed at time zero, purely due to the neutral differentiation of allelic frequencies (Q_ST=2.4%), but θ_B and θ_W were zero at the starting point. The number of alleles initially present at each locus in the metapopulation was fixed at six, according to Ewens’ approximate sampling formula (Ewens, 1972). Overall allelic frequencies at each locus were drawn form a Dirichlet distribution with parameter 1. Allelic frequencies for the different populations were then drawn from Dirichlet distributions with parameters equal to the overall frequency scaled by 4 Nm (Beaumont and Balding, 2004). For each scenario, we considered different genetic architectures of the trait undergoing selection, by imposing that the trait was controlled by 5, 10 or 30 loci. A total of 50 replicates (for each combination scenario—number of loci) were run to account for stochastic variation of the various parameters assessed. These simulations were run up to 3000 generations in order to compare equilibrium values, but their dynamics were monitored in detail during the very first 100 generations.

Table 1 Evolutionary scenarios (a) starting from mutation–migration–drift equilibrium and (b) with changing selection regimes

Full size table

We further investigated additional scenarios aiming at monitoring the decoupling of G_STq and Q_ST under more realistic situations, where selection takes place while some sort of selection was already acting. This was done by subdividing each scenario into two successive phases: phase A from generation 0 to 100, followed by phase B from generation 101 to 200 (Table 1b) Differences in phases A and B consisted essentially in different levels of divergent selection (uniform, moderate divergent and strong divergent). The rationale for subdividing each scenario into two successive phases was twofold:

1)
to evaluate the impact of different initial situations on the dynamics of the parameters (θ_B, θ_W, G_STq and Q_ST ) by considering nonequilibrium situations as initial states and
2)
to monitor how differentiation of traits and genes respond to sudden changes in selection regimes (either increasing levels or decreasing levels of divergent selection).

Results

Genetic architecture and decoupling of G_STq and Q_ST

Asymptotic values observed for G_STq and Q_ST after 3000 generations confirm theoretical predictions that the decoupling between G_STq and Q_ST is inflated by differences between θ_B and θ_W (Figure 1). The largest decoupling was observed under strong divergent and stabilizing selection, whereas low divergent and stabilizing selection contributed to moderate decoupling. One of the most striking findings of the simulations was the contrasting response of θ_W, θ_B and G_STq to variation of the number of loci controlling the trait (Table 2 and Figure 1). By contrast, the genetic parameters V_W, V_B and Q_ST remained constant (Supplementary Table 1), except under relaxed selection regimes (scenario 3), where higher levels of V_W and V_B were maintained when the number of loci was larger. As θ_W, θ_B and G_STq are the only components of Q_ST (Equation (1)), there must be some trade-off between θ_W, θ_B and G_STq to maintain Q_ST constant. As predicted by theory (Equations (11) and (12)), θ_W and θ_B increase with the number of loci (Table 2). Interestingly, the increase of θ_B as a function of the number of loci fits to the prediction in the simplified diallelic locus case (assuming Δ_B and D_ST constant). G_STq decreased with increasing number of loci (Figure 1 and Supplementary Table 1). As predicted by (13), the level of the decoupling (Q_ST/G_STq) increased as the number of loci increased (when θ_B>θ_W). A high degree of decoupling of Q_ST and G_STq was therefore expected when a large number of genes contributed to the trait, as illustrated in Figure 1.

Table 2 Asymptotic values of (a) θ_W and (b) θ_B

Full size table

These observations can be extended to the case in which the number of loci increases indefinitely (infinitesimal model), resulting to very low levels of G_STq, and θ_B becoming the key component of Q_ST. A basic assumption of the infinitesimal model is indeed that the allelic frequencies of quantitative loci do not change across generations and that all the between-population variance stems from the intergenic disequilibrium (θ_B; Bulmer (1980)).

Figure 1 also illustrates the variation of Q_ST and G_STq values between stochastic repetitions of a given evolutionary scenario. As the number of genes contributing to the trait decreases, the variation of G_STq increases over the 50 stochastic repetitions. Q_ST is much less variable than G_STq across stochastic repetitions. Given the high degree of heterogeneity of G_STq values, there may be considerable variation of the decoupling of G_STq and Q_ST for a given setting of VarZ_OPT and ω². However, this variation occurs mostly in situations of small numbers of loci and a high intensity of stabilizing selection (Figure 1).

Dynamics of genetic variances and covariances

We first monitored the dynamics of θ_W, θ_B and corresponding variances under selection regimes following an initial mutation–migration–drift equilibrium (scenarios 1–4), considering a trait controlled by 10 loci (Figure 2). In the first few generations (<10) following the induction of selection, the intergenic linkage disequilibrium component θ_W decreases rapidly. This decrease becomes stronger as the strength of stabilizing selection increases (Figures 2e and g). These trends follow the predictions of Bulmer (1974, 1980), who showed that negative covariances are generated by directional selection in a single population. Our results suggest a similar trend for a subdivided population. Selection induces negative covariance at each generation. However, this covariance is halved by recombination at each generation. Further, the negative covariance component added in each generation decreases steadily because stabilizing selection erodes the genetic variance itself (Figure 3). After reaching its minimum, θ_W begins to increase, possibly even approaching positive values when divergent selection is strong (Figures 2c vs 2a; Figures 2g vs 2e; Table 2a). We suspect that the construction of a positive θ_W results from an interaction between gene flow and divergent selection. As first observed by Nei and Li (1973), when divergent selection operates in a subdivided population such that allelic frequencies differ between subpopulations, migration produces permanent linkage disequilibrium in each subpopulation. Kirkpatrick et al. (2002; formula 32) showed that the disequilibrium component contributed by migration in each population is a function of the product, across loci, of the differences in allele frequencies between the local population and the migrant pool, and increases with increasing migration rate. As strong divergent selection favors different allelic associations in different populations, migration continually assembles different allelic combinations, thus creating positive disequilibrium within populations. Indeed, our simulations show that divergent selection has a positive effect on θ_W due to the restoration of positive allelic covariances (Figures 2c and g and Table 2a), as predicted by Nei and Li (1973) and Kirkpatrick et al. (2002).

θ_B increases very rapidly during the first 5 to 10 generations, particularly under highly divergent selection (Figures 2d and h). θ_B builds up to compensate for the lag between the variance of optimal values VarZ_OPT and the existing between-population variance (V_B). At the very beginning (generation 0), the only source of V_B is the neutral subdivision of the populations induced by gene flow:

where σ₀² is the initial genetic variance in the whole metapopulation and $G_{{ST}_{0}}$ is the neutral differentiation. Thus, in the case of a species with low migration, the between-population variance that exists before the onset of divergent selection may be much lower than the variance of optimal values.

Thereafter, θ_B is maintained at the same level, or decreases slowly under highly divergent selection (Figure 2h). Thus, simulation results show that changes in genetic covariances both within and between populations occur in the first few generations of selection.

Shortly after θ_B reaches its peak value, the genetic variance between populations, V_B, also attains its maximum value (Figure 3), decreasing steadily thereafter. The rate of decrease is stronger when stabilizing selection is weak. It is difficult to detect these dynamics over short time scales (Figure 3), but they are obvious when comparing the asymptotic values of V_B (Supplementary Table 1) with the values of this parameter over the first 100 generations (Figure 3). As alleles are purged from populations, intergenic allelic associations are limited (decay of θ_B; Figures 2f and h) and V_B depends principally on the difference in allele frequency between populations. As can be seen on Figure 3, V_B does not reach VarZ_OPT (the variance of optimal values within populations according to stabilizing selection). The maximum value of the V_B/VarZ_OPT ratio was a function of the strength of stabilizing selection and did not vary with the number of loci: the mean value obtained was 0.67 when ω²=50 and 0.94 when ω²=5. Under extensive gene flow (Nm=10), VarZ_OPT can be reached only if stabilizing selection is strong while under weak selection, the final V_B value is less than half of VarZ_OPT (Supplementary Table 1). No changes to the transient dynamics were observed when the number of loci was changed to 5 or 30 (data not shown). The only change was the overall higher values of θ_W and θ_B when the number of loci increased.

The general trends of variation of θ_B were maintained when the starting conditions of simulations were modified. Indeed, regardless whether populations were at mutation–migration–drift equilibrium, undergoing uniform selection or moderate divergent selection, the responses of θ_B to the induction of strong divergent selection was a very rapid and steep increase (Figures 4a and b). The steepness of the response is mainly dependent on the strength of the ongoing stabilizing selection, and not on the initial conditions. However, the range of increase of θ_B was larger when uniform selection was the starting point (θ_B negative at generation 0), because the lag between the imposed variance of optimal values (VarZ_OPT) and the between-population variance (V_B) at generation 0 was much larger than under other starting conditions. θ_B is the immediate response to compensate the lag. We also monitored the changes in θ_B when the number of loci was 5 and 30. There was no change in the dynamics, but the values after 100 generations differed according to the theoretical predictions (Equation (12)): they ranged from 1.76 to 2.11 with 5 loci, from 3.17 to 4.15 with 10 loci and from 7.74 to 8.36 with 30 loci.

Dynamics of G_STq and Q_ST

As for of θ_W and θ_B, we monitored first the variation of Q_ST and G_STq under scenarios 1–4 following an initial mutation–migration–drift equilibrium (Figure 5). There are striking differences between the variation of Q_ST and G_STq on the one hand and θ_W and θ_B on the other. θ_W and θ_B reach minimum and maximum values during the first few generations, whereas Q_ST and, especially G_STq, increases more steadily. The main driver of G_STq appears to be the strength of stabilizing selection, which opposes the effect of gene flow, as described above.

Throughout the selection scenario, G_STq remained lower than Q_ST, as a straightforward consequence of θ_W being lower than θ_B (inequalities 6). Q_ST increased very rapidly in the first few generations, with a steeper slope under strong stabilizing selection. Subsequently, the dynamics of Q_ST followed that of G_STq, with a steady increase when stabilizing selection was strong (Figures 5c and d), and a smoother or flat course when stabilizing selection was weak (Figures 5a and b). The decoupling of Q_ST and G_STq was not changed when initial conditions were either uniform selection or moderate divergent selection (Figures 5b, d and 6a, b). Whether Q_ST is 0 or already higher than the neutral expectation at generation 0, the time trends during the first 100 generations are similar. The same conclusions hold also for G_STq. Particularly, the very rapid increase of Q_ST during the early generations is created by the steep variation of θ_B, being the main driver of Q_ST.

The patterns depicted in Figures 2, 3, 4, 5 and 6 indicate that the variation of population differentiation induced by selection follows two phases. A detailed dissection of the early phase (comparison of Figures 3 and 5 with Figure 2) suggested that the main driver of the rapid increase in both V_B and Q_ST was the variation of θ_B, rather than that of G_STq. Interestingly, θ_B may reach even higher values when the trait is controlled by a larger number of loci (data not shown, but see asymptotic values in Table 2b), enhancing the capture of intergenic disequilibria, as the larger number of loci provides a larger number of opportunities for favorable allelic associations. The dynamics during this early phase is highly analogous to that of the genetic variance of a trait undergoing selection within a single population (Bulmer, 1974, 1980). Indeed, Bulmer indicated that allele frequencies probably remain constant over the first few generations of selection, whereas the genetic variance of the trait decreases, due mostly to negative intergenic linkage disequilibrium. In our multiple-population context, the Bulmer effect may be thus generalized as follows: the short-term response to selection is characterized by a decrease in within-population variance mediated by negative values of θ_W in each population on one the hand, and an increase in between-population variance mediated by a rapid increase in positive θ_B values on the other hand. This early phase is characterized by the rapid building up of the covariance of allelic effects, but allelic frequencies within populations change very slowly. The increase in covariance is followed by an increase in between-population variance, V_B, which peaks at the end of the first phase. As this point, populations are locally adapted (that is, the between-population variance is closest to the optimal variance). The duration of the early phase varies from a little over 10 to 30 generations when local stabilizing selection is strong, and from 50 to 80 generations when local stabilizing selection is weak .In the second phase, there is a slower steady increase in Q_ST driven by G_STq, with no further increase in θ_B. θ_B remains larger than θ_W, but differences in allelic frequencies gradually increase, contributing to the steady increase in Q_ST.

Changes of selection regimes and decoupling of G_STq and Q_ST

An interesting extension of the comparative in silico monitoring of G_STq and Q_ST is their response to environmental changes, which can be provoked by changing Z_OPT values assigned to the different populations. To do so, we compared two symmetric scenarios: divergent selection to uniform selection (scenarios 5 and 6) versus uniform to divergent selection (scenarios 9 and 10; Figures 7 and 8). The rapidity of the response of θ_B after generation 100 was very similar, whether the level of divergent selection was increasing or decreasing, resulting in almost a symmetric pattern (Figure 7). The same conclusions can be drawn for Q_ST, whereas the changes of selection regimes have only minor impact on G_STq (Figure 8). The same simulations were conducted under moderate divergent selection during phase B (scenario 11 and 12) and by changing the number of loci to 5 and 30, but the dynamics of G_STq and Q_ST following the changing of selection regimes were very similar to those observed in Figure 8 (data not shown).

Discussion

Evidence for the decoupling of differentiation between traits and genes

Our results support earlier findings suggesting that substantial decoupling may occur between Q_ST and G_STq (Latta, 1998, 2004; McKay and Latta, 2002). We explored in greater detail the evolutionary factors enhancing decoupling and showed that strong divergent selection and strong selection intensity within populations were the predominant drivers of the lag between Q_ST and G_STq. Furthermore, this discrepancy increases as the number of loci involved in the trait increases. The conclusions drawn from theoretical analyses can be compared with experimental estimates of Q_ST and G_STq. The genes involved in quantitative traits have not yet been fully identified, but candidate genes have been identified for various traits of adaptive significance, not only in model species. A few recent studies have compared Q_ST with G_STq estimates for candidate genes with G_ST for neutral markers. The traits of interest investigated were resistance to drought stress in Pinus pinaster (Eveno et al., 2008), bud set in P. sylvestris (Pyhäjärvi et al., 2008) and Picea abies (Heuertz et al., 2006), phenological traits (bud burst, length of growing season and leaf abscission) and growth traits (height growth and diameter) in Populus tremula (Hall et al., 2007 and Luquez et al., 2007), bud burst in Quercus petraea (Derory et al., 2010) and growth, phenology and wood characters in Picea glauca (Namroud et al., 2008). In each case, the level of differentiation of the candidate genes was similar to that of neutral markers, whereas Q_ST values were much higher. The selection of candidate genes may not have been stringent enough to identify the only genes causally involved in expression of the trait of interest, but the concordant results obtained in the various studies carried out support our theoretical predictions that differentiation at individual genes may be limited, even for traits displaying high levels of differentiation in common garden experiments. We also suspect that this may be the case in general for trees and other species with large populations, high levels of gene flow and strong divergent selection.

Transient dynamics of population differentiation

Over and above the simple decoupling of Q_ST and G_STq, our results highlight the striking differences in the dynamics of these two differentiation measurements, once selection has been established. The differences observed in the variation of θ_W and θ_B, and of Q_ST and G_STq over successive generations (illustrated in Figures 3, 4, 5, 6, 7 and 8) highlight the differences in trait and gene divergence as a result of selection. The dynamics can clearly be separated into two phases. Very early in the first phase, θ_W and θ_B follow a bell-shaped curve, reaching a maximum for θ_B (or a minimum for θ_W) after <20 generations. The increase in θ_B is followed by an increase in between-population variance (V_B), which peaks at 10–30 generations when local stabilizing selection is strong, and at 50–80 generations when local stabilizing selection is weak. During this period of increase, Q_ST increases very rapidly, whereas G_STq displays a much slower monotone increase. The second phase starts after V_B has reached its maximum value, and is characterized by a steady increase in G_STq. During this second phase, Q_ST increases more slowly, driven mostly by G_STq.

Slight variations of these general patterns were observed for the different selection scenarios, but the overall trends remained the same. In more biological terms, these patterns suggest two important conclusions: (1) that divergent selection first captures beneficial allelic associations at different loci in different populations and then targets changes in allelic frequencies and (2) that allelic associations promote rapid genetic divergence between populations more efficiently than changes in allelic frequencies.

In terms of evolution, this has the immediate consequence of very rapid differentiation at the trait level, once selection is established. The rapid response of Q_ST results from the building up of covariances between allelic effects within and between populations. θ_W decays rapidly because of recombination, whereas θ_B becomes the main driver of Q_ST during the first phase. In their review, Reznick and Ghalambor (2001) identified two principal contexts in which rapid evolution has been reported: new environments recently colonized by new populations and metapopulation structures in heterogeneous environments. In both scenarios, there are recent and new selection pressures, resulting in evolution during these early stages. These conditions match our scenario, in which populations rapidly differentiate. Most authors have tended to conclude that rapid evolution results from drastic demographic changes associated with the changes to the environment. However, our results suggest that there may also be a rapid accumulation of beneficial allelic associations. Several examples supporting rapid differentiation of traits have been reported in European tree species. Norway spruce displays the most recent wave of migration in Europe, spreading throughout southern Scandinavia over the last 2000 years by natural means or through human-mediated dispersion (Bradshaw and Lindbladh, 2005). Common garden experiments established with populations originating from these regions have shown extensive population differentiation for all traits assessed (Hannerz and Westin, 2000; Danusevicius and Gabrilavicius, 2001). Over more recent time scales, the introduction of northern red oak (Quercus rubra) into Europe over the last two centuries has been followed by the rapid divergence of the source population from the natural distribution and the introduced populations (Daubree and Kremer, 1993).

Our results have peculiar relevance in the context of climate change. Although we investigated responses to sudden changes of divergent selection in our simulations, we showed that allelic associations were immediately modified. Whether the transition was from moderate to strong divergent selection or from uniform to strong, the response elapsed during less than 10–20 generations. The lag of the response of θ_B was slightly lower under the former than the latter transition. We suspect that under more gradual changes of environmental conditions, allelic associations would be changed during a shorter period. Evolutionary response to fill the lag between new optimal values induced by climate change and extant values of populations has triggered theoretical research on adaptation to climate change (Lande and Shannon, 1996; Burger and Krall, 2004). Our results suggest that the multilocus structure of complex traits and intergenic disequilibria should be considered in predictive models of adaptive divergence.

Detection of selection signatures at the molecular level

We showed that the decoupling of Q_ST and G_STq is generated by the discrepancy between allelic covariances both within and between populations. At the molecular level, the signature of selection is therefore more visible on multilocus than on monolocus structures of diversity. At worst, during the early phase of selection, there may be no visible trace of selection at single loci, whereas strong correlations between allelic effects are expected. However, these conclusions must be considered with caution, as our analysis at the monolocus level is limited to the mean differentiation across all loci involved in the trait. We previously showed that G_STq values may differ between loci and that heterogeneity increases with gene flow and low stabilizing selection (Le Corre and Kremer, 2003). Under such circumstances, the detection of outlier loci by genome scans would be successful, but only a small subset of genes would be identified. However, selection imprints can be detected at the molecular level by another method, based on the correlation of allelic effects between populations (θ_B). It is difficult to estimate allelic effects, but the correlation of allelic frequencies may be used as a surrogate for θ_B. Correlations between allelic frequencies can be estimated from gene diversity surveys in natural populations, such as G_STq. In this respect, an appropriate initial approach would be to estimate the correlation between outlier loci detected on genome scans. Alternatively, multilocus measurements of population differentiation taking into account disequilibrium between alleles could also be used to detect multilocus structures (Kremer et al., 1997). However, these approaches may be tainted by confounding effects, generating covariances of allelic frequencies, such as demographic or historical effects. One highly predictable confounding effect may result from isolation by distance, and will affect all loci including those involved in the trait of interest. Divergent selection may follow continuous geographic gradients, resulting in an increase in VarZ_OPT along the gradient. Covariances of allelic effects may therefore be generated independently by two different mechanisms. The second constraint on explorations of θ_B or its surrogates is the very high degree of variation observed between repeats of the same scenario, suggesting the existence of a very high associated stochastic evolutionary variance (Figure 1).

Concluding remarks

In contrast to our earlier comparison between Q_ST and G_STq this review is purposely limited to outcrossing species that exhibit large population sizes and extensive gene flow such as forest trees. Indeed, the built up of substantial population differentiation for fitness-related traits that is observed in provenance tests (common garden experiments) in the context of large pollen flow has not been elucidated so far. Here, we show that the pace at which allelic associations are generated can be responsible for the level of divergence observed in extant populations. We suspect that the building up of allelic associations is enhanced by the very large genetic diversity residing within populations, which is sustained by pollen flow. These results need, however, to be confirmed or refined in a more general context beyond the single trait approach and by considering explicitly the contribution of seed versus pollen to gene flow.

References

Aitken SN, Yeaman S, Holliday JA, Wang T, Curtis-McLane S (2008). Adaptation, migration or extirpation: climatic changes outcomes for tree populations. Evol Appl 1: 95–111.
Article PubMed PubMed Central Google Scholar
Beaumont MA, Balding JB (2004). Identifying adaptive genetic divergence among populations from genome scans. Mol Ecol 13: 969–980.
Article CAS PubMed Google Scholar
Björklund M, Ranta E, Kaitala V, Bach LA, Lundberg P, Stenseth NC (2009). Quantitative trait evolution and environmental change. PLOS One 4: e4521.
Article PubMed PubMed Central Google Scholar
Bradshaw RHW, Lindbladh M (2005). Regional spread and stand-scale establishment of Fagus sylvatica and Picea abies in Scandinavia. Ecology 86: 1679–1686.
Article Google Scholar
Burger R, Krall C (2004). Quantitative-genetic models and changing environments. In: Ferriere R, Dieckmann U, Couvet D (eds). Evolutionary Conservation Biology. Cambridge University Press: Cambridge, UK, pp 171–187.
Chapter Google Scholar
Bulmer MG (1974). Linkage disequilibrium and genetic variability. Genet Res 23: 281–289.
Article CAS PubMed Google Scholar
Bulmer MG (1980). The Mathematical Theory of Quantitative Genetics. Clarendon Press: Oxford, 254p.
Google Scholar
Crispo E (2008). Modifying effects of phenotypic plasticity on interactions among natural selection, adaptation and gene flow. J Evol Biol 21: 1460–1469.
Article CAS PubMed Google Scholar
Danusevicius D, Gabrilavicius R (2001). Variation in juvenile growth rhythm among Picea abies provenances from the Baltic States and the adjacent regions. Scand J For Res 16: 305–317.
Article Google Scholar
Daubree JB, Kremer A (1993). Genetic and phenological differentiation between introduced and natural populations of Quercus rubra L. Annales des Sciences Forestières 50 (Suppl 1): 271s–280s.
Article Google Scholar
Derory J, Scotti-Saintagne C, Bertocchi E, Le Dantec L, Graignic N, Jauffres A et al. (2010). Contrasting correlations between diversity of candidate genes and variation of bud burst in natural and segregating populations of European oaks. Heredity 104: 438–448.
Article CAS PubMed Google Scholar
Eveno E, Collada C, Guevara MA, Léger V, Soto A, Díaz L et al. (2008). Contrasting patterns of selection at Pinus pinaster Ait. drought stress candidate genes as revealed by genetic differentiation analyses. Mol Biol Evol 25: 417–437.
Article CAS PubMed Google Scholar
Ewens W (1972). The sampling theory of selectively neutral alleles. Theor Popul Biol 3: 87–112.
Article CAS PubMed Google Scholar
Goudet J, Büchi L (2006). The effects of dominance, regular inbreeding and sampling design on Qst, an estimator of population differentiation for quantitative traits. Genetics 172: 1337–1347.
Article PubMed PubMed Central Google Scholar
Goudet J, Martin G (2007). Under neutrality, Qst>= Fst when there is dominance in an island model. Genetics 176: 1371–1374.
Article PubMed PubMed Central Google Scholar
Hall D, Luquez V, Garcia VM, St Onge KR, Jansson S, Ingvarsson PK (2007). Adaptive population differentiation in phenology across a latitudinal gradient in European Aspen (Populus tremula, L.): a comparison of neutral markers, candidate genes, and phenotypic traits. Evolution 61: 2849–2860.
Article PubMed Google Scholar
Hamilton DC, Cole DEC (2004). Standardizing a composite measure of linkage disequilibrium. Ann Hum Genet 68: 234–239.
Article CAS PubMed Google Scholar
Hannerz M, Westin J (2000). Growth cessation and autumn-frost hardiness in one-year-old Picea abies progenies from seed orchards and natural stands. Scand J For Res 15: 309–317.
Article Google Scholar
Hendry AP, Day T, Taylor EB (2001). Population mixing and the adaptive divergence of quantitative traits in discrete populations: a theoretical framework for empirical tests. Evolution 55: 459–466.
Article CAS PubMed Google Scholar
Heuertz M, De Paoli E, Källmann T, Larsson H, Jurman I, Morgante M et al. (2006). Multilocus patterns of nucleotide diversity, linkage disequilibrium, and demographic history of Norway spruce. Genetics 174: 2095–2105.
Article CAS PubMed PubMed Central Google Scholar
Holderegger R, Herrmann D, Poncet B, Gugerli F, Thuiller W, Taberlet P et al. (2008). Land ahead: using genome scans to identify molecular markers of adaptive relevance. Plant Ecol Divers 1: 273–283.
Article Google Scholar
Kapralov MV, Filatov DA (2006). Molecular adaptation during adaptive radiation in the Hawaiian endemic genus Schiedea. Plos One 1: e8.
Article PubMed PubMed Central Google Scholar
Kawecki TJ (2008). Adaptation to marginal habitats. Annu Rev Ecol Evol Syst 39: 321–342.
Article Google Scholar
Kirkpatrick M, Johnson T, Barton N (2002). General models of multilocus evolution. Genetics 161: 1727–1750.
PubMed PubMed Central Google Scholar
König AO (2005). Provenance research: evaluating the spatial pattern of genetic variation. In: Geburek Th, Turok J (eds). Conservation and Management of Forest Genetic Resources in Europe. Arbora Publishers: Zvolen, Slovakia, pp 275–328.
Google Scholar
Kremer A (1994). Diversité génétique et variabilité des caractères phénotypiques chez les arbres forestiers. Genet Sel Evol 26 (Suppl 1): 105s–123s.
Article Google Scholar
Kremer A, Zanetto A, Ducousso A (1997). Multilocus and multitrait measures of differentiation for gene markers and phenotypic traits. Genetics 145: 1229–1241.
CAS PubMed PubMed Central Google Scholar
Lande R, Shannon S (1996). The role of genetic variation in adaptation and population persistence in a changing environment. Evolution 50: 434–437.
Article PubMed Google Scholar
Latta RG (1998). Differentiation of allelic frequencies at quantitative trait loci affecting locally adaptive traits. Am Nat 151: 283–292.
Article CAS PubMed Google Scholar
Latta RG (2004). Gene flow, adaptive population divergence and comparative population structure across loci. New Phytol 161: 51–58.
Article CAS Google Scholar
Le Corre V, Kremer A (2003). Genetic variability at neutral markers, quantitative trait loci and trait in a subdivided population under selection. Genetics 164: 2005–2019.
Google Scholar
Lopez S, Rousset F, Shaw FH, Shaw RG, Ronce O (2008). Migration load in plants: role of pollen and seed dispersal in heterogeneous landscapes. J Evol Biol 21: 294–309.
Article CAS PubMed Google Scholar
Luquez V, Hall D, Albrectsen B, Karlsson J, Ingvarsson PK, Jansson S (2007). Natural phenological variation in aspen (Populus tremula): the Swedish Aspen Collection. Tree Genet Genome 4: 279–292.
Article Google Scholar
McKay JK, Latta RG (2002). Adaptive population divergence: markers, QTLs and traits. Trends Ecol Evol 17: 185–291.
Article Google Scholar
Morgenstern EK (1996). Geographic Variation in Forest Trees. University of British Columbia Press: Vancouver, Canada, 209p.
Google Scholar
Namroud MC, Beaulieu J, Juge N, Laroche J, Bousquet J (2008). Scanning the genome for single nucleotide polymorphisms involved in adaptive population differentiation in white spruce. Mol Ecol 17: 3599–3613.
Article CAS PubMed PubMed Central Google Scholar
Nei M (1987). Molecular Evolutionary Genetics. Columbia University press: New York, 512p.
Google Scholar
Nei M, Li WH (1973). Linkage disequilibrium in subdivided populations. Genetics 75: 213–219.
CAS PubMed PubMed Central Google Scholar
Nielsen R (2005). Molecular signatures of natural selection. Ann Rev Genet 39: 197–218.
Article CAS PubMed Google Scholar
Nosil P, Funk DJ, Ortiz-Barrientos D (2009). Divergent selection and heterogeneous genomic divergence. Mol Ecol 18: 375–402.
Article PubMed Google Scholar
Pyhäjärvi T, García-Gil R, Knürr T, Mikkonen M, Wachowiak W, Savolainen O (2008). Demographic history has influenced nucleotide diversity in European Pinus sylvestris populations. Genetics 177: 1713–1724.
Article Google Scholar
Rasanen K, Hendry AP (2008). Disentangling interactions between adaptive divergence and gene flow when ecology drives diversification. Ecol Lett 11: 624–636.
Article PubMed Google Scholar
Reznick DN, Ghalambor CK (2001). The population ecology of contemporary adaptations: what empirical studies reveal about the conditions that promote adaptive evolution. Genetica 112–113: 183–198.
Article PubMed Google Scholar
Roff DA (2002). Life History Evolution. Sinauer Associates: Sunderland Massachusetts.
Google Scholar
Santure AW, Wang J (2009). The joint effects of selection and dominance on the Qst –Fst contrast. Genetics 181: 259–276.
Article PubMed PubMed Central Google Scholar
Savolainen O, Pyhajarvi T, Knurr T (2007). Gene flow and local adaptation in trees. Annu Rev Ecol Evol Syst 38: 595–619.
Article Google Scholar
Storz JF (2005). Using genome scans of DNA polymorphism to infer adaptive population divergence. Mol Ecol 14: 671–688.
Article CAS PubMed Google Scholar
Weir BS (1990). Genetic Data Analysis. Sinauer Associates: Sunderland, MA, 377p.
Google Scholar
Wright JW (1976). Introduction to Forest Genetics. Academic Press: New York, NY, USA.
Google Scholar
Wright SI, Gaut BS (2006). Molecular population genetics and the search for adaptive evolution in plants. Mol Biol Evol 22: 506–519.
Article Google Scholar

Download references

Acknowledgements

This research was supported by the European Commission through the directorate General Research within the fifth and sixth framework programmes (Research project TREEESNIPS (QLK3-CT-2002-01973) and Network of Excellence EVOLTREE (FP6 No. 016322)). We are grateful to two anonymous reviewers for their very helpful comments and suggestions.

Author information

Authors and Affiliations

INRA, UMR1202 Biodiversité Gènes et Communautés, Cestas, France
A Kremer
Université de Bordeaux1, UMR1202 Biodiversité Gènes et Communautés, Talence, France
A Kremer
INRA, UMR1210 Biologie et Gestion des Adventices, Dijon F-21065, France,
V Le Corre

Authors

A Kremer
View author publications
You can also search for this author in PubMed Google Scholar
V Le Corre
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to A Kremer.

Ethics declarations

Competing interests

The authors declare no conflict of interest.

Additional information

Supplementary Information accompanies the paper on Heredity website

Supplementary information

Supplementary Table 1 (DOC 39 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kremer, A., Le Corre, V. Decoupling of differentiation between traits and their underlying genes in response to divergent selection. Heredity 108, 375–385 (2012). https://doi.org/10.1038/hdy.2011.81

Download citation

Received: 22 December 2010
Revised: 15 June 2011
Accepted: 27 June 2011
Published: 14 September 2011
Issue Date: April 2012
DOI: https://doi.org/10.1038/hdy.2011.81

Keywords

This article is cited by

Characterization of pollen tube development in distant hybridization of Chinese cork oak (Quercus variabilis L.)
- Meng Ke
- Huayu Si
- Yun Li
Planta (2023)
Adaptive evolution in a conifer hybrid zone is driven by a mosaic of recently introgressed and background genetic variants
- Mitra Menon
- Justin C. Bagley
- Andrew J. Eckert
Communications Biology (2021)
Genomic release-recapture experiment in the wild reveals within-generation polygenic selection in stickleback fish
- Telma G. Laurentino
- Dario Moser
- Daniel Berner
Nature Communications (2020)
Genetic patterns in Neotropical Magnolias (Magnoliaceae) using de novo developed microsatellite markers
- Emily Veltjen
- Pieter Asselman
- Marie-Stéphanie Samain
Heredity (2019)
Low genetic differentiation between two morphologically and ecologically distinct giant-leaved Mexican oaks
- Ana L. Albarrán-Lara
- Remy J. Petit
- Ken Oyama
Plant Systematics and Evolution (2019)