Comparing Bayesian estimates of genetic differentiation of molecular markers and quantitative traits: an application to Pinus sylvestris

Waldmann, P; García-Gil, M R; Sillanpää, M J

doi:10.1038/sj.hdy.6800672

Download PDF

Original Article
Published: 27 April 2005

Comparing Bayesian estimates of genetic differentiation of molecular markers and quantitative traits: an application to Pinus sylvestris

P Waldmann^1,3,
M R García-Gil^2,3 &
M J Sillanpää¹

Heredity volume 94, pages 623–629 (2005)Cite this article

825 Accesses
28 Citations
3 Altmetric
Metrics details

Abstract

Comparison of the level of differentiation at neutral molecular markers (estimated as F_ST or G_ST) with the level of differentiation at quantitative traits (estimated as Q_ST) has become a standard tool for inferring that there is differential selection between populations. We estimated Q_ST of timing of bud set from a latitudinal cline of Pinus sylvestris with a Bayesian hierarchical variance component method utilizing the information on the pre-estimated population structure from neutral molecular markers. Unfortunately, the between-family variances differed substantially between populations that resulted in a bimodal posterior of Q_ST that could not be compared in any sensible way with the unimodal posterior of the microsatellite F_ST. In order to avoid publishing studies with flawed Q_ST estimates, we recommend that future studies should present heritability estimates for each trait and population. Moreover, to detect variance heterogeneity in frequentist methods (ANOVA and REML), it is of essential importance to check also that the residuals are normally distributed and do not follow any systematically deviating trends.

Evidence of local adaptation despite strong drift in a Neotropical patchily distributed bromeliad

Article 05 May 2021

Bárbara Simões Santos Leal, Cleber Juliano Neves Chaves, … Clarisse Palma-Silva

Hybridization and geographic distribution shapes the spatial genetic structure of two co-occurring orchid species

Article 07 August 2019

Patrícia Sanae Sujii, Salvatore Cozzolino & Fábio Pinheiro

Reduced within-population quantitative genetic variation is associated with climate harshness in maritime pine

Article 23 May 2023

Juliette Archambeau, Marta Benito Garzón, … Santiago C. González-Martínez

Introduction

Many species occur in several or at least partly isolated subpopulations that are adapted to local environmental conditions. These populations are influenced by a number of forces that can alter their genetic structure. In general, genetic drift and diversifying selection will differentiate the gene frequencies, whereas gene flow, mutation and unifying selection will counteract these forces. Estimation of genetic differentiation between populations using molecular markers is an important topic in many areas of evolutionary science (Avise, 1993; Goldstein and Schlötterer, 1999). Most studies have used statistical measures derived from Wright's F-statistics (Wright (1951, 1965); see also Excoffier, 2001; Weir and Hill, 2002).

Given an island model, it has been shown that genetic differentiation between populations, expressed as Wright's F_ST, should be the same regardless of whether it is estimated from neutral single-locus marker or from a neutral quantitative trait with an additive genetic basis (Lande, 1992; Lynch, 1994). Whitlock (1999) showed that this procedure can be applied also to other types of population structures. Hence, the amount of differentiation in neutral molecular markers (F_ST) can be used as an expectation for the level of differentiation of the neutral additive genetic variance of a quantitative trait among populations (termed Q_ST by Spitze, 1993). If the value of Q_ST is larger than the corresponding value of F_ST, one can conclude that there is some evidence against the neutrality hypothesis in favour of diversifying selection among the populations. Conversely, if the value of Q_ST is lower than F_ST, one can say that unifying selection has been the prevalent force. This reasoning, of course, assumes that the mutation rates are the same at the loci involved. An increasing number of studies have compared the differentiation in molecular markers and quantitative traits, and it seems that quantitative traits generally are more differentiated than molecular markers (reviewed in Merilä and Crnokrak, 2001; Reed and Frankham, 2001; McKay and Latta, 2002). However, except for Palo et al (2003), Q_ST and F_ST estimates have so far not been compared in a statistically rigorous way.

Bayesian inferential methods have recently started to emerge for different areas in evolutionary biology (Beaumont and Rannala, 2004). They provide a number of advantages compared to classical frequentist approaches. One such advantage is a probabilistic measure of uncertainty in the form of credible intervals (CI), which can be obtained on parameter estimates and even on their functions. By contrast, maximum likelihood methods only provide standard errors around the parameters in the model (from the Fisher information matrix; Lynch and Walsh, 1998) and confidence intervals for functions of these parameters must be obtained with approximate methods (eg Podolsky and Holtsford, 1995; Waldmann and Andersson, 1998). Another important advantage with Bayesian methods is that many complex hierarchical model structures that earlier were intractable can now be easily investigated (Robert, 2001; Gelman et al, 2004). A Bayesian method for estimation of population genetic structure and F_ST on the basis of multilocus molecular markers was recently presented (Corander et al, 2003, 2004). This method considers the number of populations as an unknown quantity and determines the posterior probabilities of the structure configurations. Moreover, rather than conditioning the F_ST estimate on a single structure it provides model-averaged (robust) estimate for F_ST, where individual F_ST estimates from different population structures are weighted according to corresponding posterior probabilities (of the structure). For other approaches, into Bayesian F_ST estimation, see Holsinger (1999) and Balding (2003).

In this study, we develop a Bayesian method for estimation of the differentiation of the additive genetic variance between populations. The method is applicable to single quantitative traits that can be assumed to have an additive genetic basis. The method uses a Gibbs sampling approach (Gelfand et al, 1995) for estimation of posterior distributions, and CIs are therefore easily obtained for any function of the variance parameters. We first estimate the model averaged neutral F_ST estimate (Corander et al, 2003, 2004) in Pinus sylvestris, and then calculate the weighted Q_ST estimate (according to the posterior probabilities of the hidden structures) and compare these two estimates. Based on the results, we also discuss situations where comparison of the F_ST and Q_ST parameters might be problematic.

Materials and methods

Estimation and comparison of F_ST and Q_ST

The recently developed Bayesian method for estimation of molecular marker population genetic structure (Corander et al, 2003, 2004) treats both the number of populations and the allele frequencies of the molecular markers of each population as random variables. The posterior distribution of the population structure is estimated from the expression where all allele frequencies have been analytically integrated out. If needed, the posterior sample for the allele frequencies can then be generated afterwards. The accompanying program (BAPS) performs an exact Bayesian analysis by enumerative calculation when the number of original populations is small (less than nine). A Markov Chain Monte Carlo (MCMC) algorithm is used when there are nine or more original populations. Based on the posterior distribution of the structure parameters, a measure of uncertainty regarding the specified populations is obtained for all pairwise comparisons. BAPS can also generate the MCMC samples for the allele frequencies and the F_ST statistic, estimated under all possible population structures that are considered to be likely in the light of the data (model averaged estimate). The empirical samples in this study come from five different locations (original populations) and the enumerative calculations were therefore used for the structure parameter. The length of the MCMC chain for F_ST statistic estimation in BAPS program was based on 50 000 iterations (after a burn-in of 10 000 had been discarded). For details of this method and the program BAPS, see Corander et al (2003, 2004).

Given a polygenic trait with an additive genetic basis and a neutral island model where populations derive from a common ancestor population, it has been shown that the components of between-population variance (σ_b²) and the within-population variance (σ_w²) can be used to formulate a quantitative trait analog to the molecular F_ST statistic as Q_ST (Prout and Barker, 1989; Spitze, 1993). The molecular F_ST estimate can be used as a neutral drift expectation to which it is possible to compare the Q_ST estimate of a quantitative trait.

Estimation of the F_ST and Q_ST statistics have so far been conducted within the frequentist framework, mainly by using standard methods as ANOVA and REML (an exception is presented in Palo et al, 2003). Rigorous tests for the hypothesis that F_ST and Q_ST are equal are difficult to formulate because these statistics are often estimated from separate analyses conditionally on the original sampling design.

The only relevant way up to now has been to compare an overlap of confidence intervals around the estimates. Confidence intervals can readily be obtained from the ANOVA for balanced designs (Lynch and Walsh, 1998). However, designs are seldom balanced and only approximate confidence intervals can therefore be constructed (eg with the Delta method; Podolsky and Holtsford, 1995; Waldmann and Andersson, 1998). Bootstrap methods can also be used, but the bootstrap can be difficult to implement in multilevel hierarchical designs.

A Bayesian Gibbs sampling approach can be formulated using the hierarchical centering parameterization of Gelfand et al (1995). Consider the following nested random effects linear model

where y_ijk is the observed quantitative trait measurement of individual k belonging to family j at population i, μ the overall mean, p_i the population effect at population i, f_ij the family effect of family j in population i, and e_ijk the residual. These parameters are distributed as

The hierarchical centering parameterization is obtained by replacing p_i with γ_i and f_ij with δ_ij, so that γ_i=μ+p_i and δ_ij=μ+p_i+f_ij. Hence, γ_i is centered around μ and δ_ij is centered around γ_i. This centering has been found to provide a good mixing and convergence properties of the MCMC algorithm (Gelfand et al, 1995). In order to estimate Q_ST, σ_b² is obtained directly from the population variance σ_p², that is (σ_b²=σ_p²), whereas the family variance σ_f² has to be converted into σ_w² by multiplication of a coefficient (c) that depends on the relationship of individuals within families (σ_w²=cσ_f²). For half-sibs, full-sibs and cloned individuals c is 4, 2 and 1 (under the assumption of no dominance and epistasis), respectively. The quantitative trait analog to the molecular F_ST statistic is then estimated as (Prout and Barker, 1989; Spitze, 1993):

We implemented this model using WinBUGS14 (Spiegelhalter et al, 2003) and the code is available from the authors. Prior for μ was taken to be a very flat normal distribution with zero mean and variance 1/10⁻⁶. Recently, it has been argued that the commonly used inverse Gamma prior is not always uninformative for variance parameters (Gelman et al, 2004). Hence, we performed two separate analyses with Gamma (0.001, 0.001) and uniform (10⁺⁶, 10⁻⁶) distributions as priors for the (inverse) of the variances (1/σ_p², 1/σ_f² and 1/σ_e²).

Quantitative and molecular data from P. sylvestris

Seedlings of P. sylvestris were grown in a common garden experiment as described by García-Gil et al (2003). The experiment consisted of five populations along a latitudinal cline (67–40°N). The timing of terminal budset was scored twice per week. A terminal bud was defined as the stage when the stipules of the foliage leaves cover the shoot apex and the youngest foliage leaf are offset from the central axis of the shoot apex. The date of budset was defined as the number of days from sowing to the formation of the bud. The number of families per population was: 22 (Valsaín), 20 (Kolari, Lapinjärvi and Lithuania) and 10 (Puebla de Lillo). In all, 20 individuals per family were scored for the quantitative traits. We assume that the individuals within each family are related as half-sibs (Muona and Harju, 1989; Yang et al, 1996). For the microsatellite work, two individuals per family per population from the common garden experiment were analysed. Ten nuclear microsatellite primers, developed for Pinus taeda, were used to genotype the individuals: PtTX3025, PtTX3013, PtTX2146, PtTX2123 primers (Elsik et al, 2000) and 8846, 4516, 4527, 4528 primers supplied by Dr Auli Karhu. PtTX2146 primer amplified three different polymorphic microsatellite loci named as PtX2146A, PtX2146B and PtX2146C. Out of the ten primers, six microsatellite loci were polymorphic and were used to genotype the 180 individuals. DNA was extracted from needles using Quiagen DNAeasy plant kit. The PCR volume was 25 μl and consisted on 50 ng of genomic DNA template, 0.2 μM of each primer, 0.2 mM of each dNTP, 2.5 μl of 10 × Taq buffer (500 mM KCl, 100 mM Tris-HCl, 1% Triton X-100, Promega), 2 mM of MgCl₂ (Promega) and 2 units of Taq polymerase (Promega). Amplifications were performed using Robocycler gradient 96 (Stratagene). The amplification protocol was: 5 min at 94°C; followed by 35 cycles of 1 min at 94°C, 30 s at 50°C, 1 min at 72°C; and finally one cycle 10 min at 72°C. PCR amplifications were resolved using an ABI 377 DNA sequencer and allele scoring was performed by using the GeneScan 3.1 and Genotyper 2.5 softwares.

Results

P. sylvestris data

The allele frequencies for the original populations are summarized in Table 1. Analysis with BAPS resulted in two clusters, {Kolari, Lapinjärvi and Lithuania} and {Valsaín and Puebla de Lillo}, with a posterior probability of 0.994. The pairwise probabilities, that two populations are equal, are presented in Table 2. The mean of the posterior of the F_ST estimates between the two clusters varied between 0.00650 and 0.0727 for the loci, the overall mean being 0.0316 (95% CI: 0.0193–0.0438). We also estimated the inbreeding coefficient (F_IS) within each cluster with the Hickory 1.0 program (Holsinger and Lewis, 2003). The 95% CI of the posteriors of F_IS overlapped considerably (northern cluster: 0.0405–0.151 and southern cluster: 0.0346–0.200). Consequently, the estimated level of inbreeding was low (corresponding well with the assumed half-sib relations within families) and did not vary between the two clusters.

Table 1 Microsatellite allele frequencies for the five original Pinus sylvestris populations

Full size table

Table 2 Pairwise posterior probabilities for allele frequencies of two populations being the same

Full size table

Given that the two-cluster configuration in the BAPS analysis had a very high posterior probability, the estimation of the Q_ST differentiation of bud set date was carried out conditionally on this structure. Two parallel MCMC chains were run for 550 000 iterations for each type of priors (Gamma, uniform) for variance parameters. The first 50 000 iterations were discarded from each chain as burn-in and the chains were thinned by storing every tenth iteration based on autocorrelation plots (plots not shown). The Gelman-Rubin convergence statistic in WinBUGS1.4 strongly supported the conclusion that the chains had converged (R close to 1) for all variance parameters with both the Gamma and the uniform priors. The two priors seemed to produce identical results, and we will therefore only present the runs based on the uniform priors. The MCMC chains resulted in a posterior Q_ST density that was bimodal with peaks close to 0.2 and 1 (Figure 1). In order to investigate this further, we estimated the family variance σ_f² for each cluster (extracted from the full model). The northern cluster had a posterior mean of σ_f² of 125.9 (95% CI: 110.7–145.2), and the southern cluster a posterior mean of σ_f² of 62.93 (95%CI: 38.14–97.88). Hence, the within-cluster variation was very different for those two groups. The Pr (σ_fNorth²>σ_fSouth²) was estimated by dividing the number of iterations where (σ_fNorth²>σ_fSouth²) by the total number of iterations and found to be 0.999. We also estimated Q_ST based on the original population configuration for comparative purposes. The mean, mode and median were high (0.765, 0.817 and 0.779, respectively), but the CI of the posterior was wide (95% CI: 0.490–0.964).

Discussion

It is shown here how the Q_ST statistic, which is commonly used for estimation of differentiation in quantitative traits, can be formulated in a Bayesian framework. A Gibbs sampling approach with hierarchical centering was used in the estimation, because it has been shown to work well (Gelfand et al, 1995). When the molecular markers from P. sylvestris were subjected to the BAPS program, it was found that the estimated neutral population genetic structure consists of two clusters that correspond very well to the north/south geographic distribution. However, when trying to estimate Q_ST of the bud set data from P. sylvestris, the MCMC-chains produced a bimodal posterior of Q_ST. The evident reason for the bimodality is that the family variances differed considerably between the two clusters because of the strong cline within the northern cluster (García-Gil et al, 2003). An earlier study has found that microsatellites and other molecular markers are hardly differentiated at all, whereas timing of bud set is very different between the original populations of the northern cluster (Karhu et al, 1996). Consequently, the assumption of variance homogeneity at the family level is violated and no comparison between the Q_ST and the F_ST estimates can therefore be made.

Statistical and evolutionary assumptions behind Q_ST

One of the critical assumptions when estimating Q_ST is that the populations are evolving at the same rate, that is, that genetic drift is converting the family variance within populations to the between-population component at the same rate. It is therefore of fundamental importance to investigate a priori whether the additive genetic variance (or the heritability) differs considerably between populations. Several of the studies included in recent review articles (Merilä and Crnokrak, 2001; McKay and Latta, 2002) have reported average heritabilities. Since relatively few authors have reported both Q_ST and population-specific heritability estimates in their studies, it is difficult to evaluate if heterogeneity in within-population variance is a common problem.

Another assumption that is more challenging to verify is that the molecular markers and the genes of the quantitative traits should have the same mutation rates. Although it is inherently difficult to estimate proper mutation rates of both molecular markers and quantitative traits, it has been suggested that different marker types can have considerably dissimilar mutation rates (Balloux et al, 2000). Moreover, theoretical studies have shown that the F_ST statistics is very sensitive to variations in the mutation rate (Fu et al, 2003) and to unequal migration rates between populations (Wilkinson-Herbots and Ettridge, 2004).

The level of inbreeding was low and did not differ between the northern and southern clusters. Thus, the bias should be small for the assumption of no inbreeding when estimating Q_ST. Theoretically, it is possible to derive a Bayesian Q_ST statistic that takes inbreeding into account. Prior information from the level of inbreeding could easily be attained from molecular markers. Unfortunately, it is practically much more difficult to specify a model for the dominance variance components that are introduced by inbreeding (De Boer and Hoeschele, 1993). Hence, we did not try to implement an inbreeding Q_ST for this data set.

In Waldmann and Andersson (1998), heritability estimates varied between populations considerably for some traits (flowering date in Scabiosa canescens varied between 0.098 and 1.49), but not that much for others (flowering date in S. columbaria varied between 0.198 and 0.450). However, the confidence intervals of the heritabilities were wide in that study and overlaps between populations were common. Large variation in population-specific heritability levels were also found for some traits in two rare plants, whereas some traits displayed very similar heritability levels between populations (Petit et al, 2001). A similar result was found by Widen et al (2002) for Brassica cretica. The within-population genetic variance varied between 0.111 and 11.0 for internode length, whereas node number only varied between 12.9 and 64.4 between populations of this species.

Moreover, it should also be noticed that the (frequentist) ANOVA and REML methods produce point estimates for the variance components even in the presence of considerable variance heterogeneity. For example, an REML analysis of the bud set data in this study produced a Q_ST estimate of 0.274 (when estimated conditional on the northern and southern cluster structure). An obvious indication for variance heterogeneity can be obtained by checking that the residuals do not follow a straight line in the Normal Quantile–Quantile plot (Figure 2; see also Pinheiro and Bates, 2000).

Recently, a theoretical study by Lopez-Fanjul et al (2003) showed that Q_ST can be severely biased if there are nonadditive gene actions (dominant and/or epistatic loci contribute to the phenotype). Hence, comparison of Q_ST and F_ST for inference of the relative importance of drift and selection in population differentiation is limited to purely additive traits. In this study, we have assumed that individuals within families are related as half-sibs, which seems to be reasonable when considering the mating system of P. sylvestris (Muona and Harju, 1989; Yang et al, 1996) and the F_IS result attained with Hickory. However, it is possible that a small fraction of full-sibs are present and introduce a small amount of error due to dominance (Lynch and Walsh, 1998).

So far, no theoretical investigation has been undertaken on how different selection regimes within populations influence Q_ST. Although it has been shown that migration (or pollen dispersal) between local populations in which selection favours different trait values can maintain substantial amounts of genetic variation at the between-population level (Barton and Keightley, 2002). Regarding our data, one could suspect that divergent selection is more prevalent within the northern cluster, and that uniform selection (or drift) is the dominating force within the southern cluster where original populations occur in a rather similar environment (latitude). In fact, analyses of bud set date at the within-cluster level (using initial populations) with WinBUGS14 revealed that mean Q_ST was 0.728 for the northern cluster and 0.0979 for the southern cluster (these estimates should only be taken as very approximate because CIs were very wide).

Differentiation in pine and other forest trees

Two recent studies have compared differentiation in molecular markers and quantitative traits in different Pinus species. Yang et al (1996) compared allozyme differentiation and quantitative genetic differentiation in P. contorta ssp. latifolia and found that specific gravity, stem diameter, stem height and branch length had significantly higher Q_ST (between 0.133 and 0.195) than F_ST (0.019) values. In P. pinaster (Gonzalez-Martínez et al, 2002), the Q_ST values were very high for stem form, total height growth and survival at 30 years age (0.973, 0.791 and 0.732, respectively), and significantly higher than the allozyme F_ST (0.048). However, Gonzalez-Martinez et al (2002) also report considerably lower Q_ST estimates (0.12 and 0.20) for height growth of two other pine species from the Mediterranean.

Strong diversifying selection is apparent in several widespread tree species as Q_ST values often are much higher than neutral F_ST values, especially in timing of bud burst (Le Corre and Kremer, 2003). However, the Q_ST values are not always extremely high. Instead, it is the F_ST values that are very low. This is not surprising because many tree species are often wind pollinated and distributed over large areas. For example, a recent study in Picea glauca revealed that Q_ST estimates of 10 traits ranged between 0.035 and 0.246 (Jaramillo-Correa et al, 2001). Of those were only 8 year height (Q_ST=0.082), 13 year height (Q_ST=0.069), total wood density (Q_ST=0.102) and date of budset (Q_ST=0.246) higher than the neutral differentiation of allozymes (G_ST=0.014) and ESTPs (G_ST=0.014). Unfortunately, no heritability estimates were reported for the separate populations.

In two other theoretical studies (Latta, 1998; Le Corre and Kremer, 2003), the differentiation at neutral molecular markers, the QTL behind the trait and the adaptive trait itself was compared under different selection regimes and with different levels of gene flow. Their general conclusion was that it is more common that population differentiation shows pattern of genetic variability that differs between markers, QTL and the adaptive trait, than that the differentiation is the same. Le Corre and Kremer (2003) found that the highest disparity between the three levels occurred under highly diversifying selection and high gene flow, a situation that corresponds very well with the pine biology.

In conclusion, we have presented a Bayesian method for estimation of Q_ST that can utilize the information regarding the actual population structure estimated using neutral molecular markers. The method should work well when the quantitative traits of the populations differentiate at the same rate (ie have similar variances). However, when the heritability differs substantially between populations, the MCMC estimation may result in bimodal posteriors. This problem was illustrated by applying the methods to a data set of date to bud set in P. sylvestris. We also recommend that future studies in addition to presenting Q_ST estimates, also present between-family variance or heritability estimates for each trait and population. Moreover, when using ANOVA and REML methods, it is of particular importance to check that the residuals are normally distributed and do not follow any deviating trends. Finally, many evolutionary forces can potentially bias F_ST and Q_ST comparisons and they should therefore be interpreted with care.

References

Avise JC (1993). Molecular Markers, Natural History and Evolution. Kluwer Academic Publishers: Boston, MA.
Google Scholar
Balding DJ (2003). Likelihood-based inference for genetic correlation coefficients. Theor Popul Biol 63: 221–230.
Article PubMed Google Scholar
Balloux F, Brünner H, Lugon-Moulin N, Hausser J, Goudet J (2000). Microsatellites can be misleading: an empirical and simulation study. Evolution 54: 1414–1422.
Article CAS PubMed Google Scholar
Barton NH, Keightley PD (2002). Understanding quantitative genetic variation. Nat Rev Genet 3: 11–21.
Article CAS PubMed Google Scholar
Beaumont MA, Rannala B (2004). The Bayesian revolution in genetics. Nat Rev Genet 5: 251–261.
Article CAS PubMed Google Scholar
Corander J, Waldmann P, Marttinen P, Sillanpää MJ (2004). BAPS 2: enhanced possibilities for the analysis of genetic population structure. Bioinformatics 20: 2363–2369.
Article CAS PubMed Google Scholar
Corander J, Waldmann P, Sillanpää MJ (2003). Bayesian analysis of genetic differentiation between populations. Genetics 163: 367–374.
CAS PubMed PubMed Central Google Scholar
De Boer IJM, Hoeschele I (1993). Genetic evaluation methods for populations with dominance and inbreeding. Theor Appl Genet 86: 245–258.
Article CAS PubMed Google Scholar
Elsik CG, Minihan VT, Hall SE, Scarpa AM, Williams CG (2000). Low-copy microsatellite markers for Pinus taeda L. Genome 43: 550–555.
Article CAS PubMed Google Scholar
Excoffier L (2001). Analysis of population subdivision. In: Balding DJ, Bishop M, Cannings C (eds) Handbook of Statistical Genetics. Wiley: NY. pp 271–307.
Google Scholar
Fu R, Gelfand AE, Holsinger KE (2003). Exact moment calculations for genetic models with migration, mutation, and drift. Theor Popul Biol 63: 231–243.
Article PubMed Google Scholar
García-Gil MR, Mikkonen M, Savolainen O (2003). Nucleotide diversity at two phytochrome loci along a latitudinal cline in Pinus sylvestris. Mol Ecol 12: 1195–1206.
Article PubMed Google Scholar
Gelfand AE, Sahu SK, Carlin BP (1995). Efficient parametrizations for normal linear mixed models. Biometrika 82: 479–488.
Article Google Scholar
Gelman A, Carlin JB, Stern HS, Rubin DB (2004). Bayesian Data Analysis 2nd edn. Chapman & Hall: London.
Google Scholar
Goldstein DB, Schlötterer C (1999). Microsatellites: Evolution and Applications. Oxford University Press: Oxford, UK.
Google Scholar
Gonzalez-Martínez SC, Alia R, Gil L (2002). Population genetic structure in a Mediterranean pine (Pinus pinaster Ait.): a comparison of allozyme markers and quantitative traits. Heredity 89: 199–206.
Article PubMed Google Scholar
Holsinger KE (1999). Analysis of genetic diversity in geographically structured populations: a Bayesian perspective. Hereditas 130: 245–255.
Article Google Scholar
Holsinger KE, Lewis PO (2003) Hickory. Ver. 1.0. Department of Ecology and Evolutionary Biology, University of Connecticut, CT. Available via http://www.eeb.uconn.edu/.
Jaramillo-Correa JP, Beaulieu J, Bousquet J (2001). Contrasting evolutionary forces driving populations structure at expressed sequence tag polymorphisms, allozymes and quantitative traits in white spruce. Mol Ecol 10: 2729–2740.
Article CAS PubMed Google Scholar
Karhu A, Hurme P, Karjalainen M, Karvonen P, Karkkainen K, Neale D et al (1996). Do molecular markers reflect patterns of differentiation in adaptive traits of conifers? Theor Appl Genet 93: 215–221.
Article CAS PubMed Google Scholar
Lande R (1992). Neutral theory of quantitative genetic variance in an island model with local extinction and colonization. Evolution 46: 381–389.
Article PubMed Google Scholar
Latta RG (1998). Differentiation of allelic frequencies at quantitative trait loci affecting locally adaptive traits. Am Nat 151: 283–292.
Article CAS PubMed Google Scholar
Le Corre V, Kremer A (2003). Genetic variability at neutral markers, quantitative trait loci and trait in a subdivided population under selection. Genetics 164: 1205–1219.
CAS PubMed PubMed Central Google Scholar
Lopez-Fanjul C, Fernandez A, Toro MA (2003). The effect of neutral nonadditive gene action on the quantitative index of population divergence. Genetics 164: 1627–1633.
PubMed PubMed Central Google Scholar
Lynch M (1994). The neutral theory of phenotypic evolution. In: Real L (ed) Ecological Genetics. Princeton University Press: Princeton, NJ. pp 86–108.
Google Scholar
Lynch M, Walsh B (1998). Genetics and Analysis of Quantitative Traits. Sinauer Associates: Sunderland, MA.
Google Scholar
McKay JK, Latta RC (2002). Adaptive population divergence: markers, QTL, and traits. Trends Ecol Evol 17: 285–291.
Article Google Scholar
Merilä J, Crnokrak P (2001). Comparison of genetic differentiation at marker loci and quantitative traits. J Evol Biol 14: 892–903.
Article Google Scholar
Muona O, Harju A (1989). Effective population sizes, genetic variability, and mating system in natural stands and seed orchards of Pinus sylvestris. Silvae Genetica 38: 221–228.
Google Scholar
Palo JU, O'Hara RB, Laugen AT, Laurila A, Primmer CR, Merilä J (2003). Latitudinal divergence of common frog (Rana temporaria) life history traits by natural selection: evidence from a comparison of molecular and quantitative genetic data. Mol Ecol 12: 1963–1978.
Article CAS PubMed Google Scholar
Petit C, Freville H, Mignot A, Colas B, Riba M, Imbert E et al. (2001). Gene flow and local adaptation in two endemic plant species. Biol Cons 100: 21–34.
Article Google Scholar
Pinheiro J, Bates DM (2000). Mixed Effects Models in S and S-Plus. Springer-Verlag: New York.
Book Google Scholar
Podolsky RH, Holtsford TP (1995). Population structure of morphological traits in Clarkia dudleyana. I. Comparison of FST between allozymes and morphological traits. Genetics 140: 733–744.
CAS PubMed PubMed Central Google Scholar
Prout T, Barker JSF (1989). Ecological aspects of the heritability of body size in Drosophila buzzatii. Genetics 123: 803–813.
CAS PubMed PubMed Central Google Scholar
Reed DH, Frankham R (2001). How closely related are molecular and quantitative measures of genetic variation? A meta-analysis. Evolution 55: 1095–1103.
Article CAS PubMed Google Scholar
Robert CP (2001). The Bayesian Choice. Springer-Verlag: New York 2nd edn.
Google Scholar
Spiegelhalter DJ, Thomas A, Best N, Lunn D (2003) WinBUGS. Ver. 1.4 user manual. MRC Biostatistics Unit, Cambridge, U.K. Available viahttp://www.mrc-bsu.cam.ac.uk/bugs.
Spitze K (1993). Population structure in Daphnia obtusa: quantitative genetic and allozyme variation. Genetics 135: 367–374.
CAS PubMed PubMed Central Google Scholar
Waldmann P, Andersson S (1998). Comparison of quantitative genetic variation and allozyme diversity within and between populations of Scabiosa canescens and S. columbria. Heredity 81: 79–86.
Article CAS Google Scholar
Weir BS, Hill WG (2002). Estimating F-statistics. Ann Rev Genet 36: 721–750.
Article CAS PubMed Google Scholar
Whitlock MC (1999). Neutral additive genetic variance in a meta-population. Genet Res 74: 215–221.
Article CAS PubMed Google Scholar
Widen B, Andersson S, Rao G-Y, Widen M (2002). Population divergence of genetic (co)variance matrices in a subdivided plant species, Brassica cretica. J Evol Biol 15: 961–970.
Article Google Scholar
Wilkinson-Herbots HM, Ettridge R (2004). The effect of unequal migration rates on FST . Theor Popul Biol 66: 185–197.
Article PubMed Google Scholar
Wright S (1951). The genetical structure of populations. Ann Eugen 15: 323–354.
Article CAS PubMed Google Scholar
Wright S (1965). The interpretation of population structure by F-statistics with special regard to system mating. Evolution 19: 395–420.
Article Google Scholar
Yang RC, Yeh FC, Yanchuck AD (1996). A comparison of isozyme and quantitative genetic variation in Pinus contorta ssp. latifolia by Fst. Genetics 142: 1045–1052.
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by Academy of Finland (Grant no. 202324) and the Marie Curie fellowship (QLK5-CT-2000-51233) under the 5th Framework Programme. We are grateful to Dr O Savolainen who provided us a P. sylvestris data set, which have been collected under the support of the Environmental and Natural Resources Research Council of Finland (ES1587).

Author information

Authors and Affiliations

Rolf Nevanlinna Institute, University of Helsinki, PO Box 68, FIN-000 14, Finland
P Waldmann & M J Sillanpää
Department of Biology, University of Oulu, University of Oulu, PO Box 3000, FIN-900 14, Finland
M R García-Gil
Department of Forest Genetics and Plant Physiology, SLU, SE-901 83, Umeå, Sweden
P Waldmann & M R García-Gil

Authors

P Waldmann
View author publications
You can also search for this author in PubMed Google Scholar
M R García-Gil
View author publications
You can also search for this author in PubMed Google Scholar
M J Sillanpää
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P Waldmann.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Waldmann, P., García-Gil, M. & Sillanpää, M. Comparing Bayesian estimates of genetic differentiation of molecular markers and quantitative traits: an application to Pinus sylvestris. Heredity 94, 623–629 (2005). https://doi.org/10.1038/sj.hdy.6800672

Download citation

Received: 26 September 2004
Accepted: 17 February 2005
Published: 27 April 2005
Issue Date: 01 June 2005
DOI: https://doi.org/10.1038/sj.hdy.6800672

Keywords

This article is cited by

Stronger genetic differentiation among within-population genetic groups than among populations in Scots pine provides new insights into within-population genetic structuring
- Darius Danusevičius
- Om P. Rajora
- Algirdas Augustaitis
Scientific Reports (2024)
Development and transferability of two multiplexes nSSR in Scots pine (Pinus sylvestris L.)
- Stefana Ganea
- Sonali S. Ranade
- María Rosario García-Gil
Journal of Forestry Research (2015)
Micro-evolutionary patterns of juvenile wood density in a pine species
- Jean-Baptiste Lamy
- Frédéric Lagane
- Sylvain Delzon
Plant Ecology (2012)
Separating Effects of Gene Flow and Natural Selection along an Environmental Gradient
- Sergei Volis
- Yong-Hong Zhang
Evolutionary Biology (2010)
Technological advances in temperate hardwood tree improvement including breeding and molecular marker applications
- Paula M. Pijut
- Keith E. Woeste
- Charles H. Michler
In Vitro Cellular & Developmental Biology - Plant (2007)

Comparing Bayesian estimates of genetic differentiation of molecular markers and quantitative traits: an application to Pinus sylvestris

Abstract

Similar content being viewed by others

Evidence of local adaptation despite strong drift in a Neotropical patchily distributed bromeliad

Hybridization and geographic distribution shapes the spatial genetic structure of two co-occurring orchid species

Reduced within-population quantitative genetic variation is associated with climate harshness in maritime pine

Introduction

Materials and methods

Estimation and comparison of F_ST and Q_ST

Quantitative and molecular data from P. sylvestris