Introduction

Most complex traits show marked heritable variation in natural populations. How evolutionary forces interact to maintain such variation remains a long-standing dilemma in evolutionary biology and quantitative genetics (Johnson and Barton 2005). Although mutation and genetic drift have straightforward roles, generating and eliminating variation respectively, the effect of natural selection is more complicated (Walsh and Lynch 2018). Stabilizing selection, i.e., the selection of intermediate phenotypes, is often strong in natural populations (Hereford et al. 2004). This type of selection is expected to deplete genetic variation (Fisher 1930), either directly on the focal trait or indirectly via pleiotropic effects (Johnson and Barton 2005). Theoretical models based on the balance between mutation, drift and stabilizing selection support this idea, but they suggest lower fitness heritability values than those generally observed in empirical studies (Johnson and Barton 2005). Balancing selection encompasses various evolutionary processes that can maintain greater than neutral genetic variation within populations (Mitchell-Olds et al. 2007), such as temporally or spatially fluctuating selection pressures in subpopulations connected by gene flow (Felsenstein 1976). Since Levene’s archetypal model (1953), a large corpus of single-locus and polygenic models, most often deterministic, have generally concluded that genetic polymorphisms in spatially heterogeneous environments can only be maintained under restrictive conditions (Byers 2005, Spichtig and Kawecki 2004). In this line, McDonald and Yeaman (2018) showed with stochastic individual-based simulations that substantial within-population genetic variation can be maintained in spatially (and to a much lesser extent, temporally) heterogeneous environments, at intermediate migration rates and regardless of population size.

In empirical studies, the causes that maintain genetic variation in populations have been largely studied based on genetic and discrete-trait polymorphisms (reviewed in Hedrick 1986, 2006), but much less via quantitative genetic analysis of complex traits. Lower quantitative genetic variation in populations evolving under strong selection pressures has been found in a number of studies, e.g., for flowering time and size at reproduction in an invasive plant species (Colautti et al. 2010), for desiccation resistance in Drosophila birchii (van Heerwaarden et al. 2009) and for body mass in great tits (Charmantier et al. 2004), but not in others (e.g., Merilä et al. 2004, Stock et al. 2014). Higher genetic variation in populations evolving under spatially or temporally varying selection pressures has been found in Drosophila populations for body mass, survival and the sternopleural chaeta number (Huang et al. 2015, Mackay 1981), but not for wing shape (Yeaman et al. 2010). The lack of general trends in empirical studies can be explained by method-specific pitfalls to accurately estimate quantitative genetic variation, e.g., the genetic and environmental variances are hard to disentangle in the wild, and when estimated in common gardens, their environment-dependent nature does not allow for wide generalization of estimates (Charmantier et al. 2004, Hoffmann and Parsons 1991, Merilä et al. 2001).

Forest trees have specific life-history traits and genomic features making them interesting model species in population and quantitative genetic studies (Petit and Hampe 2006, Savolainen et al. 2007). Compared with crop species, they remain largely undomesticated (Neale and Savolainen 2004). Most forest trees are outcrossing, have high lifetime reproductive output and long generation times. They often display important gene flow among populations through long-distance pollen dispersal (Kremer et al. 2012). They show slow rates of macroevolution (i.e., low nucleotide substitution rates and low speciation rates; Petit and Hampe 2006), generally have large effective population sizes, with distributions often covering a wide range of environmental conditions (Alberto et al. 2013). Extensive work has revealed strong clines at large geographical scales in the population-specific mean values of phenotypic traits (reviewed in Benito Garzón et al. 2019, Savolainen et al. 2007), e.g., phenological traits with latitude or altitude (Alberto et al. 2011, Thibault et al. 2020) or height growth with cold hardiness (Leites et al. 2012, Rehfeldt et al. 1999). Genetic differentiation at microgeographic spatial scales has also been repeatedly observed (reviewed in Jump and Peñuelas 2005, Linhart and Grant 1996, Scotti et al. 2016), suggesting rapid rates of microevolution (Petit and Hampe 2006). Possible explanations include the fact that forest trees have high levels of genetic diversity and that most of their quantitative and neutral genetic variation is within populations (Hamrick 2004). To our knowledge, only two empirical studies investigated the potential causes underlying the maintenance of quantitative trait variation within forest tree populations. Yeaman and Jarvis (2006) showed that 20% of growth genetic variation in lodgepole pine populations was attributable to regional spatial heterogeneity, suggesting an important role of gene flow and varying selection pressures. In the neotropical oak Quercus oleoides, Ramírez-Valiente et al. (2019) found lower quantitative genetic variation of functional traits in harsher environments, but not higher quantitative genetic variation in temporally fluctuating environments. They also suggested only a marginal effect of genetic structure and diversity on the maintenance of within-population genetic variation. More empirical studies on the effects of evolutionary forces on quantitative genetic variation across natural forest tree populations are therefore needed to assess their evolutionary potential, anticipate their responses to ongoing global change, and develop sound adaptive management and conservation strategies that guarantee their future persistence.

In this study, we aimed to test three competing, but not mutually exclusive, hypotheses regarding the potential drivers maintaining or reducing quantitative genetic variation within maritime pine populations, namely: i) the most admixed populations have higher quantitative genetic variation due to exogenous gene flow from other gene pools, and this relationship is proportional to the divergence between sink and source gene pools; ii) quantitative genetic variation is lower in populations that have evolved in harsher environments, as a result of higher selection pressures in these regions; and iii) quantitative genetic variation is higher in populations that have evolved in spatially or temporally heterogeneous environments. Importantly, the last two hypotheses require the action of natural selection, while the first does not. Therefore, under the last two hypotheses, detecting changes in genetic variances will be more likely for traits more closely related to fitness, whereas under the first hypothesis, detecting changes in genetic variances will be equally likely across traits. We used growth (four height measurements), phenological (bud burst and duration of bud burst) and functional (δ13C and specific leaf area, SLA) trait data from three clonal common gardens, consisting of 522 clones (i.e., genotypes) from 33 populations, spanning all known gene pools in the species (Jaramillo-Correa et al. 2015) and genotyped for 5165 SNPs. We calculated ten population-specific indices related to our three hypotheses: two indexes describing the level and origin of admixture in the populations (estimated with SNP markers), two indexes of climate harshness (capturing summer droughts and winter cold temperatures), four indexes describing the environmental spatial heterogeneity in the forested areas surrounding the populations and two indexes describing the climatic temporal heterogeneity (i.e., annual variability in summer droughts and winter cold temperatures). We then used quantitative genetic models in a Bayesian framework to estimate for each trait the linear association among the population-specific total genetic variances (i.e., hereafter referred to as the within-population quantitative genetic variation) and each of the ten population-specific indices described above.

Materials and methods

Maritime pine, a forest tree growing in heterogeneous environments

Maritime pine (Pinus pinaster Ait., Pinaceae) is a wind-pollinated, outcrossing and long-lived tree species with large ecological and economical importance in western Europe and North Africa. Maritime pine is largely appreciated for its wood, for stabilizing coastal and fossil dunes and, as a keystone species, for supporting biodiversity (Viñas et al. 2016). The distribution of maritime pine natural populations is scattered and covers a wide range of environmental conditions. Several studies have provided evidence of genetic differentiation for adaptive traits in this species, suggesting local adaptation (e.g., de Miguel et al. 2022, González-Martínez et al. 2002). Maritime pine can grow in widely different climates: the dry climate along the northern coasts of the Mediterranean Basin (from Portugal to western Italy), the mountainous climates of south-eastern Spain and Morocco, the wetter climate of the Atlantic region (from the Spanish Iberian region to the western part of France) and the continental climate of central Spain. Maritime pine can also grow on a wide range of substrates, from sandy and acidic soils to more calcareous ones (Viñas et al. 2016).

Maritime pine presents a strong population genetic structure with six main gene pools, located in the French Atlantic region, Iberian Atlantic region, central Spain, south-eastern Spain, Corsica and northern Africa (Fig. S10; Alberto et al. 2013, Jaramillo-Correa et al. 2015). These gene pools consist of homogeneous (albeit interconnected by gene flow) genetic clusters, as shown by molecular markers (Jaramillo-Correa et al. 2015), and probably resulted from the expansion of distinct glacial refugia (Bucci et al. 2007).

Phenotypic data

Phenotypic data were collected in three clonal common gardens planted in 2011 (Fig. 1): Asturias (Spain) and Fundão (Portugal) in the Iberian Atlantic region and Bordeaux (France) in the French Atlantic region. As evidenced by the high survival rate at these sites (Table S1), the common gardens are located in environments considered favorable to maritime pine, with mild winters, high annual rainfall and relatively wet summers (Tables S11, S12, and Fig. 1). In each of these common gardens, trees belonging to 522 clones (i.e., genotypes) from 33 populations, covering the six known gene pools in the species (Fig. S10), were planted following a randomized complete block design with 8 blocks. To obtain the clones, trees at least 50 m apart were sampled in natural stands, and one seed per tree was planted in a nursery and vegetatively propagated by cuttings (see Rodríguez-Quilón et al. 2016 for details). Clones were therefore considered unrelated.

Fig. 1: Location of the three common gardens and the 33 populations used in the study.
figure 1

The colors represent the gradients of mean coldest month temperature (MCMT) and summer (May-September) precipitation (SP) over the period 1901-1950 within the maritime pine range. These gradients were obtained by performing a centered and scaled principal component analysis on the MCMT and SP values. The maritime pine distribution combines the EUFORGEN distribution (http://www.euforgen.org/) and 10-km radius areas around the National Forest Inventory plots with this species.

One growth trait, height, was measured in all common gardens and at different tree ages (Table S1). Two phenology-related traits, the mean bud burst date over four years and the mean duration of bud burst over three years, were measured in Bordeaux and were averaged over several years to suppress differences across years and approximate a normal distribution of their trait values (Table S1). Bud burst corresponds to the date of brachyblast emergence in accumulated degree-days (with base temperature 0 C) from the first day of the year to account for between-year variability in temperature. The duration of bud burst corresponds to the number of degree-days between the beginning of bud elongation and the total elongation of the needles (see Hurel et al. 2021). Last, two functional traits, δ13C and the specific leaf area (SLA) were measured in Fundão (Table S1). For each trait and each population, the number of trees and clones used in the analyses, the raw phenotypic means and variances, and histograms of trait distribution are provided in section 1.1 of the Supplementary Information. Prior to analyses, some traits were log-transformed to get closer to normality (Fig. S1) or mean-centered to help model convergence (Table S1).

SNP genotyping and population admixture

The 522 clones planted in the Asturias common garden were genotyped with the Illumina Infinium assay described in Plomion et al. (2016), resulting in 5,165 high-quality polymorphic SNPs. There were on average only 3.3 missing SNP values per genotype (ranging between 0 and 142). For each clone, the proportions of ancestry from each of the six known gene pools were estimated in Jaramillo-Correa et al. (2015) using the Bayesian approach available in Structure (Pritchard et al. 2000), and were then averaged by population (Table S14). For each population, the main gene pool was defined as the gene pool constituting the higher proportion of population ancestry whereas the other gene pools were considered as a product of (past or present) exogenous gene flow. The accuracy of the assignment of each population to a given gene pool was validated by observing that populations with the same main gene pool ancestry were located close to each other. First, we calculated a population admixture score A, as the proportion of ancestry from foreign gene pools (Table S14). Second, we calculated a population admixture score D that considers both the proportion of foreign ancestries and the divergence between the main and foreign gene pools (Table S14). For that, we weighted the proportions of ancestry from foreign gene pools by the sum of the allele frequency divergence of the main and foreign gene pool from the common ancestral one (Fk, which should be numerically similar to FST; Falush et al. 2003). We developed D considering that some gene pools are more divergent than others and thus may bring higher genetic diversity to an admixed population at the same level of introgression. A was highly correlated with D (Pearson correlation coefficient of 0.91; Table S14), and also with a related index that used weights based on pairwise FST (Table S14). A and D were mean-centered and divided by their standard deviation prior to analyses.

Climate harshness and temporal and spatial heterogeneity

To describe the winter temperature and drought summer conditions under which the populations have evolved, we extracted the annual values of the mean coldest month temperature (MCMT) and the summer (May-September) precipitation (SP) at the population location for the period 1901-1950 using the bilinear interpolation of the Climate Downscaling Tool (https://www.ibbr.cnr.it/climate-dt/). SP was selected to better understand how trees adapt in areas subjected to frequent summer droughts, which is key to anticipate tree population responses to ongoing climate change, while MCMT was chosen because marked patterns of cold adaptation have been previously reported in maritime pine (e.g., Grivet et al. 2011). The means of the annual values of MCMT and SP over the period considered were used as two indexes of climate harshness while their variances were used as two indexes of temporal heterogeneity.

In addition, to describe the environmental spatial heterogeneity around the population location, we used raster files of climatic, topographic and soil data at 1-km resolution. Climatic raster data correspond to climatic variables from the ClimateEU database (Marchi et al. 2020) and averaged over the period 1901-1950. Topographic raster data were generated from NASA’s Shuttle Radar Topography Mission (SRTM) and SAGA v 2.3.1 (Conrad et al. 2015), and were used to calculate the topographic ruggedness index (TRI), which quantifies the terrain heterogeneity, i.e., differences in elevation among adjacent cells (Riley et al. 1999). Soil raster data were downloaded from the European Soil Database (Hiederer et al. 2013). All environmental variables used are listed in Table S15.

We extracted raster cell values of the environmental variables within a 20-km radius around each population location, and kept only raster cells that fell within forested areas, to avoid including environmental data from non-suitable areas (e.g., lakes, mountain peaks; section 1.4.2 of the Supplementary Information). We then performed a principal component analysis (PCA) on the set of the selected raster cells and extracted the PC1 and PC2 scores of each cell, accounting for 45.2% and 34.1% of the variance, respectively (Fig. S11). PC1 mainly captured precipitation and maximum temperature conditions in summer, while PC2 primarily represented minimum temperature conditions in winter. To obtain the indexes of spatial environmental heterogeneity, we calculated the variances of the PC1 and PC2 scores in a 20-km and 1.6-km radius around each population location (Figs S12 and S13). The environmental heterogeneity indexes were only very weakly correlated (Pearson correlation coefficients lower than 0.36) with the number of forested cells (i.e., the area considered to calculate the indexes), ensuring that the estimated effects of spatial heterogeneity in further analyses were not due to the area per se (Stein et al. 2014, Triantis et al. 2003).

The population-specific values of the spatial and temporal heterogeneity and climate harshness indexes are shown in Tables S16 and S17. All ten indexes were mean-centered and divided by their standard deviation prior to analyses.

Univariate quantitative genetic models

We constructed ten models for each trait y, that is one model per potential driver of the within-population genetic variance (i.e., the two admixture scores, the six environmental heterogeneity indexes and the two climate harshness indexes). Each model was derived from the following baseline quantitative genetic model:

$${y}_{bpcr}={{{\mathcal{N}}}}\left({\beta }_{0}+{B}_{b}+{P}_{p}+{C}_{c(p)},{\sigma }_{r}^{2}\right)$$
(1)

\({\sigma }_{r}^{2}\) is the residual variance. β0 is the global intercept with a weakly informative prior; \({\beta }_{0} \sim {{{\mathcal{N}}}}(\overline{y},2),\overline{y}\) being the mean of the observed trait values. Bb, Pp and Cc(p) are the block, population and clone (nested within population) varying intercepts; they are distributed as follows:

$$\begin{array}{rcl}\left[\begin{array}{c}{B}_{b}\\ {P}_{p}\end{array}\right]& \sim &{{{\mathcal{N}}}}\left(0,\left[\begin{array}{c}{\sigma }_{B}^{2}\\ {\sigma }_{P}^{2}\\ \end{array}\right]\right)\\ {C}_{c(p)}& \sim &{{{\mathcal{N}}}}(0,{\sigma }_{{C}_{p}}^{2})\end{array}$$
(2)

where \({\sigma }_{B}^{2}\) and \({\sigma }_{P}^{2}\) are the variance among blocks and populations, respectively, and \({\sigma }_{{C}_{p}}^{2}\) are the population-specific variances among clones (i.e., population-specific total genetic variances), which we refer to herein as the within-population quantitative genetic variation. The total variance \({\sigma }_{tot}^{2}\) is partitioned as follows:

$$\begin{array}{lll}{\sigma }_{tot}^{2}&=&{\sigma }_{r}^{2}+{\sigma }_{B}^{2}+\overline{{\sigma }_{{C}_{p}}^{2}}+{\sigma }_{P}^{2}\\ {\sigma }_{r}&=&{\sigma }_{tot}\times \sqrt{\left.\right(}{\pi }_{r}\left.\right)\\ {\sigma }_{B}&=&{\sigma }_{tot}\times \sqrt{\left.\right(}{\pi }_{B}\left.\right)\\ {\sigma }_{P}&=&{\sigma }_{tot}\times \sqrt{\left.\right(}{\pi }_{P}\left.\right)\\ \overline{{\sigma }_{{C}_{p}}}&=&{\sigma }_{tot}\times \sqrt{\left.\right(}{\pi }_{C}\left.\right)\\ 1&=&{\pi }_{r}+{\pi }_{B}+{\pi }_{C}+{\pi }_{P}\\ {\sigma }_{tot}& \sim &{{{{\mathcal{S}}}}}^{* }(0,1,3)\end{array}$$
(3)

where \(\overline{{\sigma }_{{C}_{p}}^{2}}\) is the arithmetic mean of the population-specific among-clone variances (\({\sigma }_{{C}_{p}}^{2}\)), and \(\overline{{\sigma }_{{C}_{p}}}\) is the quadratic mean of the standard deviations. \({\sigma }_{{C}_{p}}\) is expressed as follows:

$${\sigma }_{{C}_{p}} \sim {{{\mathcal{LN}}}}\left(\ln (\overline{{\sigma }_{{C}_{p}}})-\frac{{\sigma }_{K}^{2}}{2}+{\beta }_{X}{X}_{p},{\sigma }_{K}^{2}\right)$$
(4)

\({\sigma }_{K}^{2}\) is the variance of the population-specific among-clone standard deviations \({\sigma }_{{C}_{p}}\), with \({\sigma }_{K} \sim \exp (1)\) (see section 2 in the Supplementary Information for more details). Xp is one of the ten potential drivers and βX stands for the linear association between \({\sigma }_{{C}_{p}}\) and the potential driver considered, with \({\beta }_{X} \sim {{{\mathcal{N}}}}(0,1)\).

The modeling approach described above has two key advantages. First, estimating the total genetic variances for each trait (i.e., \({\sigma }_{{C}_{p}}^{2}\) coefficients) and their association with the ten potential drivers (i.e., βX coefficients) in a one-step quantitative genetic model within a Bayesian framework allowed us to propagate the uncertainty in the parameter estimates. Second, we used weakly informative priors, which are a mean to statistically regularize the parameters and thus avoid getting aberrant values (Lemoine 2019). These priors are particularly useful to deal with small sample sizes and therefore we were able to estimate the total genetic variances for each population despite the small number of clones in some of them. We evaluated the model ability to recover the true parameter values under the assumption that the mathematical model represents the biological processes at stake (i.e., how the real data was generated). For each common garden, we generated 100 simulated height samples with the same sample sizes as the real experimental design (see Tables S3, S4 and S5), and based on the mean and variance of the observed height data. We extracted the mean standard error and bias error of the estimates, and the coverage of the 80% and 95% credible intervals, to compare the model estimates with the true parameter values (see details in section 3 of the Supplementary Information).

It can be noted that the estimates of the total genetic variances may be inflated by C effects (environmental effects common to a group of relatives such as clones, Lerner 1958). However, we expect C effects to inflate the total genetic variances uniformly across populations and are therefore unlikely to bias the βX estimates.

Model specification and fit were performed using the Stan probabilistic programming language (Carpenter et al. 2017), based on the no-U-turn sampler algorithm. Models were run with four chains and 2,500 iterations per chain (including 1,250 warm-up samples not used for the inference). All analyses were undertaken in R version 3.6.3 (R Core Team 2020) and scripts are available at https://github.com/JulietteArchambeau/H2Pinpin.

Validation analysis for height and SLA in a family-based progeny test

To validate our results, we used height and SLA measurements from a progeny test near Asturias (thus in a similar environment to the nearby clonal common garden of our study) which shares 23 populations with the CLONAPIN common garden network (progeny test data kindly provided by Dr. Ricardo Alia, CSIC, Madrid). As the progeny test is based on families, we were able to estimate the additive genetic variance within populations (and not only the total genetic variance) for height at 3 and 6 years and SLA at 5 years (see section 10 of the Supplementary Information for more details).

Results

When comparing the estimated height values to the true values (σK = 0.1 and βX = 0.1) over the 100 simulated samples, the mean standard error was around 0.06 for σK and 0.05 for βX, the mean bias error was around 0.01 for σK and and between −0.004 and 0.006 for βX, the coverage of the 80% credible interval was higher than 93% for σK and higher than 76% for βX, and the coverage of the 95% credible interval was higher than 99% for σK and 96% for βX (Table S19, see also Figs S14S16). These simulations therefore showed that, under the assumption that the statistical model reflects the processes at work, our model displayed a satisfactory accuracy to be used in the following analyses.

The proportion of variance explained by the models (i.e., the sum of the among-population, among-clone and among-block variances) and the variance partitioning varied broadly across traits (see section 5 in the Supplementary Information). More specifically, the models explained between 40% and 50% of the variance for phenology-related traits, between 30% and 40% for functional traits, and from 20% for height in Fundão to almost 60% for height in Bordeaux at 85-month old (Fig. S17). Residual variance explained most of the variance for all traits, except for height in Bordeaux at 85-month old, where ~ 40% of the variance came from variation among populations, ~ 40% from residuals and the remaining ~ 20% from variation among clones (Fig. S29). Variation among populations was higher than variation among clones for height and δ13C (Figs. S21, S25, S29, S33 and S49, but not for SLA and phenology-related traits (Figs. S45, S37 and S41).

We found that populations experiencing colder winter conditions (i.e., lower MCMT) consistently displayed lower quantitative genetic variation for early height growth in the three common gardens (Fig. 2). Holding all other parameters constant, a one-standard deviation decrease in the mean temperature of the coldest month over the period considered was associated, on average, with a 42%, 21% and 25% decrease of \({\sigma }_{{C}_{p}}\) for height in Fundão, Bordeaux at 25-month old and Asturias, respectively (see details of the calculation in the section 4 of the Supplementary Information). Interestingly, this association was not found in Bordeaux when the trees were older (i.e., at 85-month old; Fig. 2). The negative association between colder winter conditions and quantitative genetic variation was also detected in a model combining height data from the three common gardens (except Bordeaux at 85-month old; Fig. S52), and in which the genotype-by-environment interaction explained only 3.7% of the phenotypic variance (Fig. S54). Unexpectedly, populations experiencing colder winter conditions also showed higher genetic variation for SLA (Fig. 2). The detected associations were unlikely to have been biased by unequal sample sizes across clones and populations, as detailed in section 11 of the Supplementary Information.

Fig. 2: Median and 95% credible intervals of the βX coefficients, which stand for the association among within-population quantitative genetic variation and its potential underlying drivers on the x-axis (see equation (4)).
figure 2

A and D are the two admixture scores; mean(MCMT) and mean(SP) are the two indexes of climate harshness and correspond to the mean of the annual values of the mean coldest month temperature (MCMT) and the summer precipitation (SP) over the period 1901–1950, respectively; var(MCMT) and var(SP) are the two indexes of temporal heterogeneity and correspond to the variance of the annual values of MCMT and SP over the period 1901–1950; finally, SH1[20km], SH2[20km], SH1[1.6km] and SH2[1.6km] are the indexes of spatial heterogeneity in a 20-km and 1.6-km radius around the population location calculated as the variance of the first two principal components of a PCA summarizing the environmental variation (see details in section 1.4.2 of the Supplementary Information). Colors stand for the different traits under study and the shapes for the different types of traits, i.e., height (circles), phenology-related traits (triangles) and functional traits (squares).

Importantly, in the validation analysis using a family-based progeny test, we also found that populations experiencing colder winter conditions have lower additive genetic variation for height at 3-year old, but not at 6-year old, and we did not find any association with the other potential drivers (Fig. 3). In addition, the negative association between cold winter temperatures and the within-population quantitative genetic variation for SLA was not replicated in the validation analysis (Fig. 3). Therefore, we consider our results to provide only robust support for a decrease in genetic variation for early height growth under colder winter conditions. The other detected associations for early height growth were not consistently replicated across the three common gardens or in the validation analysis (Figs. 2 and 3), and thus we do not interpret them in the discussion.

Fig. 3: Validation step using height and SLA measurements from a family-based progeny test near Asturias (data kindly provided by Dr. Ricardo Alı́a, CSIC, Madrid).
figure 3

Median and 95% credible intervals of the βX posterior distributions are shown. In the validation analysis, βX coefficients stand for the association among within-population additive genetic variation and its potential underlying drivers on the x-axis. A description of the drivers can be found in the legend of Fig. 2.

Discussion

How quantitative genetic variation is maintained within populations remains a long-standing open question that has been extensively explored in theoretical work but still lacks sufficient empirical evidence (Johnson and Barton 2005). We found that genetic variation for early height growth in maritime pine is lower in populations exposed to colder winters, which is consistent with some theoretical models predicting that quantitative genetic variation in fitness-related traits is lower in populations under strong selection (Fisher 1930). However, we did not find this pattern for later growth, phenology-related traits or functional traits, and we propose below different nonexclusive explanations for this. Moreover, we did not find higher genetic variation in populations located in temporally or spatially variable environments for any trait, which goes against predictions from theoretical or simulation work (McDonald and Yeaman 2018, Walsh and Lynch 2018) and an empirical study in lodgepole pine (Yeaman and Jarvis 2006). Admixed populations did not show higher quantitative genetic variation, suggesting that the observed patterns are not confounded by historical gene flow among distinct gene pools. Empirically-based detection of the potential footprints of natural selection on within-population quantitative genetic variation is much needed to understand how populations are adapted to their current environments and will evolve under changing conditions.

Lower genetic variation for early height growth under colder winters

Genetic variation for early height growth was lower in populations experiencing colder winters (Figs. 2 and S52). This result is in line with the hypothesis that strong stabilizing selection in harsh environments depletes quantitative genetic variation within populations (Fisher 1930) and echoes similar results in another forest tree, Q. oleoides. For this Mesoamerican white oak species, Ramírez-Valiente et al. (2019) found lower genetic variation averaged over functional and growth traits in populations experiencing low precipitation and high temperatures during the dry season. The importance of cold temperatures as a driver of height genetic variation in maritime pine is supported by the association between candidate-gene allele frequency and temperature gradients in other studies (Grivet et al. 2011, Jaramillo-Correa et al. 2015), suggesting a major role of minimum temperatures in the species adaptive evolution. Lower quantitative genetic variation in environments with cold winters may therefore reflect adaptation to local conditions, but it may also alter the ability of populations to respond to the new selection pressures induced by climate change (Lande and Shannon 1996, Storfer 1996) and decrease the fitness of individuals through the expression of inbreeding depression (but see Carley et al. 2022).

Mean coldest month temperatures were highly correlated with altitude in our study (Pearson’s correlation of 0.79), and adaptation patterns along altitudinal gradients are common in forest trees (e.g., Kurt et al. 2012). Therefore, we cannot exclude that the association between genetic variation for early height growth and coldest month temperatures is triggered by more complex environmental factors typical of high altitude conditions (e.g., reduced vapor pressure deficit, higher maximum solar radiation; Körner 1995).

Differences in genetic variation for early height growth among populations were unlikely to originate from the expression of hidden genetic variation in novel environments (i.e., cryptic genetic variation caused by genotype-by-environment interactions; Gibson and Dworkin 2004, Schlichting 2008). First, the lower genetic variation for early height growth in populations experiencing colder winters was consistent across the three common gardens (i.e., independent of their environmental conditions), which was confirmed by the low genotype-by-environment interaction estimated in a complementary analysis (see section 8 of the Supplementary Information; Archambeau et al. 2022). Second, populations with higher quantitative genetic variation for early height growth were not those with larger differences in winter temperatures between their locations of origin and the common garden (i.e., higher climatic transfer distances; Fig. S50). In contrast, the expression of cryptic genetic variation may explain why populations with larger climatic transfer distances for winter temperatures showed higher quantitative genetic variation for SLA in Fundão (see section 6 of the Supplementary Information). From an evolutionary perspective, this pattern requires attention as cryptic genetic variation can enhance the evolutionary potential of populations by generating new phenotypes that are then available for natural selection (Ledón-Rettig et al. 2014), and may ultimately get fixed in the population, a process called ‘genetic assimilation’ (Waddington 1953).

Most importantly, the validation analysis in a family-based progeny test provided further evidence that additive within-population genetic variation for height was lower in populations experiencing colder winters for young trees but not for older trees (Fig. 3). This supports the robustness of our study and suggests that our results were unlikely to be biased by considering the total variance instead of the additive one, which was somehow expected as two previous studies in maritime pine found low non-additive effects for growth (Gaspar et al. 2013), and height and diameter (Lepoittevin et al. 2011).

Potential effects of demographic processes

Historical demographic processes, by generating population structure or changes in effective population size independently from natural selection, can also shape patterns of genetic variation (Lawton-Rauh 2008). Theory predicts that small and isolated populations are likely to lose genetic variation more rapidly than large populations, due to genetic drift and limited gene flow from other populations. In this line, empirical studies have provided evidence that genetic diversity (measured with DNA markers) increases with population sizes (Frankham 1996, Leimu et al. 2006), although the magnitude of differences in population genetic diversity is marginal relative to the magnitude of changes in population size, a paradox which remains unresolved (Buffalo 2021). The relationship between quantitative genetic variation and population size is less clear. In their review, Willi et al. (2006) found that reduced quantitative genetic variation in small populations was supported in laboratory experiments in which populations evolve at different sizes, whereas most field studies do not support this prediction (see also Wood et al. 2015). In addition, individual-based simulations revealed that the additive genetic variance was less altered by a 10-fold change in population size than a reduction in migration (McDonald and Yeaman 2018).

Estimating effective population sizes of large and widespread forest tree populations is particularly challenging due to extensive sampling requirements (i.e., sufficient spatial coverage, sample sizes, and genotyping intensity), and thus current estimates are probably unreliable (Santos-del Blanco et al. 2022). Therefore, although population sizes are normally large in maritime pine, we can not rule out the hypothesis that low effective population sizes in harsh environments, due to stochastic demographic events or recent colonization history, may have caused the lower genetic variation observed for early height growth in our study. However, several observations makes this hypothesis unlikely. First, reduced genetic variation was only observed for early height growth and not for the other traits, which is more likely to occur under the hypothesis that the underlying process is natural selection, rather than demographic factors (under this second hypothesis, all traits would have the same probability of experiencing a decrease in variance). Second, we did not find any association between quantitative genetic variation and genetic diversity estimated with molecular markers (i.e., expected heterozygosity) for any trait (see section 7 of the Supplementary Information; Rodríguez-Quilón et al. 2015). Third, colonization history is unlikely to explain the reduced genetic variation for early height growth in populations experiencing the coldest winter conditions, as these populations are located in central and southeastern Spain, which are probably among the first regions recolonized after the last glaciation (Bucci et al. 2007).

Populations in harsh environments may also be more isolated, and therefore less likely to benefit from increased quantitative genetic variance as a result a gene flow from nearby populations (as shown for instance in Reid and Arcese 2020). In our study, historical gene flow among gene pools most likely had a negligible impact on the expression of quantitative genetic variation in maritime pine populations, as the most admixed populations did not show higher quantitative genetic variation for any trait (even when the divergence between source and sink gene pools was accounted for). However, to our knowledge, evaluating the effect of contemporary gene flow on quantitative genetic variation using quantitative genetic models (such as the one used in our study) was until recently only possible using pedigree data (but see Aase et al. 2022), and therefore we were not able to rule out the hypothesis that recent exogenous gene flow in some populations may have induced an increase in their genetic variance independently of natural selection.

No association with spatial or temporal heterogeneity

Populations from spatially heterogeneous environments did not show higher genetic variation for any trait (Fig. 2), which was also the case in the validation analysis (Fig. 3). This goes against a previous estimate in lodgepole pine suggesting that up to 20% of the genetic variation in growth within populations is explained by spatial heterogeneity (Yeaman and Jarvis 2006). A potential explanation of this discrepancy is the smaller experiment size in our study compared to that of Yeaman and Jarvis (103 populations with an average of 28 planting sites per population). However, in our study, we obtained reasonable credible intervals for most traits (allowing the detection of associations with other drivers) and data simulations suggested that our models have adequate power, rendering this explanation unlikely. The discrepancy with Yeaman and Jarvis (2006) may also stem from the use of younger trees in our study. Indeed, the effect of environmental heterogeneity on the within-population quantitative genetic variation might be age-dependent, as might be the case also for the associations we detected with cold winters (see the section below for potential explanations of this phenomena).

Another explanation is that genetic variation within populations is not affected by the environmental heterogeneity at the regional scale imposed by the 1 × 1 km resolution of our climate dataset but at finer spatial scales (also discussed in Yeaman and Jarvis 2006). Indeed, populations can adapt along microgeographic environmental gradients despite the homogenizing effect of gene flow (Richardson et al. 2014), even for forest tree populations with their long-generation times and large effective population sizes (Scotti et al. 2016). However, a correlation between regional and microgeographic environmental heterogeneity across the maritime pine range is very likely: populations showing the highest environmental heterogeneity in our study were located in mountainous areas in which we also expect higher microgeographic variation, e.g., the Cómpeta population (COM) located in the Tejeda and Almijara mountains (southern Spain), the Arenas de San Pedro population (ARN) located in the Sierra de Gredos (central Spain) or the Pineta population (PIE) located close to the Punta di Forchelli (Corsican mountains), while populations with the lowest environmental heterogeneity were located on flat plateaus, e.g., populations from the Landes plateau and the Atlantic coastal regions in France (HOU, MIM, PET, VER, OLO, STJ, PLE), and populations from the central Spain plateau near to Segovia (CUE, COC, CAR). Thus, even if genetic variation was maintained by migration-selection balance at microgeographic scales, we would have been able to detect the effect of environmental heterogeneity at the regional scale. Last, genetic variation within populations might also be affected by environmental heterogeneity at even greater scales than the one considered in our study, since, for instance, some potential environmental drivers of adaptation (e.g., wind, precipitation) vary at large scales. However, for these factors to affect genetic variation at the local scale substantial gene flow among distant populations would be needed, which does not seem the case in maritime pine, a species characterized by strong genetic differentiation of gene pools.

Finally, a last explanation is related to the different biological features between lodgepole pine and maritime pine. Lodgepole pine has extensive gene flow and low population structure (FST = 0.016 in Yeaman et al. 2016) while maritime pine shows restricted gene flow with strong population structure (at least six distinct gene pools and FST = 0.112; Jaramillo-Correa et al. 2015; our study) and fragmented distribution (Alberto et al. 2013). Pollen dispersal kernels in maritime pine are highly leptokurtic, as for other wind-pollinated pines (Robledo-Arnuncio and Gil 2005, Schuster and Mitton 2000), with estimated mean dispersal distances from 78.4 to 174.4m (de Lucas et al. 2008). Interestingly, McDonald and Yeaman (2018) showed that high levels of quantitative genetic variance can be maintained when a trait is under stabilizing selection only at intermediate levels of migration. Migration rates in maritime pine may therefore not be strong enough to compensate for the purifying effect of natural selection in spatially heterogeneous environments, especially in mountainous areas which may represent barriers to gene flow and where populations are more isolated (see González-Martínez et al. 2007 for maritime pine). Meanwhile, in the homogeneous plateaus of the Landes forest and central Spain, natural selection may be low because conditions are more favorable, and these populations are less isolated, which may maintain genetic variation at levels similar to those of populations in heterogeneous landscapes. Investigating local adaptation and gene flow at microgeographic scales in natural populations of maritime pine located in both homogeneous and heterogeneous environments would be highly valuable to understand why spatial environmental heterogeneity does not seem to play a major role in maintaining genetic variation in this species. Moreover, conducting similar analyses in sister species such as Scots pine, with low population genetic structure and continuous populations (Alberto et al. 2013), could help to determine whether genetic variation in forest tree populations experiencing higher migration rates are more prone to be impacted by spatial environmental heterogeneity.

The lack of robust support for an association between within-population quantitative genetic variation and temporal climatic heterogeneity was more expected than for spatial heterogeneity. Indeed, theory predicts lower levels of genetic variation for fitness under temporal heterogeneity than under spatial heterogeneity, though still higher than under constant environments (Felsenstein 1976, Levene 1953). These predictions were supported by empirical studies (e.g., Huang et al. 2015; but see Mackay 1981) and individual-based simulations showing that, in contrast to spatially heterogeneous environments, temporally heterogeneous environments can only marginally increase the additive genetic variance within populations (McDonald and Yeaman 2018). In forest trees, Ramírez-Valiente et al. (2019) found no association between quantitative genetic variation measured in 1-year-old seedlings of Q. oleoides and temporal environmental heterogeneity. One possible explanation is that long-lived, sessile species, such as forest trees, respond to temporal heterogeneity primarily through phenotypic plasticity, resulting in the selection of the most plastic genotypes in temporally fluctuating environments rather than the selection of a wide range of genotypes adapted to different conditions.

Potential causes underlying contrasting patterns across traits

The consistent association between within-population genetic variation and cold winter temperatures for early height growth was in line with the initial hypothesis that the effect of natural selection on quantitative genetic variation is likely to be stronger (and thus easier to detect) for traits more closely related to fitness. Indeed, height growth is an integrative trait that can be seen as the end-product of multiple ecophysiological processes both genetically and environmentally determined (Grattapaglia et al. 2009). Taller trees compete more efficiently for light, water and nutrients (Morgenstern 1996), and are also more likely to have high fecundity (Aitken and Bemmels 2015, Rehfeldt et al. 1999, Wu and Ying 2004). As such, Younginger et al. (2017) reviewed 170 studies in herbs, shrubs and trees and showed that, for conspecifics of the same age class, larger plants have generally a higher fitness, i.e., they have more reproductive output and are therefore more likely to produce viable offspring. In addition, as maritime pine is a pioneer species, selection pressures might be particularly high in the early growth stages and might be exacerbated by increased competition under harsh conditions (even at the expense of future tree longevity; Bigler and Veblen 2009). Interestingly, no association between within-population genetic variation and climate harshness was found for later height growth. It may result from the weaker selection pressures for later growth as competition for light and nutrients is probably less intense after the removal of most seedlings in the early selection phase.

Furthermore, the different patterns obtained for early and later height growth highlight that ontogenic stage is a key parameter to consider when studying the evolutionary forces driving the quantitative genetic variation in forest tree populations. In this vein, genetic parameters in forest trees have been shown to vary with age; e.g., heritability generally increases with age until reaching a plateau, especially for height-related traits (e.g., Johnson et al. 1997, Kroon et al. 2011). In maritime pine, an increase in heritability with age was found in Costa and Durel (1996) but not in Kusnandar et al. (1998). Interestingly, the expression of age-dependent heritabilities for another growth trait, diameter growth, was associated with spacing among trees in common gardens of Pinus radiata (Lin et al. 2013).

Leaf phenology-related traits exhibit steep adaptation gradients in forest trees and have a relatively high heritability, e.g., 0.15–0.51 for bud burst in pedunculate oak (Scotti-Saintagne et al. 2004), ~0.36 for bud burst and ~0.24 for the duration of bud burst in our study (Table S20). Gauzere et al. (2020) showed that both the mean and the variance of leaf phenology-related traits varied along an altitudinal gradient in natural oak populations, with populations at high altitude having a narrower fitness peak. We might therefore have expected lower genetic variation for leaf phenology-related traits in populations experiencing cold winters (and at higher altitude; see also Alberto et al. 2011). A biological explanation for this could be the weaker link with fitness in young trees for phenology-related traits when compared to early height growth, implying that they are less prone to a decrease in their genetic variance caused by strong selection pressures. Another explanation is that variation in phenology-related traits is likely to be controlled by genotype-by-environment interactions in forest trees (e.g., opposite genetic clines in common gardens and natural populations; Vitasse et al. 2009) and therefore such association may be hidden in the unique environment of one common garden. Last, since phenology-related traits are derived from discrete measurements, their distribution deviates more from a normal distribution than that of height, requiring more statistical power to detect associations.

We might also have expected lower genetic variation in populations experiencing drier summer conditions for traits considered to be related to drought resistance such as SLA (Mitchell et al. 2008; but see Reynolds et al. 2018) and δ13C (Warren et al. 2001). In line with our study, Anderegg et al. (2021) found no differences in phenotypic variances for traits related to leaf and stem robustness and allocation along an aridity gradient in eucalypts. Investigating how the genetic variation of drought-resistance traits is impacted by intense drought conditions is worth addressing in face of climate change, as a lack of association may reflect a limited adaptive potential for these traits. However, genotype-by-environment interactions but also insufficient statistical power is likely to explain the lack of detected associations for these traits in our study since only a small proportion of their variation is genetically determined (Alía et al. 2014, Corcuera et al. 2010, \({H}_{SLA}^{2} \sim 0.09\) and \({H}_{{\delta }_{C}^{13}}^{2} \sim\) 0.09 in our study; Table S20), resulting in smaller effect sizes that are more difficult to detect.

Importantly, theoretical work suggests that much of the genetic variation associated with a trait is likely maintained by pleiotropic effects, which are independent of the selection on that trait, implying that stabilizing selection can only act on a reduced number of independent dimensions in the trait space (Barton 1990, Walsh and Lynch 2018). As we used univariate models, we cannot exclude that the associations detected for early height growth originate from genetic correlation with other traits under selection, or that the lack of association with other traits (notably functional traits such as δ13C) does not originate from genetic constraints (Walsh and Blows 2009). For example, in maritime pine, trait canalization and genetic constraints may explain low quantitative genetic differentiation for hydraulic traits (e.g., P50, the xylem pressure inducing 50% loss of hydraulic conductance; Lamy et al. 2014). In this line, Benavides et al. (2021) found higher trait covariance in Scots pine populations under harsher environments, thus suggesting that strong selection pressures may not only reduce the genetic variance of individual traits but may also generate more coordinated and efficient phenotypes ('tighter' phenotypes), a process that deserves further investigation.

Conclusion

Our manuscript contributes to the current debate on the maintenance of quantitative genetic variation within populations by providing empirical support for the role of natural selection in decreasing genetic variation for a key trait of early survival in forest trees. Indeed, our results consistently showed that genetic variation for early height growth is lower in maritime pine populations experiencing colder winters (i.e., experiencing stronger climate-related selection). Surprisingly, we did not find any association between spatial or temporal environmental heterogeneity and within-population genetic variation for any trait; whether for technical reasons (e.g., sample size, spatial scale considered) or for genuine biological reasons (e.g., too low migration, ontogenic stage), it would be worth further exploration. Importantly, this work contributes to understanding how evolutionary forces shape genetic variation in key adaptive traits within populations and ultimately how populations adapt to their local environment. However, whether the reduction in quantitative genetic variation potentially induced by cold temperatures has significant evolutionary implications for the adaptive capacity of populations remains unknown and would require more prediction-oriented studies.