Genomic selection of juvenile height across a single-generational gap in Douglas-fir

Thistlethwaite, Frances R.; Ratcliffe, Blaise; Klápště, Jaroslav; Porth, Ilga; Chen, Charles; Stoehr, Michael U.; El-Kassaby, Yousry A.

doi:10.1038/s41437-018-0172-0

Download PDF

Article
Open access
Published: 10 January 2019

Genomic selection of juvenile height across a single-generational gap in Douglas-fir

Heredity volume 122, pages 848–863 (2019)Cite this article

2830 Accesses
19 Citations
9 Altmetric
Metrics details

Subjects

Abstract

Here, we perform cross-generational GS analysis on coastal Douglas-fir (Pseudotsuga menziesii), reflecting trans-generational selective breeding application. A total of 1321 trees, representing 37 full-sib F₁ families from 3 environments in British Columbia, Canada, were used as the training population for (1) EBVs (estimated breeding values) of juvenile height (HTJ) in the F₁ generation predicting genomic EBVs of HTJ of 136 individuals in the F₂ generation, (2) deregressed EBVs of F₁ HTJ predicting deregressed genomic EBVs of F₂ HTJ, (3) F₁ mature height (HT35) predicting HTJ EBVs in F₂, and (4) deregressed F₁ HT35 predicting genomic deregressed HTJ EBVs in F₂. Ridge regression best linear unbiased predictor (RR-BLUP), generalized ridge regression (GRR), and Bayes-B GS methods were used and compared to pedigree-based (ABLUP) predictions. GS accuracies for scenarios 1 (0.92, 0.91, and 0.91) and 3 (0.57, 0.56, and 0.58) were similar to their ABLUP counterparts (0.92 and 0.60, respectively) (using RR-BLUP, GRR, and Bayes-B). Results using deregressed values fell dramatically for both scenarios 2 and 4 which approached zero in many cases. Cross-generational GS validation of juvenile height in Douglas-fir produced predictive accuracies almost as high as that of ABLUP. Without capturing LD, GS cannot surpass the prediction of ABLUP. Here we tracked pedigree relatedness between training and validation sets. More markers or improved distribution of markers are required to capture LD in Douglas-fir. This is essential for accurate forward selection among siblings as markers that track pedigree are of little use for forward selection of individuals within controlled pollinated families.

Single-step genomic prediction of Eucalyptus dunnii using different identity-by-descent and identity-by-state relationship matrices

Article 18 June 2021

Esteban J. Jurcic, Pamela V. Villalba, … Eduardo P. Cappa

Improving lodgepole pine genomic evaluation using spatial correlation structure and SNP selection with single-step GBLUP

Article 18 February 2022

Eduardo P. Cappa, Blaise Ratcliffe, … Yousry A. El-Kassaby

Adaptation to local climate in multi-trait space: evidence from silver fir (Abies alba Mill.) populations across a heterogeneous environment

Article 10 June 2019

Katalin Csilléry, Otso Ovaskainen, … Felix Gugerli

Introduction

There is a strong drive to incorporate genomic selection (GS) methodologies, as first proposed by Meuwissen et al. 2001, into forest tree selective breeding. With a proliferation of genomic technologies and a steady decline in genotyping costs (Heffner et al. 2010; Thomson 2014), breeders are taking full advantage of the availability of large SNP data sets. With these large SNP sets, it is envisaged that linkage disequilibrium (LD) between the markers and most sources of variation for valued complex phenotypes can be tracked. In doing so, capturing more variance than the well-established marker-assisted selection (MAS), which relies on fewer, large effect quantitative trait loci (QTLs) (El-Kassaby 1982). Genetically complex traits (such as height, growth, and wood quality) are now amenable to selection with the use of dense marker data. Given this statistical advantage, it is anticipated that GS may be implemented into tree selective breeding, as it has been done in livestock breeding (Van Eenennaam et al. 2014), resulting in higher genetic gain per unit time for traits of interest. This will largely be achieved through the reduction of trait evaluation time for such late expressing traits, leading to a faster turn-over in breeding generations, a significant time-sink in current breeding programs (Hayes et al. 2009; Heffner et al. 2010). Furthermore, breeding programs will become more dynamic as they will be able to ensure adaptation to capricious influences such as climate change and biotic disturbance in less time (Grattapaglia 2014).

Early deterministic simulations by Grattapaglia and Resende (2011) of GS that modeled forest tree species reported promising results. Following this, several experimental investigations have built upon this concept with varying success (Bartholomé et al. 2016; Beaulieu et al. 2014a, 2014b; Fuentes-Utrilla et al. 2017; Gamal El-Dien et al. 2015; Grattapaglia 2014; Isik et al. 2016; Müller et al. 2017a; Ratcliffe et al. 2015; Resende et al. 2012a, 2012b, 2012c, 2017; Tan et al. 2017; Thistlethwaite et al. 2017). In general, the following phases are involved in the GS process: (1) the genetic and phenotypic evaluation of a random subset of samples from within a selected population forming the training set from the tree breeding population under investigation; (2) creating a predictive model using this data, in which alleles at all marker loci have their effects simultaneously estimated; (3) implementing a validation or cross-validation process to test the developed models’ robustness; and (4) genomic prediction on a different subset of individuals from the same breeding population and selection of candidates from this population for next-generation breeding based on their genomic estimated breeding values (GEBVs) (Meuwissen et al. 2001; Grattapaglia 2014). The repercussion of this is a paradigm shift, in which the model unit of these breeding analyses shifts from being the line of descent to the allele.

Factors influencing the success of GS are varied, but one, which is entirely at the discretion of the investigator, is the statistical prediction method. Many methods have been proposed for GS and can be differentiated largely by their a priori assumptions of variance distribution. Ridge regression best linear unbiased prediction (RR-BLUP) and generalized ridge regression (GRR) have been selected for their computational efficiency. RR-BLUP assumes marker effects to be normally distributed (mean = 0) and to have equal variance. Conversely, GRR allows for heterogeneous variances and so employs a step that sets marker-specific shrinkage parameters on BLUP. In addition to these two methods, Bayes-B will also be implemented. The former have been shown to be highly sensitive to genetic relationships, leading to the accuracy of GEBV predictions based on markers tracking this relationship rather than LD (Habier et al. 2007; Thistlethwaite et al. 2017). This conclusion led Habier et al. (2007) to recommend the use of Bayes-B as an alternative.

Other features which independently and collectively alter the success of GS include: (1) The extent of LD between markers and QTL, (2) density of marker coverage; (3) training population size; (4) relatedness of samples (Habier et al. 2007), (5) trait heritability; (6) genetic architecture of trait (number of loci and effect size) (Hayes et al. 2009; Grattapaglia 2014); and (7) effective population size (N_e) (Lorenz et al. 2011). Again features 2, 3, and 7 are (to a certain extent) under the control of the experimenter.

Regarding relatedness between individuals (feature 4), the parent average effect on EBVs was removed in this study following Garrick et al. (2009). The deregressed information is used as an alternative to EBVs in genomic selection to reduce the bias and increase reliability. Using the BLUP procedure shrinks both individual and progeny information toward the parent average EBV, and so there is motivation to remove this effect for the following rationale: (1) those records with an EBV but no individual or progeny information do not contribute to genomic estimation. Their EBV is calculated purely on the parental average and therefore do not afford any additional information besides that of the parental EBVs and genotypes and (2) to avoid the shrinking of major effects that are potentially segregating in the parents. Without deregression the EBVs of the offspring will all be shrunk toward the parent average, regardless of whether they inherit the favorable or unfavorable allele (Garrick et al. 2009).

The extent of marker-QTL LD is dictated by the relationship between N_e and the number of markers used (Grattapaglia and Resende 2011). In accordance with established theory, in a population with low N_e, genetic drift has a stronger effect, resulting in an increase in non-random association of markers and QTL (LD). Thus, in this situation fewer markers are required to capture the variation of the trait of interest. In outbreeding populations, recombination negates LD and so, long-ranging LD is lost over generations. Because of this, models must be continually updated according to the distance between generations. Wu et al. (2015) found that increasing the generational gap between training and testing populations, reduced predictive accuracy of GS. Two factors can be garnered from this information, and their interaction is thought to have the highest impact on the success of GS in trees: (1) N_e must dictate the scale of marker density and (2) LD can be controlled through choosing the N_e (Grattapaglia 2014). Larger training population sizes were shown to increase predictive accuracy up to a threshold of around 1000 individuals, beyond which there were negligible gains (Grattapaglia and Resende 2011). Although Meuwissen et al. (2001) reported findings that suggested larger training populations negated somewhat the effects of low trait heritability, this had little effect on GS predictive accuracy of more complex traits (50–100 + QTL) (Grattapaglia and Resende 2011).

Substantial work has been carried out on testing the efficacy of GS in forest tree selective breeding. Validation populations (or cross-validation) have largely been comprised of individuals from within the same generation as the training population, aside from work performed by Isik et al. (2016) and Bartholomé et al. (2016). This practice has resulted in effectively side-stepping the issue of the generational gap. To address this issue, we performed a cross-generational GS analysis on coastal Douglas-fir (Pseudotsuga menziesii Mirb. (Franco)). The training population, on which the predictive models were trained, was collected from the parental generation (F₁). The set was composed of 1321 randomly selected, 38-year-old trees, representing 37 full-sib families with replications over three environments in British Columbia (Canada) (Thistlethwaite et al. 2017). The validation population was collected from the progeny of the parental generation (F₂, n = 136) with shared pedigree to the studied 37 full-sib families. A total of 69,551 SNPs were used in the GS analyses to produce a predictive model for juvenile height, which was compared to pedigree-based (ABLUP) predictions.

Methods

Experimental population

Samples from a 38-year-old replicated coastal Douglas-fir (Pseudotsuga menziesii Mirb. (Franco)) progeny testing population (F₁) was used as the training set to develop the GS predictive models. The breeding program was established by the Ministry of Forests, Lands and Natural Resource Operations of British Columbia (BC), Canada in 1975. A total of 37 of 165 full-sib families were randomly selected for sampling from 3 environments in British Columbia, Canada (Adams (Lat. 50° 24′ 42″ N, Long. 126° 09′ 37″ W, Elev. 576 mas), Fleet River (Lat. 48° 39′ 25″ N, Long. 128° 05′ 05″ W, Elev. 561 mas), and Lost Creek (Lat. 49° 22′ 15″ N, Long. 122° 14′ 07″ W, Elev. 424 mas)). A total of 1321 individuals (N_e ≈ 21) from these 3 environments (Adams: N = 449, Fleet River: N = 441, Lost Creek: N = 431) were selected for genotyping and to train the GS models and ABLUP validation model. The N_e was estimated using a program developed by Dr. Milan Lstiburek (Faculty of Forestry and Wood Sciences, Czech University of Life Sciences Prague, Prague, Czech Republic) based on the status number concept of Lindgren et al. (1997).

The validation population (F₂) is represented by 247 samples from control pollinated offspring derived from the 37 full-sib families described above with offspring from an additional 5 full-sib families selected from the same progeny testing population (F₁) (42 F₂ families total). ABLUP accuracies were derived using these 247 samples. Due to missing genotype information some F₂ samples were discarded for GS analysis. The remaining 136 samples were used as the GS validation population, representing 17 parents located at Jordan River, BC (Lat. 48° 25′ 52.6N, Long. 124° 02′ 46.2W, Elev. 150 mas) and established in 2003.

In order to derive best possible estimates for the EBVs, an increased number of individuals (total N = 36,311) was used to provide as much information as possible when fitting the ABLUP model in ASReml 4.0 (Gilmour et al. 2009). Information was used from 11 environments of the 38-year-old progeny testing population (F₁) (N = 33,931), plus their wild progenitors (N = 108) as described previously (Yanchuk 1996), an ungenotyped replicate of the F₂ Jordan River validation population (N = 2025) (North Arm, Lat. 48° 50′ 41.7″ N, Long. 124° 06′ 34.8″ W), and the Jordan River environment itself (N = 247) were also incorporated into the analysis (total N = 36,311). The EBVs of a genotyped subset were used in the training of all models (Table 1). The “original” EBVs of the genotyped subset from Jordan River were used to validate each model.

Table 1 Summary of the number of individuals per environment, and to which models they contributed

Full size table

Phenotyping, deregression, tissue sampling, DNA extraction, and genotyping

Early-rotation (juvenile height) (1988 for the training population and 2010 for the validation population) height measurements of the studied trees were recorded (HTJ: in cm). EBVs for HTJ were obtained in ASReml 4.0 (Gilmour et al. 2009) and used as phenotypes for the genomic prediction analysis. In addition, the EBVs were deregressed and parental averages removed, using the method “Removing parent average effects” proposed by Garrick et al. (2009). The resulting deregressed estimated breeding values (DEBVs) were used as alternative phenotypes for GS analysis. This type of deregressed data can be obtained by approximating and back-solving the evaluation equations. The following equations were solved for each individual tree:

$$\left[ {\begin{array}{*{20}{c}} {{\mathbf{Z}}\prime _{{\mathrm{PA}}}{\mathbf{Z}}_{{\mathrm{PA}}} + 4\lambda } & { - 2\lambda } \\ { - 2\lambda } & {{\mathbf{Z}}\prime _{i - {\mathrm{PA}}}{\mathbf{Z}}_{i - {\mathrm{PA}}} + 2\lambda } \end{array}} \right]\left[ {\begin{array}{*{20}{c}} {{\mathbf{PA}}} \\ {{\mathbf{EBV}}} \end{array}} \right] = \left[ {\begin{array}{*{20}{c}} {y_{{\mathrm{PA}}}} \\ {y_{i - {\mathrm{PA}}}} \end{array}} \right]{,}$$

(1)

where PA and EBV represent the parental average and estimated breeding value vectors, respectively; $y_{{\mathrm{PA}}}$ and $y_{i - {\mathrm{PA}}}$ represent information equivalent to the right-hand-side elements referring to the PA and individual respectively; λ = (1 − h²)/h²; ${\mathbf{Z}}\prime _{{\mathrm{PA}}}{\mathbf{Z}}_{{\mathrm{PA}}}$ and ${\mathbf{Z}}\prime _{i - {\mathrm{PA}}}{\mathbf{Z}}_{i - {\mathrm{PA}}}$ express the unknown information content of the parental average, and individual effect without parental average, respectively. These latter terms can be equated by solving firstly Eq. (2) and using the result to solve for Eq. (3):

$${\mathbf{Z}}\prime _{{\mathrm{PA}}}{\mathbf{Z}}_{{\mathrm{PA}}} = \lambda \left( {0.5\alpha - 4} \right) + 0.5\lambda \sqrt {\left( {\alpha ^2 + \frac{{16}}{\delta }} \right)}{,}$$

(2)

$${\mathbf{Z}}\prime _{i - {\mathrm{PA}}}{\mathbf{Z}}_{i - {\mathrm{PA}}} = \delta {\mathbf{Z}}\prime _{{\mathrm{PA}}}{\mathbf{Z}}_{{\mathrm{PA}}} + 2\lambda (2\delta - 1){,}$$

(3)

where α = 1/(0.5 − $r_{{\mathrm{PA}}}^2$); and δ = (0.5 − $r_{{\mathrm{PA}}}^2$)/(1 − $r_i^2$). $r_{{\mathrm{PA}}}^2$ is defined as the reliability of the PA for individual i with parents “sire” and “dam”, and can be calculated by: $r_{{\mathrm{PA}}}^2 = \frac{{r_{sire}^2 + r_{dam}^2}}{4}$. While $r_i^2$ the reliability of the EBV, was calculated as the square of the correlation between the true and predicted breeding values (r_i) according to Gilmour et al. (2009):

$$r_i = \sqrt {1 - \frac{{s_i^2}}{{(1 + f_i)\sigma _A^2}}}$$

(4)

where $s_i^2$ is the prediction error variance for individual i; f_i is the inbreeding coefficient for individual i calculated in ASReml 4.0 (Gilmour et al. 2009); and $\sigma _A^2$ is the additive genetic variance.

We can now complete and solve the coefficient matrix (Eq. (1)) using Eqs. (2) and (3), and multiply this by the vectors PA and EBV. The deregressed information regarding the individual without PA effects is obtained using this simplified formula:

$$\frac{{y_{i - {\mathrm{PA}}}}}{{{\mathbf{Z}}\prime _{i - {\mathrm{PA}}}{\mathbf{Z}}_{i - {\mathrm{PA}}}}}$$

(5)

Cambial tissue was collected from the mature trees of the training population; this was an elegant solution to overcome the difficulty of obtaining foliage tissue from older/taller trees. Using a hammer and punch tool (approx. 2 cm diameter) two small circular disks of bark, cambium and developing tissue were removed from each tree. Once separated, the cambial tissue was immediately stored in a 2 ml collection tube with 1 ml of storage buffer (10 mM EDTA pH 8.0, 10 mM Na₂SO₃), these were kept at 4 °C until DNA extraction. Foliage DNA extraction is easier using a standard protocol, therefore leaf bud tissue was collected from the juvenile trees. Two samples from each juvenile tree were taken and stored in the same way as the cambial tissue (in 1 ml of storage buffer and kept at 4 °C). The same DNA extraction protocol was used on both forms of the tissue samples. This was a modified protocol developed by Ivanova et al. 2008 (R. Whetten, unpublished, North Carolina State University, personal communications). Whole exome capture genotyping was carried out in a commercial facility (RAPiD Genomics^©, FL, USA), with probes designed using the available Douglas-fir transcriptome assembly (Howe et al. 2013). For further details on the genotyping process see Thistlethwaite et al. (2017), both the training and validation populations were genotyped at the same time using the same procedure. For more information regarding exome capture, see Neves et al. (2013).

EBV prediction and accuracy

ASReml 4.0 (Gilmour et al. 2009) was used to fit EBVs, using information from the 11 F₁ parental environments, their parents (P₀) and the 2 F₂ juvenile environments (total N = 36,311) (Table 1). As environmental effects are an important consideration in forestry (Cappa et al. 2016), and to account for site (environment) and age differences, a linear mixed model analysis was carried out,

$$y = {\boldsymbol{X}}\beta + {\boldsymbol{Z}}_1a + {\boldsymbol{Z}}_2sa + {\boldsymbol{Z}}_3s(rep) + {\boldsymbol{Z}}_4sf + {\boldsymbol{Z}}_5f + e{,}$$

(6)

where y is the phenotypic trait measurement; β is a vector of fixed effects (i.e., mean, site, and age effects); a is a vector of individual random additive effects following ~N(0, Aσ_a²); sa is a site × additive genetic interaction following ~N(0, Iσ_sa²); s(rep) is a vector of the block effect nested within site following ~N(0, Iσ_s(rep)²); sf is a random effect site × family interaction following ~N(0, Iσ_sf²); f is the effect of family and following ~N(0, Iσ_f²); and e is the random residual effect following ~N(0, Iσ_e²); X and Z_1-5 are incidence matrices assigning fixed and random effects to each observation at vector y; lastly I is the identity matrix and A the average numerator relationship matrix (Wright 1922). We chose to use a common variance for all environments since this is the most parsimonious model when using a large number of environments. This avoids over-fitting the model since the number of parameters increases much faster than the number of environments (Isik et al. 2017). Theoretical accuracy of the EBVs ($\hat r$) was calculated following Dutkowski et al. (2002).

$$\hat r = \sqrt {1 - \frac{{SE_i^2}}{{(1 + F_i)\hat \sigma _a^2}}}{,}$$

(7)

where SE_i is the standard error of breeding value, and F_i is the inbreeding coefficient of the ith individual. Narrow-sense heritability was calculated as h² = σ_a²/(σ_a² + σ_sa² + σ_sf² + σ_f² + σ_e²), where σ_a², σ_sa², σ_sf², σ_f², and σ_e² are the variances of additive genetic, site × additive genetic, site × family, family, and residual effects, respectively.

ABLUP validation was carried out in ASReml R v4.1, predicting the breeding values of the validation population using an expected relationship matrix (A) based on pedigree information. A tenfold validation approach was used. Briefly, samples from the F₁ parental generation at Adams, Fleet River, and Lost Creek (N = 1321) were randomly partitioned into ten training subsets. Nine of these subsets (approximately 90% of the F₁ samples) were used as the training set, on which the ABLUP model would be trained to estimate breeding values. On the basis of this model training, EBVs of the validation set were predicted. The validation set was composed of the 136 individuals from the F₂ Jordan River environment. This was repeated ten times until all F₁ subsets had been included in the training set. Then the whole process was repeated ten times, randomly assigning the training subsets. Prediction accuracy was measured as the correlation between the predicted EBVs from the cross-validation, and their original EBVs calculated using all possible pedigree information. In addition, the predictive ability was calculated as the correlation between predicted EBVs and actual height phenotypes.

Genomic selection analysis

Three statistical methods were used to perform genomic selection: RR-BLUP, GRR, and Bayes-B (Lorenz et al. 2011). Four GS analyses were performed: (1) models were trained on EBVs for juvenile height of the F₁ trees, GEBVs for height of the F₂ validation set were predicted (HTJ EBVs → HTJ GEBVs); (2) models were trained on DEBVs for juvenile height of the F₁ trees, genomic estimates of deregressed breeding values (GDEBVs) for height of the F₂ validation set were predicted (HTJ DEBVS → HTJ GDEBVs); (3) GS models were trained on EBVs of mature height (age 35) of the F₁ samples and GEBVs of the F₂ validation set were predicted and correlated with their juvenile EBVs to ascertain any relationship (HT35 EBVs → HTJ GEBVs); (4) models were trained on DEBVs for mature height of the F₁ samples, GDEBVs of the F₂ validation set were predicted and correlated with their juvenile DEBVs (HT35 DEBVs → HTJ GDEBVs). A tenfold validation process repeated ten times, was again used to randomly select individuals from the F₁ generation to construct the training set of the models. Prediction accuracy for each model in all four analyses was calculated as the mean of the replications of the Pearson product-moment correlation between the original EBVs (as calculated with all pedigree information N = 36,311) for HTJ of the 136 F₂ validation trees from Jordan River and their predicted GEBVs. Alternatively, using DEBVs to train the models, the prediction accuracy is the Pearson product-moment correlation between DEBVs of the validation set and their predicted GDEBVs. Similarly to the ABLUP investigation, the predictive ability was calculated as the correlation between predicted GEBVs and actual height phenotypes.

Ridge regression best linear unbiased predictor

RR-BLUP (Whittaker et al. 2000) was implemented using the R package “bigRR” (Shen et al. 2014). The predicted heights (or GEBVs) are obtained by the summing of all the marker effects of an individual tree. Marker effects were estimated as in Henderson (1976), under the following mixed model:

$${\boldsymbol{y}}_{{\boldsymbol{D}}({\boldsymbol{EBV}})} = \vec 1{\boldsymbol{\mu }} + {\boldsymbol{Zg}} + {\boldsymbol{e}}{,}$$

(8)

where ${\boldsymbol{y}}_{{\boldsymbol{D}}({\boldsymbol{EBV}})}$ is the vector of n tree height records (EBVs or DEBVs in this case), $\vec 1$ is a vector of 1, μ is an intercept, g is the vector of random marker effects, Z is the design matrix for the random marker effects, and e is the residual vector for random effects. In RR-BLUP the residuals and marker effects follow normal distributions with constant variance, i.e., e ~ N (0, I$\sigma _e^2$) and g ~ N(0, I$\sigma _g^2$), where I is an identity matrix. The solution for the marker effects is given by the following equation:

$${\hat{\boldsymbol {g}}} = \left( {{\boldsymbol{Z}}\prime {\boldsymbol{Z}} +{\lambda} {\boldsymbol{I}}} \right)^{ - 1}{\boldsymbol{Z}}\prime {\boldsymbol{y}}$$

(9)

where ʎ = $\sigma _e^2/\sigma _g^2$ is the ridge penalization parameter. An assumption of this method is that all marker effects are distributed equally, and therefore all effects are equally shrunk towards zero.

Generalized ridge regression

GRR was implemented in the R package “bigRR” (Shen et al. 2014). The first step, in this two-step variable selection method, is to use linear mixed models optimizing ʎ, to estimate marker effects (the same as RR-BLUP). Where it differs from RR-BLUP is in a second step. In which an alternative, marker-specific shrinkage parameter is imposed on the BLUP for ${\hat{\boldsymbol g}}.$

In this heterogeneous error model, ʎI becomes diag(ʎ) in Eq. (4):

$${\hat{\boldsymbol g}} = ({\boldsymbol{Z}}\prime {\boldsymbol{Z}} + diag(\lambda))^{ - 1}{\boldsymbol{Z}}\prime {\boldsymbol{y}}$$

(10)

Here ʎ is a vector of p shrinkage parameters. For the kth element: ʎ$_k = \widehat {\sigma _e^2}/\widehat {\sigma _{gk}^2}$, is the parameter, where $\widehat {\sigma _{gk}^2}$ is the variance of marker effect k ($\widehat {\sigma _{gk}^2} = \hat g_k^2/(1 - h_{kk})$). Where $\hat g$ is the BLUP marker effect (from step 1), and $h_{kk}$ is the effect of the dependant variable on the fitted value for observation k. To wit, $h_{kk}$ represents the diagonal element (n + k) of the influence matrix H = T(T’T)⁻¹T’, and

$$T = \left( {\begin{array}{*{20}{c}} {\vec 1} & Z \\ 0 & {diag(\lambda)} \end{array}} \right)$$

(11)

Bayes-B

Bayesian methods, as first proposed by Meuwissen et al. (2001), seek to relax the assumption that genetic effects are evenly distributed across the genome (as in RR-BLUP). In this analysis, we use Bayes-B, another variable selection method, in which there exists a probability that a marker has no effect (π). This would correspond to a situation where the genetic architecture of the trait was such that genetic variance was present at few, major effect loci only (Heffner et al. 2009; Lorenz et al. 2011). Bayes-B is thought to be a more realistic prior since some genomic regions will be absent of QTL (Heffner et al. 2009). An assumption of this model is that marker effects are normally distributed with zero mean and finite variance. The prior distribution of the marker effect variance, is a mixture of two finite prior densities: var (g) = 0, with probability π; and var (g) ~ χ⁻² (v, S), with probability (1 − π) (Lorenz et al. 2011; Gezan et al. 2017). π is assumed known and specified arbitrarily, the default value of 0.5 was used. The Bayes-B analysis was carried out in the R package BGLR v1.0.4 (Perez and de los Campos 2014), with the Gibbs sampler run for 100,000 iterations and a burn-in of 20,000, with a thinning rate of 100. Default rules of the BGLR R package (Perez and de los Campos 2014) were used for the initial hyper-parameter values.

The data sets supporting the results of this article will be available in the Dryad Digital Repository upon acceptance.

Results

Heritability and EBV accuracy

Juvenile height (HTJ) heritability was estimated using a pedigree-based relationship matrix (ABLUP), including individuals from the 11 parental (F₁) environments, their parents (P₀), and the 2 progeny (F₂) environments. A summary of the contribution of each environment to both the ABLUP and GS models is in Table 1, along with environment heritabilities for HTJ which ranged from 0.09 to 0.39. The overall HTJ heritability estimate was 0.14 (SE 0.025). The average theoretical accuracy for the EBVs of the sampled, genotyped individuals (from 3 parental environments: Adams, Fleet River and Lost Creek; and 1 progeny environment: Jordan River) was 0.68, and 0.61 for the validation environment (Jordan River) alone.

Validation in the progeny generation

ABLUP

The average prediction accuracy for Jordan River EBVs derived from ABLUP was 0.92 (SE 0.001) (Table 2), using a pedigree including genotyped samples only, i.e., the validation set and the F₁ samples from the three environments (Adams, Fleet River, and Lost Creek). Using a larger pedigree based on the full 11 F₁ environments (plus their parents), and two progeny environments, the prediction accuracy becomes 0.95 (SE 0.0005). While the pedigree for the ABLUP analyses included multiple generations, phenotypic information from the F₁ generation only was used to predict the validation (F₂) EBVs in the cross-validation. For comparison with the GS analyses, it is these results that we shall concentrate on. The average predictive ability for HTJ using ABLUP was calculated as the correlation between EBV and juvenile height measurements. The predictive ability of ABLUP in the validation set was calculated as 0.71 (SE 0.003) using a pedigree including genotyped samples only, i.e., the validation set from Jordan River (F₂) and the F₁ samples from the three environments: Adams, Fleet River, and Lost Creek (Table 3). Higher than the theoretical accuracy (0.61) calculated for the validation environment (Jordan River) alone.

Table 2 Genomic selection analyses of four models using three GS statistical methods (RR-BLUP, GRR, and Bayes-B)

Full size table

Table 3 The corresponding predictive abilities for GS analyses in Table 2, calculated as the correlation between the raw phenotype (juvenile height: HTJ) and their genomic estimated breeding values (GEBVs) or deregressed genomic estimated breeding values (GDEBVs)

Full size table

GS Scenario 1: HTJ EBVs → HTJ GEBVs

Prediction accuracies for GEBVs derived from RR-BLUP, GRR, and Bayes-B are shown in Fig. 1. In the first cross-generational GS analysis, GS prediction accuracies for validation were determined by the correlation between EBV and GEBVs for the F₂ validation (progeny generation) set at Jordan River. GS prediction accuracies with models trained on F₁ EBVs were very similar over all GS methods used (Table 2), the average was 0.91. Their corresponding predictive abilities were also similar to each other with an average of 0.43, somewhat lower than that of ABLUP despite similar prediction accuracies (Table 3).

As is evident from the Fig. 1a–c, there is a pattern of distinct grouping characterized by those individuals with EBV over 20 and those below. We suspected this was the major factor in causing such high prediction accuracies. To this end, we further analyzed these groups, or clusters, separately, each with 3 different training set combinations represented by indicies 1-3. EBV<20¹, EBV < 20², and EBV < 20³ represent the analysis of validation individuals within the lower cluster, using: ¹all genotyped F₁ individuals (1321 trees from Adams, Fleet River, and Lost Creek) as the training set; ² all genotyped F₁ individuals minus the parents of the high cluster as the training set (total = 1104); and ³using only their parents from the F₁ generation as the training set (total = 132). Similarly EBV > 20¹ represents the analysis of validation individuals within the higher cluster (EBV > 20), using all genotyped F₁ individuals as the training set (total = 1321); EBV > 20² is the analysis of validation individuals within the high cluster, using all genotyped F₁ individuals minus the parents of the low cluster as the training set (total = 1189); and EBV > 20³ is the analysis of validation individuals within the high cluster, using only their parents from the F₁ generation as the training set (total = 217).

We subsequently found more limited correlations between EBVs and GEBVs within these clusters. When analyzed separately the lower cluster, with individuals with EBVs lower than 20 (hereafter referred to as EBV < 20), observed similar yet moderate predictive accuracies across all training set combinations and GS statistical methods (Table 2), obtaining an average of 0.42. GS method RR-BLUP and training set EBV < 20³ performed only slightly better than the other combinations. The higher cluster, the group with individuals with EBV over 20 (hereafter referred to as EBV > 20), observed markedly low prediction accuracies for EBV > 20¹ and EBV > 20² (average: −0.05). Yet with a training set comprised only of the parents of this cluster (EBV > 20³), that rose to an average of 0.68 over all GS methods (Table 2). Bayes-B, in this case, outperformed the other GS methods significantly, and indeed provided the best prediction accuracy in this investigation (0.99), greater than that of the ABLUP model.

GS Scenario 2: HTJ DEBVs → HTJ GDEBVs

The second GS analysis used the correlation between DEBVs and GDEBVs, of the F₂ validation set at Jordan River, as an indication of predictive accuracy. Deregression was carried out using the procedure of Garrick et al. (2009), to account for family means and their effect on EBVs. The resulting deregressed EBVs (DEBVs) contain information regarding individuals only, without influence from parental BVs. Model accuracy fell dramatically when trained on DEBVs of the F₁ training set rather than on EBVs, 0.10 (SE 0.008) for RR-BLUP (Fig. 1d), 0.05 (SE 0.007) for GRR (Fig. 1e), and 0.48 (SE 0.002) for Bayes-B (Fig. 1f). The Bayes-B analysis being noticeably higher than the other two analyses due to the aforementioned clustering structure of the validation set. Similarly, the predictive abilities of these three models were much lower for RR-BLUP and GRR and moderate for Bayes-B: 0.06 (SE 0.008), 0.009 (SE 0.007), and 0.40 (SE 0.003), respectively (Table 3). In this scenario, Bayes-B seemed to out-perform the other two GS models, however, the appearance of clustering in Fig. 1f called for further investigation.

Again, separate analyses were carried out according to the grouping of individuals mentioned previously. Although it is not immediately obvious in Fig. 1d, e, there is the emergence of a grouping pattern in Fig. 1f. The average predictive accuracy for EBV < 20 across all training set combinations and GS methods was −0.09 and the average predictive ability for EBV < 20 was −0.19. For EBV > 20 the average predictive accuracy of the training set combinations EBV > 20¹ and EBV > 20² was 0.17. This contrasts the results using only the parents of this high cluster as the training set, which gives an average predictive accuracy of −0.12 (Table 2). The average predictive ability for EBV > 20 was 0.18 (Table 3).

GS Scenario 3: HT35 EBVs → HTJ GEBVs

The third GS analysis, using EBVs for mature height (age 35) of the F₁ samples as the training set for the models, and juvenile height GEBVs of the F₂ generation at Jordan River as the validation set. The prediction accuracies achieved were similar between all three GS models, with an average of 0.57 (Fig. 1g–i). They were comparable but lower than the result of the ABLUP analysis in which HT35 was used to predict HTJ EBVs (0.60, SE 0.010) (Table 2). Likewise, their predictive abilities were generally quite similar to that of the ABLUP analysis (0.24, SE 0.005) with an average of 0.21 across all GS methods (Table 3):

The appearance of groups again appears in Fig. 1g–i as a result of individual EBV in the validation set. For the group EBV < 20 the prediction accuracy was on average 0.12 for EBV < 20¹ and EBV < 20². However, this increased to an average of 0.37 using a training set made up of the parents of the low cluster only. Prediction accuracy for the EBV > 20 cluster was more consistent across all training sets and GS methods (Table 2), with an average of −0.25. Although in this latter case the predictive accuracy correlation is driven by the presence of two smaller sub-groups within the EBV > 20 group. These subgroups have limited structure on their own.

GS Scenario 4: HT35 DEBVs → HTJ GDEBVs

Finally in the fourth analysis, using DEBVs for mature height of the F₁ samples as the training set for the models, and GDEBVs for juvenile height of the F₂ generation at Jordan River as the validation set; the average prediction accuracy or correlation between DEBVs and GDEBVs was −0.05 across all GS methods (Fig. 1j–l). The average predictive ability for this scenario was −0.09. All results were much lower than those for the ABLUP analysis for HT35 predicting HTJ EBVs (Tables 2 and 3).

The same groupings as before were tested for their prediction accuracies within the fourth analysis. The predictive accuracies for EBV < 20 were generally very low across all training set combinations and GS methods. The average for EBV < 20¹ and EBV < 20² was −0.06. With the direction of the linear relationship changing when only the EBV < 20 parents made up the training set (Table 2). The average for this training set combination over all GS methods was 0.06. Their average predictive abilities were: −0.16 for EBV < 20¹ and EBV < 20²; with a slight drop in predictive ability in EBV < 20³ to −0.04. For EBV > 20 the predictive accuracies were similar in magnitude for all training set combinations and GS methods, with an average of 0.02. However, whilst RR-BLUP and GRR gave slight positive correlations, Bayes-B results were all negative in their direction. All within-cluster results were close to zero for this scenario, although the variation in the sign of the linear relationship causes some doubt as to their reliability. The predictive abilities for EBV > 20 were on average 0.08, predictably with the limited training set of EBV > 20³ having slightly higher predictive abilities than the other training sets (for RR-BLUP and GRR only).

Discussion

Heritability of juvenile height

The estimated heritability of juvenile height was substantially low (0.14), but only slightly lower than previous studies suggest for Douglas-fir given the age of the trees used in the estimation (Yeh and Heaman 1982; Dean and Stonecypher 2006; Ukrainetz et al. 2008; Thistlethwaite et al. 2017). However, it should be noted that information from many more environments was used to estimate this heritability (N = 36,311), increasing the genotype × environmental interaction, effectively shrinking the heritability. Using only the 1321 F₁ trees, the heritability estimate is 0.17. The effect of heritability appears to be minimal in this case, since high predictive accuracies were obtained for validation models for HTJ predicting HTJ: 0.92 for ABLUP, 0.92, 0.91, and 0.91 for RR-BLUP, GRR, and Bayes-B, respectively (Fig. 1a–c), using EBVs as the GS model input; and for HT35 predicting HTJ: 0.60 for ABLUP, 0.57, 0.56, and 0.58 for RR-BLUP, GRR, and Bayes-B, respectively (Fig. 1g–i). The negligible effect of heritability in this instance is likely a consequence of the large sample size and low N_e used in the present investigation (Meuwissen et al. 2001).

Pedigree vs. marker-based models

Although some results were similar, most predictive accuracies and especially predictive abilities for GS models trained on EBVs were still lower than those of ABLUP (Tables 2 and 3), a common issue in forestry (Bartholomé et al. 2016). It is thought that since the Douglas-fir genome is so exceedingly large and complex (Neale et al. 2017), many more markers will be needed in order to track LD between sources of variation and markers (Thistlethwaite et al. 2017). Simulations have yielded evidence that GS prediction accuracy increases markedly with marker density at least up to 8N_e/Morgan (Solberg et al. 2008). In this study, we have been moderately successful in capturing both contemporary and historical pedigree information. Although this is somewhat driven by groupings in the validation generation (F₂) characterized by high and low juvenile height values. Further investigation led to the discovery that the ultimate cause of this clustering was familial grouping. Each cluster contains whole families which can be traced back to distinct parents in the previous generation (F₁). There are no parents who are represented in both groups. In addition to this, the average heights and EBVs of the parents of the “high” cluster (EBV > 20) were 765.04 cm and 60.70, respectively; and are indeed higher than those parents of the “low” cluster (EBV < 20), 709.88 cm and 0.81, respectively.

Analysis of each cluster provided mixed results. For those scenarios in which the GS models were trained on EBV data (scenarios 1 and 3), the within-cluster results were always lower than the overall predictive accuracy (and ABLUP predictive accuracy) with one exception. Convincing evidence that relationship tracking was the major driving force behind the strong overall correlations. Again within these two scenarios, the training set composition had an effect. Unsurprisingly the analyses with training sets comprised of parents of the cluster in question only, generally gave higher prediction accuracies due to the high relationship between the two sets. Indeed the only scenario in which the GS prediction accuracy surpassed the ABLUP prediction accuracy was for the correlation between HTJ EBVs and HTJ GEBVs with the limited training set EBV > 20³ and using Bayes-B as the GS method (0.99 vs. 0.92 for ABLUP). Although in some cases, the effect of training set composition was minimal (scenario 1: EBV < 20 and, Scenario 3: EBV > 20) (Table 2). Taking into account their clustering, predictive abilities for scenarios 1 and 3 fell far short of their ABLUP predictive abilities (Table 3). With the exception of the correlation between HTJ and HTJ GEBVs, which showed moderate success with a value of 0.59. This yet still falling short of the ABLUP value of 0.71. Within-cluster correlations between HTJ and HT35 GEBVs (scenario 3) were all negative, whilst the overall correlations were marginally positive due to the structure of the data as a whole.

Following the removal of family means by deregression (scenarios 2 and 4), the lack of available marker-QTL LD failed to raise the GS prediction accuracies enough to even approximate those of ABLUP. However, Bayes-B notably performed better than both RR-BLUP and GRR in scenario 2 using the correlation between HTJ DEBVs and HTJ GDEBVs of the whole F₂ validation set as an indication of prediction accuracy (Table 2). A fundamental difference between the GS methods used is that, as opposed to RR-BLUP and GRR, Bayes-B gives different variances to each locus (including zero), therefore it allows for more weight to be put on the causative SNPs. As is evident from the results here, this drives up the prediction accuracy (Meuwissen et al. 2001). No such differences were seen between the GS methods subsequent to cluster analysis in these scenarios. When analyzed to their full extent given the aforementioned data groupings, these deregressed analyses showed a dramatic drop in prediction accuracy. Suggesting the strong correlation was merely driven by tracking family means. Meanwhile, their predictive abilities show some striking trends. In both deregression scenarios (2 and 4) the EBV < 20 cluster has only negative prediction abilities, averaging −0.19 and −0.12 for scenarios 2 and 4, respectively. Conversely, the EBV > 20 cluster has only positive prediction abilities (with one exception), averaging 0.18 and 0.08 for scenarios 2 and 4, respectively. The discrepancy in the direction of the linear relationship casts some doubt on the reliability of the within-cluster results. A likely symptom of the reduced sample size necessitated for these analyses.

Although it is disappointing to not have captured enough LD to raise the prediction accuracy above ABLUP in a diverse validation population, in terms of real-world application there is a positive finding. This use of marker-based selection nevertheless reduces the need for time-intensive practices such as performing specific crosses and building a structured pedigree (El-Kassaby and Lstibůrek 2009; El-Kassaby et al. 2011). Potentially quickening the breeding process, and perhaps off-setting costs involved in genotyping.

GS across generations

Once GS becomes a viable option for tree breeders it will likely, in most cases, be deployed to select progeny of the training population without the need for explicit crosses and a lengthy testing phase. Bearing this in mind, it is important to validate models across generations rather than cross-validate within the same generation, as many have done before.

GS relies on LD between markers and causal gene variants. LD breaks down after every breeding cycle due to recombination. This is especially pertinent to breeding within the forestry sector, for forest tree species have lower observed levels of LD (Neale and Kremer 2011). Given these circumstances, large SNP sets will be required to provide dense coverage of the genome to find LD between SNPs and QTL (Jaramillo-Correa et al. 2015). In a simulation study, it was shown that higher marker densities allowed prediction accuracies to be maintained over longer generational gaps (Müller et al. 2017b). Thus, without addressing this issue, predictive accuracy is expected to fall dramatically (Habier et al. 2007; Atefi et al. 2016).

There are a reported 54,830 gene models in the Douglas-fir genome (Neale et al. 2017). With our 69,551 SNPs, using the juvenile height of an F₁ generation as the training set and an F₂ generation as the validation set, we obtained an overall predictive accuracy of 0.92, 0.91, and 0.91 for RR-BLUP, GRR, and Bayes-B, respectively for juvenile height. This is in line with results from a previous study of GS in Douglas-fir (0.79–0.92) using cross-validation within the same generation (Thistlethwaite et al. 2017).

The results we present here did not show a drop in predictive accuracy as one would expect (with a caveat described later). As has been shown before, prediction accuracy increases when the genetic relationship between training and validation sets is closer (Habier et al., 2007; Lorenz et al. 2012; Sallam et al. 2015). Indeed, it is the case here that our validation set is closely related to the training set by virtue of being their (the training set’s) offspring. Although our results are not as disparate, Bartholomé et al. (2016) observed a similar trend. When progeny validation was used, prediction accuracies of GS methods were higher (0.70 using genomic BLUP (GBLUP), and 0.71 using Bayesian LASSO (B-LASSO)) than when within generation cross-validation was carried out (0.66, for both GBLUP and B-LASSO). With that in mind, there may be some benefit in combining phenotypes across generations which helps to infer the Mendelian sampling term.

However, as is evident from Fig. 1a–c, g–i, two distinct data groups had arisen which had caused the high correlations. These two groups were characterized by (a) those individuals with low juvenile height values (mean = 641.81 cm) and (b) those with high juvenile height values (mean = 696.79 cm). They were later found to be the result of familial clusters, with no overlapping parents from group to group. The estimation of breeding values exacerbated this trend and thus two non-overlapping groups can be seen defined as EBV < 20 (individuals with EBVs less than 20) and EBV > 20 (individuals with EBV over 20).

Given this data structure, further analysis showed that prediction accuracies for scenario 1 EBV < 20 were in fact only moderate, with an average of 0.42 for all GS methods compared to ABLUP (0.92). In the case of EBV > 20 scenario 1 results were close to zero with the exception of EBV > 20³. High prediction accuracies, in this case, were driven by close relationships between the training and validation sets and small sample size.

These results might possibly be improved by re-estimation of EBVs using a realized relationship matrix (G-matrix) rather than the estimated relationship A-matrix used here (Munoz et al. 2014).

Prediction accuracies for deregressed values reported here, align with those reported in an earlier investigation also carried out on Douglas-fir. Thistlethwaite et al. (2017) describe obtaining prediction accuracies for GS models, trained on DEBVs of height at age 12, that were also approximately 0. These models were cross-validated with individuals from within the same generation. Here, using similar parameters but using a cross-generational validation process, we have obtained similar results with a few exceptions. Once data grouping was taken into account, correlations between HTJ DEBVs and HTJ GDEBVs (scenario 2) were low-moderately negative, −0.12 to −0.02 for EBV < 20 and −0.23 to 0.26 for EBV > 20.

Although a lower predictive accuracy for F₂ HTJ was obtained when the GS models were trained on F₁ mature height (age 35) (scenario 3), there was still a significant positive correlation between the two (0.57, 0.56, and 0.58 for RR-BLUP, GRR, and Bayes-B, respectively). These results are lower but similar to findings in Thistlethwaite et al. (2017) where a positive time–time correlation between juvenile (age 12) and mature (age 35) height within the same generation was found (0.71 (SE 0.0004) for both RR-BLUP and GRR). However here again, the groups EBV < 20 and EBV > 20 were the driving force behind these correlations. For scenario 3 EBV < 20 results were likewise reduced, averaging 0.12 for EBV < 20¹ and EBV < 20², and 0.37 for EBV < 20³ across all GS methods. The stronger correlation coming from the increased relationship between the training and validation sets, yet falling quite short compared to ABLUP (0.60). In scenario 3 the EBV > 20 accuracy results show a small-moderate but negative correlation averaging −0.25 across all GS methods. Marker-trait associations vary with tree age, and predictably the ABLUP correlation here was found to be lower than in scenario 1 (0.92) at 0.60. The high-positive EBV > 20³ juvenile–juvenile correlation and small-moderate negative EBV > 20 mature-juvenile correlation, may also be symptoms of the effect of tree age on marker-trait associations (Lerceteau et al. 2001) and as seen here, recombination. These results do not provide a sound basis on which to perform selection decisions. However should there be an improvement in GS accuracy with the re-estimation of EBVs, useful information may still be procured from moderate correlations, for input into early selection decisions.

In the final scenario, deregressed breeding values for mature height in the F₁ generation were used as training data for the GS models. Predictive accuracy fell significantly both for the validation set as a whole (−0.15 to 0.11), and for each of EBV < 20 (−0.08 to 0.07) and EBV > 20 (−0.07 to 0.08).

In the absence of average parental information (i.e., deregressed phenotypes), the ability of the markers to predict phenotypes in the next generation was consistently poor (Table 2). None of the GS analyses in these deregressed scenarios (2 and 4) surpassed or even matched the predictive ability of the ABLUP models, despite having very similar overall prediction accuracies in scenarios 1 and 3 (although this itself can be attributed to the effects of clustering in the validation set). Predictive ability similarly dropped dramatically after deregression and re-adjusting for clustering. The opposing signs of coefficients for the high and low clusters after deregression suggest some unreliability of these values. This trend is reflected in throughout, thus any of these within-cluster GS “accuracies” should be treated with caution.

Main factors that affect GS: relatedness and LD

Whilst some of our observations during this study align with previously published work, the high predictive accuracies using EBVs as the model input are a result of tracking family means as opposed to true LD being captured. GS BLUP methods are robust in most circumstances and perform well; however, it is now known that these methods primarily capture marker derived relatedness more readily than actual LD (Habier et al. 2007; Zhong et al. 2009). We base this viewpoint on trends seen in Thistlethwaite et al. (2017) whereby removing family derived means from trait breeding values, caused virtually no prediction accuracies of GS methods. Simulation evidence suggests that pedigree-based relationships contribute to predictive accuracy for a few generations, especially given low N_e (Müller et al. 2017b), which the progeny validation set presented here certainly does have. In empirical studies, there is also evidence that the relatedness of the training and validation sets has an impact on the accuracy of GS (Resende et al. 2012a; Beaulieu et al. 2014a; Bartholomé et al. 2016; Märtens et al. 2016; Varshney et al. 2017). With higher relatedness between sets producing more accurate predictions. Our training and validation set are only a generation apart; it is therefore likely that their relatedness is the main force behind the GS prediction accuracies. Even though the alternative, more computationally intense GS method of Bayes-B was employed to help overcome this (Habier et al. 2007), we have not been able to resolve any additional LD using this method. In order to “reveal” the effect of LD on GS predictive accuracy, empirical investigations on multiple generations further apart will have to be conducted using much denser SNP genotyping.

Apropos of marker density, we carried out analyses concerning the effect of marker density on the prediction accuracy of GS. We carried out a tenfold cross-validation on HT35 EBVs within the genotyped parental (F₁) generation samples using RR-BLUP. Randomly selected marker sets were chosen that had progressive totals from 200 to 50,000 SNPs. These sets were tested and replicated ten times, with random SNP sampling for each repetition. Our investigations led us to the conclusion that an increase in marker density leads to an increase in GS prediction accuracy. A caveat being that the magnitude of prediction accuracy gains falls to a plateau as the number of makers approaches approximately 15,000 SNPs. This also happens to be the point at which the prediction accuracy is similar to the ABLUP prediction accuracy for HT35 EBVs for these samples. When the same method was employed on deregressed HT35 EBVs, the prediction accuracies fell dramatically for all SNP set totals. We believe that relatedness is the driving force behind these additional GS predictive accuracy results for the HT35 EBVs for a couple of reasons. Firstly, the variance within each SNP set total was modest even though the SNPs were randomly selected. Secondly, the prediction accuracies fell dramatically when the EBVs were deregressed to remove the parental average effects, for the purposes of extricating LD from pedigree. Given this insight, in order to capture short-range marker-QTL LD in conifer species, we recommend the use of deregression to remove parental average effects and high-density marker sets.

The effect of capturing marker derived relatedness may lead to sibling coselection due to an increased correlation between EBVs within families (Wray and Thompson 1990). In the long-run, this will lead to a reduction in genetic variation in subsequent generations, and a loss of potential genetic gain (Hallander 2009). A reduction in Mendelian segregation variance due to inbreeding (sibling coselection) will eventually lead to a decrease in the Mendelian sampling term of each individual, and population-wide additive variance. Presenting a problem for breeders in that the expectation of the rate of genetic gain in a population is proportional to the Mendelian sampling term of selection targets (Woolliams and Thompson 1994; Avendaño et al. 2004).

Breeding programs often implement selection methods that optimize the selection differential whilst constrained by a limit on increasing coancestry. Hallander and Waldmann (2009) tested such methods on a diallel progeny trial of Scots pine (Pinus sylvestris L.) investigating height and stem diameter and found optimum contribution (OC) dynamic selection resulted in the highest genetic gain over other methods (standard restricted selection). One of the main drivers for the success is that the selective advantage in the OC method is the Mendelian sapling term. The OC method maximizes the selection differential between families by using the best estimates of the Mendelian sampling term for each tree when calculating the mating contributions (Avendaño et al. 2004). Selection methods must be considered when using GS results as a basis for selection, to avoid inbreeding and to maintain Mendelian segregation variance.

Conclusions

Our cross-generational GS validation of juvenile height in Douglas-fir provided results that almost matched the ABLUP predictive accuracy. However, we believe that the predictive accuracy is driven by the relatedness between the training and validation sets, and even more so in capturing among-family effects. Whilst this relationship is exploited for selection in current breeding programs, the ultimate aim of using GS should be to capture true LD across populations and traits to uncover presently unknown variation, and possibly unknown traits (Beaulieu et al. 2014a; Grattapaglia 2017). Hence, we degregressed EBVs to tease apart LD from familial relationships, which may subsequently be preferential only in advanced breeding programs (Grattapaglia 2017). While the number of SNPs we have derived from sequence capture represents the highest in forest trees GS studies (Grattapaglia 2017), we have yet to observe enough marker-QTL LD in our GS methods. Many more markers may be required for this to be resolved since LD in Douglas-fir survives only over short distances as in other conifer species (Neale and Savolainen 2004). In addition, multi-generational information may be required in order to evaluate LD in this species. This must be explored further before any such incorporation into applied selective breeding programs is undertaken.

Data availability

The data sets supporting the results of this article are available from the Dryad Digital Repository https://doi.org/10.5061/dryad.8n2d374.

References

Atefi A, Shadparvar AA, Ghavi Hossein-Zadeh N (2016) Comparison of whole genome prediction accuracy across generations using parametric and semi parametric methods. Acta Sci Anim Sci 38:447
Article Google Scholar
Avendanõ S, Woolliams J. A. E, Villanueva B (2004) Mendelian sampling terms as a selective advantage in optimum breeding schemes with restrictions on the rate of inbreeding Genetics Research 83:55–64
Article Google Scholar
Bartholomé J, Van Heerwaarden J, Isik F, Boury C, Vidal M, Plomion C, Bouffier L (2016) Performance of genomic prediction within and across generations in maritime pine. BMC Genom 17:604
Article Google Scholar
Beaulieu J, Doerksen T, Clément S, MacKay J, Bousquet J (2014a) Accuracy of genomic selection models in a large population of open-pollinated families in white spruce. Heredity 113:343–352
Article CAS PubMed PubMed Central Google Scholar
Beaulieu J, Doerksen TK, MacKay J, Rainville A, Bousquet J (2014b) Genomic selection accuracies within and between environments and small breeding groups in white spruce. BMC Genom 15:1048
Article Google Scholar
Cappa EP, Stoehr MU, Xie C-Y, Yanchuk AD (2016) Identification and joint modeling of competition effects and environmental heterogeneity in three Douglas-Fir (Pseudotsuga menziesii var. menziesii) trials. Tree Genet Genomes 12:102
Article Google Scholar
Dean CA, Stonecypher RW (2006) Early Selection of Douglas-Fir across South Central Coastal Oregon, USA. Silvae Genet 55:135–141
Article Google Scholar
Dutkowski GW, Silva JCe, Gilmour AR, Lopez GA (2002) Spatial analysis methods for forest genetic trials. Can J Res 32:2201–2214
Article Google Scholar
El-Kassaby YA (1982) Associations between Allozyme Genotypes and Quantitative Traits in Douglas-Fir [PSEUDOTSUGA MENZIESII (Mirb.) Franco]. Genetics 101:103–115
CAS PubMed PubMed Central Google Scholar
El-Kassaby YA, Cappa EP, Liewlaksaneeyanawin C, Klápšte J, Lstiburek M (2011) Breeding without breeding: is a complete pedigree necessary for efficient breeding? PLoS ONE 6:e25737
Article CAS PubMed PubMed Central Google Scholar
El-Kassaby YA, Lstibůrek M (2009) Breeding without breeding. Genet Res 91:111
Article Google Scholar
Fuentes-Utrilla P, Goswami C, Cottrell JE, Pong-Wong R, Law A, A’Hara SW, Lee SJ, Woolliams JA (2017) QTL analysis and genomic selection using RADseq derived markers in Sitka spruce: the potential utility of within family data. Tree Genet Genomes 13:33
Article Google Scholar
Gamal El-Dien O, Ratcliffe B, Klápště J, Chen C, Porth I, El-Kassaby YA (2015) Prediction accuracies for growth and wood attributes of interior spruce in space using genotyping-by-sequencing. BMC Genom 16:370
Article Google Scholar
Garrick DJ, Taylor JF, Fernando RL (2009) Deregressing estimated breeding values and weighting information for genomic regression analyses. Genet Sel Evol 41:55
Article PubMed PubMed Central Google Scholar
Gezan SA, Osorio LF, Verma S, Whitaker VM (2017) An experimental validation of genomic selection in octoploid strawberry. Hortic Res 4:16070
Article PubMed PubMed Central Google Scholar
Gilmour AR, Gogel BJ, Cullis BR, Thompson R (2009) ASReml Use Guide Release 3:0
Google Scholar
Grattapaglia D (2014) Breeding forest trees by genomic selection: current progress and the way forward. In: Tuberosa R, Graner A, Frison E eds. Genomics of plant genetic resources. Springer Netherlands, Dordrecht, pp 651–682
Chapter Google Scholar
Grattapaglia D (2017) Status and perspectives of genomic selection in forest tree breeding. In: Varshney RK, Roorkiwal M, Sorrells ME eds. Genomic selection for crop improvement. Springer International Publishing, Cham, pp 199–249
Chapter Google Scholar
Grattapaglia D, Resende MDV (2011) Genomic selection in forest tree breeding. Tree Genet Genomes 7:241–255
Article Google Scholar
Habier D, Fernando RL, Dekkers JCM (2007) The impact of genetic relationship information on genome-assisted breeding values. Genetics 177:2389–2397
Article CAS PubMed PubMed Central Google Scholar
Hallander, J (2009). Novel methods for improved tree breeding (Doctoral thesis, Swedish University of Agricultural Sciences, Umea). Retrieved from: Epsilon Open Archive. ISBN 978-91-86195-60-1
Hallander J, Waldmann P (2009) Optimum contribution selection in large general tree breeding populations with an application to Scots pine. Theor Appl Genet 118(6):1133–1142
Article PubMed Google Scholar
Hayes BJ, Bowman PJ, Chamberlain AJ, Goddard ME (2009) Invited review: genomic selection in dairy cattle: progress and challenges. J Dairy Sci 92:433–443
Article CAS PubMed Google Scholar
Heffner EL, Sorrells ME, Jannink J-L (2009) Genomic selection for crop improvement. Crop Sci 49:1
Article CAS Google Scholar
Heffner EL, Lorenz AJ, Jannink J-L, Sorrells ME (2010) Plant breeding with genomic selection: gain per unit time and cost. Crop Sci 50:1681
Article Google Scholar
Henderson CR (1976) A simple method for computing the inverse of a numerator relationship matrix used in prediction of breeding values. Biometrics 32:69
Article Google Scholar
Howe GT, Yu J, Knaus B, Cronn R, Kolpak S, Dolan P, Lorenz WW, Dean JF (2013) A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation. BMC Genom 14:137
Article CAS Google Scholar
Isik F, Bartholomé J, Farjat A, Chancerel E, Raffin A, Sanchez L, Plomion C, Bouffier L (2016) Genomic selection in maritime pine. Plant Sci 242:108–119
Article CAS PubMed Google Scholar
Isik F, Holland J, Maltecca C (2017) Genetic data analysis for plant and animal breeding. Springer, Cham
Book Google Scholar
Ivanova NV, Fazekas AJ, Hebert PDN (2008) Semi-automated, membrane-based protocol for DNA isolation from plants. Plant Mol Biol Rep 26:186–198
Article CAS Google Scholar
Jaramillo-Correa JP, Prunier J, Vázquez-Lobo A, Keller SR, Moreno-Letelier A (2015). Molecular signatures of adaptation and selection in forest trees. In: Advances in botanical research, vol. 74 (ed. Plomion, C., and Adam-Blondon, A.), Elsevier, pp 265–306
Lerceteau E, Szmidt AE, Andersson B (2001) Detection of quantitative trait loci in Pinus sylvestris L. across years. Euphytica 121:117–122
Article Google Scholar
Lorenz AJ, Chao S, Asoro FG, Heffner EL, Hayashi T, Iwata H, Smith KP, Sorrells ME, Jannink J-L (2011). Genomic selection in plant breeding. Advances in agronomy, vol. 110 (ed. Sparks, D.L.) Elsevier, pp 77–123
Lorenz AJ, Smith KP, Jannink J-L (2012) Potential anD Optimization of Genomic Selection for Fusarium Head Blight Resistance in Six-row Barley. Crop Sci 52:1609
Article Google Scholar
Lindgren D, Gea LD, Jefferson PA (1997) Status number for measuring genetic diversity. For Genet 4:762–764
Google Scholar
Märtens K, Hallin J, Warringer J, Liti G, Parts L (2016) Predicting quantitative traits from genome and phenome with near perfect accuracy. Nat Commun 7:11512
Article PubMed PubMed Central Google Scholar
Meuwissen TH, Hayes BJ, Goddard ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819–1829
CAS PubMed PubMed Central Google Scholar
Müller BSF, Neves LG, de Almeida Filho JE, Resende MFR, Muñoz PR, dos Santos PET, Filho EP, Kirst M, Grattapaglia D (2017a) Genomic prediction in contrast to a genome-wide association study in explaining heritable variation of complex growth traits in breeding populations of Eucalyptus. BMC Genom 18:524
Article Google Scholar
Müller D, Schopp P, Melchinger AE (2017b) Persistency of prediction accuracy and genetic gain in synthetic populations under recurrent genomic selection. G3 (Bethesda) 7:801–811
Article Google Scholar
Munoz PR, Resende MFR, Huber DA, Quesada T, Resende MDV, Neale DB, Wegrzyn JL, Kirst M, Peter GF (2014) Genomic relationship matrix for correcting pedigree errors in breeding populations: impact on genetic parameters and genomic selection accuracy. Crop Sci 54(3):1115–1123
Article Google Scholar
Neale DB, Kremer A (2011) Forest tree genomics: growing resources and applications. Nat Rev Genet 12:111–122
Article CAS PubMed Google Scholar
Neale DB, Savolainen O (2004) Association genetics of complex traits in conifers. Trends Plant Sci 9:325–330
Article CAS PubMed Google Scholar
Neale DB, McGuire PE, Wheeler NC, Stevens KA, Crepeau MW, Cardeno C, Zimin AV, Puiu D, Pertea GM, Sezen UU et al. (2017) The Douglas-Fir genome sequence reveals specialization of the photosynthetic apparatus in pinaceae. G3 (Bethesda) 7:3157–3167
Article CAS Google Scholar
Neves LG, Davis JM, Barbazuk WB, Kirst M (2013) Whole-exome targeted sequencing of the uncharacterized pine genome. Plant J 75:146–156
Article CAS PubMed Google Scholar
Perez P, de los Campos G (2014) Genome-wide regression and prediction with the BGLR statistical package. Genetics 198:483–495
Article PubMed PubMed Central Google Scholar
Ratcliffe B, El-Dien OG, Klápště J, Porth I, Chen C, Jaquish B, El-Kassaby YA (2015) A comparison of genomic selection models across time in interior spruce (Picea engelmannii × glauca) using unordered SNP imputation methods. Heredity 115:547–555
Article CAS PubMed PubMed Central Google Scholar
Resende MDV, Resende MFR, Sansaloni CP, Petroli CD, Missiaggia AA, Aguiar AM, Abad JM, Takahashi EK, Rosado AM, Faria DA et al. (2012a) Genomic selection for growth and wood quality in Eucalyptus: capturing the missing heritability and accelerating breeding for complex traits in forest trees. New Phytol 194:116–128
Article PubMed Google Scholar
Resende MFR, Muñoz P, Acosta JJ, Peter GF, Davis JM, Grattapaglia D, Resende MDV, Kirst M (2012b) Accelerating the domestication of trees using genomic selection: accuracy of prediction models across ages and environments. New Phytol 193:617–624
Article PubMed Google Scholar
Resende MFR, Munoz P, Resende MDV, Garrick DJ, Fernando RL, Davis JM, Jokela EJ, Martin TA, Peter GF, Kirst M (2012c) Accuracy of genomic selection methods in a standard data set of loblolly pine (Pinus taeda L.). Genetics 190:1503–1510
Article PubMed PubMed Central Google Scholar
Resende RT, Resende MDV, Silva FF, Azevedo CF, Takahashi EK, Silva-Junior OB, Grattapaglia D (2017) Assessing the expected response to genomic selection of individuals and families in Eucalyptus breeding with an additive-dominant model. Heredity 119:245–255
Article CAS PubMed PubMed Central Google Scholar
Sallam, AH, Endelman, JB, Jannink, J-L, Smith, KP (2015). Assessing genomic selection prediction accuracy in a dynamic barley breeding population. The Plant Genome 8. https://doi.org/10.3835/plantgenome2014.05.0020
Article Google Scholar
Shen X, Alam M, Ronnegard L (2014). Package “bigRR”: Generalized Ridge Regression (with special advantage for p >> n cases)
Solberg TR, Sonesson AK, Woolliams JA, Meuwissen THE (2008) Genomic selection using different marker types and densities. J Anim Sci 86:2447–2454
Article CAS PubMed Google Scholar
Tan B, Grattapaglia D, Martins GS, Ferreira KZ, Sundberg B, Ingvarsson PK (2017) Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F1 hybrids. BMC Plant Biol 17:110
Article PubMed PubMed Central Google Scholar
Thistlethwaite FR, Ratcliffe B, Klápště J, Porth I, Chen C, Stoehr MU, El-Kassaby YA (2017) Genomic prediction accuracies in space and time for height and wood density of Douglas-fir using exome capture as the genotyping platform. BMC Genom 18:930
Article Google Scholar
Thomson MJ (2014) High-throughput SNP genotyping to accelerate crop improvement. Plant Breed Biotechnol 2:195–212
Article Google Scholar
Ukrainetz NK, Kang K-Y, Aitken SN, Stoehr M, Mansfield SD (2008) Heritability and phenotypic and genetic correlations of coastal Douglas-fir (Pseudotsuga menziesii) wood quality traits. Can J Res 38:1536–1546
Article CAS Google Scholar
Van Eenennaam AL, Weigel KA, Young AE, Cleveland MA, Dekkers JCM (2014) Applied Animal genomics: results from the field. Annu Rev Anim Biosci 2:105–139
Article PubMed Google Scholar
Varshney, RK, Roorkiwal, M, and Sorrells, ME (2017). Genomic selection for crop improvement: new molecular breeding strategies for crop improvement. Springer International Publishing, Switzerland
Chapter Google Scholar
Woolliams, J. A., and Thompson, R. (1994). A theory of genetic contributions. In Proceedings of the 5th World Congress on Genetics Applied to Livestock Production, vol. 19 (ed. Smith, C., Gavora, J. S., Benkel, B., Chesnais, J., Fairfull, W., Gibson, J. P., Kennedy, B. W. and Burnside, E. B.), pp. 127–134. Guelph
Whittaker JC, Thompson R, Denham MC (2000) Marker-assisted selection using ridge regression. Genet Res 75:249–252
Article CAS PubMed Google Scholar
Wray N, Thompson R (1990) Prediction of rates of inbreeding in selected populations Genetical Research 55:41–54
Article CAS PubMed Google Scholar
Wright S (1922) Coefficients of inbreeding and relationship. Am Nat 56:330–338
Article Google Scholar
Wu X, Lund MS, Sun D, Zhang Q, Su G (2015) Impact of relationships between test and training animals and among training animals on reliability of genomic prediction. J Anim Breed Genet 132:366–375
Article CAS PubMed Google Scholar
Yanchuk AD (1996) General and specific combining ability from disconnected partial diallels of coastal Douglas-fir. Silvae Genet 45:37–45
Google Scholar
Yeh FC, Heaman C (1982) Heritabilities and genetic and phenotypic correlations for height and diameter in coastal Douglas-fir. Can J Res 12:181–185
Article Google Scholar
Zhong S, Dekkers JCM, Fernando RL, Jannink J-L (2009) Factors affecting accuracy from genomic selection in populations derived from multiple inbred lines: a barley case study. Genetics 182:355–364
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgments

We thank British Columbia Ministry of Forests, Lands and Natural Resource Operations, Victoria, BC for data and trials access, MF Resende Jr. and LG Neves of RAPiD Genomics, FL, USA for genotyping.

Funding

This work was supported by Genome British Columbia (User Partnership Program (UPP-001) to YAK and MUS, NSERC Discovery Grant to YAK, TimberWest Forest Corp., Western Forest Products Inc. We declare that the funding agencies did not participate in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author contributions

YAK and MUS conceived and designed the experiment, FRT, BR, and IP collected the data, FRT performed the data analyses and wrote the manuscript, BR, JK, IP, CC, MUS and YAK advised on the data analyses and edited the manuscript.

Author information

Authors and Affiliations

Department of Forest and Conservation Sciences, Faculty of Forestry, The University of British Columbia, 2424 Main Mall, Vancouver, BC, V6T 1Z4, Canada
Frances R. Thistlethwaite, Blaise Ratcliffe, Jaroslav Klápště & Yousry A. El-Kassaby
Scion (New Zealand Forest Research Institute Ltd.), 49 Sala Street, Whakarewarewa, Rotorua, 3046, New Zealand
Jaroslav Klápště
Department of Genetics and Physiology of Forest Trees, Faculty of Forestry and Wood Sciences, Czech University of Life Sciences Prague, Praha 6, 165 21, Czech Republic
Jaroslav Klápště
Département des sciences du bois et de la forêt, Université Laval, G1V 0A6, Québec, QC, Canada
Ilga Porth
Department of Biochemistry and Molecular Biology, Oklahoma State University, Stillwater, OK, 74078-3035, USA
Charles Chen
British Columbia Ministry of Forests, Lands and Natural Resource Operations, Victoria, BC, V8W 9C2, Canada
Michael U. Stoehr

Authors

Frances R. Thistlethwaite
View author publications
You can also search for this author in PubMed Google Scholar
Blaise Ratcliffe
View author publications
You can also search for this author in PubMed Google Scholar
Jaroslav Klápště
View author publications
You can also search for this author in PubMed Google Scholar
Ilga Porth
View author publications
You can also search for this author in PubMed Google Scholar
Charles Chen
View author publications
You can also search for this author in PubMed Google Scholar
Michael U. Stoehr
View author publications
You can also search for this author in PubMed Google Scholar
Yousry A. El-Kassaby
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yousry A. El-Kassaby.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethics approval and consent to participate

Access to progeny test trials was granted by The Ministry of Forests, Lands and Natural Resource Operations of British Columbia, Canada.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Thistlethwaite, F.R., Ratcliffe, B., Klápště, J. et al. Genomic selection of juvenile height across a single-generational gap in Douglas-fir. Heredity 122, 848–863 (2019). https://doi.org/10.1038/s41437-018-0172-0

Download citation

Received: 11 June 2018
Revised: 24 November 2018
Accepted: 26 November 2018
Published: 10 January 2019
Issue Date: June 2019
DOI: https://doi.org/10.1038/s41437-018-0172-0

This article is cited by

Genomic selection: a revolutionary approach for forest tree improvement in the wake of climate change
- Umesh Sharma
- H. P. Sankhyan
- Neeraj Sankhyan
Euphytica (2024)
Genomic prediction in a multi-generation Eucalyptus globulus breeding population
- Geoffrey Haristoy
- Laurent Bouffier
- Jean-Marc Gion
Tree Genetics & Genomes (2023)
Evaluating the accuracy of genomic prediction for the management and conservation of relictual natural tree populations
- Sebastián Arenas
- Andrés J. Cortés
- Juan Pablo Jaramillo-Correa
Tree Genetics & Genomes (2021)
Effect of number of annual rings and tree ages on genomic predictive ability for solid wood properties of Norway spruce
- Linghua Zhou
- Zhiqiang Chen
- María Rosario García-Gil
BMC Genomics (2020)
Assessing the sensitivities of genomic selection for growth and wood quality traits in lodgepole pine using Bayesian models
- Nicholas K. Ukrainetz
- Shawn D. Mansfield
Tree Genetics & Genomes (2020)

Subjects

Abstract

Similar content being viewed by others

Introduction

Methods

Experimental population

Phenotyping, deregression, tissue sampling, DNA extraction, and genotyping

EBV prediction and accuracy

Genomic selection analysis

Ridge regression best linear unbiased predictor

Generalized ridge regression

Bayes-B

Results

Heritability and EBV accuracy

Validation in the progeny generation

ABLUP

GS Scenario 1: HTJ EBVs → HTJ GEBVs

GS Scenario 2: HTJ DEBVs → HTJ GDEBVs

GS Scenario 3: HT35 EBVs → HTJ GEBVs

GS Scenario 4: HT35 DEBVs → HTJ GDEBVs

Discussion

Heritability of juvenile height

Pedigree vs. marker-based models

GS across generations

Main factors that affect GS: relatedness and LD

Conclusions

Data availability

References

Acknowledgments

Funding

Author contributions

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval and consent to participate

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links