Introduction

The effective population size is one of the most important parameters in the definition of selection programmes, because the magnitude of this parameter determines the effects of both random genetic drift and inbreeding (Falconer, 1989; Caballero, 1994). In the absence of selection, all the individuals of a population have the same expected number of offspring, and differences in family size are caused only by random sampling of individuals among families. However, in populations under selection, families do not have equal probabilities of contributing to the next generation because of inherited or noninherited causes. This leads to a reduction in the effective population size. The effective size of selected populations, however, cannot be predicted solely from the variance of the family size at a given generation, because selection affects the variance of the family size not only in the parental generation but also in the ancestral generations (Wray et al., 1990).

There are two approaches to predict the effective size of populations under selection. First, the effective population size (variance effective size) can be predicted from the variance of gene frequency, using the idea of accumulation of selective advantage over generations (Robertson, 1961). Following this approach, Santiago & Caballero (1995) obtained accurate predictions for selected populations with discrete generations. The method has been extended to monoecious populations with partial selfing (Caballero & Santiago, 1996), dioecious populations with overlapping generations (Nomura, 1996), sex-linked loci (Nomura, 1997), linked autosomal loci (Santiago & Caballero, 1998) and nonrandom mating (Wang, 1998). Secondly, the effective population size (inbreeding effective size) can be predicted from the rate of inbreeding, using the concept of long-term genetic contributions (Wray & Thompson, 1990). Wray & Thompson (1990) showed that the rate of inbreeding is proportional to the sum of squared long-term genetic contributions of ancestors, and derived a method to predict rates of inbreeding in populations under mass selection. Their method, however, requires complex iterative computations. In order to overcome this problem, Woolliams et al. (1993) proposed a method for predicting without iterations. Wray et al. (1994) extended the method to index selection with family information. Later, Woolliams & Thompson (1994) corrected this method for some omissions and the equation they obtained has been used to optimize breeding programmes (Villanueva et al., 1996; Villanueva & Woolliams, 1997).

Although the variance and inbreeding effective sizes should be the same under constant population size over generations (Crow & Kimura, 1970; Caballero, 1994), the two approaches to predict the effective size of selected populations give different expressions. It is unclear as yet how these two approaches differ in their assumptions and approximations (Woolliams & Thompson, 1994). In this paper, the author clarifies the meaning of the difference based on the long-term genetic contributions approach. It is shown that the equation of Santiago & Caballero (1995) can also be derived from this approach, if the effect of mates on the long-term genetic contributions is correctly accounted for.

Model and assumptions

Although the theory can be extended to a population with different numbers of male and female parents, for the purpose of this paper we assume a population in which the same numbers of male and female parents are randomly pair-mated to create N/2 families (couples) each generation. Each family contributes n progeny of each sex to candidates of selection. Among the progeny of each sex, the best N/2 individuals are selected on the basis of their phenotypic values. The proportion selected is p=1/n, and the selection intensity is i=z/p, where z is the normal ordinate corresponding to the truncation point of selection (x). The trait under selection is assumed to be determined by an infinite number of additive loci, each with infinitesimal effect (the infinitesimal model; Bulmer, 1980). The variance reduction factor due to the Bulmer effect (Bulmer, 1980) is k=i(ix) and the coefficient of competitiveness defined by Woolliams et al. (1993) is c=(1 − kh2)/2, where h2 is the heritability. The population is assumed to be at the steady-state under the infinitesimal model. Then asymptotic values are used for all genetic parameters. Constant population size and discrete generations are assumed.

The genetic contribution of an ancestor j in generation n1 to descendants in a later generation n2 is defined as the total additive relationship between the ancestor and the descendants (Wray & Thompson, 1990). The long-term contribution is the contribution when n2 is much bigger than n1. In the absence of selection, an ancestor is expected to contribute two offspring to the next generation, each with the additive relationship 1/2. The expected total contribution is therefore 1. This expectation remains constant in the later generations, and the expected long-term contribution of an ancestor is 1. Under selection, genetically superior ancestors are expected to contribute more descendants than genetically inferior ancestors. Thus, ancestors have different expected genetic contributions, depending on their selective advantage. We relate the effective population size to the long-term contribution of ancestors in a given generation [following Wray & Thompson (1990), this generation is referred to as generation 1]. The long-term contribution of ancestor j of sex x (=m or f) to descendants in generation t is denoted by rj(x),t. Note that an ancestor and its mate contribute to their descendants in any generations with exactly the same rate, because there are no half-sib families in our model. Then, the long-term contributions of male and female parents in couple j are rj,t rj(m),t=rj(f),t.

In the works of Wray & Thompson (1990) and Woolliams et al. (1993), the genetic contribution of an ancestor is formulated only with its own breeding value. However, as suggested by Woolliams & Thompson (1994), the expected long-term contribution of an ancestor should be conditional not only on its breeding value but also on the breeding value of mates. Let Aj(m) and Aj(f) be the deviations of breeding values of male and female parents in couple j in generation 1 from their selected contemporaries, respectively, and σP be the phenotypic standard deviation. Defining A*j=(Aj(m) + Aj(f))/2σP, the expectation of long-term contributions (μj,t) of parents in this couple, conditional on Aj*, is:

and the limiting value (μj,∞) is:

where St=∑ti=0 ci and S=∑i=0 ci=(1 – c)–1 (Woolliams et al., 1993). Further, a genealogical pathway descended from this couple is expected to express the relative selective advantage in the selection of generation t:

(Woolliams et al., 1993).

Comparison of equations and discussion

Santiago & Caballero (1995) derived a prediction equation for effective size (Ne) in terms of the variance of change in the frequency of a neutral allele. Under random mating, the equation reduces to:

where ρFS is the intraclass correlation of full-sibs.

An alternative viewpoint for predicting the effective size is in terms of the rate of inbreeding. According to Wray & Thompson (1990), the rate of inbreeding is related to the long-term contribution as:

where E[r2j,∞]=∑N/2j=1r2j,∞/(N/2). Instead of the direct computation of E[r2j,∞], Wray et al. (1994) derived the expression for E[r2j,2], and then evaluated E[r2j,∞] with the expression. The expression for E[r2j,2] is:

where the constant 1 is the mean squared under random selection, σ2r(e),2=1/2 is the sampling variance under random selection (see also Wray & Thompson, 1990), and σ2r(g),2=i2ρFS is the variance attributable to selective advantage of couples. To evaluate E[r2j,∞] with this expression, Wray et al. (1994) took the following aspects into account.

1 Under random selection (i.e. σ2r(g),2=0), the mean of the long-term contributions of the ancestors is the same for t=2 as for t=∞, and so the first term in eqn (3) remains unchanged. This holds also under selection (Woolliams et al., 1993).

2 Under random selection, the sampling variance σ2r(e),2 is increased by a factor of 2 from t=2 to t=∞ (Wray & Thompson, 1990). Thus, σ2r(e),∞=2σ2r(e),2=1. This leads to ΔF=1/2N or Ne=N, as expected.

3 The doubling of the variance terms (σ2r(e),2 + σ2r(g),2) is insufficient for the expression of E[r2j,∞] under selection, because of the cumulative effect of selective advantage. The accumulation of selective advantage can be incorporated by multiplying σ2r(g),2 by S2.

Accounting for these points results in:

Then:

or:

which agrees with eqn (1).

In the original work of Wray et al. (1994), σ2r(e),2= ½ + i2ρy and σ2r(g),2=i2ρx were used, where ρx and ρy are the correlations of full-sibs arising from the breeding values of the ancestors and their mates, respectively, i.e. ρx=ρy=ρFS/2. Applying the rule to evaluate E[r2j,∞], they obtained an expression:

But this expression has an omission, because the accumulation of the selective advantage from mates is not accounted for (Woolliams & Thompson, 1994). To correct this omission, the effect of mates [the second term of eqn (5)] should be accumulated in the same process as the last term of eqn (5). This correction leads to eqn (4).

Recently, Woolliams (1998) showed that eqn (2) can be approximated by:

Noting that E[Aj*]=0 and E[Aj*2]=ρFS,

Then, we obtain again eqn (1).

Using the iterative regression method of Wray & Thompson (1990), Woolliams et al. (1993) derived an alternative expression for E[r2j,∞],

where Pj,t=∏l=tμj,l–1,l and Vt is the sampling variance of long-term genetic contribution arising in generation t. If we neglect the contributions of O(Aj*2) or higher following the suggestion of Woolliams & Thompson (1994), then:

and:

Thus the first term of eqn (6) equals E[μ2j,∞]=1 + S2i2ρFS. The second term of eqn (6) accounts for the accumulation of sampling variance (we denote this term by E[σ2j,∞]). An approximated expression of E[σ2j,∞] may be obtained by using the terms listed in appendix 3 of Woolliams et al. (1993). Using the terms A3.6, A3.10 and A3.16 in Woolliams et al. (1993),

where B=∑j=0(c2/2)j=2/(2 – c2). From this approximation, the prediction equation of Woolliams & Thompson (1994),

is obtained. If we further include term A3.7 in Woolliams et al. (1993), eqn (7) becomes:

leading to:

which is the simplified expression of the equation used by Villanueva & Woolliams (1997). [In these derivations, ρx (the correlation of full-sibs arising from the breeding value of the parent of sex x) in A3.7 and A3.10 should be replaced by ρFS, because the cumulative effect of the selective advantage of the mates is omitted in the work of Woolliams et al. (1993). See also appendix of Woolliams & Thompson (1994).]

In the case of the iterative regression method [eqn (6)], eqn (4) is derived by assuming:

The equality of E[μ2j,∞]=1 + S2i2ρFS was given in this paper, and the equality of E[μ2j,∞]=E[σ2j,∞] is obtained by assuming that the litter size of each family, i.e. the number of scored individuals in each family, follows a Poisson distribution (Woolliams, 1998; J. A. Woolliams & P. Bijma, pers. comm.). As seen from eqns (8) and (9) of Santiago & Caballero (1995), the assumption of a Poisson distribution of litter size has a very small effect on the prediction of Ne (the effect on the prediction is of the order of 1/T, where T is the total number of scored individuals). In fact, eqn (1) is the approximation obtained from a Poisson distribution of litter size (Santiago & Caballero, 1995). On the other hand, in the derivation of eqns (7) and (9), the terms resulting from an approximated expansion of E[σ2j,∞] were listed (Woolliams et al., 1993), and only terms with a larger effect on Ne were included. All the terms of the order of 1/T were also neglected in the derivation. Thus, eqns (8) and (10) may be regarded as approximations to eqn (1). Although simulation studies have shown that the approximated eqns (8) and (10) give satisfactory predictions (Woolliams & Thompson, 1994; Villanueva & Woolliams, 1997), eqn (1) is preferable to these equations because of the neater expression involving less approximation.

In conclusion, an identical equation is obtained from the drift approach (Santiago & Caballero, 1995) and the long-term contribution approach (Wray & Thompson, 1990), as seen from eqns (1) and (4). The two approaches have, however, their respective appealing points. The long-term genetic contribution approach provides a prediction of genetic gain (Woolliams & Thompson, 1994; Woolliams, 1998), and gives an insight into the design of breeding schemes to maximize gain with constraints of inbreeding (Woolliams & Thompson, 1994; Villanueva & Woolliams, 1997). On the other hand, the prediction of effective size from the drift approach can be extended to complex genetic systems, such as sex-linked loci (Nomura, 1997), linked autosomal loci (Santiago & Caballero, 1998) and nonrandom mating populations (Santiago & Caballero, 1995; Wang, 1998). All the extensions seem to be difficult for the long-term genetic contribution approach.