Introduction

Many families contain a mixture of good and bad, a topic of perennial interest. But can we really have, some of us, both altruistic siblings and selfish siblings? Studies of behaviour genetics show that some 30–50% of the variation in willingness to help others is heritable (Rushton et al., 1986; Rushton, 2004; Scourfield et al., 2004; Knafo and Plomin, 2006; Hur and Rushton, 2007; Cesarini et al., 2009; Gregory et al., 2009) but cf. (Krueger et al., 2001). This is known from comparison of the altruistic dispositions of monozygotic and dizygotic twins in both Western and non-Western (Hur and Rushton, 2007) populations. Those who do not help others, help themselves, and so anti-social behaviour has a heritability similar to that of prosocial behaviour (Rhee and Waldman, 2002) but cf. (Krueger et al., 2001). So, depending on the genotypes of our parents, my brothers may all be selfish, all altruistic or a mixture of both. The existence of genetically based variation in altruism and selfishness calls for explanation (Rushton, 2004; Rushton et al., 2008). One possibility is that altruism and selfishness have no fitness effects and the observed polymorphism arose by genetic drift, but we do not regard this as credible. In this study, we show that a mix of altruism and selfishness is an expected outcome of natural selection in a plausible model in which, if several individuals provide help, the fitness benefits to the recipient are subject to a law of diminishing returns.

The modern study of the evolution of altruism began with Hamilton's (Hamilton, 1964) demonstration, that altruistic alleles that cause carriers to help relatives are selected if b>c/r (Hamilton's rule), where b represents a fitness benefit obtained by a relative at a cost c to the helper, and r is the coefficient of relatedness between the relatives. This theory elicited a large quantity of empirical and theoretical work, and the question of how altruistic behaviours evolved through natural selection was until recently considered settled. However, now it is again the subject of debate (Okasha, 2010). The problems stem at root from imprecision in empirical testing of Hamilton's rule. If b, c and r could be measured precisely, then it would be straightforward to check that altruism occurred if and only if Hamilton's rule held. Empirical work has amply confirmed that animals behave more altruistically towards kin than non-kin, and the coefficient of relatedness has often been determined precisely, but estimation of fitness costs and benefits is difficult and has therefore generally been imprecise. Furthermore, only a few studies have attempted quantitatively to evaluate Hamilton's rule in animals. This has left theory with only limited empirical leverage. Theoretical work has instead aimed to understand a rich variety of processes that may be at work in the evolution of social behaviour and several approaches have emerged. Early population genetics modelling was comprehensively reviewed by Michod (1982). Subsequently a ‘direct fitness’ approach was developed using the methods of quantitative genetics (see, for example, Queller, 1985, 1992a, 1992b; Taylor and Frank, 1996; Frank, 1998; Fletcher and Doebeli, 2006), and there are also other strong lines of attack using evolutionary game theory (for example, Sigmund, 1993; Doebeli et al., 2004; Hauert and Doebeli, 2004; Archetti, 2009), and multi-level selection (for example, Nowak et al., 2010). Okasha (2010) considered the relationships between the theoretical approaches to be sometimes ambiguous, and identified disagreement among biologists as to which is most fundamental. However, previous workers have generally agreed that population genetics is the touchstone; other approaches are valid provided they can be translated into population genetics, although with additional assumptions if needed. Recently another powerful line of work studying the evolution of social polymorphisms in bacteria and yeast has emerged (see, for example, Ross-Gillespie et al., 2007; Smith et al., 2010). In these studies, strain-specific strategies were identified (‘co-operator’, ‘cheater’) and the fitness of each was measured directly as the number of spores it produced in the next generation. These studies have shown that each strain outperformed the other when at low density, and the fitness differential between the strains changed monotonically with their frequencies. Thus, a stable polymorphism was unequivocally demonstrated empirically. In this and the theoretical approaches described above, fitnesses can be frequency dependent in the sense that the fitness of an individual playing a specified strategy depends on the numbers of individuals playing that and other specified strategies.

However, in none of these approaches does fitness depend on the genotypes of the individual's forebears. Introducing this dependency is the crucial new ingredient in our model. It is needed because in our model fitness depends on the number of encountered relatives who are altruistic, and this is determined by the alleles they carry, which in turn depends on the genotypes of the individual's forebears.

Hamilton (1964) assumed that the benefits of help remain the same as genes causing altruism spread in populations. In reality, however, as more individuals help the benefits each provides may decrease. Suppose I need shelter when injured, food when starving or rescue in battle. The first individual to help may save my life. Latecomers may pay similar costs in attempting to help, but the additional benefit to me is less. There are diminishing returns because there are limits to how much individuals can be helped. Several possibilities are shown in Figure 1a. In the case of constant returns, the benefits per helper are the same irrespective of the number of helpers (case i in Figure 1a) and cumulative benefits increase linearly with the number who help (Figure 2). Cases ii–iv show linear diminishing returns. In this study, we model cases such as these in which help is subject to a law of diminishing returns. Our interest is in investigating when and where genetic polymorphisms may arise in diploid animals as a result of genes with specified effects, which depend on the genotypes of the carrier's sibs. The principal difficulty is in calculating the relationship between the fitnesses of the genotypes and the frequencies of the genes from the phenotypic relationships illustrated in Figures 1a and 2. This is complicated because the genotypes each individual encounters depend on the genotypes of its forebears (as in Table 2). The calculation is simpler without diminishing returns because then the coefficient of relatedness, r, is sufficient to determine the frequencies of encounters with different phenotypes within sibships, as Hamilton showed. Without diminishing returns, our results reduce to Hamilton's. With diminishing returns, population genetic models are needed to take account of the diploidy and the way this affects the relative frequencies of genotypes in the carrier's sibs. The calculations are not possible using existing direct fitness or environmental stress screening approaches; these start from a presumption that fitness is a known function of the level of altruism.

Figure 1
figure 1

Examples of constant or diminishing returns from helping, and their consequences, for full-sibs (top row), half-sibs (middle row) and first cousins (bottom row). Left-hand figures show benefit per helper in relation to the proportion of relatives who provide help. Benefits are in these diagrams in multiples of the cost of help. In case i, the benefits per helper do not vary with the proportion of relatives who provide help, so the cumulative benefits increase linearly with the number of helpers as shown in Figure 2. Cases ii–iv show diminishing returns. In each case the pattern of diminishing returns is similar for the different kin relationships (full-sibs, half-sibs and first cousins) once the coefficient of relatedness is taken into account. Thus, r·b(0)=½ × 3=1.5 for full-sibs, as r=½ and b(0)=3. Similarly r·b(0)=¼ × 6=1.5 for half-sibs, and 1/8 × 12=1.5 for first cousins. At the other extreme when all relatives help (x=1) the intercept is again similar for the different kin relationships, thus b(1)/b(0)=0.50 in case iv, 0.67 in case iii and 0.82 in case ii, except for full-sibs case ii. In this study, we used the value 0.77 because they lack an internal equilibrium if b(1)/b(0)=0.82. Right-hand figures: fitness consequences of the possibilities shown in left-hand figures. FAFS represents the fitness differential between the altruistic and selfish alleles, calculated from equations (1)–(3) and here normalised by division by Nc. In case i, the altruistic allele spreads to fixation because for all three relationships, FAFS>0, but the fitness differential declines linearly to zero as the frequency of the altruistic allele increases. In cases ii–iv, the altruistic allele increases when rare but reaches a stable equilibrium, marked asterisks (*), where FAFS=0.

Figure 2
figure 2

Cumulative benefits shown in relation to the number of relatives, n, who help, for cases i–iv of Figure 1a when the sibship size is 10. The horizontal axis differs from that in Figure 1 because the cumulative benefits, that is, n × b(x), depend on the number of relatives who help. The number of relatives who help is equal to the sibship size times the proportion of relatives who help.

Table 2 Quantities used in fitness calculations

No previous population genetic model of altruism has found a condition for a genetic polymorphism other than overdominance, which is discussed by Uyenoyama and Feldman (1981) and Uyenoyama et al. (1981). In this study, we used rigorous population genetic models to investigate the evolutionary consequences of diminishing returns, which we show may lead to the stable coexistence in the same population of both altruistic and selfish individuals.

Population genetics model

We assume a standard single-locus two-allele model of selection in a diploid population with discrete generations and random mating so that immediately after conception genotypes occur in Hardy–Weinberg proportions. The alleles are labelled A and S and their properties are described below. The frequencies of the sib genotypes have to be calculated separately for each possible common ancestor or pair of common ancestors. These frequencies then have to be averaged over the possible common ancestors or pairs of common ancestors. As in the derivation of Hamilton's rule, we have had to assume weak selection so that the genotypic frequencies in previous generations of parents at the time of mating are, despite the effects of altruistic selection, in Hardy–Weinberg equilibrium. This amounts to ignoring terms of second order in the effects of altruistic selection. Simulations showed that the effect of this approximation is very small (RM Sibly and RN Curnow, in preparation).

We now calculate the fitness values, FSS, FSA and FAA, of the three genotypes that determine the changes in the frequencies of the genotypes as a result of the altruistic selection. The frequency of each genotype after selection is its frequency before selection multiplied by its fitness, normalised by dividing by the sum of (fitness × frequency before selection) of all three genotypes. The genotypic fitnesses are then used to calculate the fitnesses of the two alleles, FS and FA. The frequency of the A allele after the altruistic selection is its frequency before selection multiplied by its fitness, normalised by dividing by the average of (fitness × frequency before selection) of both alleles. The change in the frequency of the A allele because of the altruistic selection is pq(FAFS) divided by the sum fitness of the two alleles. Thus, the evolution of the two alleles is determined by the sign of FAFS. If there are no diminishing returns, the sign of FAFS is given by Hamilton's rule (that is, by the sign of b/r−c).

Let p be the frequency of a dominant ‘altruistic’ allele, call it A, that causes its carriers to help relatives related by a coefficient of relatedness r, in which case each recipient receives a fitness benefit b(x) from each helper, where x is the proportion of relatives who help, but each helper pays a fitness cost c. b will here always be written as a function of the proportion of relatives who help, that is, as b(x); this is not to be confused with the alternative use of brackets to denote grouping in mathematical expressions. Let the alternative allele at the locus be S, a recessive non-altruistic allele that occurs with frequency q=1−p. Homozygous SS individuals do not help relatives and will be labelled selfish. The net benefits obtained by individuals depend on their phenotypes and on those of their relatives as shown in Table 1.

Table 1 Net benefit to an individual, after the subtraction of costs, of encounters with one of his relatives, classified according to the selfish/altruistic phenotype of each

To calculate the fitness values, we first calculate the proportion of relatives of an individual who are altruistic. This depends on the individual's genotype (SS, SA or AA) and on the genotype(s) of the forebear(s) he shares with his relatives. Using this approach, we derive equations (1)–(3) giving formulae for the difference between the fitnesses FA and FS of the altruistic and selfish alleles, respectively, for each of three types of relationship: half-sibs, full-sibs and first cousins, in terms of the frequencies of the two alleles, the values of the benefits and costs, and the size of the sibship. We begin with half-sibs because their mathematical analysis is simpler.

Half-sibs have one parent in common. The genotype of the common parent determines the proportions of offspring that are of each genotype, and these are shown in Table 2a. The average fitness of an offspring of specified genotype is the sum of the product of the frequencies of its encounters with its relatives of each phenotype, times the benefit minus the cost of each encounter. Costs and benefits depend on phenotype as shown in Table 1. We assume that all pairs of sibs have equal numbers of encounters providing opportunities for help to be received or given. Benefits and costs depend on the proportions of relatives that are selfish or altruistic, and these depend on the genotype of the common parent as specified in Table 2a. Consider offspring of genotype SS with an SS parent. With our assumption that the parental genotypes are in Hardy–Weinberg equilibrium, the frequency of these offspring is q2·q=q3 from Table 2a. A proportion q of their half-sibs sharing the SS parent are selfish and p are altruistic. Encounters with the former are cost and benefit free, encounters with the latter yield benefit b(p) to the SS offspring. So the net fitness benefit of these encounters is q3(q·0+p·b(p))=q3pb(p) multiplied by the size of the sibship, N, which is assumed constant. This gives the first term on the right-hand side of the equation below. Note that sibship size appears in all cost and benefit terms. Extending the method to the other two types of sibship (second and third terms on the right-hand side of the equation below), we can now obtain the average fitness, FSS, of an SS individual from the equation

The right-hand side of this equation, derived above, shows the total of the changes to the fitness of the SS genotype that result from encounters between relatives. To obtain the left-hand side note that these changes accrue to the q2 copies of the SS genotype (both sides of the equation have been normalised by dividing by population size) and result in an increase in their average fitness above its baseline value of 1.

Rearranging the equation we have

Similarly

Finally, the fitness of each allele is the average fitness of the genotypes containing the allele, so FAFS=(q FAS+p FAA)−(q FSS+p FAS). This gives

Equation (1) provides a formula for calculating the difference between the fitnesses FA and FS of the altruistic and selfish alleles in half sibs. We now use the same method to derive equivalent formulae for full sibs and first cousins.

Full-sibs have two parents in common, and there are six possible combinations of parental genotypes. The proportions of offspring of each genotype are shown for each of the six possible parental matings in Table 2b. The procedure used for half-sibs here gives

First cousins have one pair of grandparents in common, and there are six possible combinations of these grandparental genotypes. The proportions of their grandchildren that are of each genotype are shown for each of the six possible grandparental matings in Table 2c in the ‘offspring’ columns. The genotypic frequencies in this case are derived by multiplying the frequencies of the alleles in the common grandparents by the frequencies, p and q, of the random alleles from the unshared grandparents. The same procedure as before here gives

Discussion

Equations (1)–(3) give formulae for the difference between the fitnesses FA and FS of the altruistic and selfish alleles, that is, for the relative fitness, FAFS, of an altruistic allele. These equations show how the relative fitness of an altruistic allele depends on the frequencies of the two alleles, p and q=1−p, the values of the benefits and costs, b(x) and c, and sibship size, N. For given values of c, b(x) and N, the equations show how the relative fitness of an altruistic allele varies with its frequency p in half-sibs, full-sibs and first cousins, respectively. If FAFS >0, the A allele is selected and increases in frequency, whereas the S allele declines. Conversely if FAFS <0, S allele is selected. Thus, a new mutation from an S to an A allele will increase in frequency if FAFS >0 at p=0. Subsequently there are two possibilities:

  1. i)

    The A allele spreads to fixation at p=1 if FAFS continues to be positive as p increases. The selfish allele is then eliminated. Note that when there is no diminution in returns equations (1)–(3), all reduce to Hamilton's rule (Hamilton, 1964) that the A allele invades the population only if b/c>1/r, where r is the coefficient of relatedness. To see this, note that if b(x)=b(0), equation (1) simplifies to FAFS=Nq (−c+b/4). So FAFS>0 if and only if b/4>c, and this is Hamilton's rule for half-sibs. The other cases are obtained similarly. An example of constant returns is illustrated in Figure 1, as case i for each type of relationship. Because returns are constant, b(x)=b(0), and values of b(0) were chosen so that rb(0)=1.5 c for each type of relationship. Thus, Hamilton's rule b/c>1/r holds for all values of p (see Figure 1 caption), and this is why the A allele is selected, that is, FAFS >0 for all values of p, for case i in the panels on the right-hand side of Figure 1.

  2. ii)

    Alternatively, as p increases there comes a point beyond which FAFS is negative. The A allele is then selected against. In between the low frequencies where A is selected and the high frequencies where it is selected against, there is a stable equilibrium where FAFS=0. At the stable equilibrium the population contains both selfish and altruistic individuals. Examples of stable equilibria are given for each type of relationship on the right-hand side of Figure 1 (cases ii–iv). The positions of the equilibria are marked by asterisks. The equilibrium frequencies of the A allele and the corresponding proportions of altruistic individuals are given for each of the three types of sibship in Table 3. Note that as the lines relating b(x) to x get steeper (case i through iv) the stable equilibria occur at lower values of p, and result in higher frequencies of selfish individuals.

    Table 3 Equilibrium values of the frequencies of the altruistic allele

The existence of stable equilibria can be understood intuitively using Hamilton's rule. Altruists are selected when there are few helpers because there is then little diminution of returns from helping, so Hamilton's rule holds provided b(0)>c/r. At the other extreme altruists are abundant and diminution of returns is maximal, so the benefits obtained per individual are reduced. With sufficient diminution, Hamilton's rule no longer holds and selfish individuals are at an advantage. In sum, altruists are selected when rare but selected against when common. The result is an evolutionary equilibrium with some individuals selfish and others altruistic.

Our results reduce to Hamilton's rule if the returns from helping are constant (no diminishing returns). With diminishing returns, our method shows what happens for the realistic case that fitness depends on the number of encountered relatives who are altruistic. Whether a relative is altruistic depends on his genotype, and that depends on the genotypes of his immediate forebears. Without these dependencies a simpler analysis would be feasible. Fitness would then depend only on the coefficient of relatedness and the level of overall altruism in the population, so direct fitness or environmental stress screening models could be used.

The benefit functions b(x) illustrated in Figure 1 are linear, but this is not assumed in deriving equations (1)–(3). The equations can be used whatever form is taken by the b(x) function and stable equilibria may again occur. For full-sibs, for example, the conditions on b(1/2), b(3/4) and b(1) in (2) that lead to FAFS=0 for gene frequencies between 0 and 1 could be used to show the extent of nonlinearity that results in stable equilibria. In deriving equations (1)–(3), we assumed each individual had N relatives and encountered them once each. Changing the frequency of encounters, or making them probabilistic, could be incorporated by modifying the definition of N. Our hope is that empiricists will use equations (1)–(3) and in particular will attempt to quantify costs and benefits. To calculate costs and benefits in practice one would use Charlesworth’s method for calculating genotype fitnesses for iteraparous life histories (Charlesworth, 1994). Fitness of genotype i is then defined as the number, Fi which solves the Euler–Lotka equation where li(x) and ni(x) represent respectively survival from birth to age x, and expected number of offspring then produced, by individuals of genotype i. As an example, help costs 0.007 in fitness units that involves young adults taking a 20% risk of dying, if one assumes their genotype reaches adulthood at age 15 years with chance 0.4, and one baby is then born every four years if the adult survives, which happens with chance 0.95 per year (see Sibly (2002) for calculation details).

We hope our analysis will spur further theoretical investigations and also experimentation to estimate the benefit and cost parameters associated with equilibria in populations containing both selfish and altruistic individuals. Help comes in many forms, presumably with different genetic bases, for example, alloparental care during child rearing (Hawkes et al., 1997; Hrdy, 2009), provisioning during times of need (Allen-Arave et al., 2008), and defence of resources against competitors (Rose and Marshall, 1996). Each and any of these may be subject to laws of diminishing returns, which may be linear as in Figure 1 or nonlinear. The form of the relationship between help received and number of helpers requires quantitative description. From the quantitative form of the relationship the expected mix of selfishness and altruism can be calculated using equations (1)–(3).

The frequencies of the three genotypes calculated for each set of common forebears are exact only when the sibship sizes are infinite but we have shown, by exact calculations, that equilibria can exist even when an individual has only two relatives. It may also be objected that potential helpers may in some situations adjust the amount of help they provide according to the number of other altruists present, thus reducing the cost of helping. This situation is not represented in our simplistic model. However there are other situations where altruists would not be aware of other altruists, for example, because they could not see them. It may also be that the genetic determination of altruism is polygenic, not single locus. Polygenic determination could be represented in a model for an underlying liability that determines altruism or selfishness according to some threshold value on the liability scale. Conditions under which this leads to stable equilibria need investigation. Accepting the simplifications inherent in our model, our results do provide an explanation for the otherwise puzzling finding that prosocial and antisocial behaviours are heritable (Rushton et al., 1986; Rhee and Waldman, 2002; Rushton, 2004; Scourfield et al., 2004; Knafo and Plomin, 2006; Hur and Rushton, 2007; Cesarini et al., 2009; Gregory et al., 2009).

The existence of innately selfish and innately altruistic individuals provides an opening for the evolution of discriminating individuals, who withhold help from selfish individuals and only help those who help others. The basis for this discrimination could be simple observation of which individuals are altruistic and selfish in social groups, such as those in which great apes live. Such discrimination could involve keeping tally of favours returned, as has been observed in man (Hill and Kaplan, 1993; Gurven, 2006). This would be more direct than a system of discrimination based on kin recognition, which has been much discussed (see, for example, Rousset and Roze, 2007). Further analysis is needed to understand under what conditions and to what extent genes for discrimination enter populations at equilibrium between selfishness and altruism.