Introduction

Population genetics theory predicts that genomic regions with low or reduced recombination will exhibit reduced levels of adaptation and variability, caused by the action of selection at closely linked sites (the Hill–Robertson effect) (Hill and Robertson, 1966; Gordo and Charlesworth, 2001). This effect arises from the fact that any locus linked to another that is under directional selection experiences a reduction in effective population size (Ne). Since the efficacy of selection on a mutation is a function of the product of Ne and the effect of the mutation on fitness (Kimura, 1983), such linkage affects the probability of fixation of new mutations; favourable mutations are less likely to reach fixation and the opposite is true for deleterious mutations. Hill–Robertson effects can be caused by the spread to fixation of advantageous mutations (hitchhiking: (Maynard-Smith and Haigh, 1974; Gillespie, 2000)), the elimination of recurrent deleterious mutations (background selection: (Charlesworth et al., 1993)), the stochastic accumulation of deleterious mutations (Muller's ratchet: (Muller, 1964; Kim and Stephan, 2000)) and mutual interference between weakly selected mutations that spend long periods at intermediate frequencies (weak selection Hill–Robertson interference: (McVean and Charlesworth, 2000; Comeron and Kreitman, 2002)).

Recombination reduces the extent of this interference, increasing Ne and hence the efficacy of selection. We therefore expect lower rates of adaptive evolution, and higher rates of fixation of deleterious mutations, in genomic regions with low levels of genetic recombination, compared with more highly recombining regions. In addition, the level of variability at neutral or nearly neutral sites will also be reduced in low recombination genomes, since such variability is proportional to the product of Ne and the mutation rate (Kimura, 1983).

This prediction is consistent with a number of observations on molecular evolution and variation, with evidence for reduced levels of variability and adaptation in genomic regions with low levels of recombination (Gordo and Charlesworth, 2001; Presgraves, 2005). In particular, the degeneration of Y- and W chromosomes (in species with male and female heterogamety, respectively) is thought to result from this effect, since these have evolved as permanently heterozygous, non-recombining units (Charlesworth and Charlesworth, 2000). Support for this hypothesis is provided by data showing accelerated rates of protein sequence evolution and reduced levels of silent site DNA sequence diversity in systems such as the neo-Y chromosome of Drosophila miranda (Bachtrog, 2004), the evolving Y chromosome of Silene latifolia (Nicolas et al., 2005), and the W chromosome of the chicken (Berlin and Ellegren, 2004, 2006).

Here we ask whether this observation holds generally in birds by using the fact that the mitochondrial (mt) genome of birds appears to be strictly maternally transmitted, and is thus completely associated with the W chromosome (Berlin and Ellegren, 2001; Berlin et al., 2004). If Hill–Robertson effects reduce W-chromosome diversity, then mtDNA diversity will also be reduced in birds. We test for such a reduction by comparing bird and mammalian mtDNA diversities, and evaluating possible confounding effects of differences in effective population size and mutation rates.

It has recently been proposed that Hill–Robertson effects are so pervasive in animal mtDNA that its genetic diversity has little relation to population size (Bazin et al., 2006), in accordance with the theoretical proposal of Gillespie (2000). We show below that there are some patterns in DNA sequence variability in both bird and mtDNA that appear to be inconsistent with this idea; any Hill–Robertson effects that are acting seem to be too weak to entirely override the signature of demographic factors that influence effective population size.

Materials and methods

Sequences and statistical analyses

Mammalian and avian mitochondrial cytochrome b sequences were collected from NCBI (http://www.ncbi.nlm.nih.gov/). This gene was chosen because it is the most widely sequenced coding region of the mitochondrial genome. One hundred and four mammalian species from 69 genera and 88 avian species from 61 genera were included in the study. We did not include avian species from the superorder Palaeognathae (ratites and tinnamous) because the Z- and W chromosomes of many of these species have only differentiated to a limited extent (Tsuda et al., 2007). All raw data along with references to each data set can be found as Supplementary Information Tables S1–S4. We generated two data sets each for birds and mammals; one ‘total’ data set and one ‘population’ data set, the latter containing sequences collected from a single location. Thus, each data point in the ‘total’ data set represents sequence information from one species (regardless of sampling location), whereas each data point in the ‘population’ data set is from one population (sequences sampled from one location). If there was more than one species per genus in the ‘total’ data set, the average value of the sequence diversity estimates per genus was used; if there was more than one population per species in the ‘population’ data set, a population average was used, and species averages were then used to get the genus average.

The nucleotide sequences were aligned by hand (no gaps). Intraspecific sequence variation was analysed in DnaSP 4.0 (Rozas et al., 2003). The pairwise nucleotide diversity, π̂ (Nei, 1987), with Jukes Cantor correction (Jukes and Cantor, 1969), and Watterson's estimator (θ^) (Watterson, 1975) were obtained for synonymous (π̂S, θ^S) and non-synonymous sites (π̂A, θ^A) separately for each species or population data set. These two estimates of the population parameter, θ=4, are expected to be the same for neutral sites at statistical equilibrium under mutation and genetic drift (Tajima, 1989). For alignments containing complex codons that differed by multiple substitutions, θ^S and θ^A were manually calculated, using the number of synonymous and replacement substitutions per synonymous and replacement sites respectively (S), from the formula

where a is Watterson's correction factor, given by:

and n is the number of sequences in the sample (Watterson, 1975; Nei, 1987).

The variances of the values of π̂S and π̂A were obtained from the formula:

where k is the number of synonymous or non-synonymous sites, and π̂G is the overall unweighted genus mean (Nei, 1987, p 257). We used the genus mean rather than the sample value, since this should be closer to the expected value of π̂, and hence avoids biases caused by low estimates of π̂ for individual samples (Bartolomé et al., 2005).

The variances of the corresponding θ^ values were obtained from the following formula (Nei, 1987, p 255):

where θ^G is the overall unweighted genus mean θ^ for birds and mammals separately and b is given by:

Variance-weighted values of π̂S, π̂A were then generated as the ratio of the means of π̂/varπ̂ to the mean of 1/varπ̂ (and similarly for θ^), as described by Bartolomé et al. (2005). The standard errors of the mean diversity estimates across a set of genera were estimated as the standard deviation divided by the square root of the sample size (the number of genera).

The ratios π̂A/π̂S and θ^A/θ^S were calculated as the mean values of the diversity estimates for each genus. The variances of these ratios were estimated using the above formulae and the relation:

All estimates were statistically compared by non-parametric Mann–Whitney U-tests, in Mini-Tab 14.20.

Body masses

Species were classified according to body mass, with one category containing species with body masses up to 100 g and another category including species with body masses from 100 to 6175 g (the largest bird). Large mammals with body masses over 6175 g were not used in any comparisons. Body mass measurements were taken from the literature (Dunning, 1993; Silva and Downing, 1995; Morrow and Fricke, 2004; Pitcher et al., 2005). When the literature reported a range of body masses, the means of these were used: π̂S, π̂A, π̂A/π̂S, θ^S, θ^A and θ^A/θ^S were then compared for mammals and birds within each body mass category. These estimates were also compared between passerines and bats. The mean values of π̂S, π̂A, π̂A/π̂S, θ^S, θ^A and θ^A/θ^S were also reported for each order.

Population subdivision

Average FST values per genera for birds and mammals, respectively, were estimated in DnaSP 4.0 (Rozas et al., 2003), using equation 3 in Hudson et al. (1992).

Neutrality index

The neutrality index (NI) (Rand and Kann, 1996) was estimated for sister taxa (more closely related to each other than to any other species in the data set) and then compared between mammals and birds. NI reflects the extent to which the levels of non-synonymous diversity depart from a strict neutral model and can be calculated as follows: (π̂A/π̂S)/(KA/KS). NI=1, indicates neutrality; NI>1, indicate an excess of amino acid replacements within species; and NI<1 indicates an excess of amino acid changes between species.

π̂A and π̂S were estimated in DnaSP, as described above. KA and KS are the non-synonymous and synonymous pairwise sequence divergences between related taxa, calculated using the maximum likelihood method implemented in codeml (runmode=−2, seqtype=1) in PAML 3.15 (Yang, 1997). PAML uses maximum likelihood to fit models of mutational changes among codons. Codeml implements the codon substitution model of Goldman and Yang (1994), which allows for arbitrary frequencies of all possible codons, a transition/transversion mutational bias, and a parameter equivalent to KA/KS that measures the relative rates of non-synonymous and synonymous changes. The substitution rates are estimated by using the nucleotide frequencies at each codon position (the F3 × 4 model of (Yang, 1997)) to obtain the expected codon frequencies at equilibrium under the substitution process.

In DnaSP, the total numbers of synonymous and non-synonymous sites are computed by the method of Nei and Gojobori (1986), with a Jukes–Cantor correction for multiple changes. In contrast to the method of Goldman and Yang (1994), this procedure assumes that all nucleotide changes are equally frequent; two-thirds of changes at twofold degenerate positions are thus assumed to be synonymous. The numbers of non-synonymous and synonymous sites in a sequence are estimated by weighting each site by the frequency with mutations are expected to result in the respective type of change, and the observed numbers of the two types of differences between two sequences are divided by the estimated numbers of sites to obtain the fractions of non-synonymous and synonymous changes, respectively. Despite the fact that the two methods may give differences in the estimates of KA and KS themselves, the ratio KA/KS is usually very similar, as we have established in a study of 22 genes in a comparison of Drosophila americana and D. ezoana (Maside and Charlesworth, unpublished). For either method, differences between species in levels of codon usage bias could result in differences in estimates of KA/KS or π̂A/π̂S, but we found little evidence for a difference in codon usage between birds and mammals for cytochrome b (data not shown).

Results and discussion

Differences in mtDNA sequence diversity between mammals and birds

Mean pairwise diversity per synonymous nucleotide site (π̂S) for mammalian genera ranged from 0 to 0.452 with a mean of 0.086±0.011, and non-synonymous site diversity (π̂A) ranged from 0 to 0.0259 with a mean of 0.0036±0.0005 (Table 1). π̂S in birds ranged between 0 and 0.1374, with a mean of 0.0266±0.0045, and π̂A ranged between 0 and 0.0158 with a mean of 0.0019±0.0003 (Table 1). π̂S was significantly higher in mammals than in birds (Mann–Whitney U-test, W=2840.5, P<0.0001), as was π̂A (Mann–Whitney U-test, W=3169.5, P=0.0001) (Table 1). Similar results are obtained with the θ^ estimates. Overall, therefore, there appears to be highly significantly lower mtDNA variability in birds than in mammals. Similar results are obtained when diversity is compared between bird and mammalian orders; both mean synonymous and non-synonymous diversity are significantly lower for bird orders compared to mammalian orders (π̂S: W=117.0, P=0.017; π̂A: W: 123.0, P=0.044) (Table 2).

Table 1 Total genetic diversity estimates for birds and mammals (±s.e.)
Table 2 Total genetic diversity estimates for orders of birds and mammals (±s.e.)

Generation time effects on genetic diversity

Difference in generation times are likely to affect differences in mtDNA variability between birds and mammals, since the neutral diversity for a given effective population size (Ne) is proportional to mutation rate per generation (Kimura, 1983), which is the product of generation time and mutation rate per year. To examine this effect, we divided the groups of species into two categories with respect to body size (less than 100 g and 100–6175 g) (Table 1). Body size correlates strongly with generation time (Finch, 1990; Charnov, 1993), and reliable data are more widely available for body size than generation time.

We found a significantly negative correlation between the genus averages synonymous diversity and mammalian body mass (Spearman's rank correlation: ρ=−0.499, P<0.0001). The correlation for birds is in the same direction, but not as strong (ρ=−0.295, P=0.034). We also fitted least squares regression lines to the bird and mammals separately, and used bootstrapping to obtain confidence intervals (CI), as described by Haddrill et al. (2005). Figure 1 shows that birds have a lower intercept than mammals (0.030 with 95% CI 0.021/0.040 versus 0.095 with 95% CI 0.071/0.122), but a larger magnitude of regression coefficient (−7.39 × 10−6 with 95% CI −2.21 × 10−5/−4.01 × 10−6 versus −2.94 × 10−7 with 95% CI −1.06 × 10−6 versus −1.66 × 10−7).

Figure 1
figure 1

The mean values of synonymous diversity for genera of birds (open squares) and mammals (filled squares), plotted against the corresponding mean body masses. The regression lines, fitted as described in the text, are also shown.

Overall, the results show that birds have significantly less synonymous site variability than mammals with comparable body sizes. A similar pattern is observed for non-synonymous variability, although the difference is not as pronounced. The most conservative comparison is probably that between small bats and small passerines, which are the species groups with the most similar generation times. Synonymous diversity is significantly lower in the passerines compared to the bats (W=292.0, P=0.0004), and non-synonymous diversity is also lower (W=339.0, P=0.0537) (Table 1). The relation between body size and diversity presumably reflect smaller Ne values for species with larger body size; this casts considerable doubt on the recent proposal that mtDNA is not affected by population size due to the prevalence of Hill–Robertson effects in the non-recombining mtDNA genome (Bazin et al., 2006).

Mutation rate differences between mammals and birds

It is possible that mutation rates in mammalian mtDNA are intrinsically higher than in birds. Since there are no direct estimates of these mutation rates in birds, we have to rely on comparisons of estimates of synonymous or non-coding sequence divergence per year from mammals and birds, assuming these to be neutral (Kimura, 1983). It is currently unclear whether birds or mammals have higher rates of mtDNA sequence divergence (Garcia-Moreno, 2004; Ho et al., 2005; Lynch et al., 2006). A 2% rate of sequence divergence per million years is commonly used for mitochondrial coding genes for both mammals and birds (Brown et al., 1979), but there are many factors that could violate this general rule. Overall, there seems to be no evidence for differences that are large enough to overcome the differences in generation time between otherwise similar birds and mammals, given the fact that birds tend to have much longer generation times than mammals of comparable sizes (Finch, 1990). Other things being equal, this would tend to cause higher levels of variability in birds compared with mammals.

There is also indirect evidence that mutation rate differences cannot explain the difference in synonymous site diversity between birds and mammals that we have described. This comes from the results on non-synonymous variability (Tables 1 and 3). These estimates are much more similar for birds and mammals than the corresponding synonymous site variability estimates. In addition, θ^A is nearly always greater than π̂A, consistent with the action of purifying selection against deleterious amino acid site mutations that has been extensively documented before (Table 1) (Nachman, 1998; Fry, 1999; Weinreich and Rand, 2001). With such selection, a larger Ne has a much smaller effect on π̂A than π̂S, since the former is more dependent on the ratio of mutation rate to selection coefficient than on Ne, whereas neutral or nearly neutral diversity is proportional to Ne (Loewe et al., 2006).

Table 3 Pairwise divergence and diversity estimates for birds

If a difference in mutation rate were involved in the mammal–bird difference, both non-synonymous and synonymous diversity would be affected proportionately. In fact, overall π̂A/π̂S is significantly higher in birds than mammals, and individual comparisons are always nominally higher (Table 1). This pattern is what is expected on the hypothesis of a reduction in the efficacy of selection in birds, due to Hill–Robertson effects that reduce Ne (Charlesworth and Charlesworth, 2000; Gordo and Charlesworth, 2001). Significantly higher NIs in birds (Table 3) (mean: 20.35±12.41) compared to mammals (mean: 3.26±0.57) (Table 4) (W: 205.0, P<0.0015) further support this conclusion. A high NI reflects less efficient selection against slightly deleterious mutations, which therefore become more abundant in the population but still have a low probability of fixation (Charlesworth, 1994; Rand and Kann, 1996). The average KA/KS ratio is non-significantly higher in mammals than in birds (U: 103, P=0.15). The theory of selection against slightly deleterious mutations predicts a much smaller effect of a reduction in Ne on π̂A/π̂S than on KA/KS (Charlesworth, 1994; Loewe et al., 2006), consistent with the lack of a significant difference in KA/KS between birds and mammals.

Table 4 Pairwise divergence and diversity estimates for mammals

Demography and sequence variation

The last possible confounding factor is the operation of demographic factors that cause lower Ne values in birds. Comparisons between nuclear gene diversities for birds and mammals should shed light on this possibility. Unfortunately, there are relatively few data on putatively neutral DNA sequence diversity in birds. The evidence suggests that birds have larger Ne values for nuclear genes than mammals. The ancestral Ne in a passerine species was recently estimated as several hundred thousand (Jennings and Edwards, 2005). The recent effective population size (Ne) in chicken is well above one million, using a mutation rate of 1.2 × 10–9 (based on divergence data between chicken and turkey (Axelsson et al., 2004), a time of divergence of 38 × 106 (Dimcheff et al., 2002; van Tuinen and Dyke, 2004), and a generation time of 1.5 years and average θ=7.7 × 10−3 (Sundstrom et al., 2004)). In contrast, Ne in higher primates has been estimated as 10 000–20 000, and in rodents as 160 000 (Eyre-Walker et al., 2002).

Population subdivision could, however, result in very different total Ne values for mtDNA and nuclear genes, as the Ne that controls diversity among genes sampled randomly from the species as a whole is inversely related to the rate of migration among local populations (Charlesworth et al., 2003). MtDNA is affected only by migration of females, and there is evidence that female mammals mostly disperse less than males, whereas the reverse is true for birds (Greenwood, 1980; Prugnolle and de Meeus, 2002). This difference in dispersal may contribute to the patterns we have reported. In order to examine this possibility, we estimated synonymous and non-synonymous diversities for sequences collected at the same locality. This provides estimates of genetic diversities within local populations, which are expected to be much less sensitive to migration rates than species-wide diversity levels (Charlesworth et al., 2003).

One hundred and thirty-four mammalian populations from 35 genera and 127 bird populations from 36 genera were analysed (Supplementary Material). Mean within-population π̂S for mammalian genera ranged from 0.0017 to 0.0521, with a mean of 0.0117±0.0018; π̂A ranged from 0 to 0.0050 with a mean of 0.0012±0.0002 (Table 5). Mean within-population π̂S in bird genera ranged between 0 and 0.0209, with a mean of 0.0044±0.0007; and π̂A ranged between 0 and 0.0022, with a mean of 0.0005±0.00009. Both synonymous and non-synonymous diversities were significantly lower in birds than in mammals (π̂S: W=955.0, P=0.0001, π̂A: W=1064.0, P=0.0078). We also grouped species by body mass; in most comparisons, π̂S was significantly lower in birds compared to mammals. π̂A was always lower in birds compared to mammals, although these differences were not statistically significant. We also compared passerines with bats, and again found lower π̂S in the passerines compared to the bats.

Table 5 Genetic diversities within populations for birds and mammals (±s.e.)

We compared population structures in the mammals and the birds by estimating FST (Hudson et al., 1992; Charlesworth et al., 2003) for mammalian and bird species (averaging over species within genera). The average mammalian FST was 0.563±0.067 and average bird FST was 0.384±0.068 (P=0.077). These figures suggest that mammals may have a more pronounced population structure than birds with respect to mtDNA, although the difference is not statistically significant. A lower Ne for local populations, as expected if there are more intense Hill–Robertson effects on bird than mammalian mtDNA, would lead to increased FST (Charlesworth et al., 2003), so that this apparent difference is consistent with our hypothesis.

Conclusions

Overall, the finding that birds have lower mtDNA variability than mammals is most easily explained by Hill–Robertson effects acting on the W chromosome. We recognize that this conclusion is only tentative, since this difference could be due to confounding factors for which we have not allowed. Further data on effective population sizes based on autosomal genes in birds and mammals, as well as direct measurements of W-chromosome variability in a range of species, will provide additional data for testing this explanation further. Similar comparisons between other taxonomic groups with male and female heterogamety would also help to explore this question further.