Introduction

High allelic diversity at the major histocompatibility complex (MHC) loci has been reported in many vertebrate species and has been attributed to the vital role MHC proteins play in initiating immune response to parasites and pathogens (Gaudieri et al., 2000). This group of closely linked genes code for proteins that recognize peptides of self or non-self origin and trigger T-cell-mediated specific immune responses. The maintenance of elevated polymorphism at MHC loci is thought to be the consequences of balancing selection of varying nature (Garrigan and Hedrick, 2003). It can take the form of higher fitness of the heterozygote state resulting in homozygote deficit relative to the expectations from neutral theory (Hedrick and Thomson, 1983) and homozygote deficit relative to Hardy–Weinberg proportions (Markow et al., 1993; Black and Hedrick, 1997). There is also evidence that rare-allele-advantage combined with temporally and spatially varying selection, and a modification of this model in which a given allele can confer resistance in either homozygous or heterozygous state (Hedrick et al., 1987) could generate and maintain high levels of MHC diversity (Takahata and Nei, 1990; Slade and McCallum, 1992; Hedrick, 1994; Hill et al., 1994). Recent reviews of the topic list other explanations such as MHC-specific mate choice (Edwards and Hedrick, 1998; Jordan and Bruford, 1998; Garrigan and Hedrick, 2003).

Microsatellite loci located within the MHC region have received increasing attention as proxies for inferring the level of polymorphism at MHC loci (Ammer et al., 1992; Ellegren et al., 1993; Schwaiger et al., 1993, 1994). The analysis of variation at these loci has produced contrasting results. High polymorphism and heterozygote excess relative to neutral expectations were observed in cattle (van Haeringen et al., 1999) and in free living Soay sheep (Paterson, 1998), while no deviation from Hardy–Weinberg equilibrium was found in a sample of Dutch people (Moghaddam et al., 1998). Low levels of variation and no evidence of selection were reported in bighorn sheep (Boyce et al., 1997).

Although the evidence for balancing selection acting on MHC genes is considerable, either from its impact on diversity at the MHC loci themselves or on MHC-linked microsatellites, a common problem in the statistical tests to establish this has been the confounding effect of demographic events. A commonly used method is the Ewens–Watterson test (Ewens, 1972; Watterson, 1978), which compares observed allelic profiles with predictions of the neutral theory assuming the population being tested has maintained a constant size. The sheep breeds we studied will have undergone bottlenecks as a result of active breeding. This can have a confounding effect on the Ewens–Watterson test. Screening other neutral markers in addition to the MHC or the proxy-MHC loci can overcome this effect (Schlotterer et al., 2004). Non-MHC microsatellite loci make good candidates for this purpose. The impact of selection can be distinguished by comparing the results of the test for the neutral markers and for the MHC loci. Selection typically affects specific loci (Garrigan and Hedrick, 2003), while the effect of bottlenecks and other demographic events on diversity is likely to be genome wide.

We have investigated polymorphism at microsatellite loci linked to MHC genes (henceforth referred to as MHC microsatellites) in several breeds of domestic sheep and in one of their wild relatives, the European Mouflon, Ovis orientalis musimon (Corbet and Hill, 1980). This has been compared to variation at other microsatellite loci distributed across the sheep genome (non-MHC microsatellites). The latter data are from a much broader collaborative screening project between us and other researchers on the genetic diversity of European sheep breeds, whose detailed reports are forthcoming. The history of the breeds we studied encompasses varying durations and intensities of artificial selection. This has enabled us to address specific questions about the impact of managed breeding on the genetic diversity of MHC microsatellites relative to non-MHC microsatellites.

Materials and methods

The following sheep breeds were studied: Chios from Greece, Comisana from Italy, Friesland from The Netherlands, Merino from Spain, Sarda from Italy, Skudde from Germany and Soay from Great Britain. Several samples from the wild sheep known as Mouflon (O. o. musimon) in Sardinia and the Caroux–Espinouse Massif in southern France were also included. In Europe, Mouflons are to be found in the wild only in the islands of Sardinia and Corsica. However, descendants of these wild populations have been introduced to isolated mountain regions in the Alps and elsewhere for hunting purposes (Ryder, 1984). Friesland, Chios and Sarda are milk breeds. Comisana and Skudde are meat breeds, while Merino is a wool breed. Soay is a free-living primitive breed. The number of sheep that were genotyped is shown in Table 2. Between one and three flocks were sampled for each breed.

DNA was isolated from white blood cells or tissue by classical phenol/chloroform extraction methods (Sambrook et al., 1989) and from hair using a chelex protocol (Zhang and Hewitt, 1998). The genotypic makeup of the sheep at three microsatellites loci that are closely linked to MHC genes was determined. These are: (1) OLADRB (Schwaiger et al., 1993) located in the second intron of OvarDRB1, a class II MHC gene; (2) OLADRBps located in the intron between the fifth and the sixth exons of OLADRBps (Blattman and Beh, 1992); and (3) OMHC1 in the third intron of the OMHC1 class I locus (Groth and Wetherall, 1994). OLADRB is an interrupted and complex microsatellite with (gt)n(ga)m as the basic motif. OMHC1 has a (ca) motif. OLADRBps has an uninterrupted (ac) motif (Table 1).

Table 1 Locus information

Polymerase chain reaction amplifications were carried out in a 9700 PE thermal cycler. Amplification was in a 10 μl reaction volume containing 2.3 mM MgCl2, 250 μ M dNTPs, 1.8 × AmpliTaq Gold Buffer, 0.18 μ M of each primer and 2.5 U of Amplitaq Gold (Perkin Elmer). The thermal profile had an initial denaturation at 95°C for 5 min, followed by 30 cycles of denaturing at 94°C for 1 min, annealing at 60°C for 1 min (OMHC1 and OLADRB), or 55°C for 30 s (OLADRBps), and extension at 72°C for 1 min. The final stage comprised an extension reaction for 10 min at 72°C.

Summary data pertaining to an additional 12 non-MHC microsatellite loci have been included in this study for the purpose of comparing the allelic and genotypic distributions in MHC and non-MHC microsatellite loci. The loci are: OarFCB48, MAF214, ETH225, ILSTS005, ILSTS011, ILSTS28, SRCRSP1, SRCRSP3, SRCRSP5, SRCRSP7, SRCRSP8, SRCRSP9 as characterized in (Steffen et al., 1993; Arevalo et al., 1994; Bhebhe et al., 1994; Bishop et al., 1994; Crawford et al., 1995; Kemp et al., 1995; Brezinsky et al., 1993a, 1993b). Conditions used for the amplification and genotyping of the non-MHC microsatellites were similar to those for the MHC microsatellite loci. Allele size scoring for both sets of data was carried out in an ABI 377 automated sequencer, using the ABI GeneScan and Genotyper software. An Ovine DNA standard was included in every gel. Allele frequencies and number of alleles (Na) were calculated using Genepop ver.3.1d (Raymond and Rousset, 1995) as were the expected (He) and observed heterozygosities (Ho). Allelic diversity of the MHC and non-MHC microsatellite loci was compared using allelic richness, the number of alleles per locus. Measures of allelic richness were corrected for variation in sample size by using the rarefaction method as implemented in FSTAT (Goudet, 2000). This essentially compares the number of alleles per locus per breed estimated from n diploid genotypes as in El Mousadik and Petit (1996), and Hurlbert (1971), where n is the smallest number of sheep genotyped in any of the breeds. There were 14 OLDRB genotypes available for Skudde and hence all the allelic richness estimates obtained for both the MHC and non-MHC microsatellite loci are per 14 diploid individuals. Slatkin (1994) implementation of the Ewens–Watterson neutrality test (Ewens, 1972; Watterson, 1978) was performed using PyPop (Lancaster et al., 2003).

Ewens (1972) derived a formula for the expected number of neutral alleles at equilibrium between mutation and loss of new alleles due to drift. In a sample of 2N genes, this is given by: ∑i=12N−1θ/θ+i, where θ=4Neμ. Thus, if θ, the product of mutation rate and effective population size is small, the expected number of different alleles approaches 1 and if θ is very large, it approaches 2N (Hedrick, 2000). Watterson (1978) developed a test for neutrality based on this. The test calculates the observed Hardy–Weinberg homozygosity (F) in a sample of 2N genes containing k different alleles as ∑pi2, where pi is the frequency of allele i. This is then compared with the expected homozygosity, , in a similar sample of 2N genes also containing k alleles drawn per Ewens’ expectations assuming neutrality and drift–mutation equilibrium. A significant difference between the two is taken as evidence for selection. If the observed homozygosity is lower, indicating the frequencies of the k alleles in the sample is even more than expected under neutrality, balancing selection can be inferred. Deviation in the opposite direction may be attributed to purifying selection (Hedrick, 2000).

Results

Table 1 shows information on the microsatellites typed, reporting for each locus the motif, size range, number of alleles and allelic richness. Both the MHC and the non-MHC group contain simple and complex repeats, some perfect and others imperfect.

Allelic diversity

Summary statistics of allelic and genotypic distributions of the 3 MHC-embedded microsatellite loci and the 12 non-MHC microsatellite loci are shown in Tables 1 and 2. Focussing on the MHC microsatellites, the average number of alleles per breed, without correcting for differences in sample sizes, is 12.25±4.83 at OLADRB, 8.75±3.41 at OLADRBps and 8.88±2.85 at OMHC1. The total number of alleles across all breeds is 44 for OLADRB, 18 at OLADRBps and 14 at OMHC1; OLADRB is the most polymorphic locus, while OLADRBps and OMHC1 show comparable allelic diversities.

Table 2 Summary statistics of allelic and genotypic distributions of three MHC-embedded microsatellite loci and 12 non-MHC microsatellite loci in European sheep breeds

The average number of alleles per locus in each breed ranges between 15 in Comisana and 6 in Friesland. Sarda, Comisana, Merino and Chios have consistently higher number of alleles in any of the three loci compared to Mouflon, Skudde, Soay and Friesland. This pattern is also obvious in Figures 1 and 2, where the frequency distributions of MHC-microsatellite alleles in the eight sheep breeds are shown as histograms. Thus, based on allelic diversity at the MHC-microsatellite loci, the eight sheep breeds can be grouped into two. A high-diversity group comprising Sarda, Comisana, Merino and Chios with an average of 38.5±6.24 alleles per breed in the three loci combined and a less variable second group comprising Mouflon, Soay, Skudde and Friesland with 21.25±2.25 alleles (Table 2). This grouping also holds for each of the three loci individually. The average number of OLADRB alleles in the first group is 16±2.87 compared to 8.25±1.89 in the second group. Similarly, the comparisons are 11±3.65 versus 6.5±0.58 for OLADRBps and 11.25±1.5 versus 6.5±1.29 for OMHC1.

Figure 1
figure 1

Histograms showing allele frequencies at the OLADRB, OLADRBps and OMHC1 microsatellites in the high-diversity group (Sarda, Comisana, Merino and Chios). Allelic richness estimates (see text) are also shown.

Figure 2
figure 2

Histograms showing allele frequencies at the OLADRB, OLADRBps and OMHC1 microsatellites in the low-diversity group (Mouflon, Skudde, Soay, Friesland). Allelic richness estimates (see text) are also shown.

The above trend is confirmed when allelic richness, corrected for differences in sample sizes, is used as a measure of diversity (Table 1, Figures 1 and 2). Allelic richness of the eight breeds averaged across the three MHC-microsatellite loci is 8.07±1.34. The high- and low-diversity breeds had average allelic richness of 9.95 and 6.19, respectively. The difference between the two was found to be statistically significant (P=0.014) in a bootstrap test of 1000 replicates.

The average number of alleles per locus at the 12 non-MHC microsatellite loci is consistently lower than the number of alleles at any one of the three MHC microsatellites in all the breeds but Mouflon. In Mouflon, six non-MHC loci had number of alleles ranging between seven and nine compared to the seven and six alleles at OLADRBps and OMHC1, respectively. However, averaged across loci, this breed too is more diverse at the MHC-microsatellite loci than the non-MHC loci. Allelic richness of the non-MHC loci, at 5.03±0.86 alleles per locus, is also significantly lower than in the MHC-microsatellite loci (Table 1).

Heterozygosity

Table 2 shows the frequency of Ho at each of the three MHC-microsatellite loci for each breed. Averaged across breeds (0.60±0.22 at OLADRB, 0.73±0.16 at OLADRBps and 0.64±0.15 at OMHC1), it is consistently lower than the average expected heterozygosity (0.82±0.07 at OLADRB, 0.78±0.08 at OLADRBps, 0.76±0.08 at OMHC1). At individual loci in each breed too, this is predominantly the case. This is the expected trend for the sampling scheme of multiple, separately managed flocks per breed that we adopted to be able to assess breed-wide levels of genetic diversity in European sheep.

When the genotypic distributions at the MHC-microsatellite loci were tested for conformity to Hardy–Weinberg proportions, all the breeds except Soay showed significant deficits of heterozygotes in at least one of the three loci (P<0.05, Table 2). This is particularly evident at OLADRB. All the breeds except Soay and Skudde showed highly significant (P<0.01) heterozygote deficit at this locus. In the semiwild Soay, where our sample is effectively from a single flock, there is no deviation from the Hardy–Weinberg proportions at any of the three loci. The opposite is true in the case of the wild species Mouflon. All three loci showed highly significant heterozygote deficit. This sample included sheep belonging to populations from two localities in Sardinia and France. When tested separately, despite the smaller sample sizes, the deviations from Hardy–Weinberg proportions were found to be non significant at all three loci confirming that the homozygote excess in the pooled data was due to the Wahlund effect.

In the 12 non-MHC microsatellite loci, the average Ho across loci and breed was 0.54 (range 0.43–0.71); the range per breed was 0.43–0.63. Hardy–Weinberg tests showed significant heterozygote deficit in at least one out of the 12 non-MHC microsatellite loci in all the breeds with the exception of Soay. All 12 loci were at equilibrium in Soay. In Mouflon, nine non-MHC microsatellite loci revealed significant heterozygote deficit, confirming the Wahlund effect observed in the MHC microsatellites. The average expected and observed heterozygosities are higher in the MHC microsatellites as a group compared to the non-MHC loci primarily due to the allelic richness of the former. This is not the case for Mouflon. Several factors such as breeding practices, flock structure and breed history that may have affected the above allelic and genotypic diversity patterns are discussed below.

Table 3 shows results of the homozygosity tests of neutrality for the three MHC-microsatellite loci in the eight breeds. Till now, as the predominant trend in the tests has revealed few statistically significant differences between the observed Hardy–Weinberg homozygosities and , these results can be interpreted as evidence for neutrality of the MHC-microsatellite loci. However, there is a clear trend of lower Hardy–Weinberg homozygosity compared to neutral expectations. In 19 out of the 24 tests, observed F is lower than indicative of allelic distributions more even than neutral expectation. Among the four tests where the deficit in F compared to is statistically significant, two involve Soay. Thus, in the only breed where the Hardy–Weinberg tests confirmed random mating, there is significant evidence for selection at the OLADRB and OLADRBps loci but not at OMHC1.

Table 3 Observed Hardy–Weinberg homozygosity (F) was calculated as ∑pi2, where pi is the frequency of allele i

Among the five neutrality tests that revealed higher Hardy–Weinberg homozygosity compared to that expected under neutrality, three involve the wild species Mouflon and the semiwild Soay. The remaining two pertain to Chios (Table 3). As with the tests for deviations from Hardy–Weinberg expectations, the Mouflon results appear to be artefacts of sampling of two separate populations from Sardinia and the French Alps. When the 26 Sardinia Mouflons were tested separately, the observed homozygosity was lower than the expected. Similarly, when 15 Chios sheep from a single flock in Athalsa, Cyprus were tested separately, the observed homozygosity at all three MHC microsatellites was lower than the expected. It is thus apparent that, when the effect of multiple sampling is taken into account, all but one (Soay, OMHC1) of the 24 comparisons of F and are indicative of allelic distributions more uniform than expected for neutral loci. However, when tested for statistical significance, only four could be confirmed. Fisher (1932) combined probability test was used on the set of P-values in the 24 tests and produced a highly significant P-value (X2=94.35, d.f.=48, P<0.001).

Table 3 also summarizes the results of tests of neutrality for the 12 non-MHC microsatellite loci in the eight breeds, a total of 96. Observed F was lower in 63 tests, but none were significant. In contrast to the major trend at the MHC microsatellites, the only statistically significant deviations, of which there were three, were all in the direction of higher observed homozygosity. This and the fact that over a third of the tests revealed no deviation from neutrality lead to the conclusion that allelic frequencies at these loci do not show a similar pattern to the MHC microsatellite loci.

Discussion

MHC-embedded microsatellite loci in European sheep breeds were found to be highly polymorphic as is the case in a number of other domestic ungulates (Paterson, 1998; van Haeringen et al., 1999). Both the number of alleles per locus and the genotypic diversity as measured by the expected number of heterozygotes are consistently higher compared to other microsatellite loci dispersed throughout the sheep genome. We infer from this that the loci may have been affected by selection at the neighbouring Mhc genes where diversity is also high. Balancing selection has been proposed as the mechanism responsible for the maintenance of high levels of polymorphism at MHC loci. Other mechanisms, such as assortative mating, maternal–fetal interactions and selective abortions have also been proposed (for a review, see Hedrick, 1994; Jordan and Bruford, 1998; Garrigan and Hedrick, 2003). The hitchhiking effect of the tight linkage of the MHC microsatellites to these genes would lead to higher polymorphism too.

There is indication that repeat type and number of repeats have an influence on microsatellite variability (Ellegren, 2000; Dieringer and Schlotterer, 2003). To detect if this is the case in our two sets of data, we compared all available information on the typed loci. As most of the non-MHC microsatellites used in this study have been derived in cattle and goat mapping projects no information is available on their structure in sheep. This should be born in mind when considering any conclusions drawn from comparison of their structure to the MHC group. Table 1 shows no simple pattern when looking at repeat structure or, type and number of repeats. Both the MHC and non-MHC group contain repeat motifs equally complex, often interrupted and characterized by a high number of repeats. Interestingly, the MHC microsatellite with higher number of alleles also shows a wider range in allele size. This seems to suggest that alleles at this locus follow the pattern observed at MHC genes, where alleles have been retained over large evolutionary periods (Klein, 1987). In support of this view, the basic (gt)n(ga)m motif of OLADRB is preserved in the same location over a large evolutionary span, having been observed in cattle, sheep, goat, rain deer and humans (Schwaiger et al., 1994; Schwaiger and Epplen, 1995).

The eight sheep breeds we studied can be grouped into a high-diversity group and a low-diversity group based on allelic richness at the MHC microsatellite loci. Sarda (milk breed), Comisana (meat), Merino (wool) and Chios (milk) comprise the diverse group. Mouflon (wild), Soay (semiwild), Skudde (meat) and Friesland (milk) are in the group with fewer alleles. Thus, the grouping does not appear to be linked to the purpose for which the breeds were bred and managed. However, this pattern may be associated with the geographic origins of the breeds. The high-diversity group comprises primarily of southern breeds, while the low-diversity group are of northern European origins. This grouping reflects what has been shown by the main population study (Lawson-Handley et al., submitted), where breeds from southern Europe tend to exhibit more variation than northern ones. This has been related to a loss of diversity further away not only from the centre of domestication, but also from southern glacial refugia, from where populations may have recolonized northern areas, as observed in several wild species (Hewitt, 2000).

In all the breeds we studied, there is higher diversity in the MHC microsatellites compared to non-MHC microsatellites, above and beyond that associated with the geographic origin of the breeds. We also report that the observed heterozygosity at sheep MHC microsatellites is lower than that expected under Hardy–Weinberg equilibrium in the majority of the loci and breeds. Three factors will have contributed to this. Because our major aim was to assess the overall genetic diversity of European sheep breeds, we sampled one to three flocks for each breed. Clearly, the Wahlund effect would cause heterozygote deficit in the multiflock samples, where the flocks have been separated long enough. However, heterozygote deficit resulting from the Wahlund effect is expected to show across all loci for each affected breed. The two breeds Mouflon and Soay are worth particular attention in this regard. Both are essentially free ranging wild breeds and both conform to Hardy–Weinberg expectations when the Sardinia and France samples of Mouflon are treated separately. In the other breeds, there is variation between loci within breeds on whether there is heterozygote deficit or not. For example, both Skudde and Merino show higher observed heterozygosity at OLADRBps and lower heterozygosity at OLADRB and OMHC1. From this, we conclude that substructuring resulting from the multiflock-sampling scheme does not fully account for the heterozygote deficit in all the breeds.

The presence of null alleles is another potential cause of deficit in observed heterozygosity. In the breeds, where heterozygote deficit is apparent, most of the loci show it. This is true in the 12 non-MHC microsatellite loci too, regardless of the repeat structure. In addition, the wild Mouflons, when tested separately for the two sampling locations, and the semiwild Soay showed no heterozygote deficit. These observations lead to the conclusion that null alleles are unlikely to have contributed to the observed genotypic distribution.

A third contributing factor is the common use of a single or few rams in many flocks to sire lambs over several generations. This would result in heterozygote deficit relative to Hardy–Weinberg expectations, either because our samples were not from a single generation cohort or because the sires used tend to be more homozygous relative to a male selected at random from the parental generation. That the only two populations that live in wild conditions (semiwild in the case of the Soay from St Kilda Island) showed the least discrepancy between observed and expected heterozygosities argues that the low observed heterozygosity in the other breeds is most likely the consequence of such breeding practices. In striking contrast to its high observed and expected heterozigosity at MHC microsatellites, the Soay breed showed the lowest heterozygosity values in the population study (Lawson-Handley et al., submitted). The Soay population is known to have suffered population crashes (Bancroft et al., 1995; Charbonnel and Pemberton, 2005). This indicates that even populations undergoing bottlenecks are able to retain their MHC variation, as long as random mating is permitted.

The results of the neutrality tests showed that at all the MHC-microsatellite loci and for most of the breeds, the allele distributions are even more than would be expected if the genes were neutral and sampled from populations under drift-mutation equilibrium. However, when tested for statistical significance, this trend is confirmed only in the case of Soay. It is nonetheless noteworthy that the evidence for selection is apparent in Soay, a nonmanaged breed, which does not show Hardy–Weinberg discrepancies. This again seems to support the fact that the lack of statistically significant F-values at MHC microsatellites in the other breeds could be due to the confounding effect of managed breeding, which could reduce heterozygosity while maintaining within breed diversity. Thus, it is apparent that the predominant trend in the test of neutrality of these loci is indicative of allelic distributions more uniform than expected of neutral loci in drift-mutation equilibrium, although statistical significance is attained only in the case of Soay. The Ewens–Watterson test has relatively low statistical power, as is the case with other tests of conformity to population genetic models including the Hardy–Weinberg equilibrium (Hedrick, 2000). The consistent trend of lower observed homozygosity compared to the Ewens–Watterson expectation could reasonably be attributed to the effect of selection. Indeed, Worley et al. (2006) have shown, as in our case, that evidence for selection becomes more widely significant when a global statistics combining multilocus signal is considered as opposed to single loci. While the combining of probabilities following Fisher's method does not exactly address this issue, the fact that a highly significant P-value was obtained is indicative of lack of resolution in the tests statistics used for individual loci. It is also noteworthy that the Ewens–Watterson test of neutrality is a composite test of the combined effect of the observed number of alleles, whether higher or lower than expected under the infinite allele mutation model and, their frequency distribution, whether more even or skewed compared to expectations under neutrality. With respect to the former, Schlotterer et al. (2004) have demonstrated using simulations that, for a broad range of parameter values, the theoretical expectation of the number of alleles from the infinite allele model tends to be an overestimate. Despite this, the Ewen–Watterson tests consistently yielded deviations in the direction of excess compared to theoretical expectations. This further strengthens our inference that the observed number of alleles and their frequency distributions have been affected by selection.

Hitchhiking due to tight linkage with alleles at the MHC loci that are under balancing selection is proposed as the most likely explanation for this pattern of deviation from neutrality and at least for OLADRB there is some evidence for it (Schwaiger and Epplen, 1995). Previous studies have also identified balancing selection as the most likely form of selection to cause deviations from neutrality in the direction observed bearing in mind the role of the MHC loci to which these microsatellites are tightly linked (Paterson, 1998). It should also be noticed that OLADRBps, a microsatellite located in the fifth intron of a pseudogene of the OvarDRB1, shows less variability than OLADRB, located in the second intron of the functional OvarDRB1, again supporting the hitchhiking effect on the level of variation. In conclusion, while these data show evidence of balancing selection at the MHC microsatellite, they also highlight the confounding effect of managed breeding on this pattern and the need for more sophisticated tests to unravel its causes.