Introduction

The grey wolf, Canis lupus, once had a distribution throughout much of the northern hemisphere. However, hunting and eradication programmes in the late nineteenth and throughout the twentieth centuries resulted in its extirpation from most of its original range. In the U.S.A., the remaining wolves have been listed as endangered and the Mexican grey wolf subspecies, C. l. baileyi, was listed as endangered in 1976. Mexican wolves are thought to have been extirpated from the U.S.A., and greatly reduced in Mexico, by 1970. Because there have been no confirmed sightings in over 15 years throughout their ancestral range, they are thought to be extinct in the wild.

The extant Mexican wolves are in captivity, except for approximately 20 wolves that have been reintroduced into east central Arizona over the past few years. A survey of 20 microsatellite loci (Garcia-Moreno et al., 1996; Hedrick et al., 1997) was used as a basis to combine the three captive Mexican wolf lineages — McBride (formerly known as Certified), Ghost Ranch, and Aragon. The combined captive population is now descended from seven founders — three from the McBride, two from Ghost Ranch, and two from Aragon lineages. All of the released wolves to date descend entirely from three founders of the McBride lineage.

The ancestral range of the Mexican wolf subspecies in northern Mexico and south-western U.S.A. is isolated geographically from the other extant subspecies of the grey wolf. Mexican wolves are one of the smaller grey wolf subspecies in North America, but there is great range in size in both Mexican wolves and other subspecies. However, investigations using microsatellite variation clearly differentiated between Mexican and northern grey wolves (Garcia-Moreno et al., 1996; Hedrick et al., 1997).

The major histocompatibility complex (MHC) is one of the most important genetic systems for infectious disease resistance in vertebrates (Hill, 1998; Hedrick & Kim, 2000). The association of disease resistance and MHC variation has often been difficult to document for various reasons (Hedrick & Kim, 2000), but a number of human studies have documented its importance for resistance to malaria (Hill et al., 1991), hepatitis (Thurz et al., 1997), and AIDS (Carrington et al., 1999). In addition, O’Brien & Evermann (1988) suggested that low MHC variation might be related to higher susceptibility to infectious disease. For small populations and endangered species, it is recognized that there may be lowered levels of genetic variation; it also appears that mortality from pathogens and parasites may be a significant extinction threat (Lyles & Dobson, 1993; Laurenson et al., 1998; Murray et al., 1999).

Therefore, in the present contribution we characterize the genetic variation in a highly variable class II MHC gene in the three lineages of the Mexican wolf, and, for comparison, examine samples from Canadian grey wolves, red wolves, and coyotes. We also compare our data to the known dog alleles for this gene.

Materials and methods

Sample sources and DNA techniques

DNA samples were obtained from R. Wayne and J. Leonard, University of California-Los Angeles. The 36 Mexican wolf samples were from the McBride lineage (18 individuals), Ghost Ranch lineage (10), and Aragon lineage (8). In addition, 15 samples of Canadian grey wolves, C. lupus, 12 samples of red wolves, C. rufus, and 10 samples of coyotes, C. latrans, were obtained from the same source.

Primers for amplifying a 228 base pair (bp) fragment of MHC class II DRB 1 exon 2 used for single strand conformation polymorphism (SSCP) analysis and subsequent cloning were designated DM-1 (5′-aag tcc gag tgc tat ttc acc-3′) and DM-2 (5′-tcg ccg ctg cac cgt gaa gct-3′) and were designed based on the dog sequences for the homologous gene summarized by Kennedy et al. (1998). Amplification conditions consisted of 35 cycles of PCR in a 10 μL reaction containing 10 ng of genomic DNA, 0.5 μM of each primer, 3 mM MgCl2, 1× polymerase buffer and 1 U Taq polymerase (Promega, Madison, WI) with an annealing temperature of 65°C. PCR conditions for SSCP analysis were the same but included 1 μCi of 32P in a 5 μL reaction volume. Five μL of stop solution (95% formamide, 20 mM EDTA, 0.05% bromophenol blue and 0.05% xylene cyanol) was added to each sample. Samples were denatured by heating at 95–100°C for 3 min, cooling in ice water for 3 min and then loading on a 6% polyacrylamide gel with 2.6% cross-linking. The SSCP gel was electrophoresed at 50 watts for 3 h 30 min at 4°C, transferred to 3 MM Whatman paper, dried and exposed to X-ray film overnight.

PCR amplification products made with the primers DM-1 and DM-2 were purified on an Ultrafree-MC centrifugal filter unit as recommended by the manufacturer (Millipore, Bedford, MA), and ligated into pUC18-Sma-T overhang vector (Bayou Biolabs, Harahan, LA). Positive colonies were identified by insertional inactivation and picked and boiled in 50 μL HPLC grade water. One μL of this solution was used as template for subsequent PCR. Subclones with the correct size insert were screened by SSCP using DM-1 and DM-2 and clones with band profiles identical to that individual’s genomic SSCP pattern were chosen for sequencing. Heterozygotes were identified with SSCP as the sum of the profiles for two allelic subclones. Sequencing templates were prepared by amplifying with the plasmid primers in a 100 μL reaction and then the products were purified using an Ultrafree-MC centrifugal filter unit. Subclones were sequenced on both strands on an Applied Biosystems 377 automated sequencer (division of Perkin-Elmer, Foster City, CA). Sequences were identical in all replicates and from both strands.

Data analysis

In order to align the sequences, the sequence editor ESEE 3.25 (Eric Cabot, 1998) was used. The MEGA program (Kumar et al., 1993) was used to construct the neighbour-joining tree using the genetic distance of Jukes & Cantor (1969) and to obtain bootstrap confidence intervals (500 replicates). MEGA was also used to calculate the relative rate of nonsynonymous and synonymous substitutions according to Nei & Gojobori (1986) and applying the correction of Jukes & Cantor (1969) for multiple hits. Expected heterozygosity was calculated after Nei (1987) with the small sample size correction. Amino-acid heterozygosity for individual sites was calculated by weighting the amino acids in given sequences by their population frequency as in Hedrick et al. (1991).

Results

For the Mexican wolves, all of the samples successfully amplified except for one sample from the McBride lineage. SSCP gels representing the class II DRB1 exon 2 resulted in band patterns consistent with two alleles in the McBride lineage, one allele in the Ghost Ranch lineage, and three alleles in the Aragon lineage (Table 1). The only overlap in alleles between lineages was the Ghost ranch allele Calu-3, which was also the most frequent allele in the Aragon lineage. The observed heterozygosities for the McBride and Aragon lineages were higher than that expected under Hardy–Weinberg proportions (Table 2). However, the observed heterozygosities in these relatively small samples were not significantly different statistically from that expected (or were they for the grey wolf, red wolf, and coyote samples discussed below).

Table 1 The observed frequencies of the 21 alleles at the DRB1 locus over the four taxa (three lineages of Mexican wolves) where N is the sample size. The abbreviations for the alleles, Calu, Caru, and Cala, indicate the taxa C. lupens, C. rufus, and C. latrans, respectively
Table 2 The observed and expected (using Hardy–Weinberg proportions) heterozygosities at the DRB1 locus for the different lineages and species

For the grey wolves, 13 out of 15 samples amplified, for the red wolves only 3 out of 12 samples amplified, and for the coyotes, all 10 samples amplified. The amount and quality of the red wolf DNA in these samples was not adequate for good amplification. Ten of the amplified samples of the grey wolf were from the Northwest Territories, Canada, and three from Alberta, Canada. The 10 coyote samples were all from Los Angeles County. The SSCP patterns were consistent with seven alleles in the grey wolves (one of which, allele Calu-5, was in the Mexican wolf Aragon lineage and one of which, allele Calu-11, was in red wolves where it is designated as Caru-1), three alleles in the red wolves (allele Caru-1 also found in grey wolves), and eight alleles in coyotes, none of which were found in the other taxa. The observed and expected heterozygosities for the grey wolves, red wolves, and coyotes are given in Table 2. Interestingly, for the three groups of captive animals which were polymorphic, the McBride and Aragon lineages of Mexican wolves and the red wolves, the observed heterozygosity is higher than expected while for the wild caught animals, grey wolves and coyotes, the observed heterozygosity is lower than expected. Although none of these values is statistically significant, a Wahlund effect caused by combining several populations may reduce observed heterozygosity in the wild caught samples and the avoidance of inbreeding may increase observed heterozygosity in the captive samples.

Figure 1 presents a neighbour-joining tree with the 21 DRB 1 sequences (there are actually 24 sequences in the figure because three are in more than one lineage or taxa). The sequences from any one taxa are generally widely dispersed in the tree. Furthermore, in almost all cases, a sequence from another taxa is most closely related to a given sequence. Table 3 presents the nucleotide and amino-acid sequence for the variable positions for the 21 alleles (the complete sequences have been deposited in GenBank, accessions AY009508–AY009530). The order of the sequences in Table 2 from top to bottom is the same as in Fig. 1 for ease of comparison.

Fig. 1
figure 1

A neighbour-joining tree giving the relationships for the 21 sequences found in this study. The triangles indicate the six different Mexican wolf sequences, the vertical lines indicate identical sequences found in different lineages of the Mexican wolf (M, McBride; G, Ghost Ranch; A, Aragon) or taxa, and the numbers indicate bootstrap significance values of 75 or greater.

Table 3 The nucleotide and amino acid sequence for the 21 variable positions for the 21 sequences found in Mexican, grey, and red wolves and coyotes

The five alleles from the Mexican wolf were in three different groups, allele Calu-1, alleles Calu-4 and Calu-5, and alleles Calu-2 and Calu-3, spread throughout the tree. Alleles Calu-4 and Calu-5 differed by only one amino acid (one nucleotide) at position 56. Both of these alleles were found in the Aragon lineage, but allele Calu-5 was also found in the grey wolf sample. Alleles Calu-2 and Calu-3 differed by three amino acids (six nucleotides) spread throughout the exon. Calu-2 was found only in the McBride lineage while Calu-3 was found in both the Ghost Ranch and Aragon lineages. The average number of amino acid differences between these groups was between 10.7 and 11.5 and the average number of nucleotide differences was between 16.8 and 17.3. For four positions, 26, 28, 30, and 37, there were three different amino acids present in the five Mexican wolf alleles.

The other most similar alleles, besides those discussed above, were Cala-1 and Cala-5, differing only by one amino acid (one nucleotide), both found in coyotes, and alleles Calu-1 and Calu-7, differing by one amino acid (one nucleotide) and found in Mexican and grey wolves, respectively. The most divergent alleles were Calu-1 and Calu-6, which differed by 13 amino acids (23 nucleotides), but there were many other pairs of alleles that diverged nearly as much.

Of the 69 amino acids, 19 (27.5%) were variable and of the 207 nucleotides sequenced, 34 (16.4%) were variable. At two amino acid positions, 28 and 37, there were five different amino acids present over the 21 sequences while there were four different amino acids observed at position 71. There were only two codon positions, positions 73 and 77 that had only silent variation. Interestingly, the silent difference from the consensus sequence at position 73 was found in only two sequences, Calu-11 (Caru-1) and Cala-6, which cluster together in the phylogenetic tree, and the silent difference at position 77 was found in five sequences, four of which, Calu-2, Calu-3, Calu-6, and Caru-3, cluster together.

The amino acid positions, documented using X-ray crystallography in humans to be important in the antigen-binding site (ABS) (Brown et al., 1993), are indicated here by an asterisk in Table 2. Of the 13 ABS positions, 11 (84.6%) are variable over the 21 alleles, while for the remaining 56 positions that are not thought to interact with the antigen-binding site, only 10 (17.8%) are polymorphic and two of these are variable only for synonymous variation.

The three lineages of Mexican wolves are being merged upon recommendation of the Genetics Committee of the Mexican Wolf recovery team (Hedrick et al., 1997). A tentative recommended goal was to combine the lineages in proportions 0.5 McBride, 0.25 Ghost Ranch, and 0.25 Aragon. Using these proportions and the frequencies of the sequences in the different lineages, we calculated the overall frequency of amino-acid heterozygosity for the examined 69 codons (Fig. 2). Here two amino-acid positions, 26 and 63, have heterozygosities over 0.6, and 10 positions in the ABS are variable, six of them with heterozygosities greater than 0.4.

Fig. 2
figure 2

The heterozygosity at individual amino-acid positions in the Mexican wolf where the McBride, Ghost Ranch, and Aragon lineages are combined in the proportions 0.5, 0.25, and 0.25, respectively. *indicates sites that are putative antigen binding site positions.

In addition, the estimated rate of nonsynonymous (dN) and synonymous (dS) substitutions, for ABS and non-ABS amino-acid positions are given in Table 4. For the antigen binding site positions. dN (0.374) is significantly greater than dS (0.152) and the ratio dN/dS is 2.46. For the nonantigen binding site positions, the difference is not significant and the ratio is only slightly greater than unity.

Table 4 The estimated rates of nonsynonymous and synonymous substitutions for antigen and nonantigen binding amino-acid positions and their ratio. N is the number of codons in each category and P is the probability that dN and dS are different

Discussion

The samples of Mexican wolves, founded from a total of only seven animals and with a history of inbreeding, have five different alleles for the DRB1 locus. The Ghost Ranch lineage, consistent with the low level of genetic variation found for microsatellite loci (Garcia-Moreno et al., 1996; Hedrick et al., 1997) and high inbreeding coefficient from pedigree analysis (Hedrick et al., 1997) was monomorphic for one allele, Calu-3. Calu-3 was the only allele shared between Mexican wolf lineages. These findings support the recommendation by Hedrick et al. (1997) that combining the three Mexican wolf lineages (adding the Ghost Ranch and Aragon lineages to the McBride lineage) would increase genetic variation in the captive population.

The samples from grey and red wolves, and coyotes had 16 additional alleles. Even the small sample of three red wolves that we were able to examine had three quite different alleles. It has been suggested that red wolves are a recent hybrid from grey wolves and coyotes (Wayne & Jenks, 1991; Roy et al., 1994, 1996) because all the mtDNA haplotypes and microsatellite alleles found in red wolves were also found in either grey wolves or coyotes. In our dataset of MHC sequences, one of the red wolf alleles was found in the grey wolves and none were found in the coyote sample. In other words, two of the alleles were found only in red wolves and not in the two putative ancestral taxa. However, the sample sizes of these three taxa were so small that alleles may have been missed by chance. In addition, the source of the grey wolves (Canada) and coyotes (Los Angeles) may be sufficiently removed geographically from the location of the putative hybridization, south-eastern United States or Texas, that the exact alleles may differ.

Examination of the known dog, C. famililaris, MHC DRB1 sequences in Genbank, showed that two alleles found in this study are identical over the 207 base pairs to dog alleles (Fig. 3). For Mexican wolves, Calu-1, found in the McBride lineage, is identical to dog DRB1*00601 (top of Fig. 3). This is one of the more common alleles in dogs and was found in 10 different breeds at an overall frequency of 0.22 by Kennedy et al. (1998). Allele Calu-11, found in both grey and red wolves (Caru-1), is identical to DRB1*00101 (middle of Fig. 3), another common dog allele that Kennedy et al. (1998) found in seven breeds with an overall frequency of 0.24. Four other alleles differed by only one amino acid (one nucleotide) from dog alleles. These are allele Calu-5, found in both the Aragon lineage and the grey wolf sample, and DRB1*02101, allele Calu-7 in grey wolves and DRB1*00601, allele Cala-8 in coyotes and DRB1*00901, and allele Cala-7 in coyotes and DRB1*02301. In addition, J. Angles (pers. comm.) allowed us to compare our sequences with three unpublished sequences that have been found in Alaskan grey wolves but have been given canid names under the dog major histocompatibility nomenclature (Kennedy et al., 1999). These are DRB1*03101, which differs by two base pairs from our coyote sequence Calu-9, DBR1*03501, identical to our sequence for allele Calu-5, found in both Mexican and grey wolves, and DRB1*03601, which differs by two base pairs from Cala-8, found in our coyote sample (see Fig. 3).

Fig. 3
figure 3

A neighbour-joining tree giving the relationships for the sequences from this study and the available dog sequences indicated only by numbers. The triangles indicate the six different Mexican wolf sequences, the circles the three different red wolf sequences, and the vertical lines indicate identical sequences found in different lineages of the Mexican wolf (M, McBride; G, Ghost Ranch; A, Aragon) or taxa.

The intermingling of MHC alleles from different species within a phylogenetic tree from related taxa has been found in a number of organisms (see Edwards & Hedrick, 1998 for discussion). Often the closest sequence for MHC genes is from another species and not from another allele within the same species. Klein (1987) termed this phenomenon ‘transpecies polymorphism’, and suggested that the variability from an ancestral species is retained even when the species splits into two or more descendant species. The mechanism thought to be responsible for the retention of ancestral variation, is a form of balancing selection that results in alleles in one species having closest ancestry to alleles in another. The higher estimate of nonsynonymous than synonymous substitution, observed in the amino-acid positions involved in antigen binding, is consistent with this type of selection.

From the fossil record, the estimated time of common ancestry of the two most divergent taxa examined here, coyotes and grey wolves, is approximately one million years ago (Wayne et al., 1991). Examining mtDNA sequence from the control region, Vila et al. (1997) found that all the coyote sequences clustered together and are clearly divergent from wolves and dogs. This contrasts with the pattern we found for MHC, in which all the coyote alleles were intermixed with wolf and dog alleles in the phylogenetic tree. In other words, one million years between wolves and coyotes does not appear to be sufficient time to have resulted in divergence into species-specific branches of the MHC tree.

The time of origin of dogs from wolves, based on the archeological record, is approximately 14 000 years before present (Olsen, 1985), but based on an estimate from mtDNA is more than 100 000 years before present (Vila et al., 1997). As mentioned above, although the red wolf is considered a separate species, it appears to be of more recent origin than the divergence of coyotes and wolves. In the phylogenetic tree in Fig. 1, all three of the major branches contain alleles for the grey, Mexican, and red wolves. The known sequences from dogs in Fig. 3 are spread throughout the phylogenetic tree and are also included in the three major branches. This is consistent with a broad origin from wolves, as suggested from the mtDNA data (Vila et al., 1997), retention of the ancestral wolf polymorphism by selection, and, furthermore, may reflect continued gene flow from wolves into dog populations until fairly recent times.