Introduction

Robertsonian (Rb) fusion chromosomes are often assumed to be underdominant because of meiotic nondisjunction in Rb heterozygotes (Gropp & Winking, 1981). Theoretical studies suggest that such polymorphisms can, nonetheless, be fixed in local populations by a variety of factors acting singly or in combination. The finding of chromosomal differences between many closely related species suggests that the fixation of chromosomal variants occurs frequently in evolution. In the case of strict underdominance (equal fitness of both homozygotes and reduced heterozygote fitness), an unstable equilibrium frequency exists below which the mutant is eliminated and above which it progresses to fixation (Wright, 1941). Any factor which causes the frequency of underdominants to fluctuate above the equilibrium can thus promote fixation. Genetic drift and extinction–recolonization events are two such factors (Hedrick, 1981; Lande, 1985). Under extinction–recolonization, the probability of fixation in semi-isolated demes is maximized at intermediate rates of gene flow (Michalakis & Olivieri, 1993). Two other relevant factors are inbreeding, which slows down the progress to fixation under genetic drift, and meiotic drive, which either lowers or eliminates the equilibrium frequency (Hedrick, 1981). Such theoretical considerations apply mainly to subdivided populations because in large, continuous populations fixation is likely only when the mutant homozygote has a substantial selective advantage (Barton & Rouhani, 1991). In this study genetic data were analysed to determine whether the usual structure of house mouse populations displaying Rb polymorphism is influenced by inbreeding and the combined effects of genetic drift and extinction–recolonization.

The western European form of the house mouse, Mus musculus domesticus Rutty, contains many chromosomal races possessing varying numbers of Robertsonian fusion (Rb) chromosomes (Boursot et al., 1993). Such races occur as localized patches (e.g. Hauffe & Searle, 1993) most of which have arisen independently (Nachman et al., 1994). In contrast, the eastern form, M. m. musculus, displays little Rb polymorphism. It has been suggested that population-level factors such as inbreeding and genetic drift cannot be responsible for such widespread Rb polymorphism in M. m. domesticus because population genetic structure is very similar in both forms (Nachman & Searle, 1995). In fact, the paucity of studies involving both forms of house mouse makes this conclusion premature.

This study concerns a well-characterized Rb race of commensal M. m. domesticus in central Belgium (Bauchau et al., 1990). The race is defined by the presence of two Rb chromosomes. Rb(4.12) occurs in a roughly circular area of 7900 km2, and is fixed in some central populations. Rb(5.10) has a much smaller distribution centred within that of Rb(4.12), and is segregating in all populations examined. This race may be expanding because it has no obvious geographical constraints, and because the clines of the different Rb chromosomes are not coincident.

Frequency data on microsatellite and Rb polymorphisms in samples from seven sites from Rb populations and from standard (2n=40) populations from surrounding areas were used to determine whether local populations were random mating or inbred, and to estimate levels of genetic subdivision between populations. Levels of genetic subdivision in central Belgium were compared to those found in the western and eastern European forms of house mice, M. m. domesticus and M. m. musculus, from Jutland (Dallas et al., 1995). Lastly, two aspects of population genetic structure which are indicators of small population sizes were studied: variance effective sizes in one population, and the extent of two-locus disequilibrium in all populations.

The relationship between genetic subdivision and geographical distance was studied using microsatellite data because genetic subdivision for neutral markers is either reduced or eliminated within areas of high gene flow. The form of this relationship was used to differentiate between two migration models for house mice. These models follow from the ecological and behavioural evidence which suggests that both passive (via human transport) and active dispersal occur (Boursot et al., 1993). If passive dispersal occurs sufficiently often throughout a given area, this situation would correspond to the island model in which migration rate does not depend on the distance between subpopulations. Alternatively, if most migration is active, or if passive migration occurs infrequently and over short distances, this situation would favour the stepping-stone model, in which migration occurs only between adjacent subpopulations. This is because mouse habitat is patchy rather than continuous, and because intuition suggests that their ability to travel actively long distances between patches is limited.

Microsatellite polymorphism may contain more signal, and more noise, about population subdivision than other nuclear markers, such as allozymes or restriction fragment length polymorphisms. This is because the sizes of existing microsatellite alleles contain information about prior sizes in the two-phase model proposed for microsatellite mutation (DiRienzo et al., 1994). In this study, subdivision was estimated using two statistics: RST, which incorporates such mutational memory (Slatkin, 1995), and θ (Weir & Cockerham, 1984), which, being based on the infinite alleles model of mutation, has no memory.

Materials and methods

Twelve samples of commensal M. m. domesticus, each consisting of 10–63 individuals, were collected at seven sites in central Belgium between November 1988 and May 1992. The sites were named after the nearest village; two sites nearest the same village were lettered, and successive samples from the same site, were numbered. Four sites were located in the area where the Rb(4.12) and Rb(5.10) chromosomes occur (La Hulpe, Chastre, Monstrieux and Corroy), and three sites were located in an area to the south-east containing only mice of standard karyotype (Spontin a, Spontin b, and Space; Fig. 1). One site, La Hulpe, was sampled six times over a 3-year period (samples LH1–6), and the other sites were sampled once. Mice were live-trapped inside buildings and killed in the laboratory for chromosomal and molecular analyses. The mean sampling period was 24 days (range 2–76). Distances between sites were calculated as straight-line measurements on 1:20,000 maps. Total cellular DNA was extracted from frozen tails as described previously (Dallas et al., 1995). Individual karyotypes were prepared from bone marrow or blood culture, and G-banding was performed according to Evans (1987). The presence of the Robertsonian fusions, Rb(4.12) in all the samples, and of Rb(5.10) in eight of the samples, was scored by light microscopy.

Fig. 1
figure 1

Locations of house mouse sample sites. Small black circles, sites where Rb(4.12) and Rb(5.10) occur; small empty circles, standard karyotype; large grey circles, major cities. The dotted line shows the approximate limit of the Rb(4.12) distribution.

Eight microsatellites with three, four or five base-pair motifs were employed; four (34, 105, 135 and 150) from Hearne et al. (1991), and four (D7Mit18, D11Mit16, D13Mit14, D15Mit16) from Dietrich et al. (1992). Information on motif sizes was provided by Research Genetics, Inc. Microsatellites were amplified by PCR, resolved by nondenaturing polyacrylamide gel electrophoresis (PAGE), and visualized by ethidium bromide staining. Allelic differences of one motif could be resolved, and alleles distinguished from heteroduplex patterns. Alleles of 34, 105, 135, 150 and D15Mit16 were numbered using as standards those identified in a previous study of house mice in Jutland (Dallas et al., 1995). Alleles of D7Mit18, D11Mit16 and D13Mit14 were identified using as standards alleles previously sized in mouse inbred strains (Dietrich et al., 1992), and alleles tentatively identified in the first samples analysed. For D11Mit16, the published allele sizes of inbred strains A/J and AKR/J are inverted as compared to those found (and checked) here. Alleles of D7Mit18 were numbered from the smallest: 1–4, 7–11; putative alleles 5 and 6 of D7Mit18 were not observed. Alleles of D11Mit16 and D13Mit14 were numbered consecutively from 1, the smallest.

Allele frequencies and unbiased heterozygosity averaged over microsatellite loci were calculated for each sample using BIOSYS-1. Hardy–Weinberg and two-locus disequilibria within samples were identified using GENEPOP1.2 (Raymond & Rousset, 1995). Wright's F-statistics were estimated according to Weir & Cockerham (1984) using a modified version of DIPLOID (Weir, 1990; J. Goudet, personal communication). This package estimates FIS as f, FIT as F, FST as θ, and calculates confidence intervals of f, F and θ by performing 1000 bootstrap resamples of the single-locus estimates.

Slatkin's RST was calculated for each microsatellite locus as Excoffier's ΦST using AMOVA 1.55 (L. Excoffier, personal communication), the two statistics being equivalent (Michalakis & Excoffier, 1996). In AMOVA, ΦST is estimated for one locus as:

where V(A) is the between-sample variance in allele size, and V(B) is the within-sample variance in allele size. Multilocus values of RST were calculated by averaging the single-locus values of V(A) and V(B), then calculating the above ratio as:

A GENSTAT5.3.1 program was used to generate 95 per cent confidence intervals for multilocus RST values, as follows. One thousand bootstrap resamples of single-locus pairs of values, V(A) and V(B), were performed, then RST was calculated for each bootstrap replicate. The RST values were ranked in ascending order, and the 25th-lowest and 24th-highest values defined the 95 per cent confidence interval. Associations between geographical distance and values of either θ or RST for pairs of samples were tested for significance by performing two-matrix Mantel tests using the R package (1000 permutations, Legendre & Vaudor, 1991).

The method of Waples (1989) was used to estimate effective population size (Ne) within a sample site. This method provides an estimate of variance Ne based on the variance in allele frequency over a few generations. The data consist of the allele frequencies in two samples taken at different times from the same site, and it is assumed that the only source of variance in allele frequency is genetic drift. It was assumed that sampling occurred before reproduction, and that house mice have two generations per year.

Results

Genotypic proportions of microsatellites and Rb chromosomes within samples were compared with Hardy–Weinberg expectations using Fisher's exact tests (data not shown). The high levels of within-sample polymorphism (heterozygosity of 0.44–0.70, average of eight microsatellite loci) made most markers informative. Heterozygote deficit was observed in two out of 91 tests for microsatellites, and in two out of 11 tests for the Rb chromosomes (for Rb(4.12) in the La Hulpe samples LH1 and LH2, see Bauchau et al., 1990) after sequential Bonferroni correction (α=0.05, K=10, independent tests, Rice, 1989). The virtual lack of heterozygote deficit for microsatellites suggests that null alleles are absent or at low frequency. Overall, mating in the study populations appears to occur mostly at random: both inbreeding and further subdivision are absent.

Levels of genetic subdivision over the study area were estimated using seven samples from different sites (microsatellite data only). Two statistics, θ and RST, were calculated as single-locus values and as multilocus averages. Table 1 shows the values of F, θ and f, which are estimators of Wright's FIS, FST and FIT, respectively (Weir & Cockerham, 1984), and RST (Slatkin, 1995). These samples give significant and virtually identical multilocus values of θ and RST. The single-locus values of RST are more variable than those of θ, so RST has wider confidence intervals generated by bootstrapping over loci. This is not surprising as RST contains information on both allele size and frequency, whereas θ contains only allele frequency information. Bootstrapping over different numbers of loci suggested that to arrive at equal confidence intervals for both statistics twice as many loci would be required for RST as for θ. Consistent with the Hardy–Weinberg test results, the mean f-value is not significantly different from zero. Table 2 shows that the θ value for the seven Belgian samples of M. m. domesticus is significantly higher than those for both four M. m. domesticus samples and seven M. m. musculus samples from either side of the hybrid zone in Eastern Jutland (Dallas et al., 1995). This result is not an artefact of confounding different sets of markers or different scales of geographical distance in the two studies, or of genetic differentiation between standard and Rb populations in Belgium.

Table 1 Spatial genetic subdivision in Belgian house mice
Table 2 Genetic subdivision in house mice from three European locations

The relationship between genetic subdivision and geographical distance was examined. Figure 2a and b shows pairwise multilocus values of θ and RST, respectively, plotted against straight-line distance between sites. Values corresponding to 40–40, 40–Rb, and Rb–Rb pairs of samples are given different symbols. Most pairwise θ and RST values are large, including those for the closest pair of sites, Spontin a and b, which are adjacent houses about 300 m apart. Mantel tests showed that no significant association exists between distance and either θ or RST over the geographical scale studied (P>0.2 or 0.36, respectively). This not surprising given the high scatter of pairwise values of both statistics in these samples: 0.18–0.39 for θ, and 0.07–0.49 for RST. As above, RST values show more variability than θ values. There is no obvious effect of karyotype on levels of subdivision: the θ and RST values for the 40–40, 40–Rb, and Rb–Rb pairs of samples appear to be scattered randomly over the whole ranges.

Fig. 2
figure 2

Values of (a) θ, and (b) RST, for pairs of samples of house mice plotted against distance between sample sites (microsatellite data only).

Temporal stability in genetic composition at one site in the Rb race, La Hulpe, was assessed over a period of 3 years. Values of θ between pairs of samples were calculated for both successive (LH1 vs. LH2, LH2 vs. LH3, etc.) and cumulative (LH1 vs. LH2, LH1 vs. LH3, etc.) time intervals. Correlations between length of time interval and θ were tested for significance using the Mantel test. RST was not used here because the timescale was considered too short for mutation to be relevant. All the successive (θ=0.03−0.12) and cumulative (θ=0.10−0.14) intervals gave significant values (Fig. 3). There was no correlation between θ and interval length in either case (data not shown). These results show that significant temporal changes in genetic composition occurred at La Hulpe in each of the five intervals between sampling, two of which were as short as 3.6 months. However, the degree of change did not increase over longer intervals. The temporal changes in microsatellite allele frequencies at La Hulpe were used to calculate corresponding values of variance Ne (Table 3), using the method of Waples (1989). All Ne values are much smaller than most of the samples obtained at this site, suggesting that the variance Ne of the La Hulpe population is a fraction of the likely census size.

Fig. 3
figure 3

Values of θ for pairs of La Hulpe samples of house mice plotted against time intervals between sampling. Successive and cumulative time intervals were analysed (microsatellite data only).

Table 3 Variance-effective population sizes of house mice at La Hulpe

Cases of within-sample two-locus disequilibrium among all possible pairs of microsatellite and Rb markers were identified using Fisher's exact tests followed by sequential Bonferroni correction (α=0.05, K=28, 36 or 45, nonindependent tests). Not all pairs of loci could be tested because Rb(4.12) and/or Rb(5.10) were monomorphic in some samples, and because Rb(5.10) had not been scored in the last four samples. Overall, 19 out of 402 significant cases were identified (Table 4). These were strongly clustered by locus pair and sample: seven cases involved the two pairs of linked loci 34–135 (15 cM) and 105–D7Mit18 (29 cM), and 11 cases involved the Corroy sample.

Table 4 Cases of two-locus disequilibrium within samples of house mice

Discussion

These results show that the genetic structure of house mice in central Belgium is strongly perturbed by population-level factors: between-population genetic subdivision was marked, genetic composition in one population was temporally unstable, and two-locus disequilibria between loosely or unlinked loci were detected. Two factors could be responsible for the subdivision: (i) genetic drift caused by local populations being isolated and having small effective sizes; and (ii) extinction–recolonization events, in which colonizers have a high probability of originating from the same source population (Whitlock & McCauley, 1990). Both factors could operate in combination in house mice. Although this makes their influence difficult to separate in the present study, it is important to note that both factors can promote the fixation of underdominant Rb chromosomes (see Introduction). The study populations do not appear to be influenced by either inbreeding or further subdivision. Similar results have been found in other M. m. domesticus populations from Europe, the Middle East and North Africa (Britton-Davidian, 1990; Dallas et al., 1995). Thus, inbreeding is unlikely to slow down the fixation of underdominant Rb chromosomes in M. m. domesticus populations (Hedrick, 1981).

The demonstration of random mating conflicts with the view that commensal house mice are subdivided into reproductive units or demes of a few males and several females (Boursot et al., 1993). Although demic structure was not studied directly here, there is nothing to suggest that the Belgian populations were atypical in this respect. On the other hand, absence of homozygote excess is consistent with inbreeding avoidance. There is evidence for inbreeding depression in house mice: disassortative mating for MHC alleles may be one mechanism for its avoidance (Potts et al., 1994). The results of the present study imply that a high proportion of matings and/or recruitment in commensal house mice occurs between demes. Such interdeme exchanges may extend far enough to promote high levels of localized gene flow in commensal house mice. Gene flow in commensal house mice does occur at a very local scale (Baker, 1981).

The average level of subdivision in M. m. domesticus in central Belgium was significantly higher than that found in M. m. domesticus and M. m. musculus from Jutland (Dallas et al., 1995). The former populations contain Rb polymorphism (Nance et al., 1990), whereas the latter form displays very little Rb polymorphism throughout its whole range (Zima et al., 1990). This comparison suggests that subdivision may contribute to maintaining Rb polymorphism in at least some populations of M. m. domesticus. As well as in the present study, the influence of genetic drift on an Rb population on the Island of Faray, Orkney, has been clearly demonstrated (Berry et al., 1992). The Jutland populations of M. m. musculus may be atypically more subdivided than elsewhere because they are seasonally commensal (Carlsen, 1993). Local population genetic structure, especially concerning dispersal among adjacent populations, needs to be further characterized in both forms of house mice.

The changes in the genetic composition of the La Hulpe population observed over the 3-year study period exemplify the kind of stochastic perturbations which may be common in the study populations. Although it is true that mice were removed in sampling this site, perhaps inflating the variance in allele frequencies, the La Hulpe population must have been small to begin with for sampling to have had such an effect. Moreover, the interval between the first and second samples (16 months, Ne=10.6), was sufficiently long for the population size to have recovered. Such temporal genetic instability was also found in three commensal populations of M. m. domesticus and M. m. musculus in Jutland. In that study the intervals between sampling were sufficiently long to allow population recovery (1–7 years, Dallas et al., 1995). Temporal fluctuations in genetic composition may, therefore, be characteristic of populations of commensal house mice. Demographic instability is a potential mechanism causing genetic drift in house mice: marked and released individuals typically have a monthly recapture rate lower than 50 per cent (Baker, 1981).

It is not surprising that the study populations display significant two-locus disequilibria more frequently than Hardy–Weinberg disequilibria. If a population perturbation creates both types of disequilibria simultaneously in a finite population, subsequent random mating will cause Hardy–Weinberg disequilibria to vanish rapidly, whereas two-locus disequilibria will be merely reduced by a factor 1−r−1/(2N) each generation. Genetic drift can create disequilibria when the product of population size (N) and recombination rate (r) is sufficiently small, that is when 4Nr<10 (Hartl & Clarke, 1989). The Ne estimates for La Hulpe (5–20) are consistent with finding the most closely linked pair of microsatellite loci (34 and 135, r=0.15) in significant disequilibrium in four La Hulpe samples (because 4Nr=3–12). A recent recolonization event may explain why 10 pairs of nonsyntenic loci (r=0.5) are in disequilibrium in the Corroy population.

Gene flow appears insufficient to eliminate subdivision at any geographical scale studied here. The two practically adjacent sites (Spontin a and b) are significantly differentiated. In order to learn how gene flow in house mice occurs, further ecological and genetic data on dispersal among such adjacent populations are needed. Gene flow can have a strong influence in maintaining levels of polymorphism in the overall population despite marked transient genetic subdivision created by drift (Nürnberger & Harrison, 1995). The lack of geographical association in the pattern of genetic subdivision tends to favour an island-type migration model over a stepping-stone type, at least over the range of distances studied here (8–60 km, ignoring the closest pair of populations). Thus, passive transport via the movement of agricultural materials may be the dominant means of dispersal over this scale of distance. The large spread of pairwise values observed for θ and RST is probably not compatible with a state of migration–drift equilibrium, so the numbers of migrants were not calculated. Nonetheless, some immigration of mice carrying novel alleles has probably influenced all the study populations because their observed levels of heterozygosity are much higher than those predicted at mutation–drift equilibrium by the variance Ne values for the La Hulpe population (Dallas et al., 1995).

The finding that Robertsonian races have originated independently in different areas of Europe (Nachman et al., 1994) suggests that biological traits of house mice as a whole, or the meiotic drive of individual Rb chromosomes, are responsible for such polymorphism, not any lineage-specific factors. Whatever factors are ultimately responsible for the origin of Rb races in house mice (Nachman & Searle, 1995), this study suggests that the Rb race in central Belgium is influenced by population-level factors theoretically capable of promoting the fixation of underdominant mutations.