Introduction

Biodiversity of the honeybee Apis mellifera was first assessed using morphometrics. Based on an extensive sample collection and multivariate analyses, Ruttner et al. (1978) proposed a classification which was long considered as definitive. These authors hypothesized that north-eastern Africa and the Near East would be the centre of origin of Apis mellifera. They proposed that the species invaded Africa and Europe in three distinct branches, a South and Central African branch (A), a North African and West European branch (M) and a North Mediterranean branch (C). This classification was further refined by the addition of a fourth evolutionary branch, called O, which included the Near and Middle Eastern subspecies (Ruttner, 1988). Data on mitochondrial DNA have confirmed the presence of three lineages A, M, C in Africa, western Europe and south-eastern Europe (Smith & Brown, 1988; Garnery et al., 1992, 1993). The existence of a fourth mitochondrial lineage (O) in the Middle East has recently been confirmed (Arias & Sheppard, 1996; Franck et al., 2000b; Palmer et al., 2000). The same general structure of the species also emerged from microsatellite surveys (Estoup et al., 1995; Franck et al., 2000b). The main discrepancy provided by molecular markers was a clear genetic disruption between the branches M and A in the Iberian Peninsula (Smith et al., 1991; Garnery et al., 1995; Franck et al., 1998). Consequently, honeybees from northern Africa and from western Europe are considered to have followed separate evolutionary histories.

The introduction of the African subspecies A. m. scutellata in Brazil (Kerr, 1957), its subsequent hybridization with previously imported subspecies (Michener, 1975), and the rapid spread of this new hybrid strain (Africanized honeybee) throughout the Neotropics of America (Kerr, 1992) demonstrates how humans can greatly modify the genetic architecture of honeybee populations. Although most honeybee races present in Americas before the Africanization were native to Europe (Smith & Brown, 1988; Hall & Muralidharan, 1989; Hall & Smith, 1991; Sheppard et al., 1991; McMichael & Hall, 1996), there are some clues that African honeybees from other subspecies than scutellata had been introduced into different American regions (Schiff & Sheppard, 1993; Sheppard et al., 1999). It appears necessary to characterize the genetic diversity of honeybee races in their native range (Africa, Europe, and the Near-East). Only with this information will it be possible to accurately identify intrusion of unwanted genotypes and to develop coherent policies for conservation of local honeybee races. The present article focuses on African honeybees.

According to a recent review of the intraspecific nomenclature of Apis mellifera (Engel, 1999), 10 valid subspecies are recognized in Africa: A. m. intermissa (Maa, 1953) and A. m. sahariensis (Baldensperger, 1932) in Maghreb, A. m. lamarckii (Cockerell, 1906) and A. m. jemenitica (Ruttner, 1976) in north-eastern Africa, A. m. monticola and A. m. litorea (Smith, 1961) in south-eastern Africa, A. m. unicolor (Latreille, 1804) in Madagascar, A. m. scutellata (Lepeletier, 1836) and A. m. capensis (Eschscholtz, 1822) in southern Africa, and A. m. adansonii (Latreille, 1804) in western Africa. Hepburn & Radloff (1998) showed that these 10 subspecies have clearly separated morpho-clusters and they precisely delineated their geographical distribution. They also indicated geographical zones of high morphological variance within and between colonies, thus identifying hybrid zones among subspecies. Most African bees currently analysed belong to the A mitochondrial lineage (Smith, 1991; Garnery et al., 1992, 1993; Arias & Sheppard, 1996; De la Rùa et al., 1998). Only two colonies from Egypt have been recognized as belonging to lineage O (Arias & Sheppard, 1996; Franck et al., 2000b). Microsatellite loci are extremely polymorphic in African populations compared to European honeybee populations and this has been interpreted as a consequence of larger effective population sizes in Africa (Estoup et al., 1995; McMichael & Hall, 1996; Franck et al., 1998). African populations would have been less influenced by quaternary ice episodes which are considered to be the main cause of honeybee subspecies differentiation in Europe (Ruttner, 1988). However, the question of a common evolutionary origin of the honeybees from Africa remains unanswered.

In this paper, several new African honeybee populations are investigated. In light of these new data, the genetic variability of the honeybees from the African continent is reviewed by combining analyses of mtDNA and microsatellite polymorphisms.

Materials and methods

Sampling and DNA extraction

The origin of the samples is reported in Fig. 1. All African subspecies represented by a total of 738 honeybee colonies (254 colonies newly investigated) were sampled for use in this study. These colonies come from 64 locations in 21 African countries. In most cases the colonies have been morphometrically identified and several locations have been sampled within the endemic range of each subspecies (see Hepburn & Radloff [1998] for a review of original morphological data). A sample of 622 additional colonies from diverse locations in Europe, the Near East and America was used for comparison. Honeybee samples from each colony were brought back to the laboratory in vials containing 95% ethanol. The DNA was extracted from one leg with a chelex-based protocol (Estoup et al., 1996).

Figure 1
figure 1

Distributions of the COI–COII honeybee mitotypes in 21 African countries and nine other countries used as reference. Pie charts indicate the frequencies of the five mitochondrial lineages M, C, O, Y and A. In the A lineage, the three groups of mitotypes – AI, AII and AIII – are distinguished (see Fig. 4 for the details of mitotypes included within each lineage or group). Values under country names are the numbers of locations (right) and colonies (left) analysed per country (see http://www.ensam.inra.fr/URLB for a complete description of mitotypes observed at each location). The stars refer to the countries in which populations have been analysed using microsatellite markers. The right up corner map refers to the natural distribution of Apis mellifera subspecies studied.

Mitochondrial DNA

The mtDNA region including the tRNAleu gene, the COI–COII intergenic region and the 5′ end of the COII subunit gene was PCR-amplified according to a protocol detailed elsewhere (Garnery et al., 1993). A fraction of the PCR product was run on a 1% agarose gel for total size determination and the remaining was restricted with DraI prior to electrophoresis on 7.5% polyacrylamide gel. The COI–COII mitotypes were determined for 404 new colonies. The newly found mitotypes were sequenced for further characterization as in Franck et al. (2000a). Additional data from De la Rùa et al. (1998), Garnery et al. (1993, 1995, 1998, Franck et al. (1998, 2000a, b) and Moritz et al. (1994) were included in the present study.

Microsatellite loci

Eight African populations for which a sufficient number of nonrelated honeybee individuals were available were scored at six microsatellite loci (A113, A43, A28, A24, A88 and B124). These came from Morocco (Al-Hoceima and Kenitra, A. m. intermissa; Tiznit, A. m. sahariensis), Guinea (Nimba, A. m. adansonii), Malawi (Chelinda, A. m. monticola), and South Africa (Johannesburg and Pretoria, A. m. scutellata; Cape Town, A. m. capensis). They were compared to eight additional populations from France (Valenciennes, A. m. mellifera, branch M), Sweden (Umeo, A. m. mellifera, branch M), Spain (Sevilla, A. m. iberiensis, branch M), Portugal (Porto, A. m. iberiensis, branch M), Italy (Forli, A. m. ligustica, branch C), Greece (Chalkidiki, A. m. macedonica, branch C), Sicily (Favara, A. m. siciliana, branch C), and Lebanon (El-Hermel, A. m. syriaca, branch O). The protocol followed and the original population data are reported in Estoup et al. (1995) and in Franck et al. (1998, 2000b).

Statistical and phylogenetic analysis

Unbiased estimates of gene diversity for microsatellite loci were calculated according to Nei (1978). Exact tests for genetic structure were computed using the GENEPOP package version 3.1 (Raymond & Rousset, 1995). The genetic differentiation between populations was computed using unbiased estimates of FST values provided by GENEPOP and the (dμ)2 microsatellite distance (Goldstein et al., 1995).

A maximum parsimony tree of COI–COII mitotypes was constructed using the PHYLIP package version 3.5c (Felsenstein, 1993). Presence/absence of base pair substitutions and insertions/deletions along the COI–COII intergenic sequence were coded as 1/0. When a component was missing, the corresponding characters were coded as missing data (e.g. insertions/deletions and substitutions within the P sequence in lineage C). A neighbour-joining tree from population samples was calculated from microsatellite data using the chord distance of Cavalli-Sforza & Edwards (1967). Bootstrap values were computed over 2000 replications (Hedges, 1992) re-sampling individuals within population.

Results

Mitochondrial DNA

DraI RFLP of the COI–COII intergenic region provided a total of 42 different mitotypes among the 1359 colonies assayed (Fig. 2). Five mitotypes (A4′, A25, A26, A27, Y1, Y2) were not reported in previous analyses (Garnery et al., 1993, 1995, 1998; De la Rùa et al., 1998; Franck et al., 1998, 2000a, b; Palmer et al., 2000). Their restriction maps, the length of their restriction fragments, and their P sequences are given in Figs. 2 and 3.

Figure 2
figure 2

Restriction maps (left) and restriction fragment sizes (right) of the 42 COI–COII mitotypes. The maps are deduced from DraI restriction pattern and sequences of the COI–COII intergenic mitochondrial region. Restriction sites are numbered from 1 to 10. Deletions and insertions are numbered preceded by characters d and i, respectively (see also Fig. 3). Exponent number refers to the number of equal size fragments.

Figure 3
figure 3

Sequences of the P region of COI–COII mitotypes indicating substitution sites (small characters), DraI restriction sites (bold characters) and insertion/deletion (dashes). Four diagnostic sequences are defined: the supposed ancestral P0 sequence without deletion characterizing most of mitotypes of lineages A and O; the sequence P with the deletion d characterizing mitotypes of lineage M; the sequence P1 with the deletion d1 characterizing mitotypes of lineage A from the Atlantic coast; the sequence P2 with the deletion d2 characterizing mitotypes of lineage Y. Mitotypes C have no sequence P.

A total of 50 characters provided by COI–COII restriction patterns were used to assess the phylogeny of COI–COII mitotypes (Fig. 4). The samples from Ethiopia were characterized by mitotypes which clearly belong to a fifth lineage that we have called Y. All other mitotypes are clearly assigned to their previously described lineages A, M, C, or O (Franck et al., 2000b). The Y lineage has diverged from other lineages by around 2% (Table 1). Within lineage A, most of the mitotypes (group AI) are not differentiated in the phylogenetic analysis (Fig. 4). Only two sublineages are identified. The first one is characterized by mitotypes displaying only restriction site 1 (group AII). The second sublineage is characterized by mitotypes displaying the P1 sequence (group AIII).

Figure 4
figure 4

Consensus tree (level 95%) of the COI–COII mitotypes over the equally parsimonious trees. The trees were established for 50 characters including restrictions sites, insertions and deletions (see Fig. 2). A, M, C, O and Y refer to the five major mitochondrial lineages. AI, AII and AIII refer to the three different groups of mitotypes within the lineage A.

Table 1 Range of divergence percentage within and between lineages using 40 complete DNA sequences of the COI–COII intergenic region (sequence data are fom Cornuet et al., 1991; Garnery et al., 1992; Franck et al.,2000a, b; and this study)

The distribution of COI–COII mitotypes per country is given in Fig. 1. African colonies display mainly mitotypes A1 and A4 (group AI). Both mitotypes are also observed in high proportion in the Africanized bees from Mexico. In the populations from the Indian Ocean islands (A. m. unicolor), the proportion of mitotypes A1 is almost 100%. In continental African populations, the proportion of mitotypes A1 decreases progressively from Guinea toward south-eastern Africa and is replaced by mitotypes A4 in A. monticola, A. m. scutellata and A. m. capensis. From Guinea toward northern Africa, A1 is replaced by mitotypes A8, A9 and A10 (group AII) in A. m. sahariensis and A. m. intermissa. Most of the mitotypes from northern Africa are also observed in A. m. siciliana and A. m. iberiensis populations from southern Europe. Nevertheless, these populations display a high proportion of mitotypes A which are extremely rare in continental Africa. Mitotypes A2 and A3 (group AI) are principally observed in Spain and Sicily, and mitotypes of the AIII group are the most frequent within A. m. iberiensis populations from Portugal and the Canary Islands. Note that the populations from the Canary Islands also contain mitotypes C1 due to recent introduction of A. m. ligustica queens as in A. m. mellifera populations (Fig. 1). In north-eastern Africa, mitotypes belonging to three different lineages are identified (Fig. 1). Mitotypes A27 are observed only in A. m. litorea and they are highly divergent from other A mitotypes (Fig. 2). Mitotypes O and Y are observed only in A. m. lamarckii and A. m. jemenitica, respectively (Fig. 1).

Microsatellite loci

Gene diversity estimates per population and microsatellite locus range from 0.671 (A24, Kenitra) to 0.922 (A88, Chelinda). Mean gene diversities per population are given in Table 2. The samples from Africa display gene diversity higher than those from Europe and Middle East (Mann–Whitney’s U-test, P < 10 −6). Among the African samples, the Moroccan populations show the lowest values (Mann–Whitney’s U-test, P=4 × 10−5).

Table 2 Mean gene diversities and standard deviations for six honeybee microsatellite loci

Fisher exact tests for genic differentiation indicate that all pairs of populations are significantly differentiated (Pmultilocus < 10−5). Among African populations FST values range from 0.0052 (Chelinda/Nimba) to 0.1239 (Kenitra/Cape Town) and (dμ)2 values range from 1.4535 (Pretoria/Nimba) to 12.1512 (Tiznit/Cap Town) (Table 3).

Table 3 Pairwise multilocus unbiaised estimate of FST (below diagonal) and (dμ)2 genetic distance (above diagonal), in honeybee populations

In the neighbour-joining tree (Fig. 5) the populations are separated into four groups {Al-Hoceima, Kenitra, Tiznit, Cape Town, Johannesburg, Pretoria, Nimba and Chelinda}, {Seville, Porto, Valenciennes and Umea}, {Favara, Forli and Chalkidiki}, {El-Hermel} corresponding to the four branches A, M, C and O of Ruttner’s classification (Ruttner, 1988). Within branch A, two sub-branches are identified. North-western African populations (Al-Hoceima, Kenitra and Tiznit) representative of the subspecies A. m. intermissa and A. m. sahariensis are clustered in the first sub-branch. The second sub-branch is composed of A. m. adansonii (Nimba) and A. m. monticola (Chelinda) populations in Equatorial Africa and A. m. capensis (Cape Town) and A. m. scutellata (Johannesburg, Pretoria) populations in southern Africa. Although A. m. iberiensis (Porto, Seville) and A. m. siciliana (Favara) populations display numerous A mitotypes, according to their morphological classification they cluster within branches M and C, respectively (Ruttner, 1988).

Figure 5
figure 5

Neighbour-joining tree based on Cavalli-Sforza and Edwards’ chord distance. Bootstrap values (2000 replications by resampling individuals) are indicated in percentages.

Discussion

The results can be summarized as follows. (1) Honeybees from north-eastern Africa contain three highly divergent mitochondrial lineages A, O, and Y, the third being newly characterized. (2) In the other parts of Africa, honeybees carry only mitotypes of lineage A. Nevertheless, honeybees from north-western Africa and from tropical and southern Africa highly diverge when mitochondrial and microsatellite data are considered. (3) southern European and American populations also display several A mitotypes. In Spain, Portugal and Sicily they have several possible origins, presumably resulting from successive ‘Africanizations’. In Mexico, the mitotypes A observed correspond to those displayed by A. m. scutellata populations from South Africa. (4) Microsatellite data indicate a high level of polymorphism in African populations compared to European ones.

The coexistence of three mitochondrial lineages in north-eastern Africa provides support for Ruttner et al.’s (1978) hypothesis that this area is the probable centre of origin for Apis mellifera. Note, however, that the Horn of Africa and the Rift Valley are the main channels of colonization from Asia to Africa. The presence of the mitochondrial lineage O in Egypt and Somalia and Y in Ethiopia may also result from successive honeybee invasions of Africa from the Middle East.

The quaternary climatic changes including desertification and vegetation shifts are considered quite important in explaining faunal distribution within the tropics (Potts & Behrensmeyer, 1992). They were probably responsible for honeybee subspecies diversification in Africa. The mitochondrial and microsatellite divergences between honeybee subspecies from northern and southern sides of the Sahara (mainly A. m. intermissa and A. m. adansonii) may have occurred during the late Pleistocene (around 15 000 years BP) when the Sahara extended largely southwards into the present Sahelian zone and the extreme north-west of Africa displayed favourable moist conditions (Mediterranean-like vegetation) for honeybees (Hooghiemstra et al., 1992; Lioubimsteva, 1995) (Figs. 1, and 5). In the Middle Holocene (around 8000 years BP), African climates became moister and the Sahara desert almost completely disappeared (Gasse et al., 1990; Hooghiemstra et al., 1992). This change has probably facilitated gene flow between honeybees from the Maghreb and the Sahel. This hypothesis is further strengthened by contact zones along both sides of the Atlas range evidenced by mitochondrial DNA data (Garnery et al., 1995). Another putative refuge for honeybees was the woodland and coastal forest along the borders of the Indian Ocean which persisted during the last arid period (around 15 000 years BP) when eastern Africa was largely characterized by open vegetation and the disappearance of forests at the top of eastern African mountains (Hamilton, 1982; Lovett, 1993a). This may explain the mitochondrial divergence observed between A. m. scutellata and A. m. litorea (Fig. 2) which was also recently reported by Meixner et al. (2000). The montane forest belt from the eastern African arc is rich in endemic species of plants (Lovett, 1993b). However the ecologically isolated subspecies A. m. monticola cannot be differentiated from A. m. scutellata using mitochondrial DNA markers, although some differences can be detected using isoenzyme analyses (Meixner et al., 1994, 2000). This may reflect recurrent introgression between both subspecies in the high tableland from northern Malawi studied here (N. Koeninger, personal communication). The honeybee subspecies from tropical and southern Africa (A. m. adansonii, A. m. monticola, A. m. scutellata, A. m. capensis and A. m. unicolor) are only slightly differentiated at the molecular level. The main difference among these five African subspecies arises from the variation in the proportion of mitotypes A1 and A4. It is noteworthy that both mitotypes only differ in the number of Q sequences (Fig. 2). Thus these differences are probably not evolutionarily significant across a Quaternary time scale. Consequently, the absence of endemic mitotypes and the absence of mitochondrial polymorphism in the A. m. unicolor populations could point to a recent colonization of the Indian Ocean islands by an extremely reduced number of colonies. Compared to A. m. unicolor, the ‘Africanized honeybee’ from Mexico displays both African mitotypes A1 and A4 whereas around 50 A. m. scutellata colonies have been introduced in Brazil in 1956.

The main explanation for the low molecular differentiation among African subspecies is probably a result of their highly migratory behaviour (absconding, swarming), specifically of A. m. adansonii and A. m. scutellata (Fletcher, 1978; Hepburn & Radloff, 1998). Direct genetic evidence of such behaviour is the high level of polymorphism at nuclear markers observed in all African populations assayed thus far (Table 2; Hall, 1992, 1998; Estoup et al., 1995; McMichael & Hall, 1996; Franck et al., 1998). This is also an explanation for the diffusion of mitotypes A outside the African continent into the Americas, the Iberian Peninsula and the Mediterranean Islands (Fig. 1 Hall & Muralidharan, 1989; Smith et al., 1989, 1991; Hall & Smith, 1991; Garnery et al., 1993, 1995; Sheppard et al., 1997; Sinacori et al., 1998). Interestingly, the diffusion of the African mitochondrial genome into southern Europe and the Americas does not necessarily correspond to the ‘Africanization’ of the nuclear genome (Lobo & Krieger, 1992, 2000a; Franck et al., 1998). Conversely, whereas European subspecies such as A. m. mellifera and A. m. ligustica have been repeatedly introduced into northern Africa (Second, 1975), not a single M or C mitotype has been found in north-continental Africa (but see Fig. 1 and De la Rùa et al., 1998 for the Canary Islands). Nevertheless, some evidence, including the sequences of microsatellite alleles (Franck, 1999), suggests nuclear introgression of Moroccan honeybees by some European alleles.

Consequently, mitochondrial data alone are probably insufficient to infer taxonomic and genetic status of honeybee colonies. Extending microsatellite analysis to honeybee subspecies from eastern Africa and the Middle East will be useful in the future for understanding the phylogeography of Apis mellifera and resolving relationships among all the African subspecies.