Detecting the existence of gene flow between Spanish and North African goats through a coalescent approach

Martínez, Amparo; Manunza, Arianna; Delgado, Juan Vicente; Landi, Vincenzo; Adebambo, Ayotunde; Ismaila, Muritala; Capote, Juan; El Ouni, Mabrouk; Elbeltagy, Ahmed; Abushady, Asmaa M.; Galal, Salah; Ferrando, Ainhoa; Gómez, Mariano; Pons, Agueda; Badaoui, Bouabid; Jordana, Jordi; Vidal, Oriol; Amills, Marcel

doi:10.1038/srep38935

Download PDF

Article
Open access
Published: 14 December 2016

Detecting the existence of gene flow between Spanish and North African goats through a coalescent approach

Amparo Martínez¹^na1,
Arianna Manunza²^na1,
Juan Vicente Delgado¹^na1,
Vincenzo Landi¹^na1,
Ayotunde Adebambo³^na1,
Muritala Ismaila³^na1,
Juan Capote⁴^na1,
Mabrouk El Ouni⁵^na1,
Ahmed Elbeltagy⁶^na1,
Asmaa M. Abushady⁷^na1,
Salah Galal⁸^na1,
Ainhoa Ferrando⁹^na1,
Mariano Gómez¹⁰^na1,
Agueda Pons¹¹^na1,
Bouabid Badaoui¹²^na1,
Jordi Jordana⁹^na1,
Oriol Vidal¹³^na1 &
…
Marcel Amills^2,9^na1

Scientific Reports volume 6, Article number: 38935 (2016) Cite this article

2533 Accesses
9 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Human-driven migrations are one of the main processes shaping the genetic diversity and population structure of domestic species. However, their magnitude and direction have been rarely analysed in a statistical framework. We aimed to estimate the impact of migration on the population structure of Spanish and African goats. To achieve this goal, we analysed a dataset of 1,472 individuals typed with 23 microsatellites. Population structure of African and Spanish goats was moderate (mean F_ST = 0.07), with the exception of the Canarian and South African breeds that displayed a significant differentiation when compared to goats from North Africa and Nigeria. Measurement of gene flow with Migrate-n and IMa coalescent genealogy samplers supported the existence of a bidirectional gene flow between African and Spanish goats. Moreover, IMa estimates of the effective number of migrants were remarkably lower than those calculated with Migrate-n and classical approaches. Such discrepancies suggest that recent divergence, rather than extensive gene flow, is the main cause of the weak population structure observed in caprine breeds.

Genetic diversity and historical demography of underutilised goat breeds in North-Western Europe

Article Open access 25 November 2023

Genetic characterization and implications for conservation of the last autochthonous Mouflon population in Europe

Article Open access 19 July 2021

Dynamic range expansion leads to establishment of a new, genetically distinct wolf population in Central Europe

Article Open access 12 December 2019

Introduction

During three millenia, cattle, goats and sheep originally domesticated in the Fertile Crescent, followed human Neolithic migrations, probably reaching the Iberian Peninsula and the Maghreb by 7,700 YBP and 7,000 YBP¹, respectively. Cyprus is thought to have been colonized by Northern Levant seafarers, who brought the four major livestock species (cattle, sheep, goats and pigs), approximately 9,000–10,500 YBP¹. In Mediterranean Europe, rather than a gradual transition from the Mesolithic to the Neolithic lifestyles, evidence suggests a sharp demographic decrease of Late Mesolitic cultures and the settlement of Neolithic colonists at previously unhabited coastal locations¹. The current view is that this migratory movement did not follow a constant pace i.e. it took 2,000 years to move from Cyprus to the Aegean, another 500 years to establish in Italy, and 500–600 additional years to reach the Iberian Peninsula¹.

North Africa was also a fundamental corridor for the diffusion of animal species domesticated in the Fertile Crescent^2,3,4. Zooarchaeological evidence suggests that cattle, sheep and goats were raised at the Sahara desert during the African Humid Period². Circa 5,900 YBP, a dramatic climatic shift triggered a process of desertification that forced African agropastoral societies to move southwards^2,3. Nomad populations of Central Saharan herdsmen arrived to the Sahel probably around 4,000 YBP². Subsequently, nomad pastoralists may have entered South Africa through the Tsetse-free corridor down the highland spine of East Central Africa². This migratory wave likely took place 2,000–2,400 YBP².

The analysis of mitochondrial and Y-chromosome^4,5 markers suggests that the African and Iberian caprine genetic pools remained connected by gene flow after their post-domestication split, but this has not been formally demonstrated in a statistical framework yet. In the current work, we have examined the magnitude and direction of migration between goat populations from Africa and Spain by analysing, with classical methods and coalescent genealogy samplers, a dataset of ~34,000 microsatellite genotypes. Our goal was to assess, in a statistical modeling framework, if migration had a relevant role in shaping the patterns of caprine genetic diversity found in Africa and the Iberian Peninsula.

Results

Analysis of genetic diversity and population structure

Most of the continental breeds analysed in the current work displayed high levels of diversity, with H_e values above 0.60 (Supplementary Table S1). With regard to insular populations, H_e estimates were 0.68, 0.63–0.68 and 0.49–0.66 in Cape Verdean, Balearic and Canarian goats, respectively. We also observed a tendency towards an increased diversity in Northwest African and Egyptian goats when compared with the Spanish ones i.e. the three populations with highest H_e (>0.70) were the Baladi, Saidi and Tunisian ones. In contrast, H_e estimates were particularly low in the Palmera (H_e = 0.49) and Tinerfeña del Norte (H_e = 0.57) breeds.

In general, the majority of F_ST coefficients (Supplementary Table S2) were low (F_ST < 0.05) or moderate (F_ST = 0.05–0.15), providing evidence that population structure is moderate (mean F_ST = 0.07) but significant (P < 0.001). There were, however, a few cases in which F_ST values reflected marked genetic differentiation. Indeed, when we compared the South African and Canarian goats with the remaining populations, estimated F_ST coefficients were moderate (F_ST = 0.06–0.14) but highly significant (P < 1.10⁻⁵). The comparison of goats from Spain and Central West Europe (Alpine and Saanen are French and Swiss breeds, respectively) and Northwest Africa (Supplementary Table S2), revealed a low level of differentiation (F_ST ≈ 0.03–0.04 for both comparisons), and the same result was obtained when we compared goat populations from Northwest Africa and Nigeria (F_ST ≈ 0.03). Insular goat breeds showed various patterns of differentiation with mainland populations. For example, Canarian goats appeared equally differentiated from African and Spanish breeds. In contrast, Balearic goats displayed a moderate differentiation with their Spanish (F_ST = 0.01–0.03) and Northwest African (F_ST ≈ 0.05) counterparts, respectively (Supplementary Table S2).

The Principal Coordinates Analysis (PCoA) plot shown in Fig. 1 supported the high differentiation of South African and Palmera goats when compared with the remaining caprine populations. Peninsular Spanish, Balearic and Central West European (Alpine and Saanen) breeds grouped together forming a small cluster clearly distinguishable from the African one. Northwest African and Egyptian breeds were distributed in a scattered cluster that was placed close to the Nigerian and Cape Verdean populations. A third highly differentiated cluster was formed by the Canarian breeds.

Bayesian clustering of the populations with Structure (Fig. 2, Supplementary Fig. S1) was consistent with the main trends outlined above. At all explored K-values (three independent runs were made for each K-value), the Canarian breeds formed a distinctive and homogeneous cluster clearly differentiated from the remaining ones. At K = 3 (Supplementary Fig. S1) and at K = 4 (Fig. 2), we detected a split between Spanish, Balearic and Central West European goats and their African counterparts. Importantly, Spanish breeds displayed the same genetic background as Alpine and Saanen goats even at K = 5 (Supplementary Fig. S1). Moreover, the majority of Spanish breeds were poorly differentiated. The African cluster became evident at K = 3 (Supplementary Fig. S1) and at K = 5 it was further subdivided into (1) South African breeds (Boer and Kalahari Red), (2) Egyptian (Barki, Baladi, Saidi) and Anglo-Nubian goats, and (3) Northwest African and Nigerian populations. The Structure analysis (K = 3–4) also revealed signals of mixed Spanish, Canarian and African genetic backgrounds in the Cape Verdean goats. This Atlantic archipelago remained uninhabited until Portuguese sailors discovered it in the 15^th century. Afterwards, Portuguese, West African and Canarian livestock were imported into Cape Verde as a source of food, a circumstance that may explain our findings.

We also computed the coefficients of membership at the most significant K-value i.e. K = 4 (Supplementary Fig. S3). We found that Spanish and Central West European breeds shared the same genetic background. Moreover, the genetic background more prevalent in Northwest African, Egyptian and Nigerian breeds (Supplementary Fig. S3) was weakly present (~4–6%) in Spanish goats (North Spain, South Spain and Balearic Islands), though it is difficult to assess the significance of this finding because this faint signal might not be due to a post-domestication introgression event but to ancient shared ancestry amongst goat populations. In Africa, there was a clear split between Northwest African and Nigerian goats with regard to those from the Southern part of the continent, although Egyptian breeds shared ancestry with the latter (at K = 4, 35%). The Canarian breeds had a unique genetic background, completely different from that found in other breeds, whilst, as previously discussed, goats from Cape Verde had a mixed Canarian, Spanish and African ancestry.

Estimation of migration rates

We estimated migration rates with the softwares Migrate-n⁶ and IMa⁷. With Migrate-n (Supplementary Table S3), the average Spearman correlation between parameter estimates obtained in two independent runs was 0.98, thus indicating that we had reached convergence. We did not know the mutation rates of the microsatellites employed in our analysis, so we did not make any attempt to convert our parameter estimates into quantities expressed in demographic units. Indeed, the main objectives of our study could be fully achieved without making such conversions. For the four migration routes investigated here, migration rate 95% confidence intervals did not comprise the zero-value.

With regard to IMa parameter estimates (Supplementary Tables S4 and S5 and Supplementary Fig. S4), the effective sample sizes and autocorrelation values obtained plus the comparison between two independent M-mode runs (Supplementary Fig. S4) demonstrated that our analyses had reached convergence. In general, θ-estimates obtained with IMa were larger than those inferred with Migrate-n, and this was particularly true for the Egyptian and Northwest African population. In contrast, migration rates inferred with IMa were much lower and more asymmetric than those calculated with Migrate-n (Supplementary Tables S3 and S4). We also calculated mutation-scaled times of divergence between populations as well as the sizes of the ancestral populations that yielded them (Supplementary Table S5).

The comparison of IMa nested models by using likelihood ratio tests (threshold of significance after correction for multiple testing, P-value = 0.003) showed that, in all cases, the full migration model (bi-directional asymmetric migration) had a much better fit to the data than the reduced model without migration (indicated by significant P-values in Table 1). In contrast, in several cases the full model did not account for the data significantly better than simpler models. For instance, when we analysed data from Northwest and Egyptian goats, the full model did not account for the data better than models with symmetric bidirectional migration or unidirectional migration (Table 1). The calculation of N_em with four methods, namely Wright’s equation⁸, the private-alleles method⁹ and Migrate-n⁶ and IMa⁷ coalescent genealogy samplers, showed remarkable discrepancies (Table 2). By far, the F_ST method yielded the largest N_em estimates, while the method of private-alleles and Migrate-n showed intermediate N_em values and IMa the smallest ones.

Table 1 Statistical support for the full vs reduced models comparison inferred with IMa through likelihood ratio tests.

Full size table

Table 2 Estimates of the effective number of migrants (N_em) calculated with Wright equation⁸, Slatkin method⁹, Migrate-n⁶ and IMa⁷.

Full size table

Discussion

We detected a high level of variation in the majority of African and Spanish goat breeds, with H_e in the range of 0.60–0.70 (Supplementary Table S1). These values were consistent with previous estimates obtained in South East Asian¹⁰ (H_e = 0.30–0.71), European and Near Eastern¹¹ (H_e = 0.69), Indian (H_e = 0.73–0.78)¹² and Chinese¹³ (H_e = 0.61–0.78) goat breeds. We also observed a limited level of genetic differentiation between Northwest African (Morocco, Algeria, Tunisia), Egyptian and Nigerian goat populations (F_ST ≈ 0.03–0.06). In the PCoA plot (Fig. 1), they grouped in relatively close proximity and in the Structure analysis (Fig. 2 and Supplementary Fig. S1) they displayed a similar genetic background. These results may seem paradoxical because North Africa and Nigeria are separated by the Sahara desert, a formidable geographic barrier to human and livestock dispersal. However, the Imazighen people that inhabit the Sahara are pastoral nomads that have traversed the desert during millenia transporting goods and livestock². Moreover, in the Early Holocene (9,000–5,900 YBP) the Sahara was not the hyper-arid desert of present times, but a savanna ecosystem with a benign climate that supported herding activities².

The population structure of African goats was mostly explained by the strong genetic differentiation between South African breeds (Boer and Kalahari Red) and those from Northwest Africa and Nigeria (Figs 1 and 2, Supplementary Fig. S1). Marked genetic differences between goats from South Africa and Mozambique have been observed when comparing them with those from North and West Africa⁴. Similarly, clear differentiation has been demonstrated between Southern African Pafuri and Ndebele breeds with regard to those from West and East Africa¹⁴. It would be worth investigating if the Tsetse fly belt (latitude parallels 15°N to 29°S) has enhanced the genetic differentiation of South African breeds by limiting genetic exchanges with northern areas. In this regard, an analysis of the landscape genetics of Burkina Faso goats provided evidence that the most significant genetic discontinuity between goat populations coincided with the boundary between Tsetse fly infested and free areas¹⁵. Indeed, trypanosomiasis could have affected the patterns of genetic diversity of African goats not only by acting as a biological barrier to the diffusion of trypanosusceptible goats but also because of the long-term selection pressure for trypanotolerance on goats raised in infested areas.

Data presented in Table 2 provided compelling evidence that F_ST coefficients give, in all cases, much higher N_em estimates than those provided by coalescent genealogy samplers. There are various possible explanations for this discrepancy. When the assumptions of the Wright approximation⁸, i.e. infinite number of populations at migration-drift equilibrium, mutation rates much lower than migration rates and absence of selection, are not met, N_em tends to be overestimated. Moreover, this lack of correspondence between F_ST and N_em is exacerbated when F_ST values are small¹⁶. Although F_ST should be considered as an excellent measure of genetic differentiation, it is clear that it does not tell much about the relative weight of its causal factors.

We used two coalescent genealogy samplers to calculate migration rates and N_em (Table 2, Supplementary Tables S3 and S4). The comparison of both parameter estimates made evident that migration rates obtained with IMa are lower than those inferred with Migrate-n. Similar observations were made in previous studies^17,18, where IMa estimates of migration were, at some instances, two orders of magnitude lower than those obtained with Migrate-n. This outcome is probably due to the fact that these two coalescent methods are based on different assumptions and statistical models^6,7. Indeed, divergence times of goat breeds are intrinsically short (<10,000 YBP) because they descend from a single gene pool domesticated in Eastern Anatolia, a feature that is modeled by IMa but not by Migrate-n.

Our assessment is that IMa gives more accurate measurements of migration rates because it takes into account the existence of isolation (i.e. the sharing of alleles is not only due to migration but also to recent divergence). Using this software, we compared full and reduced models (Table 1). For all routes, models with zero migration in both directions had a much poorer fit to the data than the full models. In other words, our results provided evidence that migration is statistically significant for the four migratory routes under analysis.

Regarding the accuracy of our migration estimates, we should acknowledge that our experimental design violates some assumptions of the IMa method, such as the lack of population substructure and the potential existence of unsampled populations. Although IMa has been shown to be quite robust to small to moderate violations of the Isolation with Migration Model, gene flow with unsampled populations could bias upward the splitting time between populations and the effective size of the ancestral population and also increase the asymmetry in the direction of gene flow¹⁹. Despite the fact that our experimental design does not perfectly match all the assumptions of IMa and this could lead to some bias in parameter estimates, we do not expect this circumstance to substantially modify the main conclusions of our study.

Detecting gene flow between Europe and Africa has been a major subject of research in the fields of human and livestock population genetics^20,21,22. The results obtained here suggest the existence of migration between Southern Spain and Northwest Africa (Tables 1 and 2 and Supplementary Tables S3 and S4). This result is consistent with previous data showing the existence of genetic connections between the Maghreb and Iberian caprine gene pools⁴. Moreover, the analysis of ancient bovine remains at the archaeological site of Atapuerca (northern Spain) demonstrated the presence of a mitochondrial variant with a likely African origin²¹. We also found significant gene flow from Central West Europe to Northern Spain, a finding that agrees well with their recent common ancestry³.

The impact of African introgression into Spanish breeds seemed to be quite limited. When we estimated the proportion of Central West European (Alpine and Saanen) vs Northwest African genetic background in the genomes of peninsular Spanish goats by using Structure, we only found weak traces of a putative African ancestry (4–6%, Supplementary Fig. S3). It is difficult to judge the significance of this finding, though it is worth highlighting that a recent analysis of worldwide bovine diversity indicated that the magnitude of African introgression into Iberian cattle was around 7.5%²³. This limited admixture is consistent with the significant genetic differentiation that exists between Southern Spanish and North African both goats⁴ and cattle²³. We can conclude that after dispersal from the Eastern Anatolia domestication center, the caprine Spanish and Northwest African gene pools evolved mostly in an independent manner, though some genetic exchanges took place.

We anticipated the existence of gene flow between the Canary Islands and Northwest Africa because this archipelago was settled by Imazighen peoples around 3,000 YBP, as supported by several lines of archaeological, linguistic and genetic evidence²⁴, and current Canarian goat populations are thought to descend from the ones brought by the first settlers of the archipelago²⁵. We also found significant bidirectional gene flow between Egypt and Northwest Africa (Table 1). In this regard, it is worth highlighting that North Africa has been inhabited over millenia by nomadic pastoralists, such as the Tuaregs and Bedouins, whose economy was mainly based on herding. This involved the seasonal migration of herders and livestock from winter to summer pastures and vice versa (transhumance).

Conclusions

We have detected the existence of gene flow between North African and Spanish goats, suggesting that these two gene pools did not evolve in a completely independent manner after their split ~7,000 YBP³. From a broader perspective, our data show that analysing gene flow in a domestic species as goats, where population splitting times are intrinsically short, without taking into account the contribution of isolation may lead to inflated estimates of migration rates.

Methods

Ethics statement

Blood and hair root samples were collected from goats by trained veterinarians in the context of sanitation campaigns and parentage controls not directly related with our research project. For this reason, permission from the Universitat Autònoma de Barcelona Committee of Ethics in Animal Experimentation was not required. In all instances, veterinarians followed standard procedures and relevant national guidelines to ensure appropriate animal care.

Goat sampling and genotyping

We have used a dataset of 1,472 individuals from 32 goat populations (Supplementary Table S6, Supplementary Fig. S5) native from Northern Spain (N = 106), Southern Spain (N = 303), Balearic Islands (N = 137), Central West Europe (N = 73), Northwest Africa (N = 96), Egypt (N = 156), Nigeria (N = 161), South Africa (N = 93), Canary Islands (N = 310) and Cape Verde (N = 37). Part of this dataset (856 Spanish Peninsular, Balearic and Canarian goats typed with 20 microsatellites) has been recently employed to make inferences about the population structure of Iberian caprine breeds²⁶. Genomic DNA purification from blood and hair samples and microsatellite genotyping were performed as previously described²⁶. The primers employed in microsatellite amplification are reported in Supplementary Table S7. After inspecting microsatellite data visually as well as with Lositan²⁷, Genepop²⁸, and MicroChecker²⁹, that detect selection, linkage disequilibrium and presence of null alleles respectively, six markers were discarded (CSSM66, INRA5, ILSTS19, SRCRSP5, SRCRSP23 and SRCRSP24) and the remaining 23 were used in further analyses.

Genetic diversity analyses

Genetic differentiation among breeds was estimated with Arlequin v3.3³⁰ by calculating the total and pairwise F_ST coefficients. The program GenAlEx v6.5³¹ was utilized to infer expected heterozygosities (H_e) as well as to generate a PCoA plot based on a pairwise F_ST matrix. In order to investigate the population structure of goat breeds, we employed Structure v2.3.4³². We carried out 25 runs with 1 million iterations and 200,000 iterations as burn-in. We considered the allele frequencies as correlated and we used the admixture option, considering a range of K-values from 2 to 6. This analysis was carried out three times in order to assess the repeatability of results. The output of Structure was collated with the web-based program Structure Harvester³³ in order to visualize likelihood scores and to infer the most likely K-value³⁴. Coefficients of membership were obtained by averaging the inferred values of ancestry of individuals for each population.

Inferring gene flow with coalescent genealogy samplers

We used two coalescent genealogy samplers i.e Migrate-n⁶ and IMa⁷ to estimate mutation-scaled population sizes (θ_i = 4N_eμ) and migration rates (M_ij = m_ij/μ). We did not use TreeMix to infer migration events because this software models migration between populations as occurring at a single instantaneous time point³⁵, an assumption that is quite unrealistic for domestic species. Genealogy samplers estimate parameters by using a large collection of genealogies³⁶. They often employ a Markov Chain Monte Carlo (MCMC) approach to explore the parameter space with the goal of identifying those genealogies that best fit the data³⁶. Our coalescent analysis was focused on four potential migratory routes: Egypt vs Northwest Africa, Northwest Africa vs the Canary Islands, Northwest Africa vs Southern Spain and Northern Spain vs Central West Europe. We did not analyse the route linking Northwest Africa with Cape Verde because this archipelago was colonized only 500 YBP, and this time of divergence is too short to allow meaningful migration analyses. We also excluded South African and Nigerian breeds from our analysis because the central and meridional parts of the African continent are poorly sampled in our study.

Migrate-n considers an Island Model where populations have been exchanging migrants at a constant rate for a very long time (island model). In contrast, IMa is built on an Isolation with Migration Model that includes six parameters: three population sizes (q₁, population 1; q₂, population 2, and q_A, ancestral population from which populations 1 and 2 are derived), two migration rates and the splitting time⁷. The Isolation with Migration model implemented in IMa can distinguish between migration and the sharing of ancestral polymorphisms because it explicitly models both factors in a coalescent model where gene trees can coalesce in the ancestral population (common history) and exchange migrants amongst populations (migration) within the same genealogy and with separate parameters that will determine the shape of such genealogy. The more probable genealogy produced by both processes and given the observed data will be accepted. In consequence, significant differences between IMa and Migrate-n estimates (with IMa < Migrate-n) would imply that recent divergence has contributed significantly to allele sharing amongst populations.

Migrate-n analysis

Migrate-n v. 3.6. builds on a n-island model where population sizes and migration rates do not change over time, and considers asymmetrical migration rates and populations with distinct sizes⁶. After several optimization pilot runs, we decided to infer migration rates on a pairwise basis (two populations at a time) because if not (i.e. when considering all populations at the same time) the computational burden was unmanageable with the available resources. We used subsamples of genotypes retrieved from 30 individuals (per population) and 10 microsatellites chosen at random (ETH10, ETH225, MAF65, CRSM60, BM6526, TGLA122, SPS115, OarFCB11, OarFCB304 and McM527) because otherwise the amount of time required to complete the analyses was prohibitive. We initially made two short independent runs, using a Brownian motion mutation model that approximates the classical stepwise model, and a maximum-likelihood inference strategy. We considered variable mutation rates amongst loci, and the M_ij and θ_i starting parameters were based on F_ST and N_e calculations, respectively. The parameter M_ij defines the proportion of genes of population j that come from population i per generation. Each of the replicate runs consisted of 10 short-chains (100,000 visited genealogies, 1,000 recorded steps) and three long-chains (4 million visited genealogies, 40,000 recorded steps), with a burn-in of one million iterations. We used an adaptive heating scheme with 4 chains with temperatures set by default at Migrate-n (swapping interval = 1) to increase the efficiency of the MCMC search. Averaged parameter estimates were used as priors in two subsequent long runs based on an adaptive heating scheme and comprising 10 short-chains (100,000 visited genealogies, 1,000 recorded steps), and three long-chains (8 million visited genealogies, 80,000 recorded steps) with a burn-in of 1 million steps. We made sure that both long runs converged to the same parameter estimates, that were subsequently averaged.

Isolation with Migration analysis (IMa)

IMa is better suited than Migrate-n for analysing populations (populations 1 and 2) that diverged recently from a source population (population A) i.e less than 10,000 years ago. The migration rate m₁ defines the fraction of genes of population 1 that come from population 2 (m₁ would be equivalent to M₂₁ of Migrate-n) per generation, while m₂ refers to the opposite direction (m₂ = M₁₂ of Migrate-n). We carried out pre-analyses of the data to define suitable prior ranges and running conditions. Parameter ranges for uniform priors were chosen on the basis of the highest posterior density intervals (at 90%) of the estimates obtained in these initial runs. Priors for θ-values were used directly as the upper bounds of the corresponding prior distributions and they varied depending on the run under consideration (Supplementary Fig. S4). To ensure an efficient exploration of the parameter space, we employed a geometric heating scheme (heating parameters: g₁ = 0.9, g₂ = 0.8) with 200 Metropolis-coupled Markov chains. We made sure that the mixing was good and that convergence had been achieved by checking estimated sample sizes, autocorrelations, and parameter trendline plots (Supplementary Fig. S4). Once we verified that marginal posterior distributions of the two runs had achieved similar solutions, a total of 150,000 genealogies were analysed by using the nested models option in the “Load Trees Mode” (L-Mode). By using log-likelihood ratio tests, this analysis determined if the fully parameterized IMa model explains the data significantly better than a series of simpler models with fewer parameters.

Calculation of the effective number of migrants (N_em)

The effective number of migrants (N_em, immigration expressed in units of genetically effective individuals) was calculated by using a variety of methods. The first of them was based on F_ST estimates and derives N_em with equation (1)⁸:

In addition, we calculated N_em with the private-alleles method⁹, implemented in Genepop²⁸, that takes into account a correction for population size. Equation (2) assumes that there is a linear relationship between the average frequency of private alleles (p_i) and N_em. Indeed, private alleles (those present in only one subpopulation) may reach high frequencies only when N_em is small. The corresponding equation can be expressed as:

where a and b are the slope and the intercept of the regression of N_em over p_i (both depend on the number of individuals sampled by subpopulation), respectively³⁷. Finally, we also inferred N_em from coalescent-based estimates (θ₁, θ₂, M₂₁, M₁₂) obtained with Migrate-n and IMa. In this context, N_em depends on the product of the effective size of the population that receives the migrants by the corresponding migration rate i.e. the fraction of the recipient population composed by immigrants. The relationship between θ_i and M_ij can be expressed with equation (3) when considering autosomal markers³⁸:

Additional Information

How to cite this article: Martínez, A. et al. Detecting the existence of gene flow between Spanish and North African goats through a coalescent approach. Sci. Rep. 6, 38935; doi: 10.1038/srep38935 (2016).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Zeder, M. A. Domestication and early agriculture in the Mediterranean Basin: Origins, diffusion, and impact. Proc. Natl. Acad. Sci. USA 105, 11597–11604 (2008).
Article CAS ADS PubMed PubMed Central Google Scholar
Smith, A. B. Origins and spread of pastoralism in Africa. Nomad. People 32, 91–105 (1993).
MathSciNet Google Scholar
Pereira, F. & Amorim, A. Origin and spread of goat pastoralism. In Encyclopedia of Life Sciences (ELS) John Wiley & Sons, Ltd., Chichester (2010).
Pereira, F. et al. Tracing the history of goat pastoralism: new clues from mitochondrial and Y chromosome DNA in North Africa. Mol. Biol. Evol. 26, 2765–2773 (2009).
Article CAS PubMed Google Scholar
Pereira, F., Pereira, L., Van Asch, B., Bradley, D. G. & Amorim, A. The mtDNA catalogue of all Portuguese autochthonous goat (Capra hircus) breeds: high diversity of female lineages at the western fringe of European distribution. Mol Ecol. 14, 2313–2318 (2005).
Article CAS PubMed Google Scholar
Beerli, P. & Felsenstein, J. Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach. Proc. Natl. Acad. Sci. USA 98, 4563–4568 (2001).
Article CAS ADS MATH PubMed PubMed Central Google Scholar
Hey, J. & Nielsen, R. Integration within the Felsenstein equation for improved Markov chain Monte Carlo methods in population genetics. Proc. Natl. Acad. Sci. USA 104, 2785–2790 (2007).
Article CAS ADS PubMed PubMed Central Google Scholar
Wright, S. Evolution in Mendelian populations. Genetics 16, 97–159 (1931).
CAS PubMed PubMed Central Google Scholar
Slatkin, M. Gene flow in natural populations. Annu. Rev. Ecol., Evol. Syst. 16, 393–430 (1985).
Article Google Scholar
Nomura, K. et al. Microsatellite DNA markers indicate three genetic lineages in East Asian indigenous goat populations. Anim. Genet. 43, 760–7 (2012).
Article CAS PubMed Google Scholar
Cañón, J. et al. Geographical partitioning of goat diversity in Europe and the Middle East. Anim. Genet. 37, 327–334 (2006).
Article PubMed Google Scholar
Rout, P. K. et al. Microsatellite-based phylogeny of Indian domestic goats. BMC Genet. 9, 11 (2008).
Article PubMed PubMed Central Google Scholar
Li, M. H. et al. Genetic relationships among twelve Chinese indigenous goat populations based on microsatellite analysis. Genet. Sel. Evol. 34, 729–744 (2002).
Article CAS PubMed PubMed Central Google Scholar
Chenyambuga, S. W. et al. Genetic characterization of indigenous goats of sub-Saharan Africa using microsatellite DNA markers. Asian-Australas. J. Anim. Sci. 17, 445–452 (2004).
Article CAS Google Scholar
Traoré, A. et al. Ascertaining gene flow patterns in livestock populations of developing countries: a case study in Burkina Faso goat. BMC Genet. 13, 35 (2012).
Article PubMed PubMed Central Google Scholar
Waples, R. S. Separating the wheat from the chaff: patterns of genetic differentiation in high gene flow species. J. Hered. 89, 438–450 (1998).
Article Google Scholar
Pavey, S. A., Nielsen, J. L. & Hamon, T. R. Recent ecological divergence despite migration in sockeye salmon (Oncorhynchus nerka). Evolution 64, 1773–1783 (2010).
Article PubMed PubMed Central Google Scholar
Liao, P. C., Tsai, C. C., Chou, C. H. & Chiang, Y. C. Introgression between cultivars and wild populations of Momordica charantia L. (Cucurbitaceae) in Taiwan. Int. J. Mol. Sci. 13, 6469–6491 (2012).
Article CAS PubMed PubMed Central Google Scholar
Strasburg, J. L. & Rieseberg, L. H. How robust are “isolation with migration” analyses to violations of the IM model? A simulation study. Mol. Biol. Evol. 27, 297–310 (2010).
Article CAS PubMed Google Scholar
Currat, M., Poloni, E. S. & Sanchez-Mazas, A. Human genetic differentiation across the Strait of Gibraltar. BMC Evol Biol. 10, 237 (2010).
Article PubMed PubMed Central Google Scholar
Anderung, C. et al. Prehistoric contacts over the Straits of Gibraltar indicated by genetic analysis of Iberian Bronze Age cattle. Proc. Natl. Acad. Sci. USA 102, 8431–8435 (2005).
Article CAS ADS PubMed PubMed Central Google Scholar
Botigué, L. R. et al. Gene flow from North Africa contributes to differential human genetic diversity in Southern Europe. Proc. Natl. Acad. Sci. USA 110, 11791–11796 (2013).
Article ADS PubMed PubMed Central Google Scholar
Decker, J. E. et al. Worldwide patterns of ancestry, divergence, and admixture in domesticated cattle. PLoS Genet. 10, e1004254 (2014).
Article PubMed PubMed Central Google Scholar
Fregel, R. et al. The maternal aborigine colonization of La Palma (Canary Islands) Eur. J. Hum. Genet. 17, 1314–1324 (2009).
Article ADS PubMed PubMed Central Google Scholar
Ferrando et al. A mitochondrial analysis reveals distinct founder effect signatures in Canarian and Balearic goats. Anim. Genet. 46, 452–456 (2015).
Article CAS PubMed Google Scholar
Martínez, A. M. et al. The Southwestern fringe of Europe as an important reservoir of caprine biodiversity. Genet. Sel. Evol. 47, 86 (2015).
Article PubMed PubMed Central Google Scholar
Antao, T., Lopes, A., Lopes, R. J., Beja-Pereira, A. & Luikart, G. LOSITAN: a workbench to detect molecular adaptation based on a Fst-outlier method. BMC Bioinformatics 9, 323 (2008).
Article PubMed PubMed Central Google Scholar
Rousset, F. Genepop’007: a complete reimplementation of the Genepop software for Windows and Linux. Mol. Ecol. Resour. 8, 103–106 (2008).
Article PubMed Google Scholar
Van Oosterhout, C., Hutchinson, W. F., Wills, D. P. M. & Shipley, P. MICRO-CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Mol. Ecol. Notes, 4, 535–538 (2004).
Article CAS Google Scholar
Excoffier, L. & Lischer, H. E. L. Arlequin suite ver 3.5: A new series of programs to perform population genetics analyses under Linux and Windows. Mol. Ecol. Resour. 10, 564–567 (2010).
Article PubMed Google Scholar
Peakall, R. & Smouse, P. E. GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol. Ecol. Notes 6, 288–295 (2006).
Article Google Scholar
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
CAS PubMed PubMed Central Google Scholar
Earl, D. A. & von Holdt, B. M. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Cons. Genet. Resour. 4, 359–361 (2012).
Article Google Scholar
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Article CAS PubMed Google Scholar
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet. 8, e1002967 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kuhner, M. K. Coalescent genealogy samplers: windows into population history. Trends Ecol. Evol. 24, 86–93 (2009).
Article PubMed Google Scholar
Barton, N. H. & Slatkin, M. A quasi-equilibrium theory of the distribution of rare alleles in a subdivided population. Heredity 56, 409–15 (1986).
Article PubMed Google Scholar
Marko, P. B. & Hart, M. W. The complex analytical landscape of gene flow inference. Trends Ecol. Evol. 26, 448–456 (2011).
Article PubMed Google Scholar

Download references

Acknowledgements

We acknowledge the support of the Spanish Ministry of Economy and Competitivity for the Center of Excellence Severo Ochoa 2016–2019 (SEV-2015-0533) grant awarded to the Center for Research in Agricultural Genomics. Many thanks to Alejandro Sánchez Gracia and Sara Guirao for valuable advice in the IMa analysis and for making a critical revision of the manuscript.

Author information

Martínez Amparo and Manunza Arianna contributed equally to this work.

Authors and Affiliations

Departamento de Genética, Universidad de Córdoba, Córdoba, 14071, Spain
Amparo Martínez, Juan Vicente Delgado & Vincenzo Landi
Department of Animal Genetics, Center for Research in Agricultural Genomics (CSIC-IRTA-UAB-UB), Campus Universitat Autònoma de Barcelona, Bellaterra, 08193, Spain
Arianna Manunza & Marcel Amills
Department of Animal Breeding and Genetics, Federal University of Agriculture, Abeokuta, PMB 2240, Nigeria
Ayotunde Adebambo & Muritala Ismaila
Instituto Canario de Investigaciones Agrarias, La Laguna, 38108, Tenerife, Spain
Juan Capote
Livestock & Wildlife Laboratory, Arid Land Institute Medenine, Médenine, 4119, Tunisia
Mabrouk El Ouni
Department of Animal Biotechnology, Animal Production Research Institute, Dokki, Giza, Egypt
Ahmed Elbeltagy
Genetics Department, Faculty of Agriculture, Ain Shams University, Shubra, 11241, Cairo, Egypt
Asmaa M. Abushady
Animal Production Department, Faculty of Agriculture, Ain Shams University, Abbassia, 11566, Cairo, Egypt
Salah Galal
Departament de Ciència Animal i dels Aliments, Universitat Autònoma de Barcelona, Bellaterra, 08193, Spain
Ainhoa Ferrando, Jordi Jordana & Marcel Amills
Servicio de Ganadería. Diputación Foral de Bizkaia. Avda, Lehendakari Aguirre n° 9-2°, Bilbao, 48014, Spain
Mariano Gómez
Unitat de Races Autòctones, Servei de Millora Agrària, (SEMILLA-SAU), Son Ferriol, 07198, Spain
Agueda Pons
University Mohammed V, Agdal, Faculty of Sciences, 4 Av. Ibn Battota, Rabat, Morocco
Bouabid Badaoui
Departament de Biologia, Universitat de Girona, Girona, 17071, Spain
Oriol Vidal

Authors

Amparo Martínez
View author publications
You can also search for this author in PubMed Google Scholar
Arianna Manunza
View author publications
You can also search for this author in PubMed Google Scholar
Juan Vicente Delgado
View author publications
You can also search for this author in PubMed Google Scholar
Vincenzo Landi
View author publications
You can also search for this author in PubMed Google Scholar
Ayotunde Adebambo
View author publications
You can also search for this author in PubMed Google Scholar
Muritala Ismaila
View author publications
You can also search for this author in PubMed Google Scholar
Juan Capote
View author publications
You can also search for this author in PubMed Google Scholar
Mabrouk El Ouni
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Elbeltagy
View author publications
You can also search for this author in PubMed Google Scholar
Asmaa M. Abushady
View author publications
You can also search for this author in PubMed Google Scholar
Salah Galal
View author publications
You can also search for this author in PubMed Google Scholar
Ainhoa Ferrando
View author publications
You can also search for this author in PubMed Google Scholar
Mariano Gómez
View author publications
You can also search for this author in PubMed Google Scholar
Agueda Pons
View author publications
You can also search for this author in PubMed Google Scholar
Bouabid Badaoui
View author publications
You can also search for this author in PubMed Google Scholar
Jordi Jordana
View author publications
You can also search for this author in PubMed Google Scholar
Oriol Vidal
View author publications
You can also search for this author in PubMed Google Scholar
Marcel Amills
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study design: J.V.D., A. Martínez, M.A., J.J., O.V.; Performed the molecular analysis: A. Martínez, V.L., J.V.D.; Performed the population genetic analyses: A. Manunza, M.A., B.B.; Provision of the data: A.A., M.I., J.C., M.E.O., A.E., A.M.A., S.G., M.G., A.P., A.F.; M.A. wrote the manuscript and all authors read, corrected and approved its content.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Martínez, A., Manunza, A., Delgado, J. et al. Detecting the existence of gene flow between Spanish and North African goats through a coalescent approach. Sci Rep 6, 38935 (2016). https://doi.org/10.1038/srep38935

Download citation

Received: 01 February 2016
Accepted: 16 November 2016
Published: 14 December 2016
DOI: https://doi.org/10.1038/srep38935

This article is cited by

The demographic history and adaptation of Canarian goat breeds to environmental conditions through the use of genome-wide SNP data
- Gabriele Senczuk
- Martina Macrì
- Amparo Martínez
Genetics Selection Evolution (2024)
Characterizing the Mitochondrial Diversity of Arbi Goats from Tunisia
- Yosra Ressaissi
- Marcel Amills
- Mohamed Ben Hamouda
Biochemical Genetics (2021)
Differential distribution of Y-chromosome haplotypes in Swiss and Southern European goat breeds
- Oriol Vidal
- Cord Drögemüller
- Marcel Amills
Scientific Reports (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.