Abstract
Sheep farming has been fundamental to many civilizations in the world and is practiced in India since antiquity. Several thousand years of adaptation to local environmental conditions and selective breeding have evolved 44 sheep breeds in India. They are paramount in terms of economic, scientific, and cultural heritage. Genetic characterization information is imperative for sustainable utilization and conservation of ovine heritage. In this study, the genetic diversity, differentiation, and structure of 11 indigenous sheep breeds from three different agro-ecological zones of India were explored with genomic microsatellite loci and mitochondrial DNA (D loop). The estimated diversity parameters indicated that populations retained high levels of genetic diversity (Na = 8.27 ± 0.17; Ho = 0.65 ± 0.01), which provides an optimistic viewpoint for their survival. However, significant inbreeding was also observed in the nine populations. Moderate genetic differentiation existed among the groups (FST = 0.129 ± 0.012), and most likely clusters existing in the dataset are seven. Phylogenetic clustering was in line with the geographical locations of sheep populations. Mitochondrial sequences revealed high haplotype diversity with the existence of maternal haplogroups A, B, and C, and signals of population expansion. Decreased genetic diversity and unique maternal lineage (C) in endangered Tibetan and Bonpala sheep breed, warrant their immediate scientific management. Overall, the quantitative data reported here on the extant variability, and genetic relationships among the Indian sheep breeds, provide critically important inputs that will be valuable for the decision-making process on their management, both for the conservation of endangered breeds, and formulation of breeding programs to check genetic erosion.
Similar content being viewed by others
Introduction
Sheep, Ovis aries, are a highly versatile and adaptable species. From their domestication in the Fertile Crescent, approximately 11,000 years ago, sheep now span the sundry terrains of all inhabited continents and are utilized as a source of food (meat, milk, fat) and clothing (skin, wool)1. India has the second-largest sheep inventory in the world (6.4%) after China, with 74.26 million heads, and sheep number has increased (14.13%) in the recent past (2012 to 2019)2. Sheep are an important species of livestock for India. Sheep contribute immensely to the agrarian economy and to the livelihoods of small and marginal farmers in the harshest areas, where agriculture and dairying are not cost-effective. Mountains from Afganistan to Armenia have various wild varieties of sheep (Ovis orientalis vignai), who were possibly the ancestors of Indian sheep3. Urial and Argali stock are considered to be the progenitor of Indian sheep3. Thousands of years of natural selection for adaptation to varied agro-ecological conditions4, and planned field level breeding for environmental tolerance, behavioral, and commercial traits are reflected in the form of 44 sheep breeds in India (www.nbagr.res.in).
Genetic diversity (the variation of alleles and genotypes present in a population) reflects population size, history, ecology, and adaptability5. It plays a vital role in providing traits responsible for improvement, survival, and adaptation of a species6. Future challenges, such as climate change, emerging diseases, pressure on land and water, and shifting market demands, make it even more important to ensure that livestock genetic resources are conserved and used sustainably7. The vast Indian sheep biodiversity is being eroded rapidly, with more than 50% of sheep breeds currently under threat8. Thus, a better understanding of the indigenous sheep genetic diversity can pave way for devising conservation strategies; facilitate their effective use for future breeding programs and, formulating management policies. Genetic diversity is often assessed by molecular markers, including the nuclear microsatellites, also referred to as the simple sequence repeat markers (nSSR) and mitochondrial DNA (mtDNA)9. Nuclear SSR has been widely accepted as a useful tool for measuring genetic diversity and divergence within and among livestock populations10. Microsatellite markers are consistent with Mendel’s laws of inheritance, are highly polymorphic, and generally have the co-dominant inheritance. Moreover, these are highly reproducible and easy to perform11. Therefore, genetic diversity data obtained by nSSR is used for surveying the genetic variability of livestock species that provides theory references to prevent the loss of genetic diversity, over the time12. These are being used to estimate the diversity of autochthonous sheep breeds all over the world13,14, including India8,15.
Mitochondrial DNA has contributed immensely to elucidating the molecular phylogeny of many livestock species due to a higher mutation rate than the nuclear DNA, maternal inheritance, lack of recombination, and existence as multiple copies. The control region or the displacement region (D-loop) has particularly attracted the attention of researchers, to analyze phylogenetic relationships, and explore domestication events across species16,17. Multiphyletic origins involving 5 phylogenetically divergent mtDNA haplogroups (A–E) have been delineated in sheep, with haplogroups A and B being recorded in many countries and continents across the globe, while for haplogroups C, D, and E, the geographic distribution is relatively less widespread18. Studies on Indian domestic sheep have unveiled that haplogroup A is mainly represented in indigenous breeds, followed by haplogroup B19. Haplogroup C, which is considered a less frequent haplogroup worldwide, has also been seen in < 1% of the sampled animals, in one study on Indian sheep breeds20.
The sheep breeds in India have been classified into four groups, based on agro-ecological regions3, that is (1) North temperate region (NT), (2) North-Western arid and semi-arid region (NWASA), (3) Southern peninsular region (SP), and (4) Eastern region (ET)4. Himalaya mountains constitute the NT region. The NWASA consists of scattered hills in vast alluvial plains and an undulating landscape of the sandy desert. This is the primary region in the country for carpet-wool production and has the second largest population of sheep among the four regions. The SP region has the largest sheep population in the country, and these are maintained primarily for meat. The zone has a hot and humid coastline and semi-arid central peninsular region. Its topography includes sandy tracts, mountain ranges, plateaus, deltas, planes and, even highlands. The ET region is primarily hot and humid, apart from a few sub-temperate and humid areas of eastern states. The landscape is highly variable with alluvial plains along with undulations, plateaus and table-land, and Himalaya hill ranges as well as valleys in the north-eastern area. Here, sheep are primarily maintained for meat and produce extremely coarse and hairy fleece (https://www.dahd.nic.in).
The adaptation of different Indian breeds to a wide range of agroecology offers the necessary variability, that provides the opportunity to meet the future demands for food, and endows with the flexibility to respond to altered markets and needs5. So far, the characterization of sheep diversity in India has largely been explored using microsatellites, with limited studies utilizing mitochondrial DNA. These are mainly restricted to a single population21, with very few studies targeting more number of breeds22,23. Genetic variation and relationships among the majority of recognized sheep breeds of NWASA region have been established using microsatellite markers24. Comprehensive knowledge about the sheep breeds of the remaining three agro-ecological regions of India is still sought after, for designing suitable and sustainable sheep breeding programs and conservation strategies.
The present study was undertaken to assess the genetic diversity, population structure, and phylogeography of 11 unexplored indigenous sheep breeds from three agro-ecological regions (NT, SP, ET) of the country (Fig. 1), using both nSSR and mtDNA markers. Results obtained using two types of markers that have different modes of inheritance, and degrees of polymorphism will provide inclusive insight into the lineage relationship of various Indian sheep breeds, and molecular information on the genetic variability existing within and among them. It will act as a baseline data to devise strategies for sustainable development of precious Indian sheep resources.
Results
Mitochondrial DNA sequence variation and genetic diversity in Indian sheep breeds
In this study, we amplified a 1246 bp fragment of ovine mitochondrial control region encompassing part of 12S rRNA coding region and tRNAPhe gene. Analysis of the amplified sequences was restricted to a 1,059 bp fragment, that corresponds to a consensus region defined by our sequences and those retrieved from the GenBank (10 reference sequences), representing 5 well-defined sheep haplogroups. Sequence analysis yielded a total of 83 novel haplotypes in Indian sheep, with overall haplotype diversity (Hd) value of 0.8502 ± 0.024, and nucleotide diversity (π) equal to 0.00373 ± 0.00029. The sequences were defined by 26 singleton variable sites and 44 parsimony informative sites. The maximum number of haplotypes was observed in Bonpala and Poonchi sheep, and minimum in Tibetan sheep. The Hd values ranged from 0.7526 ± 0.099 in Changthangi to 0.9004 ± 0.056 in Poonchi. The estimates of nucleotide diversity (π) were maximum for Tibetan sheep and minimum for Bandur sheep. Detailed statistics summarizing various genetic diversity parameters are given in Table 1. Out of 83 haplotypes, 16 were shared within breeds, and 10 were common among breeds. The majority of haplotypes (57) were private to individual samples, thus suggesting high variation in the analyzed region of the mtDNA. The most frequent haplotype was present in 84 animals, from all the breeds studied, and the second most frequent haplotype was shared by 14 samples from 8 breeds.
mtDNA based phylogenetic analysis and genetic structure of Indian sheep
A median joining (MJ) network constructed using the haplotypes from Indian sheep breeds along with 10 reference sequences for the 5 known sheep haplogroups, revealed that Indian sheep were undoubtedly separated into three distinct clusters that corresponded to haplogroups A, B, and C. Out of a total of 154 Indian sheep specific haplotypes, 134 grouped with haplogroup A, 18 with haplogroup B and 2 with haplogroup C (Fig. 2). The phylogenetic topology obtained by the construction of neighbor joining (NJ) tree was also similar (Supplementary Fig. S1). The proportion of animals in haplogroups A, B, and C was 88.86%, 10.46%, and 0.66%, respectively. In addition to nine breeds that previously showed conformity to haplogroup B (Marwari, Nali, Kheri, Muzaffarnagri, Sonadi, Chokla, Jaisalmeri, Patanwadi and Garole), this study revealed that seven more breeds had their representation in haplogroup B (Table 1). Interestingly, unlike other breeds, haplogroup B was the most frequent haplogroup in Tibetan sheep. Our study, for the first time, reports clustering of some animals of breeds of the Eastern region i.e. Bonpala and Tibetan with haplogroup C. From the MJ network profile, a star-like shape for the haplogroup A could be appreciated, since the most common haplotype comprising of 166 members was present at the center, and was surrounded by haplotypes that were separated by few mutations only. If we specifically discuss the 11 sheep breeds analyzed in this study, out of a total of 16 haplotypes that were shared within breeds, the number of haplotypes that belonged to haplogroups A, B, and C were 10, 5, and 1, respectively. Among the 10 haplotypes that were common among breeds, 8 conformed to haplogroup A, and 2 represented haplogroup B.
Assessment of mismatch distribution in Indian sheep breeds revealed Raggedness index of 0.012 (P = 0.97), 0.043 (P = 0.85), 0.013 (P = 0.76) and 0.010 (P = 1.00), respectively, in sheep breeds from NT, NWASA, SP and ET regions, hinting at demographic expansion. Additionally, the sum of squared deviations (SSD) between the observed and expected mismatch values also presented non-significant low values, again suggesting population expansion. This observation was also substantiated by the results of Fu’s Fs and Tajima’s D neutrality tests. Significant negative Fu’s Fs values, which are inferred as signs of either purifying selection or demographic expansion were evident in all groups. Similarly, Tajima’s D value was negative, but significant only in the case of sheep from ET and SP zones of India. Details of population demographic parameters based on analysis of the mtDNA D loop region in Indian sheep are presented in Table 2.
Genetic variability in Indian sheep breeds based on microsatellite markers
The genotype data generated in the present study showed that a significant amount of genetic variation is maintained in the Indian sheep populations. All the markers were found to be polymorphic in each of the 11 analyzed populations. Genotype inferring errors due to stuttering, large allele dropout, and high frequency of null alleles were not detected. Non-significant linkage disequilibrium was ascertained between these loci thus, all the loci were retained for diversity and differentiation analysis. The level of variation depicted by the number of alleles at each locus serves as a measure of genetic variability. FAO has specified a minimum of four different alleles per locus, for evaluation of genetic differences between the breeds10. By this criterion, all the 25 microsatellite loci showed ample polymorphism for evaluating within breed genetic variability, and for exploring the genetic differences (Table 3). A total of 517 alleles, with an overall mean of 8.27 ± 0.19 alleles per locus, were observed in the 11 sheep breeds. The most polymorphic loci were OarCP49 and BM1314, with 13 alleles, while BM6506 was observed to be the least polymorphic with 4 alleles across the 11 sheep breeds.
The average observed number of alleles per locus ranged from 5.96 in Tibetan and Bonpala to 9.12 in Kenguri and Rampur Bushair sheep (Table 4). The average effective number of alleles in a population varied from 2.99 (Bonpala) to 4.72 (Shahbadi), with a mean across all the loci of 4.21 ± 0.11. Lower values of the expected number of alleles, as compared to the observed number of alleles in all the populations, suggested that there were many low-frequency alleles in the populations. The private alleles, confined to one population only, were detected, but most of them were rare alleles having less than 5% allele frequencies. All the microsatellite markers were highly polymorphic according to the mean value of Shannon’s information index (I). Estimates of observed heterozygosity across all the loci and populations (0.65 ± 0.01), confirmed the remarkable level of diversity existing in the Indian sheep. Observed heterozygosity was lower than the expected heterozygosity in all the populations, except for the Poonchi and Bonpala. Similarly, analysis of FIS evidenced significant (P < 0.05) heterozygote deficiency in all the populations, except for the Poonchi and Bonpala, which on the contrary depicted heterozygote excess (Table 4).
nSSR based genetic differentiation and population structure
To describe the level of heterogeneity within and between the Indian sheep populations, F-statistics values were determined. Results for each of the 25 loci, across all the populations, are presented in Table 3. The global deficit of heterozygotes across populations (FIT) amounted to 20%. An overall significant (P < 0.05) deficit of heterozygotes (FIS) of 8.5% occurred in the analyzed loci because of inbreeding within the populations. There was a moderate (0.05 < Fst < 0.15) breed differentiation between the 11 populations. The multi-locus FST value of breed differentiation indicated that 12.9% of the total genetic variation was due to the unique allelic differences between the breeds. The remaining 87.1% corresponded to the differences among the individuals of a breed, across the 25 markers. Pair-wise FST coefficients between the two populations are shown in Table 5, above diagonal. All pair-wise FST values were significant (P < 0.05). In summary, it revealed the least differentiation between the Bellary-Hassan populations of the SP region. The highest divergence was observed between Changthangi (NT) and Bandur (SP) and was closely followed by Tibetan (ET) and Bandur (SP). The average estimated effective number of migrants per generation, between pairs of the populations (Supplementary Table S1), was of moderate magnitude (Nm = 2.0 ± 0.16). The highest gene flow was observed among Bellary and Hassan (Nm = 22.55) populations of the SP region. Similarly, genetic variation among the different groups, compared using AMOVA (analysis of molecular variance), revealed that there was an 11.1% variation among the populations (Supplementary Table S2), which is consistent with the FST results (12.9%). The pair-wise fixation index (FST) value provided by AMOVA through 99 permutations differed significantly from zero at P < 0.05. The largest genetic distance (DA) was recorded between Changthangi (NT) and Bandur (SP) sheep breeds, while Bellary and Hassan of the SP region were closest.
Genetic relationship among populations was further confirmed by the construction of phylogenetic tree (NJ), using Nei’s genetic distance (Fig. 3). Tibetan sheep population was most distinct, and separated first. Results illustrated clustering together of geographically close sheep breeds, while Tibetan, Changthangi and Bonpala were placed on separate nodes.
The overall accuracy of population self-assignment was very high (Supplementary Table S3), to the tune of 96%. All the animals, except one of Rampur Bushair, Bandur, and Balangir, seven of Hassan and nine of Bellary breeds were correctly assigned to their respective groups. Clustering using Bayesian approaches was performed on the entire data set, with an increasing number of inferred clusters (K) from 2 to 13. The likely value of K, which best captures the variation present in the data was seven (Supplementary Fig. S2). Changthangi, Bonpala, Tibetan, and Shahabadi formed their clusters. Rampur Bushair and Poonchi animals of NT were partitioned into one cluster. Similarly, breeds of the SP region showed intermixing. Even if K was increased up to 13, individuals of Bellary and Hassan remained admixed.
Discussion
Evolution is the foundation of the enormous variation existing between and within species as well as between and within breeds of a species. Evolution corresponds to the differences in allele frequencies over time, and these differences can be used for the characterization of populations5. Accordingly, the genetic variability of different indigenous sheep breeds in India was investigated.
Molecular diversity studies of farm animal genetic resources using mitochondrial DNA have extensively traced domestication events and proposed multiphyletic origins across different species, including sheep. This study explored the membership of 11 breeds of Indian sheep to the 5 well-defined sheep haplogroups identified across continents18. Three highly diverged haplogroups (A, B, and C) including 84.68%, 13.96%, and 1.35% samples, respectively, were evident in the investigated sheep populations. Haplogroups D and E, which are considered rare globally, were not observed in this study. Our results are in agreement with the previous studies in Indian sheep, involving breeds other than those considered for the present investigation, wherein haplogroup A was observed in 89.6% and haplogroup B in 10.38% of the total samples19. Five breeds of NWASA region namely Marwari, Nali, Kheri, Muzaffarnagri, and Sonadi were components of clade B in the phylogenetic analysis19. Subsequently, haplogroup B was identified in four more breeds (Chokla, Jaisalmeri, Patanwadi, and Garole), representing 15% of total samples. Additionally, Haplogroup C was observed at low frequency (< 1%) in Nali and Jaisalmeri sheep, which are again the breeds of NWASA region20. Interestingly, haplogroup B and C were observed for the first time in the other three major agro-ecological zones of India, in the present study. Our observations of identification of three haplogroups in Indian sheep are in concordance with many international studies on mtDNA that have registered the presence of haplogroups A and B in Asia and predominance of haplogroup B in Europe16,18,25,26. Although haplogroup C is less frequent, however, its presence has been recorded in Turkey, Portugal, the Caucasus, and China25. In Tibetan sheep from China, major haplogroups that have been identified are A, B, and C. However, one animal out of a total of 636 sheep investigated also represented haplogroup D26. Despite the predominance of haplogroup A in India, the presence of 2 more haplogroups (B and C) in all the major agro-ecological regions of India suggests introgression due to gene flow between breeds of Asia and Europe.
The average haplotype diversity value observed in the present investigation was 0.850, which was intermediate of the previous two attempts that have explored the mitochondrial DNA diversity in Indian sheep (0.603 and 0.987), but the nucleotide value was higher than either of these studies19,20. The number of haplotypes ranged from 6–14 across different breeds, and each breed also had its unique haplotypes. All these statistics point towards high maternal genetic diversity in the populations investigated and also substantiates that the correct sampling strategy of including unrelated animals was followed. Signature of population expansion was evident based on Fu’s Fs, Tajima’s D, goodness-of-fit indices, and mismatch distribution assessed under sudden expansion model in sheep breeds from all the four agro-ecological zones. Mismatch distribution analysis revealed a unimodal distribution which suggests that the populations have undergone a rapid expansion in size. Fu’s Fs test which is based on haplotype frequencies as well as Tajima’s D test that relies on the difference between the number of polymorphic sites, and the mean number of nucleotide pairwise differences, also suggested expanding populations in all the major zones and whole of India. Despite vast sheep genetic diversity and huge population, lack of political will and financial support have resulted in poor coverage of indigenous sheep under genetic improvement programs. Therefore, it has not been possible to curtail mating between breeds in neighboring areas because of shared pastures. This inference was also supported by the presence of a large number of haplotypes that differed from the most common haplotype by few variations only.
Based on the microsatellite diversity and differentiation, Indian sheep breeds had sufficient diversity, as evidenced by the number of alleles and heterozygosity per breed. Except for Tibetan and Bonpala, the mean observed number of alleles (Na) across other breeds was more than 8. Similarly, excluding Tibetan, mean observed heterozygosity (Ho) was more than 0.6 in all the populations (Table 4). Genetic variation of similar magnitude, in terms of allele diversity (Na, 6.04–9.95) and observed heterozygosity (Ho, 0.62–0.78), has been reported in 17 Indian sheep breeds22. Estimates were also comparable with that of the Kajali sheep, that belong to the NWASA region21 and four sheep populations of the SP region; Madras Red, Mecheri, and Pattanam27 and Tiruchi Black28. The genetic variability observed in this study was higher than those reported in Jordan sheep29, and sheep breeds of the neighboring country, Pakistan30. However, it was comparable to the values reported across the world, for sheep31,32. Much higher diversity indices were recorded in Turkish sheep breeds that can be attributed to Turkey being considered as a major sheep domestication center33.
The lowest diversity parameters (Na, 5.96; Ho, 0.52) were observed in the Tibetan sheep which may be resulting from its extremely small population size. It is an endangered sheep breed with only 215 animal left8. Genetic variation of similar magnitude (0.46–0.55) has also been reported in the endangered Namaqua Afrikaner sheep, indigenous to South Africa34, whereas, higher heterozygosity has been reported in the two endangered Spanish breeds, Churratensina (0.66) and Churralebrijana (0.67), despite their small population size35. It should be mentioned here that the comparison of diversity estimates should be considered as a suggestive indication of diversity since microsatellite loci used across multiple studies are not uniform. A significant positive FIS value was observed for all the breeds, except very small negative values in Poonchi and Bonpala (Table 4), thereby suggesting a lack of heterozygote in 9 breeds. The highest deficiency (23.4%) was encountered in Tibetan sheep that pointed towards the possibility of inbreeding. However, this could also be attributed to the limited samples or to a shared ancestry of the few samples, which otherwise have been treated as unrelated. A reasonably high FIS value could also be an outcome of sex-biased dispersal, a characteristic of a dwindling population or a consequence of the breeding system. Such an elevated level of inbreeding is a risky proposition as it could lead to genetic diseases. Moreover, it can negatively affect animal health. Higher heterozygote deficiency is generally observed in Indian sheep breeds15,22,36. Results for Poonchi (− 0.007) and Bonpala (− 0.021) can be interpreted as possible signs of outbreeding, most likely due to a recent admixture of two (or more) populations.
Genetic relationship and differentiation in the 11 Indian sheep breeds were found to be correlated with the geographic distance, shared ancestral variation, and historical gene flow, according to the multiple approaches employed in this study. Populations depicted a moderate, but significant genetic differentiation (FST = 0.129 ± 0.012). The Tibetan breed appeared as a distinct breed separated from the others. These results reflected that the within-breed genetic variation was high (87.1%) and this variation could be a valuable tool for genetic improvement and conservation of Indian sheep. Genetic differentiation of similar magnitude (11.1%) has been documented in some other indigenous sheep breeds22. FST values of the examined Indian sheep breeds were comparable to those reported for Baltic, Chiapas, and Akkarman sheep33. However, it was much higher than the FST values reported for several others like Spanish, European, Middle Eastern, Colombian, Pakistani, and Romanian sheep breeds30,32,37. Much lower differentiation (5.6–5.9%) has been reported in Indian sheep breeds of SP region15,27. In the present study also, the breeds of SP region were less differentiated. Bellary and Hassan were least differentiated among them which may be attributed to the considerable exchange of germplasm among the two populations (Nm = 22.55). Differentiation can be an outcome of geography and reproductive isolation, genetic drift, and founder effect, different production goals, and preferences for the specific traits. In the present study, overall moderate genetic differentiation could be due to the fact that breeds were spread across three agro-ecological zones and larger geographical distances, in India. Similarly, visualization of breed relationship through NJ tree showed the clustering of the breeds, mainly in congruence with their geographic location (Fig. 3).
The phylogenetic relationship was substantiated by the cluster analyses. The results of STRUCTURE (Fig. 4) support the proposition that Changthangi, Bonpala, Tibetan, and Shahbadi are genetically distinct breeds, while Rampur Bushair and Poonchi are admixed sheep populations. Similarly, all the four sheep breeds of the SP region had indications of gene flow among them. Distinct clusters were most likely the result of different geographic distributions.
Management practices, ecological factors, lack of breeding policies, and geographic continuity in the absence of prominent natural barriers, might have resulted in intermixing among some of the breeds. Sheep management and husbandry practices vary from temperate to tropical ecology of India. Sheep production in the country is largely a nomadic proposition. It is either a pastoral system, where farmers move from place to place along with their flocks or an extensive system in which flocks graze in community land, pastures, and harvested fields, in nearby areas. The intensive scientific production system is restricted only to organized and government farms. The migratory system is the cornerstone of sheep farming in the NT region that falls under the influence of the Himalayas. This migration may be of short duration to neighboring locations or permanent, to long distances over extensive areas. In the plains of the ET region, most of the flocks are stationary and in its Himalayan region, flocks are migratory. Flocks graze on foothills and in the valleys in winter and move to high altitude forests and alpine meadows in summers. In the SP and NWASA regions, migration takes place during the drought conditions in search of foraging and drinking water resources.
Rampur Bushair and Poonchi of the NT region cluster in a single group. This observation is very well supported by the past and prevailing management practices, as well as, the breeding policies. Firstly, shepherds regularly or seasonally migrate with their flocks in search of grazing areas and pasture lands, causing intermixing among the animals. Secondly, most of the animals in this area have been involved in cross-breeding with exotic fine wool breeds (Merino, Rambouillet), for increasing apparel wool production with the concomitant dilution of these breeds. Geographic continuity, ecological similarity, and cultural connections amongst the major sheep rearing communities of the SP region could be contributing towards the gene flow between different sheep populations.
Following the results, conservation programs need to be undertaken for endangered sheep breeds that are threatened with extinction. These include Changthangi, Rampur Bushair, Poonchi, Bonpala, and Tibetan4. Tibetan and Bonpala should be prioritized for conservation. These two unique gene pools have low genetic diversity. It is worth mentioning that maternal lineage B dominates in Tibetan sheep, unlike other Indian sheep breeds. Lineage C in Tibetan and Bonpala pointed towards their distinct evolutionary history from other Indian sheep breeds. Similarly, Changthangi represents a unique gene pool with good adaptation to the extremely cold and low oxygen conditions. These animals thrive in the high-altitude cold desert, where, the average altitude of the area is around 14,600 m above sea level. Habitat of Tibetan, Bonpala, and Changthangi is common across the Tibetan plateau38. Tibetan and Bonpala sheep migrated to the Eastern agro-ecological region of India with the Tibetan traders, who used them as beasts of burden for transporting various merchandise39. They might share ancestral variation and historical gene flow with the sheep breeds of the high-altitude Himalayan region of China. All the three temperate breeds might carry unique adaptation alleles, therefore, these should be considered as a priority group for conservation. Moreover, sustainable breeding programs and policies are required to prevent indiscriminate gene flow among populations and regions, as well as, crossbreeding with the exotic sheep breeds.
Animal breeding farms and research institutes can play a pivotal role in the planning and execution of conservation programs. The establishment of breed associations and nucleus breeding farms is the first step towards conservation and sustainable utilization of indigenous sheep resources. Frequent exchange of the rams among farms and flocks of the same breed needs to be in place to increase the genetic diversity. Incentives to livestock keepers, extension, and health services to migratory flocks, availability of quality breeding rams for each breed, and community involvement can contribute enormously to this endeavor.
Conclusion
The findings of this study contribute to the knowledge of the genetic diversity and the structure of native Indian sheep breeds. Data generated could be a useful source for breeding management to decrease heterozygote deficiency and breed dilution, and to ensure sustainable management and conservation activities to preserve sheep's genetic heritage. Overall, Tibetan sheep emerged out to be the most interesting gene pool, from both microsatellite and mitochondrial analysis. Genetic diversity indices warrant immediate scientific management and conservation of this invaluable germplasm, that is considered endangered at the moment.
Materials and methods
Animals
In total, 477 animals of 11 breeds (Rampur Bushair, Poonchi, Changthangi, Bonpala, Tibetan, Shahbadi, Balangir, Kenguri, Bandur (Mandya), Hassan and Bellary) were sampled after field survey in their respective distribution areas, spread across Northern, Eastern and Southern India (Fig. 1). Animals included in this study represented the original autochthonous phenotype. Guidelines of Measurement of Domestic Animal Diversity program10 were followed in selecting animals. For each breed, blood samples were collected from unrelated sheep belonging to multiple flocks, and across different villages. Farmers were interviewed in detail to ascertain the unrelatedness of collected samples. Blood sampling was carried out between 2015 and 2019. All the 477 samples were utilized for the microsatellite analysis, whereas the first 20–22 samples of each breed were used for the mitochondrial study. Genomic DNA was extracted from blood using standard phenol–chloroform protocol40. The quantity of DNA was analyzed using a NanoDrop 1000 spectrophotometer (Thermo Scientific).
Ethics statement
Sheep blood (~ 5 ml) was aseptically collected by the trained veterinarians from the jugular vein with the consent of the farmers. The study has the approval of the Institute Animal Ethics Committee of Indian Council of Agricultural Research, National Bureau of Animal Genetic Resources, Karnal (PME Ref. 11/ 2017-18), and their relevant guidelines were followed during blood sample collection.
Amplification and sequencing of mtDNA
DNA samples from all 11 breeds were amplified using previously reported primers and cycling conditions41. Primers, F-AACTGCTTGACCGTACATAGTA, and R-AGAAGGGTATAAAGCACCGCC amplified a 1,246 bp fragment that spans part of the control region, the tRNA-Phe and 12S rRNA coding RNA genes. The amplified fragments were enzymatically purified using Exonuclease I and Antarctic Phosphatase (New England Biolab, USA), followed by sequencing in ABI 3100 Automated DNA Sequencer (Applied Biosystems, USA) using Big Dye Terminator Cycle Sequencing Kit (Applied Biosystems, USA).
Analytical methodologies for mtDNA
The amplified sequences were aligned using the Clustal W algorithm embedded in MEGA version 6.042. The genetic diversity indices such as the number of haplotypes, haplotype diversity, nucleotide diversity, and the number of polymorphic sites for each breed and across all breeds were estimated using DnaSP v543. The analysis of molecular variance within and among agro-ecological zones was done using the AMOVA program implemented in the Arlequin version 3.544. To get a comprehensive insight into the phylogenetic relationship of Indian sheep in the context of global ovine diversity, sequences from previous studies in Indian sheep19 and 10 reference sequences for the 5 known sheep haplogroups identified across the globe were retrieved from NCBI, and a meta-analysis was carried out with the sequences of the present study. The purpose of this exhaustive analysis was to include breeds that represent all the four major agro-ecological regions of India. Departures from neutrality using Fu’s Fs45 and Tajima’s D statistics46, demographic changes based on Harpending's raggedness index, and the sum of squared deviations (SSD) between the observed and expected mismatch, as well as, population pairwise FST were examined using Arlequin version 3.544. Reference sequences of the 5 known sheep mitochondrial haplogroups were retrieved from NCBI, as per the assignment given by Meadows et al.18: Haplogroup A: DQ852105 and DQ852113; Haplogroup B: DQ852157 and DQ852189; Haplogroup C: DQ852252 and DQ852260; Haplogroup D: DQ852274 and DQ852275; and Haplogroup E: DQ852276 and DQ852278. These sequences were included along with the sequences obtained in this study for constructing a Median-joining (MJ) network using NETWORK 10.0.0.047, and Neighbour-joining (NJ) tree using MEGA6, for exploring the association between the haplotypes.
Nuclear microsatellite genotyping
Twenty-five International Society of Animal Genetics and FAO-recommended sheep nuclear microsatellites were analyzed in all the individual animals10. These simple sequence repeat markers (nSSR) were highly polymorphic, spread all over the genome, and with the ability to co-amplify in PCR reactions (Supplementary Table S4). The workflow consisted of (1) amplification using a fluorescent dye (FAM, HEX, NED and PET) labeled primers, as described by Sharma et al.8 (2) multiplex genotyping using an ABI 3100 Automated DNA Sequencer (Applied Biosystems, USA) along with the internal line control (3) electropherogram analysis using GeneMapper software (Applied Biosystems, USA) for allele size estimation, and (4) statistical analysis using different software packages.
Data analysis for nSSR
Allelic polymorphisms at each nSSR locus were evaluated. The program MICRO-CHECKER48 was used to examine large allele dropout, stuttering, and null alleles as potential sources of error. The genotype data were analyzed using GenAlEx 6.5 software49 to calculate allele frequencies at each locus for each population, the average number of allele per population; observed (Na) and effective numbers of alleles (Ne) and heterozygosity values (observed, Ho and expected, He, Shannon information index (I) and heterozygote deficit (FIS) per locus across breeds and markers. Average values were expressed as Mean ± SE from values at each locus. Chi-square tests of deviations from Hardy–Weinberg equilibrium (HWE) were derived.
To study the relationships and the genetic differentiation among tested populations, different F statistics estimates50, analysis of molecular variance (AMOVA), and genetic distances were applied. Nei’s genetic distance51 and Cavalli-Sforza Chord distance were estimated by GenAlEx 6.549. Statistical significance of F estimates was tested using FSTAT, version 2.9.3 software52. The dendrograms of phylogenetic trees were built from different distance matrices and were visualized by MEGA642 using the NJ approach. Genotypic disequilibrium was tested by using FSTAT 2.9.352 for all pairs of loci with 1,000 permutations. Population assignment was performed using multilocus genotypes of individuals. STRUCTURE ver 2.3.4 software53 was used to further assess the degree of population differentiation within and between the 11 populations. Individuals were assigned into clusters using the Bayesian method under an admixture model. The algorithm estimates allele frequencies for each gene pool (cluster) and population memberships for every individual. The Simulation was run ten times for each value of K (2–13), where K is the number of tested clusters. All runs were performed with a 300,000 burnin period and 500,000 MCMC repeats after burnin. Software CLUMPP54 was used to align multiple replicates for each K to facilitate the interpretation of clustering results. The DISTRUCT application55 was used to graphically display the results. To determine the optimal number of groups (K), we utilized both the log-likelihood method of Prichard et al.56 as well as ΔK value of Evanno et al.57 and was calculated and plotted using Structure Harvester application58.
Data availability
Sequences of unique mitochondrial DNA haplotypes of each sheep breed were submitted in GenBank (Accession numbers: MK061173 - MK061283).
References
Meadows, J. R. S. Sheep: domestication. In Encyclopedia of Global Archaeology (ed. Smith, C.) (Springer, New York, 2014).
20th Livestock Census. BAHS-basic animal husbandry statistics: Department of Animal Husbandry, Dairying & Fisheries, Ministry of Agriculture, Government of India. Available online from www.dahd.nic.in (2019).
Acharya, R.M. Sheep and goat breeds of India. FAO Animal Production and Health. Paper, 30. FAO, Rome, Italy (1982).
Bhatia, S. & Arora, R. Biodiversity and conservation of Indian sheep genetic resources-an overview. Asian-Australas. J. Anim. Sci. 18, 1387–1402 (2005).
Sheriff, O. & Alemayehu, K. Genetic diversity studies using microsatellite markers and their contribution in supporting sustainable sheep breeding programs: a review. Cogent Food Agric. 4, 1459062 (2018).
Zhong, Y., Tang, Z., Huang, L., Wang, D. & Lu, Z. Genetic diversity of Procambarusclarkii populations based on mitochondrial DNA and microsatellite markers in different areas of Guangxi, China. Mitochondrial DNA Part A. https://doi.org/10.1080/24701394.2020.1721484 (2020).
FAO. The state of food and agriculture: Climate change, agriculture, and food security. Published by the Food and Agriculture Organization of the United Nations, Rome. ISBN 978-92-5-109374-0. https://www.fao.org/3/a-i6030e.pdf (2016).
Sharma, R. et al. Genetic diversity estimates point to immediate efforts for conserving the endangered Tibetan sheep of India. Meta Gene 8, 14–20 (2016).
Zhai, D. D., Li, W. J., Liu, H. Z., Cao, W. X. & Gao, X. Genetic diversity and temporal changes of an endemic cyprinid fish species, Ancherythroculter nigrocauda, from the upper reaches of Yangtze river. Zool. Res. 40(5), 427–438 (2019).
FAO. Molecular genetic characterization of animal genetic resources. FAO Animal Production and Health Guidelines. No. 9. Rome (2011).
Mason, A. S. SSR genotyping. Methods Mol. Biol. 1245, 77–89 (2015).
Presti, F. T. & Wasko, A. P. A review of microsatellite markers and their application on genetic diversity studies in parrots. Open J. Genet. 4(2), 69–77 (2014).
Ghazy, A. et al. Genetic diversity and distances of three Egyptian local sheep breeds using microsatellite markers. Zoology 3, 1–9 (2013).
Ceccobelli, S. et al. Genetic diversity of Cornigliese sheep breed using STR markers. Small Rumin. Res. 123, 62–69 (2015).
Yadav, D. K., Arora, R. & Jain, A. Classification and conservation priority of five Deccani sheep ecotypes of Maharashtra, India. PLoS ONE 12(9), e0184691 (2017).
Luikart, G. et al. Multiple maternal origins and weak phylogeographic structure in domestic goats. Proc. Natl. Acad. Sci. U.S.A. 98, 5927–5932 (2001).
Bruford, M. W., Bradley, D. G. & Luikart, G. DNA markers reveal the complexity of livestock domestication. Nat. Rev. Genet. 4, 900–910 (2003).
Meadows, J. R. S., Cemal, I., Karaca, O., Gootwine, E. & Kijas, J. W. Five ovine mitochondrial lineages identified from sheep breeds of the near east. Genetics 175, 1371–1379 (2007).
Arora, R., Yadav, D. K. & Mishra, B. P. Mitochondrial DNA diversity in Indian sheep. Livest. Sci. 153, 50–55 (2013).
Singh, S., Kumar, S. Jr., Kolte, A. P. & Kumar, S. Extensive variation and sub-structuring in lineage A mtDNA in Indian sheep: genetic evidence for domestication of sheep in India. PLoS ONE 8(11), e77858 (2013).
Singh, S. et al. Microsatellite based genetic diversity estimation in Kajali sheep and its phylogenetic relationship with other indigenous sheep breeds. Indian J. Anim. Sci. 87, 1097–1101 (2017).
Arora, R., Bhatia, S., Mishra, B. P. & Joshi, B. K. Population structure in Indian sheep ascertained using microsatellite information. Anim. Genet. 42(3), 242–250 (2011).
Arora, R., Bhatia, S., Mishra, B. P., Jain, A. & Prakash, B. Diversity analysis of sheep breeds from Southern peninsular and Eastern regions of India. Trop. Anim. Health Prod. 43, 401–408 (2011).
Arora, R., Bhatia, S., Yadav, D. K. & Mishra, B. P. Current genetic profile of sheep breeds/populations from North-western semi-arid zone of India. Livest. Sci. 135, 193–198 (2011).
Tapio, M. et al. Sheep mitochondrial DNA variation in European, Caucasian, and Central Asian areas. Mol. Biol. Evol. 23, 1776–1783 (2006).
Liu, J. et al. Genetic diversity and phylogenetic evolution of Tibetan sheep based on mtDNA D-Loop sequences. PLoS ONE 11(7), e0159308 (2016).
Ramachandran, A. et al. Microsatellite- based phylogeny of Indian sheep breeds. Indian J. Anim. Sci. 85(11), 1209–1214 (2015).
Kavitha, S. T., Subramanian, A., Sivaselvam, S. N., Thiagarajan, R. & Balasubramanian, S. Genetic characterization of Tiruchy Black sheep of Tamil Nadu using microsatellite markers. Indian J. Anim. Res. 49(3), 320–324 (2015).
Jawasreh, K. I., Ababneh, M. M., Ismail, Z. B., Younes, A. M. B. & Sukhni, I. A. Genetic diversity and population structure of local and exotic sheep breeds in Jordan using microsatellites markers. Vet. World 11, 778–781 (2018).
Naqvi, A. N. et al. Assessment of genetic diversity and structure of major sheep breeds from Pakistan. Small Rumin. Res. 148, 72–79 (2017).
Vahidi, S. M. F. et al. Multilocus genotypic data reveal high genetic diversity and low population genetic structure of Iranian indigenous sheep. Anim. Genet. 47, 463–470 (2016).
Dudu, A. et al. Assessment of genetic diversity in main local sheep breeds from Romania using microsatellite markers. Arch. Anim. Breed. 63, 53–59 (2020).
Ozmen, O., Kul, S. & Gok, T. Determination of genetic diversity of the Akkaraman sheep breed from Turkey. Small Rumin. Res. 182, 37–45 (2020).
Qwabe, S. O., Marle-Köster, E. V. & Visser, C. Genetic diversity and population structure of the endangered Namaqua Afrikaner sheep. J. Anim. Breed. Genet. 45, 511–516 (2013).
Calvo, J. H., Alvarez-Rodriguez, J., Marcos-Carcavilla, A., Serrano, M. & Sanz, A. Genetic diversity in the Churra tensina and Churra lebrijana endangered Spanish sheep breeds and relationship with other Churra group breeds and Spanish mouflon. Small Rumin. Res. 95, 34–39 (2011).
Sodhi, M., Mukesh, M. & Bhatia, S. Characterizing Nalli and Chokla sheep differentiation with microsatellite markers. Small Rumin. Res. 65, 185–192 (2006).
Ocampo, R., Martínez, R., Rocha, J. & Cardona, H. Genetic characterization of Colombian indigenous sheep. Revista Colombiana de Ciencias Pecuarias 30, 116–125 (2017).
Gorkhali, N. et al. Genomic analysis identified a potential novel molecular mechanism for high-altitude adaptation in sheep at the Himalayas. Sci. Rep. 6, 29963 (2016).
Government of Bengal. Proceedings of Bengal Government General. July 1864. H.C.Wake, Superintendent of Darjeeling, to Undersecretary to Govt. of Bengal. 277, 47–57 (1864).
Sambrook, J., Fritsch, E. F. & Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd edn. (Cold Spring Harbour Laboratory Press, New York, 1989).
Meadows, J. R. S. et al. Mitochondrial sequence reveals high levels of gene flow between breeds of domestic sheep from Asia and Europe. J. Hered. 96(5), 494–501 (2005).
Tamura, K., Stecher, G., Peterson, D., Filipski, A. & Kumar, S. MEGA6: Molecular evolutionary genetics analysis version 6.0. Mol. Biol. Evol. 30, 2725–2729 (2013).
Librado, P. & Rozas, J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25, 1451–1452 (2009).
Excoffier, L. & Lischer, H. E. L. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol. Ecol. Resour. 10, 564–567 (2010).
Fu, Y. X. Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection. Genetics 147(2), 915–925 (1997).
Tajima, F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123, 585–595 (1989).
Bandelt, H. J., Forster, P. & Rohl, A. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 16, 37–48 (1999).
Van Oosterhout, C., Hutchinson, W. F., Wills, D. P. M. & Shipley, P. MICROCHECKER: software for identifying and correcting genotyping errors in microsatellite data. Mol. Ecol. Notes 4, 535–538 (2004).
Peakall, R. & Smouse, P. E. GenAlEx 6.5: genetic analysis in Excel Population genetic software for teaching and research—an update. Bioinformatics 28(19), 2537–2539 (2012).
Weir, B. & Cockerham, C. Estimating F-statistics for the analysis of population-structure. Evolution 38(6), 1358–1370 (1984).
Nei, M. Genetic distance between populations. Am. Nat. 106(949), 283–292 (1972).
Goudet, J. FSTAT (Version 2.9.3): A program to estimate and test gene diversities and fixation indices. Available online from http:wwwunivch/izea/softwares/fstathtml (2001).
Pritchard, J.K. STRUCTURE Software Ver 2.3.4 Retrieved from: https://web.stanford.edu/group/pritchardlab/structure_software/release_versions/v2.3.4/html/structure.html (2009).
Jakobsson, M. & Rosenberg, N. A. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinformatics 23, 1801–1806 (2007).
Rosenberg, N. A. DISTRUCT: a program for the graphical display of population structure. Mol. Ecol. Resour. 4, 137–138 (2004).
Pritchard, J., Stephens, M. J. & Donnelly, P. J. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Evanno, G., Regnaut, S. & Gould, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
Earl, D. A. & VonHoldt, B. M. Structure Harvester: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 (2012).
Acknowledgements
This work was supported by the Indian Council of Agricultural Research, under Network Project on Animal Genetic Resources.
Author information
Authors and Affiliations
Contributions
R.S. and S.A. designed the study; M.S.T. managed resource sheep populations and performed sampling; H.S., P.S., and P.P. carried out the experiment; R.S., S.A., and R.A. performed data analysis and R.S. and S.A. wrote the manuscript. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Sharma, R., Ahlawat, S., Sharma, H. et al. Microsatellite and mitochondrial DNA analyses unveil the genetic structure of native sheep breeds from three major agro-ecological regions of India. Sci Rep 10, 20422 (2020). https://doi.org/10.1038/s41598-020-77480-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-77480-6
This article is cited by
-
Maternal genetic diversity and phylogenetic analysis of Indian riverine and swamp buffaloes: insights from complete mitochondrial genomes
Mammalian Genome (2024)
-
Genetic variation and demographic history of Sudan desert sheep reveal two diversified lineages
BMC Genomics (2023)
-
Evidence for independent domestication of sheep mtDNA lineage A in India and introduction of lineage B through Arabian sea route
Scientific Reports (2021)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.