Introduction

The Mongolic-speaking Kalmyks currently inhabiting the steppes to the west of the mouth of the Volga River close to the Caspian Sea coast are the descendants of the Oyrats (or the Oirads, Oyrads, Oyirads) originating from Western Mongolia.1, 2 In the 13th century, the Oyrat confederation was established consisting of the Torguud (or Torgut, Torgaut, Torgoud), Dörwöd (or Dörböd, Derbet), Khoshuud (or Khoshut, Khoshugut) and Khoyd (or Khoyt, Khoit) tribes.1 In the 17th century, the largest Oyrat tribal unions of the Torguuds and Dörwöds migrated to the steppes of Western Siberia, but later they relocated to the left bank of the Volga River. Therefore, for over 380 years (from 1630), all the descendants of the Oyrats (currently classified as Kalmyks) have been living interspersed among other ethnic groups of this region. In the Republic of Kalmykia, the Torguuds and Dörwöds are the numerically dominant subdivisions of the Kalmyks. The Torguuds are located in the eastern and southeastern regions of the Republic. The Dörwöds primarily inhabit the northern and central regions of Kalmykia, and the Khoshuuds, few in number, are located in the Torguud and Dörwöd regions.

There are several genetic studies in which different Kalmyk populations were analyzed. An analysis of biochemical polymorphisms (blood groups, serum proteins and red cell enzymes) has been performed in three tribes of the Kalmyks: the Torguuds, Dörwöds and Buzawa (or Buzavs).3 The Buzawa are descendants of those Torguuds, Dörwöds and Züüngars who migrated to the Don River region and intermingled with other ethnic groups of this region during the 18th century. This genetic study revealed that close genetic affinities exist between the Dörwöds and Buzawa, but both populations differ significantly from the Torguuds. As an ethnic group, however, the Kalmyks genetically resemble the contemporary Buryats of the Baikal region of southeastern Siberia and the Mongols of Mongolia.3 The mitochondrial DNA and Y-chromosome results also indicate a close relationship between the Kalmyks and Mongols4, 5, 6 and Buryats.7, 8 More detailed analysis of mitochondrial DNA variation in Northern Asia has shown that the Kalmyks are clustered together with the Mongolic-speaking populations of the Buryats, Barghuts, Khamnigans, Mongols and Turkic-speaking Sojots, thus suggesting their origin from a common maternal ancestral gene pool.9 The same trend is also evident for some of the Y-chromosome lineages—a relatively high frequency of haplogroup C3d-M407 has been revealed in all populations mentioned above, thus suggesting that this subhaplogroup is characteristic of the Mongolic-speaking populations.10

Meanwhile, population samples studied to date using mitochondrial DNA and Y-chromosome markers represent heterogeneous collections of individuals of different tribal ancestry living in Elista city, the capital of the Kalmyk Republic; therefore, molecular data obtained may simply result from sampling effects. Therefore, it would be reasonable to study descendants from different tribes of the Kalmyks obtained from different locations in Kalmykia. Apparently, such sets of samples may more accurately represent the overall genetic diversity in the Kalmyks. In this article, we explore the genetic polymorphisms of three tribes of the Kalmyks by characterizing the Y-chromosome variation through analysis of high-resolution biallelic markers and short tandem repeat (STR) typing. This approach allowed us to reconstruct the genetic structure of separate Kalmyk subpopulations (tribes of the Torguuds, Dörwöds and Khoshuuds) and the Kalmyks collectively as an ethnic group. The genetic relationships between the Kalmyks and other populations of Central and Eastern Asia as well as South Siberia were examined to clarify the paternal history of the Kalmyks.

Materials and methods

Subjects and DNA typing

Hair root samples of the Kalmyk males were collected from different locations in the Republic of Kalmykia, the autonomous Republic within the Russian Federation, during field expedition conducted in 2007 (Supplementary Figure S1, Supplementary Table S1). All the samples were collected with ethical approval at the Institute of Biological Problems of the North in Magadan, Russia, and informed consent was written in Russian.

DNA was extracted from the hair roots as described elsewhere.11 The Y-chromosome single-nucleotide polymorphism was characterized using PCR primers summarized in Karafet et al.12 Markers M25, M40, M48, M73, M120, M124, M128, M174, M217, M231, M267, M269, M314, M356, M407, M410, M458, MEH2, P31, P36.2 and P43 were analyzed through direct sequencing. All these polymorphisms were typed on ABI 3130 and ABI 3500xL Genetic Analyzers (Applied Biosystems, Foster City, CA, USA). Markers Lly22g, M9, M17, M77, M89, M122, M130, M170, M172, M173, M201, M207, M214, M242, M253, P37.2, RPS4Y, SRY1532, Tat and 92R7 were detected by PCR-restriction fragment length polymorphism analysis.13, 14, 15, 16, 17, 18, 19, 20, 21 An insertion/deletion polymorphism at DYS287 (YAP), 12f2 and MSY2.2 was typed as described elsewhere.13, 22, 23

Samples were additionally typed for 17 Y-STR loci (DYS19, DYS385a, DYS385b, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, GATA-H4, DYS448, DYS456, DYS458 and DYS635) using AmpFl-STR YFiler PCR Amplification kit (Applied Biosystems) according to the manufacturer’s instructions. Products of amplification were analyzed on ABI 3500xL Genetic Analyzer. Electrophoresis results were analyzed using GeneMapper Software v. 4.1 (Applied Biosystems).

Statistical analysis

Loci DYS385a and DYS385b were excluded from statistical analysis of population data sets because an unambiguous assignment of the alleles to these loci is impossible without their separate typing. In addition, we excluded DYS19 because it is duplicated in some haplogroups, particularly in haplogroup C3c1-M77. The allele sizes for DYS389II were determined with the subtraction of DYS389I, and both loci were included in the calculations.

Summary statistics were calculated using ARLEQUIN 3.5.24 Genetic differentiation between populations was estimated by means of distance methods based on the number of different alleles of the microsatellite (FST) and the sum of the squared number of repeat differences between two alleles (RST). The measure RST differs from FST in taking explicit account of the mutation process at microsatellite loci, for which a generalized stepwise mutation model appears appropriate.25 The software ARLEQUIN 3.5 was used to perform the analysis of molecular variation. The statistical significance tests for pairwise FST and RST values were performed at 1000 permutations and for analysis of molecular variation at 10 000 permutations.

Our population data sets were compared with previously described populations of Central and Eastern Asia and South Siberia belonging to different groups of Altaic family of languages, for which data for 12 Y-STR loci were available. These populations included Buryats,26 Tibetans,27 Manchu28 and Altaian Kazakhs.29 In addition, the population samples of the Mongols, Khamnigans, Sojots, Southern Altaians, Evenks, Khakassians and Tuvinians (a total of 462 individuals) typed for 12 Y-STR loci (DYS19, DYS385a, DYS385b, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438 and DYS439) using PowerPlex Y System (Promega, Madison, WI, USA) were also analyzed. These population samples have been partially published by us,9, 10, 30, 31 but the complete set of Y-STR profiles is included here, in Supplementary Table S2.

Population pairwise FST values were also calculated from the Y-chromosome haplogroup frequencies to assess the genetic similarity of the comparative Kalmyk tribes, the Kalmyks collectively and South Siberian and Central Asian ethnic groups. As there is a scarcity of high-resolution haplogroup information in Central and Eastern Asian populations, we have used only the following South Siberian and Central Asian populations in comparative analysis: Tuvinians, Khakassians, Southern Altaians, Buryats, Yakuts, Altaian Kazakhs and Kyrgyzs from the study by Kharkov32 and additional sample of Altaian Kazakhs from Dulik et al.29

Between-population distances (in the form of FST or RST values) were illustrated by constructing a neighbor-joining tree using the program MEGA 5.0533 and by creating a multidimensional scaling plot using the software package STATISTICA v. 7.1 (StatSoft, Tulsa, OK, USA).

The age of STR variation within haplogroups was estimated as the average squared difference in the number of repeats between all current chromosomes and the founder haplotype (formed by the median values of the repeat scores at each STR locus within the haplogroup), averaged over STR loci and divided by means of a mutation rate.34 The genealogical mutation rate equal to 2.5 × 10−3 per 35 years35 was used in the calculations.

Median network analysis

Median-joining networks of the Kalmyk Y-STR haplotypes were constructed using the Network 4.6 program (http://www.fluxus-engineering.com). Ambiguous loci DYS385a and DYS385b were excluded from median network analysis of individual haplogroups (with the exception of the C3d-M407 haplogroup). In addition, DYS19 locus was excluded from analysis of the C3c1 haplogroup. For the network construction, the weight of each character within the network was assigned to be inversely proportional to the number of mutations at character.36 The weights varied from the default value of 10 at characters with 1 mutation per character to the value of 1 at fast-mutating characters.

For reconstruction of haplogroup R1a1a-M17 phylogeny, published 12-loci STR haplotypes for South Siberians (Southern Altaians, Buryats, Sojots, Evenks, Khakassians, Tuvinians, Shors and Tofalars),7 Central Asians (Mongols,7 Tajiks37 and Pathans from Afghanistan38), East Europeans (Russians),7 Western Asians (Persians and Kurds of Iran)37 and South Asians (Indians)39 were included in the analysis. For median network analysis of haplogroup R2a, Y-STR haplotypes for Buryats,31 Persians, Kurds, Tajiks,37 Pathans38 and Indians39 were used. For haplogroup N1c-Tat analysis, the following haplotypes for South Siberians and Central Asians (the Mongols, Southern Altaians, Buryats, Sojots, Evenks, Khakassians, Tuvinians, Shors, Tofalars, Yakuts, Evens and Koryaks)30 and East Europeans (Russians,30 Udmurts, Maris, Bashkirs, Chuvashes,40 Komi,41, 42 Khanty and Mansi43) were also used.

Results

In total, 23 Y-chromosome haplogroups were identified in the Kalmyks (Table 1). However, more than half (56.1%) of the Kalmyk Y-chromosomes belonged to three C3-M217-derived haplogroups (C3*-M217, C3c1-M77 and C3d-M407). N1c-Tat, O2-P31, O3-M122 and R2a-M124 were also common haplogroups, encompassing about 30% of the total male population. The Kalmyk tribes of the Torguuds, Dörwöds and Khoshuuds are characterized by the presence of haplogroups C3*-M217, C3c1-M77, N1c-Tat, O3-M122 and R2a-M124 haplogroups, but the Khoshuuds had no haplogroup C3d-M407 that is typical of many Mongolic-speaking populations.10 In addition, the Khoshuuds are characterized by a high frequency of haplogroups N1c-Tat and O3-M122, whereas the Dörwöds and Torguuds had a high frequency of haplogroups R2a-M124 and O2-P31, respectively. The fourth tribal group that can be retrieved from our data is the Buzawa (Supplementary Table S1) but the sample size of this group is small (23 individuals). Nevertheless, we can report that this subpopulation also had high frequencies of C3-haplogroups (8.7% of C3*-M217, 39.1% of C3c1-M77 and 8.7% of C3d-M407), N1-haplogroups (8.7% of N1a-M128, 4.3% of N1b-P43 and 4.3% of N1c-Tat) and haplogroup R2a-M124 (17.4%). Haplogroups R1a1a-M17 and G-M201, each having a frequency of 4.3%, were also found in the Buzawa.

Table 1 Y-chromosome haplogroup frequencies (in percentages) in the Kalmyks

Population pairwise FST values calculated from the Y-chromosome haplogroup frequencies among different tribes of the Kalmyks and between them and other comparative groups are summarized in Supplementary Table S3. The relative affinities of populations studied are shown in Supplementary Figure S2. This neighbor-joining tree indicates the Dörwöds and Torguuds, as well as the Kalmyks collectively, are much more similar genetically to each other compared with the Khoshuuds. Comparing the Kalmyks with other Siberian populations, it is surprising that the Kalmyks cluster with Altaian Kazakhs. On the basis of Y-chromosome haplogroup frequencies, the Kalmyks differ from the other comparative groups, especially from the Mongolic-speaking Buryats.

Analysis of 17 fast evolving Y-STRs provided additional details of the relative affinities of the Kalmyk tribes. We reduced the 17-STR profile to a 9-STR profile (loci DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438 and DYS439) to compare the Kalmyk data with published data sets. Population pairwise FST and RST estimates were computed from the Y-STR data (Table 2) and illustrated by creating neighbor-joining trees (Supplementary Figures S3 and S4).

Table 2 Pairwise comparisons among the Kalmyks and other populations of Siberia, Central and Eastern Asia based on Y-chromosome STR variation

The neighbor-joining tree based on the FST values (Supplementary Figure S3) shows that the Kalmyks, as well as the Dörwöds and Torguuds, cluster together with Altaian Kazakhs and Mongols, whereas the Khoshuuds were clearly separated from all of them, gathering with the Manchu and Tibetans. The results of the multidimensional scaling of the FST values presented in Supplementary Figure S4 illustrate the close genetic affinities between the Kalmyks (and the Dörwöds) and Altaian Kazakhs, whereas the Torguuds were separated from all other populations and the Khoshuuds were close to the Mongols. Somewhat different picture emerged from results of the RST values analysis. As shown in the phylogenetic tree (Supplementary Figure S5), populations formed two clusters, with most Mongolic-speaking populations (the Mongols, Kalmyks and their tribes Dörwöds and Torguuds, Buryats, Khamnigans) and genetically close to them samples of Altaian Kazakhs, Tibetans and Sojots, being separated from the other reference populations, such as the Altaians, Evenks, Manchu, Khakassians and Tuvinians, as well as the Khoshuuds. In the multidimensional scaling plot (Figure 1), the RST estimates indicated that the Khoshuuds had genetic affinities with the Evenks, whereas the Kalmyks collectively, as well as their tribes of the Dörwöds and Torguuds, clustered together with the Mongols and Altaian Kazakhs. Therefore, the Kalmyk tribes did not constitute a separate cluster and the Khoshuuds were distinctive among them. Meanwhile, one should note that based on the population pairwise FST and RST values, the Kalmyks shared a smaller distances with the Mongols (Table 2). However, only three pairs of populations exhibited nonsignificant genetic distances: Khoshuuds-Evenks and Evenks-Khakassians for the RST values and Dörwöds-Kalmyks for the FST and RST values (Table 2).

Figure 1
figure 1

Multidimensional scaling (MDS) plot of pairwise RST genetic distances using Y-chromosome STR haplotypes data for the Kalmyks and other populations of Siberia, Central and Eastern Asia. The stress value for the MDS plot is 0.112.

The analysis of molecular variation performed using the 12 populations of South Siberia, Central and Eastern Asia showed that a 15 and 20.2% of genetic differentiation (for FST and RST distances, respectively) are because of differences among ethnic groups. This is consistent with previous studies pointing to a high level of Y-chromosomal differentiation in this geographical region.7, 43, 44, 45, 46 However, our study shows that genetic differentiation is strong even within the Kalmyks as an ethnic group (Table 3). The level of genetic differentiation is higher at the inter-tribal group level than that at the interethnic group level for such closely related ethnic groups as the Kalmyks, Mongols and Altaian Kazakhs (Table 3). In addition, genetic diversity at the tribal level is comparable to that at the level of ethnic group (Table 4). It is noteworthy that this is in agreement with previous studies of Y-chromosome variation in nomadic populations of Central Asia demonstrating that there is no common ancestry at the tribal level, in contrast to the clan and lineage levels.45

Table 3 AMOVA results (in percentages of variation) based on the Y-chromosome STR diversity in the Kalmyk tribes and closely related ethnic groups
Table 4 Molecular diversity indices calculated from the Y-chromosome STR haplotypes in the Kalmyks

In order to compare patterns of distribution of Y-STR haplotypes in different tribes of the Kalmyks, we performed median network analysis of STR haplotypes within their major haplogroups, such as C3c1-M77, C3d-M407, C3*-M217, N1c-Tat, R-M207, O2-P31 and O3-M122 (Supplementary Figure S6). Haplogroup C3c1-M77 has a star-like phylogenetic structure, with the most frequent haplotype found in 69 individuals from different tribes of the Kalmyks. About one-third of the Kalmyk Y-chromosomes (27.9% in the Dörwöds, 26.8% in the Khoshuuds and 37.3% in the Torguuds) are characterized by duplication of DYS19 locus on the background of haplogroup C3c1-M77 (Table 5). The same duplication has been previously observed in different populations of Central Asia and South Siberia.4, 5, 10, 47

Table 5 Frequencies (in percentages) and the age of STR variation of some Y-chromosome haplogroups in the Kalmyks

C3*-M217 paragroup consists of two major haplotype clusters (Supplementary Figure S6). One of them is associated with deletion of the locus DYS448 and has been found at a frequency of 1.2% in the Khoshuuds, 2.4% in the Dörwöds and 7.3% in the Torguuds. Second haplotype cluster corresponded to the ‘star cluster’, which was thought to be carried by likely male-line descendants of Genghis Khan48 and found at the highest frequency (76.5%) in the Kerey clan from Kazakhstan.49 ‘Star cluster’ haplotypes represented only 3% of the Dörwöds and 2.7% of the Torguuds. Another interesting haplogroup is C3d-M407 (Supplementary Figure S6). Two separate haplotype subclusters are seen in the MJ network of this clade. One of them is characterized by allelic combination 11,18 at the DYS385 locus, whereas the overwhelming majority of C3d-M407 haplotypes in the Kalmyks are defined by allelic combination 11,11, which is probably a consequence of a deletion of allele 18 in any one of the two DYS385a and b loci lying on different arms of the Y-chromosome or gene conversion creating identical copies at the two loci during intrachromosomal recombination.50, 51, 52 Quite the contrary, the overwhelming majority of C3d-M407 haplotypes in populations of South Siberia and Central Asia are defined by allelic combination 11,18 (or sometimes 11,17 and 11,19), and the DYS385-11,11 branch is present in the Kalmyks, Mongols, Tuvinians and Altaians.10

High frequency of such unusual mutations as the DYS19 duplication, DYS448 deletion and DYS385-11,11 variant may be related to the founder effects due to the recent migration of the Kalmyk ancestors and their isolation in the North Caspian Steppe. Therefore, one can expect age-synchronous pattern in these haplogroups. However, their ages differed significantly, varying from 1300 years for DYS448-del haplotypes to about 400 years for R2a-M124 haplogroup (Table 5). The latter haplogroup, rare in South Siberian and Central Asian populations, was found at high frequency in the Dörwöds (15.2%). It is noteworthy that the age of haplogroup R2a-M124 shows the best correspondence to the historical chronology of the Kalmyks.

Figure 2 illustrates a median network of R2a-M124 haplotypes derived from 12 Y-STR loci in the Kalmyks and Buryats and populations of Southwestern and South Asia, in which this haplogroup is present at a relatively high frequency.37, 38, 39 Common haplotypes are observed in the Kalmyks and Buryats, whereas there is no haplotype sharing between these and other comparative populations. However, it is important that the R2a-M124 haplotypes found in the Kalmyks and Buryats are derivatives of the Indian haplotypes. This suggests a recent common ancestry and/or expansion of the Kalmyk R2a-M124 lineages from India.

Figure 2
figure 2

A median-joining network based on 12 Y-STR loci within the R2a-M124 haplogroup. Each circle represents a haplotype, defined by a combination of short tandem repeat (STR) markers. Circle size is shown proportional to haplotype frequency and the smallest circle represents one haplotype. The lines between circles represent mutational distance, the shortest distance being a single mutational step. Colors indicate different groups of populations: gray for the Kalmyks and Buryats,31 black for populations of Central and West Asia (Pathans from Afghanistan, Persians and Kurds from Iran and Tajiks)37, 38 and white for South Indians.39

Previous study of the Y-chromosome variability in the Kalmyks indicated that there has been no substantial admixture with Russians during the last 380 years of their neighborhood because the paternal gene pool of the Kalmyks hardly included East European-specific haplogroups, such as R1a1a-M17, N1c-Tat and I-M170.4 In our study, we have found haplotypes belonging to the R1a1a-M17 and N1c-Tat haplogroups at perceptible frequencies (3.3% and 10.1%, respectively); hence, their East European origin is quite possible. To study this question, we reconstructed median networks of haplogroups R1a1a-M17 and N1c-Tat based on STR variation in the Kalmyks, Russians and populations of South Siberia and Central Asia. The latter were taken into analysis because haplogroups R1a1a-M17 and N1c-Tat are typical not only of East Europeans but also of some populations of Siberia and Central Asia.7, 30, 32, 40, 44 In addition, haplogroup R1a1a-M17 is specific for populations of Southwestern and South Asia; therefore, we have also analyzed Y-STR haplotypes for Pathans from Afghanistan,38 Persians and Kurds from Iran,37 Tajiks37 and South Indians.39 Phylogenetic network of haplogroup R1a1a-M17 (Supplementary Figure S7) demonstrates that two haplotypes of the Kalmyks coincide with the haplotypes common among different populations of South Siberia (Tuvinians, Altaians and Sojots) and Central Asia (Tajiks and Pathans), two haplotypes with populations of Pathans, Persians and Indians, and one haplotype with South Siberians (Shors and Tuvinians). Only one common haplotype was detected in the Kalmyks and Russians (Supplementary Figure S7).

Median network analysis of N1c-Tat haplotypes shows the bipartite distribution of the Kalmyk haplotypes (Supplementary Figure S6, Figure 3). Majority of them can be found among a dozen haplotypes that were detected in South Siberians and East Europeans, and the origin of these paternal lineages cannot be established undoubtedly. The Kalmyk haplotypes are in general unique, with only one haplotype revealed in common between the Kalmyks and Tuvinians. Thus, it is possible that all these haplotypes may have originated in South Siberia and Eastern Asia. Second part of the Kalmyk N1c-Tat haplotype network is represented by a separate branch consisting of three haplotypes that were found only in the Khoshuuds—haplotypes 2, 3 and 4 in Figure 3.

Figure 3
figure 3

A median-joining network based on 12 Y-STR loci within the N1c-Tat haplogroup in the Kalmyks, East Europeans and South Siberians. Each circle represents a haplotype, defined by a combination of short tandem repeat (STR) markers. Circle size is shown proportional to haplotype frequency and the smallest circle represents one haplotype. The lines between circles represent mutational distance, the shortest distance being a single mutational step. Colors indicate different groups of populations: gray for the Kalmyks, black for the Volga-Ural populations (Komi, Udmurts, Chuvashes and Bashkirs),40, 41, 42, 43 white for the Russians30 and white circle with black dot inside for aboriginal South Siberian populations.30 Explanations for haplotypes *, 1, 2, 3 and 4 are in the main text.

A search of these STR haplotypes in the YHRD 3.0 database (http://www.yhrd.org; release 44 built on 15 July 2013; 67 430 haplotypes within 503 world populations) does not find matching haplotypes (for loci DYS19, DYS385a, DYS385b, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438 and DYS439) in any population. It is noteworthy, however, that earlier haplotype (haplotype 1 in Figure 3) preceding the branch consisting of haplotypes 2–4 is the haplotype revealed in Chuvashes from the Volga region.40 This finding points to possibility of East European origin of the Khoshuud N1c-Tat branch. Rare haplotype 1 also does not occur in the YHRD 3.0 database, but earlier haplotype, indicated by a star (*) in Figure 3, was found frequently only in populations of the Ural region (in the Komi, Mansi and Khanty).41, 42, 53 Coalescence age of the Khoshuud N1c-Tat branch is about 343±58 years, pointing to a possibility that inflow of this East European paternal lineage into the gene pool of the Khoshuuds occurred soon after their appearance in the Volga-Ural region (after 1645 according to historical data2).

Discussion

The pattern of Y-chromosome diversity in a population can provide a clear overview of its origin.4, 29, 48 Haplogroup C3c1-M77 is the most frequent in all Kalmyk tribes studied. It is known that, in addition to the Kalmyks, haplogroup C3c1-M77 is widespread in Tungusic-speaking people of North-Eastern Asia, such as Evens and Evenks, and Turkic-speaking Altaian Kazakhs.54 However, the three Kalmyk tribes studied here exhibited quite different patterns of Y-chromosome haplogroup variation. In the Khoshuuds, N1c-Tat and O3-M122 were the dominant haplogroups and C3d-M407 was absent, although it appeared as a genetic marker for the Mongolic-speaking populations.10 Such composition is interesting because haplogroup O3-M122 is the dominant paternal haplogroup in Sino-Tibetan populations,55 but haplogroup N1c-Tat occurs at high frequency in different populations of Eastern and Central Siberia.30, 40, 56 However, the Torguuds showed another pattern, having high frequency of haplogroup O2-P31 and low frequency of O3-M122. The Dörwöds are characterized by high frequencies of C3d-M407 and R2a-M124. Haplogroup D-M174, albeit rarely found in the Dörwöds and Torguuds (about 2%), reveals genetic affinity to the Tibetan populations where this haplogroup was present probably before the Last Glacial Age.57 Rare N*-M231 haplotype was found in the Khoshuuds. It is suggested that this haplogroup has likely originated in Southern China and now is mostly present in southern Eastern Asian populations, such as Daic, southern Han Chinese, Tibeto-Burman and Hmong-Mien.58 Another rare N1a-M128 haplogroup, which is frequently occurred only among Kazakhs (8.1%) in Central Asia, was also detected in the Kalmyks (in the Buzawa). Haplogroup R2a-M124 is also interesting, notwithstanding it accounted relatively high frequency only in the Dörwöds (15.2%). This haplogroup is very rare in Central Asia and Siberia, being found only in the Buryats (0.3–2.7%), Kyrgyzs (1.3%), Tajiks (12.5–16.7%)31, 32, 37, 59 and also in Chinese populations (in Uygurs, Han, Hui at a frequency of 2–7%).60 Haplogroup R2a-M124 has a predominant distribution in South Asia (in India and Pakistan);38, 39, 60, 61 thus, its presence in the Kalmyks and other Central Asians is likely a result of male influence from the south of Asia, possibly from India.

Analysis of the Y-chromosome STR diversity has shown that the Kalmyks collectively as an ethnic group have close genetic affinities to the Mongols and Altaian Kazakhs, but the individual Kalmyk tribes, especially the Khoshuuds, have differentiated genetically from each other. Recent studies based on Y-STR haplotypes have also demonstrated that the Kalmyks, Mongols, Altaian Kazakhs and Altaians-Kizhi showed a similar haplotype composition, resulting probably from common origin of populations.6, 62 Anthropologists have suggested that in the last 200–300 years, the Kalmyk tribes have experienced some admixture with neighboring East European populations.63 However, comparison of haplotypes belonging to haplogroups R1a1a-M17 and N1c-Tat, which are frequently occurred in East European populations, shows a low level of East European admixture in the Kalmyks. In addition, the lack or very low frequency of European-specific lineages R1a1a7-M458, I1-M253 and I2a-P37.2 in the Kalmyks suggests that paternal gene flow from East Europeans to the Kalmyks is very little. Therefore, intra-ethnic heterogeneity of the Kalmyks is likely maintained by inter-tribal differences because of extensive endogamous practices of the tribal groups. Previous molecular genetic studies have shown that Y-chromosome results are in agreement with ethnological data suggesting that nomadic populations of Central Asia, especially Turkic-speaking ones, are exogamous at the clan level but endogamous at the tribe level.29, 43, 45, 46, 49 Our study shows that there are more differences between the tribes of the Kalmyks than between some ethnic groups, such as the Kalmyks, Mongols and Altaian Kazakhs. This is also consistent with previous studies of the Y-chromosome variation in the Turkic nomadic groups of Central Asia.45, 46

Future research should be conducted with high-resolution Y-chromosome analysis of Mongolian populations throughout Mongolia, especially of the Oyrat groups living in Western Mongolia, which is necessary to answer the questions about genetic affinities between different tribes of the Mongols and Kalmyks. Previous studies of Mongolian populations were performed using low-resolution Y-chromosome analysis and reported conflicting results.64, 65 In addition, analysis of genome-wide autosomal single-nucleotide polymorphism data in the Kalmyks would be a helpful approach to reveal their genetic affinity to populations of Central Asia and Siberia and to estimate the extent of their recent admixture with East Europeans.