Gene pool preservation across time and space In Mongolian-speaking Oirats

Balinova, Natalia; Hudjašov, Georgi; Pankratov, Vasili; Pennarun, Erwan; Reidla, Maere; Metspalu, Ene; Batyrov, Valery; Khomyakova, Irina; Reisberg, Tuuli; Parik, Jüri; Dzhaubermezov, Murat; Aiyzhy, Elena; Balinova, Altana; El’chinova, Galina; Spitsyna, Nailya; Khusnutdinova, Elza; Metspalu, Mait; Tambets, Kristiina; Villems, Richard; Kushniarevich, Alena

doi:10.1038/s41431-024-01588-w

Download PDF

Article
Open access
Published: 11 April 2024

Gene pool preservation across time and space In Mongolian-speaking Oirats

European Journal of Human Genetics (2024)Cite this article

536 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

The Oirats are a group of Mongolian-speaking peoples residing in Russia, China, and Mongolia, who speak Oirat dialects of the Mongolian language. Migrations of nomadic ethnopolitical formations of the Oirats across the Eurasian Steppe during the Late Middle Ages/early Modern times resulted in a wide geographic spread of Oirat ethnic groups from present-day northwestern China in East Asia to the Lower Volga region in Eastern Europe. In this study, we generate new genome-wide and mitochondrial DNA data for present-day Oirat-speaking populations from Kalmykia in Eastern Europe, Western Mongolia, and the Xinjiang region of China, as well as Issyk-Kul Sart-Kalmaks from Central Asia, and historically related ethnic groups from Altai, Tuva, and Northern Mongolia to study the genetic structure and history of the Oirats. Despite their spatial and temporal separation, small current population census, both the Kalmyks of Eastern Europe and the Oirats of Western Mongolia in East Asia are characterized by strong genetic similarity, high effective population size, and low levels of interpopulation structure. This contrasts the fine genetic structure observed today at a smaller geographic scale in traditionally sedentary populations, and is conditioned by high mobility and marriage practices (traditional strict exogamy) in nomadic groups. Conversely, the genetic profile of the Issyk-Kul Sart-Kalmaks suggests a distinct source(s) of genetic ancestry, along with indications of isolation and genetic drift compared to other Oirats. Our results also show that there was limited gene flow between the ancestors of the Oirats and the Altaians during the late Middle Ages.

Phylogenomics and the rise of the angiosperms

Article Open access 24 April 2024

Hybrid speciation driven by multilocus introgression of ecological traits

Article Open access 17 April 2024

Diversity-dependent speciation and extinction in hominins

Article Open access 17 April 2024

Introduction

The Oirats, a group of closely related peoples who speak the Oirat dialect of the western branch of the Mongolian language, now live far apart on the eastern (western regions of Mongolia, China (XUAR)) and western (Republic of Kalmykia of Russian Federation) edges of the Eurasian Steppe [1, 2]. Both groups include multiple tribes with the largest, Khoshuts, Derbets, and Torguts, present in both groups, and Buzav, which formed in Eastern Europe [3]. Some Kalmak groups in Central Asia, including Sart-Kalmaks in Kyrgyzstan (Sunni Muslims), may also be related to the Oirats through some common traditions and the Oirat dialect of the Mongolian language ([4] but see [5] and [6, 7] for alternative views on the Sart-Kalmak’s origin).

The Oirats are historically related to the mobile nomadic groups in the eastern part of the Eurasian Steppe – today’s Altai region, western Mongolia and northwestern China [1]. The pre-Iron Age genetic history of the region is characterized by ancestral ties to Neolithic individuals of the Devils’ Gate cave in the Far East (ancient northeast Asian ancestry, ANA) [8] as well as connections to Mal’ta and Afontova Gora individuals (ancient north Eurasian ancestry, ANE) [9]. The genetic makeup of the region changed as a result of high population mobility during and after the Iron Age [8]. The Oirats were part of the Great Mongol Empire until the 12th century AD. After its collapse in the 14th-15th centuries AD, some nomadic groups moved westward, forming the Kalmyk ethnic group in the Lower Volga region in the 16th century AD [1]. The uniparental [10,11,12,13,14] and autosomal [15,16,17] genetic diversity of the Kalmyks brings them close to present-day Central and East Asian populations.

The formation and dispersion of the Oirats is connected with the Altai-Sayan highlands, which were under the influence of several political formations during 13th–18th centuries AD, including the Oirat Khanate [3]. The common history has left traces in the linguistics [18] as well as in the cultural layers of the Oirats and Altaians [3], but their genetic ties have not yet been studied.

The demographic history of the Oirats is poorly understood. In this light, the genomes of modern Oirats are a valuable resource for studying their past. In this study, we generate genome-wide genotypes and mitochondrial DNA sequences of present-day Oirat-speaking groups and historically related South Siberians and analyze them together in the context of modern and ancient human genomes. We aim to characterize the genetic structure of the Oirats, reconstruct their demographic history and genetic relationships with surrounding populations.

Material

Genome-wide genotypes (InfiniumOmniExpress-24v1.2 and v1.3 array) of 80 and mitochondrial DNA (mtDNA) of 453 individuals from different Oirat groups (Kalmyks, Western Mongols), Sart-Kalmaks and South Siberians (Fig.1) were analyzed for the first time in this study (Table S1, Table S2).

**Fig. 1: Map showing the geographic origin of the new and reference populations studied for genome-wide diversity.**

Methods

Genome-wide data processing and genotype imputation

New samples were pooled with several previously published populations of interest (resulting in N = 645) (Table S1). As these were genotyped on different Illumina genotyping platforms, we first performed imputation to increase the cross-platform single nucleotide polymorphism (SNP) overlap. All samples were prepared in a similar way using PLINK v1.9 [19] (SFile). Imputation and phasing were performed separately for each genotyping array on the TOPMed imputation server (https://imputation.biodatacatalyst.nhlbi.nih.gov/). This was followed by post-imputation quality filtering and merging of the datasets with imputed genotypes, resulting in ~915k SNPs in 645 individuals.

ADMIXTURE and principal component analysis (PCA)

PCA was performed with PLINK v1.9 [19]. The ancestry of each individual was modeled using ADMIXTURE v1.30 [20]. Thirty randomly seeded runs were performed for each number of ancestral populations (k = 2–12), and results within each k were post-processed with CLUMPAK to find the consensus solution [21]. Samples belonging to the largest (most frequent) CLUMPAK cluster were grouped by inferred fineSTRUCTURE (FS) populations (see below). For each k, ancestral population proportions within each FS population were averaged and reported. Cross-validation (CV) scores for each k are shown in Fig. S1.

fineSTRUCTURE, GLOBETROTTER

To investigate genetic clustering between different target groups and other Eurasian populations, we used the ChromoPainter/fineSTRUCTURE (CP/FS) pipeline [22] on our phased and imputed SNP set. FS was run for 3 M Markov chain Monte Carlo (MCMC) iterations (1.5 M burn-in and 1.5 M main iterations) in two parallel runs to assess convergence. The tree-building step was performed as published elsewhere [23] and the run with the highest observed posterior likelihood was used to cluster the samples into genetic groups. The inferred FS groups were further manually inspected and merged into the higher-order FS clusters (Table S1 “FS cluster affiliation”). These FS clusters were used as surrogate populations (Table S1 “GT population name”) to infer admixture proportions and dates with GLOBETROTTER (GT). For additional details on admixture events dating, see SFile and Table S3.

Fst estimation

We used vcftools-0.1.14 [24] on imputed genomes (460k SNPs, SFile) to obtain Weir and Cockerham’s Fst between each pair of populations across all sites.

Outgroup f3 and f4 statistics

Outgroup f3 and f4 statistics were calculated using the qp3Pop and qpDstat methods, respectively, from ADMIXTOOLS 6.0 program [25]. Ancient DNA (aDNA) samples of interest were extracted from the Allen Ancient DNA Resource [25] (Table S4) and merged with the modern data, resulting in 460k overlapping SNPs (details of aDNA data preparation in SFile). Outgroup f3 statistics used in the form f3(Mbuti; modern group, ancient group) [26], where modern groups were Kalmyks (Buzav, Khoshut, Torgut), Western Mongols (Derbert, Torgut), Xinjiang Kalmyks, Sart-Kalmak, Buryat, Tsaatan, Tozhu, as well as Altaians (Chelkan, Kumandin, Tubalar, Teles). f4 statistics followed the form f4(Mbuti, Western Eurasian population; Kalmyk group, Sart-Kalmak). Western Eurasian populations are reported in Table S1.

Detection of segments identical by descent (IBD)

To detect patterns of long IBD segments sharing between individuals and FS clusters (Table S1), we applied IBIS [27] to 645 imputed genotypes. The choice of IBIS was motivated by the fact that (a) IBIS does not use phase information and is therefore not affected by phase errors, (b) IBIS is to some extent tolerant of genotype errors, and (c) IBIS is applicable to datasets with individuals from different populations. Before running IBIS, we filtered positions based on imputation quality (R2 > = 0.99), resulting in 708 k positions. We ran IBIS with the following arguments:

-t 10 -maxDist 0.1 -a 0 -mL 5 -mt 300 -er 0.004 -hbd -mLH 5 -erH 0.008.

This revealed 239,501 IBD segments of 5 centimorgan (cM) or longer. We then used the sum of the genetic length in cM of IBD segments between each pair of individuals as a measure of IBD sharing.

When describing IBD sharing between clusters and/or populations, we removed samples marked “1” in columns “Pruning” in Table S1 (falling into a distinct cluster compared to the majority of samples in that population) and “2” (forming a separate tip within the cluster) from populations and only those marked “2” from clusters.

Runs of homozygosity

Runs of homozygosity (RoHs) were detected using PLINK v1.9 [19] on the same file as was used for IBD detection (see above). The following arguments were used to obtain the number of RoHs and their total length for each individual: --homozyg --homozyg-window-snp 100 --homozyg-snp 50 --homozyg-kb 1500 --homozyg-gap 1000 --homozyg-density 50 --homozyg-window-missing 5 --homozyg-window-het 1.

Estimating the effective population size (Ne) through time

To estimate the effective population size of the Kalmyk and Western Mongol groups, we performed IBDNe [28] on 50 individuals forming the corresponding FS cluster (Table S1). Since the length distribution of IBD segments is fundamental for IBDNe, to avoid segment disruption due to imputation errors, we used only the SNPs that were actually genotyped and not imputed (564k SNPs) for the samples belonging to the cluster that includes the Kalmyks and the Oirats from Western Mongolia. We extracted phase information from our TOPMed imputation results for these positions in these 50 samples. Here, we used the Refined IBD method [28] to detect IBD segments. We then merged segments that were no more than 0.6 cM apart and had no more than one discordant position between two segments to be merged, using the utility provided by the authors of the Refined IBD. Resulting segments longer than or equal to 3 cM were used as input for IBDNe estimation.

Mitochondrial DNA haplogroup determination

MtDNA haplogroups (hg) were determined by DNA sequencing of the hyper variable segment (HVS) I and HVSII (where necessary) and the screening of 73 coding region markers (Table S2) according to the hierarchy of the mtDNA phylogenetic tree [29] (PhyloTree.org – mtDNA tree Build 17 (18 Feb 2016) http://www.phylotree.org/tree/main.htm). The frequencies of the mtDNA hgs of the studied populations were compared with available comparative data (Table S5) from Eastern Eurasia using Correspondence analysis (CA).

Results

Genetic structure of Oirats and South Siberians

To assess the broad genetic profile of the Oirats and South Siberians, we used PCA and Fst. From here on, we will use “Kalmyks” to refer to the Kalmyk groups from Kalmykia and the Xinjiang region of China, and “Oirats of Western Mongolia” (OWM) to refer to the Torgut and Derbet groups from Western Mongolia. Both the PCA results (Fig. 2a, b) and Fst values (Table S6, Fig. S2) show that the Kalmyks are genetically similar to OWM, but are differentiated from their present neighboring populations from Eastern Europe (Fig. 2a, b, Table S6, Fig. S2). The Mongolian-speaking Buryats form their own cluster but are in close proximity to the Kalmyks and OWM. There is no recognizable structure between studied subethnic groups, neither within Kalmykia, nor within the OWM (Fig. 2a, b, Table S6, Fig. S2). The Sart-Kalmaks are distinct from other Oirats: they are grouped together with the Central Asian Kyrgyz, Uyghurs, and Kazakhs (Fig. 2a, b). The Altaians form a cline along the PC1 with northern ethnic groups (Kumandins, Chelkans and Tubalars) shifted towards the West Eurasians, and southern ethnic groups – Teles and Altai-Kizhi – found on the opposite side (Fig. 2a, b). Tsaatans from northern Mongolia lie between Tozhu Tuva and Tuvinians, suggesting varying degrees of admixture between the two groups (Fig. 2a, b).

**Fig. 2: Genetic structure of Oirats and South Siberian populations revealed by PCA.**

At k = 9 of the ADMIXTURE analysis, an East Asian component predominates in the Kalmyks and OWM, followed by a Siberian Yakut and Evenk-like component, while West Eurasian ancestry is minor (Fig. 3a, Fig. S1, Fig. S3). The Sart-Kalmaks are closer to Central Asians due to a slight increase in Western Eurasian and a decrease in Siberian Evenk-like components (Fig. 3a). Siberian ancestry (light orange and light green, maximized in Yakuts and Evenks, respectively) predominates among Tozhu from Tuva and Tsaatans from northern Mongolia. The proportions of ancestral components differ between northern and southern Altaian populations: the former have a predominant Shor-like component (pink), whereas the latter have two components - Evenk-like (light green) and Han-like (dark blue) – in almost equal proportions (Fig. 3a). In addition, the Western Eurasian (European-like) component is increased in the Northern Altaic group.

**Fig. 3: fineSTRUCTURE cluster tree and ancestral components modeled with ADMIXTURE, and total and mean length of runs of homozygosity (RoH) per population.**

Patterns of IBD sharing and homozygosity in Oirats and South Siberians

To examine recent gene flow, we analyzed IBD segments shared within and between populations as well as genetic clusters defined by the FS analysis (an individual’s membership in a particular cluster is given in Table S1: e.g. the Kalmyks and OWM are members of the genetic cluster “Kalmyk-OWM”). The values of IBD sharing within populations and within clusters varied widely in our dataset (Fig. S4, Fig. S5). The lowest values within populations and within clusters were detected in most of the East European, Caucasian, Central Asian, and Oirat populations. In particular, the median pairwise IBD sharing is 25 cM within the Kalmyk-OWM cluster, while extended IBD runs were observed in the South Siberian populations, suggesting low Ne (e.g., the median IBD sharing reaches 200 cM and higher within the Tuvan-Tozhu and Altai North clusters), which is expected for second cousins who share a couple of ancestors from around three generations ago [30] (Fig. S4, Fig. S5).

Individuals forming the Kalmyk-OWM cluster have the highest total IBD length, with East Asians, Central Asians, and South and East Siberians, with Buryats standing out among other Siberian groups. This is similar for Sart-Kalmaks, but the total length of IBD segments is highest with Kyrgyz (Fig. 4, Fig. S6, Fig. S7). IBD-relatedness is different for the Altai North and Altai South clusters: while the former is more localized, the latter shares more IBD with a wide range of Siberian, Central Asian populations and Kalmyks (Fig. 4, Fig. S6, Fig. S7). The Tuvan-Tozhu have more IBD in common with the Siberian populations – the Buryats, Dolgans, Evenks, and Nenets – than with the Tuvans (Fig. 4, Fig. S6, Fig. S7).

**Fig. 4: Population-median IBD sharing with fineSTRUCTURE-defined clusters.**

We have examined the total length of RoH, as a measure of a homozygosity in our dataset (Fig. 3b). Both the Kalmyks and OWM have comparatively low values of total RoH (Fig. 3b). Among the Kalmyk subethnic groups, the lowest RoH values were found among Derbets and Kalmyks from Xinjiang, and the highest values were found among Khoshuts. Notably, the total RoH length in Tsaatans from northern Mongolia, unlike other Mongolian populations studied, is among the highest in our dataset. RoHs in this population are also longer on average than in other populations with similar total RoH length, suggesting low Ne in the relatively recent past.

In contrast to the Kalmyks and OWM, Sart-Kalmaks from Central Asia have both higher total and mean RoH lengths, which may reflect a more recent decrease in population size during the westward migration of some of their ancestors into Kyrgyz territory [5]. Higher levels of homozygosity are also a characteristic of both northern and southern Altaic populations, among which Altaic Chelkans stand out (Fig. 3b). Finally, the Tozhu people of the Republic of Tuva have the highest values of total RoH length among the new populations generated in this study, indicating a small population size of these groups, probably due to geographical isolation or bottleneck (Fig. 3b). In the case of Tozhu, however, the high total RoH length is due to a large number of relatively short segments, suggesting a recent increase in Ne or exogamy [31].

Effective population size dynamics for the Kalmyk-OWM cluster

Modeling of Ne in the Kalmyk-OWM cluster suggests its rapid increase starting about 20 generations ago (if generation = 30 years, then about 600 years ago) (Fig. S8) [27]. However, these results should be treated with caution because (a) the number of samples used is relatively small, and (b) we had to combine the Kalmyk and OWM to achieve a sample size of at least 50 individuals, which may have inflated Ne estimates in the very recent past. Nevertheless, the recent population expansion is consistent with the low levels of IBD sharing observed in the Kalmyk-OWM cluster, as well as in individual Kalmyk and OWM populations.

Exploring admixture events in Oirats and South Siberians

Admixture events in Kalmyks, OWM, Sart-Kalmaks, and South Siberian genetic clusters were modeled using GT and MALDER analyses. Simple one-date admixture events between two source populations dated to the 13th–14th centuries AD are inferred by GT with medium to high certainty (goodness-of-fit R2 > 0.6) in most of the clusters, but Mongolian Tsaatans (Table S7a, b). MALDER results further confirm these findings: with the exception of three groups (Altai South, Tuvan-Tozhu, and Kalmyk Khoshut), it detected an admixture event between the eastern and western Eurasian reference groups. This pattern and admixture dates overlap with those in the GT analysis (see SFile for potential explanation of the observed differences in dates).

The Kalmyk, OWM, and Sart-Kalmak groups could be represented as a mixture of East Eurasian (Daur/Oroqen/Buryat proxies) (65–80%) and West Eurasian (South Russian/Caucasian proxies) sources (20–35%) (Table S8). The proportion of West Eurasian ancestry among Sart-Kalmaks is significantly higher than among Kalmyks and OWM (35%), while Buryat-like ancestry is lower (10%) (Table S8, Table S9). At the level of ethnic subgroups of Kalmyks, a slight variation in West Eurasian ancestry is observed (Table S8). Admixture between Central Asian (Uzbeks/Kazakhs) and South Siberian (Tuvan/Shor) proxies was modeled in the two Altai clusters (Table S8).

Relatedness to ancient human groups

Analysis of the outgroup f3 statistics shows that Kalmyks, OWM, Sart-Kalmaks, Buryats, and Tsaatans from northern Mongolia share a high number of derived alleles with ancient groups whose ancestry is classified as ANA (Ancestral Northeast Asian) (Fig. S10, Table S4, Table S10). These include: (a) pre-Bronze Age (BA) groups from central-eastern Mongolia, Priamurie (Devil’s Gate), Buryatia (Fofonovo site), and the Baikal region (Lokomotiv Early Neolithic site, Shamanka Early BA, Ust Belaya Early BA), in which ANA predominates; (b) Middle-Late BA (MLBA) and Early Iron Age groups from central and eastern Mongolia (Ulaanzuukh site; Slab Grave culture), the Baikal region (mixture of ANA and ANE (Ancestral North Eurasian) ancestry related to Mal’ta/Afontova Gora individuals); (c) Late Xiongnu groups and late medieval individuals (Khitan, Mongol) from Mongolian territory.

Present-day Altaians and Tuva Tozhu are closer to those ancient groups that represent a mixture of ANE and ANA genetic ancestries (Fig. S10, Table S10). These include primarily Neolithic and Early BA individuals from the Baikal region, but also those where ANA is predominant (pre-BA, MLBA, and Early IA individuals from Mongolian territory). Chelkans also share a higher number of derived alleles with West Siberian hunter-gatherers (HG) (Sosnovyj Ostrov, Tumen) and from Karelia (Fig. S10, Table S10).

Mitochondrial DNA diversity in Oirats and South Siberians

We characterized the mtDNA diversity of nine Oirat ethnic groups - four groups of Kalmyks from Kalmykia, three groups from western Mongolia, Kalmyks from the Xinjiang region of China, and Sart-Kalmaks from Kyrgyzstan - together with Mongolian Tsaatans and Tozhu Tuvans, who speak the Tozhu dialect of Tuvan (Table S2). All Kalmyk subethnic groups, as well as OWM and Tsaatans from Mongolia, are characterized by quite diverse maternal gene pools. In contrast, Tozhu Tuvans have a limited number of hgs, typical of a founder event followed by isolation and small population size (Fig. S9a).

East Eurasian hgs (B, F, R9b, A, Y, N9a, C, Z, D, G, M7, M9a, M13) form the largest common component (more than 70%) among all ethnic groups studied (Fig. S9a). The frequency and composition of these hgs differ among the groups. Hg D is the most common among both Kalmyks and OWM. Hg G occurs more frequently in Mongol Khoshuts, Mongol Derbets, Sart-Kalmaks, and Xinjiang Kalmyks than in the other groups studied. Mongol Tsaatans and Tozhu Tuvans differ from other populations by high frequency (> 50%) of hg C and low frequency of hg D.

West Eurasian hgs (HV, R1, R2, JT, U, N1, N2, X) form the minor component in the groups studied, while they are virtually absent in the Tozhu Tuvan and Mongol Khoshut. Western Eurasian hgs constitute > 20% of the maternal gene pool of the Kalmyk Buzav and Torgut, and Sart-Kalmak, but only 8% among Mongol Torguts, Mongol Tsaatans and Xinjiang Kalmyks. However, considering the small sample sizes of some sub-groups, the differences should be taken with caution. In the Eurasian background, the Oirats studied are intermingled with South Siberians and Central Asians and are clearly separated from modern Caucasian populations (Fig. S9b).

Discussion

Genetic ancestry of Oirats across the Eurasian Steppe

The ancestors of the modern Kalmyks and the OWM lived in the region that is now western Mongolia, northwestern China, eastern Kazakhstan, and southern Siberia until the 16th century AD, when a number of tribes began to migrate westward, reaching the lowlands of the Volga and Don rivers in eastern Europe in the 17th century AD [32,33,34,35,36].

The high genetic similarity of Kalmyks and OWM (Fig. 2, Fig. 3a, Table S6, Fig. S7, Table S8) supports a common genetic ancestry for these groups and its high degree of preservation over time and distance. Some factors that would support the preservation of genetic ancestry in Kalmyk ancestors are: (1) high mobility of pastoral nomads from the Eurasian Steppe of the 16th–17th centuries AD – within about 50 years they reached the lower Volga and Don rivers from Dzungaria [3, 37, 38], (2) pastoral nomadic lifestyle maintained until the 20th century in the regions of the lower Volga and Don rivers [3, 37, 38], (3) language (Oirat dialect of the Mongolian language) [3, 37, 39], and (4) religion (Lamaist Buddhists) [3, 37, 39] could limit gene flow with encountered populations with different traditions.

Oirat groups (Kalmyks and OWM), as well as Sart-Kalmaks and Buryats, are closely related to ancient groups that inhabited regions east of the historical land of the Oirats (Fig. S10). This suggests a deep common ancestry among these groups, consistent with the proposed importance of the eastern Transbaikal and Amur regions in the formation of proto-Mongolian tribes [40]. Similarly, genetic links between present-day Oirats and Altaians go back as far as the Neolithic and Bronze Ages, suggesting a common prehistoric ancestry between these populations (Fig. S10). There is little evidence of substantial gene flow between the Oirat and Altai ancestors during the late medieval/early modern period, despite the Oirat political dominance in the region at the time.

The Kalmyks and OWM are likely to have originated from a parental population(s) that has maintained a relatively large effective population size (Ne) throughout its history (Fig. 3b) (also observed earlier on uniparental data by Nasidze et al., 2005) [14]. The Ne curve estimated in the joint cluster that includes Kalmyks and OWM (Fig. S8) suggests a population growth around 600 y.a. This estimate overlaps with the period of expansion of the Mongol Empire, when numerous tribes became a part of this state in the 12th––13th centuries AD [3, 37, 38]. Importantly, IBDNe estimations reflect the population growth itself, rather than a mixture of different populations [41]. This, together with the genetic evidence in our study, may suggest that the expansion of the Mongols and the Oirats in particular, involved not only the absorption of a variety of different populations, but also actual population growth.

The gene pool of the Sart-Kalmaks demonstrates high similarity with Central Asians, signs of founder effect and isolation of their ancestral population for some period of time (Fig. 2a, b, Fig. 3a, b, Fig. S5, Fig. S7, Table S6, Table S9). Distinct genetic sources for Oirats and Issyk-Kul Sart-Kalmaks, and/or intensive admixture would be consistent with the observed genetic differences between the two groups. However, reconstructing the history of the Sart-Kalmaks will require a more comprehensive dataset and modeling of the population genetic history.

Social traditions strongly influence the genetic structure of (ex)nomadic Oirats

Previous studies have demonstrated the presence of genetic structure mirroring the geographic origin of individuals in populations with historically agricultural economies (e.g. Estonians, Poles) [41, 42]. The pattern we observe for the studied Oirats, whose recent ancestors were pastoral nomads, is the opposite: Kalmyk and Western Mongolian groups are genetically very similar to each other, despite the large geographic distance that now separates them (Fig. 2, Fig. 3, Fig. S7, Table S6). Moreover, we reveal little to no differences between tribal groups, both within Kalmyks and OWM (Fig. 2a, b, Fig. 3a, Table S2) [13]. The minor differences we observe between the groups (Caucasian component in the Kalmyk Torguts, slight increase in West Eurasian ancestry in the Buzav (Table S8)) are less likely to be due to genetic drift (low within-group IBD sharing as well as growth in Ne during the last 600 years (Fig. 4, Fig. S8)), but can be explained by limited gene flow from neighboring populations.

The lack of differentiation between Kalmyks and OWM indicates that they have maintained large Ne since their divergence. In addition to recent population growth, exogamy may also contributed to preventing isolation between local ethnic subgroups. Exogamy (“marriage outside the kin group”) is an ancient custom dating back to proto-Mongolian tribes, that served to prevent marriages between related people [37, 43]. Although modified over time (reckoning the number of generations to common ancestor; geographic distance, clan affiliation etc), exogamy has been preserved to the present day and is common among numerous populations living throughout the Great Eurasian Steppe (Mongolians, Oirats, Tuvans, Kazakhs, Kyrgyz, Bashkirs) [37, 43]. Although we have not tested in our study the hypothesis of exogamy explicitly, the genetic patterns we observe are compatible with it - the lower total length and the lower number of RoHs in the present-day Kalmyks (Fig. 3b).

In summary, the present-day Oirats of Western Mongolia and Kalmykia represent a genetic unity that traces its deep ancestry to eastern Transbaikal and the Amur River region. The revealed low intrapopulation structure and relatively high effective population size is consistent with the tradition of exogamy among historically nomadic pastoralists. In contrast, the genetic profile of the Issyk-Kul Sart-Kalmaks may suggest different source(s) of genetic ancestry(ies), as well as indications of isolation and genetic drift, compared to other Oirats. Our study does not provide evidence of significant gene flow between Oirat ancestors and populations from the Altai-Sayan highlands of southern Siberia during the late Middle Ages.

Data availability

The newly generated genome-wide genotype data generated in this study are available at https://www.ncbi.nlm.nih.gov/geo/(accession numbers GSE262748 and GSE262754) and the data depository of the Estonian Biocentre (https://evolbio.ut.ee/).

References

Bakaeva EP, Orlova KV, Muzraeva DN, Sharaeva TI, Balinova NV, Khomyakova IA, et al. Cross-border culture: essays on a comparative study of traditions of the western Mongols and Kalmyks (in Russian). Bakaeva EP, editor. Elista: Kalmyk scientific center of the Russian Academy of Sciences; 2016.
Bitkeeva AN. Cultural and linguistic dynamics of the linguistic community: the Oirats of China. In: Chelyshev EP, editor. Solution of national and linguistic issues in the modern world CIS and Baltic countries (in Russian). Moscow: Azbukovnik; 2010. p. 681–8.
Maksimov KN, editor. The history of Kalmykia from ancient times to the present day. In 3 volumes (in Russian). Elista: Gerel; 2009.
Balinova NV, Khoninov VN. Studying the Issyk-Kul Kalmyk ethnic group (in Russian). Anthropol Archeol Eurasia. 2014;53:47–55.
Article Google Scholar
Zhukovskaya N. Issyk-Kul Kalmyks (Sart-Kalmaks). In: Tolstova RDA, editor. Ethnic processes among the national groups of Central Asia and Kazakhstan (in Russian). Moscow: Science; 1980. p. 155–66.
Mokeev A. New Look at the Question of the Origin of the Sart-Kalmaks (in Russian). In Bishkek, Moscow: KRSU; 2013. p. 98–103.
Nanzatov BZ. Sart-Kalmaks in modern Kyrgyzstan (in Russian). Cultural Herit Peoples Cent Asia 2012;3:28–49.
Google Scholar
Jeong C, Wang K, Wilkin S, Taylor WTT, Miller BK, Bemmann JH, et al. A dynamic 6000-year genetic history of Eurasia’s Eastern steppe. Cell. 2020;183:890–904.e29.
Article CAS PubMed PubMed Central Google Scholar
Raghavan M, Skoglund P, Graf KE, Metspalu M, Albrechtsen A, Moltke I, et al. Upper Palaeolithic Siberian genome reveals dual ancestry of Native Americans. Nature. 2014;505:87–91.
Article PubMed Google Scholar
Roewer L, de Knijff P, Kayser MY. Chromosome STR analysis in forensic practice. 2004. p.13–16.
Derenko M, Malyarchuk B, Grzybowski T, Denisova G, Rogalla U, Perkova M, et al. Origin and post-glacial dispersal of mitochondrial DNA haplogroups C and D in northern Asia. PLoS One. 2010;5:e15214.
Article CAS PubMed PubMed Central Google Scholar
Malyarchuk BA, Derenko M, Denisova G, Woźniak M, Rogalla U, Dambueva I, et al. Y chromosome haplotype diversity in Mongolic-speaking populations and gene conversion at the duplicated STR DYS385a,b in haplogroup C3-M407. J Hum Genet. 2016;61:491–6.
Article CAS PubMed Google Scholar
Balinova N, Post H, Kushniarevich A, Flores R, Karmin M, Sahakyan H, et al. Y-chromosomal analysis of clan structure of Kalmyks, the only European Mongol people, and their relationship to Oirat-Mongols of Inner Asia. Eur J Hum Genet. 2019;27:1466–74.
Article PubMed PubMed Central Google Scholar
Nasidze I, Quinque D, Dupanloup I, Cordaux R, Kokshunova L, Stoneking M. Genetic evidence for the Mongolian ancestry of Kalmyks. Am J Phys Anthropol. 2005;128:846–54.
Article PubMed Google Scholar
Tambets K, Yunusbayev B, Hudjashov G, Ilumäe AM, Rootsi S, Honkola T, et al. Genes reveal traces of common recent demographic history for most of the Uralic-speaking populations. Genome Biol. 2018;19:139.
Article PubMed PubMed Central Google Scholar
Yunusbayev B, Metspalu M, Metspalu E, Valeev A, Litvinov S, Valiev R, et al. The genetic legacy of the expansion of Turkic-Speaking nomads across Eurasia. PLoS Genet. 2015;11:1–24.
Article Google Scholar
Seidualy M, Blazyte A, Jeon S, Bhak Y, Jeon Y, Kim J, et al. Decoding a highly mixed Kazakh genome. Hum Genet. 2020;139:557–68.
Article CAS PubMed PubMed Central Google Scholar
Kassian AS, Starostin G, Egorov IM, Logunova ES, Dybo AV. Permutation test applied to lexical reconstructions partially supports the Altaic linguistic macrofamily. Evolut Hum Sci. 2021;3:e32.
Article Google Scholar
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR, Bender D, et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet. 2007;81:559–75.
Article CAS PubMed PubMed Central Google Scholar
Alexander DH, Novembre J, Lange K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 2009;19:1655–64.
Article CAS PubMed PubMed Central Google Scholar
Kopelman NM, Mayzel J, Jakobsson M, Rosenberg NA, Mayrose I. Clumpak: a program for identifying clustering modes and packaging population structure inferences across K. Mol Ecol Resour. 2015;15:1179–91.
Article CAS PubMed PubMed Central Google Scholar
Hellenthal G, Busby GBJ, Band G, Wilson JF, Capelli C, Falush D, et al. A genetic atlas of human admixture history. Science. 2014;343:747–51.
Article CAS PubMed PubMed Central Google Scholar
Leslie S, Winney B, Hellenthal G, Davison D, Boumertit A, Day T, et al. The fine-scale genetic structure of the British population. Nature. 2015;519:309–14.
Article CAS PubMed PubMed Central Google Scholar
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–8.
Article CAS PubMed PubMed Central Google Scholar
Mallick S, Micco A, Mah M, Ringbauer H, Lazaridis I, Olalde I, et al. The Allen Ancient DNA Resource (AADR): a curated compendium of ancient human genomes. bioRxiv. 2023; Available from: https://doi.org/10.1101/2023.04.06.535797.
Patterson N, Moorjani P, Luo Y, Mallick S, Rohland N, Zhan Y, et al. Ancient admixture in human history. Genetics. 2012;192:1065–93.
Article PubMed PubMed Central Google Scholar
Seidman DN, Shenoy SA, Kim M, Babu R, Woods IG, Dyer TD, et al. Rapid, phase-free detection of long identity-by-descent segments enables effective relationship classification. Am J Hum Genet. 2020;106:453–66.
Article CAS PubMed PubMed Central Google Scholar
Browning SR, Browning BL. Accurate non-parametric estimation of recent effective population size from segments of identity by descent. Am J Hum Genet. 2015;97:404–18.
Article CAS PubMed PubMed Central Google Scholar
van Oven M, Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum Mutat. 2009;30:E386–94.
Article PubMed Google Scholar
Speed D, Balding DJ. Relatedness in the post-genomic era: is it still useful? Nat Rev Genet. 2015;16:33–44.
Article CAS PubMed Google Scholar
Aiyzhy EV. Tuvans of Russia, Mongolia and China: Forms of Family and Marriage (in Russian). Bulletin of the Kalmyk Institute for Humanities of the Russian Academy of Sciences. 2017;29:59.
Zlatkin IY. History of the dzungar khanate, 1635–1758. 2nd ed. (in Russian). Moscow: Nauka; 1983.
Vladimirtsov BY. Social system of Mongols: Mongolian nomadic feudalism (in Russian). Leningrad: Academy of Sciences Press; 1934.
Derevyanko AP. The Early Iron Age of Priamurye (in Russian). Novosibirsk: Nauka. Siberian Branch; 1973.
Kychanov EI. The History of Ancient and Medieval States Bordering China (From Huns to Manchus). (in Russian). St. Petersburg: St. Petersburg Linguistic Society; 2010.
Avlyaev GO. Origin of Kalmyk people (in Russian). Elista : Kalmyk Book Publisher; 2002.
Bakaeva E, Zhukovskaya N, editors. The Kalmyks (in Russian). Moscow: Science; 2010. (Peoples and Сultures).
Sanchirov V.P. “On the issue of the Durban-Oirat Union” Oriental Studies, N.2, 2013. p. 7–12.
Gruntov I, Mazo O. Lexicostatistical classification of the Mongolic languages (in Russian). J Lang Relatsh. 2016;13:205–56.
Google Scholar
Dashibalov BB. On the Mongol-Turkic borderland: ethno-cultural processes in southeastern Siberia in the middle ages (in Russian). 2005. p. 202.
Pankratov V, Montinaro F, Kushniarevich A, Hudjashov G, Jay F, Saag L, et al. Differences in local population history at the finest level: the case of the Estonian population. Eur J Hum Genet. 2020;28:1580–91.
Article PubMed PubMed Central Google Scholar
Kaja E, Lejman A, Sielski D, Sypniewski M, Gambin T, Dawidziuk M, et al. The Thousand Polish Genomes-a database of Polish variant allele frequencies. Int J Mol Sci. 2022;20;23. Available from: https://doi.org/10.3390/ijms23094532.
Krader L. Indigenous cultures. Science. 1971;174:1184.
Article CAS PubMed Google Scholar
EOSC-Nordic. 2023. Available from: https://doi.org/10.23673/PH6N-0144.

Download references

Acknowledgements

We would like to thank all volunteers who participated in this study. We are grateful to Siiri Rootsi for the discussion on the uniparental genetic systems in Oirat populations. Analyses of the genome-wide data were done on the High-Performance Computing cluster at the University of Tartu (University of Tartu, UT Rocket, 2018) [44]. This work was partly composed during a writing retreat organized by the University of Tartu Institute of Genomics (https://genomics.ut.ee/en).

Funding

This work was supported by: the Estonian Research Council grant PUT (PRG243 to MM) (MM); the European Union through Horizon 2020 research and innovation program under grant no. 810645 and through the European Regional Development Fund project no. MOBEC008 (to MM) (MM, VP, GH); the Estonian Research Council grant PUT (PRG1071 to RV) (MR, EM, RV); the Estonian Research Council grant (PUT1339 to AK) (AK). GH was supported by the EU Digital Europe Programme (DIGITAL) and the Estonian Ministry of Social Affairs under grant GDI (GA no. 101081813). The research was carried out within the framework of the topic Research Institute “Anthropology of Eurasian Populations (biological aspects)” (AAAAA-A 19-119013090163-2) (IKh). The work was supported by a state assignment of the Ministry of Science and Higher Education of the Russian Federation (NB, GE); the funders had no role in the design of the study; in the collection, analysis, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Author information

These authors contributed equally: Natalia Balinova, Georgi Hudjašov, Vasili Pankratov.

Authors and Affiliations

Research Centre for Medical Genetics, Moskvorechye Str. 1, 115522, Moscow, Russia
Natalia Balinova & Galina El’chinova
Core Facility of Genomics, Institute of Genomics, University of Tartu, Riia 23B, 51010, Tartu, Estonia
Georgi Hudjašov, Tuuli Reisberg & Jüri Parik
Institute of Genomics, University of Tartu, Riia 23B, 51010, Tartu, Estonia
Vasili Pankratov, Erwan Pennarun, Maere Reidla, Ene Metspalu & Mait Metspalu
Kalmyk State University named after B. B. Gorodovikov, Pushkina Str. 11, 358000, Elista, Russia
Valery Batyrov
Anuchin Research Institute and Museum of Anthropology, Lomonosov Moscow State University, Mokhovaya Str., 11, 125009, Moscow, Russia
Irina Khomyakova
Institute of Biochemistry and Genetics, Ufa Federal Research Center of the Russian Academy of Sciences, 71 Prospekt Oktyabrya Str., 450054, Ufa, Russia
Murat Dzhaubermezov & Elza Khusnutdinova
Federal State Educational Institution of Higher Education “Ufa University of Science and Technology”, 32 Zaki Validi Str., 450076, Ufa, Russia
Murat Dzhaubermezov & Elza Khusnutdinova
Tuvan State University, Kyzyl, Russian Federation, Lenina Str., 36, 667000, Kyzyl, Republiс of Tuva, Russia
Elena Aiyzhy
Institute of Linguistics, Russian Academy of Sciences, Bolshoi Kislovsky Pereulok, 1, 125009, Moscow, Russia
Altana Balinova
Institute of Ethnology and Anthropology, Russian Academy of Sciences, Leninsky Prospekt, 32 А, 119334, Moscow, Russia
Nailya Spitsyna
Estonian Biocentre, Institute of Genomics, University of Tartu, Riia 23B, 51010, Tartu, Estonia
Kristiina Tambets, Richard Villems & Alena Kushniarevich

Authors

Natalia Balinova
View author publications
You can also search for this author in PubMed Google Scholar
Georgi Hudjašov
View author publications
You can also search for this author in PubMed Google Scholar
Vasili Pankratov
View author publications
You can also search for this author in PubMed Google Scholar
Erwan Pennarun
View author publications
You can also search for this author in PubMed Google Scholar
Maere Reidla
View author publications
You can also search for this author in PubMed Google Scholar
Ene Metspalu
View author publications
You can also search for this author in PubMed Google Scholar
Valery Batyrov
View author publications
You can also search for this author in PubMed Google Scholar
Irina Khomyakova
View author publications
You can also search for this author in PubMed Google Scholar
Tuuli Reisberg
View author publications
You can also search for this author in PubMed Google Scholar
Jüri Parik
View author publications
You can also search for this author in PubMed Google Scholar
Murat Dzhaubermezov
View author publications
You can also search for this author in PubMed Google Scholar
Elena Aiyzhy
View author publications
You can also search for this author in PubMed Google Scholar
Altana Balinova
View author publications
You can also search for this author in PubMed Google Scholar
Galina El’chinova
View author publications
You can also search for this author in PubMed Google Scholar
Nailya Spitsyna
View author publications
You can also search for this author in PubMed Google Scholar
Elza Khusnutdinova
View author publications
You can also search for this author in PubMed Google Scholar
Mait Metspalu
View author publications
You can also search for this author in PubMed Google Scholar
Kristiina Tambets
View author publications
You can also search for this author in PubMed Google Scholar
Richard Villems
View author publications
You can also search for this author in PubMed Google Scholar
Alena Kushniarevich
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study design and study coordination: NB, AK, GE, NS, EKh, MM, KT, RV; sample collection and management: NB, EA, IKh, AB; historical and linguistic background: VB, AB; data processing: NB, MD, EM, TR, JP; data analyses: NB, AK, GH, VP, EP, MR, EM, MD. All authors contributed to the interpretation of the results and to the writing of the paper.

Corresponding authors

Correspondence to Natalia Balinova or Alena Kushniarevich.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

This genetic and epidemiological study was approved by the Internal Review Board of the Research Centre for Medical Genetics (Protocol No. 5/2 dated February 9, 2015). Informed consent was obtained from all participants; translation of the consent form into the native language was done where necessary. All experiments were performed in accordance with the relevant guidelines and regulations of the collaborating institutions.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

SFile

Fig.S1

Fig.S2

Fig.S3

Fig.S4

Fig.S5

Fig.S6

Fig.S7

Fig.S8

Fig.S9

Fig.S10 A-O

TableS1.

TableS2.

TableS3.

TableS4.

TableS5.

TableS6.

TableS7ab.

TableS8.

TableS9.

TableS10.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Balinova, N., Hudjašov, G., Pankratov, V. et al. Gene pool preservation across time and space In Mongolian-speaking Oirats. Eur J Hum Genet (2024). https://doi.org/10.1038/s41431-024-01588-w

Download citation

Received: 07 December 2023
Revised: 27 February 2024
Accepted: 04 March 2024
Published: 11 April 2024
DOI: https://doi.org/10.1038/s41431-024-01588-w