Introduction

Copronecrophagous dung beetles (subfamily Scarabaeinae) possess specialized ecological ability and behavior, in that they utilize feces as food, habitat, and breeding ground, and this extraordinary case of ecological specialization has likely contributed to their evolutionary success1. These dung beetles are also thought to play important ecological roles, such as nutrient recycling, bioturbation, plant growth promotion, seed dispersal, and parasite control, in terrestrial ecosystems2,3 and, as a result, have become a focus of both biodiversity and conservation research, especially owing to their broad range of responses to environmental changes and strong ecological interactions with a diverse array of organisms, including plants, vertebrates, and invertebrates4.

Unfortunately, however, both the decreased of population size and local extinction of dung beetle species (extinction-prone) have been reported to occur as a result of natural and anthropogenic disturbances2,3,5,6. It has been suggested that the diversity and abundance of dung beetles are strongly affected by slight changes in land use and mammalian fauna, at even local scales. Overhunting of forest-dwelling mammals, for example, has led to the extirpation of dung beetle species that rely on the dung of particular mammals2,7. In addition, dung beetle species with rolling behavior (i.e., telecoprids) are especially vulnerable to anthropogenic activities, and historical records suggest that the relative frequencies of telecoprid species have been more severely affected than other scarab dung beetles during the 20th century8. Furthermore, the use of antibiotics and parasite treatments (e.g., ivermectin) in commercial livestock management and agriculture can also accelerate the decline and extinction of dung beetle species9,10, thereby profoundly affecting community- and ecosystem-level changes.

Similar to other telecoprid species, Gymnopleurus mopsus (Scarabaeidae), is also under threat of local extinction throughout its distribution in Palearctic regions, including in Asia11 and Europe6,8. Previous studies have reported that both G. mopsus and other roller species are threatened in Italy8 and that populations of G. mopsus on the Iberian Peninsula have declined significantly6. In addition, G. mopsus has also been declared as regionally extinct (last documented in the 1960s) on the Korean Peninsula, which constitutes the southern margin of the species’ range in Northeast Asia. As a result, the National Institute of Biological Resources in South Korea launched a restoration program in 2012, which re-introduced G. mopsus from Mongolia to South Korea, based on the high genetic diversity of the Mongolia population and its genetic similarity to populations in South Korea12.

Because Mongolia is the only region where large G. mopsus populations are currently maintained in the north-eastern Palearctic region11, it has the value as the species’ last “refuge” and, as such, is considered important for future conservation and restoration efforts. However, the present-day population genetic structure shaped by contemporary processes (e.g., ongoing gene flow among populations in relation to anthropogenic habitat fragmentation and distributional changes) as well as demographic (population) history (e.g., dispersal, isolation, population contraction/expansion and bottleneck) of G. mopsus in Mongolia are yet to be investigated.

Such investigations of Mongolian G. mopsus populations would allow an understanding of the various ecological and evolutionary aspects of G. mopsus. The dispersal and migration patterns of insect species have been reported to play essential roles in determining the genetic structure of insect populations13, and the ecological associations of dung beetles and mammals have likely contributed to the evolution of the Scarabaeinae and community structuring of extant dung beetles over the last 40 million years14. Therefore, analysis of both population genetic and phylogeographic structures could elucidate the historic migration, dispersal, and demography, and the G. mopsus populations in Mongolia present a promising model system for understanding the effects of mammal (e.g., cattle, sheep, and camels) abundance and distribution on the animal-mediated dispersal patterns of dung beetles.

The current population genetic structure of a species is shaped by the effects of the population demographic history (i.e., evolutionary history) of a species, as well as the level of contemporary gene flow among populations. Therefore, historical anthropogenic events, such as the domestication of wild animals, could have influenced the population demography of dung beetles. Studying the levels of genetic diversity and population genetics structure of G. mopsus in Mongolia would allow us to infer whether anthropogenic activities, such as moving cattle herds by pasturing, have facilitated the dispersal of dung beetles. Phylogeographic approaches can also reveal the historical and evolutionary origins and demographic processes of G. mopsus, including the genetic footprints of historic fluctuations in population demography, which could also be related to human activities15. Phylogeographic analysis might also elucidate the effects of human-mediated disturbances, such pasture reduction or habitat destruction, on the distribution and size of G. mopsus populations. Pasturing areas in Mongolia have been rapidly decreased by urbanization and changes to pasturing style. Livestock pasturing can prevent the range expansion of some dung beetle species ranges by altering vegetation density, temperature, and moisture levels16. Mongolia can be classified into two large biogeographic regions: northern humid areas (grassland and forest-steppe areas) and southern desert or semi-desert areas that have undergone drastic climate changes seasonally17. Therefore, determining both the current population genetic and phylogeographic structures of G. mopsus may help us to elucidate its historical distribution patterns in an ecological context.

Although G. mopsus currently maintains large populations in Mongolia, future population declines are very likely, owing to rapid urbanization and land development (e.g., mining, road construction etc.) now taking place in Mongolia. Here, we first investigated the population genetic and phylogeographic structures of G. mopsus in Mongolia, in order to understand the species’ evolutionary history (e.g., the pattern of migration, population contraction/expansion) and distribution, as well as to determine its current population genetic status.

Results

Mitochondrial diversity

Among 418 COI sequences (658 bp) that were generated from G. mopsus specimens collected in Mongolia (N = 406), as well 12 specimens from Korea, China, and France, we identified 94 polymorphic sites and a total of 230 haplotypes (Table 1, Fig. 1). Only the 14 populations that were represented by more than ten samples (N > 10) were included in downstream population genetic analyses, whereas the samples from all 27 sampling localities were used for haplotype network analysis, and those from the 24 sampling localities in Mongolia were used for AMOVA (Table 1). Sequence correspond to GenBank accession numbers MF674025–MF674381.

Table 1 Information, localities, and diversity indices of Gymnopleurus mopsus samples analyzed in the present study.
Figure 1
figure 1

Localities of Gymnopleurus mopsus samples collected in Mongolia. The map of Mongolia was modified from a map produced using ArcMap 10.552 (www.esri.com) with the World Terrain Basemap (Esri, USGS, NOAA). Each color represents 24 different sampling localities. The 14 populations (N > 10) used for the present study’s population genetics analyses are indicated in bold.

Estimates of haplotype diversity (h) for the 14 large (N > 10) populations ranged from 0.889 to 1.000 in the ST23 and ST24 populations, respectively, with an average of 0.974, whereas estimates of nucleotide diversity (π) ranged from 0.006 to 0.010, with an average of 0.008 (Table 1). Furthermore, the number of haplotypes in these populations ranged from ten to 29 in the ST23 and ST18 populations, respectively, which were located in the central and southernmost of Mongolia. The most common haplotype (H19) was shared by 47 individuals (11.2%) and was obtained from all but seven of the localities in Mongolia (ST02, ST05, ST08, ST09, ST11, ST12, and ST18), and the next most common haplotypes, H21, H15, H12, H30, and H02, accounted for 4.5, 3.6, 2.9, 2.9, and 2.6% of the sampled individuals, respectively. Meanwhile, 181 of the haplotypes (43.4%) were obtained from single individuals and, thus, classified as unique haplotypes (i.e., singletons). Most of the localities yielded relatively high haplotype richness (HR) values, ranging from 5.616 to 10, with the lowest HR value obtained for the ST23 population, which was located in the center of our sampling localities.

COI haplotype network

A haplotype network of COI sequences was star-shaped, with the most common haplotype (H19) located in the most internal position in the network (Fig. 2), thereby indicating presumably an ancestral haplotype. Furthermore, the ancestral H19 haplotype was detected from all but seven of the sampling localities in Mongolia (ST02, ST05, ST08, ST09, ST11, ST12, and ST18), which indicates its wide distribution. Related haplotypes that radiated from H19 were found repeatedly in different parts of the network, which suggests that local population expansion was followed by radiation from the ancestral haplotype. The haplotypes of individuals from Korea (N = 7, H25, H56, H80, H81, H83, and H84) and China (N = 3, H82 and H65) were located at the edge of the network but with only one or two mutational steps separating them from the haplotypes of the Mongolia populations. Meanwhile, the two haplotypes (H105 and H106) from France (N = 2) were positioned at the tip of the network, with four or five mutational steps separating them from the Mongolia haplotypes.

Figure 2
figure 2

Haplotype network of COI sequences from 14 populations of Gymnopleurus mopsus. Each color represents different localities. Each line in the network denotes a single mutational step, irrespective of its length. Numbers shown either inside or outside of each circle and the accompanying numbers in parentheses denote haplotype identification numbers and the numbers of individuals belonging to each haplotype, respectively. The areas of the circles are also proportional to the number of individuals represented by each haplotype, and the small, filled circles represent intermediate haplotypes that were not present in our samples but are necessary to connect the observed haplotypes to the network.

Geographic population structure

Hierarchical analyses of molecular variance (AMOVA) revealed significant spatial genetic structuring among the 24 Mongolia populations, according to three different biogeographic groupings of (1) two groups, the humid areas (North and Central populations) and desert areas (South and East populations); (2) three groups, the North and Central area (NC), South area (S), and East area (E); or 3) three groups of South populations (S1: ST06, ST07, ST08, ST09, ST10, ST11, and ST18; S2: ST12, ST13, ST19, ST20, and ST21; and S3: ST05, ST14, ST16, and ST17). These groupings were based on our a priori expectation of the spatial population differentiation of G. mopsus, based on geographic proximity to one another and to the region’s main roads (Table 2). We found that 3.06 and 2.17% of the molecular variance was partitioned between and among the groups, respectively (ϕ CT ; P < 0.01), whereas most of the variance occurred within populations (ϕ ST ; 96.74 and 97.60% of the total variation, P < 0.01; Table 2). However, only 0.20 and 0.23% of the molecular variance was partitioned among populations within groups (ϕ SC ; P > 0.05). The greatest between-group difference (3.06%) was observed when the four populations (ST01, ST15, ST22, ST23) in the northern humid area were compared to the 20 desert populations. However, the genetic structure (ϕ CT  = − 0.002) among the three South groups (S1, S2, and S3) was insignificant (P = 0.775), which indicated the occurrence of relatively high levels of either historical or contemporary gene flow among populations in South area.

Table 2 Analysis of molecular variance (AMOVA) of Gymnopleurus mopsus in Mongolia.

Genetic differentiation (Fst) and isolation by distance (IBD)

The genetic differentiation between populations was estimated using pairwise Fst statistics (ϕ ST ), which ranged from −0.019 to 0.079 (Table 3). Most of the pairwise differences (ϕ ST values) were relatively small, even between distant populations (e.g., the southernmost and northernmost sampling localities, which were separated by 670 km), which generally indicated low levels of inter-population genetic differentiation. As a result, after sequential Bonferroni correction applied, only 18 of the 91 comparisons among the 14 large (N > 10) populations indicated significant differentiation (P < 0.05; Table 3). The highest ϕ ST value was observed between the ST23 and ST18 populations (Fst = 0.079, P < 0.01). However, neither the ST07 nor ST10 population was significantly differentiated from any of the other populations, which suggests the occurrence of either historical or ongoing gene flow between these and the other populations. Isolation-by-distance (IBD) analysis detected a significant positive relationship between genetic distance (Fst) and geographic distance (decimal degrees; R2 = 0.087, P = 0.001) between the 14 large Mongolia populations (Fig. 3).

Table 3 Pairwise Fst values of Gymnopleurus mopsus populations in Mongolia.
Figure 3
figure 3

Relationship between the genetic distance (Fst) and geographic distance (decimal degree) of paired populations. These two distance measures are significantly correlated (Mantel R2 = 0.0869 P = 0.001).

Demographic history

Phylogeographic analyses revealed a recent, sudden expansion of G. mopsus populations, based on several lines of evidence, including neutrality tests and the mismatch distribution analysis (Table 1, Fig. 4). For example, both the Tajima’s D and Fu’s Fs statistics for neutrality were negative for all but the ST23 population, which was located in the central area of all the main roads meet in Mongolia (Fig. 1). Moreover, the neutrality test was significantly violated when the all of the 14 populations were pooled as a single data set (Table 1), with Tajima’s D and Fu’s Fs values of −1.76936 (P = 0.005) and −24.64263 (P = 0.001), respectively, which suggests that a sudden historical population expansion occurred following a population bottleneck. These significant negative values also indicate that the hypotheses of population stasis, selective neutrality, and population equilibrium can be rejected.

Figure 4
figure 4

Mismatch distribution of nucleotide differences between pairs of mtDNA haplotypes. There were no significant differences between the observed and expected frequencies (sum of squared deviation = 0.000344, P = 0.75).

A mismatch distribution was clearly smooth and unimodal for all of the 14 large populations, further supporting the model of sudden population expansion18 (Table 1, Fig. 4). The SSD (sum of squared deviations) statistic indicated that the observed frequency distribution fit the distribution expected under the model of demographic expansion (SSD = 0.00037, P = 0.92, raggedness index 0.00604, P = 0.77), and both the non-significant SSD statistic and small raggedness index indicated sudden population expansion, again confirming the results of the neutrality tests, whereas the non-significant raggedness index indicated a good fit to a model of population expansion19 (under the demographic expansion model). Based on the pooled Mongolia populations, the mismatch distribution again fit the model of demographic expansion.

Historical demographic reconstructions, based on Bayesian skyline plot (BSP) analysis, further supported a model of sudden population expansion (Fig. 5). Indeed, the Mongolia populations appear to have undergone a population expansion ~2500 years ago, followed by continuous increases in effective population size for a long period afterward (Fig. 5). This estimate might indicate that the timing of G. mopsus expansion corresponded to both the expansion of human populations and advent of large mammal domestication in Mongolia.

Figure 5
figure 5

Bayesian skyline plot of COI sequences from Gymnopleurus mopsus. The black solid line indicates the median effective population size, and the dashed line indicates the range of 95% highest posterior density (HPD) intervals.

Discussion

To the best of our knowledge, the present study is the first to examine the current population genetic structure and historical population demography of G. mopsus spanning major distributional ranges across Mongolia using mitochondrial COI sequences. The high frequency of unique haplotypes (singletons) and limited nucleotide differences between haplotypes could reflect rapid and recent population expansion, following a population bottleneck. Otherwise, such rare mtDNA variants would be eliminated by genetic drift. The significant negative Tajima’s D and Fu’s Fs statistics of the pooled G. mopsus populations also reject the hypothesis of the constant population size assumed under mutation-drift equilibrium20. In addition, the star-shaped topology of the haplotype network is another line of evidence that suggests the sudden demographic expansion of dung beetles, as in previous studies on flies21,22, moths23, leaf beetles24, and marine gastropods25, as well as domestic cattle and goats26,27. The most common and presumably ancestral G. mopsus haplotype (H19) was located in the center of the haplotype network and was connected to a large number of infrequent haplotypes by only a few mutational steps. This pattern is generally interpreted to indicate a population that has recently expanded in size from a small number of founders following a population bottleneck28. The single bell-shaped mismatch distribution for the COI haplotypes also supported the hypothesis of population range expansion29, whereas the hypothesis of sudden population expansion was further supported by the abrupt increase in effective population sizes, as suggested by BSP analysis (Fig. 5). Thus, all the analyses of COI sequences from G. mopsus in Mongolia indicated sudden population expansion.

The historical demographic expansion of domesticated animals, which has resulted from archeological or anthropogenic events (e.g., domestication), has been well documented in a number of taxa21,27,30,31. We speculate that the recent population expansion observed in G. mopsus is related to the population dynamics (e.g., expansion) of large mammals, such as cattle, horses, and camels, that provide food and breeding ground for G. mopsus and, therefore, limit its abundance and distribution. The expansion of G. mopsus populations is estimated to have occurred ~2500 years ago, based on BSP analysis with estimated substitution rate. Such estimates depend on which molecular clock rates are applied, since different lineages are often subject to different mitochondrial substitution rates32. However, the hypothesis of a recent history of expansion was supported by many other analyses, as well, and the population expansion of G. mopsus was estimated to have occurred a few thousand years ago, based on the BSP analysis that accounts for the variation in our time estimation. Interestingly, our estimated timing of G. mopsus expansion seems to correspond to the advent of livestock domestication in Mongolia. Thus, the domestication of cattle and livestock was a turning point in the histories of both human and dung beetle populations33. The earliest domestication of wild cattle in northeastern Asia, including Mongolia, North China, Korea, and Japan, has been suggested to have occurred between 4000 and 5000 years ago34. The similar population structure of livestock and G. mopsus in Mongolia might further support the hypothesis of livestock-mediated expansion of G. mopsus populations. Recent population expansion in various domesticated animals, including cattle26 and goats27,35, has been supported by bell-shaped mismatch distributions or star-shaped haplotype networks, much like the ones presented for G. mopsus populations in the present study. Furthermore, other population genetics studies of cattle populations have also reported unusually high genetic diversity within domesticated populations in Mongolia35, as well as domestication-driven population expansion34. Moreover, such studies have also observed that the most common and presumably most ancestral haplotypes are located at the centers of the haplotype networks and that a number of rare haplotypes originate from that haplotype with only a few mutational steps.

In a previous study of cattle dung-using Helictopleurus species (H. neoamplicollis and H. marsyas) in Madagascar, researchers used COI sequences to document a rapid population expansion that followed a resource shift36. The resource shift from primate droppings to cattle dung resulted in extended geographic ranges and population expansion with low genetic diversity and was associated with the historic introduction of cattle ~1500 years ago36. Indeed, the two Helictopleurus species exhibited considerably lower COI haplotype diversity than more wild, forest-dwelling dung beetles with more varied diets36. Another common cattle dung-using species, H. quadripunctatus, possesses a more intermediate diet, in that it is less specific, and, compared with forest-dwelling species, exhibits more similarities to the G. mopsus populations in Mongolia, particularly in regards to its high haplotype diversity, low nucleotide diversity, and significant IBD. In fact, like H. quadripunctatus, G. mopsus feeds on a broad range of food sources (e.g., sheep and camel dungs). Thus, a resource shift by G. mopsus from wild animal dung to domesticated cattle droppings during the domestication of cattle in Mongolia could account for the species’ demographic expansion. The contemporary cattle population in Mongolia is known to have originated from hybridization with wild yak populations ~7300 years ago and has probably undergone rapid population expansion for the last 3600 years37.

The dispersal potential (dispersal capacity) of a species is a critical factor in characterizing its present-day population genetic structure. There is evidence that the dispersal and movement patterns of dung beetles are strongly associated with those of mammals13,14. Overall, the weak genetic structuring observed among the G. mopsus populations of Mongolia, as indicated by low Fst values, suggests the occurrence of either contemporary or historical gene flow. In particular, the significant relationship between the genetic and geographic distances of population pairs (IBD) indicates that the geographic proximity of populations significantly influences the spatial genetic structure of G. mopsus, probably resulting from the fact that its dispersal ability is associated with the geographic closeness.

The movement processes of dung beetles might be stochastic since individuals could move short distances between dung pats, as well as long distances between pastures, during foraging. Maximum flight distance of 1.5–2 kilometers per day has been reported in a previous study38. Thus, the dispersal and migration distances of this small beetle are likely limited, especially when considering the long distances between certain populations (e.g., ~670 km). However, if the migration of G. mopsus is promoted by other factors, such as livestock dispersal (i.e., herd movement during pasturing), the high level of connectivity between the G. mopsus populations becomes much more conceivable. Pasturing is a popular type of livestock farming (i.e., nomadic livestock husbandry) in Mongolia, and farmers often move their cattle herds among grasslands on a seasonal basis, following road networks. Such long-distance dispersal, either by long-distance flight or long-distance animal-mediated movement, could explain the low levels of inter-population genetic differentiation observed in the present study.

The AMOVA indicated weak, but detectable genetic structuring among certain G. mopsus population groups (i.e., North and Central, East, South area) that were clustered by the major roads that are seasonally used for moving cattle (Table 2). However, the species’ complicated dispersal system (i.e., short-distance foraging movement and long-distance dispersal), which is the result of dung availability and possibly other biological factors, could lead to variability in the species’ dispersal. Therefore, further investigation of G. mopsus foraging behavior and dispersal potential is needed, especially elucidate the influence of dung beetle dispersal on population connectivity in Mongolia.

In the present study, we also identified an extraordinarily high level of genetic diversity, which could indicate extremely large effective population sizes (Ne) and, possibly, high levels of historical and ongoing gene flow. Large Ne values typically indicate that populations have undergone historical demographic expansion but are now stable, thereby suggesting that G. mopsus originated in Mongolia. Our haplotype network analysis, which included individuals from multiple regions, including South Korea, China, and France, also supports the hypothesis that G. mopsus originated in Mongolia. Indeed, the haplotypes from South Korea and China were connected to those from Mongolia by only one or two mutational steps, and the haplotypes from France were not much more removed. It is possible that Mongolia is the geographic origin of G. mopsus expansion and that the species radiated (or migrated) to both the East and West during the demographic processes of population expansion.

The high genetic diversity of the remnant G. mopsus populations in Mongolia might allow the species to adapt to changes in local environments, including changes in climate, vegetation, and dung availability. Mongolia includes both arid and semi-arid regions, where climate change is generally more drastic than in other regions17. In particular, the largest genetic differentiation was detected between populations from the northern humid areas (grassland or forest-steppe areas) and southern dessert area, which were characterized by different vegetation types, i.e., grassland or forest-steppe areas and desert, respectively. A previous study39 reported that there was substantial year-to-year variation in the insect community of Mongolia. For example, in one year, the number of coleopteran species at a region near our ST23 locality was reduced by 60%39. The ST23 population, which was located at the center of a main road and was the most urbanized of our sampling localities, possessed the lowest genetic diversity of the Mongolia populations and yielded only negative Tajima’s D and Fu’s Fs values. This could indicate urbanization effects, which would indicate that the species is at a high risk of population size reduction in Mongolia. However, the widespread COI haplotypes and the pattern of star-shaped haplotype network may still indicate that the species is highly adaptive. Therefore, in order to understand the relationship between the genetic diversity and ecological adaptive potential of G. mopsus, future studies should investigate the response of G. mopsus to diverse environmental dynamics.

To design effective conservation and restoration management plans for G. mopsus in South Korea, understanding the genetic characteristics (e.g., genetic diversity, genetic structure, and phylogenetic or phylogeographic relationships) of potential source populations, like those of Mongolia, will be crucial40. The National Institute of Biological Resources in South Korea established a restoration program in 2012. The conservation and restoration efforts of endangered and regionally extinct species often require the introduction of individuals from source populations, whose genetic and evolutionary lineages are comparable to those of the recipient populations. Therefore, both the population genetic structure and phylogenetic relationships of source populations should be considered carefully when developing reintroduction programs41. The findings of the present study highlight the value of the Mongolia populations for G. mopsus conservation and suggest that the populations could be used as source populations for restoration efforts in South Korea. Additionally, this species has been suggested to have a very large distributional range in Palearctic region42. Therefore, the observed phylogeographic trends by molecular data in this study need to be corroborated in a wider spectrum of its geographic distribution in future research, and it will also allow for exploring the conservation values of other remnant and potentially important populations of this species.

The genetic data presented by the present study provide valuable information about the phylogeographic structure, evolutionary history, and the genetic structure of G. mopsus populations across Mongolia. Additional ecological studies could further contribute to the effectiveness of dung beetle restoration in South Korea. The results of the present establish a basis for future research regarding the effective conservation and restoration of this highly cherished dung beetle species.

Methods

Sample collection, DNA extraction, and mtDNA COI sequencing

Between 2014 and 2016, a total of 406 adult G. mopsus individuals were collected from 24 localities in Mongolia. Upon collection, the specimens were washed with distilled water to remove dirt and animal dung and then stored in absolute ethanol until subject to DNA extraction. Dried specimens that were collected from South Korea from 1931 to 1964 (N = 7), France in 1971 (N = 2) and China in 2015 (N = 3) were also included (Table 1) because the populations in those regions are now regionally extinct. Especially, the species in the western part of Europe (e.g., France) is almost extinct or in great danger of extinction compared to one in the eastern part of Europe (e.g., Greece)43.

Genomic DNA was isolated from a femur of each specimen using the DNeasy Blood & Tissue Kit (Qiagen, USA), and polymerase chain reaction (PCR) was used to amplify a 658 bp fragment of the mitochondrial cytochrome oxidase I gene (COI), using 20-μl reactions, universal forward (LCO1490: 5-GGT CAA CAA ATC ATA AAG ATA TTG G-3)44 and reverse (C1-N-2191: 5-CCC GGT AAA ATT AAA ATA TAA ACT TC-3)45 primers using AccuPower PCR Premix (Bioneer, Korea), with the following conditions: an initial denaturation at 94 °C for 1 min; followed by 35 cycles of 30 s at 94 °C, 30 s at 50 °C, and 1–2 min at 72 °C; and a final extension step of 72 °C for 7 min. The PCR products were visualized on 1.5% agarose gels using UV light, purified enzymatically using Exonuclease I and Shrimp Alkaline Phosphatase (New England BioLabs, USA), and then sequenced by Macrogen INC Sequencing (Korea) using an ABI PRISM 3130xl Genetic Analyzer (Applied Biosystems, USA). The resulting sequences were deposited in GenBank under accession numbers MF674025–MF674381. Previously published COI sequences collected in South Korea from 1931 to 1964 (KT454107-454111, KX879011-879012) and from Mongolia (KT454096-454106, KX879013-879046, KX879050-879055) and China (KX879047-879049) were also included12.

Haplotype network and genetic diversity analyses

The COI sequences were aligned using Clustal W in BioEdit v.7.0.146 and checked manually, and the haplotypes of the 418 individuals were determined using Neighbor-Joining algorithms in MEGA 7.047. A haplotype network was then inferred using HAPSTAR v0.748.

The total and population genetic diversity indices, including the number of haplotypes (HN), haplotype diversity (h), and nucleotide diversity (π), were estimated using ARLEQUIN v3.549, and Tajima’s D and Fu’s Fs statistics were also analyzed to test for neutrality. Ten populations that had insufficient sample sizes (N < 10) were excluded from all subsequent analyses, except the AMOVA. Haplotype richness was calculated using a rarefaction method in CONTRIB v1.0250 that corrects for unequal sample sizes among populations.

Population genetic differentiation and isolation by distance (IBD)

The fixation index (Fst)51 was calculated for population pairs using ARLEQUIN v3.5, and the geographical distance between sampling localities was measured in decimal degrees using ArcGIS Desktop 10.5 (ESRI, USA)52. The correlation between genetic distance (Fst) and geographic distance (decimal degrees), i.e., the Mantel test (spatial autocorrelation multivariate approach)53, was used to test for isolation by distance (IBD) in GenAlEx v6.50254.

Geographic population structure

A hierarchical analysis of molecular variance (AMOVA) was performed using ARLEQUIN v3.5, in order to assess the spatial population genetic structure of G. mopsus across its main range in Mongolia. Each of the 24 populations (Table 1) was assigned to one of four biogeographic areas, i.e., North (N), Central (C), South (S), or East (E), following the major road networks in Mongolia. For this analysis, the populations were either separated into either (1) two groups, the northern humid areas (North and Central populations) and desert areas (South and East populations); (2) three groups (NC, S, and E), which were classified based on the major roads (N: ST01, ST15, ST22, and ST23 populations; S: ST05, ST06, ST07, ST08, ST09, ST10, ST11, ST12, ST13, ST14, ST16, ST17, ST18, ST19, ST20, and ST21 populations; and E: ST02, ST03, ST04, and ST24 populations); or (3) three groups (South populations only; S1: ST06, ST07, ST08, ST09, ST10, ST11, and ST18 populations; S2: ST12, ST13, ST19, ST20, and ST21 populations; and S3: ST05, ST14, ST16, and ST17 populations; Fig. 1, Table 2). Total molecular variance was partitioned among groups (Fct = ‘inter-group’ genetic variation), populations within groups (Fsc = ‘intra-group’ genetic variation), and populations, regardless of groupings (Fst = ‘inter-population’).

Historical demography

The expansion of G. mopsus populations was tested using mismatch distribution analysis and both Tajima’s D55 and Fu’s Fs neutrality tests56 in ARLEQUIN v3.5. Meanwhile, the demographic history of G. mopsus in Mongolia was determined using Bayesian Skyline Plot (BSP) analysis in BEAST v2.4.557. The GTR + G + I model of evolution was selected as the best model by jModeltest58. MCMC run was performed with 300 M iterations, with sampling every 1000 iterations, the first 10% of generations discarded as “burn-in”, a constant skyline model, and Bayesian skyline tree priors. The molecular clock rate and substitution rates were estimated as a default function. The run was examined using Tracer v.1.659, and all the parameters were analyzed to confirm that the effective sample sizes were sufficient (ESS > 200).