Phylogenomics reveals the evolution, biogeography, and diversification history of voles in the Hengduan Mountains

Wang, XiaoYun; Liang, Dan; Wang, XuMing; Tang, MingKun; Liu, Yang; Liu, ShaoYing; Zhang, Peng

doi:10.1038/s42003-022-04108-y

Download PDF

Article
Open access
Published: 25 October 2022

Phylogenomics reveals the evolution, biogeography, and diversification history of voles in the Hengduan Mountains

Communications Biology volume 5, Article number: 1124 (2022) Cite this article

3042 Accesses
3 Citations
10 Altmetric
Metrics details

Subjects

Abstract

The Hengduan Mountains (HDM) of China are a biodiversity hotspot whose temperate flora and fauna are among the world’s richest. However, the origin and evolution of biodiversity in the HDM remain poorly understood, especially in mammals. Given that the HDM shows the highest richness of vole species in the world, we used whole-exome capture sequencing data from the currently most comprehensive sampling of HDM voles to investigate their evolutionary history and diversification patterns. We reconstructed a robust phylogeny and re-estimated divergence times of the HDM voles. We found that all HDM voles could be divided into a western lineage (Volemys, Proedromys, and Neodon) and an eastern lineage (Caryomys and Eothenomys), and the two lineages originated from two migration events from North Eurasia to the HDM approximately 9 Mya. Both vole lineages underwent a significant acceleration of net diversification from 8–5 Mya, which was temporally congruent with the orogeny of the HDM region. We also identified strong intertribal gene flow among the HDM voles and hypothesized that frequent gene flow might have facilitated the speciation burst of the HDM voles. Our study highlights the importance of both environmental and biotic factors in shaping the biodiversity of mammals in mountain ecosystems.

Hybrid speciation driven by multilocus introgression of ecological traits

Article Open access 17 April 2024

Diversity-dependent speciation and extinction in hominins

Article Open access 17 April 2024

Complexity of avian evolution revealed by family-level genomes

Article 01 April 2024

Introduction

Despite covering only approximately one–eighth of the Earth’s land surface, mountain regions harbor one–third of all global terrestrial species^1,2. Why do so many species occur in mountains? A common hypothesis is that mountain uplift drives the rapid speciation of organisms because orogeny creates a complex range of topographies, climates and habitats where species evolve and diversify^3,4,5,6,7. On the other hand, biological factors such as biome changes associated with orogeny and genetic admixture among lineages may also contribute to the speciation process^8,9. Illuminating the contribution of both environmental and biological factors during the development of the remarkable biodiversity in mountains is important for understanding how evolutionary processes interact with changing global environments to shape biodiversity. Among the many mountain ecosystems, the Hengduan Mountains (HDM) are an unusual, enigmatic biodiversity hotspot located in the southeastern corner of the Qinghai–Tibetan Plateau (QTP)^6,10,11 (Fig. 1a). The temperate flora and fauna of the HDM region are among the world’s richest, including approximately 12,000 species of vascular plants and 1,500 terrestrial vertebrates, many of which are endemic¹⁰. Elucidating the evolutionary mechanisms driving the formation of biodiversity in the HDM has long attracted the attention of evolutionary biologists. Central to addressing this question is knowledge of the speciation tempo (rate) and mode (colonization via dispersal or in situ lineage diversification) of the resident lineages of the HDM⁶. To this end, scientists have performed a great deal of work on plants^6,7,12, amphibians¹³, and birds^14,15,16 to determine the relationship between mountain uplift and species diversification in the HDM region. In contrast, although mammals are the flagship group of terrestrial vertebrates, mammals in the HDM region have been the subject of relatively few investigations addressing their diversification tempo and mode.

**Fig. 1: a Geographic location of the Hengduan Mountains. b Global species richness map of Arvicolinae.**

Arvicolinae (Rodentia: Cricetidae) is a subfamily of rodents that includes voles, lemmings, and muskrats. It is a highly diverse, young, fast-evolving rodent group comprising ten tribes, 28 genera and over 150 species^17,18, with new species constantly being discovered and described^19,20,21. Arvicolinae are widespread in various landscapes of the Northern Hemisphere but are mainly concentrated in mountain areas and show several species diversity hotspots, including the North Rocky Mountains, the Mountains of Central Asia, and the HDM (Fig. 1b). Remarkably, the HDM region harbors the world’s richest species diversity of Arvicolinae, including ~35 species of voles, most of which are endemic. Intriguingly, the HDM region exhibits a unique island-like pattern of vole species richness, in which adjacent areas show extremely low species numbers, while the other two species diversity hotspots show gradual declines in species richness toward their adjacent areas (Fig. 1b). The unusual distribution pattern and high richness of vole species in the HDM provide an ideal mammalian model system for exploring the processes that gave rise to the biodiversity of the HDM. When and how did vole species accumulate in the HDM region? To answer this question, a robust phylogenetic framework and evolutionary timescale of Arvicolinae that includes all HDM vole species, is essential.

Although the phylogeny and evolutionary timescale of Arvicolinae have been studied for decades using both morphological and molecular data, there are important issues that remain to be addressed. First, from the perspective of large-scale systematic frameworks of Arvicolinae, the interrelationships among tribes remain unresolved. The poor resolution of many nodes in the Arvicolinae phylogeny is likely a result of the small number of molecular markers used and/or high rates of missing data in these analyses (e.g., eight genes, 9,002 bp with 72.8% missing data or 11 genes, 15,535 bp with 75% missing data)^22,23. Whole mitochondrial genomes with low missing data rates have also been applied²⁴, but mitochondrial genes are genetically linked and highly compositionally heterogenous. Second, a general consensus on the divergence times of the major Arvicolinae lineages is also lacking. The known fossil records suggest that extant arvicolids presumably emerged in the late Miocene—early Pliocene (~8–5 Mya). However, molecular data have produced a wide range of estimated origin times of extant Arvicolinae, from older estimates of ~15.2–20.9 Mya^23,25,26,27 to much younger estimates of ~7 Mya^24,28. Third, due to the difficulty of sample collection, previous studies on the phylogenetic relationships and divergence times of Arvicolinae have typically included only limited taxon sampling of HDM vole species. All of the above issues have prevented the reconstruction of a reliable, comprehensive evolutionary history of HDM vole species, which is crucial for understanding the macroevolutionary and ecological processes that shape their diversity in the HDM region.

The aims of this study were to reconstruct the evolutionary history and diversification process of HDM voles and to discuss the relationships among species biodiversity, genetic exchange, and mountain uplift in the HDM. To do this, we collected 121 rodent specimens including all known HDM vole species and used a whole-exome sequencing technique to generate genome-scale DNA sequence data. The phylogenetic analysis of these data led to a well-supported hypothesis of the relationships among voles that was highly concordant across multiple analytical approaches. With this phylogenetic framework, we further investigated the divergence times, biogeographic history, tempo and mode of diversification, and gene flow of the HDM voles to reveal the environmental and biotic factors that have driven their evolution and diversification.

Results and discussion

Data processing and the datasets

Based on whole-exome capture sequencing, we obtained a total of 391.7 GB of Illumina PE150 sequencing data for 115 Arvicolinae and 6 Cricetinae samples, with an average of 3.2 GB of data per sample. For each sample, ~8.84% of read sequences could be mapped to the reference coding sequences (CDSs). The number of extracted CDSs ranged from 915 (Alticola argentatus 26051) to 18,997 (Myopus schisticolor 09RAP055), with an average of 10,958 CDSs per sample (Supplementary Fig. S1). At the species level, we obtained at least 6000 CDSs per species, except for the species Alticola argentatus. In addition, we extracted 117 mitochondrial genomes (four samples failed) from our capture data. The extracted CDSs for each sample were deposited in the Mendeley Data Repository (https://data.mendeley.com/datasets/mwyj4m963h), and the newly sequenced mitochondrial genomes were deposited in GenBank (for accession numbers, see Supplementary Table S1). These CDS and mitochondrial genome data will be a useful and convenient resource for future studies on the biology, classification, and adaptive evolution of Arvicolinae.

We constructed three datasets from these CDS and mitochondrial genome data. The individual-level nuclear dataset contained 6517 CDSs generated entirely within this study, comprising sequences from 115 Arvicolinae and 7 Cricetinae individuals (12,041,474 nt in length and 48.2% complete), which were used to provide a basic framework for the phylogeny of Arvicolinae, emphasizing the voles in the HDM. The species-level nuclear dataset was obtained by merging the data of all individuals from each nominal species and included 6078 CDSs from 58 Arvicolinae and three Cricetinae species (10,788,858 nt in length and 69.6% complete), with the aim of reducing missing data. Then, the mitochondrial genome dataset contained the 117 mtDNA sequences newly generated in this study and 98 published mtDNA sequences, comprising sequences from 197 Arvicolinae taxa and 18 Cricetinae outgroup taxa (15,160 nt in length and 97.9% complete). It included more tribes than the nuclear datasets and can provide a more comprehensive framework for the phylogeny of Arvicolinae.

Phylogeny of arvicolids, with emphasis on voles in the HDM

The concatenated maximum likelihood (IQ-TREE) analysis of the individual-level nuclear dataset produced a well-resolved arvicolid phylogeny, with 117 of the 119 nodes showing UFBS values = 100% (Fig. 2). The species tree analysis (ASTRAL) of this dataset produced an identical phylogeny, with 87.4% of nodes showing bootstrap values = 100% (Fig. 2). The phylogenetic trees of the species-level nuclear dataset were completely congruent with those of the individual-level nuclear dataset and showed a higher resolution: 92% of nodes had UFBS values = 100% and ASTRAL bootstrap values = 100% (Supplementary Fig. S2). To investigate whether the branches with high nodal support were robust, we estimated the gene concordance factors (gCFs) and site concordance factors (sCFs) of each branch of both the individual and species-level phylogenies. The spatial correlations between the gCF, sCF and bootstrap values showed that high bootstrap values always coincided with high gCF and sCF values, which suggested that the phylogenetic signals of the two nuclear datasets were strong and congruent and that the resulting phylogenies were robust (Supplementary Fig. S3 and Supplementary Table S2). Finally, the ML tree inferred from the mitochondrial genome dataset was essentially congruent with the results of the nuclear datasets, but the support for the deep nodes (among tribes) was not strong (UFBS < 95%) (Supplementary Fig. S4).

**Fig. 2: The phylogeny of Arvicolinae with a focus on the HDM taxa.**

The placement of the genus Arvicola is one of the most controversial issues concerning the phylogenetic relationships of Arvicolinae. Traditionally, Arvicola has been considered a member of tribe Arvicolini¹⁸. Molecular studies based on a few mitochondrial and nuclear genes either supported^29,30,31 or opposed^28,32 this classification, but none of them received strong support. Based on mitochondrial genomes, Abramson et al. ²⁴. found that Arvicola clustered with Lagurini, but they considered this result to be a phylogenetic artifact related to nucleotide composition bias or a long-branch attraction effect. Our mitochondrial genome tree did not robustly resolve the placement of Arvicola (Supplementary Fig. S4), implying insufficient mtDNA information to answer this question. In comparison, our nuclear tree based on thousands of nuclear genes strongly recovered Arvicola as the sister group of all other members of Arvicolini (Fig. 2), supporting the traditional classification. However, because the branch separating Arvicola and other members of Arvicolini was rather long and the genetic distances from Arvicola, Lagurini and Ellobiusini to the other members of Arvicolini were similar (Fig. 2), we suggested that the traditional Arvicolini be split into two tribes (Arvicolini and Microtini), following the suggestion of Liu et al. ^32,33. The type genera of Arvicolini and Microtini should be Arvicola and Microtus, respectively.

Regarding the basal lineages of Arvicolinae, our mitochondrial genome tree showed that North American Ondatrini is the sister group of all other Arvicolinae taxa and that Eurasian Lemmini is the sister group of a clade containing Myodini, Microtini, Arvicolini, Ellobiusini, Lagurini, and Pliomyini, albeit with only moderate support (Supplementary Fig. S4). These results are different from those of Abramson et al. ²⁴. based on 13 mitochondrial protein-coding genes, suggesting that Lemmini is the sister group of all other Arvicolinae tribes. In our mitochondrial data analysis, we included RNA genes (two rRNA and twenty-two tRNA genes) in addition to the 13 protein-coding genes, which may be one of the possible reasons for the discordant results. On the other hand, our nuclear tree strongly indicated that Ondatrini branched earlier than Lemmini, lending support to our mitochondrial results, although the nuclear datasets included fewer tribes than the mitochondrial dataset (Fig. 2). In addition, our nuclear data analyses robustly resolved all phylogenetic relationships among genera. In contrast, previous attempts to resolve phylogenetic relationships within the Arvicolinae subfamily using morphological characters, combinations of mitochondrial and nuclear genes, or complete mitochondrial genomes all yielded weakly supported and conflicting topologies^{23,24,28,30,31,34}. This demonstrated that the phylogenomic nuclear gene dataset was highly effective for phylogenetic reconstruction within Arvicolinae. Therefore, a comprehensive, robust phylogeny of Arvicolinae requires further phylogenomic analyses of nuclear genes from more taxa in the future.

All the vole species distributed in the HDM region could be placed into two tribes, Myodini and Microtini (Fig. 2 and Supplementary Fig. S4). Tribe Myodini included five genera split into two major lineages. One lineage included Myodes, Alticola and Craseomys. Their species are widespread across the Northern Hemisphere. Another lineage included Caryomys and Eothenomys. Caryomys is mainly distributed in the monsoon areas of Eastern Asia, and Eothenomys is mainly distributed in the HDM region. Tribe Microtini contained three HDM genera, Volemys, Proedromys, and Neodon, but these three genera did not form a clade. Volemys and Proedromys were more closely related and were located near the base of Microtini (Fig. 2 and Supplementary Fig. S4). The genus Neodon was deeply nested within the tribe Microtini and was more closely related to the northern China genus Lasiopodomys. These phylogenetic relationships implied that the origin of the HDM voles was polyphyletic, possibly resulting from multiple migration events.

Timing and migration route of HDM vole origination

Our mitochondrial genome dataset (analyzed at the genus level) and species-level nuclear dataset produced similar divergence time estimates for Arvicolinae evolution (Supplementary Figs. S5 and S6). The major difference between the two estimates was that the mitochondrial times tended to be slightly younger than the nuclear times at shallower nodes. Both mitochondrial and nuclear time trees showed that the most recent common ancestor (MRCA) of extant arvicolids occurred in the middle Miocene at ~14.92 Mya (mitochondrial) or ~14.99 Mya (nuclear) (Fig. 3), immediately after the Miocene Climatic Optimum (17–15 Mya). This time estimate corresponded to the molecular dating results obtained in several previous studies^23,27 but was much older than those estimated by some other authors (~7 Mya)^24,28. The MRCA of the tribe Microtini was dated to 9.71 Mya (95% HPD: 8.49–10.99; Supplementary Fig. S6), and the MRCA of the genera Caryomys and Eothenomys was dated at 9.03 Mya (95% HPD: 7.92–10.1; Supplementary Fig. S6), which corresponded to the origin times of the two major lineages of the HDM voles. Finally, the four vole genera that are largely endemic to the HDM, Eothenomys, Volemys, Proedromys and Neodon, originated ~6–7 Mya.

**Fig. 3: Divergence times and dispersal history of voles.**

We estimated the historical biogeography of the global Arvicolinae based on the mitochondrial genome dataset (genus-level) using BioGeoBEARS. All models with an additional J parameter that modeled long-distance or “jump” dispersal performed significantly better than the original models (Supplementary Fig. S7), suggesting that long-distance dispersal was a common phenomenon during the diversification of arvicolids. The DEC + J model and the DIVALIKE + J model produced similar ancestral range reconstruction results (Supplementary Fig. S7). Because the oldest known Arvicolinae fossil was found in the Palearctic realm and a fossil record of early arvicolids in North America is lacking, it is generally thought that the common ancestor of Arvicolinae first appeared in Eurasia rather than in North America^35,36 and migrated from the Palearctic to the Nearctic²⁴. However, our biogeographic analyses showed that the common ancestor of extant Arvicolinae most likely first appeared in North America (probability 45.2%) (Fig. 3a). The second probable ancestral area of Arvicolinae was a mixed region of North America and North Eurasia (probability ~ 44%). All these results suggest that the possibility that North America was the region of origin cannot be ruled out, and the migration route of early Arvicolinae might have been from North America to Eurasia.

The mitochondrial genome dataset showed that the common ancestor of all vole species in the HDM came from North Eurasia and dispersed southward thereafter (Fig. 3a). The species-level nuclear dataset provided further information on the origin and migration history of HDM voles (Fig. 3b; See detail results in Supplementary Fig. S8). The common ancestor of the first HDM vole lineage (including Volemys, Proedromys, and Neodon) appeared in arid and semiarid areas of Eurasia (probability ~95%; Supplementary Fig. S8) and entered the HDM region 9.71 Ma (Fig. 3b). The current distribution of this lineage is concentrated in the western part of the HDM region, with some species extending to the QTP. The ancestral area of the other HDM lineage (including Caryomys and Eothenomys) was estimated to be the monsoon area of Eurasia (probability ~65%; Supplementary Fig. S8). This lineage migrated to the HDM region 9.03 Ma (Fig. 3b). The current distribution of this lineage is concentrated in the eastern part of the HDM region, with some species extending to eastern monsoon areas. Notably, the basal taxa of the two vole lineages (Proedromys bedfordi and Caryomys eva) were all distributed in the northeastern part of the HDM (see the sample distribution map in Fig. 2 and Supplementary Fig. S9), suggesting that the ancestral voles may have entered the HDM region from the Northeast.

According to these results, we argued that the voles currently distributed in the HDM region likely resulted from two independent colonization events from northern Eurasia. The ancestors of Volemys, Proedromys, and Neodon passed through the arid and semiarid areas of Eurasia and entered the HDM region from the northeast in the late Miocene (~9.71 Mya), after which they migrated southwestward, ultimately reaching the QTP (Fig. 3c). Almost at the same time, the ancestors of Caryomys and Eothenomys started from the northern part of the monsoon areas of Eurasia and entered the HDM region from the northeast (~9.03 Mya). Their decedents further migrated southeastward, with some species spreading out of the HDM region and finally reaching the southern part of the monsoon areas of Eurasia (Fig. 3c).

Orogeny promotes the diversification of voles in the HDM

Orogeny creates a variety of environmental conditions, including the generation of climatic niches, new habitats or food resources and dispersal barriers, that promote the speciation of organisms^3,6,37. The HDM are a geologically young region but possess the highest species richness of voles on a global scale. Is the high species richness of voles related to the recent orogeny in the HDM region?

To answer this question, we first investigated the evolutionary trend of the elevation adaptation of voles. Our ancestral elevation reconstruction results showed that the ancestors of arvicolids lived in a low-elevation region (1000–2000 m) and that the ancestors of the two HDM vole lineages were distributed in middle elevations (2000–2500 m) (Fig. 4a). During their evolution, the two HDM vole lineages have adapted to higher elevation habitats almost continuously (Fig. 4a). The evolutionary trend of high-elevation adaptation was more obvious in the western HDM lineage (Volemys, Proedromys, and Neodon) (purple lines; Fig. 4a) than in the eastern HDM lineage (Caryomys and Eothenomys) (green lines; Fig. 4a), in accord with the terrain features of the HDM region, in which the western part is higher than the eastern part. In contrast, the direction of elevation adaptation among vole species outside the HDM region was random, with the species occupying habitats at either higher or lower elevations (black lines; Fig. 4a). These results indicate that the development of vole biodiversity in the HDM occurred via an uplift-driven diversification process.

**Fig. 4: Diversification mode and tempo of voles in the Hengduan Mountains.**

To further verify this hypothesis, we estimated the speciation mode of all HDM vole species. We identified 44 in situ diversification events in the HDM region and 13 colonization events (Supplementary Fig. S10; Supplementary Table S3). In situ diversification events accounted for over two–thirds of the total speciation events (44/57 = 77.2%), and the rate of in situ diversification was 2–3 times faster than that of colonization (Fig. 4b), suggesting that in situ diversification was the main speciation mode of the HDM voles. The rates of the in situ diversification and colonization of the HDM voles over time showed that the two types of speciation began at similar rates ~10 Mya. After 8 Mya, the rate of in situ diversification accelerated dramatically and reached its peaked rate ~6–5 Mya, while the rate of colonization did not change as greatly (Fig. 4b). Notably, the rates of in situ diversification in the western and eastern vole lineages of the HDM exhibited remarkable synchrony over time (Fig. 4c). The higher diversification rates of voles and the diversification synchronicity of the two vole lineages ~8–4 Mya coincided with the rapid mountain uplift period in the HDM region from the late Miocene to the late Pliocene^38,39,40,41, which strongly suggested a close relationship between the diversification of the HDM voles and the orogenic activity of the HDM.

As part of the continued expansion of QTP orogeny, the HDM underwent recent rapid uplift during the late Miocene, reaching their peak elevation before the late Pliocene⁴⁰. The intensive orogeny of the HDM resulted in extreme ruggedness of the terrain as well as remarkable environmental heterogeneity, creating complex microclimates and fragmented habitat niches^1,42. The terrain and climate condition changes forced the voles to inhabit narrower regions and smaller elevational ranges, which in turn facilitated the speciation of the high-altitude-adapted voles. At the same time, the Asian monsoon intensified and promoted the growth of plants in the HDM region (Fig. 4d), which provided substantial food resources for rodents^5,7. We argue that these conditions might together provide new ecological and evolutionary opportunities for voles, leading to the formation of new species.

Ancient genetic admixture fueled the rapid diversification of voles in the HDM

In addition to environmental factors, speciation can be promoted by biotic factors, such as key traits that allow the exploitation of new niches or gene flow between divergent lineages, leading to faster responses to environmental changes^8,9,43. We noticed that the species richness of different vole genera in the HDM was highly asymmetrical: Neodon (15 species) and Eothenomys (17 species) comprised ~85% of the total vole species in the HDM region, while the other three HDM genera (Volemys, Proedromys, and Caryomys) accounted for only 15%. Why do Neodon and Eothenomys have more species than other HDM genera? Because high-elevation adaptation is a common trait of all HDM voles underlying their continued success in occupying new mountain niches, the high species richness of Neodon and Eothenomys might be related to gene flow. We wish to unravel the gene flow pattern among vole species of the HDM region to test this hypothesis.

Because the HDM voles are all placed in Myodini and Microtini, we hypothesize that intertribal gene flow occurs between the two tribes. In addition, both Myodini and Microtini contain genera that are endemic to the HDM (HDM genera) and genera distributed outside of the HDM (non-HDM genera). If gene flow plays an important role in the rapid diversification of HDM voles, we expected to observe stronger intertribal gene flow in the HDM genera than in the non-HDM genera. Following this idea, we investigated our species-level nuclear dataset for signals of intertribal gene flow by using Patterson’s D test. We performed two separate tests: (A) using Microtini as P₃ group, HDM genera of Myodini as P₁ group and non-HDM genera of Myodini as P₂ group; and (B) using Myodini as P₃ group, HDM genera of Microtini as P₁ group and non-HDM genera of Microtini as P₂ group.

Patterson’s D tests were performed on all possible permutations of three species belonging to each of the three test groups using Mesocricetus auratus as an outgroup, and the gene flow pattern was interpreted from the number of species permutations in which significant gene flow was detected. In test A, there were a total of 3078 species permutations (Supplementary Table S4), and gene flow was detected in 62.7% of the species permutations (Fig. 5a), showing frequent intertribal gene flow. However, the frequency of gene flow between the HDM genera of Myodini and Microtini and that between the non-HDM genera of Myodini and Microtini were quite different, where the former was ~3.5 times higher than the latter (Fig. 5a). This difference in frequency remained stable when we used only the non-HDM genera or the HDM genera of Microtini as the P₃ group (Supplementary Fig. S11). Notably, for the two HDM genera of Myodini, when only the species-rich Eothenomys was used as the P₁ group, the higher gene flow frequency between P₁ and P₃ remained; however, when only species-poor Caryomys was used as the P₁ group, the higher gene flow frequency between P₁ and P₃ disappeared (Fig. 5a). These results suggest that the HDM genera of Myodini show more genetic exchange with Microtini voles than the non-HDM genera of Myodini and that species-rich Eothenomys presents more frequent intertribal gene flow than species-poor Caryomys. Similarly, in test B, when Myodini was used as P₃, and the HDM genera and non-HDM genera of Microtini were used as P₁ and P₂, higher intertribal gene flow frequency in the HDM genera was observed again, and species-rich Neodon presented more frequent intertribal gene flow than species-poor Volemys and Proedromys (Fig. 5b). Because Myodini and Microtini diverged 11–12 million years ago (Fig. 3), the observed intertribal gene flows reflected a kind of ancient genetic admixture among voles.

**Fig. 5: Intertribal gene flow pattern between Microtini and Myodini.**

Based on the observed gene flow patterns, we argue that, to some extent, ancient genetic admixture of different vole lineages increased the rapid diversification of voles in the HDM, which is in line with an admixture variation speciation scenario⁹. This hypothesis explains why Neodon and Eothenomys have many more species than other HDM genera, although all HDM voles are well suited for mountain habitats. According to admixture variation theory, genetic admixture can instantaneously generate novel genetic combinations from standing genetic variation, facilitating subsequent rapid radiation^8,9. Compared with standing genetic variation, which gradually arises from new mutations, the admixture of ancient genetic variation generates a genetic variation pool for selection much more rapidly^9,44. Large amounts of genetic variation increase the potential for phenotypic evolution and extrinsic reproductive isolation, thereby increasing the propensity for ecological speciation given new ecological opportunities^9,45,46. The role of admixture variation in boosting rapid speciation has previously been acknowledged in many vertebrate groups, such as cichlid fishes^43,47, narrow-mouthed frogs⁴⁸, and Darwin’s finches^49,50. The voles of the HDM might likewise have benefitted from such a genetic combinatorial mechanism during their speciation. On the one hand, frequent gene flow provides highly diverse genetic substrates for the speciation of voles in the HDM; on the other hand, the extreme ruggedness of the terrain and the remarkable environmental heterogeneity of the HDM provide an excellent arena in which these admixed genetic materials may give rise to new species. Our results provide a valuable mammalian case supporting the admixture variation speciation theory.

Conclusions

Using whole-exome capture sequencing, we generated over 6,000 nuclear protein-coding genes per sample and 117 new mitochondrial genomes for 115 Arvicolinae and 6 Cricetinae samples, covering ~100% of the known vole species in the HDM. Based on these data, we produced a robust time-calibrated phylogenetic hypothesis for the voles of the HDM. Our tree is currently the most comprehensive tree available in terms of phylogenetic diversity, taxa, and the number of genes. We found that North American Ondatrini is the sister group of all other arvicolids, implying that Arvicolinae might have originated in North America and later spread to Eurasia. The voles of the HDM comprise two major lineages: the western lineage (Volemys, Proedromys, and Neodon) and the eastern lineage (Caryomys and Eothenomys). These two vole lineages likely originated from two independent migration events from North Eurasia to the HDM in the late Miocene. The main speciation mode of the HDM voles is in situ diversification. Both the western and eastern vole lineages of the HDM experienced accelerated diversification from 8–4 Mya and have exhibited remarkable synchrony over time. This diversification tempo was in accordance with the rapid mountain uplift of the HDM from the late Miocene to the late Pliocene, showing that the orogenic activity of the HDM played an important role in driving the diversification of the HDM voles. In addition, we identified strong intertribal gene flow between the two lineages of HDM voles, which suggests that ancient genetic admixture might also have fueled the rapid diversification of voles in the HDM. In summary, our findings reveal the evolutionary history and diversification process of voles in the HDM and contribute to a better understanding of how environmental and biotic factors work together to shape the biodiversity of mountain ecosystems.

Materials and methods

Taxon sampling and data collection

We conducted multiple field survey and collected 115 vole and lemming specimens (~100% of the known Arvicolinae species in the HDM region^33,51) at 55 different localities, representing 7 tribes, 16 genera, and 57 species. Six Cricetulus samples were collected as outgroup taxa. All rodent samples were captured by using rat traps following the American Society of Mammologist guidelines and the laws and regulations of China for the implementation of the protection of terrestrial wild animals^52,53. Hypoxia is used as the method of euthanasia of rodent samples in field. Collection permits were approved by Sichuan Forestry Department. Collecting protocols were approved by the Ethics Committee of the Sichuan Academy of Forestry. All specimens were deposited at the Sichuan Academy of Forestry. Detailed information on these samples, such as the collection locality, latitude, longitude, and altitude, is given in Supplementary Table S1.

For each sample, total genomic DNA was extracted from ethanol-preserved tissues (liver, muscle or fur) using a TIANamp Genomic DNA kit (TIANGEN Inc., Beijing, China). Then, 200 ng of genomic DNA was randomly fragmented to sizes of 200–400 bp with NEBNext dsDNA fragmentase (New England Biolabs) and used for DNA library construction with the NEBNext Illumina DNA Library Prep Kit (New England Biolabs). The SureSelect^xt Mouse All Exon Kit (Agilent Technologies) was used to capture the coding sequences of each sample. The hybridization enrichment experiments were performed using the SureSelect^xt Reagent Kit (G9611A) following the manufacturer’s protocol. After enrichment, the hybridized libraries were captured with Dynabeads MyOne Streptavidin C1 magnetic beads (Invitrogen) and amplified with index primers. The indexed captured libraries were pooled in equal concentrations and sequenced on an Illumina HiSeqX10 sequencer.

Data processing and dataset construction

We applied a “read mapping” strategy similar to that used by Wang et al. ⁵⁴. to process our sequence capture data. Briefly, for each sample, the sequence reads were aligned to the reference coding DNA sequences (CDSs) of Mesocricetus auratus (24,400 CDSs) using BWA⁵⁵ under a stringent setting of -B 6. The alignments were converted to BAM files with SAMtools⁵⁶, and consensus sequences were generated with BCFtools⁵⁷. In this way, we could obtain 24,400 orthologous CDS sequences for each sample. Each CDS orthologous group was aligned using MUSCLE v.3.8⁵⁸ based on codons. From these CDS alignments, we constructed an individual-level nuclear dataset using the following filtering criteria: (i) the percentage of ambiguous bases per CDS must be <0.1%; (ii) the percentage of missing data per CDS must be <60%; (iii) at least 95% of samples must be included per CDS. 6517 CDS alignments passed these criteria and were concatenated. To reduce missing data, we also constructed a species-level nuclear dataset. Based on the data source of the original 24,400 orthologous CDSs for the 122 samples, each orthologous CDS data of all individuals from every nominal species was merged together. Thus, we could obtain 24,400 orthologous CDSs for 61 species. The CDS alignments were generated as described above. The filtering criteria for the species-level nuclear dataset were as follows: (i) the percentage of ambiguous bases per CDS must be <0.1%; (ii) the percentage of missing data per CDS must be <40%; and (iii) 100% of samples must be included for each CDS. Finally, 6078 CDS alignments passed these criteria and were concatenated.

We also constructed a mitochondrial genome dataset. For this purpose, we extracted mitochondrial genomes from our sequencing data for each sample by using MitoZ⁵⁹. To include more Arvicolinae taxa, we searched the GenBank database to collect all published mitochondrial genome sequences of Arvicolinae (stopping in December 2021). When there were more than two mitochondrial genomes for the same species, we retained only representative sequences from different studies in the case of taxonomic revision. Finally, 98 mitochondrial genomes were downloaded and retained for further analysis (their accession numbers are given in Supplementary Table S5). Each mitochondrial gene was aligned using MUSCLE v.3.8⁵⁹ based on either nucleotides or codons depending on its origin, and alignments were refined by GBlocks v.0.91b⁶⁰ with the default settings.

Phylogenetic analyses

Phylogenetic trees were reconstructed based on maximum likelihood inference using IQ-TREE version 2⁶¹. IQ-TREE was also used to select the best-fit models and partitioning schemes according to the Bayesian information criterion. For the individual-level nuclear dataset and the species-level nuclear dataset, the best partitioning scheme involved three separate partitions with separate GTR + F + R2 models for the first and second codon positions, GTR + F + R3 models for the third codon positions; for the mitochondrial genome dataset, the best partitioning scheme involved six separate partitions: all tRNAs combined and the first and second codon positions for all protein-coding genes with the GTR + F + R5 model; 12 S rRNA with TIM2 + F + R5; 16 S rRNA with TIM2 + F + R7; and the third codon positions for all protein-coding genes with TIM3 + F + R9. Branch support was estimated by using the ultrafast bootstrap algorithm (UFBoot) embedded in IQ-TREE with 10,000 replicates. The nuclear and mitochondrial genome datasets and detailed IQTREE running results were deposited in the Mendeley Data Repository (https://data.mendeley.com/datasets/mwyj4m963h).

For the individual-level nuclear dataset and species-level nuclear dataset, we also used coalescence-based ASTRAL-II⁶² to estimate the species tree. We used a data binning strategy to improve multispecies coalescent analyses for handling gene trees with weak phylogenetic signals^63,64. The CDSs of the individual-level nuclear dataset and the species-level nuclear dataset were divided into 200 bins of similar sizes according to their evolutionary rates. Each bin included ~200–300 CDSs. For each data bin, 200 ML bootstrap trees and the final best-scoring ML tree were estimated using RAxML⁶⁵ under the GTR + GAMMA model. These best-scoring ML trees and bootstrapping trees were used as input files for ASTRAL with the option “–i –b” to calculate the final species tree and branch support.

To quantify the genealogical concordance between the resulting trees and the data, we calculated two concordance factors⁶⁶ by using IQ-TREE. The gene concordance factor (gCF) indicates the percentage of decisive gene trees containing a branch, and the site concordance factor (sCF) indicates the percentage of decisive sites supporting a branch in the reference tree. The individual-level nuclear dataset and the species-level nuclear dataset were tested and divided into 200 bins as mentioned above. The concatenated IQ-TREE ML trees were used as reference trees.

Divergence time analyses

Molecular time estimation was performed based on the mitochondrial genome dataset and the species-level nuclear dataset using MCMCTree in the PAML package⁶⁷. The aim of the mitochondrial analysis was only to provide a time framework for the global Arvicolinae, so we compressed the mitochondrial genome dataset to the genus level, retaining only one or two sequences per genus. Nine calibration points were used, which are presented in Supplementary Table S6. The Arvicolinae-Cricetinae split was constrained at 18–48 Ma. The minimum age of Ondatrini, Lagurini, Ellobiusini, Arvicolini and Lemmini were set to 4.13, 2.5, 2.6, 3.6, and 3.2 Ma. The minimum and maximum boundaries of Eothenomys, Myodes and Volemys were constrained at 2.7–8.1, 3.6–6.08, and 5.3–12.2 Ma. For the minimum and maximum boundaries of all calibration points, the default 2.5% tail probability in MCMCTree was used (soft bound strategy). The root age prior was set to 31 Ma. The mitochondrial dataset was divided into six partitions, and the nuclear dataset was divided into 20 gene bins according to gene evolution rates using a partitioning strategy similar to that applied in the phylogenetic analyses. ML estimates of the branch lengths for each partition were obtained by using the BASEML programs (in PAML) under the GTR + G model. rgene_gamma (overall substitution rate) was set as G (1, 0.27975) or G (1, 24.7497) for the mitochondrial genome dataset or the species-level nuclear dataset, respectively. sigma2_gamma (rate-drift parameter) was set as G (1, 4.5, 1). The independent rate model (clock = 2) was used to specify the prior rate change across the tree. Two independent MCMC runs were conducted. In each run, the first 10 million iterations were discarded as burn-in, after which sampling was conducted every 150 iterations until 100,000 samples were collected. The stationary state and convergence of each run were checked in Tracer⁶⁸.

Biogeographic analyses

Ancestral range reconstruction was performed by using BioGeoBEARS⁶⁹. First, based on the global distribution pattern of Arvicolinae, we defined three large geographic areas, North America, North Eurasia and South Eurasia, to estimate the biogeographic history of Arvicolinae using the genus-level time tree of the mitochondrial genome dataset. Second, the distribution of the HDM voles and their Eurasian relatives was further subdivided into four geographic regions (Arid and Semiarid Areas, Monsoon Areas, Qinghai-Tibet Plateau and Hengduan Mountains) to estimate the ancestral range of the HDM voles using the time tree of the species-level nuclear dataset. Two models (dispersal-extinction cladogenesis and dispersal-vicariance analysis) and an additional J (long-distance jumping) parameter were tested. In addition, we performed ancestral elevation reconstruction based on the species elevation information and the time tree of the species-level nuclear dataset using the maximum likelihood approach in Phytools⁷⁰.

To infer the speciation mode of vole species in the HDM region, we defined three distribution types: found only in HDMs, found outside of HDMs, or found in both regions. Ancestral range reconstruction analysis was performed on the species-level nuclear dataset using the DEC + J model. Based on the ancestral range estimation result, the biogeographic events of voles in the HDM region were categorized into two types, in situ diversification and colonization. The identification of biogeographic events largely followed the criteria of Xu et al. ¹³. In brief, in an in situ diversification event, the ancestral node and its descendant node are both distributed in the HDM, whereas in a colonization event, the ancestral node is located outside the HDM but its descendant node is distributed in the HDM. Finally, all in situ diversification and colonization events were sliced to calculate their rates over time, defined as the maximal number of colonization events per 0.1 million years and the maximal number of in situ diversification events per 0.1 million years. These rates were calculated by summing potential colonization or in situ diversification events over time using a sliding window of 0.1 Ma based on the divergence time credibility intervals estimated from the species-level nuclear dataset.

Gene flow analyses

We used Patterson’s D test to estimate gene flow between the vole species within the HDM and outside of the HDM. According to the obtained phylogenetic tree, we performed two gene flow tests between Myodini and Microtini. The D test requires a four-taxon phylogeny (((P₁, P₂), P₃), O). In the first test, the two HDM genera of tribe Myodini (Caryomys and Eothenomys) were considered as P₁, the remaining genera of tribe Myodini distributed outside of the HDM (Alticola, Craseomys and Myodes) were considered as P₂, and all species of tribe Microtini were considered as P₃. In the second test, the three HDM genera of tribe Microtini Neodon, Proedromys and Volemys were considered as P₁, the remaining genera of tribe Microtini (Alexandromys, Lasiopodnmys, and Microtus) were considered as P₂, and all species of tribe Myodini were considered as P₃. Mesocricetus auratus was used as the outgroup in all tests. For every four-taxon phylogeny in each test, we applied the D test exhaustively for all combinations of four species belonging to each of the four groups. For this purpose, we used a custom Python script to extract subsets from the species-level nuclear dataset. These sequence subsets were used by the program D_FOIL⁷¹ to generate an AB-pattern count file and then calculate D statistics by using the default significance cutoff of 0.01.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The raw Illumina sequencing data generated in this paper can be downloaded from the NCBI Sequence Read Archive under the BioProject Accession Number PRJNA820500. The extracted CDS sequences for each sample were deposited in the Mendeley Data Repository (Mendeley Data, V1, doi: 10.17632/mwyj4m963h.1).

References

Rahbek, C. et al. Humboldt’s enigma: what causes global patterns of mountain biodiversity? Science 365, 1108–1113 (2019).
Article PubMed CAS Google Scholar
Spehn, E. M., Rudmann-Maurer, K. & Körner, C. Mountain biodiversity. Plant Ecol. Divers. 4, 301–302 (2012).
Article Google Scholar
Hoorn, C., Mosbrugger, V., Mulch, A. & Antonelli, A. Biodiversity from mountain building. Nat. Geosci. 6, 154–154 (2013).
Article CAS Google Scholar
Change, C. et al. Amazonia through time: andean uplift, climate change, landscape evolution, and biodiversity. Science 330, 927–932 (2010).
Article Google Scholar
Antonelli, A. et al. Geological and climatic influences on mountain biodiversity. Nat. Geosci. 11, 718–725 (2018).
Article CAS Google Scholar
Xing, Y. & Ree, R. H. Uplift-driven diversification in the Hengduan Mountains, a temperate biodiversity hotspot. Proc. Natl Acad. Sci. USA 114, E3444–E3451 (2017).
Article PubMed PubMed Central CAS Google Scholar
Ding, W. N., Ree, R. H., Spicer, R. A. & Xing, Y. W. Ancient orogenic and monsoon-driven assembly of the world’s richest temperate alpine flora. Science 369, 578–581 (2020).
Article PubMed CAS Google Scholar
Seehausen, O. et al. Genomics and the origin of species. Nat. Rev. Genet. 15, 176–192 (2014).
Article PubMed CAS Google Scholar
Marques, D. A., Meier, J. I. & Seehausen, O. A. Combinatorial view on speciation and adaptive radiation. Trends Ecol. Evol. 34, 531–544 (2019).
Article PubMed Google Scholar
Boufford, D. E. Biodiversity hotspot: China’s Hengduan Mountains. Arnoldia 72, 24–35 (2014).
Mi, X. et al. The global significance of biodiversity science in China: an overview. Natl Sci. Rev. 8, nwab032 (2021).
Article PubMed PubMed Central Google Scholar
Ye, X. Y. et al. Rapid diversification of alpine bamboos associated with the uplift of the Hengduan Mountains. J. Biogeogr. 46, 2678–2689 (2019).
Article Google Scholar
Xu, W. et al. Herpetological phylogeographic analyses support a Miocene focal point of Himalayan uplift and biological diversification. Natl Sci. Rev. 8, nwaa263 (2021).
Article PubMed Google Scholar
Liu, Y. et al. Sino-Himalayan mountains act as cradles of diversity and immigration centres in the diversification of parrotbills (Paradoxornithidae). J. Biogeogr. 43, 1488–1501 (2016).
Article Google Scholar
Wu, Y., DuBay, S. G., Colwell, R. K., Ran, J. & Lei, F. Mobile hotspots and refugia of avian diversity in the mountains of south-west China under past and contemporary global climate change. J. Biogeogr. 44, 615–626 (2017).
Article Google Scholar
Cai, T. et al. The role of evolutionary time, diversification rates and dispersal in determining the global diversity of a large radiation of passerine birds. J. Biogeogr. 47, 13823 (2020).
Article Google Scholar
Musser, G. G., Carleton, M. D., Wilson, D. E. & Reeder, D. M. Mammal species of the world: a taxonomic and geographic reference. Johns. Ed. Wilson, Reeder. Dm. Balt. 894, 1531 (2005).
Google Scholar
Wilson, D. E., Lacher, T. E. & Mittermeier, R. A. Handbook of the Mammals of the World: Vol. 7: Rodents II (Lynx Edicions, 2017).
Liu, S., Sun, Z., Zeng, Z. & Zhao, E. A new vole (Cricetidae: Arvicolinae: Proedromys) from the Liangshan mountains of Sichuan province, China. J. Mammal. 88, 1170–1178 (2007).
Article Google Scholar
Liu, S. et al. Phylogeny of oriental voles (Rodentia: Muridae: Arvicolinae): molecular and morphological evidence. Zool. Sci. 29, 610–622 (2012).
Article Google Scholar
Liu, S. et al. Molecular phylogeny and taxonomy of subgenus Eothenomys (Cricetidae: Arvicolinae: Eothenomys) with the description of four new species from Sichuan, China. Zool. J. Linn. Soc. 186, 569–598 (2019).
Article Google Scholar
Martínková, N. & Moravec, J. Multilocus phylogeny of arvicoline voles (Arvicolini, Rodentia) shows small tree terrace size. Folia Zool. 61, 254–267 (2012).
Article Google Scholar
Fabre, P. H., Hautier, L., Dimitrov, D. & Douzery, E. J. P. A glimpse on the pattern of rodent diversification: a phylogenetic approach. BMC Evol. Biol. 12, 1–19 (2012).
Article Google Scholar
Abramson, N. I., Bodrov, S. Y., Bondareva, O. V., Genelt-Yanovskiy, E. A. & Petrova, T. V. A mitochondrial genome phylogeny of voles and lemmings (Rodentia: Arvicolinae): Evolutionary and taxonomic implications. PLoS One 16, e0248198 (2021).
Article PubMed PubMed Central CAS Google Scholar
Fritz, S. A., Bininda‐Emonds, O. R. P. & Purvis, A. Geographical variation in predictors of mammalian extinction risk: big is bad, but only in the tropics. Ecol. Lett. 12, 538–549 (2009).
Article PubMed Google Scholar
Pisano, J. et al. Out of Himalaya: the impact of past Asian environmental changes on the evolutionary and biogeographical history of Dipodoidea (Rodentia). J. Biogeogr. 42, 856–870 (2015).
Article Google Scholar
Álvarez-Carretero, S. et al. A species-level timeline of mammal evolution integrating phylogenomic data. Nature 602, 263–267 (2022).
Article PubMed Google Scholar
Steppan, S. J. & Schenk, J. J. Muroid rodent phylogenetics: 900-species tree reveals increasing diversification rates. PLoS One 12, e0183070 (2017).
Article PubMed PubMed Central Google Scholar
Galewski, T. et al. The evolutionary radiation of Arvicolinae rodents (voles and lemmings): Relative contribution of nuclear and mitochondrial DNA phylogenies. BMC Evol. Biol. 6, 1–17 (2006).
Article Google Scholar
Robovský, J., Řičánková, V. & Zrzavý, J. Phylogeny of Arvicolinae (Mammalia, Cricetidae): utility of morphological and molecular data sets in a recently radiating clade. Zool. Scr. 37, 571–590 (2008).
Article Google Scholar
Abramson, N. I., Lebedev, V. S., Tesakov, A. S. & Bannikova, A. A. Supraspecies relationships in the subfamily Arvicolinae (rodentia, cricetidae): an unexpected result of nuclear gene analysis. Mol. Biol. 43, 834–846 (2009).
Article CAS Google Scholar
Liu, S. et al. Taxonomic position of Chinese voles of the tribe Arvicolini and the description of 2 new species from Xizang, China. J. Mammal. 98, 166–182 (2017).
PubMed Google Scholar
Liu, S., Jin, W. & Tang, M. Review on the taxonomy of Microtini (Arvicolinae: Cricetidae) with a catalogue of species occurring in China. ACTA Theriol. Sin. 40, 290–301 (2020).
Google Scholar
Buzan, E. V., Krystufek, B., HÄNFLING, B. & Hutchinson, W. F. Mitochondrial phylogeny of Arvicolinae using comprehensive taxonomic sampling yields new insights. Biol. J. Linn. Soc. 94, 825–835 (2008).
Article Google Scholar
Martin, R. D. 28. Arvicolidae. Evol. Tert. Mamm. North Am. 2, 480–498 (2008).
Article Google Scholar
Fejfar, O., Heinrich, W. D., Kordos, L. & Maul, L. C. Microtoid cricetids and the Early history of arvicolids (Mammalia, Rodentia). Palaeontol. Electron. 14, 12 (2011).
Google Scholar
He, J., Lin, S., Ding, C., Yu, J. & Jiang, H. Geological and climatic histories likely shaped the origins of terrestrial vertebrates endemic to the Tibetan Plateau. Glob. Ecol. Biogeogr. 30, 1116–1128 (2021).
Article Google Scholar
Favre, A. et al. The role of the uplift of the Qinghai-Tibetan Plateau for the evolution of Tibetan biotas. Biol. Rev. 90, 236–253 (2015).
Article PubMed Google Scholar
Sun, B. N. et al. Reconstructing Neogene vegetation and climates to infer tectonic uplift in western Yunnan, China. Palaeogeogr. Palaeoclimatol. Palaeoecol. 304, 328–336 (2011).
Article Google Scholar
Wang, Y. et al. Cenozoic uplift of the Tibetan Plateau: evidence from the tectonic–sedimentary evolution of the western Qaidam Basin. Geosci. Front. 3, 175–187 (2012).
Article Google Scholar
Su, T. et al. No high Tibetan Plateau until the Neogene. Sci. Adv. 5, eaav2189 (2019).
Article PubMed PubMed Central CAS Google Scholar
Payne, N. L. & Smith, J. A. An alternative explanation for global trends in thermal tolerance. Ecol. Lett. 20, 70–77 (2017).
Article PubMed Google Scholar
Irisarri, I. et al. Phylogenomics uncovers early hybridization and adaptive loci shaping the radiation of Lake Tanganyika cichlid fishes. Nat. Commun. 9, 1–12 (2018).
Article CAS Google Scholar
Abbott, R. et al. Hybridization and speciation. J. Evol. Biol. 26, 229–246 (2013).
Article PubMed CAS Google Scholar
Orr, H. A. The population genetics of adaptation: the distribution of factors fixed during adaptive evolution. Evolution 52, 935–949 (1998).
Article PubMed Google Scholar
Selz, O. M., Thommen, R., Maan, M. E. & Seehausen, O. Behavioural isolation may facilitate homoploid hybrid speciation in cichlid fish. J. Evol. Biol. 27, 275–289 (2014).
Article PubMed CAS Google Scholar
Meier, J. I., Marques, D. A., Wagner, C. E., Excoffier, L. & Seehausen, O. Genomics of parallel ecological speciation in Lake Victoria cichlids. Mol. Biol. Evol. 35, 1489–1506 (2018).
Article PubMed CAS Google Scholar
Alexander, A. M. et al. Genomic data reveals potential for hybridization, introgression, and incomplete lineage sorting to confound phylogenetic relationships in an adaptive radiation of narrow‐mouth frogs. Evolution 71, 475–488 (2017).
Article PubMed Google Scholar
Lamichhaney, S. et al. Evolution of Darwin’s finches and their beaks revealed by genome sequencing. Nature 518, 371–375 (2015).
Article PubMed CAS Google Scholar
Lamichhaney, S. et al. Rapid hybrid speciation in Darwin’s finches. Science 359, 224–228 (2018).
Article PubMed CAS Google Scholar
Tang, M. et al. A summary of phylogenetic systematics studies of Myodini in China (Rodentia: Cricetidae: Arvicolinae). ACTA Theriol. Sin. 41, 71–81 (2021).
Google Scholar
Sikes, R. S. & Gannon, W. L. & Animal Care and Use Committee of the the American Society of Mammalogists. Guidelines of the American Society of Mammalogists for the use of wild mammals in research. J. Mammal. 92, 235–253 (2011).
State Council Decree. Wildlife Protective Enforcement Regulation of The People’s Republic of China (Environmental Investigation Agency, 1992).
Wang, X. Y. et al. Out of tibet: genomic perspectives on the evolutionary history of extant pikas. Mol. Biol. Evol. 37, 1577–1592 (2020).
Article PubMed CAS Google Scholar
Li, R. et al. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res 20, 265–272 (2010).
Article PubMed PubMed Central CAS Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central Google Scholar
Li, H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27, 2987–2993 (2011).
Article PubMed PubMed Central CAS Google Scholar
Edgar, R. C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32, 1792–1797 (2004).
Article PubMed PubMed Central CAS Google Scholar
Meng, G., Li, Y., Yang, C. & Liu, S. MitoZ: A toolkit for animal mitochondrial genome assembly, annotation and visualization. Nucleic Acids Res 47, e63–e63 (2019).
Article PubMed PubMed Central CAS Google Scholar
Castresana, J. Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol. 17, 540–552 (2000).
Article PubMed CAS Google Scholar
Minh, B. Q. et al. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic Era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Article PubMed PubMed Central CAS Google Scholar
Mirarab, S. et al. ASTRAL: genome-scale coalescent-based species tree estimation. Bioinformatics 30, i541–i548 (2014).
Article PubMed PubMed Central CAS Google Scholar
Jarvis, E. D. et al. Whole-genome analyses resolve early branches in the tree of life of modern birds. Science 346, 1320–1331 (2014).
Article PubMed PubMed Central CAS Google Scholar
Mirarab, S., Bayzid, M. S., Boussau, B. & Warnow, T. Statistical binning enables an accurate coalescent-based estimation of the avian tree. Science 346, 1250463 (2014).
Article PubMed Google Scholar
Stamatakis, A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30, 1312–1313 (2014).
Article PubMed PubMed Central CAS Google Scholar
Minh, B. Q., Hahn, M. W. & Lanfear, R. New methods to calculate concordance factors for phylogenomic datasets. Mol. Biol. Evol. 37, 2727–2733 (2020).
Article PubMed PubMed Central CAS Google Scholar
Yang, Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol. Biol. Evol. 24, 1586–1591 (2007).
Article PubMed CAS Google Scholar
Rambaut, A., Drummond, A. J. & Suchard, M. Tracer v1. 6 http://beast.bio.ed.ac.uk (2007).
Matzke, N. J. BioGeoBEARS: BioGeography with Bayesian (and Likelihood) Evolutionary Analysis in R Scripts. http://phylo.wikidot.com/biogeobears (2013).
Revell, L. J. Phytools: An R package for phylogenetic comparative biology (and other things). Methods Ecol. Evol. 3, 217–223 (2012).
Article Google Scholar
Pease, J. B. & Hahn, M. W. Detection and polarization of introgression in a five-taxon phylogeny. Syst. Biol. 64, 651–662 (2015).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We thank all our lab members for help with the experiments and data analyses. Dr. Li Yang at Sun Yat-Sen University provided assistance in the species richness analysis. This work was supported by the National Natural Science Foundation of China (32170449, 32071611, 31970399, and 31470110) and the China Postdoctoral Science Foundation (2020TQ0386 and 2020M683040).

Author information

Authors and Affiliations

State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
XiaoYun Wang, Dan Liang & Peng Zhang
Sichuan Academy of Forestry, Chengdu, China
XuMing Wang, MingKun Tang, Yang Liu & ShaoYing Liu
Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Zhuhai, Guangdong Province, China
Peng Zhang

Authors

XiaoYun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Dan Liang
View author publications
You can also search for this author in PubMed Google Scholar
XuMing Wang
View author publications
You can also search for this author in PubMed Google Scholar
MingKun Tang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Liu
View author publications
You can also search for this author in PubMed Google Scholar
ShaoYing Liu
View author publications
You can also search for this author in PubMed Google Scholar
Peng Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.Z., D.L., and S.Y.L. designed the research. S.Y.L., X.M.W., M.K.T. and Y.L. carried out taxon sampling and collection. X.Y.W. performed the DNA sequencing with the help of D.L. X.Y.W. and P.Z. analyzed the data. P.Z., X.Y.W. and S.Y.L. wrote the paper.

Corresponding authors

Correspondence to ShaoYing Liu or Peng Zhang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Biology thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editor: Luke R. Grinham.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Materials

Description of Additional Supplementary Data

Supplementary Tables

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, X., Liang, D., Wang, X. et al. Phylogenomics reveals the evolution, biogeography, and diversification history of voles in the Hengduan Mountains. Commun Biol 5, 1124 (2022). https://doi.org/10.1038/s42003-022-04108-y

Download citation

Received: 07 June 2022
Accepted: 12 October 2022
Published: 25 October 2022
DOI: https://doi.org/10.1038/s42003-022-04108-y

This article is cited by

Past climate cooling and orogenesis of the Hengduan Mountains have influenced the evolution of Impatiens sect. Impatiens (Balsaminaceae) in the Northern Hemisphere
- Fei Qin
- Tiantian Xue
- Shengxiang Yu
BMC Plant Biology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.