Studies along elevational gradients worldwide usually find the highest plant taxa richness in mid-elevation forest belts. Hence, an increase in upper elevation diversity is expected in the course of warming-related treeline rise. Here, we use a time-series approach to infer past taxa richness from sedimentary ancient DNA from the south-eastern Tibetan Plateau over the last ~18,000 years. We find the highest total plant taxa richness during the cool phase after glacier retreat when the area contained extensive and diverse alpine habitats (14–10 ka); followed by a decline when forests expanded during the warm early- to mid-Holocene (10–3.6 ka). Livestock grazing since 3.6 ka promoted plant taxa richness only weakly. Based on these inferred dependencies, our simulation yields a substantive decrease in plant taxa richness in response to warming-related alpine habitat loss over the next centuries. Accordingly, efforts of Tibetan biodiversity conservation should include conclusions from palaeoecological evidence.
Global change affects mountain biodiversity and alters ecosystem functioning, eventually threatening the provision of ecosystem services to human society1. Whether ongoing glacier retreat, treeline rise, and land-use intensification will result in an increase or decrease of high-elevation plant richness is heavily debated2,3,4,5,6. Studies of elevational transects locate the highest plant richness at intermediate elevations, which are mostly found within the forest belt in temperate regions. For example, plant diversity peaks at ~3,600 m a.s.l7. (metres above sea level) on the south-eastern Tibetan Plateau (Hengduan Mountains, Fig. 1a, red dotted line) which harbours one third of the vascular plant flora of China8. However, it is unclear whether this widely observed hump-shaped diversity pattern represents a sampling effect (because mid-elevations are often preferentially sampled)9, an area effect (because mid-elevation areas are mostly overrepresented and thus can support higher plant diversity)9, a mid-domain effect (because species distributions overlap mainly at the geometric centre even without environmental gradients)10, or whether it is really the mild temperatures and/or ecotone effect of forests that support a high richness9. Hence, the question remains as to whether plant diversity at high elevations will increase with temperature-driven treeline advance, as would be expected when simply projecting an upward movement of vegetation belts.
Alpine plants, which typically include many endemic taxa, are threatened by habitat loss when treelines rise11 and are therefore a focus of conservation considerations. It is uncertain whether their preferred habitats12 or a diversity of habitats should be conserved to protect richness13. It is also unclear whether landscape diversity resulting from extensive and traditional land use can, to some extent, compensate for climate-change effects on alpine plant richness14.
These uncertainties mainly originate from a lack of long-term biodiversity records. Most projected mountain diversity changes are based on knowledge obtained from samples taken across a spatial extent and include various artefacts15. The advantages of a time-series approach over the traditional space-for-time approach are that the sampled site is constant (i.e., normalizing for the sampling effect), that sampling elevation always represents the same portion of the investigated mountain range (i.e., normalizing for the area effect) and is always placed at the same relative elevation (i.e., normalizing for the mid-domain effect). Hence, such an approach can well reflect the temporal biodiversity-environment relationship and as such increase the effects of relevant variables when predicting biodiversity change over time16. Mountain lake sediments are historical archives of ecological change, but classical vegetation proxies, such as pollen or macrofossils, are not suitable indicators for plant diversity change17. However, methodological advances in sedimentary ancient DNA (sedaDNA) metabarcoding now allow an assessment of biodiversity at higher taxonomic resolution than traditional approaches through time18.
Here, we reconstruct the taxonomic richness of plants in the catchment area of Lake Naleng (Hengduan Mountains, south-eastern margin of the Tibetan Plateau; Fig. 1a) over the past ~18,000 years. We apply a sedaDNA metabarcoding approach with general plant primers (Methods) to 72 horizons from a lake sediment core. We consider temperature, habitable area, forest shifts, and human impact as potential drivers of changes in plant taxa richness. In particular, we investigate whether richness increases with forest invasion into alpine habitats, recapitulating the modern elevational richness gradient. Finally, we make inferences for plant taxa richness under future climate change by applying the inferred long-term diversity-environmental relationships and provide suggestions for a future plant diversity conservation planning. We find that the total plant taxa richness is highest during late glacial times, when the area was dominated by alpine meadows, and lowest during the early Holocene when forest extent was at its maximum. By analogy to the past, we infer that total plant-taxa richness could decrease in the future due to a warming-induced upward movement of the treeline.
Results and discussion
Plant DNA and taxa richness changes
After bioinformatic filtering of raw sequencing output (Methods), 6,021,603 sequence counts were obtained from 138 PCR (polymerase chain reaction) replicates of 71 investigated sediment horizons. They were assigned to 218 terrestrial seed plant taxa with 100% best identity (Supplementary Data 1). A few PCR replicates, mainly from the late-glacial period, had to be excluded from further analyses because they were without read counts (Supplementary Fig. 1a). Apart from 5 outliers with high read numbers, PCR replicates yielded read counts of a similar order of magnitude (Supplementary Fig. 1b). We found no correlation between read counts and total plant taxa richness (df = 69, rho = 0.014, p = 0.908). From this we conclude that read count has no impact on the inferred diversity signal. Also, neither variations in plant taxa richness nor compositional signals differed between results from single PCR samples or from pooled-PCR samples (Supplementary Fig. 1c, d, Supplementary Table 1 and Supplementary Table 2). Accordingly, we assume that the variations of plant taxa richness over time can be reliably tracked by pooling results from PCR replicates of one horizon.
Overall the sedaDNA record reproduces the compositional vegetation changes (Fig. 2k) inferred from pollen data19,20 (Supplementary Fig. 2) and pollen-based vegetation change agrees with other pollen records from the Tibetan Plateau:21,22 alpine steppe dominated 18–14 ka, alpine meadow 14–10 ka, open Picea forest 10–3.6 ka and alpine meadow after 3.6 ka with the presence of typical land-use indicators such as Sanguisorba officinalis (Fig. 2i, grey area)20 and Nardostachys jatamansi (a traditional Tibetan medicinal plant23) (Fig. 2i, black line). The sedaDNA better captures the vegetation signals within the lake catchment than pollen as it is not impacted by upward plant material transport (Supplementary Fig. 2) and records more taxa at higher taxonomic resolution than pollen spectra (Supplementary Table 3). Accordingly, and because the lake catchment covers the most common elevations in the Hengduan Mountains (~4200 to ~4900 m a.s.l., Fig. 1a, b), we conclude that Lake Naleng archived the main signal of the south-eastern Tibetan alpine ecosystem. This reasoning aligns with a modern study that indicates a non-random vegetation composition in the alpine belt of the Hengduan Mountains and identified phylogenetic clustering of alpine plant taxa in connection with environmental filtering24.
Total plant taxa richness was relatively low before 14 ka, higher between 14 and 10 ka, low again between 10 and 3.6 ka and high after 3.6 ka (Fig. 2a). Similar trends were obtained for taxa richness within important alpine plant families (Fig. 2b–f). Proportional immigration of taxa dominated during 14–10 ka and after 3.6 ka compared to their respective previous time period (Supplementary Fig. 3, Methods). Considering the restricted entries in the taxonomic database (EMBL Nucleotide Database standard sequence release 12725, we redid the sedaDNA analyses using a 95% best identity threshold for taxa assignment which yielded 984 unique sequence types (Supplementary Data 2) indicating the possibility of additional plant taxa. Plant taxa richness patterns based on the 95% best identity are similar to those with 100% best identity (Supplementary Fig. 4), which provides confidence in our results. Furthermore, a similar temporal pattern of total plant taxa richness was obtained when analysing the data before taxonomic assignment and the data containing all terrestrial seed plant sequences (Methods), as indicated by the highly significant correlations of these time-series with total plant taxa richness (Supplementary Table 1). Sample processing-related errors (e.g., PCR and sequencing) may have slightly inflated taxa richness. However, we assume that we rather underestimate taxa richness because of non-specificity of the marker and non-completeness of the reference database, which likely means that the sedaDNA detected taxa number is lower than the absolute taxa number recorded in the flora. Additionally, species-rich families26 in the flora including Asteraceae, Saxifragaceae, and Orobanchaceae have highest richness in our record. Thus, plant taxa richness (total plant taxa richness and taxa richness within dominant alpine families) can be regarded as a semi-quantitative proxy of taxa richness in our study.
Taken together, our applied plant sedaDNA metabarcoding identifies more plant taxa at lower taxonomic level than any other palaeo-approach before, providing a first reliable record of relative plant taxa richness variation on a millennial timescale for the Tibetan Plateau.
Drivers of plant taxa richness changes
Total plant taxa richness shows contrasting correlations with multiple proxy-based temperature reconstructions (Fig. 2g, dotted line) for different periods of the record: a positive correlation for 18–10 ka (rho = 0.225) but negative correlations for 14–3.6 ka (rho = −0.728) and 10–0 ka (rho = −0.932; Table 1). This suggests that temperature is unlikely to be a direct driver of plant taxa richness but may instead trigger different environmental processes that lead to contrasting biodiversity-temperature correlations.
Our analyses reveal a weak positive but statistically non-significant correlation between total plant taxa richness and total habitat area (Fig. 2h, black line) in response to glacier retreat at the end of the last glacial in the catchment of Lake Naleng (rho = 0.257, alpha level = 0.25; Table 1). Thus, we find no evidence to support the idea that total plant taxa richness mainly depends on the available area. Underestimation of this dependency might be related to limitations in our glacier-extent modelling approach that uses a proxy-based temperature reconstruction averaged from the Northern Hemisphere rather than from our study area. Nevertheless, to our knowledge, our study is the first palaeo time-series approach that addresses the extensively debated relationship between taxa richness and area27. Aside from habitable area, further processes related to rapid glacier retreat negatively impacted total plant taxa richness changes during the late glacial (rho = −0.587, alpha level = 0.025; Table 1). This might be attributed to disturbances on unstable slopes restricting vegetation establishment28. The increase in pedogenic minerals (as indicated by sedimentary proxy Mg/Ca ratio from the same record29, Fig. 2j) may have promoted the increase of richness of some alpine families (e.g. Polygonaceae, alpha level = 0.025; Ranunculaceae, alpha level = 0.025; Orobanchaceae, alpha level = 0.05, Supplementary Table 4), supporting the idea that soil development contributes to the coexistence of a large number of plant species30. However, it is not the key driver for the total plant taxa richness (alpha level = 0.25, Table 1).
We find a strong positive relationship between total plant taxa richness and the alpine habitat extent (Fig. 2h, red area; which itself negatively correlates with sedaDNA signals of Picea) in the catchment area after 14 ka (alpha level < 0.05; Table 1). Hence, in contrast to our expectation from the modern elevational plant taxa richness gradients in the Hengduan Mountains that peaks in the upper forest belt (Fig. 1a), early Holocene forest expansion into the catchment of Lake Naleng did not result in a plant taxa richness increase but in a richness decrease (alpha level = 0.01; Table 1). Accordingly, the reconstructed and simulated late-Holocene forest retreat (i.e., alpine area extent) also correlates with a richness increase (alpha level = 0.0005, Table 1). We assume that the retreat of forests is related to late Holocene cooling and weakening of the Asian summer monsoon, not to human impact, and is supported by a lack of late Holocene forest burning20.
Interestingly, sedaDNA results show that relative abundance and plant taxa richness within high-alpine plant families such as Asteraceae, Orobanchaceae, and Saxifragaceae can differ substantially, such that we find high within-family richness but low relative abundance at 14–10 ka (Fig. 2d–f). Of course, the relationship between relative read abundance and relative abundance of the taxon in the vegetation is still poorly understood and previous studies indicate that biases originate from, for example, PCR setup (e.g. preference for short reads and reads with high GC content31). However, studies of modern lake sediments have also shown that the compositional differences among sites are preserved32,33. Similarity in compositional changes between Lake Naleng sedaDNA and the pollen record supports this finding (Supplementary Fig. 2). We speculate that simultaneous high diversity of alpine habitats and maximum alpine habitat extent during 14–10 ka in the catchment of Lake Naleng may have provided habitats for many different plant taxa thereby suppressing domination by a few taxa34, which could, in turn, have created microenvironments that facilitate novel taxa migration.
We assume that the signs for late Holocene grazing intensification in sedaDNA and pollen records (Supplementary Fig. 2) are related to human impact. They indicate that human impact started or substantially increased after 3.6 ka, which aligns with archaeological evidence35. Their positive correlation with total plant taxa richness in the palaeorecord (10–0 ka, rho = 0.939, alpha level = 0.0005, Table 1) is consistent with the findings from experimental studies that moderate land use can increase taxa richness14. However, we find that the positive effect is smaller compared to the negative effect of alpine habitat loss due to forest invasion (10–0 ka, rho = 0.966, alpha level = 0.0005, Table 1).
Potential pattern of future plant taxa richness
In summary, we identified alpine habitat extent as the best predictor variable for total plant taxa richness (explained deviance = 96.04%, Supplementary Table 5), which was then used in a generalized linear model (Fig. 3a). The simulations project extensive alpine habitat loss in the Lake Naleng catchment area over the next 250 years in response to a predicted 2.5 °C warming (Supplementary Fig. 5), leading to a pronounced decrease in total plant taxa richness (Fig. 3b) and restricting cold-adapted taxa to high-mountain regions (Fig. 3c). In particular, taxa of endemic-rich high-alpine families are likely to disappear from the catchment (Supplementary Fig. 6). Because similar habitats will be rare in the surrounding Hengduan Mountains and the amplification of warming in high-elevation central Tibetan areas36 is unfavourable for these taxa, these taxa may become extinct. An upward expansion of montane taxa and a loss of high alpine taxa in the study area agree with predictions from a comprehensive species distribution modelling approach for the Hengduan Mountains4.
Our approach has several shortcomings. It assumes that tree-line change is sensitive to temperature change in the region. Although this assumption is supported by palaeoecological evidence showing that forests expanded into higher elevations under warming during the early- to mid-Holocene and retreated to lower elevations during the late Holocene cooling22, and by modern observations of an upward shifting treeline on the southeastern Tibetan Plateau in the past 100 years37, the pace of treeline response is observed to lag the temperature warming in some mountain regions due to a variety of processes. Such processes include interspecific competition, forest-shrub interactions, dispersal variations, or even extreme climate events37,38. So, our temperature-treeline-richness relationship may therefore be correct on a millennial time scale but may overestimate changes on shorter time scales. Furthermore, our approach considers plant richness as a whole or focus on certain alpine families. Therefore, the habitat gain and loss of individual taxa or specific functional groups cannot be evaluated. Hence, our simulated taxa loss in relation to shrinking alpine habitat extent should be treated as a potential pattern by analogy to the past. It requires confirmation from a more sophisticated species-specific approach that also considers realistic migrational lags.
Implications for conservation of future plant taxa richness
With respect to conservation efforts of the unique Hengduan Mountains diversity hotspot the following conclusions can be derived from our time-series approach (Fig. 4). First, to maintain high total plant richness, nature conservation should focus on alpine habitats. This contrasts with the conclusions based on elevational gradients which suggest prioritizing forests39. Focusing on alpine areas will also help to protect richness within alpine plant families that contain many endemic taxa as a result of Quaternary glaciations and geographical isolation40. Second, areas harbouring extensive alpine habitat and habitat diversity in the current upper alpine zone should be protected to provide space for warming-related upward plant migration. Third, any potential positive effects from grazing management are probably too weak to compensate for climate-change impacts on plant diversity in alpine habitats. Our study indicates that time-series investigations from palaeoecological investigations using sedaDNA can inform decision-making in nature conservation by revealing potential plant responses to changing environments and, when used alongside modelling studies of modern species distributions41, create a fuller picture of plant dynamics.
Lake Naleng (31.10° N, 99.75° E, 4200 m a.s.l.) is fed mainly via a major river channel on the northern side of the lake and drains from the southern margin. Several small streams from the adjacent mountains (up to 4,900 m a.s.l.) drain directly into the lake29. The basin of the lake was formed by glacial activity during the Last Glacial Maximum as indicated by erratic boulders and moraines29. The study area is influenced by the Indian summer monsoon, generating warm and wet conditions. Based on the instrumental data collected from Ganze station (31.62° N, 100.00°E; 3,522 m a.s.l.), the closest meteorological station about 80 km north-east of the lake, mean July temperature (MJT) is 14.3 °C and mean January temperature is −3.9 °C. Annual precipitation is about 620 mm with most falling from May to October. Yak and sheep livestock graze in the catchment during the summer. The vegetation composition has sharp environmental gradients:19,20 (1) montane forests, consisting of conifers (Abies aquamata, A. faxonia, Picea likiangensis, P. purpurea), are found up to 4,400 m a.s.l. and distributed mainly on north-facing slopes; (2) broadleaved plants (Betula, Rhododendron) form a secondary canopy; (3) alpine meadow (e.g. Polygonum, Kobresia) is found above the subalpine ecotone; (4) the high-alpine zone (4,900–5,200 m a.s.l.) is dominated by cushion and rosette plants (e.g. Saussurea).
In total, 72 sediment samples were collected from the core. In the climate chamber at a temperature of −10 °C, about 2 mm of the exposed sediment of frozen samples was removed with a small single-use clean blade and the inner part used for ancient DNA isolation.
Dating and chronology
Dating and an age-depth model are described in detail in a previous publication42. As macrofossils were absent throughout the core, sixteen samples of bulk organic carbon were selected for accelerator mass spectrometer (AMS) 14C dating at the Leibniz Institute Kiel. The determined lake reservoir effect of 1500 years was subtracted from each 14C date, prior to calibration to calendar years (cal yr BP) using CALIB 5.0.143,44.
DNA extraction, amplification, and high throughput next-generation sequencing
All DNA work was carried out in ancient DNA dedicated facilities at Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research, using strict ancient DNA precautions and protocols. Each extraction batch included nine samples (3–10 g sample–1) and one extraction control, which was treated with a partially modified protocol of PowerMax® Soil DNA Isolation kit (Mo Bio Laboratories, Inc. USA). The isolation of DNA was first processed by loading 15 mL PowerBead solution, 1.2 mL C1 buffer, 0.8 mg proteinase K (VWR International), 0.5 mL 1 M dithiothreitol (VWR International), and samples into PowerBead tubes. Then, all tubes were vortexed in 10 min and incubated at 56 °C in a rocking shaker overnight under the aluminium foil protection. The subsequent extraction steps followed the manufacturer’s instructions of the kit and were completed on the second day and DNA was eluted in 1.6 ml C6 buffer. For amplification we used plant universal primers g and h targeting the P6 loop of the chloroplast trnL (UAA) intron (Supplementary Table 6)45. To distinguish the samples after sequencing, both primers were modified by adding an 8 bp tag with at least five different base pairs between each to the 5′ end46 and three additional NNNs for improving cluster detection on Illumina sequencing platforms47. Altogether, 25 µL per PCR reaction were prepared with the following reagents: Primers (forward: 5′ NNN(8 bp tag)GGGCAATCCTGAGCCAA 3′, reverse: 5′ NNN(8 bp tag)CCATTGAGTCTCTGCACCTATC 3′) with the final concentration of 0.4 µM, 1× Platinum® Taq DNA Polymerase High Fidelity PCR buffer (Invitrogen, USA), 0.25 mM dNTPS, 0.8 mg Bovine Serum Albumin, 2 mM MgSO4 (Invitrogen, USA), 1 U Platinum® Taq High Fidelity DNA Polymerase (Invitrogen, USA) and 3 µL of sedaDNA template. PCRs (polymerase chain reactions) were run in the Post-PCR area separate from the ancient DNA facilities at 94 °C for 5 min (initial denaturation), followed by 50 cycles of 94 °C for 30 s, 50 °C for 30 s, 68 °C for 30 s and a final extension at 72 °C for 10 min. A no template control (NTC) was included for each PCR batch which included nine DNA extractions and one extraction control. PCR set-ups were conducted under a dedicated UV working station in the detached ancient DNA laboratory physically separated from the workplace of Post-PCR where we did the thermal cycling, purification, and pooling. Each PCR batch was replicated until obtaining two positive PCR replicates for each lake-sediment sample when the associated controls were negative. A qualified positive PCR product was considered only if it matched two conditions: (1) the gene band is evidently longer than that of negative controls; (2) the brightest staining is in the 100–200 bp range. Specifically, the thin/blurry products below 50 bp in corresponding controls are primer dimers and not expected PCR products. PCR products were visualized with 2% agarose gel electrophoresis, purified with the MinElute PCR Purification Kit (Qiagen, Germany), and measured with the ds-DNA BR Assay and the Qubit® 2.0 fluorometer (Invitrogen, USA) using 1 μL of PCR product. All purified PCR products were equimolarly pooled and sent away for sequencing to Fasteris SA, which used the MetaFast library protocol prior to sequencing on an Illumina HiSeq 2500 sequencing platform with paired-end reads of 125 bp length with the mode HiSeq High Output Version 4 by applying the HiSeq SBS Kit v4. Our project was sequenced together with another unknown sequencing project on a full HiSeq 2500 lane and resulted in 9.5 Gb with 37,922,797 generated clusters ≥Q30.
Sequence analysis and taxonomic assignment
The sequence data were analysed using OBITools software48. First, the paired-end DNA sequence reads were aligned using the illuminapairedend program in order to assemble the forward and reverse sequence reads. With the program ngsfilter paired sequences were assigned to samples based on their unique tag combination used for each sample. Afterwards read counts were summarized for unique sequence types using obiuniq, and obigrep was used to discard those sequences with a length <10 bp and a total count <10 reads in whole dataset. Subsequently, obiclean was used to exclude sequence variants which are likely attributed to PCR or sequencing errors by determining the sequences into head, internal, and singletons based on sequence count and similarity within one sample. Finally, sequence types were taxonomically assigned using the ecotag program, which was run on two reference databases: EMBL Nucleotide Database (standard sequence release 12725) and Arctic and Boreal vascular plant and bryophyte reference libraries49,50,51. The applied EMBL database was created by using an in silico PCR52 with the g/h primers allowing five mismatches between primer and the targeted sequences of the EMBL entries to increase the taxonomic breadth.
sedaDNA data quality control
To further denoise, only those sequence types that were assigned to terrestrial seed plants and have a best identity value greater than or equal to 0.95 were kept for the following data processing. Sequence counts <10 in each sample were replaced with 0 using R software53. Subsequently, we excluded assumed contaminations from fruits, cultivars and taxa not occurring in China, which were Musaceae, PACMAD clade, Lycopersicon, BOP clade, Maleae, etc. from the dataset. The extraction and PCR blanks were mostly without any contamination, only in a single NTC (sample ID: NTC6, Supplementary Data 1) were some plant DNA fragments detected. However, we did not remove these sequences from its controlled samples (Supplementary Discussion). In addition, sequences were only considered as genuine if the corresponding taxa could be found in the study area (Hengduan Mountains), which was aided by an international open access database54. We collected all sequence types with the best identity value of 1 and a frequency ≥2, into a dataset named bestid1, while the bestid0.95 dataset included the sequences with best identity ≥0.95 and frequency ≥2. Sample ESL024 was excluded from further statistical analysis because no reads were obtained (Supplementary Data 1 and Data 2).
Glacier-extant and habitat area simulation
Several previous studies have shown that the climate at a millennial time scale on the eastern Tibetan Plateau is strongly impacted by monsoons, particularly the East Asian summer monsoon, which tracks changes in the westerlies and continental warming that are largely a function of mid- to high-latitude changes55,56. Thus, the past climate change in our study was inferred by the synthesized record of Northern Hemisphere (30°–90° N) temperature anomaly since the last deglaciation57,58. The physical surface area is a prerequisite for suitable habitat development and taxa shifts. Hence, we modelled the past glacier-cover changes first (Fig. 1b). We considered the catchment area that is available for plant colonization to be ice-free. Past ice extents were estimated with the numerical ice-flow model GC2D59. We ran simulations on the present-day topography, based on a 90-m resolution SRTM digital elevation model. Climate was imposed through a vertical mass-balance profile that we estimated from present-day conditions. Based on the spatially averaged mean elevation of present-day glaciers in the vicinity60, we estimated an equilibrium line altitude (ELA) of ~5200 m61. We estimated the maximum ice accumulation rate to be 0.25 m yr−1, based on different gridded precipitation data sets (HAR62, GPCC63). Guided by observations from modern Tibetan glaciers64, and by matching the present-day distribution of ice cover in the wider region of our study area, we estimated a mass balance gradient of 0.0115 m yr−1 m−1. Glacier and snow cover through time were interpolated from the corresponding ▵ELA based on an integrated temperature lapse rate of 0.55 °C 100 m−1 in the Hengduan Mountains65 and reconstructed past temperature. To clarify, the ice-flow model only simulated the past flow of ice and any concurrent advance and retreat.
We then modelled the available habitable area within the lake catchment back-in-time (Supplementary Fig. 7) according to the following steps: (1) delineate the catchment using the global 1-arcsecond (90-m) SRTM digital elevation model and downscale to 30-m resolution for simulation; (2) combine the two reconstructed past temperature records;57,58 (3) calculate the relative elevation of each pixel based on the integrated temperature lapse rate of 0.55 °C 100 m−1 the same as for the glacier model65 for the catchment over the past 18 ka for each step of 500 years by the relative temperature change from the constructed temperature series in (2); (4) calculate the elevational range of the catchment over the past 18 ka under the effect of simulated glacier cover; (5) group the elevation values per 100-m elevational band ranging from 1000 m a.s.l. to 6000 m a.s.l. under the effect of simulated glacier cover and sum the pixels in all elevational bands as total habitable area (Fig. 2h, black line); (6) sum up the number of pixels above the modern treeline (~ 4,400 m a.s.l.) to obtain the alpine habitat area (Fig. 2h, red area); and (7) compute the elevational range above the modern treeline in the catchment (2050–2300 CE) under the projections of temperature according to RCP 4.5 emissions scenario (source: http://svn.zmaw.de/svn/cosmos/branches/releases/mpi-esm-cmip5/src/mod) for indicating the loss of alpine habitat under the ongoing climate warming (Supplementary Fig. 5). The modern treeline was calculated based on 49 current treeline points taken from high-resolution satellite images in GoogleEarthTM and open-source data (Supplementary Fig. 8 and Supplementary Table 7). It should be noted that uncertainties in the simulation may arise from species interactions and potentially lagging treeline response to climate warming37.
All statistical calculations were carried out using the R software53. All correlations were computed using the corr.test(method = “spearman”) in the psych package66. The Spearman Rank Correlation (rho), non-adjusted/adjusted probability values (p-value/adjusted p-value) and sample size (n) were obtained. Both p-values are helpful to check the significance of correlation under the unadjusted degrees of freedom, which is equal to sample size (n) minus 2.
As plant taxa richness increases with read counts, we rarefied the data bestid1 to equal counts for each sample based on the minimal total read count occurring in the entire sedaDNA dataset (base count = 11,949) 100 times (Supplementary Code 1), as well as the data bestid0.95 (base count = 13,344).
To investigate potential methodological biases of the sedaDNA-based plant taxa richness, two additional datasets were set up: (1) metabarcoding data before taxonomic assignment (hereafter referred as non-ecotag); (2) all terrestrial seed plant sequences without further sequence filtering (hereafter referred as terSeq data). Both datasets were rarefied to their respective minimal total read count (16,209; 14,645) 100 times. We investigated whether plant taxa richness is correlated to read counts for both datasets and whether it is correlated with total plant taxa richness of bestid1 dataset using corr.test(adjust = “none”).
To test if the plant taxa richness and composition are stable, we first collected the deeply sequenced PCR product for each lake sediment sample from the dataset bestid1 (hereafter referred as single data). Then, we rarefied these data 100 times based on the minimal read count (6,339) across all samples and calculated the plant taxa richness. The correlation between plant taxa richness from data bestid1 and single data was calculated using corr.test(adjust = “none”). Finally, Procrustes, and Protest analyses were applied to check whether sample scores and taxa scores of the first two PCA (principal component analysis) axes of the single data match those of the bestid1 dataset. Only those sequences with a maximum relative read abundance of 0.25% at least were kept. Double-square root transformation was applied before PCA analysis67. The procrustes(), protest() and rda(scale = FALSE) are available in the vegan package68.
In order to calculate the plant taxa richness for each single taxonomic family, we divided sedaDNA sequence data into subsets of taxonomic families and rarefied these subsets to a cut-off value of 100 total read counts to minimize the effect of relative abundance of taxonomic family. Such richness signals of families with low read counts (e.g., Ranunculaceae) could be compared with richness signals of families having higher read counts (e.g., Asteraceae). Furthermore, four distinct vegetation zones were classified using the chclust(method = “CONISS”) in vegan package68 based on the rarefied relative read abundance. We summed up the samples in each zone and computed the mean value of read counts per zone. Then, we rarefied the zonal data to its minimal total read count (40,377) 100 times. Afterwards, we computed the total vegetation turnover (beta diversity) using the turnover() function in the codyn package69 based on the rarefied zonal data (Supplementary Code 2).
To identify the main drivers of plant taxa richness, we calculated the correlation coefficient between plant taxa richness and driver variables using corr.test (adjust = “Bonferroni”) (Supplementary Code 3 and Supplementary Data 3). We separated the complete richness time-series into three time-intervals each consisting of two consecutive vegetation zones (according to CONISS), i.e., 18–10 ka, 14–3.6 ka, 10–0 ka. This approach accounts for variation of driver importance throughout the record and can even reveal sign changes in the relationship between driver variables and plant taxa richness. We used the smoothed data for plant richness and two predictor variables (Mg/Ca ratio and land-use) in the processing of correlation calculation, so that we adjusted the degrees of freedom to get the effective independent variables of the smoothed data. We used gam() for land-use data smoothing as it included a large number of zeros and loess(span = 0.5) to smooth the Mg/Ca ratio and plant taxa richness. Consequently, to obtain the alpha level we compared rho values with exact critical values of Spearman’s rho according to adjusted degrees of freedom. Only an alpha level ≤ 0.05 is considered statistically significant. The results of correlation are summarized in Table 1 and Supplementary Table 4. The related code is available in Supplementary Code 4. A generalized linear model (GLM) was built using glm(family = “gaussian”) for total plant taxa richness and those families that are significantly related to the predictor variables (alpine habitat area and land-use indicator) during 10–0 ka (Supplementary Code 5). This period was selected as it covers the warmest and most modern phase of the record. Moreover, the correlation between total plant taxa richness and alpine habitat is highest in this time interval. The temporal resolution of the correlated time-series was about 250 years. The proportion of deviance explained by the GLM was calculated using Dsquared() in the modEvA package70. We predicted the total richness and within-family richness for 2050–2300 C.E. in 50-year time steps using glm.predict() based on the most important predictor variable (alpine habitat area) in the GLM models (Supplementary Code 5). The variable importance was calculated using varlmp() in caret package71. The within-family sedaDNA data was analysed using the same data processing.
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
The raw NGS sequencing data that support the findings of this study have been archived in NCBI Sequence Read Archive (SRA) with the accession code SRR13957608 and in BioProject PRJNA596631. It also has been deposited in addition to the tag-to-sample matrix and taxonomic reference database that support the NGS sequencing data analysis in Dryad Digital Repository with the identifier https://doi.org/10.5061/dryad.vdncjsxth. The filtered sedaDNA datasets analysed during this study are provided in Supplementary Data 1 and 2. The data used for statistical analyses are available in Supplementary Data 3. The AMS-dating results and calibrated ages of the lake-sediment core can be found at https://doi.org/10.1007/s00334-009-0219-5. The pollen data have been published at https://doi.org/10.1016/j.yqres.2009.12.003 and https://doi.org/10.1016/j.palaeo.2009.12.001. The gridded precipitation data sets are open access and can be downloaded from HAR (https://www.klima.tu-berlin.de/index.php?show=daten_har2&lan=en) and GPCC (https://www.dwd.de/EN/ourservices/gpcc/gpcc.html). Source data are provided with this paper.
The Obitools scripts for NGS sequencing analysis are archived on Dryad Digital Repository with the identifier https://doi.org/10.5061/dryad.vdncjsxth. The MATLAB code for glacier modelling is available at https://csdms.colorado.edu/wiki/Model:Gc2d. Rarefaction v1.0 is available on Zenodo: https://doi.org/10.5281/zenodo.4562708. PastElevationChange v1.0, which includes ice cover, palaeo-temperature, digital elevation models of the lake catchment, 2006–2300 C.E. temperature with RCP4.5 to reproduce the habitat simulation within lake catchment, is available on Zenodo: https://doi.org/10.5281/zenodo.4562675. The R codes for statistical analyses and visualization are available in Supplementary Code 1–5).
Körner, C. Alpine Plant Life: Functional Plant Ecology of High Mountain Ecosystems; with 47 Tables. (Springer Science & Business Media, 2003).
Alexander, J. M. et al. Plant invasions into mountains and alpine ecosystems: current status and future challenges. Alp. Bot. 126, 89–103 (2016).
Xu, J. et al. The Melting Himalayas: cascading effects of climate change on water, biodiversity, and livelihoods. Conserv. Biol. 23, 520–530 (2009).
Liang, Q. et al. Shifts in plant distributions in response to climate warming in a biodiversity hotspot, the Hengduan Mountains. J. Biogeogr. 45, 1334–1344 (2018).
Newbold, T. et al. Global effects of land use on local terrestrial biodiversity. Nature 520, 45–50 (2015).
Dullinger, S. et al. Extinction debt of high-mountain plants under twenty-first-century climate change. Nat. Clim. Change 2, 619–622 (2012).
Zhang, D. C., Zhang, Y. H., Boufford, D. E. & Sun, H. Elevational patterns of species richness and endemism for some important taxa in the Hengduan Mountains, southwestern China. Biodivers. Conserv. 18, 699–716 (2009).
Tang, Z., Wang, Z., Zheng, C. & Fang, J. Biodiversity in China’s mountains. Front. Ecol. Environ. 4, 347–352 (2006).
Lomolino, Mark V. Elevation gradients of species-density: historical and prospective views. Glob. Ecol. Biogeogr. 10, 3–13 (2001).
Colwell, RobertK. & Lees, D. C. The mid-domain effect: geometric constraints on the geography of species richness. Trends Ecol. Evol. 15, 70–76 (2000).
Dirnböck, T., Essl, F. & Rabitsch, W. Disproportional risk for habitat loss of high-altitude endemic species under climate change. Glob. Change Biol. 17, 990–996 (2011).
Su, X., Han, W., Liu, G., Zhang, Y. & Lu, H. Substantial gaps between the protection of biodiversity hotspots in alpine grasslands and the effectiveness of protected areas on the Qinghai-Tibetan Plateau, China. Agric. Ecosyst. Environ. 278, 15–23 (2019).
Zhang, Y. et al. Spatial phylogenetics of two topographic extremes of the Hengduan Mountains in southwestern China and its implications for biodiversity conservation. Plant Divers. (2020) https://doi.org/10.1016/j.pld.2020.09.001.
Hopping, K. A., Knapp, A. K., Dorji, T. & Klein, J. A. Warming and land use change concurrently erode ecosystem services in Tibet. Glob. Change Biol. 24, 5534–5548 (2018).
Trivedi, M. R., Berry, P. M., Morecroft, M. D. & Dawson, T. P. Spatial scale affects bioclimate model projections of climate change impacts on mountain plants. Glob. Change Biol. 14, 1089–1103 (2008).
Gavin, D. G. et al. Climate refugia: joint inference from fossil records, species distribution models and phylogeography. N. Phytol. 204, 37–54 (2014).
Birks, H. J. B. et al. Does pollen-assemblage richness reflect floristic richness? A review of recent developments and future challenges. Rev. Palaeobot. Palynol. 228, 1–25 (2016).
Parducci, L. et al. Ancient plant DNA in lake sediments. N. Phytol. 214, 924–942 (2017).
Kramer, A., Herzschuh, U., Mischke, S. & Zhang, C. Late glacial vegetation and climate oscillations on the southeastern Tibetan Plateau inferred from the Lake Naleng pollen profile. Quat. Res. 73, 324–335 (2010).
Kramer, A., Herzschuh, U., Mischke, S. & Zhang, C. Holocene treeline shifts and monsoon variability in the Hengduan Mountains (southeastern Tibetan Plateau), implications from palynological investigations. Palaeogeogr. Palaeoclimatol. Palaeoecol. 286, 23–41 (2010).
Hou, G., Yang, P., Cao, G., Chongyi, E. & Wang, Q. Vegetation evolution and human expansion on the Qinghai–Tibet Plateau since the Last Deglaciation. Quat. Int. 430, 82–93 (2017).
Chen, F. et al. Climate change, vegetation history, and landscape responses on the Tibetan Plateau during the Holocene: a comprehensive review. Quat. Sci. Rev. 243, 106444 (2020).
Singh, U. M., Gupta, V., Rao, V. P., Sengar, R. S. & Yadav, M. K. A review on biological activities and conservation of endangered medicinal herb Nardostachys jatamansi. Int. J. Med. Arom. Plants 3, 113–124 (2013).
Li, X. H., Zhu, X. X., Niu, Y. & Sun, H. Phylogenetic clustering and overdispersion for alpine plants along elevational gradient in the Hengduan Mountains Region, southwest China: Phylogenetic structure along elevational gradient. J. Syst. Evol. 52, 280–288 (2014).
Kanz, C. et al. The EMBL nucleotide sequence database. Nucleic Acids Res. 33, D29–D33 (2005).
Yu, H. et al. Contrasting floristic diversity of the Hengduan mountains, the Himalayas and the Qinghai-Tibet Plateau Sensu Stricto in China. Front. Ecol. Evol. 8 (2020).
Scheiner, S. M. et al. The underpinnings of the relationship of species richness with space and time. Ecol. Monogr. 81, 195–213 (2011).
Milner, A. M. et al. Glacier shrinkage driving global changes in downstream systems. Proc. Natl Acad. Sci. USA 114, 9770–9778 (2017).
Opitz, S., Zhang, C., Herzschuh, U. & Mischke, S. Climate variability on the south-eastern Tibetan Plateau since the Lateglacial based on a multiproxy approach from Lake Naleng – comparing pollen and non-pollen signals. Quat. Sci. Rev. 115, 112–122 (2015).
Laliberté, E. et al. How does pedogenesis drive plant diversity? Trends Ecol. Evol. 28, 331–340 (2013).
Nichols, R. V. et al. Minimizing polymerase biases in metabarcoding. Mol. Ecol. Resour. 18, 927–939 (2018).
Alsos, I. G. et al. Plant DNA metabarcoding of lake sediments: How does it represent the contemporary vegetation. PLoS ONE 13, e0195403 (2018).
Niemeyer, B., Epp, L. S., Stoof-Leichsenring, K. R., Pestryakova, L. A. & Herzschuh, U. A comparison of sedimentary DNA and pollen from lake sediments in recording vegetation composition at the Siberian treeline. Mol. Ecol. Resour. 17, e46–e62 (2017).
Stein, A., Gerstner, K. & Kreft, H. Environmental heterogeneity as a universal driver of species richness across taxa, biomes and spatial scales. Ecol. Lett. 17, 866–880 (2014).
Chen, F. H. et al. Agriculture facilitated permanent human occupation of the Tibetan Plateau after 3600 B.P. Science 347, 248–250 (2015).
Group, M. R. I. E. W. et al. Elevation-dependent warming in mountain regions of the world. Nat. Clim. Change 5, 424–430 (2015).
Liang, E. et al. Species interactions slow warming-induced upward shifts of treelines on the Tibetan Plateau. Proc. Natl Acad. Sci. USA 113, 4380–4385 (2016).
Alexander, J. M. et al. Lags in the response of mountain plant communities to climate change. Glob. Change Biol. 24, 563–579 (2018).
Zu, K. et al. Altitudinal biodiversity patterns of seed plants along Gongga Mountain in the southeastern Qinghai–Tibetan Plateau. Ecol. Evol. 9, 9586–9596 (2019).
Sun, H., Zhang, J., Deng, T. & Boufford, D. E. Origins and evolution of plant diversity in the Hengduan Mountains, China. Plant Divers. 39, 161–166 (2017).
Wiens, J. A., Stralberg, D., Jongsomjit, D., Howell, C. A. & Snyder, M. A. Niches, models, and climate change: assessing the assumptions and uncertainties. Proc. Natl Acad. Sci. 106, 19729–19736 (2009).
Kramer, A., Herzschuh, U., Mischke, S. & Zhang, C. Late Quaternary environmental history of the south-eastern Tibetan Plateau inferred from the Lake Naleng non-pollen palynomorph record. Veg. Hist. Archaeobotany 19, 453–468 (2010).
Stuiver, M. & Reimer, P. J. Extended 14C Data Base and Revised CALIB 3.0 14C Age Calibration Program. Radiocarbon 35, 215–230 (1993).
Intcal04 Terrestrial Radiocarbon Age Calibration, 0–26 Cal Kyr BP. Radiocarbon 46, 1029–1058 (2004).
Taberlet, P. et al. Power and limitations of the chloroplast trnL (UAA) intron for plant DNA barcoding. Nucleic Acids Res. 35, e14–e14 (2007).
Coissac, E. OligoTag: A Program for Designing Sets of Tags for Next-Generation Sequencing of Multiplexed Samples. in Data Production and Analysis in Population Genomics: Methods and Protocols (eds. Pompanon, F. & Bonin, A.) 13–31 (Humana Press, 2012). https://doi.org/10.1007/978-1-61779-870-2_2.
De Barba, M. et al. DNA metabarcoding multiplexing and validation of data accuracy for diet assessment: application to omnivorous diet. Mol. Ecol. Resour. 14, 306–323 (2014).
Boyer, F. et al. obitools: a unix-inspired software package for DNA metabarcoding. Mol. Ecol. Resour. 16, 176–182 (2016).
SøNstebø, J. H. et al. Using next-generation sequencing for molecular reconstruction of past Arctic vegetation and climate: TECHNICAL ADVANCES. Mol. Ecol. Resour. 10, 1009–1018 (2010).
Willerslev, E. et al. Fifty thousand years of Arctic vegetation and megafaunal diet. Nature 506, 47–51 (2014).
Soininen, E. M. et al. Highly overlapping winter diet in two sympatric lemming species revealed by DNA metabarcoding. PLoS ONE 10, e0115335 (2015).
Ficetola, G. F. et al. Replication levels, false presences and the estimation of the presence/absence from eDNA metabarcoding data. Mol. Ecol. Resour. 15, 543–556 (2015).
R Core Team. R: A Language and Environment for Statistical Computing. (2019).
Brach, A. R. & Song, H. eFloras: New directions for online floras exemplified by the Flora of China Project. TAXON 55, 188–192 (2006).
Zhao, Y. et al. Evolution of vegetation and climate variability on the Tibetan Plateau over the past 1.74 million years. Sci. Adv. 6, eaay6193 (2020).
Herzschuh, U. et al. Position and orientation of the westerly jet determined Holocene rainfall patterns in China. Nat. Commun. 10, 2376 (2019).
Shakun, J. D. et al. Global warming preceded by increasing carbon dioxide concentrations during the last deglaciation. Nature 484, 49–54 (2012).
Marcott, S. A., Shakun, J. D., Clark, P. U. & Mix, A. C. A reconstruction of regional and global temperature for the past 11,300 years. Science 339, 1198–1201 (2013).
Kessler, M. A., Anderson, R. S. & Stock, G. M. Modeling topographic and climatic control of east-west asymmetry in Sierra Nevada glacier length during the Last Glacial Maximum. J. Geophys. Res. Earth Surf. 111, F02002 (2006).
Pfeffer, W. T. et al. The Randolph Glacier Inventory: a globally complete inventory of glaciers. J. Glaciol. 60, 537–552 (2014).
Braithwaite, R. J. From Doktor Kurowski’s Schneegrenze to our modern glacier equilibrium line altitude (ELA). Cryosphere 9, 2135–2148 (2015).
Maussion, F. et al. Precipitation seasonality and variability over the Tibetan Plateau as resolved by the high asia reanalysis. J. Clim. 27, 1910–1927 (2014).
Anja, M.-C. et al. GPCC Climatology Version 2011 at 0.25°: Monthly Land-Surface Precipitation Climatology for Every Month and the Total Year from Rain-Gauges built on GTS-based and Historic Data. https://doi.org/10.5676/DWD_GPCC/CLIM_M_V2011_025.
Yuzhong, Y., Qingbai, W. & Hanbo, Y. Stable isotope variations in the ground ice of Beiluhe Basin on the Qinghai-Tibet Plateau. Quat. Int. 313–314, 85–91 (2013).
Li, X. et al. Near-surface air temperature lapse rates in the mainland China during 1962–2011. J. Geophys. Res. Atmospheres 118, 7505–7515 (2013).
Revelle, W. psych: Procedures for Personality and Psychological Research, Northwestern University, Evanston, Illinois, USA. Northwest. Univ. (2018).
Zimmermann, H. H. et al. Sedimentary ancient DNA and pollen reveal the composition of plant organic matter in Late Quaternary permafrost sediments of the Buor Khaya Peninsula (north-eastern Siberia). Biogeosciences Discuss. 1–50 (2016) https://doi.org/10.5194/bg-2016-386.
Oksanen, J. et al. vegan: Community Ecology Package. (2019).
Hallett, L. M. et al. codyn: An r package of community dynamics metrics. Methods Ecol. Evol. 7, 1146–1151 (2016).
Barbosa, A. M., Brown, J. A., Jimenez-Valverde, A. & Real, R. modEvA: Model Evaluation and Analysis. R package version 1.3.2. (2016).
Kuhn, M. caret: Classification and Regression Training. R package version 6.0–86. (2020).
We thank David E. Boufford for providing some information about modern terrestrial plants in Hengduan Mountains, Yaling Wu for help with sub-sampling, and Cathy Jenks for English proofreading. We also thank Xinghua Li for providing the shapefile of Hengduan Mountains. This study was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation, grants 410561986 to S.K., Mi 730/1-1,2 to S.M.), US National Science Foundation (grants DEB-9705795, DEB-0321846), and China Scholarship Council (grant 201606180048 to S.L.).
Open Access funding enabled and organized by Projekt DEAL.
All authors declare no competing interests.
Peer review information Nature Communications thanks Peter Heintzman and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Liu, S., Kruse, S., Scherler, D. et al. Sedimentary ancient DNA reveals a threat of warming-induced alpine habitat loss to Tibetan Plateau plant diversity. Nat Commun 12, 2995 (2021). https://doi.org/10.1038/s41467-021-22986-4